Query         psy17098
Match_columns 204
No_of_seqs    199 out of 317
Neff          5.5 
Searched_HMMs 46136
Date          Fri Aug 16 21:49:20 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy17098.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17098hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0196|consensus              100.0 3.8E-53 8.3E-58  408.9  14.2  165    1-195   146-317 (996)
  2 KOG0196|consensus               99.8 2.9E-21 6.3E-26  188.3   4.8   75  130-204   201-277 (996)
  3 PF12662 cEGF:  Complement Clr-  89.3    0.21 4.6E-06   28.9   1.2   19  186-204     2-20  (24)
  4 PF02793 HRM:  Hormone receptor  83.1    0.89 1.9E-05   31.5   2.0   49  130-184     2-55  (66)
  5 PF02793 HRM:  Hormone receptor  80.3     0.8 1.7E-05   31.7   0.9   49   52-106     2-55  (66)
  6 KOG1226|consensus               79.4     4.7  0.0001   40.9   6.2  138   38-195   403-575 (783)
  7 PHA02817 EEV Host range protei  78.8     6.3 0.00014   34.4   6.2   91   79-189    48-146 (225)
  8 smart00180 EGF_Lam Laminin-typ  77.7     1.9 4.1E-05   28.1   2.0   24  107-138    17-40  (46)
  9 KOG1225|consensus               75.3       4 8.6E-05   39.9   4.3   69  105-195   262-336 (525)
 10 PF07974 EGF_2:  EGF-like domai  74.4       3 6.6E-05   25.4   2.2   23  173-195     6-28  (32)
 11 smart00008 HormR Domain presen  72.1     5.3 0.00011   28.2   3.3   48  130-183     3-53  (70)
 12 PF12661 hEGF:  Human growth fa  70.0     2.5 5.4E-05   20.9   0.8    9  187-195     1-9   (13)
 13 cd00055 EGF_Lam Laminin-type e  69.6     3.4 7.4E-05   27.1   1.7   27  106-140    17-43  (50)
 14 PHA02639 EEV host range protei  67.5      21 0.00046   32.2   6.9   90   79-189   109-206 (295)
 15 PF00053 Laminin_EGF:  Laminin   67.1     2.4 5.1E-05   27.6   0.6   28  106-141    16-43  (49)
 16 PF07645 EGF_CA:  Calcium-bindi  64.7     3.4 7.3E-05   26.2   0.9   20  184-203    23-42  (42)
 17 PF00084 Sushi:  Sushi domain (  64.3     9.1  0.0002   24.3   3.0   32   79-110    25-56  (56)
 18 PF14670 FXa_inhibition:  Coagu  61.0     4.8  0.0001   25.2   1.1   19  185-203    18-36  (36)
 19 smart00032 CCP Domain abundant  60.9      12 0.00025   23.5   3.0   32   79-110    26-57  (57)
 20 smart00008 HormR Domain presen  58.5       7 0.00015   27.5   1.7   48   52-105     3-53  (70)
 21 KOG4260|consensus               57.8     8.8 0.00019   35.0   2.6   22  173-195   156-177 (350)
 22 cd00033 CCP Complement control  57.0      13 0.00028   23.4   2.8   32   79-110    25-56  (57)
 23 PF07699 GCC2_GCC3:  GCC2 and G  56.5     6.4 0.00014   25.6   1.2   13  128-140    10-22  (48)
 24 PF01683 EB:  EB module;  Inter  54.1      18 0.00039   23.6   3.1   17  185-203    36-52  (52)
 25 smart00032 CCP Domain abundant  53.1      14  0.0003   23.1   2.4   30  159-188    28-57  (57)
 26 KOG0994|consensus               52.8      12 0.00025   40.2   2.9   59  106-195  1035-1093(1758)
 27 PF00084 Sushi:  Sushi domain (  52.4      10 0.00023   24.0   1.7   31  158-188    26-56  (56)
 28 KOG1214|consensus               42.1      62  0.0013   34.0   6.0   98   32-203   764-863 (1289)
 29 KOG4260|consensus               41.6      13 0.00027   34.0   1.0   39   94-141   155-193 (350)
 30 PF06725 3D:  3D domain;  Inter  39.6      12 0.00026   27.0   0.4   55    1-56      2-57  (75)
 31 KOG1836|consensus               31.8      33 0.00072   38.1   2.4   38   95-140   398-438 (1705)
 32 PF12946 EGF_MSP1_1:  MSP1 EGF   30.8      16 0.00035   23.3  -0.0   18  185-204    20-37  (37)
 33 PF01393 Chromo_shadow:  Chromo  29.3      13 0.00027   25.6  -0.8   31   30-61     19-49  (58)
 34 PHA03001 putative virion core   29.2      51  0.0011   26.7   2.5   28   25-61     48-75  (132)
 35 cd00185 TNFR Tumor necrosis fa  29.0      31 0.00066   25.6   1.2   29  112-140    17-46  (98)
 36 PHA02831 EEV host range protei  28.3 1.9E+02  0.0041   25.9   6.3   33  157-189   164-196 (268)
 37 PF12947 EGF_3:  EGF domain;  I  27.7      30 0.00066   21.4   0.8   18  184-203    19-36  (36)
 38 PF06138 Chordopox_E11:  Chordo  27.7      55  0.0012   26.4   2.5   28   25-61     49-76  (130)
 39 PF11767 SET_assoc:  Histone ly  27.0      40 0.00086   23.9   1.4   14   26-39     31-44  (66)
 40 PHA02927 secreted complement-b  26.1 1.8E+02  0.0038   25.4   5.6   32  157-188   230-261 (263)
 41 PF13121 DUF3976:  Domain of un  25.1      15 0.00033   23.5  -0.9    8   25-32     14-21  (41)
 42 PHA02639 EEV host range protei  24.9 3.5E+02  0.0077   24.4   7.5   90   79-189    46-145 (295)
 43 KOG4564|consensus               24.1      81  0.0018   30.6   3.4   49  130-184    62-115 (473)
 44 smart00179 EGF_CA Calcium-bind  23.9      93   0.002   17.9   2.5   10  186-195    24-33  (39)
 45 PF04260 DUF436:  Protein of un  23.6      34 0.00075   28.8   0.7   22   26-63     56-77  (172)
 46 cd00034 ChSh Chromo Shadow Dom  23.5      27 0.00058   23.6  -0.0   29   31-61     18-46  (54)
 47 KOG0994|consensus               23.4      80  0.0017   34.3   3.3   39   95-141   765-807 (1758)
 48 COG4475 Uncharacterized protei  21.7      45 0.00097   28.0   1.0   28   25-68     60-87  (180)
 49 cd00054 EGF_CA Calcium-binding  21.4 1.1E+02  0.0024   17.2   2.5   10  186-195    24-33  (38)
 50 PF07354 Sp38:  Zona-pellucida-  21.4      50  0.0011   29.7   1.3   31  111-141   221-252 (271)
 51 PF00954 S_locus_glycop:  S-loc  20.8      45 0.00098   24.9   0.8   10  188-197   100-109 (110)
 52 TIGR01440 conserved hypothetic  20.1      50  0.0011   27.8   1.0   26   26-67     56-81  (172)

No 1  
>KOG0196|consensus
Probab=100.00  E-value=3.8e-53  Score=408.89  Aligned_cols=165  Identities=45%  Similarity=0.907  Sum_probs=144.8

Q ss_pred             CccCCCCCcCC-CC---ceeeeeEEEEe-ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhccccCCCCCCCCCccc
Q psy17098          1 RIAAGEGRFNT-NT---EVNINTEVKSI-PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAHFPTTPTGREVTF   75 (204)
Q Consensus         1 tiAade~~f~~-~~---~~~~N~e~r~~-pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~Fp~T~ag~~~~s   75 (204)
                      ||||||+ |++ +.   .||||||+|+| ||||+||||||||+||||||+|||||||+||++++|||.||||++|++.++
T Consensus       146 tIAades-f~~~d~~~r~mklNte~rs~Gpltk~GfYlAFqd~GAC~aLlsVrvyYkkCp~~~~nlA~FpeTvtGa~~ts  224 (996)
T KOG0196|consen  146 TIAADES-FTQSDLGDRVMKLNTEVRSVGPLTKKGFYLAFQDQGACLALLSVRVYYKKCPETTRNLAHFPETVTGAESTS  224 (996)
T ss_pred             eeeccCc-eecccCccceeEeeeeeEeccccccceEEEEEecCCceEEEEEEEeeeecChHHHhhhccCCCcCCCCCccc
Confidence            8999999 555 32   38999999999 999999999999999999999999999999999999999999999999999


Q ss_pred             eEeEEeEEccCceec--CCCceEEeccceEEeeccceeecCCcccccccccccCCcccCCccccccccCCCCCCCCceee
Q psy17098         76 IEQATGRCVENAEEV--GTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTF  153 (204)
Q Consensus        76 lv~v~G~CV~~A~~~--~~P~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~s  153 (204)
                      ||+|.|+||+||++.  ..|+|||++||+|++|+|+|.|++|||+...+.+|    ++||.|+||+..      | +   
T Consensus       225 lv~v~GtCV~nA~~~~~~~p~m~C~~dGeWlvpiG~C~C~aGye~~~~~~~C----~aCp~G~yK~~~------~-~---  290 (996)
T KOG0196|consen  225 LVEVRGSCVPNAEEEPVPVPRMYCSGDGEWLVPIGGCVCKAGYEEAENGKAC----QACPPGTYKASQ------G-D---  290 (996)
T ss_pred             eEEecccccCCCccCCCCCceeEEcCCCcEEEEcCceeecCCCCcccCCCcc----eeCCCCcccCCC------C-C---
Confidence            999999999999975  56999999999999999999999999986556665    999999999984      2 1   


Q ss_pred             EEeeecccccCccccCCCeEEecccCeEEccCCCeecCCCce
Q psy17098        154 IEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCKCKPGFE  195 (204)
Q Consensus       154 lv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~C~~Gye  195 (204)
                        .....|.+|+.....             ....|.|..||.
T Consensus       291 --~~C~~CP~~S~s~~e-------------ga~~C~C~~gyy  317 (996)
T KOG0196|consen  291 --SLCLPCPPNSHSSSE-------------GATSCTCENGYY  317 (996)
T ss_pred             --CCCCCCCCCCCCCCC-------------CCCcccccCCcc
Confidence              246789999874322             245678888876


No 2  
>KOG0196|consensus
Probab=99.83  E-value=2.9e-21  Score=188.29  Aligned_cols=75  Identities=49%  Similarity=0.994  Sum_probs=71.0

Q ss_pred             ccCCccccccccCCCCCCCCceeeEEeeecccccCcccc--CCCeEEecccCeEEccCCCeecCCCceeccccCccC
Q psy17098        130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEV--GTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCN  204 (204)
Q Consensus       130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~--~~P~~~C~~~G~W~vp~g~C~C~~Gye~~~~~~~C~  204 (204)
                      ++||..+.+||+||+|++|++.++||+|+|+||+||++.  +.|+|||++||+|+||+|+|.|++|||+..++.+|+
T Consensus       201 kkCp~~~~nlA~FpeTvtGa~~tslv~v~GtCV~nA~~~~~~~p~m~C~~dGeWlvpiG~C~C~aGye~~~~~~~C~  277 (996)
T KOG0196|consen  201 KKCPETTRNLAHFPETVTGAESTSLVEVRGSCVPNAEEEPVPVPRMYCSGDGEWLVPIGGCVCKAGYEEAENGKACQ  277 (996)
T ss_pred             ecChHHHhhhccCCCcCCCCCccceEEecccccCCCccCCCCCceeEEcCCCcEEEEcCceeecCCCCcccCCCcce
Confidence            799999999999999999999999999999999999854  569999999999999999999999999887888885


No 3  
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=89.27  E-value=0.21  Score=28.88  Aligned_cols=19  Identities=26%  Similarity=0.927  Sum_probs=15.4

Q ss_pred             CCeecCCCceeccccCccC
Q psy17098        186 GGCKCKPGFEADVEKQTCN  204 (204)
Q Consensus       186 g~C~C~~Gye~~~~~~~C~  204 (204)
                      -.|.|.+||+.+..+.+|.
T Consensus         2 y~C~C~~Gy~l~~d~~~C~   20 (24)
T PF12662_consen    2 YTCSCPPGYQLSPDGRSCE   20 (24)
T ss_pred             EEeeCCCCCcCCCCCCccc
Confidence            3699999999877777774


No 4  
>PF02793 HRM:  Hormone receptor domain;  InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products.  This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=83.13  E-value=0.89  Score=31.46  Aligned_cols=49  Identities=29%  Similarity=0.552  Sum_probs=36.5

Q ss_pred             ccCCccccccccCCCCCCCCceeeEEeeecccccCcc-----ccCCCeEEecccCeEEcc
Q psy17098        130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAE-----EVGTPTYLCKGDGKWYLP  184 (204)
Q Consensus       130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~-----~~~~P~~~C~~~G~W~vp  184 (204)
                      ..||..+-...-.|+|.+|.      .+.-.|.....     .....+..|..+|+|...
T Consensus         2 ~~C~~~~d~~~cWp~T~~G~------~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~   55 (66)
T PF02793_consen    2 SYCPATWDGWLCWPPTPAGE------TASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH   55 (66)
T ss_dssp             SCB--EEESSCEE-SBETTE------EEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred             CCCChhhCCcCCCCCCCCCC------EEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence            46899999999999999984      35678998884     123467899999999974


No 5  
>PF02793 HRM:  Hormone receptor domain;  InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products.  This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=80.35  E-value=0.8  Score=31.70  Aligned_cols=49  Identities=29%  Similarity=0.538  Sum_probs=36.2

Q ss_pred             ecChhhhhccccCCCCCCCCCccceEeEEeEEccCcee-----cCCCceEEeccceEEee
Q psy17098         52 ITCPETTINFAHFPTTPTGREVTFIEQATGRCVENAEE-----VGTPTYLCKGDGKWYLP  106 (204)
Q Consensus        52 ~~CP~~~~~la~Fp~T~ag~~~~slv~v~G~CV~~A~~-----~~~P~~~C~~~G~W~~p  106 (204)
                      ..||.+..++.-.|+|.+|.      .|.-.|-.....     ...-+..|+.+|+|...
T Consensus         2 ~~C~~~~d~~~cWp~T~~G~------~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~   55 (66)
T PF02793_consen    2 SYCPATWDGWLCWPPTPAGE------TASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH   55 (66)
T ss_dssp             SCB--EEESSCEE-SBETTE------EEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred             CCCChhhCCcCCCCCCCCCC------EEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence            36999999999999999974      266799998842     13457889999999873


No 6  
>KOG1226|consensus
Probab=79.37  E-value=4.7  Score=40.88  Aligned_cols=138  Identities=23%  Similarity=0.536  Sum_probs=82.8

Q ss_pred             CCceEeE-------EEEEEEEecChhhh--hccccCCCCCCCCCccceEeEEeEEccCce-ecCCCceEEeccceEEeec
Q psy17098         38 QGACISL-------LAIKVYYITCPETT--INFAHFPTTPTGREVTFIEQATGRCVENAE-EVGTPTYLCKGDGKWYLPS  107 (204)
Q Consensus        38 ~GaCval-------~sVrVyy~~CP~~~--~~la~Fp~T~ag~~~~slv~v~G~CV~~A~-~~~~P~~~C~~~G~W~~p~  107 (204)
                      .+.|..|       ..|.|=-++||...  ..+..-|--.+   .+-.+.|.-.|.=.=+ ..+....+|+++|.  ..=
T Consensus       403 ~~~c~~i~~gd~v~f~v~~~~~~C~~~~~~~~~~i~pvgf~---e~l~v~v~~~C~C~C~~~~e~~s~~C~g~G~--~~C  477 (783)
T KOG1226|consen  403 RRKCSGITIGDEVTFEVSVTAKKCPPEDQKGSFIIRPVGFT---ETLEVIVQYNCECDCQDQGEPNSALCHGNGT--FVC  477 (783)
T ss_pred             cCCCCCcccCceeEEEEecccccCCCccccceEEEccCCCC---cceEEEeeccccccccccCCCCccccCCCCc--EEe
Confidence            4566654       56777788999332  44555444333   2334556666632222 23667889997774  446


Q ss_pred             cceeecCCcccccccccccCCcccCCccccccc------cCCCC---CCCCceeeEEeeecccccCcccc----------
Q psy17098        108 GGCKCKPGFEADVEKQTCNGTCHACPETTINFA------HFPTT---PTGREVTFIEQATGRCVENAEEV----------  168 (204)
Q Consensus       108 g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A------~Fp~T---~~g~~~~slv~v~G~CV~na~~~----------  168 (204)
                      |.|.|++||-.    ..|     -||..-+..+      .++..   .+|    .=.=+.|+|+-|....          
T Consensus       478 G~C~C~~G~~G----~~C-----EC~~~~~ss~~~~~~Cr~~~~~~vCSg----rG~C~CGqC~C~~~~~~~i~G~fCEC  544 (783)
T KOG1226|consen  478 GQCRCDEGWLG----KKC-----ECSTDELSSSEEEDKCRENSDSPVCSG----RGDCVCGQCVCHKPDNGKIYGKFCEC  544 (783)
T ss_pred             cceecCCCCCC----Ccc-----cCCccccCcHhHHhhccCCCCCCCcCC----CCcEeCCceEecCCCCCceeeeeeec
Confidence            99999999973    345     5666655553      11111   122    1123567777776422          


Q ss_pred             -C--CCe---EEecccCeEEccCCCeecCCCce
Q psy17098        169 -G--TPT---YLCKGDGKWYLPSGGCKCKPGFE  195 (204)
Q Consensus       169 -~--~P~---~~C~~~G~W~vp~g~C~C~~Gye  195 (204)
                       +  -++   ..|.+-|+  =--|+|.|.+||.
T Consensus       545 DnfsC~r~~g~lC~g~G~--C~CG~CvC~~Gwt  575 (783)
T KOG1226|consen  545 DNFSCERHKGVLCGGHGR--CECGRCVCNPGWT  575 (783)
T ss_pred             cCcccccccCcccCCCCe--EeCCcEEcCCCCc
Confidence             1  122   45888887  6789999999998


No 7  
>PHA02817 EEV Host range protein; Provisional
Probab=78.82  E-value=6.3  Score=34.40  Aligned_cols=91  Identities=23%  Similarity=0.374  Sum_probs=48.5

Q ss_pred             EEeEEccCc-----eecCCCceEEeccceEEeeccceeecCCcccccccccccCCcccCCcc-cccccc--CCCCCCCCc
Q psy17098         79 ATGRCVENA-----EEVGTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPET-TINFAH--FPTTPTGRE  150 (204)
Q Consensus        79 v~G~CV~~A-----~~~~~P~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g-~~k~A~--Fp~T~~g~~  150 (204)
                      |.=+|.++.     .......++|+.||+|..+.-.|.      .           ..||.- ..+-..  .|.   ...
T Consensus        48 Vty~C~~G~~~~gY~L~G~~~i~C~~dG~Ws~~~P~C~------~-----------v~C~~P~i~NG~v~~~~~---~~~  107 (225)
T PHA02817         48 VTFFCGNNTRGVRYTLVGEKNIICEKDGKWNKEFPVCK------I-----------IRCRFPALQNGFVNGIPD---SKK  107 (225)
T ss_pred             EEEEeCCCCCCCCEEECCCCeEEECCCCcCCCCCCeee------e-----------eECCCCCCcCceeEcccc---CCc
Confidence            566898862     223567899999999975443342      1           133311 111000  010   000


Q ss_pred             eeeEEeeecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098        151 VTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK  189 (204)
Q Consensus       151 ~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~  189 (204)
                      ..--..+.=+|-++=.-.+.....|..+|.|.-+.-.|.
T Consensus       108 y~yg~~Vty~C~~Gy~L~G~~~~tC~~~G~WSp~~P~C~  146 (225)
T PHA02817        108 FYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPICS  146 (225)
T ss_pred             eEcCCEEEEEcCCCCEEcCCCceEECCCCeECCCCCEee
Confidence            001112445665443333456788999999997766665


No 8  
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=77.73  E-value=1.9  Score=28.05  Aligned_cols=24  Identities=29%  Similarity=0.883  Sum_probs=20.4

Q ss_pred             ccceeecCCcccccccccccCCcccCCccccc
Q psy17098        107 SGGCKCKPGFEADVEKQTCNGTCHACPETTIN  138 (204)
Q Consensus       107 ~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k  138 (204)
                      +|.|.|++|+..    ..|    ..|++++|.
T Consensus        17 ~G~C~C~~~~~G----~~C----~~C~~g~~g   40 (46)
T smart00180       17 TGQCECKPNVTG----RRC----DRCAPGYYG   40 (46)
T ss_pred             CCEEECCCCCCC----CCC----CcCCCCcCC
Confidence            689999999984    344    999999998


No 9  
>KOG1225|consensus
Probab=75.30  E-value=4  Score=39.86  Aligned_cols=69  Identities=32%  Similarity=0.715  Sum_probs=43.1

Q ss_pred             eeccceeecCCcccccccccccCCcccCCccccccccCCCCCCCCceeeEEeeecccccCcc-cc---CCCe--EEeccc
Q psy17098        105 LPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTFIEQATGRCVENAE-EV---GTPT--YLCKGD  178 (204)
Q Consensus       105 ~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~-~~---~~P~--~~C~~~  178 (204)
                      .--|.|.|.+||.-.    .|. . ..||.. .         +|    -.+-+.|.|+-|-. ..   +.++  -.|+..
T Consensus       262 c~~G~CIC~~Gf~G~----dC~-e-~~Cp~~-c---------s~----~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~  321 (525)
T KOG1225|consen  262 CVEGRCICPPGFTGD----DCD-E-LVCPVD-C---------SG----GGVCVDGECICNPGYSGKDCSIRRCPADCSGH  321 (525)
T ss_pred             EeCCeEeCCCCCcCC----CCC-c-ccCCcc-c---------CC----CceecCCEeecCCCccccccccccCCccCCCC
Confidence            334788888888731    230 0 345554 1         11    23456788888874 11   1122  568888


Q ss_pred             CeEEccCCCeecCCCce
Q psy17098        179 GKWYLPSGGCKCKPGFE  195 (204)
Q Consensus       179 G~W~vp~g~C~C~~Gye  195 (204)
                      |.+.  .|+|.|.+||.
T Consensus       322 G~Ci--~G~C~C~~Gy~  336 (525)
T KOG1225|consen  322 GKCI--DGECLCDEGYT  336 (525)
T ss_pred             Cccc--CCceEeCCCCc
Confidence            9998  99999999997


No 10 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=74.38  E-value=3  Score=25.41  Aligned_cols=23  Identities=35%  Similarity=1.008  Sum_probs=19.6

Q ss_pred             EEecccCeEEccCCCeecCCCce
Q psy17098        173 YLCKGDGKWYLPSGGCKCKPGFE  195 (204)
Q Consensus       173 ~~C~~~G~W~vp~g~C~C~~Gye  195 (204)
                      ..|+..|.=..+.+.|.|.+||.
T Consensus         6 ~~C~~~G~C~~~~g~C~C~~g~~   28 (32)
T PF07974_consen    6 NICSGHGTCVSPCGRCVCDSGYT   28 (32)
T ss_pred             CccCCCCEEeCCCCEEECCCCCc
Confidence            45888887777789999999997


No 11 
>smart00008 HormR Domain present in hormone receptors.
Probab=72.09  E-value=5.3  Score=28.17  Aligned_cols=48  Identities=23%  Similarity=0.473  Sum_probs=38.2

Q ss_pred             ccCCccccccccCCCCCCCCceeeEEeeecccccCcccc---CCCeEEecccCeEEc
Q psy17098        130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEV---GTPTYLCKGDGKWYL  183 (204)
Q Consensus       130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~---~~P~~~C~~~G~W~v  183 (204)
                      ..||..+....-.|.|.+|.      .+.-.|.......   .....+|..+|.|..
T Consensus         3 ~~C~~~wD~~~CWp~t~~G~------~~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~   53 (70)
T smart00008        3 LGCPATWDGIICWPQTPAGQ------LVEVPCPDYFSGFSYKTGASRNCTENGGWSP   53 (70)
T ss_pred             CCCccccccccCCCCCCCCC------EEEecCCCcccCcCCCCCEEEEcCCCCCCcc
Confidence            46999999999999999985      3467899887532   126789999999985


No 12 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=69.99  E-value=2.5  Score=20.94  Aligned_cols=9  Identities=44%  Similarity=1.497  Sum_probs=7.0

Q ss_pred             CeecCCCce
Q psy17098        187 GCKCKPGFE  195 (204)
Q Consensus       187 ~C~C~~Gye  195 (204)
                      .|.|.+||.
T Consensus         1 ~C~C~~G~~    9 (13)
T PF12661_consen    1 TCQCPPGWT    9 (13)
T ss_dssp             EEEE-TTEE
T ss_pred             CccCcCCCc
Confidence            499999998


No 13 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=69.59  E-value=3.4  Score=27.14  Aligned_cols=27  Identities=26%  Similarity=0.765  Sum_probs=21.8

Q ss_pred             eccceeecCCcccccccccccCCcccCCccccccc
Q psy17098        106 PSGGCKCKPGFEADVEKQTCNGTCHACPETTINFA  140 (204)
Q Consensus       106 p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A  140 (204)
                      ..|.|.|++|+..    ..|    ..|++++|.+.
T Consensus        17 ~~G~C~C~~~~~G----~~C----~~C~~g~~~~~   43 (50)
T cd00055          17 GTGQCECKPNTTG----RRC----DRCAPGYYGLP   43 (50)
T ss_pred             CCCEEeCCCcCCC----CCC----CCCCCCCccCC
Confidence            4799999999983    334    99999999765


No 14 
>PHA02639 EEV host range protein; Provisional
Probab=67.50  E-value=21  Score=32.21  Aligned_cols=90  Identities=21%  Similarity=0.320  Sum_probs=48.3

Q ss_pred             EEeEEccC---c-eecCCCceEEeccceEEeeccceeecCCcccccccccccCCcccCCcc---ccccccCCCC-CCCCc
Q psy17098         79 ATGRCVEN---A-EEVGTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPET---TINFAHFPTT-PTGRE  150 (204)
Q Consensus        79 v~G~CV~~---A-~~~~~P~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g---~~k~A~Fp~T-~~g~~  150 (204)
                      |.=+|-++   - .......+.|..||+|..+.-.|.      .    ..|    . -|..   ..+....... .-|  
T Consensus       109 V~y~C~~g~~~gY~L~G~~~~~C~~dG~WS~~~P~C~------~----i~C----~-~P~i~nG~v~~~~~~~~~~yg--  171 (295)
T PHA02639        109 IYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICK------M----INC----R-FPALQNGYINGIPSNKKFYYK--  171 (295)
T ss_pred             EEEEeCCCCCCceEECCCCeEEECCCCeECCCCCeee------e----EEe----C-CCCCCCCceEcccCCCceecC--
Confidence            55678774   1 223567899999999986544442      0    012    1 1211   1110000000 111  


Q ss_pred             eeeEEeeecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098        151 VTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK  189 (204)
Q Consensus       151 ~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~  189 (204)
                          ..|.=+|.+.=.-.+.+.+.|..+|+|.-+.=.|.
T Consensus       172 ----~~V~fsC~~GY~L~Gs~~~tC~~nG~Ws~~~P~C~  206 (295)
T PHA02639        172 ----TRVGFSCKSGFDLVGEKYSTCNINATWFPSIPTCV  206 (295)
T ss_pred             ----CEEEEEcCCCCeEcCCCcEEECCCCeECCCCCeEe
Confidence                12345666544434567889999999997776674


No 15 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=67.06  E-value=2.4  Score=27.57  Aligned_cols=28  Identities=29%  Similarity=0.793  Sum_probs=21.1

Q ss_pred             eccceeecCCcccccccccccCCcccCCcccccccc
Q psy17098        106 PSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAH  141 (204)
Q Consensus       106 p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~  141 (204)
                      ..|.|.|+++|+.    ..    |..|..++|.+..
T Consensus        16 ~~G~C~C~~~~~G----~~----C~~C~~g~~~~~~   43 (49)
T PF00053_consen   16 STGQCVCKPGTTG----PR----CDQCKPGYFGLPS   43 (49)
T ss_dssp             TCEEESBSTTEES----TT----S-EE-TTEECSTT
T ss_pred             CCCEEeccccccC----Cc----CcCCCCccccccC
Confidence            5789999999983    34    4999999998864


No 16 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=64.69  E-value=3.4  Score=26.15  Aligned_cols=20  Identities=40%  Similarity=0.934  Sum_probs=14.9

Q ss_pred             cCCCeecCCCceeccccCcc
Q psy17098        184 PSGGCKCKPGFEADVEKQTC  203 (204)
Q Consensus       184 p~g~C~C~~Gye~~~~~~~C  203 (204)
                      ..-.|.|++||+.+....+|
T Consensus        23 Gsy~C~C~~Gy~~~~~~~~C   42 (42)
T PF07645_consen   23 GSYSCSCPPGYELNDDGTTC   42 (42)
T ss_dssp             TEEEEEESTTEEECTTSSEE
T ss_pred             CCEEeeCCCCcEECCCCCcC
Confidence            45679999999966666555


No 17 
>PF00084 Sushi:  Sushi domain (SCR repeat);  InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13.  Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=64.30  E-value=9.1  Score=24.32  Aligned_cols=32  Identities=31%  Similarity=0.662  Sum_probs=24.2

Q ss_pred             EEeEEccCceecCCCceEEeccceEEeeccce
Q psy17098         79 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC  110 (204)
Q Consensus        79 v~G~CV~~A~~~~~P~~~C~~~G~W~~p~g~C  110 (204)
                      +.=+|-.+-.......+.|+.+|+|..+.-.|
T Consensus        25 ~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C   56 (56)
T PF00084_consen   25 VTFSCNPGYELSGSSTITCQSNGQWSPPPPTC   56 (56)
T ss_dssp             EEEEESTTEEEESSSEEEEETTSEESSTTSEE
T ss_pred             EEEcCCCCCEecCCCEEEECCCCEECCCCCCC
Confidence            55688777776666899999999998664433


No 18 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=60.98  E-value=4.8  Score=25.23  Aligned_cols=19  Identities=37%  Similarity=0.945  Sum_probs=13.7

Q ss_pred             CCCeecCCCceeccccCcc
Q psy17098        185 SGGCKCKPGFEADVEKQTC  203 (204)
Q Consensus       185 ~g~C~C~~Gye~~~~~~~C  203 (204)
                      .-.|.|.+||..+.++.+|
T Consensus        18 ~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen   18 SYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             SEEEE-STTEEE-TTSSSE
T ss_pred             ceEeECCCCCEECcCCCCC
Confidence            4579999999987777665


No 19 
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short  consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino  acid residues and have been identified in several proteins of the  complement system. A missense mutation in seventh CCP domain causes  deficiency of the b subunit of factor XIII.
Probab=60.93  E-value=12  Score=23.46  Aligned_cols=32  Identities=31%  Similarity=0.647  Sum_probs=22.6

Q ss_pred             EEeEEccCceecCCCceEEeccceEEeeccce
Q psy17098         79 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC  110 (204)
Q Consensus        79 v~G~CV~~A~~~~~P~~~C~~~G~W~~p~g~C  110 (204)
                      +.=.|-++=.......+.|..+|+|..+.-.|
T Consensus        26 ~~~~C~~Gy~l~g~~~~~C~~~g~Ws~~~p~C   57 (57)
T smart00032       26 VTYSCNPGYTLIGSSTITCLEDGTWSPPAPTC   57 (57)
T ss_pred             EEEEcCCCCEEcCCCeeEECCCCEECCCCCcC
Confidence            45577777554556899999999998654333


No 20 
>smart00008 HormR Domain present in hormone receptors.
Probab=58.49  E-value=7  Score=27.53  Aligned_cols=48  Identities=23%  Similarity=0.486  Sum_probs=37.2

Q ss_pred             ecChhhhhccccCCCCCCCCCccceEeEEeEEccCceec---CCCceEEeccceEEe
Q psy17098         52 ITCPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEV---GTPTYLCKGDGKWYL  105 (204)
Q Consensus        52 ~~CP~~~~~la~Fp~T~ag~~~~slv~v~G~CV~~A~~~---~~P~~~C~~~G~W~~  105 (204)
                      ..||.+-.++.-.|.|.+|..      |.-.|=......   ..-...|+.+|.|..
T Consensus         3 ~~C~~~wD~~~CWp~t~~G~~------~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~   53 (70)
T smart00008        3 LGCPATWDGIICWPQTPAGQL------VEVPCPDYFSGFSYKTGASRNCTENGGWSP   53 (70)
T ss_pred             CCCccccccccCCCCCCCCCE------EEecCCCcccCcCCCCCEEEEcCCCCCCcc
Confidence            469999999999999999753      455787766532   126788999999975


No 21 
>KOG4260|consensus
Probab=57.79  E-value=8.8  Score=34.98  Aligned_cols=22  Identities=45%  Similarity=0.978  Sum_probs=17.2

Q ss_pred             EEecccCeEEccCCCeecCCCce
Q psy17098        173 YLCKGDGKWYLPSGGCKCKPGFE  195 (204)
Q Consensus       173 ~~C~~~G~W~vp~g~C~C~~Gye  195 (204)
                      -+|.+||. .-..|.|.|.+||.
T Consensus       156 G~C~GdGs-R~GsGkCkC~~GY~  177 (350)
T KOG4260|consen  156 GSCHGDGS-REGSGKCKCETGYT  177 (350)
T ss_pred             CcccCCCC-CCCCCcccccCCCC
Confidence            45777773 45789999999997


No 22 
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=56.99  E-value=13  Score=23.45  Aligned_cols=32  Identities=34%  Similarity=0.670  Sum_probs=23.4

Q ss_pred             EEeEEccCceecCCCceEEeccceEEeeccce
Q psy17098         79 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC  110 (204)
Q Consensus        79 v~G~CV~~A~~~~~P~~~C~~~G~W~~p~g~C  110 (204)
                      +.=.|-++=.......+.|..+|+|..+.-.|
T Consensus        25 ~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C   56 (57)
T cd00033          25 VTYSCNEGYTLVGSSTITCTENGGWSPPPPTC   56 (57)
T ss_pred             EEEECCCCCeEeCCCeeEECCCCeECCCCccc
Confidence            45578776555567799999999998665444


No 23 
>PF07699 GCC2_GCC3:  GCC2 and GCC3;  InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []:   Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction [].      Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases [].   This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=56.48  E-value=6.4  Score=25.57  Aligned_cols=13  Identities=31%  Similarity=0.777  Sum_probs=9.9

Q ss_pred             CcccCCccccccc
Q psy17098        128 TCHACPETTINFA  140 (204)
Q Consensus       128 ~C~~Cp~g~~k~A  140 (204)
                      .|+.||.|+|+..
T Consensus        10 ~C~~Cp~GtYq~~   22 (48)
T PF07699_consen   10 KCQPCPKGTYQDE   22 (48)
T ss_pred             ccCCCCCCccCCc
Confidence            4588888888755


No 24 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=54.12  E-value=18  Score=23.55  Aligned_cols=17  Identities=35%  Similarity=1.079  Sum_probs=13.0

Q ss_pred             CCCeecCCCceeccccCcc
Q psy17098        185 SGGCKCKPGFEADVEKQTC  203 (204)
Q Consensus       185 ~g~C~C~~Gye~~~~~~~C  203 (204)
                      .|.|.|.+||.+  .+.+|
T Consensus        36 ~g~C~C~~g~~~--~~~~C   52 (52)
T PF01683_consen   36 NGRCQCPPGYVE--VGGRC   52 (52)
T ss_pred             CCEeECCCCCEe--cCCCC
Confidence            489999999985  44454


No 25 
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short  consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino  acid residues and have been identified in several proteins of the  complement system. A missense mutation in seventh CCP domain causes  deficiency of the b subunit of factor XIII.
Probab=53.14  E-value=14  Score=23.10  Aligned_cols=30  Identities=30%  Similarity=0.661  Sum_probs=19.5

Q ss_pred             cccccCccccCCCeEEecccCeEEccCCCe
Q psy17098        159 GRCVENAEEVGTPTYLCKGDGKWYLPSGGC  188 (204)
Q Consensus       159 G~CV~na~~~~~P~~~C~~~G~W~vp~g~C  188 (204)
                      =.|.+.=.-.....+.|..+|+|..+.-.|
T Consensus        28 ~~C~~Gy~l~g~~~~~C~~~g~Ws~~~p~C   57 (57)
T smart00032       28 YSCNPGYTLIGSSTITCLEDGTWSPPAPTC   57 (57)
T ss_pred             EEcCCCCEEcCCCeeEECCCCEECCCCCcC
Confidence            345444332344789999999999765544


No 26 
>KOG0994|consensus
Probab=52.77  E-value=12  Score=40.20  Aligned_cols=59  Identities=36%  Similarity=0.835  Sum_probs=40.5

Q ss_pred             eccceeecCCcccccccccccCCcccCCccccccccCCCCCCCCceeeEEeeecccccCccccCCCeEEecccCeEEccC
Q psy17098        106 PSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPS  185 (204)
Q Consensus       106 p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~  185 (204)
                      .+|.|.|.+.-+    +..|    -.|.++++|+|+      |.       -.-.|--|++  ..|  .||      .-+
T Consensus      1035 ~tGQCpClpNv~----G~~C----DqCA~N~w~laS------G~-------GCe~C~Cd~~--~~p--qCN------~ft 1083 (1758)
T KOG0994|consen 1035 FTGQCPCLPNVQ----GVRC----DQCAENHWNLAS------GE-------GCEPCNCDPI--GGP--QCN------EFT 1083 (1758)
T ss_pred             ccCcCCCCcccc----cccc----cccccchhcccc------CC-------CCCccCCCcc--CCc--ccc------ccc
Confidence            457777775444    3444    899999999995      42       2456666663  344  566      368


Q ss_pred             CCeecCCCce
Q psy17098        186 GGCKCKPGFE  195 (204)
Q Consensus       186 g~C~C~~Gye  195 (204)
                      |+|.|+|||-
T Consensus      1084 GQCqCkpGfG 1093 (1758)
T KOG0994|consen 1084 GQCQCKPGFG 1093 (1758)
T ss_pred             cceeccCCCC
Confidence            9999999995


No 27 
>PF00084 Sushi:  Sushi domain (SCR repeat);  InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13.  Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=52.40  E-value=10  Score=24.04  Aligned_cols=31  Identities=32%  Similarity=0.682  Sum_probs=21.7

Q ss_pred             ecccccCccccCCCeEEecccCeEEccCCCe
Q psy17098        158 TGRCVENAEEVGTPTYLCKGDGKWYLPSGGC  188 (204)
Q Consensus       158 ~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C  188 (204)
                      .=+|-.+-...+...+.|..+|.|.-+.-.|
T Consensus        26 ~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C   56 (56)
T PF00084_consen   26 TFSCNPGYELSGSSTITCQSNGQWSPPPPTC   56 (56)
T ss_dssp             EEEESTTEEEESSSEEEEETTSEESSTTSEE
T ss_pred             EEcCCCCCEecCCCEEEECCCCEECCCCCCC
Confidence            4456555554455889999999999776544


No 28 
>KOG1214|consensus
Probab=42.12  E-value=62  Score=33.99  Aligned_cols=98  Identities=21%  Similarity=0.566  Sum_probs=0.0

Q ss_pred             EEEeccC-CceEeEEEEEEEEecChhhhhccccCCCCCCCCCccceEeEEeEEccCceecCCC-ceEEeccceEEeeccc
Q psy17098         32 YFAFRDQ-GACISLLAIKVYYITCPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEVGTP-TYLCKGDGKWYLPSGG  109 (204)
Q Consensus        32 YlAfqD~-GaCval~sVrVyy~~CP~~~~~la~Fp~T~ag~~~~slv~v~G~CV~~A~~~~~P-~~~C~~~G~W~~p~g~  109 (204)
                      |-+|-|- +.||.|.-=                     +.+.         .|+..+..=..+ +..|.-.|   -..-.
T Consensus       764 gy~F~dd~~tCV~i~~p---------------------ap~n---------~Ce~g~h~C~i~g~a~c~~hG---gs~y~  810 (1289)
T KOG1214|consen  764 GYEFADDRHTCVLITPP---------------------APAN---------PCEDGSHTCAIAGQARCVHHG---GSTYS  810 (1289)
T ss_pred             cceeccCCcceEEecCC---------------------CCCC---------ccccCccccCcCCceEEEecC---CceEE


Q ss_pred             eeecCCcccccccccccCCcccCCccccccccCCCCCCCCceeeEEeeecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098        110 CKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK  189 (204)
Q Consensus       110 C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~  189 (204)
                      |.|.+||.  .++..|... -.|.+.---.+                  -.|+..+                  ....|.
T Consensus       811 C~CLPGfs--GDG~~c~dv-DeC~psrChp~------------------A~Cyntp------------------gsfsC~  851 (1289)
T KOG1214|consen  811 CACLPGFS--GDGHQCTDV-DECSPSRCHPA------------------ATCYNTP------------------GSFSCR  851 (1289)
T ss_pred             EeecCCcc--CCccccccc-cccCccccCCC------------------ceEecCC------------------Ccceee


Q ss_pred             cCCCceeccccCcc
Q psy17098        190 CKPGFEADVEKQTC  203 (204)
Q Consensus       190 C~~Gye~~~~~~~C  203 (204)
                      |.+||+  ..+.+|
T Consensus       852 C~pGy~--GDGf~C  863 (1289)
T KOG1214|consen  852 CQPGYY--GDGFQC  863 (1289)
T ss_pred             cccCcc--CCCcee


No 29 
>KOG4260|consensus
Probab=41.59  E-value=13  Score=33.98  Aligned_cols=39  Identities=31%  Similarity=0.684  Sum_probs=28.3

Q ss_pred             ceEEeccceEEeeccceeecCCcccccccccccCCcccCCcccccccc
Q psy17098         94 TYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAH  141 (204)
Q Consensus        94 ~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~  141 (204)
                      ..+|.+||+ .-.+|.|.|.+||..    ..    |..|-.++|...+
T Consensus       155 nG~C~GdGs-R~GsGkCkC~~GY~G----p~----C~~Cg~eyfes~R  193 (350)
T KOG4260|consen  155 NGSCHGDGS-REGSGKCKCETGYTG----PL----CRYCGIEYFESSR  193 (350)
T ss_pred             CCcccCCCC-CCCCCcccccCCCCC----cc----ccccchHHHHhhc
Confidence            567777774 356899999999983    23    4888887776543


No 30 
>PF06725 3D:  3D domain;  InterPro: IPR010611 This short presumed domain contains three conserved aspartate residues, hence the name 3D. This conservation is suggestive of a cation binding function. The central aspartate is found in a DTG motif that is suggestive of a peptidase like active site.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0009254 peptidoglycan turnover, 0019867 outer membrane; PDB: 2G5D_A 2G6G_A 3CZB_A 2PI8_D 2AE0_X 2PIC_A 2GAE_A 2PJJ_A 2PNW_A.
Probab=39.63  E-value=12  Score=26.96  Aligned_cols=55  Identities=22%  Similarity=0.224  Sum_probs=29.3

Q ss_pred             CccCCCCCcCCCCceeeeeEEEEe-ecCCCceEEEeccCCceEeEEEEEEEEecChh
Q psy17098          1 RIAAGEGRFNTNTEVNINTEVKSI-PVTKKGVYFAFRDQGACISLLAIKVYYITCPE   56 (204)
Q Consensus         1 tiAade~~f~~~~~~~~N~e~r~~-pls~~GfYlAfqD~GaCval~sVrVyy~~CP~   56 (204)
                      |||.|-+.++.++.+-|+.+...- ...-.|+|+ -||+|+=+-=-++-||+-.=.+
T Consensus         2 SiAvD~~viPlGt~v~i~~~~~~~~~~~~~~~~~-aqDtGgAIkG~riDif~g~g~~   57 (75)
T PF06725_consen    2 SIAVDPSVIPLGTPVYIPGPPLDDGGKPFNGLYI-AQDTGGAIKGNRIDIFFGSGDE   57 (75)
T ss_dssp             EEEE-TTTS-TT-EEEEEEETTETTTSEEEEEEE-EEEE-TT--SSEEEEEEEESHH
T ss_pred             eEEECCccCCCCCEEEEEeccccCCCceeeEEEE-EeccCccccCCEEEEEeCCHHH
Confidence            689999878777776666641000 011246677 6999985554577777654333


No 31 
>KOG1836|consensus
Probab=31.75  E-value=33  Score=38.14  Aligned_cols=38  Identities=29%  Similarity=0.769  Sum_probs=26.6

Q ss_pred             eEEeccceEE---eeccceeecCCcccccccccccCCcccCCccccccc
Q psy17098         95 YLCKGDGKWY---LPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFA  140 (204)
Q Consensus        95 ~~C~~~G~W~---~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A  140 (204)
                      -+|++.|+-.   .-.|+|.|++|+..    +.|    -.|-++++.+-
T Consensus       398 C~c~~~gsl~~~~~~~g~c~c~P~v~g----~~c----D~ca~g~~~~~  438 (1705)
T KOG1836|consen  398 CICNSAGSLSAQCDDTGRCQCKPGVTG----QKC----DRCAPGFYGLP  438 (1705)
T ss_pred             cCCCCccchhhhhccCCcceecccccc----ccc----CccCcccccCc
Confidence            4566666543   33599999999984    234    78888888775


No 32 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=30.84  E-value=16  Score=23.27  Aligned_cols=18  Identities=28%  Similarity=0.737  Sum_probs=11.9

Q ss_pred             CCCeecCCCceeccccCccC
Q psy17098        185 SGGCKCKPGFEADVEKQTCN  204 (204)
Q Consensus       185 ~g~C~C~~Gye~~~~~~~C~  204 (204)
                      .-+|.|.+||..  .++.|+
T Consensus        20 ~eecrCllgyk~--~~~~C~   37 (37)
T PF12946_consen   20 SEECRCLLGYKK--VGGKCV   37 (37)
T ss_dssp             EEEEEE-TTEEE--ETTEEE
T ss_pred             CEEEEeeCCccc--cCCCcC
Confidence            346899999995  555563


No 33 
>PF01393 Chromo_shadow:  Chromo shadow domain Web page maintained by Rein Aasland;  InterPro: IPR008251 Chromo shadow domain is distantly related to chromo domain. It is always found in association with a chromo domain.  The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting. Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain [], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1); and mammalian modifier 1 and modifier 2. The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigenand and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1.; GO: 0005634 nucleus; PDB: 3Q6S_C 2FMM_B 3P7J_B 1E0B_B 3I3C_A 1DZ1_B 1S4Z_A 3KUP_D.
Probab=29.29  E-value=13  Score=25.65  Aligned_cols=31  Identities=23%  Similarity=0.463  Sum_probs=26.4

Q ss_pred             ceEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098         30 GVYFAFRDQGACISLLAIKVYYITCPETTINF   61 (204)
Q Consensus        30 GfYlAfqD~GaCval~sVrVyy~~CP~~~~~l   61 (204)
                      -||+=|.|.|- +.+++-++-|.+||..+..|
T Consensus        19 ~~likwk~~~~-~~~v~~~~~~~k~Pq~vI~F   49 (58)
T PF01393_consen   19 MFLIKWKNSGE-KDLVPSKEANEKCPQKVIKF   49 (58)
T ss_dssp             EEEEEETTSSS-EEEEEHHHHHHHSHHHHHHH
T ss_pred             EEEEEECCCCc-eEEeeHHHHHHHCcHHHHHH
Confidence            38999999886 88888999999999988654


No 34 
>PHA03001 putative virion core protein; Provisional
Probab=29.20  E-value=51  Score=26.66  Aligned_cols=28  Identities=25%  Similarity=0.660  Sum_probs=22.0

Q ss_pred             ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098         25 PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINF   61 (204)
Q Consensus        25 pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~l   61 (204)
                      .|.+.=||||++|.-         |||++|---...+
T Consensus        48 ~v~eStFylvvrd~d---------iFyfkcdkG~is~   75 (132)
T PHA03001         48 HVDKSTFYLVVKDKD---------IFYFKCDKGSISQ   75 (132)
T ss_pred             EecccEEEEEEecCc---------EEEEEecCCeEee
Confidence            567888999999974         9999997654433


No 35 
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=28.99  E-value=31  Score=25.62  Aligned_cols=29  Identities=24%  Similarity=0.532  Sum_probs=17.0

Q ss_pred             ecCCccccccc-ccccCCcccCCccccccc
Q psy17098        112 CKPGFEADVEK-QTCNGTCHACPETTINFA  140 (204)
Q Consensus       112 C~~Gye~~~~~-~~C~~~C~~Cp~g~~k~A  140 (204)
                      |.+|+.....- ..-+..|+.||+|+|...
T Consensus        17 C~pG~~~~~~C~~~~~t~C~~C~~g~ys~~   46 (98)
T cd00185          17 CPPGTYLSKDCTPGSDTVCEPCPPGTYTDS   46 (98)
T ss_pred             CCCCccCCCcCCCCCCCeecCCCCCCcccC
Confidence            78888853210 000234688888888665


No 36 
>PHA02831 EEV host range protein; Provisional
Probab=28.28  E-value=1.9e+02  Score=25.89  Aligned_cols=33  Identities=18%  Similarity=0.407  Sum_probs=21.6

Q ss_pred             eecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098        157 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK  189 (204)
Q Consensus       157 v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~  189 (204)
                      |.=+|-+.=.-.+.....|..+|.|.-+.=.|.
T Consensus       164 Vty~C~~GY~L~Gss~~tC~~nG~Wsp~~P~C~  196 (268)
T PHA02831        164 VNFKCKKGFILLGSSVSTCDINSIWYPGIPKCV  196 (268)
T ss_pred             EEEEcCCCCEECCCccEEECCCCeECCCCCCcc
Confidence            345565443334556889999999986555553


No 37 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=27.72  E-value=30  Score=21.45  Aligned_cols=18  Identities=44%  Similarity=0.990  Sum_probs=11.8

Q ss_pred             cCCCeecCCCceeccccCcc
Q psy17098        184 PSGGCKCKPGFEADVEKQTC  203 (204)
Q Consensus       184 p~g~C~C~~Gye~~~~~~~C  203 (204)
                      +.-.|.|++||+-  ++..|
T Consensus        19 ~~~~C~C~~Gy~G--dG~~C   36 (36)
T PF12947_consen   19 GSYTCTCKPGYEG--DGFFC   36 (36)
T ss_dssp             TSEEEEE-CEEEC--CSTCE
T ss_pred             CCEEeECCCCCcc--CCcCC
Confidence            3578999999994  44443


No 38 
>PF06138 Chordopox_E11:  Chordopoxvirus E11 protein;  InterPro: IPR009201 This group represents a virion core protein, vaccinia E11L type.
Probab=27.67  E-value=55  Score=26.40  Aligned_cols=28  Identities=21%  Similarity=0.618  Sum_probs=21.9

Q ss_pred             ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098         25 PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINF   61 (204)
Q Consensus        25 pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~l   61 (204)
                      .|.+.=|||+++|.-         |||++|---...+
T Consensus        49 ~veeStFylvvrd~d---------iFyfk~dkG~vs~   76 (130)
T PF06138_consen   49 DVEESTFYLVVRDKD---------IFYFKCDKGRVSP   76 (130)
T ss_pred             EecccEEEEEEecCc---------EEEEEecCCeEee
Confidence            667888999999953         9999997655443


No 39 
>PF11767 SET_assoc:  Histone lysine methyltransferase SET associated;  InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases []. 
Probab=27.04  E-value=40  Score=23.90  Aligned_cols=14  Identities=29%  Similarity=0.634  Sum_probs=11.8

Q ss_pred             cCCCceEEEeccCC
Q psy17098         26 VTKKGVYFAFRDQG   39 (204)
Q Consensus        26 ls~~GfYlAfqD~G   39 (204)
                      .-+.|||+.|.|..
T Consensus        31 ~d~tGfYIvF~~~~   44 (66)
T PF11767_consen   31 DDRTGFYIVFNDSK   44 (66)
T ss_pred             ecCCEEEEEECChH
Confidence            46899999999864


No 40 
>PHA02927 secreted complement-binding protein; Provisional
Probab=26.11  E-value=1.8e+02  Score=25.44  Aligned_cols=32  Identities=16%  Similarity=0.357  Sum_probs=21.9

Q ss_pred             eecccccCccccCCCeEEecccCeEEccCCCe
Q psy17098        157 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC  188 (204)
Q Consensus       157 v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C  188 (204)
                      |.=+|-+.=.-.+.+.+.|..+|.|.-+.=.|
T Consensus       230 v~y~C~~Gy~l~G~~~~~C~~~g~Ws~~~P~C  261 (263)
T PHA02927        230 VDFKCKYGYKLSGSSSSTCSPGNTWQPELPKC  261 (263)
T ss_pred             EEEECCCCCeEcCCCCeEECCCCEECCCCCee
Confidence            44566554444466789999999998655555


No 41 
>PF13121 DUF3976:  Domain of unknown function (DUF3976)
Probab=25.10  E-value=15  Score=23.50  Aligned_cols=8  Identities=50%  Similarity=1.032  Sum_probs=6.9

Q ss_pred             ecCCCceE
Q psy17098         25 PVTKKGVY   32 (204)
Q Consensus        25 pls~~GfY   32 (204)
                      -|+|||||
T Consensus        14 tltkrgfy   21 (41)
T PF13121_consen   14 TLTKRGFY   21 (41)
T ss_pred             eeehhhHH
Confidence            57999998


No 42 
>PHA02639 EEV host range protein; Provisional
Probab=24.88  E-value=3.5e+02  Score=24.38  Aligned_cols=90  Identities=20%  Similarity=0.304  Sum_probs=48.0

Q ss_pred             EEeEEccCceecCCCceEEecc---ceEEeeccceeecCCcccccccccccCCcccCCc--cccccccCCCCCCCCceee
Q psy17098         79 ATGRCVENAEEVGTPTYLCKGD---GKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPE--TTINFAHFPTTPTGREVTF  153 (204)
Q Consensus        79 v~G~CV~~A~~~~~P~~~C~~~---G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~--g~~k~A~Fp~T~~g~~~~s  153 (204)
                      |.=+|.++=.......++|..|   |.|..+.-.|.      .           ..||.  ...+-...+..   . ...
T Consensus        46 V~Y~Cn~GY~L~G~~~~~C~~d~~nG~WS~~~P~C~------~-----------~~C~~Pp~~~nG~i~~~~---~-~y~  104 (295)
T PHA02639         46 IEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCM------L-----------KECNDPPSIINGKIYNKR---E-MYK  104 (295)
T ss_pred             EEEEeCCCCEECCCCeEEEeCCCCCCEECCCCCEEe------e-----------ccCcCCCCCCCcEEecCC---C-ceE
Confidence            5568887766666678999975   68985544443      0           22321  11111111110   0 000


Q ss_pred             -EEeeecccccC---cc-ccCCCeEEecccCeEEccCCCee
Q psy17098        154 -IEQATGRCVEN---AE-EVGTPTYLCKGDGKWYLPSGGCK  189 (204)
Q Consensus       154 -lv~v~G~CV~n---a~-~~~~P~~~C~~~G~W~vp~g~C~  189 (204)
                       -..+.=+|.++   -. -.+..++.|..+|+|.-+.-.|.
T Consensus       105 ~G~~V~y~C~~g~~~gY~L~G~~~~~C~~dG~WS~~~P~C~  145 (295)
T PHA02639        105 VGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICK  145 (295)
T ss_pred             CCCEEEEEeCCCCCCceEECCCCeEEECCCCeECCCCCeee
Confidence             01234567664   11 22456789999999996555563


No 43 
>KOG4564|consensus
Probab=24.13  E-value=81  Score=30.58  Aligned_cols=49  Identities=27%  Similarity=0.457  Sum_probs=37.5

Q ss_pred             ccCCccccccccCCCCCCCCceeeEEeeecccccCcc-----ccCCCeEEecccCeEEcc
Q psy17098        130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAE-----EVGTPTYLCKGDGKWYLP  184 (204)
Q Consensus       130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~-----~~~~P~~~C~~~G~W~vp  184 (204)
                      ..|+..+.+..=.|.|.+|.      .|.-.|..-=.     ..+....+|.+||.|.-+
T Consensus        62 ~~Cn~twD~~lCWp~t~~G~------~v~~~CP~yf~~~~~~~~~~v~R~C~~~G~W~~~  115 (473)
T KOG4564|consen   62 LGCNGTWDGILCWPRTPAGT------LVTVPCPDYFPGFSYSTSGNVTRNCTSNGTWSER  115 (473)
T ss_pred             CCCCCcCCccccCCCCCCCc------eEEecCccccCCCcccCCCceEeecCCCCccCCC
Confidence            57999999999999999985      24567774332     345678999999999954


No 44 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=23.90  E-value=93  Score=17.94  Aligned_cols=10  Identities=40%  Similarity=1.258  Sum_probs=8.3

Q ss_pred             CCeecCCCce
Q psy17098        186 GGCKCKPGFE  195 (204)
Q Consensus       186 g~C~C~~Gye  195 (204)
                      -.|.|.+||.
T Consensus        24 ~~C~C~~g~~   33 (39)
T smart00179       24 YRCECPPGYT   33 (39)
T ss_pred             eEeECCCCCc
Confidence            4589999997


No 45 
>PF04260 DUF436:  Protein of unknown function (DUF436) ;  InterPro: IPR006340 Members of this family are uncharacterised proteins of about 180 amino acids from the Bacillus/Clostridium group of Gram-positive bacteria, found in no more than one copy per genome. ; PDB: 1V8D_C.
Probab=23.56  E-value=34  Score=28.81  Aligned_cols=22  Identities=23%  Similarity=0.662  Sum_probs=13.8

Q ss_pred             cCCCceEEEeccCCceEeEEEEEEEEecChhhhhcccc
Q psy17098         26 VTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAH   63 (204)
Q Consensus        26 ls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~   63 (204)
                      +.++|+|||||                -|-..+|.|..
T Consensus        56 ~~~~gi~LA~Q----------------cCEHlNRALvv   77 (172)
T PF04260_consen   56 LKERGIYLAFQ----------------CCEHLNRALVV   77 (172)
T ss_dssp             HHTTT-EEEEE------------------GGGTT-EEE
T ss_pred             HHHcCcEEEEE----------------chhhhhHHHHh
Confidence            47899999998                36666666654


No 46 
>cd00034 ChSh Chromo Shadow Domain,  found in association with N-terminal chromo (CHRromatin Organization MOdifier) domain; Chromo domains mediate the interaction of the heterochromatin with other heterochromatin proteins, thereby affecting chromatin structure (e.g. Drosophila and human heterochromatin protein (HP1) and mammalian modifier 1 and modifier 2)
Probab=23.48  E-value=27  Score=23.59  Aligned_cols=29  Identities=28%  Similarity=0.474  Sum_probs=25.0

Q ss_pred             eEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098         31 VYFAFRDQGACISLLAIKVYYITCPETTINF   61 (204)
Q Consensus        31 fYlAfqD~GaCval~sVrVyy~~CP~~~~~l   61 (204)
                      ||+=|+|  .=..+++.++-|.+||..+..|
T Consensus        18 fl~kwk~--~~~~lVp~~~~~~k~P~~vI~F   46 (54)
T cd00034          18 FLAKWKD--GQASLVPNKELNVKCPLLVISF   46 (54)
T ss_pred             EEEEEeC--CeEEEEEHHHHHhhCcHHHHHH
Confidence            8888888  6678999999999999988754


No 47 
>KOG0994|consensus
Probab=23.36  E-value=80  Score=34.33  Aligned_cols=39  Identities=28%  Similarity=0.673  Sum_probs=26.9

Q ss_pred             eEEeccceEE----eeccceeecCCcccccccccccCCcccCCcccccccc
Q psy17098         95 YLCKGDGKWY----LPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAH  141 (204)
Q Consensus        95 ~~C~~~G~W~----~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~  141 (204)
                      ..|++.|+-.    +.-|.|.|+|+--    +.+|    --|.+|+|.|-+
T Consensus       765 CnCnptGSlS~vCn~~GGqCqCkPnVV----GR~C----dqCApGtyGFGP  807 (1758)
T KOG0994|consen  765 CNCNPTGSLSSVCNPNGGQCQCKPNVV----GRRC----DQCAPGTYGFGP  807 (1758)
T ss_pred             cccCCCccccccccCCCceecccCccc----cccc----cccCCcccCcCC
Confidence            4466666543    2348999997764    3445    999999999863


No 48 
>COG4475 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.69  E-value=45  Score=28.01  Aligned_cols=28  Identities=21%  Similarity=0.330  Sum_probs=20.3

Q ss_pred             ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhccccCCCCC
Q psy17098         25 PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAHFPTTP   68 (204)
Q Consensus        25 pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~Fp~T~   68 (204)
                      -+.++|+|||||.                |-..++.|..=-++.
T Consensus        60 ~~~~tGv~lA~Q~----------------CEHlNRAlvvEr~~a   87 (180)
T COG4475          60 VLKKTGVHLAFQG----------------CEHLNRALVVEREVA   87 (180)
T ss_pred             HHhccceEEEeeh----------------hhhhhhHhhhhHhhh
Confidence            3578999999984                777777776544443


No 49 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=21.41  E-value=1.1e+02  Score=17.19  Aligned_cols=10  Identities=40%  Similarity=1.222  Sum_probs=8.4

Q ss_pred             CCeecCCCce
Q psy17098        186 GGCKCKPGFE  195 (204)
Q Consensus       186 g~C~C~~Gye  195 (204)
                      ..|.|.+||.
T Consensus        24 ~~C~C~~g~~   33 (38)
T cd00054          24 YRCSCPPGYT   33 (38)
T ss_pred             eEeECCCCCc
Confidence            4699999997


No 50 
>PF07354 Sp38:  Zona-pellucida-binding protein (Sp38);  InterPro: IPR010857 This family contains a number of zona-pellucida-binding proteins that seem to be restricted to mammals. These are sperm proteins that bind to the 90 kDa family of zona pellucida glycoproteins in a calcium-dependent manner []. These represent some of the specific molecules that mediate the first steps of gamete interaction, allowing fertilisation to occur [].; GO: 0007339 binding of sperm to zona pellucida, 0005576 extracellular region
Probab=21.39  E-value=50  Score=29.74  Aligned_cols=31  Identities=26%  Similarity=0.720  Sum_probs=23.0

Q ss_pred             eecCCcccc-cccccccCCcccCCcccccccc
Q psy17098        111 KCKPGFEAD-VEKQTCNGTCHACPETTINFAH  141 (204)
Q Consensus       111 ~C~~Gye~~-~~~~~C~~~C~~Cp~g~~k~A~  141 (204)
                      .|.|||--| .....|.+-|-.|++|+|+...
T Consensus       221 sC~PGfG~N~~~h~~C~~CCVvCsPgTysp~~  252 (271)
T PF07354_consen  221 SCRPGFGKNDILHSDCPSCCVVCSPGTYSPDD  252 (271)
T ss_pred             ccCCCCCcCcccCCCCCCeeEECCCcccCCCC
Confidence            578999854 2234576779999999998763


No 51 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=20.80  E-value=45  Score=24.93  Aligned_cols=10  Identities=60%  Similarity=1.551  Sum_probs=9.0

Q ss_pred             eecCCCceec
Q psy17098        188 CKCKPGFEAD  197 (204)
Q Consensus       188 C~C~~Gye~~  197 (204)
                      |.|.+||+|+
T Consensus       100 C~Cl~GF~P~  109 (110)
T PF00954_consen  100 CSCLPGFEPK  109 (110)
T ss_pred             eECCCCcCCC
Confidence            9999999983


No 52 
>TIGR01440 conserved hypothetical protein TIGR01440. Members of this family are uncharacterized proteins of about 180 amino acids from the Bacillus/Clostridium group of Gram-positive bacteria, found in no more than one copy per genome.
Probab=20.12  E-value=50  Score=27.83  Aligned_cols=26  Identities=23%  Similarity=0.401  Sum_probs=18.8

Q ss_pred             cCCCceEEEeccCCceEeEEEEEEEEecChhhhhccccCCCC
Q psy17098         26 VTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAHFPTT   67 (204)
Q Consensus        26 ls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~Fp~T   67 (204)
                      +.++|+|||||                -|-..+|.|..=-++
T Consensus        56 ~~~~gi~lA~Q----------------~CEHlNRALvvEr~~   81 (172)
T TIGR01440        56 LKKTGVTLAFQ----------------GCEHINRALVMERSV   81 (172)
T ss_pred             hhhcCeEEEEe----------------chhhhhHHHHHhHHH
Confidence            35899999998                477777777654433


Done!