Query         psy14262
Match_columns 482
No_of_seqs    447 out of 2860
Neff          7.5 
Searched_HMMs 46136
Date          Fri Aug 16 23:30:56 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14262.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14262hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1215|consensus               99.8 3.5E-20 7.6E-25  211.5  18.4  324  115-481    25-426 (877)
  2 KOG1215|consensus               99.2 3.8E-11 8.3E-16  137.6  12.4  166  120-312   142-319 (877)
  3 PF00057 Ldl_recept_a:  Low-den  98.9   9E-10 1.9E-14   75.0   3.3   36    4-45      2-37  (37)
  4 cd00112 LDLa Low Density Lipop  98.9 8.4E-10 1.8E-14   74.3   2.5   35    5-45      1-35  (35)
  5 PF00057 Ldl_recept_a:  Low-den  98.8 3.6E-09 7.8E-14   72.0   3.3   35  164-199     2-36  (37)
  6 cd00112 LDLa Low Density Lipop  98.7   6E-09 1.3E-13   70.1   2.7   34  165-199     1-34  (35)
  7 smart00192 LDLa Low-density li  98.7 1.7E-08 3.7E-13   67.0   2.8   33    4-42      1-33  (33)
  8 smart00192 LDLa Low-density li  98.5 1.1E-07 2.3E-12   63.1   2.7   32  165-197     2-33  (33)
  9 KOG1214|consensus               98.4   3E-07 6.5E-12   99.0   6.4  147  290-455   695-865 (1289)
 10 PF12999 PRKCSH-like:  Glucosid  98.4 2.2E-07 4.8E-12   84.8   4.7   68  167-237    35-110 (176)
 11 PF14670 FXa_inhibition:  Coagu  98.4 2.8E-07   6E-12   62.2   3.5   33  421-453     4-36  (36)
 12 KOG1219|consensus               98.2 1.8E-06 3.9E-11  100.8   5.7  105  337-455  3864-3978(4289)
 13 PF12999 PRKCSH-like:  Glucosid  98.1   3E-06 6.4E-11   77.5   4.2   68  207-282    35-110 (176)
 14 PF07645 EGF_CA:  Calcium-bindi  98.1 5.3E-06 1.2E-10   58.2   4.2   39  415-453     1-42  (42)
 15 PF12662 cEGF:  Complement Clr-  98.0 3.8E-06 8.2E-11   51.1   2.1   24  435-458     1-24  (24)
 16 KOG1219|consensus               97.4 0.00017 3.6E-09   85.2   4.9  102  289-414  3866-3977(4289)
 17 KOG1214|consensus               97.3 0.00049 1.1E-08   75.0   6.8  112  344-462   699-833 (1289)
 18 KOG4260|consensus               97.2 0.00082 1.8E-08   64.8   6.3   46  401-446   221-269 (350)
 19 PF14670 FXa_inhibition:  Coagu  97.0 0.00018 3.8E-09   48.6   0.0   30  384-413     5-36  (36)
 20 PF12662 cEGF:  Complement Clr-  96.8 0.00079 1.7E-08   41.0   1.7   22  397-418     1-24  (24)
 21 cd01475 vWA_Matrilin VWA_Matri  96.2  0.0039 8.5E-08   60.0   3.5   44  409-452   180-224 (224)
 22 KOG2397|consensus               96.1  0.0055 1.2E-07   63.9   4.3   69  168-239    43-115 (480)
 23 smart00179 EGF_CA Calcium-bind  95.9   0.009   2E-07   40.3   3.4   35  416-453     2-38  (39)
 24 KOG1217|consensus               95.9    0.21 4.6E-06   52.6  15.2  116  338-461   272-405 (487)
 25 KOG4289|consensus               95.4    0.02 4.3E-07   66.2   5.2   88  357-455  1221-1317(2531)
 26 KOG1217|consensus               95.3    0.15 3.1E-06   53.8  11.3  150  278-459   158-315 (487)
 27 PF07645 EGF_CA:  Calcium-bindi  95.0  0.0073 1.6E-07   42.2   0.3   26  381-406     4-33  (42)
 28 KOG4260|consensus               94.8    0.06 1.3E-06   52.2   6.0   76  365-447   221-306 (350)
 29 KOG3509|consensus               94.3    0.17 3.7E-06   57.8   9.0  104  257-374     2-109 (964)
 30 KOG2397|consensus               93.8   0.056 1.2E-06   56.6   3.8   68  209-284    44-115 (480)
 31 cd00054 EGF_CA Calcium-binding  93.4   0.099 2.1E-06   34.5   3.3   30  416-445     2-33  (38)
 32 smart00181 EGF Epidermal growt  93.4   0.094   2E-06   34.5   3.1   25  423-447     6-31  (35)
 33 KOG4289|consensus               93.0   0.087 1.9E-06   61.2   3.7   49  394-445  1218-1269(2531)
 34 PF12947 EGF_3:  EGF domain;  I  92.9   0.099 2.1E-06   35.2   2.6   30  422-453     5-36  (36)
 35 KOG3509|consensus               92.9   0.098 2.1E-06   59.7   4.1  103  177-286     2-111 (964)
 36 smart00179 EGF_CA Calcium-bind  91.4    0.16 3.6E-06   33.9   2.4   32  381-413     4-38  (39)
 37 cd00053 EGF Epidermal growth f  89.3    0.46   1E-05   30.6   3.1   24  423-446     6-31  (36)
 38 PF12947 EGF_3:  EGF domain;  I  89.2   0.087 1.9E-06   35.5  -0.5   25  389-413    12-36  (36)
 39 smart00181 EGF Epidermal growt  88.7    0.34 7.3E-06   31.7   2.1   22  385-406     6-28  (35)
 40 PF00008 EGF:  EGF-like domain   84.6       1 2.2E-05   29.3   2.6   23  423-445     4-29  (32)
 41 PF00008 EGF:  EGF-like domain   84.2    0.23 4.9E-06   32.4  -0.7   23  384-406     3-28  (32)
 42 cd00054 EGF_CA Calcium-binding  82.0     1.2 2.6E-05   29.1   2.2   26  381-406     4-32  (38)
 43 cd00053 EGF Epidermal growth f  81.3     1.3 2.8E-05   28.4   2.1   18  389-406    12-29  (36)
 44 KOG1225|consensus               80.2       7 0.00015   42.3   8.3   72  210-310   260-331 (525)
 45 cd01475 vWA_Matrilin VWA_Matri  78.7     1.4   3E-05   42.2   2.3   23  384-406   194-216 (224)
 46 KOG1225|consensus               78.3     9.8 0.00021   41.2   8.7   62  365-445   299-362 (525)
 47 PF06247 Plasmod_Pvs28:  Plasmo  68.3     2.9 6.3E-05   38.8   1.6   57  392-448    14-82  (197)
 48 PF09064 Tme5_EGF_like:  Thromb  60.6     8.9 0.00019   25.4   2.3   24  424-448     7-30  (34)
 49 PF12661 hEGF:  Human growth fa  59.0     7.8 0.00017   20.0   1.5    9  437-445     1-9   (13)
 50 PF12946 EGF_MSP1_1:  MSP1 EGF   31.0      48   0.001   22.5   2.2   22  426-447    10-32  (37)
 51 PF06247 Plasmod_Pvs28:  Plasmo  27.1      13 0.00028   34.6  -1.4   98  346-450    51-165 (197)

No 1  
>KOG1215|consensus
Probab=99.84  E-value=3.5e-20  Score=211.54  Aligned_cols=324  Identities=32%  Similarity=0.614  Sum_probs=250.1

Q ss_pred             CCCCcceeeCCceecCceeeCCCCCCCcccCCCCCCccccccCCCC----------------------------------
Q psy14262        115 GENRSPLVTKNIQWPSSLVIDYNQDILYWCDSHLNTVESVRVDGSH----------------------------------  160 (482)
Q Consensus       115 ~~~~~~~~~~~~ci~~~~~cdg~~dc~dw~d~~~~sdE~~~~~~~c----------------------------------  160 (482)
                      |+.+..++-...-||+++..|....+.+|.|++...+|..++++.-                                  
T Consensus        25 g~~~~~~~~~~~~~~ng~~id~~~~~~y~~d~~~~~i~~~~~dg~~r~~l~~~~~~~y~~d~~v~~~~~~sg~~~~~~~~  104 (877)
T KOG1215|consen   25 GGERKILEKEEFEWPNGLTIDLAWQRIYWADAKNDLIESANYDGSGRRALTLFEDGLYWTDKSVSAANKKTGKDVTRLSQ  104 (877)
T ss_pred             CcceEEeeccceeCCCcceecchhheeeeccccCCceEEeccCCccceeeeeeccceeeccchhhhhccCCCCcceeehh
Confidence            3444455555677888888888899999999999888887776631                                  


Q ss_pred             -----------------------------CCccCCCCCeecC--CCCceecCCcccCCCCCCCCCCCCCCCCC---CCCC
Q psy14262        161 -----------------------------NAVLCHPNELRCG--SSNQCVALSKFCDGKPDCSNGEDEGPFCN---AECT  206 (482)
Q Consensus       161 -----------------------------~~~~C~~~~f~C~--~~~~Ci~~~~~CDg~~dC~dgsDE~~~C~---~~C~  206 (482)
                                                   +...|...+|+|.  +. +||+..|+|||..+|.+|+||.+ |.   ....
T Consensus       105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~-~Cip~~~~cd~~~~C~dg~de~~-~~~~~~~~~  182 (877)
T KOG1215|consen  105 DSHFPLDIHAYHPSSQPLAPDPCAESGNGPCSHCCLDKFSCRTGSC-KCIPGDWLCDGEADCPDGSDELN-CAVRRCEPR  182 (877)
T ss_pred             cCCCCcceeEEecCCCCCCCCcccccCCCCCccccCCCCCCcCccc-cCCCCceeCCCCCccccchhhhc-ccccccCcc
Confidence                                         0123566789998  44 89999999999999999999998 64   1222


Q ss_pred             CCceecCCCCeeccCCCCCCCcCCCCCCCCCCCccccccc---CCccccCCCCceecCCCccCccCCCCCCCCCCCCCCC
Q psy14262        207 PGQFSCGMGKCIPESWKCDGFVDCLEDGSDEPNTCIHQCF---NGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDED  283 (482)
Q Consensus       207 ~~~f~C~~g~Ci~~~~~CDg~~dC~~dgsDE~~C~~~~C~---~~~f~C~~~g~Ci~~~~~CDg~~dC~~~~~~dgsDE~  283 (482)
                      ...|+|     |...++||+..+| .+++|+.......+.   ...++|.+.++||...+.|||..+|.     +++||.
T Consensus       183 ~~~~~~-----~~~~~~~d~~~~~-~~~~d~~~~~~~~~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~-----~~~de~  251 (877)
T KOG1215|consen  183 GASLDC-----IVAIKVCDIQHDC-ADDYDESEGRIYWTDDSRIEVTRCDGSSRCILISEVCDGPRDCV-----DGPDEG  251 (877)
T ss_pred             cccccc-----ceeeeecCccccc-ccccccccCcccccCCcceeEEEecCCCcEEeehhccCCCcccc-----cCCcCc
Confidence            344555     9999999999999 999999887544443   46788985569999999999999999     999994


Q ss_pred             --CCCCccccCcccccCCCCccCCCCCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCcccccccccC
Q psy14262        284 --WDKFHQCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQCVALSKFCD  361 (482)
Q Consensus       284 --~C~~~~C~~~~f~C~~g~~Ci~~~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~Ci~~~~~CD  361 (482)
                        ++....|...++.|.++ .|++...+|+|..||+     +|+||..   |..  ..+.+..|.|..+ + |...++|+
T Consensus       252 ~~~~~~~~~~~~e~~~~~~-~~~~~~~~~~g~~d~p-----dg~de~~---~~~--~~~~~~~~d~~~~-~-i~~~~~~~  318 (877)
T KOG1215|consen  252 VMNCSDATCEAPEIECADG-DCSDRQKLCDGDLDCP-----DGLDEDY---CKK--KLYWSMNVDGSGR-R-ILLSKLCH  318 (877)
T ss_pred             eeEeeccccCCcceeecCC-CCccceEEecCccCCC-----Ccccccc---ccc--ceeeeeecccCCc-e-eeecccCc
Confidence              66667788889999876 9999999999999999     9999987   754  2233667888877 8 99999998


Q ss_pred             CCCCCCCCCCCCCCCCCCCCcc--CCCCcCCCCCCCCCccccCCCCc--cCCCCCccccccccCCCCCCceee-ecCCce
Q psy14262        362 GKPDCSNGEDEGPFCNNKADCE--NLKCSHKCSPTPRGAMCFCPAGQ--NPNKTQCVDADECLIDGSCDQTCT-NTHLSY  436 (482)
Q Consensus       362 g~~dC~dgsDE~~~C~~~~~C~--~~~C~~~C~~~~~~~~C~C~~G~--~~~~~~C~di~eC~~~~~Cs~~C~-n~~gsy  436 (482)
                      +     ...+..+      ++.  ...+.+.+.+......+.|..++  .....+ ....++..++.|+|.|. +.++.|
T Consensus       319 ~-----~~~~~~~------~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~Csq~C~~~~p~~~  386 (877)
T KOG1215|consen  319 G-----YWTDGLN------ECAERVLKCSHKCPDVSVGPRCDCMGAKVLPLGART-DSNPCESDNGGCSQLCVPNSPGTF  386 (877)
T ss_pred             c-----ccccccc------cchhhcccccCCCCccccCCcccCCccceecccccc-cCCcccccCCccceeccCCCCCce
Confidence            8     1233322      443  45666778888888999999988  222322 12233336799999999 569999


Q ss_pred             EEecCCCeEECCCCCccccCCCCCCCCcEEEeccCCceEEEeccC
Q psy14262        437 VCSCTEGYTLRPDGKSCTAVNVPQDEPASIIFSSTVDIRRIHLNG  481 (482)
Q Consensus       437 ~C~C~~Gy~l~~d~~tC~didec~~~p~~l~~~~~~~ir~~~l~~  481 (482)
                      +|.|..||.+..++  |...+   ...++|+|+++++||++.|..
T Consensus       387 ~c~c~~g~~~~~~~--c~~~~---~~~~~l~~s~~~~ir~~~~~~  426 (877)
T KOG1215|consen  387 KCACSPGYELRLDK--CEASD---QPEAFLLFSNRHDIRRISLDC  426 (877)
T ss_pred             eEecCCCcEeccCC--ceecC---CCCcEEEEecCccceecccCC
Confidence            99999999998766  66554   334599999999999999875


No 2  
>KOG1215|consensus
Probab=99.24  E-value=3.8e-11  Score=137.56  Aligned_cols=166  Identities=27%  Similarity=0.421  Sum_probs=136.8

Q ss_pred             ceeeC--C-ceecCceeeCCCCCCCcccCCCCCCccccccCCCCCCcc--CCCCCeecCCCCceecCCcccCCCCCCCCC
Q psy14262        120 PLVTK--N-IQWPSSLVIDYNQDILYWCDSHLNTVESVRVDGSHNAVL--CHPNELRCGSSNQCVALSKFCDGKPDCSNG  194 (482)
Q Consensus       120 ~~~~~--~-~ci~~~~~cdg~~dc~dw~d~~~~sdE~~~~~~~c~~~~--C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dg  194 (482)
                      .|.|.  . .|||..|+||+..+|.+      |+||.     .+....  .....|.|      |...++||+..+|.++
T Consensus       142 ~~~c~~~~~~Cip~~~~cd~~~~C~d------g~de~-----~~~~~~~~~~~~~~~~------~~~~~~~d~~~~~~~~  204 (877)
T KOG1215|consen  142 KFSCRTGSCKCIPGDWLCDGEADCPD------GSDEL-----NCAVRRCEPRGASLDC------IVAIKVCDIQHDCADD  204 (877)
T ss_pred             CCCCcCccccCCCCceeCCCCCcccc------chhhh-----cccccccCcccccccc------ceeeeecCcccccccc
Confidence            45565  4 99999999999999999      99999     454221  22233444      8899999999999999


Q ss_pred             CCCCCCCC-CCCC---CCceecCCC-CeeccCCCCCCCcCCCCCCCCCC--CcccccccCCccccCCCCceecCCCccCc
Q psy14262        195 EDEGPFCN-AECT---PGQFSCGMG-KCIPESWKCDGFVDCLEDGSDEP--NTCIHQCFNGQVQCALSKKCIPHNWLCDE  267 (482)
Q Consensus       195 sDE~~~C~-~~C~---~~~f~C~~g-~Ci~~~~~CDg~~dC~~dgsDE~--~C~~~~C~~~~f~C~~~g~Ci~~~~~CDg  267 (482)
                      +|+.. +. ..+.   ...++|..+ +||..++.|||..|| .+++||.  ++....|...++.|. ++.|++..++|+|
T Consensus       205 ~d~~~-~~~~~~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc-~~~~de~~~~~~~~~~~~~e~~~~-~~~~~~~~~~~~g  281 (877)
T KOG1215|consen  205 YDESE-GRIYWTDDSRIEVTRCDGSSRCILISEVCDGPRDC-VDGPDEGVMNCSDATCEAPEIECA-DGDCSDRQKLCDG  281 (877)
T ss_pred             ccccc-CcccccCCcceeEEEecCCCcEEeehhccCCCccc-ccCCcCceeEeeccccCCcceeec-CCCCccceEEecC
Confidence            99987 43 1122   467899875 999999999999999 9999994  676667888899998 9999999999999


Q ss_pred             cCCCCCCCCCCCCCCCCCCCccccCcccccCCCCccCCCCCCCCC
Q psy14262        268 EMDCSTDDIVDNTDEDWDKFHQCFNGQVQCALSKKCIPHNWLCDE  312 (482)
Q Consensus       268 ~~dC~~~~~~dgsDE~~C~~~~C~~~~f~C~~g~~Ci~~~~~CDg  312 (482)
                      ..+|+     +|+||..|....+....|.|..+ + |+...+|++
T Consensus       282 ~~d~p-----dg~de~~~~~~~~~~~~~d~~~~-~-i~~~~~~~~  319 (877)
T KOG1215|consen  282 DLDCP-----DGLDEDYCKKKLYWSMNVDGSGR-R-ILLSKLCHG  319 (877)
T ss_pred             ccCCC-----CcccccccccceeeeeecccCCc-e-eeecccCcc
Confidence            99999     99999998866665778899866 8 999998876


No 3  
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.91  E-value=9e-10  Score=74.98  Aligned_cols=36  Identities=39%  Similarity=0.985  Sum_probs=33.2

Q ss_pred             ccCCCCeeeCCCCCeecCccccCCCCCCCCCCCCCCCCcccc
Q psy14262          4 QCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWD   45 (482)
Q Consensus         4 ~C~~~~f~C~~~~~Ci~~~~~CDg~~dC~~~~~~d~sDE~~~   45 (482)
                      .|.+++|+|. ++.|||..|+|||+.||+     |+|||.+|
T Consensus         2 ~C~~~~f~C~-~~~CI~~~~~CDg~~DC~-----dgsDE~~C   37 (37)
T PF00057_consen    2 TCPPGEFRCG-NGQCIPKSWVCDGIPDCP-----DGSDEQNC   37 (37)
T ss_dssp             SSSTTEEEET-TSSEEEGGGTTSSSCSSS-----SSTTTSSH
T ss_pred             cCcCCeeEcC-CCCEEChHHcCCCCCCCC-----CCcccccC
Confidence            6899999999 888999999999999998     99999754


No 4  
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.89  E-value=8.4e-10  Score=74.27  Aligned_cols=35  Identities=43%  Similarity=1.048  Sum_probs=32.0

Q ss_pred             cCCCCeeeCCCCCeecCccccCCCCCCCCCCCCCCCCcccc
Q psy14262          5 CFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWD   45 (482)
Q Consensus         5 C~~~~f~C~~~~~Ci~~~~~CDg~~dC~~~~~~d~sDE~~~   45 (482)
                      |.+++|+|. +++|||..|+|||+.||+     |+|||..|
T Consensus         1 C~~~~f~C~-~~~Ci~~~~~CDg~~DC~-----dgsDE~~C   35 (35)
T cd00112           1 CPPNEFRCA-NGRCIPSSWVCDGEDDCG-----DGSDEENC   35 (35)
T ss_pred             CCCCeEEcC-CCCeeCHHHcCCCccCCC-----CCcccccC
Confidence            678999998 799999999999999998     99999743


No 5  
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.81  E-value=3.6e-09  Score=72.01  Aligned_cols=35  Identities=51%  Similarity=1.155  Sum_probs=18.4

Q ss_pred             cCCCCCeecCCCCceecCCcccCCCCCCCCCCCCCC
Q psy14262        164 LCHPNELRCGSSNQCVALSKFCDGKPDCSNGEDEGP  199 (482)
Q Consensus       164 ~C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE~~  199 (482)
                      .|.+.+|+|.++ +||+..|+|||+.||.|||||.+
T Consensus         2 ~C~~~~f~C~~~-~CI~~~~~CDg~~DC~dgsDE~~   36 (37)
T PF00057_consen    2 TCPPGEFRCGNG-QCIPKSWVCDGIPDCPDGSDEQN   36 (37)
T ss_dssp             SSSTTEEEETTS-SEEEGGGTTSSSCSSSSSTTTSS
T ss_pred             cCcCCeeEcCCC-CEEChHHcCCCCCCCCCCccccc
Confidence            345555555554 35555555555555555555543


No 6  
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.74  E-value=6e-09  Score=70.11  Aligned_cols=34  Identities=47%  Similarity=1.116  Sum_probs=20.9

Q ss_pred             CCCCCeecCCCCceecCCcccCCCCCCCCCCCCCC
Q psy14262        165 CHPNELRCGSSNQCVALSKFCDGKPDCSNGEDEGP  199 (482)
Q Consensus       165 C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE~~  199 (482)
                      |.+.+|+|.++ +||+..++|||+.||+|||||.+
T Consensus         1 C~~~~f~C~~~-~Ci~~~~~CDg~~DC~dgsDE~~   34 (35)
T cd00112           1 CPPNEFRCANG-RCIPSSWVCDGEDDCGDGSDEEN   34 (35)
T ss_pred             CCCCeEEcCCC-CeeCHHHcCCCccCCCCCccccc
Confidence            44556666664 66666666666666666666654


No 7  
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.65  E-value=1.7e-08  Score=67.00  Aligned_cols=33  Identities=48%  Similarity=1.145  Sum_probs=30.2

Q ss_pred             ccCCCCeeeCCCCCeecCccccCCCCCCCCCCCCCCCCc
Q psy14262          4 QCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDE   42 (482)
Q Consensus         4 ~C~~~~f~C~~~~~Ci~~~~~CDg~~dC~~~~~~d~sDE   42 (482)
                      .|.+.+|+|. ++.|||..|+|||++||+     |+|||
T Consensus         1 ~C~~~~f~C~-~~~Ci~~~~~Cdg~~dC~-----dgsDE   33 (33)
T smart00192        1 TCPPGEFQCD-NGRCIPLSWVCDGVDDCS-----DGSDE   33 (33)
T ss_pred             CCCCCeEECC-CCCEECchhhCCCcCcCc-----CCCCC
Confidence            3777899998 899999999999999998     88997


No 8  
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.47  E-value=1.1e-07  Score=63.14  Aligned_cols=32  Identities=50%  Similarity=1.183  Sum_probs=24.6

Q ss_pred             CCCCCeecCCCCceecCCcccCCCCCCCCCCCC
Q psy14262        165 CHPNELRCGSSNQCVALSKFCDGKPDCSNGEDE  197 (482)
Q Consensus       165 C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE  197 (482)
                      |...+|+|.++ .||+..++|||+.||+|||||
T Consensus         2 C~~~~f~C~~~-~Ci~~~~~Cdg~~dC~dgsDE   33 (33)
T smart00192        2 CPPGEFQCDNG-RCIPLSWVCDGVDDCSDGSDE   33 (33)
T ss_pred             CCCCeEECCCC-CEECchhhCCCcCcCcCCCCC
Confidence            44567888766 688888888888888888887


No 9  
>KOG1214|consensus
Probab=98.43  E-value=3e-07  Score=99.03  Aligned_cols=147  Identities=32%  Similarity=0.762  Sum_probs=100.3

Q ss_pred             ccCcccccCCCCccCCC---CCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCccccc--ccccCCCC
Q psy14262        290 CFNGQVQCALSKKCIPH---NWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQCVAL--SKFCDGKP  364 (482)
Q Consensus       290 C~~~~f~C~~g~~Ci~~---~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~Ci~~--~~~CDg~~  364 (482)
                      |..+.-.|..+..|.+.   .|.|    .|+.++-+||   .+   |.. +..|......|+....||+.  +++|    
T Consensus       695 Cy~gsh~cdt~a~C~pg~~~~~tc----ecs~g~~gdg---r~---c~d-~~eca~~~~~CGp~s~Cin~pg~~rc----  759 (1289)
T KOG1214|consen  695 CYDGSHMCDTTARCHPGTGVDYTC----ECSSGYQGDG---RN---CVD-ENECATGFHRCGPNSVCINLPGSYRC----  759 (1289)
T ss_pred             ceecCcccCCCccccCCCCcceEE----EEeeccCCCC---CC---CCC-hhhhccCCCCCCCCceeecCCCceeE----
Confidence            43334445555566653   4555    5664443333   33   443 23566666788887799976  7888    


Q ss_pred             CCCCC---CCCCCCCC-------CCCCcc--CCCCc----CCCCCC-CCCccccCCCCccCCCCCccccccccCCCCC--
Q psy14262        365 DCSNG---EDEGPFCN-------NKADCE--NLKCS----HKCSPT-PRGAMCFCPAGQNPNKTQCVDADECLIDGSC--  425 (482)
Q Consensus       365 dC~dg---sDE~~~C~-------~~~~C~--~~~C~----~~C~~~-~~~~~C~C~~G~~~~~~~C~di~eC~~~~~C--  425 (482)
                      .|.-|   +|+...|.       .+ .|.  ++.|.    .+|+.. .+.|.|.|.+||..++..|.+++||. +..|  
T Consensus       760 eC~~gy~F~dd~~tCV~i~~pap~n-~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~-psrChp  837 (1289)
T KOG1214|consen  760 ECRSGYEFADDRHTCVLITPPAPAN-PCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECS-PSRCHP  837 (1289)
T ss_pred             EEeecceeccCCcceEEecCCCCCC-ccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccC-ccccCC
Confidence            77776   46655576       23 565  34442    145543 34799999999988999999999998 7788  


Q ss_pred             CceeeecCCceEEecCCCeEECCCCCcccc
Q psy14262        426 DQTCTNTHLSYVCSCTEGYTLRPDGKSCTA  455 (482)
Q Consensus       426 s~~C~n~~gsy~C~C~~Gy~l~~d~~tC~d  455 (482)
                      ..+|+|++|+|.|.|.+||.  .|+..|..
T Consensus       838 ~A~CyntpgsfsC~C~pGy~--GDGf~CVP  865 (1289)
T KOG1214|consen  838 AATCYNTPGSFSCRCQPGYY--GDGFQCVP  865 (1289)
T ss_pred             CceEecCCCcceeecccCcc--CCCceecC
Confidence            45999999999999999998  45777764


No 10 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=98.43  E-value=2.2e-07  Score=84.76  Aligned_cols=68  Identities=35%  Similarity=0.688  Sum_probs=60.1

Q ss_pred             CCCeecCCCCce-ecCCcccCCCCCCCCCCCCCCCCCCCCCCCceecCCC----CeeccCCCCCCCcC---CCCCCCCC
Q psy14262        167 PNELRCGSSNQC-VALSKFCDGKPDCSNGEDEGPFCNAECTPGQFSCGMG----KCIPESWKCDGFVD---CLEDGSDE  237 (482)
Q Consensus       167 ~~~f~C~~~~~C-i~~~~~CDg~~dC~dgsDE~~~C~~~C~~~~f~C~~g----~Ci~~~~~CDg~~d---C~~dgsDE  237 (482)
                      .+.|+|.++.+= |+.+.+.|++-||+|||||.++  ..|+...|.|.|.    .-||.+++=||+-|   | -|||||
T Consensus        35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPGT--sAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~C-CDGSDE  110 (176)
T PF12999_consen   35 NGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPGT--SACSNGKFYCENKGHIPRYIPSSRVNDGICDYDIC-CDGSDE  110 (176)
T ss_pred             CCceEecCCCCceecHHHccCcceeCCCCCCcccc--ccCcCceEeeccCCCCCceeehhhhcCCcCccccc-CCCCCC
Confidence            357999988666 9999999999999999999872  3688789999875    67899999999999   9 999999


No 11 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.40  E-value=2.8e-07  Score=62.18  Aligned_cols=33  Identities=48%  Similarity=1.095  Sum_probs=28.8

Q ss_pred             CCCCCCceeeecCCceEEecCCCeEECCCCCcc
Q psy14262        421 IDGSCDQTCTNTHLSYVCSCTEGYTLRPDGKSC  453 (482)
Q Consensus       421 ~~~~Cs~~C~n~~gsy~C~C~~Gy~l~~d~~tC  453 (482)
                      .++.|+|+|++++++|+|.|++||+|..|+++|
T Consensus         4 ~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    4 NNGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             GGGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred             CCCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence            358899999999999999999999999999987


No 12 
>KOG1219|consensus
Probab=98.18  E-value=1.8e-06  Score=100.77  Aligned_cols=105  Identities=28%  Similarity=0.726  Sum_probs=86.5

Q ss_pred             cccccCCCcccCCCCccccc---ccccCCCCCCCCCCCCCCCCCCC-CCccCCCCcC--CCCCCCCCccccCCCCccCCC
Q psy14262        337 AVLCHPNELRCGSSNQCVAL---SKFCDGKPDCSNGEDEGPFCNNK-ADCENLKCSH--KCSPTPRGAMCFCPAGQNPNK  410 (482)
Q Consensus       337 ~~~C~~~~f~C~~g~~Ci~~---~~~CDg~~dC~dgsDE~~~C~~~-~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~  410 (482)
                      +..|+  ..+|.+|+.|++.   .|.|    .|+..+-+.+ |.+. .+|.++||..  +|....++|.|.|+.||  .|
T Consensus      3864 ~d~C~--~npCqhgG~C~~~~~ggy~C----kCpsqysG~~-CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gy--TG 3934 (4289)
T KOG1219|consen 3864 TDPCN--DNPCQHGGTCISQPKGGYKC----KCPSQYSGNH-CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGY--TG 3934 (4289)
T ss_pred             ccccc--cCcccCCCEecCCCCCceEE----eCcccccCcc-cccccccccCCCCCCCCEEEecCCCeeEeCCCCc--cC
Confidence            34563  4589999999986   7899    9999888888 8732 2788888875  89999999999999999  78


Q ss_pred             CCcc--ccccccCCCCC--CceeeecCCceEEecCCCeEECCCCCcccc
Q psy14262        411 TQCV--DADECLIDGSC--DQTCTNTHLSYVCSCTEGYTLRPDGKSCTA  455 (482)
Q Consensus       411 ~~C~--di~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~l~~d~~tC~d  455 (482)
                      .+|+  .++||+ ...|  .++|+|++|+|.|.|.+||.    +++|.+
T Consensus      3935 ~~Ce~~Gi~eCs-~n~C~~gg~C~n~~gsf~CncT~g~~----gr~c~~ 3978 (4289)
T KOG1219|consen 3935 KRCEARGISECS-KNVCGTGGQCINIPGSFHCNCTPGIL----GRTCCA 3978 (4289)
T ss_pred             ceeecccccccc-cccccCCceeeccCCceEeccChhHh----cccCcc
Confidence            8885  378998 5677  56999999999999999997    677643


No 13 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=98.09  E-value=3e-06  Score=77.46  Aligned_cols=68  Identities=44%  Similarity=0.785  Sum_probs=58.4

Q ss_pred             CCceecCCC-Ce-eccCCCCCCCcCCCCCCCCCCCcccccccCCccccCCCC---ceecCCCccCccCC---CCCCCCCC
Q psy14262        207 PGQFSCGMG-KC-IPESWKCDGFVDCLEDGSDEPNTCIHQCFNGQVQCALSK---KCIPHNWLCDEEMD---CSTDDIVD  278 (482)
Q Consensus       207 ~~~f~C~~g-~C-i~~~~~CDg~~dC~~dgsDE~~C~~~~C~~~~f~C~~~g---~Ci~~~~~CDg~~d---C~~~~~~d  278 (482)
                      .+.|.|.+| .= |+.+++.|++.|| +|||||.+-  ..|..+.|+|.|.|   .-||.++|=||+-|   |=     |
T Consensus        35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC-~DGSDEPGT--sAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CC-----D  106 (176)
T PF12999_consen   35 NGKFTCLDGSKIVIPFSQINDDYCDC-PDGSDEPGT--SACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICC-----D  106 (176)
T ss_pred             CCceEecCCCCceecHHHccCcceeC-CCCCCcccc--ccCcCceEeeccCCCCCceeehhhhcCCcCcccccC-----C
Confidence            457999887 34 8999999999999 999999853  35777899999655   57889999999999   98     9


Q ss_pred             CCCC
Q psy14262        279 NTDE  282 (482)
Q Consensus       279 gsDE  282 (482)
                      ||||
T Consensus       107 GSDE  110 (176)
T PF12999_consen  107 GSDE  110 (176)
T ss_pred             CCCC
Confidence            9999


No 14 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.06  E-value=5.3e-06  Score=58.16  Aligned_cols=39  Identities=51%  Similarity=1.016  Sum_probs=32.7

Q ss_pred             cccccc-CCCCCC--ceeeecCCceEEecCCCeEECCCCCcc
Q psy14262        415 DADECL-IDGSCD--QTCTNTHLSYVCSCTEGYTLRPDGKSC  453 (482)
Q Consensus       415 di~eC~-~~~~Cs--~~C~n~~gsy~C~C~~Gy~l~~d~~tC  453 (482)
                      ||+||. ....|.  +.|+|+.|+|+|.|++||++...+.+|
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~C   42 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTTC   42 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCcC
Confidence            689998 457885  799999999999999999977666554


No 15 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=98.00  E-value=3.8e-06  Score=51.13  Aligned_cols=24  Identities=54%  Similarity=1.266  Sum_probs=22.5

Q ss_pred             ceEEecCCCeEECCCCCccccCCC
Q psy14262        435 SYVCSCTEGYTLRPDGKSCTAVNV  458 (482)
Q Consensus       435 sy~C~C~~Gy~l~~d~~tC~dide  458 (482)
                      ||+|+|++||+|.+++++|.||||
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            699999999999999999999986


No 16 
>KOG1219|consensus
Probab=97.38  E-value=0.00017  Score=85.21  Aligned_cols=102  Identities=23%  Similarity=0.651  Sum_probs=81.4

Q ss_pred             cccCcccccCCCCccCC---CCCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCccccc--ccccCCC
Q psy14262        289 QCFNGQVQCALSKKCIP---HNWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQCVAL--SKFCDGK  363 (482)
Q Consensus       289 ~C~~~~f~C~~g~~Ci~---~~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~Ci~~--~~~CDg~  363 (482)
                      .|..  ..|.+||+|++   ..|.|    .|+     .-....+   |+.....|.++  +|.+|++||+.  .++|   
T Consensus      3866 ~C~~--npCqhgG~C~~~~~ggy~C----kCp-----sqysG~~---CEi~~epC~sn--PC~~GgtCip~~n~f~C--- 3926 (4289)
T KOG1219|consen 3866 PCND--NPCQHGGTCISQPKGGYKC----KCP-----SQYSGNH---CEIDLEPCASN--PCLTGGTCIPFYNGFLC--- 3926 (4289)
T ss_pred             cccc--CcccCCCEecCCCCCceEE----eCc-----ccccCcc---cccccccccCC--CCCCCCEEEecCCCeeE---
Confidence            4554  35989999994   78999    898     5555566   76656678654  89999999986  8899   


Q ss_pred             CCCCCCCCCCCCCC---CCCCccCCCCcC--CCCCCCCCccccCCCCccCCCCCcc
Q psy14262        364 PDCSNGEDEGPFCN---NKADCENLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCV  414 (482)
Q Consensus       364 ~dC~dgsDE~~~C~---~~~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~  414 (482)
                       .|+.|.-+.. |.   ++ +|+.+.|.+  .|.|.+++|.|.|.+||  -+++|.
T Consensus      3927 -nC~~gyTG~~-Ce~~Gi~-eCs~n~C~~gg~C~n~~gsf~CncT~g~--~gr~c~ 3977 (4289)
T KOG1219|consen 3927 -NCPNGYTGKR-CEARGIS-ECSKNVCGTGGQCINIPGSFHCNCTPGI--LGRTCC 3977 (4289)
T ss_pred             -eCCCCccCce-eeccccc-ccccccccCCceeeccCCceEeccChhH--hcccCc
Confidence             9999998887 87   34 798888876  89999999999999999  555663


No 17 
>KOG1214|consensus
Probab=97.28  E-value=0.00049  Score=75.01  Aligned_cols=112  Identities=27%  Similarity=0.676  Sum_probs=76.1

Q ss_pred             CcccCCCCccccc---ccccCCCCCCCCCC--CCCCCCCCCCCcc--CCCCcC--CCCCCCCCccccCCCCc--cCCCCC
Q psy14262        344 ELRCGSSNQCVAL---SKFCDGKPDCSNGE--DEGPFCNNKADCE--NLKCSH--KCSPTPRGAMCFCPAGQ--NPNKTQ  412 (482)
Q Consensus       344 ~f~C~~g~~Ci~~---~~~CDg~~dC~dgs--DE~~~C~~~~~C~--~~~C~~--~C~~~~~~~~C~C~~G~--~~~~~~  412 (482)
                      ...|..+..|.+.   .|.|    .|.-|+  |... |...++|.  .+.|.+  .|++.+++|+|.|..||  ..++.+
T Consensus       699 sh~cdt~a~C~pg~~~~~tc----ecs~g~~gdgr~-c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~t  773 (1289)
T KOG1214|consen  699 SHMCDTTARCHPGTGVDYTC----ECSSGYQGDGRN-CVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHT  773 (1289)
T ss_pred             CcccCCCccccCCCCcceEE----EEeeccCCCCCC-CCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcc
Confidence            3344444445443   3444    555553  4445 76444777  566754  89999999999999999  667788


Q ss_pred             ccc------ccccc-CCCCC----CceeeecCC-ceEEecCCCeEECCCCCccccCCCCCCC
Q psy14262        413 CVD------ADECL-IDGSC----DQTCTNTHL-SYVCSCTEGYTLRPDGKSCTAVNVPQDE  462 (482)
Q Consensus       413 C~d------i~eC~-~~~~C----s~~C~n~~g-sy~C~C~~Gy~l~~d~~tC~didec~~~  462 (482)
                      |.-      ++.|. ..+.|    ...|+...| .|.|.|.+||.  .|+..|.++|||+..
T Consensus       774 CV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfs--GDG~~c~dvDeC~ps  833 (1289)
T KOG1214|consen  774 CVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFS--GDGHQCTDVDECSPS  833 (1289)
T ss_pred             eEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCcc--CCccccccccccCcc
Confidence            864      34565 33555    125666654 59999999998  579999999999854


No 18 
>KOG4260|consensus
Probab=97.17  E-value=0.00082  Score=64.77  Aligned_cols=46  Identities=37%  Similarity=0.733  Sum_probs=31.3

Q ss_pred             cCCCCccCCCCCcccccccc-CCCCC--CceeeecCCceEEecCCCeEE
Q psy14262        401 FCPAGQNPNKTQCVDADECL-IDGSC--DQTCTNTHLSYVCSCTEGYTL  446 (482)
Q Consensus       401 ~C~~G~~~~~~~C~di~eC~-~~~~C--s~~C~n~~gsy~C~C~~Gy~l  446 (482)
                      .|..||..+...|.||+||. .+.+|  .|+|+|+.|||+|.+.+||+.
T Consensus       221 kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  221 KCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK  269 (350)
T ss_pred             hhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence            46777744455677777776 44555  467777777777777777764


No 19 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.98  E-value=0.00018  Score=48.57  Aligned_cols=30  Identities=40%  Similarity=0.841  Sum_probs=24.8

Q ss_pred             CCCCcCCCCCCCCCccccCCCCc--cCCCCCc
Q psy14262        384 NLKCSHKCSPTPRGAMCFCPAGQ--NPNKTQC  413 (482)
Q Consensus       384 ~~~C~~~C~~~~~~~~C~C~~G~--~~~~~~C  413 (482)
                      +++|+|.|++++++|+|.|++||  .+++++|
T Consensus         5 NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    5 NGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             GGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred             CCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence            57899999999999999999999  6677765


No 20 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.78  E-value=0.00079  Score=41.03  Aligned_cols=22  Identities=41%  Similarity=0.939  Sum_probs=19.7

Q ss_pred             CccccCCCCc--cCCCCCcccccc
Q psy14262        397 GAMCFCPAGQ--NPNKTQCVDADE  418 (482)
Q Consensus       397 ~~~C~C~~G~--~~~~~~C~di~e  418 (482)
                      +|+|.|++||  .+++++|.||+|
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            5899999999  578999999986


No 21 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=96.20  E-value=0.0039  Score=59.97  Aligned_cols=44  Identities=43%  Similarity=0.872  Sum_probs=38.3

Q ss_pred             CCCCcccccccc-CCCCCCceeeecCCceEEecCCCeEECCCCCc
Q psy14262        409 NKTQCVDADECL-IDGSCDQTCTNTHLSYVCSCTEGYTLRPDGKS  452 (482)
Q Consensus       409 ~~~~C~di~eC~-~~~~Cs~~C~n~~gsy~C~C~~Gy~l~~d~~t  452 (482)
                      .+..|.++++|. .++.|.+.|.+++|+|.|.|++||.|..++++
T Consensus       180 ~~~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~~~~  224 (224)
T cd01475         180 QGKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT  224 (224)
T ss_pred             ccccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCCCCC
Confidence            567788899997 56789999999999999999999999877754


No 22 
>KOG2397|consensus
Probab=96.13  E-value=0.0055  Score=63.89  Aligned_cols=69  Identities=36%  Similarity=0.649  Sum_probs=60.2

Q ss_pred             CCeecCCCCceecCCcccCCCCCCCCCCCCCCCCCCCCCCCceecCCC----CeeccCCCCCCCcCCCCCCCCCCC
Q psy14262        168 NELRCGSSNQCVALSKFCDGKPDCSNGEDEGPFCNAECTPGQFSCGMG----KCIPESWKCDGFVDCLEDGSDEPN  239 (482)
Q Consensus       168 ~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE~~~C~~~C~~~~f~C~~g----~Ci~~~~~CDg~~dC~~dgsDE~~  239 (482)
                      ..|+|.++..=|+.+.+=|..-||.||+||..+  ..|+.+.|.|.+.    .=|+.+.+=||+.|| -|||||..
T Consensus        43 ~~~~CLdgs~~i~f~qlNDd~CDC~DGsDEPGt--sACpngkF~C~N~G~~p~~i~ssrV~DGICDC-CDgSDE~~  115 (480)
T KOG2397|consen   43 SMFKCLDGSKTISFSQLNDDSCDCLDGSDEPGT--SACPNGKFYCVNQGHQPKYIPSSRVNDGICDC-CDGSDEYL  115 (480)
T ss_pred             cceeeccCCcccCHHHhccccccCCCCCCCCcc--ccCCCCceeeeecCCCceeeechhccCccccc-ccCCCCcc
Confidence            479999988889988888999999999999762  4688999999764    678889999999999 99999974


No 23 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.94  E-value=0.009  Score=40.26  Aligned_cols=35  Identities=51%  Similarity=1.123  Sum_probs=25.7

Q ss_pred             cccccCCCCCCc--eeeecCCceEEecCCCeEECCCCCcc
Q psy14262        416 ADECLIDGSCDQ--TCTNTHLSYVCSCTEGYTLRPDGKSC  453 (482)
Q Consensus       416 i~eC~~~~~Cs~--~C~n~~gsy~C~C~~Gy~l~~d~~tC  453 (482)
                      +++|....+|.+  .|+++.++|.|.|++||.   +++.|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C   38 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNC   38 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCc---cCCcC
Confidence            566752256754  899999999999999987   35554


No 24 
>KOG1217|consensus
Probab=95.88  E-value=0.21  Score=52.56  Aligned_cols=116  Identities=28%  Similarity=0.606  Sum_probs=79.8

Q ss_pred             ccccCCCcccCCCCccccc--ccccCCCCCCCCCCCCCCCC---CCCCCcc----CCCCcC--CC--CCCCCCccccCCC
Q psy14262        338 VLCHPNELRCGSSNQCVAL--SKFCDGKPDCSNGEDEGPFC---NNKADCE----NLKCSH--KC--SPTPRGAMCFCPA  404 (482)
Q Consensus       338 ~~C~~~~f~C~~g~~Ci~~--~~~CDg~~dC~dgsDE~~~C---~~~~~C~----~~~C~~--~C--~~~~~~~~C~C~~  404 (482)
                      ..|..... |.+++.|+..  .++|    .|+.|..... |   ....+|.    ...|..  .|  ......+.|.|..
T Consensus       272 ~~C~~~~~-c~~~~~C~~~~~~~~C----~C~~g~~g~~-~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~  345 (487)
T KOG1217|consen  272 DSCALIAS-CPNGGTCVNVPGSYRC----TCPPGFTGRL-CTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGP  345 (487)
T ss_pred             cccCCCCc-cCCCCeeecCCCccee----eCCCCCCCCC-CccccccccccccccCCcCCCCcccccCCCCCCCCcCCCC
Confidence            34555444 8777789887  3788    6998887766 4   1112553    333443  45  3344578899999


Q ss_pred             CccCCCCCccccc-cccCC-CCCCceeee-cCCceEEecCCCeEEC--CCCCccccCCCCCC
Q psy14262        405 GQNPNKTQCVDAD-ECLID-GSCDQTCTN-THLSYVCSCTEGYTLR--PDGKSCTAVNVPQD  461 (482)
Q Consensus       405 G~~~~~~~C~di~-eC~~~-~~Cs~~C~n-~~gsy~C~C~~Gy~l~--~d~~tC~didec~~  461 (482)
                      |+  .+..|+... +|... ......|++ +.++|.|.|+.+|.+.  .+...+.++++|..
T Consensus       346 ~~--~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~  405 (487)
T KOG1217|consen  346 GF--TGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG  405 (487)
T ss_pred             CC--CCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC
Confidence            96  778898774 88732 223678999 7999999999999863  57788888888753


No 25 
>KOG4289|consensus
Probab=95.40  E-value=0.02  Score=66.15  Aligned_cols=88  Identities=25%  Similarity=0.643  Sum_probs=65.5

Q ss_pred             ccccCCCCCCCCCCCCCCCCC-CCCCccCCCCcC--CCCCCCCCccccCCCCccCCCCCcccc---ccccCCCCC--Cce
Q psy14262        357 SKFCDGKPDCSNGEDEGPFCN-NKADCENLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCVDA---DECLIDGSC--DQT  428 (482)
Q Consensus       357 ~~~CDg~~dC~dgsDE~~~C~-~~~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~di---~eC~~~~~C--s~~  428 (482)
                      .++|    .|+.|..... |. ..++|-+++|.+  +|....++|+|.|.+||  .+..|+-.   -.|. ++.|  ..+
T Consensus      1221 glrC----rCPpGFTgd~-CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~--tGehCEvs~~agrCv-pGvC~nggt 1292 (2531)
T KOG4289|consen 1221 GLRC----RCPPGFTGDY-CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGF--TGEHCEVSARAGRCV-PGVCKNGGT 1292 (2531)
T ss_pred             ceeE----eCCCCCCccc-ccchhHhhhcCCCCCCCceEEecCceeEEecCCc--cccceeeecccCccc-cceecCCCE
Confidence            4677    8999987776 87 223788889976  89988999999999999  77778532   2243 6788  468


Q ss_pred             eeecC-CceEEecCCCeEECCCCCcccc
Q psy14262        429 CTNTH-LSYVCSCTEGYTLRPDGKSCTA  455 (482)
Q Consensus       429 C~n~~-gsy~C~C~~Gy~l~~d~~tC~d  455 (482)
                      |++.+ |+|.|.|+.| .  .....|+-
T Consensus      1293 C~~~~nggf~c~Cp~g-e--~e~prC~v 1317 (2531)
T KOG4289|consen 1293 CVNLLNGGFCCHCPYG-E--FEDPRCEV 1317 (2531)
T ss_pred             EeecCCCceeccCCCc-c--cCCCceEE
Confidence            98875 7799999999 3  23555653


No 26 
>KOG1217|consensus
Probab=95.27  E-value=0.15  Score=53.78  Aligned_cols=150  Identities=26%  Similarity=0.599  Sum_probs=86.8

Q ss_pred             CCCCCCCCCC--ccccCcccccCCCCccCC--CCCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCcc
Q psy14262        278 DNTDEDWDKF--HQCFNGQVQCALSKKCIP--HNWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQC  353 (482)
Q Consensus       278 dgsDE~~C~~--~~C~~~~f~C~~g~~Ci~--~~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~C  353 (482)
                      .+.....+..  ..|......|.+++.|..  ..|.|    .|.     .+.-...   |...           .++..|
T Consensus       158 ~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C----~c~-----~~~~~~~---~~~~-----------~~~~~c  214 (487)
T KOG1217|consen  158 EGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLC----SCP-----PGYTGST---CETT-----------GNGGTC  214 (487)
T ss_pred             CCcccccccccccccccCCCCcCCCcccccCCCCeeE----eCC-----CCccCCc---CcCC-----------CCCceE
Confidence            5555544442  356655556776667773  44667    566     4433333   3211           222344


Q ss_pred             cccccccCCCCCCCCCCCCCCCCCCC-CCccCCCCcCCCCCCCCCccccCCCCccCCC-CCccccccccCCCCC--Ccee
Q psy14262        354 VALSKFCDGKPDCSNGEDEGPFCNNK-ADCENLKCSHKCSPTPRGAMCFCPAGQNPNK-TQCVDADECLIDGSC--DQTC  429 (482)
Q Consensus       354 i~~~~~CDg~~dC~dgsDE~~~C~~~-~~C~~~~C~~~C~~~~~~~~C~C~~G~~~~~-~~C~di~eC~~~~~C--s~~C  429 (482)
                      +.. +.|    .+..+..... |... .+|....  .+|.+..++|.|.|++||.... ..|.++++|.....|  .++|
T Consensus       215 ~~~-~~~----~~~~g~~~~~-c~~~~~~~~~~~--~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C  286 (487)
T KOG1217|consen  215 VDS-VAC----SCPPGARGPE-CEVSIVECASGD--GTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTC  286 (487)
T ss_pred             ecc-eec----cCCCCCCCCC-cccccccccCCC--CcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCee
Confidence            443 222    3444443322 3321 1232221  4788888899999999994344 467889999843236  4699


Q ss_pred             eecCCceEEecCCCeEECCCCCccccCCCC
Q psy14262        430 TNTHLSYVCSCTEGYTLRPDGKSCTAVNVP  459 (482)
Q Consensus       430 ~n~~gsy~C~C~~Gy~l~~d~~tC~didec  459 (482)
                      ++..+.|.|.|++||..... ..+.++.+|
T Consensus       287 ~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C  315 (487)
T KOG1217|consen  287 VNVPGSYRCTCPPGFTGRLC-TECVDVDEC  315 (487)
T ss_pred             ecCCCcceeeCCCCCCCCCC-ccccccccc
Confidence            99999999999999986544 334455555


No 27 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.01  E-value=0.0073  Score=42.16  Aligned_cols=26  Identities=31%  Similarity=0.810  Sum_probs=21.5

Q ss_pred             Ccc--CCCCc--CCCCCCCCCccccCCCCc
Q psy14262        381 DCE--NLKCS--HKCSPTPRGAMCFCPAGQ  406 (482)
Q Consensus       381 ~C~--~~~C~--~~C~~~~~~~~C~C~~G~  406 (482)
                      ||.  .+.|.  ..|+|+.|+|.|.|++||
T Consensus         4 EC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy   33 (42)
T PF07645_consen    4 ECAEGPHNCPENGTCVNTEGSYSCSCPPGY   33 (42)
T ss_dssp             TTTTTSSSSSTTSEEEEETTEEEEEESTTE
T ss_pred             ccCCCCCcCCCCCEEEcCCCCEEeeCCCCc
Confidence            665  34565  489999999999999999


No 28 
>KOG4260|consensus
Probab=94.84  E-value=0.06  Score=52.22  Aligned_cols=76  Identities=28%  Similarity=0.621  Sum_probs=56.5

Q ss_pred             CCCCCC--CCCCCCCCCCCcc--CCCCcC--CCCCCCCCccccCCCCccCCCCCcccccccc-CCCCC---CceeeecCC
Q psy14262        365 DCSNGE--DEGPFCNNKADCE--NLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCVDADECL-IDGSC---DQTCTNTHL  434 (482)
Q Consensus       365 dC~dgs--DE~~~C~~~~~C~--~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~di~eC~-~~~~C---s~~C~n~~g  434 (482)
                      .|..|.  ||.. |...+||.  ..+|.+  .|+|+.++|.|.+.+||..      ++++|. ....|   ...|+|+.+
T Consensus       221 kCkkGW~lde~g-CvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~------g~d~C~~~~d~~~~kn~~c~ni~~  293 (350)
T KOG4260|consen  221 KCKKGWKLDEEG-CVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK------GVDECQFCADVCASKNRPCMNIDG  293 (350)
T ss_pred             hhcccceecccc-cccHHHHhcCCCCCChhheeecCCCceEecccccccC------ChHHhhhhhhhcccCCCCcccCCc
Confidence            455563  7777 88545897  566753  8999999999999999932      256665 22444   357999999


Q ss_pred             ceEEecCCCeEEC
Q psy14262        435 SYVCSCTEGYTLR  447 (482)
Q Consensus       435 sy~C~C~~Gy~l~  447 (482)
                      +|+|.|+.|+...
T Consensus       294 ~~r~v~f~~~~~~  306 (350)
T KOG4260|consen  294 QYRCVCFSGLIII  306 (350)
T ss_pred             cEEEEecccceee
Confidence            9999999998643


No 29 
>KOG3509|consensus
Probab=94.30  E-value=0.17  Score=57.75  Aligned_cols=104  Identities=29%  Similarity=0.628  Sum_probs=86.0

Q ss_pred             ceecCCCccCccCCCCCCCCCCCCCCCCCC--CccccCcccccCCCCccCCCCCCCCCCCCCCCCCcCCCCCCCcCCcCc
Q psy14262        257 KCIPHNWLCDEEMDCSTDDIVDNTDEDWDK--FHQCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDKYFS  334 (482)
Q Consensus       257 ~Ci~~~~~CDg~~dC~~~~~~dgsDE~~C~--~~~C~~~~f~C~~g~~Ci~~~~~CDg~~DC~~g~~~dgsDE~~~~~C~  334 (482)
                      .|....+.|++..++.     +.+|+.+..  ...+.+++|.|.++ ++.-..+.+|....+.     +++.+.+   |.
T Consensus         2 ~c~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~p~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~~~~---~~   67 (964)
T KOG3509|consen    2 ECVKNRYACDRQPDCR-----DRSDVANDPAIGSACSPNEFKCNNP-RCVQPEALLDADSTCG-----PNSTPSG---CN   67 (964)
T ss_pred             chhhhhhhhccchhhH-----hhcccCCCccccccCCcchhccCCc-cccCchhhhccccccC-----CCCCcCC---cc
Confidence            4666778899999998     999997765  34688899999887 9999999999999999     7776665   43


Q ss_pred             c--CcccccCCCcccCCCCcccccccccCCCCCCCCCCCCCC
Q psy14262        335 H--NAVLCHPNELRCGSSNQCVALSKFCDGKPDCSNGEDEGP  374 (482)
Q Consensus       335 ~--~~~~C~~~~f~C~~g~~Ci~~~~~CDg~~dC~dgsDE~~  374 (482)
                      .  .+..|.+..+.|.+-.++...+..|+|..+|.++++|..
T Consensus        68 ~~~~~s~~~~~~~~c~~~~~~~~~~~~~~g~~~~~~~~~~~~  109 (964)
T KOG3509|consen   68 AKPSASDCKPTETQCRDRLRCNPQSFQCDGTNDCKDGSDEVG  109 (964)
T ss_pred             ccccccccCCcccccccchhcCCccccccCCCCCCccchhcc
Confidence            2  245677888999887678888999999999999999977


No 30 
>KOG2397|consensus
Probab=93.84  E-value=0.056  Score=56.56  Aligned_cols=68  Identities=43%  Similarity=0.638  Sum_probs=56.2

Q ss_pred             ceecCCC-CeeccCCCCCCCcCCCCCCCCCCCcccccccCCccccCCCC---ceecCCCccCccCCCCCCCCCCCCCCCC
Q psy14262        209 QFSCGMG-KCIPESWKCDGFVDCLEDGSDEPNTCIHQCFNGQVQCALSK---KCIPHNWLCDEEMDCSTDDIVDNTDEDW  284 (482)
Q Consensus       209 ~f~C~~g-~Ci~~~~~CDg~~dC~~dgsDE~~C~~~~C~~~~f~C~~~g---~Ci~~~~~CDg~~dC~~~~~~dgsDE~~  284 (482)
                      .|.|.+| .=|+.+++-|...|| .||+||..=  ..|..+.|+|.|-|   .=|+.+.+=||+.||-     |||||..
T Consensus        44 ~~~CLdgs~~i~f~qlNDd~CDC-~DGsDEPGt--sACpngkF~C~N~G~~p~~i~ssrV~DGICDCC-----DgSDE~~  115 (480)
T KOG2397|consen   44 MFKCLDGSKTISFSQLNDDSCDC-LDGSDEPGT--SACPNGKFYCVNQGHQPKYIPSSRVNDGICDCC-----DGSDEYL  115 (480)
T ss_pred             ceeeccCCcccCHHHhccccccC-CCCCCCCcc--ccCCCCceeeeecCCCceeeechhccCcccccc-----cCCCCcc
Confidence            5788877 678888899999999 999999752  35888999998543   4688888999999999     9999953


No 31 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=93.42  E-value=0.099  Score=34.51  Aligned_cols=30  Identities=53%  Similarity=1.011  Sum_probs=22.6

Q ss_pred             cccccCCCCC--CceeeecCCceEEecCCCeE
Q psy14262        416 ADECLIDGSC--DQTCTNTHLSYVCSCTEGYT  445 (482)
Q Consensus       416 i~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~  445 (482)
                      +++|....+|  .+.|+++.++|+|.|++||.
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~   33 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYT   33 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCc
Confidence            4556522467  46899999999999999987


No 32 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=93.42  E-value=0.094  Score=34.45  Aligned_cols=25  Identities=56%  Similarity=1.077  Sum_probs=20.5

Q ss_pred             CCCCc-eeeecCCceEEecCCCeEEC
Q psy14262        423 GSCDQ-TCTNTHLSYVCSCTEGYTLR  447 (482)
Q Consensus       423 ~~Cs~-~C~n~~gsy~C~C~~Gy~l~  447 (482)
                      .+|.+ +|+++.++|.|.|++||.+.
T Consensus         6 ~~C~~~~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        6 GPCSNGTCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCCCEEECCCCCeEeECCCCCccC
Confidence            46755 89999999999999999753


No 33 
>KOG4289|consensus
Probab=92.95  E-value=0.087  Score=61.19  Aligned_cols=49  Identities=33%  Similarity=0.880  Sum_probs=40.4

Q ss_pred             CCCCccccCCCCccCCCCCcc-ccccccCCCCC--CceeeecCCceEEecCCCeE
Q psy14262        394 TPRGAMCFCPAGQNPNKTQCV-DADECLIDGSC--DQTCTNTHLSYVCSCTEGYT  445 (482)
Q Consensus       394 ~~~~~~C~C~~G~~~~~~~C~-di~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~  445 (482)
                      ...+++|.||+||  .+..|+ .+++|- .++|  ...|....|+|.|.|.+||+
T Consensus      1218 pvnglrCrCPpGF--Tgd~CeTeiDlCY-s~pC~nng~C~srEggYtCeCrpg~t 1269 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGF--TGDYCETEIDLCY-SGPCGNNGRCRSREGGYTCECRPGFT 1269 (2531)
T ss_pred             ccCceeEeCCCCC--CcccccchhHhhh-cCCCCCCCceEEecCceeEEecCCcc
Confidence            4568999999999  445674 578886 5788  46899999999999999997


No 34 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=92.94  E-value=0.099  Score=35.24  Aligned_cols=30  Identities=53%  Similarity=1.114  Sum_probs=20.8

Q ss_pred             CCCC--CceeeecCCceEEecCCCeEECCCCCcc
Q psy14262        422 DGSC--DQTCTNTHLSYVCSCTEGYTLRPDGKSC  453 (482)
Q Consensus       422 ~~~C--s~~C~n~~gsy~C~C~~Gy~l~~d~~tC  453 (482)
                      ++.|  ...|++++++|.|+|.+||..  +|..|
T Consensus         5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~G--dG~~C   36 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGSYTCTCKPGYEG--DGFFC   36 (36)
T ss_dssp             GGGS-TTCEEEE-TTSEEEEE-CEEEC--CSTCE
T ss_pred             CCCCCCCcEeecCCCCEEeECCCCCcc--CCcCC
Confidence            3556  358999999999999999984  35543


No 35 
>KOG3509|consensus
Probab=92.91  E-value=0.098  Score=59.68  Aligned_cols=103  Identities=28%  Similarity=0.593  Sum_probs=87.2

Q ss_pred             ceecCCcccCCCCCCCCCCCCCCCCC---CCCCCCceecCCCCeeccCCCCCCCcCCCCCCCCCCCcc----cccccCCc
Q psy14262        177 QCVALSKFCDGKPDCSNGEDEGPFCN---AECTPGQFSCGMGKCIPESWKCDGFVDCLEDGSDEPNTC----IHQCFNGQ  249 (482)
Q Consensus       177 ~Ci~~~~~CDg~~dC~dgsDE~~~C~---~~C~~~~f~C~~g~Ci~~~~~CDg~~dC~~dgsDE~~C~----~~~C~~~~  249 (482)
                      .|....+.|++..|+.+.+|+.+ .+   ..+++++|.|.++++.-..+.+|....+ ..++++.+|.    +..|.+..
T Consensus         2 ~c~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~s~~~~~~   79 (964)
T KOG3509|consen    2 ECVKNRYACDRQPDCRDRSDVAN-DPAIGSACSPNEFKCNNPRCVQPEALLDADSTC-GPNSTPSGCNAKPSASDCKPTE   79 (964)
T ss_pred             chhhhhhhhccchhhHhhcccCC-CccccccCCcchhccCCccccCchhhhcccccc-CCCCCcCCccccccccccCCcc
Confidence            57778899999999999999987 54   6788999999999999999999999999 8888877763    23467778


Q ss_pred             cccCCCCceecCCCccCccCCCCCCCCCCCCCCCCCC
Q psy14262        250 VQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDK  286 (482)
Q Consensus       250 f~C~~~g~Ci~~~~~CDg~~dC~~~~~~dgsDE~~C~  286 (482)
                      +.|.+--++.+.+..|+|.++|.     ++++|..+.
T Consensus        80 ~~c~~~~~~~~~~~~~~g~~~~~-----~~~~~~~~~  111 (964)
T KOG3509|consen   80 TQCRDRLRCNPQSFQCDGTNDCK-----DGSDEVGCK  111 (964)
T ss_pred             cccccchhcCCccccccCCCCCC-----ccchhcccc
Confidence            88984447888999999999999     999886553


No 36 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=91.45  E-value=0.16  Score=33.89  Aligned_cols=32  Identities=28%  Similarity=0.789  Sum_probs=23.9

Q ss_pred             CccC-CCCcC--CCCCCCCCccccCCCCccCCCCCc
Q psy14262        381 DCEN-LKCSH--KCSPTPRGAMCFCPAGQNPNKTQC  413 (482)
Q Consensus       381 ~C~~-~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C  413 (482)
                      +|.. .+|.+  +|+++.++|.|.|++||. .+..|
T Consensus         4 ~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~-~g~~C   38 (39)
T smart00179        4 ECASGNPCQNGGTCVNTVGSYRCECPPGYT-DGRNC   38 (39)
T ss_pred             cCcCCCCcCCCCEeECCCCCeEeECCCCCc-cCCcC
Confidence            5554 56665  799999999999999994 35544


No 37 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=89.25  E-value=0.46  Score=30.59  Aligned_cols=24  Identities=50%  Similarity=0.953  Sum_probs=19.0

Q ss_pred             CCCC--ceeeecCCceEEecCCCeEE
Q psy14262        423 GSCD--QTCTNTHLSYVCSCTEGYTL  446 (482)
Q Consensus       423 ~~Cs--~~C~n~~gsy~C~C~~Gy~l  446 (482)
                      ..|.  ++|+++.++|+|.|+.||..
T Consensus         6 ~~C~~~~~C~~~~~~~~C~C~~g~~g   31 (36)
T cd00053           6 NPCSNGGTCVNTPGSYRCVCPPGYTG   31 (36)
T ss_pred             CCCCCCCEEecCCCCeEeECCCCCcc
Confidence            4553  68888888999999999874


No 38 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=89.23  E-value=0.087  Score=35.51  Aligned_cols=25  Identities=24%  Similarity=0.553  Sum_probs=18.6

Q ss_pred             CCCCCCCCCccccCCCCccCCCCCc
Q psy14262        389 HKCSPTPRGAMCFCPAGQNPNKTQC  413 (482)
Q Consensus       389 ~~C~~~~~~~~C~C~~G~~~~~~~C  413 (482)
                      +.|.+++++|.|.|++||..++..|
T Consensus        12 A~C~~~~~~~~C~C~~Gy~GdG~~C   36 (36)
T PF12947_consen   12 ATCTNTGGSYTCTCKPGYEGDGFFC   36 (36)
T ss_dssp             CEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred             cEeecCCCCEEeECCCCCccCCcCC
Confidence            3788889999999999996666554


No 39 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=88.69  E-value=0.34  Score=31.71  Aligned_cols=22  Identities=41%  Similarity=0.886  Sum_probs=18.9

Q ss_pred             CCCcC-CCCCCCCCccccCCCCc
Q psy14262        385 LKCSH-KCSPTPRGAMCFCPAGQ  406 (482)
Q Consensus       385 ~~C~~-~C~~~~~~~~C~C~~G~  406 (482)
                      .+|.+ +|+++.++|.|.|++||
T Consensus         6 ~~C~~~~C~~~~~~~~C~C~~g~   28 (35)
T smart00181        6 GPCSNGTCINTPGSYTCSCPPGY   28 (35)
T ss_pred             CCCCCCEEECCCCCeEeECCCCC
Confidence            45666 79888999999999999


No 40 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=84.59  E-value=1  Score=29.29  Aligned_cols=23  Identities=39%  Similarity=0.998  Sum_probs=19.2

Q ss_pred             CCCC--ceeeecC-CceEEecCCCeE
Q psy14262        423 GSCD--QTCTNTH-LSYVCSCTEGYT  445 (482)
Q Consensus       423 ~~Cs--~~C~n~~-gsy~C~C~~Gy~  445 (482)
                      .+|.  .+|++.. ++|.|.|++||+
T Consensus         4 ~~C~n~g~C~~~~~~~y~C~C~~G~~   29 (32)
T PF00008_consen    4 NPCQNGGTCIDLPGGGYTCECPPGYT   29 (32)
T ss_dssp             TSSTTTEEEEEESTSEEEEEEBTTEE
T ss_pred             CcCCCCeEEEeCCCCCEEeECCCCCc
Confidence            3664  4899998 899999999997


No 41 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=84.16  E-value=0.23  Score=32.44  Aligned_cols=23  Identities=35%  Similarity=0.873  Sum_probs=18.4

Q ss_pred             CCCCcC--CCCCCC-CCccccCCCCc
Q psy14262        384 NLKCSH--KCSPTP-RGAMCFCPAGQ  406 (482)
Q Consensus       384 ~~~C~~--~C~~~~-~~~~C~C~~G~  406 (482)
                      ..+|++  +|++.. .+|+|.|++||
T Consensus         3 ~~~C~n~g~C~~~~~~~y~C~C~~G~   28 (32)
T PF00008_consen    3 SNPCQNGGTCIDLPGGGYTCECPPGY   28 (32)
T ss_dssp             TTSSTTTEEEEEESTSEEEEEEBTTE
T ss_pred             CCcCCCCeEEEeCCCCCEEeECCCCC
Confidence            345655  788777 89999999999


No 42 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=81.98  E-value=1.2  Score=29.10  Aligned_cols=26  Identities=31%  Similarity=0.778  Sum_probs=19.8

Q ss_pred             CccC-CCCc--CCCCCCCCCccccCCCCc
Q psy14262        381 DCEN-LKCS--HKCSPTPRGAMCFCPAGQ  406 (482)
Q Consensus       381 ~C~~-~~C~--~~C~~~~~~~~C~C~~G~  406 (482)
                      +|.. .+|.  ..|.+..++|.|.|++||
T Consensus         4 ~C~~~~~C~~~~~C~~~~~~~~C~C~~g~   32 (38)
T cd00054           4 ECASGNPCQNGGTCVNTVGSYRCSCPPGY   32 (38)
T ss_pred             cCCCCCCcCCCCEeECCCCCeEeECCCCC
Confidence            4443 4564  378888999999999999


No 43 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=81.26  E-value=1.3  Score=28.41  Aligned_cols=18  Identities=39%  Similarity=0.922  Sum_probs=16.0

Q ss_pred             CCCCCCCCCccccCCCCc
Q psy14262        389 HKCSPTPRGAMCFCPAGQ  406 (482)
Q Consensus       389 ~~C~~~~~~~~C~C~~G~  406 (482)
                      .+|++..++|.|.|+.||
T Consensus        12 ~~C~~~~~~~~C~C~~g~   29 (36)
T cd00053          12 GTCVNTPGSYRCVCPPGY   29 (36)
T ss_pred             CEEecCCCCeEeECCCCC
Confidence            478888889999999999


No 44 
>KOG1225|consensus
Probab=80.21  E-value=7  Score=42.31  Aligned_cols=72  Identities=25%  Similarity=0.495  Sum_probs=38.4

Q ss_pred             eecCCCCeeccCCCCCCCcCCCCCCCCCCCcccccccCCccccCCCCceecCCCccCccCCCCCCCCCCCCCCCCCCCcc
Q psy14262        210 FSCGMGKCIPESWKCDGFVDCLEDGSDEPNTCIHQCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDKFHQ  289 (482)
Q Consensus       210 f~C~~g~Ci~~~~~CDg~~dC~~dgsDE~~C~~~~C~~~~f~C~~~g~Ci~~~~~CDg~~dC~~~~~~dgsDE~~C~~~~  289 (482)
                      ++|..|+||           | ..|+...+|....|...   |.+++.+++      |+-.|.     ++.-...|....
T Consensus       260 g~c~~G~CI-----------C-~~Gf~G~dC~e~~Cp~~---cs~~g~~~~------g~CiC~-----~g~~G~dCs~~~  313 (525)
T KOG1225|consen  260 GQCVEGRCI-----------C-PPGFTGDDCDELVCPVD---CSGGGVCVD------GECICN-----PGYSGKDCSIRR  313 (525)
T ss_pred             ceEeCCeEe-----------C-CCCCcCCCCCcccCCcc---cCCCceecC------CEeecC-----CCcccccccccc
Confidence            566666666           3 55555555543334332   332344443      223376     666666676555


Q ss_pred             ccCcccccCCCCccCCCCCCC
Q psy14262        290 CFNGQVQCALSKKCIPHNWLC  310 (482)
Q Consensus       290 C~~~~f~C~~g~~Ci~~~~~C  310 (482)
                      |..   .|..+++||+..-+|
T Consensus       314 cpa---dC~g~G~Ci~G~C~C  331 (525)
T KOG1225|consen  314 CPA---DCSGHGKCIDGECLC  331 (525)
T ss_pred             CCc---cCCCCCcccCCceEe
Confidence            653   477777888544333


No 45 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=78.70  E-value=1.4  Score=42.20  Aligned_cols=23  Identities=30%  Similarity=0.788  Sum_probs=20.4

Q ss_pred             CCCCcCCCCCCCCCccccCCCCc
Q psy14262        384 NLKCSHKCSPTPRGAMCFCPAGQ  406 (482)
Q Consensus       384 ~~~C~~~C~~~~~~~~C~C~~G~  406 (482)
                      ++.|.+.|.+++++|.|.|++||
T Consensus       194 ~~~c~~~C~~~~g~~~c~c~~g~  216 (224)
T cd01475         194 SHVCQQVCISTPGSYLCACTEGY  216 (224)
T ss_pred             CCCccceEEcCCCCEEeECCCCc
Confidence            44677889999999999999999


No 46 
>KOG1225|consensus
Probab=78.32  E-value=9.8  Score=41.22  Aligned_cols=62  Identities=29%  Similarity=0.790  Sum_probs=35.1

Q ss_pred             CCCCCCCCCCCCCCCCCccCCCCcC--CCCCCCCCccccCCCCccCCCCCccccccccCCCCCCceeeecCCceEEecCC
Q psy14262        365 DCSNGEDEGPFCNNKADCENLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCVDADECLIDGSCDQTCTNTHLSYVCSCTE  442 (482)
Q Consensus       365 dC~dgsDE~~~C~~~~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~di~eC~~~~~Cs~~C~n~~gsy~C~C~~  442 (482)
                      .|.+|+-... |++- .|. ..|..  +|+    .-+|.|.+||  .+..|.... |...    +.|+|  |   |.|.+
T Consensus       299 iC~~g~~G~d-Cs~~-~cp-adC~g~G~Ci----~G~C~C~~Gy--~G~~C~~~~-C~~~----g~cv~--g---C~C~~  359 (525)
T KOG1225|consen  299 ICNPGYSGKD-CSIR-RCP-ADCSGHGKCI----DGECLCDEGY--TGELCIQRA-CSGG----GQCVN--G---CKCKK  359 (525)
T ss_pred             ecCCCccccc-cccc-cCC-ccCCCCCccc----CCceEeCCCC--cCCcccccc-cCCC----ceecc--C---ceecc
Confidence            6777776655 6644 332 22322  444    3478899998  566665432 4322    24544  2   77778


Q ss_pred             CeE
Q psy14262        443 GYT  445 (482)
Q Consensus       443 Gy~  445 (482)
                      ||+
T Consensus       360 Gw~  362 (525)
T KOG1225|consen  360 GWR  362 (525)
T ss_pred             Ccc
Confidence            876


No 47 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=68.33  E-value=2.9  Score=38.79  Aligned_cols=57  Identities=28%  Similarity=0.680  Sum_probs=39.1

Q ss_pred             CCCCCCccccCCCCc-cCCCCCcccccccc----CCCCCC--ceeeecC-----CceEEecCCCeEECC
Q psy14262        392 SPTPRGAMCFCPAGQ-NPNKTQCVDADECL----IDGSCD--QTCTNTH-----LSYVCSCTEGYTLRP  448 (482)
Q Consensus       392 ~~~~~~~~C~C~~G~-~~~~~~C~di~eC~----~~~~Cs--~~C~n~~-----gsy~C~C~~Gy~l~~  448 (482)
                      ..+...|.|.|.+|| .....+|+...+|.    ...+|.  ..|++..     ..|+|.|.+||.|..
T Consensus        14 iQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~   82 (197)
T PF06247_consen   14 IQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQ   82 (197)
T ss_dssp             EEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS
T ss_pred             EEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeC
Confidence            345667999999999 55677898888886    246673  5788876     469999999999874


No 48 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=60.64  E-value=8.9  Score=25.41  Aligned_cols=24  Identities=33%  Similarity=0.798  Sum_probs=14.8

Q ss_pred             CCCceeeecCCceEEecCCCeEECC
Q psy14262        424 SCDQTCTNTHLSYVCSCTEGYTLRP  448 (482)
Q Consensus       424 ~Cs~~C~n~~gsy~C~C~~Gy~l~~  448 (482)
                      .|...|.... .++|.|++||.|..
T Consensus         7 ~CpA~CDpn~-~~~C~CPeGyIlde   30 (34)
T PF09064_consen    7 ECPADCDPNS-PGQCFCPEGYILDE   30 (34)
T ss_pred             cCCCccCCCC-CCceeCCCceEecC
Confidence            3444554322 23788999998864


No 49 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=58.97  E-value=7.8  Score=20.00  Aligned_cols=9  Identities=44%  Similarity=1.331  Sum_probs=5.5

Q ss_pred             EEecCCCeE
Q psy14262        437 VCSCTEGYT  445 (482)
Q Consensus       437 ~C~C~~Gy~  445 (482)
                      .|.|++||+
T Consensus         1 ~C~C~~G~~    9 (13)
T PF12661_consen    1 TCQCPPGWT    9 (13)
T ss_dssp             EEEE-TTEE
T ss_pred             CccCcCCCc
Confidence            467777776


No 50 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=30.98  E-value=48  Score=22.52  Aligned_cols=22  Identities=27%  Similarity=0.475  Sum_probs=17.4

Q ss_pred             CceeeecC-CceEEecCCCeEEC
Q psy14262        426 DQTCTNTH-LSYVCSCTEGYTLR  447 (482)
Q Consensus       426 s~~C~n~~-gsy~C~C~~Gy~l~  447 (482)
                      ...|++.. |++.|.|..||+..
T Consensus        10 NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen   10 NAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             TEEEEEETTSEEEEEE-TTEEEE
T ss_pred             CcccEEcCCCCEEEEeeCCcccc
Confidence            34788887 89999999999875


No 51 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=27.08  E-value=13  Score=34.64  Aligned_cols=98  Identities=29%  Similarity=0.743  Sum_probs=50.3

Q ss_pred             ccCCCCccccc-------ccccCCCCCCCCCCCCCC-CCCCCCCccCCCCcC-CCCCC---CCCccccCCCCc-cCCCCC
Q psy14262        346 RCGSSNQCVAL-------SKFCDGKPDCSNGEDEGP-FCNNKADCENLKCSH-KCSPT---PRGAMCFCPAGQ-NPNKTQ  412 (482)
Q Consensus       346 ~C~~g~~Ci~~-------~~~CDg~~dC~dgsDE~~-~C~~~~~C~~~~C~~-~C~~~---~~~~~C~C~~G~-~~~~~~  412 (482)
                      +|++.++|+..       .+.|    +|..|+.-.. .|... .|.+-.|.. .|+-.   +....|+|.-|+ ..+...
T Consensus        51 ~Cgdya~C~~~~~~~~~~~~~C----~C~~gY~~~~~vCvp~-~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~k  125 (197)
T PF06247_consen   51 PCGDYAKCINQANKGEERAYKC----DCINGYILKQGVCVPN-KCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKK  125 (197)
T ss_dssp             EEETTEEEEE-SSTTSSTSEEE----EE-TTEEESSSSEEEG-GGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTE
T ss_pred             cccchhhhhcCCCcccceeEEE----ecccCceeeCCeEchh-hcCceecCCCeEEecCCCCCCceeEeeeceEeccCCc
Confidence            45554466643       4566    6777642211 14433 444333332 44322   234589999999 466777


Q ss_pred             ccc--cccccCCCCC--CceeeecCCceEEecCCCeEECCCC
Q psy14262        413 CVD--ADECLIDGSC--DQTCTNTHLSYVCSCTEGYTLRPDG  450 (482)
Q Consensus       413 C~d--i~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~l~~d~  450 (482)
                      |..  ..+|+  -.|  ...|..+.+-|+|.+..|+.+...+
T Consensus       126 Ctk~G~T~C~--LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~~  165 (197)
T PF06247_consen  126 CTKTGETKCS--LKCKENEECKLVDGYYKCVCKEGFPGDGEG  165 (197)
T ss_dssp             SEEEE----------TTTEEEEEETTEEEEEE-TT-EEETTT
T ss_pred             ccCCCcccee--eecCCCcceeeeCcEEEeecCCCCCCCCCc
Confidence            743  34454  344  4589999999999999999876544


Done!