Query         psy9822
Match_columns 380
No_of_seqs    312 out of 1858
Neff          6.9 
Searched_HMMs 46136
Date          Fri Aug 16 20:35:25 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9822.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9822hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0994|consensus               99.7 4.7E-17   1E-21  173.2   9.2  173  174-379   848-1063(1758)
  2 KOG0994|consensus               99.6   4E-16 8.8E-21  166.2   9.8  158  192-379   774-960 (1758)
  3 KOG1388|consensus               99.3 1.4E-12   3E-17  118.5   4.5  127  113-369     1-128 (217)
  4 KOG3512|consensus               99.2 2.8E-11   6E-16  120.4   5.6  102  259-379   277-386 (592)
  5 KOG1836|consensus               99.1 6.2E-10 1.4E-14  127.5  11.3  160  190-379   734-936 (1705)
  6 KOG3512|consensus               99.0 4.3E-10 9.3E-15  112.1   6.6   78  274-378   358-439 (592)
  7 KOG1836|consensus               98.8 4.6E-09   1E-13  120.5   7.5  108  244-380   696-825 (1705)
  8 KOG1225|consensus               98.6 7.7E-08 1.7E-12   99.5   8.3   74  207-290   232-310 (525)
  9 KOG1225|consensus               98.6 1.3E-07 2.9E-12   97.7   8.7   99  174-291   239-342 (525)
 10 KOG1214|consensus               98.2 5.4E-06 1.2E-10   88.0   8.8  107  179-287   728-859 (1289)
 11 KOG4289|consensus               98.1   7E-06 1.5E-10   90.9   7.5  107  226-373  1715-1843(2531)
 12 cd00055 EGF_Lam Laminin-type e  97.9 1.2E-05 2.7E-10   57.5   3.9   28  350-378    17-44  (50)
 13 KOG1219|consensus               97.9 1.1E-05 2.5E-10   92.4   5.5  101  186-294  3865-3981(4289)
 14 PF07645 EGF_CA:  Calcium-bindi  97.9 6.5E-06 1.4E-10   56.7   2.1   36  184-220     1-36  (42)
 15 PF00053 Laminin_EGF:  Laminin   97.9 4.3E-06 9.3E-11   59.5   0.9   29  350-379    16-44  (49)
 16 KOG4289|consensus               97.7 2.3E-05   5E-10   87.1   3.9   82  205-290  1218-1316(2531)
 17 KOG1219|consensus               97.7 3.6E-05 7.9E-10   88.6   5.5  102  231-371  3869-3981(4289)
 18 KOG1214|consensus               97.6 9.9E-05 2.1E-09   78.7   5.8  111  184-301   691-822 (1289)
 19 smart00180 EGF_Lam Laminin-typ  97.6 7.8E-05 1.7E-09   52.4   3.3   25  350-375    16-40  (46)
 20 KOG1226|consensus               97.5  0.0003 6.4E-09   74.9   8.0   75  210-290   479-580 (783)
 21 cd00055 EGF_Lam Laminin-type e  97.0  0.0011 2.3E-08   47.4   4.4   40  260-302     2-45  (50)
 22 KOG1226|consensus               97.0  0.0023 5.1E-08   68.3   8.0   87  194-293   515-623 (783)
 23 PF12947 EGF_3:  EGF domain;  I  96.9 0.00034 7.3E-09   46.6   0.8   31  190-221     3-33  (36)
 24 KOG3509|consensus               96.8   0.002 4.4E-08   71.1   6.6  116  243-379   718-853 (964)
 25 KOG1217|consensus               96.8   0.012 2.6E-07   59.5  11.5  104  174-287   157-263 (487)
 26 smart00179 EGF_CA Calcium-bind  96.7  0.0018 3.9E-08   42.8   3.3   36  184-221     1-37  (39)
 27 PF00053 Laminin_EGF:  Laminin   96.6 0.00089 1.9E-08   47.5   1.1   32  267-301    12-43  (49)
 28 smart00180 EGF_Lam Laminin-typ  95.9  0.0081 1.7E-07   42.1   3.2   29  267-298    12-40  (46)
 29 cd00054 EGF_CA Calcium-binding  95.8   0.012 2.5E-07   38.2   3.2   35  185-221     2-36  (38)
 30 PF01437 PSI:  Plexin repeat;    95.7  0.0038 8.3E-08   44.7   0.8   32   43-79      1-32  (51)
 31 PF00008 EGF:  EGF-like domain   95.4  0.0058 1.3E-07   39.5   0.8   26  194-220     5-31  (32)
 32 KOG4260|consensus               95.2   0.019   4E-07   54.8   3.6   66  247-313   132-207 (350)
 33 smart00051 DSL delta serrate l  94.7    0.03 6.5E-07   42.1   2.9   39  247-289    21-63  (63)
 34 PF07974 EGF_2:  EGF-like domai  94.5   0.022 4.9E-07   36.8   1.6   21  236-256    11-32  (32)
 35 smart00423 PSI domain found in  94.1   0.034 7.3E-07   38.8   2.0   30   44-78      1-30  (46)
 36 KOG4260|consensus               94.1    0.06 1.3E-06   51.4   4.2   63  274-375   128-190 (350)
 37 cd00053 EGF Epidermal growth f  94.1   0.054 1.2E-06   34.3   2.8   27  193-220     6-32  (36)
 38 KOG1217|consensus               94.0    0.16 3.5E-06   51.2   7.6  102  181-286   267-388 (487)
 39 smart00051 DSL delta serrate l  93.7   0.061 1.3E-06   40.4   2.8   41  212-256    20-63  (63)
 40 PF01437 PSI:  Plexin repeat;    93.7   0.018 3.9E-07   41.1  -0.1   30  105-141     1-32  (51)
 41 PF12661 hEGF:  Human growth fa  93.5   0.023 4.9E-07   29.3   0.1   12  245-256     2-13  (13)
 42 smart00181 EGF Epidermal growt  93.0    0.11 2.4E-06   33.4   2.9   25  193-219     6-30  (35)
 43 KOG3509|consensus               92.9    0.12 2.5E-06   57.6   4.6   75  274-378   717-794 (964)
 44 PF14670 FXa_inhibition:  Coagu  89.6    0.21 4.6E-06   33.2   1.5   22  200-221    10-31  (36)
 45 PF12662 cEGF:  Complement Clr-  89.0    0.26 5.7E-06   29.8   1.5   20  208-227     1-20  (24)
 46 PF07974 EGF_2:  EGF-like domai  88.8    0.31 6.7E-06   31.5   1.8   26  261-289     7-32  (32)
 47 cd01475 vWA_Matrilin VWA_Matri  84.5    0.83 1.8E-05   42.4   3.0   41  178-221   180-220 (224)
 48 KOG1218|consensus               83.9      11 0.00025   36.2  10.8   41  248-291    96-139 (316)
 49 smart00179 EGF_CA Calcium-bind  83.6     1.1 2.3E-05   29.1   2.4   27  262-289    11-38  (39)
 50 KOG1388|consensus               83.4     1.4 3.1E-05   40.9   3.9   98  201-301    68-169 (217)
 51 PF00008 EGF:  EGF-like domain   83.4    0.25 5.4E-06   31.7  -0.7   26  261-287     5-31  (32)
 52 PF06247 Plasmod_Pvs28:  Plasmo  81.5    0.21 4.6E-06   45.3  -2.1   89  194-286    51-161 (197)
 53 cd00054 EGF_CA Calcium-binding  79.0     1.9   4E-05   27.4   2.3   27  262-289    11-37  (38)
 54 smart00423 PSI domain found in  78.3     1.2 2.6E-05   30.8   1.3   20  106-125     1-22  (46)
 55 cd00053 EGF Epidermal growth f  78.1     2.7 5.8E-05   26.1   2.8   26  261-287     7-32  (36)
 56 PF01414 DSL:  Delta serrate li  78.0    0.68 1.5E-05   34.7  -0.1   10  280-289    54-63  (63)
 57 KOG1218|consensus               74.9      23  0.0005   34.0   9.7   13  176-188    98-110 (316)
 58 PF01414 DSL:  Delta serrate li  74.9    0.61 1.3E-05   35.0  -1.0   45  208-256    16-63  (63)
 59 PHA02887 EGF-like protein; Pro  73.1     1.8   4E-05   36.3   1.2   25  234-258    94-123 (126)
 60 smart00181 EGF Epidermal growt  71.4     3.8 8.2E-05   26.0   2.2   23  262-286     8-30  (35)
 61 PF12946 EGF_MSP1_1:  MSP1 EGF   67.7     1.9 4.1E-05   28.9   0.2   24  198-221     9-33  (37)
 62 PHA03099 epidermal growth fact  65.5       3 6.6E-05   35.6   1.0   25  234-258    53-82  (139)
 63 KOG0196|consensus               55.7      17 0.00037   40.2   4.8   36  274-310   258-297 (996)
 64 PTZ00214 high cysteine membran  51.6      98  0.0021   34.7  10.0  133  197-380   504-662 (800)
 65 PHA03099 epidermal growth fact  50.6     9.5 0.00021   32.7   1.6   18  350-368    65-82  (139)
 66 PF03302 VSP:  Giardia variant-  50.6 1.2E+02  0.0026   30.9   9.9   21  288-311    92-112 (397)
 67 PF01508 Paramecium_SA:  Parame  46.0      16 0.00035   26.5   2.1   34   41-78     27-60  (63)
 68 PF01683 EB:  EB module;  Inter  40.9      37  0.0008   23.7   3.2   29  251-284    17-45  (52)
 69 PTZ00382 Variant-specific surf  37.3      76  0.0017   25.7   4.9   22  288-310     5-26  (96)
 70 PF00954 S_locus_glycop:  S-loc  34.2      38 0.00081   27.7   2.7   33  184-219    76-108 (110)
 71 PHA02887 EGF-like protein; Pro  31.8      29 0.00063   29.3   1.6   18  350-368   106-123 (126)
 72 smart00187 INB Integrin beta s  31.2      27 0.00059   35.9   1.6   20  111-132     1-21  (423)
 73 KOG3848|consensus               30.1      23 0.00049   36.2   0.8   32   42-77    317-348 (516)
 74 PF14843 GF_recep_IV:  Growth f  29.7      17 0.00038   31.0  -0.0   22  352-373    73-95  (132)
 75 PF00362 Integrin_beta:  Integr  29.2      25 0.00055   36.3   1.0   19  111-131     1-20  (426)
 76 PF05283 MGC-24:  Multi-glycosy  24.6      73  0.0016   29.2   3.0   39   40-81     52-92  (186)
 77 cd00185 TNFR Tumor necrosis fa  22.4 1.7E+02  0.0038   23.3   4.6   27  287-314    33-60  (98)

No 1  
>KOG0994|consensus
Probab=99.69  E-value=4.7e-17  Score=173.20  Aligned_cols=173  Identities=32%  Similarity=0.795  Sum_probs=137.0

Q ss_pred             CCCccCCCCCCCccccCCCC-CCCCCCCeeec---CCCCccc-ccCCCceecCC---CCCcccc-CCCCCcC-----Ccc
Q psy9822         174 SFSWNYVKCPPENECLNDHH-TCDPQSEQCVD---LADGFEC-VCGRGYNKSEE---GGECVPV-CSQGCVR-----GVC  239 (380)
Q Consensus       174 ~~~w~~~~C~d~deC~~g~~-~C~~~~g~C~n---~~~g~~C-~C~~Gy~G~~~---~~~C~~~-C~~~C~~-----G~C  239 (380)
                      ...|+|+.|... .| |+|. .|++..|.|++   ...|+.| +|..||+|+..   ...|.|. |+.+-..     -.|
T Consensus       848 pG~WgFPeCr~C-qC-NgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC  925 (1758)
T KOG0994|consen  848 PGYWGFPECRPC-QC-NGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSC  925 (1758)
T ss_pred             CCccCCCcCccc-cc-cCcccccCccccccccccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccc
Confidence            356999999763 67 7776 89999998876   6689999 99999999976   5678888 8766322     245


Q ss_pred             cC-----CCccccCCCcccccccc----------------cCCCCCCC------cccCCCCCcccccCCCCCCCCCCCCC
Q psy9822         240 SE-----PDKCQCDFGYVGVNCSI----------------QCQCNGHA------DCAGPDKLDVCLRCHNHTKGPQCDKC  292 (380)
Q Consensus       240 ~~-----~~~C~C~~G~~G~~C~~----------------~c~C~~~~------~C~~~~~~g~C~~C~~~~~G~~C~~C  292 (380)
                      ..     .-.|.|..||+|.+|++                .|+|+++.      .|.  ..++.|++|...++|.||+.|
T Consensus       926 ~~d~~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD--~~TG~CLkCL~hTeG~hCe~C 1003 (1758)
T KOG0994|consen  926 YLDTRTQQIVCHCQEGYSGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACD--VATGACLKCLYHTEGDHCEHC 1003 (1758)
T ss_pred             cccccccceeeecccCccccchhhhcccccCCcccCCccccccccCCcCccCCCccc--hhhchhhhhhhcccccchhhc
Confidence            54     23899999999999975                35565542      243  368999999999999999999


Q ss_pred             CCCccCCCCCCCCCcCCCCccCCCCCc--ccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCC
Q psy9822         293 EPLYVGDPRDNGECVPCSEYCNGHTGL--CINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCV  370 (380)
Q Consensus       293 ~~g~~G~~~~~~~C~~C~~~Cng~~~~--C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~  370 (380)
                      ++||||++.. ..|+.|.  ||-.++.  |+     +|                    ..+|+|. |.+|..|.+|++|+
T Consensus      1004 k~Gf~GdA~~-q~CqrC~--Cn~LGTn~~~~-----CD--------------------r~tGQCp-ClpNv~G~~CDqCA 1054 (1758)
T KOG0994|consen 1004 KDGFYGDALR-QNCQRCV--CNFLGTNSTCH-----CD--------------------RFTGQCP-CLPNVQGVRCDQCA 1054 (1758)
T ss_pred             cccchhHHHH-hhhhhhe--ccccccCCccc-----cc--------------------cccCcCC-CCcccccccccccc
Confidence            9999999864 5699998  9855432  21     12                    2679999 99999999999999


Q ss_pred             CCCCCCCCC
Q psy9822         371 EGKENRQSR  379 (380)
Q Consensus       371 ~Gy~g~~~~  379 (380)
                      +-+|.+.|+
T Consensus      1055 ~N~w~laSG 1063 (1758)
T KOG0994|consen 1055 ENHWNLASG 1063 (1758)
T ss_pred             cchhccccC
Confidence            999998875


No 2  
>KOG0994|consensus
Probab=99.64  E-value=4e-16  Score=166.16  Aligned_cols=158  Identities=33%  Similarity=0.777  Sum_probs=117.6

Q ss_pred             CCCCCCCCCeeecCCC--Cccc-ccCCCceecCCCCCcccc-CCCC-CcCCcccC-CCcccc------------CCCccc
Q psy9822         192 HHTCDPQSEQCVDLAD--GFEC-VCGRGYNKSEEGGECVPV-CSQG-CVRGVCSE-PDKCQC------------DFGYVG  253 (380)
Q Consensus       192 ~~~C~~~~g~C~n~~~--g~~C-~C~~Gy~G~~~~~~C~~~-C~~~-C~~G~C~~-~~~C~C------------~~G~~G  253 (380)
                      .+.|.+++|.|...|+  |.+| +|.+|++|.+ +..|.+. |... =....|.. .++|+|            .+||.|
T Consensus       774 S~vCn~~GGqCqCkPnVVGR~CdqCApGtyGFG-PsGCk~CdC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~Wg  852 (1758)
T KOG0994|consen  774 SSVCNPNGGQCQCKPNVVGRRCDQCAPGTYGFG-PSGCKACDCNSIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWG  852 (1758)
T ss_pred             cccccCCCceecccCccccccccccCCcccCcC-CccCccccccccccccccccccccceeeccccchhhccccCCCccC
Confidence            3478889999999886  8999 9999999988 4567665 5322 11122322 334444            455444


Q ss_pred             -ccccccCCCCCCCc-ccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC--CCCCcCCCCccCCC-------CCcccc
Q psy9822         254 -VNCSIQCQCNGHAD-CAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD--NGECVPCSEYCNGH-------TGLCIN  322 (380)
Q Consensus       254 -~~C~~~c~C~~~~~-C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~--~~~C~~C~~~Cng~-------~~~C~~  322 (380)
                       +.|. +|+||+|+. |..  .++.|+.|++.++|.+|++|..||||+|.-  +..|+||+  |...       ++.|..
T Consensus       853 FPeCr-~CqCNgHA~~Cd~--~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCp--CP~gp~Sg~~~A~sC~~  927 (1758)
T KOG0994|consen  853 FPECR-PCQCNGHADTCDP--ITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCP--CPDGPASGRQHADSCYL  927 (1758)
T ss_pred             CCcCc-cccccCcccccCc--cccccccccccccccchhhhhccccCCcccCCCCCCCCCC--CCCCCccchhccccccc
Confidence             3444 599999975 864  689999999999999999999999999975  57899999  8743       223322


Q ss_pred             cccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822         323 ASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR  379 (380)
Q Consensus       323 ~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~  379 (380)
                           |                  .....-+|. |+.||+|.+|+.|+++|||+|++
T Consensus       928 -----d------------------~~t~~ivC~-C~~GY~G~RCe~CA~~~fGnP~~  960 (1758)
T KOG0994|consen  928 -----D------------------TRTQQIVCH-CQEGYSGSRCEICADNHFGNPSE  960 (1758)
T ss_pred             -----c------------------ccccceeee-cccCccccchhhhcccccCCccc
Confidence                 1                  113568899 99999999999999999999975


No 3  
>KOG1388|consensus
Probab=99.31  E-value=1.4e-12  Score=118.50  Aligned_cols=127  Identities=39%  Similarity=0.994  Sum_probs=95.9

Q ss_pred             hhhcCCCCCccCcCCCCCCCCceeeccCCCCCCCCCCCCCccccccccCCCCcccccccCCCCCccCCCCCCCccccCCC
Q psy9822         113 STCLRHAHCGWCALQRDTTGGMGVCTEGSLNSPSSGPESSTCDILFYQTYPDQVIEVNNNDSFSWNYVKCPPENECLNDH  192 (380)
Q Consensus       113 ~~Cl~~~~CGwC~~~~~~~~g~G~C~~G~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~w~~~~C~d~deC~~g~  192 (380)
                      ++|+++|.||||+  +++.+|.|+||+|..++|.+...   +.             -+   ...|.|.+|| ..+|    
T Consensus         1 ~~c~~~P~cgw~a--~~~~~G~grcm~g~~~~p~d~~~---~~-------------~~---~~~W~fl~cP-~~~c----   54 (217)
T KOG1388|consen    1 SHCLRNPACGWCA--DGSNTGDGRCMEGHYRGPLDCVQ---LE-------------QP---REIWRFLFCP-LCQC----   54 (217)
T ss_pred             CCcccCCCccccc--CCCCCCCCeEEECccccccchhh---cc-------------Cc---cchhhhhcCh-HHHh----
Confidence            3689999999999  99999999999999999987200   00             01   4579999998 3344    


Q ss_pred             CCCCCCCCeeecCCCCcccccCCCceecCCCCCccccCCCCCcCCcccCCCccccCCCcccccccccCCCCCCCcccCCC
Q psy9822         193 HTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPVCSQGCVRGVCSEPDKCQCDFGYVGVNCSIQCQCNGHADCAGPD  272 (380)
Q Consensus       193 ~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~C~~~C~~G~C~~~~~C~C~~G~~G~~C~~~c~C~~~~~C~~~~  272 (380)
                                                                                            ++|+.|..  
T Consensus        55 ----------------------------------------------------------------------NGh~~c~t--   62 (217)
T KOG1388|consen   55 ----------------------------------------------------------------------NGHSDCNT--   62 (217)
T ss_pred             ----------------------------------------------------------------------cCCCCccc--
Confidence                                                                                  34444443  


Q ss_pred             CCcccccCCCCCCCCCCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCc
Q psy9822         273 KLDVCLRCHNHTKGPQCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRA  352 (380)
Q Consensus       273 ~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g  352 (380)
                       .-.|.+|+++++|.+|++|+.+|+|| .+++.|++|.  |++....|+.                           ..+
T Consensus        63 -~~v~~~~~N~~~g~~c~kc~~g~~Gd-tN~g~c~~~~--~~g~~~~~~~---------------------------~~~  111 (217)
T KOG1388|consen   63 -QHVCWRCENGTTGAHCEKCIVGFYGD-TNGGKCQPCD--CNGGASACVT---------------------------LTG  111 (217)
T ss_pred             -ceeeeeccCccccccCCceEEEEEec-CCCCccCHhh--hcCCeeeeec---------------------------cCC
Confidence             33444588999999999999999999 8899999999  9987665542                           346


Q ss_pred             ccccC-CCCCCCCCCCcC
Q psy9822         353 RCMHC-GNHTTGPKCEDC  369 (380)
Q Consensus       353 ~C~~C-~~g~~G~~Ce~C  369 (380)
                      +|. | ..++.|++|++|
T Consensus       112 ~c~-c~~kgvvgd~c~~~  128 (217)
T KOG1388|consen  112 KCF-CTTKGVVGDLCPKC  128 (217)
T ss_pred             ccc-cccceEecccCccc
Confidence            676 7 457778777765


No 4  
>KOG3512|consensus
Probab=99.16  E-value=2.8e-11  Score=120.42  Aligned_cols=102  Identities=35%  Similarity=0.839  Sum_probs=83.8

Q ss_pred             cCCCCCCCc-ccCCC-CCcccccCCCCCCCCCCCCCCCCccCCCCC------CCCCcCCCCccCCCCCcccccccCCCCC
Q psy9822         259 QCQCNGHAD-CAGPD-KLDVCLRCHNHTKGPQCDKCEPLYVGDPRD------NGECVPCSEYCNGHTGLCINASLASLPF  330 (380)
Q Consensus       259 ~c~C~~~~~-C~~~~-~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~------~~~C~~C~~~Cng~~~~C~~~~~~~~p~  330 (380)
                      .|.||+|++ |+.+. ...+|+ |.+|++|+.|++|++.|+..|..      -..|.+|.  ||+|+.+|..+.+     
T Consensus       277 RCKCNgHAs~Cv~d~~~~ltCd-C~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~--Cn~harrcrfn~E-----  348 (592)
T KOG3512|consen  277 RCKCNGHASRCVMDESSHLTCD-CEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACN--CNGHARRCRFNME-----  348 (592)
T ss_pred             eeeecCccceeeeccCCceEEe-cccCCCCCCcccccccccCCCccccccCCCccccccc--cchhhhhcccchh-----
Confidence            588999976 87543 348995 99999999999999999988843      35799999  9999998875442     


Q ss_pred             CCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822         331 SPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR  379 (380)
Q Consensus       331 ~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~  379 (380)
                       .+.++          |+.+.++|++|++++.|+||..|++|||++.+.
T Consensus       349 -ly~lS----------gr~SggvClnCrHnTaGrhChyCreGyyRd~s~  386 (592)
T KOG3512|consen  349 -LYRLS----------GRRSGGVCLNCRHNTAGRHCHYCREGYYRDGSK  386 (592)
T ss_pred             -hhccc----------CccccceEeecccCCCCcccccccCccccCCCC
Confidence             33333          455789999999999999999999999999874


No 5  
>KOG1836|consensus
Probab=99.06  E-value=6.2e-10  Score=127.51  Aligned_cols=160  Identities=34%  Similarity=0.799  Sum_probs=120.9

Q ss_pred             CCC-CCCCCCCCeeecCC--CCccc-ccCCCceecCC---CCCcccc-CCCCCcCCcccC-----CCccc-cCCCccccc
Q psy9822         190 NDH-HTCDPQSEQCVDLA--DGFEC-VCGRGYNKSEE---GGECVPV-CSQGCVRGVCSE-----PDKCQ-CDFGYVGVN  255 (380)
Q Consensus       190 ~g~-~~C~~~~g~C~n~~--~g~~C-~C~~Gy~G~~~---~~~C~~~-C~~~C~~G~C~~-----~~~C~-C~~G~~G~~  255 (380)
                      ++| ..|++.+|.|...+  .+.+| +|.+||+|...   .+.|.+. |++.   +.|..     ...|. |+.||+|..
T Consensus       734 ngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~---~~~~~~~~~~~~iCk~Cp~gytG~r  810 (1705)
T KOG1836|consen  734 NGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNG---GACGQTPEILEVVCKNCPPGYTGLR  810 (1705)
T ss_pred             CCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCCCccCCCCCC---hhhcCcCcccceecCCCCCCCcccc
Confidence            566 48999999887644  68899 99999999876   3348877 7765   33332     45897 999999988


Q ss_pred             ccc--------------------cCCCCCCC------cccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC---CCCC
Q psy9822         256 CSI--------------------QCQCNGHA------DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD---NGEC  306 (380)
Q Consensus       256 C~~--------------------~c~C~~~~------~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~---~~~C  306 (380)
                      |+.                    .|+|+++.      .|..  .++.|+.|.+++.|.+|+.|+++|+|++..   .+.|
T Consensus       811 Ce~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~--~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c  888 (1705)
T KOG1836|consen  811 CEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNR--LTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKC  888 (1705)
T ss_pred             cccCCCccccCCCCCCCCcccCccceeccccCccccccccc--cccceeeccCCcccccccccccCccccccCCCcCCcc
Confidence            863                    34565542      3542  578898899999999999999999999865   4678


Q ss_pred             cCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822         307 VPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR  379 (380)
Q Consensus       307 ~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~  379 (380)
                      ..|.  |+..+..        .|    .+          .-+..+|+|. |.++.+|..|..|.+|||++++.
T Consensus       889 ~~c~--c~p~gs~--------~~----~~----------~c~~~tGQce-c~~~v~g~~c~~c~~g~fnl~s~  936 (1705)
T KOG1836|consen  889 FACG--CVPAGSE--------LP----SL----------TCNPVTGQCE-CKPNVEGRDCLYCFKGFFNLNSG  936 (1705)
T ss_pred             cccc--CccCCcc--------cc----cc----------cCCCccccee-ccCCCCccccccccccccccCCC
Confidence            8888  7754321        00    00          1123679999 99999999999999999999854


No 6  
>KOG3512|consensus
Probab=99.01  E-value=4.3e-10  Score=112.07  Aligned_cols=78  Identities=28%  Similarity=0.648  Sum_probs=63.9

Q ss_pred             CcccccCCCCCCCCCCCCCCCCccCCCCC----CCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCC
Q psy9822         274 LDVCLRCHNHTKGPQCDKCEPLYVGDPRD----NGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPT  349 (380)
Q Consensus       274 ~g~C~~C~~~~~G~~C~~C~~g~~G~~~~----~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~  349 (380)
                      .++|+.|++++.|.+|..|++|||.|+..    ...|..|+  ||..+..                     ++.   -++
T Consensus       358 ggvClnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~Cd--ChpVGs~---------------------gkt---CNq  411 (592)
T KOG3512|consen  358 GGVCLNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACD--CHPVGSA---------------------GKT---CNQ  411 (592)
T ss_pred             cceEeecccCCCCcccccccCccccCCCCCCchhhhhhhcC--Ccccccc---------------------ccc---ccc
Confidence            57899999999999999999999999753    46799999  9876421                     011   123


Q ss_pred             CCcccccCCCCCCCCCCCcCCCCCCCCCC
Q psy9822         350 TRARCMHCGNHTTGPKCEDCVEGKENRQS  378 (380)
Q Consensus       350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~  378 (380)
                      .+|+|. |++|.+|..|++|++||+-..+
T Consensus       412 ~tGqCp-CkeGvtG~tCnrCa~gyqqsrs  439 (592)
T KOG3512|consen  412 TTGQCP-CKEGVTGLTCNRCAPGYQQSRS  439 (592)
T ss_pred             cCCccc-CCCCCcccccccccchhhcccC
Confidence            679999 9999999999999999997655


No 7  
>KOG1836|consensus
Probab=98.84  E-value=4.6e-09  Score=120.52  Aligned_cols=108  Identities=32%  Similarity=0.815  Sum_probs=88.2

Q ss_pred             ccccCCCcccccccc-------------------cCCCCCCC-cccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCCC
Q psy9822         244 KCQCDFGYVGVNCSI-------------------QCQCNGHA-DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRDN  303 (380)
Q Consensus       244 ~C~C~~G~~G~~C~~-------------------~c~C~~~~-~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~~  303 (380)
                      .|.|+.||+|..|+.                   +|.|++|+ +|..  .++.|. |++++.|.+|++|++||||++..+
T Consensus       696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~--~tG~C~-C~~~t~G~~C~~C~~GfYg~~~~~  772 (1705)
T KOG1836|consen  696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDP--RTGQCK-CKHNTFGGQCAQCVDGFYGLPDLG  772 (1705)
T ss_pred             hccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccC--CCCcee-cccCCCCCchhhhcCCCCCccccC
Confidence            599999999988863                   57788884 5864  789996 999999999999999999999764


Q ss_pred             C--CCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCCC
Q psy9822         304 G--ECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSRN  380 (380)
Q Consensus       304 ~--~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~~  380 (380)
                      .  .|++|.  |... +.|....                       ....++|.+|+++|+|.+|+.|..|||+++..+
T Consensus       773 ~~~dC~~C~--Cp~~-~~~~~~~-----------------------~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~  825 (1705)
T KOG1836|consen  773 TSGDCQPCP--CPNG-GACGQTP-----------------------EILEVVCKNCPPGYTGLRCEECADGYFGNPLGH  825 (1705)
T ss_pred             CCCCCccCC--CCCC-hhhcCcC-----------------------cccceecCCCCCCCcccccccCCCccccCCCCC
Confidence            3  399999  8843 4443211                       135789999999999999999999999999764


No 8  
>KOG1225|consensus
Probab=98.63  E-value=7.7e-08  Score=99.45  Aligned_cols=74  Identities=32%  Similarity=0.891  Sum_probs=54.5

Q ss_pred             CCcccccCCCceecCCCCCcccc-CCCCCc-CCcccCCCccccCCCcccccccc-cC--CCCCCCcccCCCCCcccccCC
Q psy9822         207 DGFECVCGRGYNKSEEGGECVPV-CSQGCV-RGVCSEPDKCQCDFGYVGVNCSI-QC--QCNGHADCAGPDKLDVCLRCH  281 (380)
Q Consensus       207 ~g~~C~C~~Gy~G~~~~~~C~~~-C~~~C~-~G~C~~~~~C~C~~G~~G~~C~~-~c--~C~~~~~C~~~~~~g~C~~C~  281 (380)
                      ....|.|..+|+|..    |... |+..|. +|.|+ +++|+|++||+|.+|+. .|  .|++|..+.    .++|+ |+
T Consensus       232 ~~~ic~c~~~~~g~~----c~~~~C~~~c~~~g~c~-~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~----~g~Ci-C~  301 (525)
T KOG1225|consen  232 FDGICECPEGYFGPL----CSTIYCPGGCTGRGQCV-EGRCICPPGFTGDDCDELVCPVDCSGGGVCV----DGECI-CN  301 (525)
T ss_pred             cCceeecCCceeCCc----cccccCCCCCcccceEe-CCeEeCCCCCcCCCCCcccCCcccCCCceec----CCEee-cC
Confidence            345788888888876    6655 877776 45554 46888999999988886 33  366777776    45886 88


Q ss_pred             CCCCCCCCC
Q psy9822         282 NHTKGPQCD  290 (380)
Q Consensus       282 ~~~~G~~C~  290 (380)
                      ++|+|..|+
T Consensus       302 ~g~~G~dCs  310 (525)
T KOG1225|consen  302 PGYSGKDCS  310 (525)
T ss_pred             CCccccccc
Confidence            888888885


No 9  
>KOG1225|consensus
Probab=98.59  E-value=1.3e-07  Score=97.70  Aligned_cols=99  Identities=32%  Similarity=0.835  Sum_probs=72.4

Q ss_pred             CCCccCCCCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCcccc-CCCCCc-CCcccCCCccccCCCc
Q psy9822         174 SFSWNYVKCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPV-CSQGCV-RGVCSEPDKCQCDFGY  251 (380)
Q Consensus       174 ~~~w~~~~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~-C~~~C~-~G~C~~~~~C~C~~G~  251 (380)
                      ..+|.++.|.. -.|.   +.|. .++.|++.    +|.|++||+|+.    |... |+..|. ++.+++. +|+|++||
T Consensus       239 ~~~~~g~~c~~-~~C~---~~c~-~~g~c~~G----~CIC~~Gf~G~d----C~e~~Cp~~cs~~g~~~~g-~CiC~~g~  304 (525)
T KOG1225|consen  239 PEGYFGPLCST-IYCP---GGCT-GRGQCVEG----RCICPPGFTGDD----CDELVCPVDCSGGGVCVDG-ECICNPGY  304 (525)
T ss_pred             CCceeCCcccc-ccCC---CCCc-ccceEeCC----eEeCCCCCcCCC----CCcccCCcccCCCceecCC-EeecCCCc
Confidence            34566666652 2331   2233 33556544    899999999988    6665 887787 4566554 99999999


Q ss_pred             ccccccc---cCCCCCCCcccCCCCCcccccCCCCCCCCCCCC
Q psy9822         252 VGVNCSI---QCQCNGHADCAGPDKLDVCLRCHNHTKGPQCDK  291 (380)
Q Consensus       252 ~G~~C~~---~c~C~~~~~C~~~~~~g~C~~C~~~~~G~~C~~  291 (380)
                      +|..|++   +-.|++|+.|+    .+.|. |.++|+|..|..
T Consensus       305 ~G~dCs~~~cpadC~g~G~Ci----~G~C~-C~~Gy~G~~C~~  342 (525)
T KOG1225|consen  305 SGKDCSIRRCPADCSGHGKCI----DGECL-CDEGYTGELCIQ  342 (525)
T ss_pred             cccccccccCCccCCCCCccc----CCceE-eCCCCcCCcccc
Confidence            9999986   35699999998    68995 999999998864


No 10 
>KOG1214|consensus
Probab=98.18  E-value=5.4e-06  Score=87.97  Aligned_cols=107  Identities=29%  Similarity=0.744  Sum_probs=73.8

Q ss_pred             CCCCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCcccc--------CCC---CCc-CC--cccC---
Q psy9822         179 YVKCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPV--------CSQ---GCV-RG--VCSE---  241 (380)
Q Consensus       179 ~~~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~--------C~~---~C~-~G--~C~~---  241 (380)
                      +..|.|++||+.+.+.|. .+.+|+|.+++|+|+|..||.-.+....|.++        |..   .|. .|  +|+.   
T Consensus       728 gr~c~d~~eca~~~~~CG-p~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGg  806 (1289)
T KOG1214|consen  728 GRNCVDENECATGFHRCG-PNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGG  806 (1289)
T ss_pred             CCCCCChhhhccCCCCCC-CCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCC
Confidence            467999999999999999 67899999999999999998644433456544        211   133 23  4443   


Q ss_pred             -CCccccCCCcccc--cccc-----cCCCCCCCcccCCCCCcccccCCCCCCCC
Q psy9822         242 -PDKCQCDFGYVGV--NCSI-----QCQCNGHADCAGPDKLDVCLRCHNHTKGP  287 (380)
Q Consensus       242 -~~~C~C~~G~~G~--~C~~-----~c~C~~~~~C~~~~~~g~C~~C~~~~~G~  287 (380)
                       .+.|.|.+||.|.  .|..     +..|...+.|.++++.+.| +|+++|.|.
T Consensus       807 s~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC-~C~pGy~GD  859 (1289)
T KOG1214|consen  807 STYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSC-RCQPGYYGD  859 (1289)
T ss_pred             ceEEEeecCCccCCccccccccccCccccCCCceEecCCCccee-ecccCccCC
Confidence             3599999999973  2322     3446666777777777777 355555443


No 11 
>KOG4289|consensus
Probab=98.08  E-value=7e-06  Score=90.95  Aligned_cols=107  Identities=34%  Similarity=0.868  Sum_probs=61.2

Q ss_pred             ccccCC-CCCc-CCcccC-----CCccccCCCcccccccc--cCCCC----CCCcccCCCCCcccccCC---------CC
Q psy9822         226 CVPVCS-QGCV-RGVCSE-----PDKCQCDFGYVGVNCSI--QCQCN----GHADCAGPDKLDVCLRCH---------NH  283 (380)
Q Consensus       226 C~~~C~-~~C~-~G~C~~-----~~~C~C~~G~~G~~C~~--~c~C~----~~~~C~~~~~~g~C~~C~---------~~  283 (380)
                      |.++|. ++|. .|+|+.     .++|+|++||+|++|+.  +-+|-    ++-+|      +.| .|.         +-
T Consensus      1715 C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~Ce~~~dq~CPrGWWG~P~C------gpC-~CavsKgfdp~CnK 1787 (2531)
T KOG4289|consen 1715 CVDVCSLNPCENQGTCVRSPGAHGYTCECPPGYTGPYCELRADQPCPRGWWGFPTC------GPC-NCAVSKGFDPDCNK 1787 (2531)
T ss_pred             ccchhcccccccCceeecCCCCCceeEECCCcccCcchhhhccCCCCCcccCCCCc------cCc-cccccCCCCCCccc
Confidence            666663 4555 477765     35788888888877764  12221    11111      111 111         01


Q ss_pred             CCCCCCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCC
Q psy9822         284 TKGPQCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTG  363 (380)
Q Consensus       284 ~~G~~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G  363 (380)
                      +.| .| +|+..+|...   +.|.+|+  |. ++..                +.        +. ..+|+|. |+++..|
T Consensus      1788 t~G-~C-qCKe~hy~~~---~~Cl~Cd--C~-~Gs~----------------Sr--------~C-~adGqC~-C~pgaiG 1833 (2531)
T KOG4289|consen 1788 TNG-QC-QCKENHYRPI---GSCLPCD--CY-FGSD----------------SR--------EC-DADGQCP-CKPGAIG 1833 (2531)
T ss_pred             cCc-ce-eeccccccCC---Ccceeec--cc-cCCC----------------cc--------cc-cCCCcCC-CCCcccc
Confidence            122 23 6777666543   3599999  77 4310                00        01 1579999 9999999


Q ss_pred             CCCCcCCCCC
Q psy9822         364 PKCEDCVEGK  373 (380)
Q Consensus       364 ~~Ce~C~~Gy  373 (380)
                      ++|++|..-|
T Consensus      1834 RqCdrCd~pf 1843 (2531)
T KOG4289|consen 1834 RQCDRCDNPF 1843 (2531)
T ss_pred             ccccccCChh
Confidence            9999997543


No 12 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.92  E-value=1.2e-05  Score=57.46  Aligned_cols=28  Identities=32%  Similarity=0.730  Sum_probs=26.2

Q ss_pred             CCcccccCCCCCCCCCCCcCCCCCCCCCC
Q psy9822         350 TRARCMHCGNHTTGPKCEDCVEGKENRQS  378 (380)
Q Consensus       350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~  378 (380)
                      .+++|+ |+++|+|++|++|++|||+++.
T Consensus        17 ~~G~C~-C~~~~~G~~C~~C~~g~~~~~~   44 (50)
T cd00055          17 GTGQCE-CKPNTTGRRCDRCAPGYYGLPS   44 (50)
T ss_pred             CCCEEe-CCCcCCCCCCCCCCCCCccCCC
Confidence            579999 9999999999999999999975


No 13 
>KOG1219|consensus
Probab=97.91  E-value=1.1e-05  Score=92.45  Aligned_cols=101  Identities=27%  Similarity=0.729  Sum_probs=80.0

Q ss_pred             ccccCCCCCCCCCCCeeecCC-CCcccccCCCceecCCCCCcccc---C-CCCCc-CCcccC---CCccccCCCcccccc
Q psy9822         186 NECLNDHHTCDPQSEQCVDLA-DGFECVCGRGYNKSEEGGECVPV---C-SQGCV-RGVCSE---PDKCQCDFGYVGVNC  256 (380)
Q Consensus       186 deC~~g~~~C~~~~g~C~n~~-~g~~C~C~~Gy~G~~~~~~C~~~---C-~~~C~-~G~C~~---~~~C~C~~G~~G~~C  256 (380)
                      +.|.  ..+|. ++|+|+-.+ ++|.|.|++-|.|..    |+.-   | .++|. -|+|+.   .+.|.|+.||||.+|
T Consensus      3865 d~C~--~npCq-hgG~C~~~~~ggy~CkCpsqysG~~----CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~C 3937 (4289)
T KOG1219|consen 3865 DPCN--DNPCQ-HGGTCISQPKGGYKCKCPSQYSGNH----CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRC 3937 (4289)
T ss_pred             cccc--cCccc-CCCEecCCCCCceEEeCcccccCcc----cccccccccCCCCCCCCEEEecCCCeeEeCCCCccCcee
Confidence            4552  35787 899999877 699999999999987    6532   5 45676 489986   469999999999999


Q ss_pred             ccc----C---CCCCCCcccCCCCCcccccCCCCCCCCCCCCCCC
Q psy9822         257 SIQ----C---QCNGHADCAGPDKLDVCLRCHNHTKGPQCDKCEP  294 (380)
Q Consensus       257 ~~~----c---~C~~~~~C~~~~~~g~C~~C~~~~~G~~C~~C~~  294 (380)
                      +..    |   .|...+.|++.++.+.|. |.+++.|.+|..-+|
T Consensus      3938 e~~Gi~eCs~n~C~~gg~C~n~~gsf~Cn-cT~g~~gr~c~~~~p 3981 (4289)
T KOG1219|consen 3938 EARGISECSKNVCGTGGQCINIPGSFHCN-CTPGILGRTCCAEKP 3981 (4289)
T ss_pred             ecccccccccccccCCceeeccCCceEec-cChhHhcccCccccC
Confidence            863    2   466678899999999994 999999999964443


No 14 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.90  E-value=6.5e-06  Score=56.69  Aligned_cols=36  Identities=28%  Similarity=0.804  Sum_probs=32.5

Q ss_pred             CCccccCCCCCCCCCCCeeecCCCCcccccCCCceec
Q psy9822         184 PENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKS  220 (380)
Q Consensus       184 d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~  220 (380)
                      |+|||..+.+.|. .++.|+|+.++|+|.|++||...
T Consensus         1 DidEC~~~~~~C~-~~~~C~N~~Gsy~C~C~~Gy~~~   36 (42)
T PF07645_consen    1 DIDECAEGPHNCP-ENGTCVNTEGSYSCSCPPGYELN   36 (42)
T ss_dssp             ESSTTTTTSSSSS-TTSEEEEETTEEEEEESTTEEEC
T ss_pred             CccccCCCCCcCC-CCCEEEcCCCCEEeeCCCCcEEC
Confidence            6899998888996 89999999999999999999843


No 15 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.87  E-value=4.3e-06  Score=59.48  Aligned_cols=29  Identities=34%  Similarity=0.819  Sum_probs=24.9

Q ss_pred             CCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822         350 TRARCMHCGNHTTGPKCEDCVEGKENRQSR  379 (380)
Q Consensus       350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~  379 (380)
                      .+++|+ |+++|+|++|++|++|||+++.+
T Consensus        16 ~~G~C~-C~~~~~G~~C~~C~~g~~~~~~~   44 (49)
T PF00053_consen   16 STGQCV-CKPGTTGPRCDQCKPGYFGLPSD   44 (49)
T ss_dssp             TCEEES-BSTTEESTTS-EE-TTEECSTTS
T ss_pred             CCCEEe-ccccccCCcCcCCCCccccccCC
Confidence            579999 99999999999999999999764


No 16 
>KOG4289|consensus
Probab=97.72  E-value=2.3e-05  Score=87.06  Aligned_cols=82  Identities=24%  Similarity=0.713  Sum_probs=62.8

Q ss_pred             CCCCcccccCCCceecCCCCCcccc---CC-CCCc-CCcccC---CCccccCCCcccccccc--------cCCCCCCCcc
Q psy9822         205 LADGFECVCGRGYNKSEEGGECVPV---CS-QGCV-RGVCSE---PDKCQCDFGYVGVNCSI--------QCQCNGHADC  268 (380)
Q Consensus       205 ~~~g~~C~C~~Gy~G~~~~~~C~~~---C~-~~C~-~G~C~~---~~~C~C~~G~~G~~C~~--------~c~C~~~~~C  268 (380)
                      ..++++|+|++||+|+.    |+..   |. .+|. +|+|..   .++|.|.+||+|..|++        +--|.+.++|
T Consensus      1218 pvnglrCrCPpGFTgd~----CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC 1293 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDY----CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTC 1293 (2531)
T ss_pred             ccCceeEeCCCCCCccc----ccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEE
Confidence            34789999999999987    7644   75 5688 799987   57999999999999996        2357788889


Q ss_pred             cCCCCCcccccCCCC-CCCCCCC
Q psy9822         269 AGPDKLDVCLRCHNH-TKGPQCD  290 (380)
Q Consensus       269 ~~~~~~g~C~~C~~~-~~G~~C~  290 (380)
                      .+..+.+.|-.|+.+ +++++|+
T Consensus      1294 ~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1294 VNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred             eecCCCceeccCCCcccCCCceE
Confidence            875554444358654 5777886


No 17 
>KOG1219|consensus
Probab=97.72  E-value=3.6e-05  Score=88.56  Aligned_cols=102  Identities=27%  Similarity=0.576  Sum_probs=77.1

Q ss_pred             CCCCcC-CcccC----CCccccCCCcccccccc------cCCCCCCCcccCCCCCcccccCCCCCCCCCCCCCCCCccCC
Q psy9822         231 SQGCVR-GVCSE----PDKCQCDFGYVGVNCSI------QCQCNGHADCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGD  299 (380)
Q Consensus       231 ~~~C~~-G~C~~----~~~C~C~~G~~G~~C~~------~c~C~~~~~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~  299 (380)
                      .++|+| |+|+.    .++|.|+.-|+|..|++      ..+|..+++|+...+.+.| +|+++|+|.+|+.=  |    
T Consensus      3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~C-nC~~gyTG~~Ce~~--G---- 3941 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLC-NCPNGYTGKRCEAR--G---- 3941 (4289)
T ss_pred             cCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeE-eCCCCccCceeecc--c----
Confidence            355775 79987    36999999999999997      4578899999987889999 69999999999752  1    


Q ss_pred             CCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCC
Q psy9822         300 PRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVE  371 (380)
Q Consensus       300 ~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~  371 (380)
                         -..|..-.  |. +++.|.+..                         +.+.|. |.+++.|++|..=+|
T Consensus      3942 ---i~eCs~n~--C~-~gg~C~n~~-------------------------gsf~Cn-cT~g~~gr~c~~~~p 3981 (4289)
T KOG1219|consen 3942 ---ISECSKNV--CG-TGGQCINIP-------------------------GSFHCN-CTPGILGRTCCAEKP 3981 (4289)
T ss_pred             ---cccccccc--cc-CCceeeccC-------------------------CceEec-cChhHhcccCccccC
Confidence               12343222  33 778887421                         458999 999999999965443


No 18 
>KOG1214|consensus
Probab=97.57  E-value=9.9e-05  Score=78.68  Aligned_cols=111  Identities=29%  Similarity=0.705  Sum_probs=72.1

Q ss_pred             CCccccCCCCCCCCCCCeeecCC-CCcccccCCCceecCCCCCcccc--C---CCCCc-CCcccC---CCccccCCCcc-
Q psy9822         184 PENECLNDHHTCDPQSEQCVDLA-DGFECVCGRGYNKSEEGGECVPV--C---SQGCV-RGVCSE---PDKCQCDFGYV-  252 (380)
Q Consensus       184 d~deC~~g~~~C~~~~g~C~n~~-~g~~C~C~~Gy~G~~~~~~C~~~--C---~~~C~-~G~C~~---~~~C~C~~G~~-  252 (380)
                      .++.|..+.+.|+ .++.|.-.+ -.|+|.|..||.|++  ..|...  |   ...|. +..|++   .++|.|..||. 
T Consensus       691 ~~npCy~gsh~cd-t~a~C~pg~~~~~tcecs~g~~gdg--r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F  767 (1289)
T KOG1214|consen  691 PVNPCYDGSHMCD-TTARCHPGTGVDYTCECSSGYQGDG--RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEF  767 (1289)
T ss_pred             ccccceecCcccC-CCccccCCCCcceEEEEeeccCCCC--CCCCChhhhccCCCCCCCCceeecCCCceeEEEeeccee
Confidence            4567777888888 777887655 489999999999997  456554  3   45587 789998   35888887774 


Q ss_pred             ---cccccc------cCCCCCC-CcccCCCCCcccccCCCCCCCCCCCCCCCCccCCCC
Q psy9822         253 ---GVNCSI------QCQCNGH-ADCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPR  301 (380)
Q Consensus       253 ---G~~C~~------~c~C~~~-~~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~  301 (380)
                         +.+|-.      ...|-.. .+|..   .+++ .|...=.+.+-..|.|||.|++.
T Consensus       768 ~dd~~tCV~i~~pap~n~Ce~g~h~C~i---~g~a-~c~~hGgs~y~C~CLPGfsGDG~  822 (1289)
T KOG1214|consen  768 ADDRHTCVLITPPAPANPCEDGSHTCAI---AGQA-RCVHHGGSTYSCACLPGFSGDGH  822 (1289)
T ss_pred             ccCCcceEEecCCCCCCccccCccccCc---CCce-EEEecCCceEEEeecCCccCCcc
Confidence               245532      2344433 34642   3444 24433333333378889988874


No 19 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=97.55  E-value=7.8e-05  Score=52.41  Aligned_cols=25  Identities=28%  Similarity=0.801  Sum_probs=23.8

Q ss_pred             CCcccccCCCCCCCCCCCcCCCCCCC
Q psy9822         350 TRARCMHCGNHTTGPKCEDCVEGKEN  375 (380)
Q Consensus       350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g  375 (380)
                      .+++|+ |+++++|++|++|++||||
T Consensus        16 ~~G~C~-C~~~~~G~~C~~C~~g~~g   40 (46)
T smart00180       16 DTGQCE-CKPNVTGRRCDRCAPGYYG   40 (46)
T ss_pred             CCCEEE-CCCCCCCCCCCcCCCCcCC
Confidence            468999 9999999999999999999


No 20 
>KOG1226|consensus
Probab=97.48  E-value=0.0003  Score=74.94  Aligned_cols=75  Identities=28%  Similarity=0.749  Sum_probs=48.7

Q ss_pred             ccccCCCceecCC--CCCc------cccCCC-----CCc-CCcccCCCccccCCCcc----cccccc---cC------CC
Q psy9822         210 ECVCGRGYNKSEE--GGEC------VPVCSQ-----GCV-RGVCSEPDKCQCDFGYV----GVNCSI---QC------QC  262 (380)
Q Consensus       210 ~C~C~~Gy~G~~~--~~~C------~~~C~~-----~C~-~G~C~~~~~C~C~~G~~----G~~C~~---~c------~C  262 (380)
                      .|.|.+||.|..+  ....      ++.|..     .|. +|.|.- ++|+|.....    |..|+-   .|      .|
T Consensus       479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC  557 (783)
T KOG1226|consen  479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCECDNFSCERHKGVLC  557 (783)
T ss_pred             ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-CceEecCCCCCceeeeeeeccCcccccccCccc
Confidence            4689999999885  0000      111321     243 444432 4788887777    777762   12      37


Q ss_pred             CCCCcccCCCCCcccccCCCCCCCCCCC
Q psy9822         263 NGHADCAGPDKLDVCLRCHNHTKGPQCD  290 (380)
Q Consensus       263 ~~~~~C~~~~~~g~C~~C~~~~~G~~C~  290 (380)
                      .+|+.|.    -++|+ |.+||+|..|+
T Consensus       558 ~g~G~C~----CG~Cv-C~~GwtG~~C~  580 (783)
T KOG1226|consen  558 GGHGRCE----CGRCV-CNPGWTGSACN  580 (783)
T ss_pred             CCCCeEe----CCcEE-cCCCCccCCCC
Confidence            8888886    78995 99999999983


No 21 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.03  E-value=0.0011  Score=47.40  Aligned_cols=40  Identities=43%  Similarity=1.076  Sum_probs=31.7

Q ss_pred             CCCCCCCc----ccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC
Q psy9822         260 CQCNGHAD----CAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD  302 (380)
Q Consensus       260 c~C~~~~~----C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~  302 (380)
                      |.|+.+++    |..  .+++|. |+++++|.+|++|+++|++.+..
T Consensus         2 C~C~~~g~~~~~C~~--~~G~C~-C~~~~~G~~C~~C~~g~~~~~~~   45 (50)
T cd00055           2 CDCNGHGSLSGQCDP--GTGQCE-CKPNTTGRRCDRCAPGYYGLPSQ   45 (50)
T ss_pred             CcCcCCCCCCccccC--CCCEEe-CCCcCCCCCCCCCCCCCccCCCC
Confidence            44555443    753  579995 99999999999999999998854


No 22 
>KOG1226|consensus
Probab=96.96  E-value=0.0023  Score=68.31  Aligned_cols=87  Identities=31%  Similarity=0.819  Sum_probs=55.3

Q ss_pred             CCCCCCCeeecCCCCcccccCCCce----ecCCCCCcccc-CCCC----Cc-CCcccCCCccccCCCcccccccc-----
Q psy9822         194 TCDPQSEQCVDLADGFECVCGRGYN----KSEEGGECVPV-CSQG----CV-RGVCSEPDKCQCDFGYVGVNCSI-----  258 (380)
Q Consensus       194 ~C~~~~g~C~n~~~g~~C~C~~Gy~----G~~~~~~C~~~-C~~~----C~-~G~C~~~~~C~C~~G~~G~~C~~-----  258 (380)
                      .|. ..|.|...    +|.|.+...    |..+  +|... |...    |. ||+|.- ++|+|.+||+|..|.-     
T Consensus       515 vCS-grG~C~CG----qC~C~~~~~~~i~G~fC--ECDnfsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~C~~std  586 (783)
T KOG1226|consen  515 VCS-GRGDCVCG----QCVCHKPDNGKIYGKFC--ECDNFSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACNCPLSTD  586 (783)
T ss_pred             CcC-CCCcEeCC----ceEecCCCCCceeeeee--eccCcccccccCcccCCCCeEeC-CcEEcCCCCccCCCCCCCCCc
Confidence            455 55556553    566655444    5541  22233 5322    65 777643 6899999999998862     


Q ss_pred             ------cCCCCCCCcccCCCCCcccccCCCC-CCCCCCCCCC
Q psy9822         259 ------QCQCNGHADCAGPDKLDVCLRCHNH-TKGPQCDKCE  293 (380)
Q Consensus       259 ------~c~C~~~~~C~~~~~~g~C~~C~~~-~~G~~C~~C~  293 (380)
                            .-.|+++++|.    -++| +|.+. +.|.+||.|.
T Consensus       587 ~C~~~~G~iCSGrG~C~----Cg~C-~C~~~~~sG~~CE~cp  623 (783)
T KOG1226|consen  587 TCESSDGQICSGRGTCE----CGRC-KCTDPPYSGEFCEKCP  623 (783)
T ss_pred             cccCCCCceeCCCceee----CCce-EcCCCCcCcchhhcCC
Confidence                  11367777775    6778 48766 8999998775


No 23 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.91  E-value=0.00034  Score=46.64  Aligned_cols=31  Identities=19%  Similarity=0.728  Sum_probs=24.6

Q ss_pred             CCCCCCCCCCCeeecCCCCcccccCCCceecC
Q psy9822         190 NDHHTCDPQSEQCVDLADGFECVCGRGYNKSE  221 (380)
Q Consensus       190 ~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~  221 (380)
                      ...+.|+ .+++|++++++|+|+|++||.|++
T Consensus         3 ~~~~~C~-~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    3 ENNGGCH-PNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             TGGGGS--TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCC-CCcEeecCCCCEEeECCCCCccCC
Confidence            3456788 789999999999999999999987


No 24 
>KOG3509|consensus
Probab=96.84  E-value=0.002  Score=71.10  Aligned_cols=116  Identities=27%  Similarity=0.541  Sum_probs=76.5

Q ss_pred             CccccCCCcccccccc-------------------cCCCCCCCc-ccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC
Q psy9822         243 DKCQCDFGYVGVNCSI-------------------QCQCNGHAD-CAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD  302 (380)
Q Consensus       243 ~~C~C~~G~~G~~C~~-------------------~c~C~~~~~-C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~  302 (380)
                      .+|+|+.|+.|..|+.                   .+.|+.|+. |..  ..+.|..|++++.|.+|+.|.+|+++++..
T Consensus       718 ~~C~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~~~~h~~~c~~--~~~~nt~~q~~~~~~~~~~~~~g~~~da~~  795 (964)
T KOG3509|consen  718 EQCQCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCECNSHISQCED--DLGYNTDCQNNTEGDRCELCSPGTYGDARR  795 (964)
T ss_pred             cccccCccccCcccccccccccccccCCcCcccCcccccCCCcccccc--cccccccccccCccceeeecCCCccccCcc
Confidence            3788888888877652                   356777764 763  467787899999999999999999999876


Q ss_pred             CCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822         303 NGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR  379 (380)
Q Consensus       303 ~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~  379 (380)
                      +....... .+-.. ..++.          .+..+..       .......|..|.+|++|.+|+++..||+++.++
T Consensus       796 g~~~D~~p-~~~l~-~~~~~----------~~r~~l~-------~~~~~~~~~~~p~~~~g~~~~~~~~~~~~~atd  853 (964)
T KOG3509|consen  796 GTPEDCRP-ATALT-IQCSC----------NNRSPLS-------CDGFGPGCLLCPHNTEGTTCERVKAGYYGFATD  853 (964)
T ss_pred             CCcccCCc-cchhh-hhhhh----------cccCccc-------cccCCCCcccCCCCccccchhhhccccccccCc
Confidence            43321111 01100 01100          0011100       011345777799999999999999999998764


No 25 
>KOG1217|consensus
Probab=96.78  E-value=0.012  Score=59.46  Aligned_cols=104  Identities=22%  Similarity=0.603  Sum_probs=62.1

Q ss_pred             CCCccCCCCCCC-ccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCccccCCCCCcCCcccCCCccccCCCcc
Q psy9822         174 SFSWNYVKCPPE-NECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPVCSQGCVRGVCSEPDKCQCDFGYV  252 (380)
Q Consensus       174 ~~~w~~~~C~d~-deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~C~~~C~~G~C~~~~~C~C~~G~~  252 (380)
                      ..+|.+..|... ++|....+.|. +.+.|.+..++|.|.|++||.+...    ... ..   .+.|+....|.+..|+.
T Consensus       157 ~~g~~~~~~~~~~~~C~~~~~~c~-~~~~C~~~~~~~~C~c~~~~~~~~~----~~~-~~---~~~c~~~~~~~~~~g~~  227 (487)
T KOG1217|consen  157 TEGYEGEPCETDLDECIQYSSPCQ-NGGTCVNTGGSYLCSCPPGYTGSTC----ETT-GN---GGTCVDSVACSCPPGAR  227 (487)
T ss_pred             CCCcccccccccccccccCCCCcC-CCcccccCCCCeeEeCCCCccCCcC----cCC-CC---CceEecceeccCCCCCC
Confidence            567888888644 79976667787 7889999999999999999998863    211 00   12333334456666666


Q ss_pred             ccccccc-CCCCCC-CcccCCCCCcccccCCCCCCCC
Q psy9822         253 GVNCSIQ-CQCNGH-ADCAGPDKLDVCLRCHNHTKGP  287 (380)
Q Consensus       253 G~~C~~~-c~C~~~-~~C~~~~~~g~C~~C~~~~~G~  287 (380)
                      +..|.+. ..|... ++|.+..+.+.|. |..++++.
T Consensus       228 ~~~c~~~~~~~~~~~~~c~~~~~~~~C~-~~~g~~~~  263 (487)
T KOG1217|consen  228 GPECEVSIVECASGDGTCVNTVGSYTCR-CPEGYTGD  263 (487)
T ss_pred             CCCcccccccccCCCCcccccCCceeee-CCCCcccc
Confidence            6555531 122211 4444433444552 44444444


No 26 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.71  E-value=0.0018  Score=42.78  Aligned_cols=36  Identities=25%  Similarity=0.766  Sum_probs=28.4

Q ss_pred             CCccccCCCCCCCCCCCeeecCCCCcccccCCCce-ecC
Q psy9822         184 PENECLNDHHTCDPQSEQCVDLADGFECVCGRGYN-KSE  221 (380)
Q Consensus       184 d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~-G~~  221 (380)
                      ++|||... +.|. ..++|+++.++|.|.|++||. |..
T Consensus         1 d~~~C~~~-~~C~-~~~~C~~~~g~~~C~C~~g~~~g~~   37 (39)
T smart00179        1 DIDECASG-NPCQ-NGGTCVNTVGSYRCECPPGYTDGRN   37 (39)
T ss_pred             CcccCcCC-CCcC-CCCEeECCCCCeEeECCCCCccCCc
Confidence            35677543 5677 678999999999999999998 543


No 27 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=96.59  E-value=0.00089  Score=47.45  Aligned_cols=32  Identities=41%  Similarity=1.040  Sum_probs=26.4

Q ss_pred             cccCCCCCcccccCCCCCCCCCCCCCCCCccCCCC
Q psy9822         267 DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPR  301 (380)
Q Consensus       267 ~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~  301 (380)
                      .|..  .+++|. |+++|+|.+|++|+++||+.+.
T Consensus        12 ~C~~--~~G~C~-C~~~~~G~~C~~C~~g~~~~~~   43 (49)
T PF00053_consen   12 TCDP--STGQCV-CKPGTTGPRCDQCKPGYFGLPS   43 (49)
T ss_dssp             SEEE--TCEEES-BSTTEESTTS-EE-TTEECSTT
T ss_pred             cccC--CCCEEe-ccccccCCcCcCCCCccccccC
Confidence            5764  689996 9999999999999999999875


No 28 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=95.94  E-value=0.0081  Score=42.11  Aligned_cols=29  Identities=34%  Similarity=0.950  Sum_probs=25.3

Q ss_pred             cccCCCCCcccccCCCCCCCCCCCCCCCCccC
Q psy9822         267 DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVG  298 (380)
Q Consensus       267 ~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G  298 (380)
                      .|..  .+++|+ |+++++|.+|++|+++|||
T Consensus        12 ~C~~--~~G~C~-C~~~~~G~~C~~C~~g~~g   40 (46)
T smart00180       12 TCDP--DTGQCE-CKPNVTGRRCDRCAPGYYG   40 (46)
T ss_pred             cccC--CCCEEE-CCCCCCCCCCCcCCCCcCC
Confidence            4753  478996 9999999999999999999


No 29 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.77  E-value=0.012  Score=38.19  Aligned_cols=35  Identities=26%  Similarity=0.803  Sum_probs=27.1

Q ss_pred             CccccCCCCCCCCCCCeeecCCCCcccccCCCceecC
Q psy9822         185 ENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSE  221 (380)
Q Consensus       185 ~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~  221 (380)
                      +++|... ..|. .++.|++..++|.|.|++||.|..
T Consensus         2 ~~~C~~~-~~C~-~~~~C~~~~~~~~C~C~~g~~g~~   36 (38)
T cd00054           2 IDECASG-NPCQ-NGGTCVNTVGSYRCSCPPGYTGRN   36 (38)
T ss_pred             cccCCCC-CCcC-CCCEeECCCCCeEeECCCCCcCCc
Confidence            4566432 4576 678999999999999999998854


No 30 
>PF01437 PSI:  Plexin repeat;  InterPro: IPR002165 This is a cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in plexin (P70206 from SWISSPROT) []. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein (Q19981 from SWISSPROT) contains four copies of the repeat, while the Met receptor contains a single copy of the repeat.; GO: 0016020 membrane; PDB: 3NVQ_B 3NVN_B 3OL2_B 3OKT_A 3AL8_A 3OKW_A 3OKY_B 3AFC_B 1OLZ_B 1SHY_B ....
Probab=95.73  E-value=0.0038  Score=44.66  Aligned_cols=32  Identities=44%  Similarity=0.917  Sum_probs=27.6

Q ss_pred             CCCCCCCchhhhcCCCCCCCccceeecCCCCcccCcc
Q psy9822          43 RCDQRTDCQDCLTSSGGEGGWHECRWSSMLNECISPS   79 (380)
Q Consensus        43 ~C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~~~   79 (380)
                      .|+.|++|++||.++.     -.|+||..+++|+...
T Consensus         1 ~C~~~~sC~~Cl~~~d-----p~CgWc~~~~~C~~~~   32 (51)
T PF01437_consen    1 NCSQYTSCSSCLSSRD-----PYCGWCSSENRCVSGS   32 (51)
T ss_dssp             -GGGGSSHHHHHHSTC-----TTEEEETTTTEEEECC
T ss_pred             CCcCCCcHHHHHcCCC-----cCccccCCCCcccCCc
Confidence            3789999999999873     3799999999999876


No 31 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=95.45  E-value=0.0058  Score=39.45  Aligned_cols=26  Identities=35%  Similarity=1.032  Sum_probs=23.1

Q ss_pred             CCCCCCCeeecCC-CCcccccCCCceec
Q psy9822         194 TCDPQSEQCVDLA-DGFECVCGRGYNKS  220 (380)
Q Consensus       194 ~C~~~~g~C~n~~-~g~~C~C~~Gy~G~  220 (380)
                      +|. ++|+|++.. ++|+|.|++||+|+
T Consensus         5 ~C~-n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    5 PCQ-NGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             SST-TTEEEEEESTSEEEEEEBTTEEST
T ss_pred             cCC-CCeEEEeCCCCCEEeECCCCCccC
Confidence            566 789999988 99999999999986


No 32 
>KOG4260|consensus
Probab=95.19  E-value=0.019  Score=54.77  Aligned_cols=66  Identities=30%  Similarity=0.756  Sum_probs=46.1

Q ss_pred             cCCCcccccccc-----cCCCCCCCcccC---CCCCcccccCCCCCCCCCCCCCCCCccCCCCC--CCCCcCCCCcc
Q psy9822         247 CDFGYVGVNCSI-----QCQCNGHADCAG---PDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD--NGECVPCSEYC  313 (380)
Q Consensus       247 C~~G~~G~~C~~-----~c~C~~~~~C~~---~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~--~~~C~~C~~~C  313 (380)
                      |+.|..|++|..     .-+|++++.|..   ..+++.| .|+.+|+|..|..|.++|+-.-.+  .-.|..|.+-|
T Consensus       132 Cp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkC-kC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C  207 (350)
T KOG4260|consen  132 CPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKC-KCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGC  207 (350)
T ss_pred             cCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcc-cccCCCCCccccccchHHHHhhcccccchhhhhhhhh
Confidence            555666666653     235888888864   2468999 599999999999999999976554  23566655333


No 33 
>smart00051 DSL delta serrate ligand.
Probab=94.70  E-value=0.03  Score=42.06  Aligned_cols=39  Identities=33%  Similarity=0.647  Sum_probs=19.7

Q ss_pred             cCCCcccccccccCCC----CCCCcccCCCCCcccccCCCCCCCCCC
Q psy9822         247 CDFGYVGVNCSIQCQC----NGHADCAGPDKLDVCLRCHNHTKGPQC  289 (380)
Q Consensus       247 C~~G~~G~~C~~~c~C----~~~~~C~~~~~~g~C~~C~~~~~G~~C  289 (380)
                      |+.+|.|..|+..|.+    .+|.+|..   .+.++ |.++|+|..|
T Consensus        21 C~~~~yG~~C~~~C~~~~d~~~~~~Cd~---~G~~~-C~~Gw~G~~C   63 (63)
T smart00051       21 CDENYYGEGCNKFCRPRDDFFGHYTCDE---NGNKG-CLEGWMGPYC   63 (63)
T ss_pred             CCCCCcCCccCCEeCcCccccCCccCCc---CCCEe-cCCCCcCCCC
Confidence            4444455544443332    44555643   45553 6666666554


No 34 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.51  E-value=0.022  Score=36.85  Aligned_cols=21  Identities=48%  Similarity=1.341  Sum_probs=15.1

Q ss_pred             CCcccCC-CccccCCCcccccc
Q psy9822         236 RGVCSEP-DKCQCDFGYVGVNC  256 (380)
Q Consensus       236 ~G~C~~~-~~C~C~~G~~G~~C  256 (380)
                      ||+|+.+ ++|+|+.||+|+.|
T Consensus        11 ~G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen   11 HGTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             CCEEeCCCCEEECCCCCcCCCC
Confidence            5777765 67778877777764


No 35 
>smart00423 PSI domain found in Plexins, Semaphorins and Integrins.
Probab=94.15  E-value=0.034  Score=38.77  Aligned_cols=30  Identities=33%  Similarity=0.857  Sum_probs=26.0

Q ss_pred             CCCCCCchhhhcCCCCCCCccceeecCCCCcccCc
Q psy9822          44 CDQRTDCQDCLTSSGGEGGWHECRWSSMLNECISP   78 (380)
Q Consensus        44 C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~~   78 (380)
                      |+.+++|++||..+.     ..|+||..+++|+..
T Consensus         1 C~~~~sC~~C~~~~~-----~~C~Wc~~~~~C~~~   30 (46)
T smart00423        1 CSKYTSCSECLLARD-----PYCAWCSSQGRCTSG   30 (46)
T ss_pred             CCCCCcHHHHHcCCC-----CCCCccCCCCcccCc
Confidence            678899999999873     369999999999986


No 36 
>KOG4260|consensus
Probab=94.11  E-value=0.06  Score=51.37  Aligned_cols=63  Identities=33%  Similarity=0.701  Sum_probs=44.8

Q ss_pred             CcccccCCCCCCCCCCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcc
Q psy9822         274 LDVCLRCHNHTKGPQCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRAR  353 (380)
Q Consensus       274 ~g~C~~C~~~~~G~~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~  353 (380)
                      .-.|  |++++.|+.|..|.-+..         ++    |+++ +.|+-+..                      +.+.+.
T Consensus       128 LkvC--Cp~gtyGpdCl~Cpggse---------r~----C~Gn-G~C~GdGs----------------------R~GsGk  169 (350)
T KOG4260|consen  128 LKVC--CPDGTYGPDCLQCPGGSE---------RP----CFGN-GSCHGDGS----------------------REGSGK  169 (350)
T ss_pred             heec--cCCCCcCCccccCCCCCc---------CC----cCCC-CcccCCCC----------------------CCCCCc
Confidence            4456  999999999988864321         23    4444 34543211                      136799


Q ss_pred             cccCCCCCCCCCCCcCCCCCCC
Q psy9822         354 CMHCGNHTTGPKCEDCVEGKEN  375 (380)
Q Consensus       354 C~~C~~g~~G~~Ce~C~~Gy~g  375 (380)
                      |. |.+||+|+.|..|.++||-
T Consensus       170 Ck-C~~GY~Gp~C~~Cg~eyfe  190 (350)
T KOG4260|consen  170 CK-CETGYTGPLCRYCGIEYFE  190 (350)
T ss_pred             cc-ccCCCCCccccccchHHHH
Confidence            99 9999999999999999984


No 37 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=94.09  E-value=0.054  Score=34.35  Aligned_cols=27  Identities=30%  Similarity=0.912  Sum_probs=22.7

Q ss_pred             CCCCCCCCeeecCCCCcccccCCCceec
Q psy9822         193 HTCDPQSEQCVDLADGFECVCGRGYNKS  220 (380)
Q Consensus       193 ~~C~~~~g~C~n~~~g~~C~C~~Gy~G~  220 (380)
                      ..|. .+++|++.+++|+|.|+.||.|.
T Consensus         6 ~~C~-~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           6 NPCS-NGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCC-CCCEEecCCCCeEeECCCCCccc
Confidence            3555 57899999999999999999886


No 38 
>KOG1217|consensus
Probab=94.04  E-value=0.16  Score=51.24  Aligned_cols=102  Identities=26%  Similarity=0.691  Sum_probs=68.6

Q ss_pred             CCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCcccc--CC-----CCCcCC-cccC-----CCcccc
Q psy9822         181 KCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPV--CS-----QGCVRG-VCSE-----PDKCQC  247 (380)
Q Consensus       181 ~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~--C~-----~~C~~G-~C~~-----~~~C~C  247 (380)
                      .|.++++|....+ |. .+++|++.++.|.|.|++||.|..+ ..+...  |.     ..|.++ .|..     .+.|.|
T Consensus       267 ~~~~~~~C~~~~~-c~-~~~~C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c  343 (487)
T KOG1217|consen  267 TCVDVDSCALIAS-CP-NGGTCVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCAC  343 (487)
T ss_pred             eeeeccccCCCCc-cC-CCCeeecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCCCcccccCCCCCCCCcCC
Confidence            4668899976654 77 7899999999999999999999884 234332  52     226543 7732     247999


Q ss_pred             CCCcccccccccC-CCC-----CCCcccC-CCCCcccccCCCCCCC
Q psy9822         248 DFGYVGVNCSIQC-QCN-----GHADCAG-PDKLDVCLRCHNHTKG  286 (380)
Q Consensus       248 ~~G~~G~~C~~~c-~C~-----~~~~C~~-~~~~g~C~~C~~~~~G  286 (380)
                      ..+|+|..|+... .|.     ..+.|.+ ..+.+.|. |..++.+
T Consensus       344 ~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~-~~~~~~~  388 (487)
T KOG1217|consen  344 GPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCA-CPAGFAG  388 (487)
T ss_pred             CCCCCCCccccCCccccCCccccCCEeccCCCCCeEec-CCCcccc
Confidence            9999999998643 333     3344554 34556663 5555554


No 39 
>smart00051 DSL delta serrate ligand.
Probab=93.71  E-value=0.061  Score=40.37  Aligned_cols=41  Identities=24%  Similarity=0.606  Sum_probs=30.4

Q ss_pred             ccCCCceecCCCCCccccCCCC--Cc-CCcccCCCccccCCCcccccc
Q psy9822         212 VCGRGYNKSEEGGECVPVCSQG--CV-RGVCSEPDKCQCDFGYVGVNC  256 (380)
Q Consensus       212 ~C~~Gy~G~~~~~~C~~~C~~~--C~-~G~C~~~~~C~C~~G~~G~~C  256 (380)
                      .|.++|+|..    |...|...  +. +.+|...+.++|.+||+|++|
T Consensus        20 ~C~~~~yG~~----C~~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C   63 (63)
T smart00051       20 TCDENYYGEG----CNKFCRPRDDFFGHYTCDENGNKGCLEGWMGPYC   63 (63)
T ss_pred             eCCCCCcCCc----cCCEeCcCccccCCccCCcCCCEecCCCCcCCCC
Confidence            7888999987    55555322  33 567866778999999999876


No 40 
>PF01437 PSI:  Plexin repeat;  InterPro: IPR002165 This is a cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in plexin (P70206 from SWISSPROT) []. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein (Q19981 from SWISSPROT) contains four copies of the repeat, while the Met receptor contains a single copy of the repeat.; GO: 0016020 membrane; PDB: 3NVQ_B 3NVN_B 3OL2_B 3OKT_A 3AL8_A 3OKW_A 3OKY_B 3AFC_B 1OLZ_B 1SHY_B ....
Probab=93.66  E-value=0.018  Score=41.15  Aligned_cols=30  Identities=50%  Similarity=1.244  Sum_probs=24.9

Q ss_pred             CCCCcccchhhcCCC--CCccCcCCCCCCCCceeeccCC
Q psy9822         105 SCSAYTQCSTCLRHA--HCGWCALQRDTTGGMGVCTEGS  141 (380)
Q Consensus       105 ~Cs~~~~C~~Cl~~~--~CGwC~~~~~~~~g~G~C~~G~  141 (380)
                      .|+.|++|.+||...  .||||..       .++|+.+.
T Consensus         1 ~C~~~~sC~~Cl~~~dp~CgWc~~-------~~~C~~~~   32 (51)
T PF01437_consen    1 NCSQYTSCSSCLSSRDPYCGWCSS-------ENRCVSGS   32 (51)
T ss_dssp             -GGGGSSHHHHHHSTCTTEEEETT-------TTEEEECC
T ss_pred             CCcCCCcHHHHHcCCCcCccccCC-------CCcccCCc
Confidence            489999999999875  9999992       57888884


No 41 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.48  E-value=0.023  Score=29.32  Aligned_cols=12  Identities=58%  Similarity=1.442  Sum_probs=6.4

Q ss_pred             cccCCCcccccc
Q psy9822         245 CQCDFGYVGVNC  256 (380)
Q Consensus       245 C~C~~G~~G~~C  256 (380)
                      |+|++||+|.+|
T Consensus         2 C~C~~G~~G~~C   13 (13)
T PF12661_consen    2 CQCPPGWTGPNC   13 (13)
T ss_dssp             EEE-TTEETTTT
T ss_pred             ccCcCCCcCCCC
Confidence            556666666554


No 42 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=93.05  E-value=0.11  Score=33.40  Aligned_cols=25  Identities=24%  Similarity=0.885  Sum_probs=20.8

Q ss_pred             CCCCCCCCeeecCCCCcccccCCCcee
Q psy9822         193 HTCDPQSEQCVDLADGFECVCGRGYNK  219 (380)
Q Consensus       193 ~~C~~~~g~C~n~~~g~~C~C~~Gy~G  219 (380)
                      ..|. .+ +|++..++|+|.|++||.|
T Consensus         6 ~~C~-~~-~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        6 GPCS-NG-TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCC-CC-EEECCCCCeEeECCCCCcc
Confidence            4565 34 8999999999999999988


No 43 
>KOG3509|consensus
Probab=92.92  E-value=0.12  Score=57.62  Aligned_cols=75  Identities=28%  Similarity=0.690  Sum_probs=59.4

Q ss_pred             CcccccCCCCCCCCCCCCCCCCccCCCCC---CCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCC
Q psy9822         274 LDVCLRCHNHTKGPQCDKCEPLYVGDPRD---NGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTT  350 (380)
Q Consensus       274 ~g~C~~C~~~~~G~~C~~C~~g~~G~~~~---~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~  350 (380)
                      .-+| +|..++.|.+|+.|.++|.-..+.   .+.+..|.  |+.|+..|..                           .
T Consensus       717 ~~~C-~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~--~~~h~~~c~~---------------------------~  766 (964)
T KOG3509|consen  717 VEQC-QCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCE--CNSHISQCED---------------------------D  766 (964)
T ss_pred             cccc-ccCccccCcccccccccccccccCCcCcccCcccc--cCCCcccccc---------------------------c
Confidence            4578 499999999999999988766532   24566777  8888877753                           3


Q ss_pred             CcccccCCCCCCCCCCCcCCCCCCCCCC
Q psy9822         351 RARCMHCGNHTTGPKCEDCVEGKENRQS  378 (380)
Q Consensus       351 ~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~  378 (380)
                      .++|..|++++.|.+|+.|.+||+++.-
T Consensus       767 ~~~nt~~q~~~~~~~~~~~~~g~~~da~  794 (964)
T KOG3509|consen  767 LGYNTDCQNNTEGDRCELCSPGTYGDAR  794 (964)
T ss_pred             ccccccccccCccceeeecCCCccccCc
Confidence            4566669999999999999999999863


No 44 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.61  E-value=0.21  Score=33.19  Aligned_cols=22  Identities=32%  Similarity=0.877  Sum_probs=18.0

Q ss_pred             CeeecCCCCcccccCCCceecC
Q psy9822         200 EQCVDLADGFECVCGRGYNKSE  221 (380)
Q Consensus       200 g~C~n~~~g~~C~C~~Gy~G~~  221 (380)
                      ..|++++++|+|.|++||+...
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L~~   31 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKLAE   31 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE-T
T ss_pred             CCCccCCCceEeECCCCCEECc
Confidence            3799999999999999998765


No 45 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=89.04  E-value=0.26  Score=29.77  Aligned_cols=20  Identities=35%  Similarity=0.961  Sum_probs=13.8

Q ss_pred             CcccccCCCceecCCCCCcc
Q psy9822         208 GFECVCGRGYNKSEEGGECV  227 (380)
Q Consensus       208 g~~C~C~~Gy~G~~~~~~C~  227 (380)
                      +|+|+|++||........|+
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~   20 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCE   20 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccc
Confidence            58899999998654333444


No 46 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=88.83  E-value=0.31  Score=31.50  Aligned_cols=26  Identities=35%  Similarity=1.039  Sum_probs=21.3

Q ss_pred             CCCCCCcccCCCCCcccccCCCCCCCCCC
Q psy9822         261 QCNGHADCAGPDKLDVCLRCHNHTKGPQC  289 (380)
Q Consensus       261 ~C~~~~~C~~~~~~g~C~~C~~~~~G~~C  289 (380)
                      .|++|++|+.  ..++|+ |.++|+|..|
T Consensus         7 ~C~~~G~C~~--~~g~C~-C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVS--PCGRCV-CDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeC--CCCEEE-CCCCCcCCCC
Confidence            5889999985  158995 9999999876


No 47 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.46  E-value=0.83  Score=42.40  Aligned_cols=41  Identities=24%  Similarity=0.632  Sum_probs=34.2

Q ss_pred             cCCCCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecC
Q psy9822         178 NYVKCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSE  221 (380)
Q Consensus       178 ~~~~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~  221 (380)
                      ....|.+.+||....+.|.   ..|++++++|.|.|++||....
T Consensus       180 ~~~~C~~~~~C~~~~~~c~---~~C~~~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         180 QGKICVVPDLCATLSHVCQ---QVCISTPGSYLCACTEGYALLE  220 (224)
T ss_pred             ccccCcCchhhcCCCCCcc---ceEEcCCCCEEeECCCCccCCC
Confidence            4566999999987777886   3799999999999999997643


No 48 
>KOG1218|consensus
Probab=83.85  E-value=11  Score=36.16  Aligned_cols=41  Identities=37%  Similarity=0.873  Sum_probs=25.8

Q ss_pred             CCCcccccccccCCCCCC---CcccCCCCCcccccCCCCCCCCCCCC
Q psy9822         248 DFGYVGVNCSIQCQCNGH---ADCAGPDKLDVCLRCHNHTKGPQCDK  291 (380)
Q Consensus       248 ~~G~~G~~C~~~c~C~~~---~~C~~~~~~g~C~~C~~~~~G~~C~~  291 (380)
                      ..+|.|..|...++|+.+   .+|.+...  .| .+...+.+.+|..
T Consensus        96 ~~~~~g~~C~~~~~~~~~c~~~~C~~~~~--~c-~~~~~~~~~~C~~  139 (316)
T KOG1218|consen   96 LNGYEGPQCESPCPCGDGCAEKTCANPRR--EC-RCGGGYIGEQCGE  139 (316)
T ss_pred             CCCCCcccccCCCCcCCcccccccCCCcc--ce-ecCCcCccccccc
Confidence            688888888887777655   56654211  35 2556666666644


No 49 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=83.58  E-value=1.1  Score=29.05  Aligned_cols=27  Identities=22%  Similarity=0.566  Sum_probs=15.2

Q ss_pred             CCCCCcccCCCCCcccccCCCCCC-CCCC
Q psy9822         262 CNGHADCAGPDKLDVCLRCHNHTK-GPQC  289 (380)
Q Consensus       262 C~~~~~C~~~~~~g~C~~C~~~~~-G~~C  289 (380)
                      |..++.|.+..+.+.|. |..+|. |.+|
T Consensus        11 C~~~~~C~~~~g~~~C~-C~~g~~~g~~C   38 (39)
T smart00179       11 CQNGGTCVNTVGSYRCE-CPPGYTDGRNC   38 (39)
T ss_pred             cCCCCEeECCCCCeEeE-CCCCCccCCcC
Confidence            44445565555566663 666665 5554


No 50 
>KOG1388|consensus
Probab=83.42  E-value=1.4  Score=40.86  Aligned_cols=98  Identities=20%  Similarity=0.312  Sum_probs=63.3

Q ss_pred             eeecCCCCccc-ccCCCceecCCCCCccccCCCCCcCCcccCCCcccc-CCCcccccccccCCCCCCCcccCCCCCcccc
Q psy9822         201 QCVDLADGFEC-VCGRGYNKSEEGGECVPVCSQGCVRGVCSEPDKCQC-DFGYVGVNCSIQCQCNGHADCAGPDKLDVCL  278 (380)
Q Consensus       201 ~C~n~~~g~~C-~C~~Gy~G~~~~~~C~~~C~~~C~~G~C~~~~~C~C-~~G~~G~~C~~~c~C~~~~~C~~~~~~g~C~  278 (380)
                      .|.|.+.+++| .|..||+|+.-...|.+..-++.+.+-....++|.| ..|.+|..|..-..++.+.....   .+.|.
T Consensus        68 ~~~N~~~g~~c~kc~~g~~GdtN~g~c~~~~~~g~~~~~~~~~~~c~c~~kgvvgd~c~~~e~~N~~r~~~~---kgtcy  144 (217)
T KOG1388|consen   68 RCENGTTGAHCEKCIVGFYGDTNGGKCQPCDCNGGASACVTLTGKCFCTTKGVVGDLCPKCEVPNRYRGKPL---KGTCY  144 (217)
T ss_pred             eccCccccccCCceEEEEEecCCCCccCHhhhcCCeeeeeccCCccccccceEecccCcccccccccccCcc---cccee
Confidence            37788899999 599999997434566666333333333444678899 57899988886444555544422   22231


Q ss_pred             --cCCCCCCCCCCCCCCCCccCCCC
Q psy9822         279 --RCHNHTKGPQCDKCEPLYVGDPR  301 (380)
Q Consensus       279 --~C~~~~~G~~C~~C~~g~~G~~~  301 (380)
                        -..++....+.....++|+....
T Consensus       145 y~l~id~~ftf~l~~~d~~fv~sd~  169 (217)
T KOG1388|consen  145 YELLIDGQFTFHLLQEDDGFVTSDN  169 (217)
T ss_pred             eeeecccccccceeecCCCceeecc
Confidence              14566667777777888886653


No 51 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=83.38  E-value=0.25  Score=31.75  Aligned_cols=26  Identities=19%  Similarity=0.411  Sum_probs=13.5

Q ss_pred             CCCCCCcccCCC-CCcccccCCCCCCCC
Q psy9822         261 QCNGHADCAGPD-KLDVCLRCHNHTKGP  287 (380)
Q Consensus       261 ~C~~~~~C~~~~-~~g~C~~C~~~~~G~  287 (380)
                      +|.++++|+... ..+.|+ |+++|+|.
T Consensus         5 ~C~n~g~C~~~~~~~y~C~-C~~G~~G~   31 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCE-CPPGYTGK   31 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEE-EBTTEEST
T ss_pred             cCCCCeEEEeCCCCCEEeE-CCCCCccC
Confidence            455555555433 455563 55555554


No 52 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=81.53  E-value=0.21  Score=45.30  Aligned_cols=89  Identities=30%  Similarity=0.830  Sum_probs=57.8

Q ss_pred             CCCCCCCeeecCC-----CCcccccCCCceecCCCCCcccc-CCCC-CcCCcccC-C-----CccccCCCcc---ccccc
Q psy9822         194 TCDPQSEQCVDLA-----DGFECVCGRGYNKSEEGGECVPV-CSQG-CVRGVCSE-P-----DKCQCDFGYV---GVNCS  257 (380)
Q Consensus       194 ~C~~~~g~C~n~~-----~g~~C~C~~Gy~G~~~~~~C~~~-C~~~-C~~G~C~~-~-----~~C~C~~G~~---G~~C~  257 (380)
                      .|. +-++|++.+     ..|.|.|.+||....  +.|.|. |.+. |..|.|+. +     ..|.|.-|+.   ...|.
T Consensus        51 ~Cg-dya~C~~~~~~~~~~~~~C~C~~gY~~~~--~vCvp~~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCt  127 (197)
T PF06247_consen   51 PCG-DYAKCINQANKGEERAYKCDCINGYILKQ--GVCVPNKCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCT  127 (197)
T ss_dssp             EEE-TTEEEEE-SSTTSSTSEEEEE-TTEEESS--SSEEEGGGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTESE
T ss_pred             ccc-chhhhhcCCCcccceeEEEecccCceeeC--CeEchhhcCceecCCCeEEecCCCCCCceeEeeeceEeccCCccc
Confidence            566 667898755     589999999999887  689987 8764 99999986 2     2899999998   22333


Q ss_pred             c----c--CCCCCCCcccCCCCCcccccCCCCCCC
Q psy9822         258 I----Q--CQCNGHADCAGPDKLDVCLRCHNHTKG  286 (380)
Q Consensus       258 ~----~--c~C~~~~~C~~~~~~g~C~~C~~~~~G  286 (380)
                      .    .  -.|..+-.|...++.|.|. |+.++.+
T Consensus       128 k~G~T~C~LKCk~nE~CK~~~~~Y~C~-~~~~~~~  161 (197)
T PF06247_consen  128 KTGETKCSLKCKENEECKLVDGYYKCV-CKEGFPG  161 (197)
T ss_dssp             EEE--------TTTEEEEEETTEEEEE-E-TT-EE
T ss_pred             CCCccceeeecCCCcceeeeCcEEEee-cCCCCCC
Confidence            2    1  2355556687666788884 7666643


No 53 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=79.04  E-value=1.9  Score=27.39  Aligned_cols=27  Identities=22%  Similarity=0.575  Sum_probs=14.9

Q ss_pred             CCCCCcccCCCCCcccccCCCCCCCCCC
Q psy9822         262 CNGHADCAGPDKLDVCLRCHNHTKGPQC  289 (380)
Q Consensus       262 C~~~~~C~~~~~~g~C~~C~~~~~G~~C  289 (380)
                      |..++.|.+..+.+.|. |..+|.|.+|
T Consensus        11 C~~~~~C~~~~~~~~C~-C~~g~~g~~C   37 (38)
T cd00054          11 CQNGGTCVNTVGSYRCS-CPPGYTGRNC   37 (38)
T ss_pred             cCCCCEeECCCCCeEeE-CCCCCcCCcC
Confidence            44445565555556663 6666666554


No 54 
>smart00423 PSI domain found in Plexins, Semaphorins and Integrins.
Probab=78.30  E-value=1.2  Score=30.84  Aligned_cols=20  Identities=55%  Similarity=1.472  Sum_probs=17.1

Q ss_pred             CCCcccchhhcCCCC--CccCc
Q psy9822         106 CSAYTQCSTCLRHAH--CGWCA  125 (380)
Q Consensus       106 Cs~~~~C~~Cl~~~~--CGwC~  125 (380)
                      |+.|++|++||....  |+||.
T Consensus         1 C~~~~sC~~C~~~~~~~C~Wc~   22 (46)
T smart00423        1 CSKYTSCSECLLARDPYCAWCS   22 (46)
T ss_pred             CCCCCcHHHHHcCCCCCCCccC
Confidence            678999999998633  99998


No 55 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=78.10  E-value=2.7  Score=26.15  Aligned_cols=26  Identities=19%  Similarity=0.521  Sum_probs=14.9

Q ss_pred             CCCCCCcccCCCCCcccccCCCCCCCC
Q psy9822         261 QCNGHADCAGPDKLDVCLRCHNHTKGP  287 (380)
Q Consensus       261 ~C~~~~~C~~~~~~g~C~~C~~~~~G~  287 (380)
                      +|..++.|.+..+.+.|. |..+|.|.
T Consensus         7 ~C~~~~~C~~~~~~~~C~-C~~g~~g~   32 (36)
T cd00053           7 PCSNGGTCVNTPGSYRCV-CPPGYTGD   32 (36)
T ss_pred             CCCCCCEEecCCCCeEeE-CCCCCccc
Confidence            444455666555566663 66666655


No 56 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=77.99  E-value=0.68  Score=34.72  Aligned_cols=10  Identities=40%  Similarity=1.009  Sum_probs=5.0

Q ss_pred             CCCCCCCCCC
Q psy9822         280 CHNHTKGPQC  289 (380)
Q Consensus       280 C~~~~~G~~C  289 (380)
                      |.++|+|..|
T Consensus        54 C~~Gw~G~~C   63 (63)
T PF01414_consen   54 CLPGWTGPNC   63 (63)
T ss_dssp             E-TTEESTTS
T ss_pred             CCCCCcCCCC
Confidence            4556665544


No 57 
>KOG1218|consensus
Probab=74.89  E-value=23  Score=33.97  Aligned_cols=13  Identities=15%  Similarity=0.544  Sum_probs=7.9

Q ss_pred             CccCCCCCCCccc
Q psy9822         176 SWNYVKCPPENEC  188 (380)
Q Consensus       176 ~w~~~~C~d~deC  188 (380)
                      .|.+..|....+|
T Consensus        98 ~~~g~~C~~~~~~  110 (316)
T KOG1218|consen   98 GYEGPQCESPCPC  110 (316)
T ss_pred             CCCcccccCCCCc
Confidence            4566667665555


No 58 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=74.85  E-value=0.61  Score=34.96  Aligned_cols=45  Identities=24%  Similarity=0.579  Sum_probs=19.0

Q ss_pred             CcccccCCCceecCCCCCccccCCCC---CcCCcccCCCccccCCCcccccc
Q psy9822         208 GFECVCGRGYNKSEEGGECVPVCSQG---CVRGVCSEPDKCQCDFGYVGVNC  256 (380)
Q Consensus       208 g~~C~C~~Gy~G~~~~~~C~~~C~~~---C~~G~C~~~~~C~C~~G~~G~~C  256 (380)
                      .++-.|.+.|+|..    |...|...   =.|-+|...+.=+|.+||+|++|
T Consensus        16 ~~rv~C~~nyyG~~----C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   16 RIRVVCDENYYGPN----CSKFCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             -------TTEETTT----T-EE---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred             EEEEECCCCCCCcc----ccCCcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence            44567888888877    44444222   11345555556678888888765


No 59 
>PHA02887 EGF-like protein; Provisional
Probab=73.09  E-value=1.8  Score=36.34  Aligned_cols=25  Identities=36%  Similarity=1.079  Sum_probs=18.5

Q ss_pred             CcCCcccC-----CCccccCCCcccccccc
Q psy9822         234 CVRGVCSE-----PDKCQCDFGYVGVNCSI  258 (380)
Q Consensus       234 C~~G~C~~-----~~~C~C~~G~~G~~C~~  258 (380)
                      |.||+|.-     ...|.|+.||+|.+|+.
T Consensus        94 CiHG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         94 CINGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             eeCCEEEccccCCCceeECCCCcccCCCCc
Confidence            66777764     24788888888888863


No 60 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=71.36  E-value=3.8  Score=25.96  Aligned_cols=23  Identities=22%  Similarity=0.582  Sum_probs=12.4

Q ss_pred             CCCCCcccCCCCCcccccCCCCCCC
Q psy9822         262 CNGHADCAGPDKLDVCLRCHNHTKG  286 (380)
Q Consensus       262 C~~~~~C~~~~~~g~C~~C~~~~~G  286 (380)
                      |..+ .|.+..+.+.| .|..+|.|
T Consensus         8 C~~~-~C~~~~~~~~C-~C~~g~~g   30 (35)
T smart00181        8 CSNG-TCINTPGSYTC-SCPPGYTG   30 (35)
T ss_pred             CCCC-EEECCCCCeEe-ECCCCCcc
Confidence            3344 55554455666 36666655


No 61 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=67.69  E-value=1.9  Score=28.85  Aligned_cols=24  Identities=29%  Similarity=0.582  Sum_probs=19.9

Q ss_pred             CCCeeecCC-CCcccccCCCceecC
Q psy9822         198 QSEQCVDLA-DGFECVCGRGYNKSE  221 (380)
Q Consensus       198 ~~g~C~n~~-~g~~C~C~~Gy~G~~  221 (380)
                      .++.|++.. +.++|+|..||..++
T Consensus         9 ~NA~C~~~~dG~eecrCllgyk~~~   33 (37)
T PF12946_consen    9 ANAGCFRYDDGSEECRCLLGYKKVG   33 (37)
T ss_dssp             TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred             CCcccEEcCCCCEEEEeeCCccccC
Confidence            678899877 899999999998775


No 62 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=65.46  E-value=3  Score=35.61  Aligned_cols=25  Identities=36%  Similarity=1.063  Sum_probs=16.2

Q ss_pred             CcCCcccC-----CCccccCCCcccccccc
Q psy9822         234 CVRGVCSE-----PDKCQCDFGYVGVNCSI  258 (380)
Q Consensus       234 C~~G~C~~-----~~~C~C~~G~~G~~C~~  258 (380)
                      |.||+|.-     ...|.|+.||+|.+|+.
T Consensus        53 ClHG~C~yI~dl~~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         53 CLHGDCIHARDIDGMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             eECCEEEeeccCCCceeECCCCcccccccc
Confidence            55566653     24677777777777763


No 63 
>KOG0196|consensus
Probab=55.74  E-value=17  Score=40.23  Aligned_cols=36  Identities=25%  Similarity=0.781  Sum_probs=26.4

Q ss_pred             CcccccCCCCC----CCCCCCCCCCCccCCCCCCCCCcCCC
Q psy9822         274 LDVCLRCHNHT----KGPQCDKCEPLYVGDPRDNGECVPCS  310 (380)
Q Consensus       274 ~g~C~~C~~~~----~G~~C~~C~~g~~G~~~~~~~C~~C~  310 (380)
                      .+.|. |++||    .|..|+.|.+|+|-.......|.+|+
T Consensus       258 iG~C~-C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP  297 (996)
T KOG0196|consen  258 IGGCV-CKAGYEEAENGKACQACPPGTYKASQGDSLCLPCP  297 (996)
T ss_pred             cCcee-ecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCC
Confidence            57784 88887    46788889888886655456677776


No 64 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=51.57  E-value=98  Score=34.71  Aligned_cols=133  Identities=20%  Similarity=0.461  Sum_probs=0.0

Q ss_pred             CCCCeeecC---------CCCccc-ccCCCceecCCCCCcccc-CCCCCcCCcccC--CCcc-ccCCCcccccccccCCC
Q psy9822         197 PQSEQCVDL---------ADGFEC-VCGRGYNKSEEGGECVPV-CSQGCVRGVCSE--PDKC-QCDFGYVGVNCSIQCQC  262 (380)
Q Consensus       197 ~~~g~C~n~---------~~g~~C-~C~~Gy~G~~~~~~C~~~-C~~~C~~G~C~~--~~~C-~C~~G~~G~~C~~~c~C  262 (380)
                      |.++.|+..         .....| .|.++|+...  ..|... ...+  +-.|..  .++| .|..||.-..=..=-.|
T Consensus       504 p~nG~C~~~~~~~~~C~~~~~g~Ct~C~~gyfL~~--GGCY~~~~~PG--~~~C~~a~~g~Ct~C~~GYyl~~~~~C~~C  579 (800)
T PTZ00214        504 PFNGVCIPNTQRDAYCTSTANGACTTCSGAAFLMN--GGCYTTEHYPG--STICDKQSNGKCTTTKKGYGISPDGKLLEC  579 (800)
T ss_pred             CCCCceecccCCCccccCCCCCcccCCCCCcEecC--CceeccCcCCC--CCccccCCCCcccccccCcccCCCCccccc


Q ss_pred             CCC-CcccCCCCCcccccCCCCCCCC-----------CCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCC
Q psy9822         263 NGH-ADCAGPDKLDVCLRCHNHTKGP-----------QCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPF  330 (380)
Q Consensus       263 ~~~-~~C~~~~~~g~C~~C~~~~~G~-----------~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~  330 (380)
                      ... .+|. .+....|++|..++.-.           .|..|.+|||.+   +..|.+|.                    
T Consensus       580 d~~C~~C~-~~~~~~C~sC~~g~yL~~~~~~~~g~Cv~~~~C~~GYY~d---~~~C~~C~--------------------  635 (800)
T PTZ00214        580 DPTCLACT-APGPGRCTRCPSDKLLKRASGAATGSCVDPGACVDGYYAD---GDACLPCA--------------------  635 (800)
T ss_pred             cchhhhcc-CCCCCcceECCCCCEeccccccccccccCcccCCCCcccC---CCccccCC--------------------


Q ss_pred             CCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCCC
Q psy9822         331 SPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSRN  380 (380)
Q Consensus       331 ~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~~  380 (380)
                                          ...|..|   .....|.+|..+||....++
T Consensus       636 --------------------~~~C~tC---~~~~~C~~C~~~~~ls~~~~  662 (800)
T PTZ00214        636 --------------------TPGCKTC---GHASFCTECAGELFVSLDGQ  662 (800)
T ss_pred             --------------------ccccccc---cCCCCcCcCCCCceecCcCC


No 65 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=50.58  E-value=9.5  Score=32.67  Aligned_cols=18  Identities=28%  Similarity=0.720  Sum_probs=16.2

Q ss_pred             CCcccccCCCCCCCCCCCc
Q psy9822         350 TRARCMHCGNHTTGPKCED  368 (380)
Q Consensus       350 ~~g~C~~C~~g~~G~~Ce~  368 (380)
                      ....|. |..||+|.+||.
T Consensus        65 ~~~~Cr-C~~GYtGeRCEh   82 (139)
T PHA03099         65 DGMYCR-CSHGYTGIRCQH   82 (139)
T ss_pred             CCceeE-CCCCcccccccc
Confidence            568899 999999999986


No 66 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=50.56  E-value=1.2e+02  Score=30.91  Aligned_cols=21  Identities=38%  Similarity=0.982  Sum_probs=14.6

Q ss_pred             CCCCCCCCccCCCCCCCCCcCCCC
Q psy9822         288 QCDKCEPLYVGDPRDNGECVPCSE  311 (380)
Q Consensus       288 ~C~~C~~g~~G~~~~~~~C~~C~~  311 (380)
                      .|..|++|||-+.   ..|.+|.+
T Consensus        92 ~c~~C~~G~y~~~---~~C~~C~~  112 (397)
T PF03302_consen   92 CCSECPDGYYKNG---NKCVPCHE  112 (397)
T ss_pred             cccCCCCCccccC---CCCCCCCc
Confidence            5778899998643   46777663


No 67 
>PF01508 Paramecium_SA:  Paramecium surface antigen domain;  InterPro: IPR002895 The G surface protein of Paramecium primaurelia has important internal homologies and a periodic structure, which could be dictated in part by the rigid scaffolding of cysteine residues. The predicted secondary structure shows a quasi absence of alpha-helix and an abundance of beta-pleated sheets and random coils. The monotony of the amino acid sequence is in favour of a structural role for the protein []. This structure is based on the presence of 37 periods of about 75 residues, each period containing eight cysteine residues []. Homologies with other proteins are limited to surface antigens of trypanosomes.
Probab=45.98  E-value=16  Score=26.55  Aligned_cols=34  Identities=24%  Similarity=0.612  Sum_probs=28.3

Q ss_pred             CCCCCCCCCchhhhcCCCCCCCccceeecCCCCcccCc
Q psy9822          41 PLRCDQRTDCQDCLTSSGGEGGWHECRWSSMLNECISP   78 (380)
Q Consensus        41 p~~C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~~   78 (380)
                      |..|+.+++=..|+....    ...|+|-.++..|+..
T Consensus        27 ~~~C~~y~~~~~C~~~~~----~~~C~W~~~~~~C~~~   60 (63)
T PF01508_consen   27 PSSCSDYKTQEQCTSSKS----GGKCYWDSSTSKCQDK   60 (63)
T ss_pred             cccccccchhhcCcccCC----CCEEEEcCCCCcCcCC
Confidence            577999999999999853    2589999999999864


No 68 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=40.86  E-value=37  Score=23.75  Aligned_cols=29  Identities=28%  Similarity=0.547  Sum_probs=19.3

Q ss_pred             cccccccccCCCCCCCcccCCCCCcccccCCCCC
Q psy9822         251 YVGVNCSIQCQCNGHADCAGPDKLDVCLRCHNHT  284 (380)
Q Consensus       251 ~~G~~C~~~c~C~~~~~C~~~~~~g~C~~C~~~~  284 (380)
                      ..|..|+..-+|..++.|.    .++|. |.+++
T Consensus        17 ~~g~~C~~~~qC~~~s~C~----~g~C~-C~~g~   45 (52)
T PF01683_consen   17 QPGESCESDEQCIGGSVCV----NGRCQ-CPPGY   45 (52)
T ss_pred             CCCCCCCCcCCCCCcCEEc----CCEeE-CCCCC
Confidence            3455666666777777786    57884 77665


No 69 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=37.26  E-value=76  Score=25.71  Aligned_cols=22  Identities=32%  Similarity=0.726  Sum_probs=12.5

Q ss_pred             CCCCCCCCccCCCCCCCCCcCCC
Q psy9822         288 QCDKCEPLYVGDPRDNGECVPCS  310 (380)
Q Consensus       288 ~C~~C~~g~~G~~~~~~~C~~C~  310 (380)
                      .|..|.++|+-+. .+..|.+|.
T Consensus         5 ~Ct~C~~g~~~~~-~~~~C~~C~   26 (96)
T PTZ00382          5 VCTSCDSDKKPNK-DGSGCVLCS   26 (96)
T ss_pred             ccCcCCCCCccCC-CCCcCCcCC
Confidence            4567777776543 233466655


No 70 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=34.20  E-value=38  Score=27.67  Aligned_cols=33  Identities=21%  Similarity=0.636  Sum_probs=22.7

Q ss_pred             CCccccCCCCCCCCCCCeeecCCCCcccccCCCcee
Q psy9822         184 PENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNK  219 (380)
Q Consensus       184 d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G  219 (380)
                      ..|+| .....|. ..+.|. ......|.|.+||.-
T Consensus        76 p~d~C-d~y~~CG-~~g~C~-~~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   76 PKDQC-DVYGFCG-PNGICN-SNNSPKCSCLPGFEP  108 (110)
T ss_pred             cccCC-CCccccC-CccEeC-CCCCCceECCCCcCC
Confidence            44567 3456787 678894 345667999999863


No 71 
>PHA02887 EGF-like protein; Provisional
Probab=31.78  E-value=29  Score=29.32  Aligned_cols=18  Identities=28%  Similarity=0.857  Sum_probs=16.1

Q ss_pred             CCcccccCCCCCCCCCCCc
Q psy9822         350 TRARCMHCGNHTTGPKCED  368 (380)
Q Consensus       350 ~~g~C~~C~~g~~G~~Ce~  368 (380)
                      ....|. |..||+|.+||.
T Consensus       106 ~epsCr-C~~GYtG~RCE~  123 (126)
T PHA02887        106 DEKFCI-CNKGYTGIRCDE  123 (126)
T ss_pred             CCceeE-CCCCcccCCCCc
Confidence            568999 999999999985


No 72 
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular  matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=31.19  E-value=27  Score=35.91  Aligned_cols=20  Identities=25%  Similarity=1.095  Sum_probs=16.1

Q ss_pred             cchhhcC-CCCCccCcCCCCCCC
Q psy9822         111 QCSTCLR-HAHCGWCALQRDTTG  132 (380)
Q Consensus       111 ~C~~Cl~-~~~CGwC~~~~~~~~  132 (380)
                      +|.+|++ ++.|+||.  +..+.
T Consensus         1 sC~eCi~~~p~CaWC~--~~~f~   21 (423)
T smart00187        1 SCGECIQSGPNCAWCT--DENFT   21 (423)
T ss_pred             ChhHHhCCCCCccccC--CccCC
Confidence            6899997 68999999  55553


No 73 
>KOG3848|consensus
Probab=30.06  E-value=23  Score=36.15  Aligned_cols=32  Identities=31%  Similarity=0.720  Sum_probs=28.3

Q ss_pred             CCCCCCCCchhhhcCCCCCCCccceeecCCCCcccC
Q psy9822          42 LRCDQRTDCQDCLTSSGGEGGWHECRWSSMLNECIS   77 (380)
Q Consensus        42 ~~C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~   77 (380)
                      +-|-++.+|..|+++.-.    -.|.||.-+|+|-+
T Consensus       317 PTCLqfn~Cd~C~ns~lt----FnClWC~vlqrCSs  348 (516)
T KOG3848|consen  317 PTCLQFNTCDTCTNSTLT----FNCLWCHVLQRCSS  348 (516)
T ss_pred             CceeccCCcChhhccCCC----ceeehHhhhhhhcc
Confidence            569999999999998764    68999999999985


No 74 
>PF14843 GF_recep_IV:  Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=29.73  E-value=17  Score=31.05  Aligned_cols=22  Identities=32%  Similarity=1.013  Sum_probs=10.4

Q ss_pred             cccccCCCCCCCCCC-CcCCCCC
Q psy9822         352 ARCMHCGNHTTGPKC-EDCVEGK  373 (380)
Q Consensus       352 g~C~~C~~g~~G~~C-e~C~~Gy  373 (380)
                      ..|+.|.....+..| +.|+.||
T Consensus        73 ~~C~~C~~~~~~~~Cv~~CP~~~   95 (132)
T PF14843_consen   73 DQCLSCAHFRDGGQCVSSCPEGT   95 (132)
T ss_dssp             TTBSSBSSEEETTEEESS--EEE
T ss_pred             chhhhccccCCCCcccCcCCCCC
Confidence            445555554455555 3466664


No 75 
>PF00362 Integrin_beta:  Integrin, beta chain;  InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus.  Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another.  The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices.  Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=29.18  E-value=25  Score=36.26  Aligned_cols=19  Identities=32%  Similarity=1.209  Sum_probs=15.3

Q ss_pred             cchhhcCC-CCCccCcCCCCCC
Q psy9822         111 QCSTCLRH-AHCGWCALQRDTT  131 (380)
Q Consensus       111 ~C~~Cl~~-~~CGwC~~~~~~~  131 (380)
                      +|.+|++. +.|+||.  ++.+
T Consensus         1 SC~eCI~~~p~CaWC~--~~~f   20 (426)
T PF00362_consen    1 SCSECIQSGPSCAWCS--DENF   20 (426)
T ss_dssp             SHHHHHTT-TTEEEE---STTC
T ss_pred             ChhHHhCCCCCCcccC--CCCc
Confidence            69999996 8999999  6665


No 76 
>PF05283 MGC-24:  Multi-glycosylated core protein 24 (MGC-24);  InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein [].  Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution [].  CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments. 
Probab=24.63  E-value=73  Score=29.16  Aligned_cols=39  Identities=23%  Similarity=0.651  Sum_probs=27.0

Q ss_pred             CCCCCCCCCCchhhhcCCCCCCCccceeec--CCCCcccCccch
Q psy9822          40 CPLRCDQRTDCQDCLTSSGGEGGWHECRWS--SMLNECISPSYE   81 (380)
Q Consensus        40 cp~~C~~~~~C~~Cl~~~~~~~~~~~C~Wc--~~~~~C~~~~~~   81 (380)
                      =+.-|.++++|..|++...   ....|+|-  .....|+..+.+
T Consensus        52 ~~~~C~~~~~C~sCVn~t~---n~s~C~W~~C~~~~~Cv~~stV   92 (186)
T PF05283_consen   52 APETCEQHNSCESCVNATH---NNSTCVWMECKGESYCVNNSTV   92 (186)
T ss_pred             CcccCCCCccchhhhCccC---ccCceEeeecCCCCcccCCccc
Confidence            3667889999999997643   24579994  445666655543


No 77 
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=22.36  E-value=1.7e+02  Score=23.33  Aligned_cols=27  Identities=26%  Similarity=0.810  Sum_probs=16.9

Q ss_pred             CCCCCCCCCccCCCCCC-CCCcCCCCccC
Q psy9822         287 PQCDKCEPLYVGDPRDN-GECVPCSEYCN  314 (380)
Q Consensus       287 ~~C~~C~~g~~G~~~~~-~~C~~C~~~Cn  314 (380)
                      ..|..|.+|+|-+..+. ..|++|. .|.
T Consensus        33 t~C~~C~~g~ys~~~~~~~~C~~c~-~C~   60 (98)
T cd00185          33 TVCEPCPPGTYTDSWNHLPKCLSCR-TCD   60 (98)
T ss_pred             CeecCCCCCCcccCCCCCCcCCcCc-cCC
Confidence            45667888877665543 4577765 354


Done!