Query         psy13144
Match_columns 1031
No_of_seqs    386 out of 2779
Neff          8.3 
Searched_HMMs 46136
Date          Fri Aug 16 20:40:54 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy13144.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13144hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus               99.7 4.7E-17   1E-21  192.8  18.5   91    8-124  1216-1308(2531)
  2 KOG4289|consensus               99.6 1.1E-14 2.3E-19  173.2  18.8   90  533-698  1218-1308(2531)
  3 KOG1217|consensus               99.6 1.9E-13 4.2E-18  163.5  25.8  333  418-969    95-436 (487)
  4 KOG1217|consensus               99.6 1.6E-13 3.5E-18  164.2  24.0  312   45-486    95-421 (487)
  5 KOG1214|consensus               99.5 6.4E-14 1.4E-18  160.2  14.2  202  101-336   699-919 (1289)
  6 KOG1214|consensus               99.5 4.4E-13 9.5E-18  153.5  14.7  196  791-1022  715-920 (1289)
  7 KOG0994|consensus               99.4 5.1E-10 1.1E-14  132.5  31.9   46  117-162   300-345 (1758)
  8 KOG1219|consensus               99.3 8.7E-12 1.9E-16  154.0   8.8  120  874-1026 3869-3989(4289)
  9 KOG1219|consensus               99.2 1.1E-11 2.4E-16  153.1   8.4  112   39-187  3864-3977(4289)
 10 KOG0994|consensus               99.1 2.2E-09 4.8E-14  127.2  16.9   72  163-270   824-898 (1758)
 11 KOG1225|consensus               99.0 2.8E-09   6E-14  123.0  12.7  132  748-955   234-365 (525)
 12 KOG1225|consensus               98.9 8.4E-09 1.8E-13  119.1  13.1  132  601-827   255-386 (525)
 13 KOG4260|consensus               98.7 1.2E-08 2.5E-13  104.3   6.0  128   16-182   131-270 (350)
 14 KOG4260|consensus               98.5 8.8E-08 1.9E-12   98.0   5.4  162  796-1009  132-304 (350)
 15 KOG1226|consensus               97.9 6.4E-05 1.4E-09   88.7  10.7  140  734-907   468-620 (783)
 16 KOG1226|consensus               97.8 0.00017 3.7E-09   85.2  12.5  137  675-844   466-617 (783)
 17 KOG1836|consensus               97.7  0.0017 3.6E-08   85.6  20.0  108  118-269   696-809 (1705)
 18 PF07645 EGF_CA:  Calcium-bindi  97.6 3.8E-05 8.3E-10   58.3   2.3   34   38-71      1-36  (42)
 19 PF00008 EGF:  EGF-like domain   97.6 3.6E-05 7.8E-10   54.5   1.7   30   42-71      1-31  (32)
 20 PF07645 EGF_CA:  Calcium-bindi  97.6 5.4E-05 1.2E-09   57.5   2.7   34  964-1009    1-34  (42)
 21 PF00008 EGF:  EGF-like domain   97.5 4.2E-05   9E-10   54.2   1.2   30  925-954     1-31  (32)
 22 smart00179 EGF_CA Calcium-bind  97.4 0.00018   4E-09   53.4   3.7   34  921-954     1-36  (39)
 23 KOG1836|consensus               97.4  0.0055 1.2E-07   81.0  19.5   53  117-187   756-812 (1705)
 24 smart00179 EGF_CA Calcium-bind  97.4 0.00019 4.2E-09   53.3   3.8   35  150-186     2-38  (39)
 25 PF12947 EGF_3:  EGF domain;  I  97.2 0.00016 3.4E-09   52.5   1.7   30  985-1014    6-35  (36)
 26 PF06247 Plasmod_Pvs28:  Plasmo  97.1 0.00011 2.3E-09   72.5  -0.2  139  830-1013   17-164 (197)
 27 PF12947 EGF_3:  EGF domain;  I  97.0 0.00027 5.9E-09   51.3   1.4   30  101-130     5-34  (36)
 28 cd00054 EGF_CA Calcium-binding  96.9  0.0012 2.6E-08   48.5   3.7   33  922-954     2-35  (38)
 29 cd00054 EGF_CA Calcium-binding  96.9  0.0012 2.7E-08   48.4   3.8   35  150-186     2-37  (38)
 30 PF06247 Plasmod_Pvs28:  Plasmo  96.9 6.6E-05 1.4E-09   74.0  -4.1  144   46-223     7-163 (197)
 31 PF12662 cEGF:  Complement Clr-  95.9  0.0067 1.4E-07   39.5   2.4   24   59-90      1-24  (24)
 32 cd00053 EGF Epidermal growth f  95.9  0.0094   2E-07   42.9   3.5   28   44-71      5-32  (36)
 33 cd00053 EGF Epidermal growth f  95.9    0.01 2.2E-07   42.7   3.6   28  927-954     5-32  (36)
 34 smart00181 EGF Epidermal growt  95.8    0.01 2.2E-07   42.9   3.4   32  153-186     2-34  (35)
 35 smart00181 EGF Epidermal growt  95.7   0.013 2.7E-07   42.3   3.6   28   42-70      2-30  (35)
 36 PF12662 cEGF:  Complement Clr-  95.7  0.0094   2E-07   38.8   2.3   24  942-967     1-24  (24)
 37 PF07974 EGF_2:  EGF-like domai  94.2   0.067 1.4E-06   37.8   3.6   26  986-1013    7-32  (32)
 38 KOG1218|consensus               94.1     3.3 7.2E-05   46.5  19.3  151  746-941    47-199 (316)
 39 PF07974 EGF_2:  EGF-like domai  93.6   0.069 1.5E-06   37.7   2.7   23  783-805     8-32  (32)
 40 PF14670 FXa_inhibition:  Coagu  93.2   0.042 9.1E-07   39.9   1.2   21   51-71     10-30  (36)
 41 PF12661 hEGF:  Human growth fa  90.8    0.11 2.4E-06   28.7   0.8   13 1001-1013    1-13  (13)
 42 KOG1218|consensus               90.5     9.1  0.0002   42.9  16.8  163  747-960    14-177 (316)
 43 KOG3512|consensus               90.3     2.4 5.2E-05   48.2  11.2   79  104-183   280-384 (592)
 44 PF14670 FXa_inhibition:  Coagu  89.7    0.14   3E-06   37.3   0.7   21  162-182    10-30  (36)
 45 smart00051 DSL delta serrate l  87.9    0.65 1.4E-05   38.6   3.6   43  794-844    19-61  (63)
 46 smart00051 DSL delta serrate l  87.0    0.74 1.6E-05   38.3   3.5   47  943-1013   17-63  (63)
 47 PF12946 EGF_MSP1_1:  MSP1 EGF   84.8    0.22 4.8E-06   36.1  -0.6   32  925-956     2-34  (37)
 48 KOG3512|consensus               82.4       4 8.7E-05   46.5   7.6  118  881-1018  285-432 (592)
 49 PF12946 EGF_MSP1_1:  MSP1 EGF   81.7    0.73 1.6E-05   33.5   1.0   31  597-627     2-33  (37)
 50 cd00055 EGF_Lam Laminin-type e  80.0     1.8 3.8E-05   34.1   2.8   28  992-1021   13-40  (50)
 51 PF00053 Laminin_EGF:  Laminin   78.6     1.1 2.4E-05   35.0   1.4   26  991-1018   11-36  (49)
 52 cd00055 EGF_Lam Laminin-type e  77.3     3.4 7.4E-05   32.5   3.8   25  171-223    19-43  (50)
 53 cd01475 vWA_Matrilin VWA_Matri  72.7       3 6.5E-05   44.4   3.2   33  148-182   185-219 (224)
 54 PF00053 Laminin_EGF:  Laminin   72.3       2 4.4E-05   33.6   1.3   22  162-187    11-32  (49)
 55 cd01475 vWA_Matrilin VWA_Matri  71.5     3.4 7.3E-05   44.0   3.2   32   38-71    186-219 (224)
 56 smart00180 EGF_Lam Laminin-typ  69.7     4.4 9.5E-05   31.3   2.6   27  992-1020   12-38  (46)
 57 smart00180 EGF_Lam Laminin-typ  69.0     5.6 0.00012   30.7   3.1   14  172-187    19-32  (46)
 58 PHA03099 epidermal growth fact  58.2     9.6 0.00021   35.8   3.0   30  986-1016   52-83  (139)
 59 PF01683 EB:  EB module;  Inter  57.8      15 0.00033   29.0   3.8   19  783-801    28-46  (52)
 60 PF01683 EB:  EB module;  Inter  57.1      16 0.00034   28.9   3.8   21  986-1010   27-47  (52)
 61 PHA02887 EGF-like protein; Pro  49.5      16 0.00034   33.9   2.8   29  986-1015   93-123 (126)
 62 PHA02887 EGF-like protein; Pro  48.6      13 0.00029   34.3   2.3   32  816-854    92-123 (126)
 63 PF00954 S_locus_glycop:  S-loc  47.4      16 0.00034   34.1   2.7   32   38-70     76-108 (110)
 64 KOG3516|consensus               46.7      14  0.0003   47.2   2.8   42  144-187   539-581 (1306)
 65 PHA03099 epidermal growth fact  42.0      17 0.00036   34.3   1.8   32  816-854    51-82  (139)
 66 KOG3516|consensus               40.5      23  0.0005   45.3   3.3   36  234-269   542-578 (1306)
 67 PF00954 S_locus_glycop:  S-loc  38.4      28  0.0006   32.4   2.9   33  149-182    76-109 (110)
 68 PF09064 Tme5_EGF_like:  Thromb  37.1      25 0.00055   25.1   1.7   21    5-25     10-30  (34)
 69 KOG3514|consensus               36.3      23  0.0005   44.6   2.4   36  924-961   625-661 (1591)
 70 KOG3514|consensus               31.4      32 0.00069   43.4   2.4   35  152-188   625-660 (1591)
 71 PF01414 DSL:  Delta serrate li  26.5      20 0.00043   29.9  -0.3   44  793-844    18-61  (63)
 72 PF03302 VSP:  Giardia variant-  20.5   3E+02  0.0066   32.0   7.8   41  174-222    95-135 (397)

No 1  
>KOG4289|consensus
Probab=99.73  E-value=4.7e-17  Score=192.78  Aligned_cols=91  Identities=33%  Similarity=0.859  Sum_probs=78.5

Q ss_pred             ccCCCce-eeCCCCCCCCCCCccccccCCCCCCCCCCCCCCCCCCeecccCCceeeeCCCCCcCCCCCCCCCCccCCCCC
Q psy13144          8 INTYEVF-YSCPPGTTGSPFVQCKPIVHEPVYTNPCQPSPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCP   86 (1031)
Q Consensus         8 ~~~~~~~-C~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~   86 (1031)
                      |++...+ |+|||||+|+.   |+      +.||+|-+.||++||+|....|+|+|.|.+||+|.      .|+...   
T Consensus      1216 i~pvnglrCrCPpGFTgd~---Ce------TeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe------hCEvs~--- 1277 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDY---CE------TEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE------HCEVSA--- 1277 (2531)
T ss_pred             ccccCceeEeCCCCCCccc---cc------chhHhhhcCCCCCCCceEEecCceeEEecCCcccc------ceeeec---
Confidence            4666777 99999999994   98      78999999999999999999999999999999998      787643   


Q ss_pred             CCccccCCCCCCCCCCCCCCCCeeeec-CCCceeeCCCC
Q psy13144         87 LDKSCQNQKCADPCPGTCGQNANCKVI-NHSPICRCKAG  124 (1031)
Q Consensus        87 ~~~~C~~~~C~~~c~~~C~~~~~C~~~-~g~~~C~C~~G  124 (1031)
                           ..++|.   ++.|.++|+|++. .|+|.|+|+.|
T Consensus      1278 -----~agrCv---pGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1278 -----RAGRCV---PGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             -----ccCccc---cceecCCCEEeecCCCceeccCCCc
Confidence                 134454   5789999999995 68999999998


No 2  
>KOG4289|consensus
Probab=99.62  E-value=1.1e-14  Score=173.24  Aligned_cols=90  Identities=31%  Similarity=0.777  Sum_probs=75.2

Q ss_pred             cCCCceeecCCCCcCCCcCccccccCcCcchhhccccccccCCCCCCCcccccccCCCCCCCCCCCCCCCCCCCeeeecC
Q psy13144        533 INHSPICTCKPGFTGDALAYCNRIPLSNYVFEKILIQLMYCPGTTGNPFVLCKLVQNEPVYTNPCQPSPCGPNSQCREVN  612 (1031)
Q Consensus       533 ~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~~~~g~~G~~~~~C~~~~~~~~~~~eC~~~~C~~~~~C~~~~  612 (1031)
                      ..+++.|+|++||+|+.|+                                        +.||+|.+.||.++++|....
T Consensus      1218 pvnglrCrCPpGFTgd~Ce----------------------------------------TeiDlCYs~pC~nng~C~srE 1257 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCE----------------------------------------TEIDLCYSGPCGNNGRCRSRE 1257 (2531)
T ss_pred             ccCceeEeCCCCCCccccc----------------------------------------chhHhhhcCCCCCCCceEEec
Confidence            4577899999999999876                                        348999999999999999999


Q ss_pred             CceeeeCCCCcccCCCCCcCcCccCCCCCCCCcccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCceeec-CCCC
Q psy13144        613 HQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACFNQKCVDPCPDSPPPPLESPPEYVNPCIPSPCGPYSQCRDI-GGSP  691 (1031)
Q Consensus       613 g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~~~~~~~~~~~dideC~~~~C~~~g~C~~~-~g~y  691 (1031)
                      |+|+|+|.+||+|.      +||+..                              ..-.|.+..|.++|+|++. .|+|
T Consensus      1258 ggYtCeCrpg~tGe------hCEvs~------------------------------~agrCvpGvC~nggtC~~~~nggf 1301 (2531)
T KOG4289|consen 1258 GGYTCECRPGFTGE------HCEVSA------------------------------RAGRCVPGVCKNGGTCVNLLNGGF 1301 (2531)
T ss_pred             CceeEEecCCcccc------ceeeec------------------------------ccCccccceecCCCEEeecCCCce
Confidence            99999999999999      687532                              1234566778888999876 5788


Q ss_pred             eeecCCC
Q psy13144        692 SCSCLPN  698 (1031)
Q Consensus       692 ~C~C~~G  698 (1031)
                      .|.|+.|
T Consensus      1302 ~c~Cp~g 1308 (2531)
T KOG4289|consen 1302 CCHCPYG 1308 (2531)
T ss_pred             eccCCCc
Confidence            8999887


No 3  
>KOG1217|consensus
Probab=99.58  E-value=1.9e-13  Score=163.50  Aligned_cols=333  Identities=27%  Similarity=0.643  Sum_probs=225.1

Q ss_pred             CCCCCCCeeeecCCceEeeCCCCCccCCCccccccCCCCCCCCCCCCCC--CCCCCccccc---CCceeeecCCCccCCC
Q psy13144        418 GTCGEGAICDVINHAVSCNCPAGTTGNPFVLCKPVQNEPVYTNPCHPSP--CGPNSQCREV---NHQAVCSCLPNYFGSP  492 (1031)
Q Consensus       418 ~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~~~--C~~~g~C~~~---~g~~~C~C~~Gy~G~~  492 (1031)
                      .+......+......|.|.|.+||.+.   .++..       .+|...+  +...+.|...   ...+.|.|..||.+. 
T Consensus        95 ~~~~~~~~~~~~~~~~~c~c~~g~~~~---~~~~~-------~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~-  163 (487)
T KOG1217|consen   95 PCLLLCGECVDCVGSYECTCPPGYQGT---PCEGE-------CECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGE-  163 (487)
T ss_pred             CcccCCccccCCCCCceeeCCCccccC---cCCcc-------eeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccc-
Confidence            334445566677888999999999998   55411       1355444  3455677764   358999999999998 


Q ss_pred             CCCcCcCcccCCCCCCccccCCCcCCCCCCCCCCCCeeeecCCCceeecCCCCcCCCcCccccccCcCcchhhccccccc
Q psy13144        493 PACRPECTVNTDCPLDKACFNQKCVDPCPGTCGQNANCRVINHSPICTCKPGFTGDALAYCNRIPLSNYVFEKILIQLMY  572 (1031)
Q Consensus       493 ~~C~~~C~~~~~c~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~~  572 (1031)
                           .+....     .+|...      ...|.+++.|.+..++|.|.|++||.+..+..                    
T Consensus       164 -----~~~~~~-----~~C~~~------~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~--------------------  207 (487)
T KOG1217|consen  164 -----PCETDL-----DECIQY------SSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET--------------------  207 (487)
T ss_pred             -----cccccc-----cccccC------CCCcCCCcccccCCCCeeEeCCCCccCCcCcC--------------------
Confidence                 555431     133321      24688889999999999999999999987541                    


Q ss_pred             cCCCCCCCcccccccCCCCCCCCCCCCCCCCCCCeeeecCCceeeeCCCCcccCCCCCcCcCccCCCCCCCCcccCCccC
Q psy13144        573 CPGTTGNPFVLCKLVQNEPVYTNPCQPSPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACFNQKCV  652 (1031)
Q Consensus       573 ~~g~~G~~~~~C~~~~~~~~~~~eC~~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~  652 (1031)
                                            .       .++++|++.   +.|.+.+||.+.      .|+                 
T Consensus       208 ----------------------~-------~~~~~c~~~---~~~~~~~g~~~~------~c~-----------------  232 (487)
T KOG1217|consen  208 ----------------------T-------GNGGTCVDS---VACSCPPGARGP------ECE-----------------  232 (487)
T ss_pred             ----------------------C-------CCCceEecc---eeccCCCCCCCC------Ccc-----------------
Confidence                                  0       344566555   678999999876      233                 


Q ss_pred             CCCCCCCCCCCCCCCCcCCcCCCCCCCCCCceeecCCCCeeecCCCCcCCCCCCCcCcccCCCCCCccccccCcCCCCCC
Q psy13144        653 DPCPDSPPPPLESPPEYVNPCIPSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCRPECVMNSECPSNEACINEKCGDPCP  732 (1031)
Q Consensus       653 ~~C~~~~~~~~~~~~~dideC~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~c~  732 (1031)
                                     +++.+|...   + ++|++..++|+|.|++||.+.  .+ ..+.++++|...            .
T Consensus       233 ---------------~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~--~~-~~~~~~~~C~~~------------~  278 (487)
T KOG1217|consen  233 ---------------VSIVECASG---D-GTCVNTVGSYTCRCPEGYTGD--AC-VTCVDVDSCALI------------A  278 (487)
T ss_pred             ---------------cccccccCC---C-CcccccCCceeeeCCCCcccc--cc-ceeeeccccCCC------------C
Confidence                           234455444   4 899999999999999999998  42 234567777765            1


Q ss_pred             CCCCCCCeeeccCCCceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCC
Q psy13144        733 GSCGYNAECKIINHTPICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPEC  812 (1031)
Q Consensus       733 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C  812 (1031)
                       .|.++++|.+..++|.|.|++||+|..   | ..               +.....       |..++.           
T Consensus       279 -~c~~~~~C~~~~~~~~C~C~~g~~g~~---~-~~---------------~~~~~~-------C~~~~~-----------  320 (487)
T KOG1217|consen  279 -SCPNGGTCVNVPGSYRCTCPPGFTGRL---C-TE---------------CVDVDE-------CSPRNA-----------  320 (487)
T ss_pred             -ccCCCCeeecCCCcceeeCCCCCCCCC---C-cc---------------cccccc-------cccccc-----------
Confidence             277789999999999999999999984   3 10               111111       111111           


Q ss_pred             cCCCCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceee-cCCCee
Q psy13144        813 ILNNDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRV-INHNAV  891 (1031)
Q Consensus       813 ~~~~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~-~~g~~~  891 (1031)
                        ...|.+++.|......+.++|.|..||.|.      .|+...     ++|..        ..+..++.|++ ..++|+
T Consensus       321 --~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~------~C~~~~-----~~C~~--------~~~~~~~~c~~~~~~~~~  379 (487)
T KOG1217|consen  321 --GGPCANGGTCNTLGSFGGFRCACGPGFTGR------RCEDSN-----DECAS--------SPCCPGGTCVNETPGSYR  379 (487)
T ss_pred             --CCcCCCCcccccCCCCCCCCcCCCCCCCCC------ccccCC-----ccccC--------CccccCCEeccCCCCCeE
Confidence              134666667733332367889999999888      666542     12332        24677899999 688999


Q ss_pred             eEeCCCCccC-C--CccccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCcCCCCCCCCCcccCCCC
Q psy13144        892 CNCKPGFTGE-P--RIRCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFIGAPPNCRPECIQNSEC  968 (1031)
Q Consensus       892 C~C~~Gy~G~-~--~~~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~eC  968 (1031)
                      |.|+.+|.+. .  ...+             .++++|..     .+.|++..++|.|. ++ + ... .+  .|.+++++
T Consensus       380 c~~~~~~~~~~~~~~~~~-------------~~~~~c~~-----~~~c~~~~~~~~c~-~~-~-~~~-~~--~~~~~~~~  435 (487)
T KOG1217|consen  380 CACPAGFAGKANGDGVGC-------------EDIDECSG-----CGDCVNGPGGGACT-PP-G-LVS-PG--TCDDIDEC  435 (487)
T ss_pred             ecCCCccccCCccccccc-------------cccccccC-----CcceeccCCCCccc-cC-c-ccC-Cc--ceeccccc
Confidence            9999999984 1  1123             35666754     57799999999999 88 5 432 22  56677666


Q ss_pred             C
Q psy13144        969 P  969 (1031)
Q Consensus       969 ~  969 (1031)
                      .
T Consensus       436 ~  436 (487)
T KOG1217|consen  436 P  436 (487)
T ss_pred             c
Confidence            4


No 4  
>KOG1217|consensus
Probab=99.57  E-value=1.6e-13  Score=164.15  Aligned_cols=312  Identities=28%  Similarity=0.656  Sum_probs=210.5

Q ss_pred             CCCCCCCeecccCCceeeeCCCCCcCCCCCCCCCCccCCCCCCCccccCCCCCCCCCCCCCCCCeeee---cCCCceeeC
Q psy13144         45 SPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKV---INHSPICRC  121 (1031)
Q Consensus        45 ~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~---~~g~~~C~C  121 (1031)
                      .+....+.++....+|.|.|+.||.|..      +....      +|...+      ..+...+.|++   ....|.|+|
T Consensus        95 ~~~~~~~~~~~~~~~~~c~c~~g~~~~~------~~~~~------~C~~~~------~~~~~~~~c~~~~~~~~~~~c~C  156 (487)
T KOG1217|consen   95 PCLLLCGECVDCVGSYECTCPPGYQGTP------CEGEC------ECVTGP------GVCCIDGSCSNGPGSVGPFRCSC  156 (487)
T ss_pred             CcccCCccccCCCCCceeeCCCccccCc------CCcce------eecCCC------CCeeCchhhcCCCCCCCceeeee
Confidence            3444456677788899999999999982      22211      222221      11223345555   345899999


Q ss_pred             CCCCccCCccccccCCCCCCCCCCCCCCCCCCC--CCCCCCCCceecCCCCceeeCCCCCcCCCCCCcccCCCCCCCCCC
Q psy13144        122 KAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCY--PSPCGPYSQCRDINGSPSCSCLPSYIGSPPNCRPECIQNSECPYD  199 (1031)
Q Consensus       122 ~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~--~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~~C~~~  199 (1031)
                      ..||.+..   +.             ...++|.  ..+|.+++.|.+..++|.|.|++||.+.  .++..       ...
T Consensus       157 ~~g~~~~~---~~-------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~--~~~~~-------~~~  211 (487)
T KOG1217|consen  157 TEGYEGEP---CE-------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS--TCETT-------GNG  211 (487)
T ss_pred             CCCccccc---cc-------------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCC--cCcCC-------CCC
Confidence            99999987   32             2347887  4569999999999999999999999999  66521       223


Q ss_pred             CcccCCCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCCCCCCCCCcccccCCCeeeecCCCCcCCCCCCCCCCcc
Q psy13144        200 KACINEKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQPSPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTV  279 (1031)
Q Consensus       200 ~~C~n~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~  279 (1031)
                      ..|++.   +.+  .+.+||.+..   |.      +.+.+|...   + ++|++..++|+|.|++||.+.      .+  
T Consensus       212 ~~c~~~---~~~--~~~~g~~~~~---c~------~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~------~~--  265 (487)
T KOG1217|consen  212 GTCVDS---VAC--SCPPGARGPE---CE------VSIVECASG---D-GTCVNTVGSYTCRCPEGYTGD------AC--  265 (487)
T ss_pred             ceEecc---eec--cCCCCCCCCC---cc------cccccccCC---C-CcccccCCceeeeCCCCcccc------cc--
Confidence            445544   556  8889998763   44      234555443   4 899999999999999999998      21  


Q ss_pred             CCCCCCCccccCCCcCCCCCCCCCCCCeeeeeCCCceeeCCCCCccCCcCcccccCccccCCCCCCCCCCCCCcccCCcc
Q psy13144        280 NSDCPLDKSCQNQKCADPCPGTCGQNANCKVINHSPICRCKAGFTGDPFTYCNRIPLQYLMPNNAPMNVPPISAVETPVL  359 (1031)
Q Consensus       280 ~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~  359 (1031)
                       ..+.+.+.|+..       ..|.++++|++..++|.|.|++||+|..+..+.+..+|....                  
T Consensus       266 -~~~~~~~~C~~~-------~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~------------------  319 (487)
T KOG1217|consen  266 -VTCVDVDSCALI-------ASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRN------------------  319 (487)
T ss_pred             -ceeeeccccCCC-------CccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccc------------------
Confidence             112222344432       238888999999999999999999999752233333332100                  


Q ss_pred             CCCCCCCCCCeecc------ceeecCCCcccCCCCcCCCCCcCCCCCCCCcccccCcccCCCCCCCCCCCCeeee-cCCc
Q psy13144        360 EDTCNCAPNAVCKD------EVCVCLPDFYGDGYVSCRPECVLNNDCPSNKACIKYKCKNPCVSGTCGEGAICDV-INHA  432 (1031)
Q Consensus       360 ~~~~~C~~~g~C~~------~~C~C~~G~~G~~c~~~~~eC~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~-~~g~  432 (1031)
                       ....|.+++.|..      +.|.|.+||.|..|+...                     ++|...++..++.|.+ ..++
T Consensus       320 -~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~---------------------~~C~~~~~~~~~~c~~~~~~~  377 (487)
T KOG1217|consen  320 -AGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSN---------------------DECASSPCCPGGTCVNETPGS  377 (487)
T ss_pred             -cCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCC---------------------ccccCCccccCCEeccCCCCC
Confidence             1123666767722      469999999998864211                     2566667888999998 7899


Q ss_pred             eEeeCCCCCccC-C--CccccccCCCCCCCCCCCCCCCCCCCcccccCCceeeecCC
Q psy13144        433 VSCNCPAGTTGN-P--FVLCKPVQNEPVYTNPCHPSPCGPNSQCREVNHQAVCSCLP  486 (1031)
Q Consensus       433 ~~C~C~~Gy~G~-~--~~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~  486 (1031)
                      |.|.|+.+|.+. .  ...+       .++++|..     .+.|++..+++.|. ..
T Consensus       378 ~~c~~~~~~~~~~~~~~~~~-------~~~~~c~~-----~~~c~~~~~~~~c~-~~  421 (487)
T KOG1217|consen  378 YRCACPAGFAGKANGDGVGC-------EDIDECSG-----CGDCVNGPGGGACT-PP  421 (487)
T ss_pred             eEecCCCccccCCccccccc-------cccccccC-----CcceeccCCCCccc-cC
Confidence            999999999984 1  1122       34555543     46677778888888 77


No 5  
>KOG1214|consensus
Probab=99.53  E-value=6.4e-14  Score=160.17  Aligned_cols=202  Identities=25%  Similarity=0.592  Sum_probs=149.9

Q ss_pred             CCCCCCCCeeeecCC-CceeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCC--CCCCCCCceecCCCCceeeCCC
Q psy13144        101 PGTCGQNANCKVINH-SPICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYP--SPCGPYSQCRDINGSPSCSCLP  177 (1031)
Q Consensus       101 ~~~C~~~~~C~~~~g-~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~  177 (1031)
                      +|.|..++.|....+ .|+|.|.+||.|++.. |              .++++|+.  ..|+.+++|++.+++|+|.|..
T Consensus       699 sh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~-c--------------~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~  763 (1289)
T KOG1214|consen  699 SHMCDTTARCHPGTGVDYTCECSSGYQGDGRN-C--------------VDENECATGFHRCGPNSVCINLPGSYRCECRS  763 (1289)
T ss_pred             CcccCCCccccCCCCcceEEEEeeccCCCCCC-C--------------CChhhhccCCCCCCCCceeecCCCceeEEEee
Confidence            689999999988665 6999999999998855 5              47889985  5699999999999999999999


Q ss_pred             CCc--CCCCCCc--------ccCCCC-CCCCCCCc--ccC-CCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCCC
Q psy13144        178 SYI--GSPPNCR--------PECIQN-SECPYDKA--CIN-EKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQPS  243 (1031)
Q Consensus       178 Gy~--g~~~~C~--------~~C~~~-~~C~~~~~--C~n-~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~  243 (1031)
                      ||.  +++.+|.        +.|++. +.|...+.  |+. +.+.|.|  .|.|||.|++.. |       .++|+|+++
T Consensus       764 gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C--~CLPGfsGDG~~-c-------~dvDeC~ps  833 (1289)
T KOG1214|consen  764 GYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSC--ACLPGFSGDGHQ-C-------TDVDECSPS  833 (1289)
T ss_pred             cceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEE--eecCCccCCccc-c-------ccccccCcc
Confidence            986  4444675        246655 67776554  443 5678999  999999999753 4       457999999


Q ss_pred             CCCCCCcccccCCCeeeecCCCCcCCCCCCCCCCccCCCCCCCccccCCCcCCCCCCCCCCCCeeee--eCCCceeeCCC
Q psy13144        244 PCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKV--INHSPICRCKA  321 (1031)
Q Consensus       244 ~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~--~~g~~~C~C~~  321 (1031)
                      -|...+.|.+++|+|.|+|.+||+|++..|+|.=.....|....   .+      +-.|+.++.|..  .+..|.+.|.+
T Consensus       834 rChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er---~h------pl~chg~t~~~~~~Dp~~~e~p~~~  904 (1289)
T KOG1214|consen  834 RCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQER---FH------PLQCHGSTGFCWCVDPDGHEVPGTQ  904 (1289)
T ss_pred             ccCCCceEecCCCcceeecccCccCCCceecCCCccCCcccccc---cc------ceeeccccceeEeeCCCcccCCCCC
Confidence            99999999999999999999999999999987511112222110   00      123555544432  24567888888


Q ss_pred             CCccCCcCcccccCc
Q psy13144        322 GFTGDPFTYCNRIPL  336 (1031)
Q Consensus       322 G~~G~~~~~C~~~~~  336 (1031)
                      +--|+.-..|-.+++
T Consensus       905 ~ppG~~~~~c~~~~~  919 (1289)
T KOG1214|consen  905 TPPGSTPPHCGPSPE  919 (1289)
T ss_pred             CCCCCCCCCCCCccc
Confidence            877776555655544


No 6  
>KOG1214|consensus
Probab=99.47  E-value=4.4e-13  Score=153.49  Aligned_cols=196  Identities=26%  Similarity=0.549  Sum_probs=138.8

Q ss_pred             CceeecCCCCccCCcc-cCCCCCcCC-CCCCCCCccccCCCCCceeeecCCCC--cCCCCCCCCCcccCCCCCCcccccC
Q psy13144        791 DGVCVCLPDYYGDGYV-SCGPECILN-NDCPSNKACIRNKFNKQAVCSCLPNY--FGSPPACRPECTVNTDCPLDKACVN  866 (1031)
Q Consensus       791 ~~~C~C~~G~~G~~c~-~~~~~C~~~-~~C~~~g~C~~~~~~~~~~C~C~~G~--~g~g~~C~~~C~~~~~C~~~~~C~~  866 (1031)
                      +.+|.|..||.|++-. .++++|... ..|..++.|++.+  ++|+|.|..||  ++++.+|+..=..    ...+.|+.
T Consensus       715 ~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~p--g~~rceC~~gy~F~dd~~tCV~i~~p----ap~n~Ce~  788 (1289)
T KOG1214|consen  715 DYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLP--GSYRCECRSGYEFADDRHTCVLITPP----APANPCED  788 (1289)
T ss_pred             ceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCC--CceeEEEeecceeccCCcceEEecCC----CCCCcccc
Confidence            3489999999988643 356788865 6899999999997  99999998885  6777788731110    12234444


Q ss_pred             CcccCCCCCCCCCCC--ceeecCC-CeeeEeCCCCccCCCccccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCc
Q psy13144        867 QKCVDPCPGSCGQNA--NCRVINH-NAVCNCKPGFTGEPRIRCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSP  943 (1031)
Q Consensus       867 ~~C~~~C~~~C~~~~--~C~~~~g-~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~  943 (1031)
                      +      .+.|.-.+  .|+...+ +|.|.|.|||.|++.. |             .|+|||.++-|...|.|++++|+|
T Consensus       789 g------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c-------------~dvDeC~psrChp~A~Cyntpgsf  848 (1289)
T KOG1214|consen  789 G------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-C-------------TDVDECSPSRCHPAATCYNTPGSF  848 (1289)
T ss_pred             C------ccccCcCCceEEEecCCceEEEeecCCccCCccc-c-------------ccccccCccccCCCceEecCCCcc
Confidence            3      34565544  4555443 5999999999999864 5             368999999999999999999999


Q ss_pred             eeeCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeee---ecCCCceeeCCCCCccCCCCCCcCCC
Q psy13144        944 SCSCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCK---VINHSPICTCPDGFVGDAFSGCYPKP 1020 (1031)
Q Consensus       944 ~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~---~~~g~~~C~C~~G~~G~~c~~c~~~~ 1020 (1031)
                      .|.|.+||.|++.+|-+-=.....|.      .++ ..  +..|+.++-|.   +. ..|.+.++++=.|++-.+|-+.|
T Consensus       849 sC~C~pGy~GDGf~CVP~~~~~T~C~------~er-~h--pl~chg~t~~~~~~Dp-~~~e~p~~~~ppG~~~~~c~~~~  918 (1289)
T KOG1214|consen  849 SCRCQPGYYGDGFQCVPDTSSLTPCE------QER-FH--PLQCHGSTGFCWCVDP-DGHEVPGTQTPPGSTPPHCGPSP  918 (1289)
T ss_pred             eeecccCccCCCceecCCCccCCccc------ccc-cc--ceeeccccceeEeeCC-CcccCCCCCCCCCCCCCCCCCcc
Confidence            99999999999987753101112222      111 01  23577666433   44 45789999999998888787665


Q ss_pred             Cc
Q psy13144       1021 PE 1022 (1031)
Q Consensus      1021 ~~ 1022 (1031)
                      ..
T Consensus       919 ~~  920 (1289)
T KOG1214|consen  919 EQ  920 (1289)
T ss_pred             cc
Confidence            53


No 7  
>KOG0994|consensus
Probab=99.37  E-value=5.1e-10  Score=132.49  Aligned_cols=46  Identities=20%  Similarity=0.311  Sum_probs=34.2

Q ss_pred             ceeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy13144        117 PICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYS  162 (1031)
Q Consensus       117 ~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g  162 (1031)
                      ..|.|.---.|.+++.|.+++.-.||....-.+.++|..-.|..++
T Consensus       300 G~C~C~HNT~G~nCE~C~~fYnDlPWrpAeG~~~neCrkC~CNgHa  345 (1758)
T KOG0994|consen  300 GRCMCKHNTAGLNCEHCAPFYNDLPWRPAEGKTSNECRKCECNGHA  345 (1758)
T ss_pred             ceeEeccCCCCCChHHhhHhhcCCCCCccCCCCcccccccCCCCCc
Confidence            4799998888999888888877777655444567888776665544


No 8  
>KOG1219|consensus
Probab=99.26  E-value=8.7e-12  Score=154.01  Aligned_cols=120  Identities=24%  Similarity=0.628  Sum_probs=105.8

Q ss_pred             CCCCCCCCceeecC-CCeeeEeCCCCccCCCccccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCc
Q psy13144        874 PGSCGQNANCRVIN-HNAVCNCKPGFTGEPRIRCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFI  952 (1031)
Q Consensus       874 ~~~C~~~~~C~~~~-g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~  952 (1031)
                      .++|.++|+|+..+ ++|.|.|++-|+|.   .|+            .++..|.++||..|++|+...++|.|.|+.||+
T Consensus      3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CE------------i~~epC~snPC~~GgtCip~~n~f~CnC~~gyT 3933 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCE------------IDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYT 3933 (4289)
T ss_pred             cCcccCCCEecCCCCCceEEeCcccccCc---ccc------------cccccccCCCCCCCCEEEecCCCeeEeCCCCcc
Confidence            46899999999986 47999999999999   898            678999999999999999999999999999999


Q ss_pred             CCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCccCCCCCCcCCCCccccc
Q psy13144        953 GAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFVGDAFSGCYPKPPERTMW 1026 (1031)
Q Consensus       953 g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~G~~c~~c~~~~~~~~~~ 1026 (1031)
                      |.  +|+..  .++||+.              +.|..+|.|+++.|+|.|.|.+||.|..|..=+|....-+.|
T Consensus      3934 G~--~Ce~~--Gi~eCs~--------------n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~~pni~~~~~~ 3989 (4289)
T KOG1219|consen 3934 GK--RCEAR--GISECSK--------------NVCGTGGQCINIPGSFHCNCTPGILGRTCCAEKPNILSTVLW 3989 (4289)
T ss_pred             Cc--eeecc--ccccccc--------------ccccCCceeeccCCceEeccChhHhcccCccccCccccccch
Confidence            99  88842  3777763              479999999999999999999999999996566666666666


No 9  
>KOG1219|consensus
Probab=99.24  E-value=1.1e-11  Score=153.09  Aligned_cols=112  Identities=29%  Similarity=0.820  Sum_probs=100.1

Q ss_pred             CCCCCCCCCCCCCeeccc-CCceeeeCCCCCcCCCCCCCCCCccCCCCCCCccccCCCCCCCCCCCCCCCCeeeecCCCc
Q psy13144         39 TNPCQPSPCGPNSQCREV-NHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKVINHSP  117 (1031)
Q Consensus        39 ~d~C~~~~C~~~g~C~~~-~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~  117 (1031)
                      .++|..+||+++|+|+.. .|+|.|.|++.|+|.      .|+...     ..|.        +++|..+|+|+...++|
T Consensus      3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~------~CEi~~-----epC~--------snPC~~GgtCip~~n~f 3924 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN------HCEIDL-----EPCA--------SNPCLTGGTCIPFYNGF 3924 (4289)
T ss_pred             ccccccCcccCCCEecCCCCCceEEeCcccccCc------cccccc-----cccc--------CCCCCCCCEEEecCCCe
Confidence            389999999999999985 668999999999998      888754     3555        37888999999999999


Q ss_pred             eeeCCCCCccCCccccccCCCCCCCCCCCCCC-CCCCCCCCCCCCCceecCCCCceeeCCCCCcCCCCCCc
Q psy13144        118 ICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEP-VNPCYPSPCGPYSQCRDINGSPSCSCLPSYIGSPPNCR  187 (1031)
Q Consensus       118 ~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~-~d~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C~  187 (1031)
                      .|.|+.||+|..   |+             .. +++|+.++|.++|.|+|..|+|.|.|-+||.|.  .|.
T Consensus      3925 ~CnC~~gyTG~~---Ce-------------~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr--~c~ 3977 (4289)
T KOG1219|consen 3925 LCNCPNGYTGKR---CE-------------ARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR--TCC 3977 (4289)
T ss_pred             eEeCCCCccCce---ee-------------cccccccccccccCCceeeccCCceEeccChhHhcc--cCc
Confidence            999999999997   76             44 899999999999999999999999999999999  664


No 10 
>KOG0994|consensus
Probab=99.07  E-value=2.2e-09  Score=127.23  Aligned_cols=72  Identities=35%  Similarity=0.949  Sum_probs=43.7

Q ss_pred             ceecCCCCceeeCCCCCcCCCCCCcccCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCC
Q psy13144        163 QCRDINGSPSCSCLPSYIGSPPNCRPECIQNSECPYDKACINEKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQP  242 (1031)
Q Consensus       163 ~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~  242 (1031)
                      .|....|  +|+|.+|-+|.  .|.                          +|.|||+|-+  .|..- ..=-..|.|.+
T Consensus       824 ~Cd~~tG--QC~C~~g~ygr--qCn--------------------------qCqpG~WgFP--eCr~C-qCNgHA~~Cd~  870 (1758)
T KOG0994|consen  824 YCDKITG--QCQCRPGTYGR--QCN--------------------------QCQPGYWGFP--ECRPC-QCNGHADTCDP  870 (1758)
T ss_pred             ccccccc--ceeeccccchh--hcc--------------------------ccCCCccCCC--cCccc-cccCcccccCc
Confidence            3543333  79999998888  554                          8999999987  34310 00001223322


Q ss_pred             C--CCCCCCcccccCCCeee-ecCCCCcCCC
Q psy13144        243 S--PCGPNSQCREVNHQAVC-SCLPNYFGSP  270 (1031)
Q Consensus       243 ~--~C~~~g~C~~~~g~y~C-~C~~Gy~g~~  270 (1031)
                      .  -|.   .|.+..+++.| .|..||+|++
T Consensus       871 ~tGaCi---~CqD~T~G~~CdrCl~GyyGdP  898 (1758)
T KOG0994|consen  871 ITGACI---DCQDSTTGHSCDRCLDGYYGDP  898 (1758)
T ss_pred             cccccc---cccccccccchhhhhccccCCc
Confidence            1  122   25567778888 7999999985


No 11 
>KOG1225|consensus
Probab=98.98  E-value=2.8e-09  Score=122.95  Aligned_cols=132  Identities=33%  Similarity=0.902  Sum_probs=103.3

Q ss_pred             ceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCCcCCCCCCCCCccccC
Q psy13144        748 PICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPECILNNDCPSNKACIRN  827 (1031)
Q Consensus       748 ~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~  827 (1031)
                      +.|.|..+|+|.   .|+...-        .  -.|...+.|+++.|+|++||+|..|..  ..|..  .|+.++.+++.
T Consensus       234 ~ic~c~~~~~g~---~c~~~~C--------~--~~c~~~g~c~~G~CIC~~Gf~G~dC~e--~~Cp~--~cs~~g~~~~g  296 (525)
T KOG1225|consen  234 GICECPEGYFGP---LCSTIYC--------P--GGCTGRGQCVEGRCICPPGFTGDDCDE--LVCPV--DCSGGGVCVDG  296 (525)
T ss_pred             ceeecCCceeCC---ccccccC--------C--CCCcccceEeCCeEeCCCCCcCCCCCc--ccCCc--ccCCCceecCC
Confidence            489999999999   5663210        0  026667899999999999999999854  45663  48888888863


Q ss_pred             CCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecCCCeeeEeCCCCccCCCcccc
Q psy13144        828 KFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVINHNAVCNCKPGFTGEPRIRCS  907 (1031)
Q Consensus       828 ~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~~C~  907 (1031)
                            +|+|++||+|.      +|+..                .|+..|.++|.|+.  |  +|.|.+||+|.   .|+
T Consensus       297 ------~CiC~~g~~G~------dCs~~----------------~cpadC~g~G~Ci~--G--~C~C~~Gy~G~---~C~  341 (525)
T KOG1225|consen  297 ------ECICNPGYSGK------DCSIR----------------RCPADCSGHGKCID--G--ECLCDEGYTGE---LCI  341 (525)
T ss_pred             ------EeecCCCcccc------ccccc----------------cCCccCCCCCcccC--C--ceEeCCCCcCC---ccc
Confidence                  89999999999      66542                23667999999993  3  99999999999   675


Q ss_pred             CCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCcCCC
Q psy13144        908 KIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFIGAP  955 (1031)
Q Consensus       908 ~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~  955 (1031)
                      .              .     +|.+++.|++.     |.|..||.|..
T Consensus       342 ~--------------~-----~C~~~g~cv~g-----C~C~~Gw~G~d  365 (525)
T KOG1225|consen  342 Q--------------R-----ACSGGGQCVNG-----CKCKKGWRGPD  365 (525)
T ss_pred             c--------------c-----ccCCCceeccC-----ceeccCccCCC
Confidence            2              1     38888999863     99999999984


No 12 
>KOG1225|consensus
Probab=98.91  E-value=8.4e-09  Score=119.07  Aligned_cols=132  Identities=30%  Similarity=0.806  Sum_probs=96.3

Q ss_pred             CCCCCCeeeecCCceeeeCCCCcccCCCCCcCcCccCCCCCCCCcccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCC
Q psy13144        601 PCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACFNQKCVDPCPDSPPPPLESPPEYVNPCIPSPCGP  680 (1031)
Q Consensus       601 ~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~~~~~~~~~~~dideC~~~~C~~  680 (1031)
                      .|..+++|++.    +|+|++||+|..      |+                                  +-.|... |+.
T Consensus       255 ~c~~~g~c~~G----~CIC~~Gf~G~d------C~----------------------------------e~~Cp~~-cs~  289 (525)
T KOG1225|consen  255 GCTGRGQCVEG----RCICPPGFTGDD------CD----------------------------------ELVCPVD-CSG  289 (525)
T ss_pred             CCcccceEeCC----eEeCCCCCcCCC------CC----------------------------------cccCCcc-cCC
Confidence            45566778776    799999999993      54                                  2224333 777


Q ss_pred             CCceeecCCCCeeecCCCCcCCCCCCCcCcccCCCCCCccccccCcCCCCCCCCCCCCCeeeccCCCceeeCCCCcccCC
Q psy13144        681 YSQCRDIGGSPSCSCLPNYIGAPPNCRPECVMNSECPSNEACINEKCGDPCPGSCGYNAECKIINHTPICTCPDGFIGDP  760 (1031)
Q Consensus       681 ~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~  760 (1031)
                      ++.+++.    .|+|++||+|.  .|+.     ..|.               ..|.++|.|+  .|  .|.|.+||+|. 
T Consensus       290 ~g~~~~g----~CiC~~g~~G~--dCs~-----~~cp---------------adC~g~G~Ci--~G--~C~C~~Gy~G~-  338 (525)
T KOG1225|consen  290 GGVCVDG----ECICNPGYSGK--DCSI-----RRCP---------------ADCSGHGKCI--DG--ECLCDEGYTGE-  338 (525)
T ss_pred             CceecCC----EeecCCCcccc--cccc-----ccCC---------------ccCCCCCccc--CC--ceEeCCCCcCC-
Confidence            7877776    79999999999  7763     2233               3588899998  33  89999999999 


Q ss_pred             CCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCCcCCCCCCCCCccccC
Q psy13144        761 FTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPECILNNDCPSNKACIRN  827 (1031)
Q Consensus       761 ~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~  827 (1031)
                        .|...              .|++++.|+++ |+|..||.|..  ...+.++....|.....++..
T Consensus       339 --~C~~~--------------~C~~~g~cv~g-C~C~~Gw~G~d--~~~~~~~~~~~cs~~~~~~~~  386 (525)
T KOG1225|consen  339 --LCIQR--------------ACSGGGQCVNG-CKCKKGWRGPD--VADPSLLLITECSPPSLCIAG  386 (525)
T ss_pred             --ccccc--------------ccCCCceeccC-ceeccCccCCC--cCCchhhcccccCCCceeecc
Confidence              57642              28999999999 99999999987  333444443456555556554


No 13 
>KOG4260|consensus
Probab=98.74  E-value=1.2e-08  Score=104.34  Aligned_cols=128  Identities=32%  Similarity=0.701  Sum_probs=83.7

Q ss_pred             eCCCCCCCCCCCccccccCCCCCCCCCC---CCCCCCCCeecc---cCCceeeeCCCCCcCCCCCCCCCCccCCCCCCCc
Q psy13144         16 SCPPGTTGSPFVQCKPIVHEPVYTNPCQ---PSPCGPNSQCRE---VNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDK   89 (1031)
Q Consensus        16 ~C~~G~~g~~~~~C~~~~~~~~~~d~C~---~~~C~~~g~C~~---~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~   89 (1031)
                      -|++|.+|++   |.          .|.   .-||..+|.|.-   ..|+..|.|.+||+|..  |. .|.+.--=...+
T Consensus       131 CCp~gtyGpd---Cl----------~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~--C~-~Cg~eyfes~Rn  194 (350)
T KOG4260|consen  131 CCPDGTYGPD---CL----------QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPL--CR-YCGIEYFESSRN  194 (350)
T ss_pred             ccCCCCcCCc---cc----------cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcc--cc-ccchHHHHhhcc
Confidence            4899999996   53          342   467999999974   57788999999999982  21 121110000000


Q ss_pred             cccCCC---CCCCCCCCCCCCCeeeecCCCcee-eCCCCCccCCccccccCCCCCCCCCCCCCCCCCCC--CCCCCCCCc
Q psy13144         90 SCQNQK---CADPCPGTCGQNANCKVINHSPIC-RCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCY--PSPCGPYSQ  163 (1031)
Q Consensus        90 ~C~~~~---C~~~c~~~C~~~~~C~~~~g~~~C-~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~--~~~C~~~g~  163 (1031)
                      + ++..   |-.+|...      |.. .++-.| +|+.||..+... |              .|||||.  +.||.....
T Consensus       195 e-~~lvCt~Ch~~C~~~------Csg-~~~k~C~kCkkGW~lde~g-C--------------vDvnEC~~ep~~c~~~qf  251 (350)
T KOG4260|consen  195 E-QHLVCTACHEGCLGV------CSG-ESSKGCSKCKKGWKLDEEG-C--------------VDVNECQNEPAPCKAHQF  251 (350)
T ss_pred             c-ccchhhhhhhhhhcc------cCC-CCCCChhhhcccceecccc-c--------------ccHHHHhcCCCCCChhhe
Confidence            0 0111   21122222      322 223455 799999987533 6              5899997  578999999


Q ss_pred             eecCCCCceeeCCCCCcCC
Q psy13144        164 CRDINGSPSCSCLPSYIGS  182 (1031)
Q Consensus       164 C~~~~g~~~C~C~~Gy~g~  182 (1031)
                      |+|+.|||.|..++||.+.
T Consensus       252 CvNteGSf~C~dk~Gy~~g  270 (350)
T KOG4260|consen  252 CVNTEGSFKCEDKEGYKKG  270 (350)
T ss_pred             eecCCCceEecccccccCC
Confidence            9999999999999999875


No 14 
>KOG4260|consensus
Probab=98.53  E-value=8.8e-08  Score=98.03  Aligned_cols=162  Identities=26%  Similarity=0.618  Sum_probs=102.9

Q ss_pred             cCCCCccCCcccCCCCCcCC--CCCCCCCccccCC-CCCceeeecCCCCcCCCCCCCCCcccCCCCCCc-----ccccCC
Q psy13144        796 CLPDYYGDGYVSCGPECILN--NDCPSNKACIRNK-FNKQAVCSCLPNYFGSPPACRPECTVNTDCPLD-----KACVNQ  867 (1031)
Q Consensus       796 C~~G~~G~~c~~~~~~C~~~--~~C~~~g~C~~~~-~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~-----~~C~~~  867 (1031)
                      |++|.+|..|..    |...  .+|..+|.|.-.. ..|+..|.|.+||+|.      .|..   |-+.     ..=...
T Consensus       132 Cp~gtyGpdCl~----Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp------~C~~---Cg~eyfes~Rne~~l  198 (350)
T KOG4260|consen  132 CPDGTYGPDCLQ----CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGP------LCRY---CGIEYFESSRNEQHL  198 (350)
T ss_pred             cCCCCcCCcccc----CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCc------cccc---cchHHHHhhcccccc
Confidence            789999998753    4322  4688888887643 2378899999999999      4431   1000     000000


Q ss_pred             cccCCCCCCCCCCCceeecCCCeee-EeCCCCccCCCccccCCCCCCCCCCCCCcCCCCC--CCCCCCCCceeecCCCce
Q psy13144        868 KCVDPCPGSCGQNANCRVINHNAVC-NCKPGFTGEPRIRCSKIPPPPPPQDVPEYVNPCI--PSPCGPNSQCRDINGSPS  944 (1031)
Q Consensus       868 ~C~~~C~~~C~~~~~C~~~~g~~~C-~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~~C~--~~~C~~~~~C~~~~g~~~  944 (1031)
                      .|. .|...|.  +.|+... +-.| .|..||..+.. .|             .|||||.  +.||.....|+|+.|||+
T Consensus       199 vCt-~Ch~~C~--~~Csg~~-~k~C~kCkkGW~lde~-gC-------------vDvnEC~~ep~~c~~~qfCvNteGSf~  260 (350)
T KOG4260|consen  199 VCT-ACHEGCL--GVCSGES-SKGCSKCKKGWKLDEE-GC-------------VDVNECQNEPAPCKAHQFCVNTEGSFK  260 (350)
T ss_pred             hhh-hhhhhhh--cccCCCC-CCChhhhcccceeccc-cc-------------ccHHHHhcCCCCCChhheeecCCCceE
Confidence            000 0111222  2343322 2234 58999987732 36             5899995  579999999999999999


Q ss_pred             eeCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCc
Q psy13144        945 CSCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFV 1009 (1031)
Q Consensus       945 C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~ 1009 (1031)
                      |...+||.+..+.|+          .-        .+.|.   ..+..|.++.++|+|+|..|+.
T Consensus       261 C~dk~Gy~~g~d~C~----------~~--------~d~~~---~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  261 CEDKEGYKKGVDECQ----------FC--------ADVCA---SKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             ecccccccCChHHhh----------hh--------hhhcc---cCCCCcccCCccEEEEecccce
Confidence            999999988533333          10        01111   1367899999999999999974


No 15 
>KOG1226|consensus
Probab=97.86  E-value=6.4e-05  Score=88.68  Aligned_cols=140  Identities=24%  Similarity=0.542  Sum_probs=93.9

Q ss_pred             CCCCCCeeeccCCCceeeCCCCcccCCCCCCCCCCCCCCC-----CCccC-CCCCCCCCCeeeCceeecCCCCc----cC
Q psy13144        734 SCGYNAECKIINHTPICTCPDGFIGDPFTSCSPKPPEPVQ-----PVIQE-DTCNCVPNAECRDGVCVCLPDYY----GD  803 (1031)
Q Consensus       734 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~-----~~~~~-~~c~C~~~~~C~~~~C~C~~G~~----G~  803 (1031)
                      .|+.+|+.+-.    +|.|.+||.|.   .|+-.......     .|... +.=+|+..|.|+=+.|+|.+...    |.
T Consensus       468 ~C~g~G~~~CG----~C~C~~G~~G~---~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~  540 (783)
T KOG1226|consen  468 LCHGNGTFVCG----QCRCDEGWLGK---KCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGK  540 (783)
T ss_pred             ccCCCCcEEec----ceecCCCCCCC---cccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeee
Confidence            45555554433    89999999999   45532211111     11111 11169999999999999987776    88


Q ss_pred             CcccCCCCCcCC--CCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCC
Q psy13144        804 GYVSCGPECILN--NDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNA  881 (1031)
Q Consensus       804 ~c~~~~~~C~~~--~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~  881 (1031)
                      .|+-+.-.|...  ..|.++|+|.-.      +|+|.+||+|+      .|+-..   ..+.|++..     -.+|+..|
T Consensus       541 fCECDnfsC~r~~g~lC~g~G~C~CG------~CvC~~GwtG~------~C~C~~---std~C~~~~-----G~iCSGrG  600 (783)
T KOG1226|consen  541 FCECDNFSCERHKGVLCGGHGRCECG------RCVCNPGWTGS------ACNCPL---STDTCESSD-----GQICSGRG  600 (783)
T ss_pred             eeeccCcccccccCcccCCCCeEeCC------cEEcCCCCccC------CCCCCC---CCccccCCC-----CceeCCCc
Confidence            898776677754  479999999864      89999999999      553211   112344321     24688888


Q ss_pred             ceeecCCCeeeEeCCC-CccCCCcccc
Q psy13144        882 NCRVINHNAVCNCKPG-FTGEPRIRCS  907 (1031)
Q Consensus       882 ~C~~~~g~~~C~C~~G-y~G~~~~~C~  907 (1031)
                      +|.=    .+|+|... |.|.   .|+
T Consensus       601 ~C~C----g~C~C~~~~~sG~---~CE  620 (783)
T KOG1226|consen  601 TCEC----GRCKCTDPPYSGE---FCE  620 (783)
T ss_pred             eeeC----CceEcCCCCcCcc---hhh
Confidence            8866    38999766 9999   786


No 16 
>KOG1226|consensus
Probab=97.78  E-value=0.00017  Score=85.17  Aligned_cols=137  Identities=20%  Similarity=0.490  Sum_probs=91.9

Q ss_pred             CCCCCCCCceeecCCCCeeecCCCCcCCCCCCC--cCcccC----CCCCCccccccCcCCCCCCCCCCCCCeeeccCCCc
Q psy13144        675 PSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCR--PECVMN----SECPSNEACINEKCGDPCPGSCGYNAECKIINHTP  748 (1031)
Q Consensus       675 ~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~--~~C~~~----~~C~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~  748 (1031)
                      +..|+.+|+.+-.    .|.|.+||.|.  .|+  ......    +.|....    +      ..+|++.|.|+=.    
T Consensus       466 s~~C~g~G~~~CG----~C~C~~G~~G~--~CEC~~~~~ss~~~~~~Cr~~~----~------~~vCSgrG~C~CG----  525 (783)
T KOG1226|consen  466 SALCHGNGTFVCG----QCRCDEGWLGK--KCECSTDELSSSEEEDKCRENS----D------SPVCSGRGDCVCG----  525 (783)
T ss_pred             ccccCCCCcEEec----ceecCCCCCCC--cccCCccccCcHhHHhhccCCC----C------CCCcCCCCcEeCC----
Confidence            3456655665544    59999999999  775  111111    2333321    0      2378888888765    


Q ss_pred             eeeCCCCcc----cCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccC--CCCCcCC--CCCCC
Q psy13144        749 ICTCPDGFI----GDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSC--GPECILN--NDCPS  820 (1031)
Q Consensus       749 ~C~C~~G~~----g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~--~~~C~~~--~~C~~  820 (1031)
                      +|+|.+...    |.   .|+-..    +.|....--.|..+|+|.-+.|+|.+||+|..|...  .+.|...  ..|..
T Consensus       526 qC~C~~~~~~~i~G~---fCECDn----fsC~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSG  598 (783)
T KOG1226|consen  526 QCVCHKPDNGKIYGK---FCECDN----FSCERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSG  598 (783)
T ss_pred             ceEecCCCCCceeee---eeeccC----cccccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCC
Confidence            899988777    55   455221    111111222489999999999999999999998743  5677754  46888


Q ss_pred             CCccccCCCCCceeeecCCC-CcCC
Q psy13144        821 NKACIRNKFNKQAVCSCLPN-YFGS  844 (1031)
Q Consensus       821 ~g~C~~~~~~~~~~C~C~~G-~~g~  844 (1031)
                      .|+|+=.      +|+|.+. |.|.
T Consensus       599 rG~C~Cg------~C~C~~~~~sG~  617 (783)
T KOG1226|consen  599 RGTCECG------RCKCTDPPYSGE  617 (783)
T ss_pred             CceeeCC------ceEcCCCCcCcc
Confidence            8888864      7999887 9998


No 17 
>KOG1836|consensus
Probab=97.66  E-value=0.0017  Score=85.63  Aligned_cols=108  Identities=31%  Similarity=0.686  Sum_probs=69.8

Q ss_pred             eeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCC---ceecCCCCceeeCCCCCcCCCCCCcccCCCCC
Q psy13144        118 ICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYS---QCRDINGSPSCSCLPSYIGSPPNCRPECIQNS  194 (1031)
Q Consensus       118 ~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g---~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~  194 (1031)
                      .|.|+.||+|..++.|.+......++.   -+.  +...+|.-++   +|...  +-.|.|.+--.|.  .|+       
T Consensus       696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~---~~~--~~c~~C~cngh~~~Cd~~--tG~C~C~~~t~G~--~C~-------  759 (1705)
T KOG1836|consen  696 QCTCPVGYTGQFCESCAPGFRRLSPQL---GPF--CPCIPCDCNGHSNICDPR--TGQCKCKHNTFGG--QCA-------  759 (1705)
T ss_pred             hccCCCCcccchhhhcchhhhcccccC---CCC--CcccccccCCccccccCC--CCceecccCCCCC--chh-------
Confidence            399999999998666653332221111   122  3223444333   45433  3467888777777  665       


Q ss_pred             CCCCCCcccCCCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCCCCCCCCCcccccC--CCeeee-cCCCCcCC
Q psy13144        195 ECPYDKACINEKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQPSPCGPNSQCREVN--HQAVCS-CLPNYFGS  269 (1031)
Q Consensus       195 ~C~~~~~C~n~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~--g~y~C~-C~~Gy~g~  269 (1031)
                                         +|..||.|.+..        .. .+.|++.+|.+++.|..+.  ....|. |++||+|.
T Consensus       760 -------------------~C~~GfYg~~~~--------~~-~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~  809 (1705)
T KOG1836|consen  760 -------------------QCVDGFYGLPDL--------GT-SGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL  809 (1705)
T ss_pred             -------------------hhcCCCCCcccc--------CC-CCCCccCCCCCChhhcCcCcccceecCCCCCCCccc
Confidence                               899999998621        01 1238888999998887764  567897 99999999


No 18 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.59  E-value=3.8e-05  Score=58.27  Aligned_cols=34  Identities=26%  Similarity=0.587  Sum_probs=30.1

Q ss_pred             CCCCCCC--CCCCCCCeecccCCceeeeCCCCCcCC
Q psy13144         38 YTNPCQP--SPCGPNSQCREVNHQAVCSCLPNYFGS   71 (1031)
Q Consensus        38 ~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~Gy~g~   71 (1031)
                      |||||+.  ++|..+++|+|+.|+|+|.|++||...
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~   36 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN   36 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence            5899974  579989999999999999999999843


No 19 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.56  E-value=3.6e-05  Score=54.51  Aligned_cols=30  Identities=30%  Similarity=0.892  Sum_probs=27.6

Q ss_pred             CCCCCCCCCCeecccC-CceeeeCCCCCcCC
Q psy13144         42 CQPSPCGPNSQCREVN-HQAVCSCLPNYFGS   71 (1031)
Q Consensus        42 C~~~~C~~~g~C~~~~-g~~~C~C~~Gy~g~   71 (1031)
                      |.++||.++|+|++.. ++|+|+|++||+|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            5678999999999988 99999999999996


No 20 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.56  E-value=5.4e-05  Score=57.48  Aligned_cols=34  Identities=32%  Similarity=0.711  Sum_probs=30.1

Q ss_pred             cCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCc
Q psy13144        964 QNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFV 1009 (1031)
Q Consensus       964 ~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~ 1009 (1031)
                      |||||...            .+.|..++.|+|+.|+|+|.|++||.
T Consensus         1 DidEC~~~------------~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEG------------PHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTT------------SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CccccCCC------------CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            67888865            35788899999999999999999998


No 21 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.49  E-value=4.2e-05  Score=54.19  Aligned_cols=30  Identities=33%  Similarity=0.979  Sum_probs=27.5

Q ss_pred             CCCCCCCCCCceeecC-CCceeeCCCCCcCC
Q psy13144        925 CIPSPCGPNSQCRDIN-GSPSCSCLPTFIGA  954 (1031)
Q Consensus       925 C~~~~C~~~~~C~~~~-g~~~C~C~~Gy~g~  954 (1031)
                      |.++||+++|+|++.. ++|+|+|++||+|+
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            4567999999999998 99999999999997


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.39  E-value=0.00018  Score=53.44  Aligned_cols=34  Identities=29%  Similarity=0.826  Sum_probs=30.4

Q ss_pred             cCCCCCC-CCCCCCCceeecCCCceeeCCCCCc-CC
Q psy13144        921 YVNPCIP-SPCGPNSQCRDINGSPSCSCLPTFI-GA  954 (1031)
Q Consensus       921 ~~~~C~~-~~C~~~~~C~~~~g~~~C~C~~Gy~-g~  954 (1031)
                      ++++|.. ++|.++++|+++.++|+|.|++||+ |.
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~   36 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR   36 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence            3688887 7999999999999999999999999 55


No 23 
>KOG1836|consensus
Probab=97.39  E-value=0.0055  Score=80.96  Aligned_cols=53  Identities=32%  Similarity=0.746  Sum_probs=39.1

Q ss_pred             cee-eCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCceecCC--CCceee-CCCCCcCCCCCCc
Q psy13144        117 PIC-RCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYSQCRDIN--GSPSCS-CLPSYIGSPPNCR  187 (1031)
Q Consensus       117 ~~C-~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g~C~~~~--g~~~C~-C~~Gy~g~~~~C~  187 (1031)
                      -+| +|..||.|....               ...-| |.+-+|.+++.|..+.  ..+.|+ |++||+|.  .|+
T Consensus       756 ~~C~~C~~GfYg~~~~---------------~~~~d-C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~--rCe  812 (1705)
T KOG1836|consen  756 GQCAQCVDGFYGLPDL---------------GTSGD-CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL--RCE  812 (1705)
T ss_pred             CchhhhcCCCCCcccc---------------CCCCC-CccCCCCCChhhcCcCcccceecCCCCCCCccc--ccc
Confidence            344 688999987621               01223 8888999988887654  567899 99999999  776


No 24 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.38  E-value=0.00019  Score=53.32  Aligned_cols=35  Identities=37%  Similarity=0.963  Sum_probs=31.2

Q ss_pred             CCCCCC-CCCCCCCceecCCCCceeeCCCCCc-CCCCCC
Q psy13144        150 VNPCYP-SPCGPYSQCRDINGSPSCSCLPSYI-GSPPNC  186 (1031)
Q Consensus       150 ~d~C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~-g~~~~C  186 (1031)
                      +++|.. .+|.++++|+++.++|+|.|++||. |.  .|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~--~C   38 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR--NC   38 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC--cC
Confidence            678887 7999999999999999999999999 66  55


No 25 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.23  E-value=0.00016  Score=52.52  Aligned_cols=30  Identities=43%  Similarity=0.835  Sum_probs=24.8

Q ss_pred             CCCCCCCeeeecCCCceeeCCCCCccCCCC
Q psy13144        985 GSCGYNALCKVINHSPICTCPDGFVGDAFS 1014 (1031)
Q Consensus       985 ~~C~~~~~C~~~~g~~~C~C~~G~~G~~c~ 1014 (1031)
                      +.|+.+|+|+++.++|+|+|++||+|++..
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~   35 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFF   35 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence            469999999999999999999999999864


No 26 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.11  E-value=0.00011  Score=72.54  Aligned_cols=139  Identities=24%  Similarity=0.559  Sum_probs=81.1

Q ss_pred             CCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecC-----CCeeeEeCCCCccCCCc
Q psy13144        830 NKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVIN-----HNAVCNCKPGFTGEPRI  904 (1031)
Q Consensus       830 ~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~-----g~~~C~C~~Gy~G~~~~  904 (1031)
                      ++.|+|.|.+||.....   +.|+...+|...     . -+   ..+|...|+|++..     ..|+|.|.+||..... 
T Consensus        17 SNHfEC~Cnegfvl~~E---ntCE~kv~C~~~-----e-~~---~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~-   83 (197)
T PF06247_consen   17 SNHFECKCNEGFVLKNE---NTCEEKVECDKL-----E-NV---NKPCGDYAKCINQANKGEERAYKCDCINGYILKQG-   83 (197)
T ss_dssp             SSEEEEEESTTEEEEET---TEEEE----SG------G-GT---TSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-
T ss_pred             cCceEEEcCCCcEEccc---cccccceecCcc-----c-cc---CccccchhhhhcCCCcccceeEEEecccCceeeCC-
Confidence            37899999999976531   133333332210     0 01   23678889999876     3599999999987633 


Q ss_pred             cccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeec---CCCceeeCCCCCc-CCCCCCCCCcccCCCCCCcchhcccccc
Q psy13144        905 RCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDI---NGSPSCSCLPTFI-GAPPNCRPECIQNSECPFDKACIREKCI  980 (1031)
Q Consensus       905 ~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~---~g~~~C~C~~Gy~-g~~~~C~~~C~~~~eC~~~~~C~~~~c~  980 (1031)
                      .|.              .++|....|.. |.|+-.   +....|+|..|+. .+...|..         .+        .
T Consensus        84 vCv--------------p~~C~~~~Cg~-GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk---------~G--------~  131 (197)
T PF06247_consen   84 VCV--------------PNKCNNKDCGS-GKCILDPDNPNNPTCSCNIGKVPDDNKKCTK---------TG--------E  131 (197)
T ss_dssp             SEE--------------EGGGSS---TT-EEEEEEEGGGSEEEEEE-TEEETTTTTESEE---------EE---------
T ss_pred             eEc--------------hhhcCceecCC-CeEEecCCCCCCceeEeeeceEeccCCcccC---------CC--------c
Confidence            364              35676667884 899743   3355999999999 33222221         01        2


Q ss_pred             CCCCCCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13144        981 DPCPGSCGYNALCKVINHSPICTCPDGFVGDAF 1013 (1031)
Q Consensus       981 ~~c~~~C~~~~~C~~~~g~~~C~C~~G~~G~~c 1013 (1031)
                      .+|..-|..+-+|..+.+.|+|.+..||.+++=
T Consensus       132 T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~  164 (197)
T PF06247_consen  132 TKCSLKCKENEECKLVDGYYKCVCKEGFPGDGE  164 (197)
T ss_dssp             -------TTTEEEEEETTEEEEEE-TT-EEETT
T ss_pred             cceeeecCCCcceeeeCcEEEeecCCCCCCCCC
Confidence            334456778899999999999999999987664


No 27 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.04  E-value=0.00027  Score=51.29  Aligned_cols=30  Identities=43%  Similarity=0.912  Sum_probs=24.6

Q ss_pred             CCCCCCCCeeeecCCCceeeCCCCCccCCc
Q psy13144        101 PGTCGQNANCKVINHSPICRCKAGFTGDPF  130 (1031)
Q Consensus       101 ~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~  130 (1031)
                      ++.|+.+|+|+++.++|+|+|++||+|++.
T Consensus         5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~   34 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGF   34 (36)
T ss_dssp             GGGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred             CCCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence            378999999999999999999999999874


No 28 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.87  E-value=0.0012  Score=48.51  Aligned_cols=33  Identities=33%  Similarity=0.899  Sum_probs=29.9

Q ss_pred             CCCCCC-CCCCCCCceeecCCCceeeCCCCCcCC
Q psy13144        922 VNPCIP-SPCGPNSQCRDINGSPSCSCLPTFIGA  954 (1031)
Q Consensus       922 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~Gy~g~  954 (1031)
                      +++|.. .+|.+++.|++..++|+|.|++||.|.
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~   35 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR   35 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence            578877 799988999999999999999999986


No 29 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.87  E-value=0.0012  Score=48.41  Aligned_cols=35  Identities=40%  Similarity=0.982  Sum_probs=30.9

Q ss_pred             CCCCCC-CCCCCCCceecCCCCceeeCCCCCcCCCCCC
Q psy13144        150 VNPCYP-SPCGPYSQCRDINGSPSCSCLPSYIGSPPNC  186 (1031)
Q Consensus       150 ~d~C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C  186 (1031)
                      +++|.. .+|.++++|++..++|+|.|++||.|.  .|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~--~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR--NC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC--cC
Confidence            567876 789988999999999999999999997  55


No 30 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.86  E-value=6.6e-05  Score=74.01  Aligned_cols=144  Identities=22%  Similarity=0.513  Sum_probs=89.3

Q ss_pred             CCCCCCeecccCCceeeeCCCCCcCCCCCCCCCCccCCCCCCCccccCCCCCCCCCCCCCCCCeeeecC-----CCceee
Q psy13144         46 PCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKVIN-----HSPICR  120 (1031)
Q Consensus        46 ~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~-----g~~~C~  120 (1031)
                      .|. ||.-+...+.|.|.|.+||.+..   .++|+...+|...+.      .   .-.|..-|+|++..     ..|+|.
T Consensus         7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~---EntCE~kv~C~~~e~------~---~K~Cgdya~C~~~~~~~~~~~~~C~   73 (197)
T PF06247_consen    7 ICK-NGYLIQMSNHFECKCNEGFVLKN---ENTCEEKVECDKLEN------V---NKPCGDYAKCINQANKGEERAYKCD   73 (197)
T ss_dssp             --B-TEEEEEESSEEEEEESTTEEEEE---TTEEEE----SG-GG------T---TSEEETTEEEEE-SSTTSSTSEEEE
T ss_pred             ccc-CCEEEEccCceEEEcCCCcEEcc---ccccccceecCcccc------c---CccccchhhhhcCCCcccceeEEEe
Confidence            344 45778888899999999998853   136776665543211      1   25677889999865     479999


Q ss_pred             CCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCceecCC---CCceeeCCCCCc-CCCCCCc----ccCCC
Q psy13144        121 CKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYSQCRDIN---GSPSCSCLPSYI-GSPPNCR----PECIQ  192 (1031)
Q Consensus       121 C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g~C~~~~---g~~~C~C~~Gy~-g~~~~C~----~~C~~  192 (1031)
                      |.+||+..... |.               .+.|..-.|+ .|.|+-.+   ....|+|.-|+. .+...|.    ..|. 
T Consensus        74 C~~gY~~~~~v-Cv---------------p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-  135 (197)
T PF06247_consen   74 CINGYILKQGV-CV---------------PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-  135 (197)
T ss_dssp             E-TTEEESSSS-EE---------------EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE------
T ss_pred             cccCceeeCCe-Ec---------------hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-
Confidence            99999876533 53               3566666788 58997433   345999999999 4444565    2343 


Q ss_pred             CCCCCCCCcccCCCCCCCCCCCCCCCCCCCC
Q psy13144        193 NSECPYDKACINEKCADPCPGFCPPGTTGSP  223 (1031)
Q Consensus       193 ~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~~  223 (1031)
                       -.|..+..|....+-|+|  .+.+||.++.
T Consensus       136 -LKCk~nE~CK~~~~~Y~C--~~~~~~~~~~  163 (197)
T PF06247_consen  136 -LKCKENEECKLVDGYYKC--VCKEGFPGDG  163 (197)
T ss_dssp             ----TTTEEEEEETTEEEE--EE-TT-EEET
T ss_pred             -eecCCCcceeeeCcEEEe--ecCCCCCCCC
Confidence             246678899999999999  9999998764


No 31 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.91  E-value=0.0067  Score=39.53  Aligned_cols=24  Identities=33%  Similarity=0.573  Sum_probs=17.4

Q ss_pred             ceeeeCCCCCcCCCCCCCCCCccCCCCCCCcc
Q psy13144         59 QAVCSCLPNYFGSPPACRPECTVNSDCPLDKS   90 (1031)
Q Consensus        59 ~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~   90 (1031)
                      ||+|+|++||++.+        ++..|.++||
T Consensus         1 sy~C~C~~Gy~l~~--------d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSP--------DGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCC--------CCCccccCCC
Confidence            69999999999873        4445555553


No 32 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.88  E-value=0.0094  Score=42.90  Aligned_cols=28  Identities=29%  Similarity=0.776  Sum_probs=25.7

Q ss_pred             CCCCCCCCeecccCCceeeeCCCCCcCC
Q psy13144         44 PSPCGPNSQCREVNHQAVCSCLPNYFGS   71 (1031)
Q Consensus        44 ~~~C~~~g~C~~~~g~~~C~C~~Gy~g~   71 (1031)
                      ..+|.++++|++..++|+|.|+.||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            5788889999999999999999999886


No 33 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.88  E-value=0.01  Score=42.70  Aligned_cols=28  Identities=32%  Similarity=0.873  Sum_probs=25.7

Q ss_pred             CCCCCCCCceeecCCCceeeCCCCCcCC
Q psy13144        927 PSPCGPNSQCRDINGSPSCSCLPTFIGA  954 (1031)
Q Consensus       927 ~~~C~~~~~C~~~~g~~~C~C~~Gy~g~  954 (1031)
                      ..+|.++++|+++.++|+|.|+.||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            5689889999999999999999999987


No 34 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.84  E-value=0.01  Score=42.88  Aligned_cols=32  Identities=41%  Similarity=1.021  Sum_probs=26.1

Q ss_pred             CCC-CCCCCCCceecCCCCceeeCCCCCcCCCCCC
Q psy13144        153 CYP-SPCGPYSQCRDINGSPSCSCLPSYIGSPPNC  186 (1031)
Q Consensus       153 C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C  186 (1031)
                      |.. .+|.++ +|+++.++|+|.|++||+|. ..|
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~-~~C   34 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGD-KRC   34 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCCCccC-Ccc
Confidence            444 578887 99999999999999999993 155


No 35 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.72  E-value=0.013  Score=42.34  Aligned_cols=28  Identities=36%  Similarity=0.950  Sum_probs=25.0

Q ss_pred             CCC-CCCCCCCeecccCCceeeeCCCCCcC
Q psy13144         42 CQP-SPCGPNSQCREVNHQAVCSCLPNYFG   70 (1031)
Q Consensus        42 C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~g   70 (1031)
                      |.. .+|.++ +|++..++|+|.|++||+|
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCCCcc
Confidence            555 688888 9999999999999999998


No 36 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.67  E-value=0.0094  Score=38.85  Aligned_cols=24  Identities=33%  Similarity=0.641  Sum_probs=18.8

Q ss_pred             CceeeCCCCCcCCCCCCCCCcccCCC
Q psy13144        942 SPSCSCLPTFIGAPPNCRPECIQNSE  967 (1031)
Q Consensus       942 ~~~C~C~~Gy~g~~~~C~~~C~~~~e  967 (1031)
                      ||+|.|++||+..+.  .+.|+||||
T Consensus         1 sy~C~C~~Gy~l~~d--~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPD--GRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCC--CCccccCCC
Confidence            699999999997642  356778876


No 37 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.21  E-value=0.067  Score=37.80  Aligned_cols=26  Identities=23%  Similarity=0.595  Sum_probs=22.9

Q ss_pred             CCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13144        986 SCGYNALCKVINHSPICTCPDGFVGDAF 1013 (1031)
Q Consensus       986 ~C~~~~~C~~~~g~~~C~C~~G~~G~~c 1013 (1031)
                      .|.++++|+...+  +|+|.+||+|..|
T Consensus         7 ~C~~~G~C~~~~g--~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSPCG--RCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence            5888999998865  8999999999875


No 38 
>KOG1218|consensus
Probab=94.11  E-value=3.3  Score=46.48  Aligned_cols=151  Identities=25%  Similarity=0.599  Sum_probs=80.0

Q ss_pred             CCceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeec-CCCCccCCcccCCCCCcCCCCCCCCCcc
Q psy13144        746 HTPICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVC-LPDYYGDGYVSCGPECILNNDCPSNKAC  824 (1031)
Q Consensus       746 g~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C-~~G~~G~~c~~~~~~C~~~~~C~~~g~C  824 (1031)
                      .+..|.+..+|.+.   .|................+.|..+.........+ ..+|.|..|... .+|..  .|.. .+|
T Consensus        47 ~~~~~~~~~~~~~~---~c~~~~~~~~~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~g~~C~~~-~~~~~--~c~~-~~C  119 (316)
T KOG1218|consen   47 NSGECGLGYGFVGS---VCRIECVCGNAGGGCSQPCRCKNGGTCVSSTGYCHLNGYEGPQCESP-CPCGD--GCAE-KTC  119 (316)
T ss_pred             CceeEecccccCCC---ccccccccCCCCCcccCccccCCCCcccCCCCcccCCCCCcccccCC-CCcCC--cccc-ccc
Confidence            34588889999988   3554433322222223333455555555444455 688998887532 12221  1333 567


Q ss_pred             ccCCCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecCCCeeeEeCCCCccCCCc
Q psy13144        825 IRNKFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVINHNAVCNCKPGFTGEPRI  904 (1031)
Q Consensus       825 ~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~  904 (1031)
                      .+..   . .|.+..+|.+.      .|...  -.....|.         ..|.....+....  ..|.|.+||+|.   
T Consensus       120 ~~~~---~-~c~~~~~~~~~------~C~~~--~~~g~~C~---------~~c~~~~~~~~~~--~~c~c~~g~~g~---  173 (316)
T KOG1218|consen  120 ANPR---R-ECRCGGGYIGE------QCGEE--NLVGLKCQ---------RDCQCTGGCDCKN--GICTCQPGFVGV---  173 (316)
T ss_pred             CCCc---c-ceecCCcCccc------ccccc--CCCCCCcc---------CCCCCccccCCCC--CceeccCCcccc---
Confidence            7653   2 68888888777      45430  00011122         1222222222222  378999999999   


Q ss_pred             cccCCCCCCCCCCCCCcCCCCCC-CCCCCCCceeecCC
Q psy13144        905 RCSKIPPPPPPQDVPEYVNPCIP-SPCGPNSQCRDING  941 (1031)
Q Consensus       905 ~C~~~~~~~~~~~~~~~~~~C~~-~~C~~~~~C~~~~g  941 (1031)
                      ++...            ...|.. ..+.+++.|+...+
T Consensus       174 ~~~~~------------~~~c~~~~~~~~g~~C~~~~~  199 (316)
T KOG1218|consen  174 FCVES------------CSGCSPLTACENGAKCNRSTG  199 (316)
T ss_pred             ccccc------------CCCcCCCcccCCCCeeecccc
Confidence            67532            111442 35677677776654


No 39 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.59  E-value=0.069  Score=37.72  Aligned_cols=23  Identities=35%  Similarity=0.807  Sum_probs=20.8

Q ss_pred             CCCCCeee--CceeecCCCCccCCc
Q psy13144        783 CVPNAECR--DGVCVCLPDYYGDGY  805 (1031)
Q Consensus       783 C~~~~~C~--~~~C~C~~G~~G~~c  805 (1031)
                      |+++|+|+  .++|+|.+||+|..|
T Consensus         8 C~~~G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    8 CSGHGTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             cCCCCEEeCCCCEEECCCCCcCCCC
Confidence            88999999  679999999999865


No 40 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.21  E-value=0.042  Score=39.92  Aligned_cols=21  Identities=29%  Similarity=0.542  Sum_probs=17.9

Q ss_pred             CeecccCCceeeeCCCCCcCC
Q psy13144         51 SQCREVNHQAVCSCLPNYFGS   71 (1031)
Q Consensus        51 g~C~~~~g~~~C~C~~Gy~g~   71 (1031)
                      .+|++.+++|+|+|++||++.
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE-
T ss_pred             CCCccCCCceEeECCCCCEEC
Confidence            389999999999999999997


No 41 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=90.83  E-value=0.11  Score=28.70  Aligned_cols=13  Identities=38%  Similarity=1.135  Sum_probs=10.5

Q ss_pred             eeeCCCCCccCCC
Q psy13144       1001 ICTCPDGFVGDAF 1013 (1031)
Q Consensus      1001 ~C~C~~G~~G~~c 1013 (1031)
                      +|.|++||+|..|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5999999999876


No 42 
>KOG1218|consensus
Probab=90.47  E-value=9.1  Score=42.91  Aligned_cols=163  Identities=22%  Similarity=0.503  Sum_probs=75.5

Q ss_pred             CceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCCcCCCCCCCCCcccc
Q psy13144        747 TPICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPECILNNDCPSNKACIR  826 (1031)
Q Consensus       747 ~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~  826 (1031)
                      +..|.|.+||.|. . .+.. .......   ...+++    .=...+|.+..+|.|..|......-.....|.....|..
T Consensus        14 ~~~c~c~~~~~g~-~-~~~~-~~~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~   83 (316)
T KOG1218|consen   14 SGQCFCDPGYTGR-L-QCEH-QAVTSAC---SGICPC----EVNSGECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKN   83 (316)
T ss_pred             CCceecCCCcccc-c-cccC-CCCCccc---cccCCc----cCCceeEecccccCCCccccccccCCCCCcccCccccCC
Confidence            4589999999995 1 2332 1110000   011111    002346888899998876432211111123444444444


Q ss_pred             CCCCCceeeec-CCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecCCCeeeEeCCCCccCCCcc
Q psy13144        827 NKFNKQAVCSC-LPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVINHNAVCNCKPGFTGEPRIR  905 (1031)
Q Consensus       827 ~~~~~~~~C~C-~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~~  905 (1031)
                      ......+.-.+ ..+|.|.      .|+....+...               |.. .+|.+...  .|.+..+|.+.   +
T Consensus        84 ~~~~~~~~~~~~~~~~~g~------~C~~~~~~~~~---------------c~~-~~C~~~~~--~c~~~~~~~~~---~  136 (316)
T KOG1218|consen   84 GGTCVSSTGYCHLNGYEGP------QCESPCPCGDG---------------CAE-KTCANPRR--ECRCGGGYIGE---Q  136 (316)
T ss_pred             CCcccCCCCcccCCCCCcc------cccCCCCcCCc---------------ccc-cccCCCcc--ceecCCcCccc---c
Confidence            33223334444 5677777      55443221110               222 34444332  57777777777   5


Q ss_pred             ccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCcCCCCCCCC
Q psy13144        906 CSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFIGAPPNCRP  960 (1031)
Q Consensus       906 C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~~~C~~  960 (1031)
                      |..  +.+.       ...|... |.+...+.-.  ...|.|++||.+.  .++.
T Consensus       137 C~~--~~~~-------g~~C~~~-c~~~~~~~~~--~~~c~c~~g~~g~--~~~~  177 (316)
T KOG1218|consen  137 CGE--ENLV-------GLKCQRD-CQCTGGCDCK--NGICTCQPGFVGV--FCVE  177 (316)
T ss_pred             ccc--cCCC-------CCCccCC-CCCccccCCC--CCceeccCCcccc--cccc
Confidence            653  1111       1222211 1111122212  2378899999998  5553


No 43 
>KOG3512|consensus
Probab=90.31  E-value=2.4  Score=48.15  Aligned_cols=79  Identities=23%  Similarity=0.502  Sum_probs=57.0

Q ss_pred             CCCCC-eeeecCCC-ceeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCC-------------------
Q psy13144        104 CGQNA-NCKVINHS-PICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYS-------------------  162 (1031)
Q Consensus       104 C~~~~-~C~~~~g~-~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g-------------------  162 (1031)
                      |..++ .|+-...+ ++|.|.-.-+|..+..|.++.--.|++...-.++++|....|..++                   
T Consensus       280 CNgHAs~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~Sgg  359 (592)
T KOG3512|consen  280 CNGHASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGG  359 (592)
T ss_pred             ecCccceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccc
Confidence            43333 58876665 9999999999999888988877788888877889999876665443                   


Q ss_pred             ceec----CCCCcee-eCCCCCcCCC
Q psy13144        163 QCRD----INGSPSC-SCLPSYIGSP  183 (1031)
Q Consensus       163 ~C~~----~~g~~~C-~C~~Gy~g~~  183 (1031)
                      +|+|    +.|. +| .|.+||.-++
T Consensus       360 vClnCrHnTaGr-hChyCreGyyRd~  384 (592)
T KOG3512|consen  360 VCLNCRHNTAGR-HCHYCREGYYRDG  384 (592)
T ss_pred             eEeecccCCCCc-ccccccCccccCC
Confidence            4443    3333 45 4999998553


No 44 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.67  E-value=0.14  Score=37.27  Aligned_cols=21  Identities=38%  Similarity=0.744  Sum_probs=17.7

Q ss_pred             CceecCCCCceeeCCCCCcCC
Q psy13144        162 SQCRDINGSPSCSCLPSYIGS  182 (1031)
Q Consensus       162 g~C~~~~g~~~C~C~~Gy~g~  182 (1031)
                      .+|++++++|+|.|++||++.
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE-
T ss_pred             CCCccCCCceEeECCCCCEEC
Confidence            589999999999999999987


No 45 
>smart00051 DSL delta serrate ligand.
Probab=87.85  E-value=0.65  Score=38.59  Aligned_cols=43  Identities=28%  Similarity=0.612  Sum_probs=32.7

Q ss_pred             eecCCCCccCCcccCCCCCcCCCCCCCCCccccCCCCCceeeecCCCCcCC
Q psy13144        794 CVCLPDYYGDGYVSCGPECILNNDCPSNKACIRNKFNKQAVCSCLPNYFGS  844 (1031)
Q Consensus       794 C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~~~~~~~~C~C~~G~~g~  844 (1031)
                      =.|+++|+|..|.   ..|...+....+.+|...     ..++|.+||+|.
T Consensus        19 v~C~~~~yG~~C~---~~C~~~~d~~~~~~Cd~~-----G~~~C~~Gw~G~   61 (63)
T smart00051       19 VTCDENYYGEGCN---KFCRPRDDFFGHYTCDEN-----GNKGCLEGWMGP   61 (63)
T ss_pred             eeCCCCCcCCccC---CEeCcCccccCCccCCcC-----CCEecCCCCcCC
Confidence            4688999999885   355544556778888643     489999999998


No 46 
>smart00051 DSL delta serrate ligand.
Probab=87.04  E-value=0.74  Score=38.26  Aligned_cols=47  Identities=19%  Similarity=0.393  Sum_probs=33.8

Q ss_pred             ceeeCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13144        943 PSCSCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFVGDAF 1013 (1031)
Q Consensus       943 ~~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~G~~c 1013 (1031)
                      +.=.|+++|.|.  .|...|...+                   ....+..|.. .|  .++|++||+|..|
T Consensus        17 ~rv~C~~~~yG~--~C~~~C~~~~-------------------d~~~~~~Cd~-~G--~~~C~~Gw~G~~C   63 (63)
T smart00051       17 IRVTCDENYYGE--GCNKFCRPRD-------------------DFFGHYTCDE-NG--NKGCLEGWMGPYC   63 (63)
T ss_pred             EEeeCCCCCcCC--ccCCEeCcCc-------------------cccCCccCCc-CC--CEecCCCCcCCCC
Confidence            344799999999  7877664322                   2345677865 45  7999999999875


No 47 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=84.77  E-value=0.22  Score=36.06  Aligned_cols=32  Identities=31%  Similarity=0.729  Sum_probs=22.4

Q ss_pred             CCCCCCCCCCceeecC-CCceeeCCCCCcCCCC
Q psy13144        925 CIPSPCGPNSQCRDIN-GSPSCSCLPTFIGAPP  956 (1031)
Q Consensus       925 C~~~~C~~~~~C~~~~-g~~~C~C~~Gy~g~~~  956 (1031)
                      |....|..++.|++.. |++.|.|.+||..++.
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~   34 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGG   34 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEEETT
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCccccCC
Confidence            4455788889998876 9999999999987753


No 48 
>KOG3512|consensus
Probab=82.38  E-value=4  Score=46.46  Aligned_cols=118  Identities=20%  Similarity=0.518  Sum_probs=69.3

Q ss_pred             CceeecCCC-eeeEeCCCCccCCCccccCCC-CCCCCCCCCCcCCCCCCCCCCCCC-------------------cee--
Q psy13144        881 ANCRVINHN-AVCNCKPGFTGEPRIRCSKIP-PPPPPQDVPEYVNPCIPSPCGPNS-------------------QCR--  937 (1031)
Q Consensus       881 ~~C~~~~g~-~~C~C~~Gy~G~~~~~C~~~~-~~~~~~~~~~~~~~C~~~~C~~~~-------------------~C~--  937 (1031)
                      ..|+...++ ++|.|..+-.|..-.+|.+.. .-+|......++++|....|..++                   +|+  
T Consensus       285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnC  364 (592)
T KOG3512|consen  285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNC  364 (592)
T ss_pred             ceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeec
Confidence            357666555 999999999998444454321 112333334567777655554333                   343  


Q ss_pred             --ecCCCcee-eCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCC----CCeeeecCCCceeeCCCCCcc
Q psy13144        938 --DINGSPSC-SCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGY----NALCKVINHSPICTCPDGFVG 1010 (1031)
Q Consensus       938 --~~~g~~~C-~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~----~~~C~~~~g~~~C~C~~G~~G 1010 (1031)
                        |+.|. +| .|.+||.-++..  +. .+-..|..              -.|+.    +-+|..+.|  .|.|++|.+|
T Consensus       365 rHnTaGr-hChyCreGyyRd~s~--pl-~hrkaCk~--------------CdChpVGs~gktCNq~tG--qCpCkeGvtG  424 (592)
T KOG3512|consen  365 RHNTAGR-HCHYCREGYYRDGSK--PL-THRKACKA--------------CDCHPVGSAGKTCNQTTG--QCPCKEGVTG  424 (592)
T ss_pred             ccCCCCc-ccccccCccccCCCC--CC-chhhhhhh--------------cCCcccccccccccccCC--cccCCCCCcc
Confidence              23333 44 689999877521  00 01111110              02333    446888888  8999999999


Q ss_pred             CCCCCCcC
Q psy13144       1011 DAFSGCYP 1018 (1031)
Q Consensus      1011 ~~c~~c~~ 1018 (1031)
                      ..|++|.+
T Consensus       425 ~tCnrCa~  432 (592)
T KOG3512|consen  425 LTCNRCAP  432 (592)
T ss_pred             cccccccc
Confidence            99999953


No 49 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=81.68  E-value=0.73  Score=33.46  Aligned_cols=31  Identities=26%  Similarity=0.606  Sum_probs=21.5

Q ss_pred             CCCCCCCCCCeeeecC-CceeeeCCCCcccCC
Q psy13144        597 CQPSPCGPNSQCREVN-HQAVCSCLPNYFGSP  627 (1031)
Q Consensus       597 C~~~~C~~~~~C~~~~-g~y~C~C~~Gy~g~~  627 (1031)
                      |....|..++.|++.. |++.|.|..||..++
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~   33 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG   33 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence            4456788899998865 999999999997664


No 50 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=79.97  E-value=1.8  Score=34.15  Aligned_cols=28  Identities=21%  Similarity=0.419  Sum_probs=23.0

Q ss_pred             eeeecCCCceeeCCCCCccCCCCCCcCCCC
Q psy13144        992 LCKVINHSPICTCPDGFVGDAFSGCYPKPP 1021 (1031)
Q Consensus       992 ~C~~~~g~~~C~C~~G~~G~~c~~c~~~~~ 1021 (1031)
                      .|....|  +|.|+++|+|..|++|.+...
T Consensus        13 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~~   40 (50)
T cd00055          13 QCDPGTG--QCECKPNTTGRRCDRCAPGYY   40 (50)
T ss_pred             cccCCCC--EEeCCCcCCCCCCCCCCCCCc
Confidence            4776677  899999999999999976443


No 51 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=78.60  E-value=1.1  Score=35.03  Aligned_cols=26  Identities=27%  Similarity=0.657  Sum_probs=21.1

Q ss_pred             CeeeecCCCceeeCCCCCccCCCCCCcC
Q psy13144        991 ALCKVINHSPICTCPDGFVGDAFSGCYP 1018 (1031)
Q Consensus       991 ~~C~~~~g~~~C~C~~G~~G~~c~~c~~ 1018 (1031)
                      +.|....|  +|.|+++|+|..|++|.+
T Consensus        11 ~~C~~~~G--~C~C~~~~~G~~C~~C~~   36 (49)
T PF00053_consen   11 QTCDPSTG--QCVCKPGTTGPRCDQCKP   36 (49)
T ss_dssp             SSEEETCE--EESBSTTEESTTS-EE-T
T ss_pred             CcccCCCC--EEeccccccCCcCcCCCC
Confidence            47888777  999999999999998875


No 52 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=77.31  E-value=3.4  Score=32.51  Aligned_cols=25  Identities=40%  Similarity=0.995  Sum_probs=21.2

Q ss_pred             ceeeCCCCCcCCCCCCcccCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCC
Q psy13144        171 PSCSCLPSYIGSPPNCRPECIQNSECPYDKACINEKCADPCPGFCPPGTTGSP  223 (1031)
Q Consensus       171 ~~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~~  223 (1031)
                      .+|.|+++|+|.  .|+                          +|.+||.+.+
T Consensus        19 G~C~C~~~~~G~--~C~--------------------------~C~~g~~~~~   43 (50)
T cd00055          19 GQCECKPNTTGR--RCD--------------------------RCAPGYYGLP   43 (50)
T ss_pred             CEEeCCCcCCCC--CCC--------------------------CCCCCCccCC
Confidence            379999999999  776                          8889998874


No 53 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=72.74  E-value=3  Score=44.42  Aligned_cols=33  Identities=24%  Similarity=0.464  Sum_probs=26.8

Q ss_pred             CCCCCCCC--CCCCCCCceecCCCCceeeCCCCCcCC
Q psy13144        148 EPVNPCYP--SPCGPYSQCRDINGSPSCSCLPSYIGS  182 (1031)
Q Consensus       148 ~~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~Gy~g~  182 (1031)
                      .++++|..  ++|.  ..|.++.|+|.|.|++||++.
T Consensus       185 ~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~  219 (224)
T cd01475         185 VVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL  219 (224)
T ss_pred             cCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence            46788864  3465  589999999999999999875


No 54 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=72.32  E-value=2  Score=33.60  Aligned_cols=22  Identities=27%  Similarity=0.801  Sum_probs=16.4

Q ss_pred             CceecCCCCceeeCCCCCcCCCCCCc
Q psy13144        162 SQCRDINGSPSCSCLPSYIGSPPNCR  187 (1031)
Q Consensus       162 g~C~~~~g~~~C~C~~Gy~g~~~~C~  187 (1031)
                      .+|...  ..+|.|+++|+|.  .|+
T Consensus        11 ~~C~~~--~G~C~C~~~~~G~--~C~   32 (49)
T PF00053_consen   11 QTCDPS--TGQCVCKPGTTGP--RCD   32 (49)
T ss_dssp             SSEEET--CEEESBSTTEEST--TS-
T ss_pred             CcccCC--CCEEeccccccCC--cCc
Confidence            367653  3489999999999  776


No 55 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=71.50  E-value=3.4  Score=44.02  Aligned_cols=32  Identities=19%  Similarity=0.480  Sum_probs=26.8

Q ss_pred             CCCCCCC--CCCCCCCeecccCCceeeeCCCCCcCC
Q psy13144         38 YTNPCQP--SPCGPNSQCREVNHQAVCSCLPNYFGS   71 (1031)
Q Consensus        38 ~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~Gy~g~   71 (1031)
                      ++++|..  ++|.  ..|++..|+|.|+|++||++.
T Consensus       186 ~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~  219 (224)
T cd01475         186 VPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL  219 (224)
T ss_pred             CchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence            6788863  4565  589999999999999999986


No 56 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=69.65  E-value=4.4  Score=31.30  Aligned_cols=27  Identities=22%  Similarity=0.513  Sum_probs=22.1

Q ss_pred             eeeecCCCceeeCCCCCccCCCCCCcCCC
Q psy13144        992 LCKVINHSPICTCPDGFVGDAFSGCYPKP 1020 (1031)
Q Consensus       992 ~C~~~~g~~~C~C~~G~~G~~c~~c~~~~ 1020 (1031)
                      .|....|  .|.|+++|+|..|++|.+..
T Consensus        12 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~   38 (46)
T smart00180       12 TCDPDTG--QCECKPNVTGRRCDRCAPGY   38 (46)
T ss_pred             cccCCCC--EEECCCCCCCCCCCcCCCCc
Confidence            5666666  89999999999999997643


No 57 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=69.01  E-value=5.6  Score=30.70  Aligned_cols=14  Identities=36%  Similarity=1.094  Sum_probs=12.6

Q ss_pred             eeeCCCCCcCCCCCCc
Q psy13144        172 SCSCLPSYIGSPPNCR  187 (1031)
Q Consensus       172 ~C~C~~Gy~g~~~~C~  187 (1031)
                      +|.|+++|+|.  .|+
T Consensus        19 ~C~C~~~~~G~--~C~   32 (46)
T smart00180       19 QCECKPNVTGR--RCD   32 (46)
T ss_pred             EEECCCCCCCC--CCC
Confidence            89999999998  776


No 58 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=58.18  E-value=9.6  Score=35.80  Aligned_cols=30  Identities=20%  Similarity=0.440  Sum_probs=23.6

Q ss_pred             CCCCCCeeeecC--CCceeeCCCCCccCCCCCC
Q psy13144        986 SCGYNALCKVIN--HSPICTCPDGFVGDAFSGC 1016 (1031)
Q Consensus       986 ~C~~~~~C~~~~--g~~~C~C~~G~~G~~c~~c 1016 (1031)
                      -|-++ .|.-..  ..+.|.|..||+|.+|++=
T Consensus        52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             EeECC-EEEeeccCCCceeECCCCcccccccce
Confidence            46644 888554  6789999999999999843


No 59 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=57.77  E-value=15  Score=28.95  Aligned_cols=19  Identities=37%  Similarity=1.095  Sum_probs=17.2

Q ss_pred             CCCCCeeeCceeecCCCCc
Q psy13144        783 CVPNAECRDGVCVCLPDYY  801 (1031)
Q Consensus       783 C~~~~~C~~~~C~C~~G~~  801 (1031)
                      |..++.|++++|+|++||.
T Consensus        28 C~~~s~C~~g~C~C~~g~~   46 (52)
T PF01683_consen   28 CIGGSVCVNGRCQCPPGYV   46 (52)
T ss_pred             CCCcCEEcCCEeECCCCCE
Confidence            6688999999999999985


No 60 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=57.10  E-value=16  Score=28.90  Aligned_cols=21  Identities=33%  Similarity=0.865  Sum_probs=16.0

Q ss_pred             CCCCCCeeeecCCCceeeCCCCCcc
Q psy13144        986 SCGYNALCKVINHSPICTCPDGFVG 1010 (1031)
Q Consensus       986 ~C~~~~~C~~~~g~~~C~C~~G~~G 1010 (1031)
                      .|..++.|++  |  +|.|++||+-
T Consensus        27 qC~~~s~C~~--g--~C~C~~g~~~   47 (52)
T PF01683_consen   27 QCIGGSVCVN--G--RCQCPPGYVE   47 (52)
T ss_pred             CCCCcCEEcC--C--EeECCCCCEe
Confidence            4667888944  4  8999999863


No 61 
>PHA02887 EGF-like protein; Provisional
Probab=49.51  E-value=16  Score=33.90  Aligned_cols=29  Identities=28%  Similarity=0.582  Sum_probs=22.9

Q ss_pred             CCCCCCeeeecC--CCceeeCCCCCccCCCCC
Q psy13144        986 SCGYNALCKVIN--HSPICTCPDGFVGDAFSG 1015 (1031)
Q Consensus       986 ~C~~~~~C~~~~--g~~~C~C~~G~~G~~c~~ 1015 (1031)
                      -|- +|.|.-..  ..+.|.|+.||+|.+|+.
T Consensus        93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             Eee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence            466 57888544  468999999999999983


No 62 
>PHA02887 EGF-like protein; Provisional
Probab=48.60  E-value=13  Score=34.30  Aligned_cols=32  Identities=25%  Similarity=0.537  Sum_probs=25.8

Q ss_pred             CCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCccc
Q psy13144        816 NDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTV  854 (1031)
Q Consensus       816 ~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~  854 (1031)
                      +-|. ||+|..........|.|+.||+|.      +|+.
T Consensus        92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~------RCE~  123 (126)
T PHA02887         92 DFCI-NGECMNIIDLDEKFCICNKGYTGI------RCDE  123 (126)
T ss_pred             CEee-CCEEEccccCCCceeECCCCcccC------CCCc
Confidence            4466 478988776788999999999999      7764


No 63 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=47.36  E-value=16  Score=34.09  Aligned_cols=32  Identities=34%  Similarity=0.828  Sum_probs=25.2

Q ss_pred             CCCCCC-CCCCCCCCeecccCCceeeeCCCCCcC
Q psy13144         38 YTNPCQ-PSPCGPNSQCREVNHQAVCSCLPNYFG   70 (1031)
Q Consensus        38 ~~d~C~-~~~C~~~g~C~~~~g~~~C~C~~Gy~g   70 (1031)
                      ..|.|+ ...|+.+|.|... .+..|.|.+||.-
T Consensus        76 p~d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNSN-NSPKCSCLPGFEP  108 (110)
T ss_pred             cccCCCCccccCCccEeCCC-CCCceECCCCcCC
Confidence            346787 4889999999654 4567999999974


No 64 
>KOG3516|consensus
Probab=46.72  E-value=14  Score=47.16  Aligned_cols=42  Identities=26%  Similarity=0.666  Sum_probs=36.4

Q ss_pred             CCCCCCCCCCCCCCCCCCCceecCCCCceeeCC-CCCcCCCCCCc
Q psy13144        144 EDVPEPVNPCYPSPCGPYSQCRDINGSPSCSCL-PSYIGSPPNCR  187 (1031)
Q Consensus       144 ~~~~~~~d~C~~~~C~~~g~C~~~~g~~~C~C~-~Gy~g~~~~C~  187 (1031)
                      .|.+..+|.|.+++|.++|.|......|.|.|. .||.|.  +|.
T Consensus       539 id~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga--tCH  581 (1306)
T KOG3516|consen  539 IDMCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA--TCH  581 (1306)
T ss_pred             ecccccccccCCccccCCCcccccccceeEeccccccccc--ccc
Confidence            344456788899999999999998889999999 999999  887


No 65 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=42.03  E-value=17  Score=34.31  Aligned_cols=32  Identities=25%  Similarity=0.583  Sum_probs=25.6

Q ss_pred             CCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCccc
Q psy13144        816 NDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTV  854 (1031)
Q Consensus       816 ~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~  854 (1031)
                      +-|.++ .|....+...+.|.|..||+|.      +|+.
T Consensus        51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGe------RCEh   82 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCRCSHGYTGI------RCQH   82 (139)
T ss_pred             CEeECC-EEEeeccCCCceeECCCCcccc------cccc
Confidence            346554 8887766788999999999999      8875


No 66 
>KOG3516|consensus
Probab=40.47  E-value=23  Score=45.34  Aligned_cols=36  Identities=25%  Similarity=0.675  Sum_probs=32.6

Q ss_pred             CCCCCCCCCCCCCCCCcccccCCCeeeecC-CCCcCC
Q psy13144        234 PVYTNPCQPSPCGPNSQCREVNHQAVCSCL-PNYFGS  269 (1031)
Q Consensus       234 ~~~~d~C~~~~C~~~g~C~~~~g~y~C~C~-~Gy~g~  269 (1031)
                      |.-+|.|.+++|..+|.|......|.|.|. .||+|.
T Consensus       542 C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga  578 (1306)
T KOG3516|consen  542 CGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA  578 (1306)
T ss_pred             cccccccCCccccCCCcccccccceeEeccccccccc
Confidence            344688999999999999999999999999 899998


No 67 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=38.43  E-value=28  Score=32.41  Aligned_cols=33  Identities=42%  Similarity=0.983  Sum_probs=25.6

Q ss_pred             CCCCCC-CCCCCCCCceecCCCCceeeCCCCCcCC
Q psy13144        149 PVNPCY-PSPCGPYSQCRDINGSPSCSCLPSYIGS  182 (1031)
Q Consensus       149 ~~d~C~-~~~C~~~g~C~~~~g~~~C~C~~Gy~g~  182 (1031)
                      ..|.|. ...|+.++.|.. ..+..|.|.+||.-.
T Consensus        76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence            467887 478999999954 456689999999753


No 68 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=37.08  E-value=25  Score=25.10  Aligned_cols=21  Identities=14%  Similarity=-0.007  Sum_probs=15.6

Q ss_pred             CccccCCCceeeCCCCCCCCC
Q psy13144          5 DTKINTYEVFYSCPPGTTGSP   25 (1031)
Q Consensus         5 ~~c~~~~~~~C~C~~G~~g~~   25 (1031)
                      |.|.....+.|.|+.||..+.
T Consensus        10 A~CDpn~~~~C~CPeGyIlde   30 (34)
T PF09064_consen   10 ADCDPNSPGQCFCPEGYILDE   30 (34)
T ss_pred             CccCCCCCCceeCCCceEecC
Confidence            445555566799999999774


No 69 
>KOG3514|consensus
Probab=36.33  E-value=23  Score=44.60  Aligned_cols=36  Identities=28%  Similarity=0.679  Sum_probs=32.1

Q ss_pred             CCCCCCCCCCCceeecCCCceeeCC-CCCcCCCCCCCCC
Q psy13144        924 PCIPSPCGPNSQCRDINGSPSCSCL-PTFIGAPPNCRPE  961 (1031)
Q Consensus       924 ~C~~~~C~~~~~C~~~~g~~~C~C~-~Gy~g~~~~C~~~  961 (1031)
                      .|.++||+|+++|....+.|.|.|. .||.|.  .|+++
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~CerE  661 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCERE  661 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCc--cccce
Confidence            7899999999999999999999998 679888  78753


No 70 
>KOG3514|consensus
Probab=31.37  E-value=32  Score=43.45  Aligned_cols=35  Identities=23%  Similarity=0.622  Sum_probs=32.1

Q ss_pred             CCCCCCCCCCCceecCCCCceeeCC-CCCcCCCCCCcc
Q psy13144        152 PCYPSPCGPYSQCRDINGSPSCSCL-PSYIGSPPNCRP  188 (1031)
Q Consensus       152 ~C~~~~C~~~g~C~~~~g~~~C~C~-~Gy~g~~~~C~~  188 (1031)
                      .|.++||.|+|+|.....+|.|.|. .||.|.  +|+.
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~Cer  660 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCER  660 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCc--cccc
Confidence            6899999999999999999999996 699999  9984


No 71 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=26.47  E-value=20  Score=29.86  Aligned_cols=44  Identities=30%  Similarity=0.577  Sum_probs=20.3

Q ss_pred             eeecCCCCccCCcccCCCCCcCCCCCCCCCccccCCCCCceeeecCCCCcCC
Q psy13144        793 VCVCLPDYYGDGYVSCGPECILNNDCPSNKACIRNKFNKQAVCSCLPNYFGS  844 (1031)
Q Consensus       793 ~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~~~~~~~~C~C~~G~~g~  844 (1031)
                      .-+|.+.|+|..|..   .|...+.=..+-+|...     ++=+|.+||+|.
T Consensus        18 rv~C~~nyyG~~C~~---~C~~~~d~~ghy~Cd~~-----G~~~C~~Gw~G~   61 (63)
T PF01414_consen   18 RVVCDENYYGPNCSK---FCKPRDDSFGHYTCDSN-----GNKVCLPGWTGP   61 (63)
T ss_dssp             -----TTEETTTT-E---E---EEETTEEEEE-SS-------EEE-TTEEST
T ss_pred             EEECCCCCCCccccC---CcCCCcCCcCCcccCCC-----CCCCCCCCCcCC
Confidence            356889999998852   33322222334456643     255789999998


No 72 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=20.46  E-value=3e+02  Score=32.04  Aligned_cols=41  Identities=24%  Similarity=0.703  Sum_probs=22.8

Q ss_pred             eCCCCCcCCCCCCcccCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCC
Q psy13144        174 SCLPSYIGSPPNCRPECIQNSECPYDKACINEKCADPCPGFCPPGTTGS  222 (1031)
Q Consensus       174 ~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~  222 (1031)
                      .|++||+..+..|. .|.  ..|   .+|.+. ....|. .|++||...
T Consensus        95 ~C~~G~y~~~~~C~-~C~--~~C---~~C~~~-~~~~Ct-~C~~g~~L~  135 (397)
T PF03302_consen   95 ECPDGYYKNGNKCV-PCH--ESC---ATCSGG-APNQCT-SCKPGKVLK  135 (397)
T ss_pred             CCCCCccccCCCCC-CCC--ccc---cccCCC-CCCCCc-ccCCCcccc
Confidence            79999997655665 111  111   133322 223343 889998766


Done!