Query         psy826
Match_columns 329
No_of_seqs    366 out of 2268
Neff          8.3 
Searched_HMMs 46136
Date          Fri Aug 16 22:45:40 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy826.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/826hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4260|consensus              100.0 1.7E-61 3.6E-66  418.2   7.2  272    2-284    42-317 (350)
  2 KOG1214|consensus               99.2 8.6E-12 1.9E-16  123.4   6.5  127   93-256   720-866 (1289)
  3 PF11938 DUF3456:  TLR4 regulat  98.9 1.6E-10 3.4E-15   96.4  -1.2   88    2-93     16-151 (151)
  4 KOG1219|consensus               98.9 1.3E-09 2.8E-14  116.8   3.6  101  109-249  3869-3974(4289)
  5 KOG4260|consensus               98.7 1.3E-08 2.8E-13   89.6   3.6  107  132-264   132-244 (350)
  6 KOG0994|consensus               98.5   4E-07 8.6E-12   93.6   9.0  188   47-255   742-966 (1758)
  7 KOG4289|consensus               98.3 7.6E-07 1.7E-11   93.1   5.8  121   28-185  1168-1308(2531)
  8 KOG1225|consensus               98.3   2E-06 4.3E-11   84.3   7.5   40   93-142   238-279 (525)
  9 KOG0994|consensus               98.3 1.9E-06 4.2E-11   88.7   7.6  150   93-249   834-1010(1758)
 10 KOG1214|consensus               98.1 4.9E-06 1.1E-10   83.5   5.9  118  109-262   699-833 (1289)
 11 KOG1217|consensus               98.1 2.5E-05 5.5E-10   76.2  10.7  123   92-222   155-296 (487)
 12 KOG1225|consensus               98.0 1.1E-05 2.4E-10   79.1   6.9   96   90-250   266-365 (525)
 13 PF14843 GF_recep_IV:  Growth f  98.0 4.1E-06 8.8E-11   68.2   2.2  104  125-240    14-132 (132)
 14 PF07645 EGF_CA:  Calcium-bindi  97.9 4.6E-06   1E-10   53.8   1.8   34  196-248     1-35  (42)
 15 PTZ00214 high cysteine membran  97.7 0.00027 5.8E-09   73.5  11.5   62  126-195   590-665 (800)
 16 KOG4289|consensus               97.7 3.6E-05 7.7E-10   81.2   4.8   92  125-254  1219-1316(2531)
 17 KOG1836|consensus               97.6  0.0002 4.2E-09   79.2   9.6   96   91-192   697-812 (1705)
 18 KOG1219|consensus               97.6 5.2E-05 1.1E-09   83.0   4.5   81   93-190  3890-3975(4289)
 19 PF03302 VSP:  Giardia variant-  97.5 0.00032   7E-09   67.6   8.3   33  218-250   104-136 (397)
 20 PTZ00214 high cysteine membran  97.5 0.00041 8.9E-09   72.2   9.1  106  140-257   560-668 (800)
 21 PF14843 GF_recep_IV:  Growth f  97.5 7.4E-05 1.6E-09   60.8   2.9   85  163-260     2-94  (132)
 22 KOG1836|consensus               97.3 0.00085 1.9E-08   74.3   9.1   92   92-190   760-879 (1705)
 23 PF03302 VSP:  Giardia variant-  97.3  0.0024 5.1E-08   61.7  11.0  106  140-256    92-224 (397)
 24 PF12947 EGF_3:  EGF domain;  I  97.1  0.0002 4.3E-09   44.4   1.0   31  204-253     6-36  (36)
 25 cd00064 FU Furin-like repeats.  97.0 0.00054 1.2E-08   45.5   2.7   38  223-260     1-38  (49)
 26 PF12662 cEGF:  Complement Clr-  97.0 0.00042 9.2E-09   38.7   1.4   23  176-199     1-24  (24)
 27 smart00261 FU Furin-like repea  96.9 0.00085 1.8E-08   43.9   3.0   39  219-257     2-40  (46)
 28 KOG1217|consensus               96.9  0.0058 1.3E-07   59.5   9.8  133   92-260   255-404 (487)
 29 PF07645 EGF_CA:  Calcium-bindi  96.6  0.0009   2E-08   43.0   0.9   31  104-137     4-34  (42)
 30 smart00179 EGF_CA Calcium-bind  96.5  0.0031 6.7E-08   39.2   3.0   33  196-247     1-33  (39)
 31 PF07974 EGF_2:  EGF-like domai  96.4  0.0043 9.3E-08   37.3   3.2   27  110-141     6-32  (32)
 32 PF00008 EGF:  EGF-like domain   96.4 0.00082 1.8E-08   40.5   0.0   28  109-139     3-31  (32)
 33 PF12947 EGF_3:  EGF domain;  I  96.3  0.0011 2.3E-08   41.1   0.2   29  108-139     4-32  (36)
 34 smart00179 EGF_CA Calcium-bind  96.3  0.0052 1.1E-07   38.1   3.4   29  110-141     9-38  (39)
 35 KOG1226|consensus               96.3   0.017 3.6E-07   58.7   8.4   87   91-202   480-590 (783)
 36 KOG4052|consensus               96.3 0.00015 3.3E-09   59.4  -4.9   58   26-87     76-185 (190)
 37 smart00051 DSL delta serrate l  95.9  0.0091   2E-07   41.9   3.4   44   91-141    19-63  (63)
 38 KOG1025|consensus               95.9  0.0064 1.4E-07   62.4   3.6   83  163-258   489-572 (1177)
 39 cd00055 EGF_Lam Laminin-type e  95.8   0.012 2.5E-07   39.3   3.5   27  125-151    16-42  (50)
 40 KOG0196|consensus               95.8   0.012 2.5E-07   60.2   4.7   60  126-190   257-320 (996)
 41 cd00053 EGF Epidermal growth f  95.8   0.013 2.8E-07   35.2   3.2   30  109-141     5-35  (36)
 42 KOG1025|consensus               95.7   0.024 5.1E-07   58.4   6.5  105  125-242   501-615 (1177)
 43 PF12662 cEGF:  Complement Clr-  95.7  0.0052 1.1E-07   34.3   1.0   17  242-258     6-24  (24)
 44 cd00054 EGF_CA Calcium-binding  95.6   0.017 3.7E-07   35.2   3.3   29  110-141     9-37  (38)
 45 PF00053 Laminin_EGF:  Laminin   95.5  0.0061 1.3E-07   40.4   1.2   26  125-150    15-40  (49)
 46 smart00180 EGF_Lam Laminin-typ  95.4   0.013 2.9E-07   38.3   2.5   25  126-150    16-40  (46)
 47 smart00181 EGF Epidermal growt  95.4   0.019 4.2E-07   34.7   3.0   28  110-141     6-34  (35)
 48 smart00261 FU Furin-like repea  95.1   0.032 6.8E-07   36.3   3.4   35  159-195     3-37  (46)
 49 PF14670 FXa_inhibition:  Coagu  95.0    0.01 2.2E-07   36.7   0.9   28  164-192     5-33  (36)
 50 KOG3512|consensus               95.0   0.062 1.3E-06   51.7   6.5   76  112-190   280-384 (592)
 51 PF12661 hEGF:  Human growth fa  94.5   0.015 3.2E-07   27.5   0.6   13  129-141     1-13  (13)
 52 PF14670 FXa_inhibition:  Coagu  94.3   0.018 3.9E-07   35.6   0.7   24  205-249     7-30  (36)
 53 cd00064 FU Furin-like repeats.  94.1   0.087 1.9E-06   34.7   3.8   32  162-195     1-32  (49)
 54 cd00054 EGF_CA Calcium-binding  94.0   0.056 1.2E-06   32.9   2.7   34  197-249     2-35  (38)
 55 PF00008 EGF:  EGF-like domain   93.9   0.024 5.2E-07   34.0   0.8   27  204-249     4-31  (32)
 56 PF06247 Plasmod_Pvs28:  Plasmo  93.7  0.0094   2E-07   50.5  -1.8  122   83-249    20-162 (197)
 57 PF00594 Gla:  Vitamin K-depend  93.4   0.019 4.2E-07   36.8  -0.3   39   19-69      1-39  (42)
 58 KOG1226|consensus               93.4    0.16 3.4E-06   51.9   5.9   38   98-142   538-580 (783)
 59 cd00053 EGF Epidermal growth f  92.8    0.13 2.7E-06   30.6   2.8   27  204-249     6-32  (36)
 60 PF12946 EGF_MSP1_1:  MSP1 EGF   92.7    0.04 8.6E-07   34.0   0.4   32  204-254     5-37  (37)
 61 PTZ00382 Variant-specific surf  92.5    0.15 3.3E-06   39.0   3.4   27  227-255    29-55  (96)
 62 smart00181 EGF Epidermal growt  92.4    0.15 3.2E-06   30.7   2.7   25  205-249     7-31  (35)
 63 smart00069 GLA Domain containi  92.2   0.094   2E-06   37.0   1.8   38   18-66     24-61  (65)
 64 PF06247 Plasmod_Pvs28:  Plasmo  91.7   0.033 7.2E-07   47.3  -1.1   61  175-255    18-87  (197)
 65 KOG3512|consensus               90.0    0.84 1.8E-05   44.2   6.4   65  126-192   357-428 (592)
 66 PTZ00382 Variant-specific surf  88.7    0.74 1.6E-05   35.2   4.1   45  140-190     5-50  (96)
 67 PF01414 DSL:  Delta serrate li  86.6    0.15 3.2E-06   35.8  -0.7   44   91-141    19-63  (63)
 68 cd01475 vWA_Matrilin VWA_Matri  85.8     0.5 1.1E-05   41.7   2.1   37  192-249   182-219 (224)
 69 PF09064 Tme5_EGF_like:  Thromb  84.3     1.1 2.3E-05   27.1   2.3   22  170-192    11-32  (34)
 70 cd00185 TNFR Tumor necrosis fa  83.9     3.3 7.1E-05   31.6   5.6   20  170-190    68-87  (98)
 71 cd00185 TNFR Tumor necrosis fa  83.7     4.2   9E-05   31.0   6.1   18  233-251    71-88  (98)
 72 PHA03099 epidermal growth fact  83.2    0.14 3.1E-06   40.6  -2.3   32  204-257    51-84  (139)
 73 PF01683 EB:  EB module;  Inter  82.0     2.1 4.6E-05   28.3   3.4   21  110-137    26-46  (52)
 74 PHA02887 EGF-like protein; Pro  80.1     1.4 3.1E-05   34.5   2.3   37  104-142    85-122 (126)
 75 PF01683 EB:  EB module;  Inter  76.3       4 8.7E-05   27.0   3.4   12  242-253    41-52  (52)
 76 PHA02637 TNF-alpha-receptor-li  74.6     5.4 0.00012   31.9   4.2   12  140-151    64-75  (127)
 77 KOG4611|consensus               68.2     5.4 0.00012   37.5   3.3   25  140-168    99-123 (747)
 78 KOG3782|consensus               67.5    0.79 1.7E-05   38.1  -2.0   18   26-43     69-86  (189)
 79 KOG1218|consensus               64.4      49  0.0011   30.3   9.1  107   94-221    96-203 (316)
 80 PF01826 TIL:  Trypsin Inhibito  61.6     5.4 0.00012   26.7   1.6   22  178-200    34-55  (55)
 81 cd01475 vWA_Matrilin VWA_Matri  58.7     8.4 0.00018   33.9   2.7   30  104-138   189-218 (224)
 82 PF00757 Furin-like:  Furin-lik  56.5     5.4 0.00012   33.0   1.0   34  224-259    51-85  (149)
 83 KOG0196|consensus               52.8      14 0.00029   38.8   3.3   49   89-137   259-317 (996)
 84 PHA03099 epidermal growth fact  52.3      12 0.00026   30.0   2.2   37  104-142    44-81  (139)
 85 PF07699 GCC2_GCC3:  GCC2 and G  41.2      46   0.001   21.4   3.5   10  140-149    10-19  (48)
 86 PHA02887 EGF-like protein; Pro  40.0      20 0.00043   28.2   1.8   20   78-103   103-122 (126)
 87 PHA02637 TNF-alpha-receptor-li  39.5      61  0.0013   26.0   4.5    9  141-149    44-52  (127)
 88 PF00954 S_locus_glycop:  S-loc  39.1      28 0.00061   26.8   2.6   33  196-248    76-108 (110)
 89 PF04863 EGF_alliinase:  Alliin  37.0      13 0.00028   25.1   0.2   32  111-142    18-50  (56)
 90 PF04706 Dickkopf_N:  Dickkopf   34.3      66  0.0014   21.5   3.4   32  200-231     2-33  (52)
 91 PF00954 S_locus_glycop:  S-loc  33.9      39 0.00085   26.0   2.6   26  109-138    83-108 (110)
 92 PF07562 NCD3G:  Nine Cysteines  33.5      11 0.00023   25.3  -0.5    8  142-149     7-14  (54)
 93 KOG0814|consensus               30.0     9.2  0.0002   32.5  -1.6   26   16-41    189-215 (237)
 94 KOG3516|consensus               25.9      55  0.0012   35.8   2.8   32  108-142   549-581 (1306)
 95 KOG4258|consensus               25.0      81  0.0018   33.5   3.7   88   82-172   213-335 (1025)
 96 PHA02714 CD-30-like protein; P  24.2      86  0.0019   23.8   2.8   18  170-188    52-69  (110)
 97 PF12955 DUF3844:  Domain of un  21.1      43 0.00093   25.9   0.7   30  107-136    10-41  (103)
 98 KOG4611|consensus               20.9 1.2E+02  0.0025   28.9   3.6   18  177-194    98-115 (747)
 99 KOG3509|consensus               20.6   1E+02  0.0022   33.4   3.5   62  128-190   718-793 (964)

No 1  
>KOG4260|consensus
Probab=100.00  E-value=1.7e-61  Score=418.15  Aligned_cols=272  Identities=43%  Similarity=0.952  Sum_probs=235.8

Q ss_pred             cchhhccCCCCCCCCchhhhhccCcccccceEEEecccCcccCcc--cCccchhhHHhhhHHHHHHhhhhccCCcccCCC
Q psy826            2 SGIEKTAKGNFAGGDTAWEEEKQKIYAKSEVRLIEIQEKMCSEVS--GFLDQCHNFAADIESEIEEWWFKVQHSKAKDSD   79 (329)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~ee~~~~~y~~se~~~~e~~e~~C~~~~--~~~~~c~~~~~~~~~~~e~~~~~~~~~~~~~~~   79 (329)
                      +||+||+|+||+||||||||++|++|+.||+||+||+|++|+..+  ..||+||.+++.+|++||.||++.|+   ..||
T Consensus        42 ~GlerT~r~hfaGGdTAWEEknL~kYk~SE~RLvEilEglCsks~~~n~DfeCh~lle~hEellE~w~~hkq~---e~Pd  118 (350)
T KOG4260|consen   42 EGLERTARHHFAGGDTAWEEKNLSKYKTSETRLVEILEGLCSKSSLPNMDFECHTLLEKHEELLEEWWYHKQH---ESPD  118 (350)
T ss_pred             HHHHHHhhhccCCCchhhhhhhhhhccccchhHHHHHHHhhhccCCCCCChHHHHHHHHHHHHHHHHHHHhhc---CCch
Confidence            699999999999999999999999999999999999999999321  23899999999999999999999999   8899


Q ss_pred             ccceeecccccCcCCCCCCCCCCccCCCCC-CCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCCC
Q psy826           80 LYTWLCINKLKRCCPVDHYGADCKPCLGFP-NVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKTI  158 (329)
Q Consensus        80 ~~~~~C~~~~~~~C~~G~~G~~C~~C~~~~-~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~~  158 (329)
                      +.+|+|++..+++||+|.||++|.+||+++ .+|+++|+|.+++++.|++.|.|.+||.|+.|..|.++||....++...
T Consensus       119 l~~WlCvdqLkvCCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~l  198 (350)
T KOG4260|consen  119 LFNWLCVDQLKVCCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHL  198 (350)
T ss_pred             HHhHhhhhhheeccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccc
Confidence            999999999999999999999999999866 7999999999999999999999999999999999999999999999999


Q ss_pred             CCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCCC-CCCCCCceeecCCCCcccCCCcCCCCCCcCCccc
Q psy826          159 LCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDE-NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPD  237 (329)
Q Consensus       159 ~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~-~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~  237 (329)
                      +|..||..|...|++.++..|..|+.||.++. ..|+|||||... .+|...++|+|+.|||+|.. .+++..    ++.
T Consensus       199 vCt~Ch~~C~~~Csg~~~k~C~kCkkGW~lde-~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~d-k~Gy~~----g~d  272 (350)
T KOG4260|consen  199 VCTACHEGCLGVCSGESSKGCSKCKKGWKLDE-EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCED-KEGYKK----GVD  272 (350)
T ss_pred             hhhhhhhhhhcccCCCCCCChhhhcccceecc-cccccHHHHhcCCCCCChhheeecCCCceEecc-cccccC----ChH
Confidence            99999999998999999999999999999995 789999999987 89999999999999999982 433332    466


Q ss_pred             ccccCcccccccCCcccccCcCCCCccccceeeeeeeceeeeEEEee
Q psy826          238 MCEACAEGYKLQQNICINTQAKSQNTNENLYRYGVYVGLCVATYIIF  284 (329)
Q Consensus       238 ~C~~C~~Gy~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  284 (329)
                      .|..|..++.+.+..|.+++....  .+.+..+.+++..++...+.+
T Consensus       273 ~C~~~~d~~~~kn~~c~ni~~~~r--~v~f~~~~~~~g~cV~~~~p~  317 (350)
T KOG4260|consen  273 ECQFCADVCASKNRPCMNIDGQYR--CVCFSGLIIIEGFCVWHGSPV  317 (350)
T ss_pred             HhhhhhhhcccCCCCcccCCccEE--EEecccceeeeeeeeccCCch
Confidence            777777777777777766653322  334445555555554444333


No 2  
>KOG1214|consensus
Probab=99.25  E-value=8.6e-12  Score=123.37  Aligned_cols=127  Identities=29%  Similarity=0.689  Sum_probs=98.7

Q ss_pred             CCCCCC--CCCCc---cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCC----CCCCC---------cCCCCccccccC
Q psy826           93 CPVDHY--GADCK---PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYT----GELCN---------ECNTGYFQSYKD  154 (329)
Q Consensus        93 C~~G~~--G~~C~---~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~----g~~C~---------~C~~g~y~~~~~  154 (329)
                      |..||.  |..|.   +|......|..++.|.+   .+++++|.|..||.    +-+|.         .|.+|       
T Consensus       720 cs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin---~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g-------  789 (1289)
T KOG1214|consen  720 CSSGYQGDGRNCVDENECATGFHRCGPNSVCIN---LPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDG-------  789 (1289)
T ss_pred             EeeccCCCCCCCCChhhhccCCCCCCCCceeec---CCCceeEEEeecceeccCCcceEEecCCCCCCccccC-------
Confidence            666776  55665   56666677999999999   89999999999986    34662         33332       


Q ss_pred             cCCCCCcCCcccccCCcc--cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCc
Q psy826          155 EKTILCSKCHASCESGCS--TGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCH  232 (329)
Q Consensus       155 ~~~~~C~~C~~~C~~~C~--~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~  232 (329)
                        ...|.. ...  ..|+  +.+.|.| .|.+||.|++ ..|.|+|||.. ..|.++++|.|++|++.|+          
T Consensus       790 --~h~C~i-~g~--a~c~~hGgs~y~C-~CLPGfsGDG-~~c~dvDeC~p-srChp~A~CyntpgsfsC~----------  851 (1289)
T KOG1214|consen  790 --SHTCAI-AGQ--ARCVHHGGSTYSC-ACLPGFSGDG-HQCTDVDECSP-SRCHPAATCYNTPGSFSCR----------  851 (1289)
T ss_pred             --ccccCc-CCc--eEEEecCCceEEE-eecCCccCCc-cccccccccCc-cccCCCceEecCCCcceee----------
Confidence              222222 112  2455  5678999 9999999987 67999999996 7899999999999999999          


Q ss_pred             CCcccccccCcccccccCCccccc
Q psy826          233 GDGPDMCEACAEGYKLQQNICINT  256 (329)
Q Consensus       233 g~~~~~C~~C~~Gy~~~~~~C~~~  256 (329)
                               |.+||.|++.+|++.
T Consensus       852 ---------C~pGy~GDGf~CVP~  866 (1289)
T KOG1214|consen  852 ---------CQPGYYGDGFQCVPD  866 (1289)
T ss_pred             ---------cccCccCCCceecCC
Confidence                     999999999999865


No 3  
>PF11938 DUF3456:  TLR4 regulator and MIR-interacting MSAP;  InterPro: IPR021852  This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 182 to 356 amino acids in length. 
Probab=98.89  E-value=1.6e-10  Score=96.37  Aligned_cols=88  Identities=38%  Similarity=0.684  Sum_probs=73.8

Q ss_pred             cchhhccCCCC-----CCCCchhhhhccCcc-cccceEEEecccCcccCcc-----------------------------
Q psy826            2 SGIEKTAKGNF-----AGGDTAWEEEKQKIY-AKSEVRLIEIQEKMCSEVS-----------------------------   46 (329)
Q Consensus         2 ~~~~~~~~~~~-----~~~~~~~ee~~~~~y-~~se~~~~e~~e~~C~~~~-----------------------------   46 (329)
                      .+|++|..+++     -|+|++|+++++.+| ++||+||+|++|++|....                             
T Consensus        16 ~~l~~~~~~~~v~~~~~r~d~~g~~~~~~~~y~~SE~~l~EvlE~vC~~~~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~   95 (151)
T PF11938_consen   16 HRLAKTDPKKFVDVGGFRLDPDGERKKKKIPYARSELRLVEVLENVCEKMKDYVLHKDKSTGKRVYKRMSTTGGTNPELS   95 (151)
T ss_pred             HHHHHhCCcCceeccccccccccccccccCccccchhHHHHHHHHHccCcccceeeecCCCCceEEEecCCCcCcccccc
Confidence            36889999998     389999998888765 9999999999999999211                             


Q ss_pred             -------------cCccchhhHHhhhHHHHHHhhhhccCCcccCCCccceeecccccCcC
Q psy826           47 -------------GFLDQCHNFAADIESEIEEWWFKVQHSKAKDSDLYTWLCINKLKRCC   93 (329)
Q Consensus        47 -------------~~~~~c~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~C~~~~~~~C   93 (329)
                                   .-.+.|+.+++++|+.|+.||++.+.   . .++..|+|.+..+.||
T Consensus        96 ~~~~~~~~~~~~~~L~~~C~~ilEe~Ed~i~~~~~~~~~---~-~~l~~~LC~~~~~~C~  151 (151)
T PF11938_consen   96 EKDVKVDPSDENKRLKFYCERILEEYEDEIEEWFFSEQD---E-DDLEEWLCSETSKVCC  151 (151)
T ss_pred             cccceeecHHHHhHHHHHHHHHHHHhHHHHHHHHHhcCC---c-cCHHHHhhhcccccCC
Confidence                         12467999999999999999998876   4 6889999998877654


No 4  
>KOG1219|consensus
Probab=98.86  E-value=1.3e-09  Score=116.78  Aligned_cols=101  Identities=30%  Similarity=0.734  Sum_probs=82.4

Q ss_pred             CCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC----cCCCCccccccCcCCCCCcCCcccccCCcc-cCCCCCCccCC
Q psy826          109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN----ECNTGYFQSYKDEKTILCSKCHASCESGCS-TGGPKGCTKCK  183 (329)
Q Consensus       109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~----~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~  183 (329)
                      .+||.++|+|...  -.+.|+|.|++-|+|+.|+    .|..              .||..+  ++|. ..+.+.| .|+
T Consensus      3869 ~npCqhgG~C~~~--~~ggy~CkCpsqysG~~CEi~~epC~s--------------nPC~~G--gtCip~~n~f~C-nC~ 3929 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQ--PKGGYKCKCPSQYSGNHCEIDLEPCAS--------------NPCLTG--GTCIPFYNGFLC-NCP 3929 (4289)
T ss_pred             cCcccCCCEecCC--CCCceEEeCcccccCcccccccccccC--------------CCCCCC--CEEEecCCCeeE-eCC
Confidence            3789999999862  2566899999999999993    4544              577777  7888 6778999 999


Q ss_pred             CCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826          184 SGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       184 ~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      .||.|..+.. ..++||.. +.|..++.|+|.+|+|.|.                   |-+||.+.
T Consensus      3930 ~gyTG~~Ce~-~Gi~eCs~-n~C~~gg~C~n~~gsf~Cn-------------------cT~g~~gr 3974 (4289)
T KOG1219|consen 3930 NGYTGKRCEA-RGISECSK-NVCGTGGQCINIPGSFHCN-------------------CTPGILGR 3974 (4289)
T ss_pred             CCccCceeec-cccccccc-ccccCCceeeccCCceEec-------------------cChhHhcc
Confidence            9999976321 12788986 7999999999999999999                   99999876


No 5  
>KOG4260|consensus
Probab=98.68  E-value=1.3e-08  Score=89.64  Aligned_cols=107  Identities=24%  Similarity=0.631  Sum_probs=81.3

Q ss_pred             cCCCCCCCCCCcCCCCccccccCcCCCCCcCCcccccCCcc----cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCC
Q psy826          132 CNKEYTGELCNECNTGYFQSYKDEKTILCSKCHASCESGCS----TGGPKGCTKCKSGWAADKDIGCYDINECSDENICS  207 (329)
Q Consensus       132 C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~----~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~  207 (329)
                      |++|..|+.|..|+.|.-           .+|.++  +.|.    ..|+..| .|..||.|...+            .|.
T Consensus       132 Cp~gtyGpdCl~Cpggse-----------r~C~Gn--G~C~GdGsR~GsGkC-kC~~GY~Gp~C~------------~Cg  185 (350)
T KOG4260|consen  132 CPDGTYGPDCLQCPGGSE-----------RPCFGN--GSCHGDGSREGSGKC-KCETGYTGPLCR------------YCG  185 (350)
T ss_pred             cCCCCcCCccccCCCCCc-----------CCcCCC--CcccCCCCCCCCCcc-cccCCCCCcccc------------ccc
Confidence            888999999999988732           345544  4554    3457799 999999987643            333


Q ss_pred             CCcee-ecCCCCcccCCCcCCCC-CCcCCcccccccCcccccccCCcccccCcCCCCcc
Q psy826          208 GNQFC-VNTEGSYRCMQCDPSCN-GCHGDGPDMCEACAEGYKLQQNICINTQAKSQNTN  264 (329)
Q Consensus       208 ~~~~C-~n~~gs~~C~~C~~~C~-~C~g~~~~~C~~C~~Gy~~~~~~C~~~~~~~~~~~  264 (329)
                      .+.+= ..+.....|.+||.+|. .|+|.++..|.+|+.||.++...|+|++++...+.
T Consensus       186 ~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~  244 (350)
T KOG4260|consen  186 IEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPA  244 (350)
T ss_pred             hHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceecccccccHHHHhcCCC
Confidence            23221 12345678999999997 99999999999999999999999999999876653


No 6  
>KOG0994|consensus
Probab=98.50  E-value=4e-07  Score=93.57  Aligned_cols=188  Identities=23%  Similarity=0.514  Sum_probs=100.1

Q ss_pred             cCccchhhHHhhhHHHHHHhhhhccCCcccCC-Cccceeeccccc-CcCCCCCCCCCCccCCCC-----CCCCCCCCccc
Q psy826           47 GFLDQCHNFAADIESEIEEWWFKVQHSKAKDS-DLYTWLCINKLK-RCCPVDHYGADCKPCLGF-----PNVCFGNGKCK  119 (329)
Q Consensus        47 ~~~~~c~~~~~~~~~~~e~~~~~~~~~~~~~~-~~~~~~C~~~~~-~~C~~G~~G~~C~~C~~~-----~~~C~~~~~C~  119 (329)
                      +.++.|.+++....-.+..--..-.    ..+ .-.+..|..... +-|.|+-.|..|..|.++     +..|.. -.|.
T Consensus       742 ~~~e~C~~l~~~lsa~l~n~a~~Cn----CnptGSlS~vCn~~GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~-CdC~  816 (1758)
T KOG0994|consen  742 KSDEVCQNLDNSLSALLHNGASMCN----CNPTGSLSSVCNPNGGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKA-CDCN  816 (1758)
T ss_pred             CcchhhhhhhhhHHHHHhcCccccc----cCCCccccccccCCCceecccCccccccccccCCcccCcCCccCcc-cccc
Confidence            5677888887776555432221111    111 112234444332 226777667777766442     222210 0121


Q ss_pred             CCC-----CCCCCceeecCCCCCCCCCCcCCCCccccc------cCcCCCCCcCCcccccCCcc-cCCCCCCccCCCCce
Q psy826          120 GNG-----TRKGNGQCVCNKEYTGELCNECNTGYFQSY------KDEKTILCSKCHASCESGCS-TGGPKGCTKCKSGWA  187 (329)
Q Consensus       120 ~~~-----~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~------~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~~G~~  187 (329)
                      ..+     -..-+++|+|.+|..|..|+.|.+|||+-.      .+.-...|.+=.+.|. .|. ...++.|..|..||+
T Consensus       817 ~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA~~Cd~~tGaCi-~CqD~T~G~~CdrCl~Gyy  895 (1758)
T KOG0994|consen  817 SIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHADTCDPITGACI-DCQDSTTGHSCDRCLDGYY  895 (1758)
T ss_pred             ccccccccccccccceeeccccchhhccccCCCccCCCcCccccccCcccccCccccccc-cccccccccchhhhhcccc
Confidence            111     014568999999999999999999999621      1111222222223444 454 455689999999999


Q ss_pred             ecCC----CCcccCCCCCCCCCCCCCc--------eeecCC--CCcccCCCcCCCCCC-cCCcccccccCccccccc---
Q psy826          188 ADKD----IGCYDINECSDENICSGNQ--------FCVNTE--GSYRCMQCDPSCNGC-HGDGPDMCEACAEGYKLQ---  249 (329)
Q Consensus       188 ~~~~----~~C~d~~eC~~~~~C~~~~--------~C~n~~--gs~~C~~C~~~C~~C-~g~~~~~C~~C~~Gy~~~---  249 (329)
                      |++.    ..|.+   |    +|+.+.        .|....  ....|.        | .|....+|..|+++|.|+   
T Consensus       896 GdP~lg~g~~CrP---C----pCP~gp~Sg~~~A~sC~~d~~t~~ivC~--------C~~GY~G~RCe~CA~~~fGnP~~  960 (1758)
T KOG0994|consen  896 GDPRLGSGIGCRP---C----PCPDGPASGRQHADSCYLDTRTQQIVCH--------CQEGYSGSRCEICADNHFGNPSE  960 (1758)
T ss_pred             CCcccCCCCCCCC---C----CCCCCCccchhccccccccccccceeee--------cccCccccchhhhcccccCCccc
Confidence            9882    22432   1    232211        111111  111222        2 366777888899999886   


Q ss_pred             CCcccc
Q psy826          250 QNICIN  255 (329)
Q Consensus       250 ~~~C~~  255 (329)
                      +++|..
T Consensus       961 GGtCq~  966 (1758)
T KOG0994|consen  961 GGTCQK  966 (1758)
T ss_pred             CCcccc
Confidence            667753


No 7  
>KOG4289|consensus
Probab=98.31  E-value=7.6e-07  Score=93.14  Aligned_cols=121  Identities=28%  Similarity=0.692  Sum_probs=81.1

Q ss_pred             cccceEEEecccCcccCcccCccchhhHHhhhHHHHHHhhhh----ccCCc----ccCCCccceeecccccCcCCCCCCC
Q psy826           28 AKSEVRLIEIQEKMCSEVSGFLDQCHNFAADIESEIEEWWFK----VQHSK----AKDSDLYTWLCINKLKRCCPVDHYG   99 (329)
Q Consensus        28 ~~se~~~~e~~e~~C~~~~~~~~~c~~~~~~~~~~~e~~~~~----~~~~~----~~~~~~~~~~C~~~~~~~C~~G~~G   99 (329)
                      +.|-.|++-+.+++|-     .+.|.++....    ...-+.    +..++    +.+.......|.      ||+||+|
T Consensus      1168 ~~sll~VlpfdDniCl-----rEPCenymkCv----svlrFdssapf~~s~s~lfRpi~pvnglrCr------CPpGFTg 1232 (2531)
T KOG4289|consen 1168 AISLLRVLPFDDNICL-----REPCENYMKCV----SVLRFDSSAPFLASDSVLFRPIHPVNGLRCR------CPPGFTG 1232 (2531)
T ss_pred             HhhheeeeeccCchhh-----cchhHHHHhhh----hheeecccCccccccceeeeeccccCceeEe------CCCCCCc
Confidence            4567888888999998     66776654332    111110    00000    011222334455      9999999


Q ss_pred             CCCc----cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC------cCCCCccccccCcCCCCCcCCcccccC
Q psy826          100 ADCK----PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN------ECNTGYFQSYKDEKTILCSKCHASCES  169 (329)
Q Consensus       100 ~~C~----~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~------~C~~g~y~~~~~~~~~~C~~C~~~C~~  169 (329)
                      ..|+    .|  ...+|.+++.|..   +.|.|+|.|.+||+|..|+      .|.+|+              |..+  +
T Consensus      1233 d~CeTeiDlC--Ys~pC~nng~C~s---rEggYtCeCrpg~tGehCEvs~~agrCvpGv--------------C~ng--g 1291 (2531)
T KOG4289|consen 1233 DYCETEIDLC--YSGPCGNNGRCRS---REGGYTCECRPGFTGEHCEVSARAGRCVPGV--------------CKNG--G 1291 (2531)
T ss_pred             ccccchhHhh--hcCCCCCCCceEE---ecCceeEEecCCccccceeeecccCccccce--------------ecCC--C
Confidence            9998    35  4478999999998   8999999999999999994      565553              4444  5


Q ss_pred             Ccc--cCCCCCCccCCCC
Q psy826          170 GCS--TGGPKGCTKCKSG  185 (329)
Q Consensus       170 ~C~--~~~~~~C~~C~~G  185 (329)
                      +|.  ..+.+.| .|+.|
T Consensus      1292 tC~~~~nggf~c-~Cp~g 1308 (2531)
T KOG4289|consen 1292 TCVNLLNGGFCC-HCPYG 1308 (2531)
T ss_pred             EEeecCCCceec-cCCCc
Confidence            676  4566778 88887


No 8  
>KOG1225|consensus
Probab=98.27  E-value=2e-06  Score=84.30  Aligned_cols=40  Identities=40%  Similarity=1.115  Sum_probs=27.3

Q ss_pred             CCCCCCCCCCc--cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826           93 CPVDHYGADCK--PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN  142 (329)
Q Consensus        93 C~~G~~G~~C~--~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~  142 (329)
                      |+.+|+|+.|.  .|+.+   |.+++.|..       +.|+|++||+|..|+
T Consensus       238 c~~~~~g~~c~~~~C~~~---c~~~g~c~~-------G~CIC~~Gf~G~dC~  279 (525)
T KOG1225|consen  238 CPEGYFGPLCSTIYCPGG---CTGRGQCVE-------GRCICPPGFTGDDCD  279 (525)
T ss_pred             cCCceeCCccccccCCCC---CcccceEeC-------CeEeCCCCCcCCCCC
Confidence            77777777776  45443   555566664       677777777777774


No 9  
>KOG0994|consensus
Probab=98.27  E-value=1.9e-06  Score=88.71  Aligned_cols=150  Identities=26%  Similarity=0.602  Sum_probs=86.8

Q ss_pred             CCCCCCCCCCccCCCC------CCCCC--CC-CcccCCCCCCCCceee-cCCCCCCCCCCcCCCCccccccCcCCCCCcC
Q psy826           93 CPVDHYGADCKPCLGF------PNVCF--GN-GKCKGNGTRKGNGQCV-CNKEYTGELCNECNTGYFQSYKDEKTILCSK  162 (329)
Q Consensus        93 C~~G~~G~~C~~C~~~------~~~C~--~~-~~C~~~~~~~gs~~C~-C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~  162 (329)
                      |.+|.||..|..|.++      +.+|.  ++ ..|.     +-++.|+ |..-.+|..|+.|.+||||+..-.....|.|
T Consensus       834 C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA~~Cd-----~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrP  908 (1758)
T KOG0994|consen  834 CRPGTYGRQCNQCQPGYWGFPECRPCQCNGHADTCD-----PITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRP  908 (1758)
T ss_pred             eccccchhhccccCCCccCCCcCccccccCcccccC-----ccccccccccccccccchhhhhccccCCcccCCCCCCCC
Confidence            8888888888877652      13443  22 2344     4457886 9888889999999999999877777777887


Q ss_pred             Cc--ccc------cCCcc---cCCCCCCccCCCCceecCCCCcccCCCCCC--CCCCCCCceeecCCCCcccCCCcCC--
Q psy826          163 CH--ASC------ESGCS---TGGPKGCTKCKSGWAADKDIGCYDINECSD--ENICSGNQFCVNTEGSYRCMQCDPS--  227 (329)
Q Consensus       163 C~--~~C------~~~C~---~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~--~~~C~~~~~C~n~~gs~~C~~C~~~--  227 (329)
                      |.  .+=      ..+|.   ......| .|.+||.|.....|.+-.-=..  +..|.. -.|.|+...+.=..|+..  
T Consensus       909 CpCP~gp~Sg~~~A~sC~~d~~t~~ivC-~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~-CeC~~NiD~~d~~aCD~~TG  986 (1758)
T KOG0994|consen  909 CPCPDGPASGRQHADSCYLDTRTQQIVC-HCQEGYSGSRCEICADNHFGNPSEGGTCQK-CECSNNIDLYDPGACDVATG  986 (1758)
T ss_pred             CCCCCCCccchhccccccccccccceee-ecccCccccchhhhcccccCCcccCCcccc-ccccCCcCccCCCccchhhc
Confidence            63  100      01232   1123467 7788887765433322100000  011210 123333333333344443  


Q ss_pred             -CCCC-cCCcccccccCccccccc
Q psy826          228 -CNGC-HGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       228 -C~~C-~g~~~~~C~~C~~Gy~~~  249 (329)
                       |..| ......+|..|++||+|+
T Consensus       987 ~CLkCL~hTeG~hCe~Ck~Gf~Gd 1010 (1758)
T KOG0994|consen  987 ACLKCLYHTEGDHCEHCKDGFYGD 1010 (1758)
T ss_pred             hhhhhhhcccccchhhccccchhH
Confidence             5556 345667899999999986


No 10 
>KOG1214|consensus
Probab=98.09  E-value=4.9e-06  Score=83.49  Aligned_cols=118  Identities=30%  Similarity=0.724  Sum_probs=82.4

Q ss_pred             CCCCCCCCcccCCCCCCCCceeecCCCCCC--CCC---CcCCCCccccccCcCCCCCcCCcccccCCcc-cCCCCCCccC
Q psy826          109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTG--ELC---NECNTGYFQSYKDEKTILCSKCHASCESGCS-TGGPKGCTKC  182 (329)
Q Consensus       109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g--~~C---~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C  182 (329)
                      .+.|..++.|.. + +.-.|+|.|..||.|  ..|   .+|+.++         .+|.   +.  ..|+ .+++++| .|
T Consensus       699 sh~cdt~a~C~p-g-~~~~~tcecs~g~~gdgr~c~d~~eca~~~---------~~CG---p~--s~Cin~pg~~rc-eC  761 (1289)
T KOG1214|consen  699 SHMCDTTARCHP-G-TGVDYTCECSSGYQGDGRNCVDENECATGF---------HRCG---PN--SVCINLPGSYRC-EC  761 (1289)
T ss_pred             CcccCCCccccC-C-CCcceEEEEeeccCCCCCCCCChhhhccCC---------CCCC---CC--ceeecCCCceeE-EE
Confidence            344556666763 2 234589999999985  466   3455432         2233   22  3566 7889999 99


Q ss_pred             CCCceecCC-CCcc------cCCCCCCC-CCCCCCc--eeecCC-CCcccCCCcCCCCCCcCCcccccccCcccccccCC
Q psy826          183 KSGWAADKD-IGCY------DINECSDE-NICSGNQ--FCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQN  251 (329)
Q Consensus       183 ~~G~~~~~~-~~C~------d~~eC~~~-~~C~~~~--~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~  251 (329)
                      ..||...++ .+|+      .++.|..+ +.|...+  .|+... ++|.|.                   |-+||.|++.
T Consensus       762 ~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~-------------------CLPGfsGDG~  822 (1289)
T KOG1214|consen  762 RSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCA-------------------CLPGFSGDGH  822 (1289)
T ss_pred             eecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEe-------------------ecCCccCCcc
Confidence            999998776 4586      35668877 7787544  444444 468998                   9999999999


Q ss_pred             cccccCcCCCC
Q psy826          252 ICINTQAKSQN  262 (329)
Q Consensus       252 ~C~~~~~~~~~  262 (329)
                      .|.++++|..+
T Consensus       823 ~c~dvDeC~ps  833 (1289)
T KOG1214|consen  823 QCTDVDECSPS  833 (1289)
T ss_pred             ccccccccCcc
Confidence            99999988754


No 11 
>KOG1217|consensus
Probab=98.07  E-value=2.5e-05  Score=76.22  Aligned_cols=123  Identities=29%  Similarity=0.755  Sum_probs=85.1

Q ss_pred             cCCCCCCCCCCc----cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCc--------------CCCCcccccc
Q psy826           92 CCPVDHYGADCK----PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNE--------------CNTGYFQSYK  153 (329)
Q Consensus        92 ~C~~G~~G~~C~----~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~--------------C~~g~y~~~~  153 (329)
                      .|..||.+..|.    .|.....+|.+.+.|.+   ..+++.|.|.+||.+..|..              ++.++.+...
T Consensus       155 ~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~---~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c  231 (487)
T KOG1217|consen  155 SCTEGYEGEPCETDLDECIQYSSPCQNGGTCVN---TGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPEC  231 (487)
T ss_pred             eeCCCcccccccccccccccCCCCcCCCccccc---CCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCc
Confidence            399999988776    56555567988899998   67779999999999887742              2333322111


Q ss_pred             CcCCCCCcCCcccccCCcc-cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccC
Q psy826          154 DEKTILCSKCHASCESGCS-TGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCM  222 (329)
Q Consensus       154 ~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~  222 (329)
                      .....   .|... .+.|. ..+.+.| .|++||.+.....|.++++|.....|.++..|++..++|.|.
T Consensus       232 ~~~~~---~~~~~-~~~c~~~~~~~~C-~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~  296 (487)
T KOG1217|consen  232 EVSIV---ECASG-DGTCVNTVGSYTC-RCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCT  296 (487)
T ss_pred             ccccc---cccCC-CCcccccCCceee-eCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceee
Confidence            11111   11100 02454 4566899 999999987633578899999862388889999999889997


No 12 
>KOG1225|consensus
Probab=98.02  E-value=1.1e-05  Score=79.14  Aligned_cols=96  Identities=34%  Similarity=0.936  Sum_probs=69.6

Q ss_pred             cCcCCCCCCCCCCcc--CCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC--cCCCCccccccCcCCCCCcCCcc
Q psy826           90 KRCCPVDHYGADCKP--CLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN--ECNTGYFQSYKDEKTILCSKCHA  165 (329)
Q Consensus        90 ~~~C~~G~~G~~C~~--C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~--~C~~g~y~~~~~~~~~~C~~C~~  165 (329)
                      .+.|++||+|.+|.+  ||..   |++++.|.+       +.|+|++||.|..|.  .|+.               .|++
T Consensus       266 ~CIC~~Gf~G~dC~e~~Cp~~---cs~~g~~~~-------g~CiC~~g~~G~dCs~~~cpa---------------dC~g  320 (525)
T KOG1225|consen  266 RCICPPGFTGDDCDELVCPVD---CSGGGVCVD-------GECICNPGYSGKDCSIRRCPA---------------DCSG  320 (525)
T ss_pred             eEeCCCCCcCCCCCcccCCcc---cCCCceecC-------CEeecCCCccccccccccCCc---------------cCCC
Confidence            345999999999995  7653   887787775       599999999999883  3332               2444


Q ss_pred             cccCCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccc
Q psy826          166 SCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEG  245 (329)
Q Consensus       166 ~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~G  245 (329)
                      .  +.|.   ..+| .|.+||.|..         |... .|+++..|+|    . |.                   |..|
T Consensus       321 ~--G~Ci---~G~C-~C~~Gy~G~~---------C~~~-~C~~~g~cv~----g-C~-------------------C~~G  360 (525)
T KOG1225|consen  321 H--GKCI---DGEC-LCDEGYTGEL---------CIQR-ACSGGGQCVN----G-CK-------------------CKKG  360 (525)
T ss_pred             C--Cccc---CCce-EeCCCCcCCc---------cccc-ccCCCceecc----C-ce-------------------eccC
Confidence            3  4554   3689 9999999875         3332 3766777764    2 77                   9999


Q ss_pred             ccccC
Q psy826          246 YKLQQ  250 (329)
Q Consensus       246 y~~~~  250 (329)
                      |.+.+
T Consensus       361 w~G~d  365 (525)
T KOG1225|consen  361 WRGPD  365 (525)
T ss_pred             ccCCC
Confidence            99875


No 13 
>PF14843 GF_recep_IV:  Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=97.95  E-value=4.1e-06  Score=68.22  Aligned_cols=104  Identities=27%  Similarity=0.740  Sum_probs=61.3

Q ss_pred             CCCceee-cCCCCCCCCC-CcCCCCccccccCcCCCCCcCCccccc-----CCcccCCCCCCccCCCCceecCCCCcccC
Q psy826          125 KGNGQCV-CNKEYTGELC-NECNTGYFQSYKDEKTILCSKCHASCE-----SGCSTGGPKGCTKCKSGWAADKDIGCYDI  197 (329)
Q Consensus       125 ~gs~~C~-C~~G~~g~~C-~~C~~g~y~~~~~~~~~~C~~C~~~C~-----~~C~~~~~~~C~~C~~G~~~~~~~~C~d~  197 (329)
                      ++...|+ |.....+..| ..|+.++|..........|.+||..|.     .+|.+++...|+.|.. |..  ...|   
T Consensus        14 p~~~~C~sC~~~~~~g~Cv~~C~~~~~~~~~~~~~~~C~~Ch~~C~~~~~~~~C~Gp~~~~C~~C~~-~~~--~~~C---   87 (132)
T PF14843_consen   14 PGPDQCLSCRNFSFNGTCVASCPDGYYFDGSYEDSNTCQPCHPECLPSNGSETCTGPGADQCLSCAH-FRD--GGQC---   87 (132)
T ss_dssp             SSTTCBSSBSSEEETTEEESSTSSSSSSSSEEEETTEEEE--TTB---TTSSSBSSSSTTTBSSBSS-EEE--TTEE---
T ss_pred             cChhhccCccCcccCCcchhhcccccccccccccCCcCcccChhhCccccCcCcCCCCcchhhhccc-cCC--CCcc---
Confidence            3334443 4433334555 788876442111123678999999992     4999999999999986 432  2334   


Q ss_pred             CCCCCCCCCCCCce-------eecCCCCcccCCCcCCC-CCCcCCcccccc
Q psy826          198 NECSDENICSGNQF-------CVNTEGSYRCMQCDPSC-NGCHGDGPDMCE  240 (329)
Q Consensus       198 ~eC~~~~~C~~~~~-------C~n~~gs~~C~~C~~~C-~~C~g~~~~~C~  240 (329)
                         +.  .|+.+.+       ... .....|.+||+.| ..|+||.+.+|.
T Consensus        88 ---v~--~CP~~~~~~~~~~~~~~-~~~~~C~~Ch~~C~~gC~Gp~~~~C~  132 (132)
T PF14843_consen   88 ---VS--SCPEGTYGENFPIYYYP-DSNGVCQPCHPSCTQGCTGPSASDCQ  132 (132)
T ss_dssp             ---ES--S--EEEEBTTSEEEEEE--TTSBEEE-STTSTT-BSSSSTTCC-
T ss_pred             ---cC--cCCCCCccccccceecC-CCCCccCCcCccccCcCCCCCcccCC
Confidence               43  6776631       112 3356899999999 899999988873


No 14 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.93  E-value=4.6e-06  Score=53.78  Aligned_cols=34  Identities=56%  Similarity=1.188  Sum_probs=31.4

Q ss_pred             cCCCCCCC-CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCcccccc
Q psy826          196 DINECSDE-NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKL  248 (329)
Q Consensus       196 d~~eC~~~-~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~  248 (329)
                      |||||... +.|+.++.|+|+.|+|.|.                   |++||..
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~-------------------C~~Gy~~   35 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCS-------------------CPPGYEL   35 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEE-------------------ESTTEEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEee-------------------CCCCcEE
Confidence            68999987 7898899999999999999                   9999984


No 15 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=97.72  E-value=0.00027  Score=73.51  Aligned_cols=62  Identities=29%  Similarity=0.741  Sum_probs=43.4

Q ss_pred             CCceee-cCCCCCC--------CCC---CcCCCCccccccCcCCCCCcCCc-ccccCCcccCCCCCCccCCCCceecCCC
Q psy826          126 GNGQCV-CNKEYTG--------ELC---NECNTGYFQSYKDEKTILCSKCH-ASCESGCSTGGPKGCTKCKSGWAADKDI  192 (329)
Q Consensus       126 gs~~C~-C~~G~~g--------~~C---~~C~~g~y~~~~~~~~~~C~~C~-~~C~~~C~~~~~~~C~~C~~G~~~~~~~  192 (329)
                      +...|. |..+|.-        ..|   ..|.+|||.   +  ...|.+|+ .+|+ +|..  ...|+.|..+|++..++
T Consensus       590 ~~~~C~sC~~g~yL~~~~~~~~g~Cv~~~~C~~GYY~---d--~~~C~~C~~~~C~-tC~~--~~~C~~C~~~~~ls~~~  661 (800)
T PTZ00214        590 GPGRCTRCPSDKLLKRASGAATGSCVDPGACVDGYYA---D--GDACLPCATPGCK-TCGH--ASFCTECAGELFVSLDG  661 (800)
T ss_pred             CCCcceECCCCCEeccccccccccccCcccCCCCccc---C--CCccccCCccccc-cccC--CCCcCcCCCCceecCcC
Confidence            345564 7777742        134   489999994   3  24699999 7998 8863  35899999998886644


Q ss_pred             C-cc
Q psy826          193 G-CY  195 (329)
Q Consensus       193 ~-C~  195 (329)
                      . |+
T Consensus       662 ~~C~  665 (800)
T PTZ00214        662 QSCL  665 (800)
T ss_pred             Cchh
Confidence            3 64


No 16 
>KOG4289|consensus
Probab=97.71  E-value=3.6e-05  Score=81.19  Aligned_cols=92  Identities=24%  Similarity=0.687  Sum_probs=66.7

Q ss_pred             CCCceeecCCCCCCCCCC----cCCCCccccccCcCCCCCcCCcccccCCcc-cCCCCCCccCCCCceecCCCCcccCCC
Q psy826          125 KGNGQCVCNKEYTGELCN----ECNTGYFQSYKDEKTILCSKCHASCESGCS-TGGPKGCTKCKSGWAADKDIGCYDINE  199 (329)
Q Consensus       125 ~gs~~C~C~~G~~g~~C~----~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~~G~~~~~~~~C~d~~e  199 (329)
                      .+..+|.|++||+|+.|+    .|-.              .+|.++  ++|. ..++|.| .|.+||.|..+..=...-.
T Consensus      1219 vnglrCrCPpGFTgd~CeTeiDlCYs--------------~pC~nn--g~C~srEggYtC-eCrpg~tGehCEvs~~agr 1281 (2531)
T KOG4289|consen 1219 VNGLRCRCPPGFTGDYCETEIDLCYS--------------GPCGNN--GRCRSREGGYTC-ECRPGFTGEHCEVSARAGR 1281 (2531)
T ss_pred             cCceeEeCCCCCCcccccchhHhhhc--------------CCCCCC--CceEEecCceeE-EecCCccccceeeecccCc
Confidence            567899999999999993    4433              356555  5777 7788999 9999999976431112223


Q ss_pred             CCCCCCCCCCceeecCC-CCcccCCCcCCCCCCcCCcccccccCcccccccCCccc
Q psy826          200 CSDENICSGNQFCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICI  254 (329)
Q Consensus       200 C~~~~~C~~~~~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~  254 (329)
                      |.. +.|.++..|++.. |.+.|.                   |+.| .-.+..|.
T Consensus      1282 Cvp-GvC~nggtC~~~~nggf~c~-------------------Cp~g-e~e~prC~ 1316 (2531)
T KOG4289|consen 1282 CVP-GVCKNGGTCVNLLNGGFCCH-------------------CPYG-EFEDPRCE 1316 (2531)
T ss_pred             ccc-ceecCCCEEeecCCCceecc-------------------CCCc-ccCCCceE
Confidence            555 6899999999886 778888                   9988 44455665


No 17 
>KOG1836|consensus
Probab=97.65  E-value=0.0002  Score=79.17  Aligned_cols=96  Identities=25%  Similarity=0.516  Sum_probs=63.2

Q ss_pred             CcCCCCCCCCCCccCCCC-------------CCCCCCCC---cccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccC
Q psy826           91 RCCPVDHYGADCKPCLGF-------------PNVCFGNG---KCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKD  154 (329)
Q Consensus        91 ~~C~~G~~G~~C~~C~~~-------------~~~C~~~~---~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~  154 (329)
                      +.|++||.|..|+.|.++             +-+|.-++   +|.     ..++.|.|.+...|..|+.|.+|||+....
T Consensus       697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd-----~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~  771 (1705)
T KOG1836|consen  697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICD-----PRTGQCKCKHNTFGGQCAQCVDGFYGLPDL  771 (1705)
T ss_pred             ccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCcccccc-----CCCCceecccCCCCCchhhhcCCCCCcccc
Confidence            349999999999988542             01233222   343     567899999999999999999999986555


Q ss_pred             cCCCCCcCCc--c--cccCCcccCCCCCCccCCCCceecCCC
Q psy826          155 EKTILCSKCH--A--SCESGCSTGGPKGCTKCKSGWAADKDI  192 (329)
Q Consensus       155 ~~~~~C~~C~--~--~C~~~C~~~~~~~C~~C~~G~~~~~~~  192 (329)
                      .....|.+|+  .  .|..++. .....|+.|++||+|....
T Consensus       772 ~~~~dC~~C~Cp~~~~~~~~~~-~~~~iCk~Cp~gytG~rCe  812 (1705)
T KOG1836|consen  772 GTSGDCQPCPCPNGGACGQTPE-ILEVVCKNCPPGYTGLRCE  812 (1705)
T ss_pred             CCCCCCccCCCCCChhhcCcCc-ccceecCCCCCCCcccccc
Confidence            4444577664  2  2222221 3346775577777776643


No 18 
>KOG1219|consensus
Probab=97.61  E-value=5.2e-05  Score=83.00  Aligned_cols=81  Identities=30%  Similarity=0.706  Sum_probs=62.5

Q ss_pred             CCCCCCCCCCc----cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCCCCCcCCccccc
Q psy826           93 CPVDHYGADCK----PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKTILCSKCHASCE  168 (329)
Q Consensus        93 C~~G~~G~~C~----~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~  168 (329)
                      ||+-|.|..|+    +|.  ++||..+++|..   ..+.|.|.|+.||+|..|+.  .|     .++++.  .+|..+  
T Consensus      3890 CpsqysG~~CEi~~epC~--snPC~~GgtCip---~~n~f~CnC~~gyTG~~Ce~--~G-----i~eCs~--n~C~~g-- 3953 (4289)
T KOG1219|consen 3890 CPSQYSGNHCEIDLEPCA--SNPCLTGGTCIP---FYNGFLCNCPNGYTGKRCEA--RG-----ISECSK--NVCGTG-- 3953 (4289)
T ss_pred             CcccccCccccccccccc--CCCCCCCCEEEe---cCCCeeEeCCCCccCceeec--cc-----cccccc--ccccCC--
Confidence            99999999998    674  478999999997   67789999999999999942  11     122111  245555  


Q ss_pred             CCcc-cCCCCCCccCCCCceecC
Q psy826          169 SGCS-TGGPKGCTKCKSGWAADK  190 (329)
Q Consensus       169 ~~C~-~~~~~~C~~C~~G~~~~~  190 (329)
                      +.|. .+|++.| .|-+||.|..
T Consensus      3954 g~C~n~~gsf~C-ncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3954 GQCINIPGSFHC-NCTPGILGRT 3975 (4289)
T ss_pred             ceeeccCCceEe-ccChhHhccc
Confidence            6787 7889999 9999998765


No 19 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=97.52  E-value=0.00032  Score=67.59  Aligned_cols=33  Identities=33%  Similarity=0.962  Sum_probs=29.9

Q ss_pred             CcccCCCcCCCCCCcCCcccccccCcccccccC
Q psy826          218 SYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQ  250 (329)
Q Consensus       218 s~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~  250 (329)
                      +..|.+||+.|..|+++.+.+|++|++||.+..
T Consensus       104 ~~~C~~C~~~C~~C~~~~~~~Ct~C~~g~~L~~  136 (397)
T PF03302_consen  104 GNKCVPCHESCATCSGGAPNQCTSCKPGKVLKY  136 (397)
T ss_pred             CCCCCCCCccccccCCCCCCCCcccCCCccccc
Confidence            458999999999999999999999999998763


No 20 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=97.49  E-value=0.00041  Score=72.16  Aligned_cols=106  Identities=24%  Similarity=0.625  Sum_probs=69.0

Q ss_pred             CCCcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCc
Q psy826          140 LCNECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSY  219 (329)
Q Consensus       140 ~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~  219 (329)
                      .|..|.+|||.   + ....|..|+.+|. .|.+++...|++|..+|++..... .....|.....|..+.|  .  ...
T Consensus       560 ~Ct~C~~GYyl---~-~~~~C~~Cd~~C~-~C~~~~~~~C~sC~~g~yL~~~~~-~~~g~Cv~~~~C~~GYY--~--d~~  629 (800)
T PTZ00214        560 KCTTTKKGYGI---S-PDGKLLECDPTCL-ACTAPGPGRCTRCPSDKLLKRASG-AATGSCVDPGACVDGYY--A--DGD  629 (800)
T ss_pred             cccccccCccc---C-CCCccccccchhh-hccCCCCCcceECCCCCEeccccc-cccccccCcccCCCCcc--c--CCC
Confidence            56788999984   2 2245889999998 898877889999999999875211 11112322135555555  1  234


Q ss_pred             ccCCCc-CCCCCCcCCcccccccCccccccc--CCcccccC
Q psy826          220 RCMQCD-PSCNGCHGDGPDMCEACAEGYKLQ--QNICINTQ  257 (329)
Q Consensus       220 ~C~~C~-~~C~~C~g~~~~~C~~C~~Gy~~~--~~~C~~~~  257 (329)
                      .|.+|+ +.|..|..  ...|++|..+|++.  +..|+..+
T Consensus       630 ~C~~C~~~~C~tC~~--~~~C~~C~~~~~ls~~~~~C~~~C  668 (800)
T PTZ00214        630 ACLPCATPGCKTCGH--ASFCTECAGELFVSLDGQSCLEEC  668 (800)
T ss_pred             ccccCCccccccccC--CCCcCcCCCCceecCcCCchhhcC
Confidence            788998 68888864  34788887776643  34455433


No 21 
>PF14843 GF_recep_IV:  Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=97.49  E-value=7.4e-05  Score=60.81  Aligned_cols=85  Identities=29%  Similarity=0.840  Sum_probs=54.6

Q ss_pred             Cccccc-CCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeec-CCCCcccCCCcCCC------CCCcCC
Q psy826          163 CHASCE-SGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVN-TEGSYRCMQCDPSC------NGCHGD  234 (329)
Q Consensus       163 C~~~C~-~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n-~~gs~~C~~C~~~C------~~C~g~  234 (329)
                      ||+.|. .+|+++++..|+.|+. |...  +.|+      .  .|+...+=.+ ......|.+||+.|      .+|+||
T Consensus         2 C~~~C~~~gC~Gp~~~~C~sC~~-~~~~--g~Cv------~--~C~~~~~~~~~~~~~~~C~~Ch~~C~~~~~~~~C~Gp   70 (132)
T PF14843_consen    2 CHPQCASDGCWGPGPDQCLSCRN-FSFN--GTCV------A--SCPDGYYFDGSYEDSNTCQPCHPECLPSNGSETCTGP   70 (132)
T ss_dssp             -CTTBGTT-BSSSSTTCBSSBSS-EEET--TEEE------S--STSSSSSSSSEEEETTEEEE--TTB---TTSSSBSSS
T ss_pred             cccccCCCCccCcChhhccCccC-cccC--Ccch------h--hcccccccccccccCCcCcccChhhCccccCcCcCCC
Confidence            455554 4789999999999985 4443  4454      2  5543311000 11267899999999      999999


Q ss_pred             cccccccCcccccccCCcccccCcCC
Q psy826          235 GPDMCEACAEGYKLQQNICINTQAKS  260 (329)
Q Consensus       235 ~~~~C~~C~~Gy~~~~~~C~~~~~~~  260 (329)
                      ++.+|++|+. | .+++.|+..++..
T Consensus        71 ~~~~C~~C~~-~-~~~~~Cv~~CP~~   94 (132)
T PF14843_consen   71 GADQCLSCAH-F-RDGGQCVSSCPEG   94 (132)
T ss_dssp             STTTBSSBSS-E-EETTEEESS--EE
T ss_pred             Ccchhhhccc-c-CCCCcccCcCCCC
Confidence            9999999996 3 3577898776544


No 22 
>KOG1836|consensus
Probab=97.31  E-value=0.00085  Score=74.29  Aligned_cols=92  Identities=33%  Similarity=0.764  Sum_probs=61.3

Q ss_pred             cCCCCCCCC-------CCccCCCCCCCCCCCCcccCCCCCCCCceee-cCCCCCCCCCCcCCCCccccccCcC--CCCCc
Q psy826           92 CCPVDHYGA-------DCKPCLGFPNVCFGNGKCKGNGTRKGNGQCV-CNKEYTGELCNECNTGYFQSYKDEK--TILCS  161 (329)
Q Consensus        92 ~C~~G~~G~-------~C~~C~~~~~~C~~~~~C~~~~~~~gs~~C~-C~~G~~g~~C~~C~~g~y~~~~~~~--~~~C~  161 (329)
                      .|..||||.       +|++|+     |.+.+.|.... ....+.|. |++||+|..|+.|.+|||+......  ...|.
T Consensus       760 ~C~~GfYg~~~~~~~~dC~~C~-----Cp~~~~~~~~~-~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~  833 (1705)
T KOG1836|consen  760 QCVDGFYGLPDLGTSGDCQPCP-----CPNGGACGQTP-EILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQ  833 (1705)
T ss_pred             hhcCCCCCccccCCCCCCccCC-----CCCChhhcCcC-cccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCc
Confidence            378888843       377664     44445565422 14568898 9999999999999999998554332  34455


Q ss_pred             CCc-----------------ccccCCcc-cCCCCCCccCCCCceecC
Q psy826          162 KCH-----------------ASCESGCS-TGGPKGCTKCKSGWAADK  190 (329)
Q Consensus       162 ~C~-----------------~~C~~~C~-~~~~~~C~~C~~G~~~~~  190 (329)
                      +|.                 ..|. .|. +.....|..|.+||.+++
T Consensus       834 ~c~c~~n~dp~~~g~c~~~tg~c~-~ci~nT~g~~cd~c~~g~~gd~  879 (1705)
T KOG1836|consen  834 SCQCNFNVDPNAFGNCNRLTGECL-KCIHNTAGEYCDLCKEGYFGDP  879 (1705)
T ss_pred             cceeccccCcccccccccccccee-eccCCcccccccccccCccccc
Confidence            442                 1232 344 344568889999999876


No 23 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=97.28  E-value=0.0024  Score=61.69  Aligned_cols=106  Identities=34%  Similarity=0.826  Sum_probs=66.2

Q ss_pred             CCCcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCC-----CCcccCCCCCCC---CCCCCCce
Q psy826          140 LCNECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKD-----IGCYDINECSDE---NICSGNQF  211 (329)
Q Consensus       140 ~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~-----~~C~d~~eC~~~---~~C~~~~~  211 (329)
                      .|..|++|||.     ....|.+||.+|. +|.+.....|+.|.+||.+...     ..|.  .+|...   ..|.   .
T Consensus        92 ~c~~C~~G~y~-----~~~~C~~C~~~C~-~C~~~~~~~Ct~C~~g~~L~~~~~~~~~tC~--~~c~~~~~~g~C~---~  160 (397)
T PF03302_consen   92 CCSECPDGYYK-----NGNKCVPCHESCA-TCSGGAPNQCTSCKPGKVLKYGTDGTKGTCG--AECTEDCASGACK---A  160 (397)
T ss_pred             cccCCCCCccc-----cCCCCCCCCcccc-ccCCCCCCCCcccCCCcccccccCCCcCccc--cccccccccCCcC---e
Confidence            56899999994     3457999999998 9997777899999999988762     1232  233322   2231   2


Q ss_pred             eec-CCCCcccCCCcCC-----CCC-----------Cc--CCcccccccCcccccccCCccccc
Q psy826          212 CVN-TEGSYRCMQCDPS-----CNG-----------CH--GDGPDMCEACAEGYKLQQNICINT  256 (329)
Q Consensus       212 C~n-~~gs~~C~~C~~~-----C~~-----------C~--g~~~~~C~~C~~Gy~~~~~~C~~~  256 (329)
                      |.- ..+...|.+|...     -..           |.  ......|++|..+|+|..+-|...
T Consensus       161 C~~~i~g~~yCs~C~~~~~~PinG~C~~~~~~~~~~C~~~~~~~G~Ct~C~~~yFl~~GGCY~~  224 (397)
T PF03302_consen  161 CGATIGGTKYCSQCKTGTEVPINGVCTSNTARATATCTDGTASDGTCTSCTDGYFLYKGGCYST  224 (397)
T ss_pred             eeeEeCCccccccCCCCCcCCcCCccCCCccccccccccccCCCCcccccccchhhhcCccccc
Confidence            221 2244556666533     122           22  223467888888888765555443


No 24 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.09  E-value=0.0002  Score=44.40  Aligned_cols=31  Identities=42%  Similarity=0.885  Sum_probs=25.4

Q ss_pred             CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCcccccccCCcc
Q psy826          204 NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNIC  253 (329)
Q Consensus       204 ~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C  253 (329)
                      +.|++++.|+++.++++|.                   |++||.+++..|
T Consensus         6 ~~C~~nA~C~~~~~~~~C~-------------------C~~Gy~GdG~~C   36 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCT-------------------CKPGYEGDGFFC   36 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEE-------------------E-CEEECCSTCE
T ss_pred             CCCCCCcEeecCCCCEEeE-------------------CCCCCccCCcCC
Confidence            5788899999999999999                   999999998765


No 25 
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=97.03  E-value=0.00054  Score=45.53  Aligned_cols=38  Identities=37%  Similarity=1.039  Sum_probs=33.5

Q ss_pred             CCcCCCCCCcCCcccccccCcccccccCCcccccCcCC
Q psy826          223 QCDPSCNGCHGDGPDMCEACAEGYKLQQNICINTQAKS  260 (329)
Q Consensus       223 ~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~~~~  260 (329)
                      +||+.|.+|+++++..|++|+.||.+++++|++.++..
T Consensus         1 ~C~~~C~~C~g~~~~~C~~C~~~~~~~~~~Cv~~C~~~   38 (49)
T cd00064           1 PCHPSCATCTGPGPDQCTSCRHGFYLDGGTCVSECPEG   38 (49)
T ss_pred             CCCCccCCCcCcCcCcCccCcCccCCCCCcccccCCCC
Confidence            58999999999999999999999999899999865443


No 26 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.96  E-value=0.00042  Score=38.67  Aligned_cols=23  Identities=35%  Similarity=0.771  Sum_probs=19.1

Q ss_pred             CCCCccCCCCceecCCC-CcccCCC
Q psy826          176 PKGCTKCKSGWAADKDI-GCYDINE  199 (329)
Q Consensus       176 ~~~C~~C~~G~~~~~~~-~C~d~~e  199 (329)
                      +|.| .|++||.+.+++ .|+||||
T Consensus         1 sy~C-~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTC-SCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEe-eCCCCCcCCCCCCccccCCC
Confidence            4789 999999987654 5999986


No 27 
>smart00261 FU Furin-like repeats.
Probab=96.95  E-value=0.00085  Score=43.89  Aligned_cols=39  Identities=36%  Similarity=1.009  Sum_probs=34.7

Q ss_pred             cccCCCcCCCCCCcCCcccccccCcccccccCCcccccC
Q psy826          219 YRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICINTQ  257 (329)
Q Consensus       219 ~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~  257 (329)
                      ..|.+|++.|..|.++++..|++|..||.+..+.|+..+
T Consensus         2 ~~C~~C~~~C~~C~~~~~~~C~~C~~~~~l~~~~Cv~~C   40 (46)
T smart00261        2 GECKPCHPECATCTGPGPDDCTSCKHGFFLDGGTCVSEC   40 (46)
T ss_pred             CccccccccccccCCcCcCcCccCCcccCCCCCCccCcC
Confidence            358899999999999999999999999999888888744


No 28 
>KOG1217|consensus
Probab=96.88  E-value=0.0058  Score=59.51  Aligned_cols=133  Identities=33%  Similarity=0.727  Sum_probs=84.5

Q ss_pred             cCCCCCCCCCC------ccCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCC--CCCcCC
Q psy826           92 CCPVDHYGADC------KPCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKT--ILCSKC  163 (329)
Q Consensus        92 ~C~~G~~G~~C------~~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~--~~C~~C  163 (329)
                      .|++||.+..+      ..|..... |.+++.|.+   ..+.+.|.|++||.|..|..|...      +...  ..-.+|
T Consensus       255 ~~~~g~~~~~~~~~~~~~~C~~~~~-c~~~~~C~~---~~~~~~C~C~~g~~g~~~~~~~~~------~~C~~~~~~~~c  324 (487)
T KOG1217|consen  255 RCPEGYTGDACVTCVDVDSCALIAS-CPNGGTCVN---VPGSYRCTCPPGFTGRLCTECVDV------DECSPRNAGGPC  324 (487)
T ss_pred             eCCCCccccccceeeeccccCCCCc-cCCCCeeec---CCCcceeeCCCCCCCCCCcccccc------ccccccccCCcC
Confidence            37889987763      35655443 888899997   566699999999999987222211      0000  011224


Q ss_pred             cccccCCcc---cCCCCCCccCCCCceecCCCCcccCC-CCCCCCCCCCCceeec-CCCCcccCCCcCCCCCCcCCcccc
Q psy826          164 HASCESGCS---TGGPKGCTKCKSGWAADKDIGCYDIN-ECSDENICSGNQFCVN-TEGSYRCMQCDPSCNGCHGDGPDM  238 (329)
Q Consensus       164 ~~~C~~~C~---~~~~~~C~~C~~G~~~~~~~~C~d~~-eC~~~~~C~~~~~C~n-~~gs~~C~~C~~~C~~C~g~~~~~  238 (329)
                      ..+  ..|.   ..+.+.| .|..||.+   ..|++.+ +|.. ..|..+..|.+ ..+++.|.                
T Consensus       325 ~~g--~~C~~~~~~~~~~C-~c~~~~~g---~~C~~~~~~C~~-~~~~~~~~c~~~~~~~~~c~----------------  381 (487)
T KOG1217|consen  325 ANG--GTCNTLGSFGGFRC-ACGPGFTG---RRCEDSNDECAS-SPCCPGGTCVNETPGSYRCA----------------  381 (487)
T ss_pred             CCC--cccccCCCCCCCCc-CCCCCCCC---CccccCCccccC-CccccCCEeccCCCCCeEec----------------
Confidence            433  2442   3345678 88888544   4577774 8887 34677888998 68899998                


Q ss_pred             cccCccccccc----CCcccccCcCC
Q psy826          239 CEACAEGYKLQ----QNICINTQAKS  260 (329)
Q Consensus       239 C~~C~~Gy~~~----~~~C~~~~~~~  260 (329)
                         |+.+|.+.    ...+.+..++.
T Consensus       382 ---~~~~~~~~~~~~~~~~~~~~~c~  404 (487)
T KOG1217|consen  382 ---CPAGFAGKANGDGVGCEDIDECS  404 (487)
T ss_pred             ---CCCccccCCcccccccccccccc
Confidence               88887752    23455555554


No 29 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.56  E-value=0.0009  Score=42.97  Aligned_cols=31  Identities=32%  Similarity=0.754  Sum_probs=27.2

Q ss_pred             cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCC
Q psy826          104 PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYT  137 (329)
Q Consensus       104 ~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~  137 (329)
                      +|....+.|..++.|+|   +.|+|.|.|++||.
T Consensus         4 EC~~~~~~C~~~~~C~N---~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    4 ECAEGPHNCPENGTCVN---TEGSYSCSCPPGYE   34 (42)
T ss_dssp             TTTTTSSSSSTTSEEEE---ETTEEEEEESTTEE
T ss_pred             ccCCCCCcCCCCCEEEc---CCCCEEeeCCCCcE
Confidence            56666678988999999   99999999999997


No 30 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.48  E-value=0.0031  Score=39.20  Aligned_cols=33  Identities=55%  Similarity=1.142  Sum_probs=26.7

Q ss_pred             cCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccc
Q psy826          196 DINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYK  247 (329)
Q Consensus       196 d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~  247 (329)
                      ++++|....+|..+..|+++.++|.|.                   |++||.
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~-------------------C~~g~~   33 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCE-------------------CPPGYT   33 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeE-------------------CCCCCc
Confidence            356776434787778999999999999                   999998


No 31 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.41  E-value=0.0043  Score=37.31  Aligned_cols=27  Identities=48%  Similarity=1.206  Sum_probs=22.8

Q ss_pred             CCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826          110 NVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC  141 (329)
Q Consensus       110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C  141 (329)
                      ..|.++|+|..    . .++|+|.+||+|..|
T Consensus         6 ~~C~~~G~C~~----~-~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVS----P-CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeC----C-CCEEECCCCCcCCCC
Confidence            46999999995    2 589999999999876


No 32 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.41  E-value=0.00082  Score=40.48  Aligned_cols=28  Identities=36%  Similarity=0.778  Sum_probs=24.1

Q ss_pred             CCCCCCCCcccCCCCCC-CCceeecCCCCCCC
Q psy826          109 PNVCFGNGKCKGNGTRK-GNGQCVCNKEYTGE  139 (329)
Q Consensus       109 ~~~C~~~~~C~~~~~~~-gs~~C~C~~G~~g~  139 (329)
                      +++|.++|+|+.   .. ++|.|.|++||+|.
T Consensus         3 ~~~C~n~g~C~~---~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    3 SNPCQNGGTCID---LPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTSSTTTEEEEE---ESTSEEEEEEBTTEEST
T ss_pred             CCcCCCCeEEEe---CCCCCEEeECCCCCccC
Confidence            357999999997   55 88999999999985


No 33 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.32  E-value=0.0011  Score=41.09  Aligned_cols=29  Identities=28%  Similarity=0.700  Sum_probs=22.3

Q ss_pred             CCCCCCCCCcccCCCCCCCCceeecCCCCCCC
Q psy826          108 FPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGE  139 (329)
Q Consensus       108 ~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~  139 (329)
                      ....|..++.|++   +.+++.|.|++||.|+
T Consensus         4 ~~~~C~~nA~C~~---~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    4 NNGGCHPNATCTN---TGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGGGS-TTCEEEE----TTSEEEEE-CEEECC
T ss_pred             CCCCCCCCcEeec---CCCCEEeECCCCCccC
Confidence            3456889999998   7889999999999975


No 34 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.31  E-value=0.0052  Score=38.12  Aligned_cols=29  Identities=38%  Similarity=0.877  Sum_probs=25.1

Q ss_pred             CCCCCCCcccCCCCCCCCceeecCCCCC-CCCC
Q psy826          110 NVCFGNGKCKGNGTRKGNGQCVCNKEYT-GELC  141 (329)
Q Consensus       110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~-g~~C  141 (329)
                      .+|.+++.|.+   +.++|.|.|.+||. |..|
T Consensus         9 ~~C~~~~~C~~---~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        9 NPCQNGGTCVN---TVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             CCcCCCCEeEC---CCCCeEeECCCCCccCCcC
Confidence            56888889998   78899999999999 8776


No 35 
>KOG1226|consensus
Probab=96.29  E-value=0.017  Score=58.65  Aligned_cols=87  Identities=34%  Similarity=0.885  Sum_probs=49.7

Q ss_pred             CcCCCCCCCCCCccCCC----------------CCCCCCCCCcccCCCCCCCCceeecCCCCC----CCCCC----cCCC
Q psy826           91 RCCPVDHYGADCKPCLG----------------FPNVCFGNGKCKGNGTRKGNGQCVCNKEYT----GELCN----ECNT  146 (329)
Q Consensus        91 ~~C~~G~~G~~C~~C~~----------------~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~----g~~C~----~C~~  146 (329)
                      +.|.+||+|..|+ |+.                ...+|+++|.|.=       ++|+|.+...    |.+|+    .|+.
T Consensus       480 C~C~~G~~G~~CE-C~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-------GqC~C~~~~~~~i~G~fCECDnfsC~r  551 (783)
T KOG1226|consen  480 CRCDEGWLGKKCE-CSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-------GQCVCHKPDNGKIYGKFCECDNFSCER  551 (783)
T ss_pred             eecCCCCCCCccc-CCccccCcHhHHhhccCCCCCCCcCCCCcEeC-------CceEecCCCCCceeeeeeeccCccccc
Confidence            4599999999885 322                1125666677663       6788988766    77772    2221


Q ss_pred             CccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCC
Q psy826          147 GYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSD  202 (329)
Q Consensus       147 g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~  202 (329)
                              .....|.- |+.|.  |     .+| .|.+||.|...+-=.+.+.|..
T Consensus       552 --------~~g~lC~g-~G~C~--C-----G~C-vC~~GwtG~~C~C~~std~C~~  590 (783)
T KOG1226|consen  552 --------HKGVLCGG-HGRCE--C-----GRC-VCNPGWTGSACNCPLSTDTCES  590 (783)
T ss_pred             --------ccCcccCC-CCeEe--C-----CcE-EcCCCCccCCCCCCCCCccccC
Confidence                    01122222 22222  2     478 9999999987421133444544


No 36 
>KOG4052|consensus
Probab=96.26  E-value=0.00015  Score=59.37  Aligned_cols=58  Identities=40%  Similarity=0.839  Sum_probs=45.6

Q ss_pred             cccccceEEEecccCcccC----------------------------------------------------cccCccchh
Q psy826           26 IYAKSEVRLIEIQEKMCSE----------------------------------------------------VSGFLDQCH   53 (329)
Q Consensus        26 ~y~~se~~~~e~~e~~C~~----------------------------------------------------~~~~~~~c~   53 (329)
                      .|.+||.||.|+.|++|..                                                    +..-.-+|.
T Consensus        76 ~Y~rselrl~E~tEniCer~ley~~hker~gs~rfaraqsqt~~tlk~l~~KGvKvdlgip~elwd~psvevt~lk~qce  155 (190)
T KOG4052|consen   76 LYQRSELRLAEITENICERFLEYKIHKERTGSERFARAQSQTIETLKGLVEKGVKVDLGIPFELWDQPSVEVTALKQQCE  155 (190)
T ss_pred             eeehhHhHHHHHHHHHHHHHHHHhhhccCcchhHHHHHhhhHHHHHHHHHHcCceeeeCCchHhhcCCcHHHHHHHHHHH
Confidence            7999999999999999980                                                    001245799


Q ss_pred             hHHhhhHHHHHHhhhhccCCcccCCCccceeecc
Q psy826           54 NFAADIESEIEEWWFKVQHSKAKDSDLYTWLCIN   87 (329)
Q Consensus        54 ~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~C~~   87 (329)
                      ++++++|+.|+.|+++.+.    ..++..++|..
T Consensus       156 ~lleeyed~i~ewyf~hq~----e~~l~k~lce~  185 (190)
T KOG4052|consen  156 SLLEEYEDLIEEWYFNHQS----EEDLFKQLCER  185 (190)
T ss_pred             HHHHHHHHHHHHHHhcccc----HHHHHHHHHHh
Confidence            9999999999999999885    23456666653


No 37 
>smart00051 DSL delta serrate ligand.
Probab=95.94  E-value=0.0091  Score=41.92  Aligned_cols=44  Identities=30%  Similarity=0.678  Sum_probs=32.5

Q ss_pred             CcCCCCCCCCCCc-cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826           91 RCCPVDHYGADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC  141 (329)
Q Consensus        91 ~~C~~G~~G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C  141 (329)
                      +.|+++|||..|. .|... +.+.++..|..      .+.|+|.+||+|..|
T Consensus        19 v~C~~~~yG~~C~~~C~~~-~d~~~~~~Cd~------~G~~~C~~Gw~G~~C   63 (63)
T smart00051       19 VTCDENYYGEGCNKFCRPR-DDFFGHYTCDE------NGNKGCLEGWMGPYC   63 (63)
T ss_pred             eeCCCCCcCCccCCEeCcC-ccccCCccCCc------CCCEecCCCCcCCCC
Confidence            4499999999997 55431 23566777852      378999999999876


No 38 
>KOG1025|consensus
Probab=95.92  E-value=0.0064  Score=62.39  Aligned_cols=83  Identities=29%  Similarity=0.716  Sum_probs=56.2

Q ss_pred             Cccccc-CCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCccccccc
Q psy826          163 CHASCE-SGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEA  241 (329)
Q Consensus       163 C~~~C~-~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~  241 (329)
                      |+..|. ..|+++++.+|.+|+.--.+   +.|++  .|...    .+..  +-..+..|..||+.|.+|+|+++.+|..
T Consensus       489 C~~lCs~~GCWGpgp~qClsCrn~~rg---g~CVe--~C~~l----~g~~--rf~~~~~C~~ChPEC~TCnG~G~d~C~~  557 (1177)
T KOG1025|consen  489 CDKLCSSGGCWGPGPDQCLSCRNFSRG---GTCVE--KCNLL----GGEP--RFVNSRECERCHPECETCNGPGADNCLQ  557 (1177)
T ss_pred             hhhhcCCCCCcCCCCccceeccccccC---ceehh--hcccc----CCcc--cccccceecccChhhccCCCCCccchhh
Confidence            555553 36889999999888764333   34532  23221    1111  1244678999999999999999999999


Q ss_pred             CcccccccCCcccccCc
Q psy826          242 CAEGYKLQQNICINTQA  258 (329)
Q Consensus       242 C~~Gy~~~~~~C~~~~~  258 (329)
                      |+.  ..++-.|+..++
T Consensus       558 CaH--f~dgp~CV~~CP  572 (1177)
T KOG1025|consen  558 CAH--FRDGPHCVSDCP  572 (1177)
T ss_pred             hhh--cCCCcchhccCC
Confidence            986  556667876655


No 39 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=95.84  E-value=0.012  Score=39.27  Aligned_cols=27  Identities=41%  Similarity=1.051  Sum_probs=24.0

Q ss_pred             CCCceeecCCCCCCCCCCcCCCCcccc
Q psy826          125 KGNGQCVCNKEYTGELCNECNTGYFQS  151 (329)
Q Consensus       125 ~gs~~C~C~~G~~g~~C~~C~~g~y~~  151 (329)
                      ..+++|.|++++.|..|+.|++|||+.
T Consensus        16 ~~~G~C~C~~~~~G~~C~~C~~g~~~~   42 (50)
T cd00055          16 PGTGQCECKPNTTGRRCDRCAPGYYGL   42 (50)
T ss_pred             CCCCEEeCCCcCCCCCCCCCCCCCccC
Confidence            356899999999999999999999963


No 40 
>KOG0196|consensus
Probab=95.78  E-value=0.012  Score=60.20  Aligned_cols=60  Identities=30%  Similarity=0.797  Sum_probs=46.5

Q ss_pred             CCceeecCCCCC----CCCCCcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecC
Q psy826          126 GNGQCVCNKEYT----GELCNECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADK  190 (329)
Q Consensus       126 gs~~C~C~~G~~----g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~  190 (329)
                      ..+.|.|++||.    |..|..|+.|+|-  .......|.+|...  ..-...+...| .|..||+..+
T Consensus       257 piG~C~C~aGye~~~~~~~C~aCp~G~yK--~~~~~~~C~~CP~~--S~s~~ega~~C-~C~~gyyRA~  320 (996)
T KOG0196|consen  257 PIGGCVCKAGYEEAENGKACQACPPGTYK--ASQGDSLCLPCPPN--SHSSSEGATSC-TCENGYYRAD  320 (996)
T ss_pred             EcCceeecCCCCcccCCCcceeCCCCccc--CCCCCCCCCCCCCC--CCCCCCCCCcc-cccCCcccCC
Confidence            457899999997    6799999999994  23345678998875  22236778999 9999999766


No 41 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.76  E-value=0.013  Score=35.23  Aligned_cols=30  Identities=40%  Similarity=0.956  Sum_probs=24.9

Q ss_pred             CCCCCCCCcccCCCCCCCCceeecCCCCCCC-CC
Q psy826          109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTGE-LC  141 (329)
Q Consensus       109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~-~C  141 (329)
                      ..+|.+++.|.+   ..+.+.|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~---~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVN---TPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEec---CCCCeEeECCCCCcccCCc
Confidence            356888899998   6788999999999987 54


No 42 
>KOG1025|consensus
Probab=95.70  E-value=0.024  Score=58.44  Aligned_cols=105  Identities=27%  Similarity=0.632  Sum_probs=65.0

Q ss_pred             CCCceee-cCCCCCCCCC-CcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCC
Q psy826          125 KGNGQCV-CNKEYTGELC-NECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSD  202 (329)
Q Consensus       125 ~gs~~C~-C~~G~~g~~C-~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~  202 (329)
                      +|..+|. |..--+|..| +.|..--. ...-.....|..||+.|. +|+++|...|..|..-  .++ ..|      +.
T Consensus       501 pgp~qClsCrn~~rgg~CVe~C~~l~g-~~rf~~~~~C~~ChPEC~-TCnG~G~d~C~~CaHf--~dg-p~C------V~  569 (1177)
T KOG1025|consen  501 PGPDQCLSCRNFSRGGTCVEKCNLLGG-EPRFVNSRECERCHPECE-TCNGPGADNCLQCAHF--RDG-PHC------VS  569 (1177)
T ss_pred             CCCccceeccccccCceehhhccccCC-cccccccceecccChhhc-cCCCCCccchhhhhhc--CCC-cch------hc
Confidence            4555664 5554446666 55543100 001135567999999998 9999999999888762  121 223      33


Q ss_pred             CCCCCCCce-----eecC--CCCcccCCCcCCC-CCCcCCcccccccC
Q psy826          203 ENICSGNQF-----CVNT--EGSYRCMQCDPSC-NGCHGDGPDMCEAC  242 (329)
Q Consensus       203 ~~~C~~~~~-----C~n~--~gs~~C~~C~~~C-~~C~g~~~~~C~~C  242 (329)
                        .|+++-.     -++.  .-...|.+||+.| -+|.|+++..|..|
T Consensus       570 --~CP~G~~G~~s~ii~kypd~~~~C~~Ch~nCth~c~~~~~~~c~~~  615 (1177)
T KOG1025|consen  570 --DCPDGVTGPKSTIIWKYPDPQIECRPCHPNCTHGCLGPTSPGCVLN  615 (1177)
T ss_pred             --cCCCcccCCCCceeeecCCCccccccCCCcccccccCCCCCCcccC
Confidence              4433211     1111  1234599999999 58999999999987


No 43 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.66  E-value=0.0052  Score=34.29  Aligned_cols=17  Identities=29%  Similarity=0.788  Sum_probs=14.5

Q ss_pred             Cccccccc--CCcccccCc
Q psy826          242 CAEGYKLQ--QNICINTQA  258 (329)
Q Consensus       242 C~~Gy~~~--~~~C~~~~~  258 (329)
                      |++||.+.  +..|+|++|
T Consensus         6 C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    6 CPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CCCCCcCCCCCCccccCCC
Confidence            99999964  678999986


No 44 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.57  E-value=0.017  Score=35.24  Aligned_cols=29  Identities=38%  Similarity=0.887  Sum_probs=24.6

Q ss_pred             CCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826          110 NVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC  141 (329)
Q Consensus       110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C  141 (329)
                      .+|.+++.|.+   ..+.+.|.|.+||.|..|
T Consensus         9 ~~C~~~~~C~~---~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           9 NPCQNGGTCVN---TVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             CCcCCCCEeEC---CCCCeEeECCCCCcCCcC
Confidence            45877889997   788899999999999766


No 45 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=95.55  E-value=0.0061  Score=40.42  Aligned_cols=26  Identities=46%  Similarity=1.128  Sum_probs=21.8

Q ss_pred             CCCceeecCCCCCCCCCCcCCCCccc
Q psy826          125 KGNGQCVCNKEYTGELCNECNTGYFQ  150 (329)
Q Consensus       125 ~gs~~C~C~~G~~g~~C~~C~~g~y~  150 (329)
                      ..+++|.|+++|.|..|+.|.++||+
T Consensus        15 ~~~G~C~C~~~~~G~~C~~C~~g~~~   40 (49)
T PF00053_consen   15 PSTGQCVCKPGTTGPRCDQCKPGYFG   40 (49)
T ss_dssp             ETCEEESBSTTEESTTS-EE-TTEEC
T ss_pred             CCCCEEeccccccCCcCcCCCCcccc
Confidence            35689999999999999999999996


No 46 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=95.44  E-value=0.013  Score=38.32  Aligned_cols=25  Identities=40%  Similarity=1.095  Sum_probs=23.1

Q ss_pred             CCceeecCCCCCCCCCCcCCCCccc
Q psy826          126 GNGQCVCNKEYTGELCNECNTGYFQ  150 (329)
Q Consensus       126 gs~~C~C~~G~~g~~C~~C~~g~y~  150 (329)
                      .+++|.|++++.|..|+.|++|||+
T Consensus        16 ~~G~C~C~~~~~G~~C~~C~~g~~g   40 (46)
T smart00180       16 DTGQCECKPNVTGRRCDRCAPGYYG   40 (46)
T ss_pred             CCCEEECCCCCCCCCCCcCCCCcCC
Confidence            3689999999999999999999996


No 47 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.39  E-value=0.019  Score=34.72  Aligned_cols=28  Identities=32%  Similarity=0.807  Sum_probs=23.3

Q ss_pred             CCCCCCCcccCCCCCCCCceeecCCCCCC-CCC
Q psy826          110 NVCFGNGKCKGNGTRKGNGQCVCNKEYTG-ELC  141 (329)
Q Consensus       110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~g-~~C  141 (329)
                      .+|.++ .|.+   ..+++.|.|.+||.| ..|
T Consensus         6 ~~C~~~-~C~~---~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCIN---TPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEEC---CCCCeEeECCCCCccCCcc
Confidence            468777 8987   678999999999999 655


No 48 
>smart00261 FU Furin-like repeats.
Probab=95.06  E-value=0.032  Score=36.30  Aligned_cols=35  Identities=40%  Similarity=1.003  Sum_probs=28.5

Q ss_pred             CCcCCcccccCCcccCCCCCCccCCCCceecCCCCcc
Q psy826          159 LCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCY  195 (329)
Q Consensus       159 ~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~  195 (329)
                      .|.+|+..|. +|.+++...|+.|..||++.. +.|+
T Consensus         3 ~C~~C~~~C~-~C~~~~~~~C~~C~~~~~l~~-~~Cv   37 (46)
T smart00261        3 ECKPCHPECA-TCTGPGPDDCTSCKHGFFLDG-GTCV   37 (46)
T ss_pred             cccccccccc-ccCCcCcCcCccCCcccCCCC-CCcc
Confidence            5788999998 899877889999999998765 4443


No 49 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.03  E-value=0.01  Score=36.70  Aligned_cols=28  Identities=29%  Similarity=0.685  Sum_probs=21.2

Q ss_pred             cccccCCcc-cCCCCCCccCCCCceecCCC
Q psy826          164 HASCESGCS-TGGPKGCTKCKSGWAADKDI  192 (329)
Q Consensus       164 ~~~C~~~C~-~~~~~~C~~C~~G~~~~~~~  192 (329)
                      +++|.+.|. .++++.| .|++||.+..++
T Consensus         5 NGgC~h~C~~~~g~~~C-~C~~Gy~L~~D~   33 (36)
T PF14670_consen    5 NGGCSHICVNTPGSYRC-SCPPGYKLAEDG   33 (36)
T ss_dssp             GGGSSSEEEEETTSEEE-E-STTEEE-TTS
T ss_pred             CCCcCCCCccCCCceEe-ECCCCCEECcCC
Confidence            456777787 7889999 999999998754


No 50 
>KOG3512|consensus
Probab=95.01  E-value=0.062  Score=51.67  Aligned_cols=76  Identities=22%  Similarity=0.632  Sum_probs=46.8

Q ss_pred             CCCC-CcccCCCCCCCCceeecCCCCCCCCCCcCCCCcccccc----CcCCCCCcCCc----------------------
Q psy826          112 CFGN-GKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYK----DEKTILCSKCH----------------------  164 (329)
Q Consensus       112 C~~~-~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~----~~~~~~C~~C~----------------------  164 (329)
                      |..+ +.|+-+  ..+..+|.|.-+.+|..|..|.+-||....    ......|.+|.                      
T Consensus       280 CNgHAs~Cv~d--~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~S  357 (592)
T KOG3512|consen  280 CNGHASRCVMD--ESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRS  357 (592)
T ss_pred             ecCccceeeec--cCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccc
Confidence            5544 347641  234489999999999999999998875221    11223344321                      


Q ss_pred             -ccccCCcc-cCCCCCCccCCCCceecC
Q psy826          165 -ASCESGCS-TGGPKGCTKCKSGWAADK  190 (329)
Q Consensus       165 -~~C~~~C~-~~~~~~C~~C~~G~~~~~  190 (329)
                       ..|. .|. +..+..|..|.+||+-+.
T Consensus       358 ggvCl-nCrHnTaGrhChyCreGyyRd~  384 (592)
T KOG3512|consen  358 GGVCL-NCRHNTAGRHCHYCREGYYRDG  384 (592)
T ss_pred             cceEe-ecccCCCCcccccccCccccCC
Confidence             1122 343 334567888999998776


No 51 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.54  E-value=0.015  Score=27.54  Aligned_cols=13  Identities=38%  Similarity=1.035  Sum_probs=8.7

Q ss_pred             eeecCCCCCCCCC
Q psy826          129 QCVCNKEYTGELC  141 (329)
Q Consensus       129 ~C~C~~G~~g~~C  141 (329)
                      .|+|++||+|..|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4778888887765


No 52 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.26  E-value=0.018  Score=35.58  Aligned_cols=24  Identities=63%  Similarity=1.316  Sum_probs=19.5

Q ss_pred             CCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826          205 ICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       205 ~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      .|  .++|++++++|+|.                   |++||.+.
T Consensus         7 gC--~h~C~~~~g~~~C~-------------------C~~Gy~L~   30 (36)
T PF14670_consen    7 GC--SHICVNTPGSYRCS-------------------CPPGYKLA   30 (36)
T ss_dssp             GS--SSEEEEETTSEEEE--------------------STTEEE-
T ss_pred             Cc--CCCCccCCCceEeE-------------------CCCCCEEC
Confidence            45  56999999999999                   99999986


No 53 
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=94.06  E-value=0.087  Score=34.72  Aligned_cols=32  Identities=41%  Similarity=1.011  Sum_probs=24.9

Q ss_pred             CCcccccCCcccCCCCCCccCCCCceecCCCCcc
Q psy826          162 KCHASCESGCSTGGPKGCTKCKSGWAADKDIGCY  195 (329)
Q Consensus       162 ~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~  195 (329)
                      +|+..|. +|.+++...|+.|..||.+.. +.|+
T Consensus         1 ~C~~~C~-~C~g~~~~~C~~C~~~~~~~~-~~Cv   32 (49)
T cd00064           1 PCHPSCA-TCTGPGPDQCTSCRHGFYLDG-GTCV   32 (49)
T ss_pred             CCCCccC-CCcCcCcCcCccCcCccCCCC-Cccc
Confidence            3677887 898887789999999998654 4454


No 54 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=94.03  E-value=0.056  Score=32.85  Aligned_cols=34  Identities=50%  Similarity=1.040  Sum_probs=26.2

Q ss_pred             CCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826          197 INECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       197 ~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      +++|....+|..++.|.+..++|.|.                   |++||.+.
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~-------------------C~~g~~g~   35 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCS-------------------CPPGYTGR   35 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeE-------------------CCCCCcCC
Confidence            45665423687778999999999999                   99999873


No 55 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=93.94  E-value=0.024  Score=34.01  Aligned_cols=27  Identities=33%  Similarity=0.894  Sum_probs=24.0

Q ss_pred             CCCCCCceeecCC-CCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826          204 NICSGNQFCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       204 ~~C~~~~~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      .+|.+++.|+... ++|.|.                   |++||.|.
T Consensus         4 ~~C~n~g~C~~~~~~~y~C~-------------------C~~G~~G~   31 (32)
T PF00008_consen    4 NPCQNGGTCIDLPGGGYTCE-------------------CPPGYTGK   31 (32)
T ss_dssp             TSSTTTEEEEEESTSEEEEE-------------------EBTTEEST
T ss_pred             CcCCCCeEEEeCCCCCEEeE-------------------CCCCCccC
Confidence            4788899999998 899999                   99999875


No 56 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=93.68  E-value=0.0094  Score=50.51  Aligned_cols=122  Identities=26%  Similarity=0.665  Sum_probs=72.3

Q ss_pred             eeecccccCcCCCCCC---CCCCc---cCCC---CCCCCCCCCcccCCC--CCCCCceeecCCCCC--CCCC--CcCCCC
Q psy826           83 WLCINKLKRCCPVDHY---GADCK---PCLG---FPNVCFGNGKCKGNG--TRKGNGQCVCNKEYT--GELC--NECNTG  147 (329)
Q Consensus        83 ~~C~~~~~~~C~~G~~---G~~C~---~C~~---~~~~C~~~~~C~~~~--~~~gs~~C~C~~G~~--g~~C--~~C~~g  147 (329)
                      +.|.      |.+||.   -.+|+   .|..   .-.+|..-+.|.+..  .....+.|.|.+||.  ...|  ..|.. 
T Consensus        20 fEC~------Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~-   92 (197)
T PF06247_consen   20 FECK------CNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNN-   92 (197)
T ss_dssp             EEEE------ESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS-
T ss_pred             eEEE------cCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCc-
Confidence            5677      999997   56776   4643   236788888998633  235679999999997  3345  33332 


Q ss_pred             ccccccCcCCCCCcCCcccccCCcc----cCCCCCCccCCCCceecCCCCccc--CCCCCCCCCCCCCceeecCCCCccc
Q psy826          148 YFQSYKDEKTILCSKCHASCESGCS----TGGPKGCTKCKSGWAADKDIGCYD--INECSDENICSGNQFCVNTEGSYRC  221 (329)
Q Consensus       148 ~y~~~~~~~~~~C~~C~~~C~~~C~----~~~~~~C~~C~~G~~~~~~~~C~d--~~eC~~~~~C~~~~~C~n~~gs~~C  221 (329)
                                   ..|..   +.|.    .+....| +|.-|+...+.++|.-  ..+|..  .|..+..|....+-|+|
T Consensus        93 -------------~~Cg~---GKCI~d~~~~~~~~C-SC~IGkV~~dn~kCtk~G~T~C~L--KCk~nE~CK~~~~~Y~C  153 (197)
T PF06247_consen   93 -------------KDCGS---GKCILDPDNPNNPTC-SCNIGKVPDDNKKCTKTGETKCSL--KCKENEECKLVDGYYKC  153 (197)
T ss_dssp             ----------------TT---EEEEEEEGGGSEEEE-EE-TEEETTTTTESEEEE----------TTTEEEEEETTEEEE
T ss_pred             -------------eecCC---CeEEecCCCCCCcee-EeeeceEeccCCcccCCCccceee--ecCCCcceeeeCcEEEe
Confidence                         11221   2343    2333489 9999999555455752  234766  78889999999999999


Q ss_pred             CCCcCCCCCCcCCcccccccCccccccc
Q psy826          222 MQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       222 ~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      .                   |.+|+.+.
T Consensus       154 ~-------------------~~~~~~~~  162 (197)
T PF06247_consen  154 V-------------------CKEGFPGD  162 (197)
T ss_dssp             E-------------------E-TT-EEE
T ss_pred             e-------------------cCCCCCCC
Confidence            9                   99998876


No 57 
>PF00594 Gla:  Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain;  InterPro: IPR000294 The GLA (gamma-carboxyglutamic acid-rich) domain contains glutamate residues that have been post-translationally modified by vitamin K-dependent carboxylation to form gamma-carboxyglutamate (Gla) [, , ]. All glutamic acid (Glu) residues present in the GLA domain are potential carboxylation sites; in coagulation proteins, all Gu residues are modified to Gla, while in osteocalcin and matrix Gla proteins only some Glu residues are modified to Gla.  The GLA domain is responsible for the high-affinity binding of calcium ions. It starts at the N-terminal extremity of the mature form of proteins and ends with a conserved aromatic residue; a conserved Gla-x(3)-Gla-x-Cys motif [] is found in the middle of the domain which seems to be important for substrate recognition by the carboxylase. The 3D structure of the GLA domain has been solved [, ]. Calcium ions induce conformational changes in the GLA domain that and are necessary for the proper folding of the GLA domain. A common structural feature of functional GLA domains is the clustering of N-terminal hydrophobic residues into a hydrophobic patch that mediates interaction with the cell surface membrane [].  Proteins known to contain a GLA domain include []:    Coagulation factor X []  Coagulation factor VII []  Coagulation factor IX []  Coagulation factor XIV (vitamin K-dependent protein C) []  Vitamin K-dependent protein S []  Vitamin K-dependent protein Z []  Prothrombin  Transthyretin  Osteocalcin (also known as bone-Gla protein, BGP)  Matrix Gla protein (MGP) []  Inter-alpha-trypsin inhibitor heavy chain H2 Growth arrest-specific protein 6 (Gas-6) []  ; GO: 0005509 calcium ion binding, 0005576 extracellular region; PDB: 1Q3M_A 1VZM_C 2PF1_A 1NL2_A 1NL1_A 2SPT_A 2PF2_A 2ZP0_L 1W0Y_L 2AEI_L ....
Probab=93.41  E-value=0.019  Score=36.76  Aligned_cols=39  Identities=26%  Similarity=0.455  Sum_probs=32.4

Q ss_pred             hhhhccCcccccceEEEecccCcccCcccCccchhhHHhhhHHHHHHhhhh
Q psy826           19 WEEEKQKIYAKSEVRLIEIQEKMCSEVSGFLDQCHNFAADIESEIEEWWFK   69 (329)
Q Consensus        19 ~ee~~~~~y~~se~~~~e~~e~~C~~~~~~~~~c~~~~~~~~~~~e~~~~~   69 (329)
                      |||.+.|+.++      ||++++|+     +.+..++++....+. .+|..
T Consensus         1 ~ee~~~g~ler------Ec~eE~C~-----~eea~E~f~~~~~t~-~fw~~   39 (42)
T PF00594_consen    1 LEELKPGNLER------ECMEEVCE-----YEEAREIFEDTEGTN-AFWKK   39 (42)
T ss_dssp             TGGGSCHHHHH------HHTSSTHT-----HHHHHHHHHSHHHHH-HHHHH
T ss_pred             CCccccCCchH------HHHHhhcC-----hHHHHHHHHChHhHH-HHHHH
Confidence            79999999999      99999999     888888888876654 55543


No 58 
>KOG1226|consensus
Probab=93.41  E-value=0.16  Score=51.88  Aligned_cols=38  Identities=45%  Similarity=1.110  Sum_probs=29.4

Q ss_pred             CCCCCc----cCCCC-CCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826           98 YGADCK----PCLGF-PNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN  142 (329)
Q Consensus        98 ~G~~C~----~C~~~-~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~  142 (329)
                      ||+.|+    .|+.. ...|.++|.|.=       ++|+|.+||+|..|+
T Consensus       538 ~G~fCECDnfsC~r~~g~lC~g~G~C~C-------G~CvC~~GwtG~~C~  580 (783)
T KOG1226|consen  538 YGKFCECDNFSCERHKGVLCGGHGRCEC-------GRCVCNPGWTGSACN  580 (783)
T ss_pred             eeeeeeccCcccccccCcccCCCCeEeC-------CcEEcCCCCccCCCC
Confidence            488885    45542 256888888874       789999999999994


No 59 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=92.75  E-value=0.13  Score=30.65  Aligned_cols=27  Identities=56%  Similarity=1.109  Sum_probs=23.2

Q ss_pred             CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826          204 NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       204 ~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      ..|.++..|++..+.+.|.                   |+.||.+.
T Consensus         6 ~~C~~~~~C~~~~~~~~C~-------------------C~~g~~g~   32 (36)
T cd00053           6 NPCSNGGTCVNTPGSYRCV-------------------CPPGYTGD   32 (36)
T ss_pred             CCCCCCCEEecCCCCeEeE-------------------CCCCCccc
Confidence            4677778999999999999                   99999876


No 60 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.74  E-value=0.04  Score=34.04  Aligned_cols=32  Identities=34%  Similarity=0.746  Sum_probs=24.5

Q ss_pred             CCCCCCceeecCC-CCcccCCCcCCCCCCcCCcccccccCcccccccCCccc
Q psy826          204 NICSGNQFCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICI  254 (329)
Q Consensus       204 ~~C~~~~~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~  254 (329)
                      ..|+.++.|++.. |+..|.                   |..||..++..|+
T Consensus         5 ~~cP~NA~C~~~~dG~eecr-------------------Cllgyk~~~~~C~   37 (37)
T PF12946_consen    5 TKCPANAGCFRYDDGSEECR-------------------CLLGYKKVGGKCV   37 (37)
T ss_dssp             S---TTEEEEEETTSEEEEE-------------------E-TTEEEETTEEE
T ss_pred             ccCCCCcccEEcCCCCEEEE-------------------eeCCccccCCCcC
Confidence            4788999999877 999999                   9999999888774


No 61 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=92.48  E-value=0.15  Score=38.96  Aligned_cols=27  Identities=26%  Similarity=0.914  Sum_probs=19.4

Q ss_pred             CCCCCcCCcccccccCcccccccCCcccc
Q psy826          227 SCNGCHGDGPDMCEACAEGYKLQQNICIN  255 (329)
Q Consensus       227 ~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~  255 (329)
                      .|..|.+  +..|..|..||.++++.|+.
T Consensus        29 ~C~~C~~--~~~C~~C~~GY~~~~~~Cv~   55 (96)
T PTZ00382         29 NCKSCVV--DGVCGECNSGFSLDNGKCVS   55 (96)
T ss_pred             CCcCCCC--CCccccCcCCcccCCCcccc
Confidence            4556644  34688899999998877764


No 62 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=92.42  E-value=0.15  Score=30.71  Aligned_cols=25  Identities=48%  Similarity=1.051  Sum_probs=21.7

Q ss_pred             CCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826          205 ICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       205 ~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      .|..+ .|++..+++.|.                   |++||.+.
T Consensus         7 ~C~~~-~C~~~~~~~~C~-------------------C~~g~~g~   31 (35)
T smart00181        7 PCSNG-TCINTPGSYTCS-------------------CPPGYTGD   31 (35)
T ss_pred             CCCCC-EEECCCCCeEeE-------------------CCCCCccC
Confidence            67667 899999999999                   99999885


No 63 
>smart00069 GLA Domain containing Gla (gamma-carboxyglutamate) residues. A hyaluronan-binding domain found in proteins associated with the extracellular matrix, cell adhesion and cell migration.
Probab=92.22  E-value=0.094  Score=37.01  Aligned_cols=38  Identities=29%  Similarity=0.387  Sum_probs=32.3

Q ss_pred             hhhhhccCcccccceEEEecccCcccCcccCccchhhHHhhhHHHHHHh
Q psy826           18 AWEEEKQKIYAKSEVRLIEIQEKMCSEVSGFLDQCHNFAADIESEIEEW   66 (329)
Q Consensus        18 ~~ee~~~~~y~~se~~~~e~~e~~C~~~~~~~~~c~~~~~~~~~~~e~~   66 (329)
                      ++||.+.|+.++      |++|++|.     +++..+++++.+.+.+.|
T Consensus        24 ~~Ee~~~~~ler------Ec~eE~C~-----~eea~E~f~~~~~t~~fw   61 (65)
T smart00069       24 LLEELRPGNLER------ECQEEICS-----LEEAREVFEDNEGTDEFY   61 (65)
T ss_pred             chhhhccCccch------HHHhhhcc-----CcHHHHHHHchHhHHHHH
Confidence            489999999998      99999999     888888888887775544


No 64 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=91.72  E-value=0.033  Score=47.26  Aligned_cols=61  Identities=30%  Similarity=0.788  Sum_probs=44.4

Q ss_pred             CCCCCccCCCCceecCCCCcccCCCCCCC----CCCCCCceeecCC-----CCcccCCCcCCCCCCcCCcccccccCccc
Q psy826          175 GPKGCTKCKSGWAADKDIGCYDINECSDE----NICSGNQFCVNTE-----GSYRCMQCDPSCNGCHGDGPDMCEACAEG  245 (329)
Q Consensus       175 ~~~~C~~C~~G~~~~~~~~C~d~~eC~~~----~~C~~~~~C~n~~-----gs~~C~~C~~~C~~C~g~~~~~C~~C~~G  245 (329)
                      ..+.| .|.+||.+....+|....+|...    .+|.+-+.|++..     ..|+|.                   |.+|
T Consensus        18 NHfEC-~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~-------------------C~~g   77 (197)
T PF06247_consen   18 NHFEC-KCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCD-------------------CING   77 (197)
T ss_dssp             SEEEE-EESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEE-------------------E-TT
T ss_pred             CceEE-EcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEe-------------------cccC
Confidence            34789 99999999887889988888762    6788888998765     468888                   9999


Q ss_pred             ccccCCcccc
Q psy826          246 YKLQQNICIN  255 (329)
Q Consensus       246 y~~~~~~C~~  255 (329)
                      |.+..+.|++
T Consensus        78 Y~~~~~vCvp   87 (197)
T PF06247_consen   78 YILKQGVCVP   87 (197)
T ss_dssp             EEESSSSEEE
T ss_pred             ceeeCCeEch
Confidence            9988777763


No 65 
>KOG3512|consensus
Probab=90.01  E-value=0.84  Score=44.18  Aligned_cols=65  Identities=23%  Similarity=0.634  Sum_probs=39.5

Q ss_pred             CCceee-cCCCCCCCCCCcCCCCccccccC--cCCCCCcCCc--cc--ccCCcccCCCCCCccCCCCceecCCC
Q psy826          126 GNGQCV-CNKEYTGELCNECNTGYFQSYKD--EKTILCSKCH--AS--CESGCSTGGPKGCTKCKSGWAADKDI  192 (329)
Q Consensus       126 gs~~C~-C~~G~~g~~C~~C~~g~y~~~~~--~~~~~C~~C~--~~--C~~~C~~~~~~~C~~C~~G~~~~~~~  192 (329)
                      ..++|. |.-...|..|..|.+|||.+...  .....|..|+  +.  =..+|+ ..+.+| .|++|-.|...+
T Consensus       357 SggvClnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCN-q~tGqC-pCkeGvtG~tCn  428 (592)
T KOG3512|consen  357 SGGVCLNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCN-QTTGQC-PCKEGVTGLTCN  428 (592)
T ss_pred             ccceEeecccCCCCcccccccCccccCCCCCCchhhhhhhcCCccccccccccc-ccCCcc-cCCCCCcccccc
Confidence            345775 88888899999999999953322  1234566542  21  002455 123567 777777665543


No 66 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=88.67  E-value=0.74  Score=35.18  Aligned_cols=45  Identities=24%  Similarity=0.750  Sum_probs=28.4

Q ss_pred             CCCcCCCCccccccCcCCCCCcCCc-ccccCCcccCCCCCCccCCCCceecC
Q psy826          140 LCNECNTGYFQSYKDEKTILCSKCH-ASCESGCSTGGPKGCTKCKSGWAADK  190 (329)
Q Consensus       140 ~C~~C~~g~y~~~~~~~~~~C~~C~-~~C~~~C~~~~~~~C~~C~~G~~~~~  190 (329)
                      .|+.|.+|||   .+.....|.+|. .+|. +|..  +..|..|..||.+..
T Consensus         5 ~Ct~C~~g~~---~~~~~~~C~~C~~~~C~-~C~~--~~~C~~C~~GY~~~~   50 (96)
T PTZ00382          5 VCTSCDSDKK---PNKDGSGCVLCSVGNCK-SCVV--DGVCGECNSGFSLDN   50 (96)
T ss_pred             ccCcCCCCCc---cCCCCCcCCcCCCCCCc-CCCC--CCccccCcCCcccCC
Confidence            4667777776   222334577776 4676 6753  246878888888754


No 67 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=86.61  E-value=0.15  Score=35.77  Aligned_cols=44  Identities=34%  Similarity=0.766  Sum_probs=20.0

Q ss_pred             CcCCCCCCCCCCc-cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826           91 RCCPVDHYGADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC  141 (329)
Q Consensus        91 ~~C~~G~~G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C  141 (329)
                      +.|.+.|||+.|. .|.+... -..+-+|..      .+.-+|.+||+|..|
T Consensus        19 v~C~~nyyG~~C~~~C~~~~d-~~ghy~Cd~------~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   19 VVCDENYYGPNCSKFCKPRDD-SFGHYTCDS------NGNKVCLPGWTGPNC   63 (63)
T ss_dssp             ----TTEETTTT-EE---EEE-TTEEEEE-S------S--EEE-TTEESTTS
T ss_pred             EECCCCCCCccccCCcCCCcC-CcCCcccCC------CCCCCCCCCCcCCCC
Confidence            3499999999998 5543111 122334542      244568899988766


No 68 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.83  E-value=0.5  Score=41.71  Aligned_cols=37  Identities=41%  Similarity=0.910  Sum_probs=30.4

Q ss_pred             CCcccCCCCCCC-CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826          192 IGCYDINECSDE-NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ  249 (329)
Q Consensus       192 ~~C~d~~eC~~~-~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~  249 (329)
                      ..|.++++|... +.|.  +.|.++.|+|.|.                   |.+||.+.
T Consensus       182 ~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~-------------------c~~g~~~~  219 (224)
T cd01475         182 KICVVPDLCATLSHVCQ--QVCISTPGSYLCA-------------------CTEGYALL  219 (224)
T ss_pred             ccCcCchhhcCCCCCcc--ceEEcCCCCEEeE-------------------CCCCccCC
Confidence            348888889765 5674  5899999999999                   99999875


No 69 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=84.27  E-value=1.1  Score=27.12  Aligned_cols=22  Identities=27%  Similarity=0.768  Sum_probs=15.6

Q ss_pred             CcccCCCCCCccCCCCceecCCC
Q psy826          170 GCSTGGPKGCTKCKSGWAADKDI  192 (329)
Q Consensus       170 ~C~~~~~~~C~~C~~G~~~~~~~  192 (329)
                      .|.-.....| .|++||.++.+.
T Consensus        11 ~CDpn~~~~C-~CPeGyIlde~~   32 (34)
T PF09064_consen   11 DCDPNSPGQC-FCPEGYILDEGS   32 (34)
T ss_pred             ccCCCCCCce-eCCCceEecCCc
Confidence            3442344699 999999998743


No 70 
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=83.90  E-value=3.3  Score=31.57  Aligned_cols=20  Identities=25%  Similarity=0.599  Sum_probs=13.5

Q ss_pred             CcccCCCCCCccCCCCceecC
Q psy826          170 GCSTGGPKGCTKCKSGWAADK  190 (329)
Q Consensus       170 ~C~~~~~~~C~~C~~G~~~~~  190 (329)
                      .|+......| .|.+||+...
T Consensus        68 ~ct~t~dt~C-~C~~G~y~~~   87 (98)
T cd00185          68 PCTATRNTVC-GCKPGFYCLT   87 (98)
T ss_pred             cCCCCCCCeE-eCCCCCEecC
Confidence            3443445788 7888888765


No 71 
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=83.74  E-value=4.2  Score=30.98  Aligned_cols=18  Identities=17%  Similarity=0.534  Sum_probs=11.7

Q ss_pred             CCcccccccCcccccccCC
Q psy826          233 GDGPDMCEACAEGYKLQQN  251 (329)
Q Consensus       233 g~~~~~C~~C~~Gy~~~~~  251 (329)
                      ......|. |++||+....
T Consensus        71 ~t~dt~C~-C~~G~y~~~~   88 (98)
T cd00185          71 ATRNTVCG-CKPGFYCLTK   88 (98)
T ss_pred             CCCCCeEe-CCCCCEecCC
Confidence            34445555 9999987653


No 72 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=83.22  E-value=0.14  Score=40.61  Aligned_cols=32  Identities=22%  Similarity=0.592  Sum_probs=21.5

Q ss_pred             CCCCCCceeecCC--CCcccCCCcCCCCCCcCCcccccccCcccccccCCcccccC
Q psy826          204 NICSGNQFCVNTE--GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICINTQ  257 (329)
Q Consensus       204 ~~C~~~~~C~n~~--gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~  257 (329)
                      +.|-+| .|...+  ..+.|.                   |..||.|.  +|+..+
T Consensus        51 ~YClHG-~C~yI~dl~~~~Cr-------------------C~~GYtGe--RCEh~d   84 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCR-------------------CSHGYTGI--RCQHVV   84 (139)
T ss_pred             CEeECC-EEEeeccCCCceeE-------------------CCCCcccc--ccccee
Confidence            345544 565444  577888                   99999987  776544


No 73 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=81.98  E-value=2.1  Score=28.34  Aligned_cols=21  Identities=33%  Similarity=0.998  Sum_probs=15.8

Q ss_pred             CCCCCCCcccCCCCCCCCceeecCCCCC
Q psy826          110 NVCFGNGKCKGNGTRKGNGQCVCNKEYT  137 (329)
Q Consensus       110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~  137 (329)
                      ..|..++.|.+       ++|.|++||.
T Consensus        26 ~qC~~~s~C~~-------g~C~C~~g~~   46 (52)
T PF01683_consen   26 EQCIGGSVCVN-------GRCQCPPGYV   46 (52)
T ss_pred             CCCCCcCEEcC-------CEeECCCCCE
Confidence            44666777865       7899999885


No 74 
>PHA02887 EGF-like protein; Provisional
Probab=80.07  E-value=1.4  Score=34.46  Aligned_cols=37  Identities=38%  Similarity=0.881  Sum_probs=27.0

Q ss_pred             cCCC-CCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826          104 PCLG-FPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN  142 (329)
Q Consensus       104 ~C~~-~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~  142 (329)
                      +|+. ..+-|. +|.|.... ......|.|.+||+|..|+
T Consensus        85 pC~~eyk~YCi-HG~C~yI~-dL~epsCrC~~GYtG~RCE  122 (126)
T PHA02887         85 KCKNDFNDFCI-NGECMNII-DLDEKFCICNKGYTGIRCD  122 (126)
T ss_pred             ccChHhhCEee-CCEEEccc-cCCCceeECCCCcccCCCC
Confidence            5654 335677 67998533 2566899999999999885


No 75 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=76.35  E-value=4  Score=27.00  Aligned_cols=12  Identities=33%  Similarity=0.783  Sum_probs=9.7

Q ss_pred             CcccccccCCcc
Q psy826          242 CAEGYKLQQNIC  253 (329)
Q Consensus       242 C~~Gy~~~~~~C  253 (329)
                      |++||...+++|
T Consensus        41 C~~g~~~~~~~C   52 (52)
T PF01683_consen   41 CPPGYVEVGGRC   52 (52)
T ss_pred             CCCCCEecCCCC
Confidence            999998876665


No 76 
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=74.57  E-value=5.4  Score=31.87  Aligned_cols=12  Identities=42%  Similarity=1.032  Sum_probs=6.6

Q ss_pred             CCCcCCCCcccc
Q psy826          140 LCNECNTGYFQS  151 (329)
Q Consensus       140 ~C~~C~~g~y~~  151 (329)
                      .|..|++|.|.+
T Consensus        64 ~C~PCp~GTYTe   75 (127)
T PHA02637         64 QCTPCGSGTFTS   75 (127)
T ss_pred             ccccCCCCCeec
Confidence            455666666543


No 77 
>KOG4611|consensus
Probab=68.25  E-value=5.4  Score=37.54  Aligned_cols=25  Identities=32%  Similarity=1.160  Sum_probs=17.4

Q ss_pred             CCCcCCCCccccccCcCCCCCcCCccccc
Q psy826          140 LCNECNTGYFQSYKDEKTILCSKCHASCE  168 (329)
Q Consensus       140 ~C~~C~~g~y~~~~~~~~~~C~~C~~~C~  168 (329)
                      .|..|..|||.   + ....|..|..+|.
T Consensus        99 fcgncasgfyr---n-dngyctkcetscs  123 (747)
T KOG4611|consen   99 FCGNCASGFYR---N-DNGYCTKCETSCS  123 (747)
T ss_pred             ccccccccceE---C-CCcccccccccHh
Confidence            68899999984   2 3345777776664


No 78 
>KOG3782|consensus
Probab=67.54  E-value=0.79  Score=38.07  Aligned_cols=18  Identities=50%  Similarity=0.899  Sum_probs=17.3

Q ss_pred             cccccceEEEecccCccc
Q psy826           26 IYAKSEVRLIEIQEKMCS   43 (329)
Q Consensus        26 ~y~~se~~~~e~~e~~C~   43 (329)
                      -|+|||.-|.|+||++|+
T Consensus        69 ~yarSE~hLTEl~E~iCd   86 (189)
T KOG3782|consen   69 RYARSEMHLTELMEKICD   86 (189)
T ss_pred             ccchhHHHHHHHHHHHHH
Confidence            599999999999999999


No 79 
>KOG1218|consensus
Probab=64.40  E-value=49  Score=30.28  Aligned_cols=107  Identities=24%  Similarity=0.549  Sum_probs=54.3

Q ss_pred             CCCCCCCCCc-cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCCCCCcCCcccccCCcc
Q psy826           94 PVDHYGADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKTILCSKCHASCESGCS  172 (329)
Q Consensus        94 ~~G~~G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~  172 (329)
                      ..+|.|..|. +|+-+.. |.. .+|.+    ... .|.+..+|.+..|..  .++++.   .....| .+...    +.
T Consensus        96 ~~~~~g~~C~~~~~~~~~-c~~-~~C~~----~~~-~c~~~~~~~~~~C~~--~~~~g~---~C~~~c-~~~~~----~~  158 (316)
T KOG1218|consen   96 LNGYEGPQCESPCPCGDG-CAE-KTCAN----PRR-ECRCGGGYIGEQCGE--ENLVGL---KCQRDC-QCTGG----CD  158 (316)
T ss_pred             CCCCCcccccCCCCcCCc-ccc-cccCC----Ccc-ceecCCcCccccccc--cCCCCC---CccCCC-CCccc----cC
Confidence            5677777776 3332211 333 45654    111 688888888887754  344432   122222 11111    11


Q ss_pred             cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCccc
Q psy826          173 TGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRC  221 (329)
Q Consensus       173 ~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C  221 (329)
                       .....| .|.+||.+..... ... .|.....|..+..|....+...+
T Consensus       159 -~~~~~c-~c~~g~~g~~~~~-~~~-~c~~~~~~~~g~~C~~~~~~~~~  203 (316)
T KOG1218|consen  159 -CKNGIC-TCQPGFVGVFCVE-SCS-GCSPLTACENGAKCNRSTGSCLC  203 (316)
T ss_pred             -CCCCce-eccCCcccccccc-cCC-CcCCCcccCCCCeeecccccccc
Confidence             334678 7999999876321 110 14433455555566655444333


No 80 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=61.61  E-value=5.4  Score=26.66  Aligned_cols=22  Identities=32%  Similarity=0.890  Sum_probs=16.6

Q ss_pred             CCccCCCCceecCCCCcccCCCC
Q psy826          178 GCTKCKSGWAADKDIGCYDINEC  200 (329)
Q Consensus       178 ~C~~C~~G~~~~~~~~C~d~~eC  200 (329)
                      .| .|++||.++..+.|+...+|
T Consensus        34 gC-~C~~G~v~~~~~~CV~~~~C   55 (55)
T PF01826_consen   34 GC-FCPPGYVRNDNGRCVPPSEC   55 (55)
T ss_dssp             EE-EETTTEEEETTSEEEEGGGC
T ss_pred             cC-CCCCCeeEcCCCCEEcHHHC
Confidence            48 99999998875567765544


No 81 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=58.73  E-value=8.4  Score=33.85  Aligned_cols=30  Identities=27%  Similarity=0.656  Sum_probs=23.4

Q ss_pred             cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCC
Q psy826          104 PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTG  138 (329)
Q Consensus       104 ~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g  138 (329)
                      +|+...+.|.  ..|.+   +.|+|.|.|.+||+.
T Consensus       189 ~C~~~~~~c~--~~C~~---~~g~~~c~c~~g~~~  218 (224)
T cd01475         189 LCATLSHVCQ--QVCIS---TPGSYLCACTEGYAL  218 (224)
T ss_pred             hhcCCCCCcc--ceEEc---CCCCEEeECCCCccC
Confidence            5665556675  46888   899999999999974


No 82 
>PF00757 Furin-like:  Furin-like cysteine rich region;  InterPro: IPR006211 The furin-like cysteine rich region has been found in a variety of proteins from eukaryotes that are involved in the mechanism of signal transduction by receptor tyrosine kinases, which involves receptor aggregation [].; GO: 0004714 transmembrane receptor protein tyrosine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation, 0007169 transmembrane receptor protein tyrosine kinase signaling pathway, 0016020 membrane; PDB: 3U2P_A 2AHX_B 1N8Y_C 1IGR_A 1S78_A 3MZW_A 2A91_A 1N8Z_C 3N85_A 3H3B_B ....
Probab=56.48  E-value=5.4  Score=33.03  Aligned_cols=34  Identities=29%  Similarity=0.778  Sum_probs=24.8

Q ss_pred             CcCCC-CCCcCCcccccccCcccccccCCcccccCcC
Q psy826          224 CDPSC-NGCHGDGPDMCEACAEGYKLQQNICINTQAK  259 (329)
Q Consensus       224 C~~~C-~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~~~  259 (329)
                      ||+.| ++|+|+++.+|.+|+.-+.  +++|++.++.
T Consensus        51 Ch~eClGGC~gp~~~~C~aCr~~~~--~g~Cv~~CP~   85 (149)
T PF00757_consen   51 CHEECLGGCSGPNPSDCYACRHFSY--NGTCVEQCPP   85 (149)
T ss_dssp             --TTEESEESSSSTTSESCESSEEE--TTEEESS-ST
T ss_pred             hhHHHhhcCCCCCCCcCHHHhCcCc--CCeecccCCh
Confidence            88888 6899999999999986443  5688877655


No 83 
>KOG0196|consensus
Probab=52.76  E-value=14  Score=38.81  Aligned_cols=49  Identities=22%  Similarity=0.524  Sum_probs=28.8

Q ss_pred             ccCcCCCCCC----CCCCccCCCCC-CCCCCCCc---ccCCC--CCCCCceeecCCCCC
Q psy826           89 LKRCCPVDHY----GADCKPCLGFP-NVCFGNGK---CKGNG--TRKGNGQCVCNKEYT  137 (329)
Q Consensus        89 ~~~~C~~G~~----G~~C~~C~~~~-~~C~~~~~---C~~~~--~~~gs~~C~C~~G~~  137 (329)
                      .++.|.+||.    |..|+.||.+. .+=.....   |-...  ...++-.|.|..||.
T Consensus       259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gyy  317 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGYY  317 (996)
T ss_pred             CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCcc
Confidence            3566999996    67999998642 11111222   22111  136677888888875


No 84 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=52.28  E-value=12  Score=29.97  Aligned_cols=37  Identities=27%  Similarity=0.553  Sum_probs=25.6

Q ss_pred             cCCC-CCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826          104 PCLG-FPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN  142 (329)
Q Consensus       104 ~C~~-~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~  142 (329)
                      +|+. ..+-|.+ |.|.... ......|.|..||.|..|+
T Consensus        44 ~Cp~ey~~YClH-G~C~yI~-dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         44 LCGPEGDGYCLH-GDCIHAR-DIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             cCChhhCCEeEC-CEEEeec-cCCCceeECCCCccccccc
Confidence            4543 2345764 4887522 2567899999999999995


No 85 
>PF07699 GCC2_GCC3:  GCC2 and GCC3;  InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []:   Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction [].      Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases [].   This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=41.16  E-value=46  Score=21.44  Aligned_cols=10  Identities=30%  Similarity=1.002  Sum_probs=7.0

Q ss_pred             CCCcCCCCcc
Q psy826          140 LCNECNTGYF  149 (329)
Q Consensus       140 ~C~~C~~g~y  149 (329)
                      .|..|+.|+|
T Consensus        10 ~C~~Cp~GtY   19 (48)
T PF07699_consen   10 KCQPCPKGTY   19 (48)
T ss_pred             ccCCCCCCcc
Confidence            5667777776


No 86 
>PHA02887 EGF-like protein; Provisional
Probab=40.04  E-value=20  Score=28.22  Aligned_cols=20  Identities=35%  Similarity=0.674  Sum_probs=15.7

Q ss_pred             CCccceeecccccCcCCCCCCCCCCc
Q psy826           78 SDLYTWLCINKLKRCCPVDHYGADCK  103 (329)
Q Consensus        78 ~~~~~~~C~~~~~~~C~~G~~G~~C~  103 (329)
                      .++.+..|.      |++||+|..|+
T Consensus       103 ~dL~epsCr------C~~GYtG~RCE  122 (126)
T PHA02887        103 IDLDEKFCI------CNKGYTGIRCD  122 (126)
T ss_pred             ccCCCceeE------CCCCcccCCCC
Confidence            455556666      99999999997


No 87 
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=39.52  E-value=61  Score=25.98  Aligned_cols=9  Identities=33%  Similarity=1.080  Sum_probs=5.2

Q ss_pred             CCcCCCCcc
Q psy826          141 CNECNTGYF  149 (329)
Q Consensus       141 C~~C~~g~y  149 (329)
                      |..|++|+|
T Consensus        44 C~kCPPGt~   52 (127)
T PHA02637         44 CLSCPPGTY   52 (127)
T ss_pred             cCCCCCCCE
Confidence            366666654


No 88 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=39.09  E-value=28  Score=26.81  Aligned_cols=33  Identities=24%  Similarity=0.675  Sum_probs=24.2

Q ss_pred             cCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCcccccc
Q psy826          196 DINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKL  248 (329)
Q Consensus       196 d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~  248 (329)
                      ..+.|.....|.+..+|.. ..+..|.                   |.+||.-
T Consensus        76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~-------------------Cl~GF~P  108 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNS-NNSPKCS-------------------CLPGFEP  108 (110)
T ss_pred             cccCCCCccccCCccEeCC-CCCCceE-------------------CCCCcCC
Confidence            3456777678988899943 3456788                   9999964


No 89 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=36.97  E-value=13  Score=25.13  Aligned_cols=32  Identities=34%  Similarity=0.756  Sum_probs=17.1

Q ss_pred             CCCCCCcccCCCC-CCCCceeecCCCCCCCCCC
Q psy826          111 VCFGNGKCKGNGT-RKGNGQCVCNKEYTGELCN  142 (329)
Q Consensus       111 ~C~~~~~C~~~~~-~~gs~~C~C~~G~~g~~C~  142 (329)
                      +|+++|..--++. ..|.-.|.|+..|.|+.|.
T Consensus        18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS   50 (56)
T PF04863_consen   18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS   50 (56)
T ss_dssp             --TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred             CcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence            4777777654442 2566789999999999884


No 90 
>PF04706 Dickkopf_N:  Dickkopf N-terminal cysteine-rich region;  InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=34.26  E-value=66  Score=21.51  Aligned_cols=32  Identities=22%  Similarity=0.733  Sum_probs=18.9

Q ss_pred             CCCCCCCCCCceeecCCCCcccCCCcCCCCCC
Q psy826          200 CSDENICSGNQFCVNTEGSYRCMQCDPSCNGC  231 (329)
Q Consensus       200 C~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C  231 (329)
                      |.....|+.+.||......++|.+|...=..|
T Consensus         2 C~~D~dC~~g~yC~~~~~~~~C~~Cr~~~~rC   33 (52)
T PF04706_consen    2 CSSDEDCGYGKYCHSSSFESKCLPCRKRRKRC   33 (52)
T ss_pred             CcccccCCCCCCcCCCCCCccChhhccCCCCC
Confidence            33334666677887766667777655443333


No 91 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=33.90  E-value=39  Score=25.98  Aligned_cols=26  Identities=23%  Similarity=0.591  Sum_probs=19.9

Q ss_pred             CCCCCCCCcccCCCCCCCCceeecCCCCCC
Q psy826          109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTG  138 (329)
Q Consensus       109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g  138 (329)
                      ...|..++.|..    .....|.|.+||.-
T Consensus        83 y~~CG~~g~C~~----~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   83 YGFCGPNGICNS----NNSPKCSCLPGFEP  108 (110)
T ss_pred             ccccCCccEeCC----CCCCceECCCCcCC
Confidence            456989999963    44567999999963


No 92 
>PF07562 NCD3G:  Nine Cysteines Domain of family 3 GPCR;  InterPro: IPR011500 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents a conserved sequence, found in the extracellular region, that contains several highly-conserved Cys residues that are predicted to form disulphide bridges.; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway; PDB: 2E4X_B 2E4Y_A 2E4U_B 2E4V_B 2E4W_A.
Probab=33.45  E-value=11  Score=25.34  Aligned_cols=8  Identities=25%  Similarity=0.534  Sum_probs=2.2

Q ss_pred             CcCCCCcc
Q psy826          142 NECNTGYF  149 (329)
Q Consensus       142 ~~C~~g~y  149 (329)
                      ..|++|++
T Consensus         7 ~~C~pG~~   14 (54)
T PF07562_consen    7 EPCPPGQR   14 (54)
T ss_dssp             ----TTTE
T ss_pred             CCCCCCCE
Confidence            34555554


No 93 
>KOG0814|consensus
Probab=30.03  E-value=9.2  Score=32.53  Aligned_cols=26  Identities=27%  Similarity=0.553  Sum_probs=20.2

Q ss_pred             CchhhhhccC-cccccceEEEecccCc
Q psy826           16 DTAWEEEKQK-IYAKSEVRLIEIQEKM   41 (329)
Q Consensus        16 ~~~~ee~~~~-~y~~se~~~~e~~e~~   41 (329)
                      .|.|||+++. .++.|--.+++||+++
T Consensus       189 stV~EEk~~NPRLTk~~eeFv~IM~NL  215 (237)
T KOG0814|consen  189 STVWEEKNLNPRLTKSKEEFVKIMKNL  215 (237)
T ss_pred             eehhhhhccCcccccCHHHHHHHHHhc
Confidence            4689999984 6777777777888776


No 94 
>KOG3516|consensus
Probab=25.86  E-value=55  Score=35.85  Aligned_cols=32  Identities=34%  Similarity=0.839  Sum_probs=27.5

Q ss_pred             CCCCCCCCCcccCCCCCCCCceeecC-CCCCCCCCC
Q psy826          108 FPNVCFGNGKCKGNGTRKGNGQCVCN-KEYTGELCN  142 (329)
Q Consensus       108 ~~~~C~~~~~C~~~~~~~gs~~C~C~-~G~~g~~C~  142 (329)
                      .+++|..++.|..   .-..+.|.|. .||+|..|.
T Consensus       549 lPN~CehgG~C~Q---s~~~f~C~C~~TGY~GatCH  581 (1306)
T KOG3516|consen  549 LPNPCEHGGKCSQ---SWDDFECNCELTGYKGATCH  581 (1306)
T ss_pred             CCccccCCCcccc---cccceeEecccccccccccc
Confidence            5688999999997   5667999998 899999993


No 95 
>KOG4258|consensus
Probab=24.97  E-value=81  Score=33.48  Aligned_cols=88  Identities=25%  Similarity=0.580  Sum_probs=0.0

Q ss_pred             ceeecccccCcCCCCCC--CCCCc-cCCCCCCCCCCCCcccCCCCCCCCceee--cCCCC--------------------
Q psy826           82 TWLCINKLKRCCPVDHY--GADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCV--CNKEY--------------------  136 (329)
Q Consensus        82 ~~~C~~~~~~~C~~G~~--G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~--C~~G~--------------------  136 (329)
                      ...|......-|..|-.  |.-|- +|-+++..=.....|..-......+.|+  |+++|                    
T Consensus       213 ~~~CQK~Cp~~~g~gC~ang~cCH~eClGgC~~~~dptaC~aCr~~y~~G~CV~~Cp~~~Y~~e~~RCvt~~~C~~l~~~  292 (1025)
T KOG4258|consen  213 NSHCQKSCPENCGNGCTANGECCHEECLGGCSQPNDPTACVACRGLYDDGVCVEACPPDYYLFENWRCVTREECALLHSL  292 (1025)
T ss_pred             CCchhccCccccccCcccccccccHhhccCcCCCCChhHhHHhhccccCCceeccCCcchhhhccceeccHHHHHHhcCc


Q ss_pred             ---------CCCCC-CcCCCCccccccCcCCCCCcCCcccccCCcc
Q psy826          137 ---------TGELC-NECNTGYFQSYKDEKTILCSKCHASCESGCS  172 (329)
Q Consensus       137 ---------~g~~C-~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~  172 (329)
                               ++..| ..||+||-   .+..+..|.+|.+.|...|.
T Consensus       293 ~~~~i~G~~~~~~Cv~~CPsGy~---~N~~~~~C~~C~g~C~~vC~  335 (1025)
T KOG4258|consen  293 SNSVISGVIHAGQCVAKCPSGYK---RNSSSSECVKCEGPCPKVCE  335 (1025)
T ss_pred             ccccccceeccccchhhCCCcce---ecCcccceeccCCCCcceec


No 96 
>PHA02714 CD-30-like protein; Provisional
Probab=24.16  E-value=86  Score=23.84  Aligned_cols=18  Identities=28%  Similarity=0.933  Sum_probs=11.6

Q ss_pred             CcccCCCCCCccCCCCcee
Q psy826          170 GCSTGGPKGCTKCKSGWAA  188 (329)
Q Consensus       170 ~C~~~~~~~C~~C~~G~~~  188 (329)
                      .|....+..| .|.+|..-
T Consensus        52 PC~~ns~RvC-eCkpGm~C   69 (110)
T PHA02714         52 SCGPDKPRKC-QCGPGLKC   69 (110)
T ss_pred             cCCCCCCcee-cCCCCCEE
Confidence            4554455778 88887654


No 97 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=21.09  E-value=43  Score=25.88  Aligned_cols=30  Identities=23%  Similarity=0.438  Sum_probs=19.3

Q ss_pred             CCCCCCCCCCcccCCCCC--CCCceeecCCCC
Q psy826          107 GFPNVCFGNGKCKGNGTR--KGNGQCVCNKEY  136 (329)
Q Consensus       107 ~~~~~C~~~~~C~~~~~~--~gs~~C~C~~G~  136 (329)
                      ...+.|++||.|......  ..=|.|.|.+..
T Consensus        10 ~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~   41 (103)
T PF12955_consen   10 NATNNCSGHGSCVKKYGSGGGDCFACKCKPTV   41 (103)
T ss_pred             HhccCCCCCceEeeccCCCccceEEEEeeccc
Confidence            344679999999962211  133678888844


No 98 
>KOG4611|consensus
Probab=20.87  E-value=1.2e+02  Score=28.90  Aligned_cols=18  Identities=28%  Similarity=0.634  Sum_probs=12.2

Q ss_pred             CCCccCCCCceecCCCCc
Q psy826          177 KGCTKCKSGWAADKDIGC  194 (329)
Q Consensus       177 ~~C~~C~~G~~~~~~~~C  194 (329)
                      ..|..|..||+.++.+-|
T Consensus        98 afcgncasgfyrndngyc  115 (747)
T KOG4611|consen   98 AFCGNCASGFYRNDNGYC  115 (747)
T ss_pred             cccccccccceECCCccc
Confidence            456678888887765444


No 99 
>KOG3509|consensus
Probab=20.63  E-value=1e+02  Score=33.39  Aligned_cols=62  Identities=29%  Similarity=0.648  Sum_probs=37.6

Q ss_pred             ceeecCCCCCCCCCCcCCCCccccccCc-CCCCCc---------CCc---ccccCCcc-cCCCCCCccCCCCceecC
Q psy826          128 GQCVCNKEYTGELCNECNTGYFQSYKDE-KTILCS---------KCH---ASCESGCS-TGGPKGCTKCKSGWAADK  190 (329)
Q Consensus       128 ~~C~C~~G~~g~~C~~C~~g~y~~~~~~-~~~~C~---------~C~---~~C~~~C~-~~~~~~C~~C~~G~~~~~  190 (329)
                      -.|.|..||.|..|+.|.++|-...... ....+.         .|.   ..|. .|. ...+..|..|.+|++++.
T Consensus       718 ~~C~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~~~~h~~~c~~~~~~nt-~~q~~~~~~~~~~~~~g~~~da  793 (964)
T KOG3509|consen  718 EQCQCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCECNSHISQCEDDLGYNT-DCQNNTEGDRCELCSPGTYGDA  793 (964)
T ss_pred             cccccCccccCcccccccccccccccCCcCcccCcccccCCCcccccccccccc-cccccCccceeeecCCCccccC
Confidence            4799999999999998888764322100 000111         111   1222 343 445678889999999876


Done!