Query         psy9419
Match_columns 739
No_of_seqs    449 out of 2689
Neff          8.7 
Searched_HMMs 46136
Date          Fri Aug 16 18:45:57 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9419.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9419hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.7 3.3E-15 7.2E-20  163.0  22.1  236  195-489   700-947 (1289)
  2 KOG1214|consensus               99.7 2.3E-15   5E-20  164.2  18.4  209   27-281   693-910 (1289)
  3 KOG1217|consensus               99.7   7E-15 1.5E-19  166.7  23.5  276    5-384   106-389 (487)
  4 KOG1217|consensus               99.6 1.3E-13 2.8E-18  156.3  30.5  320  159-612    99-433 (487)
  5 KOG4289|consensus               99.6   2E-14 4.3E-19  163.7  19.5   84  131-217  1223-1308(2531)
  6 KOG4289|consensus               99.6 2.7E-14 5.8E-19  162.7  15.3  104  315-440  1180-1308(2531)
  7 KOG1219|consensus               99.3 5.8E-12 1.3E-16  148.9   8.6  109  150-282  3865-3976(4289)
  8 KOG1219|consensus               99.2 2.2E-11 4.7E-16  144.3   8.0  113  243-384  3859-3974(4289)
  9 KOG1225|consensus               99.2 3.7E-10 8.1E-15  123.6  14.3  192  432-673   160-365 (525)
 10 KOG0994|consensus               99.1   2E-09 4.3E-14  122.0  16.5  198  334-603   933-1143(1758)
 11 KOG1225|consensus               99.1 2.2E-09 4.7E-14  117.7  16.5  122  481-638   232-363 (525)
 12 KOG0994|consensus               99.0 1.3E-08 2.8E-13  115.7  17.3   30  522-562  1126-1155(1758)
 13 KOG4260|consensus               98.8 3.9E-09 8.4E-14  102.7   6.2  163   51-278   130-304 (350)
 14 KOG4260|consensus               98.6 1.1E-07 2.5E-12   92.6   6.7  127  431-602   166-305 (350)
 15 KOG1836|consensus               98.4 0.00036 7.7E-09   87.2  31.7  216  486-723   760-1033(1705)
 16 PF07645 EGF_CA:  Calcium-bindi  98.2 4.7E-07   1E-11   65.0   1.8   36   67-102     1-36  (42)
 17 KOG1836|consensus               98.0 0.00016 3.6E-09   90.1  19.0  110  426-549   903-1026(1705)
 18 KOG1226|consensus               97.9 9.8E-05 2.1E-09   83.0  12.3   99  155-284   514-621 (783)
 19 PF07645 EGF_CA:  Calcium-bindi  97.9 8.6E-06 1.9E-10   58.5   2.4   35  506-540     1-35  (42)
 20 KOG1226|consensus               97.8 7.7E-05 1.7E-09   83.8   9.9  149  515-693   467-636 (783)
 21 PF06247 Plasmod_Pvs28:  Plasmo  97.5 2.6E-05 5.7E-10   72.9   1.0  143   34-222     7-163 (197)
 22 PF12947 EGF_3:  EGF domain;  I  97.5 4.2E-05 9.1E-10   52.5   1.0   33   71-103     1-33  (36)
 23 smart00179 EGF_CA Calcium-bind  97.4 0.00021 4.5E-09   50.2   4.1   35  569-603     1-36  (39)
 24 PF12662 cEGF:  Complement Clr-  97.4 0.00012 2.6E-09   44.9   2.2   23   48-70      1-24  (24)
 25 PF00008 EGF:  EGF-like domain   97.4 8.7E-05 1.9E-09   49.7   1.7   29  575-603     2-31  (32)
 26 PF00008 EGF:  EGF-like domain   97.3 0.00012 2.7E-09   48.9   1.9   30  191-221     1-31  (32)
 27 smart00179 EGF_CA Calcium-bind  97.2 0.00047   1E-08   48.4   4.3   37  246-282     1-38  (39)
 28 PF12947 EGF_3:  EGF domain;  I  97.2 0.00018 3.9E-09   49.3   1.3   30  513-542     4-33  (36)
 29 PF12662 cEGF:  Complement Clr-  97.0 0.00063 1.4E-08   41.8   2.6   24  209-249     1-24  (24)
 30 cd00054 EGF_CA Calcium-binding  96.8  0.0016 3.5E-08   45.1   4.0   34  570-603     2-35  (38)
 31 cd00054 EGF_CA Calcium-binding  96.6  0.0031 6.7E-08   43.6   4.1   36  247-282     2-37  (38)
 32 PF06247 Plasmod_Pvs28:  Plasmo  96.3  0.0025 5.4E-08   59.9   2.6   95    4-103    65-163 (197)
 33 KOG1218|consensus               96.0     1.1 2.3E-05   47.7  21.6   84  434-540   125-209 (316)
 34 PF14670 FXa_inhibition:  Coagu  95.9  0.0035 7.6E-08   42.9   1.2   28   74-103     4-31  (36)
 35 cd00053 EGF Epidermal growth f  95.9   0.012 2.6E-07   40.0   3.9   28  576-603     5-32  (36)
 36 cd00053 EGF Epidermal growth f  95.8   0.011 2.4E-07   40.1   3.4   28   75-102     5-32  (36)
 37 smart00181 EGF Epidermal growt  95.8   0.014   3E-07   39.7   3.8   26  577-603     6-31  (35)
 38 smart00181 EGF Epidermal growt  95.4   0.016 3.5E-07   39.4   3.1   26   76-102     6-31  (35)
 39 PF07974 EGF_2:  EGF-like domai  95.3   0.016 3.5E-07   38.6   2.6   25  650-674     6-32  (32)
 40 KOG1218|consensus               95.2     0.9 1.9E-05   48.3  17.3   49  430-494    12-60  (316)
 41 PF07974 EGF_2:  EGF-like domai  94.9   0.032 6.9E-07   37.2   3.1   25   33-60      6-30  (32)
 42 PF12661 hEGF:  Human growth fa  94.0   0.021 4.5E-07   29.7   0.5   13  662-674     1-13  (13)
 43 PF14670 FXa_inhibition:  Coagu  92.8   0.078 1.7E-06   36.4   2.0   24  260-283    10-33  (36)
 44 cd01475 vWA_Matrilin VWA_Matri  91.6    0.16 3.5E-06   51.3   3.5   38   64-103   183-220 (224)
 45 smart00051 DSL delta serrate l  91.2    0.22 4.8E-06   39.0   3.1   21  654-674    42-63  (63)
 46 PF01683 EB:  EB module;  Inter  87.0     1.2 2.5E-05   33.4   4.3   43   13-60      1-48  (52)
 47 smart00051 DSL delta serrate l  84.7     1.1 2.5E-05   35.0   3.3   43  661-705    17-59  (63)
 48 PF01683 EB:  EB module;  Inter  82.5     1.9 4.2E-05   32.2   3.8   22  650-671    26-47  (52)
 49 PF12946 EGF_MSP1_1:  MSP1 EGF   82.5     1.1 2.4E-05   30.7   2.1   31   29-60      2-32  (37)
 50 PTZ00214 high cysteine membran  82.2      66  0.0014   38.8  18.1   15  269-283   682-696 (800)
 51 cd00055 EGF_Lam Laminin-type e  80.6     2.1 4.5E-05   31.8   3.3   31  522-563    13-43  (50)
 52 cd01475 vWA_Matrilin VWA_Matri  80.5     1.6 3.5E-05   43.9   3.6   37  184-222   183-220 (224)
 53 PF00053 Laminin_EGF:  Laminin   80.3     1.2 2.5E-05   32.9   1.8   31  521-562    11-41  (49)
 54 PF12946 EGF_MSP1_1:  MSP1 EGF   79.2     1.1 2.4E-05   30.7   1.2   28  576-603     4-32  (37)
 55 smart00180 EGF_Lam Laminin-typ  73.9     3.9 8.4E-05   29.8   3.0   25  522-548    12-36  (46)
 56 PF09064 Tme5_EGF_like:  Thromb  70.1     4.6 9.9E-05   27.1   2.3   23   39-63     10-32  (34)
 57 PF00954 S_locus_glycop:  S-loc  68.6     5.4 0.00012   35.1   3.4   34  569-603    76-109 (110)
 58 KOG3512|consensus               64.8      14 0.00029   40.4   5.9   28  423-450   404-431 (592)
 59 PF03302 VSP:  Giardia variant-  62.2      30 0.00064   38.2   8.3   52   52-109     3-56  (397)
 60 PHA02887 EGF-like protein; Pro  57.6     8.2 0.00018   33.7   2.3   26  325-350    96-123 (126)
 61 cd00055 EGF_Lam Laminin-type e  57.2      13 0.00027   27.6   3.0   19  427-445    13-31  (50)
 62 PF00053 Laminin_EGF:  Laminin   54.2     7.3 0.00016   28.6   1.3   20  426-445    11-30  (49)
 63 PTZ00214 high cysteine membran  51.9 4.2E+02  0.0091   32.2  16.0   13  268-280   750-762 (800)
 64 PHA03099 epidermal growth fact  49.3      16 0.00034   32.6   2.8   39  570-612    42-84  (139)
 65 PHA02887 EGF-like protein; Pro  48.5      14 0.00031   32.3   2.3   29  462-500    94-122 (126)
 66 PF00954 S_locus_glycop:  S-loc  48.4      17 0.00036   32.0   2.9   33  187-220    76-108 (110)
 67 smart00180 EGF_Lam Laminin-typ  48.0      20 0.00043   26.0   2.7   20  426-445    11-30  (46)
 68 KOG3512|consensus               47.4      32 0.00069   37.7   5.2   99    3-107   289-430 (592)
 69 PHA03099 epidermal growth fact  46.3      20 0.00042   32.0   2.9   37  247-284    42-82  (139)
 70 PF01414 DSL:  Delta serrate li  42.0     6.9 0.00015   30.7  -0.5   14  661-674    50-63  (63)
 71 PF12955 DUF3844:  Domain of un  32.2      25 0.00054   30.5   1.3   32   28-59      7-43  (103)
 72 PF12955 DUF3844:  Domain of un  28.1      36 0.00078   29.5   1.6   31  571-601     6-42  (103)
 73 PF04863 EGF_alliinase:  Alliin  27.8      31 0.00067   26.0   1.0   33  320-352    17-53  (56)
 74 KOG3516|consensus               27.3      56  0.0012   40.1   3.5   44  241-285   539-583 (1306)
 75 KOG3516|consensus               22.5      64  0.0014   39.6   2.8   37  566-603   541-578 (1306)

No 1  
>KOG1214|consensus
Probab=99.68  E-value=3.3e-15  Score=163.01  Aligned_cols=236  Identities=27%  Similarity=0.596  Sum_probs=161.3

Q ss_pred             CCCCCCCeeeccCC-ceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCCCC-CCCCCCCCCeeeecCCceEEe
Q psy9419         195 SPCASSALCVNEKG-GFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDECL-GVSPCASSALCVNEKGGFKCV  272 (739)
Q Consensus       195 ~~C~~~~~C~n~~g-~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~eC~-~~~~C~~~~~C~~~~g~~~C~  272 (739)
                      +-|..++.|....+ .|+|.|..||.|++                   ..|.|++||+ ..+.|..+++|++.+|+|+|.
T Consensus       700 h~cdt~a~C~pg~~~~~tcecs~g~~gdg-------------------r~c~d~~eca~~~~~CGp~s~Cin~pg~~rce  760 (1289)
T KOG1214|consen  700 HMCDTTARCHPGTGVDYTCECSSGYQGDG-------------------RNCVDENECATGFHRCGPNSVCINLPGSYRCE  760 (1289)
T ss_pred             cccCCCccccCCCCcceEEEEeeccCCCC-------------------CCCCChhhhccCCCCCCCCceeecCCCceeEE
Confidence            44666677776543 68999999999976                   4578999997 568899999999999999999


Q ss_pred             CCCCCCCCCCCccccCCCC--CCCccccCCCCCCCCCCCCCcccCCCCCCCCCCC--cccccC-CCCceeecCCCceeCC
Q psy9419         273 CPKGTTGDPYTLGCVGSGS--PRTECRVDKECSPSLQCRGGACVDPCRSVECGAH--ALCEPQ-DHRASCRCELGYTEGL  347 (739)
Q Consensus       273 C~~Gy~g~~c~~~c~~~~~--~~~~C~~~~~C~~~~~C~~g~C~~~C~~~~C~~~--~~C~~~-~g~~~C~C~~G~~g~~  347 (739)
                      |..||.......+|+.+..  ....|++.                   +..|...  +.|+.. .+.|+|.|.|||.|+.
T Consensus       761 C~~gy~F~dd~~tCV~i~~pap~n~Ce~g-------------------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG  821 (1289)
T KOG1214|consen  761 CRSGYEFADDRHTCVLITPPAPANPCEDG-------------------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDG  821 (1289)
T ss_pred             EeecceeccCCcceEEecCCCCCCccccC-------------------ccccCcCCceEEEecCCceEEEeecCCccCCc
Confidence            9999987766666765432  22223222                   1234433  344443 3479999999999976


Q ss_pred             CCc-cccCCCCCCCCCCCeeecCCCCCeeecCCCCccCCCCCCCccCCCCCCCCCCCCCCcccCCCccCCCCCCCCCCCc
Q psy9419         348 NGK-CVSLCEGIVCAPGAACIVTPAGPTCTCADGARGNPFPGGACYPDLCSATQPCPALSVCVAGRCKARCAGVVCGAGA  426 (739)
Q Consensus       348 ~~~-c~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~~~~g~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~  426 (739)
                      -.- .+|+|+++.|..+|.|.++++++.|+|.+||.|+.+   .|.+++- ...+|...          +-..+.|+..+
T Consensus       822 ~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf---~CVP~~~-~~T~C~~e----------r~hpl~chg~t  887 (1289)
T KOG1214|consen  822 HQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGF---QCVPDTS-SLTPCEQE----------RFHPLQCHGST  887 (1289)
T ss_pred             cccccccccCccccCCCceEecCCCcceeecccCccCCCc---eecCCCc-cCCccccc----------cccceeecccc
Confidence            332 368999999999999999999999999999999944   7766421 11233221          01122344444


Q ss_pred             ee----cCCCCcccCCCCccCCCCccccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCC
Q psy9419         427 QC----DPALDRCVCPPFYVGDPEFNCVPPVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNS  489 (739)
Q Consensus       427 ~C----~~~~~~C~C~~g~~g~~~~~C~~~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~  489 (739)
                      .|    ++..+++.+.++=.|++...|..... .. ...|..+|.+.....     .+.++.|.|..
T Consensus       888 ~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~-~~-vp~Cd~hgh~ap~qc-----hG~~~~CwCvd  947 (1289)
T KOG1214|consen  888 GFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPE-QY-VPQCDDHGHFAPLQC-----HGKSDFCWCVD  947 (1289)
T ss_pred             ceeEeeCCCcccCCCCCCCCCCCCCCCCCccc-cc-CCCcccccccccccc-----CCCcceeEEec
Confidence            33    24567888888777776556654321 11 236778888876652     23558899976


No 2  
>KOG1214|consensus
Probab=99.66  E-value=2.3e-15  Score=164.23  Aligned_cols=209  Identities=31%  Similarity=0.680  Sum_probs=145.6

Q ss_pred             cCCCCC-CCCCCCCeeeeCCCCeeEEecCCCCccCCCCCCcccccccCCCCCCCCCCceeeCCCCceeeCCCCCcCCCC-
Q psy9419          27 ATCGTQ-GQCPGGAECVNIAGGVSYCACPKGFRPKEDGYCEDVDECAESRHLCGPGAVCINHPGSYTCQCPPNSSGDPL-  104 (739)
Q Consensus        27 ~~C~~~-~~C~~~g~C~~~~~g~~~C~C~~Gy~g~~~~~C~dideC~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~~-  104 (739)
                      ++|..+ +.|..++.|...++-.|+|+|..||.|. .++|.|++||++.++.|+++++|+|.+|+|+|.|..||.-... 
T Consensus       693 npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gd-gr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~  771 (1289)
T KOG1214|consen  693 NPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGD-GRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDR  771 (1289)
T ss_pred             ccceecCcccCCCccccCCCCcceEEEEeeccCCC-CCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCC
Confidence            355544 6777788898877777999999999986 3489999999999999999999999999999999999885431 


Q ss_pred             CCCccCCCCCCCCCCCCCCCCCccCccccCCCCccccCCCCCCccCCCCCCCCCCCc--eeccC--CCCeeeCCCCCccC
Q psy9419         105 LGCTHARVQCSRDADCDGPYERCVRAACVCPAPYYADVNDGHKCKSPCERFSCGINA--QCTPA--DPPQCTCLAGYTGE  180 (739)
Q Consensus       105 ~~C~~~~~~C~~~~~C~~~~~~C~~~~C~C~~g~~g~~~~~~~C~~~C~~~~C~~~~--~C~~~--~~~~C~C~~Gy~g~  180 (739)
                      ..|....+. ..++.|+.                              ..+.|..++  .|+..  +.|.|+|.|||.|+
T Consensus       772 ~tCV~i~~p-ap~n~Ce~------------------------------g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGD  820 (1289)
T KOG1214|consen  772 HTCVLITPP-APANPCED------------------------------GSHTCAIAGQARCVHHGGSTYSCACLPGFSGD  820 (1289)
T ss_pred             cceEEecCC-CCCCcccc------------------------------CccccCcCCceEEEecCCceEEEeecCCccCC
Confidence            113222111 11122211                              123455444  45543  47999999999999


Q ss_pred             CCCCCcccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCCCCCCCCCCCCC
Q psy9419         181 ATLGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDECLGVSPCASSA  260 (739)
Q Consensus       181 ~~~~C~~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~eC~~~~~C~~~~  260 (739)
                      ... |.|+|||.. +-|...+.|+|++|+|.|+|.+||.|+.++  |+....+.+.|...+         ..+-.|+.+.
T Consensus       821 G~~-c~dvDeC~p-srChp~A~CyntpgsfsC~C~pGy~GDGf~--CVP~~~~~T~C~~er---------~hpl~chg~t  887 (1289)
T KOG1214|consen  821 GHQ-CTDVDECSP-SRCHPAATCYNTPGSFSCRCQPGYYGDGFQ--CVPDTSSLTPCEQER---------FHPLQCHGST  887 (1289)
T ss_pred             ccc-cccccccCc-cccCCCceEecCCCcceeecccCccCCCce--ecCCCccCCcccccc---------ccceeecccc
Confidence            874 899999997 999999999999999999999999999854  543222222222110         0022344443


Q ss_pred             ee---eecCCceEEeCCCCCCCCC
Q psy9419         261 LC---VNEKGGFKCVCPKGTTGDP  281 (739)
Q Consensus       261 ~C---~~~~g~~~C~C~~Gy~g~~  281 (739)
                      .+   ++. ..|++.+.++-.|+.
T Consensus       888 ~~~~~~Dp-~~~e~p~~~~ppG~~  910 (1289)
T KOG1214|consen  888 GFCWCVDP-DGHEVPGTQTPPGST  910 (1289)
T ss_pred             ceeEeeCC-CcccCCCCCCCCCCC
Confidence            22   333 457788777776654


No 3  
>KOG1217|consensus
Probab=99.66  E-value=7e-15  Score=166.65  Aligned_cols=276  Identities=33%  Similarity=0.738  Sum_probs=202.3

Q ss_pred             CCCCceeecCCCCeecCCeeecc-CCCCCC-CCCCCCeeeeCC--CCeeEEecCCCCccCCCCCCccc-ccccCCCCCCC
Q psy9419           5 QCNTLECQCRPPYQIVAGECTLA-TCGTQG-QCPGGAECVNIA--GGVSYCACPKGFRPKEDGYCEDV-DECAESRHLCG   79 (739)
Q Consensus         5 ~~~~~~C~C~~Gy~g~~~~C~~~-~C~~~~-~C~~~g~C~~~~--~g~~~C~C~~Gy~g~~~~~C~di-deC~~~~~~C~   79 (739)
                      ...+|.|.|++||.+..+  ... +|.... .+...+.|++..  ...|.|.|..||.+.   .+... ++|......|.
T Consensus       106 ~~~~~~c~c~~g~~~~~~--~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~---~~~~~~~~C~~~~~~c~  180 (487)
T KOG1217|consen  106 CVGSYECTCPPGYQGTPC--EGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGE---PCETDLDECIQYSSPCQ  180 (487)
T ss_pred             CCCCceeeCCCccccCcC--CcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccc---cccccccccccCCCCcC
Confidence            456889999999998743  222 344421 234456777642  246899999999998   66633 79987667899


Q ss_pred             CCCceeeCCCCceeeCCCCCcCCCCCCCccCCCCCCCCCCCCCCCCCccCccccCCCCccccCCCCCCccCCCCCCCCCC
Q psy9419          80 PGAVCINHPGSYTCQCPPNSSGDPLLGCTHARVQCSRDADCDGPYERCVRAACVCPAPYYADVNDGHKCKSPCERFSCGI  159 (739)
Q Consensus        80 ~~~~C~n~~gsy~C~C~~Gy~g~~~~~C~~~~~~C~~~~~C~~~~~~C~~~~C~C~~g~~g~~~~~~~C~~~C~~~~C~~  159 (739)
                      +++.|.+..++|.|.|+++|++...          ...                                        ..
T Consensus       181 ~~~~C~~~~~~~~C~c~~~~~~~~~----------~~~----------------------------------------~~  210 (487)
T KOG1217|consen  181 NGGTCVNTGGSYLCSCPPGYTGSTC----------ETT----------------------------------------GN  210 (487)
T ss_pred             CCcccccCCCCeeEeCCCCccCCcC----------cCC----------------------------------------CC
Confidence            9999999999999999999999742          110                                        11


Q ss_pred             CceeccCCCCeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCccCCCCCCccccc
Q psy9419         160 NAQCTPADPPQCTCLAGYTGEATLGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRV  239 (739)
Q Consensus       160 ~~~C~~~~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~  239 (739)
                      .+.|+..  +.|.+.+||.+..+.  .++.++.. .   + ++|+++.++|+|+|++||++...                
T Consensus       211 ~~~c~~~--~~~~~~~g~~~~~c~--~~~~~~~~-~---~-~~c~~~~~~~~C~~~~g~~~~~~----------------  265 (487)
T KOG1217|consen  211 GGTCVDS--VACSCPPGARGPECE--VSIVECAS-G---D-GTCVNTVGSYTCRCPEGYTGDAC----------------  265 (487)
T ss_pred             CceEecc--eeccCCCCCCCCCcc--cccccccC-C---C-CcccccCCceeeeCCCCcccccc----------------
Confidence            2233322  578889999987765  66777765 2   4 79999999999999999999751                


Q ss_pred             CCCCcccCCCCCCCCCCCCCCeeeecCCceEEeCCCCCCCCCCCccccCCCCCCCccccCCCCCCCCCCCCCcccCCCCC
Q psy9419         240 DKVGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKECSPSLQCRGGACVDPCRS  319 (739)
Q Consensus       240 ~~~~C~d~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~c~~~~~~~~~C~~~~~C~~~~~C~~g~C~~~C~~  319 (739)
                        ..+.++++|.....|.++++|++..+.|.|.|++||+|..+ .          .+.+..+|...           -..
T Consensus       266 --~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~-~----------~~~~~~~C~~~-----------~~~  321 (487)
T KOG1217|consen  266 --VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC-T----------ECVDVDECSPR-----------NAG  321 (487)
T ss_pred             --ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC-c----------ccccccccccc-----------ccC
Confidence              23578999984324999999999999999999999999875 1          13333333221           112


Q ss_pred             CCCCCCccc--ccCCCCceeecCCCceeCCCCccccCCCCCCCCCCCeeec-CCCCCeeecCCCCccC
Q psy9419         320 VECGAHALC--EPQDHRASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIV-TPAGPTCTCADGARGN  384 (739)
Q Consensus       320 ~~C~~~~~C--~~~~g~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~-~~~~~~C~C~~Gy~g~  384 (739)
                      .+|.+++.|  ......+.|.|..||.|..++.-.++|...++..++.|++ ..++|.|.|+.+|.+.
T Consensus       322 ~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  322 GPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGK  389 (487)
T ss_pred             CcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccC
Confidence            446667777  3334578899999999998876445788888999999999 6899999999999983


No 4  
>KOG1217|consensus
Probab=99.64  E-value=1.3e-13  Score=156.31  Aligned_cols=320  Identities=33%  Similarity=0.743  Sum_probs=214.5

Q ss_pred             CCceeccC-CCCeeeCCCCCccCCCCCCcccCCCCCCCC--CCCCCeeecc---CCceEeeCCCCCcCCCCCCCccCCCC
Q psy9419         159 INAQCTPA-DPPQCTCLAGYTGEATLGCLDVDECLGVSP--CASSALCVNE---KGGFKCVCPKGTTGDPYTLGCVGSGS  232 (739)
Q Consensus       159 ~~~~C~~~-~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~--C~~~~~C~n~---~g~~~C~C~~Gy~g~~~~~~C~~~~~  232 (739)
                      ..+.+... ..+.|.|++||.|..+..   ..+|.. .+  +...+.|...   ...|.|+|..||.+....        
T Consensus        99 ~~~~~~~~~~~~~c~c~~g~~~~~~~~---~~~C~~-~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~--------  166 (487)
T KOG1217|consen   99 LCGECVDCVGSYECTCPPGYQGTPCEG---ECECVT-GPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCE--------  166 (487)
T ss_pred             CCccccCCCCCceeeCCCccccCcCCc---ceeecC-CCCCeeCchhhcCCCCCCCceeeeeCCCccccccc--------
Confidence            34444443 578999999999987642   114544 22  3445577764   358999999999998732        


Q ss_pred             CCcccccCCCCcccCCCCC-CCCCCCCCCeeeecCCceEEeCCCCCCCCCCCccccCCCCCCCccccCCCCCCCCCCCCC
Q psy9419         233 PRTECRVDKVGCLDVDECL-GVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKECSPSLQCRGG  311 (739)
Q Consensus       233 ~~~~c~~~~~~C~d~~eC~-~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~c~~~~~~~~~C~~~~~C~~~~~C~~g  311 (739)
                                  .+.++|. ...+|.+.+.|.+..++|.|.|+++|.+..++..                          
T Consensus       167 ------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~--------------------------  208 (487)
T KOG1217|consen  167 ------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT--------------------------  208 (487)
T ss_pred             ------------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC--------------------------
Confidence                        2336776 4567999999999999999999999999875421                          


Q ss_pred             cccCCCCCCCCCCCcccccCCCCceeecCCCceeCCCCccccCCCCCCCCCCCeeecCCCCCeeecCCCCccCCCCCCCc
Q psy9419         312 ACVDPCRSVECGAHALCEPQDHRASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIVTPAGPTCTCADGARGNPFPGGAC  391 (739)
Q Consensus       312 ~C~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~~~~g~~C  391 (739)
                                 ...+.|+..   +.|.+.+||.+..+...+..+...   + ++|++..++|+|.|++||++...  ..+
T Consensus       209 -----------~~~~~c~~~---~~~~~~~g~~~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~--~~~  268 (487)
T KOG1217|consen  209 -----------GNGGTCVDS---VACSCPPGARGPECEVSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDAC--VTC  268 (487)
T ss_pred             -----------CCCceEecc---eeccCCCCCCCCCcccccccccCC---C-CcccccCCceeeeCCCCcccccc--cee
Confidence                       122334333   578999999988777655555433   4 89999999999999999999820  011


Q ss_pred             -cCCCCCCCCCCCCCCcccCCCccCCCCCCCCCCCceecCCCCcccCCCCccCCCCccccCCCCCCCC-----CCCCCCC
Q psy9419         392 -YPDLCSATQPCPALSVCVAGRCKARCAGVVCGAGAQCDPALDRCVCPPFYVGDPEFNCVPPVTMPVC-----IPPCGPN  465 (739)
Q Consensus       392 -~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~~~C~C~~g~~g~~~~~C~~~~~~~~C-----~~~C~~~  465 (739)
                       ..++|.....|...+.|++.                  ...+.|.|++||.|...   ........|     ..+|.++
T Consensus       269 ~~~~~C~~~~~c~~~~~C~~~------------------~~~~~C~C~~g~~g~~~---~~~~~~~~C~~~~~~~~c~~g  327 (487)
T KOG1217|consen  269 VDVDSCALIASCPNGGTCVNV------------------PGSYRCTCPPGFTGRLC---TECVDVDECSPRNAGGPCANG  327 (487)
T ss_pred             eeccccCCCCccCCCCeeecC------------------CCcceeeCCCCCCCCCC---ccccccccccccccCCcCCCC
Confidence             22445433224444444433                  23489999999999853   111112334     2357777


Q ss_pred             CccccCCCCCCCCCCCCceeecCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCeeec-CCCceEEeCCCCCccCCC
Q psy9419         466 AHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCGTAGPQDR-GSCDSGAGLCGPGAQCLE-TGGSVECQCPAGYKGNPY  543 (739)
Q Consensus       466 ~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~~~~~~~~-~~C~~~~~~C~~~g~C~~-~~g~~~C~C~~Gy~g~~c  543 (739)
                      ++|....      ....+.|.|..||.|.   .|+     .. ++|..  ..+..++.|++ ..++|.|.|+.+|.+.. 
T Consensus       328 ~~C~~~~------~~~~~~C~c~~~~~g~---~C~-----~~~~~C~~--~~~~~~~~c~~~~~~~~~c~~~~~~~~~~-  390 (487)
T KOG1217|consen  328 GTCNTLG------SFGGFRCACGPGFTGR---RCE-----DSNDECAS--SPCCPGGTCVNETPGSYRCACPAGFAGKA-  390 (487)
T ss_pred             cccccCC------CCCCCCcCCCCCCCCC---ccc-----cCCccccC--CccccCCEeccCCCCCeEecCCCccccCC-
Confidence            7883222      0146789999999998   887     44 48887  56888999999 68899999999888730 


Q ss_pred             cccCCCceeeeCCCCcccCCCCCCccCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCCCCCcceeCc
Q psy9419         544 VQCVGGSVECQCPAGYKGNPYVQCVDIDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGNPFVACTPVA  612 (739)
Q Consensus       544 ~~C~~g~~~C~C~~G~~g~~~~~C~~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~  612 (739)
                                       ......+.++++|..      .+.|++..+++.|. ++ +...+. .|.+++
T Consensus       391 -----------------~~~~~~~~~~~~c~~------~~~c~~~~~~~~c~-~~-~~~~~~-~~~~~~  433 (487)
T KOG1217|consen  391 -----------------NGDGVGCEDIDECSG------CGDCVNGPGGGACT-PP-GLVSPG-TCDDID  433 (487)
T ss_pred             -----------------ccccccccccccccC------CcceeccCCCCccc-cC-cccCCc-ceeccc
Confidence                             011135677888832      56788889999999 88 433333 444443


No 5  
>KOG4289|consensus
Probab=99.61  E-value=2e-14  Score=163.75  Aligned_cols=84  Identities=32%  Similarity=0.746  Sum_probs=63.9

Q ss_pred             cccCCCCccccCCCCCCccCCCCCCCCCCCceeccC-CCCeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCeeecc-CC
Q psy9419         131 ACVCPAPYYADVNDGHKCKSPCERFSCGINAQCTPA-DPPQCTCLAGYTGEATLGCLDVDECLGVSPCASSALCVNE-KG  208 (739)
Q Consensus       131 ~C~C~~g~~g~~~~~~~C~~~C~~~~C~~~~~C~~~-~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~C~~~~~C~n~-~g  208 (739)
                      +|.||+||+|+.|+..  .+.|.+.||++|++|... ++|+|.|.+||+|..|+.=...-.|.. ..|.++++|++. +|
T Consensus      1223 rCrCPpGFTgd~CeTe--iDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvp-GvC~nggtC~~~~ng 1299 (2531)
T KOG4289|consen 1223 RCRCPPGFTGDYCETE--IDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVP-GVCKNGGTCVNLLNG 1299 (2531)
T ss_pred             eEeCCCCCCcccccch--hHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCcccc-ceecCCCEEeecCCC
Confidence            4555555555443322  255678899999999986 899999999999999872222345776 889999999986 58


Q ss_pred             ceEeeCCCC
Q psy9419         209 GFKCVCPKG  217 (739)
Q Consensus       209 ~~~C~C~~G  217 (739)
                      .|.|.|+.|
T Consensus      1300 gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1300 GFCCHCPYG 1308 (2531)
T ss_pred             ceeccCCCc
Confidence            899999998


No 6  
>KOG4289|consensus
Probab=99.57  E-value=2.7e-14  Score=162.70  Aligned_cols=104  Identities=27%  Similarity=0.560  Sum_probs=83.3

Q ss_pred             CCCCCCCCCCCcccccC----------------------CCCceeecCCCceeCCCCccccCCCCCCCCCCCeeecCCCC
Q psy9419         315 DPCRSVECGAHALCEPQ----------------------DHRASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIVTPAG  372 (739)
Q Consensus       315 ~~C~~~~C~~~~~C~~~----------------------~g~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~~~~~  372 (739)
                      +.|...||.+...|+..                      .++++|+|++||+|+.|+..+|.|...||.++++|....|+
T Consensus      1180 niClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEgg 1259 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGG 1259 (2531)
T ss_pred             chhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCc
Confidence            44555677777777432                      35789999999999999999999999999999999999999


Q ss_pred             CeeecCCCCccCCCCCCCccCCCCCCCCCCCCCCcccCCCccCCCCCCCCCCCceecC---CCCcccCCCC
Q psy9419         373 PTCTCADGARGNPFPGGACYPDLCSATQPCPALSVCVAGRCKARCAGVVCGAGAQCDP---ALDRCVCPPF  440 (739)
Q Consensus       373 ~~C~C~~Gy~g~~~~g~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~---~~~~C~C~~g  440 (739)
                      |+|.|.+||+|.     +||.+.        ..+.|++|.         |.++++|..   +.+.|.|+.|
T Consensus      1260 YtCeCrpg~tGe-----hCEvs~--------~agrCvpGv---------C~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1260 YTCECRPGFTGE-----HCEVSA--------RAGRCVPGV---------CKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             eeEEecCCcccc-----ceeeec--------ccCccccce---------ecCCCEEeecCCCceeccCCCc
Confidence            999999999999     887632        234556553         556777764   4678999998


No 7  
>KOG1219|consensus
Probab=99.28  E-value=5.8e-12  Score=148.94  Aligned_cols=109  Identities=30%  Similarity=0.820  Sum_probs=101.7

Q ss_pred             CCCCCCCCCCCceeccC--CCCeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCc
Q psy9419         150 SPCERFSCGINAQCTPA--DPPQCTCLAGYTGEATLGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGC  227 (739)
Q Consensus       150 ~~C~~~~C~~~~~C~~~--~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C  227 (739)
                      ++|..+||+++|+|+..  ++|+|.|++-|+|..|+  .++.+|.+ +||..+|+|+...+.|.|.|+.||+|..    |
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE--i~~epC~s-nPC~~GgtCip~~n~f~CnC~~gyTG~~----C 3937 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE--IDLEPCAS-NPCLTGGTCIPFYNGFLCNCPNGYTGKR----C 3937 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccc--cccccccC-CCCCCCCEEEecCCCeeEeCCCCccCce----e
Confidence            68999999999999985  68999999999999998  89999999 9999999999999999999999999988    5


Q ss_pred             cCCCCCCcccccCCCCccc-CCCCCCCCCCCCCCeeeecCCceEEeCCCCCCCCCC
Q psy9419         228 VGSGSPRTECRVDKVGCLD-VDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPY  282 (739)
Q Consensus       228 ~~~~~~~~~c~~~~~~C~d-~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c  282 (739)
                      +                .+ ++||. .++|.++|.|++..|+|+|.|-+||.|..|
T Consensus      3938 e----------------~~Gi~eCs-~n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3938 E----------------ARGISECS-KNVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred             e----------------cccccccc-cccccCCceeeccCCceEeccChhHhcccC
Confidence            4                33 89998 799999999999999999999999999875


No 8  
>KOG1219|consensus
Probab=99.20  E-value=2.2e-11  Score=144.26  Aligned_cols=113  Identities=33%  Similarity=0.780  Sum_probs=102.4

Q ss_pred             CcccC-CCCCCCCCCCCCCeeeecC-CceEEeCCCCCCCCCCCccccCCCCCCCccccCCCCCCCCCCCCCcccCCCCCC
Q psy9419         243 GCLDV-DECLGVSPCASSALCVNEK-GGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKECSPSLQCRGGACVDPCRSV  320 (739)
Q Consensus       243 ~C~d~-~eC~~~~~C~~~~~C~~~~-g~~~C~C~~Gy~g~~c~~~c~~~~~~~~~C~~~~~C~~~~~C~~g~C~~~C~~~  320 (739)
                      .|... +.|. .+||+++|+|+..+ |+|+|.|++-|.|..|+..                            +.+|.++
T Consensus      3859 gC~l~~d~C~-~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~----------------------------~epC~sn 3909 (4289)
T KOG1219|consen 3859 GCSLLTDPCN-DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID----------------------------LEPCASN 3909 (4289)
T ss_pred             cccccccccc-cCcccCCCEecCCCCCceEEeCcccccCcccccc----------------------------cccccCC
Confidence            45444 7787 69999999998765 6799999999999987632                            7889999


Q ss_pred             CCCCCcccccCCCCceeecCCCceeCCCCcc-ccCCCCCCCCCCCeeecCCCCCeeecCCCCccC
Q psy9419         321 ECGAHALCEPQDHRASCRCELGYTEGLNGKC-VSLCEGIVCAPGAACIVTPAGPTCTCADGARGN  384 (739)
Q Consensus       321 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~c-~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~  384 (739)
                      ||..+++|+...++|.|.|+.||+|..|+.. +++|+.++|.++|.|++..|+|.|.|.+||.|.
T Consensus      3910 PC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3910 PCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred             CCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCCceeeccCCceEeccChhHhcc
Confidence            9999999999999999999999999999986 899999999999999999999999999999998


No 9  
>KOG1225|consensus
Probab=99.15  E-value=3.7e-10  Score=123.63  Aligned_cols=192  Identities=27%  Similarity=0.663  Sum_probs=113.1

Q ss_pred             CCcccCCCCccCCCCccccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCCCCCCCCCCCCCCCCCCC----C
Q psy9419         432 LDRCVCPPFYVGDPEFNCVPPVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCGTAGPQD----R  507 (739)
Q Consensus       432 ~~~C~C~~g~~g~~~~~C~~~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~~~~~~~----~  507 (739)
                      .++|.+.+++++...   .    ...+...+..++.+..            +.+.+..+|++.   .+......+    .
T Consensus       160 ~~~c~~~~~~~~~~~---g----~~~~~~~~~~hg~~~~------------~~~l~~~~~s~~---~~~~~~~~~~~~~~  217 (525)
T KOG1225|consen  160 NGVCSLKPNPFGAEC---G----QYKCPNDGSGHGRYYF------------GNCLSGISASGE---TCNQLGCNDDCFRT  217 (525)
T ss_pred             cccccccCCcccccc---c----eecCCcCCCCCcccee------------cccccccCcchh---hhhcccCCccceec
Confidence            456777777776531   1    1123334556666654            347888888877   443211111    0


Q ss_pred             CCCCCCCCCCCCCCeeecCCCceEEeCCCCCccCCCc--ccCCC--------ceeeeCCCCcccCCCCCCccCCCCCCCC
Q psy9419         508 GSCDSGAGLCGPGAQCLETGGSVECQCPAGYKGNPYV--QCVGG--------SVECQCPAGYKGNPYVQCVDIDECWSSN  577 (739)
Q Consensus       508 ~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~--~C~~g--------~~~C~C~~G~~g~~~~~C~~id~C~~~~  577 (739)
                      .-+..++      ..|++..-.+.|.|+.+|+|..+.  .|..+        ..+|.|++||+|.   +|.. -.|  +.
T Consensus       218 ~r~~~~~------~~~~~~~~~~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~---dC~e-~~C--p~  285 (525)
T KOG1225|consen  218 GRCREGR------CFCTAGFFDGICECPEGYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGD---DCDE-LVC--PV  285 (525)
T ss_pred             cccccCc------ccccccccCceeecCCceeCCccccccCCCCCcccceEeCCeEeCCCCCcCC---CCCc-ccC--Cc
Confidence            1111111      123333333478888888887654  12211        1377788888886   4432 235  34


Q ss_pred             CCCCCCEEeeCCCCeeeecCCCCCCCCCCcceeCccCCCCCCCCCcccCCCCCCCCCCceecCCcccCCCCCCCCCCCCe
Q psy9419         578 TCGSNAVCINTPGSYDCRCKEGNAGNPFVACTPVAVVPHSCEDPATCVCSKNAPCPSGYVCKNSRCTDLCANVRCGPRAL  657 (739)
Q Consensus       578 ~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~C~~~~~C~~~~~c~C~~G~~c~~~~C~~~C~~~~C~~~~~  657 (739)
                      +|+.++.+++.    +|+|++||+|.   .|+...+ +..|+++|.|+ .+.|.|.+||+  +..|+..    .|.+++.
T Consensus       286 ~cs~~g~~~~g----~CiC~~g~~G~---dCs~~~c-padC~g~G~Ci-~G~C~C~~Gy~--G~~C~~~----~C~~~g~  350 (525)
T KOG1225|consen  286 DCSGGGVCVDG----ECICNPGYSGK---DCSIRRC-PADCSGHGKCI-DGECLCDEGYT--GELCIQR----ACSGGGQ  350 (525)
T ss_pred             ccCCCceecCC----EeecCCCcccc---ccccccC-CccCCCCCccc-CCceEeCCCCc--CCccccc----ccCCCce
Confidence            47777777644    68888888887   5654442 46778888887 66788888887  5555544    3777788


Q ss_pred             ecCceeeCCCCCccCC
Q psy9419         658 CVQGQCLCPSDLIGNP  673 (739)
Q Consensus       658 C~~~~C~C~~Gy~G~~  673 (739)
                      |++. |+|..||.|..
T Consensus       351 cv~g-C~C~~Gw~G~d  365 (525)
T KOG1225|consen  351 CVNG-CKCKKGWRGPD  365 (525)
T ss_pred             eccC-ceeccCccCCC
Confidence            8877 88888888766


No 10 
>KOG0994|consensus
Probab=99.09  E-value=2e-09  Score=122.00  Aligned_cols=198  Identities=30%  Similarity=0.659  Sum_probs=102.4

Q ss_pred             CceeecCCCceeCCCCccccCCCCCCCCCCCeeecCCCCCeeecCCCCccCCCCCCCccCCCCCCCCCCCCCCcccC--C
Q psy9419         334 RASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIVTPAGPTCTCADGARGNPFPGGACYPDLCSATQPCPALSVCVA--G  411 (739)
Q Consensus       334 ~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~~~~g~~C~~~~C~~~~~C~~~~~C~~--~  411 (739)
                      ...|.|.+||+|.+|+.                          |.++|+|+|..|+.|+.-+|+.+..=...+.|..  |
T Consensus       933 ~ivC~C~~GY~G~RCe~--------------------------CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG  986 (1758)
T KOG0994|consen  933 QIVCHCQEGYSGSRCEI--------------------------CADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATG  986 (1758)
T ss_pred             ceeeecccCccccchhh--------------------------hcccccCCcccCCccccccccCCcCccCCCccchhhc
Confidence            35799999999987653                          5667888888888888777765433333333321  1


Q ss_pred             CccCCCCCCCCCCCceecCCCCcc-cCCCCccCCC-CccccCCCCCCCCCCCC-CCCCccccCCCCCCCCCCCCceeecC
Q psy9419         412 RCKARCAGVVCGAGAQCDPALDRC-VCPPFYVGDP-EFNCVPPVTMPVCIPPC-GPNAHCEYNSESPGSSPGSDNICVCN  488 (739)
Q Consensus       412 ~C~~~C~~~~C~~~~~C~~~~~~C-~C~~g~~g~~-~~~C~~~~~~~~C~~~C-~~~~~C~~~~~~~~~~~~~~~~C~C~  488 (739)
                      .|.+      |..+    .....| .|.+||.|+. .+.|..-    .|..-= .+-++|..          .+.+|.|.
T Consensus       987 ~CLk------CL~h----TeG~hCe~Ck~Gf~GdA~~q~CqrC----~Cn~LGTn~~~~CDr----------~tGQCpCl 1042 (1758)
T KOG0994|consen  987 ACLK------CLYH----TEGDHCEHCKDGFYGDALRQNCQRC----VCNFLGTNSTCHCDR----------FTGQCPCL 1042 (1758)
T ss_pred             hhhh------hhhc----ccccchhhccccchhHHHHhhhhhh----eccccccCCcccccc----------ccCcCCCC
Confidence            1110      1000    123345 5889999986 1222210    010000 01134443          45788898


Q ss_pred             CCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCC--CCeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCCC
Q psy9419         489 SGTHGNPYAGCGTA--GPQDRGSCDSGAGLCGP--GAQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNPY  564 (739)
Q Consensus       489 ~Gy~g~~~~~C~~~--~~~~~~~C~~~~~~C~~--~g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~~  564 (739)
                      |.-.|.....|..+  ....-.-|.+  ..|.+  +-+|..-.|  +|+|++||-|..|.+|.         .-|.|++.
T Consensus      1043 pNv~G~~CDqCA~N~w~laSG~GCe~--C~Cd~~~~pqCN~ftG--QCqCkpGfGGR~C~qCq---------el~WGdP~ 1109 (1758)
T KOG0994|consen 1043 PNVQGVRCDQCAENHWNLASGEGCEP--CNCDPIGGPQCNEFTG--QCQCKPGFGGRTCSQCQ---------ELYWGDPN 1109 (1758)
T ss_pred             cccccccccccccchhccccCCCCCc--cCCCccCCcccccccc--ceeccCCCCCcchhHHH---------HhhcCCCC
Confidence            88888822222211  0000111222  22221  224555555  78888888887776653         35666665


Q ss_pred             CCCccCCCCCCCCCCCCCC----EEeeCCCCeeeecCCCCCCC
Q psy9419         565 VQCVDIDECWSSNTCGSNA----VCINTPGSYDCRCKEGNAGN  603 (739)
Q Consensus       565 ~~C~~id~C~~~~~C~~~g----~C~~~~g~~~C~C~~G~~g~  603 (739)
                      +.|.       .-.|...|    .|....|  .|+|.+|..|.
T Consensus      1110 ~~C~-------aCdCd~rG~~tpQCdr~tG--~C~C~~Gv~G~ 1143 (1758)
T KOG0994|consen 1110 EKCR-------ACDCDPRGIETPQCDRATG--RCVCRPGVGGP 1143 (1758)
T ss_pred             CCce-------ecCCCCCCCCCCCccccCC--ceeecCCCCCc
Confidence            5442       11222222    2433333  57777777776


No 11 
>KOG1225|consensus
Probab=99.09  E-value=2.2e-09  Score=117.70  Aligned_cols=122  Identities=37%  Similarity=0.940  Sum_probs=91.8

Q ss_pred             CCceeecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeecCCCceEEeCCCCCccCCCcc--c----CCC----c
Q psy9419         481 SDNICVCNSGTHGNPYAGCGTAGPQDRGSCDSGAGLCGPGAQCLETGGSVECQCPAGYKGNPYVQ--C----VGG----S  550 (739)
Q Consensus       481 ~~~~C~C~~Gy~g~~~~~C~~~~~~~~~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~~--C----~~g----~  550 (739)
                      ..++|.|+.+|+|.   .|+      ...|.   ..|..++.|++.    +|+|++||+|..|+.  |    ..+    .
T Consensus       232 ~~~ic~c~~~~~g~---~c~------~~~C~---~~c~~~g~c~~G----~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~  295 (525)
T KOG1225|consen  232 FDGICECPEGYFGP---LCS------TIYCP---GGCTGRGQCVEG----RCICPPGFTGDDCDELVCPVDCSGGGVCVD  295 (525)
T ss_pred             cCceeecCCceeCC---ccc------cccCC---CCCcccceEeCC----eEeCCCCCcCCCCCcccCCcccCCCceecC
Confidence            34578899999888   665      23344   456666778765    699999999988763  3    211    2


Q ss_pred             eeeeCCCCcccCCCCCCccCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCCCCCcceeCccCCCCCCCCCcccCCCCC
Q psy9419         551 VECQCPAGYKGNPYVQCVDIDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGNPFVACTPVAVVPHSCEDPATCVCSKNA  630 (739)
Q Consensus       551 ~~C~C~~G~~g~~~~~C~~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~C~~~~~C~~~~~c  630 (739)
                      .+|.|++||+|.   .|. +..|  +..|.++|.|+  .|  +|+|.+||+|.   .|+..     .|.+++.|+..  |
T Consensus       296 g~CiC~~g~~G~---dCs-~~~c--padC~g~G~Ci--~G--~C~C~~Gy~G~---~C~~~-----~C~~~g~cv~g--C  355 (525)
T KOG1225|consen  296 GECICNPGYSGK---DCS-IRRC--PADCSGHGKCI--DG--ECLCDEGYTGE---LCIQR-----ACSGGGQCVNG--C  355 (525)
T ss_pred             CEeecCCCcccc---ccc-cccC--CccCCCCCccc--CC--ceEeCCCCcCC---ccccc-----ccCCCceeccC--c
Confidence            389999999996   553 3446  68999999999  23  79999999999   67653     39999999977  9


Q ss_pred             CCCCCcee
Q psy9419         631 PCPSGYVC  638 (739)
Q Consensus       631 ~C~~G~~c  638 (739)
                      .|..||..
T Consensus       356 ~C~~Gw~G  363 (525)
T KOG1225|consen  356 KCKKGWRG  363 (525)
T ss_pred             eeccCccC
Confidence            99999993


No 12 
>KOG0994|consensus
Probab=98.98  E-value=1.3e-08  Score=115.65  Aligned_cols=30  Identities=40%  Similarity=0.883  Sum_probs=21.7

Q ss_pred             eeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccC
Q psy9419         522 QCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGN  562 (739)
Q Consensus       522 ~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~  562 (739)
                      .|....|  +|+|.+|-.|..|++|         ..||.|.
T Consensus      1126 QCdr~tG--~C~C~~Gv~G~rCdqC---------aRgy~G~ 1155 (1758)
T KOG0994|consen 1126 QCDRATG--RCVCRPGVGGPRCDQC---------ARGYSGQ 1155 (1758)
T ss_pred             CccccCC--ceeecCCCCCcchhhh---------hhhhcCC
Confidence            3555555  7999999988877754         5678874


No 13 
>KOG4260|consensus
Probab=98.85  E-value=3.9e-09  Score=102.66  Aligned_cols=163  Identities=29%  Similarity=0.680  Sum_probs=106.7

Q ss_pred             EecCCCCccCCCCCCcccccccCCCCCCCCCCceee---CCCCceeeCCCCCcCCCCCCCccCCCCCCCCCCCCCCCCCc
Q psy9419          51 CACPKGFRPKEDGYCEDVDECAESRHLCGPGAVCIN---HPGSYTCQCPPNSSGDPLLGCTHARVQCSRDADCDGPYERC  127 (739)
Q Consensus        51 C~C~~Gy~g~~~~~C~dideC~~~~~~C~~~~~C~n---~~gsy~C~C~~Gy~g~~~~~C~~~~~~C~~~~~C~~~~~~C  127 (739)
                      =-|++|-.|.+...|...   +  ..+|..++.|.-   ..|+-.|.|.+||+|..+..|..                  
T Consensus       130 vCCp~gtyGpdCl~Cpgg---s--er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~------------------  186 (350)
T KOG4260|consen  130 VCCPDGTYGPDCLQCPGG---S--ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGI------------------  186 (350)
T ss_pred             eccCCCCcCCccccCCCC---C--cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccch------------------
Confidence            348888888832233211   1  146887888873   35778999999999987553322                  


Q ss_pred             cCccccCCCCcccc-CCC-CC---CccCCCCCCCCCCCceeccCCCCee-eCCCCCccCCCCCCcccCCCCC-CCCCCCC
Q psy9419         128 VRAACVCPAPYYAD-VND-GH---KCKSPCERFSCGINAQCTPADPPQC-TCLAGYTGEATLGCLDVDECLG-VSPCASS  200 (739)
Q Consensus       128 ~~~~C~C~~g~~g~-~~~-~~---~C~~~C~~~~C~~~~~C~~~~~~~C-~C~~Gy~g~~~~~C~~i~eC~~-~~~C~~~  200 (739)
                               +|+-. +.. ..   .|..+|       .+.|+...+-.| .|..||..+. ..|+|||||.. ..+|..+
T Consensus       187 ---------eyfes~Rne~~lvCt~Ch~~C-------~~~Csg~~~k~C~kCkkGW~lde-~gCvDvnEC~~ep~~c~~~  249 (350)
T KOG4260|consen  187 ---------EYFESSRNEQHLVCTACHEGC-------LGVCSGESSKGCSKCKKGWKLDE-EGCVDVNECQNEPAPCKAH  249 (350)
T ss_pred             ---------HHHHhhcccccchhhhhhhhh-------hcccCCCCCCChhhhcccceecc-cccccHHHHhcCCCCCChh
Confidence                     11110 000 00   111122       124443334466 7999999984 46999999986 5889999


Q ss_pred             CeeeccCCceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCCCCC-CCCC-CCCCeeeecCCceEEeCCCCCC
Q psy9419         201 ALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDECLG-VSPC-ASSALCVNEKGGFKCVCPKGTT  278 (739)
Q Consensus       201 ~~C~n~~g~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~eC~~-~~~C-~~~~~C~~~~g~~~C~C~~Gy~  278 (739)
                      ..|+|+.|||+|..++||.+.                         +|+|.. ...| ..+..|.++.++|+|+|..|+.
T Consensus       250 qfCvNteGSf~C~dk~Gy~~g-------------------------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  250 QFCVNTEGSFKCEDKEGYKKG-------------------------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             heeecCCCceEecccccccCC-------------------------hHHhhhhhhhcccCCCCcccCCccEEEEecccce
Confidence            999999999999999999873                         233320 1222 1356899999999999999875


No 14 
>KOG4260|consensus
Probab=98.56  E-value=1.1e-07  Score=92.61  Aligned_cols=127  Identities=30%  Similarity=0.608  Sum_probs=88.6

Q ss_pred             CCCcccCCCCccCCCCccccCCCCC-------CC---CCCCCCCCCccccCCCCCCCCCCCCcee-ecCCCCCCCCCCCC
Q psy9419         431 ALDRCVCPPFYVGDPEFNCVPPVTM-------PV---CIPPCGPNAHCEYNSESPGSSPGSDNIC-VCNSGTHGNPYAGC  499 (739)
Q Consensus       431 ~~~~C~C~~g~~g~~~~~C~~~~~~-------~~---C~~~C~~~~~C~~~~~~~~~~~~~~~~C-~C~~Gy~g~~~~~C  499 (739)
                      +++.|.|.+||.|.....|....+.       ..   |...|.  +.|....         +-.| .|..||..+.. .|
T Consensus       166 GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~Csg~~---------~k~C~kCkkGW~lde~-gC  233 (350)
T KOG4260|consen  166 GSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL--GVCSGES---------SKGCSKCKKGWKLDEE-GC  233 (350)
T ss_pred             CCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--cccCCCC---------CCChhhhcccceeccc-cc
Confidence            5789999999999976666543211       11   222332  2444322         2234 78999987732 45


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCCCCCCccCCCCCC-CCC
Q psy9419         500 GTAGPQDRGSCDSGAGLCGPGAQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNPYVQCVDIDECWS-SNT  578 (739)
Q Consensus       500 ~~~~~~~~~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~~~~C~~id~C~~-~~~  578 (739)
                      .     ||+||...+.+|..+..|+|+.|||.|..++||.+.                            +|+|.. ...
T Consensus       234 v-----DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g----------------------------~d~C~~~~d~  280 (350)
T KOG4260|consen  234 V-----DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG----------------------------VDECQFCADV  280 (350)
T ss_pred             c-----cHHHHhcCCCCCChhheeecCCCceEecccccccCC----------------------------hHHhhhhhhh
Confidence            5     999999988999999999999999999999888752                            333310 122


Q ss_pred             C-CCCCEEeeCCCCeeeecCCCCCC
Q psy9419         579 C-GSNAVCINTPGSYDCRCKEGNAG  602 (739)
Q Consensus       579 C-~~~g~C~~~~g~~~C~C~~G~~g  602 (739)
                      | ..+..|.++.++|+|+|..|+.-
T Consensus       281 ~~~kn~~c~ni~~~~r~v~f~~~~~  305 (350)
T KOG4260|consen  281 CASKNRPCMNIDGQYRCVCFSGLII  305 (350)
T ss_pred             cccCCCCcccCCccEEEEeccccee
Confidence            2 34678999999999999999753


No 15 
>KOG1836|consensus
Probab=98.36  E-value=0.00036  Score=87.22  Aligned_cols=216  Identities=29%  Similarity=0.611  Sum_probs=113.0

Q ss_pred             ecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeecC--CCceEEe-CCCCCccCCCcccCCCceeeeCCCCcccC
Q psy9419         486 VCNSGTHGNPYAGCGTAGPQDRGSCDSGAGLCGPGAQCLET--GGSVECQ-CPAGYKGNPYVQCVGGSVECQCPAGYKGN  562 (739)
Q Consensus       486 ~C~~Gy~g~~~~~C~~~~~~~~~~C~~~~~~C~~~g~C~~~--~g~~~C~-C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~  562 (739)
                      +|..||+|.+...       ....|..  .+|.+++.|...  .....|. |++||+|..|+.|.         .||.|+
T Consensus       760 ~C~~GfYg~~~~~-------~~~dC~~--C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~---------dgyfg~  821 (1705)
T KOG1836|consen  760 QCVDGFYGLPDLG-------TSGDCQP--CPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECA---------DGYFGN  821 (1705)
T ss_pred             hhcCCCCCccccC-------CCCCCcc--CCCCCChhhcCcCcccceecCCCCCCCcccccccCC---------CccccC
Confidence            5778888885321       1122766  677777777655  3567899 99999999988765         467766


Q ss_pred             CCCCCccCCCCC-----------CCCCCCCC-CE---EeeCCCCeee-ecCCCCCCCCCC-----cceeCccC-------
Q psy9419         563 PYVQCVDIDECW-----------SSNTCGSN-AV---CINTPGSYDC-RCKEGNAGNPFV-----ACTPVAVV-------  614 (739)
Q Consensus       563 ~~~~C~~id~C~-----------~~~~C~~~-g~---C~~~~g~~~C-~C~~G~~g~~~~-----~C~~~~~~-------  614 (739)
                      +...=.|+-.|.           ....|... +.   |+.......| .|++||.|++..     .|....+.       
T Consensus       822 p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~  901 (1705)
T KOG1836|consen  822 PLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELP  901 (1705)
T ss_pred             CCCCCCCcccCccceeccccCccccccccccccceeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccc
Confidence            532112222220           11233221 33   3433344456 699999998643     23322211       


Q ss_pred             CCCCCCC-CcccCCC------CCCCCCCceecCCcccCCCCCCCCCCC----CeecC--ceeeCCCCCccCCCCCC----
Q psy9419         615 PHSCEDP-ATCVCSK------NAPCPSGYVCKNSRCTDLCANVRCGPR----ALCVQ--GQCLCPSDLIGNPTDLT----  677 (739)
Q Consensus       615 ~~~C~~~-~~C~~~~------~c~C~~G~~c~~~~C~~~C~~~~C~~~----~~C~~--~~C~C~~Gy~G~~c~~~----  677 (739)
                      ...|... |.|.+..      -=.|..||.  +..=...|.+-.|+..    ..|..  ++|.|.+|-+|.+|+..    
T Consensus       902 ~~~c~~~tGQcec~~~v~g~~c~~c~~g~f--nl~s~~gC~~c~c~~~gs~~~~c~~~tGqc~c~~gVtgqrc~qc~~~~  979 (1705)
T KOG1836|consen  902 SLTCNPVTGQCECKPNVEGRDCLYCFKGFF--NLNSGVGCEPCNCDPTGSESSDCDVGTGQCYCRPGVTGQRCDQCETYH  979 (1705)
T ss_pred             cccCCCcccceeccCCCCcccccccccccc--ccCCCCCcccccccccccccccccccCCceeeecCccccccCccccCc
Confidence            1123221 2222111      124445554  2110123433344433    25553  79999999999999752    


Q ss_pred             -----CCCccCCCCCCCC----CCCC-CCceecCCCCCCCCCCCCCCCCCCccccc
Q psy9419         678 -----RGCQVKGQCANDL----ECKP-NEICFQEKGIEPTYPGLHSHTGAPCSSTV  723 (739)
Q Consensus       678 -----~~C~~~~~C~~~~----~C~~-~~~C~~~~g~~~c~~~~~C~~~~~C~~~~  723 (739)
                           ..|. ..+|+..+    +|.. .+.|.+..++..= -...|+.+......+
T Consensus       980 ~~~~~~gc~-~c~c~~~Gs~~~qc~~~~G~c~c~~~~~g~-~c~~c~~~~~~~~~~ 1033 (1705)
T KOG1836|consen  980 FGFQTEGCG-LCECDPLGSRGFQCDPEDGQCPCRPGFEGR-RCDQCEEGFFGNAQG 1033 (1705)
T ss_pred             ccccccCCc-ceecccCCcccceecccCCeeeecCCCCCc-ccccccCCccccccC
Confidence                 2232 12354444    5766 7778887776321 123355555444433


No 16 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.24  E-value=4.7e-07  Score=65.02  Aligned_cols=36  Identities=47%  Similarity=1.082  Sum_probs=33.5

Q ss_pred             ccccccCCCCCCCCCCceeeCCCCceeeCCCCCcCC
Q psy9419          67 DVDECAESRHLCGPGAVCINHPGSYTCQCPPNSSGD  102 (739)
Q Consensus        67 dideC~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~  102 (739)
                      |||||+..++.|..+++|+|+.|||+|.|++||+..
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~   36 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN   36 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence            799999988999989999999999999999999943


No 17 
>KOG1836|consensus
Probab=98.03  E-value=0.00016  Score=90.13  Aligned_cols=110  Identities=27%  Similarity=0.660  Sum_probs=67.0

Q ss_pred             ceecCCCCcccCCCCccCCCCccccCCCCCCC----CC-CCCCC----CCccccCCCCCCCCCCCCceeecCCCCCCCCC
Q psy9419         426 AQCDPALDRCVCPPFYVGDPEFNCVPPVTMPV----CI-PPCGP----NAHCEYNSESPGSSPGSDNICVCNSGTHGNPY  496 (739)
Q Consensus       426 ~~C~~~~~~C~C~~g~~g~~~~~C~~~~~~~~----C~-~~C~~----~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~  496 (739)
                      .+|++.+++|.|.+.-.|.....|....+.-.    |. -.|..    +..|..          .+.+|.|.+|-+|...
T Consensus       903 ~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~----------~tGqc~c~~gVtgqrc  972 (1705)
T KOG1836|consen  903 LTCNPVTGQCECKPNVEGRDCLYCFKGFFNLNSGVGCEPCNCDPTGSESSDCDV----------GTGQCYCRPGVTGQRC  972 (1705)
T ss_pred             ccCCCcccceeccCCCCccccccccccccccCCCCCcccccccccccccccccc----------cCCceeeecCcccccc
Confidence            45667788999998888876555555443222    21 12321    234443          5578999999999944


Q ss_pred             CCCCCCC-CCCCCCCCCCCCCCCCCC----eeecCCCceEEeCCCCCccCCCcccCCC
Q psy9419         497 AGCGTAG-PQDRGSCDSGAGLCGPGA----QCLETGGSVECQCPAGYKGNPYVQCVGG  549 (739)
Q Consensus       497 ~~C~~~~-~~~~~~C~~~~~~C~~~g----~C~~~~g~~~C~C~~Gy~g~~c~~C~~g  549 (739)
                      ..|+... -..+..|..  -.|...|    +|....|  +|.|++++.|..+.+|..+
T Consensus       973 ~qc~~~~~~~~~~gc~~--c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~~c~~c~~~ 1026 (1705)
T KOG1836|consen  973 DQCETYHFGFQTEGCGL--CECDPLGSRGFQCDPEDG--QCPCRPGFEGRRCDQCEEG 1026 (1705)
T ss_pred             CccccCcccccccCCcc--eecccCCcccceecccCC--eeeecCCCCCcccccccCC
Confidence            4444210 011123333  4455555    6887666  8999999999888776654


No 18 
>KOG1226|consensus
Probab=97.89  E-value=9.8e-05  Score=83.01  Aligned_cols=99  Identities=26%  Similarity=0.573  Sum_probs=69.4

Q ss_pred             CCCCCCceeccCCCCeeeCCCCCc----cCCCCCCcccCCCCC--CCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCcc
Q psy9419         155 FSCGINAQCTPADPPQCTCLAGYT----GEATLGCLDVDECLG--VSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCV  228 (739)
Q Consensus       155 ~~C~~~~~C~~~~~~~C~C~~Gy~----g~~~~~C~~i~eC~~--~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C~  228 (739)
                      -+|+++|.|+=.   +|+|.+...    |..++ | |--.|..  ...|..+|+|.=.    +|+|.+||+|..+.  |.
T Consensus       514 ~vCSgrG~C~CG---qC~C~~~~~~~i~G~fCE-C-DnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~--C~  582 (783)
T KOG1226|consen  514 PVCSGRGDCVCG---QCVCHKPDNGKIYGKFCE-C-DNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACN--CP  582 (783)
T ss_pred             CCcCCCCcEeCC---ceEecCCCCCceeeeeee-c-cCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCC--CC
Confidence            378888888753   899988877    77765 2 2233443  3568889998754    69999999999843  22


Q ss_pred             CCCCCCcccccCCCCcccCCCCC--CCCCCCCCCeeeecCCceEEeCCCC-CCCCCCCc
Q psy9419         229 GSGSPRTECRVDKVGCLDVDECL--GVSPCASSALCVNEKGGFKCVCPKG-TTGDPYTL  284 (739)
Q Consensus       229 ~~~~~~~~c~~~~~~C~d~~eC~--~~~~C~~~~~C~~~~g~~~C~C~~G-y~g~~c~~  284 (739)
                                      .+.+.|.  ....|...|+|.=.    +|+|... |.|..|+.
T Consensus       583 ----------------~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~  621 (783)
T KOG1226|consen  583 ----------------LSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEK  621 (783)
T ss_pred             ----------------CCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhc
Confidence                            4555564  23467778888644    6899877 99988763


No 19 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.87  E-value=8.6e-06  Score=58.47  Aligned_cols=35  Identities=37%  Similarity=0.884  Sum_probs=32.2

Q ss_pred             CCCCCCCCCCCCCCCCeeecCCCceEEeCCCCCcc
Q psy9419         506 DRGSCDSGAGLCGPGAQCLETGGSVECQCPAGYKG  540 (739)
Q Consensus       506 ~~~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g  540 (739)
                      ||+||....+.|..+++|+|+.|+|+|.|++||+.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~   35 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL   35 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence            58999998889999999999999999999999983


No 20 
>KOG1226|consensus
Probab=97.82  E-value=7.7e-05  Score=83.79  Aligned_cols=149  Identities=26%  Similarity=0.688  Sum_probs=100.3

Q ss_pred             CCCCCCCeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCCCCCCccCCCCC---CCCCCCCCCEEeeCCCC
Q psy9419         515 GLCGPGAQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNPYVQCVDIDECW---SSNTCGSNAVCINTPGS  591 (739)
Q Consensus       515 ~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~~~~C~~id~C~---~~~~C~~~g~C~~~~g~  591 (739)
                      ..|+.+|+..-.    +|.|.+||.|+          +|+|+..-....    ...+.|.   ...+|++.|.|+=.   
T Consensus       467 ~~C~g~G~~~CG----~C~C~~G~~G~----------~CEC~~~~~ss~----~~~~~Cr~~~~~~vCSgrG~C~CG---  525 (783)
T KOG1226|consen  467 ALCHGNGTFVCG----QCRCDEGWLGK----------KCECSTDELSSS----EEEDKCRENSDSPVCSGRGDCVCG---  525 (783)
T ss_pred             cccCCCCcEEec----ceecCCCCCCC----------cccCCccccCcH----hHHhhccCCCCCCCcCCCCcEeCC---
Confidence            556655655543    58999999987          456665544432    1134454   23489999999744   


Q ss_pred             eeeecCCCCC----CCCCCccee--Ccc---CCCCCCCCCcccCCCCCCCCCCceecCCcc-----cCCCCC---CCCCC
Q psy9419         592 YDCRCKEGNA----GNPFVACTP--VAV---VPHSCEDPATCVCSKNAPCPSGYVCKNSRC-----TDLCAN---VRCGP  654 (739)
Q Consensus       592 ~~C~C~~G~~----g~~~~~C~~--~~~---~~~~C~~~~~C~~~~~c~C~~G~~c~~~~C-----~~~C~~---~~C~~  654 (739)
                       +|+|.+...    |.   .|+-  ..+   ....|.++|+|.+. .|.|.+||+  |..|     .+.|.+   ..|+.
T Consensus       526 -qC~C~~~~~~~i~G~---fCECDnfsC~r~~g~lC~g~G~C~CG-~CvC~~Gwt--G~~C~C~~std~C~~~~G~iCSG  598 (783)
T KOG1226|consen  526 -QCVCHKPDNGKIYGK---FCECDNFSCERHKGVLCGGHGRCECG-RCVCNPGWT--GSACNCPLSTDTCESSDGQICSG  598 (783)
T ss_pred             -ceEecCCCCCceeee---eeeccCcccccccCcccCCCCeEeCC-cEEcCCCCc--cCCCCCCCCCccccCCCCceeCC
Confidence             699998887    44   3442  222   24578888888754 599999999  4444     355653   47999


Q ss_pred             CCeecCceeeCCCC-CccCCCCCCCCCccCCCCCCCCCCC
Q psy9419         655 RALCVQGQCLCPSD-LIGNPTDLTRGCQVKGQCANDLECK  693 (739)
Q Consensus       655 ~~~C~~~~C~C~~G-y~G~~c~~~~~C~~~~~C~~~~~C~  693 (739)
                      +|+|.=++|+|... |+|..||+-+.|.+.  |....+|+
T Consensus       599 rG~C~Cg~C~C~~~~~sG~~CE~cptc~~~--C~~~~~Cv  636 (783)
T KOG1226|consen  599 RGTCECGRCKCTDPPYSGEFCEKCPTCPDP--CAENKSCV  636 (783)
T ss_pred             CceeeCCceEcCCCCcCcchhhcCCCCCCc--ccccccch
Confidence            99999999999886 999999986666433  55544443


No 21 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.54  E-value=2.6e-05  Score=72.89  Aligned_cols=143  Identities=29%  Similarity=0.687  Sum_probs=87.1

Q ss_pred             CCCCCCeeeeCCCCeeEEecCCCCccCCCCCCcccccccC---CCCCCCCCCceeeCC-----CCceeeCCCCCcCCCCC
Q psy9419          34 QCPGGAECVNIAGGVSYCACPKGFRPKEDGYCEDVDECAE---SRHLCGPGAVCINHP-----GSYTCQCPPNSSGDPLL  105 (739)
Q Consensus        34 ~C~~~g~C~~~~~g~~~C~C~~Gy~g~~~~~C~dideC~~---~~~~C~~~~~C~n~~-----gsy~C~C~~Gy~g~~~~  105 (739)
                      .|.+ |.-+... +-|.|.|++||.......|+...+|..   ...+|...|+|++..     ..|.|.|.+||....  
T Consensus         7 ~CKN-G~LiQMS-NHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~--   82 (197)
T PF06247_consen    7 ICKN-GYLIQMS-NHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQ--   82 (197)
T ss_dssp             --BT-EEEEEES-SEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS--
T ss_pred             cccC-CEEEEcc-CceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeC--
Confidence            4443 6666655 469999999999987779998888865   235899999999875     459999999999763  


Q ss_pred             CCccCCCCCCCCCCCCCCCCCccCccccCCCCccccCCCCCCccCCCCCCCCCCCceeccC----CCCeeeCCCCCccCC
Q psy9419         106 GCTHARVQCSRDADCDGPYERCVRAACVCPAPYYADVNDGHKCKSPCERFSCGINAQCTPA----DPPQCTCLAGYTGEA  181 (739)
Q Consensus       106 ~C~~~~~~C~~~~~C~~~~~~C~~~~C~C~~g~~g~~~~~~~C~~~C~~~~C~~~~~C~~~----~~~~C~C~~Gy~g~~  181 (739)
                                         +.|+.                    +.|....|+ .|.|+..    ....|+|.-|+..+.
T Consensus        83 -------------------~vCvp--------------------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d  122 (197)
T PF06247_consen   83 -------------------GVCVP--------------------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD  122 (197)
T ss_dssp             -------------------SSEEE--------------------GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTT
T ss_pred             -------------------CeEch--------------------hhcCceecC-CCeEEecCCCCCCceeEeeeceEecc
Confidence                               11111                    112234455 6788742    244899999998433


Q ss_pred             CCCCccc--CCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCC
Q psy9419         182 TLGCLDV--DECLGVSPCASSALCVNEKGGFKCVCPKGTTGDP  222 (739)
Q Consensus       182 ~~~C~~i--~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~  222 (739)
                      ...|...  -+|+  -.|..+.+|....+-|+|.+.+||.++.
T Consensus       123 n~kCtk~G~T~C~--LKCk~nE~CK~~~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  123 NKKCTKTGETKCS--LKCKENEECKLVDGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             TTESEEEE----------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred             CCcccCCCcccee--eecCCCcceeeeCcEEEeecCCCCCCCC
Confidence            3334322  2344  4567788999999999999999998765


No 22 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.47  E-value=4.2e-05  Score=52.47  Aligned_cols=33  Identities=48%  Similarity=1.010  Sum_probs=26.4

Q ss_pred             ccCCCCCCCCCCceeeCCCCceeeCCCCCcCCC
Q psy9419          71 CAESRHLCGPGAVCINHPGSYTCQCPPNSSGDP  103 (739)
Q Consensus        71 C~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~  103 (739)
                      |+.+++.|+.+|+|+++.++|.|+|++||+|+.
T Consensus         1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             TTTGGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            344567899999999999999999999999986


No 23 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.41  E-value=0.00021  Score=50.20  Aligned_cols=35  Identities=51%  Similarity=1.085  Sum_probs=29.8

Q ss_pred             cCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCC-CC
Q psy9419         569 DIDECWSSNTCGSNAVCINTPGSYDCRCKEGNA-GN  603 (739)
Q Consensus       569 ~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~-g~  603 (739)
                      ++++|....+|.++++|+++.++|+|.|++||+ |.
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~   36 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR   36 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence            467884327899999999999999999999999 65


No 24 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.40  E-value=0.00012  Score=44.89  Aligned_cols=23  Identities=48%  Similarity=1.120  Sum_probs=20.0

Q ss_pred             eeEEecCCCCccCCCC-CCccccc
Q psy9419          48 VSYCACPKGFRPKEDG-YCEDVDE   70 (739)
Q Consensus        48 ~~~C~C~~Gy~g~~~~-~C~dide   70 (739)
                      +|+|+|++||+...++ .|+||||
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            5899999999987665 8999987


No 25 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.38  E-value=8.7e-05  Score=49.66  Aligned_cols=29  Identities=45%  Similarity=1.090  Sum_probs=26.5

Q ss_pred             CCCCCCCCCEEeeCC-CCeeeecCCCCCCC
Q psy9419         575 SSNTCGSNAVCINTP-GSYDCRCKEGNAGN  603 (739)
Q Consensus       575 ~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~  603 (739)
                      .+++|+++|+|++.. ++|+|+|++||+|.
T Consensus         2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            456999999999998 99999999999996


No 26 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.33  E-value=0.00012  Score=48.94  Aligned_cols=30  Identities=40%  Similarity=1.157  Sum_probs=26.8

Q ss_pred             CCCCCCCCCCCeeeccC-CceEeeCCCCCcCC
Q psy9419         191 CLGVSPCASSALCVNEK-GGFKCVCPKGTTGD  221 (739)
Q Consensus       191 C~~~~~C~~~~~C~n~~-g~~~C~C~~Gy~g~  221 (739)
                      |.+ ++|.++|+|++.. ++|+|+|++||+|.
T Consensus         1 C~~-~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSS-NPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTT-TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCC-CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            344 7999999999999 99999999999985


No 27 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.24  E-value=0.00047  Score=48.38  Aligned_cols=37  Identities=43%  Similarity=1.041  Sum_probs=30.6

Q ss_pred             cCCCCCCCCCCCCCCeeeecCCceEEeCCCCCC-CCCC
Q psy9419         246 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTT-GDPY  282 (739)
Q Consensus       246 d~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~-g~~c  282 (739)
                      ++++|....+|.++++|++..++|.|.|++||. |..|
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence            467787327899889999999999999999998 6543


No 28 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.16  E-value=0.00018  Score=49.35  Aligned_cols=30  Identities=47%  Similarity=1.067  Sum_probs=23.9

Q ss_pred             CCCCCCCCCeeecCCCceEEeCCCCCccCC
Q psy9419         513 GAGLCGPGAQCLETGGSVECQCPAGYKGNP  542 (739)
Q Consensus       513 ~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~  542 (739)
                      .+..|+.+|+|+++.++|+|+|++||+|++
T Consensus         4 ~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    4 NNGGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             GGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            346789999999999999999999999974


No 29 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.02  E-value=0.00063  Score=41.76  Aligned_cols=24  Identities=42%  Similarity=0.969  Sum_probs=19.7

Q ss_pred             ceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCC
Q psy9419         209 GFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDE  249 (739)
Q Consensus       209 ~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~e  249 (739)
                      +|+|+|++||+...                 ++..|+||||
T Consensus         1 sy~C~C~~Gy~l~~-----------------d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSP-----------------DGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCC-----------------CCCccccCCC
Confidence            68999999999865                 4466789987


No 30 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.84  E-value=0.0016  Score=45.11  Aligned_cols=34  Identities=50%  Similarity=1.066  Sum_probs=29.1

Q ss_pred             CCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419         570 IDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGN  603 (739)
Q Consensus       570 id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~  603 (739)
                      +++|....+|.++++|++..++|+|.|++||.|.
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~   35 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR   35 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence            5677322689889999999999999999999996


No 31 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.61  E-value=0.0031  Score=43.62  Aligned_cols=36  Identities=42%  Similarity=1.035  Sum_probs=29.7

Q ss_pred             CCCCCCCCCCCCCCeeeecCCceEEeCCCCCCCCCC
Q psy9419         247 VDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPY  282 (739)
Q Consensus       247 ~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c  282 (739)
                      +++|....+|.+++.|++..++|.|.|++||.|..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence            567762268988899999999999999999998654


No 32 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.31  E-value=0.0025  Score=59.94  Aligned_cols=95  Identities=23%  Similarity=0.609  Sum_probs=65.7

Q ss_pred             CCCCCceeecCCCCeecCCeeeccCCCCCCCCCCCCeeeeCCCC--eeEEecCCCCccCCCCCCc--ccccccCCCCCCC
Q psy9419           4 NQCNTLECQCRPPYQIVAGECTLATCGTQGQCPGGAECVNIAGG--VSYCACPKGFRPKEDGYCE--DVDECAESRHLCG   79 (739)
Q Consensus         4 ~~~~~~~C~C~~Gy~g~~~~C~~~~C~~~~~C~~~g~C~~~~~g--~~~C~C~~Gy~g~~~~~C~--dideC~~~~~~C~   79 (739)
                      ++...|+|.|.+||......|..+.|.. ..|. .|.|+-.+..  ...|+|.-|+...+...|.  .-.+|++   .|.
T Consensus        65 ~~~~~~~C~C~~gY~~~~~vCvp~~C~~-~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~L---KCk  139 (197)
T PF06247_consen   65 GEERAYKCDCINGYILKQGVCVPNKCNN-KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSL---KCK  139 (197)
T ss_dssp             TSSTSEEEEE-TTEEESSSSEEEGGGSS----T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE-----------T
T ss_pred             ccceeEEEecccCceeeCCeEchhhcCc-eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceee---ecC
Confidence            5678899999999999999999999987 7897 5999853321  3589999999955445786  2346765   688


Q ss_pred             CCCceeeCCCCceeeCCCCCcCCC
Q psy9419          80 PGAVCINHPGSYTCQCPPNSSGDP  103 (739)
Q Consensus        80 ~~~~C~n~~gsy~C~C~~Gy~g~~  103 (739)
                      .+.+|..+.+-|+|++.+||.++.
T Consensus       140 ~nE~CK~~~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  140 ENEECKLVDGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             TTEEEEEETTEEEEEE-TT-EEET
T ss_pred             CCcceeeeCcEEEeecCCCCCCCC
Confidence            888999999999999999998764


No 33 
>KOG1218|consensus
Probab=96.01  E-value=1.1  Score=47.70  Aligned_cols=84  Identities=29%  Similarity=0.704  Sum_probs=48.9

Q ss_pred             cccCCCCccCCCCccccC-CCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9419         434 RCVCPPFYVGDPEFNCVP-PVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCGTAGPQDRGSCDS  512 (739)
Q Consensus       434 ~C~C~~g~~g~~~~~C~~-~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~~~~~~~~~~C~~  512 (739)
                      .|.+..+|.+.   .|.. ......|...|.....+..          ....|.|.+||.|.   .+..    ....|..
T Consensus       125 ~c~~~~~~~~~---~C~~~~~~g~~C~~~c~~~~~~~~----------~~~~c~c~~g~~g~---~~~~----~~~~c~~  184 (316)
T KOG1218|consen  125 ECRCGGGYIGE---QCGEENLVGLKCQRDCQCTGGCDC----------KNGICTCQPGFVGV---FCVE----SCSGCSP  184 (316)
T ss_pred             ceecCCcCccc---cccccCCCCCCccCCCCCccccCC----------CCCceeccCCcccc---cccc----cCCCcCC
Confidence            46667777766   3443 3344556655532222222          33579999999999   6651    1111443


Q ss_pred             CCCCCCCCCeeecCCCceEEeCCCCCcc
Q psy9419         513 GAGLCGPGAQCLETGGSVECQCPAGYKG  540 (739)
Q Consensus       513 ~~~~C~~~g~C~~~~g~~~C~C~~Gy~g  540 (739)
                       ...|.+++.|+...+  .+.+.+++.+
T Consensus       185 -~~~~~~g~~C~~~~~--~~~~~~~~~~  209 (316)
T KOG1218|consen  185 -LTACENGAKCNRSTG--SCLCYPGPSG  209 (316)
T ss_pred             -CcccCCCCeeecccc--ccccCCCCcc
Confidence             255667778888766  5666666543


No 34 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.91  E-value=0.0035  Score=42.94  Aligned_cols=28  Identities=39%  Similarity=0.993  Sum_probs=21.9

Q ss_pred             CCCCCCCCCceeeCCCCceeeCCCCCcCCC
Q psy9419          74 SRHLCGPGAVCINHPGSYTCQCPPNSSGDP  103 (739)
Q Consensus        74 ~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~  103 (739)
                      ++..|+  .+|++++++|+|.|++||++..
T Consensus         4 ~NGgC~--h~C~~~~g~~~C~C~~Gy~L~~   31 (36)
T PF14670_consen    4 NNGGCS--HICVNTPGSYRCSCPPGYKLAE   31 (36)
T ss_dssp             GGGGSS--SEEEEETTSEEEE-STTEEE-T
T ss_pred             CCCCcC--CCCccCCCceEeECCCCCEECc
Confidence            345676  5899999999999999999874


No 35 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.89  E-value=0.012  Score=39.96  Aligned_cols=28  Identities=50%  Similarity=1.178  Sum_probs=25.7

Q ss_pred             CCCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419         576 SNTCGSNAVCINTPGSYDCRCKEGNAGN  603 (739)
Q Consensus       576 ~~~C~~~g~C~~~~g~~~C~C~~G~~g~  603 (739)
                      ..+|.++++|+++.+.|+|.|++||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            5678889999999999999999999987


No 36 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.79  E-value=0.011  Score=40.10  Aligned_cols=28  Identities=50%  Similarity=1.271  Sum_probs=25.5

Q ss_pred             CCCCCCCCceeeCCCCceeeCCCCCcCC
Q psy9419          75 RHLCGPGAVCINHPGSYTCQCPPNSSGD  102 (739)
Q Consensus        75 ~~~C~~~~~C~n~~gsy~C~C~~Gy~g~  102 (739)
                      ..+|.++++|+++.++|+|.|++||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            3689888999999999999999999986


No 37 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.79  E-value=0.014  Score=39.72  Aligned_cols=26  Identities=50%  Similarity=1.229  Sum_probs=23.6

Q ss_pred             CCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419         577 NTCGSNAVCINTPGSYDCRCKEGNAGN  603 (739)
Q Consensus       577 ~~C~~~g~C~~~~g~~~C~C~~G~~g~  603 (739)
                      .+|.++ +|+++.++|+|.|++||+|.
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccC
Confidence            578887 99999999999999999993


No 38 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.44  E-value=0.016  Score=39.39  Aligned_cols=26  Identities=62%  Similarity=1.399  Sum_probs=23.6

Q ss_pred             CCCCCCCceeeCCCCceeeCCCCCcCC
Q psy9419          76 HLCGPGAVCINHPGSYTCQCPPNSSGD  102 (739)
Q Consensus        76 ~~C~~~~~C~n~~gsy~C~C~~Gy~g~  102 (739)
                      .+|.++ +|+++.++|+|.|++||+|.
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccC
Confidence            579888 99999999999999999983


No 39 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.31  E-value=0.016  Score=38.58  Aligned_cols=25  Identities=32%  Similarity=0.752  Sum_probs=21.6

Q ss_pred             CCCCCCCeec--CceeeCCCCCccCCC
Q psy9419         650 VRCGPRALCV--QGQCLCPSDLIGNPT  674 (739)
Q Consensus       650 ~~C~~~~~C~--~~~C~C~~Gy~G~~c  674 (739)
                      .+|+++++|+  .++|+|++||+|..|
T Consensus         6 ~~C~~~G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCCCCEEECCCCCcCCCC
Confidence            3689999999  589999999999765


No 40 
>KOG1218|consensus
Probab=95.21  E-value=0.9  Score=48.30  Aligned_cols=49  Identities=22%  Similarity=0.509  Sum_probs=27.7

Q ss_pred             CCCCcccCCCCccCCCCccccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCCCCCCC
Q psy9419         430 PALDRCVCPPFYVGDPEFNCVPPVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNSGTHGN  494 (739)
Q Consensus       430 ~~~~~C~C~~g~~g~~~~~C~~~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~  494 (739)
                      .....|.|.++|+|.  ..+........+...+..      ..        ....|.+..+|.+.
T Consensus        12 ~~~~~c~c~~~~~g~--~~~~~~~~~~~~~~~~~~------~~--------~~~~~~~~~~~~~~   60 (316)
T KOG1218|consen   12 GGSGQCFCDPGYTGR--LQCEHQAVTSACSGICPC------EV--------NSGECGLGYGFVGS   60 (316)
T ss_pred             CCCCceecCCCcccc--ccccCCCCCccccccCCc------cC--------CceeEecccccCCC
Confidence            457789999999996  122211111112222211      11        45678899999888


No 41 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.91  E-value=0.032  Score=37.18  Aligned_cols=25  Identities=32%  Similarity=0.883  Sum_probs=21.6

Q ss_pred             CCCCCCCeeeeCCCCeeEEecCCCCccC
Q psy9419          33 GQCPGGAECVNIAGGVSYCACPKGFRPK   60 (739)
Q Consensus        33 ~~C~~~g~C~~~~~g~~~C~C~~Gy~g~   60 (739)
                      ..|+++|+|+...+   +|+|++||+|.
T Consensus         6 ~~C~~~G~C~~~~g---~C~C~~g~~G~   30 (32)
T PF07974_consen    6 NICSGHGTCVSPCG---RCVCDSGYTGP   30 (32)
T ss_pred             CccCCCCEEeCCCC---EEECCCCCcCC
Confidence            47999999998643   89999999997


No 42 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.04  E-value=0.021  Score=29.69  Aligned_cols=13  Identities=31%  Similarity=0.833  Sum_probs=9.9

Q ss_pred             eeeCCCCCccCCC
Q psy9419         662 QCLCPSDLIGNPT  674 (739)
Q Consensus       662 ~C~C~~Gy~G~~c  674 (739)
                      +|+|++||+|..|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5889999999765


No 43 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.82  E-value=0.078  Score=36.36  Aligned_cols=24  Identities=33%  Similarity=0.826  Sum_probs=19.3

Q ss_pred             CeeeecCCceEEeCCCCCCCCCCC
Q psy9419         260 ALCVNEKGGFKCVCPKGTTGDPYT  283 (739)
Q Consensus       260 ~~C~~~~g~~~C~C~~Gy~g~~c~  283 (739)
                      .+|++.+++|+|.|++||+.....
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L~~D~   33 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKLAEDG   33 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE-TTS
T ss_pred             CCCccCCCceEeECCCCCEECcCC
Confidence            489999999999999999987644


No 44 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.63  E-value=0.16  Score=51.27  Aligned_cols=38  Identities=39%  Similarity=0.800  Sum_probs=34.0

Q ss_pred             CCcccccccCCCCCCCCCCceeeCCCCceeeCCCCCcCCC
Q psy9419          64 YCEDVDECAESRHLCGPGAVCINHPGSYTCQCPPNSSGDP  103 (739)
Q Consensus        64 ~C~dideC~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~  103 (739)
                      .|++++||...++.|.  ..|+++.|+|.|.|++||++..
T Consensus       183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE  220 (224)
T ss_pred             cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence            7999999998778897  4799999999999999999863


No 45 
>smart00051 DSL delta serrate ligand.
Probab=91.25  E-value=0.22  Score=39.05  Aligned_cols=21  Identities=19%  Similarity=0.308  Sum_probs=12.8

Q ss_pred             CCCeecC-ceeeCCCCCccCCC
Q psy9419         654 PRALCVQ-GQCLCPSDLIGNPT  674 (739)
Q Consensus       654 ~~~~C~~-~~C~C~~Gy~G~~c  674 (739)
                      .+.+|.. +.++|.+||+|..|
T Consensus        42 ~~~~Cd~~G~~~C~~Gw~G~~C   63 (63)
T smart00051       42 GHYTCDENGNKGCLEGWMGPYC   63 (63)
T ss_pred             CCccCCcCCCEecCCCCcCCCC
Confidence            3444443 56777777777654


No 46 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=87.01  E-value=1.2  Score=33.39  Aligned_cols=43  Identities=37%  Similarity=0.947  Sum_probs=34.4

Q ss_pred             cCCCCeecCCeee-----ccCCCCCCCCCCCCeeeeCCCCeeEEecCCCCccC
Q psy9419          13 CRPPYQIVAGECT-----LATCGTQGQCPGGAECVNIAGGVSYCACPKGFRPK   60 (739)
Q Consensus        13 C~~Gy~g~~~~C~-----~~~C~~~~~C~~~g~C~~~~~g~~~C~C~~Gy~g~   60 (739)
                      |++||......|.     ...|.....|..++.|++   |  +|.|++||...
T Consensus         1 C~~~~~~~~~~C~~~~~~g~~C~~~~qC~~~s~C~~---g--~C~C~~g~~~~   48 (52)
T PF01683_consen    1 CPSGQVAINGQCVPRVQPGESCESDEQCIGGSVCVN---G--RCQCPPGYVEV   48 (52)
T ss_pred             CCCCCEEECCEECccCCCCCCCCCcCCCCCcCEEcC---C--EeECCCCCEec
Confidence            6788888777786     456887788988999976   3  79999999864


No 47 
>smart00051 DSL delta serrate ligand.
Probab=84.73  E-value=1.1  Score=35.05  Aligned_cols=43  Identities=14%  Similarity=0.221  Sum_probs=35.8

Q ss_pred             ceeeCCCCCccCCCCCCCCCccCCCCCCCCCCCCCCceecCCCCC
Q psy9419         661 GQCLCPSDLIGNPTDLTRGCQVKGQCANDLECKPNEICFQEKGIE  705 (739)
Q Consensus       661 ~~C~C~~Gy~G~~c~~~~~C~~~~~C~~~~~C~~~~~C~~~~g~~  705 (739)
                      +.=.|+++|.|..|+  ..|..++....+..|...+.+.|.+||.
T Consensus        17 ~rv~C~~~~yG~~C~--~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~   59 (63)
T smart00051       17 IRVTCDENYYGEGCN--KFCRPRDDFFGHYTCDENGNKGCLEGWM   59 (63)
T ss_pred             EEeeCCCCCcCCccC--CEeCcCccccCCccCCcCCCEecCCCCc
Confidence            344688999999997  5687777778889999888899999995


No 48 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=82.55  E-value=1.9  Score=32.18  Aligned_cols=22  Identities=32%  Similarity=0.945  Sum_probs=18.1

Q ss_pred             CCCCCCCeecCceeeCCCCCcc
Q psy9419         650 VRCGPRALCVQGQCLCPSDLIG  671 (739)
Q Consensus       650 ~~C~~~~~C~~~~C~C~~Gy~G  671 (739)
                      ..|..++.|++.+|+|++||+-
T Consensus        26 ~qC~~~s~C~~g~C~C~~g~~~   47 (52)
T PF01683_consen   26 EQCIGGSVCVNGRCQCPPGYVE   47 (52)
T ss_pred             CCCCCcCEEcCCEeECCCCCEe
Confidence            3577888998999999999864


No 49 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=82.51  E-value=1.1  Score=30.71  Aligned_cols=31  Identities=29%  Similarity=0.787  Sum_probs=22.6

Q ss_pred             CCCCCCCCCCCeeeeCCCCeeEEecCCCCccC
Q psy9419          29 CGTQGQCPGGAECVNIAGGVSYCACPKGFRPK   60 (739)
Q Consensus        29 C~~~~~C~~~g~C~~~~~g~~~C~C~~Gy~g~   60 (739)
                      |.. ..|+.|+.|++...|++.|.|..||...
T Consensus         2 C~~-~~cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    2 CID-TKCPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             -SS-S---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             ccC-ccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence            444 6788899999988789999999999875


No 50 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=82.20  E-value=66  Score=38.79  Aligned_cols=15  Identities=20%  Similarity=0.401  Sum_probs=11.2

Q ss_pred             eEEeCCCCCCCCCCC
Q psy9419         269 FKCVCPKGTTGDPYT  283 (739)
Q Consensus       269 ~~C~C~~Gy~g~~c~  283 (739)
                      -+|+|..||..+...
T Consensus       682 ~~C~C~~g~~p~~~~  696 (800)
T PTZ00214        682 RRCWCERGFLPALDR  696 (800)
T ss_pred             ceeEecCCcccccCC
Confidence            479999999865543


No 51 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=80.61  E-value=2.1  Score=31.81  Aligned_cols=31  Identities=35%  Similarity=0.844  Sum_probs=22.8

Q ss_pred             eeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCC
Q psy9419         522 QCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNP  563 (739)
Q Consensus       522 ~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~  563 (739)
                      .|....|  +|.|+++|+|..+++|         .+||++.+
T Consensus        13 ~C~~~~G--~C~C~~~~~G~~C~~C---------~~g~~~~~   43 (50)
T cd00055          13 QCDPGTG--QCECKPNTTGRRCDRC---------APGYYGLP   43 (50)
T ss_pred             cccCCCC--EEeCCCcCCCCCCCCC---------CCCCccCC
Confidence            3655444  8999999999988764         56777754


No 52 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=80.46  E-value=1.6  Score=43.92  Aligned_cols=37  Identities=24%  Similarity=0.637  Sum_probs=30.5

Q ss_pred             CCcccCCCCC-CCCCCCCCeeeccCCceEeeCCCCCcCCC
Q psy9419         184 GCLDVDECLG-VSPCASSALCVNEKGGFKCVCPKGTTGDP  222 (739)
Q Consensus       184 ~C~~i~eC~~-~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~  222 (739)
                      .|.++++|.. .++|.  ..|.++.|+|.|.|++||++..
T Consensus       183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE  220 (224)
T ss_pred             cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence            4788899975 35675  4899999999999999999754


No 53 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=80.31  E-value=1.2  Score=32.94  Aligned_cols=31  Identities=35%  Similarity=0.757  Sum_probs=22.4

Q ss_pred             CeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccC
Q psy9419         521 AQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGN  562 (739)
Q Consensus       521 g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~  562 (739)
                      .+|....|  +|.|+++|+|..|++|.         +||++.
T Consensus        11 ~~C~~~~G--~C~C~~~~~G~~C~~C~---------~g~~~~   41 (49)
T PF00053_consen   11 QTCDPSTG--QCVCKPGTTGPRCDQCK---------PGYFGL   41 (49)
T ss_dssp             SSEEETCE--EESBSTTEESTTS-EE----------TTEECS
T ss_pred             CcccCCCC--EEeccccccCCcCcCCC---------Cccccc
Confidence            46777554  89999999999988654         467765


No 54 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=79.22  E-value=1.1  Score=30.71  Aligned_cols=28  Identities=36%  Similarity=0.694  Sum_probs=20.6

Q ss_pred             CCCCCCCCEEeeCC-CCeeeecCCCCCCC
Q psy9419         576 SNTCGSNAVCINTP-GSYDCRCKEGNAGN  603 (739)
Q Consensus       576 ~~~C~~~g~C~~~~-g~~~C~C~~G~~g~  603 (739)
                      ...|..|+.|++.. |+++|+|.+||..+
T Consensus         4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    4 DTKCPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             SS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             CccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence            35677899999875 99999999999865


No 55 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=73.89  E-value=3.9  Score=29.78  Aligned_cols=25  Identities=24%  Similarity=0.695  Sum_probs=18.4

Q ss_pred             eeecCCCceEEeCCCCCccCCCcccCC
Q psy9419         522 QCLETGGSVECQCPAGYKGNPYVQCVG  548 (739)
Q Consensus       522 ~C~~~~g~~~C~C~~Gy~g~~c~~C~~  548 (739)
                      .|....|  +|.|+++|+|..+++|.+
T Consensus        12 ~C~~~~G--~C~C~~~~~G~~C~~C~~   36 (46)
T smart00180       12 TCDPDTG--QCECKPNVTGRRCDRCAP   36 (46)
T ss_pred             cccCCCC--EEECCCCCCCCCCCcCCC
Confidence            4555444  899999999988876543


No 56 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=70.13  E-value=4.6  Score=27.05  Aligned_cols=23  Identities=30%  Similarity=0.715  Sum_probs=16.7

Q ss_pred             CeeeeCCCCeeEEecCCCCccCCCC
Q psy9419          39 AECVNIAGGVSYCACPKGFRPKEDG   63 (739)
Q Consensus        39 g~C~~~~~g~~~C~C~~Gy~g~~~~   63 (739)
                      +.|.....  +.|.|++||..+...
T Consensus        10 A~CDpn~~--~~C~CPeGyIlde~~   32 (34)
T PF09064_consen   10 ADCDPNSP--GQCFCPEGYILDEGS   32 (34)
T ss_pred             CccCCCCC--CceeCCCceEecCCc
Confidence            46776543  389999999987543


No 57 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=68.57  E-value=5.4  Score=35.12  Aligned_cols=34  Identities=29%  Similarity=0.615  Sum_probs=26.8

Q ss_pred             cCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419         569 DIDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGN  603 (739)
Q Consensus       569 ~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~  603 (739)
                      ..|.|.....|+.+|.|.. ..+..|.|++||+..
T Consensus        76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence            3567866789999999964 456689999999864


No 58 
>KOG3512|consensus
Probab=64.83  E-value=14  Score=40.37  Aligned_cols=28  Identities=25%  Similarity=0.545  Sum_probs=21.8

Q ss_pred             CCCceecCCCCcccCCCCccCCCCcccc
Q psy9419         423 GAGAQCDPALDRCVCPPFYVGDPEFNCV  450 (739)
Q Consensus       423 ~~~~~C~~~~~~C~C~~g~~g~~~~~C~  450 (739)
                      ..+.+|+..+++|.|.+|.+|.....|.
T Consensus       404 s~gktCNq~tGqCpCkeGvtG~tCnrCa  431 (592)
T KOG3512|consen  404 SAGKTCNQTTGQCPCKEGVTGLTCNRCA  431 (592)
T ss_pred             cccccccccCCcccCCCCCccccccccc
Confidence            4567788889999999999998544444


No 59 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=62.18  E-value=30  Score=38.17  Aligned_cols=52  Identities=25%  Similarity=0.620  Sum_probs=30.8

Q ss_pred             ecCCCCccCCCC-CCcccccccCCCCCCCCCCceeeCCCCcee-eCCCCCcCCCCCCCcc
Q psy9419          52 ACPKGFRPKEDG-YCEDVDECAESRHLCGPGAVCINHPGSYTC-QCPPNSSGDPLLGCTH  109 (739)
Q Consensus        52 ~C~~Gy~g~~~~-~C~dideC~~~~~~C~~~~~C~n~~gsy~C-~C~~Gy~g~~~~~C~~  109 (739)
                      +|.+||....+. .|....+|..  ..|.   +|.+... -.| .|..+|.+.+.+.|..
T Consensus         3 ~C~~gy~~~~~~t~C~~~~~C~~--~~C~---~Cs~~~~-~~Ct~C~~~~~lt~t~~Ci~   56 (397)
T PF03302_consen    3 ECTSGYKLSTDKTSCVSASECKT--PNCK---TCSNDKK-EVCTECNSGYYLTPTNQCIE   56 (397)
T ss_pred             cccCCceECCCCCcccccCCCCC--CCCc---cccCCCC-CccCcCCCCCcCCCCCcccc
Confidence            477788876553 6776666765  3453   4554433 245 5888887765443443


No 60 
>PHA02887 EGF-like protein; Provisional
Probab=57.60  E-value=8.2  Score=33.75  Aligned_cols=26  Identities=23%  Similarity=0.354  Sum_probs=19.4

Q ss_pred             CcccccC--CCCceeecCCCceeCCCCc
Q psy9419         325 HALCEPQ--DHRASCRCELGYTEGLNGK  350 (739)
Q Consensus       325 ~~~C~~~--~g~~~C~C~~G~~g~~~~~  350 (739)
                      ||+|...  ...+.|.|+.||+|.+|++
T Consensus        96 HG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         96 NGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             CCEEEccccCCCceeECCCCcccCCCCc
Confidence            4566433  3467999999999998875


No 61 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=57.23  E-value=13  Score=27.55  Aligned_cols=19  Identities=42%  Similarity=0.978  Sum_probs=16.0

Q ss_pred             eecCCCCcccCCCCccCCC
Q psy9419         427 QCDPALDRCVCPPFYVGDP  445 (739)
Q Consensus       427 ~C~~~~~~C~C~~g~~g~~  445 (739)
                      .|+..+++|.|+++|+|..
T Consensus        13 ~C~~~~G~C~C~~~~~G~~   31 (50)
T cd00055          13 QCDPGTGQCECKPNTTGRR   31 (50)
T ss_pred             cccCCCCEEeCCCcCCCCC
Confidence            4667789999999999974


No 62 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=54.23  E-value=7.3  Score=28.62  Aligned_cols=20  Identities=40%  Similarity=0.938  Sum_probs=17.4

Q ss_pred             ceecCCCCcccCCCCccCCC
Q psy9419         426 AQCDPALDRCVCPPFYVGDP  445 (739)
Q Consensus       426 ~~C~~~~~~C~C~~g~~g~~  445 (739)
                      .+|++.+++|+|+++|+|..
T Consensus        11 ~~C~~~~G~C~C~~~~~G~~   30 (49)
T PF00053_consen   11 QTCDPSTGQCVCKPGTTGPR   30 (49)
T ss_dssp             SSEEETCEEESBSTTEESTT
T ss_pred             CcccCCCCEEeccccccCCc
Confidence            47778899999999999984


No 63 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=51.92  E-value=4.2e+02  Score=32.18  Aligned_cols=13  Identities=15%  Similarity=0.268  Sum_probs=9.9

Q ss_pred             ceEEeCCCCCCCC
Q psy9419         268 GFKCVCPKGTTGD  280 (739)
Q Consensus       268 ~~~C~C~~Gy~g~  280 (739)
                      ...|+|..||...
T Consensus       750 ~~vC~C~~g~~l~  762 (800)
T PTZ00214        750 QGVCMCELDAVLT  762 (800)
T ss_pred             CCeEEeCCcceec
Confidence            3489999999754


No 64 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=49.33  E-value=16  Score=32.58  Aligned_cols=39  Identities=31%  Similarity=0.646  Sum_probs=28.5

Q ss_pred             CCCCC--CCCCCCCCCEEeeC--CCCeeeecCCCCCCCCCCcceeCc
Q psy9419         570 IDECW--SSNTCGSNAVCINT--PGSYDCRCKEGNAGNPFVACTPVA  612 (739)
Q Consensus       570 id~C~--~~~~C~~~g~C~~~--~g~~~C~C~~G~~g~~~~~C~~~~  612 (739)
                      +.+|.  ..+-|.+ |+|.-.  ...+.|+|..||+|.   +|+..+
T Consensus        42 i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh~d   84 (139)
T PHA03099         42 IRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQHVV   84 (139)
T ss_pred             cccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccc---ccccee
Confidence            34453  3567875 489854  478899999999999   787654


No 65 
>PHA02887 EGF-like protein; Provisional
Probab=48.53  E-value=14  Score=32.27  Aligned_cols=29  Identities=31%  Similarity=0.773  Sum_probs=22.8

Q ss_pred             CCCCCccccCCCCCCCCCCCCceeecCCCCCCCCCCCCC
Q psy9419         462 CGPNAHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCG  500 (739)
Q Consensus       462 C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~  500 (739)
                      |. ||+|....+      ...+.|.|++||+|.   +|+
T Consensus        94 Ci-HG~C~yI~d------L~epsCrC~~GYtG~---RCE  122 (126)
T PHA02887         94 CI-NGECMNIID------LDEKFCICNKGYTGI---RCD  122 (126)
T ss_pred             ee-CCEEEcccc------CCCceeECCCCcccC---CCC
Confidence            44 678877663      256899999999999   887


No 66 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=48.37  E-value=17  Score=31.97  Aligned_cols=33  Identities=24%  Similarity=0.539  Sum_probs=25.9

Q ss_pred             ccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcC
Q psy9419         187 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTTG  220 (739)
Q Consensus       187 ~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g  220 (739)
                      ..+.|.....|+..+.|.. ..+..|.|.+||.-
T Consensus        76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP  108 (110)
T ss_pred             cccCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence            3567887688999999954 44567999999974


No 67 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=47.98  E-value=20  Score=26.02  Aligned_cols=20  Identities=35%  Similarity=0.893  Sum_probs=16.0

Q ss_pred             ceecCCCCcccCCCCccCCC
Q psy9419         426 AQCDPALDRCVCPPFYVGDP  445 (739)
Q Consensus       426 ~~C~~~~~~C~C~~g~~g~~  445 (739)
                      ..|++.+++|.|+++|+|..
T Consensus        11 ~~C~~~~G~C~C~~~~~G~~   30 (46)
T smart00180       11 GTCDPDTGQCECKPNVTGRR   30 (46)
T ss_pred             CcccCCCCEEECCCCCCCCC
Confidence            35666788999999999973


No 68 
>KOG3512|consensus
Probab=47.36  E-value=32  Score=37.65  Aligned_cols=99  Identities=23%  Similarity=0.569  Sum_probs=56.5

Q ss_pred             CCCCCCceeecCCCCeecCC-eee---------------ccCCCCCCCCCC-------------------CCeeee---C
Q psy9419           3 NNQCNTLECQCRPPYQIVAG-ECT---------------LATCGTQGQCPG-------------------GAECVN---I   44 (739)
Q Consensus         3 ~~~~~~~~C~C~~Gy~g~~~-~C~---------------~~~C~~~~~C~~-------------------~g~C~~---~   44 (739)
                      .++-+.++|.|+.+-.|.++ .|.               +++|.. +.|..                   +|+|+|   +
T Consensus       289 ~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~a-c~Cn~harrcrfn~Ely~lSgr~SggvClnCrHn  367 (592)
T KOG3512|consen  289 MDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVA-CNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHN  367 (592)
T ss_pred             eccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccc-cccchhhhhcccchhhhcccCccccceEeecccC
Confidence            34556699999999888764 232               334443 33322                   245664   2


Q ss_pred             CCCeeEE-ecCCCCccCCCCCCcccccccCCCCCCC----CCCceeeCCCCceeeCCCCCcCCCCCCC
Q psy9419          45 AGGVSYC-ACPKGFRPKEDGYCEDVDECAESRHLCG----PGAVCINHPGSYTCQCPPNSSGDPLLGC  107 (739)
Q Consensus        45 ~~g~~~C-~C~~Gy~g~~~~~C~dideC~~~~~~C~----~~~~C~n~~gsy~C~C~~Gy~g~~~~~C  107 (739)
                      +.|- .| .|.+||..+...-=.+...|..  ..|+    .+-+|..+.|  +|.|++|-+|..++.|
T Consensus       368 TaGr-hChyCreGyyRd~s~pl~hrkaCk~--CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnrC  430 (592)
T KOG3512|consen  368 TAGR-HCHYCREGYYRDGSKPLTHRKACKA--CDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNRC  430 (592)
T ss_pred             CCCc-ccccccCccccCCCCCCchhhhhhh--cCCcccccccccccccCC--cccCCCCCcccccccc
Confidence            2232 45 6999998764321112222222  2232    2457777777  8999999999865533


No 69 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=46.33  E-value=20  Score=32.03  Aligned_cols=37  Identities=27%  Similarity=0.571  Sum_probs=26.4

Q ss_pred             CCCCC--CCCCCCCCCeeeec--CCceEEeCCCCCCCCCCCc
Q psy9419         247 VDECL--GVSPCASSALCVNE--KGGFKCVCPKGTTGDPYTL  284 (739)
Q Consensus       247 ~~eC~--~~~~C~~~~~C~~~--~g~~~C~C~~Gy~g~~c~~  284 (739)
                      +.+|.  ..+-|.+ |+|.-.  ...+.|.|..||+|..|+.
T Consensus        42 i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         42 IRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             cccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccccccc
Confidence            44553  2355765 489544  4778999999999999875


No 70 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=42.03  E-value=6.9  Score=30.68  Aligned_cols=14  Identities=21%  Similarity=0.453  Sum_probs=8.4

Q ss_pred             ceeeCCCCCccCCC
Q psy9419         661 GQCLCPSDLIGNPT  674 (739)
Q Consensus       661 ~~C~C~~Gy~G~~c  674 (739)
                      ++=+|.+||+|..|
T Consensus        50 G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   50 GNKVCLPGWTGPNC   63 (63)
T ss_dssp             --EEE-TTEESTTS
T ss_pred             CCCCCCCCCcCCCC
Confidence            56678888888764


No 71 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=32.21  E-value=25  Score=30.46  Aligned_cols=32  Identities=25%  Similarity=0.640  Sum_probs=23.2

Q ss_pred             CCCCC-CCCCCCCeeeeCCC----CeeEEecCCCCcc
Q psy9419          28 TCGTQ-GQCPGGAECVNIAG----GVSYCACPKGFRP   59 (739)
Q Consensus        28 ~C~~~-~~C~~~g~C~~~~~----g~~~C~C~~Gy~g   59 (739)
                      .|... +.|++||.|++...    .=|.|.|.+.+..
T Consensus         7 aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~   43 (103)
T PF12955_consen    7 ACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVK   43 (103)
T ss_pred             HHHHhccCCCCCceEeeccCCCccceEEEEeeccccc
Confidence            34444 78999999998632    3589999996554


No 72 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=28.11  E-value=36  Score=29.50  Aligned_cols=31  Identities=26%  Similarity=0.800  Sum_probs=22.8

Q ss_pred             CCCC-CCCCCCCCCEEeeCC-----CCeeeecCCCCC
Q psy9419         571 DECW-SSNTCGSNAVCINTP-----GSYDCRCKEGNA  601 (739)
Q Consensus       571 d~C~-~~~~C~~~g~C~~~~-----g~~~C~C~~G~~  601 (739)
                      ++|. ..+.|..||.|+...     .=|.|.|.+.+.
T Consensus         6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~   42 (103)
T PF12955_consen    6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVV   42 (103)
T ss_pred             HHHHHhccCCCCCceEeeccCCCccceEEEEeecccc
Confidence            3443 567899999999872     448999999544


No 73 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=27.75  E-value=31  Score=25.97  Aligned_cols=33  Identities=18%  Similarity=0.378  Sum_probs=17.7

Q ss_pred             CCCCCCccccc----CCCCceeecCCCceeCCCCccc
Q psy9419         320 VECGAHALCEP----QDHRASCRCELGYTEGLNGKCV  352 (739)
Q Consensus       320 ~~C~~~~~C~~----~~g~~~C~C~~G~~g~~~~~c~  352 (739)
                      .+|+.||.-..    ..|...|.|..-|.|..|.+-+
T Consensus        17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~   53 (56)
T PF04863_consen   17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI   53 (56)
T ss_dssp             S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred             CCcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence            45677776632    2566899999999999887644


No 74 
>KOG3516|consensus
Probab=27.26  E-value=56  Score=40.05  Aligned_cols=44  Identities=27%  Similarity=0.587  Sum_probs=37.3

Q ss_pred             CCCcccCCCCCCCCCCCCCCeeeecCCceEEeCC-CCCCCCCCCcc
Q psy9419         241 KVGCLDVDECLGVSPCASSALCVNEKGGFKCVCP-KGTTGDPYTLG  285 (739)
Q Consensus       241 ~~~C~d~~eC~~~~~C~~~~~C~~~~g~~~C~C~-~Gy~g~~c~~~  285 (739)
                      ...|.-++.|. +++|.++|.|......|+|.|. .||.|..|..+
T Consensus       539 id~C~i~drCl-PN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts  583 (1306)
T KOG3516|consen  539 IDMCGISDRCL-PNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS  583 (1306)
T ss_pred             ecccccccccC-CccccCCCcccccccceeEeccccccccccccCC
Confidence            34566778888 8999999999988889999999 89999988653


No 75 
>KOG3516|consensus
Probab=22.46  E-value=64  Score=39.61  Aligned_cols=37  Identities=27%  Similarity=0.729  Sum_probs=32.5

Q ss_pred             CCccCCCCCCCCCCCCCCEEeeCCCCeeeecC-CCCCCC
Q psy9419         566 QCVDIDECWSSNTCGSNAVCINTPGSYDCRCK-EGNAGN  603 (739)
Q Consensus       566 ~C~~id~C~~~~~C~~~g~C~~~~g~~~C~C~-~G~~g~  603 (739)
                      .|.-+|.| .+++|.++|.|.-....|.|.|. .||+|.
T Consensus       541 ~C~i~drC-lPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga  578 (1306)
T KOG3516|consen  541 MCGISDRC-LPNPCEHGGKCSQSWDDFECNCELTGYKGA  578 (1306)
T ss_pred             cccccccc-CCccccCCCcccccccceeEeccccccccc
Confidence            45556777 89999999999998889999998 999998


Done!