Query         psy620
Match_columns 1290
No_of_seqs    902 out of 4255
Neff          6.5 
Searched_HMMs 46136
Date          Fri Aug 16 20:15:49 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy620.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/620hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF05735 TSP_C:  Thrombospondin 100.0 7.5E-43 1.6E-47  365.5   7.9  138 1015-1153    1-138 (201)
  2 PF05735 TSP_C:  Thrombospondin 100.0 2.2E-39 4.8E-44  339.4   5.6  108 1132-1290    1-112 (201)
  3 KOG1214|consensus               99.9 7.1E-21 1.5E-25  225.3  17.9  325   19-377   606-951 (1289)
  4 KOG1214|consensus               99.7 5.4E-17 1.2E-21  192.9  18.2  160  242-433   699-865 (1289)
  5 KOG4289|consensus               99.7 4.2E-17 9.1E-22  200.6  16.8  105  104-225  1178-1308(2531)
  6 KOG4289|consensus               99.6 1.8E-15 3.8E-20  186.6  16.8  111  146-274  1179-1317(2531)
  7 KOG0994|consensus               99.6 1.4E-15 3.1E-20  185.4  14.2  312   54-427   763-1095(1758)
  8 KOG1219|consensus               99.6 1.1E-15 2.4E-20  193.7  10.7  111  104-234  3863-3977(4289)
  9 KOG1217|consensus               99.6 2.2E-13 4.7E-18  165.3  25.4  199  148-379    91-306 (487)
 10 KOG1219|consensus               99.5 9.2E-14   2E-18  176.9  11.7  114  142-273  3859-3977(4289)
 11 KOG1217|consensus               99.4 1.4E-12 3.1E-17  158.3  18.2  264  106-427    90-389 (487)
 12 KOG0994|consensus               99.1   5E-10 1.1E-14  138.0  14.0  221  119-400   877-1117(1758)
 13 KOG4260|consensus               98.7 1.6E-08 3.4E-13  109.5   6.5  152  128-326   130-304 (350)
 14 PF02412 TSP_3:  Thrombospondin  98.6   2E-08 4.3E-13   78.3   1.8   36  829-864     1-36  (36)
 15 KOG1836|consensus               98.6 2.1E-06 4.6E-11  115.8  21.9  227  160-428   749-1019(1705)
 16 KOG4260|consensus               98.6 9.7E-08 2.1E-12  103.5   6.8  156  172-373   132-304 (350)
 17 PF02412 TSP_3:  Thrombospondin  98.5 2.5E-08 5.4E-13   77.7   1.5   35  927-961     2-36  (36)
 18 KOG1225|consensus               98.4 8.1E-07 1.7E-11  107.6  11.5  132  127-329   234-365 (525)
 19 KOG1225|consensus               98.4 1.7E-06 3.7E-11  104.9  12.0  131  217-428   234-365 (525)
 20 KOG1836|consensus               97.8 0.00039 8.4E-09   94.7  16.9  108  260-384   697-816 (1705)
 21 PF07645 EGF_CA:  Calcium-bindi  97.6   3E-05 6.5E-10   63.1   1.9   39  290-331     1-39  (42)
 22 PF12947 EGF_3:  EGF domain;  I  97.5 9.3E-05   2E-09   58.1   2.9   31  348-379     6-36  (36)
 23 KOG1226|consensus               97.4   0.001 2.3E-08   82.5  11.7   99  259-380   479-580 (783)
 24 KOG1226|consensus               97.1  0.0024 5.1E-08   79.5  10.7  137  168-357   479-636 (783)
 25 PF00008 EGF:  EGF-like domain   97.0 0.00018   4E-09   55.0   0.2   28  593-621     3-31  (32)
 26 PF07645 EGF_CA:  Calcium-bindi  97.0 0.00096 2.1E-08   54.3   4.2   32  341-373     2-34  (42)
 27 smart00179 EGF_CA Calcium-bind  96.7  0.0014   3E-08   51.6   3.3   35  146-183     2-38  (39)
 28 PF00008 EGF:  EGF-like domain   96.6 0.00061 1.3E-08   52.1   0.4   29  108-138     1-30  (32)
 29 smart00179 EGF_CA Calcium-bind  96.6  0.0022 4.8E-08   50.5   3.5   37  105-144     2-39  (39)
 30 PF12947 EGF_3:  EGF domain;  I  96.5 0.00082 1.8E-08   52.8   0.7   32  111-143     5-36  (36)
 31 PF06247 Plasmod_Pvs28:  Plasmo  96.5 0.00065 1.4E-08   71.5  -0.3  133  215-376    18-163 (197)
 32 PF12662 cEGF:  Complement Clr-  96.2  0.0025 5.4E-08   45.4   1.5   22  608-630     1-24  (24)
 33 PF06247 Plasmod_Pvs28:  Plasmo  96.1 0.00086 1.9E-08   70.6  -1.4  141  159-332    12-166 (197)
 34 cd00054 EGF_CA Calcium-binding  96.0  0.0069 1.5E-07   47.0   3.2   35  146-183     2-37  (38)
 35 cd00054 EGF_CA Calcium-binding  95.9   0.008 1.7E-07   46.6   3.5   36  105-144     2-38  (38)
 36 PF12662 cEGF:  Complement Clr-  95.9  0.0041 8.9E-08   44.3   1.6   22  698-720     1-24  (24)
 37 PF14670 FXa_inhibition:  Coagu  95.8   0.004 8.8E-08   48.9   1.1   36  294-334     1-36  (36)
 38 cd00053 EGF Epidermal growth f  94.9   0.026 5.7E-07   43.0   3.2   28  151-179     5-32  (36)
 39 cd00053 EGF Epidermal growth f  94.6   0.037   8E-07   42.1   3.4   31  111-143     5-35  (36)
 40 smart00181 EGF Epidermal growt  94.4   0.044 9.4E-07   42.2   3.3   26  112-139     6-31  (35)
 41 smart00181 EGF Epidermal growt  94.3   0.041 8.9E-07   42.4   3.0   33  148-183     1-34  (35)
 42 KOG1218|consensus               91.5     3.1 6.8E-05   48.3  14.7  193  126-374    14-209 (316)
 43 PF14670 FXa_inhibition:  Coagu  88.4    0.49 1.1E-05   37.4   3.1   29  348-379     6-36  (36)
 44 KOG1218|consensus               88.0      11 0.00024   43.7  15.5  102  131-257    93-200 (316)
 45 PF12661 hEGF:  Human growth fa  87.3    0.36 7.9E-06   29.6   1.3   13  365-379     1-13  (13)
 46 PF07974 EGF_2:  EGF-like domai  85.4    0.81 1.8E-05   35.2   2.7   27  348-379     6-32  (32)
 47 cd01475 vWA_Matrilin VWA_Matri  85.4    0.71 1.5E-05   51.4   3.5   43  284-331   180-222 (224)
 48 KOG3514|consensus               83.2     1.8 3.9E-05   56.1   5.9   34  107-144   625-659 (1591)
 49 KOG3512|consensus               82.7     2.2 4.8E-05   51.1   6.0  116  250-380   286-428 (592)
 50 PF00683 TB:  TB domain;  Inter  82.6   0.089 1.9E-06   43.0  -3.7   22  465-486    18-39  (42)
 51 PF07974 EGF_2:  EGF-like domai  82.6     1.2 2.5E-05   34.3   2.5   24  112-138     6-29  (32)
 52 KOG3516|consensus               81.5     3.2 6.9E-05   54.9   7.3   35  106-144   956-991 (1306)
 53 PF12946 EGF_MSP1_1:  MSP1 EGF   81.4     1.1 2.4E-05   35.5   2.0   32  348-379     5-36  (37)
 54 smart00051 DSL delta serrate l  79.5     2.3 4.9E-05   38.0   3.6   44  320-379    20-63  (63)
 55 smart00682 G2F G2 nidogen doma  77.8     2.1 4.5E-05   47.6   3.6   67   21-96    153-222 (227)
 56 cd01475 vWA_Matrilin VWA_Matri  76.9     1.7 3.8E-05   48.2   2.8   38  141-179   182-219 (224)
 57 KOG3516|consensus               72.0     9.6 0.00021   50.7   7.8   39  102-144   542-581 (1306)
 58 PF01683 EB:  EB module;  Inter  69.8     7.5 0.00016   33.0   4.3   33  342-379    20-52  (52)
 59 PF12946 EGF_MSP1_1:  MSP1 EGF   69.0    0.92   2E-05   35.9  -1.3   34  149-183     2-36  (37)
 60 KOG3512|consensus               68.8     8.6 0.00019   46.4   5.9   16  164-179   368-383 (592)
 61 PTZ00214 high cysteine membran  64.8 1.2E+02  0.0025   40.6  15.6   85  119-227   366-458 (800)
 62 smart00051 DSL delta serrate l  61.6      10 0.00022   34.0   3.6   47  217-272    17-63  (63)
 63 PF03302 VSP:  Giardia variant-  60.9      40 0.00086   41.2   9.8  128  130-271     2-136 (397)
 64 cd00255 nidG2 Nidogen, G2 doma  59.9     6.7 0.00015   43.7   2.7   69   19-93    150-222 (224)
 65 PHA02887 EGF-like protein; Pro  57.6       9  0.0002   38.0   2.8   31  348-381    92-123 (126)
 66 PHA03099 epidermal growth fact  52.2      13 0.00028   37.5   3.0   31  348-381    51-82  (139)
 67 KOG3514|consensus               51.8      10 0.00023   49.6   2.8   35  588-626   624-659 (1591)
 68 PHA03099 epidermal growth fact  50.3      15 0.00031   37.2   3.0   39  104-146    41-83  (139)
 69 PF03302 VSP:  Giardia variant-  49.0 2.8E+02  0.0062   33.9  14.4   44  167-228    91-134 (397)
 70 PF00954 S_locus_glycop:  S-loc  44.3      19 0.00041   35.5   2.8   32  105-138    77-108 (110)
 71 PHA02887 EGF-like protein; Pro  43.6      18 0.00038   36.1   2.4   28  206-236    97-124 (126)
 72 cd00055 EGF_Lam Laminin-type e  34.6      41 0.00088   28.4   3.0   35  114-179     4-42  (50)
 73 PF01683 EB:  EB module;  Inter  33.8      50  0.0011   27.9   3.4   29  106-139    20-48  (52)
 74 PF00954 S_locus_glycop:  S-loc  33.6      34 0.00074   33.7   2.8   32  341-374    77-108 (110)
 75 smart00210 TSPN Thrombospondin  31.6      79  0.0017   34.1   5.4   33   29-65    112-144 (184)
 76 PTZ00214 high cysteine membran  29.9 1.8E+02  0.0039   38.9   9.1   38  314-357   681-722 (800)
 77 KOG3509|consensus               28.8 1.1E+02  0.0023   41.3   6.7  117  128-273   719-841 (964)
 78 TIGR00648 recU recombination p  25.9      32 0.00069   36.9   1.1   46 1226-1272  101-154 (169)
 79 PF00053 Laminin_EGF:  Laminin   25.4      61  0.0013   27.1   2.5   22  354-380    11-32  (49)
 80 PRK02234 recU Holliday junctio  21.7      47   0.001   36.5   1.4   47 1225-1272  123-177 (195)
 81 PF01414 DSL:  Delta serrate li  20.0      26 0.00057   31.3  -0.8   48  314-379    16-63  (63)

No 1  
>PF05735 TSP_C:  Thrombospondin C-terminal region;  InterPro: IPR008859 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. The globular C-terminal domain is a beta sandwich of two curved antiparallel beta-sheets []. The fold is an elaboration of the jelly role topology, with strand B3-B7, B11 and B14-B15 forming the eight-stranded jelly roll motif. The function of the C-terminal domain is not yet known.; GO: 0005509 calcium ion binding, 0007155 cell adhesion, 0005576 extracellular region; PDB: 1UX6_A 1YO8_A 2RHP_A 3FBY_C.
Probab=100.00  E-value=7.5e-43  Score=365.48  Aligned_cols=138  Identities=64%  Similarity=1.137  Sum_probs=106.7

Q ss_pred             CCCCceeeccCCceEEEeecCCCCcccccccccceeeecceeecccCCCCccceEeeeccCCcEEEEeccccceeeeecc
Q psy620         1015 QIDPHWVIYNHGAEILQTMNSDPGLAIGQDKFSGVDFEGTFFVDTDIDDDYAGFVFSYQSSQKFYVMMWKKNSQVYWQTT 1094 (1290)
Q Consensus      1015 ~~d~~~~v~~~g~~~~q~~~~dp~~~~g~~~~~~~d~~g~~~~~~~~d~~~~gfvf~yq~~~~f~~~~~~~~~~~~w~~~ 1094 (1290)
                      |.||+|+|.++|+||+|++||||+++||.++|.+|||+|||+|++..|||||||||+||+|+||||||||+..|+||+.+
T Consensus         1 q~dP~W~v~~~G~ev~Qt~NsdP~l~ig~~~~~~vdf~GT~~Vnt~~DDDyiGFVFGYQsn~~FYvv~WKq~~Q~y~~~~   80 (201)
T PF05735_consen    1 QIDPNWVVSNQGAEVVQTLNSDPGLAIGPDNFGGVDFSGTFFVNTTSDDDYIGFVFGYQSNRKFYVVMWKQGNQNYWESS   80 (201)
T ss_dssp             S----EEEECCCTEEEE-SS-SSEEEEEEEEESSEEEEEEEEE--SS---EEEEEEEEEETTEEEEEEEESS-EE-S--S
T ss_pred             CCCCceEEecCCeEEEEeccCCCeEEEccceecceEEEEEEEEecCCCCCEEEEEEEecCCCeEEEEEeeccccccccCC
Confidence            67999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ccccccCCccEEEEecCCCCCCcccccccccCCCcccccccCCCcCCCcccccCCCCcc
Q psy620         1095 PFRAVAEPGIQLKVVDSATGPGTMLRNSLWHTGDTENQCDLAEPCDPRVQCTNLFPGYR 1153 (1290)
Q Consensus      1095 ~f~~~~~~g~~~~~~~~~~~~~~~~~~~~w~~~~~~~q~~~~w~~~~~~~~~~~~~~~~ 1153 (1290)
                      ||||.|++|||+|+|+|+||||++|||+|||+++|.+||++||+ +|...+|+....|+
T Consensus        81 p~~~~a~~Gl~iK~V~s~tGpg~~l~nalWh~~~t~~qv~llw~-dp~~~GW~~~t~Y~  138 (201)
T PF05735_consen   81 PFRATAEPGLQIKLVDSTTGPGEMLRNALWHTGDTTNQVKLLWH-DPGNIGWKDNTAYR  138 (201)
T ss_dssp             SS--EE-SEEEEEEEE-SS-TTHHHHHHHHSSS-BTTTEEEEEE--TT-----TT-EEE
T ss_pred             CccccccceEEEEEEecCcCCchhhhhhhccCCCccceeEEEEe-CCCcCCCcCCccEE
Confidence            99999999999999999999999999999999999999999999 88777776655554


No 2  
>PF05735 TSP_C:  Thrombospondin C-terminal region;  InterPro: IPR008859 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. The globular C-terminal domain is a beta sandwich of two curved antiparallel beta-sheets []. The fold is an elaboration of the jelly role topology, with strand B3-B7, B11 and B14-B15 forming the eight-stranded jelly roll motif. The function of the C-terminal domain is not yet known.; GO: 0005509 calcium ion binding, 0007155 cell adhesion, 0005576 extracellular region; PDB: 1UX6_A 1YO8_A 2RHP_A 3FBY_C.
Probab=100.00  E-value=2.2e-39  Score=339.41  Aligned_cols=108  Identities=50%  Similarity=0.779  Sum_probs=82.4

Q ss_pred             ccccCCCc----CCCcccccCCCCcccCCCCCCCcCCCCccceeeeEEEEeeecccccccccCCCCCCccceeeeecccc
Q psy620         1132 QCDLAEPC----DPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGSALLLVRINSQA 1207 (1290)
Q Consensus      1132 q~~~~w~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1207 (1290)
                      |++|+|++    .+++|++||+||++        +|.+.|.+|||+|||.                         ||   
T Consensus         1 q~dP~W~v~~~G~ev~Qt~NsdP~l~--------ig~~~~~~vdf~GT~~-------------------------Vn---   44 (201)
T PF05735_consen    1 QIDPNWVVSNQGAEVVQTLNSDPGLA--------IGPDNFGGVDFSGTFF-------------------------VN---   44 (201)
T ss_dssp             S----EEEECCCTEEEE-SS-SSEEE--------EEEEEESSEEEEEEEE-------------------------E----
T ss_pred             CCCCceEEecCCeEEEEeccCCCeEE--------EccceecceEEEEEEE-------------------------Ee---
Confidence            67778877    78999999999998        7777999999999985                         66   


Q ss_pred             chhhchhhhhhhhhhcccccceeeEEEEeecCCcEEEEEeeecccccccccCcceecCCccEEEEEeCCCCCchhhhhcc
Q psy620         1208 WTSRELSSWIRILMTTMLDLCSGNVATFYQSSQKFYVMMWKKNSQVYWQTTPFRAVAEPGIQLKVVDSATGPGTMLRNSL 1287 (1290)
Q Consensus      1208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~yw~~~~~~~~~~~~~~~~~~~~~~~~g~~lrn~l 1287 (1290)
                                    |+.+|||+||| ||||+|++||||||||.+|+||+++||||+|++|||||+|+|+||||++|||||
T Consensus        45 --------------t~~DDDyiGFV-FGYQsn~~FYvv~WKq~~Q~y~~~~p~~~~a~~Gl~iK~V~s~tGpg~~l~nal  109 (201)
T PF05735_consen   45 --------------TTSDDDYIGFV-FGYQSNRKFYVVMWKQGNQNYWESSPFRATAEPGLQIKLVDSTTGPGEMLRNAL  109 (201)
T ss_dssp             ---------------SS---EEEEE-EEEEETTEEEEEEEESS-EE-S--SSS--EE-SEEEEEEEE-SS-TTHHHHHHH
T ss_pred             --------------cCCCCCEEEEE-EEecCCCeEEEEEeeccccccccCCCccccccceEEEEEEecCcCCchhhhhhh
Confidence                          34578899988 899999999999999999999999999999999999999999999999999999


Q ss_pred             cCC
Q psy620         1288 WHT 1290 (1290)
Q Consensus      1288 w~~ 1290 (1290)
                      |||
T Consensus       110 Wh~  112 (201)
T PF05735_consen  110 WHT  112 (201)
T ss_dssp             HSS
T ss_pred             ccC
Confidence            997


No 3  
>KOG1214|consensus
Probab=99.85  E-value=7.1e-21  Score=225.33  Aligned_cols=325  Identities=24%  Similarity=0.492  Sum_probs=228.7

Q ss_pred             ceeecccccccccccCccc--eEEEEeeeeccccceeeeecccCCCccc---chhh--hhhhcccccccceeeeeecccc
Q psy620           19 PIVEGWSVKDDLLDDGVIN--GLLLGVKQDIMGARYTLYMDCVDHGTVA---MTQS--LKKMFDSMKNPQMRLRKTDEES   91 (1290)
Q Consensus        19 ~~~~~~s~~~~~~~~~~~~--~l~~~~~~~i~G~~~~ly~~C~~~~~~~---~~~~--~~~~~~~~~~~~~~l~~~~~~~   91 (1290)
                      ..+...++++|.+..+.+.  +.+++++|+|.      |..|.+..+.+   ..++  +-++|+.|..++..|+++..+.
T Consensus       606 s~vtstssr~y~~t~ga~~S~~~sy~~hq~it------yq~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~lr~a~Sn~  679 (1289)
T KOG1214|consen  606 STVTSTSSRDYSLTFGAINSQTWSYRIHQNIT------YQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVSNQ  679 (1289)
T ss_pred             ceeecccccceeeecCcccccceeEEEeecce------eEEeecCCCCCCCCCceEeecccceeccCccccchhhhhhhc
Confidence            3444556677888888776  78999999988      88998776654   3333  3499999999999999999999


Q ss_pred             cccccCCCcCcccCCCCC-CCCCCCCCCeeecCCC-CcccccCCCCcccCCCCCCCCCCCCC--CCCCCCCeeccCCCCc
Q psy620           92 VDEIELPAIPIVKKPTCA-TDNPCFPGVECRDTRE-GPRCMRCPDGYVGDGIHCKPGVTCNM--RPCFQGVQCFDTVEGY  167 (1290)
Q Consensus        92 ~~~~~~~~~~~~~~d~C~-~~~pC~~gg~C~~~~g-~y~C~~C~~Gy~Gdg~~CedideC~~--~pC~~gg~C~n~~g~y  167 (1290)
                      +..+.....+ ...++|- .++-|.-++.|....+ .|.| .|..||.|+++.|.++++|+.  ..|+.++.|++.+++|
T Consensus       680 igpV~E~S~~-~~~npCy~gsh~cdt~a~C~pg~~~~~tc-ecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~  757 (1289)
T KOG1214|consen  680 IGPVKEDSDP-TPVNPCYDGSHMCDTTARCHPGTGVDYTC-ECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSY  757 (1289)
T ss_pred             ccceecCCCC-cccccceecCcccCCCccccCCCCcceEE-EEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCce
Confidence            9888643332 2346665 3788998999998764 5999 999999999999999999998  4599999999999999


Q ss_pred             ccccCCCCCC--CCCCCceec-ccCCCCCCCCCCCCCC-cceeeeccCCCCCceecCCCCCCCcCCCCccccCCccCCCC
Q psy620          168 TCGPCPSGYT--GDGERCQRI-GGCSRNPCAQGKLNEK-TRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDLAE  243 (1290)
Q Consensus       168 ~C~~C~~Gy~--Gdg~~C~~i-deC~~~pC~~g~~~~~-~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~~~  243 (1290)
                      +| +|..||.  +++.+|..+ .+-..++|..+..... ...+.|+... .+.|.| +|.+||.|+|..|.+++||. ++
T Consensus       758 rc-eC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG-gs~y~C-~CLPGfsGDG~~c~dvDeC~-ps  833 (1289)
T KOG1214|consen  758 RC-ECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG-GSTYSC-ACLPGFSGDGHQCTDVDECS-PS  833 (1289)
T ss_pred             eE-EEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC-CceEEE-eecCCccCCccccccccccC-cc
Confidence            99 9999986  777789873 3222345554411110 0115666654 468999 99999999999999999998 89


Q ss_pred             CCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCC--CCCCCCCCcccc--CCCCcEEcC
Q psy620          244 PCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGR--NGGCDSNSMCTN--TEGSFTCTS  319 (1290)
Q Consensus       244 pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~--~g~C~~~g~C~n--~~gsy~C~~  319 (1290)
                      .|+..+.|.+++++|.|. |.+||.|+. .+++..+         .....|...+  .-.|+.+..|..  -+.+|.|  
T Consensus       834 rChp~A~CyntpgsfsC~-C~pGy~GDG-f~CVP~~---------~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~--  900 (1289)
T KOG1214|consen  834 RCHPAATCYNTPGSFSCR-CQPGYYGDG-FQCVPDT---------SSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEV--  900 (1289)
T ss_pred             ccCCCceEecCCCcceee-cccCccCCC-ceecCCC---------ccCCccccccccceeeccccceeEeeCCCcccC--
Confidence            999999999999999999 999999987 1111110         1122333221  122554443332  2456788  


Q ss_pred             cCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeee--cCCceEEecCCCcccCCC
Q psy620          320 LCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRI--LGNHYACKCDNGWAGDGQ  377 (1290)
Q Consensus       320 ~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~--~~gsy~C~C~~Gy~GdG~  377 (1290)
                      .|.++-.|.. ...|-...  ...|   ..|.-+|.|..+  .+.+++|.|..   +||+
T Consensus       901 p~~~~ppG~~-~~~c~~~~--~~~v---p~Cd~hgh~ap~qchG~~~~CwCvd---~dGr  951 (1289)
T KOG1214|consen  901 PGTQTPPGST-PPHCGPSP--EQYV---PQCDDHGHFAPLQCHGKSDFCWCVD---KDGR  951 (1289)
T ss_pred             CCCCCCCCCC-CCCCCCcc--cccC---CCccccccccccccCCCcceeEEec---CCCc
Confidence            7777654432 23454311  1112   246666766543  24458999987   5665


No 4  
>KOG1214|consensus
Probab=99.73  E-value=5.4e-17  Score=192.88  Aligned_cols=160  Identities=33%  Similarity=0.767  Sum_probs=129.4

Q ss_pred             CCCCCCCcccccCCC-CeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCc
Q psy620          242 AEPCDPRVQCTNLFP-GYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSL  320 (1290)
Q Consensus       242 ~~pC~~~g~C~n~~g-sy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~  320 (1290)
                      +..|..++.|....+ .|+|. |..||.|..              +.|.+++||+.. .+.|.+++.|++.+++|+|  +
T Consensus       699 sh~cdt~a~C~pg~~~~~tce-cs~g~~gdg--------------r~c~d~~eca~~-~~~CGp~s~Cin~pg~~rc--e  760 (1289)
T KOG1214|consen  699 SHMCDTTARCHPGTGVDYTCE-CSSGYQGDG--------------RNCVDENECATG-FHRCGPNSVCINLPGSYRC--E  760 (1289)
T ss_pred             CcccCCCccccCCCCcceEEE-EeeccCCCC--------------CCCCChhhhccC-CCCCCCCceeecCCCceeE--E
Confidence            345666777876544 58898 999998876              889999999998 7889999999999999999  9


Q ss_pred             CcCCccccCCCCCCCCCCC--CCCCCCCC-CCCCCCC--eEeeecCCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCC
Q psy620          321 CRNSYMVRNVSVGCQSQNF--GADVCPDG-TRCDRNA--KCTRILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLAC  395 (1290)
Q Consensus       321 C~~Gy~g~~~g~~C~~~~~--~id~C~~~-~~C~~~g--~C~~~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C  395 (1290)
                      |..||.....+..|.....  .++.|..+ +.|...+  .|+....++|.|.|.+||.|||..|.   +.|       .|
T Consensus       761 C~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~---dvD-------eC  830 (1289)
T KOG1214|consen  761 CRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT---DVD-------EC  830 (1289)
T ss_pred             EeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccc---ccc-------cc
Confidence            9999988887888976432  35778877 8898755  56666678899999999999999884   333       56


Q ss_pred             CCCCCC-CCccccCCCCCCCCccccCCCCCCCCCCcccc
Q psy620          396 PDRKCR-KDNCVHIPNSGINNHADNCPRNANPDQRMCGH  433 (1290)
Q Consensus       396 ~~~~C~-ng~C~~~~gs~~~~~~C~C~~Gy~G~~~~c~~  433 (1290)
                      +++.|. +.+|++.+|+    |.|.|.+||.|++-.|..
T Consensus       831 ~psrChp~A~Cyntpgs----fsC~C~pGy~GDGf~CVP  865 (1289)
T KOG1214|consen  831 SPSRCHPAATCYNTPGS----FSCRCQPGYYGDGFQCVP  865 (1289)
T ss_pred             CccccCCCceEecCCCc----ceeecccCccCCCceecC
Confidence            678884 4589999976    679999999999866643


No 5  
>KOG4289|consensus
Probab=99.72  E-value=4.2e-17  Score=200.61  Aligned_cols=105  Identities=33%  Similarity=0.841  Sum_probs=93.8

Q ss_pred             cCCCCCCCCCCCCCCeeec----------------------CCCCcccccCCCCcccCCCCCC-CCCCCCCCCCCCCCee
Q psy620          104 KKPTCATDNPCFPGVECRD----------------------TREGPRCMRCPDGYVGDGIHCK-PGVTCNMRPCFQGVQC  160 (1290)
Q Consensus       104 ~~d~C~~~~pC~~gg~C~~----------------------~~g~y~C~~C~~Gy~Gdg~~Ce-dideC~~~pC~~gg~C  160 (1290)
                      +.+.|. ..||.|..+|+.                      ..++++| +||+||+|  ..|+ .+++|.+.||.++++|
T Consensus      1178 dDniCl-rEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrC-rCPpGFTg--d~CeTeiDlCYs~pC~nng~C 1253 (2531)
T KOG4289|consen 1178 DDNICL-REPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRC-RCPPGFTG--DYCETEIDLCYSGPCGNNGRC 1253 (2531)
T ss_pred             cCchhh-cchhHHHHhhhhheeecccCccccccceeeeeccccCceeE-eCCCCCCc--ccccchhHhhhcCCCCCCCce
Confidence            457898 999999999975                      2357999 99999999  4999 8999999999999999


Q ss_pred             ccCCCCcccccCCCCCCCCCCCcee---cccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCC
Q psy620          161 FDTVEGYTCGPCPSGYTGDGERCQR---IGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEG  225 (1290)
Q Consensus       161 ~n~~g~y~C~~C~~Gy~Gdg~~C~~---ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~G  225 (1290)
                      ....|+|+| .|.+||+|.  +||.   ...|.+..|.++        ++|++.. .++|.| .|+.|
T Consensus      1254 ~srEggYtC-eCrpg~tGe--hCEvs~~agrCvpGvC~ng--------gtC~~~~-nggf~c-~Cp~g 1308 (2531)
T KOG4289|consen 1254 RSREGGYTC-ECRPGFTGE--HCEVSARAGRCVPGVCKNG--------GTCVNLL-NGGFCC-HCPYG 1308 (2531)
T ss_pred             EEecCceeE-EecCCcccc--ceeeecccCccccceecCC--------CEEeecC-CCceec-cCCCc
Confidence            999999999 999999998  9986   467888999999        9999886 478999 99987


No 6  
>KOG4289|consensus
Probab=99.64  E-value=1.8e-15  Score=186.64  Aligned_cols=111  Identities=34%  Similarity=0.820  Sum_probs=92.9

Q ss_pred             CCCCCCCCCCCCCeecc----------------------CCCCcccccCCCCCCCCCCCcee-cccCCCCCCCCCCCCCC
Q psy620          146 GVTCNMRPCFQGVQCFD----------------------TVEGYTCGPCPSGYTGDGERCQR-IGGCSRNPCAQGKLNEK  202 (1290)
Q Consensus       146 ideC~~~pC~~gg~C~n----------------------~~g~y~C~~C~~Gy~Gdg~~C~~-ideC~~~pC~~g~~~~~  202 (1290)
                      -+.|...||.+...|+.                      ..++++| +||+||+|+  .|+. ++.|-+.||.++     
T Consensus      1179 DniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrC-rCPpGFTgd--~CeTeiDlCYs~pC~nn----- 1250 (2531)
T KOG4289|consen 1179 DNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRC-RCPPGFTGD--YCETEIDLCYSGPCGNN----- 1250 (2531)
T ss_pred             CchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeE-eCCCCCCcc--cccchhHhhhcCCCCCC-----
Confidence            35688888888888852                      3467899 999999999  9998 999999999999     


Q ss_pred             cceeeeccCCCCCceecCCCCCCCcCCCCccc---cCCccCCCCCCCCCcccccC-CCCeecccCCCC-CccCCCCc
Q psy620          203 TRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCH---DIDECDLAEPCDPRVQCTNL-FPGYRCDPCPAG-FTGSTGVQ  274 (1290)
Q Consensus       203 ~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~---dideC~~~~pC~~~g~C~n~-~gsy~C~~C~~G-y~G~~Ce~  274 (1290)
                         ++|...+  ++|+| .|.+||+|  .+|+   ..-.|. +..|.++++|++. .++|.|. |+.| |+++.|+.
T Consensus      1251 ---g~C~srE--ggYtC-eCrpg~tG--ehCEvs~~agrCv-pGvC~nggtC~~~~nggf~c~-Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1251 ---GRCRSRE--GGYTC-ECRPGFTG--EHCEVSARAGRCV-PGVCKNGGTCVNLLNGGFCCH-CPYGEFEDPRCEV 1317 (2531)
T ss_pred             ---CceEEec--CceeE-EecCCccc--cceeeecccCccc-cceecCCCEEeecCCCceecc-CCCcccCCCceEE
Confidence               9999876  69999 99999999  8997   345676 8899999999875 5789998 9987 55666654


No 7  
>KOG0994|consensus
Probab=99.63  E-value=1.4e-15  Score=185.36  Aligned_cols=312  Identities=22%  Similarity=0.464  Sum_probs=190.7

Q ss_pred             eeecccCCCcccchhhhhhhcccccccceeeeeecccccccccCCCcCcccCCCCCC----CCCCCC-CC--eeecCCCC
Q psy620           54 LYMDCVDHGTVAMTQSLKKMFDSMKNPQMRLRKTDEESVDEIELPAIPIVKKPTCAT----DNPCFP-GV--ECRDTREG  126 (1290)
Q Consensus        54 ly~~C~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~d~C~~----~~pC~~-gg--~C~~~~g~  126 (1290)
                      +.++|.+.+++.........+.+ .+|+++.|....-.++.|.+.+..|..- +|..    .+-|.. -|  .|+...-+
T Consensus       763 ~~CnCnptGSlS~vCn~~GGqCq-CkPnVVGR~CdqCApGtyGFGPsGCk~C-dC~~~Gs~~~~Cd~~tGQC~C~~g~yg  840 (1758)
T KOG0994|consen  763 SMCNCNPTGSLSSVCNPNGGQCQ-CKPNVVGRRCDQCAPGTYGFGPSGCKAC-DCNSIGSLDKYCDKITGQCQCRPGTYG  840 (1758)
T ss_pred             cccccCCCccccccccCCCceec-ccCccccccccccCCcccCcCCccCccc-cccccccccccccccccceeeccccch
Confidence            46888888888776666666665 7888888888777777787766654321 2221    122221 12  35665556


Q ss_pred             cccccCCCCcccCCCCCC------CCCCCCCC--CCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCCCCCCCCCC
Q psy620          127 PRCMRCPDGYVGDGIHCK------PGVTCNMR--PCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPCAQGK  198 (1290)
Q Consensus       127 y~C~~C~~Gy~Gdg~~Ce------dideC~~~--pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~  198 (1290)
                      .+|.+|.+||+|. +.|.      ..++|.+.  .|.   .|.+...++.|++|..||+|+++. ..-..|.+.||+.+.
T Consensus       841 rqCnqCqpG~WgF-PeCr~CqCNgHA~~Cd~~tGaCi---~CqD~T~G~~CdrCl~GyyGdP~l-g~g~~CrPCpCP~gp  915 (1758)
T KOG0994|consen  841 RQCNQCQPGYWGF-PECRPCQCNGHADTCDPITGACI---DCQDSTTGHSCDRCLDGYYGDPRL-GSGIGCRPCPCPDGP  915 (1758)
T ss_pred             hhccccCCCccCC-CcCccccccCcccccCccccccc---cccccccccchhhhhccccCCccc-CCCCCCCCCCCCCCC
Confidence            7899999999996 4444      23444432  122   367788999999999999998643 123467888888773


Q ss_pred             CCCCcceeeeccCCCCCceecCCCCCCCcCCCCccccCCccCC---CCCCCCCcccccCCCCeecccCCCCCccCCCCcc
Q psy620          199 LNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDL---AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQG  275 (1290)
Q Consensus       199 ~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~---~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~  275 (1290)
                      ......--.|...+......| .|.+||.|  .+|+   +|..   .+|=. +++|.      .|. |.--.-       
T Consensus       916 ~Sg~~~A~sC~~d~~t~~ivC-~C~~GY~G--~RCe---~CA~~~fGnP~~-GGtCq------~Ce-C~~NiD-------  974 (1758)
T KOG0994|consen  916 ASGRQHADSCYLDTRTQQIVC-HCQEGYSG--SRCE---ICADNHFGNPSE-GGTCQ------KCE-CSNNID-------  974 (1758)
T ss_pred             ccchhccccccccccccceee-ecccCccc--cchh---hhcccccCCccc-CCccc------ccc-ccCCcC-------
Confidence            222212234544333335678 89999998  6775   3541   12222 34442      233 321100       


Q ss_pred             ccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCe
Q psy620          276 VGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAK  355 (1290)
Q Consensus       276 ~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~  355 (1290)
                                  =.+...|... ++.|.   .|.....+-+|+ .|+.||+|..-...|+.     -.|.....= +.+.
T Consensus       975 ------------~~d~~aCD~~-TG~CL---kCL~hTeG~hCe-~Ck~Gf~GdA~~q~Cqr-----C~Cn~LGTn-~~~~ 1031 (1758)
T KOG0994|consen  975 ------------LYDPGACDVA-TGACL---KCLYHTEGDHCE-HCKDGFYGDALRQNCQR-----CVCNFLGTN-STCH 1031 (1758)
T ss_pred             ------------ccCCCccchh-hchhh---hhhhcccccchh-hccccchhHHHHhhhhh-----heccccccC-Cccc
Confidence                        0244567666 67787   688777788997 99999998876667765     123321100 1134


Q ss_pred             EeeecCCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCCCCCCCCC-C--ccccCCCCCCCCccccCCCCCCCC
Q psy620          356 CTRILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLACPDRKCRK-D--NCVHIPNSGINNHADNCPRNANPD  427 (1290)
Q Consensus       356 C~~~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C~~~~C~n-g--~C~~~~gs~~~~~~C~C~~Gy~G~  427 (1290)
                      |...   +.+|.|.+...|  .+|......-+-..+...|.+-.|.. +  +|..      ...+|.|.+||-|.
T Consensus      1032 CDr~---tGQCpClpNv~G--~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~------ftGQCqCkpGfGGR 1095 (1758)
T KOG0994|consen 1032 CDRF---TGQCPCLPNVQG--VRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNE------FTGQCQCKPGFGGR 1095 (1758)
T ss_pred             cccc---cCcCCCCccccc--ccccccccchhccccCCCCCccCCCccCCccccc------cccceeccCCCCCc
Confidence            5443   459999999999  88865443333345677787766633 1  3432      24469999999443


No 8  
>KOG1219|consensus
Probab=99.61  E-value=1.1e-15  Score=193.74  Aligned_cols=111  Identities=40%  Similarity=1.015  Sum_probs=103.3

Q ss_pred             cCCCCCCCCCCCCCCeeecCC-CCcccccCCCCcccCCCCCC-CCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCC
Q psy620          104 KKPTCATDNPCFPGVECRDTR-EGPRCMRCPDGYVGDGIHCK-PGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGE  181 (1290)
Q Consensus       104 ~~d~C~~~~pC~~gg~C~~~~-g~y~C~~C~~Gy~Gdg~~Ce-dideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~  181 (1290)
                      ..+.|. .+||+++|+|...+ ++|.| .|++-|+|  .+|| ++..|+++||..|++|+...++|.| .|+.||+|.  
T Consensus      3863 ~~d~C~-~npCqhgG~C~~~~~ggy~C-kCpsqysG--~~CEi~~epC~snPC~~GgtCip~~n~f~C-nC~~gyTG~-- 3935 (4289)
T KOG1219|consen 3863 LTDPCN-DNPCQHGGTCISQPKGGYKC-KCPSQYSG--NHCEIDLEPCASNPCLTGGTCIPFYNGFLC-NCPNGYTGK-- 3935 (4289)
T ss_pred             cccccc-cCcccCCCEecCCCCCceEE-eCcccccC--cccccccccccCCCCCCCCEEEecCCCeeE-eCCCCccCc--
Confidence            349999 99999999999876 77999 99999999  8999 7889999999999999999999999 999999999  


Q ss_pred             Ccee--cccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCccc
Q psy620          182 RCQR--IGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCH  234 (1290)
Q Consensus       182 ~C~~--ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~  234 (1290)
                      +|+.  +++|..++|.++        +.|+++.  ++|.| .|.+||.|  +.|.
T Consensus      3936 ~Ce~~Gi~eCs~n~C~~g--------g~C~n~~--gsf~C-ncT~g~~g--r~c~ 3977 (4289)
T KOG1219|consen 3936 RCEARGISECSKNVCGTG--------GQCINIP--GSFHC-NCTPGILG--RTCC 3977 (4289)
T ss_pred             eeecccccccccccccCC--------ceeeccC--CceEe-ccChhHhc--ccCc
Confidence            9997  899999999999        9999986  68999 99999999  7774


No 9  
>KOG1217|consensus
Probab=99.57  E-value=2.2e-13  Score=165.35  Aligned_cols=199  Identities=35%  Similarity=0.762  Sum_probs=141.1

Q ss_pred             CCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCCCCCC--CCCCCCCCcceeeeccCC-CCCceecCCCCC
Q psy620          148 TCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPC--AQGKLNEKTRCVRCDDIP-EHPYYRCGSCPE  224 (1290)
Q Consensus       148 eC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC--~~g~~~~~~~Cg~C~~~~-~~g~y~C~~C~~  224 (1290)
                      .|...+......|......|.| .|++||.+.  .|.....|...+.  ...        +.|.... ....+.| .|..
T Consensus        91 ~~~~~~~~~~~~~~~~~~~~~c-~c~~g~~~~--~~~~~~~C~~~~~~~~~~--------~~c~~~~~~~~~~~c-~C~~  158 (487)
T KOG1217|consen   91 PCRSPCLLLCGECVDCVGSYEC-TCPPGYQGT--PCEGECECVTGPGVCCID--------GSCSNGPGSVGPFRC-SCTE  158 (487)
T ss_pred             cccCCcccCCccccCCCCCcee-eCCCccccC--cCCcceeecCCCCCeeCc--------hhhcCCCCCCCceee-eeCC
Confidence            4444455556677778889999 899999997  5554334665542  222        4566542 1347899 9999


Q ss_pred             CCcCCCCccccC-CccCC-CCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCccc-----------C
Q psy620          225 GTTGNGTRCHDI-DECDL-AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVD-----------I  291 (1290)
Q Consensus       225 Gy~Gdg~~C~di-deC~~-~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~d-----------i  291 (1290)
                      ||.+  ..|... ++|.. ..+|.+++.|.+..++|.|. |++||.+..|+..       .....|..           .
T Consensus       159 g~~~--~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~-c~~~~~~~~~~~~-------~~~~~c~~~~~~~~~~g~~~  228 (487)
T KOG1217|consen  159 GYEG--EPCETDLDECIQYSSPCQNGGTCVNTGGSYLCS-CPPGYTGSTCETT-------GNGGTCVDSVACSCPPGARG  228 (487)
T ss_pred             Cccc--ccccccccccccCCCCcCCCcccccCCCCeeEe-CCCCccCCcCcCC-------CCCceEecceeccCCCCCCC
Confidence            9999  667643 78873 55799999999999999999 9999999997753       01111211           2


Q ss_pred             CCCCCCCCCCCCCC-CccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCC
Q psy620          292 DECADGRNGGCDSN-SMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDN  370 (1290)
Q Consensus       292 deC~~~~~g~C~~~-g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~  370 (1290)
                      ..|... ...|..+ ++|++..++|+|  .|++||.+... ..|..    +++|.....|.++++|++. .+.|.|.|++
T Consensus       229 ~~c~~~-~~~~~~~~~~c~~~~~~~~C--~~~~g~~~~~~-~~~~~----~~~C~~~~~c~~~~~C~~~-~~~~~C~C~~  299 (487)
T KOG1217|consen  229 PECEVS-IVECASGDGTCVNTVGSYTC--RCPEGYTGDAC-VTCVD----VDSCALIASCPNGGTCVNV-PGSYRCTCPP  299 (487)
T ss_pred             CCcccc-cccccCCCCcccccCCceee--eCCCCcccccc-ceeee----ccccCCCCccCCCCeeecC-CCcceeeCCC
Confidence            233332 2234433 899999999999  99999986542 34555    6889865349999999998 6679999999


Q ss_pred             CcccCCCCc
Q psy620          371 GWAGDGQFC  379 (1290)
Q Consensus       371 Gy~GdG~~C  379 (1290)
                      ||+|  ..|
T Consensus       300 g~~g--~~~  306 (487)
T KOG1217|consen  300 GFTG--RLC  306 (487)
T ss_pred             CCCC--CCC
Confidence            9999  555


No 10 
>KOG1219|consensus
Probab=99.48  E-value=9.2e-14  Score=176.88  Aligned_cols=114  Identities=37%  Similarity=0.913  Sum_probs=103.3

Q ss_pred             CCCC-CCCCCCCCCCCCCeeccCC-CCcccccCCCCCCCCCCCcee-cccCCCCCCCCCCCCCCcceeeeccCCCCCcee
Q psy620          142 HCKP-GVTCNMRPCFQGVQCFDTV-EGYTCGPCPSGYTGDGERCQR-IGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYR  218 (1290)
Q Consensus       142 ~Ced-ideC~~~pC~~gg~C~n~~-g~y~C~~C~~Gy~Gdg~~C~~-ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~  218 (1290)
                      -|.- .+.|..+||+++|+|...+ ++|.| .|++-|+|.  .|+. +..|.++||..|        ++|+...  .+|.
T Consensus      3859 gC~l~~d~C~~npCqhgG~C~~~~~ggy~C-kCpsqysG~--~CEi~~epC~snPC~~G--------gtCip~~--n~f~ 3925 (4289)
T KOG1219|consen 3859 GCSLLTDPCNDNPCQHGGTCISQPKGGYKC-KCPSQYSGN--HCEIDLEPCASNPCLTG--------GTCIPFY--NGFL 3925 (4289)
T ss_pred             cccccccccccCcccCCCEecCCCCCceEE-eCcccccCc--ccccccccccCCCCCCC--------CEEEecC--CCee
Confidence            3552 2789999999999999876 68999 999999999  9998 899999999999        9999875  5899


Q ss_pred             cCCCCCCCcCCCCccc--cCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCC
Q psy620          219 CGSCPEGTTGNGTRCH--DIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGV  273 (1290)
Q Consensus       219 C~~C~~Gy~Gdg~~C~--dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce  273 (1290)
                      | .|+.||+|  .+|+  .++||. .++|.+++.|+|..|+|+|. |.+||.|..|.
T Consensus      3926 C-nC~~gyTG--~~Ce~~Gi~eCs-~n~C~~gg~C~n~~gsf~Cn-cT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3926 C-NCPNGYTG--KRCEARGISECS-KNVCGTGGQCINIPGSFHCN-CTPGILGRTCC 3977 (4289)
T ss_pred             E-eCCCCccC--ceeecccccccc-cccccCCceeeccCCceEec-cChhHhcccCc
Confidence            9 99999999  8997  389998 89999999999999999999 99999998863


No 11 
>KOG1217|consensus
Probab=99.44  E-value=1.4e-12  Score=158.28  Aligned_cols=264  Identities=31%  Similarity=0.755  Sum_probs=191.9

Q ss_pred             CCCCCCCCCCCCCeeecCCCCcccccCCCCcccCCCCCCCCCCCCCCC--CCCCCeeccCC---CCcccccCCCCCCCCC
Q psy620          106 PTCATDNPCFPGVECRDTREGPRCMRCPDGYVGDGIHCKPGVTCNMRP--CFQGVQCFDTV---EGYTCGPCPSGYTGDG  180 (1290)
Q Consensus       106 d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~CedideC~~~p--C~~gg~C~n~~---g~y~C~~C~~Gy~Gdg  180 (1290)
                      +.|. ..+...++.|......+.| .|++||.+  ..|+...+|...+  +...+.|....   ..|+| .|..||.+. 
T Consensus        90 ~~~~-~~~~~~~~~~~~~~~~~~c-~c~~g~~~--~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c-~C~~g~~~~-  163 (487)
T KOG1217|consen   90 PPCR-SPCLLLCGECVDCVGSYEC-TCPPGYQG--TPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRC-SCTEGYEGE-  163 (487)
T ss_pred             cccc-CCcccCCccccCCCCCcee-eCCCcccc--CcCCcceeecCCCCCeeCchhhcCCCCCCCceee-eeCCCcccc-
Confidence            4444 4444556677778889999 99999998  5666433677766  35666787754   58999 999999998 


Q ss_pred             CCceec-ccCCC--CCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCcccc--------------------CC
Q psy620          181 ERCQRI-GGCSR--NPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHD--------------------ID  237 (1290)
Q Consensus       181 ~~C~~i-deC~~--~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~d--------------------id  237 (1290)
                       .|... ++|..  .+|.++        +.|.+..  ++|.| .|++||.+  ..|+.                    ..
T Consensus       164 -~~~~~~~~C~~~~~~c~~~--------~~C~~~~--~~~~C-~c~~~~~~--~~~~~~~~~~~c~~~~~~~~~~g~~~~  229 (487)
T KOG1217|consen  164 -PCETDLDECIQYSSPCQNG--------GTCVNTG--GSYLC-SCPPGYTG--STCETTGNGGTCVDSVACSCPPGARGP  229 (487)
T ss_pred             -cccccccccccCCCCcCCC--------cccccCC--CCeeE-eCCCCccC--CcCcCCCCCceEecceeccCCCCCCCC
Confidence             77764 78884  569988        8898875  46999 99999997  33331                    12


Q ss_pred             ccCC-CCCCCCC-cccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCc
Q psy620          238 ECDL-AEPCDPR-VQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSF  315 (1290)
Q Consensus       238 eC~~-~~pC~~~-g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy  315 (1290)
                      .|.. ...|... ++|++..++|+|. |++||++..+             ..|.++++|... .. |.++++|++..+.|
T Consensus       230 ~c~~~~~~~~~~~~~c~~~~~~~~C~-~~~g~~~~~~-------------~~~~~~~~C~~~-~~-c~~~~~C~~~~~~~  293 (487)
T KOG1217|consen  230 ECEVSIVECASGDGTCVNTVGSYTCR-CPEGYTGDAC-------------VTCVDVDSCALI-AS-CPNGGTCVNVPGSY  293 (487)
T ss_pred             CcccccccccCCCCcccccCCceeee-CCCCcccccc-------------ceeeeccccCCC-Cc-cCCCCeeecCCCcc
Confidence            3321 1234433 8999999999999 9999998862             346889999987 34 99999999999999


Q ss_pred             EEcCcCcCCccccCCCCCCCCCCCCCCCCC---CCCCCCCCCeEee-ecCCceEEecCCCcccCCCCcCCcCCCCCCCCC
Q psy620          316 TCTSLCRNSYMVRNVSVGCQSQNFGADVCP---DGTRCDRNAKCTR-ILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDY  391 (1290)
Q Consensus       316 ~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~---~~~~C~~~g~C~~-~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~  391 (1290)
                      +|  .|++||.+..+ ..|..    ..+|.   ....|.++++|.. ...+.+.|.|..||.|  ..|+...        
T Consensus       294 ~C--~C~~g~~g~~~-~~~~~----~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g--~~C~~~~--------  356 (487)
T KOG1217|consen  294 RC--TCPPGFTGRLC-TECVD----VDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTG--RRCEDSN--------  356 (487)
T ss_pred             ee--eCCCCCCCCCC-ccccc----cccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCC--CccccCC--------
Confidence            99  99999987665 33444    46775   2467988889932 2245788999999888  8997432        


Q ss_pred             CCCCCCCCCCCC-ccccC-CCCCCCCccccCCCCCCCC
Q psy620          392 DLACPDRKCRKD-NCVHI-PNSGINNHADNCPRNANPD  427 (1290)
Q Consensus       392 ~~~C~~~~C~ng-~C~~~-~gs~~~~~~C~C~~Gy~G~  427 (1290)
                       ..|...+|.++ .|++. .+    .+.|.|+.+|.+.
T Consensus       357 -~~C~~~~~~~~~~c~~~~~~----~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  357 -DECASSPCCPGGTCVNETPG----SYRCACPAGFAGK  389 (487)
T ss_pred             -ccccCCccccCCEeccCCCC----CeEecCCCccccC
Confidence             14555556444 78873 33    5679999999654


No 12 
>KOG0994|consensus
Probab=99.11  E-value=5e-10  Score=138.00  Aligned_cols=221  Identities=29%  Similarity=0.745  Sum_probs=132.6

Q ss_pred             eeecCCCCcccccCCCCcccCCCCCCCCCCCCCCCCCCCC--------eecc--CCCCcccccCCCCCCCCCCCceeccc
Q psy620          119 ECRDTREGPRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGV--------QCFD--TVEGYTCGPCPSGYTGDGERCQRIGG  188 (1290)
Q Consensus       119 ~C~~~~g~y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg--------~C~n--~~g~y~C~~C~~Gy~Gdg~~C~~ide  188 (1290)
                      .|.+...++.|.+|..||.|+.+. -....|.+.||..+-        .|.-  ......| .|.+||+|.  +|+.   
T Consensus       877 ~CqD~T~G~~CdrCl~GyyGdP~l-g~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC-~C~~GY~G~--RCe~---  949 (1758)
T KOG0994|consen  877 DCQDSTTGHSCDRCLDGYYGDPRL-GSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVC-HCQEGYSGS--RCEI---  949 (1758)
T ss_pred             cccccccccchhhhhccccCCccc-CCCCCCCCCCCCCCCccchhccccccccccccceee-ecccCcccc--chhh---
Confidence            366777899999999999998432 223456666775442        3432  2235678 999999998  8863   


Q ss_pred             CCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCccccCCccCC-CCCCCCCcccccCCCCeecccCCCCC
Q psy620          189 CSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDL-AEPCDPRVQCTNLFPGYRCDPCPAGF  267 (1290)
Q Consensus       189 C~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~-~~pC~~~g~C~n~~gsy~C~~C~~Gy  267 (1290)
                      |..+-=.+.  .+.   ++|.        .| .|....--     .+...|.. ...|.   +|.....+-+|..|..||
T Consensus       950 CA~~~fGnP--~~G---GtCq--------~C-eC~~NiD~-----~d~~aCD~~TG~CL---kCL~hTeG~hCe~Ck~Gf 1007 (1758)
T KOG0994|consen  950 CADNHFGNP--SEG---GTCQ--------KC-ECSNNIDL-----YDPGACDVATGACL---KCLYHTEGDHCEHCKDGF 1007 (1758)
T ss_pred             hcccccCCc--ccC---Cccc--------cc-cccCCcCc-----cCCCccchhhchhh---hhhhcccccchhhccccc
Confidence            544311111  000   3442        33 33322110     01122321 12332   466666777999999999


Q ss_pred             ccCCCCccccccccccCCCCcc-------cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCC
Q psy620          268 TGSTGVQGVGLEHAVRFRQTCV-------DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFG  340 (1290)
Q Consensus       268 ~G~~Ce~~~~~~~~~~~~~~C~-------dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~  340 (1290)
                      +|..         ...+.+.|+       ..-.|... ++.|.    |.+...+.+|. +|.+.++-...|..|+.    
T Consensus      1008 ~GdA---------~~q~CqrC~Cn~LGTn~~~~CDr~-tGQCp----ClpNv~G~~CD-qCA~N~w~laSG~GCe~---- 1068 (1758)
T KOG0994|consen 1008 YGDA---------LRQNCQRCVCNFLGTNSTCHCDRF-TGQCP----CLPNVQGVRCD-QCAENHWNLASGEGCEP---- 1068 (1758)
T ss_pred             hhHH---------HHhhhhhheccccccCCccccccc-cCcCC----CCccccccccc-ccccchhccccCCCCCc----
Confidence            9976         122223331       11345555 66676    88888889998 99998877777888886    


Q ss_pred             CCCCCCCCCCCC--CCeEeeecCCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCCCCCCC
Q psy620          341 ADVCPDGTRCDR--NAKCTRILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLACPDRKC  400 (1290)
Q Consensus       341 id~C~~~~~C~~--~g~C~~~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C~~~~C  400 (1290)
                         |.    |+.  +-+|... .  .+|.|++||-|  +.|.+.-+ =.|.+....|..-.|
T Consensus      1069 ---C~----Cd~~~~pqCN~f-t--GQCqCkpGfGG--R~C~qCqe-l~WGdP~~~C~aCdC 1117 (1758)
T KOG0994|consen 1069 ---CN----CDPIGGPQCNEF-T--GQCQCKPGFGG--RTCSQCQE-LYWGDPNEKCRACDC 1117 (1758)
T ss_pred             ---cC----CCccCCcccccc-c--cceeccCCCCC--cchhHHHH-hhcCCCCCCceecCC
Confidence               43    332  2367655 2  39999999999  88865433 245555566655444


No 13 
>KOG4260|consensus
Probab=98.72  E-value=1.6e-08  Score=109.47  Aligned_cols=152  Identities=32%  Similarity=0.836  Sum_probs=110.0

Q ss_pred             ccccCCCCcccCCCCCCCCCCC---CCCCCCCCCeecc---CCCCcccccCCCCCCCCCCCceecccCCC----------
Q psy620          128 RCMRCPDGYVGDGIHCKPGVTC---NMRPCFQGVQCFD---TVEGYTCGPCPSGYTGDGERCQRIGGCSR----------  191 (1290)
Q Consensus       128 ~C~~C~~Gy~Gdg~~CedideC---~~~pC~~gg~C~n---~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~----------  191 (1290)
                      .|  ||+|-+|  +.|.   .|   +..||..++.|.-   ..|+-.| .|.+||+|.  .|..   |..          
T Consensus       130 vC--Cp~gtyG--pdCl---~Cpggser~C~GnG~C~GdGsR~GsGkC-kC~~GY~Gp--~C~~---Cg~eyfes~Rne~  196 (350)
T KOG4260|consen  130 VC--CPDGTYG--PDCL---QCPGGSERPCFGNGSCHGDGSREGSGKC-KCETGYTGP--LCRY---CGIEYFESSRNEQ  196 (350)
T ss_pred             ec--cCCCCcC--Cccc---cCCCCCcCCcCCCCcccCCCCCCCCCcc-cccCCCCCc--cccc---cchHHHHhhcccc
Confidence            56  9999988  6776   34   2368999999973   3467899 999999998  6643   211          


Q ss_pred             ----CCCCCCCCCCCccee-eeccCCCCCceecCCCCCCCcCCCCccccCCccCC-CCCCCCCcccccCCCCeecccCCC
Q psy620          192 ----NPCAQGKLNEKTRCV-RCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDL-AEPCDPRVQCTNLFPGYRCDPCPA  265 (1290)
Q Consensus       192 ----~pC~~g~~~~~~~Cg-~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~-~~pC~~~g~C~n~~gsy~C~~C~~  265 (1290)
                          ..|+.+       |. .|...   ++-.|..|..||..+...|.||+||.. +.+|.....|+|+.|+|.|. +++
T Consensus       197 ~lvCt~Ch~~-------C~~~Csg~---~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~-dk~  265 (350)
T KOG4260|consen  197 HLVCTACHEG-------CLGVCSGE---SSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCE-DKE  265 (350)
T ss_pred             cchhhhhhhh-------hhcccCCC---CCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEec-ccc
Confidence                123322       22 45543   244687899999988889999999984 67899999999999999999 999


Q ss_pred             CCccCCCCccccccccccCCCCcccCCCCCCCCCCCC-CCCCccccCCCCcEEcCcCcCCcc
Q psy620          266 GFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGC-DSNSMCTNTEGSFTCTSLCRNSYM  326 (1290)
Q Consensus       266 Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C-~~~g~C~n~~gsy~C~~~C~~Gy~  326 (1290)
                      ||.+..                    ++|..- ...| ..+..|.|+.++|+|  .|..|+.
T Consensus       266 Gy~~g~--------------------d~C~~~-~d~~~~kn~~c~ni~~~~r~--v~f~~~~  304 (350)
T KOG4260|consen  266 GYKKGV--------------------DECQFC-ADVCASKNRPCMNIDGQYRC--VCFSGLI  304 (350)
T ss_pred             cccCCh--------------------HHhhhh-hhhcccCCCCcccCCccEEE--Eecccce
Confidence            997632                    333320 0012 135678899999999  9999985


No 14 
>PF02412 TSP_3:  Thrombospondin type 3 repeat;  InterPro: IPR003367 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. This entry represents the type 3 thrombospondin repeat, and related repeats present in other types of protein.; GO: 0005509 calcium ion binding, 0007155 cell adhesion; PDB: 1UX6_A 3FBY_C 1YO8_A 2RHP_A.
Probab=98.58  E-value=2e-08  Score=78.26  Aligned_cols=36  Identities=61%  Similarity=1.142  Sum_probs=30.5

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCC
Q psy620          829 TDTDNDGTGDACDNDMDNDGINNHADNCPRNANPDQ  864 (1290)
Q Consensus       829 ~D~D~DG~~D~~d~D~D~DGi~d~~d~c~~~~n~~~  864 (1290)
                      +|+|+|||||+|+.|.|+|||+|..|+||+++|+.|
T Consensus         1 ~D~D~dg~GD~C~~D~D~Dgi~d~~DnCP~~~n~~Q   36 (36)
T PF02412_consen    1 EDSDGDGIGDACDDDSDGDGIPDACDNCPNVPNPDQ   36 (36)
T ss_dssp             --TTSSSS-GGGSSSTTSSSS-GGGHSSTTSTTTTS
T ss_pred             CcccCCCCCcccccCCCCCcccCcccCCCCCCCCCC
Confidence            589999999999999999999999999999999876


No 15 
>KOG1836|consensus
Probab=98.58  E-value=2.1e-06  Score=115.83  Aligned_cols=227  Identities=24%  Similarity=0.533  Sum_probs=134.9

Q ss_pred             eccCCCCcccccCCCCCCCCCCCceecccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCcccc----
Q psy620          160 CFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHD----  235 (1290)
Q Consensus       160 C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~d----  235 (1290)
                      |.....+-+|..|..||+|.... .....|++.+|.++        +.|..+....+..|..|++||+|  .+|+.    
T Consensus       749 C~~~t~G~~C~~C~~GfYg~~~~-~~~~dC~~C~Cp~~--------~~~~~~~~~~~~iCk~Cp~gytG--~rCe~c~dg  817 (1705)
T KOG1836|consen  749 CKHNTFGGQCAQCVDGFYGLPDL-GTSGDCQPCPCPNG--------GACGQTPEILEVVCKNCPPGYTG--LRCEECADG  817 (1705)
T ss_pred             cccCCCCCchhhhcCCCCCcccc-CCCCCCccCCCCCC--------hhhcCcCcccceecCCCCCCCcc--cccccCCCc
Confidence            55556677899999999987321 11233899999998        77877765667889339999998  67751    


Q ss_pred             -----------CCccCCCCCCCC-------------Cc---ccccCCCCeecccCCCCCccCCCCccccccccccCCCCc
Q psy620          236 -----------IDECDLAEPCDP-------------RV---QCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTC  288 (1290)
Q Consensus       236 -----------ideC~~~~pC~~-------------~g---~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C  288 (1290)
                                 .-.|. +.+|..             .+   +|+....+.+|..|.+||.|..=. .    ........|
T Consensus       818 yfg~p~~~~~~~~~c~-~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~-~----~p~~~c~~c  891 (1705)
T KOG1836|consen  818 YFGNPLGHDGDVRPCQ-SCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLA-P----NPEDKCFAC  891 (1705)
T ss_pred             cccCCCCCCCCcccCc-cceeccccCccccccccccccceeeccCCcccccccccccCccccccC-C----CcCCccccc
Confidence                       11333 222321             11   355555667788899999887611 0    000011111


Q ss_pred             --c------cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeec
Q psy620          289 --V------DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRIL  360 (1290)
Q Consensus       289 --~------dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~  360 (1290)
                        .      ..-.|... ++.|.    |.....+-.|. .|.+||.+.+.+..|+.     ..|...  =+.+..|..  
T Consensus       892 ~c~p~gs~~~~~~c~~~-tGQce----c~~~v~g~~c~-~c~~g~fnl~s~~gC~~-----c~c~~~--gs~~~~c~~--  956 (1705)
T KOG1836|consen  892 GCVPAGSELPSLTCNPV-TGQCE----CKPNVEGRDCL-YCFKGFFNLNSGVGCEP-----CNCDPT--GSESSDCDV--  956 (1705)
T ss_pred             cCccCCcccccccCCCc-cccee----ccCCCCccccc-cccccccccCCCCCccc-----cccccc--ccccccccc--
Confidence              1      12235544 45554    67777778887 89999988877777876     223211  111235653  


Q ss_pred             CCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCCCCCCC-CCC----ccccCCCCCCCCccccCCCCCCCCC
Q psy620          361 GNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLACPDRKC-RKD----NCVHIPNSGINNHADNCPRNANPDQ  428 (1290)
Q Consensus       361 ~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C~~~~C-~ng----~C~~~~gs~~~~~~C~C~~Gy~G~~  428 (1290)
                       ++.+|.|++|.+|  .+|..... ..+......|..-.| .+|    .|...      ..+|.|.+++.|..
T Consensus       957 -~tGqc~c~~gVtg--qrc~qc~~-~~~~~~~~gc~~c~c~~~Gs~~~qc~~~------~G~c~c~~~~~g~~ 1019 (1705)
T KOG1836|consen  957 -GTGQCYCRPGVTG--QRCDQCET-YHFGFQTEGCGLCECDPLGSRGFQCDPE------DGQCPCRPGFEGRR 1019 (1705)
T ss_pred             -cCCceeeecCccc--cccCcccc-CcccccccCCcceecccCCcccceeccc------CCeeeecCCCCCcc
Confidence             3459999999999  88864322 222233345544444 223    24432      34589999986653


No 16 
>KOG4260|consensus
Probab=98.56  E-value=9.7e-08  Score=103.49  Aligned_cols=156  Identities=31%  Similarity=0.715  Sum_probs=105.1

Q ss_pred             CCCCCCCCCCCceecccCCCCCCCCCCCCCCcceeeeccCC-CCCceecCCCCCCCcCCCCccccC---------Cc---
Q psy620          172 CPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCVRCDDIP-EHPYYRCGSCPEGTTGNGTRCHDI---------DE---  238 (1290)
Q Consensus       172 C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg~C~~~~-~~g~y~C~~C~~Gy~Gdg~~C~di---------de---  238 (1290)
                      ||.|-+|.  .|..-..=...||...        +.|..-. ..++-+| .|.+||+|  ..|..-         ++   
T Consensus       132 Cp~gtyGp--dCl~Cpggser~C~Gn--------G~C~GdGsR~GsGkC-kC~~GY~G--p~C~~Cg~eyfes~Rne~~l  198 (350)
T KOG4260|consen  132 CPDGTYGP--DCLQCPGGSERPCFGN--------GSCHGDGSREGSGKC-KCETGYTG--PLCRYCGIEYFESSRNEQHL  198 (350)
T ss_pred             cCCCCcCC--ccccCCCCCcCCcCCC--------CcccCCCCCCCCCcc-cccCCCCC--ccccccchHHHHhhcccccc
Confidence            88898887  6653111123567666        7776422 2356789 99999999  666411         11   


Q ss_pred             -cCC-CCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCcE
Q psy620          239 -CDL-AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFT  316 (1290)
Q Consensus       239 -C~~-~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~  316 (1290)
                       |.. ..+|.  +.|... ++..|..|..||....              ..|.|||||... ..+|.....|+|+.|+|.
T Consensus       199 vCt~Ch~~C~--~~Csg~-~~k~C~kCkkGW~lde--------------~gCvDvnEC~~e-p~~c~~~qfCvNteGSf~  260 (350)
T KOG4260|consen  199 VCTACHEGCL--GVCSGE-SSKGCSKCKKGWKLDE--------------EGCVDVNECQNE-PAPCKAHQFCVNTEGSFK  260 (350)
T ss_pred             hhhhhhhhhh--cccCCC-CCCChhhhcccceecc--------------cccccHHHHhcC-CCCCChhheeecCCCceE
Confidence             210 12332  245432 3346877999998775              669999999988 577999999999999999


Q ss_pred             EcCcCcCCccccCCCCCCCCCCCCCCCCCCC-CCC-CCCCeEeeecCCceEEecCCCcc
Q psy620          317 CTSLCRNSYMVRNVSVGCQSQNFGADVCPDG-TRC-DRNAKCTRILGNHYACKCDNGWA  373 (1290)
Q Consensus       317 C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~-~~C-~~~g~C~~~~~gsy~C~C~~Gy~  373 (1290)
                      |  .+++||...            +++|..- ..| ..+..|.++ .++|+|+|..|+.
T Consensus       261 C--~dk~Gy~~g------------~d~C~~~~d~~~~kn~~c~ni-~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  261 C--EDKEGYKKG------------VDECQFCADVCASKNRPCMNI-DGQYRCVCFSGLI  304 (350)
T ss_pred             e--cccccccCC------------hHHhhhhhhhcccCCCCcccC-CccEEEEecccce
Confidence            9  899999752            2334310 122 245678888 8899999999875


No 17 
>PF02412 TSP_3:  Thrombospondin type 3 repeat;  InterPro: IPR003367 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. This entry represents the type 3 thrombospondin repeat, and related repeats present in other types of protein.; GO: 0005509 calcium ion binding, 0007155 cell adhesion; PDB: 1UX6_A 3FBY_C 1YO8_A 2RHP_A.
Probab=98.55  E-value=2.5e-08  Score=77.72  Aligned_cols=35  Identities=57%  Similarity=1.011  Sum_probs=21.1

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCccccCcCCCCC
Q psy620          927 DNDRDGKGDECDPDLDGDGISNDEDNCRLIYNPNQ  961 (1290)
Q Consensus       927 D~D~Dg~~D~~d~D~D~DGi~d~~d~cp~~~n~~~  961 (1290)
                      |+|+|||||+|+.|.|+|||+|..||||.++|+.|
T Consensus         2 D~D~dg~GD~C~~D~D~Dgi~d~~DnCP~~~n~~Q   36 (36)
T PF02412_consen    2 DSDGDGIGDACDDDSDGDGIPDACDNCPNVPNPDQ   36 (36)
T ss_dssp             -TTSSSS-GGGSSSTTSSSS-GGGHSSTTSTTTTS
T ss_pred             cccCCCCCcccccCCCCCcccCcccCCCCCCCCCC
Confidence            56666666666666666666666666666666654


No 18 
>KOG1225|consensus
Probab=98.45  E-value=8.1e-07  Score=107.63  Aligned_cols=132  Identities=33%  Similarity=0.869  Sum_probs=96.8

Q ss_pred             cccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCCCCCCCCCCCCCCccee
Q psy620          127 PRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCV  206 (1290)
Q Consensus       127 y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg  206 (1290)
                      ..| .|+.+|+|  ..|+. -.|. ..|..++.|++.    +| .|++||+|.  .|.. -.|... |..+        +
T Consensus       234 ~ic-~c~~~~~g--~~c~~-~~C~-~~c~~~g~c~~G----~C-IC~~Gf~G~--dC~e-~~Cp~~-cs~~--------g  291 (525)
T KOG1225|consen  234 GIC-ECPEGYFG--PLCST-IYCP-GGCTGRGQCVEG----RC-ICPPGFTGD--DCDE-LVCPVD-CSGG--------G  291 (525)
T ss_pred             cee-ecCCceeC--Ccccc-ccCC-CCCcccceEeCC----eE-eCCCCCcCC--CCCc-ccCCcc-cCCC--------c
Confidence            479 99999998  67772 2343 346666788865    69 999999998  8875 235444 7666        6


Q ss_pred             eeccCCCCCceecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCC
Q psy620          207 RCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQ  286 (1290)
Q Consensus       207 ~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~  286 (1290)
                      .|++     + +| .|++||+|  +.|+ +-+|  ...|..+++|+  .+  +|. |.+||+|..|+.            
T Consensus       292 ~~~~-----g-~C-iC~~g~~G--~dCs-~~~c--padC~g~G~Ci--~G--~C~-C~~Gy~G~~C~~------------  342 (525)
T KOG1225|consen  292 VCVD-----G-EC-ICNPGYSG--KDCS-IRRC--PADCSGHGKCI--DG--ECL-CDEGYTGELCIQ------------  342 (525)
T ss_pred             eecC-----C-Ee-ecCCCccc--cccc-cccC--CccCCCCCccc--CC--ceE-eCCCCcCCcccc------------
Confidence            6654     2 89 99999999  8886 3346  47899999999  33  698 999999998541            


Q ss_pred             CcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccC
Q psy620          287 TCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRN  329 (1290)
Q Consensus       287 ~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~  329 (1290)
                          ..         |.+++.|++.     |  .|..||.|.+
T Consensus       343 ----~~---------C~~~g~cv~g-----C--~C~~Gw~G~d  365 (525)
T KOG1225|consen  343 ----RA---------CSGGGQCVNG-----C--KCKKGWRGPD  365 (525)
T ss_pred             ----cc---------cCCCceeccC-----c--eeccCccCCC
Confidence                11         5556677653     7  8999998755


No 19 
>KOG1225|consensus
Probab=98.38  E-value=1.7e-06  Score=104.85  Aligned_cols=131  Identities=32%  Similarity=0.807  Sum_probs=94.3

Q ss_pred             eecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCC
Q psy620          217 YRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECAD  296 (1290)
Q Consensus       217 y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~  296 (1290)
                      ..| .|+.+|+|  ..|+ .-.|  ...|..++.|+..    +|. |++||+|..|..                 -.|..
T Consensus       234 ~ic-~c~~~~~g--~~c~-~~~C--~~~c~~~g~c~~G----~CI-C~~Gf~G~dC~e-----------------~~Cp~  285 (525)
T KOG1225|consen  234 GIC-ECPEGYFG--PLCS-TIYC--PGGCTGRGQCVEG----RCI-CPPGFTGDDCDE-----------------LVCPV  285 (525)
T ss_pred             cee-ecCCceeC--Cccc-cccC--CCCCcccceEeCC----eEe-CCCCCcCCCCCc-----------------ccCCc
Confidence            378 89999999  7776 3345  3566666777765    698 999999999652                 12332


Q ss_pred             CCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCCCcccCC
Q psy620          297 GRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDG  376 (1290)
Q Consensus       297 ~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG  376 (1290)
                      .    |..++.|++.    .|  .|++||+|    +.|+.     ..|.  ..|+.+|.|+..     +|.|.+||+|  
T Consensus       286 ~----cs~~g~~~~g----~C--iC~~g~~G----~dCs~-----~~cp--adC~g~G~Ci~G-----~C~C~~Gy~G--  337 (525)
T KOG1225|consen  286 D----CSGGGVCVDG----EC--ICNPGYSG----KDCSI-----RRCP--ADCSGHGKCIDG-----ECLCDEGYTG--  337 (525)
T ss_pred             c----cCCCceecCC----Ee--ecCCCccc----ccccc-----ccCC--ccCCCCCcccCC-----ceEeCCCCcC--
Confidence            2    6556666654    69  99999984    55664     4465  689999999843     8999999999  


Q ss_pred             CCcCCcCCCCCCCCCCCCCCCCCCCCC-ccccCCCCCCCCccccCCCCCCCCC
Q psy620          377 QFCGRDTDLDGWPDYDLACPDRKCRKD-NCVHIPNSGINNHADNCPRNANPDQ  428 (1290)
Q Consensus       377 ~~Ce~~~d~d~~~~~~~~C~~~~C~ng-~C~~~~gs~~~~~~C~C~~Gy~G~~  428 (1290)
                      ..|+..                .|.++ .|++        . |.|..||.|..
T Consensus       338 ~~C~~~----------------~C~~~g~cv~--------g-C~C~~Gw~G~d  365 (525)
T KOG1225|consen  338 ELCIQR----------------ACSGGGQCVN--------G-CKCKKGWRGPD  365 (525)
T ss_pred             Cccccc----------------ccCCCceecc--------C-ceeccCccCCC
Confidence            788521                25555 5664        2 99999997765


No 20 
>KOG1836|consensus
Probab=97.75  E-value=0.00039  Score=94.71  Aligned_cols=108  Identities=22%  Similarity=0.492  Sum_probs=70.1

Q ss_pred             cccCCCCCccCCCCccccc----cccccCCCCcc------cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccC
Q psy620          260 CDPCPAGFTGSTGVQGVGL----EHAVRFRQTCV------DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRN  329 (1290)
Q Consensus       260 C~~C~~Gy~G~~Ce~~~~~----~~~~~~~~~C~------dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~  329 (1290)
                      |. |+.||+|..||.....    .........|.      ..+.|... ++.|.    |+....+-+|+ +|..||++..
T Consensus       697 c~-C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~-tG~C~----C~~~t~G~~C~-~C~~GfYg~~  769 (1705)
T KOG1836|consen  697 CT-CPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPR-TGQCK----CKHNTFGGQCA-QCVDGFYGLP  769 (1705)
T ss_pred             cc-CCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCC-CCcee----cccCCCCCchh-hhcCCCCCcc
Confidence            88 9999999999875532    11111112221      13567666 56664    77777777898 9999998764


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCeEeeec-CCceEEe-cCCCcccCCCCcCCcCC
Q psy620          330 VSVGCQSQNFGADVCPDGTRCDRNAKCTRIL-GNHYACK-CDNGWAGDGQFCGRDTD  384 (1290)
Q Consensus       330 ~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~-~gsy~C~-C~~Gy~GdG~~Ce~~~d  384 (1290)
                      ....=.      + |. .-+|.+++.|..+. .....|. |++||+|  .+|+...+
T Consensus       770 ~~~~~~------d-C~-~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG--~rCe~c~d  816 (1705)
T KOG1836|consen  770 DLGTSG------D-CQ-PCPCPNGGACGQTPEILEVVCKNCPPGYTG--LRCEECAD  816 (1705)
T ss_pred             ccCCCC------C-Cc-cCCCCCChhhcCcCcccceecCCCCCCCcc--cccccCCC
Confidence            322110      1 33 24477777776654 5678999 9999999  99986544


No 21 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.60  E-value=3e-05  Score=63.06  Aligned_cols=39  Identities=46%  Similarity=0.997  Sum_probs=33.7

Q ss_pred             cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCC
Q psy620          290 DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVS  331 (1290)
Q Consensus       290 dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g  331 (1290)
                      |||||+.. .+.|..+++|+|+.|+|+|  .|++||+....+
T Consensus         1 DidEC~~~-~~~C~~~~~C~N~~Gsy~C--~C~~Gy~~~~~~   39 (42)
T PF07645_consen    1 DIDECAEG-PHNCPENGTCVNTEGSYSC--SCPPGYELNDDG   39 (42)
T ss_dssp             ESSTTTTT-SSSSSTTSEEEEETTEEEE--EESTTEEECTTS
T ss_pred             CccccCCC-CCcCCCCCEEEcCCCCEEe--eCCCCcEECCCC
Confidence            68999998 5789999999999999999  999999744433


No 22 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.45  E-value=9.3e-05  Score=58.08  Aligned_cols=31  Identities=48%  Similarity=1.195  Sum_probs=25.7

Q ss_pred             CCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620          348 TRCDRNAKCTRILGNHYACKCDNGWAGDGQFC  379 (1290)
Q Consensus       348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C  379 (1290)
                      +.|+.+|+|+++ .++|+|+|++||.|||..|
T Consensus         6 ~~C~~nA~C~~~-~~~~~C~C~~Gy~GdG~~C   36 (36)
T PF12947_consen    6 GGCHPNATCTNT-GGSYTCTCKPGYEGDGFFC   36 (36)
T ss_dssp             GGS-TTCEEEE--TTSEEEEE-CEEECCSTCE
T ss_pred             CCCCCCcEeecC-CCCEEeECCCCCccCCcCC
Confidence            579999999999 6799999999999999876


No 23 
>KOG1226|consensus
Probab=97.36  E-value=0.001  Score=82.48  Aligned_cols=99  Identities=20%  Similarity=0.559  Sum_probs=64.2

Q ss_pred             ecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCC-CCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCC
Q psy620          259 RCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGR-NGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQ  337 (1290)
Q Consensus       259 ~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~-~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~  337 (1290)
                      .|. |.+||.|..||-........      ...+.|.... ...|...|.|+=.    +|  +|.+...+.--|+.|+-.
T Consensus       479 ~C~-C~~G~~G~~CEC~~~~~ss~------~~~~~Cr~~~~~~vCSgrG~C~CG----qC--~C~~~~~~~i~G~fCECD  545 (783)
T KOG1226|consen  479 QCR-CDEGWLGKKCECSTDELSSS------EEEDKCRENSDSPVCSGRGDCVCG----QC--VCHKPDNGKIYGKFCECD  545 (783)
T ss_pred             cee-cCCCCCCCcccCCccccCcH------hHHhhccCCCCCCCcCCCCcEeCC----ce--EecCCCCCceeeeeeecc
Confidence            477 99999999998433211110      1124454331 1258877888643    47  888766554456777753


Q ss_pred             CCCCCCCCCC--CCCCCCCeEeeecCCceEEecCCCcccCCCCcC
Q psy620          338 NFGADVCPDG--TRCDRNAKCTRILGNHYACKCDNGWAGDGQFCG  380 (1290)
Q Consensus       338 ~~~id~C~~~--~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~Ce  380 (1290)
                         .-.|...  ..|..+|.|.-.     +|+|.+||+|  ..|+
T Consensus       546 ---nfsC~r~~g~lC~g~G~C~CG-----~CvC~~GwtG--~~C~  580 (783)
T KOG1226|consen  546 ---NFSCERHKGVLCGGHGRCECG-----RCVCNPGWTG--SACN  580 (783)
T ss_pred             ---CcccccccCcccCCCCeEeCC-----cEEcCCCCcc--CCCC
Confidence               2235432  679999999764     8999999999  7775


No 24 
>KOG1226|consensus
Probab=97.07  E-value=0.0024  Score=79.46  Aligned_cols=137  Identities=28%  Similarity=0.724  Sum_probs=82.6

Q ss_pred             ccccCCCCCCCCCCCceec----------ccCCC----CCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcC--CCC
Q psy620          168 TCGPCPSGYTGDGERCQRI----------GGCSR----NPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTG--NGT  231 (1290)
Q Consensus       168 ~C~~C~~Gy~Gdg~~C~~i----------deC~~----~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~G--dg~  231 (1290)
                      .| .|.+||.|.  .|+-.          +.|..    .+|...        |.|.-.      +| +|.+...+  .|.
T Consensus       479 ~C-~C~~G~~G~--~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgr--------G~C~CG------qC-~C~~~~~~~i~G~  540 (783)
T KOG1226|consen  479 QC-RCDEGWLGK--KCECSTDELSSSEEEDKCRENSDSPVCSGR--------GDCVCG------QC-VCHKPDNGKIYGK  540 (783)
T ss_pred             ce-ecCCCCCCC--cccCCccccCcHhHHhhccCCCCCCCcCCC--------CcEeCC------ce-EecCCCCCceeee
Confidence            47 899999998  66531          23332    145444        555432      56 67766552  137


Q ss_pred             ccc-cCCccCC--CCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCc-ccCCCCCCCCCCCCCCCCc
Q psy620          232 RCH-DIDECDL--AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTC-VDIDECADGRNGGCDSNSM  307 (1290)
Q Consensus       232 ~C~-dideC~~--~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C-~dideC~~~~~g~C~~~g~  307 (1290)
                      .|+ +--.|..  ...|..+++|.=.    +|. |.+||+|..|+              | .+.+.|....-..|...|+
T Consensus       541 fCECDnfsC~r~~g~lC~g~G~C~CG----~Cv-C~~GwtG~~C~--------------C~~std~C~~~~G~iCSGrG~  601 (783)
T KOG1226|consen  541 FCECDNFSCERHKGVLCGGHGRCECG----RCV-CNPGWTGSACN--------------CPLSTDTCESSDGQICSGRGT  601 (783)
T ss_pred             eeeccCcccccccCcccCCCCeEeCC----cEE-cCCCCccCCCC--------------CCCCCccccCCCCceeCCCce
Confidence            886 2223432  2358888887543    698 99999999976              3 4566676652234766677


Q ss_pred             cccCCCCcEEcCcCcCC-ccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEe
Q psy620          308 CTNTEGSFTCTSLCRNS-YMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCT  357 (1290)
Q Consensus       308 C~n~~gsy~C~~~C~~G-y~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~  357 (1290)
                      |.-.    +|  +|... |.    +..|+.    -..|.  ++|..+..|+
T Consensus       602 C~Cg----~C--~C~~~~~s----G~~CE~----cptc~--~~C~~~~~Cv  636 (783)
T KOG1226|consen  602 CECG----RC--KCTDPPYS----GEFCEK----CPTCP--DPCAENKSCV  636 (783)
T ss_pred             eeCC----ce--EcCCCCcC----cchhhc----CCCCC--Ccccccccch
Confidence            7543    46  77665 75    566765    23344  4566665554


No 25 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.00  E-value=0.00018  Score=54.95  Aligned_cols=28  Identities=46%  Similarity=1.058  Sum_probs=26.1

Q ss_pred             CCCCCCCceeccCC-CCCccccCCCCccCC
Q psy620          593 DNPCFPGVECRDTR-EGPRCMRCPDGYVGD  621 (1290)
Q Consensus       593 ~nPC~~g~~C~~~~-~g~~Cg~Cp~G~~Gd  621 (1290)
                      ++||.|+++|++.. .+|+| .|++||+|.
T Consensus         3 ~~~C~n~g~C~~~~~~~y~C-~C~~G~~G~   31 (32)
T PF00008_consen    3 SNPCQNGGTCIDLPGGGYTC-ECPPGYTGK   31 (32)
T ss_dssp             TTSSTTTEEEEEESTSEEEE-EEBTTEEST
T ss_pred             CCcCCCCeEEEeCCCCCEEe-ECCCCCccC
Confidence            68999999999999 89999 999999983


No 26 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.97  E-value=0.00096  Score=54.28  Aligned_cols=32  Identities=31%  Similarity=0.941  Sum_probs=29.4

Q ss_pred             CCCCCCC-CCCCCCCeEeeecCCceEEecCCCcc
Q psy620          341 ADVCPDG-TRCDRNAKCTRILGNHYACKCDNGWA  373 (1290)
Q Consensus       341 id~C~~~-~~C~~~g~C~~~~~gsy~C~C~~Gy~  373 (1290)
                      |+||... +.|..++.|+++ .|+|+|.|++||.
T Consensus         2 idEC~~~~~~C~~~~~C~N~-~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    2 IDECAEGPHNCPENGTCVNT-EGSYSCSCPPGYE   34 (42)
T ss_dssp             SSTTTTTSSSSSTTSEEEEE-TTEEEEEESTTEE
T ss_pred             ccccCCCCCcCCCCCEEEcC-CCCEEeeCCCCcE
Confidence            7899876 789989999999 9999999999998


No 27 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.73  E-value=0.0014  Score=51.61  Aligned_cols=35  Identities=46%  Similarity=1.099  Sum_probs=30.2

Q ss_pred             CCCCCC-CCCCCCCeeccCCCCcccccCCCCCC-CCCCCc
Q psy620          146 GVTCNM-RPCFQGVQCFDTVEGYTCGPCPSGYT-GDGERC  183 (1290)
Q Consensus       146 ideC~~-~pC~~gg~C~n~~g~y~C~~C~~Gy~-Gdg~~C  183 (1290)
                      +++|.. .+|.++++|+++.++|.| .|++||. |.  .|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C-~C~~g~~~g~--~C   38 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRC-ECPPGYTDGR--NC   38 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEe-ECCCCCccCC--cC
Confidence            678887 789999999999999999 9999998 54  65


No 28 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.60  E-value=0.00061  Score=52.13  Aligned_cols=29  Identities=48%  Similarity=1.161  Sum_probs=24.2

Q ss_pred             CCCCCCCCCCCeeecCC-CCcccccCCCCccc
Q psy620          108 CATDNPCFPGVECRDTR-EGPRCMRCPDGYVG  138 (1290)
Q Consensus       108 C~~~~pC~~gg~C~~~~-g~y~C~~C~~Gy~G  138 (1290)
                      |. ++||+++|+|++.. .+|+| .|++||+|
T Consensus         1 C~-~~~C~n~g~C~~~~~~~y~C-~C~~G~~G   30 (32)
T PF00008_consen    1 CS-SNPCQNGGTCIDLPGGGYTC-ECPPGYTG   30 (32)
T ss_dssp             TT-TTSSTTTEEEEEESTSEEEE-EEBTTEES
T ss_pred             CC-CCcCCCCeEEEeCCCCCEEe-ECCCCCcc
Confidence            44 67888888998887 88889 89999888


No 29 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.58  E-value=0.0022  Score=50.45  Aligned_cols=37  Identities=43%  Similarity=1.003  Sum_probs=31.4

Q ss_pred             CCCCCCCCCCCCCCeeecCCCCcccccCCCCcc-cCCCCCC
Q psy620          105 KPTCATDNPCFPGVECRDTREGPRCMRCPDGYV-GDGIHCK  144 (1290)
Q Consensus       105 ~d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~-Gdg~~Ce  144 (1290)
                      +++|....+|.++++|++..++|.| .|++||+ |  ..|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C-~C~~g~~~g--~~C~   39 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRC-ECPPGYTDG--RNCE   39 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEe-ECCCCCccC--CcCC
Confidence            5788822799999999999999999 9999999 6  6664


No 30 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.53  E-value=0.00082  Score=52.83  Aligned_cols=32  Identities=38%  Similarity=0.999  Sum_probs=22.4

Q ss_pred             CCCCCCCCeeecCCCCcccccCCCCcccCCCCC
Q psy620          111 DNPCFPGVECRDTREGPRCMRCPDGYVGDGIHC  143 (1290)
Q Consensus       111 ~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~C  143 (1290)
                      ...|+.+++|+++.++|.| .|++||+|+|..|
T Consensus         5 ~~~C~~nA~C~~~~~~~~C-~C~~Gy~GdG~~C   36 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGSYTC-TCKPGYEGDGFFC   36 (36)
T ss_dssp             GGGS-TTCEEEE-TTSEEE-EE-CEEECCSTCE
T ss_pred             CCCCCCCcEeecCCCCEEe-ECCCCCccCCcCC
Confidence            4567888888888888888 8888888877655


No 31 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.47  E-value=0.00065  Score=71.46  Aligned_cols=133  Identities=29%  Similarity=0.776  Sum_probs=84.9

Q ss_pred             CceecCCCCCCCcC-CCCccccCCccCC----CCCCCCCcccccCC-----CCeecccCCCCCccCCCCccccccccccC
Q psy620          215 PYYRCGSCPEGTTG-NGTRCHDIDECDL----AEPCDPRVQCTNLF-----PGYRCDPCPAGFTGSTGVQGVGLEHAVRF  284 (1290)
Q Consensus       215 g~y~C~~C~~Gy~G-dg~~C~dideC~~----~~pC~~~g~C~n~~-----gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~  284 (1290)
                      ..|.| .|.+||.. +..+|+...+|..    ..+|...++|++..     ..|.|. |.+||....             
T Consensus        18 NHfEC-~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~-C~~gY~~~~-------------   82 (197)
T PF06247_consen   18 NHFEC-KCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCD-CINGYILKQ-------------   82 (197)
T ss_dssp             SEEEE-EESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEE-E-TTEEESS-------------
T ss_pred             CceEE-EcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEe-cccCceeeC-------------
Confidence            36899 99999983 4578988888863    35799999998765     569999 999999876             


Q ss_pred             CCCcccCCCCCCCCCCCCCCCCccccC---CCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecC
Q psy620          285 RQTCVDIDECADGRNGGCDSNSMCTNT---EGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILG  361 (1290)
Q Consensus       285 ~~~C~dideC~~~~~g~C~~~g~C~n~---~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~  361 (1290)
                       ..|.. .+|...   .|. .|.|+-.   +....|  +|.-|+. ..+...|...  +.-.|+  -.|..+-+|... .
T Consensus        83 -~vCvp-~~C~~~---~Cg-~GKCI~d~~~~~~~~C--SC~IGkV-~~dn~kCtk~--G~T~C~--LKCk~nE~CK~~-~  148 (197)
T PF06247_consen   83 -GVCVP-NKCNNK---DCG-SGKCILDPDNPNNPTC--SCNIGKV-PDDNKKCTKT--GETKCS--LKCKENEECKLV-D  148 (197)
T ss_dssp             -SSEEE-GGGSS------T-TEEEEEEEGGGSEEEE--EE-TEEE-TTTTTESEEE--E----------TTTEEEEEE-T
T ss_pred             -CeEch-hhcCce---ecC-CCeEEecCCCCCCcee--EeeeceE-eccCCcccCC--Ccccee--eecCCCcceeee-C
Confidence             34533 456644   387 6789743   334599  9999997 4455667652  234576  468888899998 8


Q ss_pred             CceEEecCCCcccCC
Q psy620          362 NHYACKCDNGWAGDG  376 (1290)
Q Consensus       362 gsy~C~C~~Gy~GdG  376 (1290)
                      +-|+|.|..||.+++
T Consensus       149 ~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  149 GYYKCVCKEGFPGDG  163 (197)
T ss_dssp             TEEEEEE-TT-EEET
T ss_pred             cEEEeecCCCCCCCC
Confidence            899999999998743


No 32 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.19  E-value=0.0025  Score=45.38  Aligned_cols=22  Identities=36%  Similarity=0.777  Sum_probs=19.8

Q ss_pred             CCccccCCCCcc--CCCccccCCCC
Q psy620          608 GPRCMRCPDGYV--GDGIHCKPGVT  630 (1290)
Q Consensus       608 g~~Cg~Cp~G~~--Gdg~~C~dide  630 (1290)
                      +|+| .|++||.  .+|.+|+||||
T Consensus         1 sy~C-~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTC-SCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEe-eCCCCCcCCCCCCccccCCC
Confidence            5889 8999998  68999999997


No 33 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.15  E-value=0.00086  Score=70.56  Aligned_cols=141  Identities=19%  Similarity=0.567  Sum_probs=87.4

Q ss_pred             eeccCCCCcccccCCCCCCC-CCCCceecccCCC-----CCCCCCCCCCCcceeeeccCCC---CCceecCCCCCCCcCC
Q psy620          159 QCFDTVEGYTCGPCPSGYTG-DGERCQRIGGCSR-----NPCAQGKLNEKTRCVRCDDIPE---HPYYRCGSCPEGTTGN  229 (1290)
Q Consensus       159 ~C~n~~g~y~C~~C~~Gy~G-dg~~C~~ideC~~-----~pC~~g~~~~~~~Cg~C~~~~~---~g~y~C~~C~~Gy~Gd  229 (1290)
                      ..+...+.|.| .|.+||.- +..+|+...+|..     .+|..-        ++|+....   ...|+| .|.+||...
T Consensus        12 ~LiQMSNHfEC-~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdy--------a~C~~~~~~~~~~~~~C-~C~~gY~~~   81 (197)
T PF06247_consen   12 YLIQMSNHFEC-KCNEGFVLKNENTCEEKVECDKLENVNKPCGDY--------AKCINQANKGEERAYKC-DCINGYILK   81 (197)
T ss_dssp             EEEEESSEEEE-EESTTEEEEETTEEEE----SG-GGTTSEEETT--------EEEEE-SSTTSSTSEEE-EE-TTEEES
T ss_pred             EEEEccCceEE-EcCCCcEEccccccccceecCcccccCccccch--------hhhhcCCCcccceeEEE-ecccCceee
Confidence            55555678999 99999973 2337998777865     478887        88987652   357999 999999965


Q ss_pred             CCccccCCccCCCCCCCCCcccccC---CCCeecccCCCCCccCCCCccccccccccCCCCcc--cCCCCCCCCCCCCCC
Q psy620          230 GTRCHDIDECDLAEPCDPRVQCTNL---FPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCV--DIDECADGRNGGCDS  304 (1290)
Q Consensus       230 g~~C~dideC~~~~pC~~~g~C~n~---~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~--dideC~~~~~g~C~~  304 (1290)
                      ...|. ..+|. .-.|. .+.|+-.   .....|+ |.-|+...             +...|+  -..+|+..    |..
T Consensus        82 ~~vCv-p~~C~-~~~Cg-~GKCI~d~~~~~~~~CS-C~IGkV~~-------------dn~kCtk~G~T~C~LK----Ck~  140 (197)
T PF06247_consen   82 QGVCV-PNKCN-NKDCG-SGKCILDPDNPNNPTCS-CNIGKVPD-------------DNKKCTKTGETKCSLK----CKE  140 (197)
T ss_dssp             SSSEE-EGGGS-S---T-TEEEEEEEGGGSEEEEE-E-TEEETT-------------TTTESEEEE------------TT
T ss_pred             CCeEc-hhhcC-ceecC-CCeEEecCCCCCCceeE-eeeceEec-------------cCCcccCCCccceeee----cCC
Confidence            56776 35676 66787 5889732   2345899 99999822             234563  33567766    888


Q ss_pred             CCccccCCCCcEEcCcCcCCccccCCCC
Q psy620          305 NSMCTNTEGSFTCTSLCRNSYMVRNVSV  332 (1290)
Q Consensus       305 ~g~C~n~~gsy~C~~~C~~Gy~g~~~g~  332 (1290)
                      +..|....+-|+|  .|..||.+...+.
T Consensus       141 nE~CK~~~~~Y~C--~~~~~~~~~~~~~  166 (197)
T PF06247_consen  141 NEECKLVDGYYKC--VCKEGFPGDGEGE  166 (197)
T ss_dssp             TEEEEEETTEEEE--EE-TT-EEETTT-
T ss_pred             CcceeeeCcEEEe--ecCCCCCCCCCcc
Confidence            8999999999999  9999997665443


No 34 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.96  E-value=0.0069  Score=47.01  Aligned_cols=35  Identities=46%  Similarity=1.107  Sum_probs=29.5

Q ss_pred             CCCCCC-CCCCCCCeeccCCCCcccccCCCCCCCCCCCc
Q psy620          146 GVTCNM-RPCFQGVQCFDTVEGYTCGPCPSGYTGDGERC  183 (1290)
Q Consensus       146 ideC~~-~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C  183 (1290)
                      +++|.. .+|.+++.|++..++|+| .|++||.|.  +|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C-~C~~g~~g~--~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRC-SCPPGYTGR--NC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEe-ECCCCCcCC--cC
Confidence            567877 789888899999999999 899999886  55


No 35 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.93  E-value=0.008  Score=46.64  Aligned_cols=36  Identities=44%  Similarity=1.015  Sum_probs=31.1

Q ss_pred             CCCCCCC-CCCCCCCeeecCCCCcccccCCCCcccCCCCCC
Q psy620          105 KPTCATD-NPCFPGVECRDTREGPRCMRCPDGYVGDGIHCK  144 (1290)
Q Consensus       105 ~d~C~~~-~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~Ce  144 (1290)
                      +++|. . .+|.+++.|++..++|+| .|++||+|  ..|+
T Consensus         2 ~~~C~-~~~~C~~~~~C~~~~~~~~C-~C~~g~~g--~~C~   38 (38)
T cd00054           2 IDECA-SGNPCQNGGTCVNTVGSYRC-SCPPGYTG--RNCE   38 (38)
T ss_pred             cccCC-CCCCcCCCCEeECCCCCeEe-ECCCCCcC--CcCC
Confidence            47787 5 799999999999999999 99999998  5663


No 36 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.92  E-value=0.0041  Score=44.31  Aligned_cols=22  Identities=50%  Similarity=1.159  Sum_probs=19.3

Q ss_pred             CCccCCCCCCCC--CCCCccCCCCC
Q psy620          698 YYRCGSCPEGTT--GNGTRCHDIDE  720 (1290)
Q Consensus       698 ~y~C~~C~~Gy~--Gng~~C~~~~~  720 (1290)
                      ||+|. |++||.  .+|.+|.||+|
T Consensus         1 sy~C~-C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCS-CPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEee-CCCCCcCCCCCCccccCCC
Confidence            69997 999997  68888999886


No 37 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.77  E-value=0.004  Score=48.94  Aligned_cols=36  Identities=39%  Similarity=0.983  Sum_probs=27.9

Q ss_pred             CCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCC
Q psy620          294 CADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGC  334 (1290)
Q Consensus       294 C~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C  334 (1290)
                      |+.. ++.|.  ..|++++++|+|  .|++||++..++++|
T Consensus         1 C~~~-NGgC~--h~C~~~~g~~~C--~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    1 CSVN-NGGCS--HICVNTPGSYRC--SCPPGYKLAEDGRTC   36 (36)
T ss_dssp             CTTG-GGGSS--SEEEEETTSEEE--E-STTEEE-TTSSSE
T ss_pred             CCCC-CCCcC--CCCccCCCceEe--ECCCCCEECcCCCCC
Confidence            3444 66787  689999999999  999999998887765


No 38 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=94.89  E-value=0.026  Score=42.96  Aligned_cols=28  Identities=50%  Similarity=1.169  Sum_probs=19.8

Q ss_pred             CCCCCCCCeeccCCCCcccccCCCCCCCC
Q psy620          151 MRPCFQGVQCFDTVEGYTCGPCPSGYTGD  179 (1290)
Q Consensus       151 ~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gd  179 (1290)
                      ..+|.++++|++..++|+| .|+.||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C-~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRC-VCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEe-ECCCCCccc
Confidence            4567666777777777777 777777665


No 39 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=94.61  E-value=0.037  Score=42.13  Aligned_cols=31  Identities=48%  Similarity=1.073  Sum_probs=26.7

Q ss_pred             CCCCCCCCeeecCCCCcccccCCCCcccCCCCC
Q psy620          111 DNPCFPGVECRDTREGPRCMRCPDGYVGDGIHC  143 (1290)
Q Consensus       111 ~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~C  143 (1290)
                      ..+|.++++|++..++|.| .|+.||.|. ..|
T Consensus         5 ~~~C~~~~~C~~~~~~~~C-~C~~g~~g~-~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRC-VCPPGYTGD-RSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCCeEe-ECCCCCccc-CCc
Confidence            5689999999999999999 999999994 244


No 40 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.38  E-value=0.044  Score=42.24  Aligned_cols=26  Identities=46%  Similarity=1.113  Sum_probs=19.6

Q ss_pred             CCCCCCCeeecCCCCcccccCCCCcccC
Q psy620          112 NPCFPGVECRDTREGPRCMRCPDGYVGD  139 (1290)
Q Consensus       112 ~pC~~gg~C~~~~g~y~C~~C~~Gy~Gd  139 (1290)
                      .+|.++ +|++..++|+| .|++||+|.
T Consensus         6 ~~C~~~-~C~~~~~~~~C-~C~~g~~g~   31 (35)
T smart00181        6 GPCSNG-TCINTPGSYTC-SCPPGYTGD   31 (35)
T ss_pred             CCCCCC-EEECCCCCeEe-ECCCCCccC
Confidence            577777 78777777888 788888773


No 41 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.28  E-value=0.041  Score=42.38  Aligned_cols=33  Identities=55%  Similarity=1.358  Sum_probs=27.6

Q ss_pred             CCCC-CCCCCCCeeccCCCCcccccCCCCCCCCCCCc
Q psy620          148 TCNM-RPCFQGVQCFDTVEGYTCGPCPSGYTGDGERC  183 (1290)
Q Consensus       148 eC~~-~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C  183 (1290)
                      +|.. .+|.++ +|++..++|+| .|++||.|. ..|
T Consensus         1 ~C~~~~~C~~~-~C~~~~~~~~C-~C~~g~~g~-~~C   34 (35)
T smart00181        1 ECASGGPCSNG-TCINTPGSYTC-SCPPGYTGD-KRC   34 (35)
T ss_pred             CCCCcCCCCCC-EEECCCCCeEe-ECCCCCccC-Ccc
Confidence            3566 689998 99999999999 999999994 255


No 42 
>KOG1218|consensus
Probab=91.54  E-value=3.1  Score=48.29  Aligned_cols=193  Identities=24%  Similarity=0.538  Sum_probs=95.4

Q ss_pred             CcccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCC--CCCCCCCCCCCCc
Q psy620          126 GPRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCS--RNPCAQGKLNEKT  203 (1290)
Q Consensus       126 ~y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~--~~pC~~g~~~~~~  203 (1290)
                      ...| .|.++|+|. ..+.....+.  +|...  |........| .+..+|.+.  .|.....+.  ...|...      
T Consensus        14 ~~~c-~c~~~~~g~-~~~~~~~~~~--~~~~~--~~~~~~~~~~-~~~~~~~~~--~c~~~~~~~~~~~~c~~~------   78 (316)
T KOG1218|consen   14 SGQC-FCDPGYTGR-LQCEHQAVTS--ACSGI--CPCEVNSGEC-GLGYGFVGS--VCRIECVCGNAGGGCSQP------   78 (316)
T ss_pred             CCce-ecCCCcccc-ccccCCCCCc--ccccc--CCccCCceeE-ecccccCCC--ccccccccCCCCCcccCc------
Confidence            4578 899999993 2222111111  11111  1112234567 778888877  555422221  1223333      


Q ss_pred             ceeeeccCCCCCceecCCC-CCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccc
Q psy620          204 RCVRCDDIPEHPYYRCGSC-PEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAV  282 (1290)
Q Consensus       204 ~Cg~C~~~~~~g~y~C~~C-~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~  282 (1290)
                        ..|........+.. .| ..+|.+  ..|+...+|...  |.. .+|.+...  .|. |..+|.+..|..      ..
T Consensus        79 --~~c~~~~~~~~~~~-~~~~~~~~g--~~C~~~~~~~~~--c~~-~~C~~~~~--~c~-~~~~~~~~~C~~------~~  141 (316)
T KOG1218|consen   79 --CRCKNGGTCVSSTG-YCHLNGYEG--PQCESPCPCGDG--CAE-KTCANPRR--ECR-CGGGYIGEQCGE------EN  141 (316)
T ss_pred             --cccCCCCcccCCCC-cccCCCCCc--ccccCCCCcCCc--ccc-cccCCCcc--cee-cCCcCccccccc------cC
Confidence              23333221122333 45 577777  778766666422  443 45655443  576 888888877653      01


Q ss_pred             cCCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCC
Q psy620          283 RFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGN  362 (1290)
Q Consensus       283 ~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~g  362 (1290)
                      ..+..|....++        .  ..+...  .-.|  .|++||.+..    |...   ...|.....|.+++.|... . 
T Consensus       142 ~~g~~C~~~c~~--------~--~~~~~~--~~~c--~c~~g~~g~~----~~~~---~~~c~~~~~~~~g~~C~~~-~-  198 (316)
T KOG1218|consen  142 LVGLKCQRDCQC--------T--GGCDCK--NGIC--TCQPGFVGVF----CVES---CSGCSPLTACENGAKCNRS-T-  198 (316)
T ss_pred             CCCCCccCCCCC--------c--cccCCC--CCce--eccCCccccc----cccc---CCCcCCCcccCCCCeeecc-c-
Confidence            112223222111        1  111111  2257  8999998544    4331   1116655678888888765 2 


Q ss_pred             ceEEecCCCccc
Q psy620          363 HYACKCDNGWAG  374 (1290)
Q Consensus       363 sy~C~C~~Gy~G  374 (1290)
                       ..+.+.+++.+
T Consensus       199 -~~~~~~~~~~~  209 (316)
T KOG1218|consen  199 -GSCLCYPGPSG  209 (316)
T ss_pred             -cccccCCCCcc
Confidence             25566665543


No 43 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=88.43  E-value=0.49  Score=37.37  Aligned_cols=29  Identities=31%  Similarity=0.860  Sum_probs=21.2

Q ss_pred             CCCCCCCeEeeecCCceEEecCCCccc--CCCCc
Q psy620          348 TRCDRNAKCTRILGNHYACKCDNGWAG--DGQFC  379 (1290)
Q Consensus       348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~G--dG~~C  379 (1290)
                      ..|++  .|+++ .++|+|.|++||.-  |+++|
T Consensus         6 GgC~h--~C~~~-~g~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    6 GGCSH--ICVNT-PGSYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             GGSSS--EEEEE-TTSEEEE-STTEEE-TTSSSE
T ss_pred             CCcCC--CCccC-CCceEeECCCCCEECcCCCCC
Confidence            34655  79998 78999999999975  44544


No 44 
>KOG1218|consensus
Probab=87.97  E-value=11  Score=43.72  Aligned_cols=102  Identities=26%  Similarity=0.673  Sum_probs=57.3

Q ss_pred             cC-CCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCcee----cccCCCCCCCCCCCCCCcce
Q psy620          131 RC-PDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQR----IGGCSRNPCAQGKLNEKTRC  205 (1290)
Q Consensus       131 ~C-~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~----ideC~~~pC~~g~~~~~~~C  205 (1290)
                      .| ..+|.|  ..|+...+|... |.. -+|.+...  .| .|..+|.+.  .|..    ...|... |...        
T Consensus        93 ~~~~~~~~g--~~C~~~~~~~~~-c~~-~~C~~~~~--~c-~~~~~~~~~--~C~~~~~~g~~C~~~-c~~~--------  154 (316)
T KOG1218|consen   93 YCHLNGYEG--PQCESPCPCGDG-CAE-KTCANPRR--EC-RCGGGYIGE--QCGEENLVGLKCQRD-CQCT--------  154 (316)
T ss_pred             cccCCCCCc--ccccCCCCcCCc-ccc-cccCCCcc--ce-ecCCcCccc--cccccCCCCCCccCC-CCCc--------
Confidence            55 678888  788866666544 333 45554432  46 666666655  5543    1112111 1111        


Q ss_pred             eeeccCCCCCceecCCCCCCCcCCCCccccCCc-cCCCCCCCCCcccccCCCC
Q psy620          206 VRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDE-CDLAEPCDPRVQCTNLFPG  257 (1290)
Q Consensus       206 g~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dide-C~~~~pC~~~g~C~n~~gs  257 (1290)
                      ..+...    .-.| .|.+||.+  ..|+.... |.....|.+++.|....+.
T Consensus       155 ~~~~~~----~~~c-~c~~g~~g--~~~~~~~~~c~~~~~~~~g~~C~~~~~~  200 (316)
T KOG1218|consen  155 GGCDCK----NGIC-TCQPGFVG--VFCVESCSGCSPLTACENGAKCNRSTGS  200 (316)
T ss_pred             cccCCC----CCce-eccCCccc--ccccccCCCcCCCcccCCCCeeeccccc
Confidence            112211    2368 89999999  67754332 6655678887788776664


No 45 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=87.30  E-value=0.36  Score=29.56  Aligned_cols=13  Identities=46%  Similarity=1.528  Sum_probs=10.2

Q ss_pred             EEecCCCcccCCCCc
Q psy620          365 ACKCDNGWAGDGQFC  379 (1290)
Q Consensus       365 ~C~C~~Gy~GdG~~C  379 (1290)
                      +|+|++||+|  .+|
T Consensus         1 ~C~C~~G~~G--~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTG--PNC   13 (13)
T ss_dssp             EEEE-TTEET--TTT
T ss_pred             CccCcCCCcC--CCC
Confidence            5999999999  665


No 46 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=85.44  E-value=0.81  Score=35.19  Aligned_cols=27  Identities=30%  Similarity=0.806  Sum_probs=21.8

Q ss_pred             CCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620          348 TRCDRNAKCTRILGNHYACKCDNGWAGDGQFC  379 (1290)
Q Consensus       348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C  379 (1290)
                      ..|.++|+|+..   ..+|+|.+||+|  ..|
T Consensus         6 ~~C~~~G~C~~~---~g~C~C~~g~~G--~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP---CGRCVCDSGYTG--PDC   32 (32)
T ss_pred             CccCCCCEEeCC---CCEEECCCCCcC--CCC
Confidence            468899999864   359999999999  554


No 47 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.37  E-value=0.71  Score=51.36  Aligned_cols=43  Identities=30%  Similarity=0.647  Sum_probs=34.7

Q ss_pred             CCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCC
Q psy620          284 FRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVS  331 (1290)
Q Consensus       284 ~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g  331 (1290)
                      ....|.++++|... ++.|.  ..|.++.|+|.|  .|++||++...+
T Consensus       180 ~~~~C~~~~~C~~~-~~~c~--~~C~~~~g~~~c--~c~~g~~~~~~~  222 (224)
T cd01475         180 QGKICVVPDLCATL-SHVCQ--QVCISTPGSYLC--ACTEGYALLEDN  222 (224)
T ss_pred             ccccCcCchhhcCC-CCCcc--ceEEcCCCCEEe--ECCCCccCCCCC
Confidence            35668888999876 56787  589999999999  999999865433


No 48 
>KOG3514|consensus
Probab=83.22  E-value=1.8  Score=56.08  Aligned_cols=34  Identities=32%  Similarity=0.880  Sum_probs=30.7

Q ss_pred             CCCCCCCCCCCCeeecCCCCcccccCC-CCcccCCCCCC
Q psy620          107 TCATDNPCFPGVECRDTREGPRCMRCP-DGYVGDGIHCK  144 (1290)
Q Consensus       107 ~C~~~~pC~~gg~C~~~~g~y~C~~C~-~Gy~Gdg~~Ce  144 (1290)
                      .|. ++||+|+|+|......|.| .|. .||.|  +.|+
T Consensus       625 ~C~-~nPC~N~g~C~egwNrfiC-DCs~T~~~G--~~Ce  659 (1591)
T KOG3514|consen  625 ICE-SNPCQNGGKCSEGWNRFIC-DCSGTGFEG--RTCE  659 (1591)
T ss_pred             ccC-CCcccCCCCcccccccccc-ccccCcccC--cccc
Confidence            788 9999999999999999999 896 67888  8888


No 49 
>KOG3512|consensus
Probab=82.69  E-value=2.2  Score=51.11  Aligned_cols=116  Identities=20%  Similarity=0.391  Sum_probs=60.5

Q ss_pred             ccccCCCC-eecccCCCCCccCCCCcccccccccc-------------------CCCCcccCCCCCCCCCCCCCCCCccc
Q psy620          250 QCTNLFPG-YRCDPCPAGFTGSTGVQGVGLEHAVR-------------------FRQTCVDIDECADGRNGGCDSNSMCT  309 (1290)
Q Consensus       250 ~C~n~~gs-y~C~~C~~Gy~G~~Ce~~~~~~~~~~-------------------~~~~C~dideC~~~~~g~C~~~g~C~  309 (1290)
                      .|+-...+ ++|. |...-+|+.|+.+...-..++                   ..+.|.---|+-.. .+.+. +++|+
T Consensus       286 ~Cv~d~~~~ltCd-C~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~l-Sgr~S-ggvCl  362 (592)
T KOG3512|consen  286 RCVMDESSHLTCD-CEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRL-SGRRS-GGVCL  362 (592)
T ss_pred             eeeeccCCceEEe-cccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcc-cCccc-cceEe
Confidence            56544444 7888 888888888776554322111                   11111111122221 22233 45565


Q ss_pred             c---CCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCC----CCeEeeecCCceEEecCCCcccCCCCcC
Q psy620          310 N---TEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDR----NAKCTRILGNHYACKCDNGWAGDGQFCG  380 (1290)
Q Consensus       310 n---~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~----~g~C~~~~~gsy~C~C~~Gy~GdG~~Ce  380 (1290)
                      |   ...+..|. .|++||+-. .++.=..    ...|. .-.|++    +-+|..+   +.+|.|++|.+|  .+|.
T Consensus       363 nCrHnTaGrhCh-yCreGyyRd-~s~pl~h----rkaCk-~CdChpVGs~gktCNq~---tGqCpCkeGvtG--~tCn  428 (592)
T KOG3512|consen  363 NCRHNTAGRHCH-YCREGYYRD-GSKPLTH----RKACK-ACDCHPVGSAGKTCNQT---TGQCPCKEGVTG--LTCN  428 (592)
T ss_pred             ecccCCCCcccc-cccCccccC-CCCCCch----hhhhh-hcCCccccccccccccc---CCcccCCCCCcc--cccc
Confidence            4   34456786 899999633 2221111    11222 112443    4478755   349999999999  8884


No 50 
>PF00683 TB:  TB domain;  InterPro: IPR002212 Transforming growth factor beta (TGF-beta)-binding protein-like (TB) domain comes from human fibrillin-1[]. This domain is found in fibrillins and latent TGF-beta-binding proteins (LTBPs) which are localized to fibrillar structures in the extracellular matrix [].; GO: 0005488 binding; PDB: 2W86_A 1UZJ_B 1UZQ_A 1UZK_A 1UZP_A 1APJ_A 1KSQ_A.
Probab=82.61  E-value=0.089  Score=42.97  Aligned_cols=22  Identities=27%  Similarity=0.477  Sum_probs=15.4

Q ss_pred             cCCCCCCCCCCCCCCCCCCCCc
Q psy620          465 DSDHDGIGDACDNCPRVSNPEQ  486 (1290)
Q Consensus       465 c~~~~~~G~~C~~Cp~~~~~~~  486 (1290)
                      |+.+.+||.+|+.||...+.+|
T Consensus        18 Cs~G~aWG~~Ce~CP~~~t~ef   39 (42)
T PF00683_consen   18 CSVGRAWGSPCEPCPPPGTDEF   39 (42)
T ss_dssp             TTT-SEETTTTEE---TTSHHH
T ss_pred             CCCCCcCCCccccCCCCCChHH
Confidence            7889999999999999877655


No 51 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=82.59  E-value=1.2  Score=34.34  Aligned_cols=24  Identities=38%  Similarity=0.785  Sum_probs=19.3

Q ss_pred             CCCCCCCeeecCCCCcccccCCCCccc
Q psy620          112 NPCFPGVECRDTREGPRCMRCPDGYVG  138 (1290)
Q Consensus       112 ~pC~~gg~C~~~~g~y~C~~C~~Gy~G  138 (1290)
                      ..|.++|+|+..  ..+| .|.+||+|
T Consensus         6 ~~C~~~G~C~~~--~g~C-~C~~g~~G   29 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRC-VCDSGYTG   29 (32)
T ss_pred             CccCCCCEEeCC--CCEE-ECCCCCcC
Confidence            468888999866  3489 99999998


No 52 
>KOG3516|consensus
Probab=81.55  E-value=3.2  Score=54.93  Aligned_cols=35  Identities=31%  Similarity=0.894  Sum_probs=27.9

Q ss_pred             CCCCCCCCCCCCCeeecCCCCcccccCC-CCcccCCCCCC
Q psy620          106 PTCATDNPCFPGVECRDTREGPRCMRCP-DGYVGDGIHCK  144 (1290)
Q Consensus       106 d~C~~~~pC~~gg~C~~~~g~y~C~~C~-~Gy~Gdg~~Ce  144 (1290)
                      --|+ +.+|.|||+|+....+|.| -|. ..|.|  ..|.
T Consensus       956 GhCs-s~~C~NGG~Cvery~gytC-DCs~Tay~G--p~Cs  991 (1306)
T KOG3516|consen  956 GHCS-SYPCLNGGHCVERYDGYTC-DCSRTAYDG--PFCS  991 (1306)
T ss_pred             cccc-cccccCCCEEEEecCceee-ccccCcCCC--Cccc
Confidence            4577 7799999999999999999 886 44666  5665


No 53 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=81.40  E-value=1.1  Score=35.54  Aligned_cols=32  Identities=34%  Similarity=0.690  Sum_probs=23.8

Q ss_pred             CCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620          348 TRCDRNAKCTRILGNHYACKCDNGWAGDGQFC  379 (1290)
Q Consensus       348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C  379 (1290)
                      ..|-.+|.|.+...|+++|+|..||..+|..|
T Consensus         5 ~~cP~NA~C~~~~dG~eecrCllgyk~~~~~C   36 (37)
T PF12946_consen    5 TKCPANAGCFRYDDGSEECRCLLGYKKVGGKC   36 (37)
T ss_dssp             S---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred             ccCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence            56778999999867999999999998766555


No 54 
>smart00051 DSL delta serrate ligand.
Probab=79.47  E-value=2.3  Score=38.03  Aligned_cols=44  Identities=20%  Similarity=0.517  Sum_probs=30.4

Q ss_pred             cCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620          320 LCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDGQFC  379 (1290)
Q Consensus       320 ~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C  379 (1290)
                      .|.++|.|..+.+.|..          .+.+..+.+|...    ..|+|.+||+|  ..|
T Consensus        20 ~C~~~~yG~~C~~~C~~----------~~d~~~~~~Cd~~----G~~~C~~Gw~G--~~C   63 (63)
T smart00051       20 TCDENYYGEGCNKFCRP----------RDDFFGHYTCDEN----GNKGCLEGWMG--PYC   63 (63)
T ss_pred             eCCCCCcCCccCCEeCc----------CccccCCccCCcC----CCEecCCCCcC--CCC
Confidence            79999998776666654          1234456667432    37899999999  554


No 55 
>smart00682 G2F G2 nidogen domain and fibulin.
Probab=77.85  E-value=2.1  Score=47.58  Aligned_cols=67  Identities=15%  Similarity=0.171  Sum_probs=50.8

Q ss_pred             eecccccccccccCccceEEEEeeeeccccceeeeecccCCCcccc---hhhhhhhcccccccceeeeeeccccccccc
Q psy620           21 VEGWSVKDDLLDDGVINGLLLGVKQDIMGARYTLYMDCVDHGTVAM---TQSLKKMFDSMKNPQMRLRKTDEESVDEIE   96 (1290)
Q Consensus        21 ~~~~s~~~~~~~~~~~~~l~~~~~~~i~G~~~~ly~~C~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~   96 (1290)
                      ...++.+.+.+   ....++|+++|+|+      |..|++......   ..++.++|+.|...+..||+++.+.+.+..
T Consensus       153 l~s~str~~~v---~~~~~~y~~~Q~I~------y~~C~~~~~~~~~~~~l~Vs~I~~~Y~~~e~~LR~a~~n~i~~~~  222 (227)
T smart00682      153 LTTSSTREYTV---DNQTHSYTVDQTIT------FEECQHRDAFPPTTQQLHVSSVFVDYNDEERVLRFAAHNSVGPGD  222 (227)
T ss_pred             EEEEEeeEEEE---ccEEEeEEEeEEEE------ecccCCCCCCCCcceEEEEEEEEEEecCchhheeeeeeeeecCCC
Confidence            34556666655   45688999999988      999998875532   233459999999999999999988776654


No 56 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=76.95  E-value=1.7  Score=48.24  Aligned_cols=38  Identities=24%  Similarity=0.432  Sum_probs=30.4

Q ss_pred             CCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCC
Q psy620          141 IHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGD  179 (1290)
Q Consensus       141 ~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gd  179 (1290)
                      ..|+++++|...+......|.++.|+|.| .|++||+..
T Consensus       182 ~~C~~~~~C~~~~~~c~~~C~~~~g~~~c-~c~~g~~~~  219 (224)
T cd01475         182 KICVVPDLCATLSHVCQQVCISTPGSYLC-ACTEGYALL  219 (224)
T ss_pred             ccCcCchhhcCCCCCccceEEcCCCCEEe-ECCCCccCC
Confidence            67888889976443333589999999999 999999854


No 57 
>KOG3516|consensus
Probab=71.96  E-value=9.6  Score=50.73  Aligned_cols=39  Identities=31%  Similarity=0.763  Sum_probs=34.1

Q ss_pred             cccCCCCCCCCCCCCCCeeecCCCCcccccCC-CCcccCCCCCC
Q psy620          102 IVKKPTCATDNPCFPGVECRDTREGPRCMRCP-DGYVGDGIHCK  144 (1290)
Q Consensus       102 ~~~~d~C~~~~pC~~gg~C~~~~g~y~C~~C~-~Gy~Gdg~~Ce  144 (1290)
                      |.-++.|. ++||+++|.|..+...|.| .|. .||+|  .+|+
T Consensus       542 C~i~drCl-PN~CehgG~C~Qs~~~f~C-~C~~TGY~G--atCH  581 (1306)
T KOG3516|consen  542 CGISDRCL-PNPCEHGGKCSQSWDDFEC-NCELTGYKG--ATCH  581 (1306)
T ss_pred             cccccccC-CccccCCCcccccccceeE-ecccccccc--cccc
Confidence            44567888 9999999999998899999 999 99999  6777


No 58 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=69.80  E-value=7.5  Score=32.97  Aligned_cols=33  Identities=24%  Similarity=0.725  Sum_probs=23.2

Q ss_pred             CCCCCCCCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620          342 DVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDGQFC  379 (1290)
Q Consensus       342 d~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C  379 (1290)
                      ..|.....|..++.|++.     +|.|++||+-.+.+|
T Consensus        20 ~~C~~~~qC~~~s~C~~g-----~C~C~~g~~~~~~~C   52 (52)
T PF01683_consen   20 ESCESDEQCIGGSVCVNG-----RCQCPPGYVEVGGRC   52 (52)
T ss_pred             CCCCCcCCCCCcCEEcCC-----EeECCCCCEecCCCC
Confidence            346555677788899654     999999997643443


No 59 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=68.98  E-value=0.92  Score=35.94  Aligned_cols=34  Identities=29%  Similarity=0.689  Sum_probs=19.6

Q ss_pred             CCCCCCCCCCeeccCC-CCcccccCCCCCCCCCCCc
Q psy620          149 CNMRPCFQGVQCFDTV-EGYTCGPCPSGYTGDGERC  183 (1290)
Q Consensus       149 C~~~pC~~gg~C~n~~-g~y~C~~C~~Gy~Gdg~~C  183 (1290)
                      |...+|..++.|.+.. |++.| +|..||...+..|
T Consensus         2 C~~~~cP~NA~C~~~~dG~eec-rCllgyk~~~~~C   36 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEEC-RCLLGYKKVGGKC   36 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEE-EE-TTEEEETTEE
T ss_pred             ccCccCCCCcccEEcCCCCEEE-EeeCCccccCCCc
Confidence            4556677777887765 77888 8888886443344


No 60 
>KOG3512|consensus
Probab=68.78  E-value=8.6  Score=46.37  Aligned_cols=16  Identities=38%  Similarity=0.802  Sum_probs=11.9

Q ss_pred             CCCcccccCCCCCCCC
Q psy620          164 VEGYTCGPCPSGYTGD  179 (1290)
Q Consensus       164 ~g~y~C~~C~~Gy~Gd  179 (1290)
                      ..+-.|..|.+||+-+
T Consensus       368 TaGrhChyCreGyyRd  383 (592)
T KOG3512|consen  368 TAGRHCHYCREGYYRD  383 (592)
T ss_pred             CCCcccccccCccccC
Confidence            4456787899999855


No 61 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=64.80  E-value=1.2e+02  Score=40.56  Aligned_cols=85  Identities=26%  Similarity=0.693  Sum_probs=43.1

Q ss_pred             eeecCCCCcccccCCCCcccC-CCCCCCCCCCCCCCCCCCCeeccC-------CCCcccccCCCCCCCCCCCceecccCC
Q psy620          119 ECRDTREGPRCMRCPDGYVGD-GIHCKPGVTCNMRPCFQGVQCFDT-------VEGYTCGPCPSGYTGDGERCQRIGGCS  190 (1290)
Q Consensus       119 ~C~~~~g~y~C~~C~~Gy~Gd-g~~CedideC~~~pC~~gg~C~n~-------~g~y~C~~C~~Gy~Gdg~~C~~ideC~  190 (1290)
                      +|....+...|..|..||... +..|.  ..|....   .+.|..-       .++=.| .|++||+.....|.      
T Consensus       366 tC~~~~~~~tCt~C~~gyl~~~g~sC~--~~C~~~~---~~~Ct~c~~g~~~~~~~C~c-~C~~G~y~~~g~C~------  433 (800)
T PTZ00214        366 TCGYNSGAVTCTRCSAGYLGVDGKSCS--ESCSGDT---RGVCTKVAEGSESTEVSCRC-VCKPTFYNSSGTCT------  433 (800)
T ss_pred             cccCCCCCcccccccCCcCcCCCCccc--ccCCCCC---CCcccccccccccccCcccc-cCCCCcccCCCCcc------
Confidence            444333335688888888642 23453  2332211   1223211       112245 68999885433453      


Q ss_pred             CCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCc
Q psy620          191 RNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTT  227 (1290)
Q Consensus       191 ~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~  227 (1290)
                        +|+..       |.+|...   ....|..|++||.
T Consensus       434 --~C~~s-------Ca~C~~~---~~~~CtsC~~g~~  458 (800)
T PTZ00214        434 --PCTDS-------CAVCKDG---TPTGCQQCSPGKI  458 (800)
T ss_pred             --CCCCc-------ccccCCC---CcCcCccCCCCcE
Confidence              34332       4666543   2446878999985


No 62 
>smart00051 DSL delta serrate ligand.
Probab=61.63  E-value=10  Score=33.95  Aligned_cols=47  Identities=21%  Similarity=0.437  Sum_probs=32.2

Q ss_pred             eecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCC
Q psy620          217 YRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTG  272 (1290)
Q Consensus       217 y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~C  272 (1290)
                      ++- .|.++|.|  ..|.  ..|...+.+..+.+|.. .|  .|. |.+||+|..|
T Consensus        17 ~rv-~C~~~~yG--~~C~--~~C~~~~d~~~~~~Cd~-~G--~~~-C~~Gw~G~~C   63 (63)
T smart00051       17 IRV-TCDENYYG--EGCN--KFCRPRDDFFGHYTCDE-NG--NKG-CLEGWMGPYC   63 (63)
T ss_pred             EEe-eCCCCCcC--CccC--CEeCcCccccCCccCCc-CC--CEe-cCCCCcCCCC
Confidence            344 79999999  7775  34542344566777854 34  577 9999999863


No 63 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=60.88  E-value=40  Score=41.16  Aligned_cols=128  Identities=27%  Similarity=0.627  Sum_probs=68.0

Q ss_pred             ccCCCCccc--CCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCC-CCcee-cccCCCCCCCCCCCCCCcce
Q psy620          130 MRCPDGYVG--DGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDG-ERCQR-IGGCSRNPCAQGKLNEKTRC  205 (1290)
Q Consensus       130 ~~C~~Gy~G--dg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg-~~C~~-ideC~~~pC~~g~~~~~~~C  205 (1290)
                      ..|..||.-  +...|....+|....|.   +|.+... -.|..|..+|+... +.|.. -.+|....+... ......|
T Consensus         2 ~~C~~gy~~~~~~t~C~~~~~C~~~~C~---~Cs~~~~-~~Ct~C~~~~~lt~t~~Ci~~C~~c~~~~~~t~-~~~~~~C   76 (397)
T PF03302_consen    2 TECTSGYKLSTDKTSCVSASECKTPNCK---TCSNDKK-EVCTECNSGYYLTPTNQCIEDCAKCSNYYCSTC-GNDKKTC   76 (397)
T ss_pred             ccccCCceECCCCCcccccCCCCCCCCc---cccCCCC-CccCcCCCCCcCCCCCccccCcccccccccccc-ccccccc
Confidence            468889873  44577766677766664   4655433 56878999987542 23432 111222111111 0012234


Q ss_pred             eeeccCC---CCCceecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCC
Q psy620          206 VRCDDIP---EHPYYRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGST  271 (1290)
Q Consensus       206 g~C~~~~---~~g~y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~  271 (1290)
                      ..|....   -.+.-.|..|+.||+-++..|.   .|.  ..|   ..|... ....|..|++||....
T Consensus        77 ~~C~~~~~~~~~~~~~c~~C~~G~y~~~~~C~---~C~--~~C---~~C~~~-~~~~Ct~C~~g~~L~~  136 (397)
T PF03302_consen   77 KKCSIGNCLTCSGDACCSECPDGYYKNGNKCV---PCH--ESC---ATCSGG-APNQCTSCKPGKVLKY  136 (397)
T ss_pred             cccccccccccccCccccCCCCCccccCCCCC---CCC--ccc---cccCCC-CCCCCcccCCCccccc
Confidence            4444211   0122356689999996556664   221  223   234432 3457888999997665


No 64 
>cd00255 nidG2 Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an extracellular sheet-like matrix. Nidogen is a multifunctional protein that interacts with many other basement membrane proteins, like collagen, perlecan, lamin, and has a potential role in the assembly and connection of networks. Nidogen consists of 3 globular domains (G1-G3), G3 is the lamin-binding domain, while G2 binds collagen IV and perlecan. Also found in hemicentin, a protein which functions at various cell-cell and cell-matrix junctions and might assist in refining broad regions of cell contact into oriented, line-shaped junctions. Nidogen G2 consists of an N-terminal EGF-like domain (excluded from this alignment model) and an 11-stranded beta-barrel with a central helix, a topology that exhibits high structural similarity to the green flourescent proteins of Cnidaria.
Probab=59.86  E-value=6.7  Score=43.68  Aligned_cols=69  Identities=17%  Similarity=0.243  Sum_probs=49.6

Q ss_pred             ceeeccccccccccc-CccceEEEEeeeeccccceeeeecccCCCcc---cchhhhhhhcccccccceeeeeecccccc
Q psy620           19 PIVEGWSVKDDLLDD-GVINGLLLGVKQDIMGARYTLYMDCVDHGTV---AMTQSLKKMFDSMKNPQMRLRKTDEESVD   93 (1290)
Q Consensus        19 ~~~~~~s~~~~~~~~-~~~~~l~~~~~~~i~G~~~~ly~~C~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~~~~~~~   93 (1290)
                      ......+.|.+++.. +...+++|+++|+|+      |..|.+....   .....+.++|..|...+..||+++.+.+.
T Consensus       150 g~l~s~str~~~v~~~~~~~~~~y~~~Q~I~------y~~c~~~~~~~p~~~~l~v~~i~~~Y~~~e~~lrf~~~~~i~  222 (224)
T cd00255         150 GVLTSSSTREYTVDEGGESQTLSYQWNQTIT------YEECPHDDEAAPDLQQLLVARIFALYNPEEEILRFAITNSIG  222 (224)
T ss_pred             CEEEEEEeeEEEEecCCCceEEeEEeeeEEE------EeecCCCCcCCCceEEEEEEEEEEEecChHHheeeeeeeccc
Confidence            445556667676654 335589999999988      9999985422   22333449999999999999998776654


No 65 
>PHA02887 EGF-like protein; Provisional
Probab=57.60  E-value=9  Score=38.03  Aligned_cols=31  Identities=29%  Similarity=0.758  Sum_probs=24.2

Q ss_pred             CCCCCCCeEeeec-CCceEEecCCCcccCCCCcCC
Q psy620          348 TRCDRNAKCTRIL-GNHYACKCDNGWAGDGQFCGR  381 (1290)
Q Consensus       348 ~~C~~~g~C~~~~-~gsy~C~C~~Gy~GdG~~Ce~  381 (1290)
                      +.|- +|+|.... ...+.|.|..||+|  .+|+.
T Consensus        92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG--~RCE~  123 (126)
T PHA02887         92 DFCI-NGECMNIIDLDEKFCICNKGYTG--IRCDE  123 (126)
T ss_pred             CEee-CCEEEccccCCCceeECCCCccc--CCCCc
Confidence            5677 47997542 45689999999999  89973


No 66 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=52.19  E-value=13  Score=37.53  Aligned_cols=31  Identities=23%  Similarity=0.643  Sum_probs=24.5

Q ss_pred             CCCCCCCeEeeec-CCceEEecCCCcccCCCCcCC
Q psy620          348 TRCDRNAKCTRIL-GNHYACKCDNGWAGDGQFCGR  381 (1290)
Q Consensus       348 ~~C~~~g~C~~~~-~gsy~C~C~~Gy~GdG~~Ce~  381 (1290)
                      +.|-++ +|.... ...+.|.|..||+|  .+||.
T Consensus        51 ~YClHG-~C~yI~dl~~~~CrC~~GYtG--eRCEh   82 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCRCSHGYTG--IRCQH   82 (139)
T ss_pred             CEeECC-EEEeeccCCCceeECCCCccc--ccccc
Confidence            567764 897642 46799999999999  89974


No 67 
>KOG3514|consensus
Probab=51.82  E-value=10  Score=49.59  Aligned_cols=35  Identities=31%  Similarity=0.830  Sum_probs=30.1

Q ss_pred             CCCCCCCCCCCCceeccCCCCCccccC-CCCccCCCcccc
Q psy620          588 PTCATDNPCFPGVECRDTREGPRCMRC-PDGYVGDGIHCK  626 (1290)
Q Consensus       588 ~~C~~~nPC~~g~~C~~~~~g~~Cg~C-p~G~~Gdg~~C~  626 (1290)
                      ..|. +|||.||++|.+.-+.|.| .| ..||.|  +.|+
T Consensus       624 ~~C~-~nPC~N~g~C~egwNrfiC-DCs~T~~~G--~~Ce  659 (1591)
T KOG3514|consen  624 KICE-SNPCQNGGKCSEGWNRFIC-DCSGTGFEG--RTCE  659 (1591)
T ss_pred             cccC-CCcccCCCCcccccccccc-ccccCcccC--cccc
Confidence            3797 8999999999999999999 68 567877  6776


No 68 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=50.28  E-value=15  Score=37.21  Aligned_cols=39  Identities=38%  Similarity=0.971  Sum_probs=27.9

Q ss_pred             cCCCCCC--CCCCCCCCeeecCC--CCcccccCCCCcccCCCCCCCC
Q psy620          104 KKPTCAT--DNPCFPGVECRDTR--EGPRCMRCPDGYVGDGIHCKPG  146 (1290)
Q Consensus       104 ~~d~C~~--~~pC~~gg~C~~~~--g~y~C~~C~~Gy~Gdg~~Cedi  146 (1290)
                      ++-+|..  .+=|.|| +|.--.  ..+.| .|..||+|  .+||..
T Consensus        41 ~i~~Cp~ey~~YClHG-~C~yI~dl~~~~C-rC~~GYtG--eRCEh~   83 (139)
T PHA03099         41 AIRLCGPEGDGYCLHG-DCIHARDIDGMYC-RCSHGYTG--IRCQHV   83 (139)
T ss_pred             ccccCChhhCCEeECC-EEEeeccCCCcee-ECCCCccc--ccccce
Confidence            4556663  5668876 886543  67889 99999999  688743


No 69 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=49.03  E-value=2.8e+02  Score=33.88  Aligned_cols=44  Identities=34%  Similarity=0.939  Sum_probs=28.3

Q ss_pred             cccccCCCCCCCCCCCceecccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcC
Q psy620          167 YTCGPCPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTG  228 (1290)
Q Consensus       167 y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~G  228 (1290)
                      -.|..|+.||+-++..|.        ||+..       |.+|...   ....|..|++||..
T Consensus        91 ~~c~~C~~G~y~~~~~C~--------~C~~~-------C~~C~~~---~~~~Ct~C~~g~~L  134 (397)
T PF03302_consen   91 ACCSECPDGYYKNGNKCV--------PCHES-------CATCSGG---APNQCTSCKPGKVL  134 (397)
T ss_pred             ccccCCCCCccccCCCCC--------CCCcc-------ccccCCC---CCCCCcccCCCccc
Confidence            356689999986544553        44443       4566543   24578889999874


No 70 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=44.29  E-value=19  Score=35.51  Aligned_cols=32  Identities=28%  Similarity=0.710  Sum_probs=26.2

Q ss_pred             CCCCCCCCCCCCCCeeecCCCCcccccCCCCccc
Q psy620          105 KPTCATDNPCFPGVECRDTREGPRCMRCPDGYVG  138 (1290)
Q Consensus       105 ~d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~G  138 (1290)
                      .+.|.....|.+.+.|... ....| .|.+||.-
T Consensus        77 ~d~Cd~y~~CG~~g~C~~~-~~~~C-~Cl~GF~P  108 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNSN-NSPKC-SCLPGFEP  108 (110)
T ss_pred             ccCCCCccccCCccEeCCC-CCCce-ECCCCcCC
Confidence            4789878999999999543 56689 99999974


No 71 
>PHA02887 EGF-like protein; Provisional
Probab=43.61  E-value=18  Score=36.06  Aligned_cols=28  Identities=32%  Similarity=0.695  Sum_probs=22.6

Q ss_pred             eeeccCCCCCceecCCCCCCCcCCCCccccC
Q psy620          206 VRCDDIPEHPYYRCGSCPEGTTGNGTRCHDI  236 (1290)
Q Consensus       206 g~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~di  236 (1290)
                      |+|.-+.+...+.| .|..||+|  .+|+.+
T Consensus        97 G~C~yI~dL~epsC-rC~~GYtG--~RCE~v  124 (126)
T PHA02887         97 GECMNIIDLDEKFC-ICNKGYTG--IRCDEV  124 (126)
T ss_pred             CEEEccccCCCcee-ECCCCccc--CCCCcc
Confidence            47877766677899 99999999  888743


No 72 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=34.57  E-value=41  Score=28.39  Aligned_cols=35  Identities=29%  Similarity=0.724  Sum_probs=0.0

Q ss_pred             CCCCCe----eecCCCCcccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCC
Q psy620          114 CFPGVE----CRDTREGPRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGD  179 (1290)
Q Consensus       114 C~~gg~----C~~~~g~y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gd  179 (1290)
                      |.+++.    |....+  +| .|+++|+|  ..|+                          .|++||++.
T Consensus         4 C~~~g~~~~~C~~~~G--~C-~C~~~~~G--~~C~--------------------------~C~~g~~~~   42 (50)
T cd00055           4 CNGHGSLSGQCDPGTG--QC-ECKPNTTG--RRCD--------------------------RCAPGYYGL   42 (50)
T ss_pred             CcCCCCCCccccCCCC--EE-eCCCcCCC--CCCC--------------------------CCCCCCccC


No 73 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=33.84  E-value=50  Score=27.93  Aligned_cols=29  Identities=41%  Similarity=1.088  Sum_probs=20.3

Q ss_pred             CCCCCCCCCCCCCeeecCCCCcccccCCCCcccC
Q psy620          106 PTCATDNPCFPGVECRDTREGPRCMRCPDGYVGD  139 (1290)
Q Consensus       106 d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gd  139 (1290)
                      ..|.....|..++.|++.    +| .|++||+-.
T Consensus        20 ~~C~~~~qC~~~s~C~~g----~C-~C~~g~~~~   48 (52)
T PF01683_consen   20 ESCESDEQCIGGSVCVNG----RC-QCPPGYVEV   48 (52)
T ss_pred             CCCCCcCCCCCcCEEcCC----Ee-ECCCCCEec
Confidence            456666677777788653    78 888888743


No 74 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=33.62  E-value=34  Score=33.67  Aligned_cols=32  Identities=28%  Similarity=0.567  Sum_probs=25.6

Q ss_pred             CCCCCCCCCCCCCCeEeeecCCceEEecCCCccc
Q psy620          341 ADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAG  374 (1290)
Q Consensus       341 id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~G  374 (1290)
                      .+.|.....|..+|.|..  ..+..|.|.+||.-
T Consensus        77 ~d~Cd~y~~CG~~g~C~~--~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNS--NNSPKCSCLPGFEP  108 (110)
T ss_pred             ccCCCCccccCCccEeCC--CCCCceECCCCcCC
Confidence            467886789999999954  34568999999974


No 75 
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=31.65  E-value=79  Score=34.14  Aligned_cols=33  Identities=33%  Similarity=0.441  Sum_probs=27.5

Q ss_pred             cccccCccceEEEEeeeeccccceeeeecccCCCccc
Q psy620           29 DLLDDGVINGLLLGVKQDIMGARYTLYMDCVDHGTVA   65 (1290)
Q Consensus        29 ~~~~~~~~~~l~~~~~~~i~G~~~~ly~~C~~~~~~~   65 (1290)
                      ..+.+++||++.+.+...    .++||++|.......
T Consensus       112 ~~l~dg~WH~lal~V~~~----~v~LyvDC~~~~~~~  144 (184)
T smart00210      112 LPLADGQWHKLALSVSGS----SATLYVDCNEIDSRP  144 (184)
T ss_pred             CccccCCceEEEEEEeCC----EEEEEECCcccccee
Confidence            457899999999998776    579999999877664


No 76 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=29.90  E-value=1.8e+02  Score=38.86  Aligned_cols=38  Identities=24%  Similarity=0.699  Sum_probs=24.4

Q ss_pred             CcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCC----CCCCCCCeEe
Q psy620          314 SFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDG----TRCDRNAKCT  357 (1290)
Q Consensus       314 sy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~----~~C~~~g~C~  357 (1290)
                      ..+|  +|..||.....+..|..    ...|...    ..|...++|+
T Consensus       681 ~~~C--~C~~g~~p~~~~~~C~~----~~~C~~~~~gC~~C~~~g~C~  722 (800)
T PTZ00214        681 VRRC--WCERGFLPALDRSGCVL----PTECPPDMPSCAACDESGRCL  722 (800)
T ss_pred             ccee--EecCCcccccCCCcccc----ccCCCcccccccccCCCCcee
Confidence            4589  99999987777778876    2345421    2455555554


No 77 
>KOG3509|consensus
Probab=28.77  E-value=1.1e+02  Score=41.29  Aligned_cols=117  Identities=25%  Similarity=0.432  Sum_probs=0.0

Q ss_pred             ccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCcee-cccCCCCCCCCCCCCCCccee
Q psy620          128 RCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQR-IGGCSRNPCAQGKLNEKTRCV  206 (1290)
Q Consensus       128 ~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~-ideC~~~pC~~g~~~~~~~Cg  206 (1290)
                      .| .|++||.|  ..|++-.++...++.  +.|....   .+ .|.-+....  .|+. .-.|                .
T Consensus       719 ~C-~c~~g~~G--~~ce~c~e~~~ls~t--~~~~~~~---~~-~c~~~~h~~--~c~~~~~~n----------------t  771 (964)
T KOG3509|consen  719 QC-QCPKGLVG--TSCEDCAEGYTLSTT--GGLYPGL---CE-DCECNSHIS--QCEDDLGYN----------------T  771 (964)
T ss_pred             cc-ccCccccC--ccccccccccccccc--CCcCccc---Cc-ccccCCCcc--ccccccccc----------------c


Q ss_pred             eeccCCCCCceecCCCCCCCcCC---CCccccCCccCCCCC--CCCCcccccCCCCeecccCCCCCccCCCC
Q psy620          207 RCDDIPEHPYYRCGSCPEGTTGN---GTRCHDIDECDLAEP--CDPRVQCTNLFPGYRCDPCPAGFTGSTGV  273 (1290)
Q Consensus       207 ~C~~~~~~g~y~C~~C~~Gy~Gd---g~~C~dideC~~~~p--C~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce  273 (1290)
                      .|.+...  +++|..|++||.++   +..+.....|.+..+  +.+...-.-...++.|..|+++++|..|+
T Consensus       772 ~~q~~~~--~~~~~~~~~g~~~da~~g~~~D~~p~~~l~~~~~~~~r~~l~~~~~~~~~~~~p~~~~g~~~~  841 (964)
T KOG3509|consen  772 DCQNNTE--GDRCELCSPGTYGDARRGTPEDCRPATALTIQCSCNNRSPLSCDGFGPGCLLCPHNTEGTTCE  841 (964)
T ss_pred             cccccCc--cceeeecCCCccccCccCCcccCCccchhhhhhhhcccCccccccCCCCcccCCCCccccchh


No 78 
>TIGR00648 recU recombination protein U. The Bacillus protein has been shown to be required for DNA recombination and repair. RJD 11/20/00
Probab=25.88  E-value=32  Score=36.88  Aligned_cols=46  Identities=11%  Similarity=0.172  Sum_probs=36.6

Q ss_pred             ccceeeEEEEeecCCcEEEEEeeecccccc--------cccCcceecCCccEEEE
Q psy620         1226 DLCSGNVATFYQSSQKFYVMMWKKNSQVYW--------QTTPFRAVAEPGIQLKV 1272 (1290)
Q Consensus      1226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~yw--------~~~~~~~~~~~~~~~~~ 1272 (1290)
                      -...+||++.+....+||+|.|++..+ ||        .+-|+.-..+-|++|++
T Consensus       101 ~gGiaF~iI~F~~~~e~y~v~~~~l~~-~w~~~~~~GrKSi~~~~i~~~g~~i~~  154 (169)
T TIGR00648       101 QDGICFLIISFQTFDQVYFLEADKLFY-FWKRKEKNGRKSIRKDEIEETAYPIPL  154 (169)
T ss_pred             CCCEEEEEEEEeecCeEEEEEHHHHHH-HHHHHhhCCCCcccHHHHHHhCEEecc
Confidence            456789999999999999999988754 78        44577777788888865


No 79 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=25.40  E-value=61  Score=27.07  Aligned_cols=22  Identities=27%  Similarity=0.661  Sum_probs=16.9

Q ss_pred             CeEeeecCCceEEecCCCcccCCCCcC
Q psy620          354 AKCTRILGNHYACKCDNGWAGDGQFCG  380 (1290)
Q Consensus       354 g~C~~~~~gsy~C~C~~Gy~GdG~~Ce  380 (1290)
                      .+|...   ..+|.|+++|+|  .+|+
T Consensus        11 ~~C~~~---~G~C~C~~~~~G--~~C~   32 (49)
T PF00053_consen   11 QTCDPS---TGQCVCKPGTTG--PRCD   32 (49)
T ss_dssp             SSEEET---CEEESBSTTEES--TTS-
T ss_pred             CcccCC---CCEEeccccccC--CcCc
Confidence            367653   469999999999  8885


No 80 
>PRK02234 recU Holliday junction-specific endonuclease; Reviewed
Probab=21.73  E-value=47  Score=36.46  Aligned_cols=47  Identities=13%  Similarity=0.274  Sum_probs=35.1

Q ss_pred             cccceeeEEEEeecCCcEEEEEeeecccccc--------cccCcceecCCccEEEE
Q psy620         1225 LDLCSGNVATFYQSSQKFYVMMWKKNSQVYW--------QTTPFRAVAEPGIQLKV 1272 (1290)
Q Consensus      1225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~yw--------~~~~~~~~~~~~~~~~~ 1272 (1290)
                      .-...+||++.+-.-.+||+|.|++.. .||        .+-|+.-..+-|++|++
T Consensus       123 ~~gGiaF~iI~F~~~~e~y~vp~~~l~-~~w~~~~~~grKSI~~e~i~~~~~~i~~  177 (195)
T PRK02234        123 KQGGICFVIIRFSTLDETYLLPASKLI-KFWERQKDGGRKSIPLEEIKKNGYEIPL  177 (195)
T ss_pred             HCCCEEEEEEEEEeCCeEEEEEHHHHH-HHHHHHHhCCCCcccHHHHHHcCEEecc
Confidence            345688999999999999999999874 488        34455555666777754


No 81 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=20.02  E-value=26  Score=31.35  Aligned_cols=48  Identities=25%  Similarity=0.523  Sum_probs=19.7

Q ss_pred             CcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620          314 SFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDGQFC  379 (1290)
Q Consensus       314 sy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C  379 (1290)
                      .++-  .|.+.|.|..+...|.+.    +.-      ..+-+|...  |  .=+|.+||+|  ..|
T Consensus        16 ~~rv--~C~~nyyG~~C~~~C~~~----~d~------~ghy~Cd~~--G--~~~C~~Gw~G--~~C   63 (63)
T PF01414_consen   16 RIRV--VCDENYYGPNCSKFCKPR----DDS------FGHYTCDSN--G--NKVCLPGWTG--PNC   63 (63)
T ss_dssp             ---------TTEETTTT-EE---E----EET------TEEEEE-SS------EEE-TTEES--TTS
T ss_pred             EEEE--ECCCCCCCccccCCcCCC----cCC------cCCcccCCC--C--CCCCCCCCcC--CCC
Confidence            4566  899999987776666551    110      112245432  2  4579999999  555


Done!