Query psy9568
Match_columns 248
No_of_seqs 223 out of 2401
Neff 9.3
Searched_HMMs 46136
Date Fri Aug 16 22:55:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9568.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9568hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1225|consensus 99.7 4.4E-16 9.5E-21 137.8 12.2 130 36-184 235-365 (525)
2 KOG1225|consensus 99.7 3E-16 6.6E-21 138.8 11.1 118 17-149 248-365 (525)
3 KOG1226|consensus 99.4 3.3E-12 7.1E-17 115.6 12.9 178 1-188 433-653 (783)
4 KOG1226|consensus 99.2 1.1E-10 2.4E-15 105.8 11.2 135 55-194 467-627 (783)
5 KOG1219|consensus 99.1 1.4E-10 3E-15 114.0 7.3 102 84-189 3866-3980(4289)
6 KOG1219|consensus 99.1 1.7E-10 3.6E-15 113.5 6.5 95 55-152 3870-3978(4289)
7 KOG4289|consensus 98.9 5.4E-10 1.2E-14 106.4 2.9 108 7-114 1168-1317(2531)
8 KOG4289|consensus 98.9 1.5E-09 3.2E-14 103.6 4.8 87 99-188 1222-1318(2531)
9 KOG1217|consensus 98.5 2.4E-06 5.2E-11 76.6 13.6 158 35-196 152-363 (487)
10 KOG1217|consensus 98.4 3.3E-06 7.2E-11 75.7 10.4 120 66-185 151-306 (487)
11 KOG1214|consensus 98.2 5.6E-06 1.2E-10 76.3 8.0 126 57-183 702-859 (1289)
12 KOG1214|consensus 98.1 1.1E-05 2.5E-10 74.4 8.5 120 25-149 702-860 (1289)
13 KOG4260|consensus 98.1 5.6E-06 1.2E-10 67.1 5.1 133 38-181 131-304 (350)
14 KOG0994|consensus 97.9 6.5E-05 1.4E-09 71.6 8.5 94 93-187 1030-1147(1758)
15 PF07974 EGF_2: EGF-like domai 97.8 2.6E-05 5.6E-10 42.8 3.1 25 24-48 7-32 (32)
16 KOG4260|consensus 97.8 2.7E-05 5.9E-10 63.2 3.7 107 71-182 132-269 (350)
17 KOG0994|consensus 97.5 0.00092 2E-08 64.2 10.3 54 28-81 774-844 (1758)
18 smart00051 DSL delta serrate l 97.3 0.00035 7.7E-09 44.7 3.6 43 38-80 20-63 (63)
19 PF07974 EGF_2: EGF-like domai 97.2 0.00034 7.5E-09 38.3 2.8 26 55-80 6-32 (32)
20 PF00008 EGF: EGF-like domain 97.0 0.00027 5.9E-09 38.8 0.7 25 23-47 4-32 (32)
21 PF12661 hEGF: Human growth fa 96.9 0.00039 8.5E-09 29.9 0.7 12 37-48 2-13 (13)
22 PF12661 hEGF: Human growth fa 96.8 0.00053 1.2E-08 29.5 0.9 13 173-185 1-13 (13)
23 PF00008 EGF: EGF-like domain 96.7 0.00053 1.2E-08 37.6 0.6 22 162-183 5-31 (32)
24 smart00051 DSL delta serrate l 96.6 0.0035 7.7E-08 40.1 3.7 45 138-185 18-63 (63)
25 smart00179 EGF_CA Calcium-bind 96.3 0.0059 1.3E-07 34.6 3.2 25 24-48 10-38 (39)
26 KOG1836|consensus 96.1 0.081 1.8E-06 54.4 12.2 16 35-50 695-710 (1705)
27 KOG1218|consensus 96.1 0.23 4.9E-06 42.2 13.3 151 34-193 14-183 (316)
28 smart00179 EGF_CA Calcium-bind 95.9 0.011 2.3E-07 33.5 2.9 25 161-185 9-38 (39)
29 cd00054 EGF_CA Calcium-binding 95.8 0.012 2.6E-07 32.8 3.1 25 24-48 10-37 (38)
30 cd00053 EGF Epidermal growth f 95.4 0.019 4.2E-07 31.4 2.8 26 23-48 6-35 (36)
31 KOG1218|consensus 95.1 0.53 1.1E-05 40.0 12.1 144 35-180 49-207 (316)
32 cd00054 EGF_CA Calcium-binding 94.8 0.038 8.3E-07 30.7 2.9 24 162-185 10-37 (38)
33 PF12662 cEGF: Complement Clr- 94.5 0.032 7E-07 28.2 1.8 11 171-181 1-11 (24)
34 PF12947 EGF_3: EGF domain; I 94.5 0.014 3E-07 32.8 0.5 23 24-46 7-32 (36)
35 smart00181 EGF Epidermal growt 94.3 0.06 1.3E-06 29.6 2.9 24 24-48 7-34 (35)
36 cd00055 EGF_Lam Laminin-type e 94.1 0.062 1.3E-06 32.6 2.8 15 35-49 19-33 (50)
37 PF01414 DSL: Delta serrate li 93.4 0.026 5.6E-07 36.1 0.3 44 36-80 18-63 (63)
38 cd00053 EGF Epidermal growth f 93.3 0.1 2.2E-06 28.4 2.7 23 56-78 7-32 (36)
39 KOG1836|consensus 93.2 0.19 4.2E-06 51.8 6.1 14 68-81 696-709 (1705)
40 smart00181 EGF Epidermal growt 93.1 0.088 1.9E-06 28.9 2.2 16 170-185 18-34 (35)
41 PF07645 EGF_CA: Calcium-bindi 92.9 0.044 9.5E-07 31.9 0.7 21 161-181 10-34 (42)
42 PF00053 Laminin_EGF: Laminin 92.8 0.062 1.4E-06 32.3 1.4 20 30-49 12-32 (49)
43 PF07645 EGF_CA: Calcium-bindi 92.6 0.072 1.6E-06 31.0 1.4 21 24-44 11-34 (42)
44 PHA02887 EGF-like protein; Pro 92.3 0.094 2E-06 37.3 1.9 28 160-187 91-123 (126)
45 smart00180 EGF_Lam Laminin-typ 91.9 0.17 3.8E-06 30.0 2.5 15 35-49 18-32 (46)
46 PHA03099 epidermal growth fact 91.0 0.14 3E-06 37.2 1.6 29 160-188 50-83 (139)
47 PF01414 DSL: Delta serrate li 91.0 0.066 1.4E-06 34.2 -0.0 15 100-114 18-32 (63)
48 PF06247 Plasmod_Pvs28: Plasmo 90.5 0.047 1E-06 42.4 -1.2 79 29-108 11-119 (197)
49 PF12947 EGF_3: EGF domain; I 89.6 0.13 2.9E-06 28.8 0.5 23 56-78 7-32 (36)
50 PHA02887 EGF-like protein; Pro 85.6 0.73 1.6E-05 32.9 2.3 23 28-50 96-123 (126)
51 KOG3607|consensus 85.5 0.65 1.4E-05 44.2 2.7 54 24-83 605-658 (716)
52 cd00055 EGF_Lam Laminin-type e 80.6 1.6 3.6E-05 26.2 2.2 16 171-186 18-33 (50)
53 PF01683 EB: EB module; Inter 80.2 2.6 5.6E-05 25.5 3.1 29 44-76 18-46 (52)
54 PHA03099 epidermal growth fact 79.8 1.6 3.4E-05 31.8 2.2 15 35-49 67-81 (139)
55 KOG0196|consensus 72.7 11 0.00025 36.3 6.3 14 97-110 306-319 (996)
56 KOG3607|consensus 72.7 4.4 9.5E-05 38.7 3.8 32 85-116 628-659 (716)
57 PF14670 FXa_inhibition: Coagu 71.7 1.8 3.9E-05 24.2 0.6 13 170-182 17-29 (36)
58 PF09064 Tme5_EGF_like: Thromb 70.1 3.6 7.9E-05 22.6 1.6 10 35-44 18-27 (34)
59 KOG3512|consensus 65.9 29 0.00063 31.3 7.1 20 167-186 408-428 (592)
60 PF12955 DUF3844: Domain of un 59.2 6.9 0.00015 27.6 1.7 20 23-42 13-40 (103)
61 KOG3512|consensus 56.5 14 0.0003 33.3 3.5 16 66-81 413-428 (592)
62 PF04863 EGF_alliinase: Alliin 55.9 4.6 0.0001 24.8 0.4 16 173-188 37-52 (56)
63 PF12946 EGF_MSP1_1: MSP1 EGF 46.3 7.8 0.00017 21.8 0.3 21 24-44 6-30 (37)
64 KOG3516|consensus 44.2 21 0.00045 35.8 2.9 36 156-191 545-586 (1306)
65 PF00954 S_locus_glycop: S-loc 43.0 42 0.00091 23.6 3.8 26 19-44 78-107 (110)
66 KOG3516|consensus 40.5 18 0.00039 36.3 1.9 35 17-51 544-583 (1306)
67 KOG3514|consensus 23.0 56 0.0012 32.8 2.0 31 158-188 625-661 (1591)
68 KOG3514|consensus 21.6 56 0.0012 32.8 1.7 30 20-49 625-659 (1591)
No 1
>KOG1225|consensus
Probab=99.68 E-value=4.4e-16 Score=137.81 Aligned_cols=130 Identities=35% Similarity=0.987 Sum_probs=66.2
Q ss_pred ceeeCCCCcCCCCCcCCCCCCCCCCCeecCCCceeeCCCCccCCCCcCCCCCCCCCCceecCCCceeCCCCcccCCCCcc
Q psy9568 36 TCECQKGFYGLRCEFCICTEKCLNGGKCVQKDTCECQKGFYGLRCEFSKCIIPCLNGGRCKGVNKCRCPPGFLGDYCEIW 115 (248)
Q Consensus 36 ~C~C~~G~~G~~C~~~~c~~~C~~~g~C~~~~~C~C~~g~~G~~C~~~~C~~~C~~~g~C~~~~~C~C~~g~~g~~C~~~ 115 (248)
.|.|+.+|+|+.|....|...|.++|.|++ ++|+|++||.|..|+...|...|++++.+.. +.|+|++||.|..|++.
T Consensus 235 ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~-G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~-g~CiC~~g~~G~dCs~~ 312 (525)
T KOG1225|consen 235 ICECPEGYFGPLCSTIYCPGGCTGRGQCVE-GRCICPPGFTGDDCDELVCPVDCSGGGVCVD-GECICNPGYSGKDCSIR 312 (525)
T ss_pred eeecCCceeCCccccccCCCCCcccceEeC-CeEeCCCCCcCCCCCcccCCcccCCCceecC-CEeecCCCccccccccc
Confidence 455555555555554445555555555554 5555555555555554445544555555543 35555555555555432
Q ss_pred CCCCCCCCCCCCCCCceeeCCCeeeCCCCCCCCCCCcCCCCCCCCCCCCC-CccCCCCeeecCCCCccCC
Q psy9568 116 QRPYISKCIIPCLNGGRCKGVNKCRCPPGFLGDYCEIWQRPYICPKPCKQ-GVCSAARTCACYEGWFGRT 184 (248)
Q Consensus 116 ~~~~~~~c~~~C~~~g~C~~~~~C~C~~g~~g~~C~~~~~~~~c~~~C~~-g~C~~~~~C~C~~g~~G~~ 184 (248)
. |...|.++|.|+ .++|.|.+||+|..|+.. . |.+ |.|+ .+ |.|..||.|.+
T Consensus 313 ~------cpadC~g~G~Ci-~G~C~C~~Gy~G~~C~~~-------~-C~~~g~cv-~g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 313 R------CPADCSGHGKCI-DGECLCDEGYTGELCIQR-------A-CSGGGQCV-NG-CKCKKGWRGPD 365 (525)
T ss_pred c------CCccCCCCCccc-CCceEeCCCCcCCccccc-------c-cCCCceec-cC-ceeccCccCCC
Confidence 1 223455555555 355555555555555541 1 444 3444 23 55555555554
No 2
>KOG1225|consensus
Probab=99.67 E-value=3e-16 Score=138.80 Aligned_cols=118 Identities=33% Similarity=0.932 Sum_probs=107.5
Q ss_pred ccCCCCcCCCCCCEeecCCceeeCCCCcCCCCCcCCCCCCCCCCCeecCCCceeeCCCCccCCCCcCCCCCCCCCCceec
Q psy9568 17 VSGICTEKCLNGGKCVQKDTCECQKGFYGLRCEFCICTEKCLNGGKCVQKDTCECQKGFYGLRCEFSKCIIPCLNGGRCK 96 (248)
Q Consensus 17 ~~~~C~~~C~~~g~C~~~~~C~C~~G~~G~~C~~~~c~~~C~~~g~C~~~~~C~C~~g~~G~~C~~~~C~~~C~~~g~C~ 96 (248)
....|+..|.++|.|+. +.|+|++||+|.+|++..|...|++++.+++ +.|.|++||+|..|++..|...|.++|.|+
T Consensus 248 ~~~~C~~~c~~~g~c~~-G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~-g~CiC~~g~~G~dCs~~~cpadC~g~G~Ci 325 (525)
T KOG1225|consen 248 STIYCPGGCTGRGQCVE-GRCICPPGFTGDDCDELVCPVDCSGGGVCVD-GECICNPGYSGKDCSIRRCPADCSGHGKCI 325 (525)
T ss_pred ccccCCCCCcccceEeC-CeEeCCCCCcCCCCCcccCCcccCCCceecC-CEeecCCCccccccccccCCccCCCCCccc
Confidence 46788888999999998 8999999999999998889888999999998 599999999999999988999999999999
Q ss_pred CCCceeCCCCcccCCCCccCCCCCCCCCCCCCCCceeeCCCeeeCCCCCCCCC
Q psy9568 97 GVNKCRCPPGFLGDYCEIWQRPYISKCIIPCLNGGRCKGVNKCRCPPGFLGDY 149 (248)
Q Consensus 97 ~~~~C~C~~g~~g~~C~~~~~~~~~~c~~~C~~~g~C~~~~~C~C~~g~~g~~ 149 (248)
.++|.|.+||+|..|... .|.+++.|+. + |.|..||.|.+
T Consensus 326 -~G~C~C~~Gy~G~~C~~~----------~C~~~g~cv~-g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 326 -DGECLCDEGYTGELCIQR----------ACSGGGQCVN-G-CKCKKGWRGPD 365 (525)
T ss_pred -CCceEeCCCCcCCccccc----------ccCCCceecc-C-ceeccCccCCC
Confidence 689999999999999862 3888888987 4 99999999998
No 3
>KOG1226|consensus
Probab=99.41 E-value=3.3e-12 Score=115.57 Aligned_cols=178 Identities=30% Similarity=0.714 Sum_probs=123.1
Q ss_pred CceeEEEEEeEEE-eeeccCCCCcCC-----------CCCCEeecCCceeeCCCCcCCCCCc-----------CCC----
Q psy9568 1 MRLNVAVVSLMTI-LLFVSGICTEKC-----------LNGGKCVQKDTCECQKGFYGLRCEF-----------CIC---- 53 (248)
Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~C~~~C-----------~~~g~C~~~~~C~C~~G~~G~~C~~-----------~~c---- 53 (248)
++|.|++|.|..+ .+.+...|.-+| +.+|...= |.|.|.+||.|+.|+- ..|
T Consensus 433 ~~~~i~pvgf~e~l~v~v~~~C~C~C~~~~e~~s~~C~g~G~~~C-G~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~ 511 (783)
T KOG1226|consen 433 GSFIIRPVGFTETLEVIVQYNCECDCQDQGEPNSALCHGNGTFVC-GQCRCDEGWLGKKCECSTDELSSSEEEDKCRENS 511 (783)
T ss_pred ceEEEccCCCCcceEEEeeccccccccccCCCCccccCCCCcEEe-cceecCCCCCCCcccCCccccCcHhHHhhccCCC
Confidence 4689999988555 456667775444 33444433 6899999999999971 112
Q ss_pred -CCCCCCCCeecCCCceeeCCCCc----cCCCCcCC--CC----CCCCCCceecCCCceeCCCCcccCCCCccCCC--CC
Q psy9568 54 -TEKCLNGGKCVQKDTCECQKGFY----GLRCEFSK--CI----IPCLNGGRCKGVNKCRCPPGFLGDYCEIWQRP--YI 120 (248)
Q Consensus 54 -~~~C~~~g~C~~~~~C~C~~g~~----G~~C~~~~--C~----~~C~~~g~C~~~~~C~C~~g~~g~~C~~~~~~--~~ 120 (248)
..+|++.|.|+- ++|.|.+... |..|+.+. |. ..|.++|.|.- ++|+|.+||+|+.|+..... +.
T Consensus 512 ~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~C~~std~C~ 589 (783)
T KOG1226|consen 512 DSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACNCPLSTDTCE 589 (783)
T ss_pred CCCCcCCCCcEeC-CceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeC-CcEEcCCCCccCCCCCCCCCcccc
Confidence 236999999997 8999998766 89998653 44 44999999874 79999999999998754322 22
Q ss_pred CCCCCCCCCCceeeCCCeeeCCCC-CCCCCCCcCCCCCCCCCCCCC-CccCCCCee-ecCCCCccCCCCCC
Q psy9568 121 SKCIIPCLNGGRCKGVNKCRCPPG-FLGDYCEIWQRPYICPKPCKQ-GVCSAARTC-ACYEGWFGRTCSQR 188 (248)
Q Consensus 121 ~~c~~~C~~~g~C~~~~~C~C~~g-~~g~~C~~~~~~~~c~~~C~~-g~C~~~~~C-~C~~g~~G~~C~~~ 188 (248)
..-...|+..|+|.- ++|.|... |.|..|+. .+.|+++|.. ..|+ +| ....|+.+..|...
T Consensus 590 ~~~G~iCSGrG~C~C-g~C~C~~~~~sG~~CE~---cptc~~~C~~~~~Cv---eC~~~~~g~~~~~C~~~ 653 (783)
T KOG1226|consen 590 SSDGQICSGRGTCEC-GRCKCTDPPYSGEFCEK---CPTCPDPCAENKSCV---ECQAFETGPVGDTCVEE 653 (783)
T ss_pred CCCCceeCCCceeeC-CceEcCCCCcCcchhhc---CCCCCCcccccccch---hhcccccccccchHHHH
Confidence 211245777788774 67888766 99999997 3566777753 5554 11 22345666666543
No 4
>KOG1226|consensus
Probab=99.21 E-value=1.1e-10 Score=105.78 Aligned_cols=135 Identities=29% Similarity=0.700 Sum_probs=101.3
Q ss_pred CCCCCCCeecCCCceeeCCCCccCCCCcC-----------CCC-----CCCCCCceecCCCceeCCCCcc----cCCCCc
Q psy9568 55 EKCLNGGKCVQKDTCECQKGFYGLRCEFS-----------KCI-----IPCLNGGRCKGVNKCRCPPGFL----GDYCEI 114 (248)
Q Consensus 55 ~~C~~~g~C~~~~~C~C~~g~~G~~C~~~-----------~C~-----~~C~~~g~C~~~~~C~C~~g~~----g~~C~~ 114 (248)
..|+++|+.+- +.|.|.+||.|..|+.+ .|. .+|+++|.|.- ++|+|.+... |..|+-
T Consensus 467 ~~C~g~G~~~C-G~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCEC 544 (783)
T KOG1226|consen 467 ALCHGNGTFVC-GQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCEC 544 (783)
T ss_pred cccCCCCcEEe-cceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-CceEecCCCCCceeeeeeec
Confidence 45888888886 89999999999999732 243 36999999984 6899988766 899987
Q ss_pred cCCCCCCCCCCCCCCCceeeCCCeeeCCCCCCCCCCCcCCCCCCCC----CCCCC-CccCCCCeeecCCC-CccCCCCCC
Q psy9568 115 WQRPYISKCIIPCLNGGRCKGVNKCRCPPGFLGDYCEIWQRPYICP----KPCKQ-GVCSAARTCACYEG-WFGRTCSQR 188 (248)
Q Consensus 115 ~~~~~~~~c~~~C~~~g~C~~~~~C~C~~g~~g~~C~~~~~~~~c~----~~C~~-g~C~~~~~C~C~~g-~~G~~C~~~ 188 (248)
+...+...-...|.++|.|.- ++|.|.+||+|..|+-..+.+.|. ..|.. |+|. -++|.|... |.|..|+.
T Consensus 545 DnfsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~-Cg~C~C~~~~~sG~~CE~- 621 (783)
T KOG1226|consen 545 DNFSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCE-CGRCKCTDPPYSGEFCEK- 621 (783)
T ss_pred cCcccccccCcccCCCCeEeC-CcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceee-CCceEcCCCCcCcchhhc-
Confidence 665544433357999999875 899999999999998876666662 23543 6776 568888655 99999975
Q ss_pred CCCCCC
Q psy9568 189 SDSGED 194 (248)
Q Consensus 189 ~~~~~~ 194 (248)
.+.+..
T Consensus 622 cptc~~ 627 (783)
T KOG1226|consen 622 CPTCPD 627 (783)
T ss_pred CCCCCC
Confidence 344433
No 5
>KOG1219|consensus
Probab=99.11 E-value=1.4e-10 Score=114.00 Aligned_cols=102 Identities=40% Similarity=0.935 Sum_probs=85.3
Q ss_pred CCC-CCCCCCceec----CCCceeCCCCcccCCCCccCCCCCCCCCCCCCCCceee---CCCeeeCCCCCCCCCCCcCCC
Q psy9568 84 KCI-IPCLNGGRCK----GVNKCRCPPGFLGDYCEIWQRPYISKCIIPCLNGGRCK---GVNKCRCPPGFLGDYCEIWQR 155 (248)
Q Consensus 84 ~C~-~~C~~~g~C~----~~~~C~C~~g~~g~~C~~~~~~~~~~c~~~C~~~g~C~---~~~~C~C~~g~~g~~C~~~~~ 155 (248)
.|. .||+++|.|. +.|.|.|++-|.|..|++....+.. .||.++|+|. ..+.|.|+.||+|..|+..-
T Consensus 3866 ~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~s---nPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~G- 3941 (4289)
T KOG1219|consen 3866 PCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCAS---NPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARG- 3941 (4289)
T ss_pred ccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccC---CCCCCCCEEEecCCCeeEeCCCCccCceeeccc-
Confidence 444 7899999998 4589999999999999998765544 5999999998 44689999999999999852
Q ss_pred CCCC-CCCCCC-CccC---CCCeeecCCCCccCCCCCCC
Q psy9568 156 PYIC-PKPCKQ-GVCS---AARTCACYEGWFGRTCSQRS 189 (248)
Q Consensus 156 ~~~c-~~~C~~-g~C~---~~~~C~C~~g~~G~~C~~~~ 189 (248)
..+| .++|.+ |.|+ ++|.|.|-+||.|..|....
T Consensus 3942 i~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~~ 3980 (4289)
T KOG1219|consen 3942 ISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCCAEK 3980 (4289)
T ss_pred ccccccccccCCceeeccCCceEeccChhHhcccCcccc
Confidence 4567 688987 8898 78999999999999985433
No 6
>KOG1219|consensus
Probab=99.08 E-value=1.7e-10 Score=113.45 Aligned_cols=95 Identities=40% Similarity=1.006 Sum_probs=72.8
Q ss_pred CCCCCCCeecC----CCceeeCCCCccCCCCc--CCCC-CCCCCCceec---CCCceeCCCCcccCCCCccCCCCCCCCC
Q psy9568 55 EKCLNGGKCVQ----KDTCECQKGFYGLRCEF--SKCI-IPCLNGGRCK---GVNKCRCPPGFLGDYCEIWQRPYISKCI 124 (248)
Q Consensus 55 ~~C~~~g~C~~----~~~C~C~~g~~G~~C~~--~~C~-~~C~~~g~C~---~~~~C~C~~g~~g~~C~~~~~~~~~~c~ 124 (248)
.+|+++|+|.. .|.|.|++-|.|.+|++ ..|. .||.+||+|+ +.+.|.|+.||+|..|+... +++|.
T Consensus 3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~G---i~eCs 3946 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARG---ISECS 3946 (4289)
T ss_pred CcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeeccc---ccccc
Confidence 57888888875 46888888888888875 3566 7888888887 45788888888888888652 34554
Q ss_pred -CCCCCCceee---CCCeeeCCCCCCCCCCCc
Q psy9568 125 -IPCLNGGRCK---GVNKCRCPPGFLGDYCEI 152 (248)
Q Consensus 125 -~~C~~~g~C~---~~~~C~C~~g~~g~~C~~ 152 (248)
.+|.++|.|. +++.|.|.+||.|+.|..
T Consensus 3947 ~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~ 3978 (4289)
T KOG1219|consen 3947 KNVCGTGGQCINIPGSFHCNCTPGILGRTCCA 3978 (4289)
T ss_pred cccccCCceeeccCCceEeccChhHhcccCcc
Confidence 5788888887 557888888888887754
No 7
>KOG4289|consensus
Probab=98.91 E-value=5.4e-10 Score=106.37 Aligned_cols=108 Identities=44% Similarity=0.932 Sum_probs=85.2
Q ss_pred EEEeEEEeeeccCCCCc-CCCCCCEeec-------------------------CCceeeCCCCcCCCCCcC--CC-CCCC
Q psy9568 7 VVSLMTILLFVSGICTE-KCLNGGKCVQ-------------------------KDTCECQKGFYGLRCEFC--IC-TEKC 57 (248)
Q Consensus 7 ~~~~~~~~~~~~~~C~~-~C~~~g~C~~-------------------------~~~C~C~~G~~G~~C~~~--~c-~~~C 57 (248)
.+++...+..+.++|.. +|.+...|++ +..|+|++||+|..|+.. .| ..+|
T Consensus 1168 ~~sll~VlpfdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC 1247 (2531)
T KOG4289|consen 1168 AISLLRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPC 1247 (2531)
T ss_pred HhhheeeeeccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhhcCCC
Confidence 34566666677788987 8888777763 126999999999999853 46 4689
Q ss_pred CCCCeecC---CCceeeCCCCccCCCCcC----CCC-CCCCCCceec----CCCceeCCCC-cccCCCCc
Q psy9568 58 LNGGKCVQ---KDTCECQKGFYGLRCEFS----KCI-IPCLNGGRCK----GVNKCRCPPG-FLGDYCEI 114 (248)
Q Consensus 58 ~~~g~C~~---~~~C~C~~g~~G~~C~~~----~C~-~~C~~~g~C~----~~~~C~C~~g-~~g~~C~~ 114 (248)
.++|.|.. .|+|.|.+||.|.+|+++ .|. ..|+++|+|. +.+.|.|+.| |.++.|+.
T Consensus 1248 ~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1248 GNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred CCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCCCcccCCCceEE
Confidence 99999985 789999999999999964 354 4599999998 4578999987 55777764
No 8
>KOG4289|consensus
Probab=98.89 E-value=1.5e-09 Score=103.57 Aligned_cols=87 Identities=45% Similarity=1.007 Sum_probs=74.1
Q ss_pred CceeCCCCcccCCCCccCCCCCCCCCCCCCCCceee---CCCeeeCCCCCCCCCCCcCCCCCCC-CCCCCC-CccC----
Q psy9568 99 NKCRCPPGFLGDYCEIWQRPYISKCIIPCLNGGRCK---GVNKCRCPPGFLGDYCEIWQRPYIC-PKPCKQ-GVCS---- 169 (248)
Q Consensus 99 ~~C~C~~g~~g~~C~~~~~~~~~~c~~~C~~~g~C~---~~~~C~C~~g~~g~~C~~~~~~~~c-~~~C~~-g~C~---- 169 (248)
..|.|++||+|+.|+...+.+.. .+|.++|.|. +.|+|.|.+||+|.+|+.......| +..|.| |+|+
T Consensus 1222 lrCrCPpGFTgd~CeTeiDlCYs---~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~n 1298 (2531)
T KOG4289|consen 1222 LRCRCPPGFTGDYCETEIDLCYS---GPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLN 1298 (2531)
T ss_pred eeEeCCCCCCcccccchhHhhhc---CCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCC
Confidence 68999999999999987765544 5999999998 6689999999999999998877888 888988 8898
Q ss_pred CCCeeecCCC-CccCCCCCC
Q psy9568 170 AARTCACYEG-WFGRTCSQR 188 (248)
Q Consensus 170 ~~~~C~C~~g-~~G~~C~~~ 188 (248)
+.+.|+|+.| |+++.|+..
T Consensus 1299 ggf~c~Cp~ge~e~prC~v~ 1318 (2531)
T KOG4289|consen 1299 GGFCCHCPYGEFEDPRCEVT 1318 (2531)
T ss_pred CceeccCCCcccCCCceEEE
Confidence 6788999877 567888754
No 9
>KOG1217|consensus
Probab=98.51 E-value=2.4e-06 Score=76.63 Aligned_cols=158 Identities=39% Similarity=0.966 Sum_probs=88.8
Q ss_pred CceeeCCCCcCCCCCcC--CCC---CCCCCCCeecC---CCceeeCCCCccCCCCcC---------------------CC
Q psy9568 35 DTCECQKGFYGLRCEFC--ICT---EKCLNGGKCVQ---KDTCECQKGFYGLRCEFS---------------------KC 85 (248)
Q Consensus 35 ~~C~C~~G~~G~~C~~~--~c~---~~C~~~g~C~~---~~~C~C~~g~~G~~C~~~---------------------~C 85 (248)
+.|.|..||.+..+... .|. ..|.+++.|.+ .+.|.|.++|.+..++.. .|
T Consensus 152 ~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c 231 (487)
T KOG1217|consen 152 FRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPEC 231 (487)
T ss_pred eeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCc
Confidence 55666666666655532 332 23555566654 235666666665555422 11
Q ss_pred C---CCCCCC-ceec---CCCceeCCCCcccCCC--CccCCCCCCCCC-C-CCCCCceeeC---CCeeeCCCCCCCCCCC
Q psy9568 86 I---IPCLNG-GRCK---GVNKCRCPPGFLGDYC--EIWQRPYISKCI-I-PCLNGGRCKG---VNKCRCPPGFLGDYCE 151 (248)
Q Consensus 86 ~---~~C~~~-g~C~---~~~~C~C~~g~~g~~C--~~~~~~~~~~c~-~-~C~~~g~C~~---~~~C~C~~g~~g~~C~ 151 (248)
. ..+..+ +.|. +.+.|.|.+||.+..+ .... +.|. . +|.++++|.. .+.|.|++||+|..+.
T Consensus 232 ~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~----~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~ 307 (487)
T KOG1217|consen 232 EVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDV----DSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCT 307 (487)
T ss_pred ccccccccCCCCcccccCCceeeeCCCCccccccceeeec----cccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCc
Confidence 1 112222 5565 3468888888888763 1111 2222 1 3777888873 2788999999988871
Q ss_pred cCCCCCCC-----CCCCCC-CccC-----CCCeeecCCCCccCCCCCCCCCCCCCC
Q psy9568 152 IWQRPYIC-----PKPCKQ-GVCS-----AARTCACYEGWFGRTCSQRSDSGEDKS 196 (248)
Q Consensus 152 ~~~~~~~c-----~~~C~~-g~C~-----~~~~C~C~~g~~G~~C~~~~~~~~~~~ 196 (248)
.......| ...|.+ +.|. ..+.|.|..+|.|..|+...+++....
T Consensus 308 ~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~ 363 (487)
T KOG1217|consen 308 ECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSP 363 (487)
T ss_pred cccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCc
Confidence 11122333 344665 4663 356788988888988875544444433
No 10
>KOG1217|consensus
Probab=98.35 E-value=3.3e-06 Score=75.69 Aligned_cols=120 Identities=39% Similarity=0.982 Sum_probs=82.1
Q ss_pred CCceeeCCCCccCCCCc--CCCC---CCCCCCceec---CCCceeCCCCcccCCCCccC--CCC-------------CCC
Q psy9568 66 KDTCECQKGFYGLRCEF--SKCI---IPCLNGGRCK---GVNKCRCPPGFLGDYCEIWQ--RPY-------------ISK 122 (248)
Q Consensus 66 ~~~C~C~~g~~G~~C~~--~~C~---~~C~~~g~C~---~~~~C~C~~g~~g~~C~~~~--~~~-------------~~~ 122 (248)
.+.|.|..||.+..+.. +.|. .+|.+++.|. ..+.|.|.++|.+..++... ..+ ...
T Consensus 151 ~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~ 230 (487)
T KOG1217|consen 151 PFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPE 230 (487)
T ss_pred ceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCC
Confidence 58899999999999874 3675 3588888887 34789999999998887540 000 011
Q ss_pred CC---CCCCCC-ceee---CCCeeeCCCCCCCCCCCcCCCCCCC-CC-CCCC-CccC---CCCeeecCCCCccCCC
Q psy9568 123 CI---IPCLNG-GRCK---GVNKCRCPPGFLGDYCEIWQRPYIC-PK-PCKQ-GVCS---AARTCACYEGWFGRTC 185 (248)
Q Consensus 123 c~---~~C~~~-g~C~---~~~~C~C~~g~~g~~C~~~~~~~~c-~~-~C~~-g~C~---~~~~C~C~~g~~G~~C 185 (248)
+. ..+... +.|. +.+.|.|.+||++..+....+.+.| .. .|.+ ++|. +.+.|.|++||+|..|
T Consensus 231 c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 231 CEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred cccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence 11 123322 5665 3468999999999874222234455 22 3766 8888 3499999999999998
No 11
>KOG1214|consensus
Probab=98.19 E-value=5.6e-06 Score=76.33 Aligned_cols=126 Identities=29% Similarity=0.762 Sum_probs=80.7
Q ss_pred CCCCCeecC----CCceeeCCCCcc--CCC-CcCCCC---CCCCCCceec---CCCceeCCCCcc--c--CCCCccCCC-
Q psy9568 57 CLNGGKCVQ----KDTCECQKGFYG--LRC-EFSKCI---IPCLNGGRCK---GVNKCRCPPGFL--G--DYCEIWQRP- 118 (248)
Q Consensus 57 C~~~g~C~~----~~~C~C~~g~~G--~~C-~~~~C~---~~C~~~g~C~---~~~~C~C~~g~~--g--~~C~~~~~~- 118 (248)
|..++.|.. .++|.|..||.| ..| +.++|. ..|..+..|+ ++++|.|..+|. + ..|-.....
T Consensus 702 cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pa 781 (1289)
T KOG1214|consen 702 CDTTARCHPGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPA 781 (1289)
T ss_pred cCCCccccCCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCC
Confidence 445556665 358999999975 556 344554 4588888897 678888888875 2 234322111
Q ss_pred CCCCCC---CCCCCCcee--e----CCCeeeCCCCCCCCCCCcCCCCCCC-CCCCC-CCccC---CCCeeecCCCCccC
Q psy9568 119 YISKCI---IPCLNGGRC--K----GVNKCRCPPGFLGDYCEIWQRPYIC-PKPCK-QGVCS---AARTCACYEGWFGR 183 (248)
Q Consensus 119 ~~~~c~---~~C~~~g~C--~----~~~~C~C~~g~~g~~C~~~~~~~~c-~~~C~-~g~C~---~~~~C~C~~g~~G~ 183 (248)
....|. ..|.-.|++ + +.|.|.|.|||.|+.-. ..+.++| ++.|. +.+|. +++.|.|.+||.|.
T Consensus 782 p~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 782 PANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-CTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred CCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccc-cccccccCccccCCCceEecCCCcceeecccCccCC
Confidence 112221 235444443 3 45789999999986422 1234677 77786 48887 78999999999974
No 12
>KOG1214|consensus
Probab=98.12 E-value=1.1e-05 Score=74.37 Aligned_cols=120 Identities=32% Similarity=0.785 Sum_probs=80.5
Q ss_pred CCCCCEeecC----CceeeCCCCcC--CCCCc-CC---CCCCCCCCCeecC---CCceeeCCCCc--c--CCCC------
Q psy9568 25 CLNGGKCVQK----DTCECQKGFYG--LRCEF-CI---CTEKCLNGGKCVQ---KDTCECQKGFY--G--LRCE------ 81 (248)
Q Consensus 25 C~~~g~C~~~----~~C~C~~G~~G--~~C~~-~~---c~~~C~~~g~C~~---~~~C~C~~g~~--G--~~C~------ 81 (248)
|..++.|.+. ++|.|..||.| +.|.. .. +...|..+..|++ +++|+|..||. + -.|-
T Consensus 702 cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pa 781 (1289)
T KOG1214|consen 702 CDTTARCHPGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPA 781 (1289)
T ss_pred cCCCccccCCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCC
Confidence 4455667652 58999999985 56652 22 3467999999998 56888888864 3 3442
Q ss_pred -cCCCC---CCCCCCceec------CCCceeCCCCcccC--CCCccCCCCCCCCC-CCCCCCceee---CCCeeeCCCCC
Q psy9568 82 -FSKCI---IPCLNGGRCK------GVNKCRCPPGFLGD--YCEIWQRPYISKCI-IPCLNGGRCK---GVNKCRCPPGF 145 (248)
Q Consensus 82 -~~~C~---~~C~~~g~C~------~~~~C~C~~g~~g~--~C~~~~~~~~~~c~-~~C~~~g~C~---~~~~C~C~~g~ 145 (248)
.+.|. ..|...|.+. +.|.|.|.+||.|+ .|... ++|. ..|...++|. +++.|.|.+||
T Consensus 782 p~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dv-----DeC~psrChp~A~CyntpgsfsC~C~pGy 856 (1289)
T KOG1214|consen 782 PANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDV-----DECSPSRCHPAATCYNTPGSFSCRCQPGY 856 (1289)
T ss_pred CCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccc-----cccCccccCCCceEecCCCcceeecccCc
Confidence 12232 2244444443 45899999999986 34332 3343 4688888887 56899999999
Q ss_pred CCCC
Q psy9568 146 LGDY 149 (248)
Q Consensus 146 ~g~~ 149 (248)
.|+.
T Consensus 857 ~GDG 860 (1289)
T KOG1214|consen 857 YGDG 860 (1289)
T ss_pred cCCC
Confidence 9863
No 13
>KOG4260|consensus
Probab=98.08 E-value=5.6e-06 Score=67.06 Aligned_cols=133 Identities=29% Similarity=0.788 Sum_probs=83.1
Q ss_pred eeCCCCcCCCCCcCCC--CCCCCCCCeecC------CCceeeCCCCccCCCCc----------C-------CCCCCCCCC
Q psy9568 38 ECQKGFYGLRCEFCIC--TEKCLNGGKCVQ------KDTCECQKGFYGLRCEF----------S-------KCIIPCLNG 92 (248)
Q Consensus 38 ~C~~G~~G~~C~~~~c--~~~C~~~g~C~~------~~~C~C~~g~~G~~C~~----------~-------~C~~~C~~~ 92 (248)
-|++|-+|++|..+.- ..+|.++|.|.. ++.|.|.+||.|..|.. + .|..+|.
T Consensus 131 CCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~-- 208 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL-- 208 (350)
T ss_pred ccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--
Confidence 4889999999986642 356888888875 56899999999999851 1 1112333
Q ss_pred ceecC--CCce-eCCCCcccCCCCccCCCCCCCCC---CCCCCCceee---CCCeeeCCCCCCCC--CCCcCCCCCCCCC
Q psy9568 93 GRCKG--VNKC-RCPPGFLGDYCEIWQRPYISKCI---IPCLNGGRCK---GVNKCRCPPGFLGD--YCEIWQRPYICPK 161 (248)
Q Consensus 93 g~C~~--~~~C-~C~~g~~g~~C~~~~~~~~~~c~---~~C~~~g~C~---~~~~C~C~~g~~g~--~C~~~~~~~~c~~ 161 (248)
+.|.+ +-.| .|..||.-+. .....+++|. .+|..+-.|+ ++|.|...+||.+. .|+. |.+
T Consensus 209 ~~Csg~~~k~C~kCkkGW~lde---~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~------~~d 279 (350)
T KOG4260|consen 209 GVCSGESSKGCSKCKKGWKLDE---EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQF------CAD 279 (350)
T ss_pred cccCCCCCCChhhhcccceecc---cccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhh------hhh
Confidence 24542 2344 4888997541 1112223332 3566556666 67899999999763 2222 122
Q ss_pred CCC--CCccC---CCCeeecCCCCc
Q psy9568 162 PCK--QGVCS---AARTCACYEGWF 181 (248)
Q Consensus 162 ~C~--~g~C~---~~~~C~C~~g~~ 181 (248)
.|. |+.|. +.|+|.|..|+.
T Consensus 280 ~~~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 280 VCASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred hcccCCCCcccCCccEEEEecccce
Confidence 232 35554 578999998876
No 14
>KOG0994|consensus
Probab=97.86 E-value=6.5e-05 Score=71.62 Aligned_cols=94 Identities=33% Similarity=0.809 Sum_probs=57.2
Q ss_pred ceec-CCCceeCCCCcccCCCCccCCCC-----CCCCCCCC----CCCceeeC-CCeeeCCCCCCCCCCCcCCC-----C
Q psy9568 93 GRCK-GVNKCRCPPGFLGDYCEIWQRPY-----ISKCIIPC----LNGGRCKG-VNKCRCPPGFLGDYCEIWQR-----P 156 (248)
Q Consensus 93 g~C~-~~~~C~C~~g~~g~~C~~~~~~~-----~~~c~~~C----~~~g~C~~-~~~C~C~~g~~g~~C~~~~~-----~ 156 (248)
+.|+ .+++|.|.+...|..|+...... ...|. +| ..+..|.. .++|+|.|||-|..|+...+ +
T Consensus 1030 ~~CDr~tGQCpClpNv~G~~CDqCA~N~w~laSG~GCe-~C~Cd~~~~pqCN~ftGQCqCkpGfGGR~C~qCqel~WGdP 1108 (1758)
T KOG0994|consen 1030 CHCDRFTGQCPCLPNVQGVRCDQCAENHWNLASGEGCE-PCNCDPIGGPQCNEFTGQCQCKPGFGGRTCSQCQELYWGDP 1108 (1758)
T ss_pred cccccccCcCCCCcccccccccccccchhccccCCCCC-ccCCCccCCccccccccceeccCCCCCcchhHHHHhhcCCC
Confidence 3444 45799999999999887533211 11111 11 22334542 36899999999988876211 1
Q ss_pred -CCC-CCCCCC-C----ccC-CCCeeecCCCCccCCCCC
Q psy9568 157 -YIC-PKPCKQ-G----VCS-AARTCACYEGWFGRTCSQ 187 (248)
Q Consensus 157 -~~c-~~~C~~-g----~C~-~~~~C~C~~g~~G~~C~~ 187 (248)
..| .-.|.. | .|. .+++|.|.+|..|..|++
T Consensus 1109 ~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCdq 1147 (1758)
T KOG0994|consen 1109 NEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCDQ 1147 (1758)
T ss_pred CCCceecCCCCCCCCCCCccccCCceeecCCCCCcchhh
Confidence 112 222321 2 455 688999999999998864
No 15
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.80 E-value=2.6e-05 Score=42.76 Aligned_cols=25 Identities=36% Similarity=0.996 Sum_probs=18.6
Q ss_pred CCCCCCEeecC-CceeeCCCCcCCCC
Q psy9568 24 KCLNGGKCVQK-DTCECQKGFYGLRC 48 (248)
Q Consensus 24 ~C~~~g~C~~~-~~C~C~~G~~G~~C 48 (248)
.|+++|+|+.. +.|.|++||+|+.|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 57777888754 77888888887765
No 16
>KOG4260|consensus
Probab=97.75 E-value=2.7e-05 Score=63.16 Aligned_cols=107 Identities=36% Similarity=0.866 Sum_probs=69.3
Q ss_pred eCCCCccCCCCcCCCC----CCCCCCceec------CCCceeCCCCcccCCCCccCCCCC----C-------CCCCCCCC
Q psy9568 71 CQKGFYGLRCEFSKCI----IPCLNGGRCK------GVNKCRCPPGFLGDYCEIWQRPYI----S-------KCIIPCLN 129 (248)
Q Consensus 71 C~~g~~G~~C~~~~C~----~~C~~~g~C~------~~~~C~C~~g~~g~~C~~~~~~~~----~-------~c~~~C~~ 129 (248)
|++|-+|+.|. .|+ .+|..+|.|. +++.|.|.+||.|+.|..-.+.+- + .|...|.
T Consensus 132 Cp~gtyGpdCl--~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~- 208 (350)
T KOG4260|consen 132 CPDGTYGPDCL--QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL- 208 (350)
T ss_pred cCCCCcCCccc--cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh-
Confidence 78899999997 343 6788889997 468999999999998864322110 0 0111222
Q ss_pred CceeeCC--Cee-eCCCCCCCCCCCcCCCCCCC---CCCCCC-CccC---CCCeeecCCCCcc
Q psy9568 130 GGRCKGV--NKC-RCPPGFLGDYCEIWQRPYIC---PKPCKQ-GVCS---AARTCACYEGWFG 182 (248)
Q Consensus 130 ~g~C~~~--~~C-~C~~g~~g~~C~~~~~~~~c---~~~C~~-g~C~---~~~~C~C~~g~~G 182 (248)
+.|.+. -.| .|..||.-.. ...+++++| +.+|.. -.|+ ++|+|.+.+||.+
T Consensus 209 -~~Csg~~~k~C~kCkkGW~lde-~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 209 -GVCSGESSKGCSKCKKGWKLDE-EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred -cccCCCCCCChhhhcccceecc-cccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 234421 234 6888996651 223456666 455543 4565 7999999999986
No 17
>KOG0994|consensus
Probab=97.49 E-value=0.00092 Score=64.15 Aligned_cols=54 Identities=30% Similarity=0.826 Sum_probs=31.1
Q ss_pred CCEeec-CCceeeCCCCcCCCCCcCCC----------C-CCCCCCC----eecC-CCceeeCCCCccCCCC
Q psy9568 28 GGKCVQ-KDTCECQKGFYGLRCEFCIC----------T-EKCLNGG----KCVQ-KDTCECQKGFYGLRCE 81 (248)
Q Consensus 28 ~g~C~~-~~~C~C~~G~~G~~C~~~~c----------~-~~C~~~g----~C~~-~~~C~C~~g~~G~~C~ 81 (248)
.++|.. .|+|.|.|+..|+.|+.+.- . -.|..-| .|.. +++|.|.+|-+|..|+
T Consensus 774 S~vCn~~GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~CdC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCn 844 (1758)
T KOG0994|consen 774 SSVCNPNGGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKACDCNSIGSLDKYCDKITGQCQCRPGTYGRQCN 844 (1758)
T ss_pred cccccCCCceecccCccccccccccCCcccCcCCccCccccccccccccccccccccceeeccccchhhcc
Confidence 345653 47899999988888874321 0 0122222 2222 5677777777776664
No 18
>smart00051 DSL delta serrate ligand.
Probab=97.29 E-value=0.00035 Score=44.74 Aligned_cols=43 Identities=23% Similarity=0.676 Sum_probs=26.0
Q ss_pred eeCCCCcCCCCCc-CCCCCCCCCCCeecCCCceeeCCCCccCCC
Q psy9568 38 ECQKGFYGLRCEF-CICTEKCLNGGKCVQKDTCECQKGFYGLRC 80 (248)
Q Consensus 38 ~C~~G~~G~~C~~-~~c~~~C~~~g~C~~~~~C~C~~g~~G~~C 80 (248)
.|+++|+|..|+. +.+.....++..|...+.+.|.+||.|..|
T Consensus 20 ~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 20 TCDENYYGEGCNKFCRPRDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred eCCCCCcCCccCCEeCcCccccCCccCCcCCCEecCCCCcCCCC
Confidence 5667777777752 112223455666666666777777777654
No 19
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.25 E-value=0.00034 Score=38.29 Aligned_cols=26 Identities=35% Similarity=0.970 Sum_probs=21.2
Q ss_pred CCCCCCCeecCC-CceeeCCCCccCCC
Q psy9568 55 EKCLNGGKCVQK-DTCECQKGFYGLRC 80 (248)
Q Consensus 55 ~~C~~~g~C~~~-~~C~C~~g~~G~~C 80 (248)
..|+++|+|+.. ++|.|.+||+|..|
T Consensus 6 ~~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 358888999874 88999999988775
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.98 E-value=0.00027 Score=38.78 Aligned_cols=25 Identities=48% Similarity=1.148 Sum_probs=18.3
Q ss_pred cCCCCCCEeec----CCceeeCCCCcCCC
Q psy9568 23 EKCLNGGKCVQ----KDTCECQKGFYGLR 47 (248)
Q Consensus 23 ~~C~~~g~C~~----~~~C~C~~G~~G~~ 47 (248)
++|.++|.|++ .+.|.|++||+|++
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 37888888874 25788888888763
No 21
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.89 E-value=0.00039 Score=29.90 Aligned_cols=12 Identities=42% Similarity=1.340 Sum_probs=7.1
Q ss_pred eeeCCCCcCCCC
Q psy9568 37 CECQKGFYGLRC 48 (248)
Q Consensus 37 C~C~~G~~G~~C 48 (248)
|+|++||+|++|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 666666666655
No 22
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.85 E-value=0.00053 Score=29.48 Aligned_cols=13 Identities=54% Similarity=1.485 Sum_probs=8.5
Q ss_pred eeecCCCCccCCC
Q psy9568 173 TCACYEGWFGRTC 185 (248)
Q Consensus 173 ~C~C~~g~~G~~C 185 (248)
+|.|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4677777777765
No 23
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.73 E-value=0.00053 Score=37.60 Aligned_cols=22 Identities=41% Similarity=1.158 Sum_probs=11.7
Q ss_pred CCCC-CccC----CCCeeecCCCCccC
Q psy9568 162 PCKQ-GVCS----AARTCACYEGWFGR 183 (248)
Q Consensus 162 ~C~~-g~C~----~~~~C~C~~g~~G~ 183 (248)
+|.| |+|+ ..|.|.|++||+|.
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4444 4554 34566666666654
No 24
>smart00051 DSL delta serrate ligand.
Probab=96.58 E-value=0.0035 Score=40.11 Aligned_cols=45 Identities=24% Similarity=0.670 Sum_probs=29.5
Q ss_pred eeeCCCCCCCCCCCcCCCCCCCCCCCC-CCccCCCCeeecCCCCccCCC
Q psy9568 138 KCRCPPGFLGDYCEIWQRPYICPKPCK-QGVCSAARTCACYEGWFGRTC 185 (248)
Q Consensus 138 ~C~C~~g~~g~~C~~~~~~~~c~~~C~-~g~C~~~~~C~C~~g~~G~~C 185 (248)
+-.|.++|.|..|+.... ..+... +..|...+.+.|.+||+|..|
T Consensus 18 rv~C~~~~yG~~C~~~C~---~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 18 RVTCDENYYGEGCNKFCR---PRDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EeeCCCCCcCCccCCEeC---cCccccCCccCCcCCCEecCCCCcCCCC
Confidence 456888888888875322 112222 356766677888899988776
No 25
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.27 E-value=0.0059 Score=34.60 Aligned_cols=25 Identities=48% Similarity=1.210 Sum_probs=16.6
Q ss_pred CCCCCCEeecC---CceeeCCCCc-CCCC
Q psy9568 24 KCLNGGKCVQK---DTCECQKGFY-GLRC 48 (248)
Q Consensus 24 ~C~~~g~C~~~---~~C~C~~G~~-G~~C 48 (248)
+|.++|.|++. +.|.|++||. |..|
T Consensus 10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 10 PCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 57666777642 4677777777 6655
No 26
>KOG1836|consensus
Probab=96.13 E-value=0.081 Score=54.41 Aligned_cols=16 Identities=38% Similarity=0.991 Sum_probs=13.7
Q ss_pred CceeeCCCCcCCCCCc
Q psy9568 35 DTCECQKGFYGLRCEF 50 (248)
Q Consensus 35 ~~C~C~~G~~G~~C~~ 50 (248)
..|.|+.||+|..|+.
T Consensus 695 e~c~C~~g~tG~~Ce~ 710 (1705)
T KOG1836|consen 695 EQCTCPVGYTGQFCES 710 (1705)
T ss_pred hhccCCCCcccchhhh
Confidence 3599999999999973
No 27
>KOG1218|consensus
Probab=96.06 E-value=0.23 Score=42.21 Aligned_cols=151 Identities=27% Similarity=0.621 Sum_probs=72.9
Q ss_pred CCceeeCCCCcCC-CCCcCC----CCCCCCCCCeecCCCceeeCCCCccCCCCcCC-C--C-CCCCCCceec-----CCC
Q psy9568 34 KDTCECQKGFYGL-RCEFCI----CTEKCLNGGKCVQKDTCECQKGFYGLRCEFSK-C--I-IPCLNGGRCK-----GVN 99 (248)
Q Consensus 34 ~~~C~C~~G~~G~-~C~~~~----c~~~C~~~g~C~~~~~C~C~~g~~G~~C~~~~-C--~-~~C~~~g~C~-----~~~ 99 (248)
.+.|.|.++|+|. .+.... +...+.. -.+...|.+..+|.+..|.+.. . . ..|...+.|. ...
T Consensus 14 ~~~c~c~~~~~g~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~~~~~~~~ 90 (316)
T KOG1218|consen 14 SGQCFCDPGYTGRLQCEHQAVTSACSGICPC---EVNSGECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKNGGTCVSS 90 (316)
T ss_pred CCceecCCCccccccccCCCCCccccccCCc---cCCceeEecccccCCCccccccccCCCCCcccCccccCCCCcccCC
Confidence 4678888888885 222100 1111111 1224567777888887775321 0 0 1122222222 112
Q ss_pred ceeC-CCCcccCCCCccCCCCCCCCCCCCCCCceeeCCC-eeeCCCCCCCCCCCc-CCCCCCCCCCCCC-Ccc-CCCCee
Q psy9568 100 KCRC-PPGFLGDYCEIWQRPYISKCIIPCLNGGRCKGVN-KCRCPPGFLGDYCEI-WQRPYICPKPCKQ-GVC-SAARTC 174 (248)
Q Consensus 100 ~C~C-~~g~~g~~C~~~~~~~~~~c~~~C~~~g~C~~~~-~C~C~~g~~g~~C~~-~~~~~~c~~~C~~-g~C-~~~~~C 174 (248)
...+ ..+|.|..|+...... ..|.. .+|.... .|.+..+|.+..|.. ......|...|.+ ..+ .....|
T Consensus 91 ~~~~~~~~~~g~~C~~~~~~~-----~~c~~-~~C~~~~~~c~~~~~~~~~~C~~~~~~g~~C~~~c~~~~~~~~~~~~c 164 (316)
T KOG1218|consen 91 TGYCHLNGYEGPQCESPCPCG-----DGCAE-KTCANPRRECRCGGGYIGEQCGEENLVGLKCQRDCQCTGGCDCKNGIC 164 (316)
T ss_pred CCcccCCCCCcccccCCCCcC-----Ccccc-cccCCCccceecCCcCccccccccCCCCCCccCCCCCccccCCCCCce
Confidence 2233 5677777776532110 11222 3344323 366666666666655 2222334333422 222 256778
Q ss_pred ecCCCCccCCCCCCCCCCC
Q psy9568 175 ACYEGWFGRTCSQRSDSGE 193 (248)
Q Consensus 175 ~C~~g~~G~~C~~~~~~~~ 193 (248)
.|.+||.|..+......+.
T Consensus 165 ~c~~g~~g~~~~~~~~~c~ 183 (316)
T KOG1218|consen 165 TCQPGFVGVFCVESCSGCS 183 (316)
T ss_pred eccCCcccccccccCCCcC
Confidence 8888888888766544333
No 28
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.86 E-value=0.011 Score=33.50 Aligned_cols=25 Identities=40% Similarity=1.198 Sum_probs=15.9
Q ss_pred CCCCC-CccC---CCCeeecCCCCc-cCCC
Q psy9568 161 KPCKQ-GVCS---AARTCACYEGWF-GRTC 185 (248)
Q Consensus 161 ~~C~~-g~C~---~~~~C~C~~g~~-G~~C 185 (248)
.+|.+ +.|. ++|.|.|++||+ |..|
T Consensus 9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 34544 4565 566777777777 6665
No 29
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.84 E-value=0.012 Score=32.82 Aligned_cols=25 Identities=44% Similarity=1.165 Sum_probs=16.3
Q ss_pred CCCCCCEeecC---CceeeCCCCcCCCC
Q psy9568 24 KCLNGGKCVQK---DTCECQKGFYGLRC 48 (248)
Q Consensus 24 ~C~~~g~C~~~---~~C~C~~G~~G~~C 48 (248)
+|.+++.|++. +.|.|++||.|..|
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 56666777642 46777777777655
No 30
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.39 E-value=0.019 Score=31.44 Aligned_cols=26 Identities=42% Similarity=1.069 Sum_probs=19.2
Q ss_pred cCCCCCCEeec---CCceeeCCCCcCC-CC
Q psy9568 23 EKCLNGGKCVQ---KDTCECQKGFYGL-RC 48 (248)
Q Consensus 23 ~~C~~~g~C~~---~~~C~C~~G~~G~-~C 48 (248)
.+|.+++.|++ .+.|.|+.||.|. .|
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 46777788876 3678888888877 54
No 31
>KOG1218|consensus
Probab=95.12 E-value=0.53 Score=39.97 Aligned_cols=144 Identities=28% Similarity=0.647 Sum_probs=71.7
Q ss_pred CceeeCCCCcCCCCCcCCC----CCCCCCCCeecC-----CCceee-CCCCccCCCCcC-CCCCCCCCCceecCCC-cee
Q psy9568 35 DTCECQKGFYGLRCEFCIC----TEKCLNGGKCVQ-----KDTCEC-QKGFYGLRCEFS-KCIIPCLNGGRCKGVN-KCR 102 (248)
Q Consensus 35 ~~C~C~~G~~G~~C~~~~c----~~~C~~~g~C~~-----~~~C~C-~~g~~G~~C~~~-~C~~~C~~~g~C~~~~-~C~ 102 (248)
+.|.+..+|.|..|.+..- ...|...+.|.. .....+ ..+|.|..|+.+ .|...|.. -+|.... .|.
T Consensus 49 ~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~g~~C~~~~~~~~~c~~-~~C~~~~~~c~ 127 (316)
T KOG1218|consen 49 GECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKNGGTCVSSTGYCHLNGYEGPQCESPCPCGDGCAE-KTCANPRRECR 127 (316)
T ss_pred eeEecccccCCCccccccccCCCCCcccCccccCCCCcccCCCCcccCCCCCcccccCCCCcCCcccc-cccCCCcccee
Confidence 5678888888888764221 112333333332 123344 578888888642 22211222 3444333 466
Q ss_pred CCCCcccCCCCccCCCCCCCCCCCCCCCceee-CCCeeeCCCCCCCCCCCcCCCCCCCCCCCCC-CccC-CCCeeecCCC
Q psy9568 103 CPPGFLGDYCEIWQRPYISKCIIPCLNGGRCK-GVNKCRCPPGFLGDYCEIWQRPYICPKPCKQ-GVCS-AARTCACYEG 179 (248)
Q Consensus 103 C~~g~~g~~C~~~~~~~~~~c~~~C~~~g~C~-~~~~C~C~~g~~g~~C~~~~~~~~c~~~C~~-g~C~-~~~~C~C~~g 179 (248)
+..+|.+..|... ......|...|.+...+. ....|.|.+||.|..+........-...|.+ +.|. ....+.+.++
T Consensus 128 ~~~~~~~~~C~~~-~~~g~~C~~~c~~~~~~~~~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~~~~~~~~ 206 (316)
T KOG1218|consen 128 CGGGYIGEQCGEE-NLVGLKCQRDCQCTGGCDCKNGICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTGSCLCYPG 206 (316)
T ss_pred cCCcCcccccccc-CCCCCCccCCCCCccccCCCCCceeccCCcccccccccCCCcCCCcccCCCCeeeccccccccCCC
Confidence 6677777666641 111122222332222222 3467889999999888764332111344445 3665 3334444444
Q ss_pred C
Q psy9568 180 W 180 (248)
Q Consensus 180 ~ 180 (248)
+
T Consensus 207 ~ 207 (316)
T KOG1218|consen 207 P 207 (316)
T ss_pred C
Confidence 4
No 32
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=94.75 E-value=0.038 Score=30.69 Aligned_cols=24 Identities=42% Similarity=1.252 Sum_probs=14.9
Q ss_pred CCCC-CccC---CCCeeecCCCCccCCC
Q psy9568 162 PCKQ-GVCS---AARTCACYEGWFGRTC 185 (248)
Q Consensus 162 ~C~~-g~C~---~~~~C~C~~g~~G~~C 185 (248)
+|.+ +.|. +.+.|.|++||.|..|
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 4443 4554 4567777777777665
No 33
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.51 E-value=0.032 Score=28.24 Aligned_cols=11 Identities=36% Similarity=0.920 Sum_probs=7.0
Q ss_pred CCeeecCCCCc
Q psy9568 171 ARTCACYEGWF 181 (248)
Q Consensus 171 ~~~C~C~~g~~ 181 (248)
+|+|.|++||.
T Consensus 1 sy~C~C~~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCPPGYQ 11 (24)
T ss_pred CEEeeCCCCCc
Confidence 35666666665
No 34
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.46 E-value=0.014 Score=32.83 Aligned_cols=23 Identities=30% Similarity=0.811 Sum_probs=16.2
Q ss_pred CCCCCCEeecC---CceeeCCCCcCC
Q psy9568 24 KCLNGGKCVQK---DTCECQKGFYGL 46 (248)
Q Consensus 24 ~C~~~g~C~~~---~~C~C~~G~~G~ 46 (248)
.|+.++.|++. +.|.|++||.|+
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCcEeecCCCCEEeECCCCCccC
Confidence 68888899863 678888888875
No 35
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.29 E-value=0.06 Score=29.61 Aligned_cols=24 Identities=46% Similarity=1.211 Sum_probs=14.5
Q ss_pred CCCCCCEeec---CCceeeCCCCcC-CCC
Q psy9568 24 KCLNGGKCVQ---KDTCECQKGFYG-LRC 48 (248)
Q Consensus 24 ~C~~~g~C~~---~~~C~C~~G~~G-~~C 48 (248)
+|.++ .|++ .+.|.|++||.| ..|
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 46555 6664 256777777776 544
No 36
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=94.07 E-value=0.062 Score=32.57 Aligned_cols=15 Identities=40% Similarity=1.048 Sum_probs=8.8
Q ss_pred CceeeCCCCcCCCCC
Q psy9568 35 DTCECQKGFYGLRCE 49 (248)
Q Consensus 35 ~~C~C~~G~~G~~C~ 49 (248)
++|.|.++|+|..|+
T Consensus 19 G~C~C~~~~~G~~C~ 33 (50)
T cd00055 19 GQCECKPNTTGRRCD 33 (50)
T ss_pred CEEeCCCcCCCCCCC
Confidence 556666666665554
No 37
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=93.37 E-value=0.026 Score=36.12 Aligned_cols=44 Identities=23% Similarity=0.658 Sum_probs=16.8
Q ss_pred ceeeCCCCcCCCCCcCCCCC--CCCCCCeecCCCceeeCCCCccCCC
Q psy9568 36 TCECQKGFYGLRCEFCICTE--KCLNGGKCVQKDTCECQKGFYGLRC 80 (248)
Q Consensus 36 ~C~C~~G~~G~~C~~~~c~~--~C~~~g~C~~~~~C~C~~g~~G~~C 80 (248)
.-.|.+.|+|+.|.. .|.+ .-..+-+|...+.-.|.+||.|+.|
T Consensus 18 rv~C~~nyyG~~C~~-~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 18 RVVCDENYYGPNCSK-FCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp -----TTEETTTT-E-E---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred EEECCCCCCCccccC-CcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 445666677776652 1211 1123345555555666666666554
No 38
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=93.34 E-value=0.1 Score=28.36 Aligned_cols=23 Identities=43% Similarity=1.032 Sum_probs=14.7
Q ss_pred CCCCCCeecC---CCceeeCCCCccC
Q psy9568 56 KCLNGGKCVQ---KDTCECQKGFYGL 78 (248)
Q Consensus 56 ~C~~~g~C~~---~~~C~C~~g~~G~ 78 (248)
+|.+++.|++ .+.|.|+.||.|.
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCccc
Confidence 4555666665 4567777777666
No 39
>KOG1836|consensus
Probab=93.17 E-value=0.19 Score=51.78 Aligned_cols=14 Identities=43% Similarity=1.134 Sum_probs=12.3
Q ss_pred ceeeCCCCccCCCC
Q psy9568 68 TCECQKGFYGLRCE 81 (248)
Q Consensus 68 ~C~C~~g~~G~~C~ 81 (248)
.|.|+.||.|..|+
T Consensus 696 ~c~C~~g~tG~~Ce 709 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCE 709 (1705)
T ss_pred hccCCCCcccchhh
Confidence 58999999999987
No 40
>smart00181 EGF Epidermal growth factor-like domain.
Probab=93.07 E-value=0.088 Score=28.91 Aligned_cols=16 Identities=38% Similarity=1.080 Sum_probs=9.5
Q ss_pred CCCeeecCCCCcc-CCC
Q psy9568 170 AARTCACYEGWFG-RTC 185 (248)
Q Consensus 170 ~~~~C~C~~g~~G-~~C 185 (248)
+.+.|.|++||.| ..|
T Consensus 18 ~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 18 GSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCeEeECCCCCccCCcc
Confidence 3566666666666 443
No 41
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=92.88 E-value=0.044 Score=31.89 Aligned_cols=21 Identities=29% Similarity=0.904 Sum_probs=16.5
Q ss_pred CCCC-CCccC---CCCeeecCCCCc
Q psy9568 161 KPCK-QGVCS---AARTCACYEGWF 181 (248)
Q Consensus 161 ~~C~-~g~C~---~~~~C~C~~g~~ 181 (248)
..|. ++.|+ ++|.|.|++||.
T Consensus 10 ~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 10 HNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 4565 37777 789999999998
No 42
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=92.83 E-value=0.062 Score=32.35 Aligned_cols=20 Identities=35% Similarity=0.939 Sum_probs=11.9
Q ss_pred Eeec-CCceeeCCCCcCCCCC
Q psy9568 30 KCVQ-KDTCECQKGFYGLRCE 49 (248)
Q Consensus 30 ~C~~-~~~C~C~~G~~G~~C~ 49 (248)
.|.. +++|.|.++|+|+.|+
T Consensus 12 ~C~~~~G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 12 TCDPSTGQCVCKPGTTGPRCD 32 (49)
T ss_dssp SEEETCEEESBSTTEESTTS-
T ss_pred cccCCCCEEeccccccCCcCc
Confidence 4553 3566666666666665
No 43
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=92.61 E-value=0.072 Score=30.97 Aligned_cols=21 Identities=33% Similarity=1.012 Sum_probs=12.6
Q ss_pred CCCCCCEeecC---CceeeCCCCc
Q psy9568 24 KCLNGGKCVQK---DTCECQKGFY 44 (248)
Q Consensus 24 ~C~~~g~C~~~---~~C~C~~G~~ 44 (248)
.|..++.|+++ |.|.|++||+
T Consensus 11 ~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 11 NCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSTTSEEEEETTEEEEEESTTEE
T ss_pred cCCCCCEEEcCCCCEEeeCCCCcE
Confidence 45556666652 5666666665
No 44
>PHA02887 EGF-like protein; Provisional
Probab=92.29 E-value=0.094 Score=37.34 Aligned_cols=28 Identities=29% Similarity=0.822 Sum_probs=22.0
Q ss_pred CCCCCCCccC-----CCCeeecCCCCccCCCCC
Q psy9568 160 PKPCKQGVCS-----AARTCACYEGWFGRTCSQ 187 (248)
Q Consensus 160 ~~~C~~g~C~-----~~~~C~C~~g~~G~~C~~ 187 (248)
.+-|.+|+|. ....|.|++||+|.+|+.
T Consensus 91 k~YCiHG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 91 NDFCINGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred hCEeeCCEEEccccCCCceeECCCCcccCCCCc
Confidence 3557778886 567899999999999975
No 45
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=91.94 E-value=0.17 Score=30.01 Aligned_cols=15 Identities=40% Similarity=1.099 Sum_probs=8.3
Q ss_pred CceeeCCCCcCCCCC
Q psy9568 35 DTCECQKGFYGLRCE 49 (248)
Q Consensus 35 ~~C~C~~G~~G~~C~ 49 (248)
++|.|+++|+|+.|+
T Consensus 18 G~C~C~~~~~G~~C~ 32 (46)
T smart00180 18 GQCECKPNVTGRRCD 32 (46)
T ss_pred CEEECCCCCCCCCCC
Confidence 455555555555554
No 46
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=90.99 E-value=0.14 Score=37.17 Aligned_cols=29 Identities=28% Similarity=0.690 Sum_probs=23.5
Q ss_pred CCCCCCCccC-----CCCeeecCCCCccCCCCCC
Q psy9568 160 PKPCKQGVCS-----AARTCACYEGWFGRTCSQR 188 (248)
Q Consensus 160 ~~~C~~g~C~-----~~~~C~C~~g~~G~~C~~~ 188 (248)
.+.|.||+|. ..+.|.|..||+|.+|+..
T Consensus 50 ~~YClHG~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 50 DGYCLHGDCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred CCEeECCEEEeeccCCCceeECCCCcccccccce
Confidence 3557778886 6789999999999999754
No 47
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=90.98 E-value=0.066 Score=34.23 Aligned_cols=15 Identities=20% Similarity=0.680 Sum_probs=4.8
Q ss_pred ceeCCCCcccCCCCc
Q psy9568 100 KCRCPPGFLGDYCEI 114 (248)
Q Consensus 100 ~C~C~~g~~g~~C~~ 114 (248)
+-.|...|.|+.|..
T Consensus 18 rv~C~~nyyG~~C~~ 32 (63)
T PF01414_consen 18 RVVCDENYYGPNCSK 32 (63)
T ss_dssp -----TTEETTTT-E
T ss_pred EEECCCCCCCccccC
Confidence 334555555555543
No 48
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=90.46 E-value=0.047 Score=42.37 Aligned_cols=79 Identities=30% Similarity=0.690 Sum_probs=43.0
Q ss_pred CEeec---CCceeeCCCCcC---CCCCcC-CCC------CCCCCCCeecC--------CCceeeCCCCcc--CCCCcCCC
Q psy9568 29 GKCVQ---KDTCECQKGFYG---LRCEFC-ICT------EKCLNGGKCVQ--------KDTCECQKGFYG--LRCEFSKC 85 (248)
Q Consensus 29 g~C~~---~~~C~C~~G~~G---~~C~~~-~c~------~~C~~~g~C~~--------~~~C~C~~g~~G--~~C~~~~C 85 (248)
|..++ .++|.|.+||.. ..|+.- .|. -+|...+.|.+ .+.|.|.+||.- ..|....|
T Consensus 11 G~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C 90 (197)
T PF06247_consen 11 GYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKC 90 (197)
T ss_dssp EEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGG
T ss_pred CEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhc
Confidence 55553 378999999963 344421 132 24666677764 347888888853 34444455
Q ss_pred C-CCCCCCceec------CCCceeCCCCcc
Q psy9568 86 I-IPCLNGGRCK------GVNKCRCPPGFL 108 (248)
Q Consensus 86 ~-~~C~~~g~C~------~~~~C~C~~g~~ 108 (248)
. ..|. .|.|+ ....|+|.-|+.
T Consensus 91 ~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV 119 (197)
T PF06247_consen 91 NNKDCG-SGKCILDPDNPNNPTCSCNIGKV 119 (197)
T ss_dssp SS---T-TEEEEEEEGGGSEEEEEE-TEEE
T ss_pred CceecC-CCeEEecCCCCCCceeEeeeceE
Confidence 4 3355 46675 113677777766
No 49
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=89.64 E-value=0.13 Score=28.84 Aligned_cols=23 Identities=30% Similarity=0.811 Sum_probs=14.8
Q ss_pred CCCCCCeecC---CCceeeCCCCccC
Q psy9568 56 KCLNGGKCVQ---KDTCECQKGFYGL 78 (248)
Q Consensus 56 ~C~~~g~C~~---~~~C~C~~g~~G~ 78 (248)
.|+.+++|.+ .+.|.|.+||.|+
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCcEeecCCCCEEeECCCCCccC
Confidence 4667777776 4578888888764
No 50
>PHA02887 EGF-like protein; Provisional
Probab=85.57 E-value=0.73 Score=32.92 Aligned_cols=23 Identities=39% Similarity=1.159 Sum_probs=14.9
Q ss_pred CCEee-----cCCceeeCCCCcCCCCCc
Q psy9568 28 GGKCV-----QKDTCECQKGFYGLRCEF 50 (248)
Q Consensus 28 ~g~C~-----~~~~C~C~~G~~G~~C~~ 50 (248)
||.|. +...|.|+.||+|.+|+.
T Consensus 96 HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 96 NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred CCEEEccccCCCceeECCCCcccCCCCc
Confidence 46664 124677777777777763
No 51
>KOG3607|consensus
Probab=85.48 E-value=0.65 Score=44.17 Aligned_cols=54 Identities=24% Similarity=0.643 Sum_probs=38.2
Q ss_pred CCCCCCEeecCCceeeCCCCcCCCCCcCCCCCCCCCCCeecCCCceeeCCCCccCCCCcC
Q psy9568 24 KCLNGGKCVQKDTCECQKGFYGLRCEFCICTEKCLNGGKCVQKDTCECQKGFYGLRCEFS 83 (248)
Q Consensus 24 ~C~~~g~C~~~~~C~C~~G~~G~~C~~~~c~~~C~~~g~C~~~~~C~C~~g~~G~~C~~~ 83 (248)
.|..+-+|++ ..|.=.. ..+..+ |...|+++|+|.+...|.|.+||.++.|++.
T Consensus 605 ~Cg~~~vC~~-~~C~~~~-v~~~~~----~~~~C~g~GVCnn~~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 605 SCGPGMICIN-HRCLSAS-VLNSSC----CPTTCNGHGVCNNELNCHCEPGWAPPFCFIF 658 (716)
T ss_pred ccCCCceecC-Ccchhhh-hhcccc----cccccCCCcccCCCcceeeCCCCCCCccccc
Confidence 3666667776 4554333 444333 3445999999999999999999999999853
No 52
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=80.57 E-value=1.6 Score=26.19 Aligned_cols=16 Identities=31% Similarity=0.787 Sum_probs=13.4
Q ss_pred CCeeecCCCCccCCCC
Q psy9568 171 ARTCACYEGWFGRTCS 186 (248)
Q Consensus 171 ~~~C~C~~g~~G~~C~ 186 (248)
+++|.|.++|+|..|+
T Consensus 18 ~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 18 TGQCECKPNTTGRRCD 33 (50)
T ss_pred CCEEeCCCcCCCCCCC
Confidence 6788888888888886
No 53
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=80.19 E-value=2.6 Score=25.47 Aligned_cols=29 Identities=38% Similarity=0.973 Sum_probs=16.3
Q ss_pred cCCCCCcCCCCCCCCCCCeecCCCceeeCCCCc
Q psy9568 44 YGLRCEFCICTEKCLNGGKCVQKDTCECQKGFY 76 (248)
Q Consensus 44 ~G~~C~~~~c~~~C~~~g~C~~~~~C~C~~g~~ 76 (248)
.|..|... ..|..+..|++ +.|.|++||.
T Consensus 18 ~g~~C~~~---~qC~~~s~C~~-g~C~C~~g~~ 46 (52)
T PF01683_consen 18 PGESCESD---EQCIGGSVCVN-GRCQCPPGYV 46 (52)
T ss_pred CCCCCCCc---CCCCCcCEEcC-CEeECCCCCE
Confidence 45555532 23445666654 6777777764
No 54
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=79.77 E-value=1.6 Score=31.80 Aligned_cols=15 Identities=40% Similarity=1.252 Sum_probs=10.8
Q ss_pred CceeeCCCCcCCCCC
Q psy9568 35 DTCECQKGFYGLRCE 49 (248)
Q Consensus 35 ~~C~C~~G~~G~~C~ 49 (248)
..|.|..||+|.+|+
T Consensus 67 ~~CrC~~GYtGeRCE 81 (139)
T PHA03099 67 MYCRCSHGYTGIRCQ 81 (139)
T ss_pred ceeECCCCccccccc
Confidence 567777777777776
No 55
>KOG0196|consensus
Probab=72.70 E-value=11 Score=36.26 Aligned_cols=14 Identities=29% Similarity=0.930 Sum_probs=8.8
Q ss_pred CCCceeCCCCcccC
Q psy9568 97 GVNKCRCPPGFLGD 110 (248)
Q Consensus 97 ~~~~C~C~~g~~g~ 110 (248)
++-.|.|..||...
T Consensus 306 ga~~C~C~~gyyRA 319 (996)
T KOG0196|consen 306 GATSCTCENGYYRA 319 (996)
T ss_pred CCCcccccCCcccC
Confidence 34567777777643
No 56
>KOG3607|consensus
Probab=72.67 E-value=4.4 Score=38.75 Aligned_cols=32 Identities=31% Similarity=0.893 Sum_probs=27.1
Q ss_pred CCCCCCCCceecCCCceeCCCCcccCCCCccC
Q psy9568 85 CIIPCLNGGRCKGVNKCRCPPGFLGDYCEIWQ 116 (248)
Q Consensus 85 C~~~C~~~g~C~~~~~C~C~~g~~g~~C~~~~ 116 (248)
|...|..+|+|...+.|.|.+||.+++|+...
T Consensus 628 ~~~~C~g~GVCnn~~~ChC~~gwapp~C~~~~ 659 (716)
T KOG3607|consen 628 CPTTCNGHGVCNNELNCHCEPGWAPPFCFIFG 659 (716)
T ss_pred cccccCCCcccCCCcceeeCCCCCCCcccccc
Confidence 34558999999988999999999999998753
No 57
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=71.69 E-value=1.8 Score=24.23 Aligned_cols=13 Identities=23% Similarity=0.593 Sum_probs=10.2
Q ss_pred CCCeeecCCCCcc
Q psy9568 170 AARTCACYEGWFG 182 (248)
Q Consensus 170 ~~~~C~C~~g~~G 182 (248)
++++|.|++||+-
T Consensus 17 g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 17 GSYRCSCPPGYKL 29 (36)
T ss_dssp TSEEEE-STTEEE
T ss_pred CceEeECCCCCEE
Confidence 5799999999984
No 58
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=70.10 E-value=3.6 Score=22.56 Aligned_cols=10 Identities=30% Similarity=1.039 Sum_probs=6.6
Q ss_pred CceeeCCCCc
Q psy9568 35 DTCECQKGFY 44 (248)
Q Consensus 35 ~~C~C~~G~~ 44 (248)
++|.|+.||.
T Consensus 18 ~~C~CPeGyI 27 (34)
T PF09064_consen 18 GQCFCPEGYI 27 (34)
T ss_pred CceeCCCceE
Confidence 5667777765
No 59
>KOG3512|consensus
Probab=65.94 E-value=29 Score=31.29 Aligned_cols=20 Identities=40% Similarity=0.974 Sum_probs=17.2
Q ss_pred ccC-CCCeeecCCCCccCCCC
Q psy9568 167 VCS-AARTCACYEGWFGRTCS 186 (248)
Q Consensus 167 ~C~-~~~~C~C~~g~~G~~C~ 186 (248)
+|. .+++|.|.+|.+|..|.
T Consensus 408 tCNq~tGqCpCkeGvtG~tCn 428 (592)
T KOG3512|consen 408 TCNQTTGQCPCKEGVTGLTCN 428 (592)
T ss_pred cccccCCcccCCCCCcccccc
Confidence 566 68999999999999885
No 60
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=59.15 E-value=6.9 Score=27.57 Aligned_cols=20 Identities=35% Similarity=0.941 Sum_probs=13.7
Q ss_pred cCCCCCCEeecC--------CceeeCCC
Q psy9568 23 EKCLNGGKCVQK--------DTCECQKG 42 (248)
Q Consensus 23 ~~C~~~g~C~~~--------~~C~C~~G 42 (248)
+.|++||.|.+. +.|.|.+.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T 40 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPT 40 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeecc
Confidence 468999999863 34666653
No 61
>KOG3512|consensus
Probab=56.51 E-value=14 Score=33.26 Aligned_cols=16 Identities=38% Similarity=1.103 Sum_probs=10.1
Q ss_pred CCceeeCCCCccCCCC
Q psy9568 66 KDTCECQKGFYGLRCE 81 (248)
Q Consensus 66 ~~~C~C~~g~~G~~C~ 81 (248)
+++|.|.+|-+|..|.
T Consensus 413 tGqCpCkeGvtG~tCn 428 (592)
T KOG3512|consen 413 TGQCPCKEGVTGLTCN 428 (592)
T ss_pred CCcccCCCCCcccccc
Confidence 4666666666666664
No 62
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=55.91 E-value=4.6 Score=24.77 Aligned_cols=16 Identities=31% Similarity=0.748 Sum_probs=6.1
Q ss_pred eeecCCCCccCCCCCC
Q psy9568 173 TCACYEGWFGRTCSQR 188 (248)
Q Consensus 173 ~C~C~~g~~G~~C~~~ 188 (248)
.|.|..-|.|++|++.
T Consensus 37 ~CECn~Cy~GpdCS~~ 52 (56)
T PF04863_consen 37 VCECNSCYGGPDCSTL 52 (56)
T ss_dssp --EE-TTEESTTS-EE
T ss_pred cccccCCcCCCCcccC
Confidence 3455555555555443
No 63
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=46.30 E-value=7.8 Score=21.80 Aligned_cols=21 Identities=29% Similarity=0.869 Sum_probs=12.1
Q ss_pred CCCCCCEeecC----CceeeCCCCc
Q psy9568 24 KCLNGGKCVQK----DTCECQKGFY 44 (248)
Q Consensus 24 ~C~~~g~C~~~----~~C~C~~G~~ 44 (248)
.|..|+.|.+. .+|+|.+||.
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEE
T ss_pred cCCCCcccEEcCCCCEEEEeeCCcc
Confidence 56777888742 4688888886
No 64
>KOG3516|consensus
Probab=44.19 E-value=21 Score=35.84 Aligned_cols=36 Identities=36% Similarity=0.868 Sum_probs=28.9
Q ss_pred CCCC-CCCCCC-CccC---CCCeeecC-CCCccCCCCCCCCC
Q psy9568 156 PYIC-PKPCKQ-GVCS---AARTCACY-EGWFGRTCSQRSDS 191 (248)
Q Consensus 156 ~~~c-~~~C~~-g~C~---~~~~C~C~-~g~~G~~C~~~~~~ 191 (248)
.+.| |++|.+ |.|. ..+.|.|. .||.|..|...+.+
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e 586 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYE 586 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccccccccccCCCcc
Confidence 3566 888987 7887 67899996 99999999876543
No 65
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=42.96 E-value=42 Score=23.60 Aligned_cols=26 Identities=31% Similarity=0.733 Sum_probs=15.3
Q ss_pred CCCCc--CCCCCCEeecC--CceeeCCCCc
Q psy9568 19 GICTE--KCLNGGKCVQK--DTCECQKGFY 44 (248)
Q Consensus 19 ~~C~~--~C~~~g~C~~~--~~C~C~~G~~ 44 (248)
+.|.. .|..+|.|... ..|.|.+||.
T Consensus 78 d~Cd~y~~CG~~g~C~~~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNSNNSPKCSCLPGFE 107 (110)
T ss_pred cCCCCccccCCccEeCCCCCCceECCCCcC
Confidence 45553 56666777532 3577777765
No 66
>KOG3516|consensus
Probab=40.53 E-value=18 Score=36.26 Aligned_cols=35 Identities=37% Similarity=0.927 Sum_probs=28.6
Q ss_pred ccCCCCc-CCCCCCEeecC---CceeeC-CCCcCCCCCcC
Q psy9568 17 VSGICTE-KCLNGGKCVQK---DTCECQ-KGFYGLRCEFC 51 (248)
Q Consensus 17 ~~~~C~~-~C~~~g~C~~~---~~C~C~-~G~~G~~C~~~ 51 (248)
..+.|.+ +|.++|.|.++ +.|.|. .||+|..|...
T Consensus 544 i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 544 ISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS 583 (1306)
T ss_pred cccccCCccccCCCcccccccceeEeccccccccccccCC
Confidence 4456665 89999999874 789998 99999999843
No 67
>KOG3514|consensus
Probab=23.04 E-value=56 Score=32.80 Aligned_cols=31 Identities=45% Similarity=1.119 Sum_probs=25.6
Q ss_pred CC-CCCCCC-CccC---CCCeeec-CCCCccCCCCCC
Q psy9568 158 IC-PKPCKQ-GVCS---AARTCAC-YEGWFGRTCSQR 188 (248)
Q Consensus 158 ~c-~~~C~~-g~C~---~~~~C~C-~~g~~G~~C~~~ 188 (248)
.| ++||.| |+|. ..+.|.| ..+|.|..|+..
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccce
Confidence 45 789988 8998 6789999 478999999764
No 68
>KOG3514|consensus
Probab=21.56 E-value=56 Score=32.82 Aligned_cols=30 Identities=50% Similarity=1.233 Sum_probs=25.2
Q ss_pred CCCc-CCCCCCEeec---CCceeeC-CCCcCCCCC
Q psy9568 20 ICTE-KCLNGGKCVQ---KDTCECQ-KGFYGLRCE 49 (248)
Q Consensus 20 ~C~~-~C~~~g~C~~---~~~C~C~-~G~~G~~C~ 49 (248)
+|.. +|.|+|.|.. .+.|.|. .||.|+.|+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE 659 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCcccc
Confidence 6665 8999999985 4789996 789999997
Done!