Query psy9819
Match_columns 377
No_of_seqs 260 out of 1572
Neff 8.8
Searched_HMMs 46136
Date Fri Aug 16 20:31:13 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9819.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9819hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0994|consensus 99.8 4.3E-18 9.3E-23 172.7 13.9 147 47-193 828-999 (1758)
2 KOG0994|consensus 99.7 7.4E-17 1.6E-21 163.8 16.1 171 2-172 204-471 (1758)
3 KOG1225|consensus 99.4 3E-12 6.6E-17 125.6 12.8 62 287-353 299-365 (525)
4 KOG1225|consensus 99.3 1.5E-11 3.3E-16 120.7 12.6 85 287-376 268-362 (525)
5 KOG1836|consensus 99.3 5.9E-11 1.3E-15 130.4 17.3 116 50-167 696-839 (1705)
6 KOG3512|consensus 99.1 2.8E-10 6.1E-15 107.4 9.5 117 39-156 282-439 (592)
7 KOG1219|consensus 99.0 5.2E-10 1.1E-14 120.8 6.5 99 32-145 3865-3977(4289)
8 KOG1836|consensus 98.9 6.2E-08 1.3E-12 107.1 17.0 110 80-194 696-811 (1705)
9 KOG1226|consensus 98.7 6.1E-08 1.3E-12 97.4 9.4 46 305-351 596-644 (783)
10 KOG3512|consensus 98.7 5.9E-08 1.3E-12 92.0 7.4 127 66-194 278-427 (592)
11 KOG4289|consensus 98.6 3.3E-08 7.2E-13 103.8 4.0 112 20-145 1168-1316(2531)
12 KOG1226|consensus 98.5 2.5E-07 5.4E-12 93.1 8.7 69 295-363 543-628 (783)
13 KOG4289|consensus 98.5 9.7E-08 2.1E-12 100.4 3.8 78 284-374 1222-1308(2531)
14 KOG1219|consensus 98.3 6.3E-07 1.4E-11 98.0 5.2 78 299-376 3865-3972(4289)
15 KOG1217|consensus 98.3 5.5E-05 1.2E-09 75.4 18.1 91 40-146 100-207 (487)
16 KOG4260|consensus 98.3 2.1E-06 4.5E-11 76.3 6.5 65 74-156 124-193 (350)
17 KOG4260|consensus 98.0 7.4E-06 1.6E-10 72.9 4.7 120 52-191 131-269 (350)
18 cd00055 EGF_Lam Laminin-type e 97.8 3.8E-05 8.3E-10 51.7 4.9 43 113-156 2-44 (50)
19 PF07974 EGF_2: EGF-like domai 97.7 2.8E-05 6E-10 46.9 2.7 24 304-327 7-32 (32)
20 cd00055 EGF_Lam Laminin-type e 97.7 5.2E-05 1.1E-09 51.1 3.5 38 66-104 2-43 (50)
21 smart00180 EGF_Lam Laminin-typ 97.6 8.5E-05 1.8E-09 49.0 3.8 29 125-153 12-40 (46)
22 PF00053 Laminin_EGF: Laminin 97.5 4.3E-05 9.3E-10 51.3 1.4 41 115-156 3-43 (49)
23 PF00053 Laminin_EGF: Laminin 97.4 4.3E-05 9.3E-10 51.2 0.8 38 67-105 2-43 (49)
24 cd00041 CUB CUB domain; extrac 97.4 0.00057 1.2E-08 53.8 7.4 82 196-297 25-112 (113)
25 smart00180 EGF_Lam Laminin-typ 97.3 0.00022 4.8E-09 47.1 3.2 29 73-102 12-40 (46)
26 KOG4586|consensus 97.3 0.00023 4.9E-09 56.4 3.1 85 195-298 63-153 (156)
27 PF00431 CUB: CUB domain CUB d 97.1 0.00027 5.9E-09 55.4 2.5 80 196-295 24-109 (110)
28 PF00008 EGF: EGF-like domain 97.1 0.00036 7.9E-09 42.1 1.8 25 353-377 5-30 (32)
29 PF00008 EGF: EGF-like domain 97.0 0.00018 4E-09 43.4 0.5 26 36-61 4-32 (32)
30 smart00051 DSL delta serrate l 97.0 0.00061 1.3E-08 48.1 2.9 43 49-93 17-63 (63)
31 KOG1217|consensus 97.0 0.0092 2E-07 59.3 12.4 97 37-144 178-306 (487)
32 smart00042 CUB Domain first fo 96.8 0.0019 4.1E-08 50.0 4.5 81 196-295 15-101 (102)
33 PF07974 EGF_2: EGF-like domai 96.7 0.0016 3.4E-08 39.3 2.6 25 37-62 7-32 (32)
34 PF12661 hEGF: Human growth fa 96.7 0.00077 1.7E-08 31.8 1.0 13 50-62 1-13 (13)
35 PF12661 hEGF: Human growth fa 96.5 0.00055 1.2E-08 32.3 -0.2 12 316-327 2-13 (13)
36 smart00051 DSL delta serrate l 96.2 0.0058 1.3E-07 43.1 3.3 42 96-144 21-63 (63)
37 KOG1214|consensus 96.1 0.011 2.4E-07 60.7 6.4 89 37-143 743-860 (1289)
38 smart00179 EGF_CA Calcium-bind 96.1 0.0082 1.8E-07 37.5 3.6 28 36-63 9-39 (39)
39 KOG1388|consensus 96.0 0.0036 7.8E-08 54.4 2.1 86 57-149 44-130 (217)
40 KOG3509|consensus 95.9 0.017 3.7E-07 61.3 6.5 107 48-157 717-853 (964)
41 PF07645 EGF_CA: Calcium-bindi 95.6 0.011 2.4E-07 38.0 2.7 25 352-376 10-34 (42)
42 smart00179 EGF_CA Calcium-bind 95.6 0.014 3.1E-07 36.3 3.0 31 298-328 2-39 (39)
43 KOG1214|consensus 95.5 0.014 3E-07 60.0 4.0 92 286-377 718-858 (1289)
44 smart00181 EGF Epidermal growt 95.2 0.026 5.7E-07 34.3 3.1 28 36-63 6-35 (35)
45 cd00054 EGF_CA Calcium-binding 95.1 0.033 7.1E-07 34.2 3.5 28 36-63 9-38 (38)
46 cd00054 EGF_CA Calcium-binding 94.9 0.03 6.6E-07 34.4 2.9 30 299-328 3-38 (38)
47 PF14670 FXa_inhibition: Coagu 94.4 0.031 6.6E-07 34.6 1.9 22 354-377 8-29 (36)
48 cd00053 EGF Epidermal growth f 94.4 0.057 1.2E-06 32.5 3.2 27 36-62 6-35 (36)
49 smart00181 EGF Epidermal growt 94.3 0.05 1.1E-06 33.1 2.8 25 352-377 6-30 (35)
50 cd00053 EGF Epidermal growth f 93.0 0.098 2.1E-06 31.4 2.5 26 352-377 6-31 (36)
51 PHA02887 EGF-like protein; Pro 93.0 0.079 1.7E-06 41.5 2.4 28 37-64 93-123 (126)
52 KOG1218|consensus 92.8 0.38 8.1E-06 45.3 7.5 82 51-146 92-177 (316)
53 PF01414 DSL: Delta serrate li 92.3 0.027 5.9E-07 39.7 -0.8 44 48-93 16-63 (63)
54 PHA03099 epidermal growth fact 92.1 0.11 2.3E-06 41.5 2.2 37 28-64 39-82 (139)
55 PF12947 EGF_3: EGF domain; I 90.9 0.19 4.2E-06 31.0 2.1 25 353-377 7-31 (36)
56 KOG3509|consensus 90.5 0.81 1.8E-05 49.1 7.4 78 75-157 714-795 (964)
57 PHA02887 EGF-like protein; Pro 88.7 0.3 6.6E-06 38.3 2.0 25 305-330 94-124 (126)
58 PF12947 EGF_3: EGF domain; I 88.5 0.21 4.6E-06 30.9 0.9 24 37-60 7-32 (36)
59 PF12662 cEGF: Complement Clr- 88.0 0.33 7.1E-06 27.1 1.3 11 366-376 1-11 (24)
60 PF07645 EGF_CA: Calcium-bindi 87.3 0.21 4.6E-06 32.0 0.3 26 298-323 2-34 (42)
61 KOG1218|consensus 85.9 8.6 0.00019 36.0 10.7 126 48-194 48-175 (316)
62 PF01414 DSL: Delta serrate li 85.1 0.26 5.6E-06 34.7 -0.1 39 315-354 18-63 (63)
63 PF09064 Tme5_EGF_like: Thromb 82.7 1.2 2.5E-05 26.9 2.0 24 326-349 2-26 (34)
64 PF04863 EGF_alliinase: Alliin 82.3 0.37 8E-06 32.4 -0.3 28 303-330 17-52 (56)
65 PHA03099 epidermal growth fact 80.2 1.3 2.8E-05 35.4 2.1 25 305-330 53-83 (139)
66 KOG3516|consensus 79.4 1.6 3.5E-05 47.3 3.2 60 5-64 511-582 (1306)
67 KOG1388|consensus 77.1 1.7 3.6E-05 38.2 2.1 75 111-194 50-125 (217)
68 PF12955 DUF3844: Domain of un 73.8 1.9 4.1E-05 33.4 1.4 27 303-329 13-61 (103)
69 PF01683 EB: EB module; Inter 69.4 4.2 9E-05 27.1 2.2 20 304-323 27-46 (52)
70 KOG3607|consensus 66.7 4 8.8E-05 43.0 2.4 32 299-330 626-658 (716)
71 KOG0196|consensus 64.6 8.9 0.00019 40.5 4.3 56 78-140 258-317 (996)
72 PF00954 S_locus_glycop: S-loc 63.2 17 0.00036 28.4 4.9 28 32-59 78-108 (110)
73 cd00185 TNFR Tumor necrosis fa 51.6 37 0.00079 26.0 4.9 48 91-139 33-83 (98)
74 KOG3607|consensus 41.4 19 0.00041 38.1 2.4 27 67-95 631-657 (716)
75 cd01475 vWA_Matrilin VWA_Matri 40.4 19 0.00042 32.0 2.1 23 353-377 196-218 (224)
76 KOG3514|consensus 35.4 24 0.00051 38.5 2.0 31 300-330 625-661 (1591)
77 KOG3516|consensus 31.7 29 0.00062 38.3 1.9 40 299-338 546-591 (1306)
78 KOG0196|consensus 28.8 1.9E+02 0.0041 31.1 7.1 33 130-162 258-294 (996)
79 PF12946 EGF_MSP1_1: MSP1 EGF 25.1 50 0.0011 20.5 1.4 25 352-376 5-30 (37)
80 PF02468 PsbN: Photosystem II 21.8 38 0.00083 21.7 0.5 16 4-19 23-39 (43)
No 1
>KOG0994|consensus
Probab=99.76 E-value=4.3e-18 Score=172.67 Aligned_cols=147 Identities=35% Similarity=0.683 Sum_probs=113.2
Q ss_pred CCeeeecCCCCccCCCCC------------CCCCCCCc-eeecCCCcccCCCCCCCCCCCCCCCCCcccCCCCC--CCcc
Q psy9819 47 PDYSCQCELGWTGVDCSV------------NCLCNNHS-TCVHGIGICDECHDWTTGDHCQYCRAGSYGNATTQ--EGCR 111 (377)
Q Consensus 47 ~~~~C~C~~G~~G~~C~~------------~C~C~~~g-~C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~--~~C~ 111 (377)
.+++|.|.+|.+|..|.+ +|.|++|+ +|++.+|.|+.|...++|.+|+.|.+||+|+.--. ..|.
T Consensus 828 ~tGQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~Cr 907 (1758)
T KOG0994|consen 828 ITGQCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCR 907 (1758)
T ss_pred cccceeeccccchhhccccCCCccCCCcCccccccCcccccCccccccccccccccccchhhhhccccCCcccCCCCCCC
Confidence 356888888888888874 57899998 89999999999999999999999999999997543 6899
Q ss_pred CCCCCCCCCc---CcccccCCC----cceecCCCCccCCCccCCCCccCCCCCCCCCCC-CCCCCCC--CCCCCCCCCCC
Q psy9819 112 KCDCNSHGNS---VLGVCDSIT----GECICQDNTQGKNCERCLPGYYGDPTDGGTCYY-QCMARGM--LTGPGPQGLGS 181 (377)
Q Consensus 112 ~~~C~~~g~~---~~g~C~~~~----g~C~C~~g~~G~~C~~C~~G~~g~~~~~~~C~~-~C~~~g~--~~~~~~~~~g~ 181 (377)
+|+|..+... ...+|...+ -.|+|.+||+|.+|+.|.++|+|+|.++++|.. +|+++.. -...|...+|.
T Consensus 908 PCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~ 987 (1758)
T KOG0994|consen 908 PCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGA 987 (1758)
T ss_pred CCCCCCCCccchhccccccccccccceeeecccCccccchhhhcccccCCcccCCccccccccCCcCccCCCccchhhch
Confidence 9999765422 112454322 389999999999999999999999999999987 8877652 23455555554
Q ss_pred cccCCCCCccCC
Q psy9819 182 GLAERNAWEGKD 193 (377)
Q Consensus 182 ~~~c~~G~~G~~ 193 (377)
+..|...-+|.+
T Consensus 988 CLkCL~hTeG~h 999 (1758)
T KOG0994|consen 988 CLKCLYHTEGDH 999 (1758)
T ss_pred hhhhhhcccccc
Confidence 444444334443
No 2
>KOG0994|consensus
Probab=99.73 E-value=7.4e-17 Score=163.83 Aligned_cols=171 Identities=30% Similarity=0.726 Sum_probs=135.0
Q ss_pred eeEEEecCCCCCcccccc-------cceeeeEeeecCCCcCC------------------------C--C-CC-CeecC-
Q psy9819 2 SIIFRISGLTTAKDDALS-------RCTVLLLYIFNASLCYN------------------------K--C-IY-GYCKG- 45 (377)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~C~~------------------------~--C-~~-G~C~~- 45 (377)
.||||+|+|.....|||+ ||||||+.+.+...++. . | .| ..|..
T Consensus 204 EVifrvl~P~~~iedPYs~~IQ~~LKITNLRvn~tklhtlgdnllD~r~E~~ekyyYAiy~~vVrGsCfCyGHAs~C~P~ 283 (1758)
T KOG0994|consen 204 EVIFRVLDPAIDIEDPYSAKIQELLKITNLRVNFTKLHTLGDNLLDSREEIREKYYYAIYDLVVRGSCFCYGHASQCAPV 283 (1758)
T ss_pred eEEEEecCCCCCCCCchhHHHHHHhhhhheeeeeEeeccccccccccccccccchhheeeeeeeecceeecCchhhcccC
Confidence 499999999999999999 99999999988887764 1 2 23 33542
Q ss_pred --------CC-----CeeeecCCCCccCCCCC----------------------CCCCCCCc-eee-----------cCC
Q psy9819 46 --------PP-----DYSCQCELGWTGVDCSV----------------------NCLCNNHS-TCV-----------HGI 78 (377)
Q Consensus 46 --------~~-----~~~C~C~~G~~G~~C~~----------------------~C~C~~~g-~C~-----------~~~ 78 (377)
++ -+.|.|.....|.+|+. .|.|++|. +|+ ...
T Consensus 284 ~g~~s~~~~~ta~mVHG~C~C~HNT~G~nCE~C~~fYnDlPWrpAeG~~~neCrkC~CNgHa~sCHFD~aV~~ASG~vSG 363 (1758)
T KOG0994|consen 284 DGARSAKAPGTAHMVHGRCMCKHNTAGLNCEHCAPFYNDLPWRPAEGKTSNECRKCECNGHADTCHFDMAVYEASGNVSG 363 (1758)
T ss_pred CCCCcccCCCccceecceeEeccCCCCCChHHhhHhhcCCCCCccCCCCcccccccCCCCCcccccccHHHHhhcCCccc
Confidence 11 13799999999999993 25899998 787 235
Q ss_pred CcccCCCCCCCCCCCCCCCCCcccCCCC----CCCccCCCCCCCCCcCcccccC----CC----cceecCCCCccCCCcc
Q psy9819 79 GICDECHDWTTGDHCQYCRAGSYGNATT----QEGCRKCDCNSHGNSVLGVCDS----IT----GECICQDNTQGKNCER 146 (377)
Q Consensus 79 ~~C~~C~~g~~G~~C~~C~~g~~g~~c~----~~~C~~~~C~~~g~~~~g~C~~----~~----g~C~C~~g~~G~~C~~ 146 (377)
|+|+.|.+++.|.+||.|+|.||-+.-. +..|.+|.|...|+...|.|+. .+ |.|.|+++..|.+|++
T Consensus 364 GVCDdCqHNT~G~~CE~CkP~fYRdprr~i~~p~vC~pC~CdP~GS~~~g~cds~~Dp~~GlvaGqC~CK~~V~G~RCd~ 443 (1758)
T KOG0994|consen 364 GVCDDCQHNTEGQNCERCKPFFYRDPRRDISDPDVCKPCECDPAGSQDGGICDSFCDPSTGLVAGQCRCKEHVAGRRCDR 443 (1758)
T ss_pred ccCccccccccccchhhcCcccccCCCCCCCCccccccccCCCCcCcCCCccccccCccccccccccccccCcCccccch
Confidence 8999999999999999999999987643 3789999999999888777743 33 7999999999999999
Q ss_pred CCCCccCCCC-CCCCCCC-CCCCCCCCC
Q psy9819 147 CLPGYYGDPT-DGGTCYY-QCMARGMLT 172 (377)
Q Consensus 147 C~~G~~g~~~-~~~~C~~-~C~~~g~~~ 172 (377)
|++||+|... +...|.. .|+..|+..
T Consensus 444 Ck~Gywgl~~~dp~GC~~C~CN~lGT~~ 471 (1758)
T KOG0994|consen 444 CKDGYWGLTSADPYGCRPCDCNPLGTRN 471 (1758)
T ss_pred hccCcccCccCCCCCccccccccccccC
Confidence 9999998753 2344554 666666444
No 3
>KOG1225|consensus
Probab=99.39 E-value=3e-12 Score=125.64 Aligned_cols=62 Identities=24% Similarity=0.570 Sum_probs=52.0
Q ss_pred CCCCCCcc-cccccCCCCCCCCCCeecCCcccCCCCCCCCCCCCCCCCCCccCC----CceEEeCCCCCCCC
Q psy9819 287 KPSEGFNA-TYQIFSCPDKCPENRTCINNQCVCPPRRTGPDCQEEICPNECHEF----LNHGTCDLLLTGVH 353 (377)
Q Consensus 287 ~c~~GF~g-~~~~~~C~~~C~~~g~C~~g~C~C~~G~~G~~C~~~~C~~~C~~~----~~~c~C~~g~~G~~ 353 (377)
.|.+||.| ++++..|+.+|+++|.|++++|+|.+||+|..|+++. |+++ ++ |+|.+||.|.+
T Consensus 299 iC~~g~~G~dCs~~~cpadC~g~G~Ci~G~C~C~~Gy~G~~C~~~~----C~~~g~cv~g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 299 ICNPGYSGKDCSIRRCPADCSGHGKCIDGECLCDEGYTGELCIQRA----CSGGGQCVNG-CKCKKGWRGPD 365 (525)
T ss_pred ecCCCccccccccccCCccCCCCCcccCCceEeCCCCcCCcccccc----cCCCceeccC-ceeccCccCCC
Confidence 34567888 7888889999999999999999999999999999873 5554 35 99999999988
No 4
>KOG1225|consensus
Probab=99.32 E-value=1.5e-11 Score=120.73 Aligned_cols=85 Identities=26% Similarity=0.646 Sum_probs=73.2
Q ss_pred CCCCCCcc-cccccCCCCCCCCCCeecCCcccCCCCCCCCCCCCCCCCCCccCC----CceEEeCCCCCCCCCCC-----
Q psy9819 287 KPSEGFNA-TYQIFSCPDKCPENRTCINNQCVCPPRRTGPDCQEEICPNECHEF----LNHGTCDLLLTGVHITH----- 356 (377)
Q Consensus 287 ~c~~GF~g-~~~~~~C~~~C~~~g~C~~g~C~C~~G~~G~~C~~~~C~~~C~~~----~~~c~C~~g~~G~~C~~----- 356 (377)
.|.+||.| ++++..|+..|++|+.+++++|+|++||+|.+|++..||.+|+++ +++|.|.+||+|..|+.
T Consensus 268 IC~~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G~Ci~G~C~C~~Gy~G~~C~~~~C~~ 347 (525)
T KOG1225|consen 268 ICPPGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCSIRRCPADCSGHGKCIDGECLCDEGYTGELCIQRACSG 347 (525)
T ss_pred eCCCCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccccccCCccCCCCCcccCCceEeCCCCcCCcccccccCC
Confidence 45678998 777888988899999999999999999999999999999999998 59999999999999885
Q ss_pred CCeecccccCeeeecCCCee
Q psy9819 357 GRTLHYQVDLIRCTCRQVYL 376 (377)
Q Consensus 357 g~~~~~~~~~~~c~~~~~~~ 376 (377)
++.|++ . |.|..||-
T Consensus 348 ~g~cv~---g--C~C~~Gw~ 362 (525)
T KOG1225|consen 348 GGQCVN---G--CKCKKGWR 362 (525)
T ss_pred Cceecc---C--ceeccCcc
Confidence 444443 3 88888874
No 5
>KOG1836|consensus
Probab=99.30 E-value=5.9e-11 Score=130.39 Aligned_cols=116 Identities=33% Similarity=0.801 Sum_probs=95.1
Q ss_pred eeecCCCCccCCCCC-------------------CCCCCCC-ceeecCCCcccCCCCCCCCCCCCCCCCCcccCCCCC--
Q psy9819 50 SCQCELGWTGVDCSV-------------------NCLCNNH-STCVHGIGICDECHDWTTGDHCQYCRAGSYGNATTQ-- 107 (377)
Q Consensus 50 ~C~C~~G~~G~~C~~-------------------~C~C~~~-g~C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~-- 107 (377)
.|.|++||+|..|+. +|+|++| .+|++.++.| .|.++..|.+|+.|.+||||..-..
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C-~C~~~t~G~~C~~C~~GfYg~~~~~~~ 774 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQC-KCKHNTFGGQCAQCVDGFYGLPDLGTS 774 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCCce-ecccCCCCCchhhhcCCCCCccccCCC
Confidence 599999999999983 3578887 4899999999 6999999999999999999987654
Q ss_pred CCccCCCCCCCCCcCcccccCCCccee-cCCCCccCCCccCCCCccCCCCCCC----CCCC-CCCC
Q psy9819 108 EGCRKCDCNSHGNSVLGVCDSITGECI-CQDNTQGKNCERCLPGYYGDPTDGG----TCYY-QCMA 167 (377)
Q Consensus 108 ~~C~~~~C~~~g~~~~g~C~~~~g~C~-C~~g~~G~~C~~C~~G~~g~~~~~~----~C~~-~C~~ 167 (377)
.+|++|+|.+.+.... ++....++|. |+++|+|.+|+.|..||++++.... .|.. +|..
T Consensus 775 ~dC~~C~Cp~~~~~~~-~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~ 839 (1705)
T KOG1836|consen 775 GDCQPCPCPNGGACGQ-TPEILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNF 839 (1705)
T ss_pred CCCccCCCCCChhhcC-cCcccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceecc
Confidence 4599999998864332 4444567998 9999999999999999999987544 5554 4444
No 6
>KOG3512|consensus
Probab=99.11 E-value=2.8e-10 Score=107.44 Aligned_cols=117 Identities=35% Similarity=0.913 Sum_probs=96.6
Q ss_pred CC-CeecCCC--CeeeecCCCCccCCCCC----------------------CCCCCCCce-ee-----------cCCCcc
Q psy9819 39 IY-GYCKGPP--DYSCQCELGWTGVDCSV----------------------NCLCNNHST-CV-----------HGIGIC 81 (377)
Q Consensus 39 ~~-G~C~~~~--~~~C~C~~G~~G~~C~~----------------------~C~C~~~g~-C~-----------~~~~~C 81 (377)
.| ..|+-.. .++|.|..+.+|++|.. +|.|+.|+. |. ...++|
T Consensus 282 gHAs~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvC 361 (592)
T KOG3512|consen 282 GHASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVC 361 (592)
T ss_pred CccceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceE
Confidence 46 4587433 47999999999999993 357888773 54 124689
Q ss_pred cCCCCCCCCCCCCCCCCCcccCCCCC----CCccCCCCCCCCCcCcccccCCCcceecCCCCccCCCccCCCCccCCCC
Q psy9819 82 DECHDWTTGDHCQYCRAGSYGNATTQ----EGCRKCDCNSHGNSVLGVCDSITGECICQDNTQGKNCERCLPGYYGDPT 156 (377)
Q Consensus 82 ~~C~~g~~G~~C~~C~~g~~g~~c~~----~~C~~~~C~~~g~~~~g~C~~~~g~C~C~~g~~G~~C~~C~~G~~g~~~ 156 (377)
.+|.++++|.+|+.|++||+-+...+ ..|..|.|+..|+... +|+..+|+|.|++|.+|..|..|.+||+....
T Consensus 362 lnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gk-tCNq~tGqCpCkeGvtG~tCnrCa~gyqqsrs 439 (592)
T KOG3512|consen 362 LNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGK-TCNQTTGQCPCKEGVTGLTCNRCAPGYQQSRS 439 (592)
T ss_pred eecccCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccc-cccccCCcccCCCCCcccccccccchhhcccC
Confidence 99999999999999999999887654 7899999998886544 89988999999999999999999999997643
No 7
>KOG1219|consensus
Probab=98.99 E-value=5.2e-10 Score=120.83 Aligned_cols=99 Identities=32% Similarity=0.828 Sum_probs=82.2
Q ss_pred CCcC-CCCCC-CeecCCC--CeeeecCCCCccCCCCC---CC---CCCCCceee--cCCCcccCCCCCCCCCCCCCCCCC
Q psy9819 32 SLCY-NKCIY-GYCKGPP--DYSCQCELGWTGVDCSV---NC---LCNNHSTCV--HGIGICDECHDWTTGDHCQYCRAG 99 (377)
Q Consensus 32 ~~C~-~~C~~-G~C~~~~--~~~C~C~~G~~G~~C~~---~C---~C~~~g~C~--~~~~~C~~C~~g~~G~~C~~C~~g 99 (377)
.-|. +|||| |+|+..+ .|+|.|++-|+|..|++ +| +|..+|+|+ ...+.| +|+.+|+|.+||. .
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~C-nC~~gyTG~~Ce~--~- 3940 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLC-NCPNGYTGKRCEA--R- 3940 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeE-eCCCCccCceeec--c-
Confidence 4465 69999 8999643 79999999999999996 56 899999998 457899 9999999999975 1
Q ss_pred cccCCCCCCCccCCCCCCCCCcCcccccCCCc--ceecCCCCccCCCc
Q psy9819 100 SYGNATTQEGCRKCDCNSHGNSVLGVCDSITG--ECICQDNTQGKNCE 145 (377)
Q Consensus 100 ~~g~~c~~~~C~~~~C~~~g~~~~g~C~~~~g--~C~C~~g~~G~~C~ 145 (377)
| .++|...+|.++| .|....| .|.|.+||.|+.|.
T Consensus 3941 --G----i~eCs~n~C~~gg-----~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3941 --G----ISECSKNVCGTGG-----QCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred --c----ccccccccccCCc-----eeeccCCceEeccChhHhcccCc
Confidence 1 3567777888877 8876555 99999999999987
No 8
>KOG1836|consensus
Probab=98.86 E-value=6.2e-08 Score=107.14 Aligned_cols=110 Identities=30% Similarity=0.611 Sum_probs=89.7
Q ss_pred cccCCCCCCCCCCCCCCCCCcccCCCCC---CCccCCCCCCCCCcCcccccCCCcceecCCCCccCCCccCCCCccCCCC
Q psy9819 80 ICDECHDWTTGDHCQYCRAGSYGNATTQ---EGCRKCDCNSHGNSVLGVCDSITGECICQDNTQGKNCERCLPGYYGDPT 156 (377)
Q Consensus 80 ~C~~C~~g~~G~~C~~C~~g~~g~~c~~---~~C~~~~C~~~g~~~~g~C~~~~g~C~C~~g~~G~~C~~C~~G~~g~~~ 156 (377)
.| .|++||+|..||.|+++|+...-.. ..|.+|.|+++. .+|++.+|.|.|.+...|..|++|.+||||++.
T Consensus 696 ~c-~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~----~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~ 770 (1705)
T KOG1836|consen 696 QC-TCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHS----NICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPD 770 (1705)
T ss_pred hc-cCCCCcccchhhhcchhhhcccccCCCCCcccccccCCcc----ccccCCCCceecccCCCCCchhhhcCCCCCccc
Confidence 48 7999999999999999998765432 456677777762 389999999999999999999999999999986
Q ss_pred CCC--CCCC-CCCCCCCCCCCCCCCCCCcccCCCCCccCCC
Q psy9819 157 DGG--TCYY-QCMARGMLTGPGPQGLGSGLAERNAWEGKDT 194 (377)
Q Consensus 157 ~~~--~C~~-~C~~~g~~~~~~~~~~g~~~~c~~G~~G~~C 194 (377)
.+. .|.. .|...+.+........+.++.|++||+|..|
T Consensus 771 ~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC 811 (1705)
T KOG1836|consen 771 LGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRC 811 (1705)
T ss_pred cCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccc
Confidence 432 2766 7877777776666666777789999999988
No 9
>KOG1226|consensus
Probab=98.70 E-value=6.1e-08 Score=97.43 Aligned_cols=46 Identities=30% Similarity=0.769 Sum_probs=35.2
Q ss_pred CCCCCeecCCcccCCCC-CCCCCCCC-CCCCCCccCCCceE-EeCCCCCC
Q psy9819 305 CPENRTCINNQCVCPPR-RTGPDCQE-EICPNECHEFLNHG-TCDLLLTG 351 (377)
Q Consensus 305 C~~~g~C~~g~C~C~~G-~~G~~C~~-~~C~~~C~~~~~~c-~C~~g~~G 351 (377)
|+++|+|.-|+|+|... |.|..||+ +.|++.|... ..| .|..--+|
T Consensus 596 CSGrG~C~Cg~C~C~~~~~sG~~CE~cptc~~~C~~~-~~CveC~~~~~g 644 (783)
T KOG1226|consen 596 CSGRGTCECGRCKCTDPPYSGEFCEKCPTCPDPCAEN-KSCVECQAFETG 644 (783)
T ss_pred eCCCceeeCCceEcCCCCcCcchhhcCCCCCCccccc-ccchhhcccccc
Confidence 99999999999999877 99999997 4588888765 234 44444444
No 10
>KOG3512|consensus
Probab=98.65 E-value=5.9e-08 Score=92.03 Aligned_cols=127 Identities=28% Similarity=0.640 Sum_probs=97.9
Q ss_pred CCCCCCc-eee---cCCCcccCCCCCCCCCCCCCCCCCcccCCCCC------CCccCCCCCCCCCc--CcccccCC----
Q psy9819 66 CLCNNHS-TCV---HGIGICDECHDWTTGDHCQYCRAGSYGNATTQ------EGCRKCDCNSHGNS--VLGVCDSI---- 129 (377)
Q Consensus 66 C~C~~~g-~C~---~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~------~~C~~~~C~~~g~~--~~g~C~~~---- 129 (377)
|.|++|+ .|+ ..+.+| .|.++++|+.|+.|++-|+.+.... ..|..+.|+.++.. .+..+...
T Consensus 278 CKCNgHAs~Cv~d~~~~ltC-dC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~ 356 (592)
T KOG3512|consen 278 CKCNGHASRCVMDESSHLTC-DCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRR 356 (592)
T ss_pred eeecCccceeeeccCCceEE-ecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCcc
Confidence 6788887 587 234689 7999999999999999998776543 67899999888762 22222222
Q ss_pred -Cccee-cCCCCccCCCccCCCCccCCCC----CCCCCCC-CCCCCCCCCCCCCCCCCCcccCCCCCccCCC
Q psy9819 130 -TGECI-CQDNTQGKNCERCLPGYYGDPT----DGGTCYY-QCMARGMLTGPGPQGLGSGLAERNAWEGKDT 194 (377)
Q Consensus 130 -~g~C~-C~~g~~G~~C~~C~~G~~g~~~----~~~~C~~-~C~~~g~~~~~~~~~~g~~~~c~~G~~G~~C 194 (377)
.|+|. |.....|.+|..|++|||-+.. ....|.. .|.+.|.....|+..+|.+ .|.+|-+|..|
T Consensus 357 SggvClnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tGqC-pCkeGvtG~tC 427 (592)
T KOG3512|consen 357 SGGVCLNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTGQC-PCKEGVTGLTC 427 (592)
T ss_pred ccceEeecccCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccccCCcc-cCCCCCccccc
Confidence 24774 9999999999999999997654 2345655 7888888888888887766 58899999987
No 11
>KOG4289|consensus
Probab=98.58 E-value=3.3e-08 Score=103.82 Aligned_cols=112 Identities=29% Similarity=0.681 Sum_probs=85.4
Q ss_pred cceeeeEeeecCCCcC-CCCCC-CeecC----------------------C-CCeeeecCCCCccCCCCC---CC---CC
Q psy9819 20 RCTVLLLYIFNASLCY-NKCIY-GYCKG----------------------P-PDYSCQCELGWTGVDCSV---NC---LC 68 (377)
Q Consensus 20 ~~~~~~~~~~~~~~C~-~~C~~-G~C~~----------------------~-~~~~C~C~~G~~G~~C~~---~C---~C 68 (377)
+++.|++.-|.-++|. .||.| -.|+. | ++++|+||+||+|+.|+. .| +|
T Consensus 1168 ~~sll~VlpfdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC 1247 (2531)
T KOG4289|consen 1168 AISLLRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPC 1247 (2531)
T ss_pred HhhheeeeeccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhhcCCC
Confidence 7778898889999998 58988 67873 1 257899999999999996 35 89
Q ss_pred CCCceee--cCCCcccCCCCCCCCCCCCCCCCCcccCCCCCCCccCCCCCCCCCcCcccccCC---CcceecCCC-CccC
Q psy9819 69 NNHSTCV--HGIGICDECHDWTTGDHCQYCRAGSYGNATTQEGCRKCDCNSHGNSVLGVCDSI---TGECICQDN-TQGK 142 (377)
Q Consensus 69 ~~~g~C~--~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~~~C~~~~C~~~g~~~~g~C~~~---~g~C~C~~g-~~G~ 142 (377)
.++|+|. .+.+.| .|.++|+|.+||.=. ....|.+-.|.++| +|... ...|+|+.| |+++
T Consensus 1248 ~nng~C~srEggYtC-eCrpg~tGehCEvs~--------~agrCvpGvC~ngg-----tC~~~~nggf~c~Cp~ge~e~p 1313 (2531)
T KOG4289|consen 1248 GNNGRCRSREGGYTC-ECRPGFTGEHCEVSA--------RAGRCVPGVCKNGG-----TCVNLLNGGFCCHCPYGEFEDP 1313 (2531)
T ss_pred CCCCceEEecCceeE-EecCCccccceeeec--------ccCccccceecCCC-----EEeecCCCceeccCCCcccCCC
Confidence 9999998 567899 799999999887300 01335555577777 77542 237889986 7889
Q ss_pred CCc
Q psy9819 143 NCE 145 (377)
Q Consensus 143 ~C~ 145 (377)
+|+
T Consensus 1314 rC~ 1316 (2531)
T KOG4289|consen 1314 RCE 1316 (2531)
T ss_pred ceE
Confidence 998
No 12
>KOG1226|consensus
Probab=98.54 E-value=2.5e-07 Score=93.12 Aligned_cols=69 Identities=23% Similarity=0.466 Sum_probs=51.2
Q ss_pred cccccCCCCC----CCCCCeecCCcccCCCCCCCCCCCCCCCCCC--------ccCCC----ceEEeCCC-CCCCCCCCC
Q psy9819 295 TYQIFSCPDK----CPENRTCINNQCVCPPRRTGPDCQEEICPNE--------CHEFL----NHGTCDLL-LTGVHITHG 357 (377)
Q Consensus 295 ~~~~~~C~~~----C~~~g~C~~g~C~C~~G~~G~~C~~~~C~~~--------C~~~~----~~c~C~~g-~~G~~C~~g 357 (377)
+++...|+.. |.+||+|.-|+|+|.+||+|..|+-+.-.+. |+.++ ++|.|.+. |.|..||+-
T Consensus 543 ECDnfsC~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 543 ECDNFSCERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECGRCKCTDPPYSGEFCEKC 622 (783)
T ss_pred eccCcccccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCCceEcCCCCcCcchhhcC
Confidence 4444555432 9999999999999999999999986543333 44432 77899777 999999976
Q ss_pred Ceeccc
Q psy9819 358 RTLHYQ 363 (377)
Q Consensus 358 ~~~~~~ 363 (377)
-||-+-
T Consensus 623 ptc~~~ 628 (783)
T KOG1226|consen 623 PTCPDP 628 (783)
T ss_pred CCCCCc
Confidence 666554
No 13
>KOG4289|consensus
Probab=98.47 E-value=9.7e-08 Score=100.44 Aligned_cols=78 Identities=26% Similarity=0.500 Sum_probs=62.4
Q ss_pred CCCCCCCCCcccc---cccCC-CCCCCCCCeec---CC-cccCCCCCCCCCCCCCCCCCCccCCCceEEeCCCCCCCCCC
Q psy9819 284 KQGKPSEGFNATY---QIFSC-PDKCPENRTCI---NN-QCVCPPRRTGPDCQEEICPNECHEFLNHGTCDLLLTGVHIT 355 (377)
Q Consensus 284 ~s~~c~~GF~g~~---~~~~C-~~~C~~~g~C~---~g-~C~C~~G~~G~~C~~~~C~~~C~~~~~~c~C~~g~~G~~C~ 355 (377)
..++|++||++++ .++.| ..+|.++|+|. +| +|.|.+||+|++||+.. -...|.+|+ |.
T Consensus 1222 lrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~---------~agrCvpGv----C~ 1288 (2531)
T KOG4289|consen 1222 LRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSA---------RAGRCVPGV----CK 1288 (2531)
T ss_pred eeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeec---------ccCccccce----ec
Confidence 4457889999944 45778 67799999993 33 89999999999999862 135677776 99
Q ss_pred CCCeecc-cccCeeeecCCC
Q psy9819 356 HGRTLHY-QVDLIRCTCRQV 374 (377)
Q Consensus 356 ~g~~~~~-~~~~~~c~~~~~ 374 (377)
+|+||++ .+++|.|.|+.+
T Consensus 1289 nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1289 NGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred CCCEEeecCCCceeccCCCc
Confidence 9999995 578999999986
No 14
>KOG1219|consensus
Probab=98.30 E-value=6.3e-07 Score=97.99 Aligned_cols=78 Identities=26% Similarity=0.561 Sum_probs=66.8
Q ss_pred cCC-CCCCCCCCeecCC-----cccCCCCCCCCCCCCCC--C-CCCccCC--------CceEEeCCCCCCCCCC------
Q psy9819 299 FSC-PDKCPENRTCINN-----QCVCPPRRTGPDCQEEI--C-PNECHEF--------LNHGTCDLLLTGVHIT------ 355 (377)
Q Consensus 299 ~~C-~~~C~~~g~C~~g-----~C~C~~G~~G~~C~~~~--C-~~~C~~~--------~~~c~C~~g~~G~~C~------ 355 (377)
+.| .++|+++|+|... .|+|++-|+|.+||+.+ | +++|.+. ...|.|+.||+|.+||
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~e 3944 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISE 3944 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccc
Confidence 567 5679999999432 89999999999999865 5 4788775 3779999999999887
Q ss_pred -------CCCeecccccCeeeecCCCee
Q psy9819 356 -------HGRTLHYQVDLIRCTCRQVYL 376 (377)
Q Consensus 356 -------~g~~~~~~~~~~~c~~~~~~~ 376 (377)
+|+.|++..++|-|-|-++|+
T Consensus 3945 Cs~n~C~~gg~C~n~~gsf~CncT~g~~ 3972 (4289)
T KOG1219|consen 3945 CSKNVCGTGGQCINIPGSFHCNCTPGIL 3972 (4289)
T ss_pred cccccccCCceeeccCCceEeccChhHh
Confidence 799999999999999999987
No 15
>KOG1217|consensus
Probab=98.27 E-value=5.5e-05 Score=75.37 Aligned_cols=91 Identities=26% Similarity=0.682 Sum_probs=55.2
Q ss_pred CCeecCC-CCeeeecCCCCccCCCCCC--CCC-----CCCceeecC-----CCcccCCCCCCCCCCCCCCCCCcccCCCC
Q psy9819 40 YGYCKGP-PDYSCQCELGWTGVDCSVN--CLC-----NNHSTCVHG-----IGICDECHDWTTGDHCQYCRAGSYGNATT 106 (377)
Q Consensus 40 ~G~C~~~-~~~~C~C~~G~~G~~C~~~--C~C-----~~~g~C~~~-----~~~C~~C~~g~~G~~C~~C~~g~~g~~c~ 106 (377)
++.+... ..+.|.|++||.|..|+.. |.- ..++.|... .+.| .|..+|.+..|+...
T Consensus 100 ~~~~~~~~~~~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c-~C~~g~~~~~~~~~~--------- 169 (487)
T KOG1217|consen 100 CGECVDCVGSYECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRC-SCTEGYEGEPCETDL--------- 169 (487)
T ss_pred CccccCCCCCceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceee-eeCCCcccccccccc---------
Confidence 3555543 3789999999999999973 521 233444432 3455 455555555553210
Q ss_pred CCCcc--CCCCCCCCCcCcccccCCC--cceecCCCCccCCCcc
Q psy9819 107 QEGCR--KCDCNSHGNSVLGVCDSIT--GECICQDNTQGKNCER 146 (377)
Q Consensus 107 ~~~C~--~~~C~~~g~~~~g~C~~~~--g~C~C~~g~~G~~C~~ 146 (377)
+.|. ...|.+.+ .|.... ..|.|.++|.|..|+.
T Consensus 170 -~~C~~~~~~c~~~~-----~C~~~~~~~~C~c~~~~~~~~~~~ 207 (487)
T KOG1217|consen 170 -DECIQYSSPCQNGG-----TCVNTGGSYLCSCPPGYTGSTCET 207 (487)
T ss_pred -cccccCCCCcCCCc-----ccccCCCCeeEeCCCCccCCcCcC
Confidence 2444 22366655 665433 4799999999998873
No 16
>KOG4260|consensus
Probab=98.25 E-value=2.1e-06 Score=76.34 Aligned_cols=65 Identities=32% Similarity=0.798 Sum_probs=46.2
Q ss_pred eecCCCcccCCCCCCCCCCCCCCCCCcccCCCCCCCccCCCCCCCCCcCcccccC-----CCcceecCCCCccCCCccCC
Q psy9819 74 CVHGIGICDECHDWTTGDHCQYCRAGSYGNATTQEGCRKCDCNSHGNSVLGVCDS-----ITGECICQDNTQGKNCERCL 148 (377)
Q Consensus 74 C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~~~C~~~~C~~~g~~~~g~C~~-----~~g~C~C~~g~~G~~C~~C~ 148 (377)
|+..--+| |++|+.|+.|..|+-|..- +|.++| .|.. .+|.|.|.+||+|+.|..|.
T Consensus 124 CvdqLkvC--Cp~gtyGpdCl~Cpggser-----------~C~GnG-----~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg 185 (350)
T KOG4260|consen 124 CVDQLKVC--CPDGTYGPDCLQCPGGSER-----------PCFGNG-----SCHGDGSREGSGKCKCETGYTGPLCRYCG 185 (350)
T ss_pred hhhhheec--cCCCCcCCccccCCCCCcC-----------CcCCCC-----cccCCCCCCCCCcccccCCCCCccccccc
Confidence 44333456 8899999999888754421 355554 3322 25799999999999999999
Q ss_pred CCccCCCC
Q psy9819 149 PGYYGDPT 156 (377)
Q Consensus 149 ~G~~g~~~ 156 (377)
++|+-...
T Consensus 186 ~eyfes~R 193 (350)
T KOG4260|consen 186 IEYFESSR 193 (350)
T ss_pred hHHHHhhc
Confidence 99987643
No 17
>KOG4260|consensus
Probab=98.00 E-value=7.4e-06 Score=72.89 Aligned_cols=120 Identities=27% Similarity=0.601 Sum_probs=80.9
Q ss_pred ecCCCCccCCCCCCC------CCCCCceee-----cCCCcccCCCCCCCCCCCCCCCCCcccCCCCC--CCccCCCCCCC
Q psy9819 52 QCELGWTGVDCSVNC------LCNNHSTCV-----HGIGICDECHDWTTGDHCQYCRAGSYGNATTQ--EGCRKCDCNSH 118 (377)
Q Consensus 52 ~C~~G~~G~~C~~~C------~C~~~g~C~-----~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~--~~C~~~~C~~~ 118 (377)
-||+|.+|++|.. | +|.++|.|. .++++| .|.+||+|..|..|.++|+-..-+. ..|..| +..
T Consensus 131 CCp~gtyGpdCl~-Cpggser~C~GnG~C~GdGsR~GsGkC-kC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~C--h~~ 206 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQ-CPGGSERPCFGNGSCHGDGSREGSGKC-KCETGYTGPLCRYCGIEYFESSRNEQHLVCTAC--HEG 206 (350)
T ss_pred ccCCCCcCCcccc-CCCCCcCCcCCCCcccCCCCCCCCCcc-cccCCCCCccccccchHHHHhhcccccchhhhh--hhh
Confidence 3999999999984 5 699999998 457999 8999999999999999998765443 334332 110
Q ss_pred CCcCcccccCCCcceecCCCCccCCCccCCCCccCCCCCCCCCCC--CCCCCCC---CCCCCCCCCCCcc-cCCCCCcc
Q psy9819 119 GNSVLGVCDSITGECICQDNTQGKNCERCLPGYYGDPTDGGTCYY--QCMARGM---LTGPGPQGLGSGL-AERNAWEG 191 (377)
Q Consensus 119 g~~~~g~C~~~~g~C~C~~g~~G~~C~~C~~G~~g~~~~~~~C~~--~C~~~g~---~~~~~~~~~g~~~-~c~~G~~G 191 (377)
- .+.|. |-.-..|..|..||..+. ..|.+ +|...+. -..+|.|..|+|. .+++||.+
T Consensus 207 C---~~~Cs----------g~~~k~C~kCkkGW~lde---~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 207 C---LGVCS----------GESSKGCSKCKKGWKLDE---EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred h---hcccC----------CCCCCChhhhcccceecc---cccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 0 01232 222345666777777652 24544 5554442 2247889999998 66788876
No 18
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.83 E-value=3.8e-05 Score=51.73 Aligned_cols=43 Identities=58% Similarity=1.378 Sum_probs=34.9
Q ss_pred CCCCCCCCcCcccccCCCcceecCCCCccCCCccCCCCccCCCC
Q psy9819 113 CDCNSHGNSVLGVCDSITGECICQDNTQGKNCERCLPGYYGDPT 156 (377)
Q Consensus 113 ~~C~~~g~~~~g~C~~~~g~C~C~~g~~G~~C~~C~~G~~g~~~ 156 (377)
+.|+.+++.. ..|+..+|+|.|+++|+|.+|++|++||++.+.
T Consensus 2 C~C~~~g~~~-~~C~~~~G~C~C~~~~~G~~C~~C~~g~~~~~~ 44 (50)
T cd00055 2 CDCNGHGSLS-GQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPS 44 (50)
T ss_pred CcCcCCCCCC-ccccCCCCEEeCCCcCCCCCCCCCCCCCccCCC
Confidence 4566665433 368888899999999999999999999999864
No 19
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.73 E-value=2.8e-05 Score=46.89 Aligned_cols=24 Identities=46% Similarity=1.146 Sum_probs=22.1
Q ss_pred CCCCCCeec--CCcccCCCCCCCCCC
Q psy9819 304 KCPENRTCI--NNQCVCPPRRTGPDC 327 (377)
Q Consensus 304 ~C~~~g~C~--~g~C~C~~G~~G~~C 327 (377)
.|++||+|+ .++|+|++||+|.+|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 599999998 689999999999987
No 20
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.66 E-value=5.2e-05 Score=51.08 Aligned_cols=38 Identities=47% Similarity=1.072 Sum_probs=32.6
Q ss_pred CCCCCCce----eecCCCcccCCCCCCCCCCCCCCCCCcccCC
Q psy9819 66 CLCNNHST----CVHGIGICDECHDWTTGDHCQYCRAGSYGNA 104 (377)
Q Consensus 66 C~C~~~g~----C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~ 104 (377)
|.|+.+++ |+..+++| .|+++|+|.+|++|+++|++..
T Consensus 2 C~C~~~g~~~~~C~~~~G~C-~C~~~~~G~~C~~C~~g~~~~~ 43 (50)
T cd00055 2 CDCNGHGSLSGQCDPGTGQC-ECKPNTTGRRCDRCAPGYYGLP 43 (50)
T ss_pred CcCcCCCCCCccccCCCCEE-eCCCcCCCCCCCCCCCCCccCC
Confidence 45666554 88889999 7999999999999999999975
No 21
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=97.60 E-value=8.5e-05 Score=49.04 Aligned_cols=29 Identities=55% Similarity=1.420 Sum_probs=27.4
Q ss_pred cccCCCcceecCCCCccCCCccCCCCccC
Q psy9819 125 VCDSITGECICQDNTQGKNCERCLPGYYG 153 (377)
Q Consensus 125 ~C~~~~g~C~C~~g~~G~~C~~C~~G~~g 153 (377)
.|+..+|+|.|+++++|.+|++|++||++
T Consensus 12 ~C~~~~G~C~C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 12 TCDPDTGQCECKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred cccCCCCEEECCCCCCCCCCCcCCCCcCC
Confidence 78877899999999999999999999998
No 22
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.51 E-value=4.3e-05 Score=51.26 Aligned_cols=41 Identities=49% Similarity=1.258 Sum_probs=31.3
Q ss_pred CCCCCCcCcccccCCCcceecCCCCccCCCccCCCCccCCCC
Q psy9819 115 CNSHGNSVLGVCDSITGECICQDNTQGKNCERCLPGYYGDPT 156 (377)
Q Consensus 115 C~~~g~~~~g~C~~~~g~C~C~~g~~G~~C~~C~~G~~g~~~ 156 (377)
|+.++... ..|+..+|+|.|+++|+|++|++|.++|++.+.
T Consensus 3 C~~~~~~~-~~C~~~~G~C~C~~~~~G~~C~~C~~g~~~~~~ 43 (49)
T PF00053_consen 3 CNPHGSSS-QTCDPSTGQCVCKPGTTGPRCDQCKPGYFGLPS 43 (49)
T ss_dssp STTCCBCC-SSEEETCEEESBSTTEESTTS-EE-TTEECSTT
T ss_pred CcCCCCCC-CcccCCCCEEeccccccCCcCcCCCCccccccC
Confidence 44444332 388888999999999999999999999999854
No 23
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.44 E-value=4.3e-05 Score=51.24 Aligned_cols=38 Identities=39% Similarity=0.878 Sum_probs=30.7
Q ss_pred CCCCCc----eeecCCCcccCCCCCCCCCCCCCCCCCcccCCC
Q psy9819 67 LCNNHS----TCVHGIGICDECHDWTTGDHCQYCRAGSYGNAT 105 (377)
Q Consensus 67 ~C~~~g----~C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c 105 (377)
.|+.++ +|++.+++| .|+++|+|.+|+.|+++|++...
T Consensus 2 ~C~~~~~~~~~C~~~~G~C-~C~~~~~G~~C~~C~~g~~~~~~ 43 (49)
T PF00053_consen 2 DCNPHGSSSQTCDPSTGQC-VCKPGTTGPRCDQCKPGYFGLPS 43 (49)
T ss_dssp SSTTCCBCCSSEEETCEEE-SBSTTEESTTS-EE-TTEECSTT
T ss_pred cCcCCCCCCCcccCCCCEE-eccccccCCcCcCCCCccccccC
Confidence 455555 799999999 69999999999999999999854
No 24
>cd00041 CUB CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
Probab=97.43 E-value=0.00057 Score=53.82 Aligned_cols=82 Identities=28% Similarity=0.601 Sum_probs=63.4
Q ss_pred CCccccccccccCCCCCCCCceeEEeecCCCCCc----CCCCceEeeCCCCccccccCcccCCCcccCCcCCCCcCCCcc
Q psy9819 196 SRECLWIIGQSLDSNSTAPADIILLRLQPDINVP----CNENAVYIYDGLPDFVTSVGGTHQSTTLGVFCTEDMHRGYEV 271 (377)
Q Consensus 196 ~~~C~w~i~~~~~~~~~~~~~~~~l~~~~~~~~~----C~~d~~~v~dG~~~~~~~~g~~~~~~~~g~~C~~~~~~~p~~ 271 (377)
..+|.|.|.+ +.+..|.|.|. .++++ |..|++.++||... ....++.+|+.. . |..
T Consensus 25 ~~~C~w~i~~-------~~g~~i~l~f~-~~~l~~~~~C~~d~l~i~~g~~~---------~~~~~~~~Cg~~--~-~~~ 84 (113)
T cd00041 25 NLNCVWTIEA-------PPGYRIRLTFE-DFDLESSPNCSYDYLEIYDGPST---------SSPLLGRFCGST--L-PPP 84 (113)
T ss_pred CCcEEEEEEc-------CCCCEEEEEEe-CcccccCCCCCCcEEEEEcCCCC---------ccccceeeECCC--C-CCC
Confidence 5679999999 55678999998 67766 99999999998752 134567888876 3 567
Q ss_pred eecCCCeeEeccCCCCC--CCCCccccc
Q psy9819 272 LEAKSGVMTIHYKQGKP--SEGFNATYQ 297 (377)
Q Consensus 272 c~~~sG~~~v~~~s~~c--~~GF~g~~~ 297 (377)
..+..+.+.|.|.++.. ..||.+.+.
T Consensus 85 ~~s~~~~~~i~f~s~~~~~~~GF~~~y~ 112 (113)
T cd00041 85 IISSGNSLTVRFRSDSSVTGRGFKATYS 112 (113)
T ss_pred EEecCCEEEEEEEeCCCCCCCCEEEEEE
Confidence 88888899999988765 388887553
No 25
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=97.34 E-value=0.00022 Score=47.05 Aligned_cols=29 Identities=45% Similarity=1.004 Sum_probs=27.1
Q ss_pred eeecCCCcccCCCCCCCCCCCCCCCCCccc
Q psy9819 73 TCVHGIGICDECHDWTTGDHCQYCRAGSYG 102 (377)
Q Consensus 73 ~C~~~~~~C~~C~~g~~G~~C~~C~~g~~g 102 (377)
.|+..+++| .|+++++|.+|++|++||+|
T Consensus 12 ~C~~~~G~C-~C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 12 TCDPDTGQC-ECKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred cccCCCCEE-ECCCCCCCCCCCcCCCCcCC
Confidence 688889999 79999999999999999999
No 26
>KOG4586|consensus
Probab=97.27 E-value=0.00023 Score=56.44 Aligned_cols=85 Identities=21% Similarity=0.461 Sum_probs=66.1
Q ss_pred CCCccccccccccCCCCCCCCceeEEeecCC----CCCcCCCCceEeeCCCCccccccCcccCCCcccCCcCCCCcCCCc
Q psy9819 195 PSRECLWIIGQSLDSNSTAPADIILLRLQPD----INVPCNENAVYIYDGLPDFVTSVGGTHQSTTLGVFCTEDMHRGYE 270 (377)
Q Consensus 195 p~~~C~w~i~~~~~~~~~~~~~~~~l~~~~~----~~~~C~~d~~~v~dG~~~~~~~~g~~~~~~~~g~~C~~~~~~~p~ 270 (377)
|+++|+.+|.. .+-..+++.|+.. .+-+|+.|++.|.||.-.+ +++++.+|+.. .|.
T Consensus 63 p~r~cv~vi~~-------~p~~~ve~~Fde~y~IEps~EC~fD~iEvrDGpfGF---------SPlI~rfCG~~---nPp 123 (156)
T KOG4586|consen 63 PNRDCVRVIHS-------RPQHDVEVKFDEVYHIEPSYECPFDFIEVRDGPFGF---------SPLIARFCGDR---NPP 123 (156)
T ss_pred CCcceEEeEec-------ccccceEEeeeeeEEecccccCCCCcccccCCCcCc---------cHHHHHHhccC---CCh
Confidence 36789888877 4444566666522 2347999999999988766 89999999987 456
Q ss_pred ceecCCCeeEeccCCCCCC--CCCcccccc
Q psy9819 271 VLEAKSGVMTIHYKQGKPS--EGFNATYQI 298 (377)
Q Consensus 271 ~c~~~sG~~~v~~~s~~c~--~GF~g~~~~ 298 (377)
.+.+....|++.|.++.-. .||+++|.+
T Consensus 124 ~Irs~grFlWIkF~sD~ele~~gfsa~y~~ 153 (156)
T KOG4586|consen 124 EIRSVGRFLWIKFRSDSELEYQGFSAEYAI 153 (156)
T ss_pred hheecCcEEEEEEcccchhhhcccceeeec
Confidence 8999999999999999654 999998765
No 27
>PF00431 CUB: CUB domain CUB domain entry Spermadhesins family entry Link to schematic domain picture by Peer Bork. ; InterPro: IPR000859 The CUB domain (for complement C1r/C1s, Uegf, Bmp1) is a structural motif of approximately 110 residues found almost exclusively in extracellular and plasma membrane-associated proteins, many of which are developmentally regulated [, ]. These proteins are involved in a diverse range of functions, including complement activation, developmental patterning, tissue repair, axon guidance and angiogenesis, cell signalling, fertilisation, haemostasis, inflammation, neurotransmission, receptor-mediated endocytosis, and tumour suppression [, ]. Many CUB-containing proteins are peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin). Proteins containing a CUB domain include: Mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system. Cricetidae sp. (Hamster) serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium. Mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement. Vertebrate enteropeptidase (3.4.21.9 from EC), a type II membrane protein of the intestinal brush border, which activates trypsinogen. Vertebrate bone morphogenic protein 1 (BMP-1), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity. Sea urchin blastula proteins BP10 and SpAN. Caenorhabditis elegans hypothetical proteins F42A10.8 and R151.5. Neuropilin (A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits. Fibropellins I and III from Strongylocentrotus purpuratus (Purple sea urchin). Mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein. Mammalian spermadhesins. Xenopus laevis embryonic protein UVS.2, which is expressed during dorsoanterior development. Several of the above proteins consist of a catalytic domain together with several CUB domains interspersed by calcium-binding EGF domains. Some CUB domains appear to be involved in oligomerisation and/or recognition of substrates and binding partners. For example, in the complement proteases, the CUB domains mediate dimerisation and binding to collagen-like regions of target proteins (e.g. C1q for C1r/C1s). The structure of CUB domains consists of a beta-sandwich with a jelly-roll fold. Almost all CUB domains contain four conserved cysteines that probably form two disulphide bridges (C1-C2, C3-C4). The CUB1 domains of C1s and Map19 have calcium-binding sites [].; PDB: 1SFP_A 3KQ4_B 2WNO_A 2QQK_A 2QQL_A 2QQO_B 2QQM_A 3POJ_A 3POB_A 3POG_B ....
Probab=97.14 E-value=0.00027 Score=55.43 Aligned_cols=80 Identities=26% Similarity=0.554 Sum_probs=59.6
Q ss_pred CCccccccccccCCCCCCCCceeEEeecCCCCCc----CCCCceEeeCCCCccccccCcccCCCcccCCcCCCCcCCCcc
Q psy9819 196 SRECLWIIGQSLDSNSTAPADIILLRLQPDINVP----CNENAVYIYDGLPDFVTSVGGTHQSTTLGVFCTEDMHRGYEV 271 (377)
Q Consensus 196 ~~~C~w~i~~~~~~~~~~~~~~~~l~~~~~~~~~----C~~d~~~v~dG~~~~~~~~g~~~~~~~~g~~C~~~~~~~p~~ 271 (377)
..+|.|.|.+ +++..|.|+|. .++++ |..|++.|+||.... ...++.+|+.. .+..
T Consensus 24 ~~~C~w~i~~-------~~~~~I~l~f~-~~~~~~~~~c~~d~l~v~~g~~~~---------~~~~~~~cg~~---~~~~ 83 (110)
T PF00431_consen 24 NSDCTWTITA-------PPGHRIRLTFL-SFDLESSDSCCQDYLEVYDGNDES---------SPLLGRFCGSS---PPPS 83 (110)
T ss_dssp SEEEEEEEE--------STTEEEEEEEE-EEEB--TTTSTSSEEEEESSSSTT---------SEEEEEESSSS---CCEE
T ss_pred CCcEeEEEEe-------cccceeeeccc-cccceeeeeecccceeEEeecccc---------ceeeeeccCCc---CCcc
Confidence 4679999999 66678999887 56666 899999999977622 45678888743 4567
Q ss_pred eecCCCeeEeccCCCCCC--CCCccc
Q psy9819 272 LEAKSGVMTIHYKQGKPS--EGFNAT 295 (377)
Q Consensus 272 c~~~sG~~~v~~~s~~c~--~GF~g~ 295 (377)
+.+.++.+.|.|.++.-. .||.+.
T Consensus 84 i~s~~~~l~i~f~s~~~~~~~gF~~~ 109 (110)
T PF00431_consen 84 IISSSNSLFIRFHSDSSNSSRGFKAT 109 (110)
T ss_dssp EEESSSEEEEEEEESSSSTTSEEEEE
T ss_pred EEECCCEEEEEEEECCCCCCccEEEE
Confidence 888999999999886543 777654
No 28
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.05 E-value=0.00036 Score=42.14 Aligned_cols=25 Identities=20% Similarity=0.169 Sum_probs=23.6
Q ss_pred CCCCCCeecccc-cCeeeecCCCeeC
Q psy9819 353 HITHGRTLHYQV-DLIRCTCRQVYLI 377 (377)
Q Consensus 353 ~C~~g~~~~~~~-~~~~c~~~~~~~~ 377 (377)
.|.|+++|++.+ +.|+|+|+++|++
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCcc
Confidence 799999999999 9999999999974
No 29
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.05 E-value=0.00018 Score=43.45 Aligned_cols=26 Identities=38% Similarity=0.907 Sum_probs=21.5
Q ss_pred CCCCC-CeecCC--CCeeeecCCCCccCC
Q psy9819 36 NKCIY-GYCKGP--PDYSCQCELGWTGVD 61 (377)
Q Consensus 36 ~~C~~-G~C~~~--~~~~C~C~~G~~G~~ 61 (377)
+||+| |+|+.. .+|.|+|++||+|+.
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 58988 899863 479999999999963
No 30
>smart00051 DSL delta serrate ligand.
Probab=97.00 E-value=0.00061 Score=48.09 Aligned_cols=43 Identities=26% Similarity=0.477 Sum_probs=34.5
Q ss_pred eeeecCCCCccCCCCCCCCC----CCCceeecCCCcccCCCCCCCCCCC
Q psy9819 49 YSCQCELGWTGVDCSVNCLC----NNHSTCVHGIGICDECHDWTTGDHC 93 (377)
Q Consensus 49 ~~C~C~~G~~G~~C~~~C~C----~~~g~C~~~~~~C~~C~~g~~G~~C 93 (377)
+.=.|+++|+|..|+..|.+ .+|.+|+. .|.+ .|.+||+|..|
T Consensus 17 ~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~~-~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGNK-GCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCCEeCcCccccCCccCCc-CCCE-ecCCCCcCCCC
Confidence 34579999999999988854 67789986 5888 68888888766
No 31
>KOG1217|consensus
Probab=96.99 E-value=0.0092 Score=59.35 Aligned_cols=97 Identities=31% Similarity=0.767 Sum_probs=56.0
Q ss_pred CCCC-CeecCCC-CeeeecCCCCccCCCCCCCCCCCCceeecCCCcccCCCCCCCC-----------------------C
Q psy9819 37 KCIY-GYCKGPP-DYSCQCELGWTGVDCSVNCLCNNHSTCVHGIGICDECHDWTTG-----------------------D 91 (377)
Q Consensus 37 ~C~~-G~C~~~~-~~~C~C~~G~~G~~C~~~C~C~~~g~C~~~~~~C~~C~~g~~G-----------------------~ 91 (377)
+|++ ++|.+.. +|.|.|+++|.|..|+.. .+.++|... ..| .+..++.+ .
T Consensus 178 ~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---~~~~~c~~~-~~~-~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~ 252 (487)
T KOG1217|consen 178 PCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---GNGGTCVDS-VAC-SCPPGARGPECEVSIVECASGDGTCVNTVGSY 252 (487)
T ss_pred CcCCCcccccCCCCeeEeCCCCccCCcCcCC---CCCceEecc-eec-cCCCCCCCCCcccccccccCCCCcccccCCce
Confidence 4766 6787654 588999999998888753 122233221 112 22222222 2
Q ss_pred CCCCCCCCcccCCC----CCCCccCCC-CCCCCCcCcccccCCC--cceecCCCCccCCC
Q psy9819 92 HCQYCRAGSYGNAT----TQEGCRKCD-CNSHGNSVLGVCDSIT--GECICQDNTQGKNC 144 (377)
Q Consensus 92 ~C~~C~~g~~g~~c----~~~~C~~~~-C~~~g~~~~g~C~~~~--g~C~C~~g~~G~~C 144 (377)
.| .|++||.+..+ ..+.|.... |.+++ +|.... ..|.|++||.|..|
T Consensus 253 ~C-~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~-----~C~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 253 TC-RCPEGYTGDACVTCVDVDSCALIASCPNGG-----TCVNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred ee-eCCCCccccccceeeeccccCCCCccCCCC-----eeecCCCcceeeCCCCCCCCCC
Confidence 33 24666666652 225555532 55544 776533 58889999999888
No 32
>smart00042 CUB Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
Probab=96.80 E-value=0.0019 Score=50.02 Aligned_cols=81 Identities=30% Similarity=0.578 Sum_probs=57.2
Q ss_pred CCccccccccccCCCCCCCCceeEEeecCCCCC----cCCCCceEeeCCCCccccccCcccCCCcccCCcCCCCcCCCcc
Q psy9819 196 SRECLWIIGQSLDSNSTAPADIILLRLQPDINV----PCNENAVYIYDGLPDFVTSVGGTHQSTTLGVFCTEDMHRGYEV 271 (377)
Q Consensus 196 ~~~C~w~i~~~~~~~~~~~~~~~~l~~~~~~~~----~C~~d~~~v~dG~~~~~~~~g~~~~~~~~g~~C~~~~~~~p~~ 271 (377)
...|.|.|.+ +.+..+.|.|. .+++ .|..|++.++||.... ...++.+|+.. ..+..
T Consensus 15 ~~~C~w~i~~-------~~g~~i~l~f~-~~~l~~~~~C~~d~l~i~~g~~~~---------~~~~~~~Cg~~--~~~~~ 75 (102)
T smart00042 15 NLDCVWTIRA-------PPGYRIELQFT-DFDLESSDNCEYDYVEIYDGPSAS---------SPLLGRFCGSE--LPPPV 75 (102)
T ss_pred CCcEEEEEEC-------CCCeEEEEEEE-EEeccCCCCeeEeEEEEEeCCCCC---------CceeEEEecCc--CCCCe
Confidence 4679999999 45567888886 4443 3778999999976411 34566888876 33445
Q ss_pred eecCCCeeEeccCCCCCC--CCCccc
Q psy9819 272 LEAKSGVMTIHYKQGKPS--EGFNAT 295 (377)
Q Consensus 272 c~~~sG~~~v~~~s~~c~--~GF~g~ 295 (377)
..+..+.+.|.|.++... .||.+.
T Consensus 76 ~~s~~n~~~i~f~s~~~~~~~GF~~~ 101 (102)
T smart00042 76 ISSSSNSLTVTFVSDSSVQKRGFSAR 101 (102)
T ss_pred EEcCCCEEEEEEEeCCCCCCCCeEEE
Confidence 667788899999887644 688754
No 33
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.70 E-value=0.0016 Score=39.25 Aligned_cols=25 Identities=44% Similarity=1.023 Sum_probs=16.5
Q ss_pred CCC-CCeecCCCCeeeecCCCCccCCC
Q psy9819 37 KCI-YGYCKGPPDYSCQCELGWTGVDC 62 (377)
Q Consensus 37 ~C~-~G~C~~~~~~~C~C~~G~~G~~C 62 (377)
.|. ||+|+.+ .++|+|++||+|++|
T Consensus 7 ~C~~~G~C~~~-~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSP-CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCC-CCEEECCCCCcCCCC
Confidence 353 5777754 347777777777765
No 34
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.70 E-value=0.00077 Score=31.83 Aligned_cols=13 Identities=62% Similarity=1.694 Sum_probs=9.9
Q ss_pred eeecCCCCccCCC
Q psy9819 50 SCQCELGWTGVDC 62 (377)
Q Consensus 50 ~C~C~~G~~G~~C 62 (377)
+|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4889999998876
No 35
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.51 E-value=0.00055 Score=32.32 Aligned_cols=12 Identities=67% Similarity=1.788 Sum_probs=7.0
Q ss_pred ccCCCCCCCCCC
Q psy9819 316 CVCPPRRTGPDC 327 (377)
Q Consensus 316 C~C~~G~~G~~C 327 (377)
|+|++||+|.+|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 566666666655
No 36
>smart00051 DSL delta serrate ligand.
Probab=96.18 E-value=0.0058 Score=43.13 Aligned_cols=42 Identities=31% Similarity=0.626 Sum_probs=26.3
Q ss_pred CCCCcccCCCCCCCccC-CCCCCCCCcCcccccCCCcceecCCCCccCCC
Q psy9819 96 CRAGSYGNATTQEGCRK-CDCNSHGNSVLGVCDSITGECICQDNTQGKNC 144 (377)
Q Consensus 96 C~~g~~g~~c~~~~C~~-~~C~~~g~~~~g~C~~~~g~C~C~~g~~G~~C 144 (377)
|+++|+|..|+ ..|.+ ....++. +|+. .|.++|.+||+|..|
T Consensus 21 C~~~~yG~~C~-~~C~~~~d~~~~~-----~Cd~-~G~~~C~~Gw~G~~C 63 (63)
T smart00051 21 CDENYYGEGCN-KFCRPRDDFFGHY-----TCDE-NGNKGCLEGWMGPYC 63 (63)
T ss_pred CCCCCcCCccC-CEeCcCccccCCc-----cCCc-CCCEecCCCCcCCCC
Confidence 45555555554 22321 1123333 8876 699999999999886
No 37
>KOG1214|consensus
Probab=96.14 E-value=0.011 Score=60.66 Aligned_cols=89 Identities=25% Similarity=0.674 Sum_probs=54.1
Q ss_pred CC-CCCeecC-CCCeeeecCCCCcc----CCCCC--------CC-----CCC--CCceee---cCCCcccCCCCCCCCCC
Q psy9819 37 KC-IYGYCKG-PPDYSCQCELGWTG----VDCSV--------NC-----LCN--NHSTCV---HGIGICDECHDWTTGDH 92 (377)
Q Consensus 37 ~C-~~G~C~~-~~~~~C~C~~G~~G----~~C~~--------~C-----~C~--~~g~C~---~~~~~C~~C~~g~~G~~ 92 (377)
.| .|..|++ +++|+|+|..||.- .+|-. +| .|. ++..|+ .+++.| .|.+||
T Consensus 743 ~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C-~CLPGf---- 817 (1289)
T KOG1214|consen 743 RCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSC-ACLPGF---- 817 (1289)
T ss_pred CCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEE-eecCCc----
Confidence 35 3577886 45999999999873 34442 11 222 233444 235667 455555
Q ss_pred CCCCCCCcccCCC---CCCCccCCCCCCCCCcCcccccCC--CcceecCCCCccCC
Q psy9819 93 CQYCRAGSYGNAT---TQEGCRKCDCNSHGNSVLGVCDSI--TGECICQDNTQGKN 143 (377)
Q Consensus 93 C~~C~~g~~g~~c---~~~~C~~~~C~~~g~~~~g~C~~~--~g~C~C~~g~~G~~ 143 (377)
.|+.- +.++|.+..|.... +|... ...|.|.+||.|.-
T Consensus 818 --------sGDG~~c~dvDeC~psrChp~A-----~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 818 --------SGDGHQCTDVDECSPSRCHPAA-----TCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred --------cCCccccccccccCccccCCCc-----eEecCCCcceeecccCccCCC
Confidence 44432 22677777777665 77643 45999999999853
No 38
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.12 E-value=0.0082 Score=37.51 Aligned_cols=28 Identities=39% Similarity=1.029 Sum_probs=22.4
Q ss_pred CCCCC-CeecCC-CCeeeecCCCCc-cCCCC
Q psy9819 36 NKCIY-GYCKGP-PDYSCQCELGWT-GVDCS 63 (377)
Q Consensus 36 ~~C~~-G~C~~~-~~~~C~C~~G~~-G~~C~ 63 (377)
.+|.+ |+|++. ++|+|.|++||. |..|+
T Consensus 9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 47876 699864 478999999999 88774
No 39
>KOG1388|consensus
Probab=96.04 E-value=0.0036 Score=54.44 Aligned_cols=86 Identities=38% Similarity=0.962 Sum_probs=64.6
Q ss_pred CccCCCCCCCCCCCCceeecCCCcccCCCCCCCCCCCCCCCCCcccCCCCCCCccCCCCCCCCCcCcccccCCCcceec-
Q psy9819 57 WTGVDCSVNCLCNNHSTCVHGIGICDECHDWTTGDHCQYCRAGSYGNATTQEGCRKCDCNSHGNSVLGVCDSITGECIC- 135 (377)
Q Consensus 57 ~~G~~C~~~C~C~~~g~C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~~~C~~~~C~~~g~~~~g~C~~~~g~C~C- 135 (377)
|.-..|- .|.|++|+.|.... +|-.|..+.+|.+|+.|..||+|+ .....|.++.|..+.. -|...+++|.|
T Consensus 44 W~fl~cP-~~~cNGh~~c~t~~-v~~~~~N~~~g~~c~kc~~g~~Gd-tN~g~c~~~~~~g~~~----~~~~~~~~c~c~ 116 (217)
T KOG1388|consen 44 WRFLFCP-LCQCNGHSDCNTQH-VCWRCENGTTGAHCEKCIVGFYGD-TNGGKCQPCDCNGGAS----ACVTLTGKCFCT 116 (217)
T ss_pred hhhhcCh-HHHhcCCCCcccce-eeeeccCccccccCCceEEEEEec-CCCCccCHhhhcCCee----eeeccCCccccc
Confidence 4455554 46788888887432 343688899999999999999998 3447788888877653 56667899999
Q ss_pred CCCCccCCCccCCC
Q psy9819 136 QDNTQGKNCERCLP 149 (377)
Q Consensus 136 ~~g~~G~~C~~C~~ 149 (377)
.-++.|..|++|..
T Consensus 117 ~kgvvgd~c~~~e~ 130 (217)
T KOG1388|consen 117 TKGVVGDLCPKCEV 130 (217)
T ss_pred cceEecccCccccc
Confidence 45899999987654
No 40
>KOG3509|consensus
Probab=95.87 E-value=0.017 Score=61.30 Aligned_cols=107 Identities=42% Similarity=1.003 Sum_probs=76.2
Q ss_pred CeeeecCCCCccCCCCC-------------------CCCCCCCc-eeecCCCcccCCCCCCCCCCCCCCCCCcccCCCCC
Q psy9819 48 DYSCQCELGWTGVDCSV-------------------NCLCNNHS-TCVHGIGICDECHDWTTGDHCQYCRAGSYGNATTQ 107 (377)
Q Consensus 48 ~~~C~C~~G~~G~~C~~-------------------~C~C~~~g-~C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~ 107 (377)
.-+|.|++|+.|..|+. .|.|+.|+ .|....++|..|..++.|.+|+.|.+||+++.-..
T Consensus 717 ~~~C~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~~~~h~~~c~~~~~~nt~~q~~~~~~~~~~~~~g~~~da~~g 796 (964)
T KOG3509|consen 717 VEQCQCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCECNSHISQCEDDLGYNTDCQNNTEGDRCELCSPGTYGDARRG 796 (964)
T ss_pred ccccccCccccCcccccccccccccccCCcCcccCcccccCCCcccccccccccccccccCccceeeecCCCccccCccC
Confidence 34899999999988883 24577776 68888889989999999999999999999987543
Q ss_pred --CCccC-------CCCCCCCCcCcccccCCCcce-ecCCCCccCCCccCCCCccCCCCC
Q psy9819 108 --EGCRK-------CDCNSHGNSVLGVCDSITGEC-ICQDNTQGKNCERCLPGYYGDPTD 157 (377)
Q Consensus 108 --~~C~~-------~~C~~~g~~~~g~C~~~~g~C-~C~~g~~G~~C~~C~~G~~g~~~~ 157 (377)
..+.+ ..+.++.. . .+......| .|+++++|..|+.+..+|++...+
T Consensus 797 ~~~D~~p~~~l~~~~~~~~r~~--l-~~~~~~~~~~~~p~~~~g~~~~~~~~~~~~~atd 853 (964)
T KOG3509|consen 797 TPEDCRPATALTIQCSCNNRSP--L-SCDGFGPGCLLCPHNTEGTTCERVKAGYYGFATD 853 (964)
T ss_pred CcccCCccchhhhhhhhcccCc--c-ccccCCCCcccCCCCccccchhhhccccccccCc
Confidence 22222 01222111 0 111112244 599999999999999999988654
No 41
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.65 E-value=0.011 Score=38.00 Aligned_cols=25 Identities=16% Similarity=0.214 Sum_probs=23.1
Q ss_pred CCCCCCCeecccccCeeeecCCCee
Q psy9819 352 VHITHGRTLHYQVDLIRCTCRQVYL 376 (377)
Q Consensus 352 ~~C~~g~~~~~~~~~~~c~~~~~~~ 376 (377)
..|....+|+|.+++|+|.|++||.
T Consensus 10 ~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 10 HNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 5788889999999999999999996
No 42
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.56 E-value=0.014 Score=36.34 Aligned_cols=31 Identities=42% Similarity=1.124 Sum_probs=23.2
Q ss_pred ccCCC--CCCCCCCeecCC----cccCCCCCC-CCCCC
Q psy9819 298 IFSCP--DKCPENRTCINN----QCVCPPRRT-GPDCQ 328 (377)
Q Consensus 298 ~~~C~--~~C~~~g~C~~g----~C~C~~G~~-G~~C~ 328 (377)
+++|. .+|.++++|++. +|.|++||+ |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 34564 468888999644 699999998 88875
No 43
>KOG1214|consensus
Probab=95.48 E-value=0.014 Score=60.00 Aligned_cols=92 Identities=21% Similarity=0.356 Sum_probs=61.5
Q ss_pred CCCCCCCccc----ccccCC---CCCCCCCCeecCC----cccCCCCCC--C--CCCCCCC---CCCCccCC--------
Q psy9819 286 GKPSEGFNAT----YQIFSC---PDKCPENRTCINN----QCVCPPRRT--G--PDCQEEI---CPNECHEF-------- 339 (377)
Q Consensus 286 ~~c~~GF~g~----~~~~~C---~~~C~~~g~C~~g----~C~C~~G~~--G--~~C~~~~---C~~~C~~~-------- 339 (377)
+.|..||.+. +++++| ...|..+..|++. +|.|..||. + ..|-... =++.|...
T Consensus 718 cecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g 797 (1289)
T KOG1214|consen 718 CECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAG 797 (1289)
T ss_pred EEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCC
Confidence 3445566651 233445 5669999999654 788877773 3 3553211 12333322
Q ss_pred ----------CceEEeCCCCCCC-------------CCCCCCeecccccCeeeecCCCeeC
Q psy9819 340 ----------LNHGTCDLLLTGV-------------HITHGRTLHYQVDLIRCTCRQVYLI 377 (377)
Q Consensus 340 ----------~~~c~C~~g~~G~-------------~C~~g~~~~~~~~~~~c~~~~~~~~ 377 (377)
.+.|.|.+||.|. .|-.-++|.+..++|.|+|.+||.+
T Consensus 798 ~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~G 858 (1289)
T KOG1214|consen 798 QARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYG 858 (1289)
T ss_pred ceEEEecCCceEEEeecCCccCCccccccccccCccccCCCceEecCCCcceeecccCccC
Confidence 3789999999875 3557889999999999999999963
No 44
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.16 E-value=0.026 Score=34.34 Aligned_cols=28 Identities=36% Similarity=0.961 Sum_probs=21.6
Q ss_pred CCCCCCeecCC-CCeeeecCCCCcc-CCCC
Q psy9819 36 NKCIYGYCKGP-PDYSCQCELGWTG-VDCS 63 (377)
Q Consensus 36 ~~C~~G~C~~~-~~~~C~C~~G~~G-~~C~ 63 (377)
.+|.++.|++. ++|+|.|++||.| ..|+
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCccCCccC
Confidence 36766688764 4889999999999 6663
No 45
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.10 E-value=0.033 Score=34.24 Aligned_cols=28 Identities=39% Similarity=1.011 Sum_probs=21.6
Q ss_pred CCCCC-CeecCC-CCeeeecCCCCccCCCC
Q psy9819 36 NKCIY-GYCKGP-PDYSCQCELGWTGVDCS 63 (377)
Q Consensus 36 ~~C~~-G~C~~~-~~~~C~C~~G~~G~~C~ 63 (377)
.+|.+ +.|.+. +.|+|.|++||.|..|+
T Consensus 9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 36875 789764 47899999999998774
No 46
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=94.87 E-value=0.03 Score=34.38 Aligned_cols=30 Identities=40% Similarity=1.128 Sum_probs=22.1
Q ss_pred cCCC--CCCCCCCeecCC----cccCCCCCCCCCCC
Q psy9819 299 FSCP--DKCPENRTCINN----QCVCPPRRTGPDCQ 328 (377)
Q Consensus 299 ~~C~--~~C~~~g~C~~g----~C~C~~G~~G~~C~ 328 (377)
++|. .+|.+++.|++. .|.|++||.|..|+
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 4453 368888899544 69999999998774
No 47
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.38 E-value=0.031 Score=34.62 Aligned_cols=22 Identities=23% Similarity=0.451 Sum_probs=18.9
Q ss_pred CCCCCeecccccCeeeecCCCeeC
Q psy9819 354 ITHGRTLHYQVDLIRCTCRQVYLI 377 (377)
Q Consensus 354 C~~g~~~~~~~~~~~c~~~~~~~~ 377 (377)
|+| .|++..++|+|.|++||.+
T Consensus 8 C~h--~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 8 CSH--ICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp SSS--EEEEETTSEEEE-STTEEE
T ss_pred cCC--CCccCCCceEeECCCCCEE
Confidence 677 9999999999999999974
No 48
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.35 E-value=0.057 Score=32.53 Aligned_cols=27 Identities=41% Similarity=0.994 Sum_probs=21.5
Q ss_pred CCCCC-CeecCC-CCeeeecCCCCccC-CC
Q psy9819 36 NKCIY-GYCKGP-PDYSCQCELGWTGV-DC 62 (377)
Q Consensus 36 ~~C~~-G~C~~~-~~~~C~C~~G~~G~-~C 62 (377)
.+|.+ +.|+.. +.|+|.|++||.|. .|
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 56765 889864 37899999999998 55
No 49
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.26 E-value=0.05 Score=33.07 Aligned_cols=25 Identities=20% Similarity=0.250 Sum_probs=20.8
Q ss_pred CCCCCCCeecccccCeeeecCCCeeC
Q psy9819 352 VHITHGRTLHYQVDLIRCTCRQVYLI 377 (377)
Q Consensus 352 ~~C~~g~~~~~~~~~~~c~~~~~~~~ 377 (377)
..|.++ +|++..++|+|+|+.+|.+
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCcc
Confidence 468888 8998889999999998863
No 50
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=92.98 E-value=0.098 Score=31.45 Aligned_cols=26 Identities=23% Similarity=0.287 Sum_probs=22.7
Q ss_pred CCCCCCCeecccccCeeeecCCCeeC
Q psy9819 352 VHITHGRTLHYQVDLIRCTCRQVYLI 377 (377)
Q Consensus 352 ~~C~~g~~~~~~~~~~~c~~~~~~~~ 377 (377)
..|.+++.|++..+.|+|+|+.+|..
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcc
Confidence 46888899999999999999999963
No 51
>PHA02887 EGF-like protein; Provisional
Probab=92.96 E-value=0.079 Score=41.47 Aligned_cols=28 Identities=36% Similarity=0.860 Sum_probs=22.9
Q ss_pred CCCCCeecCC---CCeeeecCCCCccCCCCC
Q psy9819 37 KCIYGYCKGP---PDYSCQCELGWTGVDCSV 64 (377)
Q Consensus 37 ~C~~G~C~~~---~~~~C~C~~G~~G~~C~~ 64 (377)
-|.||+|.-. ....|.|++||+|..|+.
T Consensus 93 YCiHG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 93 FCINGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred EeeCCEEEccccCCCceeECCCCcccCCCCc
Confidence 3789999742 367999999999999984
No 52
>KOG1218|consensus
Probab=92.84 E-value=0.38 Score=45.32 Aligned_cols=82 Identities=29% Similarity=0.669 Sum_probs=59.1
Q ss_pred eec-CCCCccCCCCCCCCCCCC---ceeecCCCcccCCCCCCCCCCCCCCCCCcccCCCCCCCccCCCCCCCCCcCcccc
Q psy9819 51 CQC-ELGWTGVDCSVNCLCNNH---STCVHGIGICDECHDWTTGDHCQYCRAGSYGNATTQEGCRKCDCNSHGNSVLGVC 126 (377)
Q Consensus 51 C~C-~~G~~G~~C~~~C~C~~~---g~C~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c~~~~C~~~~C~~~g~~~~g~C 126 (377)
..| ..+|.|..|..+++|..+ -+|......| .+..++.+..|.. ++++|..|.. .|.+.. .+
T Consensus 92 ~~~~~~~~~g~~C~~~~~~~~~c~~~~C~~~~~~c-~~~~~~~~~~C~~--~~~~g~~C~~------~c~~~~-----~~ 157 (316)
T KOG1218|consen 92 GYCHLNGYEGPQCESPCPCGDGCAEKTCANPRREC-RCGGGYIGEQCGE--ENLVGLKCQR------DCQCTG-----GC 157 (316)
T ss_pred CcccCCCCCcccccCCCCcCCcccccccCCCccce-ecCCcCccccccc--cCCCCCCccC------CCCCcc-----cc
Confidence 344 789999999998877655 5666443357 6888888888865 6888888773 222211 45
Q ss_pred cCCCcceecCCCCccCCCcc
Q psy9819 127 DSITGECICQDNTQGKNCER 146 (377)
Q Consensus 127 ~~~~g~C~C~~g~~G~~C~~ 146 (377)
....+.|.|.+||.|.++..
T Consensus 158 ~~~~~~c~c~~g~~g~~~~~ 177 (316)
T KOG1218|consen 158 DCKNGICTCQPGFVGVFCVE 177 (316)
T ss_pred CCCCCceeccCCcccccccc
Confidence 55578999999999999984
No 53
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=92.33 E-value=0.027 Score=39.71 Aligned_cols=44 Identities=30% Similarity=0.514 Sum_probs=21.5
Q ss_pred CeeeecCCCCccCCCCCCCC----CCCCceeecCCCcccCCCCCCCCCCC
Q psy9819 48 DYSCQCELGWTGVDCSVNCL----CNNHSTCVHGIGICDECHDWTTGDHC 93 (377)
Q Consensus 48 ~~~C~C~~G~~G~~C~~~C~----C~~~g~C~~~~~~C~~C~~g~~G~~C 93 (377)
.++-.|.+.|+|..|++.|. -.+|-+|+. .|.= .|.+||+|+.|
T Consensus 16 ~~rv~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~-~G~~-~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCSKFCKPRDDSFGHYTCDS-NGNK-VCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETTTT-EE---EEETTEEEEE-S-S--E-EE-TTEESTTS
T ss_pred EEEEECCCCCCCccccCCcCCCcCCcCCcccCC-CCCC-CCCCCCcCCCC
Confidence 45778999999999998762 234557874 4554 46777777765
No 54
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=92.10 E-value=0.11 Score=41.47 Aligned_cols=37 Identities=32% Similarity=0.731 Sum_probs=28.3
Q ss_pred eecCCCcCC----CCCCCeecC---CCCeeeecCCCCccCCCCC
Q psy9819 28 IFNASLCYN----KCIYGYCKG---PPDYSCQCELGWTGVDCSV 64 (377)
Q Consensus 28 ~~~~~~C~~----~C~~G~C~~---~~~~~C~C~~G~~G~~C~~ 64 (377)
.-....|+. -|.||+|.- ...+.|.|+.||+|.+|+.
T Consensus 39 ~~~i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 39 IPAIRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CcccccCChhhCCEeECCEEEeeccCCCceeECCCCcccccccc
Confidence 344566764 488999973 3477999999999999984
No 55
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=90.94 E-value=0.19 Score=31.05 Aligned_cols=25 Identities=20% Similarity=0.191 Sum_probs=19.2
Q ss_pred CCCCCCeecccccCeeeecCCCeeC
Q psy9819 353 HITHGRTLHYQVDLIRCTCRQVYLI 377 (377)
Q Consensus 353 ~C~~g~~~~~~~~~~~c~~~~~~~~ 377 (377)
.|..-++|++..++|+|+|++||.+
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~G 31 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEG 31 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEEC
T ss_pred CCCCCcEeecCCCCEEeECCCCCcc
Confidence 4667789999999999999999974
No 56
>KOG3509|consensus
Probab=90.46 E-value=0.81 Score=49.08 Aligned_cols=78 Identities=36% Similarity=0.777 Sum_probs=60.6
Q ss_pred ecCCCcccCCCCCCCCCCCCCCCCCcccCCC---CCCCccCCCCCCCCCcCcccccCCCccee-cCCCCccCCCccCCCC
Q psy9819 75 VHGIGICDECHDWTTGDHCQYCRAGSYGNAT---TQEGCRKCDCNSHGNSVLGVCDSITGECI-CQDNTQGKNCERCLPG 150 (377)
Q Consensus 75 ~~~~~~C~~C~~g~~G~~C~~C~~g~~g~~c---~~~~C~~~~C~~~g~~~~g~C~~~~g~C~-C~~g~~G~~C~~C~~G 150 (377)
....-+| +|+.++.|.+|+.|.++|.-... ....+..+.|..|.. .|....+.|. |...+.|.+|+.|.+|
T Consensus 714 ~~~~~~C-~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~~~~h~~----~c~~~~~~nt~~q~~~~~~~~~~~~~g 788 (964)
T KOG3509|consen 714 AAEVEQC-QCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCECNSHIS----QCEDDLGYNTDCQNNTEGDRCELCSPG 788 (964)
T ss_pred hhhcccc-ccCccccCcccccccccccccccCCcCcccCcccccCCCcc----cccccccccccccccCccceeeecCCC
Confidence 3456689 79999999999999999865542 224555667776663 6766667775 8899999999999999
Q ss_pred ccCCCCC
Q psy9819 151 YYGDPTD 157 (377)
Q Consensus 151 ~~g~~~~ 157 (377)
++++..-
T Consensus 789 ~~~da~~ 795 (964)
T KOG3509|consen 789 TYGDARR 795 (964)
T ss_pred ccccCcc
Confidence 9998764
No 57
>PHA02887 EGF-like protein; Provisional
Probab=88.69 E-value=0.3 Score=38.27 Aligned_cols=25 Identities=36% Similarity=0.841 Sum_probs=19.7
Q ss_pred CCCCCeec--C----CcccCCCCCCCCCCCCC
Q psy9819 305 CPENRTCI--N----NQCVCPPRRTGPDCQEE 330 (377)
Q Consensus 305 C~~~g~C~--~----g~C~C~~G~~G~~C~~~ 330 (377)
|. ||+|. . -.|.|+.||+|.+|+..
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCCcc
Confidence 77 67992 1 26999999999999863
No 58
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.48 E-value=0.21 Score=30.86 Aligned_cols=24 Identities=29% Similarity=0.854 Sum_probs=17.2
Q ss_pred CC-CCCeecCC-CCeeeecCCCCccC
Q psy9819 37 KC-IYGYCKGP-PDYSCQCELGWTGV 60 (377)
Q Consensus 37 ~C-~~G~C~~~-~~~~C~C~~G~~G~ 60 (377)
.| .|.+|++. ++|+|+|++||.|+
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCcEeecCCCCEEeECCCCCccC
Confidence 35 46888864 38999999999985
No 59
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=87.98 E-value=0.33 Score=27.06 Aligned_cols=11 Identities=27% Similarity=0.706 Sum_probs=9.4
Q ss_pred CeeeecCCCee
Q psy9819 366 LIRCTCRQVYL 376 (377)
Q Consensus 366 ~~~c~~~~~~~ 376 (377)
+|+|+|++||.
T Consensus 1 sy~C~C~~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCPPGYQ 11 (24)
T ss_pred CEEeeCCCCCc
Confidence 58899999986
No 60
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=87.27 E-value=0.21 Score=31.95 Aligned_cols=26 Identities=50% Similarity=1.205 Sum_probs=19.7
Q ss_pred ccCCC---CCCCCCCeecCC----cccCCCCCC
Q psy9819 298 IFSCP---DKCPENRTCINN----QCVCPPRRT 323 (377)
Q Consensus 298 ~~~C~---~~C~~~g~C~~g----~C~C~~G~~ 323 (377)
+++|. ..|..+++|++. +|.|++||.
T Consensus 2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 56672 358889999654 799999997
No 61
>KOG1218|consensus
Probab=85.89 E-value=8.6 Score=35.99 Aligned_cols=126 Identities=21% Similarity=0.477 Sum_probs=61.5
Q ss_pred CeeeecCCCCccCCCCCCCCCCCC-ceeecCCCcccCCCCCCCCCCCCCC-CCCcccCCCCCCCccCCCCCCCCCcCccc
Q psy9819 48 DYSCQCELGWTGVDCSVNCLCNNH-STCVHGIGICDECHDWTTGDHCQYC-RAGSYGNATTQEGCRKCDCNSHGNSVLGV 125 (377)
Q Consensus 48 ~~~C~C~~G~~G~~C~~~C~C~~~-g~C~~~~~~C~~C~~g~~G~~C~~C-~~g~~g~~c~~~~C~~~~C~~~g~~~~g~ 125 (377)
..+|.+..+|.|..|.+++..... +.|... ..| .....+..... .| ..+|.|..|.. .++|... ... .+
T Consensus 48 ~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~-~~c-~~~~~~~~~~~-~~~~~~~~g~~C~~----~~~~~~~-c~~-~~ 118 (316)
T KOG1218|consen 48 SGECGLGYGFVGSVCRIECVCGNAGGGCSQP-CRC-KNGGTCVSSTG-YCHLNGYEGPQCES----PCPCGDG-CAE-KT 118 (316)
T ss_pred ceeEecccccCCCccccccccCCCCCcccCc-ccc-CCCCcccCCCC-cccCCCCCcccccC----CCCcCCc-ccc-cc
Confidence 568899999999988875533222 222211 111 11111111111 12 34455544442 2222211 000 15
Q ss_pred ccCCCcceecCCCCccCCCccCCCCccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCCCCCccCCC
Q psy9819 126 CDSITGECICQDNTQGKNCERCLPGYYGDPTDGGTCYYQCMARGMLTGPGPQGLGSGLAERNAWEGKDT 194 (377)
Q Consensus 126 C~~~~g~C~C~~g~~G~~C~~C~~G~~g~~~~~~~C~~~C~~~g~~~~~~~~~~g~~~~c~~G~~G~~C 194 (377)
|......|.+..+|.+..|.. ++++|. .|...|.+.. .+.-..+.+ .+.+||.|..+
T Consensus 119 C~~~~~~c~~~~~~~~~~C~~--~~~~g~-----~C~~~c~~~~----~~~~~~~~c-~c~~g~~g~~~ 175 (316)
T KOG1218|consen 119 CANPRRECRCGGGYIGEQCGE--ENLVGL-----KCQRDCQCTG----GCDCKNGIC-TCQPGFVGVFC 175 (316)
T ss_pred cCCCccceecCCcCccccccc--cCCCCC-----CccCCCCCcc----ccCCCCCce-eccCCcccccc
Confidence 554223688899999998875 677776 4544442222 111111223 37889999887
No 62
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=85.11 E-value=0.26 Score=34.73 Aligned_cols=39 Identities=26% Similarity=0.498 Sum_probs=16.1
Q ss_pred cccCCCCCCCCCCCCCCCCC-------CccCCCceEEeCCCCCCCCC
Q psy9819 315 QCVCPPRRTGPDCQEEICPN-------ECHEFLNHGTCDLLLTGVHI 354 (377)
Q Consensus 315 ~C~C~~G~~G~~C~~~~C~~-------~C~~~~~~c~C~~g~~G~~C 354 (377)
+-.|.+.|.|.+|++.--|. .|.. .|+=+|.+||+|..|
T Consensus 18 rv~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~-~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 18 RVVCDENYYGPNCSKFCKPRDDSFGHYTCDS-NGNKVCLPGWTGPNC 63 (63)
T ss_dssp -----TTEETTTT-EE---EEETTEEEEE-S-S--EEE-TTEESTTS
T ss_pred EEECCCCCCCccccCCcCCCcCCcCCcccCC-CCCCCCCCCCcCCCC
Confidence 34566777777776531121 2332 355677777777765
No 63
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=82.71 E-value=1.2 Score=26.94 Aligned_cols=24 Identities=21% Similarity=0.488 Sum_probs=19.2
Q ss_pred CCCCCCCCCCccCC-CceEEeCCCC
Q psy9819 326 DCQEEICPNECHEF-LNHGTCDLLL 349 (377)
Q Consensus 326 ~C~~~~C~~~C~~~-~~~c~C~~g~ 349 (377)
.|++..||..|..+ .+.|.|++||
T Consensus 2 fCn~t~CpA~CDpn~~~~C~CPeGy 26 (34)
T PF09064_consen 2 FCNQTECPADCDPNSPGQCFCPEGY 26 (34)
T ss_pred ccccccCCCccCCCCCCceeCCCce
Confidence 57777888888775 5789999988
No 64
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=82.25 E-value=0.37 Score=32.41 Aligned_cols=28 Identities=25% Similarity=0.655 Sum_probs=17.2
Q ss_pred CCCCCCCee-cCC-------cccCCCCCCCCCCCCC
Q psy9819 303 DKCPENRTC-INN-------QCVCPPRRTGPDCQEE 330 (377)
Q Consensus 303 ~~C~~~g~C-~~g-------~C~C~~G~~G~~C~~~ 330 (377)
-+|++||+- +++ .|.|..-|.|.+|++.
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~ 52 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTL 52 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcccC
Confidence 359999988 344 6999999999999874
No 65
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.24 E-value=1.3 Score=35.43 Aligned_cols=25 Identities=36% Similarity=0.777 Sum_probs=18.4
Q ss_pred CCCCCeec--C----CcccCCCCCCCCCCCCC
Q psy9819 305 CPENRTCI--N----NQCVCPPRRTGPDCQEE 330 (377)
Q Consensus 305 C~~~g~C~--~----g~C~C~~G~~G~~C~~~ 330 (377)
|.++ +|. . -.|+|+.||+|.+||..
T Consensus 53 ClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 53 CLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred eECC-EEEeeccCCCceeECCCCcccccccce
Confidence 7764 782 1 16999999999999863
No 66
>KOG3516|consensus
Probab=79.45 E-value=1.6 Score=47.31 Aligned_cols=60 Identities=23% Similarity=0.574 Sum_probs=38.9
Q ss_pred EEecCCCCCcccccc-------cceeeeEeeecC-CCc-CCCCCC-CeecCCC-CeeeecC-CCCccCCCCC
Q psy9819 5 FRISGLTTAKDDALS-------RCTVLLLYIFNA-SLC-YNKCIY-GYCKGPP-DYSCQCE-LGWTGVDCSV 64 (377)
Q Consensus 5 ~~~~~~~~~~~~~~~-------~~~~~~~~~~~~-~~C-~~~C~~-G~C~~~~-~~~C~C~-~G~~G~~C~~ 64 (377)
||++..+.+.+++.. -+.++++-+..+ ..| ||+|+| |.|.... .|.|.|. .||+|..|..
T Consensus 511 mrli~vd~~~~~l~~v~~~~~g~~~~v~id~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 511 MRLIKVDGQLKDLIDVKQGSLGNFSDVQIDMCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred eEEEEECCeEeeeeeeeccccccccceeecccccccccCCccccCCCcccccccceeEeccccccccccccC
Confidence 566655555555544 122233332222 344 379999 7798754 8999999 9999999995
No 67
>KOG1388|consensus
Probab=77.15 E-value=1.7 Score=38.17 Aligned_cols=75 Identities=29% Similarity=0.664 Sum_probs=46.9
Q ss_pred cCCCCCCCCCcCcccccCCCcceecCCCCccCCCccCCCCccCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCcccCCCCC
Q psy9819 111 RKCDCNSHGNSVLGVCDSITGECICQDNTQGKNCERCLPGYYGDPTDGGTCYY-QCMARGMLTGPGPQGLGSGLAERNAW 189 (377)
Q Consensus 111 ~~~~C~~~g~~~~g~C~~~~g~C~C~~g~~G~~C~~C~~G~~g~~~~~~~C~~-~C~~~g~~~~~~~~~~g~~~~c~~G~ 189 (377)
..+.|++++ .|.....--.|+.+-+|..|+.|.+||+|+ .+++.|.. +|+.... .+..-++.+.--..|.
T Consensus 50 P~~~cNGh~-----~c~t~~v~~~~~N~~~g~~c~kc~~g~~Gd-tN~g~c~~~~~~g~~~---~~~~~~~~c~c~~kgv 120 (217)
T KOG1388|consen 50 PLCQCNGHS-----DCNTQHVCWRCENGTTGAHCEKCIVGFYGD-TNGGKCQPCDCNGGAS---ACVTLTGKCFCTTKGV 120 (217)
T ss_pred hHHHhcCCC-----CcccceeeeeccCccccccCCceEEEEEec-CCCCccCHhhhcCCee---eeeccCCccccccceE
Confidence 345566555 565433333588999999999999999998 77777776 4444331 1222233333224577
Q ss_pred ccCCC
Q psy9819 190 EGKDT 194 (377)
Q Consensus 190 ~G~~C 194 (377)
.|+.|
T Consensus 121 vgd~c 125 (217)
T KOG1388|consen 121 VGDLC 125 (217)
T ss_pred ecccC
Confidence 77777
No 68
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=73.83 E-value=1.9 Score=33.43 Aligned_cols=27 Identities=33% Similarity=1.077 Sum_probs=20.2
Q ss_pred CCCCCCCeecCC---------cccCCC-------------CCCCCCCCC
Q psy9819 303 DKCPENRTCINN---------QCVCPP-------------RRTGPDCQE 329 (377)
Q Consensus 303 ~~C~~~g~C~~g---------~C~C~~-------------G~~G~~C~~ 329 (377)
++|++||.|+.. .|+|.+ .|.|..|+.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqK 61 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQK 61 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccc
Confidence 569999999654 588877 566777765
No 69
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=69.37 E-value=4.2 Score=27.12 Aligned_cols=20 Identities=35% Similarity=1.066 Sum_probs=17.6
Q ss_pred CCCCCCeecCCcccCCCCCC
Q psy9819 304 KCPENRTCINNQCVCPPRRT 323 (377)
Q Consensus 304 ~C~~~g~C~~g~C~C~~G~~ 323 (377)
.|..+..|++++|.|++||.
T Consensus 27 qC~~~s~C~~g~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcCCEeECCCCCE
Confidence 47788999999999999985
No 70
>KOG3607|consensus
Probab=66.68 E-value=4 Score=42.98 Aligned_cols=32 Identities=31% Similarity=0.734 Sum_probs=26.9
Q ss_pred cCCCCCCCCCCeecCC-cccCCCCCCCCCCCCC
Q psy9819 299 FSCPDKCPENRTCINN-QCVCPPRRTGPDCQEE 330 (377)
Q Consensus 299 ~~C~~~C~~~g~C~~g-~C~C~~G~~G~~C~~~ 330 (377)
..|+..|++||+|.+. .|.|.+||.+.+|++.
T Consensus 626 ~~~~~~C~g~GVCnn~~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 626 SCCPTTCNGHGVCNNELNCHCEPGWAPPFCFIF 658 (716)
T ss_pred cccccccCCCcccCCCcceeeCCCCCCCccccc
Confidence 4566779999999665 8999999999999875
No 71
>KOG0196|consensus
Probab=64.63 E-value=8.9 Score=40.48 Aligned_cols=56 Identities=30% Similarity=0.659 Sum_probs=37.4
Q ss_pred CCcccCCCCCCC----CCCCCCCCCCcccCCCCCCCccCCCCCCCCCcCcccccCCCcceecCCCCc
Q psy9819 78 IGICDECHDWTT----GDHCQYCRAGSYGNATTQEGCRKCDCNSHGNSVLGVCDSITGECICQDNTQ 140 (377)
Q Consensus 78 ~~~C~~C~~g~~----G~~C~~C~~g~~g~~c~~~~C~~~~C~~~g~~~~g~C~~~~g~C~C~~g~~ 140 (377)
.|.| .|.+||. |..|+.|++|+|-..-....|.+|+-+.+.. ....-.|.|..||.
T Consensus 258 iG~C-~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~------~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 258 IGGC-VCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSS------SEGATSCTCENGYY 317 (996)
T ss_pred cCce-eecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCC------CCCCCcccccCCcc
Confidence 5789 7999995 5789999999988766556777665444321 11123677666663
No 72
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=63.23 E-value=17 Score=28.43 Aligned_cols=28 Identities=25% Similarity=0.632 Sum_probs=19.0
Q ss_pred CCcC--CCC-CCCeecCCCCeeeecCCCCcc
Q psy9819 32 SLCY--NKC-IYGYCKGPPDYSCQCELGWTG 59 (377)
Q Consensus 32 ~~C~--~~C-~~G~C~~~~~~~C~C~~G~~G 59 (377)
..|. ..| .+|.|+......|.|.+||.-
T Consensus 78 d~Cd~y~~CG~~g~C~~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNSNNSPKCSCLPGFEP 108 (110)
T ss_pred cCCCCccccCCccEeCCCCCCceECCCCcCC
Confidence 3555 356 358887655567999999863
No 73
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=51.64 E-value=37 Score=25.97 Aligned_cols=48 Identities=23% Similarity=0.640 Sum_probs=21.2
Q ss_pred CCCCCCCCCcccCCCCC-CCccCCC-CCCCCCcCcccccCC-CcceecCCCC
Q psy9819 91 DHCQYCRAGSYGNATTQ-EGCRKCD-CNSHGNSVLGVCDSI-TGECICQDNT 139 (377)
Q Consensus 91 ~~C~~C~~g~~g~~c~~-~~C~~~~-C~~~g~~~~g~C~~~-~g~C~C~~g~ 139 (377)
..|+.|++|+|-+.-.. ..|.++. |. .+......|... +.+|.|.+||
T Consensus 33 t~C~~C~~g~ys~~~~~~~~C~~c~~C~-~g~~~~~~ct~t~dt~C~C~~G~ 83 (98)
T cd00185 33 TVCEPCPPGTYTDSWNHLPKCLSCRTCD-SGLVEKAPCTATRNTVCGCKPGF 83 (98)
T ss_pred CeecCCCCCCcccCCCCCCcCCcCccCC-CCCEEEccCCCCCCCeEeCCCCC
Confidence 34556777766554332 2344332 33 222222233322 2356666655
No 74
>KOG3607|consensus
Probab=41.40 E-value=19 Score=38.13 Aligned_cols=27 Identities=26% Similarity=0.536 Sum_probs=19.3
Q ss_pred CCCCCceeecCCCcccCCCCCCCCCCCCC
Q psy9819 67 LCNNHSTCVHGIGICDECHDWTTGDHCQY 95 (377)
Q Consensus 67 ~C~~~g~C~~~~~~C~~C~~g~~G~~C~~ 95 (377)
.|++||.|+. ...| .|.++|+++.|+.
T Consensus 631 ~C~g~GVCnn-~~~C-hC~~gwapp~C~~ 657 (716)
T KOG3607|consen 631 TCNGHGVCNN-ELNC-HCEPGWAPPFCFI 657 (716)
T ss_pred ccCCCcccCC-Ccce-eeCCCCCCCcccc
Confidence 4777777764 4567 6888888888864
No 75
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=40.39 E-value=19 Score=31.98 Aligned_cols=23 Identities=13% Similarity=0.248 Sum_probs=18.8
Q ss_pred CCCCCCeecccccCeeeecCCCeeC
Q psy9819 353 HITHGRTLHYQVDLIRCTCRQVYLI 377 (377)
Q Consensus 353 ~C~~g~~~~~~~~~~~c~~~~~~~~ 377 (377)
.|.+ +|++.+++|.|.|+.||+.
T Consensus 196 ~c~~--~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 196 VCQQ--VCISTPGSYLCACTEGYAL 218 (224)
T ss_pred Cccc--eEEcCCCCEEeECCCCccC
Confidence 3554 7999999999999999964
No 76
>KOG3514|consensus
Probab=35.41 E-value=24 Score=38.47 Aligned_cols=31 Identities=26% Similarity=0.771 Sum_probs=25.5
Q ss_pred CC-CCCCCCCCeecCC----cccCC-CCCCCCCCCCC
Q psy9819 300 SC-PDKCPENRTCINN----QCVCP-PRRTGPDCQEE 330 (377)
Q Consensus 300 ~C-~~~C~~~g~C~~g----~C~C~-~G~~G~~C~~~ 330 (377)
.| +++|.|+|+|.++ .|.|. .+|.|..||.+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccce
Confidence 46 5779999999877 68885 68999999875
No 77
>KOG3516|consensus
Probab=31.74 E-value=29 Score=38.27 Aligned_cols=40 Identities=25% Similarity=0.745 Sum_probs=30.7
Q ss_pred cCC-CCCCCCCCeecCC----cccCC-CCCCCCCCCCCCCCCCccC
Q psy9819 299 FSC-PDKCPENRTCINN----QCVCP-PRRTGPDCQEEICPNECHE 338 (377)
Q Consensus 299 ~~C-~~~C~~~g~C~~g----~C~C~-~G~~G~~C~~~~C~~~C~~ 338 (377)
+.| |++|.++|.|... .|.|. .||.|..|...+=+..|+.
T Consensus 546 drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e~SCea 591 (1306)
T KOG3516|consen 546 DRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYELSCEA 591 (1306)
T ss_pred cccCCccccCCCcccccccceeEeccccccccccccCCCcchhhHH
Confidence 356 6779999999433 79998 9999999998665555543
No 78
>KOG0196|consensus
Probab=28.83 E-value=1.9e+02 Score=31.10 Aligned_cols=33 Identities=33% Similarity=0.938 Sum_probs=24.8
Q ss_pred CcceecCCCC----ccCCCccCCCCccCCCCCCCCCC
Q psy9819 130 TGECICQDNT----QGKNCERCLPGYYGDPTDGGTCY 162 (377)
Q Consensus 130 ~g~C~C~~g~----~G~~C~~C~~G~~g~~~~~~~C~ 162 (377)
.|.|.|++|| .|..|+.|++|+|-.......|.
T Consensus 258 iG~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~ 294 (996)
T KOG0196|consen 258 IGGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCL 294 (996)
T ss_pred cCceeecCCCCcccCCCcceeCCCCcccCCCCCCCCC
Confidence 4799999998 46889999999996644333443
No 79
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=25.15 E-value=50 Score=20.51 Aligned_cols=25 Identities=12% Similarity=0.090 Sum_probs=16.0
Q ss_pred CCCCCCCeecccc-cCeeeecCCCee
Q psy9819 352 VHITHGRTLHYQV-DLIRCTCRQVYL 376 (377)
Q Consensus 352 ~~C~~g~~~~~~~-~~~~c~~~~~~~ 376 (377)
..|..-+.|+..- +.+.|+|..||.
T Consensus 5 ~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp S---TTEEEEEETTSEEEEEE-TTEE
T ss_pred ccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 3455566788666 788899999884
No 80
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=21.79 E-value=38 Score=21.74 Aligned_cols=16 Identities=19% Similarity=0.326 Sum_probs=13.4
Q ss_pred EEEecCCCCC-cccccc
Q psy9819 4 IFRISGLTTA-KDDALS 19 (377)
Q Consensus 4 ~~~~~~~~~~-~~~~~~ 19 (377)
||++.||++. +.|||-
T Consensus 23 iYtaFGppSk~LrDPfe 39 (43)
T PF02468_consen 23 IYTAFGPPSKELRDPFE 39 (43)
T ss_pred hhheeCCCccccCCccc
Confidence 7899999888 888873
Done!