Query psy9419
Match_columns 739
No_of_seqs 449 out of 2689
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 18:45:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9419.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9419hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.7 3.3E-15 7.2E-20 163.0 22.1 236 195-489 700-947 (1289)
2 KOG1214|consensus 99.7 2.3E-15 5E-20 164.2 18.4 209 27-281 693-910 (1289)
3 KOG1217|consensus 99.7 7E-15 1.5E-19 166.7 23.5 276 5-384 106-389 (487)
4 KOG1217|consensus 99.6 1.3E-13 2.8E-18 156.3 30.5 320 159-612 99-433 (487)
5 KOG4289|consensus 99.6 2E-14 4.3E-19 163.7 19.5 84 131-217 1223-1308(2531)
6 KOG4289|consensus 99.6 2.7E-14 5.8E-19 162.7 15.3 104 315-440 1180-1308(2531)
7 KOG1219|consensus 99.3 5.8E-12 1.3E-16 148.9 8.6 109 150-282 3865-3976(4289)
8 KOG1219|consensus 99.2 2.2E-11 4.7E-16 144.3 8.0 113 243-384 3859-3974(4289)
9 KOG1225|consensus 99.2 3.7E-10 8.1E-15 123.6 14.3 192 432-673 160-365 (525)
10 KOG0994|consensus 99.1 2E-09 4.3E-14 122.0 16.5 198 334-603 933-1143(1758)
11 KOG1225|consensus 99.1 2.2E-09 4.7E-14 117.7 16.5 122 481-638 232-363 (525)
12 KOG0994|consensus 99.0 1.3E-08 2.8E-13 115.7 17.3 30 522-562 1126-1155(1758)
13 KOG4260|consensus 98.8 3.9E-09 8.4E-14 102.7 6.2 163 51-278 130-304 (350)
14 KOG4260|consensus 98.6 1.1E-07 2.5E-12 92.6 6.7 127 431-602 166-305 (350)
15 KOG1836|consensus 98.4 0.00036 7.7E-09 87.2 31.7 216 486-723 760-1033(1705)
16 PF07645 EGF_CA: Calcium-bindi 98.2 4.7E-07 1E-11 65.0 1.8 36 67-102 1-36 (42)
17 KOG1836|consensus 98.0 0.00016 3.6E-09 90.1 19.0 110 426-549 903-1026(1705)
18 KOG1226|consensus 97.9 9.8E-05 2.1E-09 83.0 12.3 99 155-284 514-621 (783)
19 PF07645 EGF_CA: Calcium-bindi 97.9 8.6E-06 1.9E-10 58.5 2.4 35 506-540 1-35 (42)
20 KOG1226|consensus 97.8 7.7E-05 1.7E-09 83.8 9.9 149 515-693 467-636 (783)
21 PF06247 Plasmod_Pvs28: Plasmo 97.5 2.6E-05 5.7E-10 72.9 1.0 143 34-222 7-163 (197)
22 PF12947 EGF_3: EGF domain; I 97.5 4.2E-05 9.1E-10 52.5 1.0 33 71-103 1-33 (36)
23 smart00179 EGF_CA Calcium-bind 97.4 0.00021 4.5E-09 50.2 4.1 35 569-603 1-36 (39)
24 PF12662 cEGF: Complement Clr- 97.4 0.00012 2.6E-09 44.9 2.2 23 48-70 1-24 (24)
25 PF00008 EGF: EGF-like domain 97.4 8.7E-05 1.9E-09 49.7 1.7 29 575-603 2-31 (32)
26 PF00008 EGF: EGF-like domain 97.3 0.00012 2.7E-09 48.9 1.9 30 191-221 1-31 (32)
27 smart00179 EGF_CA Calcium-bind 97.2 0.00047 1E-08 48.4 4.3 37 246-282 1-38 (39)
28 PF12947 EGF_3: EGF domain; I 97.2 0.00018 3.9E-09 49.3 1.3 30 513-542 4-33 (36)
29 PF12662 cEGF: Complement Clr- 97.0 0.00063 1.4E-08 41.8 2.6 24 209-249 1-24 (24)
30 cd00054 EGF_CA Calcium-binding 96.8 0.0016 3.5E-08 45.1 4.0 34 570-603 2-35 (38)
31 cd00054 EGF_CA Calcium-binding 96.6 0.0031 6.7E-08 43.6 4.1 36 247-282 2-37 (38)
32 PF06247 Plasmod_Pvs28: Plasmo 96.3 0.0025 5.4E-08 59.9 2.6 95 4-103 65-163 (197)
33 KOG1218|consensus 96.0 1.1 2.3E-05 47.7 21.6 84 434-540 125-209 (316)
34 PF14670 FXa_inhibition: Coagu 95.9 0.0035 7.6E-08 42.9 1.2 28 74-103 4-31 (36)
35 cd00053 EGF Epidermal growth f 95.9 0.012 2.6E-07 40.0 3.9 28 576-603 5-32 (36)
36 cd00053 EGF Epidermal growth f 95.8 0.011 2.4E-07 40.1 3.4 28 75-102 5-32 (36)
37 smart00181 EGF Epidermal growt 95.8 0.014 3E-07 39.7 3.8 26 577-603 6-31 (35)
38 smart00181 EGF Epidermal growt 95.4 0.016 3.5E-07 39.4 3.1 26 76-102 6-31 (35)
39 PF07974 EGF_2: EGF-like domai 95.3 0.016 3.5E-07 38.6 2.6 25 650-674 6-32 (32)
40 KOG1218|consensus 95.2 0.9 1.9E-05 48.3 17.3 49 430-494 12-60 (316)
41 PF07974 EGF_2: EGF-like domai 94.9 0.032 6.9E-07 37.2 3.1 25 33-60 6-30 (32)
42 PF12661 hEGF: Human growth fa 94.0 0.021 4.5E-07 29.7 0.5 13 662-674 1-13 (13)
43 PF14670 FXa_inhibition: Coagu 92.8 0.078 1.7E-06 36.4 2.0 24 260-283 10-33 (36)
44 cd01475 vWA_Matrilin VWA_Matri 91.6 0.16 3.5E-06 51.3 3.5 38 64-103 183-220 (224)
45 smart00051 DSL delta serrate l 91.2 0.22 4.8E-06 39.0 3.1 21 654-674 42-63 (63)
46 PF01683 EB: EB module; Inter 87.0 1.2 2.5E-05 33.4 4.3 43 13-60 1-48 (52)
47 smart00051 DSL delta serrate l 84.7 1.1 2.5E-05 35.0 3.3 43 661-705 17-59 (63)
48 PF01683 EB: EB module; Inter 82.5 1.9 4.2E-05 32.2 3.8 22 650-671 26-47 (52)
49 PF12946 EGF_MSP1_1: MSP1 EGF 82.5 1.1 2.4E-05 30.7 2.1 31 29-60 2-32 (37)
50 PTZ00214 high cysteine membran 82.2 66 0.0014 38.8 18.1 15 269-283 682-696 (800)
51 cd00055 EGF_Lam Laminin-type e 80.6 2.1 4.5E-05 31.8 3.3 31 522-563 13-43 (50)
52 cd01475 vWA_Matrilin VWA_Matri 80.5 1.6 3.5E-05 43.9 3.6 37 184-222 183-220 (224)
53 PF00053 Laminin_EGF: Laminin 80.3 1.2 2.5E-05 32.9 1.8 31 521-562 11-41 (49)
54 PF12946 EGF_MSP1_1: MSP1 EGF 79.2 1.1 2.4E-05 30.7 1.2 28 576-603 4-32 (37)
55 smart00180 EGF_Lam Laminin-typ 73.9 3.9 8.4E-05 29.8 3.0 25 522-548 12-36 (46)
56 PF09064 Tme5_EGF_like: Thromb 70.1 4.6 9.9E-05 27.1 2.3 23 39-63 10-32 (34)
57 PF00954 S_locus_glycop: S-loc 68.6 5.4 0.00012 35.1 3.4 34 569-603 76-109 (110)
58 KOG3512|consensus 64.8 14 0.00029 40.4 5.9 28 423-450 404-431 (592)
59 PF03302 VSP: Giardia variant- 62.2 30 0.00064 38.2 8.3 52 52-109 3-56 (397)
60 PHA02887 EGF-like protein; Pro 57.6 8.2 0.00018 33.7 2.3 26 325-350 96-123 (126)
61 cd00055 EGF_Lam Laminin-type e 57.2 13 0.00027 27.6 3.0 19 427-445 13-31 (50)
62 PF00053 Laminin_EGF: Laminin 54.2 7.3 0.00016 28.6 1.3 20 426-445 11-30 (49)
63 PTZ00214 high cysteine membran 51.9 4.2E+02 0.0091 32.2 16.0 13 268-280 750-762 (800)
64 PHA03099 epidermal growth fact 49.3 16 0.00034 32.6 2.8 39 570-612 42-84 (139)
65 PHA02887 EGF-like protein; Pro 48.5 14 0.00031 32.3 2.3 29 462-500 94-122 (126)
66 PF00954 S_locus_glycop: S-loc 48.4 17 0.00036 32.0 2.9 33 187-220 76-108 (110)
67 smart00180 EGF_Lam Laminin-typ 48.0 20 0.00043 26.0 2.7 20 426-445 11-30 (46)
68 KOG3512|consensus 47.4 32 0.00069 37.7 5.2 99 3-107 289-430 (592)
69 PHA03099 epidermal growth fact 46.3 20 0.00042 32.0 2.9 37 247-284 42-82 (139)
70 PF01414 DSL: Delta serrate li 42.0 6.9 0.00015 30.7 -0.5 14 661-674 50-63 (63)
71 PF12955 DUF3844: Domain of un 32.2 25 0.00054 30.5 1.3 32 28-59 7-43 (103)
72 PF12955 DUF3844: Domain of un 28.1 36 0.00078 29.5 1.6 31 571-601 6-42 (103)
73 PF04863 EGF_alliinase: Alliin 27.8 31 0.00067 26.0 1.0 33 320-352 17-53 (56)
74 KOG3516|consensus 27.3 56 0.0012 40.1 3.5 44 241-285 539-583 (1306)
75 KOG3516|consensus 22.5 64 0.0014 39.6 2.8 37 566-603 541-578 (1306)
No 1
>KOG1214|consensus
Probab=99.68 E-value=3.3e-15 Score=163.01 Aligned_cols=236 Identities=27% Similarity=0.596 Sum_probs=161.3
Q ss_pred CCCCCCCeeeccCC-ceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCCCC-CCCCCCCCCeeeecCCceEEe
Q psy9419 195 SPCASSALCVNEKG-GFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDECL-GVSPCASSALCVNEKGGFKCV 272 (739)
Q Consensus 195 ~~C~~~~~C~n~~g-~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~eC~-~~~~C~~~~~C~~~~g~~~C~ 272 (739)
+-|..++.|....+ .|+|.|..||.|++ ..|.|++||+ ..+.|..+++|++.+|+|+|.
T Consensus 700 h~cdt~a~C~pg~~~~~tcecs~g~~gdg-------------------r~c~d~~eca~~~~~CGp~s~Cin~pg~~rce 760 (1289)
T KOG1214|consen 700 HMCDTTARCHPGTGVDYTCECSSGYQGDG-------------------RNCVDENECATGFHRCGPNSVCINLPGSYRCE 760 (1289)
T ss_pred cccCCCccccCCCCcceEEEEeeccCCCC-------------------CCCCChhhhccCCCCCCCCceeecCCCceeEE
Confidence 44666677776543 68999999999976 4578999997 568899999999999999999
Q ss_pred CCCCCCCCCCCccccCCCC--CCCccccCCCCCCCCCCCCCcccCCCCCCCCCCC--cccccC-CCCceeecCCCceeCC
Q psy9419 273 CPKGTTGDPYTLGCVGSGS--PRTECRVDKECSPSLQCRGGACVDPCRSVECGAH--ALCEPQ-DHRASCRCELGYTEGL 347 (739)
Q Consensus 273 C~~Gy~g~~c~~~c~~~~~--~~~~C~~~~~C~~~~~C~~g~C~~~C~~~~C~~~--~~C~~~-~g~~~C~C~~G~~g~~ 347 (739)
|..||.......+|+.+.. ....|++. +..|... +.|+.. .+.|+|.|.|||.|+.
T Consensus 761 C~~gy~F~dd~~tCV~i~~pap~n~Ce~g-------------------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG 821 (1289)
T KOG1214|consen 761 CRSGYEFADDRHTCVLITPPAPANPCEDG-------------------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDG 821 (1289)
T ss_pred EeecceeccCCcceEEecCCCCCCccccC-------------------ccccCcCCceEEEecCCceEEEeecCCccCCc
Confidence 9999987766666765432 22223222 1234433 344443 3479999999999976
Q ss_pred CCc-cccCCCCCCCCCCCeeecCCCCCeeecCCCCccCCCCCCCccCCCCCCCCCCCCCCcccCCCccCCCCCCCCCCCc
Q psy9419 348 NGK-CVSLCEGIVCAPGAACIVTPAGPTCTCADGARGNPFPGGACYPDLCSATQPCPALSVCVAGRCKARCAGVVCGAGA 426 (739)
Q Consensus 348 ~~~-c~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~~~~g~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~ 426 (739)
-.- .+|+|+++.|..+|.|.++++++.|+|.+||.|+.+ .|.+++- ...+|... +-..+.|+..+
T Consensus 822 ~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf---~CVP~~~-~~T~C~~e----------r~hpl~chg~t 887 (1289)
T KOG1214|consen 822 HQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGF---QCVPDTS-SLTPCEQE----------RFHPLQCHGST 887 (1289)
T ss_pred cccccccccCccccCCCceEecCCCcceeecccCccCCCc---eecCCCc-cCCccccc----------cccceeecccc
Confidence 332 368999999999999999999999999999999944 7766421 11233221 01122344444
Q ss_pred ee----cCCCCcccCCCCccCCCCccccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCC
Q psy9419 427 QC----DPALDRCVCPPFYVGDPEFNCVPPVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNS 489 (739)
Q Consensus 427 ~C----~~~~~~C~C~~g~~g~~~~~C~~~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~ 489 (739)
.| ++..+++.+.++=.|++...|..... .. ...|..+|.+..... .+.++.|.|..
T Consensus 888 ~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~-~~-vp~Cd~hgh~ap~qc-----hG~~~~CwCvd 947 (1289)
T KOG1214|consen 888 GFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPE-QY-VPQCDDHGHFAPLQC-----HGKSDFCWCVD 947 (1289)
T ss_pred ceeEeeCCCcccCCCCCCCCCCCCCCCCCccc-cc-CCCcccccccccccc-----CCCcceeEEec
Confidence 33 24567888888777776556654321 11 236778888876652 23558899976
No 2
>KOG1214|consensus
Probab=99.66 E-value=2.3e-15 Score=164.23 Aligned_cols=209 Identities=31% Similarity=0.680 Sum_probs=145.6
Q ss_pred cCCCCC-CCCCCCCeeeeCCCCeeEEecCCCCccCCCCCCcccccccCCCCCCCCCCceeeCCCCceeeCCCCCcCCCC-
Q psy9419 27 ATCGTQ-GQCPGGAECVNIAGGVSYCACPKGFRPKEDGYCEDVDECAESRHLCGPGAVCINHPGSYTCQCPPNSSGDPL- 104 (739)
Q Consensus 27 ~~C~~~-~~C~~~g~C~~~~~g~~~C~C~~Gy~g~~~~~C~dideC~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~~- 104 (739)
++|..+ +.|..++.|...++-.|+|+|..||.|. .++|.|++||++.++.|+++++|+|.+|+|+|.|..||.-...
T Consensus 693 npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gd-gr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~ 771 (1289)
T KOG1214|consen 693 NPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGD-GRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDR 771 (1289)
T ss_pred ccceecCcccCCCccccCCCCcceEEEEeeccCCC-CCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCC
Confidence 355544 6777788898877777999999999986 3489999999999999999999999999999999999885431
Q ss_pred CCCccCCCCCCCCCCCCCCCCCccCccccCCCCccccCCCCCCccCCCCCCCCCCCc--eeccC--CCCeeeCCCCCccC
Q psy9419 105 LGCTHARVQCSRDADCDGPYERCVRAACVCPAPYYADVNDGHKCKSPCERFSCGINA--QCTPA--DPPQCTCLAGYTGE 180 (739)
Q Consensus 105 ~~C~~~~~~C~~~~~C~~~~~~C~~~~C~C~~g~~g~~~~~~~C~~~C~~~~C~~~~--~C~~~--~~~~C~C~~Gy~g~ 180 (739)
..|....+. ..++.|+. ..+.|..++ .|+.. +.|.|+|.|||.|+
T Consensus 772 ~tCV~i~~p-ap~n~Ce~------------------------------g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGD 820 (1289)
T KOG1214|consen 772 HTCVLITPP-APANPCED------------------------------GSHTCAIAGQARCVHHGGSTYSCACLPGFSGD 820 (1289)
T ss_pred cceEEecCC-CCCCcccc------------------------------CccccCcCCceEEEecCCceEEEeecCCccCC
Confidence 113222111 11122211 123455444 45543 47999999999999
Q ss_pred CCCCCcccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCCCCCCCCCCCCC
Q psy9419 181 ATLGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDECLGVSPCASSA 260 (739)
Q Consensus 181 ~~~~C~~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~eC~~~~~C~~~~ 260 (739)
... |.|+|||.. +-|...+.|+|++|+|.|+|.+||.|+.++ |+....+.+.|...+ ..+-.|+.+.
T Consensus 821 G~~-c~dvDeC~p-srChp~A~CyntpgsfsC~C~pGy~GDGf~--CVP~~~~~T~C~~er---------~hpl~chg~t 887 (1289)
T KOG1214|consen 821 GHQ-CTDVDECSP-SRCHPAATCYNTPGSFSCRCQPGYYGDGFQ--CVPDTSSLTPCEQER---------FHPLQCHGST 887 (1289)
T ss_pred ccc-cccccccCc-cccCCCceEecCCCcceeecccCccCCCce--ecCCCccCCcccccc---------ccceeecccc
Confidence 874 899999997 999999999999999999999999999854 543222222222110 0022344443
Q ss_pred ee---eecCCceEEeCCCCCCCCC
Q psy9419 261 LC---VNEKGGFKCVCPKGTTGDP 281 (739)
Q Consensus 261 ~C---~~~~g~~~C~C~~Gy~g~~ 281 (739)
.+ ++. ..|++.+.++-.|+.
T Consensus 888 ~~~~~~Dp-~~~e~p~~~~ppG~~ 910 (1289)
T KOG1214|consen 888 GFCWCVDP-DGHEVPGTQTPPGST 910 (1289)
T ss_pred ceeEeeCC-CcccCCCCCCCCCCC
Confidence 22 333 457788777776654
No 3
>KOG1217|consensus
Probab=99.66 E-value=7e-15 Score=166.65 Aligned_cols=276 Identities=33% Similarity=0.738 Sum_probs=202.3
Q ss_pred CCCCceeecCCCCeecCCeeecc-CCCCCC-CCCCCCeeeeCC--CCeeEEecCCCCccCCCCCCccc-ccccCCCCCCC
Q psy9419 5 QCNTLECQCRPPYQIVAGECTLA-TCGTQG-QCPGGAECVNIA--GGVSYCACPKGFRPKEDGYCEDV-DECAESRHLCG 79 (739)
Q Consensus 5 ~~~~~~C~C~~Gy~g~~~~C~~~-~C~~~~-~C~~~g~C~~~~--~g~~~C~C~~Gy~g~~~~~C~di-deC~~~~~~C~ 79 (739)
...+|.|.|++||.+..+ ... +|.... .+...+.|++.. ...|.|.|..||.+. .+... ++|......|.
T Consensus 106 ~~~~~~c~c~~g~~~~~~--~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~---~~~~~~~~C~~~~~~c~ 180 (487)
T KOG1217|consen 106 CVGSYECTCPPGYQGTPC--EGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGE---PCETDLDECIQYSSPCQ 180 (487)
T ss_pred CCCCceeeCCCccccCcC--CcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccc---cccccccccccCCCCcC
Confidence 456889999999998743 222 344421 234456777642 246899999999998 66633 79987667899
Q ss_pred CCCceeeCCCCceeeCCCCCcCCCCCCCccCCCCCCCCCCCCCCCCCccCccccCCCCccccCCCCCCccCCCCCCCCCC
Q psy9419 80 PGAVCINHPGSYTCQCPPNSSGDPLLGCTHARVQCSRDADCDGPYERCVRAACVCPAPYYADVNDGHKCKSPCERFSCGI 159 (739)
Q Consensus 80 ~~~~C~n~~gsy~C~C~~Gy~g~~~~~C~~~~~~C~~~~~C~~~~~~C~~~~C~C~~g~~g~~~~~~~C~~~C~~~~C~~ 159 (739)
+++.|.+..++|.|.|+++|++... ... ..
T Consensus 181 ~~~~C~~~~~~~~C~c~~~~~~~~~----------~~~----------------------------------------~~ 210 (487)
T KOG1217|consen 181 NGGTCVNTGGSYLCSCPPGYTGSTC----------ETT----------------------------------------GN 210 (487)
T ss_pred CCcccccCCCCeeEeCCCCccCCcC----------cCC----------------------------------------CC
Confidence 9999999999999999999999742 110 11
Q ss_pred CceeccCCCCeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCccCCCCCCccccc
Q psy9419 160 NAQCTPADPPQCTCLAGYTGEATLGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRV 239 (739)
Q Consensus 160 ~~~C~~~~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~ 239 (739)
.+.|+.. +.|.+.+||.+..+. .++.++.. . + ++|+++.++|+|+|++||++...
T Consensus 211 ~~~c~~~--~~~~~~~g~~~~~c~--~~~~~~~~-~---~-~~c~~~~~~~~C~~~~g~~~~~~---------------- 265 (487)
T KOG1217|consen 211 GGTCVDS--VACSCPPGARGPECE--VSIVECAS-G---D-GTCVNTVGSYTCRCPEGYTGDAC---------------- 265 (487)
T ss_pred CceEecc--eeccCCCCCCCCCcc--cccccccC-C---C-CcccccCCceeeeCCCCcccccc----------------
Confidence 2233322 578889999987765 66777765 2 4 79999999999999999999751
Q ss_pred CCCCcccCCCCCCCCCCCCCCeeeecCCceEEeCCCCCCCCCCCccccCCCCCCCccccCCCCCCCCCCCCCcccCCCCC
Q psy9419 240 DKVGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKECSPSLQCRGGACVDPCRS 319 (739)
Q Consensus 240 ~~~~C~d~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~c~~~~~~~~~C~~~~~C~~~~~C~~g~C~~~C~~ 319 (739)
..+.++++|.....|.++++|++..+.|.|.|++||+|..+ . .+.+..+|... -..
T Consensus 266 --~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~-~----------~~~~~~~C~~~-----------~~~ 321 (487)
T KOG1217|consen 266 --VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC-T----------ECVDVDECSPR-----------NAG 321 (487)
T ss_pred --ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC-c----------ccccccccccc-----------ccC
Confidence 23578999984324999999999999999999999999875 1 13333333221 112
Q ss_pred CCCCCCccc--ccCCCCceeecCCCceeCCCCccccCCCCCCCCCCCeeec-CCCCCeeecCCCCccC
Q psy9419 320 VECGAHALC--EPQDHRASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIV-TPAGPTCTCADGARGN 384 (739)
Q Consensus 320 ~~C~~~~~C--~~~~g~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~-~~~~~~C~C~~Gy~g~ 384 (739)
.+|.+++.| ......+.|.|..||.|..++.-.++|...++..++.|++ ..++|.|.|+.+|.+.
T Consensus 322 ~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 322 GPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGK 389 (487)
T ss_pred CcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccC
Confidence 446667777 3334578899999999998876445788888999999999 6899999999999983
No 4
>KOG1217|consensus
Probab=99.64 E-value=1.3e-13 Score=156.31 Aligned_cols=320 Identities=33% Similarity=0.743 Sum_probs=214.5
Q ss_pred CCceeccC-CCCeeeCCCCCccCCCCCCcccCCCCCCCC--CCCCCeeecc---CCceEeeCCCCCcCCCCCCCccCCCC
Q psy9419 159 INAQCTPA-DPPQCTCLAGYTGEATLGCLDVDECLGVSP--CASSALCVNE---KGGFKCVCPKGTTGDPYTLGCVGSGS 232 (739)
Q Consensus 159 ~~~~C~~~-~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~--C~~~~~C~n~---~g~~~C~C~~Gy~g~~~~~~C~~~~~ 232 (739)
..+.+... ..+.|.|++||.|..+.. ..+|.. .+ +...+.|... ...|.|+|..||.+....
T Consensus 99 ~~~~~~~~~~~~~c~c~~g~~~~~~~~---~~~C~~-~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~-------- 166 (487)
T KOG1217|consen 99 LCGECVDCVGSYECTCPPGYQGTPCEG---ECECVT-GPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCE-------- 166 (487)
T ss_pred CCccccCCCCCceeeCCCccccCcCCc---ceeecC-CCCCeeCchhhcCCCCCCCceeeeeCCCccccccc--------
Confidence 34444443 578999999999987642 114544 22 3445577764 358999999999998732
Q ss_pred CCcccccCCCCcccCCCCC-CCCCCCCCCeeeecCCceEEeCCCCCCCCCCCccccCCCCCCCccccCCCCCCCCCCCCC
Q psy9419 233 PRTECRVDKVGCLDVDECL-GVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKECSPSLQCRGG 311 (739)
Q Consensus 233 ~~~~c~~~~~~C~d~~eC~-~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~c~~~~~~~~~C~~~~~C~~~~~C~~g 311 (739)
.+.++|. ...+|.+.+.|.+..++|.|.|+++|.+..++..
T Consensus 167 ------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~-------------------------- 208 (487)
T KOG1217|consen 167 ------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT-------------------------- 208 (487)
T ss_pred ------------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC--------------------------
Confidence 2336776 4567999999999999999999999999875421
Q ss_pred cccCCCCCCCCCCCcccccCCCCceeecCCCceeCCCCccccCCCCCCCCCCCeeecCCCCCeeecCCCCccCCCCCCCc
Q psy9419 312 ACVDPCRSVECGAHALCEPQDHRASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIVTPAGPTCTCADGARGNPFPGGAC 391 (739)
Q Consensus 312 ~C~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~~~~g~~C 391 (739)
...+.|+.. +.|.+.+||.+..+...+..+... + ++|++..++|+|.|++||++... ..+
T Consensus 209 -----------~~~~~c~~~---~~~~~~~g~~~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~--~~~ 268 (487)
T KOG1217|consen 209 -----------GNGGTCVDS---VACSCPPGARGPECEVSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDAC--VTC 268 (487)
T ss_pred -----------CCCceEecc---eeccCCCCCCCCCcccccccccCC---C-CcccccCCceeeeCCCCcccccc--cee
Confidence 122334333 578999999988777655555433 4 89999999999999999999820 011
Q ss_pred -cCCCCCCCCCCCCCCcccCCCccCCCCCCCCCCCceecCCCCcccCCCCccCCCCccccCCCCCCCC-----CCCCCCC
Q psy9419 392 -YPDLCSATQPCPALSVCVAGRCKARCAGVVCGAGAQCDPALDRCVCPPFYVGDPEFNCVPPVTMPVC-----IPPCGPN 465 (739)
Q Consensus 392 -~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~~~C~C~~g~~g~~~~~C~~~~~~~~C-----~~~C~~~ 465 (739)
..++|.....|...+.|++. ...+.|.|++||.|... ........| ..+|.++
T Consensus 269 ~~~~~C~~~~~c~~~~~C~~~------------------~~~~~C~C~~g~~g~~~---~~~~~~~~C~~~~~~~~c~~g 327 (487)
T KOG1217|consen 269 VDVDSCALIASCPNGGTCVNV------------------PGSYRCTCPPGFTGRLC---TECVDVDECSPRNAGGPCANG 327 (487)
T ss_pred eeccccCCCCccCCCCeeecC------------------CCcceeeCCCCCCCCCC---ccccccccccccccCCcCCCC
Confidence 22445433224444444433 23489999999999853 111112334 2357777
Q ss_pred CccccCCCCCCCCCCCCceeecCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCeeec-CCCceEEeCCCCCccCCC
Q psy9419 466 AHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCGTAGPQDR-GSCDSGAGLCGPGAQCLE-TGGSVECQCPAGYKGNPY 543 (739)
Q Consensus 466 ~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~~~~~~~~-~~C~~~~~~C~~~g~C~~-~~g~~~C~C~~Gy~g~~c 543 (739)
++|.... ....+.|.|..||.|. .|+ .. ++|.. ..+..++.|++ ..++|.|.|+.+|.+..
T Consensus 328 ~~C~~~~------~~~~~~C~c~~~~~g~---~C~-----~~~~~C~~--~~~~~~~~c~~~~~~~~~c~~~~~~~~~~- 390 (487)
T KOG1217|consen 328 GTCNTLG------SFGGFRCACGPGFTGR---RCE-----DSNDECAS--SPCCPGGTCVNETPGSYRCACPAGFAGKA- 390 (487)
T ss_pred cccccCC------CCCCCCcCCCCCCCCC---ccc-----cCCccccC--CccccCCEeccCCCCCeEecCCCccccCC-
Confidence 7883222 0146789999999998 887 44 48887 56888999999 68899999999888730
Q ss_pred cccCCCceeeeCCCCcccCCCCCCccCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCCCCCcceeCc
Q psy9419 544 VQCVGGSVECQCPAGYKGNPYVQCVDIDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGNPFVACTPVA 612 (739)
Q Consensus 544 ~~C~~g~~~C~C~~G~~g~~~~~C~~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~ 612 (739)
......+.++++|.. .+.|++..+++.|. ++ +...+. .|.+++
T Consensus 391 -----------------~~~~~~~~~~~~c~~------~~~c~~~~~~~~c~-~~-~~~~~~-~~~~~~ 433 (487)
T KOG1217|consen 391 -----------------NGDGVGCEDIDECSG------CGDCVNGPGGGACT-PP-GLVSPG-TCDDID 433 (487)
T ss_pred -----------------ccccccccccccccC------CcceeccCCCCccc-cC-cccCCc-ceeccc
Confidence 011135677888832 56788889999999 88 433333 444443
No 5
>KOG4289|consensus
Probab=99.61 E-value=2e-14 Score=163.75 Aligned_cols=84 Identities=32% Similarity=0.746 Sum_probs=63.9
Q ss_pred cccCCCCccccCCCCCCccCCCCCCCCCCCceeccC-CCCeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCeeecc-CC
Q psy9419 131 ACVCPAPYYADVNDGHKCKSPCERFSCGINAQCTPA-DPPQCTCLAGYTGEATLGCLDVDECLGVSPCASSALCVNE-KG 208 (739)
Q Consensus 131 ~C~C~~g~~g~~~~~~~C~~~C~~~~C~~~~~C~~~-~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~C~~~~~C~n~-~g 208 (739)
+|.||+||+|+.|+.. .+.|.+.||++|++|... ++|+|.|.+||+|..|+.=...-.|.. ..|.++++|++. +|
T Consensus 1223 rCrCPpGFTgd~CeTe--iDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvp-GvC~nggtC~~~~ng 1299 (2531)
T KOG4289|consen 1223 RCRCPPGFTGDYCETE--IDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVP-GVCKNGGTCVNLLNG 1299 (2531)
T ss_pred eEeCCCCCCcccccch--hHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCcccc-ceecCCCEEeecCCC
Confidence 4555555555443322 255678899999999986 899999999999999872222345776 889999999986 58
Q ss_pred ceEeeCCCC
Q psy9419 209 GFKCVCPKG 217 (739)
Q Consensus 209 ~~~C~C~~G 217 (739)
.|.|.|+.|
T Consensus 1300 gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1300 GFCCHCPYG 1308 (2531)
T ss_pred ceeccCCCc
Confidence 899999998
No 6
>KOG4289|consensus
Probab=99.57 E-value=2.7e-14 Score=162.70 Aligned_cols=104 Identities=27% Similarity=0.560 Sum_probs=83.3
Q ss_pred CCCCCCCCCCCcccccC----------------------CCCceeecCCCceeCCCCccccCCCCCCCCCCCeeecCCCC
Q psy9419 315 DPCRSVECGAHALCEPQ----------------------DHRASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIVTPAG 372 (739)
Q Consensus 315 ~~C~~~~C~~~~~C~~~----------------------~g~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~~~~~ 372 (739)
+.|...||.+...|+.. .++++|+|++||+|+.|+..+|.|...||.++++|....|+
T Consensus 1180 niClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEgg 1259 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGG 1259 (2531)
T ss_pred chhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCc
Confidence 44555677777777432 35789999999999999999999999999999999999999
Q ss_pred CeeecCCCCccCCCCCCCccCCCCCCCCCCCCCCcccCCCccCCCCCCCCCCCceecC---CCCcccCCCC
Q psy9419 373 PTCTCADGARGNPFPGGACYPDLCSATQPCPALSVCVAGRCKARCAGVVCGAGAQCDP---ALDRCVCPPF 440 (739)
Q Consensus 373 ~~C~C~~Gy~g~~~~g~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~---~~~~C~C~~g 440 (739)
|+|.|.+||+|. +||.+. ..+.|++|. |.++++|.. +.+.|.|+.|
T Consensus 1260 YtCeCrpg~tGe-----hCEvs~--------~agrCvpGv---------C~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1260 YTCECRPGFTGE-----HCEVSA--------RAGRCVPGV---------CKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred eeEEecCCcccc-----ceeeec--------ccCccccce---------ecCCCEEeecCCCceeccCCCc
Confidence 999999999999 887632 234556553 556777764 4678999998
No 7
>KOG1219|consensus
Probab=99.28 E-value=5.8e-12 Score=148.94 Aligned_cols=109 Identities=30% Similarity=0.820 Sum_probs=101.7
Q ss_pred CCCCCCCCCCCceeccC--CCCeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCc
Q psy9419 150 SPCERFSCGINAQCTPA--DPPQCTCLAGYTGEATLGCLDVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGC 227 (739)
Q Consensus 150 ~~C~~~~C~~~~~C~~~--~~~~C~C~~Gy~g~~~~~C~~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C 227 (739)
++|..+||+++|+|+.. ++|+|.|++-|+|..|+ .++.+|.+ +||..+|+|+...+.|.|.|+.||+|.. |
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE--i~~epC~s-nPC~~GgtCip~~n~f~CnC~~gyTG~~----C 3937 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE--IDLEPCAS-NPCLTGGTCIPFYNGFLCNCPNGYTGKR----C 3937 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccc--cccccccC-CCCCCCCEEEecCCCeeEeCCCCccCce----e
Confidence 68999999999999985 68999999999999998 89999999 9999999999999999999999999988 5
Q ss_pred cCCCCCCcccccCCCCccc-CCCCCCCCCCCCCCeeeecCCceEEeCCCCCCCCCC
Q psy9419 228 VGSGSPRTECRVDKVGCLD-VDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPY 282 (739)
Q Consensus 228 ~~~~~~~~~c~~~~~~C~d-~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c 282 (739)
+ .+ ++||. .++|.++|.|++..|+|+|.|-+||.|..|
T Consensus 3938 e----------------~~Gi~eCs-~n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3938 E----------------ARGISECS-KNVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred e----------------cccccccc-cccccCCceeeccCCceEeccChhHhcccC
Confidence 4 33 89998 799999999999999999999999999875
No 8
>KOG1219|consensus
Probab=99.20 E-value=2.2e-11 Score=144.26 Aligned_cols=113 Identities=33% Similarity=0.780 Sum_probs=102.4
Q ss_pred CcccC-CCCCCCCCCCCCCeeeecC-CceEEeCCCCCCCCCCCccccCCCCCCCccccCCCCCCCCCCCCCcccCCCCCC
Q psy9419 243 GCLDV-DECLGVSPCASSALCVNEK-GGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKECSPSLQCRGGACVDPCRSV 320 (739)
Q Consensus 243 ~C~d~-~eC~~~~~C~~~~~C~~~~-g~~~C~C~~Gy~g~~c~~~c~~~~~~~~~C~~~~~C~~~~~C~~g~C~~~C~~~ 320 (739)
.|... +.|. .+||+++|+|+..+ |+|+|.|++-|.|..|+.. +.+|.++
T Consensus 3859 gC~l~~d~C~-~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~----------------------------~epC~sn 3909 (4289)
T KOG1219|consen 3859 GCSLLTDPCN-DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID----------------------------LEPCASN 3909 (4289)
T ss_pred cccccccccc-cCcccCCCEecCCCCCceEEeCcccccCcccccc----------------------------cccccCC
Confidence 45444 7787 69999999998765 6799999999999987632 7889999
Q ss_pred CCCCCcccccCCCCceeecCCCceeCCCCcc-ccCCCCCCCCCCCeeecCCCCCeeecCCCCccC
Q psy9419 321 ECGAHALCEPQDHRASCRCELGYTEGLNGKC-VSLCEGIVCAPGAACIVTPAGPTCTCADGARGN 384 (739)
Q Consensus 321 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~c-~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~ 384 (739)
||..+++|+...++|.|.|+.||+|..|+.. +++|+.++|.++|.|++..|+|.|.|.+||.|.
T Consensus 3910 PC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3910 PCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred CCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCCceeeccCCceEeccChhHhcc
Confidence 9999999999999999999999999999986 899999999999999999999999999999998
No 9
>KOG1225|consensus
Probab=99.15 E-value=3.7e-10 Score=123.63 Aligned_cols=192 Identities=27% Similarity=0.663 Sum_probs=113.1
Q ss_pred CCcccCCCCccCCCCccccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCCCCCCCCCCCCCCCCCCC----C
Q psy9419 432 LDRCVCPPFYVGDPEFNCVPPVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCGTAGPQD----R 507 (739)
Q Consensus 432 ~~~C~C~~g~~g~~~~~C~~~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~~~~~~~----~ 507 (739)
.++|.+.+++++... . ...+...+..++.+.. +.+.+..+|++. .+......+ .
T Consensus 160 ~~~c~~~~~~~~~~~---g----~~~~~~~~~~hg~~~~------------~~~l~~~~~s~~---~~~~~~~~~~~~~~ 217 (525)
T KOG1225|consen 160 NGVCSLKPNPFGAEC---G----QYKCPNDGSGHGRYYF------------GNCLSGISASGE---TCNQLGCNDDCFRT 217 (525)
T ss_pred cccccccCCcccccc---c----eecCCcCCCCCcccee------------cccccccCcchh---hhhcccCCccceec
Confidence 456777777776531 1 1123334556666654 347888888877 443211111 0
Q ss_pred CCCCCCCCCCCCCCeeecCCCceEEeCCCCCccCCCc--ccCCC--------ceeeeCCCCcccCCCCCCccCCCCCCCC
Q psy9419 508 GSCDSGAGLCGPGAQCLETGGSVECQCPAGYKGNPYV--QCVGG--------SVECQCPAGYKGNPYVQCVDIDECWSSN 577 (739)
Q Consensus 508 ~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~--~C~~g--------~~~C~C~~G~~g~~~~~C~~id~C~~~~ 577 (739)
.-+..++ ..|++..-.+.|.|+.+|+|..+. .|..+ ..+|.|++||+|. +|.. -.| +.
T Consensus 218 ~r~~~~~------~~~~~~~~~~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~---dC~e-~~C--p~ 285 (525)
T KOG1225|consen 218 GRCREGR------CFCTAGFFDGICECPEGYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGD---DCDE-LVC--PV 285 (525)
T ss_pred cccccCc------ccccccccCceeecCCceeCCccccccCCCCCcccceEeCCeEeCCCCCcCC---CCCc-ccC--Cc
Confidence 1111111 123333333478888888887654 12211 1377788888886 4432 235 34
Q ss_pred CCCCCCEEeeCCCCeeeecCCCCCCCCCCcceeCccCCCCCCCCCcccCCCCCCCCCCceecCCcccCCCCCCCCCCCCe
Q psy9419 578 TCGSNAVCINTPGSYDCRCKEGNAGNPFVACTPVAVVPHSCEDPATCVCSKNAPCPSGYVCKNSRCTDLCANVRCGPRAL 657 (739)
Q Consensus 578 ~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~C~~~~~C~~~~~c~C~~G~~c~~~~C~~~C~~~~C~~~~~ 657 (739)
+|+.++.+++. +|+|++||+|. .|+...+ +..|+++|.|+ .+.|.|.+||+ +..|+.. .|.+++.
T Consensus 286 ~cs~~g~~~~g----~CiC~~g~~G~---dCs~~~c-padC~g~G~Ci-~G~C~C~~Gy~--G~~C~~~----~C~~~g~ 350 (525)
T KOG1225|consen 286 DCSGGGVCVDG----ECICNPGYSGK---DCSIRRC-PADCSGHGKCI-DGECLCDEGYT--GELCIQR----ACSGGGQ 350 (525)
T ss_pred ccCCCceecCC----EeecCCCcccc---ccccccC-CccCCCCCccc-CCceEeCCCCc--CCccccc----ccCCCce
Confidence 47777777644 68888888887 5654442 46778888887 66788888887 5555544 3777788
Q ss_pred ecCceeeCCCCCccCC
Q psy9419 658 CVQGQCLCPSDLIGNP 673 (739)
Q Consensus 658 C~~~~C~C~~Gy~G~~ 673 (739)
|++. |+|..||.|..
T Consensus 351 cv~g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 351 CVNG-CKCKKGWRGPD 365 (525)
T ss_pred eccC-ceeccCccCCC
Confidence 8877 88888888766
No 10
>KOG0994|consensus
Probab=99.09 E-value=2e-09 Score=122.00 Aligned_cols=198 Identities=30% Similarity=0.659 Sum_probs=102.4
Q ss_pred CceeecCCCceeCCCCccccCCCCCCCCCCCeeecCCCCCeeecCCCCccCCCCCCCccCCCCCCCCCCCCCCcccC--C
Q psy9419 334 RASCRCELGYTEGLNGKCVSLCEGIVCAPGAACIVTPAGPTCTCADGARGNPFPGGACYPDLCSATQPCPALSVCVA--G 411 (739)
Q Consensus 334 ~~~C~C~~G~~g~~~~~c~~~C~~~~C~~~~~C~~~~~~~~C~C~~Gy~g~~~~g~~C~~~~C~~~~~C~~~~~C~~--~ 411 (739)
...|.|.+||+|.+|+. |.++|+|+|..|+.|+.-+|+.+..=...+.|.. |
T Consensus 933 ~ivC~C~~GY~G~RCe~--------------------------CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG 986 (1758)
T KOG0994|consen 933 QIVCHCQEGYSGSRCEI--------------------------CADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATG 986 (1758)
T ss_pred ceeeecccCccccchhh--------------------------hcccccCCcccCCccccccccCCcCccCCCccchhhc
Confidence 35799999999987653 5667888888888888777765433333333321 1
Q ss_pred CccCCCCCCCCCCCceecCCCCcc-cCCCCccCCC-CccccCCCCCCCCCCCC-CCCCccccCCCCCCCCCCCCceeecC
Q psy9419 412 RCKARCAGVVCGAGAQCDPALDRC-VCPPFYVGDP-EFNCVPPVTMPVCIPPC-GPNAHCEYNSESPGSSPGSDNICVCN 488 (739)
Q Consensus 412 ~C~~~C~~~~C~~~~~C~~~~~~C-~C~~g~~g~~-~~~C~~~~~~~~C~~~C-~~~~~C~~~~~~~~~~~~~~~~C~C~ 488 (739)
.|.+ |..+ .....| .|.+||.|+. .+.|..- .|..-= .+-++|.. .+.+|.|.
T Consensus 987 ~CLk------CL~h----TeG~hCe~Ck~Gf~GdA~~q~CqrC----~Cn~LGTn~~~~CDr----------~tGQCpCl 1042 (1758)
T KOG0994|consen 987 ACLK------CLYH----TEGDHCEHCKDGFYGDALRQNCQRC----VCNFLGTNSTCHCDR----------FTGQCPCL 1042 (1758)
T ss_pred hhhh------hhhc----ccccchhhccccchhHHHHhhhhhh----eccccccCCcccccc----------ccCcCCCC
Confidence 1110 1000 123345 5889999986 1222210 010000 01134443 45788898
Q ss_pred CCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCC--CCeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCCC
Q psy9419 489 SGTHGNPYAGCGTA--GPQDRGSCDSGAGLCGP--GAQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNPY 564 (739)
Q Consensus 489 ~Gy~g~~~~~C~~~--~~~~~~~C~~~~~~C~~--~g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~~ 564 (739)
|.-.|.....|..+ ....-.-|.+ ..|.+ +-+|..-.| +|+|++||-|..|.+|. .-|.|++.
T Consensus 1043 pNv~G~~CDqCA~N~w~laSG~GCe~--C~Cd~~~~pqCN~ftG--QCqCkpGfGGR~C~qCq---------el~WGdP~ 1109 (1758)
T KOG0994|consen 1043 PNVQGVRCDQCAENHWNLASGEGCEP--CNCDPIGGPQCNEFTG--QCQCKPGFGGRTCSQCQ---------ELYWGDPN 1109 (1758)
T ss_pred cccccccccccccchhccccCCCCCc--cCCCccCCcccccccc--ceeccCCCCCcchhHHH---------HhhcCCCC
Confidence 88888822222211 0000111222 22221 224555555 78888888887776653 35666665
Q ss_pred CCCccCCCCCCCCCCCCCC----EEeeCCCCeeeecCCCCCCC
Q psy9419 565 VQCVDIDECWSSNTCGSNA----VCINTPGSYDCRCKEGNAGN 603 (739)
Q Consensus 565 ~~C~~id~C~~~~~C~~~g----~C~~~~g~~~C~C~~G~~g~ 603 (739)
+.|. .-.|...| .|....| .|+|.+|..|.
T Consensus 1110 ~~C~-------aCdCd~rG~~tpQCdr~tG--~C~C~~Gv~G~ 1143 (1758)
T KOG0994|consen 1110 EKCR-------ACDCDPRGIETPQCDRATG--RCVCRPGVGGP 1143 (1758)
T ss_pred CCce-------ecCCCCCCCCCCCccccCC--ceeecCCCCCc
Confidence 5442 11222222 2433333 57777777776
No 11
>KOG1225|consensus
Probab=99.09 E-value=2.2e-09 Score=117.70 Aligned_cols=122 Identities=37% Similarity=0.940 Sum_probs=91.8
Q ss_pred CCceeecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeecCCCceEEeCCCCCccCCCcc--c----CCC----c
Q psy9419 481 SDNICVCNSGTHGNPYAGCGTAGPQDRGSCDSGAGLCGPGAQCLETGGSVECQCPAGYKGNPYVQ--C----VGG----S 550 (739)
Q Consensus 481 ~~~~C~C~~Gy~g~~~~~C~~~~~~~~~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~~--C----~~g----~ 550 (739)
..++|.|+.+|+|. .|+ ...|. ..|..++.|++. +|+|++||+|..|+. | ..+ .
T Consensus 232 ~~~ic~c~~~~~g~---~c~------~~~C~---~~c~~~g~c~~G----~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~ 295 (525)
T KOG1225|consen 232 FDGICECPEGYFGP---LCS------TIYCP---GGCTGRGQCVEG----RCICPPGFTGDDCDELVCPVDCSGGGVCVD 295 (525)
T ss_pred cCceeecCCceeCC---ccc------cccCC---CCCcccceEeCC----eEeCCCCCcCCCCCcccCCcccCCCceecC
Confidence 34578899999888 665 23344 456666778765 699999999988763 3 211 2
Q ss_pred eeeeCCCCcccCCCCCCccCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCCCCCcceeCccCCCCCCCCCcccCCCCC
Q psy9419 551 VECQCPAGYKGNPYVQCVDIDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGNPFVACTPVAVVPHSCEDPATCVCSKNA 630 (739)
Q Consensus 551 ~~C~C~~G~~g~~~~~C~~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~C~~~~~C~~~~~c 630 (739)
.+|.|++||+|. .|. +..| +..|.++|.|+ .| +|+|.+||+|. .|+.. .|.+++.|+.. |
T Consensus 296 g~CiC~~g~~G~---dCs-~~~c--padC~g~G~Ci--~G--~C~C~~Gy~G~---~C~~~-----~C~~~g~cv~g--C 355 (525)
T KOG1225|consen 296 GECICNPGYSGK---DCS-IRRC--PADCSGHGKCI--DG--ECLCDEGYTGE---LCIQR-----ACSGGGQCVNG--C 355 (525)
T ss_pred CEeecCCCcccc---ccc-cccC--CccCCCCCccc--CC--ceEeCCCCcCC---ccccc-----ccCCCceeccC--c
Confidence 389999999996 553 3446 68999999999 23 79999999999 67653 39999999977 9
Q ss_pred CCCCCcee
Q psy9419 631 PCPSGYVC 638 (739)
Q Consensus 631 ~C~~G~~c 638 (739)
.|..||..
T Consensus 356 ~C~~Gw~G 363 (525)
T KOG1225|consen 356 KCKKGWRG 363 (525)
T ss_pred eeccCccC
Confidence 99999993
No 12
>KOG0994|consensus
Probab=98.98 E-value=1.3e-08 Score=115.65 Aligned_cols=30 Identities=40% Similarity=0.883 Sum_probs=21.7
Q ss_pred eeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccC
Q psy9419 522 QCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGN 562 (739)
Q Consensus 522 ~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~ 562 (739)
.|....| +|+|.+|-.|..|++| ..||.|.
T Consensus 1126 QCdr~tG--~C~C~~Gv~G~rCdqC---------aRgy~G~ 1155 (1758)
T KOG0994|consen 1126 QCDRATG--RCVCRPGVGGPRCDQC---------ARGYSGQ 1155 (1758)
T ss_pred CccccCC--ceeecCCCCCcchhhh---------hhhhcCC
Confidence 3555555 7999999988877754 5678874
No 13
>KOG4260|consensus
Probab=98.85 E-value=3.9e-09 Score=102.66 Aligned_cols=163 Identities=29% Similarity=0.680 Sum_probs=106.7
Q ss_pred EecCCCCccCCCCCCcccccccCCCCCCCCCCceee---CCCCceeeCCCCCcCCCCCCCccCCCCCCCCCCCCCCCCCc
Q psy9419 51 CACPKGFRPKEDGYCEDVDECAESRHLCGPGAVCIN---HPGSYTCQCPPNSSGDPLLGCTHARVQCSRDADCDGPYERC 127 (739)
Q Consensus 51 C~C~~Gy~g~~~~~C~dideC~~~~~~C~~~~~C~n---~~gsy~C~C~~Gy~g~~~~~C~~~~~~C~~~~~C~~~~~~C 127 (739)
=-|++|-.|.+...|... + ..+|..++.|.- ..|+-.|.|.+||+|..+..|..
T Consensus 130 vCCp~gtyGpdCl~Cpgg---s--er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~------------------ 186 (350)
T KOG4260|consen 130 VCCPDGTYGPDCLQCPGG---S--ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGI------------------ 186 (350)
T ss_pred eccCCCCcCCccccCCCC---C--cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccch------------------
Confidence 348888888832233211 1 146887888873 35778999999999987553322
Q ss_pred cCccccCCCCcccc-CCC-CC---CccCCCCCCCCCCCceeccCCCCee-eCCCCCccCCCCCCcccCCCCC-CCCCCCC
Q psy9419 128 VRAACVCPAPYYAD-VND-GH---KCKSPCERFSCGINAQCTPADPPQC-TCLAGYTGEATLGCLDVDECLG-VSPCASS 200 (739)
Q Consensus 128 ~~~~C~C~~g~~g~-~~~-~~---~C~~~C~~~~C~~~~~C~~~~~~~C-~C~~Gy~g~~~~~C~~i~eC~~-~~~C~~~ 200 (739)
+|+-. +.. .. .|..+| .+.|+...+-.| .|..||..+. ..|+|||||.. ..+|..+
T Consensus 187 ---------eyfes~Rne~~lvCt~Ch~~C-------~~~Csg~~~k~C~kCkkGW~lde-~gCvDvnEC~~ep~~c~~~ 249 (350)
T KOG4260|consen 187 ---------EYFESSRNEQHLVCTACHEGC-------LGVCSGESSKGCSKCKKGWKLDE-EGCVDVNECQNEPAPCKAH 249 (350)
T ss_pred ---------HHHHhhcccccchhhhhhhhh-------hcccCCCCCCChhhhcccceecc-cccccHHHHhcCCCCCChh
Confidence 11110 000 00 111122 124443334466 7999999984 46999999986 5889999
Q ss_pred CeeeccCCceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCCCCC-CCCC-CCCCeeeecCCceEEeCCCCCC
Q psy9419 201 ALCVNEKGGFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDECLG-VSPC-ASSALCVNEKGGFKCVCPKGTT 278 (739)
Q Consensus 201 ~~C~n~~g~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~eC~~-~~~C-~~~~~C~~~~g~~~C~C~~Gy~ 278 (739)
..|+|+.|||+|..++||.+. +|+|.. ...| ..+..|.++.++|+|+|..|+.
T Consensus 250 qfCvNteGSf~C~dk~Gy~~g-------------------------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 250 QFCVNTEGSFKCEDKEGYKKG-------------------------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred heeecCCCceEecccccccCC-------------------------hHHhhhhhhhcccCCCCcccCCccEEEEecccce
Confidence 999999999999999999873 233320 1222 1356899999999999999875
No 14
>KOG4260|consensus
Probab=98.56 E-value=1.1e-07 Score=92.61 Aligned_cols=127 Identities=30% Similarity=0.608 Sum_probs=88.6
Q ss_pred CCCcccCCCCccCCCCccccCCCCC-------CC---CCCCCCCCCccccCCCCCCCCCCCCcee-ecCCCCCCCCCCCC
Q psy9419 431 ALDRCVCPPFYVGDPEFNCVPPVTM-------PV---CIPPCGPNAHCEYNSESPGSSPGSDNIC-VCNSGTHGNPYAGC 499 (739)
Q Consensus 431 ~~~~C~C~~g~~g~~~~~C~~~~~~-------~~---C~~~C~~~~~C~~~~~~~~~~~~~~~~C-~C~~Gy~g~~~~~C 499 (739)
+++.|.|.+||.|.....|....+. .. |...|. +.|.... +-.| .|..||..+.. .|
T Consensus 166 GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~Csg~~---------~k~C~kCkkGW~lde~-gC 233 (350)
T KOG4260|consen 166 GSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL--GVCSGES---------SKGCSKCKKGWKLDEE-GC 233 (350)
T ss_pred CCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--cccCCCC---------CCChhhhcccceeccc-cc
Confidence 5789999999999976666543211 11 222332 2444322 2234 78999987732 45
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCCCCCCccCCCCCC-CCC
Q psy9419 500 GTAGPQDRGSCDSGAGLCGPGAQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNPYVQCVDIDECWS-SNT 578 (739)
Q Consensus 500 ~~~~~~~~~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~~~~C~~id~C~~-~~~ 578 (739)
. ||+||...+.+|..+..|+|+.|||.|..++||.+. +|+|.. ...
T Consensus 234 v-----DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g----------------------------~d~C~~~~d~ 280 (350)
T KOG4260|consen 234 V-----DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG----------------------------VDECQFCADV 280 (350)
T ss_pred c-----cHHHHhcCCCCCChhheeecCCCceEecccccccCC----------------------------hHHhhhhhhh
Confidence 5 999999988999999999999999999999888752 333310 122
Q ss_pred C-CCCCEEeeCCCCeeeecCCCCCC
Q psy9419 579 C-GSNAVCINTPGSYDCRCKEGNAG 602 (739)
Q Consensus 579 C-~~~g~C~~~~g~~~C~C~~G~~g 602 (739)
| ..+..|.++.++|+|+|..|+.-
T Consensus 281 ~~~kn~~c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 281 CASKNRPCMNIDGQYRCVCFSGLII 305 (350)
T ss_pred cccCCCCcccCCccEEEEeccccee
Confidence 2 34678999999999999999753
No 15
>KOG1836|consensus
Probab=98.36 E-value=0.00036 Score=87.22 Aligned_cols=216 Identities=29% Similarity=0.611 Sum_probs=113.0
Q ss_pred ecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeecC--CCceEEe-CCCCCccCCCcccCCCceeeeCCCCcccC
Q psy9419 486 VCNSGTHGNPYAGCGTAGPQDRGSCDSGAGLCGPGAQCLET--GGSVECQ-CPAGYKGNPYVQCVGGSVECQCPAGYKGN 562 (739)
Q Consensus 486 ~C~~Gy~g~~~~~C~~~~~~~~~~C~~~~~~C~~~g~C~~~--~g~~~C~-C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~ 562 (739)
+|..||+|.+... ....|.. .+|.+++.|... .....|. |++||+|..|+.|. .||.|+
T Consensus 760 ~C~~GfYg~~~~~-------~~~dC~~--C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~---------dgyfg~ 821 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLG-------TSGDCQP--CPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECA---------DGYFGN 821 (1705)
T ss_pred hhcCCCCCccccC-------CCCCCcc--CCCCCChhhcCcCcccceecCCCCCCCcccccccCC---------CccccC
Confidence 5778888885321 1122766 677777777655 3567899 99999999988765 467766
Q ss_pred CCCCCccCCCCC-----------CCCCCCCC-CE---EeeCCCCeee-ecCCCCCCCCCC-----cceeCccC-------
Q psy9419 563 PYVQCVDIDECW-----------SSNTCGSN-AV---CINTPGSYDC-RCKEGNAGNPFV-----ACTPVAVV------- 614 (739)
Q Consensus 563 ~~~~C~~id~C~-----------~~~~C~~~-g~---C~~~~g~~~C-~C~~G~~g~~~~-----~C~~~~~~------- 614 (739)
+...=.|+-.|. ....|... +. |+.......| .|++||.|++.. .|....+.
T Consensus 822 p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~ 901 (1705)
T KOG1836|consen 822 PLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELP 901 (1705)
T ss_pred CCCCCCCcccCccceeccccCccccccccccccceeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccc
Confidence 532112222220 11233221 33 3433344456 699999998643 23322211
Q ss_pred CCCCCCC-CcccCCC------CCCCCCCceecCCcccCCCCCCCCCCC----CeecC--ceeeCCCCCccCCCCCC----
Q psy9419 615 PHSCEDP-ATCVCSK------NAPCPSGYVCKNSRCTDLCANVRCGPR----ALCVQ--GQCLCPSDLIGNPTDLT---- 677 (739)
Q Consensus 615 ~~~C~~~-~~C~~~~------~c~C~~G~~c~~~~C~~~C~~~~C~~~----~~C~~--~~C~C~~Gy~G~~c~~~---- 677 (739)
...|... |.|.+.. -=.|..||. +..=...|.+-.|+.. ..|.. ++|.|.+|-+|.+|+..
T Consensus 902 ~~~c~~~tGQcec~~~v~g~~c~~c~~g~f--nl~s~~gC~~c~c~~~gs~~~~c~~~tGqc~c~~gVtgqrc~qc~~~~ 979 (1705)
T KOG1836|consen 902 SLTCNPVTGQCECKPNVEGRDCLYCFKGFF--NLNSGVGCEPCNCDPTGSESSDCDVGTGQCYCRPGVTGQRCDQCETYH 979 (1705)
T ss_pred cccCCCcccceeccCCCCcccccccccccc--ccCCCCCcccccccccccccccccccCCceeeecCccccccCccccCc
Confidence 1123221 2222111 124445554 2110123433344433 25553 79999999999999752
Q ss_pred -----CCCccCCCCCCCC----CCCC-CCceecCCCCCCCCCCCCCCCCCCccccc
Q psy9419 678 -----RGCQVKGQCANDL----ECKP-NEICFQEKGIEPTYPGLHSHTGAPCSSTV 723 (739)
Q Consensus 678 -----~~C~~~~~C~~~~----~C~~-~~~C~~~~g~~~c~~~~~C~~~~~C~~~~ 723 (739)
..|. ..+|+..+ +|.. .+.|.+..++..= -...|+.+......+
T Consensus 980 ~~~~~~gc~-~c~c~~~Gs~~~qc~~~~G~c~c~~~~~g~-~c~~c~~~~~~~~~~ 1033 (1705)
T KOG1836|consen 980 FGFQTEGCG-LCECDPLGSRGFQCDPEDGQCPCRPGFEGR-RCDQCEEGFFGNAQG 1033 (1705)
T ss_pred ccccccCCc-ceecccCCcccceecccCCeeeecCCCCCc-ccccccCCccccccC
Confidence 2232 12354444 5766 7778887776321 123355555444433
No 16
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.24 E-value=4.7e-07 Score=65.02 Aligned_cols=36 Identities=47% Similarity=1.082 Sum_probs=33.5
Q ss_pred ccccccCCCCCCCCCCceeeCCCCceeeCCCCCcCC
Q psy9419 67 DVDECAESRHLCGPGAVCINHPGSYTCQCPPNSSGD 102 (739)
Q Consensus 67 dideC~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~ 102 (739)
|||||+..++.|..+++|+|+.|||+|.|++||+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 799999988999989999999999999999999943
No 17
>KOG1836|consensus
Probab=98.03 E-value=0.00016 Score=90.13 Aligned_cols=110 Identities=27% Similarity=0.660 Sum_probs=67.0
Q ss_pred ceecCCCCcccCCCCccCCCCccccCCCCCCC----CC-CCCCC----CCccccCCCCCCCCCCCCceeecCCCCCCCCC
Q psy9419 426 AQCDPALDRCVCPPFYVGDPEFNCVPPVTMPV----CI-PPCGP----NAHCEYNSESPGSSPGSDNICVCNSGTHGNPY 496 (739)
Q Consensus 426 ~~C~~~~~~C~C~~g~~g~~~~~C~~~~~~~~----C~-~~C~~----~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~ 496 (739)
.+|++.+++|.|.+.-.|.....|....+.-. |. -.|.. +..|.. .+.+|.|.+|-+|...
T Consensus 903 ~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~----------~tGqc~c~~gVtgqrc 972 (1705)
T KOG1836|consen 903 LTCNPVTGQCECKPNVEGRDCLYCFKGFFNLNSGVGCEPCNCDPTGSESSDCDV----------GTGQCYCRPGVTGQRC 972 (1705)
T ss_pred ccCCCcccceeccCCCCccccccccccccccCCCCCcccccccccccccccccc----------cCCceeeecCcccccc
Confidence 45667788999998888876555555443222 21 12321 234443 5578999999999944
Q ss_pred CCCCCCC-CCCCCCCCCCCCCCCCCC----eeecCCCceEEeCCCCCccCCCcccCCC
Q psy9419 497 AGCGTAG-PQDRGSCDSGAGLCGPGA----QCLETGGSVECQCPAGYKGNPYVQCVGG 549 (739)
Q Consensus 497 ~~C~~~~-~~~~~~C~~~~~~C~~~g----~C~~~~g~~~C~C~~Gy~g~~c~~C~~g 549 (739)
..|+... -..+..|.. -.|...| +|....| +|.|++++.|..+.+|..+
T Consensus 973 ~qc~~~~~~~~~~gc~~--c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~~c~~c~~~ 1026 (1705)
T KOG1836|consen 973 DQCETYHFGFQTEGCGL--CECDPLGSRGFQCDPEDG--QCPCRPGFEGRRCDQCEEG 1026 (1705)
T ss_pred CccccCcccccccCCcc--eecccCCcccceecccCC--eeeecCCCCCcccccccCC
Confidence 4444210 011123333 4455555 6887666 8999999999888776654
No 18
>KOG1226|consensus
Probab=97.89 E-value=9.8e-05 Score=83.01 Aligned_cols=99 Identities=26% Similarity=0.573 Sum_probs=69.4
Q ss_pred CCCCCCceeccCCCCeeeCCCCCc----cCCCCCCcccCCCCC--CCCCCCCCeeeccCCceEeeCCCCCcCCCCCCCcc
Q psy9419 155 FSCGINAQCTPADPPQCTCLAGYT----GEATLGCLDVDECLG--VSPCASSALCVNEKGGFKCVCPKGTTGDPYTLGCV 228 (739)
Q Consensus 155 ~~C~~~~~C~~~~~~~C~C~~Gy~----g~~~~~C~~i~eC~~--~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~~~~~C~ 228 (739)
-+|+++|.|+=. +|+|.+... |..++ | |--.|.. ...|..+|+|.=. +|+|.+||+|..+. |.
T Consensus 514 ~vCSgrG~C~CG---qC~C~~~~~~~i~G~fCE-C-DnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~--C~ 582 (783)
T KOG1226|consen 514 PVCSGRGDCVCG---QCVCHKPDNGKIYGKFCE-C-DNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACN--CP 582 (783)
T ss_pred CCcCCCCcEeCC---ceEecCCCCCceeeeeee-c-cCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCC--CC
Confidence 378888888753 899988877 77765 2 2233443 3568889998754 69999999999843 22
Q ss_pred CCCCCCcccccCCCCcccCCCCC--CCCCCCCCCeeeecCCceEEeCCCC-CCCCCCCc
Q psy9419 229 GSGSPRTECRVDKVGCLDVDECL--GVSPCASSALCVNEKGGFKCVCPKG-TTGDPYTL 284 (739)
Q Consensus 229 ~~~~~~~~c~~~~~~C~d~~eC~--~~~~C~~~~~C~~~~g~~~C~C~~G-y~g~~c~~ 284 (739)
.+.+.|. ....|...|+|.=. +|+|... |.|..|+.
T Consensus 583 ----------------~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~ 621 (783)
T KOG1226|consen 583 ----------------LSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEK 621 (783)
T ss_pred ----------------CCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhc
Confidence 4555564 23467778888644 6899877 99988763
No 19
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.87 E-value=8.6e-06 Score=58.47 Aligned_cols=35 Identities=37% Similarity=0.884 Sum_probs=32.2
Q ss_pred CCCCCCCCCCCCCCCCeeecCCCceEEeCCCCCcc
Q psy9419 506 DRGSCDSGAGLCGPGAQCLETGGSVECQCPAGYKG 540 (739)
Q Consensus 506 ~~~~C~~~~~~C~~~g~C~~~~g~~~C~C~~Gy~g 540 (739)
||+||....+.|..+++|+|+.|+|+|.|++||+.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 58999998889999999999999999999999983
No 20
>KOG1226|consensus
Probab=97.82 E-value=7.7e-05 Score=83.79 Aligned_cols=149 Identities=26% Similarity=0.688 Sum_probs=100.3
Q ss_pred CCCCCCCeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCCCCCCccCCCCC---CCCCCCCCCEEeeCCCC
Q psy9419 515 GLCGPGAQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNPYVQCVDIDECW---SSNTCGSNAVCINTPGS 591 (739)
Q Consensus 515 ~~C~~~g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~~~~C~~id~C~---~~~~C~~~g~C~~~~g~ 591 (739)
..|+.+|+..-. +|.|.+||.|+ +|+|+..-.... ...+.|. ...+|++.|.|+=.
T Consensus 467 ~~C~g~G~~~CG----~C~C~~G~~G~----------~CEC~~~~~ss~----~~~~~Cr~~~~~~vCSgrG~C~CG--- 525 (783)
T KOG1226|consen 467 ALCHGNGTFVCG----QCRCDEGWLGK----------KCECSTDELSSS----EEEDKCRENSDSPVCSGRGDCVCG--- 525 (783)
T ss_pred cccCCCCcEEec----ceecCCCCCCC----------cccCCccccCcH----hHHhhccCCCCCCCcCCCCcEeCC---
Confidence 556655655543 58999999987 456665544432 1134454 23489999999744
Q ss_pred eeeecCCCCC----CCCCCccee--Ccc---CCCCCCCCCcccCCCCCCCCCCceecCCcc-----cCCCCC---CCCCC
Q psy9419 592 YDCRCKEGNA----GNPFVACTP--VAV---VPHSCEDPATCVCSKNAPCPSGYVCKNSRC-----TDLCAN---VRCGP 654 (739)
Q Consensus 592 ~~C~C~~G~~----g~~~~~C~~--~~~---~~~~C~~~~~C~~~~~c~C~~G~~c~~~~C-----~~~C~~---~~C~~ 654 (739)
+|+|.+... |. .|+- ..+ ....|.++|+|.+. .|.|.+||+ |..| .+.|.+ ..|+.
T Consensus 526 -qC~C~~~~~~~i~G~---fCECDnfsC~r~~g~lC~g~G~C~CG-~CvC~~Gwt--G~~C~C~~std~C~~~~G~iCSG 598 (783)
T KOG1226|consen 526 -QCVCHKPDNGKIYGK---FCECDNFSCERHKGVLCGGHGRCECG-RCVCNPGWT--GSACNCPLSTDTCESSDGQICSG 598 (783)
T ss_pred -ceEecCCCCCceeee---eeeccCcccccccCcccCCCCeEeCC-cEEcCCCCc--cCCCCCCCCCccccCCCCceeCC
Confidence 699998887 44 3442 222 24578888888754 599999999 4444 355653 47999
Q ss_pred CCeecCceeeCCCC-CccCCCCCCCCCccCCCCCCCCCCC
Q psy9419 655 RALCVQGQCLCPSD-LIGNPTDLTRGCQVKGQCANDLECK 693 (739)
Q Consensus 655 ~~~C~~~~C~C~~G-y~G~~c~~~~~C~~~~~C~~~~~C~ 693 (739)
+|+|.=++|+|... |+|..||+-+.|.+. |....+|+
T Consensus 599 rG~C~Cg~C~C~~~~~sG~~CE~cptc~~~--C~~~~~Cv 636 (783)
T KOG1226|consen 599 RGTCECGRCKCTDPPYSGEFCEKCPTCPDP--CAENKSCV 636 (783)
T ss_pred CceeeCCceEcCCCCcCcchhhcCCCCCCc--ccccccch
Confidence 99999999999886 999999986666433 55544443
No 21
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.54 E-value=2.6e-05 Score=72.89 Aligned_cols=143 Identities=29% Similarity=0.687 Sum_probs=87.1
Q ss_pred CCCCCCeeeeCCCCeeEEecCCCCccCCCCCCcccccccC---CCCCCCCCCceeeCC-----CCceeeCCCCCcCCCCC
Q psy9419 34 QCPGGAECVNIAGGVSYCACPKGFRPKEDGYCEDVDECAE---SRHLCGPGAVCINHP-----GSYTCQCPPNSSGDPLL 105 (739)
Q Consensus 34 ~C~~~g~C~~~~~g~~~C~C~~Gy~g~~~~~C~dideC~~---~~~~C~~~~~C~n~~-----gsy~C~C~~Gy~g~~~~ 105 (739)
.|.+ |.-+... +-|.|.|++||.......|+...+|.. ...+|...|+|++.. ..|.|.|.+||....
T Consensus 7 ~CKN-G~LiQMS-NHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~-- 82 (197)
T PF06247_consen 7 ICKN-GYLIQMS-NHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQ-- 82 (197)
T ss_dssp --BT-EEEEEES-SEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS--
T ss_pred cccC-CEEEEcc-CceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeC--
Confidence 4443 6666655 469999999999987779998888865 235899999999875 459999999999763
Q ss_pred CCccCCCCCCCCCCCCCCCCCccCccccCCCCccccCCCCCCccCCCCCCCCCCCceeccC----CCCeeeCCCCCccCC
Q psy9419 106 GCTHARVQCSRDADCDGPYERCVRAACVCPAPYYADVNDGHKCKSPCERFSCGINAQCTPA----DPPQCTCLAGYTGEA 181 (739)
Q Consensus 106 ~C~~~~~~C~~~~~C~~~~~~C~~~~C~C~~g~~g~~~~~~~C~~~C~~~~C~~~~~C~~~----~~~~C~C~~Gy~g~~ 181 (739)
+.|+. +.|....|+ .|.|+.. ....|+|.-|+..+.
T Consensus 83 -------------------~vCvp--------------------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d 122 (197)
T PF06247_consen 83 -------------------GVCVP--------------------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD 122 (197)
T ss_dssp -------------------SSEEE--------------------GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTT
T ss_pred -------------------CeEch--------------------hhcCceecC-CCeEEecCCCCCCceeEeeeceEecc
Confidence 11111 112234455 6788742 244899999998433
Q ss_pred CCCCccc--CCCCCCCCCCCCCeeeccCCceEeeCCCCCcCCC
Q psy9419 182 TLGCLDV--DECLGVSPCASSALCVNEKGGFKCVCPKGTTGDP 222 (739)
Q Consensus 182 ~~~C~~i--~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~ 222 (739)
...|... -+|+ -.|..+.+|....+-|+|.+.+||.++.
T Consensus 123 n~kCtk~G~T~C~--LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 123 NKKCTKTGETKCS--LKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp TTESEEEE----------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred CCcccCCCcccee--eecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 3334322 2344 4567788999999999999999998765
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.47 E-value=4.2e-05 Score=52.47 Aligned_cols=33 Identities=48% Similarity=1.010 Sum_probs=26.4
Q ss_pred ccCCCCCCCCCCceeeCCCCceeeCCCCCcCCC
Q psy9419 71 CAESRHLCGPGAVCINHPGSYTCQCPPNSSGDP 103 (739)
Q Consensus 71 C~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~ 103 (739)
|+.+++.|+.+|+|+++.++|.|+|++||+|+.
T Consensus 1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 344567899999999999999999999999986
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.41 E-value=0.00021 Score=50.20 Aligned_cols=35 Identities=51% Similarity=1.085 Sum_probs=29.8
Q ss_pred cCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCC-CC
Q psy9419 569 DIDECWSSNTCGSNAVCINTPGSYDCRCKEGNA-GN 603 (739)
Q Consensus 569 ~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~-g~ 603 (739)
++++|....+|.++++|+++.++|+|.|++||+ |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 467884327899999999999999999999999 65
No 24
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.40 E-value=0.00012 Score=44.89 Aligned_cols=23 Identities=48% Similarity=1.120 Sum_probs=20.0
Q ss_pred eeEEecCCCCccCCCC-CCccccc
Q psy9419 48 VSYCACPKGFRPKEDG-YCEDVDE 70 (739)
Q Consensus 48 ~~~C~C~~Gy~g~~~~-~C~dide 70 (739)
+|+|+|++||+...++ .|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 5899999999987665 8999987
No 25
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.38 E-value=8.7e-05 Score=49.66 Aligned_cols=29 Identities=45% Similarity=1.090 Sum_probs=26.5
Q ss_pred CCCCCCCCCEEeeCC-CCeeeecCCCCCCC
Q psy9419 575 SSNTCGSNAVCINTP-GSYDCRCKEGNAGN 603 (739)
Q Consensus 575 ~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~ 603 (739)
.+++|+++|+|++.. ++|+|+|++||+|.
T Consensus 2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 456999999999998 99999999999996
No 26
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.33 E-value=0.00012 Score=48.94 Aligned_cols=30 Identities=40% Similarity=1.157 Sum_probs=26.8
Q ss_pred CCCCCCCCCCCeeeccC-CceEeeCCCCCcCC
Q psy9419 191 CLGVSPCASSALCVNEK-GGFKCVCPKGTTGD 221 (739)
Q Consensus 191 C~~~~~C~~~~~C~n~~-g~~~C~C~~Gy~g~ 221 (739)
|.+ ++|.++|+|++.. ++|+|+|++||+|.
T Consensus 1 C~~-~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSS-NPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTT-TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCC-CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 344 7999999999999 99999999999985
No 27
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.24 E-value=0.00047 Score=48.38 Aligned_cols=37 Identities=43% Similarity=1.041 Sum_probs=30.6
Q ss_pred cCCCCCCCCCCCCCCeeeecCCceEEeCCCCCC-CCCC
Q psy9419 246 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTT-GDPY 282 (739)
Q Consensus 246 d~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~-g~~c 282 (739)
++++|....+|.++++|++..++|.|.|++||. |..|
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 467787327899889999999999999999998 6543
No 28
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.16 E-value=0.00018 Score=49.35 Aligned_cols=30 Identities=47% Similarity=1.067 Sum_probs=23.9
Q ss_pred CCCCCCCCCeeecCCCceEEeCCCCCccCC
Q psy9419 513 GAGLCGPGAQCLETGGSVECQCPAGYKGNP 542 (739)
Q Consensus 513 ~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~ 542 (739)
.+..|+.+|+|+++.++|+|+|++||+|++
T Consensus 4 ~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 4 NNGGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 346789999999999999999999999974
No 29
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.02 E-value=0.00063 Score=41.76 Aligned_cols=24 Identities=42% Similarity=0.969 Sum_probs=19.7
Q ss_pred ceEeeCCCCCcCCCCCCCccCCCCCCcccccCCCCcccCCC
Q psy9419 209 GFKCVCPKGTTGDPYTLGCVGSGSPRTECRVDKVGCLDVDE 249 (739)
Q Consensus 209 ~~~C~C~~Gy~g~~~~~~C~~~~~~~~~c~~~~~~C~d~~e 249 (739)
+|+|+|++||+... ++..|+||||
T Consensus 1 sy~C~C~~Gy~l~~-----------------d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP-----------------DGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCC-----------------CCCccccCCC
Confidence 68999999999865 4466789987
No 30
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.84 E-value=0.0016 Score=45.11 Aligned_cols=34 Identities=50% Similarity=1.066 Sum_probs=29.1
Q ss_pred CCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419 570 IDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGN 603 (739)
Q Consensus 570 id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~ 603 (739)
+++|....+|.++++|++..++|+|.|++||.|.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 5677322689889999999999999999999996
No 31
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.61 E-value=0.0031 Score=43.62 Aligned_cols=36 Identities=42% Similarity=1.035 Sum_probs=29.7
Q ss_pred CCCCCCCCCCCCCCeeeecCCceEEeCCCCCCCCCC
Q psy9419 247 VDECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPY 282 (739)
Q Consensus 247 ~~eC~~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c 282 (739)
+++|....+|.+++.|++..++|.|.|++||.|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 567762268988899999999999999999998654
No 32
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.31 E-value=0.0025 Score=59.94 Aligned_cols=95 Identities=23% Similarity=0.609 Sum_probs=65.7
Q ss_pred CCCCCceeecCCCCeecCCeeeccCCCCCCCCCCCCeeeeCCCC--eeEEecCCCCccCCCCCCc--ccccccCCCCCCC
Q psy9419 4 NQCNTLECQCRPPYQIVAGECTLATCGTQGQCPGGAECVNIAGG--VSYCACPKGFRPKEDGYCE--DVDECAESRHLCG 79 (739)
Q Consensus 4 ~~~~~~~C~C~~Gy~g~~~~C~~~~C~~~~~C~~~g~C~~~~~g--~~~C~C~~Gy~g~~~~~C~--dideC~~~~~~C~ 79 (739)
++...|+|.|.+||......|..+.|.. ..|. .|.|+-.+.. ...|+|.-|+...+...|. .-.+|++ .|.
T Consensus 65 ~~~~~~~C~C~~gY~~~~~vCvp~~C~~-~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~L---KCk 139 (197)
T PF06247_consen 65 GEERAYKCDCINGYILKQGVCVPNKCNN-KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSL---KCK 139 (197)
T ss_dssp TSSTSEEEEE-TTEEESSSSEEEGGGSS----T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE-----------T
T ss_pred ccceeEEEecccCceeeCCeEchhhcCc-eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceee---ecC
Confidence 5678899999999999999999999987 7897 5999853321 3589999999955445786 2346765 688
Q ss_pred CCCceeeCCCCceeeCCCCCcCCC
Q psy9419 80 PGAVCINHPGSYTCQCPPNSSGDP 103 (739)
Q Consensus 80 ~~~~C~n~~gsy~C~C~~Gy~g~~ 103 (739)
.+.+|..+.+-|+|++.+||.++.
T Consensus 140 ~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 140 ENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp TTEEEEEETTEEEEEE-TT-EEET
T ss_pred CCcceeeeCcEEEeecCCCCCCCC
Confidence 888999999999999999998764
No 33
>KOG1218|consensus
Probab=96.01 E-value=1.1 Score=47.70 Aligned_cols=84 Identities=29% Similarity=0.704 Sum_probs=48.9
Q ss_pred cccCCCCccCCCCccccC-CCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9419 434 RCVCPPFYVGDPEFNCVP-PVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCGTAGPQDRGSCDS 512 (739)
Q Consensus 434 ~C~C~~g~~g~~~~~C~~-~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~~~~~~~~~~C~~ 512 (739)
.|.+..+|.+. .|.. ......|...|.....+.. ....|.|.+||.|. .+.. ....|..
T Consensus 125 ~c~~~~~~~~~---~C~~~~~~g~~C~~~c~~~~~~~~----------~~~~c~c~~g~~g~---~~~~----~~~~c~~ 184 (316)
T KOG1218|consen 125 ECRCGGGYIGE---QCGEENLVGLKCQRDCQCTGGCDC----------KNGICTCQPGFVGV---FCVE----SCSGCSP 184 (316)
T ss_pred ceecCCcCccc---cccccCCCCCCccCCCCCccccCC----------CCCceeccCCcccc---cccc----cCCCcCC
Confidence 46667777766 3443 3344556655532222222 33579999999999 6651 1111443
Q ss_pred CCCCCCCCCeeecCCCceEEeCCCCCcc
Q psy9419 513 GAGLCGPGAQCLETGGSVECQCPAGYKG 540 (739)
Q Consensus 513 ~~~~C~~~g~C~~~~g~~~C~C~~Gy~g 540 (739)
...|.+++.|+...+ .+.+.+++.+
T Consensus 185 -~~~~~~g~~C~~~~~--~~~~~~~~~~ 209 (316)
T KOG1218|consen 185 -LTACENGAKCNRSTG--SCLCYPGPSG 209 (316)
T ss_pred -CcccCCCCeeecccc--ccccCCCCcc
Confidence 255667778888766 5666666543
No 34
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.91 E-value=0.0035 Score=42.94 Aligned_cols=28 Identities=39% Similarity=0.993 Sum_probs=21.9
Q ss_pred CCCCCCCCCceeeCCCCceeeCCCCCcCCC
Q psy9419 74 SRHLCGPGAVCINHPGSYTCQCPPNSSGDP 103 (739)
Q Consensus 74 ~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~ 103 (739)
++..|+ .+|++++++|+|.|++||++..
T Consensus 4 ~NGgC~--h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 4 NNGGCS--HICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp GGGGSS--SEEEEETTSEEEE-STTEEE-T
T ss_pred CCCCcC--CCCccCCCceEeECCCCCEECc
Confidence 345676 5899999999999999999874
No 35
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.89 E-value=0.012 Score=39.96 Aligned_cols=28 Identities=50% Similarity=1.178 Sum_probs=25.7
Q ss_pred CCCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419 576 SNTCGSNAVCINTPGSYDCRCKEGNAGN 603 (739)
Q Consensus 576 ~~~C~~~g~C~~~~g~~~C~C~~G~~g~ 603 (739)
..+|.++++|+++.+.|+|.|++||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 5678889999999999999999999987
No 36
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.79 E-value=0.011 Score=40.10 Aligned_cols=28 Identities=50% Similarity=1.271 Sum_probs=25.5
Q ss_pred CCCCCCCCceeeCCCCceeeCCCCCcCC
Q psy9419 75 RHLCGPGAVCINHPGSYTCQCPPNSSGD 102 (739)
Q Consensus 75 ~~~C~~~~~C~n~~gsy~C~C~~Gy~g~ 102 (739)
..+|.++++|+++.++|+|.|++||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 3689888999999999999999999986
No 37
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.79 E-value=0.014 Score=39.72 Aligned_cols=26 Identities=50% Similarity=1.229 Sum_probs=23.6
Q ss_pred CCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419 577 NTCGSNAVCINTPGSYDCRCKEGNAGN 603 (739)
Q Consensus 577 ~~C~~~g~C~~~~g~~~C~C~~G~~g~ 603 (739)
.+|.++ +|+++.++|+|.|++||+|.
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccC
Confidence 578887 99999999999999999993
No 38
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.44 E-value=0.016 Score=39.39 Aligned_cols=26 Identities=62% Similarity=1.399 Sum_probs=23.6
Q ss_pred CCCCCCCceeeCCCCceeeCCCCCcCC
Q psy9419 76 HLCGPGAVCINHPGSYTCQCPPNSSGD 102 (739)
Q Consensus 76 ~~C~~~~~C~n~~gsy~C~C~~Gy~g~ 102 (739)
.+|.++ +|+++.++|+|.|++||+|.
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccC
Confidence 579888 99999999999999999983
No 39
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.31 E-value=0.016 Score=38.58 Aligned_cols=25 Identities=32% Similarity=0.752 Sum_probs=21.6
Q ss_pred CCCCCCCeec--CceeeCCCCCccCCC
Q psy9419 650 VRCGPRALCV--QGQCLCPSDLIGNPT 674 (739)
Q Consensus 650 ~~C~~~~~C~--~~~C~C~~Gy~G~~c 674 (739)
.+|+++++|+ .++|+|++||+|..|
T Consensus 6 ~~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 3689999999 589999999999765
No 40
>KOG1218|consensus
Probab=95.21 E-value=0.9 Score=48.30 Aligned_cols=49 Identities=22% Similarity=0.509 Sum_probs=27.7
Q ss_pred CCCCcccCCCCccCCCCccccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCceeecCCCCCCC
Q psy9419 430 PALDRCVCPPFYVGDPEFNCVPPVTMPVCIPPCGPNAHCEYNSESPGSSPGSDNICVCNSGTHGN 494 (739)
Q Consensus 430 ~~~~~C~C~~g~~g~~~~~C~~~~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~ 494 (739)
.....|.|.++|+|. ..+........+...+.. .. ....|.+..+|.+.
T Consensus 12 ~~~~~c~c~~~~~g~--~~~~~~~~~~~~~~~~~~------~~--------~~~~~~~~~~~~~~ 60 (316)
T KOG1218|consen 12 GGSGQCFCDPGYTGR--LQCEHQAVTSACSGICPC------EV--------NSGECGLGYGFVGS 60 (316)
T ss_pred CCCCceecCCCcccc--ccccCCCCCccccccCCc------cC--------CceeEecccccCCC
Confidence 457789999999996 122211111112222211 11 45678899999888
No 41
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.91 E-value=0.032 Score=37.18 Aligned_cols=25 Identities=32% Similarity=0.883 Sum_probs=21.6
Q ss_pred CCCCCCCeeeeCCCCeeEEecCCCCccC
Q psy9419 33 GQCPGGAECVNIAGGVSYCACPKGFRPK 60 (739)
Q Consensus 33 ~~C~~~g~C~~~~~g~~~C~C~~Gy~g~ 60 (739)
..|+++|+|+...+ +|+|++||+|.
T Consensus 6 ~~C~~~G~C~~~~g---~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSPCG---RCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCCCC---EEECCCCCcCC
Confidence 47999999998643 89999999997
No 42
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.04 E-value=0.021 Score=29.69 Aligned_cols=13 Identities=31% Similarity=0.833 Sum_probs=9.9
Q ss_pred eeeCCCCCccCCC
Q psy9419 662 QCLCPSDLIGNPT 674 (739)
Q Consensus 662 ~C~C~~Gy~G~~c 674 (739)
+|+|++||+|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5889999999765
No 43
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.82 E-value=0.078 Score=36.36 Aligned_cols=24 Identities=33% Similarity=0.826 Sum_probs=19.3
Q ss_pred CeeeecCCceEEeCCCCCCCCCCC
Q psy9419 260 ALCVNEKGGFKCVCPKGTTGDPYT 283 (739)
Q Consensus 260 ~~C~~~~g~~~C~C~~Gy~g~~c~ 283 (739)
.+|++.+++|+|.|++||+.....
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~D~ 33 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAEDG 33 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-TTS
T ss_pred CCCccCCCceEeECCCCCEECcCC
Confidence 489999999999999999987644
No 44
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.63 E-value=0.16 Score=51.27 Aligned_cols=38 Identities=39% Similarity=0.800 Sum_probs=34.0
Q ss_pred CCcccccccCCCCCCCCCCceeeCCCCceeeCCCCCcCCC
Q psy9419 64 YCEDVDECAESRHLCGPGAVCINHPGSYTCQCPPNSSGDP 103 (739)
Q Consensus 64 ~C~dideC~~~~~~C~~~~~C~n~~gsy~C~C~~Gy~g~~ 103 (739)
.|++++||...++.|. ..|+++.|+|.|.|++||++..
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence 7999999998778897 4799999999999999999863
No 45
>smart00051 DSL delta serrate ligand.
Probab=91.25 E-value=0.22 Score=39.05 Aligned_cols=21 Identities=19% Similarity=0.308 Sum_probs=12.8
Q ss_pred CCCeecC-ceeeCCCCCccCCC
Q psy9419 654 PRALCVQ-GQCLCPSDLIGNPT 674 (739)
Q Consensus 654 ~~~~C~~-~~C~C~~Gy~G~~c 674 (739)
.+.+|.. +.++|.+||+|..|
T Consensus 42 ~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 42 GHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred CCccCCcCCCEecCCCCcCCCC
Confidence 3444443 56777777777654
No 46
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=87.01 E-value=1.2 Score=33.39 Aligned_cols=43 Identities=37% Similarity=0.947 Sum_probs=34.4
Q ss_pred cCCCCeecCCeee-----ccCCCCCCCCCCCCeeeeCCCCeeEEecCCCCccC
Q psy9419 13 CRPPYQIVAGECT-----LATCGTQGQCPGGAECVNIAGGVSYCACPKGFRPK 60 (739)
Q Consensus 13 C~~Gy~g~~~~C~-----~~~C~~~~~C~~~g~C~~~~~g~~~C~C~~Gy~g~ 60 (739)
|++||......|. ...|.....|..++.|++ | +|.|++||...
T Consensus 1 C~~~~~~~~~~C~~~~~~g~~C~~~~qC~~~s~C~~---g--~C~C~~g~~~~ 48 (52)
T PF01683_consen 1 CPSGQVAINGQCVPRVQPGESCESDEQCIGGSVCVN---G--RCQCPPGYVEV 48 (52)
T ss_pred CCCCCEEECCEECccCCCCCCCCCcCCCCCcCEEcC---C--EeECCCCCEec
Confidence 6788888777786 456887788988999976 3 79999999864
No 47
>smart00051 DSL delta serrate ligand.
Probab=84.73 E-value=1.1 Score=35.05 Aligned_cols=43 Identities=14% Similarity=0.221 Sum_probs=35.8
Q ss_pred ceeeCCCCCccCCCCCCCCCccCCCCCCCCCCCCCCceecCCCCC
Q psy9419 661 GQCLCPSDLIGNPTDLTRGCQVKGQCANDLECKPNEICFQEKGIE 705 (739)
Q Consensus 661 ~~C~C~~Gy~G~~c~~~~~C~~~~~C~~~~~C~~~~~C~~~~g~~ 705 (739)
+.=.|+++|.|..|+ ..|..++....+..|...+.+.|.+||.
T Consensus 17 ~rv~C~~~~yG~~C~--~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~ 59 (63)
T smart00051 17 IRVTCDENYYGEGCN--KFCRPRDDFFGHYTCDENGNKGCLEGWM 59 (63)
T ss_pred EEeeCCCCCcCCccC--CEeCcCccccCCccCCcCCCEecCCCCc
Confidence 344688999999997 5687777778889999888899999995
No 48
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=82.55 E-value=1.9 Score=32.18 Aligned_cols=22 Identities=32% Similarity=0.945 Sum_probs=18.1
Q ss_pred CCCCCCCeecCceeeCCCCCcc
Q psy9419 650 VRCGPRALCVQGQCLCPSDLIG 671 (739)
Q Consensus 650 ~~C~~~~~C~~~~C~C~~Gy~G 671 (739)
..|..++.|++.+|+|++||+-
T Consensus 26 ~qC~~~s~C~~g~C~C~~g~~~ 47 (52)
T PF01683_consen 26 EQCIGGSVCVNGRCQCPPGYVE 47 (52)
T ss_pred CCCCCcCEEcCCEeECCCCCEe
Confidence 3577888998999999999864
No 49
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=82.51 E-value=1.1 Score=30.71 Aligned_cols=31 Identities=29% Similarity=0.787 Sum_probs=22.6
Q ss_pred CCCCCCCCCCCeeeeCCCCeeEEecCCCCccC
Q psy9419 29 CGTQGQCPGGAECVNIAGGVSYCACPKGFRPK 60 (739)
Q Consensus 29 C~~~~~C~~~g~C~~~~~g~~~C~C~~Gy~g~ 60 (739)
|.. ..|+.|+.|++...|++.|.|..||...
T Consensus 2 C~~-~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 2 CID-TKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp -SS-S---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred ccC-ccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 444 6788899999988789999999999875
No 50
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=82.20 E-value=66 Score=38.79 Aligned_cols=15 Identities=20% Similarity=0.401 Sum_probs=11.2
Q ss_pred eEEeCCCCCCCCCCC
Q psy9419 269 FKCVCPKGTTGDPYT 283 (739)
Q Consensus 269 ~~C~C~~Gy~g~~c~ 283 (739)
-+|+|..||..+...
T Consensus 682 ~~C~C~~g~~p~~~~ 696 (800)
T PTZ00214 682 RRCWCERGFLPALDR 696 (800)
T ss_pred ceeEecCCcccccCC
Confidence 479999999865543
No 51
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=80.61 E-value=2.1 Score=31.81 Aligned_cols=31 Identities=35% Similarity=0.844 Sum_probs=22.8
Q ss_pred eeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccCC
Q psy9419 522 QCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGNP 563 (739)
Q Consensus 522 ~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~~ 563 (739)
.|....| +|.|+++|+|..+++| .+||++.+
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~~C---------~~g~~~~~ 43 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCDRC---------APGYYGLP 43 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCCCC---------CCCCccCC
Confidence 3655444 8999999999988764 56777754
No 52
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=80.46 E-value=1.6 Score=43.92 Aligned_cols=37 Identities=24% Similarity=0.637 Sum_probs=30.5
Q ss_pred CCcccCCCCC-CCCCCCCCeeeccCCceEeeCCCCCcCCC
Q psy9419 184 GCLDVDECLG-VSPCASSALCVNEKGGFKCVCPKGTTGDP 222 (739)
Q Consensus 184 ~C~~i~eC~~-~~~C~~~~~C~n~~g~~~C~C~~Gy~g~~ 222 (739)
.|.++++|.. .++|. ..|.++.|+|.|.|++||++..
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence 4788899975 35675 4899999999999999999754
No 53
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=80.31 E-value=1.2 Score=32.94 Aligned_cols=31 Identities=35% Similarity=0.757 Sum_probs=22.4
Q ss_pred CeeecCCCceEEeCCCCCccCCCcccCCCceeeeCCCCcccC
Q psy9419 521 AQCLETGGSVECQCPAGYKGNPYVQCVGGSVECQCPAGYKGN 562 (739)
Q Consensus 521 g~C~~~~g~~~C~C~~Gy~g~~c~~C~~g~~~C~C~~G~~g~ 562 (739)
.+|....| +|.|+++|+|..|++|. +||++.
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~C~---------~g~~~~ 41 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQCK---------PGYFGL 41 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-EE----------TTEECS
T ss_pred CcccCCCC--EEeccccccCCcCcCCC---------Cccccc
Confidence 46777554 89999999999988654 467765
No 54
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=79.22 E-value=1.1 Score=30.71 Aligned_cols=28 Identities=36% Similarity=0.694 Sum_probs=20.6
Q ss_pred CCCCCCCCEEeeCC-CCeeeecCCCCCCC
Q psy9419 576 SNTCGSNAVCINTP-GSYDCRCKEGNAGN 603 (739)
Q Consensus 576 ~~~C~~~g~C~~~~-g~~~C~C~~G~~g~ 603 (739)
...|..|+.|++.. |+++|+|.+||..+
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 35677899999875 99999999999865
No 55
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=73.89 E-value=3.9 Score=29.78 Aligned_cols=25 Identities=24% Similarity=0.695 Sum_probs=18.4
Q ss_pred eeecCCCceEEeCCCCCccCCCcccCC
Q psy9419 522 QCLETGGSVECQCPAGYKGNPYVQCVG 548 (739)
Q Consensus 522 ~C~~~~g~~~C~C~~Gy~g~~c~~C~~ 548 (739)
.|....| +|.|+++|+|..+++|.+
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~C~~ 36 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCDRCAP 36 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCCcCCC
Confidence 4555444 899999999988876543
No 56
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=70.13 E-value=4.6 Score=27.05 Aligned_cols=23 Identities=30% Similarity=0.715 Sum_probs=16.7
Q ss_pred CeeeeCCCCeeEEecCCCCccCCCC
Q psy9419 39 AECVNIAGGVSYCACPKGFRPKEDG 63 (739)
Q Consensus 39 g~C~~~~~g~~~C~C~~Gy~g~~~~ 63 (739)
+.|..... +.|.|++||..+...
T Consensus 10 A~CDpn~~--~~C~CPeGyIlde~~ 32 (34)
T PF09064_consen 10 ADCDPNSP--GQCFCPEGYILDEGS 32 (34)
T ss_pred CccCCCCC--CceeCCCceEecCCc
Confidence 46776543 389999999987543
No 57
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=68.57 E-value=5.4 Score=35.12 Aligned_cols=34 Identities=29% Similarity=0.615 Sum_probs=26.8
Q ss_pred cCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCCCC
Q psy9419 569 DIDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGN 603 (739)
Q Consensus 569 ~id~C~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~ 603 (739)
..|.|.....|+.+|.|.. ..+..|.|++||+..
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 3567866789999999964 456689999999864
No 58
>KOG3512|consensus
Probab=64.83 E-value=14 Score=40.37 Aligned_cols=28 Identities=25% Similarity=0.545 Sum_probs=21.8
Q ss_pred CCCceecCCCCcccCCCCccCCCCcccc
Q psy9419 423 GAGAQCDPALDRCVCPPFYVGDPEFNCV 450 (739)
Q Consensus 423 ~~~~~C~~~~~~C~C~~g~~g~~~~~C~ 450 (739)
..+.+|+..+++|.|.+|.+|.....|.
T Consensus 404 s~gktCNq~tGqCpCkeGvtG~tCnrCa 431 (592)
T KOG3512|consen 404 SAGKTCNQTTGQCPCKEGVTGLTCNRCA 431 (592)
T ss_pred cccccccccCCcccCCCCCccccccccc
Confidence 4567788889999999999998544444
No 59
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=62.18 E-value=30 Score=38.17 Aligned_cols=52 Identities=25% Similarity=0.620 Sum_probs=30.8
Q ss_pred ecCCCCccCCCC-CCcccccccCCCCCCCCCCceeeCCCCcee-eCCCCCcCCCCCCCcc
Q psy9419 52 ACPKGFRPKEDG-YCEDVDECAESRHLCGPGAVCINHPGSYTC-QCPPNSSGDPLLGCTH 109 (739)
Q Consensus 52 ~C~~Gy~g~~~~-~C~dideC~~~~~~C~~~~~C~n~~gsy~C-~C~~Gy~g~~~~~C~~ 109 (739)
+|.+||....+. .|....+|.. ..|. +|.+... -.| .|..+|.+.+.+.|..
T Consensus 3 ~C~~gy~~~~~~t~C~~~~~C~~--~~C~---~Cs~~~~-~~Ct~C~~~~~lt~t~~Ci~ 56 (397)
T PF03302_consen 3 ECTSGYKLSTDKTSCVSASECKT--PNCK---TCSNDKK-EVCTECNSGYYLTPTNQCIE 56 (397)
T ss_pred cccCCceECCCCCcccccCCCCC--CCCc---cccCCCC-CccCcCCCCCcCCCCCcccc
Confidence 477788876553 6776666765 3453 4554433 245 5888887765443443
No 60
>PHA02887 EGF-like protein; Provisional
Probab=57.60 E-value=8.2 Score=33.75 Aligned_cols=26 Identities=23% Similarity=0.354 Sum_probs=19.4
Q ss_pred CcccccC--CCCceeecCCCceeCCCCc
Q psy9419 325 HALCEPQ--DHRASCRCELGYTEGLNGK 350 (739)
Q Consensus 325 ~~~C~~~--~g~~~C~C~~G~~g~~~~~ 350 (739)
||+|... ...+.|.|+.||+|.+|++
T Consensus 96 HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 96 NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred CCEEEccccCCCceeECCCCcccCCCCc
Confidence 4566433 3467999999999998875
No 61
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=57.23 E-value=13 Score=27.55 Aligned_cols=19 Identities=42% Similarity=0.978 Sum_probs=16.0
Q ss_pred eecCCCCcccCCCCccCCC
Q psy9419 427 QCDPALDRCVCPPFYVGDP 445 (739)
Q Consensus 427 ~C~~~~~~C~C~~g~~g~~ 445 (739)
.|+..+++|.|+++|+|..
T Consensus 13 ~C~~~~G~C~C~~~~~G~~ 31 (50)
T cd00055 13 QCDPGTGQCECKPNTTGRR 31 (50)
T ss_pred cccCCCCEEeCCCcCCCCC
Confidence 4667789999999999974
No 62
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=54.23 E-value=7.3 Score=28.62 Aligned_cols=20 Identities=40% Similarity=0.938 Sum_probs=17.4
Q ss_pred ceecCCCCcccCCCCccCCC
Q psy9419 426 AQCDPALDRCVCPPFYVGDP 445 (739)
Q Consensus 426 ~~C~~~~~~C~C~~g~~g~~ 445 (739)
.+|++.+++|+|+++|+|..
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~ 30 (49)
T PF00053_consen 11 QTCDPSTGQCVCKPGTTGPR 30 (49)
T ss_dssp SSEEETCEEESBSTTEESTT
T ss_pred CcccCCCCEEeccccccCCc
Confidence 47778899999999999984
No 63
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=51.92 E-value=4.2e+02 Score=32.18 Aligned_cols=13 Identities=15% Similarity=0.268 Sum_probs=9.9
Q ss_pred ceEEeCCCCCCCC
Q psy9419 268 GFKCVCPKGTTGD 280 (739)
Q Consensus 268 ~~~C~C~~Gy~g~ 280 (739)
...|+|..||...
T Consensus 750 ~~vC~C~~g~~l~ 762 (800)
T PTZ00214 750 QGVCMCELDAVLT 762 (800)
T ss_pred CCeEEeCCcceec
Confidence 3489999999754
No 64
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=49.33 E-value=16 Score=32.58 Aligned_cols=39 Identities=31% Similarity=0.646 Sum_probs=28.5
Q ss_pred CCCCC--CCCCCCCCCEEeeC--CCCeeeecCCCCCCCCCCcceeCc
Q psy9419 570 IDECW--SSNTCGSNAVCINT--PGSYDCRCKEGNAGNPFVACTPVA 612 (739)
Q Consensus 570 id~C~--~~~~C~~~g~C~~~--~g~~~C~C~~G~~g~~~~~C~~~~ 612 (739)
+.+|. ..+-|.+ |+|.-. ...+.|+|..||+|. +|+..+
T Consensus 42 i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh~d 84 (139)
T PHA03099 42 IRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQHVV 84 (139)
T ss_pred cccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccc---ccccee
Confidence 34453 3567875 489854 478899999999999 787654
No 65
>PHA02887 EGF-like protein; Provisional
Probab=48.53 E-value=14 Score=32.27 Aligned_cols=29 Identities=31% Similarity=0.773 Sum_probs=22.8
Q ss_pred CCCCCccccCCCCCCCCCCCCceeecCCCCCCCCCCCCC
Q psy9419 462 CGPNAHCEYNSESPGSSPGSDNICVCNSGTHGNPYAGCG 500 (739)
Q Consensus 462 C~~~~~C~~~~~~~~~~~~~~~~C~C~~Gy~g~~~~~C~ 500 (739)
|. ||+|....+ ...+.|.|++||+|. +|+
T Consensus 94 Ci-HG~C~yI~d------L~epsCrC~~GYtG~---RCE 122 (126)
T PHA02887 94 CI-NGECMNIID------LDEKFCICNKGYTGI---RCD 122 (126)
T ss_pred ee-CCEEEcccc------CCCceeECCCCcccC---CCC
Confidence 44 678877663 256899999999999 887
No 66
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=48.37 E-value=17 Score=31.97 Aligned_cols=33 Identities=24% Similarity=0.539 Sum_probs=25.9
Q ss_pred ccCCCCCCCCCCCCCeeeccCCceEeeCCCCCcC
Q psy9419 187 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTTG 220 (739)
Q Consensus 187 ~i~eC~~~~~C~~~~~C~n~~g~~~C~C~~Gy~g 220 (739)
..+.|.....|+..+.|.. ..+..|.|.+||.-
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence 3567887688999999954 44567999999974
No 67
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=47.98 E-value=20 Score=26.02 Aligned_cols=20 Identities=35% Similarity=0.893 Sum_probs=16.0
Q ss_pred ceecCCCCcccCCCCccCCC
Q psy9419 426 AQCDPALDRCVCPPFYVGDP 445 (739)
Q Consensus 426 ~~C~~~~~~C~C~~g~~g~~ 445 (739)
..|++.+++|.|+++|+|..
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~ 30 (46)
T smart00180 11 GTCDPDTGQCECKPNVTGRR 30 (46)
T ss_pred CcccCCCCEEECCCCCCCCC
Confidence 35666788999999999973
No 68
>KOG3512|consensus
Probab=47.36 E-value=32 Score=37.65 Aligned_cols=99 Identities=23% Similarity=0.569 Sum_probs=56.5
Q ss_pred CCCCCCceeecCCCCeecCC-eee---------------ccCCCCCCCCCC-------------------CCeeee---C
Q psy9419 3 NNQCNTLECQCRPPYQIVAG-ECT---------------LATCGTQGQCPG-------------------GAECVN---I 44 (739)
Q Consensus 3 ~~~~~~~~C~C~~Gy~g~~~-~C~---------------~~~C~~~~~C~~-------------------~g~C~~---~ 44 (739)
.++-+.++|.|+.+-.|.++ .|. +++|.. +.|.. +|+|+| +
T Consensus 289 ~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~a-c~Cn~harrcrfn~Ely~lSgr~SggvClnCrHn 367 (592)
T KOG3512|consen 289 MDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVA-CNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHN 367 (592)
T ss_pred eccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccc-cccchhhhhcccchhhhcccCccccceEeecccC
Confidence 34556699999999888764 232 334443 33322 245664 2
Q ss_pred CCCeeEE-ecCCCCccCCCCCCcccccccCCCCCCC----CCCceeeCCCCceeeCCCCCcCCCCCCC
Q psy9419 45 AGGVSYC-ACPKGFRPKEDGYCEDVDECAESRHLCG----PGAVCINHPGSYTCQCPPNSSGDPLLGC 107 (739)
Q Consensus 45 ~~g~~~C-~C~~Gy~g~~~~~C~dideC~~~~~~C~----~~~~C~n~~gsy~C~C~~Gy~g~~~~~C 107 (739)
+.|- .| .|.+||..+...-=.+...|.. ..|+ .+-+|..+.| +|.|++|-+|..++.|
T Consensus 368 TaGr-hChyCreGyyRd~s~pl~hrkaCk~--CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnrC 430 (592)
T KOG3512|consen 368 TAGR-HCHYCREGYYRDGSKPLTHRKACKA--CDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNRC 430 (592)
T ss_pred CCCc-ccccccCccccCCCCCCchhhhhhh--cCCcccccccccccccCC--cccCCCCCcccccccc
Confidence 2232 45 6999998764321112222222 2232 2457777777 8999999999865533
No 69
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=46.33 E-value=20 Score=32.03 Aligned_cols=37 Identities=27% Similarity=0.571 Sum_probs=26.4
Q ss_pred CCCCC--CCCCCCCCCeeeec--CCceEEeCCCCCCCCCCCc
Q psy9419 247 VDECL--GVSPCASSALCVNE--KGGFKCVCPKGTTGDPYTL 284 (739)
Q Consensus 247 ~~eC~--~~~~C~~~~~C~~~--~g~~~C~C~~Gy~g~~c~~ 284 (739)
+.+|. ..+-|.+ |+|.-. ...+.|.|..||+|..|+.
T Consensus 42 i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 42 IRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred cccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccccccc
Confidence 44553 2355765 489544 4778999999999999875
No 70
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=42.03 E-value=6.9 Score=30.68 Aligned_cols=14 Identities=21% Similarity=0.453 Sum_probs=8.4
Q ss_pred ceeeCCCCCccCCC
Q psy9419 661 GQCLCPSDLIGNPT 674 (739)
Q Consensus 661 ~~C~C~~Gy~G~~c 674 (739)
++=+|.+||+|..|
T Consensus 50 G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 50 GNKVCLPGWTGPNC 63 (63)
T ss_dssp --EEE-TTEESTTS
T ss_pred CCCCCCCCCcCCCC
Confidence 56678888888764
No 71
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=32.21 E-value=25 Score=30.46 Aligned_cols=32 Identities=25% Similarity=0.640 Sum_probs=23.2
Q ss_pred CCCCC-CCCCCCCeeeeCCC----CeeEEecCCCCcc
Q psy9419 28 TCGTQ-GQCPGGAECVNIAG----GVSYCACPKGFRP 59 (739)
Q Consensus 28 ~C~~~-~~C~~~g~C~~~~~----g~~~C~C~~Gy~g 59 (739)
.|... +.|++||.|++... .=|.|.|.+.+..
T Consensus 7 aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 7 ACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred HHHHhccCCCCCceEeeccCCCccceEEEEeeccccc
Confidence 34444 78999999998632 3589999996554
No 72
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=28.11 E-value=36 Score=29.50 Aligned_cols=31 Identities=26% Similarity=0.800 Sum_probs=22.8
Q ss_pred CCCC-CCCCCCCCCEEeeCC-----CCeeeecCCCCC
Q psy9419 571 DECW-SSNTCGSNAVCINTP-----GSYDCRCKEGNA 601 (739)
Q Consensus 571 d~C~-~~~~C~~~g~C~~~~-----g~~~C~C~~G~~ 601 (739)
++|. ..+.|..||.|+... .=|.|.|.+.+.
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~ 42 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVV 42 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeecccc
Confidence 3443 567899999999872 448999999544
No 73
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=27.75 E-value=31 Score=25.97 Aligned_cols=33 Identities=18% Similarity=0.378 Sum_probs=17.7
Q ss_pred CCCCCCccccc----CCCCceeecCCCceeCCCCccc
Q psy9419 320 VECGAHALCEP----QDHRASCRCELGYTEGLNGKCV 352 (739)
Q Consensus 320 ~~C~~~~~C~~----~~g~~~C~C~~G~~g~~~~~c~ 352 (739)
.+|+.||.-.. ..|...|.|..-|.|..|.+-+
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~ 53 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI 53 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence 45677776632 2566899999999999887644
No 74
>KOG3516|consensus
Probab=27.26 E-value=56 Score=40.05 Aligned_cols=44 Identities=27% Similarity=0.587 Sum_probs=37.3
Q ss_pred CCCcccCCCCCCCCCCCCCCeeeecCCceEEeCC-CCCCCCCCCcc
Q psy9419 241 KVGCLDVDECLGVSPCASSALCVNEKGGFKCVCP-KGTTGDPYTLG 285 (739)
Q Consensus 241 ~~~C~d~~eC~~~~~C~~~~~C~~~~g~~~C~C~-~Gy~g~~c~~~ 285 (739)
...|.-++.|. +++|.++|.|......|+|.|. .||.|..|..+
T Consensus 539 id~C~i~drCl-PN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 539 IDMCGISDRCL-PNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS 583 (1306)
T ss_pred ecccccccccC-CccccCCCcccccccceeEeccccccccccccCC
Confidence 34566778888 8999999999988889999999 89999988653
No 75
>KOG3516|consensus
Probab=22.46 E-value=64 Score=39.61 Aligned_cols=37 Identities=27% Similarity=0.729 Sum_probs=32.5
Q ss_pred CCccCCCCCCCCCCCCCCEEeeCCCCeeeecC-CCCCCC
Q psy9419 566 QCVDIDECWSSNTCGSNAVCINTPGSYDCRCK-EGNAGN 603 (739)
Q Consensus 566 ~C~~id~C~~~~~C~~~g~C~~~~g~~~C~C~-~G~~g~ 603 (739)
.|.-+|.| .+++|.++|.|.-....|.|.|. .||+|.
T Consensus 541 ~C~i~drC-lPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga 578 (1306)
T KOG3516|consen 541 MCGISDRC-LPNPCEHGGKCSQSWDDFECNCELTGYKGA 578 (1306)
T ss_pred cccccccc-CCccccCCCcccccccceeEeccccccccc
Confidence 45556777 89999999999998889999998 999998
Done!