Query psy13157
Match_columns 1434
No_of_seqs 543 out of 3767
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 21:36:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13157.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13157hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1217|consensus 99.6 4.6E-14 9.9E-19 176.7 29.8 326 261-733 98-435 (487)
2 KOG1214|consensus 99.6 3E-14 6.5E-19 169.2 24.8 213 512-782 692-915 (1289)
3 KOG4289|consensus 99.6 4.1E-15 9E-20 183.3 16.2 106 149-280 1180-1308(2531)
4 KOG1217|consensus 99.6 2.9E-14 6.3E-19 178.5 23.2 296 77-541 99-422 (487)
5 KOG4289|consensus 99.6 1.3E-14 2.7E-19 179.1 17.2 91 481-597 1217-1308(2531)
6 KOG1214|consensus 99.5 6.7E-13 1.5E-17 158.0 15.5 211 148-399 692-914 (1289)
7 KOG0994|consensus 99.3 8.3E-11 1.8E-15 144.3 18.9 170 17-227 754-945 (1758)
8 KOG1219|consensus 99.2 3E-11 6.5E-16 154.9 8.1 112 148-286 3864-3976(4289)
9 KOG1219|consensus 99.2 4.8E-11 1E-15 153.1 8.3 105 574-719 3869-3974(4289)
10 KOG0994|consensus 99.0 5.2E-09 1.1E-13 129.0 18.3 64 477-544 1030-1095(1758)
11 KOG1225|consensus 99.0 3E-09 6.6E-14 128.2 14.9 132 1175-1364 233-365 (525)
12 KOG1225|consensus 98.8 1.9E-08 4.1E-13 121.4 13.6 185 45-337 160-364 (525)
13 KOG4260|consensus 98.5 1.1E-07 2.4E-12 101.6 4.9 164 172-392 131-304 (350)
14 KOG4260|consensus 98.5 2.2E-07 4.8E-12 99.4 6.5 122 1007-1157 150-304 (350)
15 PF07645 EGF_CA: Calcium-bindi 97.8 1.3E-05 2.8E-10 64.1 2.7 33 1208-1240 1-35 (42)
16 KOG1836|consensus 97.8 0.00074 1.6E-08 92.7 20.1 103 46-181 695-810 (1705)
17 KOG1836|consensus 97.6 0.001 2.2E-08 91.4 18.5 63 477-544 952-1018(1705)
18 KOG1226|consensus 97.6 0.00017 3.8E-09 88.9 8.8 131 1277-1418 468-628 (783)
19 PF12947 EGF_3: EGF domain; I 97.6 3.2E-05 6.9E-10 59.2 1.6 32 1214-1245 5-36 (36)
20 PF07645 EGF_CA: Calcium-bindi 97.5 6.2E-05 1.3E-09 60.3 2.5 34 1069-1102 1-36 (42)
21 PF06247 Plasmod_Pvs28: Plasmo 97.4 3.4E-05 7.4E-10 79.6 0.2 144 576-778 7-164 (197)
22 PF00008 EGF: EGF-like domain 97.4 8.8E-05 1.9E-09 55.4 2.3 30 1212-1241 1-31 (32)
23 smart00179 EGF_CA Calcium-bind 97.4 0.00016 3.4E-09 56.7 3.9 36 1208-1245 1-38 (39)
24 PF00008 EGF: EGF-like domain 97.4 9.7E-05 2.1E-09 55.2 2.4 31 151-181 1-32 (32)
25 PF06247 Plasmod_Pvs28: Plasmo 97.3 0.00011 2.4E-09 75.9 2.0 153 951-1160 7-163 (197)
26 smart00179 EGF_CA Calcium-bind 97.1 0.00051 1.1E-08 53.8 3.9 34 686-719 1-36 (39)
27 cd00054 EGF_CA Calcium-binding 96.9 0.001 2.3E-08 51.5 3.8 34 1208-1241 1-35 (38)
28 PF12947 EGF_3: EGF domain; I 96.8 0.00076 1.6E-08 51.7 1.9 30 1076-1105 6-35 (36)
29 KOG1226|consensus 96.7 0.0072 1.6E-07 75.1 10.4 137 1214-1368 466-622 (783)
30 cd00054 EGF_CA Calcium-binding 96.5 0.0034 7.3E-08 48.5 3.9 33 687-719 2-35 (38)
31 cd00053 EGF Epidermal growth f 95.9 0.0092 2E-07 45.3 3.8 28 1214-1241 5-32 (36)
32 smart00181 EGF Epidermal growt 95.6 0.013 2.9E-07 44.6 3.5 29 1212-1241 2-31 (35)
33 cd00053 EGF Epidermal growth f 95.5 0.017 3.8E-07 43.8 3.9 28 153-180 5-32 (36)
34 smart00181 EGF Epidermal growt 95.2 0.025 5.4E-07 43.1 3.8 27 154-181 6-33 (35)
35 PF12662 cEGF: Complement Clr- 95.1 0.018 3.9E-07 39.8 2.3 24 707-732 1-24 (24)
36 PF12662 cEGF: Complement Clr- 94.7 0.02 4.4E-07 39.5 1.8 24 325-350 1-24 (24)
37 KOG1218|consensus 94.1 4.2 9E-05 48.0 21.0 123 1221-1364 117-251 (316)
38 PF07974 EGF_2: EGF-like domai 92.8 0.099 2.1E-06 39.1 2.6 24 1277-1300 7-32 (32)
39 KOG1218|consensus 92.1 2.4 5.2E-05 50.0 14.9 142 1228-1386 47-196 (316)
40 PF12661 hEGF: Human growth fa 91.7 0.094 2E-06 30.8 1.1 13 1288-1300 1-13 (13)
41 PF07974 EGF_2: EGF-like domai 91.1 0.26 5.7E-06 36.9 3.2 26 154-181 6-31 (32)
42 PF12946 EGF_MSP1_1: MSP1 EGF 90.3 0.13 2.7E-06 39.4 0.9 34 1212-1245 2-36 (37)
43 PF14670 FXa_inhibition: Coagu 90.2 0.17 3.7E-06 38.9 1.6 26 74-101 7-32 (36)
44 smart00051 DSL delta serrate l 88.9 0.32 7E-06 42.6 2.6 22 1344-1365 41-63 (63)
45 PF14670 FXa_inhibition: Coagu 88.1 0.32 6.9E-06 37.4 1.8 26 154-181 6-31 (36)
46 smart00051 DSL delta serrate l 86.6 0.72 1.6E-05 40.5 3.3 46 1230-1300 17-63 (63)
47 PF12946 EGF_MSP1_1: MSP1 EGF 81.8 0.52 1.1E-05 36.2 0.4 32 1073-1104 2-34 (37)
48 cd01475 vWA_Matrilin VWA_Matri 80.0 1.5 3.3E-05 49.1 3.5 39 1201-1241 179-219 (224)
49 cd00055 EGF_Lam Laminin-type e 79.2 2.1 4.7E-05 35.6 3.3 31 80-122 13-43 (50)
50 PF00053 Laminin_EGF: Laminin 77.7 1.6 3.5E-05 36.1 2.1 32 79-122 11-42 (49)
51 KOG3512|consensus 73.0 21 0.00045 42.9 10.1 161 264-439 285-476 (592)
52 PF00053 Laminin_EGF: Laminin 72.7 3.3 7.1E-05 34.3 2.7 32 1139-1188 11-42 (49)
53 cd00055 EGF_Lam Laminin-type e 72.4 4 8.6E-05 34.0 3.1 26 1147-1188 18-43 (50)
54 smart00180 EGF_Lam Laminin-typ 71.8 3.6 7.7E-05 33.7 2.7 29 80-120 12-40 (46)
55 PF01683 EB: EB module; Inter 71.3 3.6 7.8E-05 34.5 2.7 21 1277-1297 27-47 (52)
56 cd01475 vWA_Matrilin VWA_Matri 69.2 4 8.6E-05 45.7 3.3 35 683-719 183-219 (224)
57 smart00180 EGF_Lam Laminin-typ 63.3 6.3 0.00014 32.2 2.5 24 1147-1186 17-40 (46)
58 PF01683 EB: EB module; Inter 61.4 11 0.00023 31.6 3.7 22 1342-1363 27-48 (52)
59 KOG3516|consensus 46.3 14 0.0003 49.3 2.7 36 148-186 545-581 (1306)
60 KOG3512|consensus 42.4 69 0.0015 38.8 7.3 53 581-633 285-338 (592)
61 PHA03099 epidermal growth fact 37.3 30 0.00066 34.4 2.9 29 750-779 51-81 (139)
62 PF01414 DSL: Delta serrate li 35.7 13 0.00027 32.8 0.1 49 706-778 15-63 (63)
63 KOG3516|consensus 33.0 34 0.00073 45.9 3.2 35 193-227 543-578 (1306)
64 PF00954 S_locus_glycop: S-loc 31.4 41 0.00089 33.0 2.9 26 1133-1159 84-109 (110)
65 PHA02887 EGF-like protein; Pro 31.3 41 0.00088 33.0 2.7 28 751-779 93-122 (126)
66 PF00954 S_locus_glycop: S-loc 31.3 41 0.00089 33.0 2.9 33 1208-1241 76-109 (110)
67 PHA03099 epidermal growth fact 29.4 50 0.0011 32.9 2.9 36 1208-1246 41-81 (139)
68 PF01414 DSL: Delta serrate li 28.2 28 0.0006 30.7 1.0 47 1229-1300 16-63 (63)
69 KOG3514|consensus 28.0 35 0.00076 44.9 2.1 36 1211-1248 625-661 (1591)
70 KOG3514|consensus 24.7 45 0.00097 44.0 2.2 36 689-726 625-661 (1591)
71 PHA02887 EGF-like protein; Pro 23.3 67 0.0015 31.5 2.6 29 472-504 93-123 (126)
72 PF09064 Tme5_EGF_like: Thromb 21.5 68 0.0015 24.5 1.7 16 1395-1410 17-32 (34)
No 1
>KOG1217|consensus
Probab=99.64 E-value=4.6e-14 Score=176.72 Aligned_cols=326 Identities=28% Similarity=0.620 Sum_probs=229.0
Q ss_pred CCCCceeecCCCceeeCCCCcccCCccccccCCCCCCCCCCCCcCCCCCCCC--CCCCCccccC---CCCCcccCCCCcc
Q psy13157 261 GQNANCRVINHSPICTCKPGFTGDALVYCNRIPPSRPLESPPEYVNPCVPSP--CGPYAQCRDI---NGSPSCSCLPNYI 335 (1434)
Q Consensus 261 ~~~~~C~~~~g~y~C~C~~Gf~G~~c~~C~~~~~~~~~~~~~~dideC~~~~--C~~~g~C~n~---~gsy~C~C~~Gy~ 335 (1434)
...+.+.....+|.|.|++||.|..+.. ..+|...+ +...+.|.+. ...|.|.|..||.
T Consensus 98 ~~~~~~~~~~~~~~c~c~~g~~~~~~~~----------------~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~ 161 (487)
T KOG1217|consen 98 LLCGECVDCVGSYECTCPPGYQGTPCEG----------------ECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYE 161 (487)
T ss_pred cCCccccCCCCCceeeCCCccccCcCCc----------------ceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcc
Confidence 3445666678899999999999987531 11465544 3456777764 4589999999999
Q ss_pred CCCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCCCCCeeeecCCCccccCCCCCccCCCcccCCCCCCCCCCCCCCCC
Q psy13157 336 GAPPNCRPECVQNSECPHDKACINEKCADPCLGSCGYGAVCTVINHSPICTCPEGFIGDAFSSCYPKPPEPIEPVIQEDT 415 (1434)
Q Consensus 336 g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~~~C~~~~gsy~C~C~~G~~G~~c~~C~~~~~~~~~~c~~~~~ 415 (1434)
+. .++. ..++|.... +.|.+++.|.+..++|.|.|++||++..++.-
T Consensus 162 ~~--~~~~---~~~~C~~~~------------~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---------------- 208 (487)
T KOG1217|consen 162 GE--PCET---DLDECIQYS------------SPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---------------- 208 (487)
T ss_pred cc--cccc---cccccccCC------------CCcCCCcccccCCCCeeEeCCCCccCCcCcCC----------------
Confidence 98 5542 224555331 35778899999999999999999999987620
Q ss_pred CCCCCCCeeec-ceeccCCCcccCCCccCCCccccCCCCCCCcccccCCCCCCCCCCCCCCCCEEeccCCceEeeCCCCC
Q psy13157 416 CNCVPNAECRD-GVCLCLPDYYGDGYVSCRPECVQNSDCPRNKACIRNKCKNPCTPGTCGEGAICDVVNHAVSCTCPPGT 494 (1434)
Q Consensus 416 c~C~~~~~C~~-~~C~C~~Gy~G~~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~g~y~C~C~~G~ 494 (1434)
.+.+.|++ ..|.+.+||.+..+... +.+|... + ++|.+..++|+|.|++||
T Consensus 209 ---~~~~~c~~~~~~~~~~g~~~~~c~~~---------------------~~~~~~~---~-~~c~~~~~~~~C~~~~g~ 260 (487)
T KOG1217|consen 209 ---GNGGTCVDSVACSCPPGARGPECEVS---------------------IVECASG---D-GTCVNTVGSYTCRCPEGY 260 (487)
T ss_pred ---CCCceEecceeccCCCCCCCCCcccc---------------------cccccCC---C-CcccccCCceeeeCCCCc
Confidence 23455654 36889999987763321 1222222 3 789999999999999999
Q ss_pred cCCCCccccccccCCCCCCCCCCC-CCCCCceeeccCCCeeeeCCCCCcCCCCCCcCCCccCCCCCCCCcccCCcccCCC
Q psy13157 495 TGSPFVQCKTIQYEPVYTNPCQPS-PCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKACVNQKCVDPC 573 (1434)
Q Consensus 495 ~G~~~~~C~~~~~~~~~~d~C~~~-~C~~~g~C~~~~g~y~C~C~~Gy~G~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C 573 (1434)
++.....+. ++++|+.. +|.++++|+++.+.|.|.|++||+|. .| ..+.....|... .-
T Consensus 261 ~~~~~~~~~-------~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~------~~---~~~~~~~~C~~~----~~ 320 (487)
T KOG1217|consen 261 TGDACVTCV-------DVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR------LC---TECVDVDECSPR----NA 320 (487)
T ss_pred cccccceee-------eccccCCCCccCCCCeeecCCCcceeeCCCCCCCC------CC---cccccccccccc----cc
Confidence 998611233 47899865 39999999999999999999999999 33 112222344211 00
Q ss_pred CCCCCCCcee--eecCCCceeeCCCCCccCCCCcccCCCCCCCCCCCCCCCC-CCCCCCCCCCCCeeEe-cCCCceeeCC
Q psy13157 574 PGSCGQNANC--RVINHSPVCSCKPGFTGEPRIRCNKIPPRPPPQEDVPEPV-NPCYPSPCGPYSQCRD-IGGSPSCSCL 649 (1434)
Q Consensus 574 ~~~C~~~~~C--~~~~g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~c~~di-deC~~~~C~~~~~C~n-~~gsy~C~C~ 649 (1434)
...|.++++| ....+.+.|.|..||.|.. |+ .+ ++|...++.+++.|++ ..++|.|.++
T Consensus 321 ~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~---C~--------------~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~ 383 (487)
T KOG1217|consen 321 GGPCANGGTCNTLGSFGGFRCACGPGFTGRR---CE--------------DSNDECASSPCCPGGTCVNETPGSYRCACP 383 (487)
T ss_pred CCcCCCCcccccCCCCCCCCcCCCCCCCCCc---cc--------------cCCccccCCccccCCEeccCCCCCeEecCC
Confidence 3457777788 3344678899999998887 85 34 5898888999999999 7899999999
Q ss_pred CCCcCC-CCCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeeccCCCceeeCCCCcccCCCCCccCCc
Q psy13157 650 PNYIGS-PPNCRPECVMNSECPSHEASRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGSPPNCRPECV 728 (1434)
Q Consensus 650 ~Gy~g~-~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~~ideC~~~~C~~~~~C~n~~gsy~C~C~~Gy~G~~~~C~~~C~ 728 (1434)
.+|.+. .... ..+.++++|.. .+.|++..++|.|. ++ + ... .. .|.
T Consensus 384 ~~~~~~~~~~~----------------------~~~~~~~~c~~-----~~~c~~~~~~~~c~-~~-~-~~~-~~--~~~ 430 (487)
T KOG1217|consen 384 AGFAGKANGDG----------------------VGCEDIDECSG-----CGDCVNGPGGGACT-PP-G-LVS-PG--TCD 430 (487)
T ss_pred CccccCCcccc----------------------ccccccccccC-----CcceeccCCCCccc-cC-c-ccC-Cc--cee
Confidence 999984 1111 12256677755 56788889999999 88 5 332 22 455
Q ss_pred cCCCC
Q psy13157 729 MNSEC 733 (1434)
Q Consensus 729 ~~~eC 733 (1434)
+++++
T Consensus 431 ~~~~~ 435 (487)
T KOG1217|consen 431 DIDEC 435 (487)
T ss_pred ccccc
Confidence 55544
No 2
>KOG1214|consensus
Probab=99.63 E-value=3e-14 Score=169.22 Aligned_cols=213 Identities=29% Similarity=0.620 Sum_probs=148.0
Q ss_pred CCCCC--CCCCCCCceeeccC-CCeeeeCCCCCcCCCCCCcCCCccCCCCCCCCcccCCcccCCCCCCCCCCceeeecCC
Q psy13157 512 TNPCQ--PSPCGPNSQCREVN-HQAVCSCLPNYFGSPPACRPECTVNSDCPLDKACVNQKCVDPCPGSCGQNANCRVINH 588 (1434)
Q Consensus 512 ~d~C~--~~~C~~~g~C~~~~-g~y~C~C~~Gy~G~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g 588 (1434)
+++|- +.-|..++.|+... -.|+|.|..||.|++..| .+.++|... ++.|.+++.|++.++
T Consensus 692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c----------~d~~eca~~------~~~CGp~s~Cin~pg 755 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGRNC----------VDENECATG------FHRCGPNSVCINLPG 755 (1289)
T ss_pred cccceecCcccCCCccccCCCCcceEEEEeeccCCCCCCC----------CChhhhccC------CCCCCCCceeecCCC
Confidence 44553 56678888898764 469999999999997543 233455543 678999999999999
Q ss_pred CceeeCCCCCccC-CCCcccCCCCCCCCCCCCCCCCCCCCC--CCCCCCCe--eEecC-CCceeeCCCCCcCCCCCCccc
Q psy13157 589 SPVCSCKPGFTGE-PRIRCNKIPPRPPPQEDVPEPVNPCYP--SPCGPYSQ--CRDIG-GSPSCSCLPNYIGSPPNCRPE 662 (1434)
Q Consensus 589 ~~~C~C~~Gy~G~-~~~~C~~~~~~~~~~~~c~~dideC~~--~~C~~~~~--C~n~~-gsy~C~C~~Gy~g~~~~C~~~ 662 (1434)
+|+|.|..||.-. .+..|..+.+. ..++.|.. +.|...+. |+... ++|+|.|.|||.|++..|
T Consensus 756 ~~rceC~~gy~F~dd~~tCV~i~~p--------ap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c--- 824 (1289)
T KOG1214|consen 756 SYRCECRSGYEFADDRHTCVLITPP--------APANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQC--- 824 (1289)
T ss_pred ceeEEEeecceeccCCcceEEecCC--------CCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcccc---
Confidence 9999999998622 12458655432 24677763 67877655 45444 569999999999998665
Q ss_pred cccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeeccCCCceeeCCCCcccCCCCCccCCccCCCCCCcchhccc
Q psy13157 663 CVMNSECPSHEASRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGSPPNCRPECVMNSECPSHEACINE 742 (1434)
Q Consensus 663 C~~~~~C~~~~~~~~~~~~~~~~~ideC~~~~C~~~~~C~n~~gsy~C~C~~Gy~G~~~~C~~~C~~~~eC~~~~~C~~~ 742 (1434)
.++|||+++.|+++|+|.+++|+|.|.|.+||.|++..|.+.=.....|....
T Consensus 825 ----------------------~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er----- 877 (1289)
T KOG1214|consen 825 ----------------------TDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQER----- 877 (1289)
T ss_pred ----------------------ccccccCccccCCCceEecCCCcceeecccCccCCCceecCCCccCCcccccc-----
Confidence 57899999999999999999999999999999999877754311222332210
Q ss_pred ccCCCCCCCCCCCCeee--ecCCcceeeCCCCCccCCCCCCC
Q psy13157 743 KCQDPCPGSCGYNAECK--VINHTPICTCPQGFIGDAFSGCY 782 (1434)
Q Consensus 743 ~C~~~C~~~C~~~~~C~--~~~g~y~C~C~~Gy~G~~c~~C~ 782 (1434)
.. +-.|+.++.|. ..+..|.+.+.++=.|+.-..|-
T Consensus 878 --~h--pl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~ 915 (1289)
T KOG1214|consen 878 --FH--PLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCG 915 (1289)
T ss_pred --cc--ceeeccccceeEeeCCCcccCCCCCCCCCCCCCCCC
Confidence 00 13455444332 12345788877776666544443
No 3
>KOG4289|consensus
Probab=99.62 E-value=4.1e-15 Score=183.26 Aligned_cols=106 Identities=32% Similarity=0.830 Sum_probs=89.1
Q ss_pred CCCCCCCCCCCCEEe----------------------ecCCceeeeCCCCCccCCCCccccCCCCCCCCCCCCCCCCCCC
Q psy13157 149 NPCVPGTCGEGAICN----------------------VENHAVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQPSPCGPN 206 (1434)
Q Consensus 149 n~C~~~~C~~~g~C~----------------------~~~g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~~~C~~~ 206 (1434)
|.|+..||.|.++|+ +..++++|+||+||+|+. |+ +.+|+|-+.||.++
T Consensus 1180 niClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~---Ce------TeiDlCYs~pC~nn 1250 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDY---CE------TEIDLCYSGPCGNN 1250 (2531)
T ss_pred chhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCccc---cc------chhHhhhcCCCCCC
Confidence 456677888888884 346789999999999998 98 78999999999999
Q ss_pred CceeccCCceeeccCCCCcCCCCCCcCCCCcCCCcCCCCccCCCcccCCCCCCCCCCCceeec-CCCceeeCCCC
Q psy13157 207 SQCREINSQAVCSCLPNYFGSPPACRPECTVNSDCLQSKACFNQKCVDPCPGTCGQNANCRVI-NHSPICTCKPG 280 (1434)
Q Consensus 207 g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~c~~~~~C~~~~C~~~C~~~C~~~~~C~~~-~g~y~C~C~~G 280 (1434)
|+|....|+|+|.|.+||+|. .|+... ....|. ++.|.++++|++. .|+|.|.|+.|
T Consensus 1251 g~C~srEggYtCeCrpg~tGe------hCEvs~---~agrCv--------pGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1251 GRCRSREGGYTCECRPGFTGE------HCEVSA---RAGRCV--------PGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred CceEEecCceeEEecCCcccc------ceeeec---ccCccc--------cceecCCCEEeecCCCceeccCCCc
Confidence 999999999999999999999 777653 122333 3788899999995 47799999987
No 4
>KOG1217|consensus
Probab=99.61 E-value=2.9e-14 Score=178.51 Aligned_cols=296 Identities=30% Similarity=0.677 Sum_probs=196.8
Q ss_pred CCCceeecCCCCeeeCCCCCccCCCCc---cccC-----------------CCceeecCCCccCCCcccCCCccccCCCC
Q psy13157 77 QNANCRVINHSPVCSCKPGFTGEPRIR---CNKI-----------------PHGVCVCLPDYYGDGYVSCRPECVLNSDC 136 (1434)
Q Consensus 77 ~~g~C~n~~g~~~C~C~~G~~g~~~~~---C~~~-----------------~~~~C~C~~Gy~g~~~~~c~~eC~~~~~C 136 (1434)
..+.++...++|.|.|++||.+..+.. |... ..+.|.|..||.+..+....++|...
T Consensus 99 ~~~~~~~~~~~~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~--- 175 (487)
T KOG1217|consen 99 LCGECVDCVGSYECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQY--- 175 (487)
T ss_pred CCccccCCCCCceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccC---
Confidence 345566677788899999988875432 3211 24677777777776543222222210
Q ss_pred CCccccccCCccCCCCCCCCCCCCEEeecCCceeeeCCCCCccCCCCccccCCCCCCCCCCCCCCCCCCCCceeccCCce
Q psy13157 137 PSNKACIRNKCKNPCVPGTCGEGAICNVENHAVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQPSPCGPNSQCREINSQA 216 (1434)
Q Consensus 137 ~~~~~C~~~~C~n~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~y 216 (1434)
..+|.+++.|.+..++|.|.|++||++.. |+. . .+++.|++. +
T Consensus 176 ----------------~~~c~~~~~C~~~~~~~~C~c~~~~~~~~---~~~------~---------~~~~~c~~~---~ 218 (487)
T KOG1217|consen 176 ----------------SSPCQNGGTCVNTGGSYLCSCPPGYTGST---CET------T---------GNGGTCVDS---V 218 (487)
T ss_pred ----------------CCCcCCCcccccCCCCeeEeCCCCccCCc---CcC------C---------CCCceEecc---e
Confidence 33577777777777777788888887776 531 0 344555554 5
Q ss_pred eeccCCCCcCCCCCCcCCCCcCCCcCCCCccCCCcccCCCCCCCCCC-CceeecCCCceeeCCCCcccCCccccccCCCC
Q psy13157 217 VCSCLPNYFGSPPACRPECTVNSDCLQSKACFNQKCVDPCPGTCGQN-ANCRVINHSPICTCKPGFTGDALVYCNRIPPS 295 (1434)
Q Consensus 217 ~C~C~~Gy~g~~~~C~~~C~~~~~c~~~~~C~~~~C~~~C~~~C~~~-~~C~~~~g~y~C~C~~Gf~G~~c~~C~~~~~~ 295 (1434)
.|.+.+||.+. .|.... ..|... ++|+++.++|+|.|++||++..+..+
T Consensus 219 ~~~~~~g~~~~------~c~~~~------------------~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~------ 268 (487)
T KOG1217|consen 219 ACSCPPGARGP------ECEVSI------------------VECASGDGTCVNTVGSYTCRCPEGYTGDACVTC------ 268 (487)
T ss_pred eccCCCCCCCC------Cccccc------------------ccccCCCCcccccCCceeeeCCCCcccccccee------
Confidence 67777777765 344321 122222 78999999999999999999863223
Q ss_pred CCCCCCCCcCCCCCCCC-CCCCCccccCCCCCcccCCCCccCCCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCCCCC
Q psy13157 296 RPLESPPEYVNPCVPSP-CGPYAQCRDINGSPSCSCLPNYIGAPPNCRPECVQNSECPHDKACINEKCADPCLGSCGYGA 374 (1434)
Q Consensus 296 ~~~~~~~~dideC~~~~-C~~~g~C~n~~gsy~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~~ 374 (1434)
+++++|+... |.++++|++..++|.|.|++||+|. .+ ..+.+..+|.... . .+.|.+++
T Consensus 269 -------~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~--~~-~~~~~~~~C~~~~--~--------~~~c~~g~ 328 (487)
T KOG1217|consen 269 -------VDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR--LC-TECVDVDECSPRN--A--------GGPCANGG 328 (487)
T ss_pred -------eeccccCCCCccCCCCeeecCCCcceeeCCCCCCCC--CC-ccccccccccccc--c--------CCcCCCCc
Confidence 5789999864 9999999999999999999999999 55 4455556664321 0 13466677
Q ss_pred ee--eecCCCccccCCCCCccCCCcccCCCCCCCCCCCCCCCCCCCCCCCeeecceeccCCCcccCCCccCCCccccCCC
Q psy13157 375 VC--TVINHSPICTCPEGFIGDAFSSCYPKPPEPIEPVIQEDTCNCVPNAECRDGVCLCLPDYYGDGYVSCRPECVQNSD 452 (1434)
Q Consensus 375 ~C--~~~~gsy~C~C~~G~~G~~c~~C~~~~~~~~~~c~~~~~c~C~~~~~C~~~~C~C~~Gy~G~~~~~c~~~C~~~~~ 452 (1434)
+| ....+.|.|.|..||.|..|+. .
T Consensus 329 ~C~~~~~~~~~~C~c~~~~~g~~C~~--------~--------------------------------------------- 355 (487)
T KOG1217|consen 329 TCNTLGSFGGFRCACGPGFTGRRCED--------S--------------------------------------------- 355 (487)
T ss_pred ccccCCCCCCCCcCCCCCCCCCcccc--------C---------------------------------------------
Confidence 77 2334567788888877776641 0
Q ss_pred CCCCcccccCCCCCCCCCCCCCCCCEEec-cCCceEeeCCCCCcCC-C--CccccccccCCCCCCCCCCCCCCCCceeec
Q psy13157 453 CPRNKACIRNKCKNPCTPGTCGEGAICDV-VNHAVSCTCPPGTTGS-P--FVQCKTIQYEPVYTNPCQPSPCGPNSQCRE 528 (1434)
Q Consensus 453 C~~~~~C~~~~C~~~C~~~~C~~~~~C~~-~~g~y~C~C~~G~~G~-~--~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~ 528 (1434)
.++|...++..++.|.+ ..++|.|.|+.+|.+. . ...+. ++++|.. .+.|++
T Consensus 356 ------------~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~-------~~~~c~~-----~~~c~~ 411 (487)
T KOG1217|consen 356 ------------NDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCE-------DIDECSG-----CGDCVN 411 (487)
T ss_pred ------------CccccCCccccCCEeccCCCCCeEecCCCccccCCcccccccc-------ccccccC-----Ccceec
Confidence 01333445677889988 6899999999999984 1 12233 3566654 557888
Q ss_pred cCCCeeeeCCCCC
Q psy13157 529 VNHQAVCSCLPNY 541 (1434)
Q Consensus 529 ~~g~y~C~C~~Gy 541 (1434)
..++|.|. ++ +
T Consensus 412 ~~~~~~c~-~~-~ 422 (487)
T KOG1217|consen 412 GPGGGACT-PP-G 422 (487)
T ss_pred cCCCCccc-cC-c
Confidence 89999999 87 5
No 5
>KOG4289|consensus
Probab=99.59 E-value=1.3e-14 Score=179.11 Aligned_cols=91 Identities=34% Similarity=0.849 Sum_probs=78.2
Q ss_pred ccCCceEeeCCCCCcCCCCccccccccCCCCCCCCCCCCCCCCceeeccCCCeeeeCCCCCcCCCCCCcCCCccCCCCCC
Q psy13157 481 VVNHAVSCTCPPGTTGSPFVQCKTIQYEPVYTNPCQPSPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPL 560 (1434)
Q Consensus 481 ~~~g~y~C~C~~G~~G~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~g~y~C~C~~Gy~G~~~~c~~~C~~~~~C~~ 560 (1434)
+..++++|.||+||+|+. |+. .||+|-+.||.++|+|....|+|+|+|.+||+|. +||....
T Consensus 1217 ~pvnglrCrCPpGFTgd~---CeT------eiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe------hCEvs~~--- 1278 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDY---CET------EIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE------HCEVSAR--- 1278 (2531)
T ss_pred cccCceeEeCCCCCCccc---ccc------hhHhhhcCCCCCCCceEEecCceeEEecCCcccc------ceeeecc---
Confidence 346789999999999997 887 5999999999999999999999999999999999 8887531
Q ss_pred CCcccCCcccCCCCCCCCCCceeeecC-CCceeeCCCC
Q psy13157 561 DKACVNQKCVDPCPGSCGQNANCRVIN-HSPVCSCKPG 597 (1434)
Q Consensus 561 ~~~C~~~~C~~~C~~~C~~~~~C~~~~-g~~~C~C~~G 597 (1434)
...|+ |+.|.++++|++.. |+|.|.|+.|
T Consensus 1279 agrCv--------pGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1279 AGRCV--------PGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred cCccc--------cceecCCCEEeecCCCceeccCCCc
Confidence 12333 57899999999864 8899999988
No 6
>KOG1214|consensus
Probab=99.45 E-value=6.7e-13 Score=157.96 Aligned_cols=211 Identities=28% Similarity=0.605 Sum_probs=149.0
Q ss_pred cCCCC--CCCCCCCCEEeecC-CceeeeCCCCCccCCCCccccCCCCCCCCCCCCC--CCCCCCCceeccCCceeeccCC
Q psy13157 148 KNPCV--PGTCGEGAICNVEN-HAVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQP--SPCGPNSQCREINSQAVCSCLP 222 (1434)
Q Consensus 148 ~n~C~--~~~C~~~g~C~~~~-g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~--~~C~~~g~C~~~~g~y~C~C~~ 222 (1434)
+|+|. +..|..++.|.... -.|+|.|..||.|+.. .| .++++|+. +.|..+++|++.+++|+|.|..
T Consensus 692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr-~c-------~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~ 763 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGR-NC-------VDENECATGFHRCGPNSVCINLPGSYRCECRS 763 (1289)
T ss_pred cccceecCcccCCCccccCCCCcceEEEEeeccCCCCC-CC-------CChhhhccCCCCCCCCceeecCCCceeEEEee
Confidence 45664 56788889998754 4699999999999873 36 57889974 5699999999999999999999
Q ss_pred CCc--CCCCCCcCCCCcCCCcCCCCccCCCcccCCCCCCCCCCCc--eeecC-CCceeeCCCCcccCCccccccCCCCCC
Q psy13157 223 NYF--GSPPACRPECTVNSDCLQSKACFNQKCVDPCPGTCGQNAN--CRVIN-HSPICTCKPGFTGDALVYCNRIPPSRP 297 (1434)
Q Consensus 223 Gy~--g~~~~C~~~C~~~~~c~~~~~C~~~~C~~~C~~~C~~~~~--C~~~~-g~y~C~C~~Gf~G~~c~~C~~~~~~~~ 297 (1434)
||. +++. +|..+..-...++|..+ .+.|.-++. |+... ++|+|.|.|||.|++-. |
T Consensus 764 gy~F~dd~~----tCV~i~~pap~n~Ce~g------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c-------- 824 (1289)
T KOG1214|consen 764 GYEFADDRH----TCVLITPPAPANPCEDG------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-C-------- 824 (1289)
T ss_pred cceeccCCc----ceEEecCCCCCCccccC------ccccCcCCceEEEecCCceEEEeecCCccCCccc-c--------
Confidence 875 4433 34433333344555544 366765544 44433 46999999999999843 4
Q ss_pred CCCCCCcCCCCCCCCCCCCCccccCCCCCcccCCCCccCCCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCCCCCeee
Q psy13157 298 LESPPEYVNPCVPSPCGPYAQCRDINGSPSCSCLPNYIGAPPNCRPECVQNSECPHDKACINEKCADPCLGSCGYGAVCT 377 (1434)
Q Consensus 298 ~~~~~~dideC~~~~C~~~g~C~n~~gsy~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~~~C~ 377 (1434)
.|+|||.++.|+.+|+|.|++|+|.|+|.+||.|+++.|.+.=.....|.... ..| -.|+.++.|.
T Consensus 825 -----~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er-------~hp--l~chg~t~~~ 890 (1289)
T KOG1214|consen 825 -----TDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQER-------FHP--LQCHGSTGFC 890 (1289)
T ss_pred -----ccccccCccccCCCceEecCCCcceeecccCccCCCceecCCCccCCcccccc-------ccc--eeecccccee
Confidence 58899999999999999999999999999999999988864311122233221 011 1365555443
Q ss_pred e--cCCCccccCCCCCccCCCccc
Q psy13157 378 V--INHSPICTCPEGFIGDAFSSC 399 (1434)
Q Consensus 378 ~--~~gsy~C~C~~G~~G~~c~~C 399 (1434)
. .+..|++.+.++-.|+.-..|
T Consensus 891 ~~~Dp~~~e~p~~~~ppG~~~~~c 914 (1289)
T KOG1214|consen 891 WCVDPDGHEVPGTQTPPGSTPPHC 914 (1289)
T ss_pred EeeCCCcccCCCCCCCCCCCCCCC
Confidence 2 345678988887777665444
No 7
>KOG0994|consensus
Probab=99.28 E-value=8.3e-11 Score=144.33 Aligned_cols=170 Identities=30% Similarity=0.664 Sum_probs=100.8
Q ss_pred cccccccCCcccccc-ccCCCCceeecCCCceeeCCCCCccCCCCCCCCCCCC--------CCCCCCCCCCCceeecCCC
Q psy13157 17 LDTLGILGSTVTKYL-LEKLITACRVINHTPICTCPQGYVGDAFSGCYPKPPE--------HPCPGSCGQNANCRVINHS 87 (1434)
Q Consensus 17 ~~~~~~~~~~~~~~~-~c~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~~~--------~~C~~~C~~~g~C~n~~g~ 87 (1434)
+..+.+.+....+.. +-++.++|. ..+.+|+|+|+.+|..+..|.+.... -+|...=.-+..|..+.|
T Consensus 754 lsa~l~n~a~~CnCnptGSlS~vCn--~~GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~CdC~~~Gs~~~~Cd~~tG- 830 (1758)
T KOG0994|consen 754 LSALLHNGASMCNCNPTGSLSSVCN--PNGGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKACDCNSIGSLDKYCDKITG- 830 (1758)
T ss_pred HHHHHhcCccccccCCCcccccccc--CCCceecccCccccccccccCCcccCcCCccCcccccccccccccccccccc-
Confidence 344444433333222 344555665 45678999999999988888764211 122222223445666655
Q ss_pred CeeeCCCCCccCCCCccccCCCceeecCCCccCCCcccCCCccccCCCCCCccccccCCccCCCCC--CCCCCCCEEeec
Q psy13157 88 PVCSCKPGFTGEPRIRCNKIPHGVCVCLPDYYGDGYVSCRPECVLNSDCPSNKACIRNKCKNPCVP--GTCGEGAICNVE 165 (1434)
Q Consensus 88 ~~C~C~~G~~g~~~~~C~~~~~~~C~C~~Gy~g~~~~~c~~eC~~~~~C~~~~~C~~~~C~n~C~~--~~C~~~g~C~~~ 165 (1434)
+|.|.+|-.|. +|+. |.+||||.+ +|. .|+-+..-+.|.+ +.|.+ |.+.
T Consensus 831 -QC~C~~g~ygr---qCnq-------CqpG~WgFP------eCr---------~CqCNgHA~~Cd~~tGaCi~---CqD~ 881 (1758)
T KOG0994|consen 831 -QCQCRPGTYGR---QCNQ-------CQPGYWGFP------ECR---------PCQCNGHADTCDPITGACID---CQDS 881 (1758)
T ss_pred -ceeeccccchh---hccc-------cCCCccCCC------cCc---------cccccCcccccCcccccccc---cccc
Confidence 99999999998 6764 999999974 332 1111112344432 33442 5567
Q ss_pred CCceee-eCCCCCccCCCCccccCCCCCCCCCCCCCCCCCCCC--------ceec--cCCceeeccCCCCcCC
Q psy13157 166 NHAVMC-TCPPGTTGSPFIQCKPVQNEPVYTNPCQPSPCGPNS--------QCRE--INSQAVCSCLPNYFGS 227 (1434)
Q Consensus 166 ~g~~~C-~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~~~C~~~g--------~C~~--~~g~y~C~C~~Gy~g~ 227 (1434)
.+++.| +|..||.|++.. .....|.+-||..+- .|.. ....-.|.|.+||.|.
T Consensus 882 T~G~~CdrCl~GyyGdP~l---------g~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~ 945 (1758)
T KOG0994|consen 882 TTGHSCDRCLDGYYGDPRL---------GSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGS 945 (1758)
T ss_pred ccccchhhhhccccCCccc---------CCCCCCCCCCCCCCCccchhccccccccccccceeeecccCcccc
Confidence 778889 699999999832 123556666665421 2432 1223568888888887
No 8
>KOG1219|consensus
Probab=99.17 E-value=3e-11 Score=154.90 Aligned_cols=112 Identities=28% Similarity=0.764 Sum_probs=98.9
Q ss_pred cCCCCCCCCCCCCEEeec-CCceeeeCCCCCccCCCCccccCCCCCCCCCCCCCCCCCCCCceeccCCceeeccCCCCcC
Q psy13157 148 KNPCVPGTCGEGAICNVE-NHAVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQPSPCGPNSQCREINSQAVCSCLPNYFG 226 (1434)
Q Consensus 148 ~n~C~~~~C~~~g~C~~~-~g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~y~C~C~~Gy~g 226 (1434)
.++|..+||+|+|+|+.. .|+|+|.|++-|+|.. || .++.+|+++||..+|+|+...++|.|.|+.||+|
T Consensus 3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~---CE------i~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG 3934 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNH---CE------IDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTG 3934 (4289)
T ss_pred ccccccCcccCCCEecCCCCCceEEeCcccccCcc---cc------cccccccCCCCCCCCEEEecCCCeeEeCCCCccC
Confidence 378999999999999986 5679999999999999 98 6899999999999999999999999999999999
Q ss_pred CCCCCcCCCCcCCCcCCCCccCCCcccCCCCCCCCCCCceeecCCCceeeCCCCcccCCc
Q psy13157 227 SPPACRPECTVNSDCLQSKACFNQKCVDPCPGTCGQNANCRVINHSPICTCKPGFTGDAL 286 (1434)
Q Consensus 227 ~~~~C~~~C~~~~~c~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~y~C~C~~Gf~G~~c 286 (1434)
. +|+... .++|.. +.|.++|.|+|+.|+|.|.|.+||.|..|
T Consensus 3935 ~------~Ce~~G----i~eCs~--------n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3935 K------RCEARG----ISECSK--------NVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred c------eeeccc----cccccc--------ccccCCceeeccCCceEeccChhHhcccC
Confidence 9 787641 122222 67888999999999999999999999986
No 9
>KOG1219|consensus
Probab=99.15 E-value=4.8e-11 Score=153.14 Aligned_cols=105 Identities=27% Similarity=0.704 Sum_probs=96.9
Q ss_pred CCCCCCCceeeecC-CCceeeCCCCCccCCCCcccCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeEecCCCceeeCCCCC
Q psy13157 574 PGSCGQNANCRVIN-HSPVCSCKPGFTGEPRIRCNKIPPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNY 652 (1434)
Q Consensus 574 ~~~C~~~~~C~~~~-g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~c~~dideC~~~~C~~~~~C~n~~gsy~C~C~~Gy 652 (1434)
.++|+++|+|+..+ |+|.|.|++-|+|.+ |+ .++..|+++||..+++|+...++|.|.|+.||
T Consensus 3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~~---CE-------------i~~epC~snPC~~GgtCip~~n~f~CnC~~gy 3932 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGNH---CE-------------IDLEPCASNPCLTGGTCIPFYNGFLCNCPNGY 3932 (4289)
T ss_pred cCcccCCCEecCCCCCceEEeCcccccCcc---cc-------------cccccccCCCCCCCCEEEecCCCeeEeCCCCc
Confidence 36899999999876 789999999999998 86 68999999999999999999999999999999
Q ss_pred cCCCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeeccCCCceeeCCCCcccC
Q psy13157 653 IGSPPNCRPECVMNSECPSHEASRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGS 719 (1434)
Q Consensus 653 ~g~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~~ideC~~~~C~~~~~C~n~~gsy~C~C~~Gy~G~ 719 (1434)
+|. .|+. ..|+||+.++|.++|.|+|+.|+|.|.|.+||.|.
T Consensus 3933 TG~--~Ce~-----------------------~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3933 TGK--RCEA-----------------------RGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred cCc--eeec-----------------------ccccccccccccCCceeeccCCceEeccChhHhcc
Confidence 999 6751 13899999999999999999999999999999999
No 10
>KOG0994|consensus
Probab=99.03 E-value=5.2e-09 Score=129.03 Aligned_cols=64 Identities=30% Similarity=0.750 Sum_probs=39.1
Q ss_pred CEEeccCCceEeeCCCCCcCCCCccccccccCCCCCCCCCCCCCCC--CceeeccCCCeeeeCCCCCcCC
Q psy13157 477 AICDVVNHAVSCTCPPGTTGSPFVQCKTIQYEPVYTNPCQPSPCGP--NSQCREVNHQAVCSCLPNYFGS 544 (1434)
Q Consensus 477 ~~C~~~~g~y~C~C~~G~~G~~~~~C~~~~~~~~~~d~C~~~~C~~--~g~C~~~~g~y~C~C~~Gy~G~ 544 (1434)
+.|+...| +|-|.|...|....+|.+..|.-.-...|.+..|.. +-+|....| .|.|.|||-|.
T Consensus 1030 ~~CDr~tG--QCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG--QCqCkpGfGGR 1095 (1758)
T KOG0994|consen 1030 CHCDRFTG--QCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTG--QCQCKPGFGGR 1095 (1758)
T ss_pred cccccccC--cCCCCcccccccccccccchhccccCCCCCccCCCccCCcccccccc--ceeccCCCCCc
Confidence 45777777 999999999998666665444322233344333322 114544444 67888888777
No 11
>KOG1225|consensus
Probab=99.01 E-value=3e-09 Score=128.20 Aligned_cols=132 Identities=29% Similarity=0.735 Sum_probs=106.5
Q ss_pred CCccccCCCcccCCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCeEeecCCcceeEcCCCcccCCCCCcc-cccccC
Q psy13157 1175 EPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVNPCYPSPCGLYSECRNVNGAPSCSCLINYIGSPPNCRP-ECIQNS 1253 (1434)
Q Consensus 1175 ~~~C~C~~Gy~g~~c~~C~~~~~~~~~~~~~~~dineC~~~~C~~~~~C~~~~gs~~C~C~~Gy~G~~~~C~~-eC~~~~ 1253 (1434)
.+.|.|+.||+|..++ .--| +..|...+.|++. +|.|++||+|+ +|.. .|+.
T Consensus 233 ~~ic~c~~~~~g~~c~------------------~~~C-~~~c~~~g~c~~G----~CIC~~Gf~G~--dC~e~~Cp~-- 285 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCS------------------TIYC-PGGCTGRGQCVEG----RCICPPGFTGD--DCDELVCPV-- 285 (525)
T ss_pred CceeecCCceeCCccc------------------cccC-CCCCcccceEeCC----eEeCCCCCcCC--CCCcccCCc--
Confidence 3478999999987653 1111 3456666788865 89999999999 7763 3432
Q ss_pred cccccccccccCCccccccCCCCCCCCCCeecCceEecCCCccCCCCccCCccCCCCCCCCCCCccccCccCCCCCCCCc
Q psy13157 1254 LLLGQSLLRTHSAVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCRPECVLNNDCPRNKACIKYKCKNPCVSAVQ 1333 (1434)
Q Consensus 1254 ~~~g~~~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~g~~ 1333 (1434)
.|+.++.++++.|+|++||+|..|+ +.+|. .+|..++.|++.+|. |..||+
T Consensus 286 -----------------------~cs~~g~~~~g~CiC~~g~~G~dCs--~~~cp--adC~g~G~Ci~G~C~--C~~Gy~ 336 (525)
T KOG1225|consen 286 -----------------------DCSGGGVCVDGECICNPGYSGKDCS--IRRCP--ADCSGHGKCIDGECL--CDEGYT 336 (525)
T ss_pred -----------------------ccCCCceecCCEeecCCCccccccc--cccCC--ccCCCCCcccCCceE--eCCCCc
Confidence 4778899999999999999999986 55564 789999999999998 999999
Q ss_pred CcccCCCcCCCCCCcccCceecCCCCccCCC
Q psy13157 1334 PVIQEDTCNCVPNAECRDGVCVCLPEYYGDG 1364 (1434)
Q Consensus 1334 ~~~~~~~c~C~~~~~C~~~~C~C~~Gy~g~~ 1364 (1434)
|..|... .|.+++.|.++ |.|..||.|..
T Consensus 337 G~~C~~~-~C~~~g~cv~g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 337 GELCIQR-ACSGGGQCVNG-CKCKKGWRGPD 365 (525)
T ss_pred CCccccc-ccCCCceeccC-ceeccCccCCC
Confidence 9998876 59999999999 99999999887
No 12
>KOG1225|consensus
Probab=98.84 E-value=1.9e-08 Score=121.41 Aligned_cols=185 Identities=28% Similarity=0.713 Sum_probs=126.2
Q ss_pred CceeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCceeecCCCCeeeCCCCCccCCCC------------cccc------
Q psy13157 45 TPICTCPQGYVGDAFSGCYPKPPEHPCPGSCGQNANCRVINHSPVCSCKPGFTGEPRI------------RCNK------ 106 (1434)
Q Consensus 45 ~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~C~~~g~C~n~~g~~~C~C~~G~~g~~~~------------~C~~------ 106 (1434)
.++|.+.+++.+.... .-.++..+..++.+. .+.+.+..+|++.... ++..
T Consensus 160 ~~~c~~~~~~~~~~~g-------~~~~~~~~~~hg~~~----~~~~l~~~~~s~~~~~~~~~~~~~~~~~r~~~~~~~~~ 228 (525)
T KOG1225|consen 160 NGVCSLKPNPFGAECG-------QYKCPNDGSGHGRYY----FGNCLSGISASGETCNQLGCNDDCFRTGRCREGRCFCT 228 (525)
T ss_pred cccccccCCccccccc-------eecCCcCCCCCccce----ecccccccCcchhhhhcccCCccceeccccccCccccc
Confidence 4566666666665543 122333455555555 4577777777766320 0100
Q ss_pred --CCCceeecCCCccCCCcccCCCccccCCCCCCccccccCCccCCCCCCCCCCCCEEeecCCceeeeCCCCCccCCCCc
Q psy13157 107 --IPHGVCVCLPDYYGDGYVSCRPECVLNSDCPSNKACIRNKCKNPCVPGTCGEGAICNVENHAVMCTCPPGTTGSPFIQ 184 (1434)
Q Consensus 107 --~~~~~C~C~~Gy~g~~~~~c~~eC~~~~~C~~~~~C~~~~C~n~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~~~~ 184 (1434)
.-.+.|.|..+|+|..+. ...| +.-|.+++.|++. +|+|++||+|.+
T Consensus 229 ~~~~~~ic~c~~~~~g~~c~--------~~~C----------------~~~c~~~g~c~~G----~CIC~~Gf~G~d--- 277 (525)
T KOG1225|consen 229 AGFFDGICECPEGYFGPLCS--------TIYC----------------PGGCTGRGQCVEG----RCICPPGFTGDD--- 277 (525)
T ss_pred ccccCceeecCCceeCCccc--------cccC----------------CCCCcccceEeCC----eEeCCCCCcCCC---
Confidence 012478888888887542 1111 2346666788865 899999999999
Q ss_pred cccCCCCCCCCCCCCCCCCCCCCceeccCCceeeccCCCCcCCCCCCcCCCCcCCCcCCCCccCCCcccCCCCCCCCCCC
Q psy13157 185 CKPVQNEPVYTNPCQPSPCGPNSQCREINSQAVCSCLPNYFGSPPACRPECTVNSDCLQSKACFNQKCVDPCPGTCGQNA 264 (1434)
Q Consensus 185 C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~c~~~~~C~~~~C~~~C~~~C~~~~ 264 (1434)
|. +-.|... |+.++.+++. +|.|++||+|. .|+... |+..|..+|
T Consensus 278 C~--------e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~------dCs~~~----------------cpadC~g~G 322 (525)
T KOG1225|consen 278 CD--------ELVCPVD-CSGGGVCVDG----ECICNPGYSGK------DCSIRR----------------CPADCSGHG 322 (525)
T ss_pred CC--------cccCCcc-cCCCceecCC----EeecCCCcccc------cccccc----------------CCccCCCCC
Confidence 74 3346444 8888888865 89999999999 666532 357888899
Q ss_pred ceeecCCCceeeCCCCcccCCccccccCCCCCCCCCCCCcCCCCCCCCCCCCCccccCCCCCcccCCCCccCC
Q psy13157 265 NCRVINHSPICTCKPGFTGDALVYCNRIPPSRPLESPPEYVNPCVPSPCGPYAQCRDINGSPSCSCLPNYIGA 337 (1434)
Q Consensus 265 ~C~~~~g~y~C~C~~Gf~G~~c~~C~~~~~~~~~~~~~~dideC~~~~C~~~g~C~n~~gsy~C~C~~Gy~g~ 337 (1434)
.|+ ..+|.|.+||+|..|. +. .|.+++.|++. |.|..||.|.
T Consensus 323 ~Ci----~G~C~C~~Gy~G~~C~---------------~~-------~C~~~g~cv~g-----C~C~~Gw~G~ 364 (525)
T KOG1225|consen 323 KCI----DGECLCDEGYTGELCI---------------QR-------ACSGGGQCVNG-----CKCKKGWRGP 364 (525)
T ss_pred ccc----CCceEeCCCCcCCccc---------------cc-------ccCCCceeccC-----ceeccCccCC
Confidence 998 3479999999999863 11 38888999874 9999999998
No 13
>KOG4260|consensus
Probab=98.48 E-value=1.1e-07 Score=101.64 Aligned_cols=164 Identities=31% Similarity=0.642 Sum_probs=104.0
Q ss_pred eCCCCCccCCCCccccCCCCCCCCCCCCCCCCCCCCceec---cCCceeeccCCCCcCCCCCCcCCCCcCCCcCCC----
Q psy13157 172 TCPPGTTGSPFIQCKPVQNEPVYTNPCQPSPCGPNSQCRE---INSQAVCSCLPNYFGSPPACRPECTVNSDCLQS---- 244 (1434)
Q Consensus 172 ~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~~~C~~~g~C~~---~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~c~~~---- 244 (1434)
-|++|..|+...+|.- =+..||..+|.|.- ..|+..|.|.+||.|.. |. .|.+..-.+..
T Consensus 131 CCp~gtyGpdCl~Cpg----------gser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~--C~-~Cg~eyfes~Rne~~ 197 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQCPG----------GSERPCFGNGSCHGDGSREGSGKCKCETGYTGPL--CR-YCGIEYFESSRNEQH 197 (350)
T ss_pred ccCCCCcCCccccCCC----------CCcCCcCCCCcccCCCCCCCCCcccccCCCCCcc--cc-ccchHHHHhhccccc
Confidence 4899999998333320 02457888899973 34678999999999982 21 22211000000
Q ss_pred CccCCCcccCCCCCCCCCCCceeecCCCcee-eCCCCcccCCccccccCCCCCCCCCCCCcCCCCCC--CCCCCCCcccc
Q psy13157 245 KACFNQKCVDPCPGTCGQNANCRVINHSPIC-TCKPGFTGDALVYCNRIPPSRPLESPPEYVNPCVP--SPCGPYAQCRD 321 (1434)
Q Consensus 245 ~~C~~~~C~~~C~~~C~~~~~C~~~~g~y~C-~C~~Gf~G~~c~~C~~~~~~~~~~~~~~dideC~~--~~C~~~g~C~n 321 (1434)
..|. +|..+|...|+. .++-.| .|+.||..+.- .| +|||||+. .||.....|+|
T Consensus 198 lvCt--~Ch~~C~~~Csg-------~~~k~C~kCkkGW~lde~-gC-------------vDvnEC~~ep~~c~~~qfCvN 254 (350)
T KOG4260|consen 198 LVCT--ACHEGCLGVCSG-------ESSKGCSKCKKGWKLDEE-GC-------------VDVNECQNEPAPCKAHQFCVN 254 (350)
T ss_pred chhh--hhhhhhhcccCC-------CCCCChhhhcccceeccc-cc-------------ccHHHHhcCCCCCChhheeec
Confidence 0010 122223223332 223345 69999987631 13 79999984 78999999999
Q ss_pred CCCCCcccCCCCccCCCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCCCCCeeeecCCCccccCCCCCc
Q psy13157 322 INGSPSCSCLPNYIGAPPNCRPECVQNSECPHDKACINEKCADPCLGSCGYGAVCTVINHSPICTCPEGFI 392 (1434)
Q Consensus 322 ~~gsy~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~~~C~~~~gsy~C~C~~G~~ 392 (1434)
+.|||+|..++||.+.. |+|+.- .+.|. ..+..|.++.++|+|+|..|+.
T Consensus 255 teGSf~C~dk~Gy~~g~----------d~C~~~--------~d~~~---~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 255 TEGSFKCEDKEGYKKGV----------DECQFC--------ADVCA---SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred CCCceEecccccccCCh----------HHhhhh--------hhhcc---cCCCCcccCCccEEEEecccce
Confidence 99999999999998852 233220 01111 2367899999999999999886
No 14
>KOG4260|consensus
Probab=98.47 E-value=2.2e-07 Score=99.44 Aligned_cols=122 Identities=28% Similarity=0.568 Sum_probs=87.2
Q ss_pred CCCCCCCeeee---cCCCCcccCCCCCcCCCCceec---------------------------CCCceee-eCCCCCccC
Q psy13157 1007 GSCGQNANCRV---INHSPVCSCKPGFTGEPRIRCN---------------------------RIHAVMC-TCPPGTTGS 1055 (1434)
Q Consensus 1007 ~~C~~~a~C~~---~~g~~~C~C~~Gy~g~~~~~C~---------------------------~~~~~~C-~C~~Gy~G~ 1055 (1434)
.+|..++.|.- ..|+..|.|.+||+|..+..|. ...+-.| .|..||..+
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~ld 229 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLD 229 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceec
Confidence 35777777774 4578899999999999875442 1111222 567777766
Q ss_pred CCcccccCCCCCCCCCCCC--CCCCCCCCceeccCCceEEecCCCCcCCCCCCcCcccccCCCCCCcccCCCcccCCCCC
Q psy13157 1056 PFVQCKPIQNEPVYTNPCQ--PSPCGPNSQCREVNKQAVCSCLPNYFGSPPACRPECTVNSDCPLNKACQNQKCVDPCPG 1133 (1434)
Q Consensus 1056 ~~~~C~~~~~~~~~~~eC~--~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~ 1133 (1434)
. ..| +|||||. ++||..+..|+|+.|||+|...+||.+..+ +|+.- .+.|+
T Consensus 230 e-~gC-------vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d----~C~~~--------------~d~~~- 282 (350)
T KOG4260|consen 230 E-EGC-------VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVD----ECQFC--------------ADVCA- 282 (350)
T ss_pred c-ccc-------ccHHHHhcCCCCCChhheeecCCCceEecccccccCChH----Hhhhh--------------hhhcc-
Confidence 3 225 5899995 688999999999999999999999998633 34321 01111
Q ss_pred CCCCCCeeeecCCCceeeCCCCCc
Q psy13157 1134 TCGQNANCKVINHSPICTCKPGYT 1157 (1434)
Q Consensus 1134 ~C~~~~~C~~~~g~~~C~C~~Gy~ 1157 (1434)
..+..|.|++++|+|+|..|+.
T Consensus 283 --~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 283 --SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred --cCCCCcccCCccEEEEecccce
Confidence 1266788999999999999986
No 15
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.80 E-value=1.3e-05 Score=64.12 Aligned_cols=33 Identities=33% Similarity=0.690 Sum_probs=30.0
Q ss_pred CCCCCC--CCCCCCCCeEeecCCcceeEcCCCccc
Q psy13157 1208 PVNPCY--PSPCGLYSECRNVNGAPSCSCLINYIG 1240 (1434)
Q Consensus 1208 dineC~--~~~C~~~~~C~~~~gs~~C~C~~Gy~G 1240 (1434)
|||||+ .++|..+++|+|+.|||+|.|++||+.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 689997 467998999999999999999999984
No 16
>KOG1836|consensus
Probab=97.77 E-value=0.00074 Score=92.69 Aligned_cols=103 Identities=34% Similarity=0.778 Sum_probs=69.2
Q ss_pred ceeeCCCCCccCCCCCCCCCCC------CCCCC-CCCCCC---CceeecCCCCeeeCCCCCccCCCCccccCCCceeecC
Q psy13157 46 PICTCPQGYVGDAFSGCYPKPP------EHPCP-GSCGQN---ANCRVINHSPVCSCKPGFTGEPRIRCNKIPHGVCVCL 115 (1434)
Q Consensus 46 ~~C~C~~G~~g~~~~~C~~~~~------~~~C~-~~C~~~---g~C~n~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~C~ 115 (1434)
..|.|+.||+|..++.|.+..+ ..-++ -+|.-+ .+|... +..|.|++--.|. +|+ +|.
T Consensus 695 e~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~--tG~C~C~~~t~G~---~C~-------~C~ 762 (1705)
T KOG1836|consen 695 EQCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPR--TGQCKCKHNTFGG---QCA-------QCV 762 (1705)
T ss_pred hhccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCC--CCceecccCCCCC---chh-------hhc
Confidence 3499999999999988877321 11122 134333 345543 4599999988888 566 488
Q ss_pred CCccCCCcccCCCccccCCCCCCccccccCCccCCCCCCCCCCCCEEeec--CCceeee-CCCCCccCC
Q psy13157 116 PDYYGDGYVSCRPECVLNSDCPSNKACIRNKCKNPCVPGTCGEGAICNVE--NHAVMCT-CPPGTTGSP 181 (1434)
Q Consensus 116 ~Gy~g~~~~~c~~eC~~~~~C~~~~~C~~~~C~n~C~~~~C~~~g~C~~~--~g~~~C~-C~~Gy~G~~ 181 (1434)
.||+|+....- .+.|.+-+|.+++.|..+ .....|. |++||+|..
T Consensus 763 ~GfYg~~~~~~---------------------~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~r 810 (1705)
T KOG1836|consen 763 DGFYGLPDLGT---------------------SGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLR 810 (1705)
T ss_pred CCCCCccccCC---------------------CCCCccCCCCCChhhcCcCcccceecCCCCCCCcccc
Confidence 99999854211 111345567777777764 4678898 999999998
No 17
>KOG1836|consensus
Probab=97.65 E-value=0.001 Score=91.42 Aligned_cols=63 Identities=35% Similarity=0.757 Sum_probs=42.8
Q ss_pred CEEeccCCceEeeCCCCCcCCCCccccccccCCCCCCCCCCCCCCCCc----eeeccCCCeeeeCCCCCcCC
Q psy13157 477 AICDVVNHAVSCTCPPGTTGSPFVQCKTIQYEPVYTNPCQPSPCGPNS----QCREVNHQAVCSCLPNYFGS 544 (1434)
Q Consensus 477 ~~C~~~~g~y~C~C~~G~~G~~~~~C~~~~~~~~~~d~C~~~~C~~~g----~C~~~~g~y~C~C~~Gy~G~ 544 (1434)
..|+...| +|.|.+|.+|.+..+|+..++.- -+..|..-.|.+.| .|....| +|.|.++|.|.
T Consensus 952 ~~c~~~tG--qc~c~~gVtgqrc~qc~~~~~~~-~~~gc~~c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~ 1018 (1705)
T KOG1836|consen 952 SDCDVGTG--QCYCRPGVTGQRCDQCETYHFGF-QTEGCGLCECDPLGSRGFQCDPEDG--QCPCRPGFEGR 1018 (1705)
T ss_pred ccccccCC--ceeeecCccccccCccccCcccc-cccCCcceecccCCcccceecccCC--eeeecCCCCCc
Confidence 36766666 99999999999866666533221 12445544555555 5776555 89999999997
No 18
>KOG1226|consensus
Probab=97.57 E-value=0.00017 Score=88.86 Aligned_cols=131 Identities=24% Similarity=0.519 Sum_probs=92.0
Q ss_pred CCCCCCeecCceEecCCCccCCCCccCC---------ccCCCC---CCCCCCCccccCccCCCCCCC----CcCcccC--
Q psy13157 1277 NCVPNAECRDGVCVCLPDYYGDGYVSCR---------PECVLN---NDCPRNKACIKYKCKNPCVSA----VQPVIQE-- 1338 (1434)
Q Consensus 1277 ~C~~~~~C~~~~C~C~~G~~G~~c~~c~---------~~C~~~---~~C~~~~~C~~~~C~~~C~~g----~~~~~~~-- 1338 (1434)
.|+-+|+.+-++|.|.+||.|+.|+-.. +.|+.. ..|...|.|.=.+|. |-.. ++|..|+
T Consensus 468 ~C~g~G~~~CG~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~--C~~~~~~~i~G~fCECD 545 (783)
T KOG1226|consen 468 LCHGNGTFVCGQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCV--CHKPDNGKIYGKFCECD 545 (783)
T ss_pred ccCCCCcEEecceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceE--ecCCCCCceeeeeeecc
Confidence 5777788888999999999999996221 345542 268888887767776 5322 3465554
Q ss_pred -CCc------CCCCCCcccCceecCCCCccCCCCc--cccccccccC--CCCCCCccccCcccCCCCCCeeeCCCC-Ccc
Q psy13157 1339 -DTC------NCVPNAECRDGVCVCLPEYYGDGYV--SCRPECVLNN--DCPRNKACIKYKCKNPCVHPICSCPQG-YIG 1406 (1434)
Q Consensus 1339 -~~c------~C~~~~~C~~~~C~C~~Gy~g~~~~--~c~~eC~~~~--~C~~~~~C~~~~C~n~~gs~~C~C~~G-y~g 1406 (1434)
..| .|..+++|.-++|+|.+||+|..|. ...+.|+..+ .|...|+|.- .+|.|... |+|
T Consensus 546 nfsC~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~C---------g~C~C~~~~~sG 616 (783)
T KOG1226|consen 546 NFSCERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCEC---------GRCKCTDPPYSG 616 (783)
T ss_pred CcccccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeC---------CceEcCCCCcCc
Confidence 334 4888899988999999999999987 2356776532 5555555532 45777766 999
Q ss_pred CCCCCcccCCCC
Q psy13157 1407 DGFNGCYPKPPE 1418 (1434)
Q Consensus 1407 ~~~~~c~~~~~~ 1418 (1434)
..+..|+.-+..
T Consensus 617 ~~CE~cptc~~~ 628 (783)
T KOG1226|consen 617 EFCEKCPTCPDP 628 (783)
T ss_pred chhhcCCCCCCc
Confidence 998888754443
No 19
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.56 E-value=3.2e-05 Score=59.18 Aligned_cols=32 Identities=28% Similarity=0.649 Sum_probs=26.5
Q ss_pred CCCCCCCCeEeecCCcceeEcCCCcccCCCCC
Q psy13157 1214 PSPCGLYSECRNVNGAPSCSCLINYIGSPPNC 1245 (1434)
Q Consensus 1214 ~~~C~~~~~C~~~~gs~~C~C~~Gy~G~~~~C 1245 (1434)
+..|+.+|+|+++.++|+|+|++||+|+|..|
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCCcCC
Confidence 46799999999999999999999999998765
No 20
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.50 E-value=6.2e-05 Score=60.25 Aligned_cols=34 Identities=26% Similarity=0.590 Sum_probs=30.0
Q ss_pred CCCCCCC--CCCCCCCceeccCCceEEecCCCCcCC
Q psy13157 1069 YTNPCQP--SPCGPNSQCREVNKQAVCSCLPNYFGS 1102 (1434)
Q Consensus 1069 ~~~eC~~--~~C~~~~~C~~~~g~~~C~C~~G~~g~ 1102 (1434)
|||||+. ++|..+++|+|+.|+|+|.|++||+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 5899974 579989999999999999999999943
No 21
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.43 E-value=3.4e-05 Score=79.64 Aligned_cols=144 Identities=28% Similarity=0.649 Sum_probs=90.5
Q ss_pred CCCCCceeeecCCCceeeCCCCCccCCCCcccCCCCCCCCCCCCCCCCCCCCC-----CCCCCCCeeEecC-----CCce
Q psy13157 576 SCGQNANCRVINHSPVCSCKPGFTGEPRIRCNKIPPRPPPQEDVPEPVNPCYP-----SPCGPYSQCRDIG-----GSPS 645 (1434)
Q Consensus 576 ~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~c~~dideC~~-----~~C~~~~~C~n~~-----gsy~ 645 (1434)
.|. +|.-+...+.|+|.|.+||...+...|+ ...+|.. .+|...|+|++.. ..|+
T Consensus 7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~EntCE--------------~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~ 71 (197)
T PF06247_consen 7 ICK-NGYLIQMSNHFECKCNEGFVLKNENTCE--------------EKVECDKLENVNKPCGDYAKCINQANKGEERAYK 71 (197)
T ss_dssp --B-TEEEEEESSEEEEEESTTEEEEETTEEE--------------E----SG-GGTTSEEETTEEEEE-SSTTSSTSEE
T ss_pred ccc-CCEEEEccCceEEEcCCCcEEccccccc--------------cceecCcccccCccccchhhhhcCCCcccceeEE
Confidence 344 5677788889999999999876655686 3445653 4799999999865 5699
Q ss_pred eeCCCCCcCCCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCeeec---cCCCceeeCCCCcccC-CC
Q psy13157 646 CSCLPNYIGSPPNCRPECVMNSECPSHEASRPPPQEDVPEPVNPCYPSPCGPYSQCRD---IGGSPSCSCLPNYIGS-PP 721 (1434)
Q Consensus 646 C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~~~~~~~~~~~~~ideC~~~~C~~~~~C~n---~~gsy~C~C~~Gy~G~-~~ 721 (1434)
|.|.+||+.....|. .++|....|+ .|.|+- .+....|+|.-|++.+ ..
T Consensus 72 C~C~~gY~~~~~vCv--------------------------p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~ 124 (197)
T PF06247_consen 72 CDCINGYILKQGVCV--------------------------PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNK 124 (197)
T ss_dssp EEE-TTEEESSSSEE--------------------------EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTT
T ss_pred EecccCceeeCCeEc--------------------------hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCC
Confidence 999999998855553 2345555788 799972 3345599999999922 11
Q ss_pred CCccCCccCCCCCCcchhcccccCCCCCCCCCCCCeeeecCCcceeeCCCCCccCCC
Q psy13157 722 NCRPECVMNSECPSHEACINEKCQDPCPGSCGYNAECKVINHTPICTCPQGFIGDAF 778 (1434)
Q Consensus 722 ~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~y~C~C~~Gy~G~~c 778 (1434)
.|... -..+| .-.|..+..|..+.+-|+|.+..||.++.-
T Consensus 125 kCtk~--G~T~C---------------~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~ 164 (197)
T PF06247_consen 125 KCTKT--GETKC---------------SLKCKENEECKLVDGYYKCVCKEGFPGDGE 164 (197)
T ss_dssp ESEEE--E-----------------------TTTEEEEEETTEEEEEE-TT-EEETT
T ss_pred cccCC--Cccce---------------eeecCCCcceeeeCcEEEeecCCCCCCCCC
Confidence 22211 01122 235677899999999999999999987763
No 22
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.42 E-value=8.8e-05 Score=55.44 Aligned_cols=30 Identities=30% Similarity=0.806 Sum_probs=27.5
Q ss_pred CCCCCCCCCCeEeecC-CcceeEcCCCcccC
Q psy13157 1212 CYPSPCGLYSECRNVN-GAPSCSCLINYIGS 1241 (1434)
Q Consensus 1212 C~~~~C~~~~~C~~~~-gs~~C~C~~Gy~G~ 1241 (1434)
|.++||.++|+|++.. ++|+|+|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4567999999999999 99999999999996
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.41 E-value=0.00016 Score=56.70 Aligned_cols=36 Identities=33% Similarity=0.826 Sum_probs=32.0
Q ss_pred CCCCCCC-CCCCCCCeEeecCCcceeEcCCCcc-cCCCCC
Q psy13157 1208 PVNPCYP-SPCGLYSECRNVNGAPSCSCLINYI-GSPPNC 1245 (1434)
Q Consensus 1208 dineC~~-~~C~~~~~C~~~~gs~~C~C~~Gy~-G~~~~C 1245 (1434)
++|||.. .+|.++++|+++.|+|+|.|++||+ |. .|
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~--~C 38 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR--NC 38 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC--cC
Confidence 4789987 8999999999999999999999999 65 55
No 24
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.41 E-value=9.7e-05 Score=55.23 Aligned_cols=31 Identities=35% Similarity=0.859 Sum_probs=27.6
Q ss_pred CCCCCCCCCCEEeecC-CceeeeCCCCCccCC
Q psy13157 151 CVPGTCGEGAICNVEN-HAVMCTCPPGTTGSP 181 (1434)
Q Consensus 151 C~~~~C~~~g~C~~~~-g~~~C~C~~Gy~G~~ 181 (1434)
|.++||+|+|+|++.. ++|+|+|++||+|+.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 3467999999999998 999999999999973
No 25
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.29 E-value=0.00011 Score=75.95 Aligned_cols=153 Identities=24% Similarity=0.560 Sum_probs=88.6
Q ss_pred CCCCCCcccccCCcceeecCCCCcCCCCCCCCCCccCCCCCCCccccCCcccCCCCCCCCCCCeeeecCCCCcccCCCCC
Q psy13157 951 PCGPNSQCREVNKQSVCSCLPNYFGSPPACRPECTVNSDCPLDKACVNQKCVDPCPGSCGQNANCRVINHSPVCSCKPGF 1030 (1434)
Q Consensus 951 ~C~~~g~C~n~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~a~C~~~~g~~~C~C~~Gy 1030 (1434)
.|.| |.-+...+.|.|.|.+||..... ..|+...+|..... + ..+|+..|+|++....
T Consensus 7 ~CKN-G~LiQMSNHfEC~Cnegfvl~~E---ntCE~kv~C~~~e~------~---~K~Cgdya~C~~~~~~--------- 64 (197)
T PF06247_consen 7 ICKN-GYLIQMSNHFECKCNEGFVLKNE---NTCEEKVECDKLEN------V---NKPCGDYAKCINQANK--------- 64 (197)
T ss_dssp --BT-EEEEEESSEEEEEESTTEEEEET---TEEEE----SG-GG------T---TSEEETTEEEEE-SST---------
T ss_pred cccC-CEEEEccCceEEEcCCCcEEccc---cccccceecCcccc------c---CccccchhhhhcCCCc---------
Confidence 3554 67777778999999999987632 36777766654210 1 2478889999987642
Q ss_pred cCCCCceecCCCceeeeCCCCCccCCCcccccCCCCCCCCCCCCCCCCCCCCceec---cCCceEEecCCCCc-CCCCCC
Q psy13157 1031 TGEPRIRCNRIHAVMCTCPPGTTGSPFVQCKPIQNEPVYTNPCQPSPCGPNSQCRE---VNKQAVCSCLPNYF-GSPPAC 1106 (1434)
Q Consensus 1031 ~g~~~~~C~~~~~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~eC~~~~C~~~~~C~~---~~g~~~C~C~~G~~-g~~~~c 1106 (1434)
.....|.|.|.+||..... .|.+ ++|..-.|+ .|.|+- .....+|+|.-|+. .+..
T Consensus 65 --------~~~~~~~C~C~~gY~~~~~-vCvp--------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~-- 124 (197)
T PF06247_consen 65 --------GEERAYKCDCINGYILKQG-VCVP--------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNK-- 124 (197)
T ss_dssp --------TSSTSEEEEE-TTEEESSS-SEEE--------GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTT--
T ss_pred --------ccceeEEEecccCceeeCC-eEch--------hhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCC--
Confidence 1223567777777765432 3542 466666787 789982 33455999999999 2322
Q ss_pred cCcccccCCCCCCcccCCCcccCCCCCCCCCCCeeeecCCCceeeCCCCCccCC
Q psy13157 1107 RPECTVNSDCPLNKACQNQKCVDPCPGTCGQNANCKVINHSPICTCKPGYTGDA 1160 (1434)
Q Consensus 1107 ~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~ 1160 (1434)
.|...-+ -+|+-.|..+..|..+++-|+|.+.+||.++.
T Consensus 125 --kCtk~G~-------------T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 125 --KCTKTGE-------------TKCSLKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp --ESEEEE---------------------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred --cccCCCc-------------cceeeecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 2322110 12333577799999999999999999999875
No 26
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.13 E-value=0.00051 Score=53.78 Aligned_cols=34 Identities=32% Similarity=0.824 Sum_probs=30.4
Q ss_pred CCCCCCC-CCCCCCCeeeccCCCceeeCCCCcc-cC
Q psy13157 686 PVNPCYP-SPCGPYSQCRDIGGSPSCSCLPNYI-GS 719 (1434)
Q Consensus 686 ~ideC~~-~~C~~~~~C~n~~gsy~C~C~~Gy~-G~ 719 (1434)
++|+|.. .+|.++++|+++.++|+|.|++||+ |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 3678887 7999999999999999999999999 65
No 27
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.91 E-value=0.001 Score=51.46 Aligned_cols=34 Identities=32% Similarity=0.744 Sum_probs=30.7
Q ss_pred CCCCCCC-CCCCCCCeEeecCCcceeEcCCCcccC
Q psy13157 1208 PVNPCYP-SPCGLYSECRNVNGAPSCSCLINYIGS 1241 (1434)
Q Consensus 1208 dineC~~-~~C~~~~~C~~~~gs~~C~C~~Gy~G~ 1241 (1434)
++++|.. .+|.++++|+++.++|+|.|++||+|.
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 3688877 799989999999999999999999996
No 28
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.76 E-value=0.00076 Score=51.72 Aligned_cols=30 Identities=30% Similarity=0.718 Sum_probs=24.4
Q ss_pred CCCCCCCceeccCCceEEecCCCCcCCCCC
Q psy13157 1076 SPCGPNSQCREVNKQAVCSCLPNYFGSPPA 1105 (1434)
Q Consensus 1076 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~ 1105 (1434)
..|+.+|+|+++.++|+|+|++||+|++..
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~ 35 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFF 35 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence 468999999999999999999999999753
No 29
>KOG1226|consensus
Probab=96.68 E-value=0.0072 Score=75.12 Aligned_cols=137 Identities=23% Similarity=0.445 Sum_probs=81.5
Q ss_pred CCCCCCCCeEeecCCcceeEcCCCcccCCCCCc--ccccccCcccccccccccCCccccccCCCCCCCCCCeecCceEec
Q psy13157 1214 PSPCGLYSECRNVNGAPSCSCLINYIGSPPNCR--PECIQNSLLLGQSLLRTHSAVQPVIQEDTCNCVPNAECRDGVCVC 1291 (1434)
Q Consensus 1214 ~~~C~~~~~C~~~~gs~~C~C~~Gy~G~~~~C~--~eC~~~~~~~g~~~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C 1291 (1434)
+..|+-+|+.+-. +|.|.+||.|+ .|+ ......... ...|+ ...+.-.|...|.|+=+.|+|
T Consensus 466 s~~C~g~G~~~CG----~C~C~~G~~G~--~CEC~~~~~ss~~~--~~~Cr--------~~~~~~vCSgrG~C~CGqC~C 529 (783)
T KOG1226|consen 466 SALCHGNGTFVCG----QCRCDEGWLGK--KCECSTDELSSSEE--EDKCR--------ENSDSPVCSGRGDCVCGQCVC 529 (783)
T ss_pred ccccCCCCcEEec----ceecCCCCCCC--cccCCccccCcHhH--Hhhcc--------CCCCCCCcCCCCcEeCCceEe
Confidence 5567656666543 79999999999 664 222111000 00111 111111567777777777777
Q ss_pred CCCcc----CCCCccCCccCCCC--CCCCCCCccccCccCCCCCCCCcCcccC-----CCc------CCCCCCcccCcee
Q psy13157 1292 LPDYY----GDGYVSCRPECVLN--NDCPRNKACIKYKCKNPCVSAVQPVIQE-----DTC------NCVPNAECRDGVC 1354 (1434)
Q Consensus 1292 ~~G~~----G~~c~~c~~~C~~~--~~C~~~~~C~~~~C~~~C~~g~~~~~~~-----~~c------~C~~~~~C~~~~C 1354 (1434)
.+... |..|+-+--.|... ..|..+++|.=.+|. |..||.|..|+ +.| .|...++|.=++|
T Consensus 530 ~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG~Cv--C~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg~C 607 (783)
T KOG1226|consen 530 HKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECGRCV--CNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECGRC 607 (783)
T ss_pred cCCCCCceeeeeeeccCcccccccCcccCCCCeEeCCcEE--cCCCCccCCCCCCCCCccccCCCCceeCCCceeeCCce
Confidence 77665 66665332334322 346666666666666 77777776554 334 3666666766889
Q ss_pred cCCCC-ccCCCCccc
Q psy13157 1355 VCLPE-YYGDGYVSC 1368 (1434)
Q Consensus 1355 ~C~~G-y~g~~~~~c 1368 (1434)
.|... |.|..|+.|
T Consensus 608 ~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 608 KCTDPPYSGEFCEKC 622 (783)
T ss_pred EcCCCCcCcchhhcC
Confidence 99887 999999744
No 30
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.47 E-value=0.0034 Score=48.55 Aligned_cols=33 Identities=36% Similarity=0.876 Sum_probs=29.7
Q ss_pred CCCCCC-CCCCCCCeeeccCCCceeeCCCCcccC
Q psy13157 687 VNPCYP-SPCGPYSQCRDIGGSPSCSCLPNYIGS 719 (1434)
Q Consensus 687 ideC~~-~~C~~~~~C~n~~gsy~C~C~~Gy~G~ 719 (1434)
+++|.. .+|.++++|++..++|+|.|++||.|.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 577776 789989999999999999999999986
No 31
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.92 E-value=0.0092 Score=45.32 Aligned_cols=28 Identities=32% Similarity=0.746 Sum_probs=26.1
Q ss_pred CCCCCCCCeEeecCCcceeEcCCCcccC
Q psy13157 1214 PSPCGLYSECRNVNGAPSCSCLINYIGS 1241 (1434)
Q Consensus 1214 ~~~C~~~~~C~~~~gs~~C~C~~Gy~G~ 1241 (1434)
..+|.++++|+++.++|+|.|+.||.|+
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 5789889999999999999999999987
No 32
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.62 E-value=0.013 Score=44.58 Aligned_cols=29 Identities=38% Similarity=0.868 Sum_probs=25.5
Q ss_pred CCC-CCCCCCCeEeecCCcceeEcCCCcccC
Q psy13157 1212 CYP-SPCGLYSECRNVNGAPSCSCLINYIGS 1241 (1434)
Q Consensus 1212 C~~-~~C~~~~~C~~~~gs~~C~C~~Gy~G~ 1241 (1434)
|.. .+|.++ +|+++.++|+|.|++||.|+
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence 445 689888 99999999999999999994
No 33
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.54 E-value=0.017 Score=43.78 Aligned_cols=28 Identities=36% Similarity=0.901 Sum_probs=25.7
Q ss_pred CCCCCCCCEEeecCCceeeeCCCCCccC
Q psy13157 153 PGTCGEGAICNVENHAVMCTCPPGTTGS 180 (1434)
Q Consensus 153 ~~~C~~~g~C~~~~g~~~C~C~~Gy~G~ 180 (1434)
..+|.++++|+++.++|+|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4578889999999999999999999998
No 34
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.20 E-value=0.025 Score=43.06 Aligned_cols=27 Identities=41% Similarity=1.008 Sum_probs=24.3
Q ss_pred CCCCCCCEEeecCCceeeeCCCCCcc-CC
Q psy13157 154 GTCGEGAICNVENHAVMCTCPPGTTG-SP 181 (1434)
Q Consensus 154 ~~C~~~g~C~~~~g~~~C~C~~Gy~G-~~ 181 (1434)
.+|.++ +|+++.++|+|.|++||+| ..
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~ 33 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKR 33 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCc
Confidence 578888 9999999999999999999 54
No 35
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.08 E-value=0.018 Score=39.76 Aligned_cols=24 Identities=38% Similarity=0.519 Sum_probs=19.1
Q ss_pred CceeeCCCCcccCCCCCccCCccCCC
Q psy13157 707 SPSCSCLPNYIGSPPNCRPECVMNSE 732 (1434)
Q Consensus 707 sy~C~C~~Gy~G~~~~C~~~C~~~~e 732 (1434)
||+|.|++||+.. .-.+.|+||||
T Consensus 1 sy~C~C~~Gy~l~--~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLS--PDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCC--CCCCccccCCC
Confidence 6999999999976 34567788875
No 36
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.74 E-value=0.02 Score=39.51 Aligned_cols=24 Identities=33% Similarity=0.557 Sum_probs=18.7
Q ss_pred CCcccCCCCccCCCCCCCCCCccCCC
Q psy13157 325 SPSCSCLPNYIGAPPNCRPECVQNSE 350 (1434)
Q Consensus 325 sy~C~C~~Gy~g~~~~C~~~C~~~~e 350 (1434)
||+|.|++||+.. .-...|+||||
T Consensus 1 sy~C~C~~Gy~l~--~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLS--PDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCC--CCCCccccCCC
Confidence 6999999999976 33456778875
No 37
>KOG1218|consensus
Probab=94.09 E-value=4.2 Score=47.99 Aligned_cols=123 Identities=24% Similarity=0.619 Sum_probs=72.7
Q ss_pred CeEeecCCcceeEcCCCcccCCCCCcccccccCcccccccccccCCccccccCCCCCCCCCCeecCceEecCCCccCCCC
Q psy13157 1221 SECRNVNGAPSCSCLINYIGSPPNCRPECIQNSLLLGQSLLRTHSAVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGY 1300 (1434)
Q Consensus 1221 ~~C~~~~gs~~C~C~~Gy~G~~~~C~~eC~~~~~~~g~~~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c 1300 (1434)
.+|.+... .|.+..+|.+. .|..+- +.+. . ....+.+..+..+.+..|.|++||+|..+
T Consensus 117 ~~C~~~~~--~c~~~~~~~~~--~C~~~~-----~~g~---------~---C~~~c~~~~~~~~~~~~c~c~~g~~g~~~ 175 (316)
T KOG1218|consen 117 KTCANPRR--ECRCGGGYIGE--QCGEEN-----LVGL---------K---CQRDCQCTGGCDCKNGICTCQPGFVGVFC 175 (316)
T ss_pred cccCCCcc--ceecCCcCccc--cccccC-----CCCC---------C---ccCCCCCccccCCCCCceeccCCcccccc
Confidence 46665533 67888888776 665411 1111 0 01111223334445688999999999998
Q ss_pred ccCCccCCCCCCCCCCCccccCccCC--------CCCCCCcCcccCCCcCCCCCCcccC----ceecCCCCccCCC
Q psy13157 1301 VSCRPECVLNNDCPRNKACIKYKCKN--------PCVSAVQPVIQEDTCNCVPNAECRD----GVCVCLPEYYGDG 1364 (1434)
Q Consensus 1301 ~~c~~~C~~~~~C~~~~~C~~~~C~~--------~C~~g~~~~~~~~~c~C~~~~~C~~----~~C~C~~Gy~g~~ 1364 (1434)
+.-...|.....|.+++.|+...-.. .|..||++..+...+.|..+..+.+ +.+.+..++.+..
T Consensus 176 ~~~~~~c~~~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~ 251 (316)
T KOG1218|consen 176 VESCSGCSPLTACENGAKCNRSTGSCLCYPGPSGACKGGFHGCACLRMCDCNEGYPCVNDCGPGICGCVLGEGETV 251 (316)
T ss_pred cccCCCcCCCcccCCCCeeeccccccccCCCCcccccCCccCCcCcccccccCCCcccCCcCCceeEeCccccccc
Confidence 74333377678888888888633222 2444566666666667776666653 4666666665443
No 38
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=92.79 E-value=0.099 Score=39.09 Aligned_cols=24 Identities=33% Similarity=0.735 Sum_probs=21.6
Q ss_pred CCCCCCeec--CceEecCCCccCCCC
Q psy13157 1277 NCVPNAECR--DGVCVCLPDYYGDGY 1300 (1434)
Q Consensus 1277 ~C~~~~~C~--~~~C~C~~G~~G~~c 1300 (1434)
.|.++++|+ .++|+|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 589999999 589999999999864
No 39
>KOG1218|consensus
Probab=92.13 E-value=2.4 Score=50.03 Aligned_cols=142 Identities=23% Similarity=0.539 Sum_probs=74.6
Q ss_pred CcceeEcCCCcccCCCCCcccccccCcccccccccccCCccccccCCCCCCCCCCeecCceEec-CCCccCCCCccCCcc
Q psy13157 1228 GAPSCSCLINYIGSPPNCRPECIQNSLLLGQSLLRTHSAVQPVIQEDTCNCVPNAECRDGVCVC-LPDYYGDGYVSCRPE 1306 (1434)
Q Consensus 1228 gs~~C~C~~Gy~G~~~~C~~eC~~~~~~~g~~~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C-~~G~~G~~c~~c~~~ 1306 (1434)
-+.+|.+..+|.|. .|+.++..+.. +. .....+.|..+.......-.| ..||.|..|... .+
T Consensus 47 ~~~~~~~~~~~~~~--~c~~~~~~~~~--~~------------~c~~~~~c~~~~~~~~~~~~~~~~~~~g~~C~~~-~~ 109 (316)
T KOG1218|consen 47 NSGECGLGYGFVGS--VCRIECVCGNA--GG------------GCSQPCRCKNGGTCVSSTGYCHLNGYEGPQCESP-CP 109 (316)
T ss_pred CceeEecccccCCC--ccccccccCCC--CC------------cccCccccCCCCcccCCCCcccCCCCCcccccCC-CC
Confidence 34578888888887 66655432210 00 011222355555555544444 688888877531 12
Q ss_pred CCCC---CCCCCCCc-cccCccC--CCCCC-CCcCcccCCCcCCCCCCcccCceecCCCCccCCCCccccccccccCCCC
Q psy13157 1307 CVLN---NDCPRNKA-CIKYKCK--NPCVS-AVQPVIQEDTCNCVPNAECRDGVCVCLPEYYGDGYVSCRPECVLNNDCP 1379 (1434)
Q Consensus 1307 C~~~---~~C~~~~~-C~~~~C~--~~C~~-g~~~~~~~~~c~C~~~~~C~~~~C~C~~Gy~g~~~~~c~~eC~~~~~C~ 1379 (1434)
|... ..|.+... |...... -.|.. ++++..+...|.+..+..+.++.|.|++||+|..+..-...|.....+.
T Consensus 110 ~~~~c~~~~C~~~~~~c~~~~~~~~~~C~~~~~~g~~C~~~c~~~~~~~~~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~ 189 (316)
T KOG1218|consen 110 CGDGCAEKTCANPRRECRCGGGYIGEQCGEENLVGLKCQRDCQCTGGCDCKNGICTCQPGFVGVFCVESCSGCSPLTACE 189 (316)
T ss_pred cCCcccccccCCCccceecCCcCccccccccCCCCCCccCCCCCccccCCCCCceeccCCcccccccccCCCcCCCcccC
Confidence 2111 23333332 2211000 01222 5666666666655555556668899999999998762222255545566
Q ss_pred CCCcccc
Q psy13157 1380 RNKACIK 1386 (1434)
Q Consensus 1380 ~~~~C~~ 1386 (1434)
+++.|..
T Consensus 190 ~g~~C~~ 196 (316)
T KOG1218|consen 190 NGAKCNR 196 (316)
T ss_pred CCCeeec
Confidence 5556654
No 40
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=91.69 E-value=0.094 Score=30.77 Aligned_cols=13 Identities=31% Similarity=0.894 Sum_probs=10.4
Q ss_pred eEecCCCccCCCC
Q psy13157 1288 VCVCLPDYYGDGY 1300 (1434)
Q Consensus 1288 ~C~C~~G~~G~~c 1300 (1434)
+|+|++||+|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999875
No 41
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=91.11 E-value=0.26 Score=36.86 Aligned_cols=26 Identities=27% Similarity=0.627 Sum_probs=22.4
Q ss_pred CCCCCCCEEeecCCceeeeCCCCCccCC
Q psy13157 154 GTCGEGAICNVENHAVMCTCPPGTTGSP 181 (1434)
Q Consensus 154 ~~C~~~g~C~~~~g~~~C~C~~Gy~G~~ 181 (1434)
..|+++|+|+.. ..+|.|.+||+|+.
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGPD 31 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCCC
Confidence 368999999976 56899999999986
No 42
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=90.28 E-value=0.13 Score=39.39 Aligned_cols=34 Identities=26% Similarity=0.768 Sum_probs=24.0
Q ss_pred CCCCCCCCCCeEeecC-CcceeEcCCCcccCCCCC
Q psy13157 1212 CYPSPCGLYSECRNVN-GAPSCSCLINYIGSPPNC 1245 (1434)
Q Consensus 1212 C~~~~C~~~~~C~~~~-gs~~C~C~~Gy~G~~~~C 1245 (1434)
|...+|..||.|++.. |+++|.|.+||..++..|
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence 3456788899999887 999999999999775444
No 43
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=90.20 E-value=0.17 Score=38.95 Aligned_cols=26 Identities=31% Similarity=0.654 Sum_probs=20.4
Q ss_pred CCCCCCceeecCCCCeeeCCCCCccCCC
Q psy13157 74 SCGQNANCRVINHSPVCSCKPGFTGEPR 101 (1434)
Q Consensus 74 ~C~~~g~C~n~~g~~~C~C~~G~~g~~~ 101 (1434)
.|.+ .|++++++|+|.|++||+...+
T Consensus 7 gC~h--~C~~~~g~~~C~C~~Gy~L~~D 32 (36)
T PF14670_consen 7 GCSH--ICVNTPGSYRCSCPPGYKLAED 32 (36)
T ss_dssp GSSS--EEEEETTSEEEE-STTEEE-TT
T ss_pred CcCC--CCccCCCceEeECCCCCEECcC
Confidence 4554 8999999999999999998854
No 44
>smart00051 DSL delta serrate ligand.
Probab=88.89 E-value=0.32 Score=42.60 Aligned_cols=22 Identities=23% Similarity=0.442 Sum_probs=16.0
Q ss_pred CCCCcccC-ceecCCCCccCCCC
Q psy13157 1344 VPNAECRD-GVCVCLPEYYGDGY 1365 (1434)
Q Consensus 1344 ~~~~~C~~-~~C~C~~Gy~g~~~ 1365 (1434)
..+.+|+. +.++|++||+|..|
T Consensus 41 ~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 41 FGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred cCCccCCcCCCEecCCCCcCCCC
Confidence 34555654 78999999998764
No 45
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=88.14 E-value=0.32 Score=37.45 Aligned_cols=26 Identities=35% Similarity=0.839 Sum_probs=20.9
Q ss_pred CCCCCCCEEeecCCceeeeCCCCCccCC
Q psy13157 154 GTCGEGAICNVENHAVMCTCPPGTTGSP 181 (1434)
Q Consensus 154 ~~C~~~g~C~~~~g~~~C~C~~Gy~G~~ 181 (1434)
+.|++ +|++++++|+|.|++||+...
T Consensus 6 GgC~h--~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 6 GGCSH--ICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp GGSSS--EEEEETTSEEEE-STTEEE-T
T ss_pred CCcCC--CCccCCCceEeECCCCCEECc
Confidence 45666 899999999999999999765
No 46
>smart00051 DSL delta serrate ligand.
Probab=86.59 E-value=0.72 Score=40.47 Aligned_cols=46 Identities=26% Similarity=0.538 Sum_probs=32.5
Q ss_pred ceeEcCCCcccCCCCCcccccccCcccccccccccCCccccccCCCCCCCCCCeecC-ceEecCCCccCCCC
Q psy13157 1230 PSCSCLINYIGSPPNCRPECIQNSLLLGQSLLRTHSAVQPVIQEDTCNCVPNAECRD-GVCVCLPDYYGDGY 1300 (1434)
Q Consensus 1230 ~~C~C~~Gy~G~~~~C~~eC~~~~~~~g~~~~~~~~~~~~~~~~~~c~C~~~~~C~~-~~C~C~~G~~G~~c 1300 (1434)
++=.|.++|.|. .|...|... + ....+.+|.. +.++|+|||+|..|
T Consensus 17 ~rv~C~~~~yG~--~C~~~C~~~---------------------~--d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGE--GCNKFCRPR---------------------D--DFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCC--ccCCEeCcC---------------------c--cccCCccCCcCCCEecCCCCcCCCC
Confidence 345789999998 777656321 0 2345677764 78999999999864
No 47
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=81.82 E-value=0.52 Score=36.18 Aligned_cols=32 Identities=25% Similarity=0.573 Sum_probs=22.6
Q ss_pred CCCCCCCCCCceeccC-CceEEecCCCCcCCCC
Q psy13157 1073 CQPSPCGPNSQCREVN-KQAVCSCLPNYFGSPP 1104 (1434)
Q Consensus 1073 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~~ 1104 (1434)
|....|..||.|++.. |++.|.|.+||..++.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEETT
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccCC
Confidence 3445788899999876 9999999999997743
No 48
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=80.00 E-value=1.5 Score=49.13 Aligned_cols=39 Identities=21% Similarity=0.369 Sum_probs=31.7
Q ss_pred CCCCCCCCCCCCC--CCCCCCCCeEeecCCcceeEcCCCcccC
Q psy13157 1201 PQDDVPEPVNPCY--PSPCGLYSECRNVNGAPSCSCLINYIGS 1241 (1434)
Q Consensus 1201 ~~~~~~~dineC~--~~~C~~~~~C~~~~gs~~C~C~~Gy~G~ 1241 (1434)
.+.+.|++++||. +++|. ..|.++.|+|.|.|++||+..
T Consensus 179 l~~~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 179 FQGKICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccccCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 3455678899996 45676 579999999999999999874
No 49
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=79.16 E-value=2.1 Score=35.60 Aligned_cols=31 Identities=45% Similarity=1.039 Sum_probs=23.7
Q ss_pred ceeecCCCCeeeCCCCCccCCCCccccCCCceeecCCCccCCC
Q psy13157 80 NCRVINHSPVCSCKPGFTGEPRIRCNKIPHGVCVCLPDYYGDG 122 (1434)
Q Consensus 80 ~C~n~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~C~~Gy~g~~ 122 (1434)
.|... +.+|.|++||+|. .|+ .|.+||++..
T Consensus 13 ~C~~~--~G~C~C~~~~~G~---~C~-------~C~~g~~~~~ 43 (50)
T cd00055 13 QCDPG--TGQCECKPNTTGR---RCD-------RCAPGYYGLP 43 (50)
T ss_pred cccCC--CCEEeCCCcCCCC---CCC-------CCCCCCccCC
Confidence 36544 3499999999999 555 4899999874
No 50
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=77.66 E-value=1.6 Score=36.14 Aligned_cols=32 Identities=44% Similarity=1.032 Sum_probs=24.1
Q ss_pred CceeecCCCCeeeCCCCCccCCCCccccCCCceeecCCCccCCC
Q psy13157 79 ANCRVINHSPVCSCKPGFTGEPRIRCNKIPHGVCVCLPDYYGDG 122 (1434)
Q Consensus 79 g~C~n~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~C~~Gy~g~~ 122 (1434)
.+|....| +|.|+++|+|. .|+ +|.+|||+..
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~---~C~-------~C~~g~~~~~ 42 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGP---RCD-------QCKPGYFGLP 42 (49)
T ss_dssp SSEEETCE--EESBSTTEEST---TS--------EE-TTEECST
T ss_pred CcccCCCC--EEeccccccCC---cCc-------CCCCcccccc
Confidence 47777544 99999999999 566 4889999874
No 51
>KOG3512|consensus
Probab=73.04 E-value=21 Score=42.91 Aligned_cols=161 Identities=24% Similarity=0.420 Sum_probs=87.6
Q ss_pred CceeecCCC-ceeeCCCCcccCCccccccCCCCCC-CCCCCCcCCCCCCCCCCC-------------------CCccc--
Q psy13157 264 ANCRVINHS-PICTCKPGFTGDALVYCNRIPPSRP-LESPPEYVNPCVPSPCGP-------------------YAQCR-- 320 (1434)
Q Consensus 264 ~~C~~~~g~-y~C~C~~Gf~G~~c~~C~~~~~~~~-~~~~~~dideC~~~~C~~-------------------~g~C~-- 320 (1434)
..|+-...+ ++|.|+.+-+|..|..|.+.---+| .+..-.++++|....|.. +++|.
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnC 364 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNC 364 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeec
Confidence 468776666 9999999999999876753211111 011113566666544432 34454
Q ss_pred --cCCCCCcccCCCCccCCCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCCCCCeeeecCCCccccCCCCCccCCCcc
Q psy13157 321 --DINGSPSCSCLPNYIGAPPNCRPECVQNSECPHDKACINEKCADPCLGSCGYGAVCTVINHSPICTCPEGFIGDAFSS 398 (1434)
Q Consensus 321 --n~~gsy~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~~~C~~~~gsy~C~C~~G~~G~~c~~ 398 (1434)
|+.|.+-=.|++||.-++.. +=.+.+.|+.-.|.. .=..+.+|..+.| +|.|.+|.+|..|+.
T Consensus 365 rHnTaGrhChyCreGyyRd~s~---------pl~hrkaCk~CdChp----VGs~gktCNq~tG--qCpCkeGvtG~tCnr 429 (592)
T KOG3512|consen 365 RHNTAGRHCHYCREGYYRDGSK---------PLTHRKACKACDCHP----VGSAGKTCNQTTG--QCPCKEGVTGLTCNR 429 (592)
T ss_pred ccCCCCcccccccCccccCCCC---------CCchhhhhhhcCCcc----cccccccccccCC--cccCCCCCccccccc
Confidence 45555444699999866321 001112222111100 0012446665555 699999999999998
Q ss_pred cCCCCCC---CCCCCCCCCCC---CCCCCCeeecceeccCCCcccCC
Q psy13157 399 CYPKPPE---PIEPVIQEDTC---NCVPNAECRDGVCLCLPDYYGDG 439 (1434)
Q Consensus 399 C~~~~~~---~~~~c~~~~~c---~C~~~~~C~~~~C~C~~Gy~G~~ 439 (1434)
|.+..-+ .+-+|+.+..- .++++.+=.+..+.|++++.|..
T Consensus 430 Ca~gyqqsrs~vapcik~p~~~~~~~~s~ve~qd~~s~Ck~~~~~~r 476 (592)
T KOG3512|consen 430 CAPGYQQSRSPVAPCIKIPTDAPTLGSSGVEPQDQCSKCKASPGGKR 476 (592)
T ss_pred ccchhhcccCCCcCceecCCCCccccCCCCcchhccccCCCCCccee
Confidence 8664332 23344333221 13444442344577888887765
No 52
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=72.67 E-value=3.3 Score=34.26 Aligned_cols=32 Identities=44% Similarity=0.931 Sum_probs=24.0
Q ss_pred CeeeecCCCceeeCCCCCccCCCCCcccCCCCCCCCCCccccCCCcccCC
Q psy13157 1139 ANCKVINHSPICTCKPGYTGDALSYCNRIPPPPPPQEPICTCKPGYTGDA 1188 (1434)
Q Consensus 1139 ~~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~~~~~~~~C~C~~Gy~g~~ 1188 (1434)
++|... +.+|.|+++|+|..++ .|.+||++..
T Consensus 11 ~~C~~~--~G~C~C~~~~~G~~C~----------------~C~~g~~~~~ 42 (49)
T PF00053_consen 11 QTCDPS--TGQCVCKPGTTGPRCD----------------QCKPGYFGLP 42 (49)
T ss_dssp SSEEET--CEEESBSTTEESTTS-----------------EE-TTEECST
T ss_pred CcccCC--CCEEeccccccCCcCc----------------CCCCcccccc
Confidence 467664 4589999999999887 6788998864
No 53
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=72.44 E-value=4 Score=34.01 Aligned_cols=26 Identities=42% Similarity=0.913 Sum_probs=21.0
Q ss_pred CceeeCCCCCccCCCCCcccCCCCCCCCCCccccCCCcccCC
Q psy13157 1147 SPICTCKPGYTGDALSYCNRIPPPPPPQEPICTCKPGYTGDA 1188 (1434)
Q Consensus 1147 ~~~C~C~~Gy~g~~~~~C~~~~~~~~~~~~~C~C~~Gy~g~~ 1188 (1434)
+.+|.|++||+|..++ .|++||+|..
T Consensus 18 ~G~C~C~~~~~G~~C~----------------~C~~g~~~~~ 43 (50)
T cd00055 18 TGQCECKPNTTGRRCD----------------RCAPGYYGLP 43 (50)
T ss_pred CCEEeCCCcCCCCCCC----------------CCCCCCccCC
Confidence 4579999999999876 5788888753
No 54
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=71.84 E-value=3.6 Score=33.66 Aligned_cols=29 Identities=48% Similarity=1.206 Sum_probs=22.4
Q ss_pred ceeecCCCCeeeCCCCCccCCCCccccCCCceeecCCCccC
Q psy13157 80 NCRVINHSPVCSCKPGFTGEPRIRCNKIPHGVCVCLPDYYG 120 (1434)
Q Consensus 80 ~C~n~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~C~~Gy~g 120 (1434)
.|... +.+|.|+++|+|. .|+ .|++||+|
T Consensus 12 ~C~~~--~G~C~C~~~~~G~---~C~-------~C~~g~~g 40 (46)
T smart00180 12 TCDPD--TGQCECKPNVTGR---RCD-------RCAPGYYG 40 (46)
T ss_pred cccCC--CCEEECCCCCCCC---CCC-------cCCCCcCC
Confidence 45444 3499999999998 565 48999998
No 55
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=71.28 E-value=3.6 Score=34.50 Aligned_cols=21 Identities=33% Similarity=0.990 Sum_probs=18.5
Q ss_pred CCCCCCeecCceEecCCCccC
Q psy13157 1277 NCVPNAECRDGVCVCLPDYYG 1297 (1434)
Q Consensus 1277 ~C~~~~~C~~~~C~C~~G~~G 1297 (1434)
.|..++.|++++|+|++||+-
T Consensus 27 qC~~~s~C~~g~C~C~~g~~~ 47 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYVE 47 (52)
T ss_pred CCCCcCEEcCCEeECCCCCEe
Confidence 577899999999999999873
No 56
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=69.17 E-value=4 Score=45.75 Aligned_cols=35 Identities=23% Similarity=0.413 Sum_probs=28.3
Q ss_pred CCCCCCCCCCC--CCCCCCeeeccCCCceeeCCCCcccC
Q psy13157 683 VPEPVNPCYPS--PCGPYSQCRDIGGSPSCSCLPNYIGS 719 (1434)
Q Consensus 683 ~~~~ideC~~~--~C~~~~~C~n~~gsy~C~C~~Gy~G~ 719 (1434)
.|.++++|... +|. ..|.++.|+|.|.|++||+..
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 34678888643 565 589999999999999999876
No 57
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=63.31 E-value=6.3 Score=32.23 Aligned_cols=24 Identities=46% Similarity=1.052 Sum_probs=20.0
Q ss_pred CceeeCCCCCccCCCCCcccCCCCCCCCCCccccCCCccc
Q psy13157 1147 SPICTCKPGYTGDALSYCNRIPPPPPPQEPICTCKPGYTG 1186 (1434)
Q Consensus 1147 ~~~C~C~~Gy~g~~~~~C~~~~~~~~~~~~~C~C~~Gy~g 1186 (1434)
+..|.|+++|+|..++ .|++||+|
T Consensus 17 ~G~C~C~~~~~G~~C~----------------~C~~g~~g 40 (46)
T smart00180 17 TGQCECKPNVTGRRCD----------------RCAPGYYG 40 (46)
T ss_pred CCEEECCCCCCCCCCC----------------cCCCCcCC
Confidence 3479999999998876 57889988
No 58
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=61.37 E-value=11 Score=31.61 Aligned_cols=22 Identities=32% Similarity=0.887 Sum_probs=17.8
Q ss_pred CCCCCCcccCceecCCCCccCC
Q psy13157 1342 NCVPNAECRDGVCVCLPEYYGD 1363 (1434)
Q Consensus 1342 ~C~~~~~C~~~~C~C~~Gy~g~ 1363 (1434)
.|..++.|.+++|.|++||+-.
T Consensus 27 qC~~~s~C~~g~C~C~~g~~~~ 48 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYVEV 48 (52)
T ss_pred CCCCcCEEcCCEeECCCCCEec
Confidence 4557889999999999999643
No 59
>KOG3516|consensus
Probab=46.28 E-value=14 Score=49.31 Aligned_cols=36 Identities=28% Similarity=0.855 Sum_probs=33.5
Q ss_pred cCCCCCCCCCCCCEEeecCCceeeeCC-CCCccCCCCccc
Q psy13157 148 KNPCVPGTCGEGAICNVENHAVMCTCP-PGTTGSPFIQCK 186 (1434)
Q Consensus 148 ~n~C~~~~C~~~g~C~~~~g~~~C~C~-~Gy~G~~~~~C~ 186 (1434)
++.|++++|+++|.|.-+...|.|.|. .||.|.. |.
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Gat---CH 581 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGAT---CH 581 (1306)
T ss_pred ccccCCccccCCCcccccccceeEecccccccccc---cc
Confidence 688999999999999999999999999 9999998 76
No 60
>KOG3512|consensus
Probab=42.36 E-value=69 Score=38.75 Aligned_cols=53 Identities=21% Similarity=0.431 Sum_probs=32.0
Q ss_pred ceeeecCC-CceeeCCCCCccCCCCcccCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy13157 581 ANCRVINH-SPVCSCKPGFTGEPRIRCNKIPPRPPPQEDVPEPVNPCYPSPCGP 633 (1434)
Q Consensus 581 ~~C~~~~g-~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~c~~dideC~~~~C~~ 633 (1434)
..|+-... .++|.|..+-+|....+|..-=...++++.-..++++|....|..
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~ 338 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNG 338 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccch
Confidence 45765554 499999999999884445322222334444445677776655544
No 61
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=37.33 E-value=30 Score=34.36 Aligned_cols=29 Identities=24% Similarity=0.574 Sum_probs=22.7
Q ss_pred CCCCCCCeeeec--CCcceeeCCCCCccCCCC
Q psy13157 750 GSCGYNAECKVI--NHTPICTCPQGFIGDAFS 779 (1434)
Q Consensus 750 ~~C~~~~~C~~~--~g~y~C~C~~Gy~G~~c~ 779 (1434)
+-|-+ |+|.-. ...+.|.|+.||+|.+|+
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCccccccc
Confidence 34654 488754 467999999999999997
No 62
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=35.75 E-value=13 Score=32.77 Aligned_cols=49 Identities=24% Similarity=0.405 Sum_probs=20.6
Q ss_pred CCceeeCCCCcccCCCCCccCCccCCCCCCcchhcccccCCCCCCCCCCCCeeeecCCcceeeCCCCCccCCC
Q psy13157 706 GSPSCSCLPNYIGSPPNCRPECVMNSECPSHEACINEKCQDPCPGSCGYNAECKVINHTPICTCPQGFIGDAF 778 (1434)
Q Consensus 706 gsy~C~C~~Gy~G~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~y~C~C~~Gy~G~~c 778 (1434)
-+++-.|.+.|.|. .|...|...+.= ..+-+|.. .| .=.|.+||+|..|
T Consensus 15 ~~~rv~C~~nyyG~--~C~~~C~~~~d~-------------------~ghy~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 15 YRIRVVCDENYYGP--NCSKFCKPRDDS-------------------FGHYTCDS-NG--NKVCLPGWTGPNC 63 (63)
T ss_dssp --------TTEETT--TT-EE---EEET-------------------TEEEEE-S-S----EEE-TTEESTTS
T ss_pred EEEEEECCCCCCCc--cccCCcCCCcCC-------------------cCCcccCC-CC--CCCCCCCCcCCCC
Confidence 35667899999999 787766543210 01233442 33 2368899999865
No 63
>KOG3516|consensus
Probab=33.01 E-value=34 Score=45.88 Aligned_cols=35 Identities=26% Similarity=0.717 Sum_probs=32.2
Q ss_pred CCCCCCCCCCCCCCCceeccCCceeeccC-CCCcCC
Q psy13157 193 VYTNPCQPSPCGPNSQCREINSQAVCSCL-PNYFGS 227 (1434)
Q Consensus 193 ~~~~~C~~~~C~~~g~C~~~~g~y~C~C~-~Gy~g~ 227 (1434)
..++.|.+++|+++|.|......|.|.|. .||+|.
T Consensus 543 ~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga 578 (1306)
T KOG3516|consen 543 GISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA 578 (1306)
T ss_pred ccccccCCccccCCCcccccccceeEeccccccccc
Confidence 35788999999999999998889999998 999998
No 64
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=31.44 E-value=41 Score=33.02 Aligned_cols=26 Identities=46% Similarity=1.131 Sum_probs=21.0
Q ss_pred CCCCCCCeeeecCCCceeeCCCCCccC
Q psy13157 1133 GTCGQNANCKVINHSPICTCKPGYTGD 1159 (1434)
Q Consensus 1133 ~~C~~~~~C~~~~g~~~C~C~~Gy~g~ 1159 (1434)
+.|+.+|.|.. ..+..|.|++||+-.
T Consensus 84 ~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 84 GFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCccEeCC-CCCCceECCCCcCCC
Confidence 68999999954 456789999999853
No 65
>PHA02887 EGF-like protein; Provisional
Probab=31.30 E-value=41 Score=32.97 Aligned_cols=28 Identities=32% Similarity=0.687 Sum_probs=21.8
Q ss_pred CCCCCCeeeec--CCcceeeCCCCCccCCCC
Q psy13157 751 SCGYNAECKVI--NHTPICTCPQGFIGDAFS 779 (1434)
Q Consensus 751 ~C~~~~~C~~~--~g~y~C~C~~Gy~G~~c~ 779 (1434)
-|- +|+|.-. ...+.|.|+.||+|.+|+
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 455 5688744 456899999999999996
No 66
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=31.29 E-value=41 Score=33.01 Aligned_cols=33 Identities=33% Similarity=0.766 Sum_probs=25.8
Q ss_pred CCCCCC-CCCCCCCCeEeecCCcceeEcCCCcccC
Q psy13157 1208 PVNPCY-PSPCGLYSECRNVNGAPSCSCLINYIGS 1241 (1434)
Q Consensus 1208 dineC~-~~~C~~~~~C~~~~gs~~C~C~~Gy~G~ 1241 (1434)
..++|. -..|+.+|.|.. ..+..|.|.+||+-.
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 456886 588999999964 456789999999743
No 67
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=29.36 E-value=50 Score=32.94 Aligned_cols=36 Identities=22% Similarity=0.602 Sum_probs=27.7
Q ss_pred CCCCCC---CCCCCCCCeEeecC--CcceeEcCCCcccCCCCCc
Q psy13157 1208 PVNPCY---PSPCGLYSECRNVN--GAPSCSCLINYIGSPPNCR 1246 (1434)
Q Consensus 1208 dineC~---~~~C~~~~~C~~~~--gs~~C~C~~Gy~G~~~~C~ 1246 (1434)
++-+|. .+=|-| |+|.-.+ ..+.|.|..||+|. +|+
T Consensus 41 ~i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGe--RCE 81 (139)
T PHA03099 41 AIRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGI--RCQ 81 (139)
T ss_pred ccccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccc--ccc
Confidence 455663 455776 4899664 78999999999999 887
No 68
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=28.21 E-value=28 Score=30.66 Aligned_cols=47 Identities=30% Similarity=0.555 Sum_probs=21.3
Q ss_pred cceeEcCCCcccCCCCCcccccccCcccccccccccCCccccccCCCCCCCCCCeecC-ceEecCCCccCCCC
Q psy13157 1229 APSCSCLINYIGSPPNCRPECIQNSLLLGQSLLRTHSAVQPVIQEDTCNCVPNAECRD-GVCVCLPDYYGDGY 1300 (1434)
Q Consensus 1229 s~~C~C~~Gy~G~~~~C~~eC~~~~~~~g~~~~~~~~~~~~~~~~~~c~C~~~~~C~~-~~C~C~~G~~G~~c 1300 (1434)
+++=.|...|.|. .|..-|...... ..+-+|.. +.=+|.+||+|..|
T Consensus 16 ~~rv~C~~nyyG~--~C~~~C~~~~d~-----------------------~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGP--NCSKFCKPRDDS-----------------------FGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETT--TT-EE---EEET-----------------------TEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEEECCCCCCCc--cccCCcCCCcCC-----------------------cCCcccCCCCCCCCCCCCcCCCC
Confidence 4577899999999 888767432110 11334543 67889999999865
No 69
>KOG3514|consensus
Probab=28.00 E-value=35 Score=44.94 Aligned_cols=36 Identities=25% Similarity=0.601 Sum_probs=32.5
Q ss_pred CCCCCCCCCCCeEeecCCcceeEcC-CCcccCCCCCccc
Q psy13157 1211 PCYPSPCGLYSECRNVNGAPSCSCL-INYIGSPPNCRPE 1248 (1434)
Q Consensus 1211 eC~~~~C~~~~~C~~~~gs~~C~C~-~Gy~G~~~~C~~e 1248 (1434)
.|.++||.|+|+|....+.|.|.|. .||.|. .|+.|
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc--cccce
Confidence 6889999999999999999999995 899998 89854
No 70
>KOG3514|consensus
Probab=24.74 E-value=45 Score=44.02 Aligned_cols=36 Identities=25% Similarity=0.635 Sum_probs=32.3
Q ss_pred CCCCCCCCCCCeeeccCCCceeeCC-CCcccCCCCCccC
Q psy13157 689 PCYPSPCGPYSQCRDIGGSPSCSCL-PNYIGSPPNCRPE 726 (1434)
Q Consensus 689 eC~~~~C~~~~~C~n~~gsy~C~C~-~Gy~G~~~~C~~~ 726 (1434)
.|.++||.|+|+|....++|.|.|. .||.|. .|+++
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc--cccce
Confidence 6889999999999999999999997 589998 78764
No 71
>PHA02887 EGF-like protein; Provisional
Probab=23.34 E-value=67 Score=31.53 Aligned_cols=29 Identities=28% Similarity=0.497 Sum_probs=21.8
Q ss_pred CCCCCCEEec--cCCceEeeCCCCCcCCCCccccc
Q psy13157 472 TCGEGAICDV--VNHAVSCTCPPGTTGSPFVQCKT 504 (1434)
Q Consensus 472 ~C~~~~~C~~--~~g~y~C~C~~G~~G~~~~~C~~ 504 (1434)
-|. +|+|.- ....+.|.|++||+|.+ |+.
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~R---CE~ 123 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIR---CDE 123 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCC---CCc
Confidence 465 368843 34568999999999997 875
No 72
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=21.50 E-value=68 Score=24.48 Aligned_cols=16 Identities=44% Similarity=0.802 Sum_probs=13.0
Q ss_pred CCeeeCCCCCccCCCC
Q psy13157 1395 HPICSCPQGYIGDGFN 1410 (1434)
Q Consensus 1395 s~~C~C~~Gy~g~~~~ 1410 (1434)
.+.|.||+||+.+..+
T Consensus 17 ~~~C~CPeGyIlde~~ 32 (34)
T PF09064_consen 17 PGQCFCPEGYILDEGS 32 (34)
T ss_pred CCceeCCCceEecCCc
Confidence 4689999999987644
Done!