Query psy11799
Match_columns 483
No_of_seqs 326 out of 2758
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 19:17:43 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11799.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11799hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.7 3.5E-15 7.6E-20 152.0 17.7 190 192-396 693-910 (1289)
2 KOG1217|consensus 99.5 3.1E-13 6.7E-18 141.0 20.6 270 113-473 155-436 (487)
3 KOG1217|consensus 99.5 8.1E-13 1.8E-17 137.8 20.2 269 140-474 98-405 (487)
4 KOG1214|consensus 99.4 4.8E-13 1E-17 136.7 11.4 155 276-468 694-865 (1289)
5 KOG1219|consensus 99.3 9.5E-12 2E-16 137.6 8.6 109 315-468 3865-3978(4289)
6 KOG4289|consensus 99.1 1.5E-10 3.4E-15 124.1 10.5 88 289-391 1217-1308(2531)
7 KOG4289|consensus 99.1 7.3E-10 1.6E-14 119.1 14.1 87 206-301 1216-1308(2531)
8 KOG0994|consensus 99.0 1.4E-09 3.1E-14 115.0 12.5 73 334-421 1077-1160(1758)
9 KOG1219|consensus 99.0 5.1E-10 1.1E-14 124.5 8.4 106 275-396 3865-3975(4289)
10 KOG0994|consensus 98.7 3.8E-07 8.2E-12 97.2 15.1 41 25-72 757-797 (1758)
11 KOG1225|consensus 98.5 5.9E-07 1.3E-11 91.6 10.5 76 293-396 233-308 (525)
12 KOG4260|consensus 98.4 3.1E-07 6.7E-12 83.5 5.3 164 112-304 130-305 (350)
13 KOG4260|consensus 98.4 6.6E-07 1.4E-11 81.4 6.2 156 216-395 132-306 (350)
14 PF07645 EGF_CA: Calcium-bindi 98.4 2.8E-07 6.1E-12 61.3 2.8 42 422-466 1-42 (42)
15 KOG1225|consensus 98.3 1.3E-06 2.8E-11 89.1 7.6 102 31-170 234-341 (525)
16 PF07645 EGF_CA: Calcium-bindi 98.3 7.3E-07 1.6E-11 59.3 3.6 40 125-167 1-40 (42)
17 PF12662 cEGF: Complement Clr- 98.1 3.2E-06 7E-11 47.9 2.6 24 448-471 1-24 (24)
18 PF14670 FXa_inhibition: Coagu 97.7 3.2E-05 6.9E-10 49.0 2.8 33 197-229 4-36 (36)
19 KOG1226|consensus 97.7 0.00038 8.3E-09 72.9 12.0 146 286-473 467-626 (783)
20 PF12662 cEGF: Complement Clr- 97.7 3.8E-05 8.1E-10 43.6 2.3 24 211-234 1-24 (24)
21 PF12947 EGF_3: EGF domain; I 97.6 3.9E-05 8.5E-10 48.7 2.2 29 139-169 8-36 (36)
22 smart00179 EGF_CA Calcium-bind 97.6 9.8E-05 2.1E-09 47.9 4.1 38 423-467 2-39 (39)
23 PF00008 EGF: EGF-like domain 97.6 4E-05 8.7E-10 47.4 1.9 26 435-460 5-31 (32)
24 PF12947 EGF_3: EGF domain; I 97.5 6.2E-05 1.4E-09 47.8 2.0 30 435-466 7-36 (36)
25 KOG1836|consensus 97.5 0.0023 5.1E-08 74.4 15.6 57 382-459 957-1017(1705)
26 PF06247 Plasmod_Pvs28: Plasmo 97.4 0.00019 4.2E-09 62.4 4.5 138 290-464 16-166 (197)
27 PF00008 EGF: EGF-like domain 97.2 0.0002 4.2E-09 44.3 2.0 30 317-354 1-31 (32)
28 cd00054 EGF_CA Calcium-binding 97.1 0.00086 1.9E-08 42.9 4.0 28 435-466 10-37 (38)
29 PF06247 Plasmod_Pvs28: Plasmo 97.0 0.0012 2.6E-08 57.6 5.5 138 203-357 11-165 (197)
30 PF14670 FXa_inhibition: Coagu 97.0 0.00068 1.5E-08 43.0 2.8 28 142-169 9-36 (36)
31 smart00179 EGF_CA Calcium-bind 97.0 0.0012 2.5E-08 42.7 4.0 31 314-352 2-33 (39)
32 KOG1226|consensus 96.9 0.0024 5.2E-08 67.1 7.7 94 57-164 478-578 (783)
33 cd00053 EGF Epidermal growth f 96.3 0.0066 1.4E-07 38.0 3.9 26 435-460 7-32 (36)
34 smart00181 EGF Epidermal growt 96.2 0.0073 1.6E-07 38.0 3.8 24 435-459 7-30 (35)
35 cd00054 EGF_CA Calcium-binding 96.2 0.0075 1.6E-07 38.4 3.9 32 314-353 2-34 (38)
36 PF12661 hEGF: Human growth fa 95.9 0.0017 3.7E-08 31.2 -0.2 13 32-44 1-13 (13)
37 KOG1836|consensus 95.1 0.24 5.2E-06 58.3 12.9 67 192-263 738-808 (1705)
38 cd00053 EGF Epidermal growth f 95.1 0.04 8.7E-07 34.3 4.0 22 333-354 11-32 (36)
39 PF07974 EGF_2: EGF-like domai 94.9 0.032 7E-07 34.3 3.0 26 435-466 7-32 (32)
40 smart00181 EGF Epidermal growt 94.8 0.05 1.1E-06 34.0 3.8 20 333-353 11-30 (35)
41 PF07974 EGF_2: EGF-like domai 94.6 0.048 1E-06 33.6 3.1 23 139-163 8-30 (32)
42 cd01475 vWA_Matrilin VWA_Matri 94.0 0.06 1.3E-06 50.2 4.0 42 186-227 182-223 (224)
43 smart00051 DSL delta serrate l 92.9 0.09 1.9E-06 38.1 2.5 40 30-70 16-63 (63)
44 cd01475 vWA_Matrilin VWA_Matri 91.9 0.18 3.8E-06 47.0 3.9 42 120-166 181-222 (224)
45 PF12946 EGF_MSP1_1: MSP1 EGF 89.4 0.23 4.9E-06 31.4 1.4 25 139-163 7-32 (37)
46 KOG1218|consensus 86.2 29 0.00062 33.7 15.1 15 382-396 160-174 (316)
47 smart00051 DSL delta serrate l 85.1 1.2 2.5E-05 32.3 3.3 44 294-353 17-60 (63)
48 PF12946 EGF_MSP1_1: MSP1 EGF 82.9 1.1 2.5E-05 28.3 2.1 22 332-353 9-31 (37)
49 KOG3512|consensus 79.2 7.6 0.00016 39.3 7.5 112 333-467 358-478 (592)
50 PF09064 Tme5_EGF_like: Thromb 78.1 2.5 5.4E-05 26.1 2.4 25 370-396 6-30 (34)
51 PF01414 DSL: Delta serrate li 77.0 0.34 7.4E-06 35.0 -1.9 39 31-70 17-63 (63)
52 KOG1218|consensus 75.6 62 0.0013 31.3 13.2 16 293-308 48-63 (316)
53 PHA03099 epidermal growth fact 75.0 3 6.4E-05 34.3 2.8 41 422-469 41-83 (139)
54 PF00053 Laminin_EGF: Laminin 72.7 1.1 2.4E-05 30.4 -0.1 21 29-49 16-36 (49)
55 smart00180 EGF_Lam Laminin-typ 72.0 2.6 5.7E-05 28.2 1.6 20 29-48 16-35 (46)
56 cd00055 EGF_Lam Laminin-type e 69.6 2.9 6.3E-05 28.6 1.4 20 29-48 17-36 (50)
57 PHA02887 EGF-like protein; Pro 68.9 5.4 0.00012 32.3 3.0 30 435-469 93-124 (126)
58 PF00954 S_locus_glycop: S-loc 57.5 9.2 0.0002 31.0 2.5 32 125-161 76-107 (110)
59 PHA02887 EGF-like protein; Pro 55.4 5.9 0.00013 32.0 1.0 17 30-46 107-123 (126)
60 PF01683 EB: EB module; Inter 53.0 38 0.00083 23.0 4.8 17 333-353 31-47 (52)
61 PF01724 DUF29: Domain of unkn 49.4 7.7 0.00017 33.1 0.9 19 3-21 16-34 (139)
62 PHA03099 epidermal growth fact 48.1 17 0.00037 30.0 2.6 36 125-163 41-78 (139)
63 PF00954 S_locus_glycop: S-loc 42.4 20 0.00043 29.0 2.3 23 435-458 85-107 (110)
64 PF12955 DUF3844: Domain of un 39.4 21 0.00045 28.5 1.8 44 424-470 6-63 (103)
65 PF12955 DUF3844: Domain of un 38.7 23 0.00051 28.3 2.0 33 127-162 6-43 (103)
66 PTZ00214 high cysteine membran 36.9 2.5E+02 0.0054 31.5 10.2 120 254-403 647-769 (800)
67 KOG3516|consensus 35.2 29 0.00063 39.5 2.7 40 420-468 542-582 (1306)
68 KOG3512|consensus 34.1 1.5E+02 0.0033 30.4 7.1 53 294-353 371-424 (592)
69 KOG3516|consensus 33.7 34 0.00073 39.0 2.9 41 310-362 541-582 (1306)
70 KOG0196|consensus 30.4 54 0.0012 36.0 3.6 58 384-458 259-317 (996)
71 KOG3514|consensus 28.9 35 0.00076 38.4 2.0 35 316-362 625-660 (1591)
72 PF04863 EGF_alliinase: Alliin 25.4 24 0.00052 24.5 0.1 25 22-46 26-51 (56)
73 PTZ00214 high cysteine membran 24.2 3.1E+02 0.0068 30.8 8.3 25 212-236 682-706 (800)
74 PF01826 TIL: Trypsin Inhibito 23.4 60 0.0013 22.3 1.8 18 386-403 35-52 (55)
No 1
>KOG1214|consensus
Probab=99.66 E-value=3.5e-15 Score=152.04 Aligned_cols=190 Identities=32% Similarity=0.758 Sum_probs=137.8
Q ss_pred chhccCCCCCC--CeEEeCCC-CeEEeCCCCceeCCCCCcccccCcccCCCCCC--CCeeeeCCCCceEeCCCCceecCC
Q psy11799 192 NECLELSNQCA--FRCHNVPG-SFRCICPYGYALAPDGRHCIDINECEQFPGLC--AHTCTNTEGSYTCGCHPGFELSSD 266 (483)
Q Consensus 192 ~~C~~~~~~C~--~~C~~~~g-s~~C~C~~G~~~~~~g~~C~~~~~C~~~~~~~--~~~C~~~~g~~~C~C~~G~~~~~~ 266 (483)
+.|....+.|. +.|...++ .|+|.|..||. .++++|.++++|+..+..+ +..|++.+++|+|+|..||.+..+
T Consensus 693 npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~--gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd 770 (1289)
T KOG1214|consen 693 NPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQ--GDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADD 770 (1289)
T ss_pred ccceecCcccCCCccccCCCCcceEEEEeeccC--CCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccC
Confidence 33443334443 45655544 58999999997 4489999999998655444 789999999999999999998876
Q ss_pred CCCcccc------CccccCCCCccc--c--cc--cCCCCeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCC
Q psy11799 267 GSGTCLD------SDECATGRHICQ--Q--IC--NLLGGYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGG 334 (483)
Q Consensus 267 g~~~C~~------~~~C~~~~~~c~--~--~C--~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g 334 (483)
+. +|+. .+.|..+.+.|. + .| ...+.|.|.|.|||.|+ +..|.++|+|..+.|.+ .+
T Consensus 771 ~~-tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGD--G~~c~dvDeC~psrChp--------~A 839 (1289)
T KOG1214|consen 771 RH-TCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGD--GHQCTDVDECSPSRCHP--------AA 839 (1289)
T ss_pred Cc-ceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCC--ccccccccccCccccCC--------Cc
Confidence 65 7763 467888777775 3 34 45568999999999998 46788999999999999 89
Q ss_pred EEeeCCCCeeEECCCCCeeccCCCceeeCC---CccCC-----CCCCCC---cEECCCCCeeEeCCCCeeecC
Q psy11799 335 KCFNTAGSYRCGCPDGFQFEAALQLCTQLS---SSCLS-----SPCSFG---CTPLGISGFSCECPSGYQRIG 396 (483)
Q Consensus 335 ~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~---~~C~~-----~~C~~~---c~~~~~~~~~C~C~~Gy~g~~ 396 (483)
+|.+++++|.|.|.+||.| +|..|.... ..|.. ..|... |.+.+.+.+.+.|.++-.|..
T Consensus 840 ~CyntpgsfsC~C~pGy~G--DGf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~ 910 (1289)
T KOG1214|consen 840 TCYNTPGSFSCRCQPGYYG--DGFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGST 910 (1289)
T ss_pred eEecCCCcceeecccCccC--CCceecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCCCC
Confidence 9999999999999999998 577775431 13331 123321 344444666777666655544
No 2
>KOG1217|consensus
Probab=99.55 E-value=3.1e-13 Score=140.98 Aligned_cols=270 Identities=36% Similarity=0.772 Sum_probs=184.1
Q ss_pred eecccCcccccccc-cccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCCCccccCCCCCCCCCccCCCcccccc
Q psy11799 113 SLKKKKFYSVFLDM-DECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLDQDECTDDSKCREGCQTCEDI 191 (483)
Q Consensus 113 ~~~~~~~~~~C~~~-~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g~~C~~~~~C~~~~~C~~g~~~C~~~ 191 (483)
.|..++.+..+... ++|... ... |.+++.|.+..++|.|.|++||.+. .|+..
T Consensus 155 ~C~~g~~~~~~~~~~~~C~~~-~~~--c~~~~~C~~~~~~~~C~c~~~~~~~----~~~~~------------------- 208 (487)
T KOG1217|consen 155 SCTEGYEGEPCETDLDECIQY-SSP--CQNGGTCVNTGGSYLCSCPPGYTGS----TCETT------------------- 208 (487)
T ss_pred eeCCCcccccccccccccccC-CCC--cCCCcccccCCCCeeEeCCCCccCC----cCcCC-------------------
Confidence 44455555555532 677743 334 9899999999999999999999844 33211
Q ss_pred chhccCCCCCCCeEEeCCCCeEEeCCCCceeCCCCCcc-cccCcccCCCCCCCCeeeeCCCCceEeCCCCceecCCCCCc
Q psy11799 192 NECLELSNQCAFRCHNVPGSFRCICPYGYALAPDGRHC-IDINECEQFPGLCAHTCTNTEGSYTCGCHPGFELSSDGSGT 270 (483)
Q Consensus 192 ~~C~~~~~~C~~~C~~~~gs~~C~C~~G~~~~~~g~~C-~~~~~C~~~~~~~~~~C~~~~g~~~C~C~~G~~~~~~g~~~ 270 (483)
...+.|++. +.|.+.+++. +..| ..+.++... . +.|++..+.++|.|++||.+... . .
T Consensus 209 --------~~~~~c~~~---~~~~~~~g~~----~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~-~-~ 267 (487)
T KOG1217|consen 209 --------GNGGTCVDS---VACSCPPGAR----GPECEVSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDAC-V-T 267 (487)
T ss_pred --------CCCceEecc---eeccCCCCCC----CCCcccccccccCC---C-CcccccCCceeeeCCCCcccccc-c-e
Confidence 011345544 5788889887 6666 334444433 3 78999988899999999996532 1 5
Q ss_pred cccCccccCCCC-cccccc-cCCCCeeEeCCCCceeCCCCCccccCCcCC----CCCCCCCCCCCCCCCCEE--eeCCCC
Q psy11799 271 CLDSDECATGRH-ICQQIC-NLLGGYKCGCPEGYVRHPFSNECIDDNECA----RNPCEPSSNTTGERGGKC--FNTAGS 342 (483)
Q Consensus 271 C~~~~~C~~~~~-~c~~~C-~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~----~~~C~~~~~~~c~~~g~C--~~~~g~ 342 (483)
+.++++|....+ ...+.| +..+.|.|.|++||++..+ ..+.+..+|. ..+|.+ ++.| ....+.
T Consensus 268 ~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~--------g~~C~~~~~~~~ 338 (487)
T KOG1217|consen 268 CVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCAN--------GGTCNTLGSFGG 338 (487)
T ss_pred eeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCC--------CcccccCCCCCC
Confidence 677888888753 334678 6666699999999999887 4556666774 346666 7788 344557
Q ss_pred eeEECCCCCeeccCCCceeeCCCccCCCCCCCC--cEECCCCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCc
Q psy11799 343 YRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFG--CTPLGISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNL 420 (483)
Q Consensus 343 ~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~--c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C 420 (483)
+.|.|..+| .++.|+...+.|...++.++ |.......+.|.|+.+|.+... .....+
T Consensus 339 ~~C~c~~~~----~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~-----------------~~~~~~ 397 (487)
T KOG1217|consen 339 FRCACGPGF----TGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKAN-----------------GDGVGC 397 (487)
T ss_pred CCcCCCCCC----CCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCc-----------------cccccc
Confidence 889999998 77888876447887777766 4441237899999999987410 001235
Q ss_pred ccccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCCCCcCCC
Q psy11799 421 GDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLDQDECT 473 (483)
Q Consensus 421 ~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~C~ 473 (483)
.++++|.. .+.|++..+++.|. ++ + ..... .|.+++++.
T Consensus 398 ~~~~~c~~----------~~~c~~~~~~~~c~-~~-~-~~~~~-~~~~~~~~~ 436 (487)
T KOG1217|consen 398 EDIDECSG----------CGDCVNGPGGGACT-PP-G-LVSPG-TCDDIDECP 436 (487)
T ss_pred cccccccC----------CcceeccCCCCccc-cC-c-ccCCc-ceecccccc
Confidence 56666642 46778788999999 88 5 44344 777777664
No 3
>KOG1217|consensus
Probab=99.51 E-value=8.1e-13 Score=137.85 Aligned_cols=269 Identities=40% Similarity=0.882 Sum_probs=182.8
Q ss_pred CCCCeEeeCCCceEeeCCCCCeeCCCCCc--cccCCCCCCCCCccCCCccccccchhccCCCCCCCeEEe---CCCCeEE
Q psy11799 140 PKPGTCINTMGSFRCLCPRGFKLDRTGTF--CLDQDECTDDSKCREGCQTCEDINECLELSNQCAFRCHN---VPGSFRC 214 (483)
Q Consensus 140 ~~~g~C~~~~g~~~C~C~~G~~g~~~g~~--C~~~~~C~~~~~C~~g~~~C~~~~~C~~~~~~C~~~C~~---~~gs~~C 214 (483)
...+.+.....+|.|.|.+||.+..+... |..... .+.. .+.|.. ....+.|
T Consensus 98 ~~~~~~~~~~~~~~c~c~~g~~~~~~~~~~~C~~~~~------------~~~~-----------~~~c~~~~~~~~~~~c 154 (487)
T KOG1217|consen 98 LLCGECVDCVGSYECTCPPGYQGTPCEGECECVTGPG------------VCCI-----------DGSCSNGPGSVGPFRC 154 (487)
T ss_pred cCCccccCCCCCceeeCCCccccCcCCcceeecCCCC------------CeeC-----------chhhcCCCCCCCceee
Confidence 35667777888999999999996655442 321110 0011 134444 3457899
Q ss_pred eCCCCceeCCCCCccccc-CcccC--CCCCCCCeeeeCCCCceEeCCCCceecCCCCC----cccc-----------Ccc
Q psy11799 215 ICPYGYALAPDGRHCIDI-NECEQ--FPGLCAHTCTNTEGSYTCGCHPGFELSSDGSG----TCLD-----------SDE 276 (483)
Q Consensus 215 ~C~~G~~~~~~g~~C~~~-~~C~~--~~~~~~~~C~~~~g~~~C~C~~G~~~~~~g~~----~C~~-----------~~~ 276 (483)
.|..||. +..+... ++|.. .++..++.|.+..+.|.|.|+++|.+...... .|.. ...
T Consensus 155 ~C~~g~~----~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~ 230 (487)
T KOG1217|consen 155 SCTEGYE----GEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPE 230 (487)
T ss_pred eeCCCcc----cccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCC
Confidence 9999999 6666533 67773 34777889999999999999999986644321 2221 122
Q ss_pred ccCCCCccc---ccc-cCCCCeeEeCCCCceeCCCCCccccCCcCCCCC-CCCCCCCCCCCCCEEeeCCCCeeEECCCCC
Q psy11799 277 CATGRHICQ---QIC-NLLGGYKCGCPEGYVRHPFSNECIDDNECARNP-CEPSSNTTGERGGKCFNTAGSYRCGCPDGF 351 (483)
Q Consensus 277 C~~~~~~c~---~~C-~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~-C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~ 351 (483)
|......+. +.| +..++|.|.|++||.+... ..+.++++|.... |.+ +++|++..+.|.|.|++||
T Consensus 231 c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~~c~~--------~~~C~~~~~~~~C~C~~g~ 301 (487)
T KOG1217|consen 231 CEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALIASCPN--------GGTCVNVPGSYRCTCPPGF 301 (487)
T ss_pred cccccccccCCCCcccccCCceeeeCCCCcccccc-ceeeeccccCCCCccCC--------CCeeecCCCcceeeCCCCC
Confidence 222222222 456 6677899999999999753 3567788888753 777 8999999999999999999
Q ss_pred eeccCCCceeeCCCccC----CCCCCCCcEE--CC-CCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccc-
Q psy11799 352 QFEAALQLCTQLSSSCL----SSPCSFGCTP--LG-ISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDI- 423 (483)
Q Consensus 352 ~~~~~g~~C~~~~~~C~----~~~C~~~c~~--~~-~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~- 423 (483)
.+... ..+.... .|. ..+|.++..+ .. ...+.|.|.+||.|.. |+..
T Consensus 302 ~g~~~-~~~~~~~-~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~-----------------------C~~~~ 356 (487)
T KOG1217|consen 302 TGRLC-TECVDVD-ECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRR-----------------------CEDSN 356 (487)
T ss_pred CCCCC-ccccccc-cccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCc-----------------------cccCC
Confidence 66443 2232223 563 4557665222 11 1356788888876654 6666
Q ss_pred cccccCCCCccCCCCCCeEee-CCCCeeeeCCCCcccC--CCCCCCCCCcCCCC
Q psy11799 424 DECKEEINGVAVCPKPGTCIN-TMGSFRCLCPRGFKLD--RTGTFCLDQDECTD 474 (483)
Q Consensus 424 ~~C~~~~~~~~~C~~~g~C~~-~~g~~~C~C~~G~~~~--~~g~~C~~~~~C~~ 474 (483)
++|... ++.+++.|++ ..++|.|.|+.+|.+. ..+..+.++++|..
T Consensus 357 ~~C~~~-----~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~ 405 (487)
T KOG1217|consen 357 DECASS-----PCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG 405 (487)
T ss_pred ccccCC-----ccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC
Confidence 378765 6889999999 7899999999999864 57788888888863
No 4
>KOG1214|consensus
Probab=99.44 E-value=4.8e-13 Score=136.72 Aligned_cols=155 Identities=38% Similarity=0.919 Sum_probs=120.6
Q ss_pred cccCCCCccc--ccc--cCCCCeeEeCCCCceeCCCCCccccCCcCCC--CCCCCCCCCCCCCCCEEeeCCCCeeEECCC
Q psy11799 276 ECATGRHICQ--QIC--NLLGGYKCGCPEGYVRHPFSNECIDDNECAR--NPCEPSSNTTGERGGKCFNTAGSYRCGCPD 349 (483)
Q Consensus 276 ~C~~~~~~c~--~~C--~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~--~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~ 349 (483)
+|-...+.|. +.| ...-.|+|.|..||.|+ ...|.++++|+. ..|.+ ++.|++.+++|+|.|..
T Consensus 694 pCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gd--gr~c~d~~eca~~~~~CGp--------~s~Cin~pg~~rceC~~ 763 (1289)
T KOG1214|consen 694 PCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGD--GRNCVDENECATGFHRCGP--------NSVCINLPGSYRCECRS 763 (1289)
T ss_pred cceecCcccCCCccccCCCCcceEEEEeeccCCC--CCCCCChhhhccCCCCCCC--------CceeecCCCceeEEEee
Confidence 3444444444 445 33457999999999987 467889999987 36766 89999999999999999
Q ss_pred CCeeccCCCceeeCCC-----ccC--CCCCCC-C---cEECCCCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCC
Q psy11799 350 GFQFEAALQLCTQLSS-----SCL--SSPCSF-G---CTPLGISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERP 418 (483)
Q Consensus 350 G~~~~~~g~~C~~~~~-----~C~--~~~C~~-~---c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~ 418 (483)
||.....+-+|..+.. .|+ .+.|.. + |...+.+.|+|.|.+||.|++ .
T Consensus 764 gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG---------------------~ 822 (1289)
T KOG1214|consen 764 GYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG---------------------H 822 (1289)
T ss_pred cceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCc---------------------c
Confidence 9998878788975432 455 345642 2 445555789999999999976 3
Q ss_pred CcccccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCCC
Q psy11799 419 NLGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLD 468 (483)
Q Consensus 419 ~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~ 468 (483)
.|.++|+|..+ -|...+.|++++++|.|.|.+||+|| |.+|..
T Consensus 823 ~c~dvDeC~ps-----rChp~A~CyntpgsfsC~C~pGy~GD--Gf~CVP 865 (1289)
T KOG1214|consen 823 QCTDVDECSPS-----RCHPAATCYNTPGSFSCRCQPGYYGD--GFQCVP 865 (1289)
T ss_pred ccccccccCcc-----ccCCCceEecCCCcceeecccCccCC--CceecC
Confidence 47888999865 79999999999999999999999986 777864
No 5
>KOG1219|consensus
Probab=99.26 E-value=9.5e-12 Score=137.65 Aligned_cols=109 Identities=40% Similarity=1.010 Sum_probs=94.9
Q ss_pred CcCCCCCCCCCCCCCCCCCCEEeeCC-CCeeEECCCCCeeccCCCceeeCCCccCCCCCCCC--cEECCCCCeeEeCCCC
Q psy11799 315 NECARNPCEPSSNTTGERGGKCFNTA-GSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFG--CTPLGISGFSCECPSG 391 (483)
Q Consensus 315 ~~C~~~~C~~~~~~~c~~~g~C~~~~-g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~--c~~~~~~~~~C~C~~G 391 (483)
+.|..+||++ +|+|...+ ++|.|.|+.-| .|++|+....+|.++||.+| |.... ++|.|.|+.|
T Consensus 3865 d~C~~npCqh--------gG~C~~~~~ggy~CkCpsqy----sG~~CEi~~epC~snPC~~GgtCip~~-n~f~CnC~~g 3931 (4289)
T KOG1219|consen 3865 DPCNDNPCQH--------GGTCISQPKGGYKCKCPSQY----SGNHCEIDLEPCASNPCLTGGTCIPFY-NGFLCNCPNG 3931 (4289)
T ss_pred cccccCcccC--------CCEecCCCCCceEEeCcccc----cCcccccccccccCCCCCCCCEEEecC-CCeeEeCCCC
Confidence 7899999999 99999866 67999999999 88999987779999999988 55555 8999999999
Q ss_pred eeecCCcceecccCCCCCCCcccCCCCCcc--cccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCCC
Q psy11799 392 YQRIGQGHCLATISSGLYPGFRSYERPNLG--DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLD 468 (483)
Q Consensus 392 y~g~~~g~C~~~~~~~C~~g~~g~~~~~C~--~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~ 468 (483)
|+|.+ |+ .|++|..+ +|.++|.|++..|+|.|.|.+|| -|..|..
T Consensus 3932 yTG~~-----------------------Ce~~Gi~eCs~n-----~C~~gg~C~n~~gsf~CncT~g~----~gr~c~~ 3978 (4289)
T KOG1219|consen 3932 YTGKR-----------------------CEARGISECSKN-----VCGTGGQCINIPGSFHCNCTPGI----LGRTCCA 3978 (4289)
T ss_pred ccCce-----------------------eecccccccccc-----cccCCceeeccCCceEeccChhH----hcccCcc
Confidence 99986 32 27889765 89999999999999999999999 6777753
No 6
>KOG4289|consensus
Probab=99.15 E-value=1.5e-10 Score=124.11 Aligned_cols=88 Identities=32% Similarity=0.866 Sum_probs=76.7
Q ss_pred cCCCCeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCeeccCCCceeeCC--Cc
Q psy11799 289 NLLGGYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLS--SS 366 (483)
Q Consensus 289 ~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~--~~ 366 (483)
+..++++|+|++||+|+.|+ ..+|.|-+.||.+ +|+|....|+|+|.|.+|| +|++|+... ..
T Consensus 1217 ~pvnglrCrCPpGFTgd~Ce---TeiDlCYs~pC~n--------ng~C~srEggYtCeCrpg~----tGehCEvs~~agr 1281 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCE---TEIDLCYSGPCGN--------NGRCRSREGGYTCECRPGF----TGEHCEVSARAGR 1281 (2531)
T ss_pred cccCceeEeCCCCCCccccc---chhHhhhcCCCCC--------CCceEEecCceeEEecCCc----cccceeeecccCc
Confidence 45578899999999999888 7799999999998 9999999999999999999 778897532 26
Q ss_pred cCCCCCCCC--cEECCCCCeeEeCCCC
Q psy11799 367 CLSSPCSFG--CTPLGISGFSCECPSG 391 (483)
Q Consensus 367 C~~~~C~~~--c~~~~~~~~~C~C~~G 391 (483)
|.+..|.++ |+....+.+.|.|+.|
T Consensus 1282 CvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1282 CVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred cccceecCCCEEeecCCCceeccCCCc
Confidence 889999988 6777778899999998
No 7
>KOG4289|consensus
Probab=99.12 E-value=7.3e-10 Score=119.08 Aligned_cols=87 Identities=37% Similarity=0.852 Sum_probs=70.2
Q ss_pred EeCCCCeEEeCCCCceeCCCCCcc-cccCcccCCCCCCCCeeeeCCCCceEeCCCCceecCCCCCccc---cCccccCCC
Q psy11799 206 HNVPGSFRCICPYGYALAPDGRHC-IDINECEQFPGLCAHTCTNTEGSYTCGCHPGFELSSDGSGTCL---DSDECATGR 281 (483)
Q Consensus 206 ~~~~gs~~C~C~~G~~~~~~g~~C-~~~~~C~~~~~~~~~~C~~~~g~~~C~C~~G~~~~~~g~~~C~---~~~~C~~~~ 281 (483)
++..+.++|.|++||+ |..| .+++.|-..||.+++.|....++|+|.|.+||+ |. .|+ ....|.++-
T Consensus 1216 i~pvnglrCrCPpGFT----gd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~t----Ge-hCEvs~~agrCvpGv 1286 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFT----GDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFT----GE-HCEVSARAGRCVPGV 1286 (2531)
T ss_pred ccccCceeEeCCCCCC----cccccchhHhhhcCCCCCCCceEEecCceeEEecCCcc----cc-ceeeecccCccccce
Confidence 3455678999999999 8899 579999999999999999999999999999999 44 443 234566665
Q ss_pred Ccccccc--cCCCCeeEeCCCC
Q psy11799 282 HICQQIC--NLLGGYKCGCPEG 301 (483)
Q Consensus 282 ~~c~~~C--~~~g~~~C~C~~G 301 (483)
+...+.| ...+++.|.|+.|
T Consensus 1287 C~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1287 CKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred ecCCCEEeecCCCceeccCCCc
Confidence 5555788 5667889999876
No 8
>KOG0994|consensus
Probab=99.05 E-value=1.4e-09 Score=115.02 Aligned_cols=73 Identities=21% Similarity=0.383 Sum_probs=47.1
Q ss_pred CEEeeCCCCeeEECCCCCeeccCCCceee--------CCCccCCCCCCCCcEECC---CCCeeEeCCCCeeecCCcceec
Q psy11799 334 GKCFNTAGSYRCGCPDGFQFEAALQLCTQ--------LSSSCLSSPCSFGCTPLG---ISGFSCECPSGYQRIGQGHCLA 402 (483)
Q Consensus 334 g~C~~~~g~~~C~C~~G~~~~~~g~~C~~--------~~~~C~~~~C~~~c~~~~---~~~~~C~C~~Gy~g~~~g~C~~ 402 (483)
-+|....| +|+|.+|| .|+.|.. .+..|..-.|....+.++ ....+|+|.+|-.|....
T Consensus 1077 pqCN~ftG--QCqCkpGf----GGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCd---- 1146 (1758)
T KOG0994|consen 1077 PQCNEFTG--QCQCKPGF----GGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCD---- 1146 (1758)
T ss_pred cccccccc--ceeccCCC----CCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchh----
Confidence 36666666 99999999 6777641 222455555554322221 256789999998887644
Q ss_pred ccCCCCCCCcccCCCCCcc
Q psy11799 403 TISSGLYPGFRSYERPNLG 421 (483)
Q Consensus 403 ~~~~~C~~g~~g~~~~~C~ 421 (483)
.|.+||+|.. +.|.
T Consensus 1147 ----qCaRgy~G~f-P~C~ 1160 (1758)
T KOG0994|consen 1147 ----QCARGYSGQF-PVCV 1160 (1758)
T ss_pred ----hhhhhhcCCC-CCCc
Confidence 4778999876 4444
No 9
>KOG1219|consensus
Probab=99.03 E-value=5.1e-10 Score=124.48 Aligned_cols=106 Identities=33% Similarity=0.801 Sum_probs=95.0
Q ss_pred ccccCCCCcccccc--cCCCCeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCe
Q psy11799 275 DECATGRHICQQIC--NLLGGYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352 (483)
Q Consensus 275 ~~C~~~~~~c~~~C--~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~ 352 (483)
+.|...++...+.| ...++|.|.|++-|+|..|+ .++..|..+||.. ||+|+...++|.|.|+.||
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE---i~~epC~snPC~~--------GgtCip~~n~f~CnC~~gy- 3932 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE---IDLEPCASNPCLT--------GGTCIPFYNGFLCNCPNGY- 3932 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccc---cccccccCCCCCC--------CCEEEecCCCeeEeCCCCc-
Confidence 78888877777999 56678999999999999998 7788999999998 9999999999999999999
Q ss_pred eccCCCceeeC-CCccCCCCCCCC--cEECCCCCeeEeCCCCeeecC
Q psy11799 353 FEAALQLCTQL-SSSCLSSPCSFG--CTPLGISGFSCECPSGYQRIG 396 (483)
Q Consensus 353 ~~~~g~~C~~~-~~~C~~~~C~~~--c~~~~~~~~~C~C~~Gy~g~~ 396 (483)
+|++|+.. .++|+.++|.++ |.+.. ++|+|.|.+||.|..
T Consensus 3933 ---TG~~Ce~~Gi~eCs~n~C~~gg~C~n~~-gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3933 ---TGKRCEARGISECSKNVCGTGGQCINIP-GSFHCNCTPGILGRT 3975 (4289)
T ss_pred ---cCceeecccccccccccccCCceeeccC-CceEeccChhHhccc
Confidence 88899875 568999999987 77777 899999999999876
No 10
>KOG0994|consensus
Probab=98.68 E-value=3.8e-07 Score=97.20 Aligned_cols=41 Identities=10% Similarity=0.136 Sum_probs=31.1
Q ss_pred ccCCCceEEEeCCCCeeeEeeecCCCCcccCCceeeeccCcceeeecc
Q psy11799 25 NMLAGIFVWSLASQWTGEILTNVSPYPTSVNMASVSTRKGLTGVSVTR 72 (483)
Q Consensus 25 ~~~~g~~~C~C~~g~~G~~C~~c~~~~~~~~~~~C~c~~g~~g~~C~~ 72 (483)
-|+.|...|.|.|. |.. +.-|..+++.|.|.++..|.+|+.
T Consensus 757 ~l~n~a~~CnCnpt--GSl-----S~vCn~~GGqCqCkPnVVGR~Cdq 797 (1758)
T KOG0994|consen 757 LLHNGASMCNCNPT--GSL-----SSVCNPNGGQCQCKPNVVGRRCDQ 797 (1758)
T ss_pred HHhcCccccccCCC--ccc-----cccccCCCceecccCccccccccc
Confidence 35566677888875 443 445677888999999999999985
No 11
>KOG1225|consensus
Probab=98.50 E-value=5.9e-07 Score=91.61 Aligned_cols=76 Identities=36% Similarity=0.749 Sum_probs=51.5
Q ss_pred CeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCeeccCCCceeeCCCccCCCCC
Q psy11799 293 GYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPC 372 (483)
Q Consensus 293 ~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C 372 (483)
.+.|.|..+|.+..+. ...|.. -|.+ ++.|++. +|+|++|| +|..|..+. |... |
T Consensus 233 ~~ic~c~~~~~g~~c~-----~~~C~~-~c~~--------~g~c~~G----~CIC~~Gf----~G~dC~e~~--Cp~~-c 287 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCS-----TIYCPG-GCTG--------RGQCVEG----RCICPPGF----TGDDCDELV--CPVD-C 287 (525)
T ss_pred CceeecCCceeCCccc-----cccCCC-CCcc--------cceEeCC----eEeCCCCC----cCCCCCccc--CCcc-c
Confidence 3489999999986443 112221 2333 5788876 99999999 666776543 5544 7
Q ss_pred CCCcEECCCCCeeEeCCCCeeecC
Q psy11799 373 SFGCTPLGISGFSCECPSGYQRIG 396 (483)
Q Consensus 373 ~~~c~~~~~~~~~C~C~~Gy~g~~ 396 (483)
+.+....+ ..|+|++||.|..
T Consensus 288 s~~g~~~~---g~CiC~~g~~G~d 308 (525)
T KOG1225|consen 288 SGGGVCVD---GECICNPGYSGKD 308 (525)
T ss_pred CCCceecC---CEeecCCCccccc
Confidence 66655554 3999999999976
No 12
>KOG4260|consensus
Probab=98.42 E-value=3.1e-07 Score=83.48 Aligned_cols=164 Identities=29% Similarity=0.661 Sum_probs=100.7
Q ss_pred EeecccCcccccccccccccCCCCccCCCCCCeEeeC---CCceEeeCCCCCeeCCCCCccccC--CCCCCCCCccCCCc
Q psy11799 112 ISLKKKKFYSVFLDMDECKEEINGVAVCPKPGTCINT---MGSFRCLCPRGFKLDRTGTFCLDQ--DECTDDSKCREGCQ 186 (483)
Q Consensus 112 ~~~~~~~~~~~C~~~~~C~~~~~~~~~C~~~g~C~~~---~g~~~C~C~~G~~g~~~g~~C~~~--~~C~~~~~C~~g~~ 186 (483)
..|+.+.||.+|. .|...... ||..+|+|.-. .|+-.|.|.+||+ |..|..- ....... ..-..
T Consensus 130 vCCp~gtyGpdCl---~Cpggser--~C~GnG~C~GdGsR~GsGkCkC~~GY~----Gp~C~~Cg~eyfes~R--ne~~l 198 (350)
T KOG4260|consen 130 VCCPDGTYGPDCL---QCPGGSER--PCFGNGSCHGDGSREGSGKCKCETGYT----GPLCRYCGIEYFESSR--NEQHL 198 (350)
T ss_pred eccCCCCcCCccc---cCCCCCcC--CcCCCCcccCCCCCCCCCcccccCCCC----CccccccchHHHHhhc--ccccc
Confidence 3567888899987 34332122 38888999743 5677999999999 5555311 0000000 00000
Q ss_pred cccccchhccCCCCCCCeEEeCCCCeEE-eCCCCceeCCCCCcccccCcccC--CCCCCCCeeeeCCCCceEeCCCCcee
Q psy11799 187 TCEDINECLELSNQCAFRCHNVPGSFRC-ICPYGYALAPDGRHCIDINECEQ--FPGLCAHTCTNTEGSYTCGCHPGFEL 263 (483)
Q Consensus 187 ~C~~~~~C~~~~~~C~~~C~~~~gs~~C-~C~~G~~~~~~g~~C~~~~~C~~--~~~~~~~~C~~~~g~~~C~C~~G~~~ 263 (483)
.|+. | ...|.+.|... ++-.| .|..||.++ ...|+||++|.. .|+.....|+|+.|+|.|...+||..
T Consensus 199 vCt~---C---h~~C~~~Csg~-~~k~C~kCkkGW~ld--e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 199 VCTA---C---HEGCLGVCSGE-SSKGCSKCKKGWKLD--EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred hhhh---h---hhhhhcccCCC-CCCChhhhcccceec--ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 1110 0 11122344332 22345 599999987 556999999985 45555779999999999999999973
Q ss_pred cCCCCCccccCccccCCCCccc---ccc-cCCCCeeEeCCCCcee
Q psy11799 264 SSDGSGTCLDSDECATGRHICQ---QIC-NLLGGYKCGCPEGYVR 304 (483)
Q Consensus 264 ~~~g~~~C~~~~~C~~~~~~c~---~~C-~~~g~~~C~C~~G~~g 304 (483)
. +|+|+.-...+. ..| +..+.|+|.|..|+.-
T Consensus 270 g---------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 270 G---------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLII 305 (350)
T ss_pred C---------hHHhhhhhhhcccCCCCcccCCccEEEEeccccee
Confidence 1 345543222232 456 8889999999988863
No 13
>KOG4260|consensus
Probab=98.37 E-value=6.6e-07 Score=81.41 Aligned_cols=156 Identities=27% Similarity=0.628 Sum_probs=101.7
Q ss_pred CCCCceeCCCCCcccccCcccCCCCCCCCeeee---CCCCceEeCCCCceecCCCCCcccc------C----ccccCCCC
Q psy11799 216 CPYGYALAPDGRHCIDINECEQFPGLCAHTCTN---TEGSYTCGCHPGFELSSDGSGTCLD------S----DECATGRH 282 (483)
Q Consensus 216 C~~G~~~~~~g~~C~~~~~C~~~~~~~~~~C~~---~~g~~~C~C~~G~~~~~~g~~~C~~------~----~~C~~~~~ 282 (483)
|++|-. |+.|..-.--...||.-++.|.- ..|+-+|.|.+||.+..+.. |.. . -.|..-..
T Consensus 132 Cp~gty----GpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~--Cg~eyfes~Rne~~lvCt~Ch~ 205 (350)
T KOG4260|consen 132 CPDGTY----GPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY--CGIEYFESSRNEQHLVCTACHE 205 (350)
T ss_pred cCCCCc----CCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc--cchHHHHhhcccccchhhhhhh
Confidence 777776 88774321112356666777753 24667899999999554321 211 0 11221112
Q ss_pred cccccccCCCCeeEe-CCCCceeCCCCCccccCCcCCC--CCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCeeccCCCc
Q psy11799 283 ICQQICNLLGGYKCG-CPEGYVRHPFSNECIDDNECAR--NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQL 359 (483)
Q Consensus 283 ~c~~~C~~~g~~~C~-C~~G~~g~~~~~~C~~~~~C~~--~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~~~~g~~ 359 (483)
.|.+.|...++..|. |..||..+ +..|.|+++|.. .+|.. +-.|+|+.|+|.|...+||... -+.
T Consensus 206 ~C~~~Csg~~~k~C~kCkkGW~ld--e~gCvDvnEC~~ep~~c~~--------~qfCvNteGSf~C~dk~Gy~~g--~d~ 273 (350)
T KOG4260|consen 206 GCLGVCSGESSKGCSKCKKGWKLD--EEGCVDVNECQNEPAPCKA--------HQFCVNTEGSFKCEDKEGYKKG--VDE 273 (350)
T ss_pred hhhcccCCCCCCChhhhcccceec--ccccccHHHHhcCCCCCCh--------hheeecCCCceEecccccccCC--hHH
Confidence 334456556666675 99999988 678999999986 47776 7899999999999999999652 222
Q ss_pred eeeCCCccCCCCCC--CC-cEECCCCCeeEeCCCCeeec
Q psy11799 360 CTQLSSSCLSSPCS--FG-CTPLGISGFSCECPSGYQRI 395 (483)
Q Consensus 360 C~~~~~~C~~~~C~--~~-c~~~~~~~~~C~C~~Gy~g~ 395 (483)
|+.. ...|. +. |.+++ +.|+|.|..|+.-.
T Consensus 274 C~~~-----~d~~~~kn~~c~ni~-~~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 274 CQFC-----ADVCASKNRPCMNID-GQYRCVCFSGLIII 306 (350)
T ss_pred hhhh-----hhhcccCCCCcccCC-ccEEEEecccceee
Confidence 3221 13332 22 67777 89999999997643
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.36 E-value=2.8e-07 Score=61.27 Aligned_cols=42 Identities=55% Similarity=1.214 Sum_probs=34.7
Q ss_pred cccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799 422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 466 (483)
Q Consensus 422 ~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C 466 (483)
|||||....+ +|..+++|+|+.|+|+|+|++||+....+..|
T Consensus 1 DidEC~~~~~---~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 1 DIDECAEGPH---NCPENGTCVNTEGSYSCSCPPGYELNDDGTTC 42 (42)
T ss_dssp ESSTTTTTSS---SSSTTSEEEEETTEEEEEESTTEEECTTSSEE
T ss_pred CccccCCCCC---cCCCCCEEEcCCCCEEeeCCCCcEECCCCCcC
Confidence 6889987654 89999999999999999999999855555443
No 15
>KOG1225|consensus
Probab=98.31 E-value=1.3e-06 Score=89.12 Aligned_cols=102 Identities=17% Similarity=0.273 Sum_probs=70.7
Q ss_pred eEEEeCCCCeeeEeee--cCCC----CcccCCceeeeccCcceeeecccCcceeeeeccccccCceEEEeeCceeEEEEE
Q psy11799 31 FVWSLASQWTGEILTN--VSPY----PTSVNMASVSTRKGLTGVSVTRDTSYTMAILNVEQDKQGILIFSMLQTRYFNLF 104 (483)
Q Consensus 31 ~~C~C~~g~~G~~C~~--c~~~----~~~~~~~~C~c~~g~~g~~C~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 104 (483)
+.|.|..+|.|+.|+. |.++ ..++. +.|.|++||+|..|+....... ......+.+|
T Consensus 234 ~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~-G~CIC~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g--------------- 296 (525)
T KOG1225|consen 234 GICECPEGYFGPLCSTIYCPGGCTGRGQCVE-GRCICPPGFTGDDCDELVCPVD-CSGGGVCVDG--------------- 296 (525)
T ss_pred ceeecCCceeCCccccccCCCCCcccceEeC-CeEeCCCCCcCCCCCcccCCcc-cCCCceecCC---------------
Confidence 4799999999999994 4433 22333 5999999999999997433333 1111222222
Q ss_pred EEEEEEEEeecccCcccccccccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCCCccc
Q psy11799 105 YVIIFYVISLKKKKFYSVFLDMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCL 170 (483)
Q Consensus 105 ~~~~~~~~~~~~~~~~~~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g~~C~ 170 (483)
++.|..++++..|+. ..|..+ |.++|.|++. +|.|.+||+ |..|.
T Consensus 297 ------~CiC~~g~~G~dCs~-~~cpad------C~g~G~Ci~G----~C~C~~Gy~----G~~C~ 341 (525)
T KOG1225|consen 297 ------ECICNPGYSGKDCSI-RRCPAD------CSGHGKCIDG----ECLCDEGYT----GELCI 341 (525)
T ss_pred ------EeecCCCcccccccc-ccCCcc------CCCCCcccCC----ceEeCCCCc----CCccc
Confidence 235678888999973 347654 9999999933 699999999 55554
No 16
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.29 E-value=7.3e-07 Score=59.27 Aligned_cols=40 Identities=53% Similarity=1.178 Sum_probs=33.4
Q ss_pred cccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCCC
Q psy11799 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGT 167 (483)
Q Consensus 125 ~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g~ 167 (483)
|||||... ++. |..+++|+|+.|+|+|.|++||.....+.
T Consensus 1 DidEC~~~-~~~--C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~ 40 (42)
T PF07645_consen 1 DIDECAEG-PHN--CPENGTCVNTEGSYSCSCPPGYELNDDGT 40 (42)
T ss_dssp ESSTTTTT-SSS--SSTTSEEEEETTEEEEEESTTEEECTTSS
T ss_pred CccccCCC-CCc--CCCCCEEEcCCCCEEeeCCCCcEECCCCC
Confidence 68999976 555 98899999999999999999998554443
No 17
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=98.07 E-value=3.2e-06 Score=47.94 Aligned_cols=24 Identities=46% Similarity=1.039 Sum_probs=22.8
Q ss_pred CeeeeCCCCcccCCCCCCCCCCcC
Q psy11799 448 SFRCLCPRGFKLDRTGTFCLDQDE 471 (483)
Q Consensus 448 ~~~C~C~~G~~~~~~g~~C~~~~~ 471 (483)
||+|+|++||++++++++|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 699999999999999999999986
No 18
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.70 E-value=3.2e-05 Score=49.00 Aligned_cols=33 Identities=55% Similarity=1.207 Sum_probs=27.4
Q ss_pred CCCCCCCeEEeCCCCeEEeCCCCceeCCCCCcc
Q psy11799 197 LSNQCAFRCHNVPGSFRCICPYGYALAPDGRHC 229 (483)
Q Consensus 197 ~~~~C~~~C~~~~gs~~C~C~~G~~~~~~g~~C 229 (483)
.+..|...|++.+++|+|.|++||.|.+|+++|
T Consensus 4 ~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 4 NNGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp GGGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence 356678999999999999999999999998876
No 19
>KOG1226|consensus
Probab=97.69 E-value=0.00038 Score=72.86 Aligned_cols=146 Identities=22% Similarity=0.506 Sum_probs=89.6
Q ss_pred ccccCCCCe---eEeCCCCceeCCCCCc--cc----cCCcCCCCCCCCCCCC-CCCCCCEEeeCCCCeeEECCCCCeecc
Q psy11799 286 QICNLLGGY---KCGCPEGYVRHPFSNE--CI----DDNECARNPCEPSSNT-TGERGGKCFNTAGSYRCGCPDGFQFEA 355 (483)
Q Consensus 286 ~~C~~~g~~---~C~C~~G~~g~~~~~~--C~----~~~~C~~~~C~~~~~~-~c~~~g~C~~~~g~~~C~C~~G~~~~~ 355 (483)
..|+..|.+ .|.|.+||.|..|+-. .. ..+.|.. .... .|...|.|+=. +|+|.+...+..
T Consensus 467 ~~C~g~G~~~CG~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~-----~~~~~vCSgrG~C~CG----qC~C~~~~~~~i 537 (783)
T KOG1226|consen 467 ALCHGNGTFVCGQCRCDEGWLGKKCECSTDELSSSEEEDKCRE-----NSDSPVCSGRGDCVCG----QCVCHKPDNGKI 537 (783)
T ss_pred cccCCCCcEEecceecCCCCCCCcccCCccccCcHhHHhhccC-----CCCCCCcCCCCcEeCC----ceEecCCCCCce
Confidence 445544444 5789999999887611 10 1123322 2221 44448888766 899998887666
Q ss_pred CCCceeeCCCccC---CCCCCCCcEECCCCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccccccccCCCC
Q psy11799 356 ALQLCTQLSSSCL---SSPCSFGCTPLGISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDIDECKEEING 432 (483)
Q Consensus 356 ~g~~C~~~~~~C~---~~~C~~~c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~ 432 (483)
.|+.|+-.+-.|. ...|.....+.- ..|.|.+||+|...+ |+ .+.+.|.....
T Consensus 538 ~G~fCECDnfsC~r~~g~lC~g~G~C~C---G~CvC~~GwtG~~C~---------C~-----------~std~C~~~~G- 593 (783)
T KOG1226|consen 538 YGKFCECDNFSCERHKGVLCGGHGRCEC---GRCVCNPGWTGSACN---------CP-----------LSTDTCESSDG- 593 (783)
T ss_pred eeeeeeccCcccccccCcccCCCCeEeC---CcEEcCCCCccCCCC---------CC-----------CCCccccCCCC-
Confidence 7899976554555 335664433322 469999999987621 11 24566655322
Q ss_pred ccCCCCCCeEeeCCCCeeeeCCCC-cccCCCCCCCCCCcCCC
Q psy11799 433 VAVCPKPGTCINTMGSFRCLCPRG-FKLDRTGTFCLDQDECT 473 (483)
Q Consensus 433 ~~~C~~~g~C~~~~g~~~C~C~~G-~~~~~~g~~C~~~~~C~ 473 (483)
. .|...|+|.=. +|+|... | .|..|+.-..|.
T Consensus 594 ~-iCSGrG~C~Cg----~C~C~~~~~----sG~~CE~cptc~ 626 (783)
T KOG1226|consen 594 Q-ICSGRGTCECG----RCKCTDPPY----SGEFCEKCPTCP 626 (783)
T ss_pred c-eeCCCceeeCC----ceEcCCCCc----CcchhhcCCCCC
Confidence 2 68888888644 3888877 8 788888544443
No 20
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.66 E-value=3.8e-05 Score=43.61 Aligned_cols=24 Identities=63% Similarity=1.370 Sum_probs=22.1
Q ss_pred CeEEeCCCCceeCCCCCcccccCc
Q psy11799 211 SFRCICPYGYALAPDGRHCIDINE 234 (483)
Q Consensus 211 s~~C~C~~G~~~~~~g~~C~~~~~ 234 (483)
+|+|+|++||.+.++++.|++|++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 589999999999999999999875
No 21
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.61 E-value=3.9e-05 Score=48.70 Aligned_cols=29 Identities=48% Similarity=1.098 Sum_probs=23.1
Q ss_pred CCCCCeEeeCCCceEeeCCCCCeeCCCCCcc
Q psy11799 139 CPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 169 (483)
Q Consensus 139 C~~~g~C~~~~g~~~C~C~~G~~g~~~g~~C 169 (483)
|+.+++|+++.++|.|+|++||.|+ |..|
T Consensus 8 C~~nA~C~~~~~~~~C~C~~Gy~Gd--G~~C 36 (36)
T PF12947_consen 8 CHPNATCTNTGGSYTCTCKPGYEGD--GFFC 36 (36)
T ss_dssp S-TTCEEEE-TTSEEEEE-CEEECC--STCE
T ss_pred CCCCcEeecCCCCEEeECCCCCccC--CcCC
Confidence 9999999999999999999999987 5543
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.59 E-value=9.8e-05 Score=47.94 Aligned_cols=38 Identities=50% Similarity=1.157 Sum_probs=29.9
Q ss_pred ccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCC
Q psy11799 423 IDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCL 467 (483)
Q Consensus 423 ~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~ 467 (483)
+++|... .+|.++++|++..++|+|.|++||. .|..|+
T Consensus 2 ~~~C~~~----~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C~ 39 (39)
T smart00179 2 IDECASG----NPCQNGGTCVNTVGSYRCECPPGYT---DGRNCE 39 (39)
T ss_pred cccCcCC----CCcCCCCEeECCCCCeEeECCCCCc---cCCcCC
Confidence 5667642 2899999999999999999999994 266653
No 23
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.58 E-value=4e-05 Score=47.43 Aligned_cols=26 Identities=38% Similarity=1.005 Sum_probs=23.7
Q ss_pred CCCCCCeEeeCC-CCeeeeCCCCcccC
Q psy11799 435 VCPKPGTCINTM-GSFRCLCPRGFKLD 460 (483)
Q Consensus 435 ~C~~~g~C~~~~-g~~~C~C~~G~~~~ 460 (483)
+|.++|+|++.. ++|+|+|++||+|.
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 899999999988 99999999999553
No 24
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.50 E-value=6.2e-05 Score=47.77 Aligned_cols=30 Identities=47% Similarity=1.025 Sum_probs=23.6
Q ss_pred CCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799 435 VCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 466 (483)
Q Consensus 435 ~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C 466 (483)
.|+.+|+|+++.++|+|+|++||.|+ |..|
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd--G~~C 36 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGD--GFFC 36 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECC--STCE
T ss_pred CCCCCcEeecCCCCEEeECCCCCccC--CcCC
Confidence 79999999999999999999999886 5543
No 25
>KOG1836|consensus
Probab=97.47 E-value=0.0023 Score=74.40 Aligned_cols=57 Identities=25% Similarity=0.524 Sum_probs=34.2
Q ss_pred CCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccccccccCCCCccCCCCCC----eEeeCCCCeeeeCCCCc
Q psy11799 382 SGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDIDECKEEINGVAVCPKPG----TCINTMGSFRCLCPRGF 457 (483)
Q Consensus 382 ~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~~~~C~~~g----~C~~~~g~~~C~C~~G~ 457 (483)
....|.|.+|-+|.+.++| ..++++.....|.. | .|...| +|....| +|.|.++|
T Consensus 957 ~tGqc~c~~gVtgqrc~qc--------~~~~~~~~~~gc~~---c--------~c~~~Gs~~~qc~~~~G--~c~c~~~~ 1015 (1705)
T KOG1836|consen 957 GTGQCYCRPGVTGQRCDQC--------ETYHFGFQTEGCGL---C--------ECDPLGSRGFQCDPEDG--QCPCRPGF 1015 (1705)
T ss_pred cCCceeeecCccccccCcc--------ccCcccccccCCcc---e--------ecccCCcccceecccCC--eeeecCCC
Confidence 5678999999988876654 44444433222322 2 233333 6766555 58888888
Q ss_pred cc
Q psy11799 458 KL 459 (483)
Q Consensus 458 ~~ 459 (483)
.|
T Consensus 1016 ~g 1017 (1705)
T KOG1836|consen 1016 EG 1017 (1705)
T ss_pred CC
Confidence 43
No 26
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.40 E-value=0.00019 Score=62.40 Aligned_cols=138 Identities=28% Similarity=0.700 Sum_probs=85.5
Q ss_pred CCCCeeEeCCCCceeCCCCCccccCCcCCC-----CCCCCCCCCCCCCCCEEeeCC-----CCeeEECCCCCeeccCCCc
Q psy11799 290 LLGGYKCGCPEGYVRHPFSNECIDDNECAR-----NPCEPSSNTTGERGGKCFNTA-----GSYRCGCPDGFQFEAALQL 359 (483)
Q Consensus 290 ~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~-----~~C~~~~~~~c~~~g~C~~~~-----g~~~C~C~~G~~~~~~g~~ 359 (483)
....|.|.|.+||.... +.+|+...+|.. .+|.. -+.|++.. ..|.|.|.+||.+. ...
T Consensus 16 MSNHfEC~Cnegfvl~~-EntCE~kv~C~~~e~~~K~Cgd--------ya~C~~~~~~~~~~~~~C~C~~gY~~~--~~v 84 (197)
T PF06247_consen 16 MSNHFECKCNEGFVLKN-ENTCEEKVECDKLENVNKPCGD--------YAKCINQANKGEERAYKCDCINGYILK--QGV 84 (197)
T ss_dssp ESSEEEEEESTTEEEEE-TTEEEE----SG-GGTTSEEET--------TEEEEE-SSTTSSTSEEEEE-TTEEES--SSS
T ss_pred ccCceEEEcCCCcEEcc-ccccccceecCcccccCccccc--------hhhhhcCCCcccceeEEEecccCceee--CCe
Confidence 34568999999999864 567877777754 25554 88998876 46999999999884 346
Q ss_pred eeeCCCccCCCCCCCC-cEECCC--CCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccccccccCCCCccCC
Q psy11799 360 CTQLSSSCLSSPCSFG-CTPLGI--SGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDIDECKEEINGVAVC 436 (483)
Q Consensus 360 C~~~~~~C~~~~C~~~-c~~~~~--~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~~~~C 436 (483)
|... .|....|..| |..++. ...+|+|.-|+.-...+.|.-.. -..|+. -|
T Consensus 85 Cvp~--~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G------------------~T~C~L------KC 138 (197)
T PF06247_consen 85 CVPN--KCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTG------------------ETKCSL------KC 138 (197)
T ss_dssp EEEG--GGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------------------------------
T ss_pred Echh--hcCceecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCC------------------ccceee------ec
Confidence 7644 4777777777 775542 34599999999844444443211 124544 47
Q ss_pred CCCCeEeeCCCCeeeeCCCCcccCCCCC
Q psy11799 437 PKPGTCINTMGSFRCLCPRGFKLDRTGT 464 (483)
Q Consensus 437 ~~~g~C~~~~g~~~C~C~~G~~~~~~g~ 464 (483)
..+-+|....+-|+|+|.+||.++..+.
T Consensus 139 k~nE~CK~~~~~Y~C~~~~~~~~~~~~~ 166 (197)
T PF06247_consen 139 KENEECKLVDGYYKCVCKEGFPGDGEGE 166 (197)
T ss_dssp TTTEEEEEETTEEEEEE-TT-EEETTT-
T ss_pred CCCcceeeeCcEEEeecCCCCCCCCCcc
Confidence 7788999999999999999998776554
No 27
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.24 E-value=0.0002 Score=44.35 Aligned_cols=30 Identities=43% Similarity=1.270 Sum_probs=25.3
Q ss_pred CCCCCCCCCCCCCCCCCCEEeeCC-CCeeEECCCCCeec
Q psy11799 317 CARNPCEPSSNTTGERGGKCFNTA-GSYRCGCPDGFQFE 354 (483)
Q Consensus 317 C~~~~C~~~~~~~c~~~g~C~~~~-g~~~C~C~~G~~~~ 354 (483)
|..++|.+ +|+|++.. ++|.|.|++||.|.
T Consensus 1 C~~~~C~n--------~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQN--------GGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTT--------TEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCC--------CeEEEeCCCCCEEeECCCCCccC
Confidence 44567887 99999999 99999999999653
No 28
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.07 E-value=0.00086 Score=42.94 Aligned_cols=28 Identities=57% Similarity=1.373 Sum_probs=24.7
Q ss_pred CCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799 435 VCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 466 (483)
Q Consensus 435 ~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C 466 (483)
+|.+++.|++..++|+|.|++|| .|..|
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~----~g~~C 37 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGY----TGRNC 37 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCC----cCCcC
Confidence 79889999999999999999999 55555
No 29
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.04 E-value=0.0012 Score=57.61 Aligned_cols=138 Identities=25% Similarity=0.619 Sum_probs=83.6
Q ss_pred CeEEeCCCCeEEeCCCCceeCCCCCcccccCcccC-----CCCCCCCeeeeCC-----CCceEeCCCCceecCCCCCccc
Q psy11799 203 FRCHNVPGSFRCICPYGYALAPDGRHCIDINECEQ-----FPGLCAHTCTNTE-----GSYTCGCHPGFELSSDGSGTCL 272 (483)
Q Consensus 203 ~~C~~~~gs~~C~C~~G~~~~~~g~~C~~~~~C~~-----~~~~~~~~C~~~~-----g~~~C~C~~G~~~~~~g~~~C~ 272 (483)
|.-+.....|.|.|.+||.+. +..+|+...+|.. .+|...+.|++.. ..|.|.|.+||.+... .|.
T Consensus 11 G~LiQMSNHfEC~Cnegfvl~-~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~---vCv 86 (197)
T PF06247_consen 11 GYLIQMSNHFECKCNEGFVLK-NENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG---VCV 86 (197)
T ss_dssp EEEEEESSEEEEEESTTEEEE-ETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS---SEE
T ss_pred CEEEEccCceEEEcCCCcEEc-cccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC---eEc
Confidence 566777788999999999875 3567876666653 4566678898765 5789999999997532 455
Q ss_pred cCccccCCCCccc-ccc----cCCCCeeEeCCCCceeCCCCCcccc--CCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeE
Q psy11799 273 DSDECATGRHICQ-QIC----NLLGGYKCGCPEGYVRHPFSNECID--DNECARNPCEPSSNTTGERGGKCFNTAGSYRC 345 (483)
Q Consensus 273 ~~~~C~~~~~~c~-~~C----~~~g~~~C~C~~G~~g~~~~~~C~~--~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C 345 (483)
. ..|... .|. +.| .......|.|.-|++ ......|.. ..+|.. .|.. +..|....+-|+|
T Consensus 87 p-~~C~~~--~Cg~GKCI~d~~~~~~~~CSC~IGkV-~~dn~kCtk~G~T~C~L-KCk~--------nE~CK~~~~~Y~C 153 (197)
T PF06247_consen 87 P-NKCNNK--DCGSGKCILDPDNPNNPTCSCNIGKV-PDDNKKCTKTGETKCSL-KCKE--------NEECKLVDGYYKC 153 (197)
T ss_dssp E-GGGSS-----TTEEEEEEEGGGSEEEEEE-TEEE-TTTTTESEEEE---------TT--------TEEEEEETTEEEE
T ss_pred h-hhcCce--ecCCCeEEecCCCCCCceeEeeeceE-eccCCcccCCCccceee-ecCC--------CcceeeeCcEEEe
Confidence 3 345443 343 666 333355999999999 333344432 122322 3333 7899999999999
Q ss_pred ECCCCCeeccCC
Q psy11799 346 GCPDGFQFEAAL 357 (483)
Q Consensus 346 ~C~~G~~~~~~g 357 (483)
.+.+||.++..+
T Consensus 154 ~~~~~~~~~~~~ 165 (197)
T PF06247_consen 154 VCKEGFPGDGEG 165 (197)
T ss_dssp EE-TT-EEETTT
T ss_pred ecCCCCCCCCCc
Confidence 999999876544
No 30
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.01 E-value=0.00068 Score=42.96 Aligned_cols=28 Identities=50% Similarity=1.163 Sum_probs=23.2
Q ss_pred CCeEeeCCCceEeeCCCCCeeCCCCCcc
Q psy11799 142 PGTCINTMGSFRCLCPRGFKLDRTGTFC 169 (483)
Q Consensus 142 ~g~C~~~~g~~~C~C~~G~~g~~~g~~C 169 (483)
...|++.+++|+|.|++||++..+++.|
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp SSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCCccCCCceEeECCCCCEECcCCCCC
Confidence 5699999999999999999999888754
No 31
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.99 E-value=0.0012 Score=42.75 Aligned_cols=31 Identities=61% Similarity=1.426 Sum_probs=26.5
Q ss_pred CCcCCC-CCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCe
Q psy11799 314 DNECAR-NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352 (483)
Q Consensus 314 ~~~C~~-~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~ 352 (483)
+++|.. .+|.+ +++|++..++|.|.|++||.
T Consensus 2 ~~~C~~~~~C~~--------~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 2 IDECASGNPCQN--------GGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred cccCcCCCCcCC--------CCEeECCCCCeEeECCCCCc
Confidence 466776 67877 88999999999999999995
No 32
>KOG1226|consensus
Probab=96.94 E-value=0.0024 Score=67.10 Aligned_cols=94 Identities=14% Similarity=0.154 Sum_probs=53.4
Q ss_pred ceeeeccCcceeeecccCcceeeeecccccc--CceEEEeeCceeEEEEEEEEEEEEEeecccCc----ccccc-ccccc
Q psy11799 57 ASVSTRKGLTGVSVTRDTSYTMAILNVEQDK--QGILIFSMLQTRYFNLFYVIIFYVISLKKKKF----YSVFL-DMDEC 129 (483)
Q Consensus 57 ~~C~c~~g~~g~~C~~~~~~~~~~~~~~~~~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~C~-~~~~C 129 (483)
+.|.|.+||.|..|+++..-.........|. ++...+ .+.+...| ..+.|....+ |..|+ |...|
T Consensus 478 G~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vC-SgrG~C~C-------GqC~C~~~~~~~i~G~fCECDnfsC 549 (783)
T KOG1226|consen 478 GQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVC-SGRGDCVC-------GQCVCHKPDNGKIYGKFCECDNFSC 549 (783)
T ss_pred cceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCc-CCCCcEeC-------CceEecCCCCCceeeeeeeccCccc
Confidence 6999999999999999755433311111121 111000 11111111 1223444444 77787 55566
Q ss_pred ccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCC
Q psy11799 130 KEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDR 164 (483)
Q Consensus 130 ~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~ 164 (483)
... ....|..+|+|.=. +|+|.+||+|..
T Consensus 550 ~r~--~g~lC~g~G~C~CG----~CvC~~GwtG~~ 578 (783)
T KOG1226|consen 550 ERH--KGVLCGGHGRCECG----RCVCNPGWTGSA 578 (783)
T ss_pred ccc--cCcccCCCCeEeCC----cEEcCCCCccCC
Confidence 653 33349999999765 599999999543
No 33
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.30 E-value=0.0066 Score=38.03 Aligned_cols=26 Identities=54% Similarity=1.269 Sum_probs=23.3
Q ss_pred CCCCCCeEeeCCCCeeeeCCCCcccC
Q psy11799 435 VCPKPGTCINTMGSFRCLCPRGFKLD 460 (483)
Q Consensus 435 ~C~~~g~C~~~~g~~~C~C~~G~~~~ 460 (483)
+|.++++|++..++|+|.|+.||.+.
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCccc
Confidence 78899999999999999999999543
No 34
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.24 E-value=0.0073 Score=37.96 Aligned_cols=24 Identities=50% Similarity=1.108 Sum_probs=21.8
Q ss_pred CCCCCCeEeeCCCCeeeeCCCCccc
Q psy11799 435 VCPKPGTCINTMGSFRCLCPRGFKL 459 (483)
Q Consensus 435 ~C~~~g~C~~~~g~~~C~C~~G~~~ 459 (483)
+|.++ +|++..++|+|.|++||.+
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCcc
Confidence 78888 9999999999999999954
No 35
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.23 E-value=0.0075 Score=38.40 Aligned_cols=32 Identities=59% Similarity=1.340 Sum_probs=25.9
Q ss_pred CCcCCC-CCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCee
Q psy11799 314 DNECAR-NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQF 353 (483)
Q Consensus 314 ~~~C~~-~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~ 353 (483)
+++|.. .+|.+ ++.|++..++|.|.|++||.|
T Consensus 2 ~~~C~~~~~C~~--------~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 2 IDECASGNPCQN--------GGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred cccCCCCCCcCC--------CCEeECCCCCeEeECCCCCcC
Confidence 455665 57766 789999999999999999954
No 36
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.94 E-value=0.0017 Score=31.18 Aligned_cols=13 Identities=23% Similarity=0.409 Sum_probs=8.1
Q ss_pred EEEeCCCCeeeEe
Q psy11799 32 VWSLASQWTGEIL 44 (483)
Q Consensus 32 ~C~C~~g~~G~~C 44 (483)
+|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4777777777654
No 37
>KOG1836|consensus
Probab=95.10 E-value=0.24 Score=58.34 Aligned_cols=67 Identities=27% Similarity=0.545 Sum_probs=36.2
Q ss_pred chhccCCCCCCCeEEeCCCCeEE-eCCCCceeCCCCCcccccCcccCCCCCCCCeeeeCC--CCceEe-CCCCcee
Q psy11799 192 NECLELSNQCAFRCHNVPGSFRC-ICPYGYALAPDGRHCIDINECEQFPGLCAHTCTNTE--GSYTCG-CHPGFEL 263 (483)
Q Consensus 192 ~~C~~~~~~C~~~C~~~~gs~~C-~C~~G~~~~~~g~~C~~~~~C~~~~~~~~~~C~~~~--g~~~C~-C~~G~~~ 263 (483)
+.|.+....|. |......-+| +|..||...++... .+.|..-+|..++.|..+. ....|. |++||+|
T Consensus 738 ~~Cd~~tG~C~--C~~~t~G~~C~~C~~GfYg~~~~~~---~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG 808 (1705)
T KOG1836|consen 738 NICDPRTGQCK--CKHNTFGGQCAQCVDGFYGLPDLGT---SGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTG 808 (1705)
T ss_pred ccccCCCCcee--cccCCCCCchhhhcCCCCCccccCC---CCCCccCCCCCChhhcCcCcccceecCCCCCCCcc
Confidence 44444444332 4444444455 58888874332211 1225555555566665543 445688 9999983
No 38
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.08 E-value=0.04 Score=34.28 Aligned_cols=22 Identities=59% Similarity=1.340 Sum_probs=19.9
Q ss_pred CCEEeeCCCCeeEECCCCCeec
Q psy11799 333 GGKCFNTAGSYRCGCPDGFQFE 354 (483)
Q Consensus 333 ~g~C~~~~g~~~C~C~~G~~~~ 354 (483)
++.|++..++|.|.|+.||.+.
T Consensus 11 ~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 11 GGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCEEecCCCCeEeECCCCCccc
Confidence 7899999999999999999653
No 39
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.94 E-value=0.032 Score=34.30 Aligned_cols=26 Identities=42% Similarity=1.125 Sum_probs=21.0
Q ss_pred CCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799 435 VCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 466 (483)
Q Consensus 435 ~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C 466 (483)
.|.++|+|+... .+|+|.+|| .|..|
T Consensus 7 ~C~~~G~C~~~~--g~C~C~~g~----~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPC--GRCVCDSGY----TGPDC 32 (32)
T ss_pred ccCCCCEEeCCC--CEEECCCCC----cCCCC
Confidence 699999999763 479999999 56554
No 40
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.82 E-value=0.05 Score=34.01 Aligned_cols=20 Identities=55% Similarity=1.315 Sum_probs=18.5
Q ss_pred CCEEeeCCCCeeEECCCCCee
Q psy11799 333 GGKCFNTAGSYRCGCPDGFQF 353 (483)
Q Consensus 333 ~g~C~~~~g~~~C~C~~G~~~ 353 (483)
+ .|++..++|.|.|++||.+
T Consensus 11 ~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 11 G-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred C-EEECCCCCeEeECCCCCcc
Confidence 6 9999999999999999965
No 41
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.58 E-value=0.048 Score=33.55 Aligned_cols=23 Identities=39% Similarity=1.091 Sum_probs=19.4
Q ss_pred CCCCCeEeeCCCceEeeCCCCCeeC
Q psy11799 139 CPKPGTCINTMGSFRCLCPRGFKLD 163 (483)
Q Consensus 139 C~~~g~C~~~~g~~~C~C~~G~~g~ 163 (483)
|+++|+|+...+ +|+|.+||+|.
T Consensus 8 C~~~G~C~~~~g--~C~C~~g~~G~ 30 (32)
T PF07974_consen 8 CSGHGTCVSPCG--RCVCDSGYTGP 30 (32)
T ss_pred cCCCCEEeCCCC--EEECCCCCcCC
Confidence 999999997633 89999999953
No 42
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.99 E-value=0.06 Score=50.16 Aligned_cols=42 Identities=38% Similarity=0.819 Sum_probs=34.6
Q ss_pred ccccccchhccCCCCCCCeEEeCCCCeEEeCCCCceeCCCCC
Q psy11799 186 QTCEDINECLELSNQCAFRCHNVPGSFRCICPYGYALAPDGR 227 (483)
Q Consensus 186 ~~C~~~~~C~~~~~~C~~~C~~~~gs~~C~C~~G~~~~~~g~ 227 (483)
..|.++++|...++.|...|.+..|+|.|.|.+||++..+++
T Consensus 182 ~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~~~ 223 (224)
T cd01475 182 KICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNK 223 (224)
T ss_pred ccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCCCC
Confidence 456677778777777888999999999999999999766654
No 43
>smart00051 DSL delta serrate ligand.
Probab=92.93 E-value=0.09 Score=38.07 Aligned_cols=40 Identities=13% Similarity=0.063 Sum_probs=30.3
Q ss_pred ceEEEeCCCCeeeEee-ecCCC-------CcccCCceeeeccCcceeee
Q psy11799 30 IFVWSLASQWTGEILT-NVSPY-------PTSVNMASVSTRKGLTGVSV 70 (483)
Q Consensus 30 ~~~C~C~~g~~G~~C~-~c~~~-------~~~~~~~~C~c~~g~~g~~C 70 (483)
.+.-.|.++|.|..|+ .|.+. .|.. .+.+.|.+||+|..|
T Consensus 16 ~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGNKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCCccCCEeCcCccccCCccCCc-CCCEecCCCCcCCCC
Confidence 3568899999999998 36542 1433 468999999999876
No 44
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.94 E-value=0.18 Score=46.98 Aligned_cols=42 Identities=31% Similarity=0.641 Sum_probs=34.1
Q ss_pred ccccccccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCC
Q psy11799 120 YSVFLDMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTG 166 (483)
Q Consensus 120 ~~~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g 166 (483)
+..|.++++|... ++. |. ..|.++.|+|.|.|++||++.+.+
T Consensus 181 ~~~C~~~~~C~~~-~~~--c~--~~C~~~~g~~~c~c~~g~~~~~~~ 222 (224)
T cd01475 181 GKICVVPDLCATL-SHV--CQ--QVCISTPGSYLCACTEGYALLEDN 222 (224)
T ss_pred cccCcCchhhcCC-CCC--cc--ceEEcCCCCEEeECCCCccCCCCC
Confidence 5678889999865 544 83 589999999999999999876554
No 45
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=89.35 E-value=0.23 Score=31.40 Aligned_cols=25 Identities=36% Similarity=0.754 Sum_probs=19.7
Q ss_pred CCCCCeEeeCC-CceEeeCCCCCeeC
Q psy11799 139 CPKPGTCINTM-GSFRCLCPRGFKLD 163 (483)
Q Consensus 139 C~~~g~C~~~~-g~~~C~C~~G~~g~ 163 (483)
|..++.|++.. |++.|+|..||..+
T Consensus 7 cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 7 CPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp --TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred CCCCcccEEcCCCCEEEEeeCCcccc
Confidence 88899999885 99999999999854
No 46
>KOG1218|consensus
Probab=86.20 E-value=29 Score=33.74 Aligned_cols=15 Identities=20% Similarity=0.527 Sum_probs=11.8
Q ss_pred CCeeEeCCCCeeecC
Q psy11799 382 SGFSCECPSGYQRIG 396 (483)
Q Consensus 382 ~~~~C~C~~Gy~g~~ 396 (483)
....|.|++||.+..
T Consensus 160 ~~~~c~c~~g~~g~~ 174 (316)
T KOG1218|consen 160 KNGICTCQPGFVGVF 174 (316)
T ss_pred CCCceeccCCccccc
Confidence 456788999998875
No 47
>smart00051 DSL delta serrate ligand.
Probab=85.12 E-value=1.2 Score=32.31 Aligned_cols=44 Identities=20% Similarity=0.413 Sum_probs=30.3
Q ss_pred eeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCee
Q psy11799 294 YKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQF 353 (483)
Q Consensus 294 ~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~ 353 (483)
+.-.|.++|.|..|.+.|...+... . +.+|.. .| .++|.+||.|
T Consensus 17 ~rv~C~~~~yG~~C~~~C~~~~d~~-----~--------~~~Cd~-~G--~~~C~~Gw~G 60 (63)
T smart00051 17 IRVTCDENYYGEGCNKFCRPRDDFF-----G--------HYTCDE-NG--NKGCLEGWMG 60 (63)
T ss_pred EEeeCCCCCcCCccCCEeCcCcccc-----C--------CccCCc-CC--CEecCCCCcC
Confidence 4456899999998887776544332 2 456654 34 7899999954
No 48
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=82.94 E-value=1.1 Score=28.28 Aligned_cols=22 Identities=32% Similarity=0.726 Sum_probs=18.4
Q ss_pred CCCEEeeCC-CCeeEECCCCCee
Q psy11799 332 RGGKCFNTA-GSYRCGCPDGFQF 353 (483)
Q Consensus 332 ~~g~C~~~~-g~~~C~C~~G~~~ 353 (483)
.++.|++.. |++.|.|..||..
T Consensus 9 ~NA~C~~~~dG~eecrCllgyk~ 31 (37)
T PF12946_consen 9 ANAGCFRYDDGSEECRCLLGYKK 31 (37)
T ss_dssp TTEEEEEETTSEEEEEE-TTEEE
T ss_pred CCcccEEcCCCCEEEEeeCCccc
Confidence 388999877 8999999999976
No 49
>KOG3512|consensus
Probab=79.19 E-value=7.6 Score=39.27 Aligned_cols=112 Identities=20% Similarity=0.292 Sum_probs=59.0
Q ss_pred CCEEee----CCCCeeEECCCCCeeccCCCceeeCCCccCCCCCCCC----cEECCCCCeeEeCCCCeeecCCcceeccc
Q psy11799 333 GGKCFN----TAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFG----CTPLGISGFSCECPSGYQRIGQGHCLATI 404 (483)
Q Consensus 333 ~g~C~~----~~g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~----c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~ 404 (483)
+|+|++ +.|.+-=.|.+||.-+.... -.... .|..-.|+.. =++.. .+..|.|.+|-+|....
T Consensus 358 ggvClnCrHnTaGrhChyCreGyyRd~s~p-l~hrk-aCk~CdChpVGs~gktCNq-~tGqCpCkeGvtG~tCn------ 428 (592)
T KOG3512|consen 358 GGVCLNCRHNTAGRHCHYCREGYYRDGSKP-LTHRK-ACKACDCHPVGSAGKTCNQ-TTGQCPCKEGVTGLTCN------ 428 (592)
T ss_pred cceEeecccCCCCcccccccCccccCCCCC-Cchhh-hhhhcCCcccccccccccc-cCCcccCCCCCcccccc------
Confidence 456654 33433335899996542211 11111 3444455542 23333 66789999999987643
Q ss_pred CCCCCCCcccCCCCCcccccccccCC-CCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCC
Q psy11799 405 SSGLYPGFRSYERPNLGDIDECKEEI-NGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCL 467 (483)
Q Consensus 405 ~~~C~~g~~g~~~~~C~~~~~C~~~~-~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~ 467 (483)
.|.+||.....+++ .|...+ ..+..++++.+ ...+.+.|+.++ .++.+.
T Consensus 429 --rCa~gyqqsrs~va----pcik~p~~~~~~~~s~ve----~qd~~s~Ck~~~----~~~r~n 478 (592)
T KOG3512|consen 429 --RCAPGYQQSRSPVA----PCIKIPTDAPTLGSSGVE----PQDQCSKCKASP----GGKRLN 478 (592)
T ss_pred --cccchhhcccCCCc----CceecCCCCccccCCCCc----chhccccCCCCC----cceecc
Confidence 46778876553322 222111 11124555554 233457888888 555554
No 50
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=78.09 E-value=2.5 Score=26.08 Aligned_cols=25 Identities=36% Similarity=0.733 Sum_probs=16.6
Q ss_pred CCCCCCcEECCCCCeeEeCCCCeeecC
Q psy11799 370 SPCSFGCTPLGISGFSCECPSGYQRIG 396 (483)
Q Consensus 370 ~~C~~~c~~~~~~~~~C~C~~Gy~g~~ 396 (483)
..|...|... ..+.|.||+||..+.
T Consensus 6 t~CpA~CDpn--~~~~C~CPeGyIlde 30 (34)
T PF09064_consen 6 TECPADCDPN--SPGQCFCPEGYILDE 30 (34)
T ss_pred ccCCCccCCC--CCCceeCCCceEecC
Confidence 3344434332 567999999998765
No 51
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=76.97 E-value=0.34 Score=35.04 Aligned_cols=39 Identities=13% Similarity=0.107 Sum_probs=19.1
Q ss_pred eEEEeCCCCeeeEeee-cCCCC-------cccCCceeeeccCcceeee
Q psy11799 31 FVWSLASQWTGEILTN-VSPYP-------TSVNMASVSTRKGLTGVSV 70 (483)
Q Consensus 31 ~~C~C~~g~~G~~C~~-c~~~~-------~~~~~~~C~c~~g~~g~~C 70 (483)
+...|...|.|..|++ |.+.- |.. .+.=.|.+||+|..|
T Consensus 17 ~rv~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~-~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSKFCKPRDDSFGHYTCDS-NGNKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-EE---EEETTEEEEE-S-S--EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccCCcCCCcCCcCCcccCC-CCCCCCCCCCcCCCC
Confidence 5578999999999994 66541 332 246678999999876
No 52
>KOG1218|consensus
Probab=75.60 E-value=62 Score=31.34 Aligned_cols=16 Identities=25% Similarity=0.439 Sum_probs=11.2
Q ss_pred CeeEeCCCCceeCCCC
Q psy11799 293 GYKCGCPEGYVRHPFS 308 (483)
Q Consensus 293 ~~~C~C~~G~~g~~~~ 308 (483)
...|.+..+|.+..+.
T Consensus 48 ~~~~~~~~~~~~~~c~ 63 (316)
T KOG1218|consen 48 SGECGLGYGFVGSVCR 63 (316)
T ss_pred ceeEecccccCCCccc
Confidence 4467778888877654
No 53
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.01 E-value=3 Score=34.29 Aligned_cols=41 Identities=29% Similarity=0.703 Sum_probs=28.1
Q ss_pred cccccccCCCCccCCCCCCeEee--CCCCeeeeCCCCcccCCCCCCCCCC
Q psy11799 422 DIDECKEEINGVAVCPKPGTCIN--TMGSFRCLCPRGFKLDRTGTFCLDQ 469 (483)
Q Consensus 422 ~~~~C~~~~~~~~~C~~~g~C~~--~~g~~~C~C~~G~~~~~~g~~C~~~ 469 (483)
++..|...-. +-|.+| +|.- ....+.|.|..|| .|..|+..
T Consensus 41 ~i~~Cp~ey~--~YClHG-~C~yI~dl~~~~CrC~~GY----tGeRCEh~ 83 (139)
T PHA03099 41 AIRLCGPEGD--GYCLHG-DCIHARDIDGMYCRCSHGY----TGIRCQHV 83 (139)
T ss_pred ccccCChhhC--CEeECC-EEEeeccCCCceeECCCCc----ccccccce
Confidence 3455544321 268764 8973 4467899999999 88888854
No 54
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=72.69 E-value=1.1 Score=30.43 Aligned_cols=21 Identities=14% Similarity=-0.026 Sum_probs=12.5
Q ss_pred CceEEEeCCCCeeeEeeecCC
Q psy11799 29 GIFVWSLASQWTGEILTNVSP 49 (483)
Q Consensus 29 g~~~C~C~~g~~G~~C~~c~~ 49 (483)
...+|.|+++|+|+.|+.|.+
T Consensus 16 ~~G~C~C~~~~~G~~C~~C~~ 36 (49)
T PF00053_consen 16 STGQCVCKPGTTGPRCDQCKP 36 (49)
T ss_dssp TCEEESBSTTEESTTS-EE-T
T ss_pred CCCEEeccccccCCcCcCCCC
Confidence 456677777777777766553
No 55
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=71.96 E-value=2.6 Score=28.25 Aligned_cols=20 Identities=10% Similarity=-0.071 Sum_probs=13.7
Q ss_pred CceEEEeCCCCeeeEeeecC
Q psy11799 29 GIFVWSLASQWTGEILTNVS 48 (483)
Q Consensus 29 g~~~C~C~~g~~G~~C~~c~ 48 (483)
...+|.|+++|+|+.|+.|.
T Consensus 16 ~~G~C~C~~~~~G~~C~~C~ 35 (46)
T smart00180 16 DTGQCECKPNVTGRRCDRCA 35 (46)
T ss_pred CCCEEECCCCCCCCCCCcCC
Confidence 35677777777777776554
No 56
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=69.57 E-value=2.9 Score=28.55 Aligned_cols=20 Identities=15% Similarity=0.022 Sum_probs=12.2
Q ss_pred CceEEEeCCCCeeeEeeecC
Q psy11799 29 GIFVWSLASQWTGEILTNVS 48 (483)
Q Consensus 29 g~~~C~C~~g~~G~~C~~c~ 48 (483)
...+|.|+++|+|..|+.|.
T Consensus 17 ~~G~C~C~~~~~G~~C~~C~ 36 (50)
T cd00055 17 GTGQCECKPNTTGRRCDRCA 36 (50)
T ss_pred CCCEEeCCCcCCCCCCCCCC
Confidence 44566666666666665543
No 57
>PHA02887 EGF-like protein; Provisional
Probab=68.91 E-value=5.4 Score=32.27 Aligned_cols=30 Identities=30% Similarity=0.841 Sum_probs=23.3
Q ss_pred CCCCCCeEe--eCCCCeeeeCCCCcccCCCCCCCCCC
Q psy11799 435 VCPKPGTCI--NTMGSFRCLCPRGFKLDRTGTFCLDQ 469 (483)
Q Consensus 435 ~C~~~g~C~--~~~g~~~C~C~~G~~~~~~g~~C~~~ 469 (483)
-|. +|+|. .....+.|.|+.|| +|..|+.+
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GY----tG~RCE~v 124 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGY----TGIRCDEV 124 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCc----ccCCCCcc
Confidence 577 57997 34456899999999 78888754
No 58
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=57.51 E-value=9.2 Score=30.97 Aligned_cols=32 Identities=34% Similarity=0.786 Sum_probs=23.8
Q ss_pred cccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCe
Q psy11799 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK 161 (483)
Q Consensus 125 ~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~ 161 (483)
..+.|.. .+. |..+|.|.. ..+..|.|.+||.
T Consensus 76 p~d~Cd~--y~~--CG~~g~C~~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 76 PKDQCDV--YGF--CGPNGICNS-NNSPKCSCLPGFE 107 (110)
T ss_pred cccCCCC--ccc--cCCccEeCC-CCCCceECCCCcC
Confidence 3467775 344 999999964 3456799999996
No 59
>PHA02887 EGF-like protein; Provisional
Probab=55.35 E-value=5.9 Score=32.04 Aligned_cols=17 Identities=12% Similarity=-0.074 Sum_probs=14.1
Q ss_pred ceEEEeCCCCeeeEeee
Q psy11799 30 IFVWSLASQWTGEILTN 46 (483)
Q Consensus 30 ~~~C~C~~g~~G~~C~~ 46 (483)
...|.|..||+|.+|+.
T Consensus 107 epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 107 EKFCICNKGYTGIRCDE 123 (126)
T ss_pred CceeECCCCcccCCCCc
Confidence 37899999999998873
No 60
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=53.00 E-value=38 Score=23.01 Aligned_cols=17 Identities=47% Similarity=1.165 Sum_probs=14.0
Q ss_pred CCEEeeCCCCeeEECCCCCee
Q psy11799 333 GGKCFNTAGSYRCGCPDGFQF 353 (483)
Q Consensus 333 ~g~C~~~~g~~~C~C~~G~~~ 353 (483)
++.|++. +|.|++||..
T Consensus 31 ~s~C~~g----~C~C~~g~~~ 47 (52)
T PF01683_consen 31 GSVCVNG----RCQCPPGYVE 47 (52)
T ss_pred cCEEcCC----EeECCCCCEe
Confidence 7788664 9999999965
No 61
>PF01724 DUF29: Domain of unknown function DUF29; InterPro: IPR002636 This entry is represented by Ralstonia phage RSS1, Orf11. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of various hypothetical proteins from cyanobacteria, none of which are functionally described. The aligned region is approximately 120-140 amino acids long corresponding to almost the entire length of the proteins in the family.; PDB: 3FCN_A.
Probab=49.43 E-value=7.7 Score=33.06 Aligned_cols=19 Identities=37% Similarity=0.576 Sum_probs=15.4
Q ss_pred ccccccccchhHHHHHhhh
Q psy11799 3 LLKARKLETFDRELLIELL 21 (483)
Q Consensus 3 ~~~~~~~~~~~~~~~~~~~ 21 (483)
|||+|++..+|.|+|+|+.
T Consensus 16 lLr~~~~~~lD~enLiEEi 34 (139)
T PF01724_consen 16 LLREGDFDALDWENLIEEI 34 (139)
T ss_dssp HHHHSSSTT--HHHHHHHH
T ss_pred HHHcCCChHhhHHHHHHHH
Confidence 7999999999999998875
No 62
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=48.12 E-value=17 Score=30.01 Aligned_cols=36 Identities=28% Similarity=0.704 Sum_probs=23.1
Q ss_pred cccccccCCCCccCCCCCCeEeeC--CCceEeeCCCCCeeC
Q psy11799 125 DMDECKEEINGVAVCPKPGTCINT--MGSFRCLCPRGFKLD 163 (483)
Q Consensus 125 ~~~~C~~~~~~~~~C~~~g~C~~~--~g~~~C~C~~G~~g~ 163 (483)
++.+|.....+- |- ||+|.-. ...+.|.|..||+|.
T Consensus 41 ~i~~Cp~ey~~Y--Cl-HG~C~yI~dl~~~~CrC~~GYtGe 78 (139)
T PHA03099 41 AIRLCGPEGDGY--CL-HGDCIHARDIDGMYCRCSHGYTGI 78 (139)
T ss_pred ccccCChhhCCE--eE-CCEEEeeccCCCceeECCCCcccc
Confidence 345565432233 63 4588754 467899999999944
No 63
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=42.39 E-value=20 Score=28.98 Aligned_cols=23 Identities=35% Similarity=0.810 Sum_probs=18.9
Q ss_pred CCCCCCeEeeCCCCeeeeCCCCcc
Q psy11799 435 VCPKPGTCINTMGSFRCLCPRGFK 458 (483)
Q Consensus 435 ~C~~~g~C~~~~g~~~C~C~~G~~ 458 (483)
.|+..|.|.. ..+..|.|.+||+
T Consensus 85 ~CG~~g~C~~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 85 FCGPNGICNS-NNSPKCSCLPGFE 107 (110)
T ss_pred ccCCccEeCC-CCCCceECCCCcC
Confidence 7999999963 4566799999994
No 64
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=39.38 E-value=21 Score=28.54 Aligned_cols=44 Identities=27% Similarity=0.696 Sum_probs=29.0
Q ss_pred cccccCCCCccCCCCCCeEeeCC-----CCeeeeCCCCcccC---------CCCCCCCCCc
Q psy11799 424 DECKEEINGVAVCPKPGTCINTM-----GSFRCLCPRGFKLD---------RTGTFCLDQD 470 (483)
Q Consensus 424 ~~C~~~~~~~~~C~~~g~C~~~~-----g~~~C~C~~G~~~~---------~~g~~C~~~~ 470 (483)
+.|....+ .|..||.|++.. .=|.|.|.+.+... ..|..|+..|
T Consensus 6 ~aC~~~Tn---~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkD 63 (103)
T PF12955_consen 6 DACENATN---NCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKD 63 (103)
T ss_pred HHHHHhcc---CCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccccc
Confidence 44554433 799999999762 34999999965332 2566776544
No 65
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=38.69 E-value=23 Score=28.26 Aligned_cols=33 Identities=24% Similarity=0.720 Sum_probs=23.9
Q ss_pred cccccCCCCccCCCCCCeEeeCC-----CceEeeCCCCCee
Q psy11799 127 DECKEEINGVAVCPKPGTCINTM-----GSFRCLCPRGFKL 162 (483)
Q Consensus 127 ~~C~~~~~~~~~C~~~g~C~~~~-----g~~~C~C~~G~~g 162 (483)
+.|... .+. |+.+|.|+... .=|.|.|.+.+..
T Consensus 6 ~aC~~~-Tn~--CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 6 DACENA-TNN--CSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred HHHHHh-ccC--CCCCceEeeccCCCccceEEEEeeccccc
Confidence 445544 555 99999999872 3499999996653
No 66
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=36.86 E-value=2.5e+02 Score=31.52 Aligned_cols=120 Identities=23% Similarity=0.571 Sum_probs=58.3
Q ss_pred eEe-CCCCceecCCCCCccccCccccCCCCcccccccCCCCeeEeCCCCceeCCCCCccccCCcCCCC--CCCCCCCCCC
Q psy11799 254 TCG-CHPGFELSSDGSGTCLDSDECATGRHICQQICNLLGGYKCGCPEGYVRHPFSNECIDDNECARN--PCEPSSNTTG 330 (483)
Q Consensus 254 ~C~-C~~G~~~~~~g~~~C~~~~~C~~~~~~c~~~C~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~--~C~~~~~~~c 330 (483)
.|. |..+|.+...+. .|+. +|.... . .+. ...+..+|.|..||.....+..|.....|... -|.. |
T Consensus 647 ~C~~C~~~~~ls~~~~-~C~~--~C~~~~-~-~~~-~~~~~~~C~C~~g~~p~~~~~~C~~~~~C~~~~~gC~~-----C 715 (800)
T PTZ00214 647 FCTECAGELFVSLDGQ-SCLE--ECTGDK-V-VGE-VSGGVRRCWCERGFLPALDRSGCVLPTECPPDMPSCAA-----C 715 (800)
T ss_pred CcCcCCCCceecCcCC-chhh--cCCCcc-c-ccc-ccCCcceeEecCCcccccCCCccccccCCCcccccccc-----c
Confidence 464 777776655444 4543 332110 0 000 00124589999999977766666654445321 1211 1
Q ss_pred CCCCEEeeCCCCeeEECCCCCeeccCCCceeeCCCccCCCCCCCCcEECCCCCeeEeCCCCeeecCCcceecc
Q psy11799 331 ERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFGCTPLGISGFSCECPSGYQRIGQGHCLAT 403 (483)
Q Consensus 331 ~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~c~~~~~~~~~C~C~~Gy~g~~~g~C~~~ 403 (483)
...++|+ .|. ..++.....++.|.. . |..+- . .....|+|..||... .+.|++.
T Consensus 716 ~~~g~C~------~C~-~~~~~vq~~~~~C~~---~-----C~~~~--~-~~~~vC~C~~g~~l~-~~~c~~~ 769 (800)
T PTZ00214 716 DESGRCL------LCV-TSGHNVQVDQRTCAE---G-----CGARA--S-SNQGVCMCELDAVLT-KGVCVPA 769 (800)
T ss_pred CCCCcee------ecc-ccCcccccCCCcccc---C-----CCCCc--c-ccCCeEEeCCcceec-CCeeEec
Confidence 1244554 232 223322224444422 1 11111 1 256799999999764 4567653
No 67
>KOG3516|consensus
Probab=35.24 E-value=29 Score=39.46 Aligned_cols=40 Identities=30% Similarity=0.782 Sum_probs=32.6
Q ss_pred cccccccccCCCCccCCCCCCeEeeCCCCeeeeCC-CCcccCCCCCCCCC
Q psy11799 420 LGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCP-RGFKLDRTGTFCLD 468 (483)
Q Consensus 420 C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~-~G~~~~~~g~~C~~ 468 (483)
|.-++.|..+ +|.++|.|.-.-..|.|.|. .|| .|.+|..
T Consensus 542 C~i~drClPN-----~CehgG~C~Qs~~~f~C~C~~TGY----~GatCHt 582 (1306)
T KOG3516|consen 542 CGISDRCLPN-----PCEHGGKCSQSWDDFECNCELTGY----KGATCHT 582 (1306)
T ss_pred cccccccCCc-----cccCCCcccccccceeEecccccc----ccccccC
Confidence 5555666654 99999999987788999999 999 7888864
No 68
>KOG3512|consensus
Probab=34.09 E-value=1.5e+02 Score=30.43 Aligned_cols=53 Identities=32% Similarity=0.755 Sum_probs=31.3
Q ss_pred eeEe-CCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCee
Q psy11799 294 YKCG-CPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQF 353 (483)
Q Consensus 294 ~~C~-C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~ 353 (483)
..|. |++||..+.-. .-.+...|..-.|++ ....+.+|..+.| +|.|.+|-+|
T Consensus 371 rhChyCreGyyRd~s~-pl~hrkaCk~CdChp----VGs~gktCNq~tG--qCpCkeGvtG 424 (592)
T KOG3512|consen 371 RHCHYCREGYYRDGSK-PLTHRKACKACDCHP----VGSAGKTCNQTTG--QCPCKEGVTG 424 (592)
T ss_pred cccccccCccccCCCC-CCchhhhhhhcCCcc----cccccccccccCC--cccCCCCCcc
Confidence 3465 99999876432 222333454444554 2223567877766 8999998833
No 69
>KOG3516|consensus
Probab=33.72 E-value=34 Score=38.96 Aligned_cols=41 Identities=34% Similarity=0.881 Sum_probs=34.0
Q ss_pred ccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECC-CCCeeccCCCceee
Q psy11799 310 ECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCP-DGFQFEAALQLCTQ 362 (483)
Q Consensus 310 ~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~-~G~~~~~~g~~C~~ 362 (483)
.|...+.|.+++|++ +|.|......|.|.|. .|| .|.+|..
T Consensus 541 ~C~i~drClPN~Ceh--------gG~C~Qs~~~f~C~C~~TGY----~GatCHt 582 (1306)
T KOG3516|consen 541 MCGISDRCLPNPCEH--------GGKCSQSWDDFECNCELTGY----KGATCHT 582 (1306)
T ss_pred ccccccccCCccccC--------CCcccccccceeEecccccc----ccccccC
Confidence 355677889999999 9999998899999998 899 5666653
No 70
>KOG0196|consensus
Probab=30.35 E-value=54 Score=36.04 Aligned_cols=58 Identities=24% Similarity=0.564 Sum_probs=37.3
Q ss_pred eeEeCCCCeeecC-CcceecccCCCCCCCcccCCCCCcccccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcc
Q psy11799 384 FSCECPSGYQRIG-QGHCLATISSGLYPGFRSYERPNLGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK 458 (483)
Q Consensus 384 ~~C~C~~Gy~g~~-~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~ 458 (483)
..|.|.+||.-.. ...|. .|++|+.-.. .....|. +|+.+..= ...++-.|.|..||+
T Consensus 259 G~C~C~aGye~~~~~~~C~-----aCp~G~yK~~----~~~~~C~-------~CP~~S~s-~~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQ-----ACPPGTYKAS----QGDSLCL-------PCPPNSHS-SSEGATSCTCENGYY 317 (996)
T ss_pred CceeecCCCCcccCCCcce-----eCCCCcccCC----CCCCCCC-------CCCCCCCC-CCCCCCcccccCCcc
Confidence 4699999998632 33343 6888876433 1223343 67666552 356778899999995
No 71
>KOG3514|consensus
Probab=28.90 E-value=35 Score=38.38 Aligned_cols=35 Identities=37% Similarity=0.973 Sum_probs=29.7
Q ss_pred cCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECC-CCCeeccCCCceee
Q psy11799 316 ECARNPCEPSSNTTGERGGKCFNTAGSYRCGCP-DGFQFEAALQLCTQ 362 (483)
Q Consensus 316 ~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~-~G~~~~~~g~~C~~ 362 (483)
.|..+||+| +|+|....+.|.|.|. .+| .|+.|++
T Consensus 625 ~C~~nPC~N--------~g~C~egwNrfiCDCs~T~~----~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQN--------GGKCSEGWNRFICDCSGTGF----EGRTCER 660 (1591)
T ss_pred ccCCCcccC--------CCCccccccccccccccCcc----cCccccc
Confidence 588889998 9999999999999997 688 6677763
No 72
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=25.40 E-value=24 Score=24.48 Aligned_cols=25 Identities=24% Similarity=0.272 Sum_probs=13.7
Q ss_pred cccccC-CCceEEEeCCCCeeeEeee
Q psy11799 22 FLINML-AGIFVWSLASQWTGEILTN 46 (483)
Q Consensus 22 ~~~~~~-~g~~~C~C~~g~~G~~C~~ 46 (483)
||..+. .|...|.|..-|.|++|++
T Consensus 26 flDg~~~dG~p~CECn~Cy~GpdCS~ 51 (56)
T PF04863_consen 26 FLDGLIADGSPVCECNSCYGGPDCST 51 (56)
T ss_dssp -TTS-EETTEE--EE-TTEESTTS-E
T ss_pred eeccccccCCccccccCCcCCCCccc
Confidence 444444 4557799999999998874
No 73
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=24.21 E-value=3.1e+02 Score=30.75 Aligned_cols=25 Identities=36% Similarity=0.719 Sum_probs=18.4
Q ss_pred eEEeCCCCceeCCCCCcccccCccc
Q psy11799 212 FRCICPYGYALAPDGRHCIDINECE 236 (483)
Q Consensus 212 ~~C~C~~G~~~~~~g~~C~~~~~C~ 236 (483)
-+|+|..||....++..|+....|.
T Consensus 682 ~~C~C~~g~~p~~~~~~C~~~~~C~ 706 (800)
T PTZ00214 682 RRCWCERGFLPALDRSGCVLPTECP 706 (800)
T ss_pred ceeEecCCcccccCCCccccccCCC
Confidence 4789999998777787886555554
No 74
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=23.44 E-value=60 Score=22.33 Aligned_cols=18 Identities=44% Similarity=1.176 Sum_probs=15.1
Q ss_pred EeCCCCeeecCCcceecc
Q psy11799 386 CECPSGYQRIGQGHCLAT 403 (483)
Q Consensus 386 C~C~~Gy~g~~~g~C~~~ 403 (483)
|.|++||..+..++||..
T Consensus 35 C~C~~G~v~~~~~~CV~~ 52 (55)
T PF01826_consen 35 CFCPPGYVRNDNGRCVPP 52 (55)
T ss_dssp EEETTTEEEETTSEEEEG
T ss_pred CCCCCCeeEcCCCCEEcH
Confidence 999999998877788764
Done!