Query psy9424
Match_columns 535
No_of_seqs 287 out of 2382
Neff 8.6
Searched_HMMs 46136
Date Fri Aug 16 18:59:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9424.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9424hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1217|consensus 99.4 1.7E-11 3.7E-16 131.2 24.0 275 41-483 97-389 (487)
2 KOG4289|consensus 99.4 8.6E-12 1.9E-16 137.1 17.7 110 144-315 1180-1316(2531)
3 KOG1214|consensus 99.4 6.2E-12 1.3E-16 132.5 15.3 123 143-315 734-862 (1289)
4 KOG1214|consensus 99.3 1.1E-11 2.3E-16 130.8 12.1 136 10-176 713-860 (1289)
5 KOG4289|consensus 99.3 2.8E-11 6.1E-16 133.2 13.5 85 257-345 1218-1303(2531)
6 KOG1217|consensus 99.2 2.5E-10 5.4E-15 122.2 18.8 246 12-345 151-418 (487)
7 KOG1219|consensus 99.1 1.4E-10 3E-15 132.2 8.3 107 245-402 3869-3976(4289)
8 KOG1219|consensus 99.1 1.6E-10 3.4E-15 131.8 8.1 110 144-315 3865-3977(4289)
9 KOG1225|consensus 99.0 3.4E-09 7.4E-14 111.0 12.2 105 14-178 235-341 (525)
10 KOG1225|consensus 98.9 1.7E-08 3.6E-13 105.9 13.9 121 130-340 267-387 (525)
11 KOG4260|consensus 98.6 5.7E-08 1.2E-12 91.0 5.4 71 219-310 231-304 (350)
12 KOG4260|consensus 98.4 4.4E-07 9.6E-12 85.1 5.7 137 246-400 150-306 (350)
13 KOG0994|consensus 98.3 2.8E-05 6.2E-10 85.8 18.3 106 165-315 831-948 (1758)
14 KOG0994|consensus 98.0 0.00018 3.9E-09 79.7 15.2 26 381-408 1078-1103(1758)
15 KOG1226|consensus 97.8 0.00025 5.4E-09 76.4 12.1 62 246-316 555-621 (783)
16 PF07645 EGF_CA: Calcium-bindi 97.7 2E-05 4.3E-10 54.1 2.0 33 32-64 2-35 (42)
17 PF00008 EGF: EGF-like domain 97.7 1.4E-05 3.1E-10 51.2 1.2 30 454-483 1-31 (32)
18 smart00179 EGF_CA Calcium-bind 97.6 9.6E-05 2.1E-09 49.6 3.9 35 451-485 2-38 (39)
19 PF07645 EGF_CA: Calcium-bindi 97.6 3.7E-05 8.1E-10 52.8 1.7 31 451-481 2-34 (42)
20 PF00008 EGF: EGF-like domain 97.5 5E-05 1.1E-09 48.7 1.3 28 38-65 3-31 (32)
21 KOG1226|consensus 97.4 0.0015 3.3E-08 70.5 12.0 60 246-315 514-580 (783)
22 smart00179 EGF_CA Calcium-bind 97.3 0.00037 8E-09 46.7 3.8 35 143-177 2-38 (39)
23 cd00054 EGF_CA Calcium-binding 97.2 0.0005 1.1E-08 45.5 3.8 35 451-485 2-37 (38)
24 PF12947 EGF_3: EGF domain; I 97.0 0.00026 5.7E-09 46.5 1.0 29 373-401 5-33 (36)
25 KOG1836|consensus 97.0 0.048 1E-06 65.7 20.2 49 131-179 760-813 (1705)
26 PF12947 EGF_3: EGF domain; I 97.0 0.0003 6.6E-09 46.3 1.0 29 38-66 5-33 (36)
27 cd00054 EGF_CA Calcium-binding 96.9 0.0014 3E-08 43.3 3.6 35 143-177 2-37 (38)
28 cd00053 EGF Epidermal growth f 96.8 0.0021 4.5E-08 41.7 3.8 30 456-485 5-35 (36)
29 smart00181 EGF Epidermal growt 96.7 0.0024 5.1E-08 41.6 3.7 28 457-485 6-34 (35)
30 PF12662 cEGF: Complement Clr- 96.6 0.0021 4.6E-08 37.9 2.6 24 300-323 1-24 (24)
31 cd00053 EGF Epidermal growth f 96.3 0.005 1.1E-07 39.9 3.6 28 148-175 5-32 (36)
32 PF06247 Plasmod_Pvs28: Plasmo 96.2 0.00096 2.1E-08 60.1 -0.7 106 149-315 50-165 (197)
33 PF12662 cEGF: Complement Clr- 96.2 0.0045 9.8E-08 36.5 2.4 24 53-87 1-24 (24)
34 PF06247 Plasmod_Pvs28: Plasmo 96.1 0.00087 1.9E-08 60.3 -1.4 146 154-401 10-163 (197)
35 smart00181 EGF Epidermal growt 96.1 0.0079 1.7E-07 39.1 3.4 28 149-177 6-34 (35)
36 PF07974 EGF_2: EGF-like domai 95.9 0.009 1.9E-07 38.1 2.9 26 39-66 6-31 (32)
37 PF07974 EGF_2: EGF-like domai 95.8 0.012 2.5E-07 37.6 3.1 27 457-485 6-32 (32)
38 PF12661 hEGF: Human growth fa 94.1 0.02 4.3E-07 28.6 0.5 13 473-485 1-13 (13)
39 smart00051 DSL delta serrate l 94.1 0.064 1.4E-06 40.2 3.5 45 14-66 18-62 (63)
40 PF14670 FXa_inhibition: Coagu 92.2 0.083 1.8E-06 34.7 1.4 21 293-313 11-31 (36)
41 PF14670 FXa_inhibition: Coagu 92.1 0.08 1.7E-06 34.8 1.3 25 39-65 6-30 (36)
42 KOG1836|consensus 90.8 0.46 9.9E-06 57.7 6.6 53 262-316 757-813 (1705)
43 smart00051 DSL delta serrate l 82.1 1.8 3.9E-05 32.4 3.3 43 131-177 20-63 (63)
44 PHA02887 EGF-like protein; Pro 82.0 1.4 2.9E-05 36.8 2.9 36 32-68 83-122 (126)
45 PF12946 EGF_MSP1_1: MSP1 EGF 78.2 0.72 1.6E-05 30.3 0.1 31 146-176 2-33 (37)
46 cd01475 vWA_Matrilin VWA_Matri 77.8 2.3 5E-05 40.6 3.6 38 442-484 181-220 (224)
47 cd01475 vWA_Matrilin VWA_Matri 76.6 2.8 6.1E-05 40.0 3.8 39 276-314 183-221 (224)
48 KOG1218|consensus 71.6 30 0.00065 34.6 10.1 56 2-66 81-136 (316)
49 PF00954 S_locus_glycop: S-loc 71.1 4.2 9E-05 34.1 3.1 32 32-64 77-108 (110)
50 PHA02887 EGF-like protein; Pro 69.5 4.8 0.0001 33.7 2.9 30 456-486 91-122 (126)
51 PHA03099 epidermal growth fact 69.5 4.7 0.0001 34.3 2.9 36 32-68 42-81 (139)
52 PF12946 EGF_MSP1_1: MSP1 EGF 67.7 2.3 5.1E-05 27.9 0.6 29 456-484 4-33 (37)
53 cd00055 EGF_Lam Laminin-type e 66.0 7 0.00015 27.6 2.9 17 390-406 20-36 (50)
54 PF00053 Laminin_EGF: Laminin 65.1 3.8 8.2E-05 28.7 1.4 26 380-407 11-36 (49)
55 PHA03099 epidermal growth fact 64.9 6.4 0.00014 33.5 2.8 30 456-486 50-81 (139)
56 cd00055 EGF_Lam Laminin-type e 64.9 8.5 0.00019 27.1 3.2 21 464-486 13-33 (50)
57 PF00954 S_locus_glycop: S-loc 63.9 7.8 0.00017 32.4 3.3 32 368-400 78-109 (110)
58 KOG1218|consensus 63.4 1.6E+02 0.0035 29.2 20.4 40 302-344 163-202 (316)
59 PF00053 Laminin_EGF: Laminin 62.1 5.7 0.00012 27.8 1.8 22 463-486 11-32 (49)
60 PF01414 DSL: Delta serrate li 55.2 2.2 4.9E-05 31.9 -1.3 48 11-66 15-62 (63)
61 smart00180 EGF_Lam Laminin-typ 52.6 15 0.00032 25.5 2.5 16 390-405 19-34 (46)
62 PF01683 EB: EB module; Inter 47.9 23 0.00051 25.0 3.1 24 456-483 25-48 (52)
63 KOG3516|consensus 46.5 15 0.00032 42.8 2.7 36 451-486 545-581 (1306)
64 KOG3516|consensus 46.0 15 0.00033 42.6 2.8 36 32-68 545-581 (1306)
65 PF01683 EB: EB module; Inter 44.9 24 0.00053 24.9 2.8 24 38-65 25-48 (52)
66 KOG3512|consensus 41.3 78 0.0017 33.3 6.7 28 380-407 285-313 (592)
67 KOG3514|consensus 39.7 20 0.00043 41.3 2.3 34 453-486 625-659 (1591)
68 PF09064 Tme5_EGF_like: Thromb 33.9 47 0.001 21.4 2.4 13 389-401 18-30 (34)
69 KOG3514|consensus 32.8 30 0.00065 39.9 2.4 36 144-179 624-660 (1591)
70 PF12955 DUF3844: Domain of un 29.7 20 0.00044 29.6 0.4 32 33-64 6-43 (103)
71 KOG3512|consensus 26.4 2.8E+02 0.006 29.5 7.7 27 156-182 286-313 (592)
72 PF04863 EGF_alliinase: Alliin 20.8 40 0.00087 24.3 0.5 31 456-486 16-50 (56)
No 1
>KOG1217|consensus
Probab=99.44 E-value=1.7e-11 Score=131.20 Aligned_cols=275 Identities=28% Similarity=0.602 Sum_probs=177.9
Q ss_pred CCCCCeeeecCCCceeeCCCCCcCCCCCCCCCCCCCCCCCCCCCCCccccCccCCCCCCCCCCCCCCCccCCCCCCCCcc
Q psy9424 41 CGRNAECAVVNHTPRCTCVAGTVGDPKYQSGVGTSCTSSRDCIGEQQCISGLCQPTCRSNTTCPAQHYCNSGLCVLEMQC 120 (535)
Q Consensus 41 C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~~~~~C~~~~c~~~C~~~~~C~~~~~c~~~~C~~g~~C 120 (535)
...++.++.....+.|.|++||.|..+.. . ..|+.... .+.....|.....
T Consensus 97 ~~~~~~~~~~~~~~~c~c~~g~~~~~~~~---~------~~C~~~~~--------~~~~~~~c~~~~~------------ 147 (487)
T KOG1217|consen 97 LLLCGECVDCVGSYECTCPPGYQGTPCEG---E------CECVTGPG--------VCCIDGSCSNGPG------------ 147 (487)
T ss_pred ccCCccccCCCCCceeeCCCccccCcCCc---c------eeecCCCC--------CeeCchhhcCCCC------------
Confidence 34456667777889999999999985541 0 01222111 0001111111110
Q ss_pred ccCCCCCCCCccCCCCCCCCccc--CCCCC--CCCCCCCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9424 121 TTHDQCSATEQCRSNDMGQMQCR--PACEG--ILCGRNALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHG 196 (535)
Q Consensus 121 ~~~~~c~~~~~C~~~~~G~~~c~--~~C~~--~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~ 196 (535)
........|..+|.+..... ++|.. .+|.+.+.|.+..++|.|.|++||.+..++.
T Consensus 148 ---~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~----------------- 207 (487)
T KOG1217|consen 148 ---SVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET----------------- 207 (487)
T ss_pred ---CCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC-----------------
Confidence 00123346666777665443 58873 5699999999999889999999999988431
Q ss_pred CCCCCCCCCCCCCCCCCCccccccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCC
Q psy9424 197 PGLSPGATSHSSHSGGPVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFG 276 (535)
Q Consensus 197 ~~c~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~ 276 (535)
. .+++.|++. +.|.+.++|.+..+.
T Consensus 208 -------------------------------------------------~--~~~~~c~~~---~~~~~~~g~~~~~c~- 232 (487)
T KOG1217|consen 208 -------------------------------------------------T--GNGGTCVDS---VACSCPPGARGPECE- 232 (487)
T ss_pred -------------------------------------------------C--CCCceEecc---eeccCCCCCCCCCcc-
Confidence 0 122344333 568888998876432
Q ss_pred CeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCCCCCccCCCCccCcccC
Q psy9424 277 CHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTDGVPKCKASCQSDEECG 356 (535)
Q Consensus 277 C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~C~~~C~~~~eC~ 356 (535)
..+.++... . ++|.+..++|+|.+++||++... ..+.++++|...
T Consensus 233 -~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~----------------------------- 277 (487)
T KOG1217|consen 233 -VSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALI----------------------------- 277 (487)
T ss_pred -cccccccCC---C-CcccccCCceeeeCCCCcccccc-ceeeeccccCCC-----------------------------
Confidence 334444333 4 78999999999999999988762 112233333221
Q ss_pred CCCcccCCCcCCCCCCCCCCCCCceeeecCCCeeeeCCCCCcCCCC------CCCC----ccCCCCCCcccCCC---CCc
Q psy9424 357 LGEKCLQGQCNNPCERQGACGVNSLCNVLTHRKVCFCPRGFTGDPE------TECV----RITCLSHADCYPGG---GSL 423 (535)
Q Consensus 357 ~~~~C~~~~C~~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~------~~C~----~~~C~~~~~C~~~~---g~~ 423 (535)
. .|.++++|+++.+.|.|.|++||+|..+ .+|. ..+|.++++|.... .+.
T Consensus 278 ----------------~-~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~ 340 (487)
T KOG1217|consen 278 ----------------A-SCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFR 340 (487)
T ss_pred ----------------C-ccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCC
Confidence 1 1455667777777799999999999986 2453 22577777883322 245
Q ss_pred cCCCCCCCCCcCCCCCCCCCCcCCCCCcCCCCCCCCCCCCeeee-CCCCceeeCCCCCcCC
Q psy9424 424 CLANLCTRGCSADTDCPAALSCRSAECVDPCSPAPCGPNAQCSV-ANHRPLCSCPAGLMGL 483 (535)
Q Consensus 424 C~~g~C~~g~~~~~~C~~g~~C~~~~c~d~C~~~~C~~~~~C~~-~~g~~~C~C~~G~~G~ 483 (535)
|. +..++ .|..|+.. .++|...++.+++.|++ ..++|.|.|+.+|.+.
T Consensus 341 C~---c~~~~-------~g~~C~~~--~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 341 CA---CGPGF-------TGRRCEDS--NDECASSPCCPGGTCVNETPGSYRCACPAGFAGK 389 (487)
T ss_pred cC---CCCCC-------CCCccccC--CccccCCccccCCEeccCCCCCeEecCCCccccC
Confidence 76 66665 78888873 25898778999999999 6899999999999985
No 2
>KOG4289|consensus
Probab=99.39 E-value=8.6e-12 Score=137.13 Aligned_cols=110 Identities=28% Similarity=0.709 Sum_probs=83.1
Q ss_pred CCCCCCCCCCCCeec----------------------cCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9424 144 PACEGILCGRNALCT----------------------ASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSP 201 (535)
Q Consensus 144 ~~C~~~~C~~~g~C~----------------------~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~ 201 (535)
+.|+..||.+..+|+ +..+++.|.|++||+|+.|++.
T Consensus 1180 niClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTe--------------------- 1238 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETE--------------------- 1238 (2531)
T ss_pred chhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccch---------------------
Confidence 566667777766663 3346788999999999997632
Q ss_pred CCCCCCCCCCCCCccccccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCee--
Q psy9424 202 GATSHSSHSGGPVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHL-- 279 (535)
Q Consensus 202 ~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~-- 279 (535)
++.|. +.+|.++++|....|.|+|.|.+||+|+ .|+.
T Consensus 1239 ----------------------iDlCY----------------s~pC~nng~C~srEggYtCeCrpg~tGe---hCEvs~ 1277 (2531)
T KOG4289|consen 1239 ----------------------IDLCY----------------SGPCGNNGRCRSREGGYTCECRPGFTGE---HCEVSA 1277 (2531)
T ss_pred ----------------------hHhhh----------------cCCCCCCCceEEecCceeEEecCCcccc---ceeeec
Confidence 34443 5789999999999999999999999999 4543
Q ss_pred -CCcCCCCCCCCCCeeecCC-CCeeeeCCCC-CccCCCC
Q psy9424 280 -IDFCAAKPCGPGARCDNSR-GSYKCLCPLG-LVGDPYG 315 (535)
Q Consensus 280 -~~~C~~~~C~~~~~C~~~~-g~~~C~C~~G-y~g~~c~ 315 (535)
.-.|.+..|.++++|++.. +.+.|.|+.| |++..|+
T Consensus 1278 ~agrCvpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1278 RAGRCVPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred ccCccccceecCCCEEeecCCCceeccCCCcccCCCceE
Confidence 3457777899999998875 7888999887 4444443
No 3
>KOG1214|consensus
Probab=99.38 E-value=6.2e-12 Score=132.48 Aligned_cols=123 Identities=28% Similarity=0.682 Sum_probs=88.4
Q ss_pred cCCCCC--CCCCCCCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccc
Q psy9424 143 RPACEG--ILCGRNALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVE 220 (535)
Q Consensus 143 ~~~C~~--~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~ 220 (535)
+++|+. +.|+.+++|++.+++|+|.|..||.-.. .+ .+|..+.
T Consensus 734 ~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~d---------------------------d~--------~tCV~i~ 778 (1289)
T KOG1214|consen 734 ENECATGFHRCGPNSVCINLPGSYRCECRSGYEFAD---------------------------DR--------HTCVLIT 778 (1289)
T ss_pred hhhhccCCCCCCCCceeecCCCceeEEEeecceecc---------------------------CC--------cceEEec
Confidence 455654 6789999999999999999999875433 00 1222221
Q ss_pred C-CCCCCCCCCCcccCCCcccCCCCCCCCCCC--CccccCC-CCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeec
Q psy9424 221 C-NSHADCSGDKVCEDHRCKISCLANNPCGPN--ALCSAEK-HKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDN 296 (535)
Q Consensus 221 C-~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~--~~C~~~~-g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~ 296 (535)
= .....|..+ +..|... ++|+... ++|.|.|.|||.|++.. |.++|+|.++.|...++|.+
T Consensus 779 ~pap~n~Ce~g--------------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c~dvDeC~psrChp~A~Cyn 843 (1289)
T KOG1214|consen 779 PPAPANPCEDG--------------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-CTDVDECSPSRCHPAATCYN 843 (1289)
T ss_pred CCCCCCccccC--------------ccccCcCCceEEEecCCceEEEeecCCccCCccc-cccccccCccccCCCceEec
Confidence 0 011122111 2445444 3555544 56999999999999875 88899999999999999999
Q ss_pred CCCCeeeeCCCCCccCCCC
Q psy9424 297 SRGSYKCLCPLGLVGDPYG 315 (535)
Q Consensus 297 ~~g~~~C~C~~Gy~g~~c~ 315 (535)
+++++.|+|.+||.|+...
T Consensus 844 tpgsfsC~C~pGy~GDGf~ 862 (1289)
T KOG1214|consen 844 TPGSFSCRCQPGYYGDGFQ 862 (1289)
T ss_pred CCCcceeecccCccCCCce
Confidence 9999999999999998765
No 4
>KOG1214|consensus
Probab=99.32 E-value=1.1e-11 Score=130.76 Aligned_cols=136 Identities=24% Similarity=0.503 Sum_probs=102.4
Q ss_pred ccCCCccCCCCCC--CcccCcCCcCCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcCCCCCCCCCCCCCCCCCCCC---
Q psy9424 10 QISQHLQQVPGLA--SAACVDGRCRNPCEA-DEVCGRNAECAVVNHTPRCTCVAGTVGDPKYQSGVGTSCTSSRDCI--- 83 (535)
Q Consensus 10 ~~~~~c~c~~g~~--g~~C~~~~~~d~C~~-~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~--- 83 (535)
.++..|+|..||. |++|++ +++|+. ...|..+.+|++..++|+|.|..||.-.+ +...|+
T Consensus 713 ~~~~tcecs~g~~gdgr~c~d---~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~d-----------d~~tCV~i~ 778 (1289)
T KOG1214|consen 713 GVDYTCECSSGYQGDGRNCVD---ENECATGFHRCGPNSVCINLPGSYRCECRSGYEFAD-----------DRHTCVLIT 778 (1289)
T ss_pred CcceEEEEeeccCCCCCCCCC---hhhhccCCCCCCCCceeecCCCceeEEEeecceecc-----------CCcceEEec
Confidence 3556799999997 788998 899997 77899999999999999999999986432 112343
Q ss_pred ---CCCccccCccCCCCCCCCCC--CC-CCCccCCCCCCCCccccCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCee
Q psy9424 84 ---GEQQCISGLCQPTCRSNTTC--PA-QHYCNSGLCVLEMQCTTHDQCSATEQCRSNDMGQMQCRPACEGILCGRNALC 157 (535)
Q Consensus 84 ---~~~~C~~~~c~~~C~~~~~C--~~-~~~c~~~~C~~g~~C~~~~~c~~~~~C~~~~~G~~~c~~~C~~~~C~~~g~C 157 (535)
.++.|..+. |.|...+.+ +. +..-|.+.|.+||. +++..|. ++|+|+.+.|..++.|
T Consensus 779 ~pap~n~Ce~g~--h~C~i~g~a~c~~hGgs~y~C~CLPGfs-------GDG~~c~--------dvDeC~psrChp~A~C 841 (1289)
T KOG1214|consen 779 PPAPANPCEDGS--HTCAIAGQARCVHHGGSTYSCACLPGFS-------GDGHQCT--------DVDECSPSRCHPAATC 841 (1289)
T ss_pred CCCCCCccccCc--cccCcCCceEEEecCCceEEEeecCCcc-------CCccccc--------cccccCccccCCCceE
Confidence 344566666 677765544 22 23344555555555 4444443 4899999999999999
Q ss_pred ccCCCCceecCCCCCCCCC
Q psy9424 158 TASDHHATCSCKPGYVGHP 176 (535)
Q Consensus 158 ~~~~~~~~C~C~~Gf~g~~ 176 (535)
++++++|.|+|.+||.|++
T Consensus 842 yntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 842 YNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred ecCCCcceeecccCccCCC
Confidence 9999999999999999998
No 5
>KOG4289|consensus
Probab=99.28 E-value=2.8e-11 Score=133.21 Aligned_cols=85 Identities=34% Similarity=0.802 Sum_probs=69.1
Q ss_pred CCCCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEE
Q psy9424 257 EKHKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHC 336 (535)
Q Consensus 257 ~~g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C 336 (535)
+.+.++|.|+|||+|+.++ +.+|.|.+.||.++++|...+|+|+|.|++||+|..|+..- ..-.|. +..|.++++|
T Consensus 1218 pvnglrCrCPpGFTgd~Ce--TeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~-~agrCv-pGvC~nggtC 1293 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCE--TEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSA-RAGRCV-PGVCKNGGTC 1293 (2531)
T ss_pred ccCceeEeCCCCCCccccc--chhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeec-ccCccc-cceecCCCEE
Confidence 3467899999999999554 45999999999999999999999999999999999988421 123454 3689999999
Q ss_pred ec-CCCCCcc
Q psy9424 337 VK-TDGVPKC 345 (535)
Q Consensus 337 ~~-~~g~~~C 345 (535)
++ ..|.|.|
T Consensus 1294 ~~~~nggf~c 1303 (2531)
T KOG4289|consen 1294 VNLLNGGFCC 1303 (2531)
T ss_pred eecCCCceec
Confidence 98 4555555
No 6
>KOG1217|consensus
Probab=99.25 E-value=2.5e-10 Score=122.22 Aligned_cols=246 Identities=28% Similarity=0.644 Sum_probs=160.1
Q ss_pred CCCccCCCCCCCcccCcCCcCCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcCCCCCCCCCCCCCCCCCCCCCCC--cc
Q psy9424 12 SQHLQQVPGLASAACVDGRCRNPCEA-DEVCGRNAECAVVNHTPRCTCVAGTVGDPKYQSGVGTSCTSSRDCIGEQ--QC 88 (535)
Q Consensus 12 ~~~c~c~~g~~g~~C~~~~~~d~C~~-~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~~~~--~C 88 (535)
...++|..||.+..+... .++|.. ..+|.+++.|.+..++|.|.|++||++..+.. . .....|++.. .+
T Consensus 151 ~~~c~C~~g~~~~~~~~~--~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~---~---~~~~~c~~~~~~~~ 222 (487)
T KOG1217|consen 151 PFRCSCTEGYEGEPCETD--LDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET---T---GNGGTCVDSVACSC 222 (487)
T ss_pred ceeeeeCCCccccccccc--ccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC---C---CCCceEecceeccC
Confidence 445999999999999862 379984 66799999999999999999999999985541 1 0111222210 00
Q ss_pred ccCccCCCCC-------CC-CCCCCCCCccCCCCCCCCccccCCCCCCCCccCCCCCCCC----cccCCCCCCC-CCCCC
Q psy9424 89 ISGLCQPTCR-------SN-TTCPAQHYCNSGLCVLEMQCTTHDQCSATEQCRSNDMGQM----QCRPACEGIL-CGRNA 155 (535)
Q Consensus 89 ~~~~c~~~C~-------~~-~~C~~~~~c~~~~C~~g~~C~~~~~c~~~~~C~~~~~G~~----~c~~~C~~~~-C~~~g 155 (535)
..+.-...|. .+ ..|.+..+.+ .+.|.++|.+.. ..+++|+... |.+++
T Consensus 223 ~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~------------------~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~ 284 (487)
T KOG1217|consen 223 PPGARGPECEVSIVECASGDGTCVNTVGSY------------------TCRCPEGYTGDACVTCVDVDSCALIASCPNGG 284 (487)
T ss_pred CCCCCCCCcccccccccCCCCcccccCCce------------------eeeCCCCccccccceeeeccccCCCCccCCCC
Confidence 0000000111 11 2222222222 234455666654 2478898753 99999
Q ss_pred eeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCCCCCCCCCCcccC
Q psy9424 156 LCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNSHADCSGDKVCED 235 (535)
Q Consensus 156 ~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C~~ 235 (535)
+|++..+.|.|.|++||.|..+ ..+.+..+|....
T Consensus 285 ~C~~~~~~~~C~C~~g~~g~~~-----------------------------------------~~~~~~~~C~~~~---- 319 (487)
T KOG1217|consen 285 TCVNVPGSYRCTCPPGFTGRLC-----------------------------------------TECVDVDECSPRN---- 319 (487)
T ss_pred eeecCCCcceeeCCCCCCCCCC-----------------------------------------ccccccccccccc----
Confidence 9999988899999999999983 1122223332100
Q ss_pred CCcccCCCCCCCCCCCCcc--ccCCCCeeeeCCCCCccCCCCCCeeC-CcCCCCCCCCCCeeec-CCCCeeeeCCCCCcc
Q psy9424 236 HRCKISCLANNPCGPNALC--SAEKHKQICYCQPGYTGDAYFGCHLI-DFCAAKPCGPGARCDN-SRGSYKCLCPLGLVG 311 (535)
Q Consensus 236 ~~c~~~c~~~~~C~~~~~C--~~~~g~~~C~C~~G~~G~~~~~C~~~-~~C~~~~C~~~~~C~~-~~g~~~C~C~~Gy~g 311 (535)
...+|.+++.| ......+.|.|.++|.|. .|+.. ++|...++..++.|++ ..+.|.|.++.+|.+
T Consensus 320 --------~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~---~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~ 388 (487)
T KOG1217|consen 320 --------AGGPCANGGTCNTLGSFGGFRCACGPGFTGR---RCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAG 388 (487)
T ss_pred --------cCCcCCCCcccccCCCCCCCCcCCCCCCCCC---ccccCCccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence 13557777777 233456789999999999 67776 4898888999999999 689999999999997
Q ss_pred C--CCCCCCCCCCCCCCCCCCCCCCEEecCCCCCcc
Q psy9424 312 D--PYGAGCVSASQCTRDDQCPPGAHCVKTDGVPKC 345 (535)
Q Consensus 312 ~--~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~C 345 (535)
. .....+.++++|.. .+.|++..+++.|
T Consensus 389 ~~~~~~~~~~~~~~c~~------~~~c~~~~~~~~c 418 (487)
T KOG1217|consen 389 KANGDGVGCEDIDECSG------CGDCVNGPGGGAC 418 (487)
T ss_pred CCccccccccccccccC------CcceeccCCCCcc
Confidence 4 33344555666642 4456666666554
No 7
>KOG1219|consensus
Probab=99.10 E-value=1.4e-10 Score=132.22 Aligned_cols=107 Identities=27% Similarity=0.642 Sum_probs=85.5
Q ss_pred CCCCCCCCccccCC-CCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCC
Q psy9424 245 NNPCGPNALCSAEK-HKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQ 323 (535)
Q Consensus 245 ~~~C~~~~~C~~~~-g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~ 323 (535)
.+||+++|+|...+ +.|.|.|++.|+|..|+ .++++|.++||..+++|+...++|.|.|+.||+|..|+.. .+++
T Consensus 3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE--i~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~--Gi~e 3944 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE--IDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR--GISE 3944 (4289)
T ss_pred cCcccCCCEecCCCCCceEEeCcccccCcccc--cccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc--cccc
Confidence 58999999998876 67999999999999555 4689999999999999999999999999999999988731 2444
Q ss_pred CCCCCCCCCCCEEecCCCCCccCCCCccCcccCCCCcccCCCcCCCCCCCCCCCCCceeeecCCCeeeeCCCCCcCCCC
Q psy9424 324 CTRDDQCPPGAHCVKTDGVPKCKASCQSDEECGLGEKCLQGQCNNPCERQGACGVNSLCNVLTHRKVCFCPRGFTGDPE 402 (535)
Q Consensus 324 C~~~~~C~~~~~C~~~~g~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 402 (535)
|+. ..|..+|+|++..|+|.|.|-+||.|..+
T Consensus 3945 Cs~-----------------------------------------------n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3945 CSK-----------------------------------------------NVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred ccc-----------------------------------------------ccccCCceeeccCCceEeccChhHhcccC
Confidence 432 34555666677777777777777776653
No 8
>KOG1219|consensus
Probab=99.09 E-value=1.6e-10 Score=131.84 Aligned_cols=110 Identities=29% Similarity=0.712 Sum_probs=96.7
Q ss_pred CCCCCCCCCCCCeeccCC-CCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCC
Q psy9424 144 PACEGILCGRNALCTASD-HHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECN 222 (535)
Q Consensus 144 ~~C~~~~C~~~g~C~~~~-~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~ 222 (535)
+.|..+||+++|+|.... ++|.|.|++-|+|..||+.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~------------------------------------------ 3902 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID------------------------------------------ 3902 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccc------------------------------------------
Confidence 789999999999999876 6799999999999997632
Q ss_pred CCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCee--CCcCCCCCCCCCCeeecCCCC
Q psy9424 223 SHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHL--IDFCAAKPCGPGARCDNSRGS 300 (535)
Q Consensus 223 ~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~--~~~C~~~~C~~~~~C~~~~g~ 300 (535)
...|. ++||..+++|+...+.|.|.|+.||+|. +|+. +++|..++|.++|.|++..|+
T Consensus 3903 -~epC~----------------snPC~~GgtCip~~n~f~CnC~~gyTG~---~Ce~~Gi~eCs~n~C~~gg~C~n~~gs 3962 (4289)
T KOG1219|consen 3903 -LEPCA----------------SNPCLTGGTCIPFYNGFLCNCPNGYTGK---RCEARGISECSKNVCGTGGQCINIPGS 3962 (4289)
T ss_pred -ccccc----------------CCCCCCCCEEEecCCCeeEeCCCCccCc---eeecccccccccccccCCceeeccCCc
Confidence 22333 5899999999999999999999999999 5553 899999999999999999999
Q ss_pred eeeeCCCCCccCCCC
Q psy9424 301 YKCLCPLGLVGDPYG 315 (535)
Q Consensus 301 ~~C~C~~Gy~g~~c~ 315 (535)
|+|.|.+||.|..|.
T Consensus 3963 f~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3963 FHCNCTPGILGRTCC 3977 (4289)
T ss_pred eEeccChhHhcccCc
Confidence 999999999988764
No 9
>KOG1225|consensus
Probab=98.98 E-value=3.4e-09 Score=111.00 Aligned_cols=105 Identities=25% Similarity=0.522 Sum_probs=74.2
Q ss_pred CccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCCCCC--CCCCCCCCCCCCCCCCCccccC
Q psy9424 14 HLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDPKYQ--SGVGTSCTSSRDCIGEQQCISG 91 (535)
Q Consensus 14 ~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~--C~~~~~C~~~~~C~~~~~C~~~ 91 (535)
.|+++.+|+|..|+.. .|+ +.|..++.|++. +|+|++||+|.+|++ |+ . .|.....+.++
T Consensus 235 ic~c~~~~~g~~c~~~----~C~--~~c~~~g~c~~G----~CIC~~Gf~G~dC~e~~Cp------~--~cs~~g~~~~g 296 (525)
T KOG1225|consen 235 ICECPEGYFGPLCSTI----YCP--GGCTGRGQCVEG----RCICPPGFTGDDCDELVCP------V--DCSGGGVCVDG 296 (525)
T ss_pred eeecCCceeCCccccc----cCC--CCCcccceEeCC----eEeCCCCCcCCCCCcccCC------c--ccCCCceecCC
Confidence 6999999999999964 554 556666888865 799999999997662 21 0 01111111111
Q ss_pred ccCCCCCCCCCCCCCCCccCCCCCCCCccccCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCeeccCCCCceecCCCC
Q psy9424 92 LCQPTCRSNTTCPAQHYCNSGLCVLEMQCTTHDQCSATEQCRSNDMGQMQCRPACEGILCGRNALCTASDHHATCSCKPG 171 (535)
Q Consensus 92 ~c~~~C~~~~~C~~~~~c~~~~C~~g~~C~~~~~c~~~~~C~~~~~G~~~c~~~C~~~~C~~~g~C~~~~~~~~C~C~~G 171 (535)
.+.|.++|+|+.+.+..|. .+|..+|.|++ ..|.|.+|
T Consensus 297 -------------------------------------~CiC~~g~~G~dCs~~~cp-adC~g~G~Ci~----G~C~C~~G 334 (525)
T KOG1225|consen 297 -------------------------------------ECICNPGYSGKDCSIRRCP-ADCSGHGKCID----GECLCDEG 334 (525)
T ss_pred -------------------------------------EeecCCCccccccccccCC-ccCCCCCcccC----CceEeCCC
Confidence 1367788888887777776 68888999984 37999999
Q ss_pred CCCCCCC
Q psy9424 172 YVGHPGP 178 (535)
Q Consensus 172 f~g~~c~ 178 (535)
|+|..|+
T Consensus 335 y~G~~C~ 341 (525)
T KOG1225|consen 335 YTGELCI 341 (525)
T ss_pred CcCCccc
Confidence 9998853
No 10
>KOG1225|consensus
Probab=98.91 E-value=1.7e-08 Score=105.89 Aligned_cols=121 Identities=26% Similarity=0.620 Sum_probs=86.4
Q ss_pred CccCCCCCCCCcccCCCCCCCCCCCCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9424 130 EQCRSNDMGQMQCRPACEGILCGRNALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSH 209 (535)
Q Consensus 130 ~~C~~~~~G~~~c~~~C~~~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~ 209 (535)
+.|+++|+|.++.+-.|... |+.++.+++. .|+|++||+|..|+
T Consensus 267 CIC~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs------------------------------- 310 (525)
T KOG1225|consen 267 CICPPGFTGDDCDELVCPVD-CSGGGVCVDG----ECICNPGYSGKDCS------------------------------- 310 (525)
T ss_pred EeCCCCCcCCCCCcccCCcc-cCCCceecCC----EeecCCCccccccc-------------------------------
Confidence 47788999998888778755 8888888874 89999999999853
Q ss_pred CCCCCccccccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCeeCCcCCCCCCC
Q psy9424 210 SGGPVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHLIDFCAAKPCG 289 (535)
Q Consensus 210 ~~~~~~C~~~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~ 289 (535)
...| +..|.+++.|+.. +|.|.+||+|. .|... .|.
T Consensus 311 --------~~~c-----------------------padC~g~G~Ci~G----~C~C~~Gy~G~---~C~~~------~C~ 346 (525)
T KOG1225|consen 311 --------IRRC-----------------------PADCSGHGKCIDG----ECLCDEGYTGE---LCIQR------ACS 346 (525)
T ss_pred --------cccC-----------------------CccCCCCCcccCC----ceEeCCCCcCC---ccccc------ccC
Confidence 1222 3668889999833 59999999999 44432 388
Q ss_pred CCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCC
Q psy9424 290 PGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTD 340 (535)
Q Consensus 290 ~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~ 340 (535)
+++.|++. |+|..||.|.. . .-+.+.....|.....++...
T Consensus 347 ~~g~cv~g-----C~C~~Gw~G~d-~----~~~~~~~~~~cs~~~~~~~~~ 387 (525)
T KOG1225|consen 347 GGGQCVNG-----CKCKKGWRGPD-V----ADPSLLLITECSPPSLCIAGV 387 (525)
T ss_pred CCceeccC-----ceeccCccCCC-c----CCchhhcccccCCCceeeccc
Confidence 88888763 99999999987 1 222333233455555555443
No 11
>KOG4260|consensus
Probab=98.59 E-value=5.7e-08 Score=90.98 Aligned_cols=71 Identities=25% Similarity=0.651 Sum_probs=55.5
Q ss_pred ccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCeeCCcCCC--CC-CCCCCeee
Q psy9424 219 VECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHLIDFCAA--KP-CGPGARCD 295 (535)
Q Consensus 219 ~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~~~~C~~--~~-C~~~~~C~ 295 (535)
..|.++++|... +.+|..+..|+|+.|+|.|...+||.+. +++|.. .. -..+..|+
T Consensus 231 ~gCvDvnEC~~e--------------p~~c~~~qfCvNteGSf~C~dk~Gy~~g-------~d~C~~~~d~~~~kn~~c~ 289 (350)
T KOG4260|consen 231 EGCVDVNECQNE--------------PAPCKAHQFCVNTEGSFKCEDKEGYKKG-------VDECQFCADVCASKNRPCM 289 (350)
T ss_pred cccccHHHHhcC--------------CCCCChhheeecCCCceEecccccccCC-------hHHhhhhhhhcccCCCCcc
Confidence 347888888754 5889999999999999999999999863 344432 22 23456889
Q ss_pred cCCCCeeeeCCCCCc
Q psy9424 296 NSRGSYKCLCPLGLV 310 (535)
Q Consensus 296 ~~~g~~~C~C~~Gy~ 310 (535)
++++.|+|.|..|+.
T Consensus 290 ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 290 NIDGQYRCVCFSGLI 304 (350)
T ss_pred cCCccEEEEecccce
Confidence 999999999999975
No 12
>KOG4260|consensus
Probab=98.39 E-value=4.4e-07 Score=85.12 Aligned_cols=137 Identities=28% Similarity=0.669 Sum_probs=88.4
Q ss_pred CCCCCCCccccC---CCCeeeeCCCCCccCCCCCCeeC------Cc----CCC--CCCCCCCeeecCCCCeee-eCCCCC
Q psy9424 246 NPCGPNALCSAE---KHKQICYCQPGYTGDAYFGCHLI------DF----CAA--KPCGPGARCDNSRGSYKC-LCPLGL 309 (535)
Q Consensus 246 ~~C~~~~~C~~~---~g~~~C~C~~G~~G~~~~~C~~~------~~----C~~--~~C~~~~~C~~~~g~~~C-~C~~Gy 309 (535)
.+|..++.|... .|+-+|.|.+||+|+.+..|.+- ++ |.. .+|. +.|..... -.| +|+.||
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~Csg~~~-k~C~kCkkGW 226 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL--GVCSGESS-KGCSKCKKGW 226 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--cccCCCCC-CChhhhcccc
Confidence 567777777532 35678999999999976555420 00 100 1222 24543322 234 799999
Q ss_pred ccCCCCCCCCCCCCCCCC-CCCCCCCEEecCCCCCccCC--CCc-cCcccCCCCcccCCCcCCCCCCCCCCCCCceeeec
Q psy9424 310 VGDPYGAGCVSASQCTRD-DQCPPGAHCVKTDGVPKCKA--SCQ-SDEECGLGEKCLQGQCNNPCERQGACGVNSLCNVL 385 (535)
Q Consensus 310 ~g~~c~~~C~~~~~C~~~-~~C~~~~~C~~~~g~~~C~~--~C~-~~~eC~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~ 385 (535)
..+. ..|+|+++|... .+|.....|+|+.|+|.|.. ++. .+++|+. |.+.|. ..+..|+++
T Consensus 227 ~lde--~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~--------~~d~~~-----~kn~~c~ni 291 (350)
T KOG4260|consen 227 KLDE--EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQF--------CADVCA-----SKNRPCMNI 291 (350)
T ss_pred eecc--cccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhh--------hhhhcc-----cCCCCcccC
Confidence 9884 459999999876 68999999999999999852 121 1233321 011111 235678899
Q ss_pred CCCeeeeCCCCCcCC
Q psy9424 386 THRKVCFCPRGFTGD 400 (535)
Q Consensus 386 ~g~~~C~C~~G~~g~ 400 (535)
+++|+|+|..|+.-.
T Consensus 292 ~~~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 292 DGQYRCVCFSGLIII 306 (350)
T ss_pred CccEEEEecccceee
Confidence 999999999887644
No 13
>KOG0994|consensus
Probab=98.32 E-value=2.8e-05 Score=85.76 Aligned_cols=106 Identities=27% Similarity=0.603 Sum_probs=65.7
Q ss_pred eecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCCC-CCCCCCCcccCCCcccCCC
Q psy9424 165 TCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNSH-ADCSGDKVCEDHRCKISCL 243 (535)
Q Consensus 165 ~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~~-~~C~~~~~C~~~~c~~~c~ 243 (535)
.|+|.+|-+|..|. -|+|||+|.. .|+.-.|..+ ++|...
T Consensus 831 QC~C~~g~ygrqCn-------------------qCqpG~WgFP-------eCr~CqCNgHA~~Cd~~------------- 871 (1758)
T KOG0994|consen 831 QCQCRPGTYGRQCN-------------------QCQPGYWGFP-------ECRPCQCNGHADTCDPI------------- 871 (1758)
T ss_pred ceeeccccchhhcc-------------------ccCCCccCCC-------cCccccccCcccccCcc-------------
Confidence 78898998888864 4788888853 4665566544 344322
Q ss_pred CCCCCCCCCccccCCCCeee-eCCCCCccCCCCCCeeCCcCCCCCCCCCC--------eeecC--CCCeeeeCCCCCccC
Q psy9424 244 ANNPCGPNALCSAEKHKQIC-YCQPGYTGDAYFGCHLIDFCAAKPCGPGA--------RCDNS--RGSYKCLCPLGLVGD 312 (535)
Q Consensus 244 ~~~~C~~~~~C~~~~g~~~C-~C~~G~~G~~~~~C~~~~~C~~~~C~~~~--------~C~~~--~g~~~C~C~~Gy~g~ 312 (535)
...|. .|.+....+.| +|..||.|++.. -.-..|.+-||..+- .|... .....|.|.+||+|.
T Consensus 872 -tGaCi---~CqD~T~G~~CdrCl~GyyGdP~l--g~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~ 945 (1758)
T KOG0994|consen 872 -TGACI---DCQDSTTGHSCDRCLDGYYGDPRL--GSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGS 945 (1758)
T ss_pred -ccccc---cccccccccchhhhhccccCCccc--CCCCCCCCCCCCCCCccchhccccccccccccceeeecccCcccc
Confidence 12222 24455567788 899999998754 122345444443321 23222 234579999999999
Q ss_pred CCC
Q psy9424 313 PYG 315 (535)
Q Consensus 313 ~c~ 315 (535)
.|+
T Consensus 946 RCe 948 (1758)
T KOG0994|consen 946 RCE 948 (1758)
T ss_pred chh
Confidence 877
No 14
>KOG0994|consensus
Probab=97.95 E-value=0.00018 Score=79.68 Aligned_cols=26 Identities=35% Similarity=0.828 Sum_probs=18.5
Q ss_pred eeeecCCCeeeeCCCCCcCCCCCCCCcc
Q psy9424 381 LCNVLTHRKVCFCPRGFTGDPETECVRI 408 (535)
Q Consensus 381 ~C~~~~g~~~C~C~~G~~g~~~~~C~~~ 408 (535)
+|..-.| +|+|.+||-|..+++|...
T Consensus 1078 qCN~ftG--QCqCkpGfGGR~C~qCqel 1103 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGRTCSQCQEL 1103 (1758)
T ss_pred ccccccc--ceeccCCCCCcchhHHHHh
Confidence 3443344 8999999999988777643
No 15
>KOG1226|consensus
Probab=97.75 E-value=0.00025 Score=76.41 Aligned_cols=62 Identities=29% Similarity=0.717 Sum_probs=45.3
Q ss_pred CCCCCCCccccCCCCeeeeCCCCCccCCCCCCe-eCCcCCCC---CCCCCCeeecCCCCeeeeCCCC-CccCCCCC
Q psy9424 246 NPCGPNALCSAEKHKQICYCQPGYTGDAYFGCH-LIDFCAAK---PCGPGARCDNSRGSYKCLCPLG-LVGDPYGA 316 (535)
Q Consensus 246 ~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~-~~~~C~~~---~C~~~~~C~~~~g~~~C~C~~G-y~g~~c~~ 316 (535)
..|..+|+|.=. +|+|.+||+|..+. |. +.+.|.+. .|...|+|.=. +|+|... |+|..|++
T Consensus 555 ~lC~g~G~C~CG----~CvC~~GwtG~~C~-C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~ 621 (783)
T KOG1226|consen 555 VLCGGHGRCECG----RCVCNPGWTGSACN-CPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEK 621 (783)
T ss_pred cccCCCCeEeCC----cEEcCCCCccCCCC-CCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhc
Confidence 568888887543 49999999999774 54 35667542 37777777654 6888776 99998874
No 16
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.71 E-value=2e-05 Score=54.14 Aligned_cols=33 Identities=21% Similarity=0.519 Sum_probs=30.6
Q ss_pred CCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcC
Q psy9424 32 RNPCEA-DEVCGRNAECAVVNHTPRCTCVAGTVG 64 (535)
Q Consensus 32 ~d~C~~-~~~C~~~g~C~~~~~~~~C~C~~Gf~G 64 (535)
||||.. .+.|..++.|+++.|+|+|+|++||..
T Consensus 2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 899997 678999999999999999999999983
No 17
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.70 E-value=1.4e-05 Score=51.22 Aligned_cols=30 Identities=33% Similarity=0.793 Sum_probs=27.0
Q ss_pred CCCCCCCCCCeeeeCC-CCceeeCCCCCcCC
Q psy9424 454 CSPAPCGPNAQCSVAN-HRPLCSCPAGLMGL 483 (535)
Q Consensus 454 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~G~ 483 (535)
|.+++|.++|+|++.. ++|+|+|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 3467999999999998 99999999999996
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.57 E-value=9.6e-05 Score=49.57 Aligned_cols=35 Identities=29% Similarity=0.748 Sum_probs=31.2
Q ss_pred cCCCCC-CCCCCCCeeeeCCCCceeeCCCCCc-CCCC
Q psy9424 451 VDPCSP-APCGPNAQCSVANHRPLCSCPAGLM-GLPS 485 (535)
Q Consensus 451 ~d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~-G~~c 485 (535)
+|+|.. .+|.++++|+++.++|.|.|++||+ |..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 678876 7899999999999999999999999 7765
No 19
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.56 E-value=3.7e-05 Score=52.76 Aligned_cols=31 Identities=35% Similarity=0.862 Sum_probs=29.0
Q ss_pred cCCCC--CCCCCCCCeeeeCCCCceeeCCCCCc
Q psy9424 451 VDPCS--PAPCGPNAQCSVANHRPLCSCPAGLM 481 (535)
Q Consensus 451 ~d~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~ 481 (535)
||||+ +++|..++.|+|+.|+|+|.|++||.
T Consensus 2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 78998 56899999999999999999999998
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.47 E-value=5e-05 Score=48.72 Aligned_cols=28 Identities=21% Similarity=0.564 Sum_probs=25.9
Q ss_pred CCCCCCCCeeeecC-CCceeeCCCCCcCC
Q psy9424 38 DEVCGRNAECAVVN-HTPRCTCVAGTVGD 65 (535)
Q Consensus 38 ~~~C~~~g~C~~~~-~~~~C~C~~Gf~G~ 65 (535)
.++|.++|+|++.. ++|+|+|++||+|.
T Consensus 3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 46899999999998 89999999999996
No 21
>KOG1226|consensus
Probab=97.39 E-value=0.0015 Score=70.52 Aligned_cols=60 Identities=25% Similarity=0.616 Sum_probs=42.5
Q ss_pred CCCCCCCccccCCCCeeeeCCCCCc----cCCCCCCeeCCcCCC---CCCCCCCeeecCCCCeeeeCCCCCccCCCC
Q psy9424 246 NPCGPNALCSAEKHKQICYCQPGYT----GDAYFGCHLIDFCAA---KPCGPGARCDNSRGSYKCLCPLGLVGDPYG 315 (535)
Q Consensus 246 ~~C~~~~~C~~~~g~~~C~C~~G~~----G~~~~~C~~~~~C~~---~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~ 315 (535)
.+|+..|.|+=. +|+|.+... |..++ |.+ -.|.. ..|..+|+|.=. +|+|.+||+|..|+
T Consensus 514 ~vCSgrG~C~CG----qC~C~~~~~~~i~G~fCE-CDn-fsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~ 580 (783)
T KOG1226|consen 514 PVCSGRGDCVCG----QCVCHKPDNGKIYGKFCE-CDN-FSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACN 580 (783)
T ss_pred CCcCCCCcEeCC----ceEecCCCCCceeeeeee-ccC-cccccccCcccCCCCeEeCC----cEEcCCCCccCCCC
Confidence 478888888644 488887766 66443 322 23433 348889998765 79999999999987
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.26 E-value=0.00037 Score=46.66 Aligned_cols=35 Identities=26% Similarity=0.656 Sum_probs=30.4
Q ss_pred cCCCCC-CCCCCCCeeccCCCCceecCCCCCC-CCCC
Q psy9424 143 RPACEG-ILCGRNALCTASDHHATCSCKPGYV-GHPG 177 (535)
Q Consensus 143 ~~~C~~-~~C~~~g~C~~~~~~~~C~C~~Gf~-g~~c 177 (535)
+++|.. .+|.++++|+++.++|.|.|++||+ |..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 567776 7899899999999999999999999 7664
No 23
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.19 E-value=0.0005 Score=45.51 Aligned_cols=35 Identities=31% Similarity=0.767 Sum_probs=30.9
Q ss_pred cCCCCC-CCCCCCCeeeeCCCCceeeCCCCCcCCCC
Q psy9424 451 VDPCSP-APCGPNAQCSVANHRPLCSCPAGLMGLPS 485 (535)
Q Consensus 451 ~d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~G~~c 485 (535)
+++|.. .+|.+++.|++..++|+|.|++||.|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 577765 78999999999999999999999999765
No 24
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.02 E-value=0.00026 Score=46.55 Aligned_cols=29 Identities=31% Similarity=0.736 Sum_probs=24.1
Q ss_pred CCCCCCCceeeecCCCeeeeCCCCCcCCC
Q psy9424 373 QGACGVNSLCNVLTHRKVCFCPRGFTGDP 401 (535)
Q Consensus 373 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 401 (535)
.+.|+.+|+|+++.++|+|+|++||.|++
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 46899999999999999999999999986
No 25
>KOG1836|consensus
Probab=97.00 E-value=0.048 Score=65.70 Aligned_cols=49 Identities=24% Similarity=0.558 Sum_probs=35.7
Q ss_pred ccCCCCCCCCcc--cCCCCCCCCCCCCeeccCC--CCceec-CCCCCCCCCCCC
Q psy9424 131 QCRSNDMGQMQC--RPACEGILCGRNALCTASD--HHATCS-CKPGYVGHPGPS 179 (535)
Q Consensus 131 ~C~~~~~G~~~c--~~~C~~~~C~~~g~C~~~~--~~~~C~-C~~Gf~g~~c~~ 179 (535)
.|.++|+|...- ...|..-+|...+.|..+. ....|. |++||+|..|+.
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence 566777776422 1227777888888887765 457898 999999999864
No 26
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.97 E-value=0.0003 Score=46.26 Aligned_cols=29 Identities=34% Similarity=0.657 Sum_probs=23.8
Q ss_pred CCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424 38 DEVCGRNAECAVVNHTPRCTCVAGTVGDP 66 (535)
Q Consensus 38 ~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~ 66 (535)
.+.|..+++|+++.++|.|+|++||+|+.
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 46799999999999999999999999984
No 27
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.85 E-value=0.0014 Score=43.28 Aligned_cols=35 Identities=29% Similarity=0.674 Sum_probs=30.1
Q ss_pred cCCCCC-CCCCCCCeeccCCCCceecCCCCCCCCCC
Q psy9424 143 RPACEG-ILCGRNALCTASDHHATCSCKPGYVGHPG 177 (535)
Q Consensus 143 ~~~C~~-~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c 177 (535)
+++|.. .+|.++++|+++.+.|.|.|++||.|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 466766 68988899999999999999999999764
No 28
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.75 E-value=0.0021 Score=41.75 Aligned_cols=30 Identities=27% Similarity=0.664 Sum_probs=26.7
Q ss_pred CCCCCCCCeeeeCCCCceeeCCCCCcCC-CC
Q psy9424 456 PAPCGPNAQCSVANHRPLCSCPAGLMGL-PS 485 (535)
Q Consensus 456 ~~~C~~~~~C~~~~g~~~C~C~~G~~G~-~c 485 (535)
..+|.+++.|++..+.|+|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 5678889999999999999999999998 44
No 29
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.69 E-value=0.0024 Score=41.64 Aligned_cols=28 Identities=32% Similarity=0.756 Sum_probs=24.7
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCCcC-CCC
Q psy9424 457 APCGPNAQCSVANHRPLCSCPAGLMG-LPS 485 (535)
Q Consensus 457 ~~C~~~~~C~~~~g~~~C~C~~G~~G-~~c 485 (535)
.+|.++ +|++..++|+|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 578888 9999999999999999999 544
No 30
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.61 E-value=0.0021 Score=37.92 Aligned_cols=24 Identities=38% Similarity=0.836 Sum_probs=20.7
Q ss_pred CeeeeCCCCCccCCCCCCCCCCCC
Q psy9424 300 SYKCLCPLGLVGDPYGAGCVSASQ 323 (535)
Q Consensus 300 ~~~C~C~~Gy~g~~c~~~C~~~~~ 323 (535)
+|+|+|++||......++|++|+|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 589999999998887888888875
No 31
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.34 E-value=0.005 Score=39.90 Aligned_cols=28 Identities=29% Similarity=0.780 Sum_probs=25.4
Q ss_pred CCCCCCCCeeccCCCCceecCCCCCCCC
Q psy9424 148 GILCGRNALCTASDHHATCSCKPGYVGH 175 (535)
Q Consensus 148 ~~~C~~~g~C~~~~~~~~C~C~~Gf~g~ 175 (535)
..+|.++++|+++.++|.|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4678889999999999999999999987
No 32
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.20 E-value=0.00096 Score=60.05 Aligned_cols=106 Identities=29% Similarity=0.663 Sum_probs=66.0
Q ss_pred CCCCCCCeeccCC-----CCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCC
Q psy9424 149 ILCGRNALCTASD-----HHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNS 223 (535)
Q Consensus 149 ~~C~~~g~C~~~~-----~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~ 223 (535)
.+|+..++|++.. ..|.|.|.+||+... ..|.+..|.+
T Consensus 50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~-------------------------------------~vCvp~~C~~ 92 (197)
T PF06247_consen 50 KPCGDYAKCINQANKGEERAYKCDCINGYILKQ-------------------------------------GVCVPNKCNN 92 (197)
T ss_dssp SEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS-------------------------------------SSEEEGGGSS
T ss_pred ccccchhhhhcCCCcccceeEEEecccCceeeC-------------------------------------CeEchhhcCc
Confidence 5677778887755 469999999999887 5677777764
Q ss_pred CCCCCCCCcccCCCcccCCCCCCCCCCCCccccCC---CCeeeeCCCCCccCCCCCCee--CCcCCCCCCCCCCeeecCC
Q psy9424 224 HADCSGDKVCEDHRCKISCLANNPCGPNALCSAEK---HKQICYCQPGYTGDAYFGCHL--IDFCAAKPCGPGARCDNSR 298 (535)
Q Consensus 224 ~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~---g~~~C~C~~G~~G~~~~~C~~--~~~C~~~~C~~~~~C~~~~ 298 (535)
..|. .|.|+..+ ....|+|.-|+.-+....|+. ..+|. -.|..+-.|....
T Consensus 93 ----------------------~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~ 148 (197)
T PF06247_consen 93 ----------------------KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVD 148 (197)
T ss_dssp -------------------------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEET
T ss_pred ----------------------eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeC
Confidence 3355 56886432 345899999998333334553 23443 3477778999999
Q ss_pred CCeeeeCCCCCccCCCC
Q psy9424 299 GSYKCLCPLGLVGDPYG 315 (535)
Q Consensus 299 g~~~C~C~~Gy~g~~c~ 315 (535)
+-|+|.+..||.+..-+
T Consensus 149 ~~Y~C~~~~~~~~~~~~ 165 (197)
T PF06247_consen 149 GYYKCVCKEGFPGDGEG 165 (197)
T ss_dssp TEEEEEE-TT-EEETTT
T ss_pred cEEEeecCCCCCCCCCc
Confidence 99999999999876544
No 33
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.20 E-value=0.0045 Score=36.52 Aligned_cols=24 Identities=21% Similarity=0.494 Sum_probs=18.7
Q ss_pred CceeeCCCCCcCCCCCCCCCCCCCCCCCCCCCCCc
Q psy9424 53 TPRCTCVAGTVGDPKYQSGVGTSCTSSRDCIGEQQ 87 (535)
Q Consensus 53 ~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~~~~~ 87 (535)
+|+|.|++||+.. .+...|.||+|
T Consensus 1 sy~C~C~~Gy~l~-----------~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLS-----------PDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCC-----------CCCCccccCCC
Confidence 5899999999976 33467888765
No 34
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.12 E-value=0.00087 Score=60.31 Aligned_cols=146 Identities=29% Similarity=0.653 Sum_probs=83.9
Q ss_pred CCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCCCCCCCCCCcc
Q psy9424 154 NALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNSHADCSGDKVC 233 (535)
Q Consensus 154 ~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C 233 (535)
+|..+...+.|.|.|.+||.... ..+|+...+|....
T Consensus 10 NG~LiQMSNHfEC~Cnegfvl~~-----------------------------------------EntCE~kv~C~~~e-- 46 (197)
T PF06247_consen 10 NGYLIQMSNHFECKCNEGFVLKN-----------------------------------------ENTCEEKVECDKLE-- 46 (197)
T ss_dssp TEEEEEESSEEEEEESTTEEEEE-----------------------------------------TTEEEE----SG-G--
T ss_pred CCEEEEccCceEEEcCCCcEEcc-----------------------------------------ccccccceecCccc--
Confidence 57777777889999999998765 12233233332100
Q ss_pred cCCCcccCCCCCCCCCCCCccccCC-----CCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCC---CCeeeeC
Q psy9424 234 EDHRCKISCLANNPCGPNALCSAEK-----HKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSR---GSYKCLC 305 (535)
Q Consensus 234 ~~~~c~~~c~~~~~C~~~~~C~~~~-----g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~---g~~~C~C 305 (535)
....+|...++|++.. ..|.|.|.+||+.... .|. .+.|....|. .|.|+..+ ....|+|
T Consensus 47 ---------~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~-vCv-p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC 114 (197)
T PF06247_consen 47 ---------NVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG-VCV-PNKCNNKDCG-SGKCILDPDNPNNPTCSC 114 (197)
T ss_dssp ---------GTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-SEE-EGGGSS---T-TEEEEEEEGGGSEEEEEE
T ss_pred ---------ccCccccchhhhhcCCCcccceeEEEecccCceeeCC-eEc-hhhcCceecC-CCeEEecCCCCCCceeEe
Confidence 0136788899998765 4699999999997654 354 3566666677 56887433 3458999
Q ss_pred CCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCCCCCccCCCCccCcccCCCCcccCCCcCCCCCCCCCCCCCceeeec
Q psy9424 306 PLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTDGVPKCKASCQSDEECGLGEKCLQGQCNNPCERQGACGVNSLCNVL 385 (535)
Q Consensus 306 ~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~ 385 (535)
.-|+..+. .. .|+. +|.-.| + -.|..+-.|..+
T Consensus 115 ~IGkV~~d-n~------------------kCtk-~G~T~C-------------------------~--LKCk~nE~CK~~ 147 (197)
T PF06247_consen 115 NIGKVPDD-NK------------------KCTK-TGETKC-------------------------S--LKCKENEECKLV 147 (197)
T ss_dssp -TEEETTT-TT------------------ESEE-EE-----------------------------------TTTEEEEEE
T ss_pred eeceEecc-CC------------------cccC-CCccce-------------------------e--eecCCCcceeee
Confidence 99998221 11 1211 011111 1 134456678889
Q ss_pred CCCeeeeCCCCCcCCC
Q psy9424 386 THRKVCFCPRGFTGDP 401 (535)
Q Consensus 386 ~g~~~C~C~~G~~g~~ 401 (535)
.+-|+|++.+||.++.
T Consensus 148 ~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 148 DGYYKCVCKEGFPGDG 163 (197)
T ss_dssp TTEEEEEE-TT-EEET
T ss_pred CcEEEeecCCCCCCCC
Confidence 9999999999998775
No 35
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.07 E-value=0.0079 Score=39.09 Aligned_cols=28 Identities=36% Similarity=0.780 Sum_probs=24.2
Q ss_pred CCCCCCCeeccCCCCceecCCCCCCC-CCC
Q psy9424 149 ILCGRNALCTASDHHATCSCKPGYVG-HPG 177 (535)
Q Consensus 149 ~~C~~~g~C~~~~~~~~C~C~~Gf~g-~~c 177 (535)
.+|..+ +|+++.++|+|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 578777 9999999999999999999 553
No 36
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.90 E-value=0.009 Score=38.14 Aligned_cols=26 Identities=27% Similarity=0.756 Sum_probs=22.4
Q ss_pred CCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424 39 EVCGRNAECAVVNHTPRCTCVAGTVGDP 66 (535)
Q Consensus 39 ~~C~~~g~C~~~~~~~~C~C~~Gf~G~~ 66 (535)
..|+++|+|+.. .++|+|.+||+|..
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGPD 31 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCCC
Confidence 469999999976 56899999999984
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.77 E-value=0.012 Score=37.62 Aligned_cols=27 Identities=22% Similarity=0.564 Sum_probs=22.8
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCCcCCCC
Q psy9424 457 APCGPNAQCSVANHRPLCSCPAGLMGLPS 485 (535)
Q Consensus 457 ~~C~~~~~C~~~~g~~~C~C~~G~~G~~c 485 (535)
..|+++|+|+...+ +|+|.+||+|..|
T Consensus 6 ~~C~~~G~C~~~~g--~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPCG--RCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence 46899999997644 9999999999875
No 38
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.08 E-value=0.02 Score=28.59 Aligned_cols=13 Identities=38% Similarity=0.989 Sum_probs=10.4
Q ss_pred eeeCCCCCcCCCC
Q psy9424 473 LCSCPAGLMGLPS 485 (535)
Q Consensus 473 ~C~C~~G~~G~~c 485 (535)
+|+|++||+|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5899999999875
No 39
>smart00051 DSL delta serrate ligand.
Probab=94.07 E-value=0.064 Score=40.18 Aligned_cols=45 Identities=13% Similarity=0.229 Sum_probs=34.4
Q ss_pred CccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424 14 HLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDP 66 (535)
Q Consensus 14 ~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~ 66 (535)
.-.|.++|.|..|.. .|...+....+.+|.. .+.++|.+||+|..
T Consensus 18 rv~C~~~~yG~~C~~-----~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~ 62 (63)
T smart00051 18 RVTCDENYYGEGCNK-----FCRPRDDFFGHYTCDE---NGNKGCLEGWMGPY 62 (63)
T ss_pred EeeCCCCCcCCccCC-----EeCcCccccCCccCCc---CCCEecCCCCcCCC
Confidence 456889999999976 5654455677788854 35799999999984
No 40
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.20 E-value=0.083 Score=34.69 Aligned_cols=21 Identities=43% Similarity=0.885 Sum_probs=10.1
Q ss_pred eeecCCCCeeeeCCCCCccCC
Q psy9424 293 RCDNSRGSYKCLCPLGLVGDP 313 (535)
Q Consensus 293 ~C~~~~g~~~C~C~~Gy~g~~ 313 (535)
+|++.+++|+|.|++||+...
T Consensus 11 ~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp EEEEETTSEEEE-STTEEE-T
T ss_pred CCccCCCceEeECCCCCEECc
Confidence 455555555555555555444
No 41
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.12 E-value=0.08 Score=34.77 Aligned_cols=25 Identities=24% Similarity=0.497 Sum_probs=19.4
Q ss_pred CCCCCCCeeeecCCCceeeCCCCCcCC
Q psy9424 39 EVCGRNAECAVVNHTPRCTCVAGTVGD 65 (535)
Q Consensus 39 ~~C~~~g~C~~~~~~~~C~C~~Gf~G~ 65 (535)
+.|++ .|+++.++|+|.|++||+..
T Consensus 6 GgC~h--~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 6 GGCSH--ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp GGSSS--EEEEETTSEEEE-STTEEE-
T ss_pred CCcCC--CCccCCCceEeECCCCCEEC
Confidence 34554 79999999999999999876
No 42
>KOG1836|consensus
Probab=90.78 E-value=0.46 Score=57.70 Aligned_cols=53 Identities=26% Similarity=0.574 Sum_probs=38.8
Q ss_pred ee-eCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCC--CCeeee-CCCCCccCCCCC
Q psy9424 262 IC-YCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSR--GSYKCL-CPLGLVGDPYGA 316 (535)
Q Consensus 262 ~C-~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~--g~~~C~-C~~Gy~g~~c~~ 316 (535)
+| +|..||.|.+-. -....|.+-+|.+++.|.... ....|+ |++||+|..|+.
T Consensus 757 ~C~~C~~GfYg~~~~--~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 757 QCAQCVDGFYGLPDL--GTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred chhhhcCCCCCcccc--CCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence 56 899999987532 112338777888888776654 567898 999999998773
No 43
>smart00051 DSL delta serrate ligand.
Probab=82.08 E-value=1.8 Score=32.43 Aligned_cols=43 Identities=19% Similarity=0.305 Sum_probs=28.0
Q ss_pred ccCCCCCCCCcccCCCCC-CCCCCCCeeccCCCCceecCCCCCCCCCC
Q psy9424 131 QCRSNDMGQMQCRPACEG-ILCGRNALCTASDHHATCSCKPGYVGHPG 177 (535)
Q Consensus 131 ~C~~~~~G~~~c~~~C~~-~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c 177 (535)
.|.++|.|..+ ...|.. +....+.+|.. . ..++|.+||+|+.|
T Consensus 20 ~C~~~~yG~~C-~~~C~~~~d~~~~~~Cd~-~--G~~~C~~Gw~G~~C 63 (63)
T smart00051 20 TCDENYYGEGC-NKFCRPRDDFFGHYTCDE-N--GNKGCLEGWMGPYC 63 (63)
T ss_pred eCCCCCcCCcc-CCEeCcCccccCCccCCc-C--CCEecCCCCcCCCC
Confidence 56677777763 345543 33456777754 2 36889999999863
No 44
>PHA02887 EGF-like protein; Provisional
Probab=81.97 E-value=1.4 Score=36.79 Aligned_cols=36 Identities=25% Similarity=0.516 Sum_probs=28.2
Q ss_pred CCCCCC--CCCCCCCCeeeecC--CCceeeCCCCCcCCCCC
Q psy9424 32 RNPCEA--DEVCGRNAECAVVN--HTPRCTCVAGTVGDPKY 68 (535)
Q Consensus 32 ~d~C~~--~~~C~~~g~C~~~~--~~~~C~C~~Gf~G~~c~ 68 (535)
..+|.. .+-|- ||+|.-.. ..+.|.|++||+|.+|.
T Consensus 83 f~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 83 FEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred ccccChHhhCEee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 667775 56788 57997754 46899999999999654
No 45
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=78.25 E-value=0.72 Score=30.26 Aligned_cols=31 Identities=32% Similarity=0.576 Sum_probs=21.9
Q ss_pred CCCCCCCCCCeeccCC-CCceecCCCCCCCCC
Q psy9424 146 CEGILCGRNALCTASD-HHATCSCKPGYVGHP 176 (535)
Q Consensus 146 C~~~~C~~~g~C~~~~-~~~~C~C~~Gf~g~~ 176 (535)
|....|..|+.|++.. |+++|.|..||....
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 4446788899999876 899999999998655
No 46
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=77.81 E-value=2.3 Score=40.61 Aligned_cols=38 Identities=26% Similarity=0.543 Sum_probs=29.5
Q ss_pred CCCcCCCCCcCCCC--CCCCCCCCeeeeCCCCceeeCCCCCcCCC
Q psy9424 442 ALSCRSAECVDPCS--PAPCGPNAQCSVANHRPLCSCPAGLMGLP 484 (535)
Q Consensus 442 g~~C~~~~c~d~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~G~~ 484 (535)
+..|.+ +++|. +++|. ..|.++.|+|.|.|++||+...
T Consensus 181 ~~~C~~---~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 181 GKICVV---PDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred cccCcC---chhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence 455655 67786 45565 5799999999999999998754
No 47
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=76.59 E-value=2.8 Score=40.04 Aligned_cols=39 Identities=28% Similarity=0.433 Sum_probs=29.4
Q ss_pred CCeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCC
Q psy9424 276 GCHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPY 314 (535)
Q Consensus 276 ~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c 314 (535)
.|+++++|...+......|.+..|+|.|.|++||+....
T Consensus 183 ~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~ 221 (224)
T cd01475 183 ICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLED 221 (224)
T ss_pred cCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCC
Confidence 577788886533222358999999999999999987543
No 48
>KOG1218|consensus
Probab=71.64 E-value=30 Score=34.56 Aligned_cols=56 Identities=21% Similarity=0.585 Sum_probs=27.5
Q ss_pred cccccchhccCCCccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424 2 CREQVQWQQISQHLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDP 66 (535)
Q Consensus 2 ~~~~~~~~~~~~~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~ 66 (535)
|+.++..+...+.+. ..+|.|..|.. +.+|... |.. -+|.+... .|.+..+|.+..
T Consensus 81 c~~~~~~~~~~~~~~-~~~~~g~~C~~---~~~~~~~--c~~-~~C~~~~~--~c~~~~~~~~~~ 136 (316)
T KOG1218|consen 81 CKNGGTCVSSTGYCH-LNGYEGPQCES---PCPCGDG--CAE-KTCANPRR--ECRCGGGYIGEQ 136 (316)
T ss_pred cCCCCcccCCCCccc-CCCCCcccccC---CCCcCCc--ccc-cccCCCcc--ceecCCcCcccc
Confidence 344555555555554 56666666665 3333211 222 33443321 466666666553
No 49
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=71.14 E-value=4.2 Score=34.08 Aligned_cols=32 Identities=31% Similarity=0.924 Sum_probs=26.1
Q ss_pred CCCCCCCCCCCCCCeeeecCCCceeeCCCCCcC
Q psy9424 32 RNPCEADEVCGRNAECAVVNHTPRCTCVAGTVG 64 (535)
Q Consensus 32 ~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G 64 (535)
.|+|...+.|+.+|.|.. .....|.|.+||.-
T Consensus 77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred ccCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence 568877789999999954 45678999999974
No 50
>PHA02887 EGF-like protein; Provisional
Probab=69.52 E-value=4.8 Score=33.65 Aligned_cols=30 Identities=23% Similarity=0.540 Sum_probs=24.0
Q ss_pred CCCCCCCCeeeeC--CCCceeeCCCCCcCCCCC
Q psy9424 456 PAPCGPNAQCSVA--NHRPLCSCPAGLMGLPSA 486 (535)
Q Consensus 456 ~~~C~~~~~C~~~--~g~~~C~C~~G~~G~~c~ 486 (535)
.+.|- +|+|... ...+.|.|+.||+|..|+
T Consensus 91 k~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 91 NDFCI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred hCEee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 45676 5799765 356899999999999986
No 51
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=69.48 E-value=4.7 Score=34.26 Aligned_cols=36 Identities=22% Similarity=0.441 Sum_probs=27.5
Q ss_pred CCCCCC--CCCCCCCCeeeecC--CCceeeCCCCCcCCCCC
Q psy9424 32 RNPCEA--DEVCGRNAECAVVN--HTPRCTCVAGTVGDPKY 68 (535)
Q Consensus 32 ~d~C~~--~~~C~~~g~C~~~~--~~~~C~C~~Gf~G~~c~ 68 (535)
+-+|.. .+-|-++ +|.-.. ..+.|.|..||+|.+|+
T Consensus 42 i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 42 IRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred cccCChhhCCEeECC-EEEeeccCCCceeECCCCccccccc
Confidence 556664 5678764 997754 58899999999999654
No 52
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=67.74 E-value=2.3 Score=27.92 Aligned_cols=29 Identities=24% Similarity=0.462 Sum_probs=20.6
Q ss_pred CCCCCCCCeeeeCC-CCceeeCCCCCcCCC
Q psy9424 456 PAPCGPNAQCSVAN-HRPLCSCPAGLMGLP 484 (535)
Q Consensus 456 ~~~C~~~~~C~~~~-g~~~C~C~~G~~G~~ 484 (535)
...|..++.|++.. |++.|.|..||....
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 45677889998876 999999999997643
No 53
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=66.04 E-value=7 Score=27.57 Aligned_cols=17 Identities=29% Similarity=0.612 Sum_probs=14.4
Q ss_pred eeeCCCCCcCCCCCCCC
Q psy9424 390 VCFCPRGFTGDPETECV 406 (535)
Q Consensus 390 ~C~C~~G~~g~~~~~C~ 406 (535)
+|.|+++|+|..++.|.
T Consensus 20 ~C~C~~~~~G~~C~~C~ 36 (50)
T cd00055 20 QCECKPNTTGRRCDRCA 36 (50)
T ss_pred EEeCCCcCCCCCCCCCC
Confidence 89999999999876553
No 54
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=65.15 E-value=3.8 Score=28.73 Aligned_cols=26 Identities=31% Similarity=0.634 Sum_probs=18.1
Q ss_pred ceeeecCCCeeeeCCCCCcCCCCCCCCc
Q psy9424 380 SLCNVLTHRKVCFCPRGFTGDPETECVR 407 (535)
Q Consensus 380 ~~C~~~~g~~~C~C~~G~~g~~~~~C~~ 407 (535)
.+|....| +|.|+++|+|..+++|.+
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~C~~ 36 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQCKP 36 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-EE-T
T ss_pred CcccCCCC--EEeccccccCCcCcCCCC
Confidence 35665444 999999999999776543
No 55
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=64.95 E-value=6.4 Score=33.50 Aligned_cols=30 Identities=23% Similarity=0.536 Sum_probs=24.3
Q ss_pred CCCCCCCCeeeeCC--CCceeeCCCCCcCCCCC
Q psy9424 456 PAPCGPNAQCSVAN--HRPLCSCPAGLMGLPSA 486 (535)
Q Consensus 456 ~~~C~~~~~C~~~~--g~~~C~C~~G~~G~~c~ 486 (535)
.+-|-++ +|.... ..+.|.|..||+|..|+
T Consensus 50 ~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 50 DGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred CCEeECC-EEEeeccCCCceeECCCCccccccc
Confidence 4567764 897653 68999999999999997
No 56
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=64.91 E-value=8.5 Score=27.11 Aligned_cols=21 Identities=24% Similarity=0.566 Sum_probs=16.6
Q ss_pred eeeeCCCCceeeCCCCCcCCCCC
Q psy9424 464 QCSVANHRPLCSCPAGLMGLPSA 486 (535)
Q Consensus 464 ~C~~~~g~~~C~C~~G~~G~~c~ 486 (535)
.|....| +|.|+++|.|..|+
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~ 33 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCD 33 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCC
Confidence 3655555 89999999999985
No 57
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=63.87 E-value=7.8 Score=32.41 Aligned_cols=32 Identities=34% Similarity=0.864 Sum_probs=24.2
Q ss_pred CCCCCCCCCCCCceeeecCCCeeeeCCCCCcCC
Q psy9424 368 NPCERQGACGVNSLCNVLTHRKVCFCPRGFTGD 400 (535)
Q Consensus 368 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 400 (535)
+.|...+.|++++.|.. .....|.|++||...
T Consensus 78 d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 35555789999999954 346689999999753
No 58
>KOG1218|consensus
Probab=63.40 E-value=1.6e+02 Score=29.18 Aligned_cols=40 Identities=30% Similarity=0.650 Sum_probs=25.8
Q ss_pred eeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCCCCCc
Q psy9424 302 KCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTDGVPK 344 (535)
Q Consensus 302 ~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~ 344 (535)
.|.|.+||.+..+...+.. |.....+.+++.|....+...
T Consensus 163 ~c~c~~g~~g~~~~~~~~~---c~~~~~~~~g~~C~~~~~~~~ 202 (316)
T KOG1218|consen 163 ICTCQPGFVGVFCVESCSG---CSPLTACENGAKCNRSTGSCL 202 (316)
T ss_pred ceeccCCcccccccccCCC---cCCCcccCCCCeeeccccccc
Confidence 6889999999887743221 544456666667776555433
No 59
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=62.08 E-value=5.7 Score=27.83 Aligned_cols=22 Identities=23% Similarity=0.579 Sum_probs=18.0
Q ss_pred CeeeeCCCCceeeCCCCCcCCCCC
Q psy9424 463 AQCSVANHRPLCSCPAGLMGLPSA 486 (535)
Q Consensus 463 ~~C~~~~g~~~C~C~~G~~G~~c~ 486 (535)
.+|....+ +|+|+++|+|..|+
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCD 32 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-
T ss_pred CcccCCCC--EEeccccccCCcCc
Confidence 47877666 99999999999995
No 60
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=55.24 E-value=2.2 Score=31.88 Aligned_cols=48 Identities=13% Similarity=0.203 Sum_probs=21.0
Q ss_pred cCCCccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424 11 ISQHLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDP 66 (535)
Q Consensus 11 ~~~~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~ 66 (535)
+.-...|.+.|.|..|.. .|.....=..+-+|... +.=+|.+||+|..
T Consensus 15 ~~~rv~C~~nyyG~~C~~-----~C~~~~d~~ghy~Cd~~---G~~~C~~Gw~G~~ 62 (63)
T PF01414_consen 15 YRIRVVCDENYYGPNCSK-----FCKPRDDSFGHYTCDSN---GNKVCLPGWTGPN 62 (63)
T ss_dssp --------TTEETTTT-E-----E---EEETTEEEEE-SS-----EEE-TTEESTT
T ss_pred EEEEEECCCCCCCccccC-----CcCCCcCCcCCcccCCC---CCCCCCCCCcCCC
Confidence 344578899999999986 55422111223355532 3558999999984
No 61
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=52.60 E-value=15 Score=25.47 Aligned_cols=16 Identities=31% Similarity=0.696 Sum_probs=13.6
Q ss_pred eeeCCCCCcCCCCCCC
Q psy9424 390 VCFCPRGFTGDPETEC 405 (535)
Q Consensus 390 ~C~C~~G~~g~~~~~C 405 (535)
+|.|+++|+|..++.|
T Consensus 19 ~C~C~~~~~G~~C~~C 34 (46)
T smart00180 19 QCECKPNVTGRRCDRC 34 (46)
T ss_pred EEECCCCCCCCCCCcC
Confidence 8999999999886644
No 62
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=47.92 E-value=23 Score=24.96 Aligned_cols=24 Identities=25% Similarity=0.714 Sum_probs=17.8
Q ss_pred CCCCCCCCeeeeCCCCceeeCCCCCcCC
Q psy9424 456 PAPCGPNAQCSVANHRPLCSCPAGLMGL 483 (535)
Q Consensus 456 ~~~C~~~~~C~~~~g~~~C~C~~G~~G~ 483 (535)
...|..++.|++. +|+|++||+-.
T Consensus 25 ~~qC~~~s~C~~g----~C~C~~g~~~~ 48 (52)
T PF01683_consen 25 DEQCIGGSVCVNG----RCQCPPGYVEV 48 (52)
T ss_pred cCCCCCcCEEcCC----EeECCCCCEec
Confidence 4456678889653 99999998643
No 63
>KOG3516|consensus
Probab=46.52 E-value=15 Score=42.77 Aligned_cols=36 Identities=31% Similarity=0.707 Sum_probs=33.3
Q ss_pred cCCCCCCCCCCCCeeeeCCCCceeeCC-CCCcCCCCC
Q psy9424 451 VDPCSPAPCGPNAQCSVANHRPLCSCP-AGLMGLPSA 486 (535)
Q Consensus 451 ~d~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~G~~c~ 486 (535)
+|.|.+++|.++|.|.-....|.|.|. .||.|..|.
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCH 581 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCH 581 (1306)
T ss_pred ccccCCccccCCCcccccccceeEecccccccccccc
Confidence 688889999999999988889999998 899999986
No 64
>KOG3516|consensus
Probab=46.03 E-value=15 Score=42.64 Aligned_cols=36 Identities=19% Similarity=0.480 Sum_probs=31.9
Q ss_pred CCCCCCCCCCCCCCeeeecCCCceeeCC-CCCcCCCCC
Q psy9424 32 RNPCEADEVCGRNAECAVVNHTPRCTCV-AGTVGDPKY 68 (535)
Q Consensus 32 ~d~C~~~~~C~~~g~C~~~~~~~~C~C~-~Gf~G~~c~ 68 (535)
+|.|. +++|.++|.|.-....|.|.|. .||+|..|.
T Consensus 545 ~drCl-PN~CehgG~C~Qs~~~f~C~C~~TGY~GatCH 581 (1306)
T KOG3516|consen 545 SDRCL-PNPCEHGGKCSQSWDDFECNCELTGYKGATCH 581 (1306)
T ss_pred ccccC-CccccCCCcccccccceeEecccccccccccc
Confidence 67777 8999999999998889999998 999999654
No 65
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=44.94 E-value=24 Score=24.88 Aligned_cols=24 Identities=38% Similarity=0.682 Sum_probs=17.6
Q ss_pred CCCCCCCCeeeecCCCceeeCCCCCcCC
Q psy9424 38 DEVCGRNAECAVVNHTPRCTCVAGTVGD 65 (535)
Q Consensus 38 ~~~C~~~g~C~~~~~~~~C~C~~Gf~G~ 65 (535)
...|..++.|++. +|+|++||+-.
T Consensus 25 ~~qC~~~s~C~~g----~C~C~~g~~~~ 48 (52)
T PF01683_consen 25 DEQCIGGSVCVNG----RCQCPPGYVEV 48 (52)
T ss_pred cCCCCCcCEEcCC----EeECCCCCEec
Confidence 3456677888653 89999999744
No 66
>KOG3512|consensus
Probab=41.34 E-value=78 Score=33.31 Aligned_cols=28 Identities=21% Similarity=0.383 Sum_probs=20.4
Q ss_pred ceeeecCCC-eeeeCCCCCcCCCCCCCCc
Q psy9424 380 SLCNVLTHR-KVCFCPRGFTGDPETECVR 407 (535)
Q Consensus 380 ~~C~~~~g~-~~C~C~~G~~g~~~~~C~~ 407 (535)
+.|+-...+ ++|.|..+-+|..|..|.+
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKp 313 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCKP 313 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcccccc
Confidence 457665544 8999999999988755543
No 67
>KOG3514|consensus
Probab=39.74 E-value=20 Score=41.27 Aligned_cols=34 Identities=29% Similarity=0.766 Sum_probs=31.3
Q ss_pred CCCCCCCCCCCeeeeCCCCceeeCC-CCCcCCCCC
Q psy9424 453 PCSPAPCGPNAQCSVANHRPLCSCP-AGLMGLPSA 486 (535)
Q Consensus 453 ~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~G~~c~ 486 (535)
.|.++||.|+|.|.....+|.|.|. .||.|..|+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE 659 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCcccc
Confidence 6889999999999999999999997 599999887
No 68
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=33.88 E-value=47 Score=21.40 Aligned_cols=13 Identities=46% Similarity=1.186 Sum_probs=11.0
Q ss_pred eeeeCCCCCcCCC
Q psy9424 389 KVCFCPRGFTGDP 401 (535)
Q Consensus 389 ~~C~C~~G~~g~~ 401 (535)
+.|.|++||..+.
T Consensus 18 ~~C~CPeGyIlde 30 (34)
T PF09064_consen 18 GQCFCPEGYILDE 30 (34)
T ss_pred CceeCCCceEecC
Confidence 3899999998775
No 69
>KOG3514|consensus
Probab=32.80 E-value=30 Score=39.92 Aligned_cols=36 Identities=22% Similarity=0.591 Sum_probs=32.8
Q ss_pred CCCCCCCCCCCCeeccCCCCceecCC-CCCCCCCCCC
Q psy9424 144 PACEGILCGRNALCTASDHHATCSCK-PGYVGHPGPS 179 (535)
Q Consensus 144 ~~C~~~~C~~~g~C~~~~~~~~C~C~-~Gf~g~~c~~ 179 (535)
..|.++||.++|+|....+.|.|.|. .||.|..|+.
T Consensus 624 ~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 624 KICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred cccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 47888999999999999999999995 8999999985
No 70
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=29.71 E-value=20 Score=29.56 Aligned_cols=32 Identities=22% Similarity=0.563 Sum_probs=24.0
Q ss_pred CCCCC-CCCCCCCCeeeecC-----CCceeeCCCCCcC
Q psy9424 33 NPCEA-DEVCGRNAECAVVN-----HTPRCTCVAGTVG 64 (535)
Q Consensus 33 d~C~~-~~~C~~~g~C~~~~-----~~~~C~C~~Gf~G 64 (535)
+.|.. .+.|+.||.|+... .=|.|.|.+.+..
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeeccccc
Confidence 35554 67899999999872 4588999996653
No 71
>KOG3512|consensus
Probab=26.41 E-value=2.8e+02 Score=29.46 Aligned_cols=27 Identities=19% Similarity=0.421 Sum_probs=20.7
Q ss_pred eeccCCCC-ceecCCCCCCCCCCCCCCC
Q psy9424 156 LCTASDHH-ATCSCKPGYVGHPGPSMGT 182 (535)
Q Consensus 156 ~C~~~~~~-~~C~C~~Gf~g~~c~~~~~ 182 (535)
+|+....+ .+|.|+..-.|+.|+.+.+
T Consensus 286 ~Cv~d~~~~ltCdC~HNTaGPdCgrCKp 313 (592)
T KOG3512|consen 286 RCVMDESSHLTCDCEHNTAGPDCGRCKP 313 (592)
T ss_pred eeeeccCCceEEecccCCCCCCcccccc
Confidence 57765544 9999999999999875433
No 72
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=20.85 E-value=40 Score=24.26 Aligned_cols=31 Identities=19% Similarity=0.517 Sum_probs=17.7
Q ss_pred CCCCCCCCeeee----CCCCceeeCCCCCcCCCCC
Q psy9424 456 PAPCGPNAQCSV----ANHRPLCSCPAGLMGLPSA 486 (535)
Q Consensus 456 ~~~C~~~~~C~~----~~g~~~C~C~~G~~G~~c~ 486 (535)
..+|+.||+-.. ..|...|.|..-|.|..|+
T Consensus 16 ai~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 16 AISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS 50 (56)
T ss_dssp TS--TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred cCCcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence 346777777642 3567899999999999986
Done!