Query psy18237
Match_columns 1050
No_of_seqs 580 out of 3591
Neff 8.0
Searched_HMMs 46136
Date Fri Aug 16 17:11:55 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18237.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18237hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.8 7.5E-18 1.6E-22 193.0 26.8 253 478-844 638-911 (1289)
2 KOG1214|consensus 99.6 1E-15 2.2E-20 175.7 12.6 157 86-350 704-866 (1289)
3 KOG1217|consensus 99.6 5E-14 1.1E-18 168.9 23.2 245 715-1022 178-436 (487)
4 KOG4289|consensus 99.5 1.1E-13 2.3E-18 165.9 18.2 90 165-337 1216-1308(2531)
5 KOG1217|consensus 99.5 4.4E-13 9.5E-18 160.8 23.2 259 550-932 150-421 (487)
6 KOG0994|consensus 99.5 3.5E-13 7.7E-18 159.4 15.9 329 460-896 780-1156(1758)
7 KOG4289|consensus 99.4 1.7E-12 3.6E-17 155.9 16.8 97 549-686 1219-1317(2531)
8 KOG0994|consensus 99.0 5.1E-09 1.1E-13 125.0 14.7 169 766-943 864-1054(1758)
9 KOG1219|consensus 99.0 9.5E-10 2.1E-14 137.4 8.4 110 899-1050 3858-3973(4289)
10 KOG4260|consensus 99.0 7.3E-10 1.6E-14 113.9 6.0 166 67-340 130-305 (350)
11 KOG1219|consensus 99.0 1.1E-09 2.4E-14 136.8 8.6 107 863-1012 3865-3976(4289)
12 KOG4260|consensus 98.8 2.6E-09 5.7E-14 109.9 5.5 167 674-894 132-305 (350)
13 PF07645 EGF_CA: Calcium-bindi 98.5 1E-07 2.2E-12 73.1 3.9 40 307-346 1-41 (42)
14 PF07645 EGF_CA: Calcium-bindi 98.4 1.8E-07 3.9E-12 71.7 2.3 39 148-186 1-40 (42)
15 KOG1225|consensus 98.1 8.2E-06 1.8E-10 95.3 10.5 73 831-938 234-308 (525)
16 KOG1225|consensus 98.1 2.1E-05 4.6E-10 91.9 12.2 98 669-842 264-364 (525)
17 PF12662 cEGF: Complement Clr- 97.8 1.2E-05 2.6E-10 52.8 2.5 24 170-193 1-24 (24)
18 PF12947 EGF_3: EGF domain; I 97.7 2E-05 4.3E-10 57.9 1.8 35 311-347 1-36 (36)
19 PF12947 EGF_3: EGF domain; I 97.6 3E-05 6.4E-10 56.9 1.5 31 237-267 1-32 (36)
20 PF12662 cEGF: Complement Clr- 97.5 6.7E-05 1.4E-09 49.3 2.4 24 994-1020 1-24 (24)
21 PF14670 FXa_inhibition: Coagu 97.5 7.5E-05 1.6E-09 54.7 2.9 34 313-347 3-36 (36)
22 PF06247 Plasmod_Pvs28: Plasmo 97.5 0.00014 3.1E-09 72.3 5.5 144 775-939 6-164 (197)
23 KOG1836|consensus 97.5 0.002 4.3E-08 85.4 17.5 240 665-943 740-1024(1705)
24 PF06247 Plasmod_Pvs28: Plasmo 97.4 0.00015 3.3E-09 72.2 3.9 146 585-842 6-162 (197)
25 smart00179 EGF_CA Calcium-bind 97.3 0.00034 7.4E-09 52.4 4.0 33 307-340 1-35 (39)
26 PF14670 FXa_inhibition: Coagu 97.2 0.0003 6.5E-09 51.6 3.0 31 585-616 6-36 (36)
27 smart00179 EGF_CA Calcium-bind 97.1 0.00066 1.4E-08 50.8 4.1 37 148-188 1-38 (39)
28 PF00683 TB: TB domain; Inter 97.1 1.9E-05 4.2E-10 59.9 -4.4 40 363-402 2-41 (42)
29 PF00008 EGF: EGF-like domain 97.0 0.00053 1.1E-08 49.1 2.3 28 713-740 3-31 (32)
30 KOG1226|consensus 96.7 0.0088 1.9E-07 71.7 10.6 96 832-940 479-580 (783)
31 PF00008 EGF: EGF-like domain 96.6 0.0012 2.6E-08 47.3 1.9 26 316-341 4-31 (32)
32 KOG1226|consensus 96.5 0.015 3.2E-07 69.9 11.3 49 553-617 526-580 (783)
33 cd00054 EGF_CA Calcium-binding 96.4 0.0035 7.6E-08 46.3 3.5 33 307-340 1-34 (38)
34 cd00054 EGF_CA Calcium-binding 96.4 0.0045 9.8E-08 45.7 3.8 33 148-181 1-34 (38)
35 KOG1836|consensus 96.0 0.089 1.9E-06 70.4 15.6 112 782-898 749-881 (1705)
36 smart00181 EGF Epidermal growt 95.3 0.024 5.1E-07 41.3 3.8 25 157-181 6-30 (35)
37 smart00181 EGF Epidermal growt 95.2 0.021 4.5E-07 41.6 3.3 24 585-608 6-29 (35)
38 cd00053 EGF Epidermal growth f 95.2 0.022 4.8E-07 41.2 3.5 27 315-341 5-32 (36)
39 cd00053 EGF Epidermal growth f 95.1 0.03 6.5E-07 40.5 3.8 27 156-182 5-32 (36)
40 cd01475 vWA_Matrilin VWA_Matri 94.4 0.037 8E-07 59.5 4.1 44 301-345 180-223 (224)
41 cd01475 vWA_Matrilin VWA_Matri 93.0 0.084 1.8E-06 56.7 3.8 43 571-614 181-223 (224)
42 PF00683 TB: TB domain; Inter 93.0 0.0045 9.8E-08 47.2 -4.3 39 658-696 2-40 (42)
43 smart00051 DSL delta serrate l 92.4 0.25 5.5E-06 41.4 4.9 38 68-108 20-61 (63)
44 PF07974 EGF_2: EGF-like domai 89.5 0.43 9.3E-06 34.1 3.1 23 243-267 7-30 (32)
45 PF07974 EGF_2: EGF-like domai 88.9 0.56 1.2E-05 33.6 3.3 23 157-181 6-29 (32)
46 PF12661 hEGF: Human growth fa 86.9 0.47 1E-05 26.6 1.5 13 600-616 1-13 (13)
47 PF12946 EGF_MSP1_1: MSP1 EGF 86.4 0.25 5.4E-06 36.3 0.3 26 157-182 5-32 (37)
48 smart00051 DSL delta serrate l 82.8 1.2 2.5E-05 37.5 2.8 45 554-616 19-63 (63)
49 PF12946 EGF_MSP1_1: MSP1 EGF 74.9 2.6 5.6E-05 31.1 2.2 25 914-938 7-33 (37)
50 PHA02887 EGF-like protein; Pro 72.6 3 6.4E-05 38.8 2.6 39 577-619 83-124 (126)
51 KOG1218|consensus 72.5 68 0.0015 35.9 14.4 44 790-840 163-208 (316)
52 PHA03099 epidermal growth fact 70.9 2.9 6.4E-05 39.5 2.2 39 577-619 42-83 (139)
53 KOG3512|consensus 59.1 25 0.00055 40.7 7.0 122 720-851 285-434 (592)
54 PHA02887 EGF-like protein; Pro 56.8 9 0.00019 35.8 2.5 32 156-191 91-124 (126)
55 KOG3512|consensus 56.5 28 0.0006 40.3 6.8 116 822-943 285-431 (592)
56 PHA03099 epidermal growth fact 55.5 8.7 0.00019 36.4 2.3 41 147-191 40-83 (139)
57 cd00055 EGF_Lam Laminin-type e 44.7 30 0.00065 27.4 3.6 30 823-854 13-42 (50)
58 PF01683 EB: EB module; Inter 43.7 36 0.00077 27.1 3.9 30 803-842 19-48 (52)
59 PF01414 DSL: Delta serrate li 40.9 9.6 0.00021 32.0 0.2 40 66-108 18-61 (63)
60 KOG1218|consensus 40.2 87 0.0019 35.1 7.9 63 67-136 135-199 (316)
61 PF09064 Tme5_EGF_like: Thromb 39.6 26 0.00056 25.4 2.1 22 590-612 10-31 (34)
62 PF03302 VSP: Giardia variant- 39.3 80 0.0017 37.0 7.5 52 887-943 3-55 (397)
63 PF00053 Laminin_EGF: Laminin 38.0 22 0.00049 27.9 1.9 31 822-854 11-41 (49)
64 PF01683 EB: EB module; Inter 37.9 53 0.0011 26.1 4.0 20 1026-1049 26-46 (52)
65 PF00954 S_locus_glycop: S-loc 34.0 33 0.00072 32.1 2.7 30 974-1004 77-107 (110)
66 smart00180 EGF_Lam Laminin-typ 31.3 71 0.0015 24.8 3.6 29 823-853 12-40 (46)
67 PF00954 S_locus_glycop: S-loc 27.4 50 0.0011 30.9 2.6 31 308-340 77-108 (110)
68 PTZ00214 high cysteine membran 25.0 2.4E+02 0.0051 36.3 8.6 38 256-294 682-722 (800)
69 KOG3516|consensus 22.0 64 0.0014 41.9 2.8 30 156-189 550-581 (1306)
70 KOG4291|consensus 21.5 2.7E+02 0.0058 36.8 8.2 156 99-340 369-533 (1043)
No 1
>KOG1214|consensus
Probab=99.80 E-value=7.5e-18 Score=192.95 Aligned_cols=253 Identities=24% Similarity=0.446 Sum_probs=165.1
Q ss_pred ceeEeeCCCCccCCCCCCCCCCCCCCccccchhcccCCCccC--ccccccccCCCcceeee--ccccCCccccccCC-cc
Q psy18237 478 RMDCCCTMGMAWGPQCQLCPTRGSQTCEDINECLELSNQCAF--RCHNISMSVSNSLACAL--TPVLTRKISMSVSN-SL 552 (1050)
Q Consensus 478 ~~~C~C~~Gy~g~~~~~~c~~~~~~~C~~~~eC~~~~~~C~~--~C~n~~~~~~~~~~C~C--G~~g~~~~C~~~~~-~~ 552 (1050)
-+.|.+.+-|.--+..+.+-......=....++. ..-.+++ .++.....-...+.|.+ ++.++...|....+ .|
T Consensus 638 yq~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~-lr~a~Sn~igpV~E~S~~~~~npCy~gsh~cdt~a~C~pg~~~~~ 716 (1289)
T KOG1214|consen 638 YQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERV-LRFAVSNQIGPVKEDSDPTPVNPCYDGSHMCDTTARCHPGTGVDY 716 (1289)
T ss_pred eEEeecCCCCCCCCCceEeecccceeccCccccc-hhhhhhhcccceecCCCCcccccceecCcccCCCccccCCCCcce
Confidence 6778888877632332222111101001111222 2223333 44433322224578888 88888888988754 68
Q ss_pred ccccCCCCcccccccccccCCcccCccccCCCCCCC-CCeeecCCCceEEecCCCeEeCCCCCccccCccccccCCCCCc
Q psy18237 553 ACALTPVLTRKVATPVAVINDCIDLDECRMMSYLCR-NGRCRNNIGSFFCECLQGYTLASEGQYCRDVDECKEVNKRESR 631 (1050)
Q Consensus 553 ~C~C~~G~~G~~C~~~~~~~~C~dideC~~~~~~C~-ng~C~n~~gsy~C~C~~Gy~~~~~G~~C~~~~eC~~~~~~~~~ 631 (1050)
+|.|..||.| +++.|.|++||+.....|. |..|+|.+|+|+|+|..||.+..++.+|..+-.=+
T Consensus 717 tcecs~g~~g-------dgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pa-------- 781 (1289)
T KOG1214|consen 717 TCECSSGYQG-------DGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPA-------- 781 (1289)
T ss_pred EEEEeeccCC-------CCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCC--------
Confidence 9999999987 4588999999999889997 78999999999999999999999999998542100
Q ss_pred cccCcccchhhhhcccCCcCCCCCccccccCCCCCCCcceeecCCCCccCCCCCCCCCCCCCCCccccCCCCCCCCCCCC
Q psy18237 632 CYLDTEEEEEEEEEEEGGYGGGSRRVTCTKEIAGSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTLCPGGSGYRPNSAT 711 (1050)
Q Consensus 632 C~~~~~~c~~~~c~~~g~~~~~~~~~~C~~~~~~~~~~~~C~C~~G~~~G~~C~~c~~~~~~~~~~~Cp~g~g~~~~~~~ 711 (1050)
--..|+.
T Consensus 782 ------------------------------------------------p~n~Ce~------------------------- 788 (1289)
T KOG1214|consen 782 ------------------------------------------------PANPCED------------------------- 788 (1289)
T ss_pred ------------------------------------------------CCCcccc-------------------------
Confidence 0001111
Q ss_pred CCCCCCC--CCEEeecC-CCeEEeCCCCcccCCCCCcccccccccCCCCCCCCCCCCCCCccCCCCCCCC-CCeEEeCCC
Q psy18237 712 REGICPS--PGKCQNVM-GSFICTCPPGYRLSPDKNSCQEDFAKLCPEGVGRGDKGEDLNECALMPSACQ-GGECINTDG 787 (1050)
Q Consensus 712 ~~~~C~~--~g~C~~~~-gsy~C~C~~Gy~g~~~~~~C~~~~~c~C~~g~~~~~~~~~ideC~~~~~~C~-~g~C~~~~g 787 (1050)
..+.|.- ++.|+... ++|.|.|.|||.|+ +..|. ++|||. |..|+ +++|.++++
T Consensus 789 g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGD--G~~c~------------------dvDeC~--psrChp~A~Cyntpg 846 (1289)
T KOG1214|consen 789 GSHTCAIAGQARCVHHGGSTYSCACLPGFSGD--GHQCT------------------DVDECS--PSRCHPAATCYNTPG 846 (1289)
T ss_pred CccccCcCCceEEEecCCceEEEeecCCccCC--ccccc------------------cccccC--ccccCCCceEecCCC
Confidence 0123433 34566554 56999999999985 34444 789998 56687 579999999
Q ss_pred CeeeeCCCCceeCCCCCccccC----CccCCC---CCCC-CCC-e--EecCCCceEEecCCCCcCCCC
Q psy18237 788 SYRCECPAGYVKDETGKICIDD----NECLSI---PNIC-GNG-T--CTNLNGGFECTCSEGYAPGPL 844 (1050)
Q Consensus 788 sy~C~C~~Gy~~~~~g~~C~~~----~~C~~~---~~~C-~~g-~--C~~~~g~~~C~C~~Gy~g~~~ 844 (1050)
+|.|+|.+||.|| |..|+.. ..|... +..| .+. . |++ +.+|.+.|.++-.|+..
T Consensus 847 sfsC~C~pGy~GD--Gf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~D-p~~~e~p~~~~ppG~~~ 911 (1289)
T KOG1214|consen 847 SFSCRCQPGYYGD--GFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVD-PDGHEVPGTQTPPGSTP 911 (1289)
T ss_pred cceeecccCccCC--CceecCCCccCCccccccccceeeccccceeEeeC-CCcccCCCCCCCCCCCC
Confidence 9999999999998 8888643 344433 3445 222 1 233 45578888777766533
No 2
>KOG1214|consensus
Probab=99.63 E-value=1e-15 Score=175.73 Aligned_cols=157 Identities=32% Similarity=0.802 Sum_probs=116.1
Q ss_pred CCcccCCCCC-CceeecCCCcccCCCCCCccccCCCCCCCCCCCCeeccCCCCcccCCCCCccccccccCCCCCCCCCee
Q psy18237 86 NEYVTGDNVI-TRWCVCDEGFRGDGYSCEDIDECTDNTNYCDYILLCGSKPGEFMNPMTNKTEEIDECNLMPNMCNHGTC 164 (1050)
Q Consensus 86 n~~~c~~~~g-~~~C~C~~G~~g~~~~C~d~~eC~~~~~~C~~~~~C~n~~gs~~~~~~g~~~di~eC~~~~~~C~~~~C 164 (1050)
.++.|-+.+| .|+|.|..||.|+++.|.|++||++..+.|.. |+.|
T Consensus 704 t~a~C~pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp---------------------------------~s~C 750 (1289)
T KOG1214|consen 704 TTARCHPGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGP---------------------------------NSVC 750 (1289)
T ss_pred CCccccCCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCC---------------------------------Ccee
Confidence 3444554444 58999999999999999999999975444433 6788
Q ss_pred ecCCCCeEeeCCCCCeeCCCCCCCccCCccCCCCCCCCCCcccccCCCCccccCCCccccCCCceeecccccccCCCCCC
Q psy18237 165 MNTPGSFHCQCNRGFLYDSDTHQCIDINECEEMPEICGSGTYINECEEMPEICGSGTCENNIGSFSCRYINECEEMPEIC 244 (1050)
Q Consensus 165 ~~~~g~y~C~C~~Gy~~~~~g~~C~d~~eC~~~~~~C~~g~C~~~C~~g~~~c~~~~C~~~~g~~~c~~~~eC~~~~~~C 244 (1050)
+|.+|+|+|.|..||+-..++.+|..|-.=.. ++.|......|
T Consensus 751 in~pg~~rceC~~gy~F~dd~~tCV~i~~pap-------------------------------------~n~Ce~g~h~C 793 (1289)
T KOG1214|consen 751 INLPGSYRCECRSGYEFADDRHTCVLITPPAP-------------------------------------ANPCEDGSHTC 793 (1289)
T ss_pred ecCCCceeEEEeecceeccCCcceEEecCCCC-------------------------------------CCccccCcccc
Confidence 88999999999999998888889984421010 12222222334
Q ss_pred C---CCceecC-CCCeEEEcCCCCccCCCCCCCcCCCccccCCCCCCCCCCceecCCCCCCCCCcccCccCcCCCCCCCC
Q psy18237 245 G---SGTCENN-IGSFSCRCEDGYSVKPAEGPACTDENECTMRTHNCDDNADCINNPVNKTGTRCVDIDECATSIQRCGE 320 (1050)
Q Consensus 245 ~---~~~C~~~-~g~y~C~C~~Gy~g~~~~~~~C~~ideC~~~~~~C~~~~~C~n~~g~~~g~~C~dideC~~~~~~C~~ 320 (1050)
. ++.|+.. .++|+|.|.+||.|| |. .|.|+|||. +..|+.
T Consensus 794 ~i~g~a~c~~hGgs~y~C~CLPGfsGD---G~-------------------------------~c~dvDeC~--psrChp 837 (1289)
T KOG1214|consen 794 AIAGQARCVHHGGSTYSCACLPGFSGD---GH-------------------------------QCTDVDECS--PSRCHP 837 (1289)
T ss_pred CcCCceEEEecCCceEEEeecCCccCC---cc-------------------------------ccccccccC--ccccCC
Confidence 3 3567654 567999999999997 44 455555555 457988
Q ss_pred -CeEeecCCceEEeCCCCCeeCCCCCccccc
Q psy18237 321 -GFCVNDVGTYHCVCPDGYMLLPSGKECIDM 350 (1050)
Q Consensus 321 -~~C~n~~Gsy~C~C~~G~~g~~~~~~C~d~ 350 (1050)
|.|+|++|+|.|.|.+||.| ||++|++-
T Consensus 838 ~A~CyntpgsfsC~C~pGy~G--DGf~CVP~ 866 (1289)
T KOG1214|consen 838 AATCYNTPGSFSCRCQPGYYG--DGFQCVPD 866 (1289)
T ss_pred CceEecCCCcceeecccCccC--CCceecCC
Confidence 99999999999999999999 89998853
No 3
>KOG1217|consensus
Probab=99.60 E-value=5e-14 Score=168.90 Aligned_cols=245 Identities=44% Similarity=0.977 Sum_probs=171.1
Q ss_pred CCCCCCEEeecCCCeEEeCCCCcccCCCCC-----cccccccccCCCCCCCCCCCCCCCccCCCCCCCCCCeEEeCCCCe
Q psy18237 715 ICPSPGKCQNVMGSFICTCPPGYRLSPDKN-----SCQEDFAKLCPEGVGRGDKGEDLNECALMPSACQGGECINTDGSY 789 (1050)
Q Consensus 715 ~C~~~g~C~~~~gsy~C~C~~Gy~g~~~~~-----~C~~~~~c~C~~g~~~~~~~~~ideC~~~~~~C~~g~C~~~~gsy 789 (1050)
+|.+++.|.+..++|.|.|++||.+..... .|.+.+.+.++.++.+..+...+.+|... . ++|++..++|
T Consensus 178 ~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~----~-~~c~~~~~~~ 252 (487)
T KOG1217|consen 178 PCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASG----D-GTCVNTVGSY 252 (487)
T ss_pred CcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCC----C-CcccccCCce
Confidence 366667777777777788888877765433 24444456666666666677777777643 3 6899999999
Q ss_pred eeeCCCCceeCCCCCccccCCccCCCCCCC-CCCeEecCCCceEEecCCCCcCCCCCCccccccCCCCCCCccCccccC-
Q psy18237 790 RCECPAGYVKDETGKICIDDNECLSIPNIC-GNGTCTNLNGGFECTCSEGYAPGPLGSCAILLTLPPISPSTDIDECYE- 867 (1050)
Q Consensus 790 ~C~C~~Gy~~~~~g~~C~~~~~C~~~~~~C-~~g~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~~~~~~C~~~~eC~~- 867 (1050)
.|.|++||.+.. ...+.++++|..... | ++++|++..+.|.|.|++||+|... ..+.+..+|..
T Consensus 253 ~C~~~~g~~~~~-~~~~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~~~------------~~~~~~~~C~~~ 318 (487)
T KOG1217|consen 253 TCRCPEGYTGDA-CVTCVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGRLC------------TECVDVDECSPR 318 (487)
T ss_pred eeeCCCCccccc-cceeeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCCCC------------cccccccccccc
Confidence 999999997552 135778899998643 8 7899999999999999999999722 12345566742
Q ss_pred -CCCCCCC-Ccc--ccCCCceeeecCCCceeCCCCCCcccC-CccCCCCCCCC-CCEEee-CCCCeEEecCCCceeCCCC
Q psy18237 868 -RPGICAN-GDC--ANFQGSFQCTCANGYTLNTARDSCVDI-DECARHPNICN-NGTCVN-AIGSFKCHCYAGFKLSHNN 940 (1050)
Q Consensus 868 -~~~~C~n-g~C--~~~~gsy~C~C~~Gy~g~~~~~~C~~i-deC~~~~~~C~-~g~C~n-~~g~y~C~C~~Gy~g~~~~ 940 (1050)
....|.+ +.| ....+.+.|.|.+||.| ..|+.. ++|...+ +. ++.|++ ..++|.|.|+.+|.+....
T Consensus 319 ~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g----~~C~~~~~~C~~~~--~~~~~~c~~~~~~~~~c~~~~~~~~~~~~ 392 (487)
T KOG1217|consen 319 NAGGPCANGGTCNTLGSFGGFRCACGPGFTG----RRCEDSNDECASSP--CCPGGTCVNETPGSYRCACPAGFAGKANG 392 (487)
T ss_pred ccCCcCCCCcccccCCCCCCCCcCCCCCCCC----CccccCCccccCCc--cccCCEeccCCCCCeEecCCCccccCCcc
Confidence 2344775 577 33455788999999776 456666 4888766 55 789999 7899999999999974110
Q ss_pred CCccccccCCCCCCCCCcCCCeeccCCCCCCccCCCCCCCCCCCCCeEeecCCceeeeCCCCCeeCCCCCcccccccCCc
Q psy18237 941 DCIGNCTDINECESPQACLYGNCTNTLGSNCTDINECESPQACLYGNCTNTLGSFSCTCPPNYQLTPAGNACVVLEDINE 1020 (1050)
Q Consensus 941 ~C~~~C~d~~eC~~~~~C~~g~C~~~~g~~C~di~eC~~~~~C~~g~C~~~~g~y~C~C~~Gy~~~~~g~~C~~~~dide 1020 (1050)
. +..+.++++|.. .+.|++..++|.|. ++ + ..... .| .++++
T Consensus 393 ~--------------------------~~~~~~~~~c~~-----~~~c~~~~~~~~c~-~~-~-~~~~~-~~---~~~~~ 434 (487)
T KOG1217|consen 393 D--------------------------GVGCEDIDECSG-----CGDCVNGPGGGACT-PP-G-LVSPG-TC---DDIDE 434 (487)
T ss_pred c--------------------------cccccccccccC-----CcceeccCCCCccc-cC-c-ccCCc-ce---ecccc
Confidence 0 112334556532 44678888999999 88 5 43334 66 67777
Q ss_pred cc
Q psy18237 1021 CE 1022 (1050)
Q Consensus 1021 C~ 1022 (1050)
+.
T Consensus 435 ~~ 436 (487)
T KOG1217|consen 435 CP 436 (487)
T ss_pred cc
Confidence 74
No 4
>KOG4289|consensus
Probab=99.54 E-value=1.1e-13 Score=165.92 Aligned_cols=90 Identities=30% Similarity=0.827 Sum_probs=68.2
Q ss_pred ecCCCCeEeeCCCCCeeCCCCCCCc-cCCccCCCCCCCCCCcccccCCCCccccCCCccccCCCceeecccccccCCCCC
Q psy18237 165 MNTPGSFHCQCNRGFLYDSDTHQCI-DINECEEMPEICGSGTYINECEEMPEICGSGTCENNIGSFSCRYINECEEMPEI 243 (1050)
Q Consensus 165 ~~~~g~y~C~C~~Gy~~~~~g~~C~-d~~eC~~~~~~C~~g~C~~~C~~g~~~c~~~~C~~~~g~~~c~~~~eC~~~~~~ 243 (1050)
++..++++|.|+|||+ |..|+ .||+|.+ .+|.|
T Consensus 1216 i~pvnglrCrCPpGFT----gd~CeTeiDlCYs--~pC~n---------------------------------------- 1249 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFT----GDYCETEIDLCYS--GPCGN---------------------------------------- 1249 (2531)
T ss_pred ccccCceeEeCCCCCC----cccccchhHhhhc--CCCCC----------------------------------------
Confidence 3567889999999999 55888 7888865 22332
Q ss_pred CCCCceecCCCCeEEEcCCCCccCCCCCCCcCCCccccCCCCCCCCCCceecCCCCCCCCCcccCccCcCCCCCCCC-Ce
Q psy18237 244 CGSGTCENNIGSFSCRCEDGYSVKPAEGPACTDENECTMRTHNCDDNADCINNPVNKTGTRCVDIDECATSIQRCGE-GF 322 (1050)
Q Consensus 244 C~~~~C~~~~g~y~C~C~~Gy~g~~~~~~~C~~ideC~~~~~~C~~~~~C~n~~g~~~g~~C~dideC~~~~~~C~~-~~ 322 (1050)
|+.|....|+|+|.|.+||+|.+++ ... .---|. ++.|.+ |+
T Consensus 1250 --ng~C~srEggYtCeCrpg~tGehCE-----------vs~----------------------~agrCv--pGvC~nggt 1292 (2531)
T KOG4289|consen 1250 --NGRCRSREGGYTCECRPGFTGEHCE-----------VSA----------------------RAGRCV--PGVCKNGGT 1292 (2531)
T ss_pred --CCceEEecCceeEEecCCcccccee-----------eec----------------------ccCccc--cceecCCCE
Confidence 5899999999999999999998311 100 111233 678999 99
Q ss_pred Eeec-CCceEEeCCCC
Q psy18237 323 CVND-VGTYHCVCPDG 337 (1050)
Q Consensus 323 C~n~-~Gsy~C~C~~G 337 (1050)
|+|. .|+|.|.||.|
T Consensus 1293 C~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1293 CVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred EeecCCCceeccCCCc
Confidence 9988 78999999998
No 5
>KOG1217|consensus
Probab=99.53 E-value=4.4e-13 Score=160.78 Aligned_cols=259 Identities=39% Similarity=0.867 Sum_probs=183.2
Q ss_pred CccccccCCCCcccccccccccCCcccCccccCCCCCCCC-CeeecCCCceEEecCCCeEeCCCCCccccCccccccCCC
Q psy18237 550 NSLACALTPVLTRKVATPVAVINDCIDLDECRMMSYLCRN-GRCRNNIGSFFCECLQGYTLASEGQYCRDVDECKEVNKR 628 (1050)
Q Consensus 550 ~~~~C~C~~G~~G~~C~~~~~~~~C~dideC~~~~~~C~n-g~C~n~~gsy~C~C~~Gy~~~~~G~~C~~~~eC~~~~~~ 628 (1050)
..+.|.|..||.+..+. .+.++|.....+|.+ +.|.+..++|.|.|++||+ +..|+..
T Consensus 150 ~~~~c~C~~g~~~~~~~--------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~----~~~~~~~--------- 208 (487)
T KOG1217|consen 150 GPFRCSCTEGYEGEPCE--------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYT----GSTCETT--------- 208 (487)
T ss_pred CceeeeeCCCccccccc--------ccccccccCCCCcCCCcccccCCCCeeEeCCCCcc----CCcCcCC---------
Confidence 45678888999886665 334789865677986 5999999999999999999 7666522
Q ss_pred CCccccCcccchhhhhcccCCcCCCCCccccccCCCCCCCcceeecCCCCccCCCCCCCCCCCCCCCccccCCCCCCCCC
Q psy18237 629 ESRCYLDTEEEEEEEEEEEGGYGGGSRRVTCTKEIAGSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTLCPGGSGYRPN 708 (1050)
Q Consensus 629 ~~~C~~~~~~c~~~~c~~~g~~~~~~~~~~C~~~~~~~~~~~~C~C~~G~~~G~~C~~c~~~~~~~~~~~Cp~g~g~~~~ 708 (1050)
.....|... ..|.+.+|+ .+..|+.-
T Consensus 209 -------------------------~~~~~c~~~-------~~~~~~~g~-~~~~c~~~--------------------- 234 (487)
T KOG1217|consen 209 -------------------------GNGGTCVDS-------VACSCPPGA-RGPECEVS--------------------- 234 (487)
T ss_pred -------------------------CCCceEecc-------eeccCCCCC-CCCCcccc---------------------
Confidence 011222211 345566666 35555531
Q ss_pred CCCCCCCCCCC-CEEeecCCCeEEeCCCCcccCCCCCcccccccccCCCCCCCCCCCCCCCccCCCCCCCCC-CeEEeCC
Q psy18237 709 SATREGICPSP-GKCQNVMGSFICTCPPGYRLSPDKNSCQEDFAKLCPEGVGRGDKGEDLNECALMPSACQG-GECINTD 786 (1050)
Q Consensus 709 ~~~~~~~C~~~-g~C~~~~gsy~C~C~~Gy~g~~~~~~C~~~~~c~C~~g~~~~~~~~~ideC~~~~~~C~~-g~C~~~~ 786 (1050)
...|..+ ++|++..++|+|.|++||.+... . ...++++|..... |.+ ++|++..
T Consensus 235 ----~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~-~------------------~~~~~~~C~~~~~-c~~~~~C~~~~ 290 (487)
T KOG1217|consen 235 ----IVECASGDGTCVNTVGSYTCRCPEGYTGDAC-V------------------TCVDVDSCALIAS-CPNGGTCVNVP 290 (487)
T ss_pred ----cccccCCCCcccccCCceeeeCCCCcccccc-c------------------eeeeccccCCCCc-cCCCCeeecCC
Confidence 1123322 89999999999999999998541 1 2237888987654 775 6899999
Q ss_pred CCeeeeCCCCceeCCCCCccccCCccCC--CCCCC-CCCeE--ecCCCceEEecCCCCcCCCCCCccccccCCCCCCCcc
Q psy18237 787 GSYRCECPAGYVKDETGKICIDDNECLS--IPNIC-GNGTC--TNLNGGFECTCSEGYAPGPLGSCAILLTLPPISPSTD 861 (1050)
Q Consensus 787 gsy~C~C~~Gy~~~~~g~~C~~~~~C~~--~~~~C-~~g~C--~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~~~~~~C~~ 861 (1050)
++|.|.|++||.+... ..+.+..+|.. ...+| .++.| ....+.+.|.|.+||.|. .|+.
T Consensus 291 ~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~---------------~C~~ 354 (487)
T KOG1217|consen 291 GSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR---------------RCED 354 (487)
T ss_pred CcceeeCCCCCCCCCC-ccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC---------------cccc
Confidence 9999999999975432 23455567752 33668 56688 334457889999998776 4555
Q ss_pred C-ccccCCCCCCCC-Ccccc-CCCceeeecCCCceeC--CCCCCcccCCccCCCCCCCCCCEEeeCCCCeEEecCC
Q psy18237 862 I-DECYERPGICAN-GDCAN-FQGSFQCTCANGYTLN--TARDSCVDIDECARHPNICNNGTCVNAIGSFKCHCYA 932 (1050)
Q Consensus 862 ~-~eC~~~~~~C~n-g~C~~-~~gsy~C~C~~Gy~g~--~~~~~C~~ideC~~~~~~C~~g~C~n~~g~y~C~C~~ 932 (1050)
. ++|...+ +.+ +.|++ ..++|.|.|+.+|.+. .....+.++++|.. .+.|++..+++.|. ++
T Consensus 355 ~~~~C~~~~--~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~------~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 355 SNDECASSP--CCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG------CGDCVNGPGGGACT-PP 421 (487)
T ss_pred CCccccCCc--cccCCEeccCCCCCeEecCCCccccCCccccccccccccccC------CcceeccCCCCccc-cC
Confidence 5 4887654 544 88999 7899999999999974 34556778888865 25688889999999 87
No 6
>KOG0994|consensus
Probab=99.48 E-value=3.5e-13 Score=159.37 Aligned_cols=329 Identities=24% Similarity=0.510 Sum_probs=188.7
Q ss_pred CCceeeCCCCCcccccccceeE-eeCCCCccCCCCCCCCCCCCCCccccchhcccCCCccC-ccccc----cccCCCcce
Q psy18237 460 TNGRCVLPTGPALLMEVTRMDC-CCTMGMAWGPQCQLCPTRGSQTCEDINECLELSNQCAF-RCHNI----SMSVSNSLA 533 (1050)
Q Consensus 460 ~~g~C~~~~g~~~~~~~~~~~C-~C~~Gy~g~~~~~~c~~~~~~~C~~~~eC~~~~~~C~~-~C~n~----~~~~~~~~~ 533 (1050)
++|.|.-.|+ |.+.+| +|++|+.|... ..| .. .|... ..+...+..
T Consensus 780 ~GGqCqCkPn------VVGR~CdqCApGtyGFGP---------sGC-------------k~CdC~~~Gs~~~~Cd~~tGQ 831 (1758)
T KOG0994|consen 780 NGGQCQCKPN------VVGRRCDQCAPGTYGFGP---------SGC-------------KACDCNSIGSLDKYCDKITGQ 831 (1758)
T ss_pred CCceecccCc------cccccccccCCcccCcCC---------ccC-------------ccccccccccccccccccccc
Confidence 7888887777 567788 59999988543 222 11 11111 112335667
Q ss_pred eee--ccccCCccccccCCccccc-cCCCCcc-cccccccccCCcc---cCccccCCCCCCCCCeeecCCCceEE-ecCC
Q psy18237 534 CAL--TPVLTRKISMSVSNSLACA-LTPVLTR-KVATPVAVINDCI---DLDECRMMSYLCRNGRCRNNIGSFFC-ECLQ 605 (1050)
Q Consensus 534 C~C--G~~g~~~~C~~~~~~~~C~-C~~G~~G-~~C~~~~~~~~C~---dideC~~~~~~C~ng~C~n~~gsy~C-~C~~ 605 (1050)
|.| |-+|.. |. |.+|||| +.| +.|+ ..|+|......|. .|.+..+++.| +|..
T Consensus 832 C~C~~g~ygrq-----------CnqCqpG~WgFPeC------r~CqCNgHA~~Cd~~tGaCi--~CqD~T~G~~CdrCl~ 892 (1758)
T KOG0994|consen 832 CQCRPGTYGRQ-----------CNQCQPGYWGFPEC------RPCQCNGHADTCDPITGACI--DCQDSTTGHSCDRCLD 892 (1758)
T ss_pred eeeccccchhh-----------ccccCCCccCCCcC------ccccccCcccccCccccccc--cccccccccchhhhhc
Confidence 777 555554 64 8899998 334 3332 2455554333333 46677778889 7999
Q ss_pred CeEeCC---CCCccccCccccccCCCCCccccCcccchhhhhcccCCcCCC-CCccccccCCCCCCCcceeecCCCCccC
Q psy18237 606 GYTLAS---EGQYCRDVDECKEVNKRESRCYLDTEEEEEEEEEEEGGYGGG-SRRVTCTKEIAGSTTRSTCCCSIGKAWG 681 (1050)
Q Consensus 606 Gy~~~~---~G~~C~~~~eC~~~~~~~~~C~~~~~~c~~~~c~~~g~~~~~-~~~~~C~~~~~~~~~~~~C~C~~G~~~G 681 (1050)
||.+++ .|..|. ||+.|++-... +..-.|... .....-.|.|.+|| .|
T Consensus 893 GyyGdP~lg~g~~Cr-------------------------PCpCP~gp~Sg~~~A~sC~~d--~~t~~ivC~C~~GY-~G 944 (1758)
T KOG0994|consen 893 GYYGDPRLGSGIGCR-------------------------PCPCPDGPASGRQHADSCYLD--TRTQQIVCHCQEGY-SG 944 (1758)
T ss_pred cccCCcccCCCCCCC-------------------------CCCCCCCCccchhcccccccc--ccccceeeecccCc-cc
Confidence 999775 344443 22223322211 222234321 23345589999999 89
Q ss_pred CCCCCCCCCCCCCCccccCCCCCCCCCCCC----------------CCCCCCC-CC---EEeecCCCeEEe-CCCCcccC
Q psy18237 682 PQCEECPAVGSDEHKTLCPGGSGYRPNSAT----------------REGICPS-PG---KCQNVMGSFICT-CPPGYRLS 740 (1050)
Q Consensus 682 ~~C~~c~~~~~~~~~~~Cp~g~g~~~~~~~----------------~~~~C~~-~g---~C~~~~gsy~C~-C~~Gy~g~ 740 (1050)
..|+.|.+. |++++.. .+..|.. -| +|.....+.+|. |.+||.|+
T Consensus 945 ~RCe~CA~~--------------~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~Gd 1010 (1758)
T KOG0994|consen 945 SRCEICADN--------------HFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGD 1010 (1758)
T ss_pred cchhhhccc--------------ccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhH
Confidence 999987643 3333211 1222321 12 233333344674 99999998
Q ss_pred CCCCcccccccccCCCCCCCCCCCCCCCccCCCCCCCCCCeEEeCCCCeeee-CCCCceeCCCCCccccCCccCCCCCCC
Q psy18237 741 PDKNSCQEDFAKLCPEGVGRGDKGEDLNECALMPSACQGGECINTDGSYRCE-CPAGYVKDETGKICIDDNECLSIPNIC 819 (1050)
Q Consensus 741 ~~~~~C~~~~~c~C~~g~~~~~~~~~ideC~~~~~~C~~g~C~~~~gsy~C~-C~~Gy~~~~~g~~C~~~~~C~~~~~~C 819 (1050)
.-.+.|+ +|+|. +-|++ .+ -.|....+.| .|.+.+-+..|. |.+.+|--.+|.-|+.- .|...
T Consensus 1011 A~~q~Cq---rC~Cn--~LGTn--~~-~~CDr~tGQC---pClpNv~G~~CDqCA~N~w~laSG~GCe~C-~Cd~~---- 1074 (1758)
T KOG0994|consen 1011 ALRQNCQ---RCVCN--FLGTN--ST-CHCDRFTGQC---PCLPNVQGVRCDQCAENHWNLASGEGCEPC-NCDPI---- 1074 (1758)
T ss_pred HHHhhhh---hhecc--ccccC--Cc-cccccccCcC---CCCcccccccccccccchhccccCCCCCcc-CCCcc----
Confidence 8777776 45564 33333 11 3444444444 366666667784 88888866678777632 12211
Q ss_pred CCCeEecCCCceEEecCCCCcCCCCCCccccccCCCCCCCccCccccCC---CCCCCC--Cc--cccCCCceee-ecCCC
Q psy18237 820 GNGTCTNLNGGFECTCSEGYAPGPLGSCAILLTLPPISPSTDIDECYER---PGICAN--GD--CANFQGSFQC-TCANG 891 (1050)
Q Consensus 820 ~~g~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~~~~~~C~~~~eC~~~---~~~C~n--g~--C~~~~gsy~C-~C~~G 891 (1050)
..-.|....| +|.|+|||-|..+..|+..||+.|+..|..- +|... ...|+. |. |+...++++| .|..|
T Consensus 1075 ~~pqCN~ftG--QCqCkpGfGGR~C~qCqel~WGdP~~~C~aC-dCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCdqCaRg 1151 (1758)
T KOG0994|consen 1075 GGPQCNEFTG--QCQCKPGFGGRTCSQCQELYWGDPNEKCRAC-DCDPRGIETPQCDRATGRCVCRPGVGGPRCDQCARG 1151 (1758)
T ss_pred CCcccccccc--ceeccCCCCCcchhHHHHhhcCCCCCCceec-CCCCCCCCCCCccccCCceeecCCCCCcchhhhhhh
Confidence 1236766666 7999999999988889999999888655321 12211 112431 33 3455556666 56666
Q ss_pred ceeCC
Q psy18237 892 YTLNT 896 (1050)
Q Consensus 892 y~g~~ 896 (1050)
|.|..
T Consensus 1152 y~G~f 1156 (1758)
T KOG0994|consen 1152 YSGQF 1156 (1758)
T ss_pred hcCCC
Confidence 66643
No 7
>KOG4289|consensus
Probab=99.43 E-value=1.7e-12 Score=155.90 Aligned_cols=97 Identities=30% Similarity=0.615 Sum_probs=75.3
Q ss_pred CCccccccCCCCcccccccccccCCcccCccccCCCCCCC-CCeeecCCCceEEecCCCeEeCCCCCccc-cCccccccC
Q psy18237 549 SNSLACALTPVLTRKVATPVAVINDCIDLDECRMMSYLCR-NGRCRNNIGSFFCECLQGYTLASEGQYCR-DVDECKEVN 626 (1050)
Q Consensus 549 ~~~~~C~C~~G~~G~~C~~~~~~~~C~dideC~~~~~~C~-ng~C~n~~gsy~C~C~~Gy~~~~~G~~C~-~~~eC~~~~ 626 (1050)
.++++|.|++||+|..|+ +.||+|.. .+|. ||+|....|+|+|+|.+||+ |.+|+ +..
T Consensus 1219 vnglrCrCPpGFTgd~Ce--------TeiDlCYs--~pC~nng~C~srEggYtCeCrpg~t----GehCEvs~~------ 1278 (2531)
T KOG4289|consen 1219 VNGLRCRCPPGFTGDYCE--------TEIDLCYS--GPCGNNGRCRSREGGYTCECRPGFT----GEHCEVSAR------ 1278 (2531)
T ss_pred cCceeEeCCCCCCccccc--------chhHhhhc--CCCCCCCceEEecCceeEEecCCcc----ccceeeecc------
Confidence 678899999999999998 78999998 8998 58999999999999999999 99998 221
Q ss_pred CCCCccccCcccchhhhhcccCCcCCCCCccccccCCCCCCCcceeecCCCCccCCCCCC
Q psy18237 627 KRESRCYLDTEEEEEEEEEEEGGYGGGSRRVTCTKEIAGSTTRSTCCCSIGKAWGPQCEE 686 (1050)
Q Consensus 627 ~~~~~C~~~~~~c~~~~c~~~g~~~~~~~~~~C~~~~~~~~~~~~C~C~~G~~~G~~C~~ 686 (1050)
...|.+..| +++++|.+... ..+.|.|+.|...++.|+.
T Consensus 1279 ---------agrCvpGvC---------~nggtC~~~~n---ggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1279 ---------AGRCVPGVC---------KNGGTCVNLLN---GGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred ---------cCcccccee---------cCCCEEeecCC---CceeccCCCcccCCCceEE
Confidence 112222233 56677765432 4567889888656788875
No 8
>KOG0994|consensus
Probab=98.98 E-value=5.1e-09 Score=125.00 Aligned_cols=169 Identities=24% Similarity=0.535 Sum_probs=95.6
Q ss_pred CCCccCCCCCCCCCCeEEeCCCCeeee-CCCCceeCC---CCCccccCCccCCCCCCC-CCC-eEe--cCCCceEEecCC
Q psy18237 766 DLNECALMPSACQGGECINTDGSYRCE-CPAGYVKDE---TGKICIDDNECLSIPNIC-GNG-TCT--NLNGGFECTCSE 837 (1050)
Q Consensus 766 ~ideC~~~~~~C~~g~C~~~~gsy~C~-C~~Gy~~~~---~g~~C~~~~~C~~~~~~C-~~g-~C~--~~~g~~~C~C~~ 837 (1050)
..+.|....+.|. .|.+...++.|. |..||.|++ .|..|.. -+|...+..= .++ .|. +....-.|.|.+
T Consensus 864 HA~~Cd~~tGaCi--~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrP-CpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~ 940 (1758)
T KOG0994|consen 864 HADTCDPITGACI--DCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRP-CPCPDGPASGRQHADSCYLDTRTQQIVCHCQE 940 (1758)
T ss_pred cccccCccccccc--cccccccccchhhhhccccCCcccCCCCCCCC-CCCCCCCccchhccccccccccccceeeeccc
Confidence 4567766555554 377777888995 999999884 4555542 1232221111 111 342 223346899999
Q ss_pred CCcCCCCCCccccccCCCC--CCCc------cCccccCCCCCCCC--Ccc---ccCCCceee-ecCCCceeCCCCCCccc
Q psy18237 838 GYAPGPLGSCAILLTLPPI--SPST------DIDECYERPGICAN--GDC---ANFQGSFQC-TCANGYTLNTARDSCVD 903 (1050)
Q Consensus 838 Gy~g~~~~~C~~~~~~~~~--~~C~------~~~eC~~~~~~C~n--g~C---~~~~gsy~C-~C~~Gy~g~~~~~~C~~ 903 (1050)
||+|..+..|..++++.|. ..|+ .||. ..++.|.. |.| ..-..+-+| .|.+||.|+.-...|..
T Consensus 941 GY~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~--~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~Cqr 1018 (1758)
T KOG0994|consen 941 GYSGSRCEICADNHFGNPSEGGTCQKCECSNNIDL--YDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQR 1018 (1758)
T ss_pred CccccchhhhcccccCCcccCCccccccccCCcCc--cCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhh
Confidence 9999988888888888775 4442 1221 12344542 444 333334566 79999999876555541
Q ss_pred CCccCCCCCCCCCCEEeeCCCCeEEecCCCceeCCCCCCc
Q psy18237 904 IDECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHNNDCI 943 (1050)
Q Consensus 904 ideC~~~~~~C~~g~C~n~~g~y~C~C~~Gy~g~~~~~C~ 943 (1050)
=.|..... =..+.|....| +|-|.+-..|..+..|.
T Consensus 1019 -C~Cn~LGT-n~~~~CDr~tG--QCpClpNv~G~~CDqCA 1054 (1758)
T KOG0994|consen 1019 -CVCNFLGT-NSTCHCDRFTG--QCPCLPNVQGVRCDQCA 1054 (1758)
T ss_pred -hecccccc-CCccccccccC--cCCCCcccccccccccc
Confidence 01110000 00123444444 78888888877655444
No 9
>KOG1219|consensus
Probab=98.96 E-value=9.5e-10 Score=137.36 Aligned_cols=110 Identities=36% Similarity=0.966 Sum_probs=93.6
Q ss_pred CCcccC-CccCCCCCCCC-CCEEeeC-CCCeEEecCCCceeCCCCCCccccccCCCCCCCCCcCCCeeccCCCCCCccCC
Q psy18237 899 DSCVDI-DECARHPNICN-NGTCVNA-IGSFKCHCYAGFKLSHNNDCIGNCTDINECESPQACLYGNCTNTLGSNCTDIN 975 (1050)
Q Consensus 899 ~~C~~i-deC~~~~~~C~-~g~C~n~-~g~y~C~C~~Gy~g~~~~~C~~~C~d~~eC~~~~~C~~g~C~~~~g~~C~di~ 975 (1050)
..|... +.|..+| |+ +|+|... .|+|+|.|++-|+|.. |+ .++.
T Consensus 3858 pgC~l~~d~C~~np--CqhgG~C~~~~~ggy~CkCpsqysG~~---CE----------------------------i~~e 3904 (4289)
T KOG1219|consen 3858 PGCSLLTDPCNDNP--CQHGGTCISQPKGGYKCKCPSQYSGNH---CE----------------------------IDLE 3904 (4289)
T ss_pred ccccccccccccCc--ccCCCEecCCCCCceEEeCcccccCcc---cc----------------------------cccc
Confidence 345533 8999988 88 7899977 5779999999999987 66 3456
Q ss_pred CCCCCCCC-CCCeEeecCCceeeeCCCCCeeCCCCCccccccc-CCcccCCCCCCC-CCeEEecCCCeEeecCCCCcC
Q psy18237 976 ECESPQAC-LYGNCTNTLGSFSCTCPPNYQLTPAGNACVVLED-INECEEHDNICE-NGHCTNTFGSFMCSCQDGFKL 1050 (1050)
Q Consensus 976 eC~~~~~C-~~g~C~~~~g~y~C~C~~Gy~~~~~g~~C~~~~d-ideC~~~~~~C~-~g~C~n~~gsy~C~C~~Gy~g 1050 (1050)
.| .++|| .+|+|+...++|.|.|+.||+ |++|+ .+ |+||+. ++|. .|+|+|+.|+|.|-|.+||.|
T Consensus 3905 pC-~snPC~~GgtCip~~n~f~CnC~~gyT----G~~Ce--~~Gi~eCs~--n~C~~gg~C~n~~gsf~CncT~g~~g 3973 (4289)
T KOG1219|consen 3905 PC-ASNPCLTGGTCIPFYNGFLCNCPNGYT----GKRCE--ARGISECSK--NVCGTGGQCINIPGSFHCNCTPGILG 3973 (4289)
T ss_pred cc-cCCCCCCCCEEEecCCCeeEeCCCCcc----Cceee--ccccccccc--ccccCCceeeccCCceEeccChhHhc
Confidence 67 57889 778999999999999999999 88996 45 999986 6899 789999999999999999864
No 10
>KOG4260|consensus
Probab=98.96 E-value=7.3e-10 Score=113.91 Aligned_cols=166 Identities=30% Similarity=0.654 Sum_probs=102.2
Q ss_pred CCCCCCcCcccccc------cCCCCCCcccCCC--CCCceeecCCCcccCCCCCCccccCCCCCCCCCCCCeeccCCCCc
Q psy18237 67 TAWLPTFTNVTTLT------NVTSTNEYVTGDN--VITRWCVCDEGFRGDGYSCEDIDECTDNTNYCDYILLCGSKPGEF 138 (1050)
Q Consensus 67 ~~c~~g~~g~~c~~------~~~~~n~~~c~~~--~g~~~C~C~~G~~g~~~~C~d~~eC~~~~~~C~~~~~C~n~~gs~ 138 (1050)
.=|+.|.+|++|.. .+|..||.+.++. .|+..|.|..||+|. .|.+ |. . +.|
T Consensus 130 vCCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp--~C~~---Cg-------~--------eyf 189 (350)
T KOG4260|consen 130 VCCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGP--LCRY---CG-------I--------EYF 189 (350)
T ss_pred eccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCc--cccc---cc-------h--------HHH
Confidence 34899999999995 4688999988875 588999999999997 4531 21 1 111
Q ss_pred ccCCCCCccccccccCCCCCCCCCeeecCCCCeEe-eCCCCCeeCCCCCCCccCCccCCCCCCCCCCcccccCCCCcccc
Q psy18237 139 MNPMTNKTEEIDECNLMPNMCNHGTCMNTPGSFHC-QCNRGFLYDSDTHQCIDINECEEMPEICGSGTYINECEEMPEIC 217 (1050)
Q Consensus 139 ~~~~~g~~~di~eC~~~~~~C~~~~C~~~~g~y~C-~C~~Gy~~~~~g~~C~d~~eC~~~~~~C~~g~C~~~C~~g~~~c 217 (1050)
....+ +..-.|..=...|. ++|.- .++-.| .|+.||.++ ..-|.|||||..
T Consensus 190 es~Rn---e~~lvCt~Ch~~C~-~~Csg-~~~k~C~kCkkGW~ld--e~gCvDvnEC~~--------------------- 241 (350)
T KOG4260|consen 190 ESSRN---EQHLVCTACHEGCL-GVCSG-ESSKGCSKCKKGWKLD--EEGCVDVNECQN--------------------- 241 (350)
T ss_pred Hhhcc---cccchhhhhhhhhh-cccCC-CCCCChhhhcccceec--ccccccHHHHhc---------------------
Confidence 10000 00000110000010 12321 112234 477888765 345666666654
Q ss_pred CCCccccCCCceeecccccccCCCCCCC-CCceecCCCCeEEEcCCCCccCCCCCCCcCCCccccCCCCCCCCCCceecC
Q psy18237 218 GSGTCENNIGSFSCRYINECEEMPEICG-SGTCENNIGSFSCRCEDGYSVKPAEGPACTDENECTMRTHNCDDNADCINN 296 (1050)
Q Consensus 218 ~~~~C~~~~g~~~c~~~~eC~~~~~~C~-~~~C~~~~g~y~C~C~~Gy~g~~~~~~~C~~ideC~~~~~~C~~~~~C~n~ 296 (1050)
.+..|. +-.|+|+.|||+|.+++||.+. +|||.
T Consensus 242 ----------------------ep~~c~~~qfCvNteGSf~C~dk~Gy~~g---------~d~C~--------------- 275 (350)
T KOG4260|consen 242 ----------------------EPAPCKAHQFCVNTEGSFKCEDKEGYKKG---------VDECQ--------------- 275 (350)
T ss_pred ----------------------CCCCCChhheeecCCCceEecccccccCC---------hHHhh---------------
Confidence 345565 4689999999999999999863 55664
Q ss_pred CCCCCCCCcccCccCcCCCCCCCCCeEeecCCceEEeCCCCCee
Q psy18237 297 PVNKTGTRCVDIDECATSIQRCGEGFCVNDVGTYHCVCPDGYML 340 (1050)
Q Consensus 297 ~g~~~g~~C~dideC~~~~~~C~~~~C~n~~Gsy~C~C~~G~~g 340 (1050)
.|+|+ |... +..|.|++|+|+|+|..|+.-
T Consensus 276 -------~~~d~--~~~k-----n~~c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 276 -------FCADV--CASK-----NRPCMNIDGQYRCVCFSGLII 305 (350)
T ss_pred -------hhhhh--cccC-----CCCcccCCccEEEEeccccee
Confidence 23332 2221 268899999999999999864
No 11
>KOG1219|consensus
Probab=98.96 E-value=1.1e-09 Score=136.85 Aligned_cols=107 Identities=37% Similarity=0.991 Sum_probs=92.9
Q ss_pred ccccCCCCCCCC-CccccCC-CceeeecCCCceeCCCCCCcccCCccCCCCCCCC-CCEEeeCCCCeEEecCCCceeCCC
Q psy18237 863 DECYERPGICAN-GDCANFQ-GSFQCTCANGYTLNTARDSCVDIDECARHPNICN-NGTCVNAIGSFKCHCYAGFKLSHN 939 (1050)
Q Consensus 863 ~eC~~~~~~C~n-g~C~~~~-gsy~C~C~~Gy~g~~~~~~C~~ideC~~~~~~C~-~g~C~n~~g~y~C~C~~Gy~g~~~ 939 (1050)
+.|..+| |+| |+|...+ ++|+|.|++-|.|..++ .++.+|..+| |. +|+|+...++|.|.|+.||+|..
T Consensus 3865 d~C~~np--CqhgG~C~~~~~ggy~CkCpsqysG~~CE---i~~epC~snP--C~~GgtCip~~n~f~CnC~~gyTG~~- 3936 (4289)
T KOG1219|consen 3865 DPCNDNP--CQHGGTCISQPKGGYKCKCPSQYSGNHCE---IDLEPCASNP--CLTGGTCIPFYNGFLCNCPNGYTGKR- 3936 (4289)
T ss_pred cccccCc--ccCCCEecCCCCCceEEeCcccccCcccc---cccccccCCC--CCCCCEEEecCCCeeEeCCCCccCce-
Confidence 7888765 998 9999874 67999999999996655 4899999998 98 89999999999999999999987
Q ss_pred CCCccccccCCCCCCCCCcCCCeeccCCCCCCcc-CCCCCCCCCC-CCCeEeecCCceeeeCCCCCeeCCCCCcc
Q psy18237 940 NDCIGNCTDINECESPQACLYGNCTNTLGSNCTD-INECESPQAC-LYGNCTNTLGSFSCTCPPNYQLTPAGNAC 1012 (1050)
Q Consensus 940 ~~C~~~C~d~~eC~~~~~C~~g~C~~~~g~~C~d-i~eC~~~~~C-~~g~C~~~~g~y~C~C~~Gy~~~~~g~~C 1012 (1050)
|+ .+ |+|| +.++| .+|.|+|.+|+|+|.|.+||. |++|
T Consensus 3937 --Ce----------------------------~~Gi~eC-s~n~C~~gg~C~n~~gsf~CncT~g~~----gr~c 3976 (4289)
T KOG1219|consen 3937 --CE----------------------------ARGISEC-SKNVCGTGGQCINIPGSFHCNCTPGIL----GRTC 3976 (4289)
T ss_pred --ee----------------------------ccccccc-ccccccCCceeeccCCceEeccChhHh----cccC
Confidence 55 12 6788 67889 778999999999999999999 6777
No 12
>KOG4260|consensus
Probab=98.85 E-value=2.6e-09 Score=109.87 Aligned_cols=167 Identities=34% Similarity=0.765 Sum_probs=112.2
Q ss_pred cCCCCccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCCCCCEEeec---CCCeEEeCCCCcccCCCCCcccccc
Q psy18237 674 CSIGKAWGPQCEECPAVGSDEHKTLCPGGSGYRPNSATREGICPSPGKCQNV---MGSFICTCPPGYRLSPDKNSCQEDF 750 (1050)
Q Consensus 674 C~~G~~~G~~C~~c~~~~~~~~~~~Cp~g~g~~~~~~~~~~~C~~~g~C~~~---~gsy~C~C~~Gy~g~~~~~~C~~~~ 750 (1050)
|++|. +|+.|..||-.. ..+|..+|.|.-. .|+..|.|.+||.|..+ +.
T Consensus 132 Cp~gt-yGpdCl~Cpggs---------------------er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C-~~----- 183 (350)
T KOG4260|consen 132 CPDGT-YGPDCLQCPGGS---------------------ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLC-RY----- 183 (350)
T ss_pred cCCCC-cCCccccCCCCC---------------------cCCcCCCCcccCCCCCCCCCcccccCCCCCccc-cc-----
Confidence 77776 899999877443 3468888888732 47788999999999643 12
Q ss_pred cccCCCCCCCCCCCCCCCccCCCCCCCCCCeEEeCCCCeee-eCCCCceeCCCCCccccCCccCCCCCCC-CCCeEecCC
Q psy18237 751 AKLCPEGVGRGDKGEDLNECALMPSACQGGECINTDGSYRC-ECPAGYVKDETGKICIDDNECLSIPNIC-GNGTCTNLN 828 (1050)
Q Consensus 751 ~c~C~~g~~~~~~~~~ideC~~~~~~C~~g~C~~~~gsy~C-~C~~Gy~~~~~g~~C~~~~~C~~~~~~C-~~g~C~~~~ 828 (1050)
|..+|+-..-.+..-.|..-...|. +.|.. .++-.| .|..||.++ ...|.||++|...+.+| .+..|+|+.
T Consensus 184 ---Cg~eyfes~Rne~~lvCt~Ch~~C~-~~Csg-~~~k~C~kCkkGW~ld--e~gCvDvnEC~~ep~~c~~~qfCvNte 256 (350)
T KOG4260|consen 184 ---CGIEYFESSRNEQHLVCTACHEGCL-GVCSG-ESSKGCSKCKKGWKLD--EEGCVDVNECQNEPAPCKAHQFCVNTE 256 (350)
T ss_pred ---cchHHHHhhcccccchhhhhhhhhh-cccCC-CCCCChhhhcccceec--ccccccHHHHhcCCCCCChhheeecCC
Confidence 2223322111111112221011121 13432 233346 499999988 66799999999998999 788999999
Q ss_pred CceEEecCCCCcCCCCCCccccccCCCCCCCccCccccCCCCCCC--CCccccCCCceeeecCCCcee
Q psy18237 829 GGFECTCSEGYAPGPLGSCAILLTLPPISPSTDIDECYERPGICA--NGDCANFQGSFQCTCANGYTL 894 (1050)
Q Consensus 829 g~~~C~C~~Gy~g~~~~~C~~~~~~~~~~~C~~~~eC~~~~~~C~--ng~C~~~~gsy~C~C~~Gy~g 894 (1050)
|+|.|.+++||.+. +|+|..-...|. +..|.++.++|+|+|..|+.-
T Consensus 257 GSf~C~dk~Gy~~g-------------------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 257 GSFKCEDKEGYKKG-------------------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLII 305 (350)
T ss_pred CceEecccccccCC-------------------hHHhhhhhhhcccCCCCcccCCccEEEEeccccee
Confidence 99999999999863 345543222343 478999999999999999864
No 13
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.50 E-value=1e-07 Score=73.08 Aligned_cols=40 Identities=53% Similarity=1.076 Sum_probs=34.1
Q ss_pred cCccCcCCCCCCCC-CeEeecCCceEEeCCCCCeeCCCCCc
Q psy18237 307 DIDECATSIQRCGE-GFCVNDVGTYHCVCPDGYMLLPSGKE 346 (1050)
Q Consensus 307 dideC~~~~~~C~~-~~C~n~~Gsy~C~C~~G~~g~~~~~~ 346 (1050)
|||||+..++.|.. ++|+|+.|||+|.|++||+....+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~ 41 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTT 41 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCc
Confidence 78999988889986 99999999999999999996444433
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.37 E-value=1.8e-07 Score=71.73 Aligned_cols=39 Identities=44% Similarity=0.993 Sum_probs=32.9
Q ss_pred ccccccCCCCCCC-CCeeecCCCCeEeeCCCCCeeCCCCC
Q psy18237 148 EIDECNLMPNMCN-HGTCMNTPGSFHCQCNRGFLYDSDTH 186 (1050)
Q Consensus 148 di~eC~~~~~~C~-~~~C~~~~g~y~C~C~~Gy~~~~~g~ 186 (1050)
|||||+...+.|. +++|+|+.|+|+|.|++||+....+.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~ 40 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGT 40 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSS
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCC
Confidence 6888887778897 79999999999999999999554443
No 15
>KOG1225|consensus
Probab=98.15 E-value=8.2e-06 Score=95.34 Aligned_cols=73 Identities=30% Similarity=0.740 Sum_probs=46.6
Q ss_pred eEEecCCCCcCCCCCCccccccCCCCCCCccCccccCCCCCCCC-CccccCCCceeeecCCCceeCCCCCCcccCCccCC
Q psy18237 831 FECTCSEGYAPGPLGSCAILLTLPPISPSTDIDECYERPGICAN-GDCANFQGSFQCTCANGYTLNTARDSCVDIDECAR 909 (1050)
Q Consensus 831 ~~C~C~~Gy~g~~~~~C~~~~~~~~~~~C~~~~eC~~~~~~C~n-g~C~~~~gsy~C~C~~Gy~g~~~~~~C~~ideC~~ 909 (1050)
++|.|..+|.|.. |. ...|. +.|.+ +.|++. +|+|++||+|..+.. -.|..
T Consensus 234 ~ic~c~~~~~g~~---------------c~-~~~C~---~~c~~~g~c~~G----~CIC~~Gf~G~dC~e-----~~Cp~ 285 (525)
T KOG1225|consen 234 GICECPEGYFGPL---------------CS-TIYCP---GGCTGRGQCVEG----RCICPPGFTGDDCDE-----LVCPV 285 (525)
T ss_pred ceeecCCceeCCc---------------cc-cccCC---CCCcccceEeCC----eEeCCCCCcCCCCCc-----ccCCc
Confidence 4789999998862 22 12232 23443 667665 899999999876542 22322
Q ss_pred CCCCCC-CCEEeeCCCCeEEecCCCceeCC
Q psy18237 910 HPNICN-NGTCVNAIGSFKCHCYAGFKLSH 938 (1050)
Q Consensus 910 ~~~~C~-~g~C~n~~g~y~C~C~~Gy~g~~ 938 (1050)
.|+ ++.+++. +|+|++||+|..
T Consensus 286 ---~cs~~g~~~~g----~CiC~~g~~G~d 308 (525)
T KOG1225|consen 286 ---DCSGGGVCVDG----ECICNPGYSGKD 308 (525)
T ss_pred ---ccCCCceecCC----EeecCCCccccc
Confidence 255 4556544 799999999886
No 16
>KOG1225|consensus
Probab=98.08 E-value=2.1e-05 Score=91.95 Aligned_cols=98 Identities=38% Similarity=1.013 Sum_probs=70.8
Q ss_pred cceeecCCCCccCCCCCC--CCCCCCCCCccccCCCCCCCCCCCCCCCCCCCCCEEeecCCCeEEeCCCCcccCCCCCcc
Q psy18237 669 RSTCCCSIGKAWGPQCEE--CPAVGSDEHKTLCPGGSGYRPNSATREGICPSPGKCQNVMGSFICTCPPGYRLSPDKNSC 746 (1050)
Q Consensus 669 ~~~C~C~~G~~~G~~C~~--c~~~~~~~~~~~Cp~g~g~~~~~~~~~~~C~~~g~C~~~~gsy~C~C~~Gy~g~~~~~~C 746 (1050)
..+|+|++|| .|..|+. || ..|+.++.+++. .|+|++||.|..+
T Consensus 264 ~G~CIC~~Gf-~G~dC~e~~Cp-------------------------~~cs~~g~~~~g----~CiC~~g~~G~dC---- 309 (525)
T KOG1225|consen 264 EGRCICPPGF-TGDDCDELVCP-------------------------VDCSGGGVCVDG----ECICNPGYSGKDC---- 309 (525)
T ss_pred CCeEeCCCCC-cCCCCCcccCC-------------------------cccCCCceecCC----EeecCCCcccccc----
Confidence 4589999999 8999986 43 337777877754 6999999998532
Q ss_pred cccccccCCCCCCCCCCCCCCCccCCCCCCCC-CCeEEeCCCCeeeeCCCCceeCCCCCccccCCccCCCCCCCCCCeEe
Q psy18237 747 QEDFAKLCPEGVGRGDKGEDLNECALMPSACQ-GGECINTDGSYRCECPAGYVKDETGKICIDDNECLSIPNICGNGTCT 825 (1050)
Q Consensus 747 ~~~~~c~C~~g~~~~~~~~~ideC~~~~~~C~-~g~C~~~~gsy~C~C~~Gy~~~~~g~~C~~~~~C~~~~~~C~~g~C~ 825 (1050)
.+.+| +..|. +|.|++. +|.|.+||+ |..|.... |. +++.|+
T Consensus 310 -------------------s~~~c---padC~g~G~Ci~G----~C~C~~Gy~----G~~C~~~~-C~------~~g~cv 352 (525)
T KOG1225|consen 310 -------------------SIRRC---PADCSGHGKCIDG----ECLCDEGYT----GELCIQRA-CS------GGGQCV 352 (525)
T ss_pred -------------------ccccC---CccCCCCCcccCC----ceEeCCCCc----CCcccccc-cC------CCceec
Confidence 22334 35577 5789843 499999995 88886442 33 456776
Q ss_pred cCCCceEEecCCCCcCC
Q psy18237 826 NLNGGFECTCSEGYAPG 842 (1050)
Q Consensus 826 ~~~g~~~C~C~~Gy~g~ 842 (1050)
+. |.|..||.|.
T Consensus 353 ~g-----C~C~~Gw~G~ 364 (525)
T KOG1225|consen 353 NG-----CKCKKGWRGP 364 (525)
T ss_pred cC-----ceeccCccCC
Confidence 53 8999999997
No 17
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.85 E-value=1.2e-05 Score=52.79 Aligned_cols=24 Identities=38% Similarity=0.931 Sum_probs=22.8
Q ss_pred CeEeeCCCCCeeCCCCCCCccCCc
Q psy18237 170 SFHCQCNRGFLYDSDTHQCIDINE 193 (1050)
Q Consensus 170 ~y~C~C~~Gy~~~~~g~~C~d~~e 193 (1050)
||+|.|++||++.+++.+|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 699999999999999999999997
No 18
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.68 E-value=2e-05 Score=57.85 Aligned_cols=35 Identities=34% Similarity=0.789 Sum_probs=26.3
Q ss_pred CcCCCCCCCC-CeEeecCCceEEeCCCCCeeCCCCCcc
Q psy18237 311 CATSIQRCGE-GFCVNDVGTYHCVCPDGYMLLPSGKEC 347 (1050)
Q Consensus 311 C~~~~~~C~~-~~C~n~~Gsy~C~C~~G~~g~~~~~~C 347 (1050)
|+.+++.|+. |+|+++.++|+|+|++||+| +|..|
T Consensus 1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~G--dG~~C 36 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEG--DGFFC 36 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TTSEEEEE-CEEEC--CSTCE
T ss_pred CCCCCCCCCCCcEeecCCCCEEeECCCCCcc--CCcCC
Confidence 4455678888 99999999999999999999 56543
No 19
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.58 E-value=3e-05 Score=56.92 Aligned_cols=31 Identities=39% Similarity=0.856 Sum_probs=24.2
Q ss_pred ccCCCCCCC-CCceecCCCCeEEEcCCCCccC
Q psy18237 237 CEEMPEICG-SGTCENNIGSFSCRCEDGYSVK 267 (1050)
Q Consensus 237 C~~~~~~C~-~~~C~~~~g~y~C~C~~Gy~g~ 267 (1050)
|+.++..|+ ||+|+++.++|+|+|++||+|+
T Consensus 1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCCCCCCcEeecCCCCEEeECCCCCccC
Confidence 345567787 7999999999999999999998
No 20
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.52 E-value=6.7e-05 Score=49.31 Aligned_cols=24 Identities=63% Similarity=1.522 Sum_probs=22.6
Q ss_pred ceeeeCCCCCeeCCCCCcccccccCCc
Q psy18237 994 SFSCTCPPNYQLTPAGNACVVLEDINE 1020 (1050)
Q Consensus 994 ~y~C~C~~Gy~~~~~g~~C~~~~dide 1020 (1050)
||+|.|++||++..++++| +||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C---~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSC---EDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCcc---ccCCC
Confidence 6999999999999999999 89997
No 21
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.52 E-value=7.5e-05 Score=54.71 Aligned_cols=34 Identities=41% Similarity=1.060 Sum_probs=27.1
Q ss_pred CCCCCCCCCeEeecCCceEEeCCCCCeeCCCCCcc
Q psy18237 313 TSIQRCGEGFCVNDVGTYHCVCPDGYMLLPSGKEC 347 (1050)
Q Consensus 313 ~~~~~C~~~~C~n~~Gsy~C~C~~G~~g~~~~~~C 347 (1050)
.+++.|++ .|++++|+|+|.|++||++.+|+++|
T Consensus 3 ~~NGgC~h-~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 3 VNNGGCSH-ICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp TGGGGSSS-EEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCCCcCC-CCccCCCceEeECCCCCEECcCCCCC
Confidence 34566774 99999999999999999999888775
No 22
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.50 E-value=0.00014 Score=72.32 Aligned_cols=144 Identities=24% Similarity=0.644 Sum_probs=91.0
Q ss_pred CCCCCCeEEeCCCCeeeeCCCCceeCCCCCccccCCccCC---CCCCC-CCCeEecCC-----CceEEecCCCCcCCCCC
Q psy18237 775 SACQGGECINTDGSYRCECPAGYVKDETGKICIDDNECLS---IPNIC-GNGTCTNLN-----GGFECTCSEGYAPGPLG 845 (1050)
Q Consensus 775 ~~C~~g~C~~~~gsy~C~C~~Gy~~~~~g~~C~~~~~C~~---~~~~C-~~g~C~~~~-----g~~~C~C~~Gy~g~~~~ 845 (1050)
..|.||.-+...+.|.|.|.+||.+. +...|+...+|.. ...+| .-+.|++.+ ..|.|.|.+||....
T Consensus 6 T~CKNG~LiQMSNHfEC~Cnegfvl~-~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~-- 82 (197)
T PF06247_consen 6 TICKNGYLIQMSNHFECKCNEGFVLK-NENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQ-- 82 (197)
T ss_dssp ---BTEEEEEESSEEEEEESTTEEEE-ETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS--
T ss_pred ccccCCEEEEccCceEEEcCCCcEEc-cccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeC--
Confidence 34788888888889999999999877 5778987777764 23578 779998765 479999999998752
Q ss_pred CccccccCCCCCCCccCccccCCCCCCCCCccccC---CCceeeecCCCceeCCCCCCcc--cCCccCCCCCCCC-CCEE
Q psy18237 846 SCAILLTLPPISPSTDIDECYERPGICANGDCANF---QGSFQCTCANGYTLNTARDSCV--DIDECARHPNICN-NGTC 919 (1050)
Q Consensus 846 ~C~~~~~~~~~~~C~~~~eC~~~~~~C~ng~C~~~---~gsy~C~C~~Gy~g~~~~~~C~--~ideC~~~~~~C~-~g~C 919 (1050)
..|. .++|.. ..|.+|.|+-. +....|.|.-|+. ......|. --.+|+.. |. +-.|
T Consensus 83 -----------~vCv-p~~C~~--~~Cg~GKCI~d~~~~~~~~CSC~IGkV-~~dn~kCtk~G~T~C~LK---Ck~nE~C 144 (197)
T PF06247_consen 83 -----------GVCV-PNKCNN--KDCGSGKCILDPDNPNNPTCSCNIGKV-PDDNKKCTKTGETKCSLK---CKENEEC 144 (197)
T ss_dssp -----------SSEE-EGGGSS-----TTEEEEEEEGGGSEEEEEE-TEEE-TTTTTESEEEE-----------TTTEEE
T ss_pred -----------CeEc-hhhcCc--eecCCCeEEecCCCCCCceeEeeeceE-eccCCcccCCCccceeee---cCCCcce
Confidence 1221 135543 23777888743 3346999999999 33445665 22355542 66 7799
Q ss_pred eeCCCCeEEecCCCceeCCC
Q psy18237 920 VNAIGSFKCHCYAGFKLSHN 939 (1050)
Q Consensus 920 ~n~~g~y~C~C~~Gy~g~~~ 939 (1050)
..+.+-|+|.+..||.++..
T Consensus 145 K~~~~~Y~C~~~~~~~~~~~ 164 (197)
T PF06247_consen 145 KLVDGYYKCVCKEGFPGDGE 164 (197)
T ss_dssp EEETTEEEEEE-TT-EEETT
T ss_pred eeeCcEEEeecCCCCCCCCC
Confidence 99999999999999998763
No 23
>KOG1836|consensus
Probab=97.49 E-value=0.002 Score=85.40 Aligned_cols=240 Identities=24% Similarity=0.549 Sum_probs=124.9
Q ss_pred CCCCcceeecCCCCccCCCCCCCCCCCCCCCccccCCCCCCCCCC------CCCCCCCCCCCEEeec--CCCeEEe-CCC
Q psy18237 665 GSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTLCPGGSGYRPNS------ATREGICPSPGKCQNV--MGSFICT-CPP 735 (1050)
Q Consensus 665 ~~~~~~~C~C~~G~~~G~~C~~c~~~~~~~~~~~Cp~g~g~~~~~------~~~~~~C~~~g~C~~~--~gsy~C~-C~~ 735 (1050)
.+..++.|.|..-. .|..|+.|. .||++++ .+.+.+|.+++.|... .....|. |++
T Consensus 740 Cd~~tG~C~C~~~t-~G~~C~~C~--------------~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~ 804 (1705)
T KOG1836|consen 740 CDPRTGQCKCKHNT-FGGQCAQCV--------------DGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPP 804 (1705)
T ss_pred ccCCCCceecccCC-CCCchhhhc--------------CCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCC
Confidence 34455667766654 678887765 3444432 1556778888888755 3567899 999
Q ss_pred CcccCCCCCcccccccccCCCCCCCCCC--CCCCCccCCC----------CCCCCC--Ce---EEeCCCCeeee-CCCCc
Q psy18237 736 GYRLSPDKNSCQEDFAKLCPEGVGRGDK--GEDLNECALM----------PSACQG--GE---CINTDGSYRCE-CPAGY 797 (1050)
Q Consensus 736 Gy~g~~~~~~C~~~~~c~C~~g~~~~~~--~~~ideC~~~----------~~~C~~--g~---C~~~~gsy~C~-C~~Gy 797 (1050)
||+|.. |+ .|..||.|..- ..+.-.|..- .+.|.- +. |+....+.+|. |.+||
T Consensus 805 gytG~r----Ce-----~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~ 875 (1705)
T KOG1836|consen 805 GYTGLR----CE-----ECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGY 875 (1705)
T ss_pred CCcccc----cc-----cCCCccccCCCCCCCCcccCccceeccccCccccccccccccceeeccCCcccccccccccCc
Confidence 999954 43 45566665211 1222223210 122331 22 44444455674 99999
Q ss_pred eeCCCCCccccCCccCCCCCCC-------CCCeEecCCCceEEecCCCCcCCCCCCccccccCCCC-CCCccCccccCCC
Q psy18237 798 VKDETGKICIDDNECLSIPNIC-------GNGTCTNLNGGFECTCSEGYAPGPLGSCAILLTLPPI-SPSTDIDECYERP 869 (1050)
Q Consensus 798 ~~~~~g~~C~~~~~C~~~~~~C-------~~g~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~~~~-~~C~~~~eC~~~~ 869 (1050)
++++.-. ...+.|.. ..| ...+|....| .|.|.+.-.|..+..|..++..... ..|.. ..|....
T Consensus 876 ~gd~l~~--~p~~~c~~--c~c~p~gs~~~~~~c~~~tG--Qcec~~~v~g~~c~~c~~g~fnl~s~~gC~~-c~c~~~g 948 (1705)
T KOG1836|consen 876 FGDPLAP--NPEDKCFA--CGCVPAGSELPSLTCNPVTG--QCECKPNVEGRDCLYCFKGFFNLNSGVGCEP-CNCDPTG 948 (1705)
T ss_pred cccccCC--CcCCcccc--ccCccCCcccccccCCCccc--ceeccCCCCccccccccccccccCCCCCccc-ccccccc
Confidence 8774321 01111111 111 0134555555 6777777777666556554433321 22211 1111110
Q ss_pred CCCCCCccccCCCceeeecCCCceeCCCCCCccc-----CCccCCCCCCCC-C----CEEeeCCCCeEEecCCCceeCCC
Q psy18237 870 GICANGDCANFQGSFQCTCANGYTLNTARDSCVD-----IDECARHPNICN-N----GTCVNAIGSFKCHCYAGFKLSHN 939 (1050)
Q Consensus 870 ~~C~ng~C~~~~gsy~C~C~~Gy~g~~~~~~C~~-----ideC~~~~~~C~-~----g~C~n~~g~y~C~C~~Gy~g~~~ 939 (1050)
=.+..|....| +|.|.+|-+|..+.+-+.. +.-|. +..|. . .+|....| +|.|++++.|..+
T Consensus 949 --s~~~~c~~~tG--qc~c~~gVtgqrc~qc~~~~~~~~~~gc~--~c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~~c 1020 (1705)
T KOG1836|consen 949 --SESSDCDVGTG--QCYCRPGVTGQRCDQCETYHFGFQTEGCG--LCECDPLGSRGFQCDPEDG--QCPCRPGFEGRRC 1020 (1705)
T ss_pred --cccccccccCC--ceeeecCccccccCccccCcccccccCCc--ceecccCCcccceecccCC--eeeecCCCCCccc
Confidence 01134544433 7888888888766432221 11121 11233 2 36776566 8999999999876
Q ss_pred CCCc
Q psy18237 940 NDCI 943 (1050)
Q Consensus 940 ~~C~ 943 (1050)
..|.
T Consensus 1021 ~~c~ 1024 (1705)
T KOG1836|consen 1021 DQCE 1024 (1705)
T ss_pred cccc
Confidence 5554
No 24
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.36 E-value=0.00015 Score=72.16 Aligned_cols=146 Identities=29% Similarity=0.729 Sum_probs=90.7
Q ss_pred CCCCCCeeecCCCceEEecCCCeEeCCCCCccccCccccccCCCCCccccCcccchhhhhcccCCcCCCCCccccccCCC
Q psy18237 585 YLCRNGRCRNNIGSFFCECLQGYTLASEGQYCRDVDECKEVNKRESRCYLDTEEEEEEEEEEEGGYGGGSRRVTCTKEIA 664 (1050)
Q Consensus 585 ~~C~ng~C~n~~gsy~C~C~~Gy~~~~~G~~C~~~~eC~~~~~~~~~C~~~~~~c~~~~c~~~g~~~~~~~~~~C~~~~~ 664 (1050)
..|.||.-+...+-|.|.|.+||.+.. ..+|+...+|....
T Consensus 6 T~CKNG~LiQMSNHfEC~Cnegfvl~~-EntCE~kv~C~~~e-------------------------------------- 46 (197)
T PF06247_consen 6 TICKNGYLIQMSNHFECKCNEGFVLKN-ENTCEEKVECDKLE-------------------------------------- 46 (197)
T ss_dssp ---BTEEEEEESSEEEEEESTTEEEEE-TTEEEE----SG-G--------------------------------------
T ss_pred ccccCCEEEEccCceEEEcCCCcEEcc-ccccccceecCccc--------------------------------------
Confidence 468899999999999999999999764 56787665554310
Q ss_pred CCCCcceeecCCCCccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCCCCCEEeecC-----CCeEEeCCCCccc
Q psy18237 665 GSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTLCPGGSGYRPNSATREGICPSPGKCQNVM-----GSFICTCPPGYRL 739 (1050)
Q Consensus 665 ~~~~~~~C~C~~G~~~G~~C~~c~~~~~~~~~~~Cp~g~g~~~~~~~~~~~C~~~g~C~~~~-----gsy~C~C~~Gy~g 739 (1050)
....+|...++|++.. ..|.|.|.+||..
T Consensus 47 ----------------------------------------------~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~ 80 (197)
T PF06247_consen 47 ----------------------------------------------NVNKPCGDYAKCINQANKGEERAYKCDCINGYIL 80 (197)
T ss_dssp ----------------------------------------------GTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE
T ss_pred ----------------------------------------------ccCccccchhhhhcCCCcccceeEEEecccCcee
Confidence 0123566678888764 4699999999998
Q ss_pred CCCCCcccccccccCCCCCCCCCCCCCCCccCCCCCCCCCCeEEeCC---CCeeeeCCCCceeCCCCCcccc--CCccCC
Q psy18237 740 SPDKNSCQEDFAKLCPEGVGRGDKGEDLNECALMPSACQGGECINTD---GSYRCECPAGYVKDETGKICID--DNECLS 814 (1050)
Q Consensus 740 ~~~~~~C~~~~~c~C~~g~~~~~~~~~ideC~~~~~~C~~g~C~~~~---gsy~C~C~~Gy~~~~~g~~C~~--~~~C~~ 814 (1050)
..+ .|. .++|.. -.|..|.|+-.+ ....|+|.-|++.+ +...|.. ..+|+.
T Consensus 81 ~~~--vCv-------------------p~~C~~--~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~-dn~kCtk~G~T~C~L 136 (197)
T PF06247_consen 81 KQG--VCV-------------------PNKCNN--KDCGSGKCILDPDNPNNPTCSCNIGKVPD-DNKKCTKTGETKCSL 136 (197)
T ss_dssp SSS--SEE-------------------EGGGSS-----TTEEEEEEEGGGSEEEEEE-TEEETT-TTTESEEEE------
T ss_pred eCC--eEc-------------------hhhcCc--eecCCCeEEecCCCCCCceeEeeeceEec-cCCcccCCCccceee
Confidence 532 443 244543 337778897643 34689999999843 4667742 334553
Q ss_pred CCCCC-CCCeEecCCCceEEecCCCCcCC
Q psy18237 815 IPNIC-GNGTCTNLNGGFECTCSEGYAPG 842 (1050)
Q Consensus 815 ~~~~C-~~g~C~~~~g~~~C~C~~Gy~g~ 842 (1050)
.| .+-.|....+-|+|.|.+||.+.
T Consensus 137 ---KCk~nE~CK~~~~~Y~C~~~~~~~~~ 162 (197)
T PF06247_consen 137 ---KCKENEECKLVDGYYKCVCKEGFPGD 162 (197)
T ss_dssp -----TTTEEEEEETTEEEEEE-TT-EEE
T ss_pred ---ecCCCcceeeeCcEEEeecCCCCCCC
Confidence 46 67899999999999999999875
No 25
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.27 E-value=0.00034 Score=52.43 Aligned_cols=33 Identities=58% Similarity=1.255 Sum_probs=28.5
Q ss_pred cCccCcCCCCCCCC-CeEeecCCceEEeCCCCCe-e
Q psy18237 307 DIDECATSIQRCGE-GFCVNDVGTYHCVCPDGYM-L 340 (1050)
Q Consensus 307 dideC~~~~~~C~~-~~C~n~~Gsy~C~C~~G~~-g 340 (1050)
|||||... .+|.+ ++|+++.|+|+|.|++||+ |
T Consensus 1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~~g 35 (39)
T smart00179 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYTDG 35 (39)
T ss_pred CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCccC
Confidence 57888763 57988 8999999999999999998 5
No 26
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.21 E-value=0.0003 Score=51.57 Aligned_cols=31 Identities=42% Similarity=0.894 Sum_probs=24.8
Q ss_pred CCCCCCeeecCCCceEEecCCCeEeCCCCCcc
Q psy18237 585 YLCRNGRCRNNIGSFFCECLQGYTLASEGQYC 616 (1050)
Q Consensus 585 ~~C~ng~C~n~~gsy~C~C~~Gy~~~~~G~~C 616 (1050)
..|+ ..|++++++|+|.|++||+|.+|+++|
T Consensus 6 GgC~-h~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 6 GGCS-HICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp GGSS-SEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCcC-CCCccCCCceEeECCCCCEECcCCCCC
Confidence 3454 389999999999999999999999876
No 27
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.10 E-value=0.00066 Score=50.83 Aligned_cols=37 Identities=49% Similarity=1.113 Sum_probs=29.1
Q ss_pred ccccccCCCCCCC-CCeeecCCCCeEeeCCCCCeeCCCCCCC
Q psy18237 148 EIDECNLMPNMCN-HGTCMNTPGSFHCQCNRGFLYDSDTHQC 188 (1050)
Q Consensus 148 di~eC~~~~~~C~-~~~C~~~~g~y~C~C~~Gy~~~~~g~~C 188 (1050)
++|||... ++|. +++|+++.|+|+|.|++||+ +|..|
T Consensus 1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C 38 (39)
T smart00179 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT---DGRNC 38 (39)
T ss_pred CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCc---cCCcC
Confidence 46788732 6787 57999999999999999998 24555
No 28
>PF00683 TB: TB domain; InterPro: IPR002212 Transforming growth factor beta (TGF-beta)-binding protein-like (TB) domain comes from human fibrillin-1[]. This domain is found in fibrillins and latent TGF-beta-binding proteins (LTBPs) which are localized to fibrillar structures in the extracellular matrix [].; GO: 0005488 binding; PDB: 2W86_A 1UZJ_B 1UZQ_A 1UZK_A 1UZP_A 1APJ_A 1KSQ_A.
Probab=97.08 E-value=1.9e-05 Score=59.94 Aligned_cols=40 Identities=48% Similarity=1.359 Sum_probs=31.2
Q ss_pred CcCCCcCCCCcceeeecCCCCcCCCCCCCCCCCCCcceee
Q psy18237 363 LCSLPMSNEQTRMVCCCSMGQSWGKPCQPCPPPGSRDYIL 402 (1050)
Q Consensus 363 ~C~~~~~~~~~~~~C~C~~g~~~~~~C~~Cp~~~~~~~~~ 402 (1050)
.|+++....+++.+|+|+.|.+||.+|+.||..++.+|+.
T Consensus 2 ~C~~~l~~~~tk~~CCCs~G~aWG~~Ce~CP~~~t~ef~~ 41 (42)
T PF00683_consen 2 QCSNPLPGNVTKSECCCSVGRAWGSPCEPCPPPGTDEFNR 41 (42)
T ss_dssp CEEEEEEEEEEHHHHHTTT-SEETTTTEE---TTSHHHHH
T ss_pred cCCCcCCCCeeccccCCCCCCcCCCccccCCCCCChHHhc
Confidence 4777777788899999999999999999999999888753
No 29
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.96 E-value=0.00053 Score=49.11 Aligned_cols=28 Identities=36% Similarity=1.023 Sum_probs=25.6
Q ss_pred CCCCCCCCEEeecC-CCeEEeCCCCcccC
Q psy18237 713 EGICPSPGKCQNVM-GSFICTCPPGYRLS 740 (1050)
Q Consensus 713 ~~~C~~~g~C~~~~-gsy~C~C~~Gy~g~ 740 (1050)
+++|+++|+|++.. ++|+|+|++||+|.
T Consensus 3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 56899999999998 99999999999984
No 30
>KOG1226|consensus
Probab=96.66 E-value=0.0088 Score=71.71 Aligned_cols=96 Identities=30% Similarity=0.655 Sum_probs=54.7
Q ss_pred EEecCCCCcCCCCCCccccccCCCCCCCccCccccCCC--CCCCC-CccccCCCceeeecCCCceeCCCCCCcc-cCCcc
Q psy18237 832 ECTCSEGYAPGPLGSCAILLTLPPISPSTDIDECYERP--GICAN-GDCANFQGSFQCTCANGYTLNTARDSCV-DIDEC 907 (1050)
Q Consensus 832 ~C~C~~Gy~g~~~~~C~~~~~~~~~~~C~~~~eC~~~~--~~C~n-g~C~~~~gsy~C~C~~Gy~g~~~~~~C~-~ideC 907 (1050)
+|.|.+||.|. .|+........ -...+.|.... ..|.+ |.|+=. +|+|.+...+...++.|+ |--.|
T Consensus 479 ~C~C~~G~~G~---~CEC~~~~~ss--~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~fCECDnfsC 549 (783)
T KOG1226|consen 479 QCRCDEGWLGK---KCECSTDELSS--SEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGKFCECDNFSC 549 (783)
T ss_pred ceecCCCCCCC---cccCCccccCc--HhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeeeeeeccCccc
Confidence 58999999998 45421111100 01123443221 24664 666543 788887776544445555 22234
Q ss_pred CCC-CCCCC-CCEEeeCCCCeEEecCCCceeCCCC
Q psy18237 908 ARH-PNICN-NGTCVNAIGSFKCHCYAGFKLSHNN 940 (1050)
Q Consensus 908 ~~~-~~~C~-~g~C~n~~g~y~C~C~~Gy~g~~~~ 940 (1050)
... ...|. +|+|.=. +|+|.+||+|..++
T Consensus 550 ~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~ 580 (783)
T KOG1226|consen 550 ERHKGVLCGGHGRCECG----RCVCNPGWTGSACN 580 (783)
T ss_pred ccccCcccCCCCeEeCC----cEEcCCCCccCCCC
Confidence 432 22476 7777644 79999999999854
No 31
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.60 E-value=0.0012 Score=47.27 Aligned_cols=26 Identities=38% Similarity=0.959 Sum_probs=23.8
Q ss_pred CCCCC-CeEeecC-CceEEeCCCCCeeC
Q psy18237 316 QRCGE-GFCVNDV-GTYHCVCPDGYMLL 341 (1050)
Q Consensus 316 ~~C~~-~~C~n~~-Gsy~C~C~~G~~g~ 341 (1050)
.+|++ |+|++.. ++|+|.|++||+|.
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 47998 9999998 99999999999983
No 32
>KOG1226|consensus
Probab=96.52 E-value=0.015 Score=69.87 Aligned_cols=49 Identities=33% Similarity=0.716 Sum_probs=31.4
Q ss_pred ccccCCCCc----ccccccccccCCcccCccccC-CCCCCC-CCeeecCCCceEEecCCCeEeCCCCCccc
Q psy18237 553 ACALTPVLT----RKVATPVAVINDCIDLDECRM-MSYLCR-NGRCRNNIGSFFCECLQGYTLASEGQYCR 617 (1050)
Q Consensus 553 ~C~C~~G~~----G~~C~~~~~~~~C~dideC~~-~~~~C~-ng~C~n~~gsy~C~C~~Gy~~~~~G~~C~ 617 (1050)
+|.|.+... |..|+ | |--.|.. ....|. ||.|.=. +|.|.+||+ |..|.
T Consensus 526 qC~C~~~~~~~i~G~fCE-------C-DnfsC~r~~g~lC~g~G~C~CG----~CvC~~Gwt----G~~C~ 580 (783)
T KOG1226|consen 526 QCVCHKPDNGKIYGKFCE-------C-DNFSCERHKGVLCGGHGRCECG----RCVCNPGWT----GSACN 580 (783)
T ss_pred ceEecCCCCCceeeeeee-------c-cCcccccccCcccCCCCeEeCC----cEEcCCCCc----cCCCC
Confidence 477777776 66665 1 1112222 134576 6777654 499999999 99887
No 33
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.40 E-value=0.0035 Score=46.31 Aligned_cols=33 Identities=58% Similarity=1.264 Sum_probs=28.0
Q ss_pred cCccCcCCCCCCCC-CeEeecCCceEEeCCCCCee
Q psy18237 307 DIDECATSIQRCGE-GFCVNDVGTYHCVCPDGYML 340 (1050)
Q Consensus 307 dideC~~~~~~C~~-~~C~n~~Gsy~C~C~~G~~g 340 (1050)
++|+|... .+|.+ ++|++..++|+|.|++||.|
T Consensus 1 ~~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 1 DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred CcccCCCC-CCcCCCCEeECCCCCeEeECCCCCcC
Confidence 46778652 46887 89999999999999999998
No 34
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.36 E-value=0.0045 Score=45.72 Aligned_cols=33 Identities=48% Similarity=1.027 Sum_probs=26.4
Q ss_pred ccccccCCCCCCC-CCeeecCCCCeEeeCCCCCee
Q psy18237 148 EIDECNLMPNMCN-HGTCMNTPGSFHCQCNRGFLY 181 (1050)
Q Consensus 148 di~eC~~~~~~C~-~~~C~~~~g~y~C~C~~Gy~~ 181 (1050)
++++|... .+|. +++|++..++|+|.|++||+|
T Consensus 1 ~~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 1 DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred CcccCCCC-CCcCCCCEeECCCCCeEeECCCCCcC
Confidence 35677632 5787 579999999999999999983
No 35
>KOG1836|consensus
Probab=96.02 E-value=0.089 Score=70.38 Aligned_cols=112 Identities=22% Similarity=0.477 Sum_probs=65.8
Q ss_pred EEeCCCCeee-eCCCCceeCCCCCccccCCccCCCCCCC-CCCeEecC--CCceEEe-cCCCCcCCCCCCccccccCCCC
Q psy18237 782 CINTDGSYRC-ECPAGYVKDETGKICIDDNECLSIPNIC-GNGTCTNL--NGGFECT-CSEGYAPGPLGSCAILLTLPPI 856 (1050)
Q Consensus 782 C~~~~gsy~C-~C~~Gy~~~~~g~~C~~~~~C~~~~~~C-~~g~C~~~--~g~~~C~-C~~Gy~g~~~~~C~~~~~~~~~ 856 (1050)
|+....+-.| +|..||.+++.... ..+ |.. .+| ..+.|..+ .....|. |++||+|..+..|..++.+.+.
T Consensus 749 C~~~t~G~~C~~C~~GfYg~~~~~~--~~d-C~~--C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~ 823 (1705)
T KOG1836|consen 749 CKHNTFGGQCAQCVDGFYGLPDLGT--SGD-CQP--CPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYFGNPL 823 (1705)
T ss_pred cccCCCCCchhhhcCCCCCccccCC--CCC-Ccc--CCCCCChhhcCcCcccceecCCCCCCCcccccccCCCccccCCC
Confidence 5555545567 49999987643221 112 544 666 55666544 3567898 9999999988888888877665
Q ss_pred CCCccCccccCC----------CCCCCC--Cc---cccCCCceee-ecCCCceeCCCC
Q psy18237 857 SPSTDIDECYER----------PGICAN--GD---CANFQGSFQC-TCANGYTLNTAR 898 (1050)
Q Consensus 857 ~~C~~~~eC~~~----------~~~C~n--g~---C~~~~gsy~C-~C~~Gy~g~~~~ 898 (1050)
..=.++..|.+- .+.|.. +. |+....+.+| .|.+||.|+.-.
T Consensus 824 ~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~ 881 (1705)
T KOG1836|consen 824 GHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLA 881 (1705)
T ss_pred CCCCCcccCccceeccccCccccccccccccceeeccCCcccccccccccCccccccC
Confidence 222333333221 123431 33 3333334455 799999887643
No 36
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.32 E-value=0.024 Score=41.30 Aligned_cols=25 Identities=48% Similarity=1.198 Sum_probs=21.8
Q ss_pred CCCCCCeeecCCCCeEeeCCCCCee
Q psy18237 157 NMCNHGTCMNTPGSFHCQCNRGFLY 181 (1050)
Q Consensus 157 ~~C~~~~C~~~~g~y~C~C~~Gy~~ 181 (1050)
++|.+++|+++.++|+|.|++||+|
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCcc
Confidence 5787449999999999999999985
No 37
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.25 E-value=0.021 Score=41.60 Aligned_cols=24 Identities=50% Similarity=1.045 Sum_probs=21.3
Q ss_pred CCCCCCeeecCCCceEEecCCCeE
Q psy18237 585 YLCRNGRCRNNIGSFFCECLQGYT 608 (1050)
Q Consensus 585 ~~C~ng~C~n~~gsy~C~C~~Gy~ 608 (1050)
.+|.++.|++..++|+|.|++||.
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~ 29 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYT 29 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCc
Confidence 467755999999999999999999
No 38
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.22 E-value=0.022 Score=41.24 Aligned_cols=27 Identities=48% Similarity=1.077 Sum_probs=23.8
Q ss_pred CCCCCC-CeEeecCCceEEeCCCCCeeC
Q psy18237 315 IQRCGE-GFCVNDVGTYHCVCPDGYMLL 341 (1050)
Q Consensus 315 ~~~C~~-~~C~n~~Gsy~C~C~~G~~g~ 341 (1050)
..+|.+ ++|+++.++|+|.|++||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 356877 999999999999999999983
No 39
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.09 E-value=0.03 Score=40.53 Aligned_cols=27 Identities=52% Similarity=1.161 Sum_probs=23.0
Q ss_pred CCCCC-CCeeecCCCCeEeeCCCCCeeC
Q psy18237 156 PNMCN-HGTCMNTPGSFHCQCNRGFLYD 182 (1050)
Q Consensus 156 ~~~C~-~~~C~~~~g~y~C~C~~Gy~~~ 182 (1050)
..+|. +++|+++.++|+|.|++||.+.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 45677 5899999999999999999843
No 40
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=94.44 E-value=0.037 Score=59.47 Aligned_cols=44 Identities=41% Similarity=0.877 Sum_probs=35.5
Q ss_pred CCCCcccCccCcCCCCCCCCCeEeecCCceEEeCCCCCeeCCCCC
Q psy18237 301 TGTRCVDIDECATSIQRCGEGFCVNDVGTYHCVCPDGYMLLPSGK 345 (1050)
Q Consensus 301 ~g~~C~dideC~~~~~~C~~~~C~n~~Gsy~C~C~~G~~g~~~~~ 345 (1050)
.+..|++++||...++.|. ..|.++.|+|.|.|++||++..+++
T Consensus 180 ~~~~C~~~~~C~~~~~~c~-~~C~~~~g~~~c~c~~g~~~~~~~~ 223 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQ-QVCISTPGSYLCACTEGYALLEDNK 223 (224)
T ss_pred ccccCcCchhhcCCCCCcc-ceEEcCCCCEEeECCCCccCCCCCC
Confidence 4567778888876666777 4899999999999999999865543
No 41
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.04 E-value=0.084 Score=56.68 Aligned_cols=43 Identities=30% Similarity=0.763 Sum_probs=36.6
Q ss_pred cCCcccCccccCCCCCCCCCeeecCCCceEEecCCCeEeCCCCC
Q psy18237 571 INDCIDLDECRMMSYLCRNGRCRNNIGSFFCECLQGYTLASEGQ 614 (1050)
Q Consensus 571 ~~~C~dideC~~~~~~C~ng~C~n~~gsy~C~C~~Gy~~~~~G~ 614 (1050)
+..|.+++||...++.|. ..|.++.|+|.|.|++||++.++++
T Consensus 181 ~~~C~~~~~C~~~~~~c~-~~C~~~~g~~~c~c~~g~~~~~~~~ 223 (224)
T cd01475 181 GKICVVPDLCATLSHVCQ-QVCISTPGSYLCACTEGYALLEDNK 223 (224)
T ss_pred cccCcCchhhcCCCCCcc-ceEEcCCCCEEeECCCCccCCCCCC
Confidence 367889999987667787 4799999999999999999877664
No 42
>PF00683 TB: TB domain; InterPro: IPR002212 Transforming growth factor beta (TGF-beta)-binding protein-like (TB) domain comes from human fibrillin-1[]. This domain is found in fibrillins and latent TGF-beta-binding proteins (LTBPs) which are localized to fibrillar structures in the extracellular matrix [].; GO: 0005488 binding; PDB: 2W86_A 1UZJ_B 1UZQ_A 1UZK_A 1UZP_A 1APJ_A 1KSQ_A.
Probab=93.02 E-value=0.0045 Score=47.16 Aligned_cols=39 Identities=49% Similarity=1.306 Sum_probs=27.0
Q ss_pred ccccCCCCCCCcceeecCCCCccCCCCCCCCCCCCCCCc
Q psy18237 658 TCTKEIAGSTTRSTCCCSIGKAWGPQCEECPAVGSDEHK 696 (1050)
Q Consensus 658 ~C~~~~~~~~~~~~C~C~~G~~~G~~C~~c~~~~~~~~~ 696 (1050)
.|..++....++..|.|..|.+||..|+.||..++.+|+
T Consensus 2 ~C~~~l~~~~tk~~CCCs~G~aWG~~Ce~CP~~~t~ef~ 40 (42)
T PF00683_consen 2 QCSNPLPGNVTKSECCCSVGRAWGSPCEPCPPPGTDEFN 40 (42)
T ss_dssp CEEEEEEEEEEHHHHHTTT-SEETTTTEE---TTSHHHH
T ss_pred cCCCcCCCCeeccccCCCCCCcCCCccccCCCCCChHHh
Confidence 455555566778899999999999999999987765543
No 43
>smart00051 DSL delta serrate ligand.
Probab=92.37 E-value=0.25 Score=41.41 Aligned_cols=38 Identities=13% Similarity=0.106 Sum_probs=30.3
Q ss_pred CCCCCcCcccccccCCC----CCCcccCCCCCCceeecCCCcccC
Q psy18237 68 AWLPTFTNVTTLTNVTS----TNEYVTGDNVITRWCVCDEGFRGD 108 (1050)
Q Consensus 68 ~c~~g~~g~~c~~~~~~----~n~~~c~~~~g~~~C~C~~G~~g~ 108 (1050)
.|.++|+|..|...+.. .....|+. .|. ++|.+||+|.
T Consensus 20 ~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~--~~C~~Gw~G~ 61 (63)
T smart00051 20 TCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGN--KGCLEGWMGP 61 (63)
T ss_pred eCCCCCcCCccCCEeCcCccccCCccCCc-CCC--EecCCCCcCC
Confidence 79999999999866543 56677865 455 9999999996
No 44
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=89.51 E-value=0.43 Score=34.15 Aligned_cols=23 Identities=43% Similarity=1.099 Sum_probs=19.3
Q ss_pred CCC-CCceecCCCCeEEEcCCCCccC
Q psy18237 243 ICG-SGTCENNIGSFSCRCEDGYSVK 267 (1050)
Q Consensus 243 ~C~-~~~C~~~~g~y~C~C~~Gy~g~ 267 (1050)
+|+ +|+|+...+ +|+|.+||+|+
T Consensus 7 ~C~~~G~C~~~~g--~C~C~~g~~G~ 30 (32)
T PF07974_consen 7 ICSGHGTCVSPCG--RCVCDSGYTGP 30 (32)
T ss_pred ccCCCCEEeCCCC--EEECCCCCcCC
Confidence 576 599987745 99999999997
No 45
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=88.92 E-value=0.56 Score=33.59 Aligned_cols=23 Identities=39% Similarity=1.061 Sum_probs=18.0
Q ss_pred CCCC-CCeeecCCCCeEeeCCCCCee
Q psy18237 157 NMCN-HGTCMNTPGSFHCQCNRGFLY 181 (1050)
Q Consensus 157 ~~C~-~~~C~~~~g~y~C~C~~Gy~~ 181 (1050)
..|. ||+|+.. ..+|.|++||+|
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G 29 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTG 29 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcC
Confidence 3576 7899876 348999999993
No 46
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=86.87 E-value=0.47 Score=26.62 Aligned_cols=13 Identities=46% Similarity=1.273 Sum_probs=10.2
Q ss_pred EEecCCCeEeCCCCCcc
Q psy18237 600 FCECLQGYTLASEGQYC 616 (1050)
Q Consensus 600 ~C~C~~Gy~~~~~G~~C 616 (1050)
.|+|++||+ |.+|
T Consensus 1 ~C~C~~G~~----G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWT----GPNC 13 (13)
T ss_dssp EEEE-TTEE----TTTT
T ss_pred CccCcCCCc----CCCC
Confidence 499999999 8766
No 47
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=86.42 E-value=0.25 Score=36.25 Aligned_cols=26 Identities=27% Similarity=0.645 Sum_probs=19.3
Q ss_pred CCCC-CCeeecCC-CCeEeeCCCCCeeC
Q psy18237 157 NMCN-HGTCMNTP-GSFHCQCNRGFLYD 182 (1050)
Q Consensus 157 ~~C~-~~~C~~~~-g~y~C~C~~Gy~~~ 182 (1050)
..|. ||.|++.. |+++|.|..||..+
T Consensus 5 ~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp S---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred ccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 4464 89998876 99999999999865
No 48
>smart00051 DSL delta serrate ligand.
Probab=82.77 E-value=1.2 Score=37.46 Aligned_cols=45 Identities=22% Similarity=0.431 Sum_probs=30.4
Q ss_pred cccCCCCcccccccccccCCcccCccccCCCCCCCCCeeecCCCceEEecCCCeEeCCCCCcc
Q psy18237 554 CALTPVLTRKVATPVAVINDCIDLDECRMMSYLCRNGRCRNNIGSFFCECLQGYTLASEGQYC 616 (1050)
Q Consensus 554 C~C~~G~~G~~C~~~~~~~~C~dideC~~~~~~C~ng~C~n~~gsy~C~C~~Gy~~~~~G~~C 616 (1050)
=.|.++|.|..|+ ..|...+.... +.+|.. .|. ++|++||+ |..|
T Consensus 19 v~C~~~~yG~~C~-----~~C~~~~d~~~------~~~Cd~-~G~--~~C~~Gw~----G~~C 63 (63)
T smart00051 19 VTCDENYYGEGCN-----KFCRPRDDFFG------HYTCDE-NGN--KGCLEGWM----GPYC 63 (63)
T ss_pred eeCCCCCcCCccC-----CEeCcCccccC------CccCCc-CCC--EecCCCCc----CCCC
Confidence 3588999998887 55654333322 457854 454 89999999 7665
No 49
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=74.92 E-value=2.6 Score=31.10 Aligned_cols=25 Identities=36% Similarity=0.781 Sum_probs=19.4
Q ss_pred CC-CCEEeeCC-CCeEEecCCCceeCC
Q psy18237 914 CN-NGTCVNAI-GSFKCHCYAGFKLSH 938 (1050)
Q Consensus 914 C~-~g~C~n~~-g~y~C~C~~Gy~g~~ 938 (1050)
|+ |+.|++.. |+++|.|.+||....
T Consensus 7 cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 7 CPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp --TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred CCCCcccEEcCCCCEEEEeeCCccccC
Confidence 77 89999876 999999999998754
No 50
>PHA02887 EGF-like protein; Provisional
Probab=72.59 E-value=3 Score=38.79 Aligned_cols=39 Identities=41% Similarity=0.975 Sum_probs=28.9
Q ss_pred CccccC-CCCCCCCCeeecCC--CceEEecCCCeEeCCCCCccccC
Q psy18237 577 LDECRM-MSYLCRNGRCRNNI--GSFFCECLQGYTLASEGQYCRDV 619 (1050)
Q Consensus 577 ideC~~-~~~~C~ng~C~n~~--gsy~C~C~~Gy~~~~~G~~C~~~ 619 (1050)
..+|.. ..+-|.||+|.-.. ..+.|.|++||+ |.+|+.+
T Consensus 83 f~pC~~eyk~YCiHG~C~yI~dL~epsCrC~~GYt----G~RCE~v 124 (126)
T PHA02887 83 FEKCKNDFNDFCINGECMNIIDLDEKFCICNKGYT----GIRCDEV 124 (126)
T ss_pred ccccChHhhCEeeCCEEEccccCCCceeECCCCcc----cCCCCcc
Confidence 344543 34678899997544 668999999999 9999854
No 51
>KOG1218|consensus
Probab=72.50 E-value=68 Score=35.95 Aligned_cols=44 Identities=30% Similarity=0.942 Sum_probs=26.4
Q ss_pred eeeCCCCceeCCCCCccccCCc-cCCCCCCC-CCCeEecCCCceEEecCCCCc
Q psy18237 790 RCECPAGYVKDETGKICIDDNE-CLSIPNIC-GNGTCTNLNGGFECTCSEGYA 840 (1050)
Q Consensus 790 ~C~C~~Gy~~~~~g~~C~~~~~-C~~~~~~C-~~g~C~~~~g~~~C~C~~Gy~ 840 (1050)
.|.|.+||+ +..|..... |... ..+ .++.|....+ .+.+.+++.
T Consensus 163 ~c~c~~g~~----g~~~~~~~~~c~~~-~~~~~g~~C~~~~~--~~~~~~~~~ 208 (316)
T KOG1218|consen 163 ICTCQPGFV----GVFCVESCSGCSPL-TACENGAKCNRSTG--SCLCYPGPS 208 (316)
T ss_pred ceeccCCcc----cccccccCCCcCCC-cccCCCCeeecccc--ccccCCCCc
Confidence 488999996 666643322 4432 556 4458877765 455555554
No 52
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=70.90 E-value=2.9 Score=39.48 Aligned_cols=39 Identities=31% Similarity=0.782 Sum_probs=30.1
Q ss_pred CccccC-CCCCCCCCeeecCC--CceEEecCCCeEeCCCCCccccC
Q psy18237 577 LDECRM-MSYLCRNGRCRNNI--GSFFCECLQGYTLASEGQYCRDV 619 (1050)
Q Consensus 577 ideC~~-~~~~C~ng~C~n~~--gsy~C~C~~Gy~~~~~G~~C~~~ 619 (1050)
+-+|.. ..+-|.||.|.-.+ ..+.|.|..||+ |.+|+..
T Consensus 42 i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYt----GeRCEh~ 83 (139)
T PHA03099 42 IRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYT----GIRCQHV 83 (139)
T ss_pred cccCChhhCCEeECCEEEeeccCCCceeECCCCcc----cccccce
Confidence 455543 35678899997544 789999999999 9999844
No 53
>KOG3512|consensus
Probab=59.07 E-value=25 Score=40.66 Aligned_cols=122 Identities=24% Similarity=0.503 Sum_probs=66.2
Q ss_pred CEEeecCCC-eEEeCCCCcccCCCCCcccccccccCCCCCCCCCCCCCCCccCC------------------CCCCCCCC
Q psy18237 720 GKCQNVMGS-FICTCPPGYRLSPDKNSCQEDFAKLCPEGVGRGDKGEDLNECAL------------------MPSACQGG 780 (1050)
Q Consensus 720 g~C~~~~gs-y~C~C~~Gy~g~~~~~~C~~~~~c~C~~g~~~~~~~~~ideC~~------------------~~~~C~~g 780 (1050)
..|+....+ ++|.|.-+-.|..++ .|...|. ..+++...-.++++|.. .++.+..|
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCg-rCKpfy~----dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~Sgg 359 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCG-RCKPFYY----DRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGG 359 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcc-ccccccc----CCCccccccCCCccccccccchhhhhcccchhhhcccCccccc
Confidence 378766544 999999999887653 3432110 00111112223333321 23334445
Q ss_pred eEEe---CCCCeeee-CCCCceeCCCCCccccCCccCCCCCCC-----CCCeEecCCCceEEecCCCCcCCCCCCccccc
Q psy18237 781 ECIN---TDGSYRCE-CPAGYVKDETGKICIDDNECLSIPNIC-----GNGTCTNLNGGFECTCSEGYAPGPLGSCAILL 851 (1050)
Q Consensus 781 ~C~~---~~gsy~C~-C~~Gy~~~~~g~~C~~~~~C~~~~~~C-----~~g~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~ 851 (1050)
+|+| ...+-+|. |.+||..|.+ .-=.+...|.. ..| .+-+|..+.| +|.|++|-+|..+..|..++
T Consensus 360 vClnCrHnTaGrhChyCreGyyRd~s-~pl~hrkaCk~--CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnrCa~gy 434 (592)
T KOG3512|consen 360 VCLNCRHNTAGRHCHYCREGYYRDGS-KPLTHRKACKA--CDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNRCAPGY 434 (592)
T ss_pred eEeecccCCCCcccccccCccccCCC-CCCchhhhhhh--cCCcccccccccccccCC--cccCCCCCcccccccccchh
Confidence 6654 22344576 9999987732 11122222322 223 2457877777 79999999998666666443
No 54
>PHA02887 EGF-like protein; Provisional
Probab=56.77 E-value=9 Score=35.76 Aligned_cols=32 Identities=31% Similarity=0.857 Sum_probs=24.4
Q ss_pred CCCCCCCeeecCC--CCeEeeCCCCCeeCCCCCCCccC
Q psy18237 156 PNMCNHGTCMNTP--GSFHCQCNRGFLYDSDTHQCIDI 191 (1050)
Q Consensus 156 ~~~C~~~~C~~~~--g~y~C~C~~Gy~~~~~g~~C~d~ 191 (1050)
.+-|-||+|.-.+ ..+.|.|.+||+ |..|+.+
T Consensus 91 k~YCiHG~C~yI~dL~epsCrC~~GYt----G~RCE~v 124 (126)
T PHA02887 91 NDFCINGECMNIIDLDEKFCICNKGYT----GIRCDEV 124 (126)
T ss_pred hCEeeCCEEEccccCCCceeECCCCcc----cCCCCcc
Confidence 4567788896554 468999999999 7778743
No 55
>KOG3512|consensus
Probab=56.46 E-value=28 Score=40.32 Aligned_cols=116 Identities=24% Similarity=0.530 Sum_probs=63.5
Q ss_pred CeEecCCCc-eEEecCCCCcCCCCCCccccccCCCC--CCCccCccccC------------------CCCCCCCCccc--
Q psy18237 822 GTCTNLNGG-FECTCSEGYAPGPLGSCAILLTLPPI--SPSTDIDECYE------------------RPGICANGDCA-- 878 (1050)
Q Consensus 822 g~C~~~~g~-~~C~C~~Gy~g~~~~~C~~~~~~~~~--~~C~~~~eC~~------------------~~~~C~ng~C~-- 878 (1050)
..|+-...+ ++|.|...-.|..+..|..-+...|- ..-.++++|.. ..+.+..+.|.
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnC 364 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNC 364 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeec
Confidence 356555444 89999999999888888753322221 11122233321 11222223443
Q ss_pred --cCCCceee-ecCCCceeCCCCCCcccCCccCCCCCCCC-----CCEEeeCCCCeEEecCCCceeCCCCCCc
Q psy18237 879 --NFQGSFQC-TCANGYTLNTARDSCVDIDECARHPNICN-----NGTCVNAIGSFKCHCYAGFKLSHNNDCI 943 (1050)
Q Consensus 879 --~~~gsy~C-~C~~Gy~g~~~~~~C~~ideC~~~~~~C~-----~g~C~n~~g~y~C~C~~Gy~g~~~~~C~ 943 (1050)
|+.| -+| .|++||..+.... =.+...|.. ..|+ +-+|..+.| +|.|.+|-+|..++.|.
T Consensus 365 rHnTaG-rhChyCreGyyRd~s~p-l~hrkaCk~--CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnrCa 431 (592)
T KOG3512|consen 365 RHNTAG-RHCHYCREGYYRDGSKP-LTHRKACKA--CDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNRCA 431 (592)
T ss_pred ccCCCC-cccccccCccccCCCCC-Cchhhhhhh--cCCcccccccccccccCC--cccCCCCCccccccccc
Confidence 2333 345 6999998765321 112233332 1143 347877777 89999999998765554
No 56
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=55.48 E-value=8.7 Score=36.43 Aligned_cols=41 Identities=24% Similarity=0.728 Sum_probs=30.4
Q ss_pred cccccccC-CCCCCCCCeeecCC--CCeEeeCCCCCeeCCCCCCCccC
Q psy18237 147 EEIDECNL-MPNMCNHGTCMNTP--GSFHCQCNRGFLYDSDTHQCIDI 191 (1050)
Q Consensus 147 ~di~eC~~-~~~~C~~~~C~~~~--g~y~C~C~~Gy~~~~~g~~C~d~ 191 (1050)
.+|-+|.. ..+-|.||+|...+ ..+.|.|..||+ |..|+-.
T Consensus 40 ~~i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYt----GeRCEh~ 83 (139)
T PHA03099 40 PAIRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYT----GIRCQHV 83 (139)
T ss_pred cccccCChhhCCEeECCEEEeeccCCCceeECCCCcc----cccccce
Confidence 34566763 34678889996554 689999999999 7788743
No 57
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=44.69 E-value=30 Score=27.40 Aligned_cols=30 Identities=20% Similarity=0.332 Sum_probs=20.4
Q ss_pred eEecCCCceEEecCCCCcCCCCCCccccccCC
Q psy18237 823 TCTNLNGGFECTCSEGYAPGPLGSCAILLTLP 854 (1050)
Q Consensus 823 ~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~~ 854 (1050)
.|....| +|.|+++|+|..+..|..+++..
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~~~~ 42 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCDRCAPGYYGL 42 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCCCCCCCCccC
Confidence 3554444 79999999998666666555443
No 58
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=43.72 E-value=36 Score=27.11 Aligned_cols=30 Identities=37% Similarity=0.990 Sum_probs=19.9
Q ss_pred CCccccCCccCCCCCCCCCCeEecCCCceEEecCCCCcCC
Q psy18237 803 GKICIDDNECLSIPNICGNGTCTNLNGGFECTCSEGYAPG 842 (1050)
Q Consensus 803 g~~C~~~~~C~~~~~~C~~g~C~~~~g~~~C~C~~Gy~g~ 842 (1050)
|..|.....|. .++.|++. +|.|++||.-.
T Consensus 19 g~~C~~~~qC~------~~s~C~~g----~C~C~~g~~~~ 48 (52)
T PF01683_consen 19 GESCESDEQCI------GGSVCVNG----RCQCPPGYVEV 48 (52)
T ss_pred CCCCCCcCCCC------CcCEEcCC----EeECCCCCEec
Confidence 45565444444 46888654 89999998743
No 59
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=40.88 E-value=9.6 Score=32.02 Aligned_cols=40 Identities=13% Similarity=0.150 Sum_probs=20.4
Q ss_pred CCCCCCCcCcccccccCCCC----CCcccCCCCCCceeecCCCcccC
Q psy18237 66 DTAWLPTFTNVTTLTNVTST----NEYVTGDNVITRWCVCDEGFRGD 108 (1050)
Q Consensus 66 ~~~c~~g~~g~~c~~~~~~~----n~~~c~~~~g~~~C~C~~G~~g~ 108 (1050)
...|.+.|+|+.|..-+-.. -...|+. .|. =+|.+||+|+
T Consensus 18 rv~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~-~G~--~~C~~Gw~G~ 61 (63)
T PF01414_consen 18 RVVCDENYYGPNCSKFCKPRDDSFGHYTCDS-NGN--KVCLPGWTGP 61 (63)
T ss_dssp -----TTEETTTT-EE---EEETTEEEEE-S-S----EEE-TTEEST
T ss_pred EEECCCCCCCccccCCcCCCcCCcCCcccCC-CCC--CCCCCCCcCC
Confidence 34799999999999766442 2345653 444 5899999996
No 60
>KOG1218|consensus
Probab=40.25 E-value=87 Score=35.06 Aligned_cols=63 Identities=16% Similarity=0.288 Sum_probs=36.6
Q ss_pred CCCCC-CcCcccccccCCCCCCcccCCCCCCceeecCCCcccCCCCCCcccc-CCCCCCCCCCCCeeccCCC
Q psy18237 67 TAWLP-TFTNVTTLTNVTSTNEYVTGDNVITRWCVCDEGFRGDGYSCEDIDE-CTDNTNYCDYILLCGSKPG 136 (1050)
Q Consensus 67 ~~c~~-g~~g~~c~~~~~~~n~~~c~~~~g~~~C~C~~G~~g~~~~C~d~~e-C~~~~~~C~~~~~C~n~~g 136 (1050)
..|.. +|+|..|.... .+...+....+ .|.|.+||+|. .|..... |.. ...|.+++.|+...+
T Consensus 135 ~~C~~~~~~g~~C~~~c--~~~~~~~~~~~--~c~c~~g~~g~--~~~~~~~~c~~-~~~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 135 EQCGEENLVGLKCQRDC--QCTGGCDCKNG--ICTCQPGFVGV--FCVESCSGCSP-LTACENGAKCNRSTG 199 (316)
T ss_pred ccccccCCCCCCccCCC--CCccccCCCCC--ceeccCCcccc--cccccCCCcCC-CcccCCCCeeecccc
Confidence 44555 77777777666 33333333344 48899999997 4533222 553 345666666665544
No 61
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=39.63 E-value=26 Score=25.38 Aligned_cols=22 Identities=32% Similarity=0.584 Sum_probs=15.2
Q ss_pred CeeecCCCceEEecCCCeEeCCC
Q psy18237 590 GRCRNNIGSFFCECLQGYTLASE 612 (1050)
Q Consensus 590 g~C~n~~gsy~C~C~~Gy~~~~~ 612 (1050)
+.|.+... +.|.|++||.++..
T Consensus 10 A~CDpn~~-~~C~CPeGyIlde~ 31 (34)
T PF09064_consen 10 ADCDPNSP-GQCFCPEGYILDEG 31 (34)
T ss_pred CccCCCCC-CceeCCCceEecCC
Confidence 35655332 26999999998764
No 62
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=39.32 E-value=80 Score=37.04 Aligned_cols=52 Identities=40% Similarity=0.928 Sum_probs=33.5
Q ss_pred ecCCCceeCCCCCCcccCCccCCCCCCCCCCEEeeCCCCeEE-ecCCCceeCCCCCCc
Q psy18237 887 TCANGYTLNTARDSCVDIDECARHPNICNNGTCVNAIGSFKC-HCYAGFKLSHNNDCI 943 (1050)
Q Consensus 887 ~C~~Gy~g~~~~~~C~~ideC~~~~~~C~~g~C~n~~g~y~C-~C~~Gy~g~~~~~C~ 943 (1050)
.|..||.+......|+...+|.... |. +|.+... -.| .|..+|.+.....|.
T Consensus 3 ~C~~gy~~~~~~t~C~~~~~C~~~~--C~--~Cs~~~~-~~Ct~C~~~~~lt~t~~Ci 55 (397)
T PF03302_consen 3 ECTSGYKLSTDKTSCVSASECKTPN--CK--TCSNDKK-EVCTECNSGYYLTPTNQCI 55 (397)
T ss_pred cccCCceECCCCCcccccCCCCCCC--Cc--cccCCCC-CccCcCCCCCcCCCCCccc
Confidence 5778998887777888777777543 43 4544332 345 388888766533555
No 63
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=38.03 E-value=22 Score=27.90 Aligned_cols=31 Identities=23% Similarity=0.395 Sum_probs=20.9
Q ss_pred CeEecCCCceEEecCCCCcCCCCCCccccccCC
Q psy18237 822 GTCTNLNGGFECTCSEGYAPGPLGSCAILLTLP 854 (1050)
Q Consensus 822 g~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~~ 854 (1050)
..|....| +|.|+++|+|..+..|..+++..
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~C~~g~~~~ 41 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQCKPGYFGL 41 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-EE-TTEECS
T ss_pred CcccCCCC--EEeccccccCCcCcCCCCccccc
Confidence 46766544 89999999999766666555543
No 64
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=37.88 E-value=53 Score=26.13 Aligned_cols=20 Identities=30% Similarity=0.856 Sum_probs=15.0
Q ss_pred CCCC-CCeEEecCCCeEeecCCCCc
Q psy18237 1026 NICE-NGHCTNTFGSFMCSCQDGFK 1049 (1050)
Q Consensus 1026 ~~C~-~g~C~n~~gsy~C~C~~Gy~ 1049 (1050)
..|. ++.|++. +|+|++||+
T Consensus 26 ~qC~~~s~C~~g----~C~C~~g~~ 46 (52)
T PF01683_consen 26 EQCIGGSVCVNG----RCQCPPGYV 46 (52)
T ss_pred CCCCCcCEEcCC----EeECCCCCE
Confidence 4566 6788654 799999985
No 65
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=34.03 E-value=33 Score=32.13 Aligned_cols=30 Identities=27% Similarity=0.733 Sum_probs=22.4
Q ss_pred CCCCCCCCCC-CCCeEeecCCceeeeCCCCCe
Q psy18237 974 INECESPQAC-LYGNCTNTLGSFSCTCPPNYQ 1004 (1050)
Q Consensus 974 i~eC~~~~~C-~~g~C~~~~g~y~C~C~~Gy~ 1004 (1050)
.++|...+.| .+|.|.. .....|.|++||+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPGFE 107 (110)
T ss_pred ccCCCCccccCCccEeCC-CCCCceECCCCcC
Confidence 3566667788 7889954 3556799999997
No 66
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=31.27 E-value=71 Score=24.85 Aligned_cols=29 Identities=24% Similarity=0.413 Sum_probs=18.9
Q ss_pred eEecCCCceEEecCCCCcCCCCCCccccccC
Q psy18237 823 TCTNLNGGFECTCSEGYAPGPLGSCAILLTL 853 (1050)
Q Consensus 823 ~C~~~~g~~~C~C~~Gy~g~~~~~C~~~~~~ 853 (1050)
.|....| +|.|+++|+|..+..|..+++.
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCCcCCCCcCC
Confidence 4544444 7999999999855555544443
No 67
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=27.37 E-value=50 Score=30.94 Aligned_cols=31 Identities=29% Similarity=0.748 Sum_probs=22.6
Q ss_pred CccCcCCCCCCCC-CeEeecCCceEEeCCCCCee
Q psy18237 308 IDECATSIQRCGE-GFCVNDVGTYHCVCPDGYML 340 (1050)
Q Consensus 308 ideC~~~~~~C~~-~~C~n~~Gsy~C~C~~G~~g 340 (1050)
.|+|.. .+.|.. |.|.. ..+-.|.|.+||..
T Consensus 77 ~d~Cd~-y~~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 77 KDQCDV-YGFCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred ccCCCC-ccccCCccEeCC-CCCCceECCCCcCC
Confidence 456654 467888 99943 44557999999975
No 68
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=24.96 E-value=2.4e+02 Score=36.33 Aligned_cols=38 Identities=26% Similarity=0.733 Sum_probs=25.3
Q ss_pred eEEEcCCCCccCCCCCCCcCCCccccCCCCC---CCCCCcee
Q psy18237 256 FSCRCEDGYSVKPAEGPACTDENECTMRTHN---CDDNADCI 294 (1050)
Q Consensus 256 y~C~C~~Gy~g~~~~~~~C~~ideC~~~~~~---C~~~~~C~ 294 (1050)
-+|+|..||.-+ .++..|+...+|...... |+..++|+
T Consensus 682 ~~C~C~~g~~p~-~~~~~C~~~~~C~~~~~gC~~C~~~g~C~ 722 (800)
T PTZ00214 682 RRCWCERGFLPA-LDRSGCVLPTECPPDMPSCAACDESGRCL 722 (800)
T ss_pred ceeEecCCcccc-cCCCccccccCCCcccccccccCCCCcee
Confidence 379999999976 567788777677654433 34444544
No 69
>KOG3516|consensus
Probab=22.02 E-value=64 Score=41.88 Aligned_cols=30 Identities=37% Similarity=1.008 Sum_probs=27.0
Q ss_pred CCCCCC-CeeecCCCCeEeeCC-CCCeeCCCCCCCc
Q psy18237 156 PNMCNH-GTCMNTPGSFHCQCN-RGFLYDSDTHQCI 189 (1050)
Q Consensus 156 ~~~C~~-~~C~~~~g~y~C~C~-~Gy~~~~~g~~C~ 189 (1050)
+|+|+| |.|..+...|.|.|. .||. |.+|.
T Consensus 550 PN~CehgG~C~Qs~~~f~C~C~~TGY~----GatCH 581 (1306)
T KOG3516|consen 550 PNPCEHGGKCSQSWDDFECNCELTGYK----GATCH 581 (1306)
T ss_pred CccccCCCcccccccceeEeccccccc----ccccc
Confidence 799996 899998889999999 9998 77887
No 70
>KOG4291|consensus
Probab=21.46 E-value=2.7e+02 Score=36.77 Aligned_cols=156 Identities=17% Similarity=0.278 Sum_probs=97.5
Q ss_pred eecCCCcccCCCCCCccccCCCCCCCCCCCCeeccCCCCcccC---CCCCc-cccccccCCCCCCC-C---CeeecCCCC
Q psy18237 99 CVCDEGFRGDGYSCEDIDECTDNTNYCDYILLCGSKPGEFMNP---MTNKT-EEIDECNLMPNMCN-H---GTCMNTPGS 170 (1050)
Q Consensus 99 C~C~~G~~g~~~~C~d~~eC~~~~~~C~~~~~C~n~~gs~~~~---~~g~~-~di~eC~~~~~~C~-~---~~C~~~~g~ 170 (1050)
|....|.++. ..+...+.+-.....|.-.+.|.+...++... +...+ ++++|+......+. + ...+...+.
T Consensus 369 c~~~~g~~~~-~~~~p~~~n~~~g~~v~d~~~C~~~~~a~~~~~e~~~~~~ct~~~~~~~~~~~~~~~~~g~~~~~~~~~ 447 (1043)
T KOG4291|consen 369 CTNDIGGTYP-MTCAPVCGNMLGGRTVNDCRICLDPICAVICGFEQREVATCTDVVRCRARCEQPALTDWGTKARQSDGG 447 (1043)
T ss_pred cccCcCCccc-eeecCCCCcccCCccccccccccCccccceecccccCCceeEecccceeeeccccccccccceeecCCc
Confidence 7777777766 46666666655555566666788888777632 22222 77777774444553 2 356666777
Q ss_pred eEeeCCCCCeeCCCCCCCccCCccCCCCCCCCCCcccccCCCCccccCCCccccCCCceeecccccccCCCCCCCCCcee
Q psy18237 171 FHCQCNRGFLYDSDTHQCIDINECEEMPEICGSGTYINECEEMPEICGSGTCENNIGSFSCRYINECEEMPEICGSGTCE 250 (1050)
Q Consensus 171 y~C~C~~Gy~~~~~g~~C~d~~eC~~~~~~C~~g~C~~~C~~g~~~c~~~~C~~~~g~~~c~~~~eC~~~~~~C~~~~C~ 250 (1050)
.+|.+..||.+++ -..+.+.+++..+ +..+.
T Consensus 448 ~q~~~~~G~~~~~-~~~~~~~~~~~~n-------------------------------------------s~~~~----- 478 (1043)
T KOG4291|consen 448 NQCFCFRGYIYDV-PPECEPVSECKTN-------------------------------------------SDACK----- 478 (1043)
T ss_pred ccceeccCccccc-Ccccccccccccc-------------------------------------------hhhcc-----
Confidence 8999999998764 2345555555431 11111
Q ss_pred cCCCCeEEEcCCCCccCCCCCCCcCCCccccCCCCCCCCCCceecCCCCCCCCCcccCccCcCCCCCCCC-CeEeecCCc
Q psy18237 251 NNIGSFSCRCEDGYSVKPAEGPACTDENECTMRTHNCDDNADCINNPVNKTGTRCVDIDECATSIQRCGE-GFCVNDVGT 329 (1050)
Q Consensus 251 ~~~g~y~C~C~~Gy~g~~~~~~~C~~ideC~~~~~~C~~~~~C~n~~g~~~g~~C~dideC~~~~~~C~~-~~C~n~~Gs 329 (1050)
+.+.+.|.+..|+..+- .+.. .+.+++.. -..++ +..+.+-+.
T Consensus 479 -~n~~~~~~~~~~~~~~~--------------------------------~~~~-~~r~~~~v--~~~~~~~~~~~~~~~ 522 (1043)
T KOG4291|consen 479 -KNGRWYCRNFEGFSITW--------------------------------QGDN-QVRMFDDV--TYGTQARIMISLYGY 522 (1043)
T ss_pred -CCceecccccccccccc--------------------------------cccc-cccccccc--cccccceeEeeeccc
Confidence 11567788888887751 1222 33334332 24666 889999999
Q ss_pred eEEeCCCCCee
Q psy18237 330 YHCVCPDGYML 340 (1050)
Q Consensus 330 y~C~C~~G~~g 340 (1050)
|+....+||..
T Consensus 523 ~~~~~~~~f~~ 533 (1043)
T KOG4291|consen 523 YEDKVRKKFRE 533 (1043)
T ss_pred eeeccccCCcc
Confidence 99999999965
Done!