Query psy5613
Match_columns 1010
No_of_seqs 553 out of 3653
Neff 8.4
Searched_HMMs 46136
Date Fri Aug 16 22:02:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5613.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5613hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.8 3.5E-19 7.5E-24 209.6 17.9 112 89-226 1180-1315(2531)
2 KOG4289|consensus 99.7 6.7E-16 1.5E-20 182.4 19.0 99 207-336 1216-1316(2531)
3 KOG1217|consensus 99.6 1.3E-14 2.9E-19 173.0 23.0 317 89-546 90-421 (487)
4 KOG1217|consensus 99.6 2.4E-14 5.2E-19 170.8 24.1 316 136-658 90-422 (487)
5 KOG1214|consensus 99.6 4.4E-14 9.4E-19 160.8 22.6 237 472-758 692-947 (1289)
6 KOG1214|consensus 99.5 1.2E-13 2.6E-18 157.3 16.3 278 50-376 639-947 (1289)
7 KOG0994|consensus 99.5 1.8E-12 3.9E-17 151.9 26.1 256 643-937 878-1165(1758)
8 KOG1219|consensus 99.3 2.1E-12 4.6E-17 158.6 7.8 111 243-383 3864-3976(4289)
9 KOG1219|consensus 99.2 9.3E-12 2E-16 153.1 8.2 111 89-226 3865-3976(4289)
10 KOG1225|consensus 99.2 3.3E-10 7.2E-15 129.8 16.3 212 425-764 152-365 (525)
11 KOG0994|consensus 99.1 2.2E-09 4.8E-14 126.6 17.3 114 206-336 878-1013(1758)
12 KOG1225|consensus 99.0 1E-09 2.2E-14 125.9 11.6 132 649-869 233-364 (525)
13 KOG4260|consensus 98.6 4.6E-08 9.9E-13 99.5 5.5 166 111-331 130-304 (350)
14 KOG1836|consensus 98.6 8.9E-05 1.9E-09 96.7 35.0 113 698-829 903-1025(1705)
15 KOG1836|consensus 98.5 0.00012 2.7E-09 95.4 35.2 209 693-933 781-1025(1705)
16 KOG4260|consensus 98.5 1E-07 2.2E-12 97.0 3.9 158 22-222 137-304 (350)
17 KOG1226|consensus 98.1 2E-05 4.3E-10 92.3 11.4 140 692-869 467-617 (783)
18 KOG1226|consensus 97.9 5.9E-05 1.3E-09 88.5 11.0 146 582-768 467-622 (783)
19 PF00008 EGF: EGF-like domain 97.7 1.6E-05 3.6E-10 55.8 2.1 31 91-121 1-32 (32)
20 PF07645 EGF_CA: Calcium-bindi 97.6 4.8E-05 1E-09 57.3 3.3 32 348-379 1-34 (42)
21 PF07645 EGF_CA: Calcium-bindi 97.6 3.4E-05 7.3E-10 58.2 2.4 32 242-273 1-34 (42)
22 PF00008 EGF: EGF-like domain 97.6 3.9E-05 8.4E-10 54.0 2.2 30 246-275 1-31 (32)
23 smart00179 EGF_CA Calcium-bind 97.5 0.00011 2.3E-09 54.4 3.8 36 242-277 1-38 (39)
24 smart00179 EGF_CA Calcium-bind 97.3 0.00028 6E-09 52.1 3.9 34 88-121 2-37 (39)
25 PF06247 Plasmod_Pvs28: Plasmo 97.3 7.5E-05 1.6E-09 73.2 0.6 143 95-276 7-163 (197)
26 cd00054 EGF_CA Calcium-binding 97.1 0.00064 1.4E-08 49.7 3.8 35 243-277 2-37 (38)
27 PF12947 EGF_3: EGF domain; I 96.9 0.00051 1.1E-08 49.5 1.7 29 844-872 7-35 (36)
28 PF12947 EGF_3: EGF domain; I 96.8 0.00062 1.3E-08 49.1 1.8 30 795-824 5-34 (36)
29 PF06247 Plasmod_Pvs28: Plasmo 96.8 0.00043 9.3E-09 68.0 1.2 140 203-381 10-162 (197)
30 cd00054 EGF_CA Calcium-binding 96.8 0.0015 3.1E-08 47.7 3.8 33 89-121 3-36 (38)
31 cd00053 EGF Epidermal growth f 96.3 0.0048 1E-07 44.2 3.8 28 93-120 5-32 (36)
32 cd00053 EGF Epidermal growth f 96.3 0.005 1.1E-07 44.1 3.6 28 248-275 5-32 (36)
33 smart00181 EGF Epidermal growt 96.2 0.006 1.3E-07 43.7 3.7 27 94-121 6-33 (35)
34 smart00181 EGF Epidermal growt 96.1 0.0066 1.4E-07 43.5 3.5 31 246-277 2-34 (35)
35 KOG1218|consensus 96.1 0.5 1.1E-05 52.9 20.4 193 492-762 14-209 (316)
36 PF07974 EGF_2: EGF-like domai 95.4 0.012 2.6E-07 41.2 2.4 23 40-62 7-31 (32)
37 KOG1218|consensus 95.2 3 6.4E-05 46.7 22.2 47 651-717 163-209 (316)
38 PF07974 EGF_2: EGF-like domai 94.8 0.037 8.1E-07 38.8 3.4 26 693-720 7-32 (32)
39 PF12662 cEGF: Complement Clr- 94.2 0.033 7.1E-07 36.1 1.9 22 369-390 1-24 (24)
40 PF12662 cEGF: Complement Clr- 94.1 0.045 9.7E-07 35.5 2.3 24 649-674 1-24 (24)
41 PF12661 hEGF: Human growth fa 92.9 0.059 1.3E-06 29.5 1.1 13 753-765 1-13 (13)
42 PF12661 hEGF: Human growth fa 92.8 0.062 1.3E-06 29.4 1.1 13 265-277 1-13 (13)
43 smart00051 DSL delta serrate l 90.9 0.27 5.8E-06 40.6 3.4 48 649-720 16-63 (63)
44 PF14670 FXa_inhibition: Coagu 90.1 0.16 3.5E-06 36.6 1.4 25 95-121 7-31 (36)
45 PF12946 EGF_MSP1_1: MSP1 EGF 88.8 0.36 7.8E-06 34.7 2.2 31 352-382 2-33 (37)
46 PF14670 FXa_inhibition: Coagu 87.8 0.3 6.5E-06 35.3 1.4 23 254-276 9-31 (36)
47 smart00051 DSL delta serrate l 83.4 1.7 3.6E-05 35.9 3.9 46 857-926 16-61 (63)
48 PF12946 EGF_MSP1_1: MSP1 EGF 82.4 0.67 1.5E-05 33.4 1.1 29 93-121 4-33 (37)
49 PF00053 Laminin_EGF: Laminin 82.0 0.97 2.1E-05 35.2 2.0 24 698-723 11-34 (49)
50 smart00180 EGF_Lam Laminin-typ 76.9 2.3 4.9E-05 32.7 2.6 23 699-723 12-34 (46)
51 KOG3512|consensus 75.6 20 0.00043 41.0 10.2 67 697-767 406-479 (592)
52 cd01475 vWA_Matrilin VWA_Matri 75.3 2.6 5.6E-05 44.7 3.5 36 240-275 184-219 (224)
53 cd01475 vWA_Matrilin VWA_Matri 74.9 2.6 5.7E-05 44.7 3.4 34 345-380 183-218 (224)
54 cd00055 EGF_Lam Laminin-type e 74.5 3.1 6.8E-05 32.5 2.9 26 39-64 2-33 (50)
55 cd00055 EGF_Lam Laminin-type e 73.5 3.6 7.9E-05 32.1 3.0 25 699-725 13-37 (50)
56 PF01683 EB: EB module; Inter 71.6 4.9 0.00011 31.6 3.4 22 424-445 27-48 (52)
57 PF00053 Laminin_EGF: Laminin 71.3 1.7 3.8E-05 33.7 0.7 28 39-66 1-34 (49)
58 PF01414 DSL: Delta serrate li 68.9 1.6 3.5E-05 36.0 0.0 48 649-720 16-63 (63)
59 PHA03099 epidermal growth fact 66.3 5.3 0.00012 37.2 2.8 30 692-722 51-82 (139)
60 KOG3512|consensus 62.7 24 0.00053 40.3 7.6 53 203-255 284-340 (592)
61 PHA02887 EGF-like protein; Pro 60.7 8 0.00017 35.5 2.8 31 900-934 92-124 (126)
62 PF01683 EB: EB module; Inter 59.7 14 0.0003 29.0 3.8 22 40-61 27-48 (52)
63 PF01414 DSL: Delta serrate li 57.5 5.3 0.00011 33.0 1.1 16 212-227 16-31 (63)
64 KOG3516|consensus 56.3 9.5 0.00021 48.3 3.4 42 240-281 542-584 (1306)
65 PHA02887 EGF-like protein; Pro 53.1 12 0.00025 34.5 2.6 29 692-721 92-122 (126)
66 PHA03099 epidermal growth fact 51.3 13 0.00028 34.7 2.6 32 900-935 51-84 (139)
67 PF00954 S_locus_glycop: S-loc 50.4 15 0.00033 34.0 3.1 32 88-120 77-109 (110)
68 KOG3514|consensus 44.2 15 0.00031 46.0 2.3 35 245-279 625-660 (1591)
69 smart00180 EGF_Lam Laminin-typ 40.3 26 0.00057 26.8 2.5 19 803-823 12-30 (46)
70 KOG3516|consensus 39.1 21 0.00044 45.5 2.6 36 88-126 545-581 (1306)
71 PF00954 S_locus_glycop: S-loc 38.0 30 0.00065 32.0 3.0 33 242-275 76-109 (110)
72 PF09064 Tme5_EGF_like: Thromb 29.6 35 0.00075 24.3 1.4 20 101-121 11-30 (34)
73 KOG3514|consensus 27.3 37 0.00081 42.7 2.1 34 90-126 625-659 (1591)
No 1
>KOG4289|consensus
Probab=99.81 E-value=3.5e-19 Score=209.59 Aligned_cols=112 Identities=31% Similarity=0.774 Sum_probs=89.7
Q ss_pred CCCCCCCCCCCCeee----------------------ecCCceeeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCC
Q psy5613 89 NPCVPGTCGEGAICD----------------------VVNHAVMCTCPPGTTGSPFIQCKPIQNEPVYTNPCQPSPCGPN 146 (1010)
Q Consensus 89 ~~C~~~~C~~~~~C~----------------------~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~ 146 (1010)
+.|...||.|...|+ +..++++|+|||||+|+. |+ ..+|+|.+.||+++
T Consensus 1180 niClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~---Ce------TeiDlCYs~pC~nn 1250 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDY---CE------TEIDLCYSGPCGNN 1250 (2531)
T ss_pred chhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCccc---cc------chhHhhhcCCCCCC
Confidence 345556777777772 344789999999999998 98 67899999999999
Q ss_pred CcceecCCCeeeecCCCCcCCCCCCCCCCccCCCCCCCCcccCCcccCCCCCCCCCCceEeec-CCCccccCCCC-CccC
Q psy5613 147 SQCREINHQAVCSCLPNYFGSPPGCRPECTVNSDCPLDRACQNQKCVDPCPGSCGYRARCQVY-NHNPVCSCPPG-YTGN 224 (1010)
Q Consensus 147 g~C~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~-~g~~~C~C~~G-y~g~ 224 (1010)
|+|....|+|+|+|.+||+|. .||.... ...|. |+.|.++++|++. .|.|.|.|+.| |++.
T Consensus 1251 g~C~srEggYtCeCrpg~tGe------hCEvs~~---agrCv--------pGvC~nggtC~~~~nggf~c~Cp~ge~e~p 1313 (2531)
T KOG4289|consen 1251 GRCRSREGGYTCECRPGFTGE------HCEVSAR---AGRCV--------PGVCKNGGTCVNLLNGGFCCHCPYGEFEDP 1313 (2531)
T ss_pred CceEEecCceeEEecCCcccc------ceeeecc---cCccc--------cceecCCCEEeecCCCceeccCCCcccCCC
Confidence 999999999999999999999 8886532 12334 4578889999995 47899999998 4455
Q ss_pred CC
Q psy5613 225 PF 226 (1010)
Q Consensus 225 ~c 226 (1010)
.|
T Consensus 1314 rC 1315 (2531)
T KOG4289|consen 1314 RC 1315 (2531)
T ss_pred ce
Confidence 54
No 2
>KOG4289|consensus
Probab=99.68 E-value=6.7e-16 Score=182.45 Aligned_cols=99 Identities=34% Similarity=0.752 Sum_probs=82.7
Q ss_pred eecCCCccccCCCCCccCCCccccCCCCCCCCCCCCCCCCCCCCCCCCCeecccCCceeeeeCCCCccCCCCcCCCCCcc
Q psy5613 207 QVYNHNPVCSCPPGYTGNPFSQCLLPPTPTPTQATPTDPCFPSPCGSNARCRVQNEHALCECLPDYYGNPYEGCRPECLI 286 (1010)
Q Consensus 207 ~~~~g~~~C~C~~Gy~g~~c~~C~~~~~~~~~~~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~Gf~G~~c~~~~~eC~~ 286 (1010)
++..+.++|.||+||+|+.| ++.||+|.+.||.++++|....|+|+|.|.+||+|+.|+.+..
T Consensus 1216 i~pvnglrCrCPpGFTgd~C-------------eTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~---- 1278 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDYC-------------ETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSAR---- 1278 (2531)
T ss_pred ccccCceeEeCCCCCCcccc-------------cchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecc----
Confidence 34567899999999999987 4789999999999999999999999999999999999986431
Q ss_pred CCCCCCccccccCCCCCCCCCCCCCCeeccCC-CCCceecCCCC-cccCCcc
Q psy5613 287 NSDCPLSLACIKNHCRDPCPGTCGVQAICSVS-NHIPICYCPAG-FTGDAFR 336 (1010)
Q Consensus 287 ~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~-~g~~~C~C~~G-y~G~~c~ 336 (1010)
...|. +|.|.++++|++. .|+|.|.|+.| |++..|+
T Consensus 1279 ------agrCv--------pGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1279 ------AGRCV--------PGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred ------cCccc--------cceecCCCEEeecCCCceeccCCCcccCCCceE
Confidence 01222 5788889999976 68899999998 5566665
No 3
>KOG1217|consensus
Probab=99.63 E-value=1.3e-14 Score=172.99 Aligned_cols=317 Identities=26% Similarity=0.577 Sum_probs=220.5
Q ss_pred CCCCCCCCCCCCeeeecCCceeeeCCCCCccCCCCCcccCCCCCCCCCCCCCCC--CCCCCcceecC---CCeeeecCCC
Q psy5613 89 NPCVPGTCGEGAICDVVNHAVMCTCPPGTTGSPFIQCKPIQNEPVYTNPCQPSP--CGPNSQCREIN---HQAVCSCLPN 163 (1010)
Q Consensus 89 ~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~--C~~~g~C~~~~---g~~~C~C~~G 163 (1010)
++|...+....+.+....++|.|.|++||.|.. |+.. .+|...+ +...+.|...+ ..|.|.|..|
T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~~---~~~~-------~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g 159 (487)
T KOG1217|consen 90 PPCRSPCLLLCGECVDCVGSYECTCPPGYQGTP---CEGE-------CECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEG 159 (487)
T ss_pred ccccCCcccCCccccCCCCCceeeCCCccccCc---CCcc-------eeecCCCCCeeCchhhcCCCCCCCceeeeeCCC
Confidence 444444555566777788899999999999987 5411 1465544 34457777643 5899999999
Q ss_pred CcCCCCCCCCCCccC-CCCCCCCcccCCcccCCCCCCCCCCceEeecCCCccccCCCCCccCCCccccCCCCCCCCCCCC
Q psy5613 164 YFGSPPGCRPECTVN-SDCPLDRACQNQKCVDPCPGSCGYRARCQVYNHNPVCSCPPGYTGNPFSQCLLPPTPTPTQATP 242 (1010)
Q Consensus 164 ~~g~~~~C~~~C~~~-~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~C~~~~~~~~~~~~~ 242 (1010)
|.+. .+... +.|... ...|.+.+.|.+..++|.|.|++||++..++ ..
T Consensus 160 ~~~~------~~~~~~~~C~~~------------~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~-------------~~ 208 (487)
T KOG1217|consen 160 YEGE------PCETDLDECIQY------------SSPCQNGGTCVNTGGSYLCSCPPGYTGSTCE-------------TT 208 (487)
T ss_pred cccc------cccccccccccC------------CCCcCCCcccccCCCCeeEeCCCCccCCcCc-------------CC
Confidence 9998 44433 233211 2346778899999999999999999999862 11
Q ss_pred CCCCCCCCCCCCCeecccCCceeeeeCCCCccCCCCcCCCCCccCCCCCCccccccCCCCCCCCCCCCCCeeccCCCCCc
Q psy5613 243 TDPCFPSPCGSNARCRVQNEHALCECLPDYYGNPYEGCRPECLINSDCPLSLACIKNHCRDPCPGTCGVQAICSVSNHIP 322 (1010)
Q Consensus 243 ~d~C~~~~C~~~~~C~~~~g~~~C~C~~Gf~G~~c~~~~~eC~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~ 322 (1010)
.+.+.|++. +.|.+.+||.+..+...+.+|.. + + ++|.+..++|
T Consensus 209 ---------~~~~~c~~~---~~~~~~~g~~~~~c~~~~~~~~~--------------------~---~-~~c~~~~~~~ 252 (487)
T KOG1217|consen 209 ---------GNGGTCVDS---VACSCPPGARGPECEVSIVECAS--------------------G---D-GTCVNTVGSY 252 (487)
T ss_pred ---------CCCceEecc---eeccCCCCCCCCCcccccccccC--------------------C---C-CcccccCCce
Confidence 455677766 78999999998887755444332 1 3 6888888899
Q ss_pred eecCCCCcccCCcccCCCCCCCCCCCCCCCCCCC-CCCCCeEeecCCceeeeecCccccccc-cccCCcccccCCccccc
Q psy5613 323 ICYCPAGFTGDAFRQCSPIPQREPEYRDPCSTTQ-CGLNAICTVINGAAQCACLLLLQHHIH-KNQDMDQYISLGYMLCH 400 (1010)
Q Consensus 323 ~C~C~~Gy~G~~c~~C~~i~~~~~~~~deC~~~~-C~~~~~C~n~~g~~~C~C~~G~~g~~~-~~~~~~~~~~~g~~~C~ 400 (1010)
+|.|++||++..+..+ .++++|+... |.++++|++..++|.|.|++||.|... .+.+..+|....
T Consensus 253 ~C~~~~g~~~~~~~~~--------~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~----- 319 (487)
T KOG1217|consen 253 TCRCPEGYTGDACVTC--------VDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRN----- 319 (487)
T ss_pred eeeCCCCcccccccee--------eeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccc-----
Confidence 9999999999974222 4669999764 999999999999999999999998764 222223331100
Q ss_pred cccccccccccccccccCCCCccCCCCCCccc------CceeecCCCceeCCcccCCCCCCCCCCCCCcchhccCCccCC
Q psy5613 401 MDILSSEYIQVYTVQPVIQEDTCNCVPNAECR------DGVCVCLPDYYGDGYVSCRPECVQNSDCPRNKACIRNKCKNP 474 (1010)
Q Consensus 401 ~~~~~~~~~~~~~~~p~~~~~~c~C~~~~~C~------~~~C~C~~G~~G~~~~~~~~~C~~~~~C~~~~~C~~~~C~~~ 474 (1010)
... .|.+++.|. .+.|.|.+||.|..|+.. .++
T Consensus 320 ------------~~~--------~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~---------------------~~~ 358 (487)
T KOG1217|consen 320 ------------AGG--------PCANGGTCNTLGSFGGFRCACGPGFTGRRCEDS---------------------NDE 358 (487)
T ss_pred ------------cCC--------cCCCCcccccCCCCCCCCcCCCCCCCCCccccC---------------------Ccc
Confidence 001 244555552 246999999998875411 025
Q ss_pred CCCCCCCCCCeeec-cCCceeeeCCCCCccCCCcccCCCCCCCCCCCCCcCCCCCCCCcccccCCCeeeecCC
Q psy5613 475 CVPGTCGEGAICDV-INHAVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQPSPCGPNSQCREVHKQAVCSCLP 546 (1010)
Q Consensus 475 C~~~~C~~~~~C~~-~~g~~~C~C~~G~~G~~~~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~ 546 (1010)
|...++..++.|++ ..++|.|.|+.+|.+... .......++++|.. .+.|++..+++.|. .+
T Consensus 359 C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~----~~~~~~~~~~~c~~-----~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 359 CASSPCCPGGTCVNETPGSYRCACPAGFAGKAN----GDGVGCEDIDECSG-----CGDCVNGPGGGACT-PP 421 (487)
T ss_pred ccCCccccCCEeccCCCCCeEecCCCccccCCc----cccccccccccccC-----CcceeccCCCCccc-cC
Confidence 55666888999998 789999999999998410 00111235566654 56688888889999 77
No 4
>KOG1217|consensus
Probab=99.63 E-value=2.4e-14 Score=170.80 Aligned_cols=316 Identities=28% Similarity=0.627 Sum_probs=218.5
Q ss_pred CCCCCCCCCCCCcceecCCCeeeecCCCCcCCCCCCCCCCccCCCCCCCCcccCCcccCCCCCCCCCCceEeec---CCC
Q psy5613 136 NPCQPSPCGPNSQCREINHQAVCSCLPNYFGSPPGCRPECTVNSDCPLDRACQNQKCVDPCPGSCGYRARCQVY---NHN 212 (1010)
Q Consensus 136 d~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~---~g~ 212 (1010)
+.+...+....+.+.....+|.|.|++||.+. .|+....|..... .+...+.|.+. ...
T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~------~~~~~~~C~~~~~------------~~~~~~~c~~~~~~~~~ 151 (487)
T KOG1217|consen 90 PPCRSPCLLLCGECVDCVGSYECTCPPGYQGT------PCEGECECVTGPG------------VCCIDGSCSNGPGSVGP 151 (487)
T ss_pred ccccCCcccCCccccCCCCCceeeCCCccccC------cCCcceeecCCCC------------CeeCchhhcCCCCCCCc
Confidence 33444445556777788889999999999998 3332212222111 11224455553 357
Q ss_pred ccccCCCCCccCCCccccCCCCCCCCCCCCCCCCC--CCCCCCCCeecccCCceeeeeCCCCccCCCCcCCCCCccCCCC
Q psy5613 213 PVCSCPPGYTGNPFSQCLLPPTPTPTQATPTDPCF--PSPCGSNARCRVQNEHALCECLPDYYGNPYEGCRPECLINSDC 290 (1010)
Q Consensus 213 ~~C~C~~Gy~g~~c~~C~~~~~~~~~~~~~~d~C~--~~~C~~~~~C~~~~g~~~C~C~~Gf~G~~c~~~~~eC~~~~~C 290 (1010)
|.|+|..||.+..+. .+.++|. ..+|.+++.|.+..++|.|.|++||.|..++..
T Consensus 152 ~~c~C~~g~~~~~~~-------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---------- 208 (487)
T KOG1217|consen 152 FRCSCTEGYEGEPCE-------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---------- 208 (487)
T ss_pred eeeeeCCCccccccc-------------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC----------
Confidence 999999999999863 3347898 456999999999999999999999999976532
Q ss_pred CCccccccCCCCCCCCCCCCCCeeccCCCCCceecCCCCcccCCcccCCCCCCCCCCCCCCCCCCCCCCCCeEeecCCce
Q psy5613 291 PLSLACIKNHCRDPCPGTCGVQAICSVSNHIPICYCPAGFTGDAFRQCSPIPQREPEYRDPCSTTQCGLNAICTVINGAA 370 (1010)
Q Consensus 291 ~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~c~~C~~i~~~~~~~~deC~~~~C~~~~~C~n~~g~~ 370 (1010)
.+.+.|++. +.|.+..||.+..+. ..+.+|... . ++|+++.++|
T Consensus 209 -------------------~~~~~c~~~---~~~~~~~g~~~~~c~----------~~~~~~~~~---~-~~c~~~~~~~ 252 (487)
T KOG1217|consen 209 -------------------GNGGTCVDS---VACSCPPGARGPECE----------VSIVECASG---D-GTCVNTVGSY 252 (487)
T ss_pred -------------------CCCceEecc---eeccCCCCCCCCCcc----------cccccccCC---C-CcccccCCce
Confidence 123344444 689999999988875 455666654 4 8999999999
Q ss_pred eeeecCccccccc-cccCCcccccCCccccccccccccccccccccccCCCCccCCCCCCcccCceeecCCCceeCCccc
Q psy5613 371 QCACLLLLQHHIH-KNQDMDQYISLGYMLCHMDILSSEYIQVYTVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVS 449 (1010)
Q Consensus 371 ~C~C~~G~~g~~~-~~~~~~~~~~~g~~~C~~~~~~~~~~~~~~~~p~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~~~~ 449 (1010)
+|.|++||.+... .+.+
T Consensus 253 ~C~~~~g~~~~~~~~~~~-------------------------------------------------------------- 270 (487)
T KOG1217|consen 253 TCRCPEGYTGDACVTCVD-------------------------------------------------------------- 270 (487)
T ss_pred eeeCCCCccccccceeee--------------------------------------------------------------
Confidence 9998888765320 0000
Q ss_pred CCCCCCCCCCCCCcchhccCCccCCCCCC-CCCCCCeeeccCCceeeeCCCCCccCCCcccCCCCCCCCCCCCCc----C
Q psy5613 450 CRPECVQNSDCPRNKACIRNKCKNPCVPG-TCGEGAICDVINHAVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQ----P 524 (1010)
Q Consensus 450 ~~~~C~~~~~C~~~~~C~~~~C~~~C~~~-~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~~~~~~~~~~~C~----~ 524 (1010)
++.|... +|.++++|++..++|.|.|++||+|.. + ....+..+|. .
T Consensus 271 ----------------------~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~---~----~~~~~~~~C~~~~~~ 321 (487)
T KOG1217|consen 271 ----------------------VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRL---C----TECVDVDECSPRNAG 321 (487)
T ss_pred ----------------------ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCC---C----ccccccccccccccC
Confidence 2333333 277788999988889999999999997 4 1123456773 4
Q ss_pred CCCCCCCccc--ccCCCeeeecCCCccCCCCCCcCCCccCCCCCCCccccCCcccCCCCCCCCCCceeec-cCCCceeec
Q psy5613 525 SPCGPNSQCR--EVHKQAVCSCLPNYFGSPPNCRPECTVNSDCPLDKACFNQKCVDPCPGTCGQNANCRV-INHNPSCTC 601 (1010)
Q Consensus 525 ~~C~~~g~C~--~~~g~~~C~C~~G~~G~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~-~~~~~~C~C 601 (1010)
.+|.+++.|. ...+.+.|.|..+|.|. .|+ ... ..|.. ..+..++.|++ ..++|.|.|
T Consensus 322 ~~c~~g~~C~~~~~~~~~~C~c~~~~~g~--~C~----~~~-----~~C~~--------~~~~~~~~c~~~~~~~~~c~~ 382 (487)
T KOG1217|consen 322 GPCANGGTCNTLGSFGGFRCACGPGFTGR--RCE----DSN-----DECAS--------SPCCPGGTCVNETPGSYRCAC 382 (487)
T ss_pred CcCCCCcccccCCCCCCCCcCCCCCCCCC--ccc----cCC-----ccccC--------CccccCCEeccCCCCCeEecC
Confidence 5688888883 44457889999999988 543 321 01111 24677889998 688999999
Q ss_pred CCCCccC-C--ccccccCCCCCCCCCCCCCCCCCCCCCCCCCCccccCCCCceeeCCCCc
Q psy5613 602 KAGFTGD-P--RVFCSRIPPPPPQESPPEYVNPCIPSPCGPYSQCRDINGSPSCSCLPNY 658 (1010)
Q Consensus 602 ~~Gy~G~-~--~~~C~~~~~~~~~~~~~~~id~C~~~~C~~~g~C~~~~g~y~C~C~~G~ 658 (1010)
+.+|.+. . ...+ .++++|.. .+.|++..++|.|. .+ +
T Consensus 383 ~~~~~~~~~~~~~~~-------------~~~~~c~~-----~~~c~~~~~~~~c~-~~-~ 422 (487)
T KOG1217|consen 383 PAGFAGKANGDGVGC-------------EDIDECSG-----CGDCVNGPGGGACT-PP-G 422 (487)
T ss_pred CCccccCCccccccc-------------cccccccC-----CcceeccCCCCccc-cC-c
Confidence 9999874 1 1113 35677754 56788889999999 88 5
No 5
>KOG1214|consensus
Probab=99.61 E-value=4.4e-14 Score=160.77 Aligned_cols=237 Identities=27% Similarity=0.559 Sum_probs=156.0
Q ss_pred cCCCC--CCCCCCCCeeeccCC-ceeeeCCCCCccCCCcccCCCCCCCCCCCCCc--CCCCCCCCcccccCCCeeeecCC
Q psy5613 472 KNPCV--PGTCGEGAICDVINH-AVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQ--PSPCGPNSQCREVHKQAVCSCLP 546 (1010)
Q Consensus 472 ~~~C~--~~~C~~~~~C~~~~g-~~~C~C~~G~~G~~~~~C~~~~~~~~~~~~C~--~~~C~~~g~C~~~~g~~~C~C~~ 546 (1010)
+++|. ++.|..++.|....+ .|.|.|..||.|+.. .| .++++|+ ...|++++.|++.+++|+|.|..
T Consensus 692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr-~c-------~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~ 763 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGR-NC-------VDENECATGFHRCGPNSVCINLPGSYRCECRS 763 (1289)
T ss_pred cccceecCcccCCCccccCCCCcceEEEEeeccCCCCC-CC-------CChhhhccCCCCCCCCceeecCCCceeEEEee
Confidence 45554 455888888976544 699999999999863 35 5788998 45699999999999999999998
Q ss_pred Ccc--CCCCCCcCCCccCCCCCCCccccCCcccCCCCCCCCCCceee--ccC-CCceeecCCCCccCCccccccCCCCCC
Q psy5613 547 NYF--GSPPNCRPECTVNSDCPLDKACFNQKCVDPCPGTCGQNANCR--VIN-HNPSCTCKAGFTGDPRVFCSRIPPPPP 621 (1010)
Q Consensus 547 G~~--G~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~--~~~-~~~~C~C~~Gy~G~~~~~C~~~~~~~~ 621 (1010)
||. +++.+|.. ...=...+.|..+ .+.|...++++ ... ++|.|.|.+||.|++.. |.
T Consensus 764 gy~F~dd~~tCV~----i~~pap~n~Ce~g------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c~------- 825 (1289)
T KOG1214|consen 764 GYEFADDRHTCVL----ITPPAPANPCEDG------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-CT------- 825 (1289)
T ss_pred cceeccCCcceEE----ecCCCCCCccccC------ccccCcCCceEEEecCCceEEEeecCCccCCccc-cc-------
Confidence 875 44445531 1110111122221 25666555544 333 46999999999999865 53
Q ss_pred CCCCCCCCCCCCCCCCCCCCccccCCCCceeeCCCCcccCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeee
Q psy5613 622 QESPPEYVNPCIPSPCGPYSQCRDINGSPSCSCLPNYIGAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCR 701 (1010)
Q Consensus 622 ~~~~~~~id~C~~~~C~~~g~C~~~~g~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~ 701 (1010)
++|+|.++-|..++.|.+++|+|.|.|.+||.|+++.|.+.=...+.|..... . +-.|+.++.|.
T Consensus 826 ------dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er~-------h--pl~chg~t~~~ 890 (1289)
T KOG1214|consen 826 ------DVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQERF-------H--PLQCHGSTGFC 890 (1289)
T ss_pred ------cccccCccccCCCceEecCCCcceeecccCccCCCceecCCCccCCccccccc-------c--ceeecccccee
Confidence 78999999999999999999999999999999998766532122233332100 0 12465555443
Q ss_pred --eeCCcceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCCCCCCCcccC-------ceeecCC
Q psy5613 702 --VINHSPVCYCPDGFIGDAFSSCYPKPIEPIQAPEQQADPCICAPNAVCRD-------NVCVCLP 758 (1010)
Q Consensus 702 --~~~g~~~C~C~~G~~G~~C~~C~~~~~~~~~~~~c~~~~c~C~~~g~C~~-------~~C~C~~ 758 (1010)
....+|.+.|.++-.|+.-..|.+. +...+. .|..+|.+.. +.|+|..
T Consensus 891 ~~~Dp~~~e~p~~~~ppG~~~~~c~~~--~~~~vp-------~Cd~hgh~ap~qchG~~~~CwCvd 947 (1289)
T KOG1214|consen 891 WCVDPDGHEVPGTQTPPGSTPPHCGPS--PEQYVP-------QCDDHGHFAPLQCHGKSDFCWCVD 947 (1289)
T ss_pred EeeCCCcccCCCCCCCCCCCCCCCCCc--ccccCC-------CccccccccccccCCCcceeEEec
Confidence 2345689998888777654434322 111122 2555665542 5788876
No 6
>KOG1214|consensus
Probab=99.53 E-value=1.2e-13 Score=157.26 Aligned_cols=278 Identities=24% Similarity=0.511 Sum_probs=177.6
Q ss_pred cceecCCCCc--cCCCCcCC---------CCCccCCCCCCCcccccCC----ccCCCC--CCCCCCCCeeeecCC-ceee
Q psy5613 50 EVCVCLPDFY--GDGYVSCR---------PECVLNSDCPSNKACIRNK----CKNPCV--PGTCGEGAICDVVNH-AVMC 111 (1010)
Q Consensus 50 ~~C~C~~G~~--g~~~~~~~---------~eC~~~~~C~~~~~C~~~~----C~~~C~--~~~C~~~~~C~~~~g-~~~C 111 (1010)
++|.+.+-|. +.+.+..+ .|+.+-..+.+...++... =+++|. ++.|..++.|....+ .|+|
T Consensus 639 q~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~lr~a~Sn~igpV~E~S~~~~~npCy~gsh~cdt~a~C~pg~~~~~tc 718 (1289)
T KOG1214|consen 639 QVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVSNQIGPVKEDSDPTPVNPCYDGSHMCDTTARCHPGTGVDYTC 718 (1289)
T ss_pred EEeecCCCCCCCCCceEeecccceeccCccccchhhhhhhcccceecCCCCcccccceecCcccCCCccccCCCCcceEE
Confidence 4699988886 43322111 2444433444433333211 256674 678999999998876 4999
Q ss_pred eCCCCCccCCCCCcccCCCCCCCCCCCCC--CCCCCCCcceecCCCeeeecCCCCc--CCCCCCCCCCccCCCCCCCCcc
Q psy5613 112 TCPPGTTGSPFIQCKPIQNEPVYTNPCQP--SPCGPNSQCREINHQAVCSCLPNYF--GSPPGCRPECTVNSDCPLDRAC 187 (1010)
Q Consensus 112 ~C~~G~~g~~~~~C~~~~~~~~~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~G~~--g~~~~C~~~C~~~~~C~~~~~C 187 (1010)
.|..||.|+... |. ++++|+. ..|+.+++|++.+++|+|+|..||. +++.+|+ ....=...++|
T Consensus 719 ecs~g~~gdgr~-c~-------d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV----~i~~pap~n~C 786 (1289)
T KOG1214|consen 719 ECSSGYQGDGRN-CV-------DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCV----LITPPAPANPC 786 (1289)
T ss_pred EEeeccCCCCCC-CC-------ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceE----EecCCCCCCcc
Confidence 999999988754 75 5688975 4599999999999999999999875 4444443 21111223344
Q ss_pred cCCcccCCCCCCCCCCc--eEeecC-CCccccCCCCCccCCCccccCCCCCCCCCCCCCCCCCCCCCCCCCeecccCCce
Q psy5613 188 QNQKCVDPCPGSCGYRA--RCQVYN-HNPVCSCPPGYTGNPFSQCLLPPTPTPTQATPTDPCFPSPCGSNARCRVQNEHA 264 (1010)
Q Consensus 188 ~~~~C~~~C~~~C~~~~--~C~~~~-g~~~C~C~~Gy~g~~c~~C~~~~~~~~~~~~~~d~C~~~~C~~~~~C~~~~g~~ 264 (1010)
..+ .+.|...+ .|+... ++|.|.|.+||+|+.-. +.|+|+|.++-|..+++|.+++++|
T Consensus 787 e~g------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~------------c~dvDeC~psrChp~A~Cyntpgsf 848 (1289)
T KOG1214|consen 787 EDG------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ------------CTDVDECSPSRCHPAATCYNTPGSF 848 (1289)
T ss_pred ccC------ccccCcCCceEEEecCCceEEEeecCCccCCccc------------cccccccCccccCCCceEecCCCcc
Confidence 443 23444444 455543 57999999999999742 3789999999999999999999999
Q ss_pred eeeeCCCCccCCCCcCCCCCccCCCCCCccccccCCCCCCCCCCCCCCe---eccCCCCCceecCCCCcccCCcccCCCC
Q psy5613 265 LCECLPDYYGNPYEGCRPECLINSDCPLSLACIKNHCRDPCPGTCGVQA---ICSVSNHIPICYCPAGFTGDAFRQCSPI 341 (1010)
Q Consensus 265 ~C~C~~Gf~G~~c~~~~~eC~~~~~C~~~~~C~~~~C~~~c~~~C~~~~---~C~~~~g~~~C~C~~Gy~G~~c~~C~~i 341 (1010)
.|+|.+||+|+... ++..=.....| .... .. +-.|+.++ .|++. ..|.+.|.++-.|+.-.+|.++
T Consensus 849 sC~C~pGy~GDGf~-CVP~~~~~T~C------~~er-~h--pl~chg~t~~~~~~Dp-~~~e~p~~~~ppG~~~~~c~~~ 917 (1289)
T KOG1214|consen 849 SCRCQPGYYGDGFQ-CVPDTSSLTPC------EQER-FH--PLQCHGSTGFCWCVDP-DGHEVPGTQTPPGSTPPHCGPS 917 (1289)
T ss_pred eeecccCccCCCce-ecCCCccCCcc------cccc-cc--ceeeccccceeEeeCC-CcccCCCCCCCCCCCCCCCCCc
Confidence 99999999999754 22110111111 1100 00 22455444 33443 4689999998888877778776
Q ss_pred CCCCCCCCCCCCCCCCCCCCeEeec--CCc-eeeeecC
Q psy5613 342 PQREPEYRDPCSTTQCGLNAICTVI--NGA-AQCACLL 376 (1010)
Q Consensus 342 ~~~~~~~~deC~~~~C~~~~~C~n~--~g~-~~C~C~~ 376 (1010)
.+. .+-+| ..+|.+..+ .|+ +.|.|..
T Consensus 918 ~~~---~vp~C-----d~hgh~ap~qchG~~~~CwCvd 947 (1289)
T KOG1214|consen 918 PEQ---YVPQC-----DDHGHFAPLQCHGKSDFCWCVD 947 (1289)
T ss_pred ccc---cCCCc-----cccccccccccCCCcceeEEec
Confidence 442 12233 333334322 233 7788875
No 7
>KOG0994|consensus
Probab=99.53 E-value=1.8e-12 Score=151.91 Aligned_cols=256 Identities=25% Similarity=0.576 Sum_probs=130.2
Q ss_pred cccCCCCcee-eCCCCcccCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeee--eCCcceeeCCCCCccCC
Q psy5613 643 CRDINGSPSC-SCLPNYIGAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRV--INHSPVCYCPDGFIGDA 719 (1010)
Q Consensus 643 C~~~~g~y~C-~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~--~~g~~~C~C~~G~~G~~ 719 (1010)
|.+...++.| .|..||.|++..-. +..|..-.|.+.=.+.=++...|.- ......|.|.+||+|.+
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~lg~-----------g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~R 946 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPRLGS-----------GIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSR 946 (1758)
T ss_pred ccccccccchhhhhccccCCcccCC-----------CCCCCCCCCCCCCccchhccccccccccccceeeecccCccccc
Confidence 4566778888 79999999853210 0011100010000000011123432 22345899999999999
Q ss_pred CccCCCCCCCCCCCCCCCCCCCCCCCCC------cccC--c------------ee-ecCCCccCCCCcccCCCCCCCCCC
Q psy5613 720 FSSCYPKPIEPIQAPEQQADPCICAPNA------VCRD--N------------VC-VCLPDYYGDGYTVCRPECVRNSDC 778 (1010)
Q Consensus 720 C~~C~~~~~~~~~~~~c~~~~c~C~~~g------~C~~--~------------~C-~C~~G~~G~~c~~~~~~C~~~~~C 778 (1010)
|+.|.+..+...+. --...+|.|++|- .|.. + .| .|++||+|+.-......|+ |
T Consensus 947 Ce~CA~~~fGnP~~-GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC~----C 1021 (1758)
T KOG0994|consen 947 CEICADNHFGNPSE-GGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRCV----C 1021 (1758)
T ss_pred hhhhcccccCCccc-CCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhhe----c
Confidence 99998776643222 1122333444441 2221 1 23 4777777764221100010 0
Q ss_pred CCCcccccCCccCCCCCCCCCCCCeeeecCCeeeeeCCCCCccCCCccccCCccCCCCCCCCCCCCCC--CCCceeecCC
Q psy5613 779 ANNKACIRNKCKNPCVPGTCGEGAICDVINHSVVCSCPPGTTGSPFIQCKPVIQEPVYTNPCQPSPCG--PNSQCREVNK 856 (1010)
Q Consensus 779 ~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~g~y~C~C~~G~~G~~~~~C~~~~~~~~~~~~C~~~~C~--~~~~C~~~~g 856 (1010)
.. -+. .+.+.|+..+| +|-|.|...|..+.+|.+.-+.-..+..|.+-.|. .+-+|...+|
T Consensus 1022 n~--------------LGT-n~~~~CDr~tG--QCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG 1084 (1758)
T KOG0994|consen 1022 NF--------------LGT-NSTCHCDRFTG--QCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTG 1084 (1758)
T ss_pred cc--------------ccc-CCccccccccC--cCCCCcccccccccccccchhccccCCCCCccCCCccCCcccccccc
Confidence 00 000 01245777777 99999999999766666654444445556554442 3346887777
Q ss_pred ceeeecCCCCcCCCCCCCCCCccCCCCCCcccccCCcccCCC--CCCCC-CCCe--eeecCCCcee-eCCCCCcCCCCCC
Q psy5613 857 QAVCSCLPNYFGSPPNCRPECTVNTDCPLDKACVNQKCVDPC--PGSCG-QNAN--CRVINHSPIC-TCRPGFTGEPRIR 930 (1010)
Q Consensus 857 ~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C--~~~C~-~~~~--C~~~~g~~~C-~C~~G~~G~~~~~ 930 (1010)
+|+|+|||-|. .|. +|+...+-.+.-.|..-.|-.-= ...|+ ..|. |...+++++| +|..||+|.-- .
T Consensus 1085 --QCqCkpGfGGR--~C~-qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCdqCaRgy~G~fP-~ 1158 (1758)
T KOG0994|consen 1085 --QCQCKPGFGGR--TCS-QCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCDQCARGYSGQFP-V 1158 (1758)
T ss_pred --ceeccCCCCCc--chh-HHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchhhhhhhhcCCCC-C
Confidence 89999999999 443 34444333333344332220000 11232 2233 3344556666 46666666522 4
Q ss_pred cccCCCC
Q psy5613 931 CSPIPRK 937 (1010)
Q Consensus 931 C~~~~~~ 937 (1010)
|.+-.++
T Consensus 1159 C~PCh~C 1165 (1758)
T KOG0994|consen 1159 CVPCHEC 1165 (1758)
T ss_pred CcchHHH
Confidence 5544433
No 8
>KOG1219|consensus
Probab=99.31 E-value=2.1e-12 Score=158.58 Aligned_cols=111 Identities=30% Similarity=0.598 Sum_probs=99.1
Q ss_pred CCCCCCCCCCCCCeecccC-CceeeeeCCCCccCCCCcCCCCCccCCCCCCccccccCCCCCCCCCCCCCCeeccCCCCC
Q psy5613 243 TDPCFPSPCGSNARCRVQN-EHALCECLPDYYGNPYEGCRPECLINSDCPLSLACIKNHCRDPCPGTCGVQAICSVSNHI 321 (1010)
Q Consensus 243 ~d~C~~~~C~~~~~C~~~~-g~~~C~C~~Gf~G~~c~~~~~eC~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~ 321 (1010)
.+.|..+||+++|+|+.++ ++|+|.|++-|.|..|+.++..|.. + +|..+++|+...++
T Consensus 3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~s------------n--------PC~~GgtCip~~n~ 3923 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCAS------------N--------PCLTGGTCIPFYNG 3923 (4289)
T ss_pred ccccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccC------------C--------CCCCCCEEEecCCC
Confidence 4899999999999999887 7899999999999999987765543 3 56678999999999
Q ss_pred ceecCCCCcccCCcccCCCCCCCCCCC-CCCCCCCCCCCCCeEeecCCceeeeecCccccccc
Q psy5613 322 PICYCPAGFTGDAFRQCSPIPQREPEY-RDPCSTTQCGLNAICTVINGAAQCACLLLLQHHIH 383 (1010)
Q Consensus 322 ~~C~C~~Gy~G~~c~~C~~i~~~~~~~-~deC~~~~C~~~~~C~n~~g~~~C~C~~G~~g~~~ 383 (1010)
|.|.|+.||+|..|+ .+ ++||+.++|.++|.|+|..|+|.|.|-+||.|..+
T Consensus 3924 f~CnC~~gyTG~~Ce----------~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3924 FLCNCPNGYTGKRCE----------ARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred eeEeCCCCccCceee----------cccccccccccccCCceeeccCCceEeccChhHhcccC
Confidence 999999999999998 34 89999999999999999999999999999987543
No 9
>KOG1219|consensus
Probab=99.24 E-value=9.3e-12 Score=153.14 Aligned_cols=111 Identities=26% Similarity=0.724 Sum_probs=98.4
Q ss_pred CCCCCCCCCCCCeeeecC-CceeeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCCCcceecCCCeeeecCCCCcCC
Q psy5613 89 NPCVPGTCGEGAICDVVN-HAVMCTCPPGTTGSPFIQCKPIQNEPVYTNPCQPSPCGPNSQCREINHQAVCSCLPNYFGS 167 (1010)
Q Consensus 89 ~~C~~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~ 167 (1010)
++|..+||+++|+|+..+ |+|+|+|++-|+|+. || .++.+|+++||..+|+|+...++|.|.|+.||+|.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~---CE------i~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~ 3935 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNH---CE------IDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK 3935 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcc---cc------cccccccCCCCCCCCEEEecCCCeeEeCCCCccCc
Confidence 789899999999999975 569999999999999 98 67899999999999999999999999999999999
Q ss_pred CCCCCCCCccCCCCCCCCcccCCcccCCCCCCCCCCceEeecCCCccccCCCCCccCCC
Q psy5613 168 PPGCRPECTVNSDCPLDRACQNQKCVDPCPGSCGYRARCQVYNHNPVCSCPPGYTGNPF 226 (1010)
Q Consensus 168 ~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c 226 (1010)
+|+... .+.|. .+.|.++|.|+|..|+|.|.|-+||.|..|
T Consensus 3936 ------~Ce~~G----i~eCs--------~n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3936 ------RCEARG----ISECS--------KNVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred ------eeeccc----ccccc--------cccccCCceeeccCCceEeccChhHhcccC
Confidence 887651 12222 247888999999999999999999999986
No 10
>KOG1225|consensus
Probab=99.18 E-value=3.3e-10 Score=129.83 Aligned_cols=212 Identities=28% Similarity=0.728 Sum_probs=139.8
Q ss_pred CCCCCcccCceeecCCCceeCCcccCCCCCCCCCCCCCcchhccCCccCCCCCCCCCCCCeeeccCCceeeeCCCCCccC
Q psy5613 425 CVPNAECRDGVCVCLPDYYGDGYVSCRPECVQNSDCPRNKACIRNKCKNPCVPGTCGEGAICDVINHAVMCTCPPGTTGS 504 (1010)
Q Consensus 425 C~~~~~C~~~~C~C~~G~~G~~~~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~ 504 (1010)
+.....+.+++|.+.+++.+..+. +. .+ ..-+..++.+.+ +.+.+..+|++.
T Consensus 152 ~~~~~~~~~~~c~~~~~~~~~~~g-------~~------------~~-----~~~~~~hg~~~~----~~~l~~~~~s~~ 203 (525)
T KOG1225|consen 152 CLVRILCKNGVCSLKPNPFGAECG-------QY------------KC-----PNDGSGHGRYYF----GNCLSGISASGE 203 (525)
T ss_pred hcchhhhhcccccccCCccccccc-------ee------------cC-----CcCCCCCcccee----cccccccCcchh
Confidence 456677788899999998887432 11 00 111223333332 357888888877
Q ss_pred CCcccCCCCCCCCCCCCCcCC-CCC-CCCcccccCCCeeeecCCCccCCCCCCcCCCccCCCCCCCccccCCcccCCCCC
Q psy5613 505 PFIQCKPVQNEPVYTNPCQPS-PCG-PNSQCREVHKQAVCSCLPNYFGSPPNCRPECTVNSDCPLDKACFNQKCVDPCPG 582 (1010)
Q Consensus 505 ~~~~C~~~~~~~~~~~~C~~~-~C~-~~g~C~~~~g~~~C~C~~G~~G~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~ 582 (1010)
. +.... ..+.+..+ .+. ....|+...-.+.|.|+.+|+|. .|. . ..|++
T Consensus 204 ~---~~~~~----~~~~~~~~~r~~~~~~~~~~~~~~~ic~c~~~~~g~--~c~----~----------------~~C~~ 254 (525)
T KOG1225|consen 204 T---CNQLG----CNDDCFRTGRCREGRCFCTAGFFDGICECPEGYFGP--LCS----T----------------IYCPG 254 (525)
T ss_pred h---hhccc----CCccceeccccccCcccccccccCceeecCCceeCC--ccc----c----------------ccCCC
Confidence 5 42110 00111100 010 01123333334589999999998 432 1 12345
Q ss_pred CCCCCceeeccCCCceeecCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCccccCCCCceeeCCCCcccCC
Q psy5613 583 TCGQNANCRVINHNPSCTCKAGFTGDPRVFCSRIPPPPPQESPPEYVNPCIPSPCGPYSQCRDINGSPSCSCLPNYIGAP 662 (1010)
Q Consensus 583 ~C~~~~~C~~~~~~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~id~C~~~~C~~~g~C~~~~g~y~C~C~~G~~g~~ 662 (1010)
.|..++.|+ ..+|+|++||+|.+ |. +-.|... |+.++.+++. .|+|++||+|.
T Consensus 255 ~c~~~g~c~----~G~CIC~~Gf~G~d---C~--------------e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~- 307 (525)
T KOG1225|consen 255 GCTGRGQCV----EGRCICPPGFTGDD---CD--------------ELVCPVD-CSGGGVCVDG----ECICNPGYSGK- 307 (525)
T ss_pred CCcccceEe----CCeEeCCCCCcCCC---CC--------------cccCCcc-cCCCceecCC----EeecCCCcccc-
Confidence 666677887 56999999999997 85 3456443 8888888765 89999999999
Q ss_pred CCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeeeeCCcceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCC
Q psy5613 663 PNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRVINHSPVCYCPDGFIGDAFSSCYPKPIEPIQAPEQQADPCI 742 (1010)
Q Consensus 663 ~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~C~~~~~~~~~~~~c~~~~c~ 742 (1010)
.|+. ..|+..|+++|.|+ .| +|.|.+||+|..|+. .+
T Consensus 308 -dCs~--------------------~~cpadC~g~G~Ci--~G--~C~C~~Gy~G~~C~~----------------~~-- 344 (525)
T KOG1225|consen 308 -DCSI--------------------RRCPADCSGHGKCI--DG--ECLCDEGYTGELCIQ----------------RA-- 344 (525)
T ss_pred -cccc--------------------ccCCccCCCCCccc--CC--ceEeCCCCcCCcccc----------------cc--
Confidence 6651 12356799999999 44 999999999999972 12
Q ss_pred CCCCCcccCceeecCCCccCCC
Q psy5613 743 CAPNAVCRDNVCVCLPDYYGDG 764 (1010)
Q Consensus 743 C~~~g~C~~~~C~C~~G~~G~~ 764 (1010)
|.+++.|+++ |.|..||.|..
T Consensus 345 C~~~g~cv~g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 345 CSGGGQCVNG-CKCKKGWRGPD 365 (525)
T ss_pred cCCCceeccC-ceeccCccCCC
Confidence 8999999999 99999999987
No 11
>KOG0994|consensus
Probab=99.08 E-value=2.2e-09 Score=126.64 Aligned_cols=114 Identities=32% Similarity=0.720 Sum_probs=62.7
Q ss_pred EeecCCCcccc-CCCCCccCCCccccCCCCCCCCCCCCCCCCCCCCCCCCC--------eeccc--CCceeeeeCCCCcc
Q psy5613 206 CQVYNHNPVCS-CPPGYTGNPFSQCLLPPTPTPTQATPTDPCFPSPCGSNA--------RCRVQ--NEHALCECLPDYYG 274 (1010)
Q Consensus 206 C~~~~g~~~C~-C~~Gy~g~~c~~C~~~~~~~~~~~~~~d~C~~~~C~~~~--------~C~~~--~g~~~C~C~~Gf~G 274 (1010)
|.+..+++.|. |..||.|++- + -....|.+.||..+- .|.-. .....|.|.+||+|
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~----l---------g~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G 944 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPR----L---------GSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSG 944 (1758)
T ss_pred ccccccccchhhhhccccCCcc----c---------CCCCCCCCCCCCCCCccchhccccccccccccceeeecccCccc
Confidence 55666778886 9999999873 1 123456666665431 34322 24568999999999
Q ss_pred CCCCcCCCCCccCC--CCCCccccccCCCCCCC----CCCCCCC-e---eccCCCCCcee-cCCCCcccCCcc
Q psy5613 275 NPYEGCRPECLINS--DCPLSLACIKNHCRDPC----PGTCGVQ-A---ICSVSNHIPIC-YCPAGFTGDAFR 336 (1010)
Q Consensus 275 ~~c~~~~~eC~~~~--~C~~~~~C~~~~C~~~c----~~~C~~~-~---~C~~~~g~~~C-~C~~Gy~G~~c~ 336 (1010)
..|+.+.+ +. +-..+..|..-.|.+.- ++.|... | .|+-...+-+| .|.+||.|+.-.
T Consensus 945 ~RCe~CA~----~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~ 1013 (1758)
T KOG0994|consen 945 SRCEICAD----NHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALR 1013 (1758)
T ss_pred cchhhhcc----cccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHH
Confidence 99985332 11 00011222222222100 2223221 1 34444445577 699999988643
No 12
>KOG1225|consensus
Probab=99.03 E-value=1e-09 Score=125.88 Aligned_cols=132 Identities=33% Similarity=0.820 Sum_probs=103.2
Q ss_pred CceeeCCCCcccCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeeeeCCcceeeCCCCCccCCCccCCCCCC
Q psy5613 649 SPSCSCLPNYIGAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRVINHSPVCYCPDGFIGDAFSSCYPKPI 728 (1010)
Q Consensus 649 ~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~C~~~~~ 728 (1010)
.+.|.|..+|+|. .|.. ..|++.|..++.|++. +|+|++||+|+.|+.
T Consensus 233 ~~ic~c~~~~~g~--~c~~--------------------~~C~~~c~~~g~c~~G----~CIC~~Gf~G~dC~e------ 280 (525)
T KOG1225|consen 233 DGICECPEGYFGP--LCST--------------------IYCPGGCTGRGQCVEG----RCICPPGFTGDDCDE------ 280 (525)
T ss_pred CceeecCCceeCC--cccc--------------------ccCCCCCcccceEeCC----eEeCCCCCcCCCCCc------
Confidence 3389999999998 5441 1234567777888854 999999999999972
Q ss_pred CCCCCCCCCCCCCCCCCCCcccCceeecCCCccCCCCcccCCCCCCCCCCCCCcccccCCccCCCCCCCCCCCCeeeecC
Q psy5613 729 EPIQAPEQQADPCICAPNAVCRDNVCVCLPDYYGDGYTVCRPECVRNSDCANNKACIRNKCKNPCVPGTCGEGAICDVIN 808 (1010)
Q Consensus 729 ~~~~~~~c~~~~c~C~~~g~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~ 808 (1010)
-.| |..|+.++.+++++|+|++||+|..|+. .+| +..|.++|.|+ .
T Consensus 281 -----~~C---p~~cs~~g~~~~g~CiC~~g~~G~dCs~-----------------------~~c-padC~g~G~Ci--~ 326 (525)
T KOG1225|consen 281 -----LVC---PVDCSGGGVCVDGECICNPGYSGKDCSI-----------------------RRC-PADCSGHGKCI--D 326 (525)
T ss_pred -----ccC---CcccCCCceecCCEeecCCCcccccccc-----------------------ccC-CccCCCCCccc--C
Confidence 112 2237889999999999999999998874 123 56788899999 4
Q ss_pred CeeeeeCCCCCccCCCccccCCccCCCCCCCCCCCCCCCCCceeecCCceeeecCCCCcCC
Q psy5613 809 HSVVCSCPPGTTGSPFIQCKPVIQEPVYTNPCQPSPCGPNSQCREVNKQAVCSCLPNYFGS 869 (1010)
Q Consensus 809 g~y~C~C~~G~~G~~~~~C~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 869 (1010)
| +|.|.+||+|.. |+. . +|.+++.|++. |+|..||.|.
T Consensus 327 G--~C~C~~Gy~G~~---C~~--------~-----~C~~~g~cv~g-----C~C~~Gw~G~ 364 (525)
T KOG1225|consen 327 G--ECLCDEGYTGEL---CIQ--------R-----ACSGGGQCVNG-----CKCKKGWRGP 364 (525)
T ss_pred C--ceEeCCCCcCCc---ccc--------c-----ccCCCceeccC-----ceeccCccCC
Confidence 4 999999999994 764 1 48888999864 9999999999
No 13
>KOG4260|consensus
Probab=98.60 E-value=4.6e-08 Score=99.50 Aligned_cols=166 Identities=25% Similarity=0.567 Sum_probs=106.4
Q ss_pred eeCCCCCccCCCCCcccCCCCCCCCCCC---CCCCCCCCCccee---cCCCeeeecCCCCcCCCCCCCCCCccCCCCCCC
Q psy5613 111 CTCPPGTTGSPFIQCKPIQNEPVYTNPC---QPSPCGPNSQCRE---INHQAVCSCLPNYFGSPPGCRPECTVNSDCPLD 184 (1010)
Q Consensus 111 C~C~~G~~g~~~~~C~~~~~~~~~~d~C---~~~~C~~~g~C~~---~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~ 184 (1010)
=-|++|..|++ |. .| +..||..+|.|.- ..|+..|.|.+||+|.. |. .|.....-+..
T Consensus 130 vCCp~gtyGpd---Cl----------~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~--C~-~Cg~eyfes~R 193 (350)
T KOG4260|consen 130 VCCPDGTYGPD---CL----------QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPL--CR-YCGIEYFESSR 193 (350)
T ss_pred eccCCCCcCCc---cc----------cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcc--cc-ccchHHHHhhc
Confidence 34999999998 64 34 3468999999984 45789999999999982 21 22211000000
Q ss_pred CcccCCcccCCCCCCCCCCceEeecCCCccc-cCCCCCccCCCccccCCCCCCCCCCCCCCCCC--CCCCCCCCeecccC
Q psy5613 185 RACQNQKCVDPCPGSCGYRARCQVYNHNPVC-SCPPGYTGNPFSQCLLPPTPTPTQATPTDPCF--PSPCGSNARCRVQN 261 (1010)
Q Consensus 185 ~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C-~C~~Gy~g~~c~~C~~~~~~~~~~~~~~d~C~--~~~C~~~~~C~~~~ 261 (1010)
+. .+..|. .|...|. +.|.-. ++-.| .|..||..+.- . ++|||||. +.||.....|+|+.
T Consensus 194 ne-~~lvCt-~Ch~~C~--~~Csg~-~~k~C~kCkkGW~lde~-g-----------CvDvnEC~~ep~~c~~~qfCvNte 256 (350)
T KOG4260|consen 194 NE-QHLVCT-ACHEGCL--GVCSGE-SSKGCSKCKKGWKLDEE-G-----------CVDVNECQNEPAPCKAHQFCVNTE 256 (350)
T ss_pred cc-ccchhh-hhhhhhh--cccCCC-CCCChhhhcccceeccc-c-----------cccHHHHhcCCCCCChhheeecCC
Confidence 00 001110 1111221 234322 22334 49999987742 2 48999998 68899999999999
Q ss_pred CceeeeeCCCCccCCCCcCCCCCccCCCCCCccccccCCCCCCCCCCCCCCeeccCCCCCceecCCCCcc
Q psy5613 262 EHALCECLPDYYGNPYEGCRPECLINSDCPLSLACIKNHCRDPCPGTCGVQAICSVSNHIPICYCPAGFT 331 (1010)
Q Consensus 262 g~~~C~C~~Gf~G~~c~~~~~eC~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~ 331 (1010)
|+|+|..++||.+. +|+|+.-.+ .|. ..+..|.|+.++|+|+|..|+.
T Consensus 257 GSf~C~dk~Gy~~g-----~d~C~~~~d-----~~~------------~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 257 GSFKCEDKEGYKKG-----VDECQFCAD-----VCA------------SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred CceEecccccccCC-----hHHhhhhhh-----hcc------------cCCCCcccCCccEEEEecccce
Confidence 99999999999873 456654110 011 1245778999999999999976
No 14
>KOG1836|consensus
Probab=98.56 E-value=8.9e-05 Score=96.68 Aligned_cols=113 Identities=23% Similarity=0.518 Sum_probs=67.6
Q ss_pred CeeeeeCCcceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCCCCCCC----cccC--ceeecCCCccCCCCcccCCC
Q psy5613 698 AQCRVINHSPVCYCPDGFIGDAFSSCYPKPIEPIQAPEQQADPCICAPNA----VCRD--NVCVCLPDYYGDGYTVCRPE 771 (1010)
Q Consensus 698 ~~C~~~~g~~~C~C~~G~~G~~C~~C~~~~~~~~~~~~c~~~~c~C~~~g----~C~~--~~C~C~~G~~G~~c~~~~~~ 771 (1010)
.+|....| .|.|.+.-.|..|..|.+..+.-..-.-| .++.|..-| .|.. +.|.|.+|-+|..|..+.+
T Consensus 903 ~~c~~~tG--Qcec~~~v~g~~c~~c~~g~fnl~s~~gC--~~c~c~~~gs~~~~c~~~tGqc~c~~gVtgqrc~qc~~- 977 (1705)
T KOG1836|consen 903 LTCNPVTG--QCECKPNVEGRDCLYCFKGFFNLNSGVGC--EPCNCDPTGSESSDCDVGTGQCYCRPGVTGQRCDQCET- 977 (1705)
T ss_pred ccCCCccc--ceeccCCCCccccccccccccccCCCCCc--ccccccccccccccccccCCceeeecCccccccCcccc-
Confidence 34666666 89999999999988777654432111111 222344333 5654 5899999999999875322
Q ss_pred CCCCCCCCCCcccccCCccCCCCCCCCCCCC----eeeecCCeeeeeCCCCCccCCCccccC
Q psy5613 772 CVRNSDCANNKACIRNKCKNPCVPGTCGEGA----ICDVINHSVVCSCPPGTTGSPFIQCKP 829 (1010)
Q Consensus 772 C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~----~C~~~~g~y~C~C~~G~~G~~~~~C~~ 829 (1010)
+..=... ..|..-.|...| .|+...| +|.|++||.|.....|++
T Consensus 978 ---~~~~~~~---------~gc~~c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~~c~~c~~ 1025 (1705)
T KOG1836|consen 978 ---YHFGFQT---------EGCGLCECDPLGSRGFQCDPEDG--QCPCRPGFEGRRCDQCEE 1025 (1705)
T ss_pred ---Ccccccc---------cCCcceecccCCcccceecccCC--eeeecCCCCCcccccccC
Confidence 1100000 111122344444 6887777 999999999986444544
No 15
>KOG1836|consensus
Probab=98.54 E-value=0.00012 Score=95.43 Aligned_cols=209 Identities=24% Similarity=0.586 Sum_probs=106.4
Q ss_pred CCCCCCeeeee--CCcceee-CCCCCccCCCccCCCCCCCCCCCCC---CCCCCCCCCCC------CcccC--cee----
Q psy5613 693 SCGQGAQCRVI--NHSPVCY-CPDGFIGDAFSSCYPKPIEPIQAPE---QQADPCICAPN------AVCRD--NVC---- 754 (1010)
Q Consensus 693 ~C~~~~~C~~~--~g~~~C~-C~~G~~G~~C~~C~~~~~~~~~~~~---c~~~~c~C~~~------g~C~~--~~C---- 754 (1010)
+|.+++.|..+ .....|+ |++||+|.+|+.|....+....... -...++.|..+ +.|.. +.|
T Consensus 781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci 860 (1705)
T KOG1836|consen 781 PCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLKCI 860 (1705)
T ss_pred CCCCChhhcCcCcccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceeccccCccccccccccccceeecc
Confidence 45566666644 3567898 9999999999988654433211111 12222233321 33432 334
Q ss_pred ---------ecCCCccCCCCccc-CCCCCCCCCCCCCcccccCCccCCCCCCCCCCCCeeeecCCeeeeeCCCCCccCCC
Q psy5613 755 ---------VCLPDYYGDGYTVC-RPECVRNSDCANNKACIRNKCKNPCVPGTCGEGAICDVINHSVVCSCPPGTTGSPF 824 (1010)
Q Consensus 755 ---------~C~~G~~G~~c~~~-~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~g~y~C~C~~G~~G~~~ 824 (1010)
.|.+||.|+..... .+.| ..-.|.. .+.=....+|.+.+| +|.|.+.-.|...
T Consensus 861 ~nT~g~~cd~c~~g~~gd~l~~~p~~~c------------~~c~c~p---~gs~~~~~~c~~~tG--Qcec~~~v~g~~c 923 (1705)
T KOG1836|consen 861 HNTAGEYCDLCKEGYFGDPLAPNPEDKC------------FACGCVP---AGSELPSLTCNPVTG--QCECKPNVEGRDC 923 (1705)
T ss_pred CCcccccccccccCccccccCCCcCCcc------------ccccCcc---CCcccccccCCCccc--ceeccCCCCcccc
Confidence 45666666543310 1111 1101100 111111345777777 9999999998864
Q ss_pred ccccCCccCCCCCCCCCCCCCCCC----CceeecCCceeeecCCCCcCCCCCCCCCCccCCCCCCcccccCCcccCCCCC
Q psy5613 825 IQCKPVIQEPVYTNPCQPSPCGPN----SQCREVNKQAVCSCLPNYFGSPPNCRPECTVNTDCPLDKACVNQKCVDPCPG 900 (1010)
Q Consensus 825 ~~C~~~~~~~~~~~~C~~~~C~~~----~~C~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~ 900 (1010)
..|.+....-.....|...+|..- ..|....| +|.|.+|-+|. +|. .|.....-....-|.. -
T Consensus 924 ~~c~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~~tG--qc~c~~gVtgq--rc~-qc~~~~~~~~~~gc~~--------c 990 (1705)
T KOG1836|consen 924 LYCFKGFFNLNSGVGCEPCNCDPTGSESSDCDVGTG--QCYCRPGVTGQ--RCD-QCETYHFGFQTEGCGL--------C 990 (1705)
T ss_pred ccccccccccCCCCCcccccccccccccccccccCC--ceeeecCcccc--ccC-ccccCcccccccCCcc--------e
Confidence 444433211111224554455322 36776666 89999999998 333 2322210000011111 1
Q ss_pred CCCCCC----eeeecCCCceeeCCCCCcCCCCCCccc
Q psy5613 901 SCGQNA----NCRVINHSPICTCRPGFTGEPRIRCSP 933 (1010)
Q Consensus 901 ~C~~~~----~C~~~~g~~~C~C~~G~~G~~~~~C~~ 933 (1010)
.|...| .|... +++|.|+++|.|.....|.+
T Consensus 991 ~c~~~Gs~~~qc~~~--~G~c~c~~~~~g~~c~~c~~ 1025 (1705)
T KOG1836|consen 991 ECDPLGSRGFQCDPE--DGQCPCRPGFEGRRCDQCEE 1025 (1705)
T ss_pred ecccCCcccceeccc--CCeeeecCCCCCcccccccC
Confidence 233333 56655 67889999998874444543
No 16
>KOG4260|consensus
Probab=98.47 E-value=1e-07 Score=96.98 Aligned_cols=158 Identities=22% Similarity=0.453 Sum_probs=98.6
Q ss_pred ccccccCCCCCCCCCCCCCCCCCCeecC-------cceecCCCCccCCCCcCCCCCccCCCCCCCcccccCCccCCCCCC
Q psy5613 22 FTYFCVNSVPPPVQQDTCNCVPNAVCKD-------EVCVCLPDFYGDGYVSCRPECVLNSDCPSNKACIRNKCKNPCVPG 94 (1010)
Q Consensus 22 ~~~~~~~~~~~~~~~~~c~C~~~~~C~~-------~~C~C~~G~~g~~~~~~~~eC~~~~~C~~~~~C~~~~C~~~C~~~ 94 (1010)
+|+.|.. +....+-+|..||.|.- +.|.|.+||+|..|..|.++=.....=...-.|. .|...
T Consensus 137 yGpdCl~----Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt--~Ch~~---- 206 (350)
T KOG4260|consen 137 YGPDCLQ----CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCT--ACHEG---- 206 (350)
T ss_pred cCCcccc----CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhh--hhhhh----
Confidence 4555544 44455567999999996 5799999999998764432100000000000111 11222
Q ss_pred CCCCCCeeeecCCceee-eCCCCCccCCCCCcccCCCCCCCCCCCC--CCCCCCCCcceecCCCeeeecCCCCcCCCCCC
Q psy5613 95 TCGEGAICDVVNHAVMC-TCPPGTTGSPFIQCKPIQNEPVYTNPCQ--PSPCGPNSQCREINHQAVCSCLPNYFGSPPGC 171 (1010)
Q Consensus 95 ~C~~~~~C~~~~g~~~C-~C~~G~~g~~~~~C~~~~~~~~~~d~C~--~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~C 171 (1010)
|. ++|.... +-.| .|..||..+... |. |||||+ +.||.....|+|+.|+|+|..++||.+...
T Consensus 207 -C~--~~Csg~~-~k~C~kCkkGW~lde~g-Cv-------DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d-- 272 (350)
T KOG4260|consen 207 -CL--GVCSGES-SKGCSKCKKGWKLDEEG-CV-------DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVD-- 272 (350)
T ss_pred -hh--cccCCCC-CCChhhhcccceecccc-cc-------cHHHHhcCCCCCChhheeecCCCceEecccccccCChH--
Confidence 22 1343322 2244 599999987543 74 668996 568999999999999999999999987522
Q ss_pred CCCCccCCCCCCCCcccCCcccCCCCCCCCCCceEeecCCCccccCCCCCc
Q psy5613 172 RPECTVNSDCPLDRACQNQKCVDPCPGSCGYRARCQVYNHNPVCSCPPGYT 222 (1010)
Q Consensus 172 ~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~ 222 (1010)
+|+.-. +.|. ..+..|.|+++.|+|+|..|+.
T Consensus 273 --~C~~~~-----d~~~------------~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 273 --ECQFCA-----DVCA------------SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred --Hhhhhh-----hhcc------------cCCCCcccCCccEEEEecccce
Confidence 333200 0111 1256788999999999999885
No 17
>KOG1226|consensus
Probab=98.06 E-value=2e-05 Score=92.34 Aligned_cols=140 Identities=24% Similarity=0.551 Sum_probs=95.3
Q ss_pred CCCCCCCeeeeeCCcceeeCCCCCccCCCccCCCCCCCCC-CCCCCCCCCC--CCCCCCcccCceeecCCCcc----CCC
Q psy5613 692 GSCGQGAQCRVINHSPVCYCPDGFIGDAFSSCYPKPIEPI-QAPEQQADPC--ICAPNAVCRDNVCVCLPDYY----GDG 764 (1010)
Q Consensus 692 ~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~C~~~~~~~~-~~~~c~~~~c--~C~~~g~C~~~~C~C~~G~~----G~~ 764 (1010)
..|+.+|+.+-. +|.|.+||.|+.|+ |........ ..+.|..... .|...|.|+=+.|+|.+... |..
T Consensus 467 ~~C~g~G~~~CG----~C~C~~G~~G~~CE-C~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~f 541 (783)
T KOG1226|consen 467 ALCHGNGTFVCG----QCRCDEGWLGKKCE-CSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKF 541 (783)
T ss_pred cccCCCCcEEec----ceecCCCCCCCccc-CCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeee
Confidence 356655555432 79999999999998 432221111 1234443322 59999999999999998877 888
Q ss_pred CcccCCCCCCCCCCCCCcccccCCccCCCCCCCCCCCCeeeecCCeeeeeCCCCCccCCCccccCCccCCCCCCCCCCC-
Q psy5613 765 YTVCRPECVRNSDCANNKACIRNKCKNPCVPGTCGEGAICDVINHSVVCSCPPGTTGSPFIQCKPVIQEPVYTNPCQPS- 843 (1010)
Q Consensus 765 c~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~g~y~C~C~~G~~G~~~~~C~~~~~~~~~~~~C~~~- 843 (1010)
|+.+.-.|.... ...|.++|+|.-. +|+|.+||+|.. |+-. .+.+.|...
T Consensus 542 CECDnfsC~r~~------------------g~lC~g~G~C~CG----~CvC~~GwtG~~---C~C~----~std~C~~~~ 592 (783)
T KOG1226|consen 542 CECDNFSCERHK------------------GVLCGGHGRCECG----RCVCNPGWTGSA---CNCP----LSTDTCESSD 592 (783)
T ss_pred eeccCccccccc------------------CcccCCCCeEeCC----cEEcCCCCccCC---CCCC----CCCccccCCC
Confidence 875433333210 4568889998776 999999999996 4432 245777543
Q ss_pred --CCCCCCceeecCCceeeecCCC-CcCC
Q psy5613 844 --PCGPNSQCREVNKQAVCSCLPN-YFGS 869 (1010)
Q Consensus 844 --~C~~~~~C~~~~g~~~C~C~~G-~~g~ 869 (1010)
-|...|+|.=. +|+|... |.|.
T Consensus 593 G~iCSGrG~C~Cg----~C~C~~~~~sG~ 617 (783)
T KOG1226|consen 593 GQICSGRGTCECG----RCKCTDPPYSGE 617 (783)
T ss_pred CceeCCCceeeCC----ceEcCCCCcCcc
Confidence 47777888755 5899776 9998
No 18
>KOG1226|consensus
Probab=97.89 E-value=5.9e-05 Score=88.55 Aligned_cols=146 Identities=25% Similarity=0.582 Sum_probs=95.4
Q ss_pred CCCCCCceeeccCCCceeecCCCCccCCccccccCCCCCCCCCCCCCCCCCCC----CCCCCCCccccCCCCceeeCCCC
Q psy5613 582 GTCGQNANCRVINHNPSCTCKAGFTGDPRVFCSRIPPPPPQESPPEYVNPCIP----SPCGPYSQCRDINGSPSCSCLPN 657 (1010)
Q Consensus 582 ~~C~~~~~C~~~~~~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~id~C~~----~~C~~~g~C~~~~g~y~C~C~~G 657 (1010)
..|+.+|+.+ =+.|.|.+||.|.. |+-... ........+.|.. .+|.+.|.|.=. +|+|.+.
T Consensus 467 ~~C~g~G~~~----CG~C~C~~G~~G~~---CEC~~~---~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~ 532 (783)
T KOG1226|consen 467 ALCHGNGTFV----CGQCRCDEGWLGKK---CECSTD---ELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKP 532 (783)
T ss_pred cccCCCCcEE----ecceecCCCCCCCc---ccCCcc---ccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCC
Confidence 4666556554 36899999999997 751110 0000012456652 279999988744 7999887
Q ss_pred cc----cCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeeeeCCcceeeCCCCCccCCCccCCCCCCCCCCC
Q psy5613 658 YI----GAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRVINHSPVCYCPDGFIGDAFSSCYPKPIEPIQA 733 (1010)
Q Consensus 658 ~~----g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~C~~~~~~~~~~ 733 (1010)
.. |. .|+ | |...|.... ...|..+|+|.-. +|+|.+||+|..|+ |. .+.
T Consensus 533 ~~~~i~G~--fCE--C-DnfsC~r~~-----------g~lC~g~G~C~CG----~CvC~~GwtG~~C~-C~------~st 585 (783)
T KOG1226|consen 533 DNGKIYGK--FCE--C-DNFSCERHK-----------GVLCGGHGRCECG----RCVCNPGWTGSACN-CP------LST 585 (783)
T ss_pred CCCceeee--eee--c-cCccccccc-----------CcccCCCCeEeCC----cEEcCCCCccCCCC-CC------CCC
Confidence 76 55 565 1 111222110 2358888888743 99999999999997 32 334
Q ss_pred CCCCCCC-CCCCCCCcccCceeecCCC-ccCCCCccc
Q psy5613 734 PEQQADP-CICAPNAVCRDNVCVCLPD-YYGDGYTVC 768 (1010)
Q Consensus 734 ~~c~~~~-c~C~~~g~C~~~~C~C~~G-~~G~~c~~~ 768 (1010)
+.|.... -+|...|+|.=++|.|... |+|..|+.+
T Consensus 586 d~C~~~~G~iCSGrG~C~Cg~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 586 DTCESSDGQICSGRGTCECGRCKCTDPPYSGEFCEKC 622 (783)
T ss_pred ccccCCCCceeCCCceeeCCceEcCCCCcCcchhhcC
Confidence 4443322 2588889999999999877 999999863
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.74 E-value=1.6e-05 Score=55.84 Aligned_cols=31 Identities=35% Similarity=0.897 Sum_probs=27.7
Q ss_pred CCCCCCCCCCeeeecC-CceeeeCCCCCccCC
Q psy5613 91 CVPGTCGEGAICDVVN-HAVMCTCPPGTTGSP 121 (1010)
Q Consensus 91 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~ 121 (1010)
|.++||+++|+|++.. ++|+|+|++||+|+.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 4567999999999998 899999999999973
No 20
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.63 E-value=4.8e-05 Score=57.35 Aligned_cols=32 Identities=25% Similarity=0.571 Sum_probs=28.9
Q ss_pred CCCCCCC--CCCCCCCeEeecCCceeeeecCccc
Q psy5613 348 YRDPCST--TQCGLNAICTVINGAAQCACLLLLQ 379 (1010)
Q Consensus 348 ~~deC~~--~~C~~~~~C~n~~g~~~C~C~~G~~ 379 (1010)
|||||+. +.|..+++|+|+.|+|+|.|++||+
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 4699985 5798899999999999999999997
No 21
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.62 E-value=3.4e-05 Score=58.17 Aligned_cols=32 Identities=28% Similarity=0.639 Sum_probs=29.3
Q ss_pred CCCCCCC--CCCCCCCeecccCCceeeeeCCCCc
Q psy5613 242 PTDPCFP--SPCGSNARCRVQNEHALCECLPDYY 273 (1010)
Q Consensus 242 ~~d~C~~--~~C~~~~~C~~~~g~~~C~C~~Gf~ 273 (1010)
|||||+. +.|..+++|+|+.|+|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 5899984 5699899999999999999999998
No 22
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.60 E-value=3.9e-05 Score=53.96 Aligned_cols=30 Identities=33% Similarity=0.811 Sum_probs=27.7
Q ss_pred CCCCCCCCCCeecccC-CceeeeeCCCCccC
Q psy5613 246 CFPSPCGSNARCRVQN-EHALCECLPDYYGN 275 (1010)
Q Consensus 246 C~~~~C~~~~~C~~~~-g~~~C~C~~Gf~G~ 275 (1010)
|.++||.++|+|++.. ++|+|+|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5567999999999999 99999999999996
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.51 E-value=0.00011 Score=54.36 Aligned_cols=36 Identities=31% Similarity=0.706 Sum_probs=32.1
Q ss_pred CCCCCCC-CCCCCCCeecccCCceeeeeCCCCc-cCCC
Q psy5613 242 PTDPCFP-SPCGSNARCRVQNEHALCECLPDYY-GNPY 277 (1010)
Q Consensus 242 ~~d~C~~-~~C~~~~~C~~~~g~~~C~C~~Gf~-G~~c 277 (1010)
++++|.. .+|.++++|+++.++|+|.|++||. |..|
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 4688887 7999999999999999999999999 7755
No 24
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.31 E-value=0.00028 Score=52.13 Aligned_cols=34 Identities=32% Similarity=0.803 Sum_probs=29.4
Q ss_pred cCCCCC-CCCCCCCeeeecCCceeeeCCCCCc-cCC
Q psy5613 88 KNPCVP-GTCGEGAICDVVNHAVMCTCPPGTT-GSP 121 (1010)
Q Consensus 88 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~-g~~ 121 (1010)
+|+|.. .+|.++++|+++.|+|+|.|++||+ |..
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~ 37 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRN 37 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCc
Confidence 356665 6899999999999999999999999 775
No 25
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.28 E-value=7.5e-05 Score=73.18 Aligned_cols=143 Identities=24% Similarity=0.591 Sum_probs=87.7
Q ss_pred CCCCCCeeeecCCceeeeCCCCCccCCCCCcccCCCCCCCCCCCC-----CCCCCCCCcceecC-----CCeeeecCCCC
Q psy5613 95 TCGEGAICDVVNHAVMCTCPPGTTGSPFIQCKPIQNEPVYTNPCQ-----PSPCGPNSQCREIN-----HQAVCSCLPNY 164 (1010)
Q Consensus 95 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~~d~C~-----~~~C~~~g~C~~~~-----g~~~C~C~~G~ 164 (1010)
.|.| |.-+.+.+-|.|.|.+||......+||.+ .+|. ..+|+..++|++.. ..|.|.|.+||
T Consensus 7 ~CKN-G~LiQMSNHfEC~Cnegfvl~~EntCE~k-------v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY 78 (197)
T PF06247_consen 7 ICKN-GYLIQMSNHFECKCNEGFVLKNENTCEEK-------VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGY 78 (197)
T ss_dssp --BT-EEEEEESSEEEEEESTTEEEEETTEEEE-----------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTE
T ss_pred cccC-CEEEEccCceEEEcCCCcEEccccccccc-------eecCcccccCccccchhhhhcCCCcccceeEEEecccCc
Confidence 4554 57788888899999999998776679865 4553 35899999999866 48999999999
Q ss_pred cCCCCCCC-CCCccCCCCCCCCcccCCcccCCCCCCCCCCceEeecC---CCccccCCCCCccCCCccccCCCCCCCCCC
Q psy5613 165 FGSPPGCR-PECTVNSDCPLDRACQNQKCVDPCPGSCGYRARCQVYN---HNPVCSCPPGYTGNPFSQCLLPPTPTPTQA 240 (1010)
Q Consensus 165 ~g~~~~C~-~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~---g~~~C~C~~Gy~g~~c~~C~~~~~~~~~~~ 240 (1010)
+.....|+ .+|. +- .|+ .|+|+-.+ ....|+|.-|+.-+.-..|..
T Consensus 79 ~~~~~vCvp~~C~------------~~--------~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk--------- 128 (197)
T PF06247_consen 79 ILKQGVCVPNKCN------------NK--------DCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTK--------- 128 (197)
T ss_dssp EESSSSEEEGGGS------------S-----------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEE---------
T ss_pred eeeCCeEchhhcC------------ce--------ecC-CCeEEecCCCCCCceeEeeeceEeccCCcccC---------
Confidence 87743333 1222 21 344 67887533 345899999998222122210
Q ss_pred CCCCCCCCCCCCCCCeecccCCceeeeeCCCCccCC
Q psy5613 241 TPTDPCFPSPCGSNARCRVQNEHALCECLPDYYGNP 276 (1010)
Q Consensus 241 ~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~Gf~G~~ 276 (1010)
.-..+|+ --|..+..|....+-|+|++.+||.++.
T Consensus 129 ~G~T~C~-LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 129 TGETKCS-LKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp EE---------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred CCcccee-eecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 0111232 2367788999999999999999998764
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.08 E-value=0.00064 Score=49.66 Aligned_cols=35 Identities=29% Similarity=0.700 Sum_probs=31.3
Q ss_pred CCCCCC-CCCCCCCeecccCCceeeeeCCCCccCCC
Q psy5613 243 TDPCFP-SPCGSNARCRVQNEHALCECLPDYYGNPY 277 (1010)
Q Consensus 243 ~d~C~~-~~C~~~~~C~~~~g~~~C~C~~Gf~G~~c 277 (1010)
+++|.. .+|.++++|++..++|+|.|++||.|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 677877 78999999999999999999999999765
No 27
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.89 E-value=0.00051 Score=49.50 Aligned_cols=29 Identities=31% Similarity=0.729 Sum_probs=23.6
Q ss_pred CCCCCCceeecCCceeeecCCCCcCCCCC
Q psy5613 844 PCGPNSQCREVNKQAVCSCLPNYFGSPPN 872 (1010)
Q Consensus 844 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~ 872 (1010)
.|+.+|+|+++.++|+|+|++||+|++..
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~ 35 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGDGFF 35 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred CCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence 68888999999999999999999999754
No 28
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.83 E-value=0.00062 Score=49.09 Aligned_cols=30 Identities=37% Similarity=0.820 Sum_probs=24.4
Q ss_pred CCCCCCCCeeeecCCeeeeeCCCCCccCCC
Q psy5613 795 PGTCGEGAICDVINHSVVCSCPPGTTGSPF 824 (1010)
Q Consensus 795 ~~~C~~~~~C~~~~g~y~C~C~~G~~G~~~ 824 (1010)
.+.|+.+|+|.++.++|+|+|++||+|++.
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 457889999999999999999999999973
No 29
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.83 E-value=0.00043 Score=67.98 Aligned_cols=140 Identities=24% Similarity=0.599 Sum_probs=83.5
Q ss_pred CceEeecCCCccccCCCCCccCCCccccCCCCCCCCCCCCCCCCCC-----CCCCCCCeecccC-----CceeeeeCCCC
Q psy5613 203 RARCQVYNHNPVCSCPPGYTGNPFSQCLLPPTPTPTQATPTDPCFP-----SPCGSNARCRVQN-----EHALCECLPDY 272 (1010)
Q Consensus 203 ~~~C~~~~g~~~C~C~~Gy~g~~c~~C~~~~~~~~~~~~~~d~C~~-----~~C~~~~~C~~~~-----g~~~C~C~~Gf 272 (1010)
+|.-+...+.|.|.|.+||....-..| +...+|.. .+|.+-++|++.. ..|+|.|.+||
T Consensus 10 NG~LiQMSNHfEC~Cnegfvl~~EntC-----------E~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY 78 (197)
T PF06247_consen 10 NGYLIQMSNHFECKCNEGFVLKNENTC-----------EEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGY 78 (197)
T ss_dssp TEEEEEESSEEEEEESTTEEEEETTEE-----------EE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTE
T ss_pred CCEEEEccCceEEEcCCCcEEcccccc-----------ccceecCcccccCccccchhhhhcCCCcccceeEEEecccCc
Confidence 466677778899999999986543334 45567763 5799999999876 57999999999
Q ss_pred ccCCCCcCCCCCccCCCCCCccccccCCCCCCCCCCCCCCeeccCC---CCCceecCCCCcccCCcccCCCCCCCCCCCC
Q psy5613 273 YGNPYEGCRPECLINSDCPLSLACIKNHCRDPCPGTCGVQAICSVS---NHIPICYCPAGFTGDAFRQCSPIPQREPEYR 349 (1010)
Q Consensus 273 ~G~~c~~~~~eC~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~---~g~~~C~C~~Gy~G~~c~~C~~i~~~~~~~~ 349 (1010)
+...- .|....|.+ -.|. .|.|+-. +....|+|.-|+.-+.-..|.-. -.
T Consensus 79 ~~~~~-----------------vCvp~~C~~---~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~------G~ 131 (197)
T PF06247_consen 79 ILKQG-----------------VCVPNKCNN---KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKT------GE 131 (197)
T ss_dssp EESSS-----------------SEEEGGGSS------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEE------E-
T ss_pred eeeCC-----------------eEchhhcCc---eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCC------Cc
Confidence 87641 233333332 2565 6788633 23459999999982221122211 11
Q ss_pred CCCCCCCCCCCCeEeecCCceeeeecCccccc
Q psy5613 350 DPCSTTQCGLNAICTVINGAAQCACLLLLQHH 381 (1010)
Q Consensus 350 deC~~~~C~~~~~C~n~~g~~~C~C~~G~~g~ 381 (1010)
.+|+ -.|..+..|..+.+-|+|.+.+||.++
T Consensus 132 T~C~-LKCk~nE~CK~~~~~Y~C~~~~~~~~~ 162 (197)
T PF06247_consen 132 TKCS-LKCKENEECKLVDGYYKCVCKEGFPGD 162 (197)
T ss_dssp --------TTTEEEEEETTEEEEEE-TT-EEE
T ss_pred ccee-eecCCCcceeeeCcEEEeecCCCCCCC
Confidence 2342 356778999999999999999999764
No 30
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.82 E-value=0.0015 Score=47.72 Aligned_cols=33 Identities=33% Similarity=0.864 Sum_probs=28.8
Q ss_pred CCCCC-CCCCCCCeeeecCCceeeeCCCCCccCC
Q psy5613 89 NPCVP-GTCGEGAICDVVNHAVMCTCPPGTTGSP 121 (1010)
Q Consensus 89 ~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 121 (1010)
++|.. .+|.+++.|+++.++|+|.|++||.|+.
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~ 36 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCc
Confidence 56655 6898899999999999999999999976
No 31
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.34 E-value=0.0048 Score=44.19 Aligned_cols=28 Identities=36% Similarity=0.894 Sum_probs=25.7
Q ss_pred CCCCCCCCeeeecCCceeeeCCCCCccC
Q psy5613 93 PGTCGEGAICDVVNHAVMCTCPPGTTGS 120 (1010)
Q Consensus 93 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 120 (1010)
..+|.++++|+++.++|+|.|++||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4678889999999999999999999998
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.28 E-value=0.005 Score=44.10 Aligned_cols=28 Identities=29% Similarity=0.769 Sum_probs=25.9
Q ss_pred CCCCCCCCeecccCCceeeeeCCCCccC
Q psy5613 248 PSPCGSNARCRVQNEHALCECLPDYYGN 275 (1010)
Q Consensus 248 ~~~C~~~~~C~~~~g~~~C~C~~Gf~G~ 275 (1010)
..+|.++++|++..++|+|.|++||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 5688889999999999999999999988
No 33
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.21 E-value=0.006 Score=43.75 Aligned_cols=27 Identities=41% Similarity=0.997 Sum_probs=24.3
Q ss_pred CCCCCCCeeeecCCceeeeCCCCCcc-CC
Q psy5613 94 GTCGEGAICDVVNHAVMCTCPPGTTG-SP 121 (1010)
Q Consensus 94 ~~C~~~~~C~~~~g~~~C~C~~G~~g-~~ 121 (1010)
.+|.++ +|+++.++|+|.|++||+| ..
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~ 33 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKR 33 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCc
Confidence 578888 9999999999999999999 54
No 34
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.11 E-value=0.0066 Score=43.53 Aligned_cols=31 Identities=29% Similarity=0.780 Sum_probs=26.2
Q ss_pred CCC-CCCCCCCeecccCCceeeeeCCCCcc-CCC
Q psy5613 246 CFP-SPCGSNARCRVQNEHALCECLPDYYG-NPY 277 (1010)
Q Consensus 246 C~~-~~C~~~~~C~~~~g~~~C~C~~Gf~G-~~c 277 (1010)
|.. .+|.++ +|+++.++|+|.|++||.| ..|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 444 588888 9999999999999999999 543
No 35
>KOG1218|consensus
Probab=96.07 E-value=0.5 Score=52.94 Aligned_cols=193 Identities=30% Similarity=0.722 Sum_probs=96.3
Q ss_pred ceeeeCCCCCccCCCcccCCCCCCCCCCCCCcCCCCCCCCcccccCCCeeeecCCCccCCCCCCcCCCccCCCCCCCccc
Q psy5613 492 AVMCTCPPGTTGSPFIQCKPVQNEPVYTNPCQPSPCGPNSQCREVHKQAVCSCLPNYFGSPPNCRPECTVNSDCPLDKAC 571 (1010)
Q Consensus 492 ~~~C~C~~G~~G~~~~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~~~C~~~C~~~~~C~~~~~C 571 (1010)
+..|.|.++|+|.. ++.. .. ....+. .++. + ......|.+..+|.+. .|+..+....
T Consensus 14 ~~~c~c~~~~~g~~--~~~~-~~---~~~~~~-~~~~----~--~~~~~~~~~~~~~~~~--~c~~~~~~~~-------- 70 (316)
T KOG1218|consen 14 SGQCFCDPGYTGRL--QCEH-QA---VTSACS-GICP----C--EVNSGECGLGYGFVGS--VCRIECVCGN-------- 70 (316)
T ss_pred CCceecCCCccccc--cccC-CC---CCcccc-ccCC----c--cCCceeEecccccCCC--ccccccccCC--------
Confidence 34799999999961 2331 10 011111 1111 1 3345788899999988 5543322221
Q ss_pred cCCcccCCCCCCCCCCceeeccCCCceeec-CCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCccccCCCCc
Q psy5613 572 FNQKCVDPCPGTCGQNANCRVINHNPSCTC-KAGFTGDPRVFCSRIPPPPPQESPPEYVNPCIPSPCGPYSQCRDINGSP 650 (1010)
Q Consensus 572 ~~~~C~~~C~~~C~~~~~C~~~~~~~~C~C-~~Gy~G~~~~~C~~~~~~~~~~~~~~~id~C~~~~C~~~g~C~~~~g~y 650 (1010)
.+..|+..+ .|..+.... .++..+ ..+|.|.. |+. +.+|... |.. .+|.+...
T Consensus 71 ~~~~c~~~~--~c~~~~~~~----~~~~~~~~~~~~g~~---C~~-------------~~~~~~~-c~~-~~C~~~~~-- 124 (316)
T KOG1218|consen 71 AGGGCSQPC--RCKNGGTCV----SSTGYCHLNGYEGPQ---CES-------------PCPCGDG-CAE-KTCANPRR-- 124 (316)
T ss_pred CCCcccCcc--ccCCCCccc----CCCCcccCCCCCccc---ccC-------------CCCcCCc-ccc-cccCCCcc--
Confidence 222233322 243433333 233344 57888776 752 3333221 333 45555443
Q ss_pred eeeCCCCcccCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeeeeCCcceeeCCCCCccCCCccCCCCCCCC
Q psy5613 651 SCSCLPNYIGAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRVINHSPVCYCPDGFIGDAFSSCYPKPIEP 730 (1010)
Q Consensus 651 ~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~C~~~~~~~ 730 (1010)
.|.+..+|.+. .|..+-. ....|.. .|.....+....+ .|.|.+||.|..+..-
T Consensus 125 ~c~~~~~~~~~--~C~~~~~-----------~g~~C~~----~c~~~~~~~~~~~--~c~c~~g~~g~~~~~~------- 178 (316)
T KOG1218|consen 125 ECRCGGGYIGE--QCGEENL-----------VGLKCQR----DCQCTGGCDCKNG--ICTCQPGFVGVFCVES------- 178 (316)
T ss_pred ceecCCcCccc--cccccCC-----------CCCCccC----CCCCccccCCCCC--ceeccCCccccccccc-------
Confidence 57888888877 6653111 1112221 2222222332333 8899999999998620
Q ss_pred CCCCCCCCCCCCCCCCCcccC--ceeecCCCccC
Q psy5613 731 IQAPEQQADPCICAPNAVCRD--NVCVCLPDYYG 762 (1010)
Q Consensus 731 ~~~~~c~~~~c~C~~~g~C~~--~~C~C~~G~~G 762 (1010)
.. . ......+.+++.|+. ..+.+.+++.+
T Consensus 179 -~~-~-c~~~~~~~~g~~C~~~~~~~~~~~~~~~ 209 (316)
T KOG1218|consen 179 -CS-G-CSPLTACENGAKCNRSTGSCLCYPGPSG 209 (316)
T ss_pred -CC-C-cCCCcccCCCCeeeccccccccCCCCcc
Confidence 00 0 122334677778875 35666666654
No 36
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.43 E-value=0.012 Score=41.21 Aligned_cols=23 Identities=26% Similarity=0.634 Sum_probs=20.9
Q ss_pred CCCCCCeec--CcceecCCCCccCC
Q psy5613 40 NCVPNAVCK--DEVCVCLPDFYGDG 62 (1010)
Q Consensus 40 ~C~~~~~C~--~~~C~C~~G~~g~~ 62 (1010)
.|++||+|+ .++|+|++||+|+.
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPD 31 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCC
Confidence 599999999 58999999999984
No 37
>KOG1218|consensus
Probab=95.16 E-value=3 Score=46.65 Aligned_cols=47 Identities=32% Similarity=0.874 Sum_probs=29.4
Q ss_pred eeeCCCCcccCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeeeeCCcceeeCCCCCcc
Q psy5613 651 SCSCLPNYIGAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRVINHSPVCYCPDGFIG 717 (1010)
Q Consensus 651 ~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G 717 (1010)
.|.|.+||.|. ++...+.. |. . ...+.+++.|+...+ .+.+.+++.+
T Consensus 163 ~c~c~~g~~g~--~~~~~~~~---c~-----------~--~~~~~~g~~C~~~~~--~~~~~~~~~~ 209 (316)
T KOG1218|consen 163 ICTCQPGFVGV--FCVESCSG---CS-----------P--LTACENGAKCNRSTG--SCLCYPGPSG 209 (316)
T ss_pred ceeccCCcccc--cccccCCC---cC-----------C--CcccCCCCeeecccc--ccccCCCCcc
Confidence 78999999999 66533211 11 1 135667778887765 5666666654
No 38
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.80 E-value=0.037 Score=38.76 Aligned_cols=26 Identities=23% Similarity=0.597 Sum_probs=22.5
Q ss_pred CCCCCCeeeeeCCcceeeCCCCCccCCC
Q psy5613 693 SCGQGAQCRVINHSPVCYCPDGFIGDAF 720 (1010)
Q Consensus 693 ~C~~~~~C~~~~g~~~C~C~~G~~G~~C 720 (1010)
.|+++|+|+...+ +|+|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~~g--~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCG--RCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence 5889999997744 9999999999875
No 39
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.23 E-value=0.033 Score=36.07 Aligned_cols=22 Identities=23% Similarity=0.379 Sum_probs=18.1
Q ss_pred ceeeeecCcccc--ccccccCCcc
Q psy5613 369 AAQCACLLLLQH--HIHKNQDMDQ 390 (1010)
Q Consensus 369 ~~~C~C~~G~~g--~~~~~~~~~~ 390 (1010)
||+|.|++||+. +...|.||+|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 699999999985 5677888775
No 40
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.14 E-value=0.045 Score=35.46 Aligned_cols=24 Identities=33% Similarity=0.551 Sum_probs=17.5
Q ss_pred CceeeCCCCcccCCCCCccCCcCCCC
Q psy5613 649 SPSCSCLPNYIGAPPNCRPECVQNTE 674 (1010)
Q Consensus 649 ~y~C~C~~G~~g~~~~C~~~C~~~~~ 674 (1010)
||+|.|++||... .-...|+|++|
T Consensus 1 sy~C~C~~Gy~l~--~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLS--PDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCC--CCCCccccCCC
Confidence 6899999999976 22355666653
No 41
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=92.85 E-value=0.059 Score=29.50 Aligned_cols=13 Identities=31% Similarity=0.894 Sum_probs=10.4
Q ss_pred eeecCCCccCCCC
Q psy5613 753 VCVCLPDYYGDGY 765 (1010)
Q Consensus 753 ~C~C~~G~~G~~c 765 (1010)
+|+|++||+|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5899999999875
No 42
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=92.79 E-value=0.062 Score=29.41 Aligned_cols=13 Identities=31% Similarity=0.904 Sum_probs=10.3
Q ss_pred eeeeCCCCccCCC
Q psy5613 265 LCECLPDYYGNPY 277 (1010)
Q Consensus 265 ~C~C~~Gf~G~~c 277 (1010)
+|+|++||+|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999864
No 43
>smart00051 DSL delta serrate ligand.
Probab=90.87 E-value=0.27 Score=40.57 Aligned_cols=48 Identities=23% Similarity=0.423 Sum_probs=34.2
Q ss_pred CceeeCCCCcccCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeeeeCCcceeeCCCCCccCCC
Q psy5613 649 SPSCSCLPNYIGAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRVINHSPVCYCPDGFIGDAF 720 (1010)
Q Consensus 649 ~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C 720 (1010)
.++=.|.++|.|. .|...|...+ ....+.+|.. .| .++|.+||+|..|
T Consensus 16 ~~rv~C~~~~yG~--~C~~~C~~~~-------------------d~~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGE--GCNKFCRPRD-------------------DFFGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCC--ccCCEeCcCc-------------------cccCCccCCc-CC--CEecCCCCcCCCC
Confidence 3455899999999 8887665432 2345566754 34 8999999999875
No 44
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=90.10 E-value=0.16 Score=36.62 Aligned_cols=25 Identities=32% Similarity=0.777 Sum_probs=19.6
Q ss_pred CCCCCCeeeecCCceeeeCCCCCccCC
Q psy5613 95 TCGEGAICDVVNHAVMCTCPPGTTGSP 121 (1010)
Q Consensus 95 ~C~~~~~C~~~~g~~~C~C~~G~~g~~ 121 (1010)
.|++ .|++++++|+|.|++||+...
T Consensus 7 gC~h--~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 7 GCSH--ICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp GSSS--EEEEETTSEEEE-STTEEE-T
T ss_pred CcCC--CCccCCCceEeECCCCCEECc
Confidence 4554 899999999999999998764
No 45
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=88.78 E-value=0.36 Score=34.73 Aligned_cols=31 Identities=35% Similarity=0.687 Sum_probs=21.8
Q ss_pred CCCCCCCCCCeEeecC-CceeeeecCcccccc
Q psy5613 352 CSTTQCGLNAICTVIN-GAAQCACLLLLQHHI 382 (1010)
Q Consensus 352 C~~~~C~~~~~C~n~~-g~~~C~C~~G~~g~~ 382 (1010)
|....|..++.|++.. |+++|.|..||..+.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 4455788899999987 999999999997643
No 46
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=87.76 E-value=0.3 Score=35.26 Aligned_cols=23 Identities=22% Similarity=0.404 Sum_probs=18.7
Q ss_pred CCeecccCCceeeeeCCCCccCC
Q psy5613 254 NARCRVQNEHALCECLPDYYGNP 276 (1010)
Q Consensus 254 ~~~C~~~~g~~~C~C~~Gf~G~~ 276 (1010)
..+|++++++|+|.|++||+...
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp SSEEEEETTSEEEE-STTEEE-T
T ss_pred CCCCccCCCceEeECCCCCEECc
Confidence 36899999999999999998764
No 47
>smart00051 DSL delta serrate ligand.
Probab=83.43 E-value=1.7 Score=35.92 Aligned_cols=46 Identities=24% Similarity=0.500 Sum_probs=31.0
Q ss_pred ceeeecCCCCcCCCCCCCCCCccCCCCCCcccccCCcccCCCCCCCCCCCeeeecCCCceeeCCCCCcCC
Q psy5613 857 QAVCSCLPNYFGSPPNCRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVINHSPICTCRPGFTGE 926 (1010)
Q Consensus 857 ~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~ 926 (1010)
.|+=.|.++|.|. .|...|... .....+.+|.. .+.++|.+||+|.
T Consensus 16 ~~rv~C~~~~yG~--~C~~~C~~~-------------------~d~~~~~~Cd~---~G~~~C~~Gw~G~ 61 (63)
T smart00051 16 QIRVTCDENYYGE--GCNKFCRPR-------------------DDFFGHYTCDE---NGNKGCLEGWMGP 61 (63)
T ss_pred EEEeeCCCCCcCC--ccCCEeCcC-------------------ccccCCccCCc---CCCEecCCCCcCC
Confidence 3456799999999 343333211 12456677854 4788999999998
No 48
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=82.41 E-value=0.67 Score=33.36 Aligned_cols=29 Identities=21% Similarity=0.320 Sum_probs=21.1
Q ss_pred CCCCCCCCeeeecC-CceeeeCCCCCccCC
Q psy5613 93 PGTCGEGAICDVVN-HAVMCTCPPGTTGSP 121 (1010)
Q Consensus 93 ~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~ 121 (1010)
...|..+|.|.+.. |++.|.|..||....
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 45678899999876 999999999998654
No 49
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=81.97 E-value=0.97 Score=35.16 Aligned_cols=24 Identities=25% Similarity=0.641 Sum_probs=18.8
Q ss_pred CeeeeeCCcceeeCCCCCccCCCccC
Q psy5613 698 AQCRVINHSPVCYCPDGFIGDAFSSC 723 (1010)
Q Consensus 698 ~~C~~~~g~~~C~C~~G~~G~~C~~C 723 (1010)
.+|....| +|.|+++|+|..|+.|
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~C 34 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQC 34 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-EE
T ss_pred CcccCCCC--EEeccccccCCcCcCC
Confidence 46777555 9999999999999843
No 50
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=76.91 E-value=2.3 Score=32.67 Aligned_cols=23 Identities=22% Similarity=0.563 Sum_probs=17.8
Q ss_pred eeeeeCCcceeeCCCCCccCCCccC
Q psy5613 699 QCRVINHSPVCYCPDGFIGDAFSSC 723 (1010)
Q Consensus 699 ~C~~~~g~~~C~C~~G~~G~~C~~C 723 (1010)
.|....| +|.|+++|+|..|+.|
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~C 34 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCDRC 34 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCCcC
Confidence 4554445 9999999999999743
No 51
>KOG3512|consensus
Probab=75.64 E-value=20 Score=40.97 Aligned_cols=67 Identities=21% Similarity=0.425 Sum_probs=44.2
Q ss_pred CCeeeeeCCcceeeCCCCCccCCCccCCCCCCC-------CCCCCCCCCCCCCCCCCCcccCceeecCCCccCCCCcc
Q psy5613 697 GAQCRVINHSPVCYCPDGFIGDAFSSCYPKPIE-------PIQAPEQQADPCICAPNAVCRDNVCVCLPDYYGDGYTV 767 (1010)
Q Consensus 697 ~~~C~~~~g~~~C~C~~G~~G~~C~~C~~~~~~-------~~~~~~c~~~~c~C~~~g~C~~~~C~C~~G~~G~~c~~ 767 (1010)
+-+|..+.| +|.|.+|-+|..|..|.+.... ++.++.- .+-.+.++.+=.+..+.|++++.|..++.
T Consensus 406 gktCNq~tG--qCpCkeGvtG~tCnrCa~gyqqsrs~vapcik~p~~--~~~~~~s~ve~qd~~s~Ck~~~~~~r~n~ 479 (592)
T KOG3512|consen 406 GKTCNQTTG--QCPCKEGVTGLTCNRCAPGYQQSRSPVAPCIKIPTD--APTLGSSGVEPQDQCSKCKASPGGKRLNQ 479 (592)
T ss_pred cccccccCC--cccCCCCCcccccccccchhhcccCCCcCceecCCC--CccccCCCCcchhccccCCCCCcceeccc
Confidence 456877777 9999999999999988764321 1111111 11235556664556789999998887653
No 52
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=75.32 E-value=2.6 Score=44.68 Aligned_cols=36 Identities=19% Similarity=0.282 Sum_probs=28.2
Q ss_pred CCCCCCCCCCCCCCCCeecccCCceeeeeCCCCccC
Q psy5613 240 ATPTDPCFPSPCGSNARCRVQNEHALCECLPDYYGN 275 (1010)
Q Consensus 240 ~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~Gf~G~ 275 (1010)
+.++++|...+......|+++.|+|.|.|++||+..
T Consensus 184 C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 184 CVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred CcCchhhcCCCCCccceEEcCCCCEEeECCCCccCC
Confidence 467888974433234689999999999999999875
No 53
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=74.94 E-value=2.6 Score=44.66 Aligned_cols=34 Identities=26% Similarity=0.457 Sum_probs=27.3
Q ss_pred CCCCCCCCCC--CCCCCCCeEeecCCceeeeecCcccc
Q psy5613 345 EPEYRDPCST--TQCGLNAICTVINGAAQCACLLLLQH 380 (1010)
Q Consensus 345 ~~~~~deC~~--~~C~~~~~C~n~~g~~~C~C~~G~~g 380 (1010)
.|.++++|.. ++|. ..|.++.|+|.|.|.+||++
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence 4456788974 4454 58999999999999999976
No 54
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=74.52 E-value=3.1 Score=32.49 Aligned_cols=26 Identities=27% Similarity=0.740 Sum_probs=20.5
Q ss_pred CCCCCCCe----ecC--cceecCCCCccCCCC
Q psy5613 39 CNCVPNAV----CKD--EVCVCLPDFYGDGYV 64 (1010)
Q Consensus 39 c~C~~~~~----C~~--~~C~C~~G~~g~~~~ 64 (1010)
|.|.++|. |.. ++|.|.+||+|..|+
T Consensus 2 C~C~~~g~~~~~C~~~~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 2 CDCNGHGSLSGQCDPGTGQCECKPNTTGRRCD 33 (50)
T ss_pred CcCcCCCCCCccccCCCCEEeCCCcCCCCCCC
Confidence 56666665 766 789999999999764
No 55
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=73.54 E-value=3.6 Score=32.12 Aligned_cols=25 Identities=28% Similarity=0.586 Sum_probs=19.1
Q ss_pred eeeeeCCcceeeCCCCCccCCCccCCC
Q psy5613 699 QCRVINHSPVCYCPDGFIGDAFSSCYP 725 (1010)
Q Consensus 699 ~C~~~~g~~~C~C~~G~~G~~C~~C~~ 725 (1010)
.|....| +|.|+++|+|..|+.|.+
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~~C~~ 37 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCDRCAP 37 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCCCCCC
Confidence 4655555 999999999999985443
No 56
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=71.61 E-value=4.9 Score=31.61 Aligned_cols=22 Identities=32% Similarity=0.901 Sum_probs=18.9
Q ss_pred CCCCCCcccCceeecCCCceeC
Q psy5613 424 NCVPNAECRDGVCVCLPDYYGD 445 (1010)
Q Consensus 424 ~C~~~~~C~~~~C~C~~G~~G~ 445 (1010)
.|..++.|++++|.|++||+-.
T Consensus 27 qC~~~s~C~~g~C~C~~g~~~~ 48 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYVEV 48 (52)
T ss_pred CCCCcCEEcCCEeECCCCCEec
Confidence 4668899999999999999754
No 57
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=71.30 E-value=1.7 Score=33.74 Aligned_cols=28 Identities=36% Similarity=0.980 Sum_probs=20.6
Q ss_pred CCCCCCC----eecC--cceecCCCCccCCCCcC
Q psy5613 39 CNCVPNA----VCKD--EVCVCLPDFYGDGYVSC 66 (1010)
Q Consensus 39 c~C~~~~----~C~~--~~C~C~~G~~g~~~~~~ 66 (1010)
|.|..++ +|.. ++|.|.++|+|..|+.|
T Consensus 1 C~C~~~~~~~~~C~~~~G~C~C~~~~~G~~C~~C 34 (49)
T PF00053_consen 1 CDCNPHGSSSQTCDPSTGQCVCKPGTTGPRCDQC 34 (49)
T ss_dssp ESSTTCCBCCSSEEETCEEESBSTTEESTTS-EE
T ss_pred CcCcCCCCCCCcccCCCCEEeccccccCCcCcCC
Confidence 3455555 7776 78999999999987543
No 58
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=68.90 E-value=1.6 Score=36.01 Aligned_cols=48 Identities=25% Similarity=0.443 Sum_probs=21.8
Q ss_pred CceeeCCCCcccCCCCCccCCcCCCCCCCCccccCCccCCCCCCCCCCCCeeeeeCCcceeeCCCCCccCCC
Q psy5613 649 SPSCSCLPNYIGAPPNCRPECVQNTECPYDKACINEKCRDPCPGSCGQGAQCRVINHSPVCYCPDGFIGDAF 720 (1010)
Q Consensus 649 ~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C 720 (1010)
+++-.|.+.|.|. .|...|.+.+.= ..+-+|.. .| .=+|.+||+|..|
T Consensus 16 ~~rv~C~~nyyG~--~C~~~C~~~~d~-------------------~ghy~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGP--NCSKFCKPRDDS-------------------FGHYTCDS-NG--NKVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETT--TT-EE---EEET-------------------TEEEEE-S-S----EEE-TTEESTTS
T ss_pred EEEEECCCCCCCc--cccCCcCCCcCC-------------------cCCcccCC-CC--CCCCCCCCcCCCC
Confidence 4577899999999 888777654310 12234542 34 4578999999876
No 59
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=66.34 E-value=5.3 Score=37.20 Aligned_cols=30 Identities=27% Similarity=0.620 Sum_probs=24.3
Q ss_pred CCCCCCCeeeee--CCcceeeCCCCCccCCCcc
Q psy5613 692 GSCGQGAQCRVI--NHSPVCYCPDGFIGDAFSS 722 (1010)
Q Consensus 692 ~~C~~~~~C~~~--~g~~~C~C~~G~~G~~C~~ 722 (1010)
+.|.+| +|.-. ...+.|.|..||+|.+|+.
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccccccc
Confidence 467775 89854 4679999999999999973
No 60
>KOG3512|consensus
Probab=62.67 E-value=24 Score=40.27 Aligned_cols=53 Identities=21% Similarity=0.470 Sum_probs=36.1
Q ss_pred CceEeecCCC-ccccCCCCCccCCCccccCCCCC---CCCCCCCCCCCCCCCCCCCC
Q psy5613 203 RARCQVYNHN-PVCSCPPGYTGNPFSQCLLPPTP---TPTQATPTDPCFPSPCGSNA 255 (1010)
Q Consensus 203 ~~~C~~~~g~-~~C~C~~Gy~g~~c~~C~~~~~~---~~~~~~~~d~C~~~~C~~~~ 255 (1010)
...|+-...+ ++|.|..+-+|..|..|..-+.. ...+-.++++|..+.|..++
T Consensus 284 As~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~ha 340 (592)
T KOG3512|consen 284 ASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHA 340 (592)
T ss_pred cceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhh
Confidence 3468876655 99999999999999888543311 12344567888877665443
No 61
>PHA02887 EGF-like protein; Provisional
Probab=60.73 E-value=8 Score=35.46 Aligned_cols=31 Identities=35% Similarity=0.847 Sum_probs=24.5
Q ss_pred CCCCCCCeeeecC--CCceeeCCCCCcCCCCCCcccC
Q psy5613 900 GSCGQNANCRVIN--HSPICTCRPGFTGEPRIRCSPI 934 (1010)
Q Consensus 900 ~~C~~~~~C~~~~--g~~~C~C~~G~~G~~~~~C~~~ 934 (1010)
+.|. +|+|.... ..+.|.|+.||+|. +|+..
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~v 124 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGI---RCDEV 124 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccC---CCCcc
Confidence 4677 46998664 57999999999999 88753
No 62
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=59.70 E-value=14 Score=29.03 Aligned_cols=22 Identities=27% Similarity=0.833 Sum_probs=19.1
Q ss_pred CCCCCCeecCcceecCCCCccC
Q psy5613 40 NCVPNAVCKDEVCVCLPDFYGD 61 (1010)
Q Consensus 40 ~C~~~~~C~~~~C~C~~G~~g~ 61 (1010)
.|..++.|++++|+|++||.-.
T Consensus 27 qC~~~s~C~~g~C~C~~g~~~~ 48 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYVEV 48 (52)
T ss_pred CCCCcCEEcCCEeECCCCCEec
Confidence 4669999999999999998654
No 63
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=57.48 E-value=5.3 Score=32.98 Aligned_cols=16 Identities=25% Similarity=0.366 Sum_probs=7.5
Q ss_pred CccccCCCCCccCCCc
Q psy5613 212 NPVCSCPPGYTGNPFS 227 (1010)
Q Consensus 212 ~~~C~C~~Gy~g~~c~ 227 (1010)
+++-.|.+.|.|..|.
T Consensus 16 ~~rv~C~~nyyG~~C~ 31 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCS 31 (63)
T ss_dssp -------TTEETTTT-
T ss_pred EEEEECCCCCCCcccc
Confidence 4667899999999974
No 64
>KOG3516|consensus
Probab=56.30 E-value=9.5 Score=48.32 Aligned_cols=42 Identities=24% Similarity=0.636 Sum_probs=36.9
Q ss_pred CCCCCCCCCCCCCCCCeecccCCceeeeeC-CCCccCCCCcCC
Q psy5613 240 ATPTDPCFPSPCGSNARCRVQNEHALCECL-PDYYGNPYEGCR 281 (1010)
Q Consensus 240 ~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~-~Gf~G~~c~~~~ 281 (1010)
+.-+|.|.+++|.++|.|.-....|.|.|. .||+|..|...+
T Consensus 542 C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi 584 (1306)
T KOG3516|consen 542 CGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSI 584 (1306)
T ss_pred cccccccCCccccCCCcccccccceeEeccccccccccccCCC
Confidence 355788889999999999999999999998 999999987544
No 65
>PHA02887 EGF-like protein; Provisional
Probab=53.09 E-value=12 Score=34.46 Aligned_cols=29 Identities=24% Similarity=0.584 Sum_probs=22.9
Q ss_pred CCCCCCCeeeee--CCcceeeCCCCCccCCCc
Q psy5613 692 GSCGQGAQCRVI--NHSPVCYCPDGFIGDAFS 721 (1010)
Q Consensus 692 ~~C~~~~~C~~~--~g~~~C~C~~G~~G~~C~ 721 (1010)
+.|-+ |+|.-. ...+.|.|++||+|.+|+
T Consensus 92 ~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 92 DFCIN-GECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred CEeeC-CEEEccccCCCceeECCCCcccCCCC
Confidence 45664 689744 456899999999999997
No 66
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=51.30 E-value=13 Score=34.74 Aligned_cols=32 Identities=31% Similarity=0.792 Sum_probs=26.0
Q ss_pred CCCCCCCeeeecC--CCceeeCCCCCcCCCCCCcccCC
Q psy5613 900 GSCGQNANCRVIN--HSPICTCRPGFTGEPRIRCSPIP 935 (1010)
Q Consensus 900 ~~C~~~~~C~~~~--g~~~C~C~~G~~G~~~~~C~~~~ 935 (1010)
+.|.+| +|.... ..+.|.|..||+|. +|+..+
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGe---RCEh~d 84 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGI---RCQHVV 84 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccc---ccccee
Confidence 467775 898764 68999999999999 998544
No 67
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=50.42 E-value=15 Score=33.99 Aligned_cols=32 Identities=34% Similarity=0.831 Sum_probs=24.7
Q ss_pred cCCCC-CCCCCCCCeeeecCCceeeeCCCCCccC
Q psy5613 88 KNPCV-PGTCGEGAICDVVNHAVMCTCPPGTTGS 120 (1010)
Q Consensus 88 ~~~C~-~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 120 (1010)
.|.|+ -+.|+.+|.|... .+..|.|++||+-+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNSN-NSPKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCCC-CCCceECCCCcCCC
Confidence 46775 4789999999643 45689999999854
No 68
>KOG3514|consensus
Probab=44.15 E-value=15 Score=46.01 Aligned_cols=35 Identities=23% Similarity=0.676 Sum_probs=32.6
Q ss_pred CCCCCCCCCCCeecccCCceeeeeC-CCCccCCCCc
Q psy5613 245 PCFPSPCGSNARCRVQNEHALCECL-PDYYGNPYEG 279 (1010)
Q Consensus 245 ~C~~~~C~~~~~C~~~~g~~~C~C~-~Gf~G~~c~~ 279 (1010)
.|.++||.|+|+|...+.+|.|.|. .||.|..|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 6899999999999999999999995 6999999985
No 69
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=40.31 E-value=26 Score=26.77 Aligned_cols=19 Identities=37% Similarity=0.810 Sum_probs=15.2
Q ss_pred eeeecCCeeeeeCCCCCccCC
Q psy5613 803 ICDVINHSVVCSCPPGTTGSP 823 (1010)
Q Consensus 803 ~C~~~~g~y~C~C~~G~~G~~ 823 (1010)
.|+...| +|.|+++|+|..
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~ 30 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRR 30 (46)
T ss_pred cccCCCC--EEECCCCCCCCC
Confidence 4655566 999999999984
No 70
>KOG3516|consensus
Probab=39.08 E-value=21 Score=45.54 Aligned_cols=36 Identities=28% Similarity=0.840 Sum_probs=33.2
Q ss_pred cCCCCCCCCCCCCeeeecCCceeeeCC-CCCccCCCCCcc
Q psy5613 88 KNPCVPGTCGEGAICDVVNHAVMCTCP-PGTTGSPFIQCK 126 (1010)
Q Consensus 88 ~~~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~g~~~~~C~ 126 (1010)
+|+|.+++|+++|.|.-....|.|.|. .||+|.. |.
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Gat---CH 581 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGAT---CH 581 (1306)
T ss_pred ccccCCccccCCCcccccccceeEecccccccccc---cc
Confidence 688999999999999998888999999 8999998 76
No 71
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=38.00 E-value=30 Score=32.00 Aligned_cols=33 Identities=36% Similarity=0.839 Sum_probs=25.7
Q ss_pred CCCCCC-CCCCCCCCeecccCCceeeeeCCCCccC
Q psy5613 242 PTDPCF-PSPCGSNARCRVQNEHALCECLPDYYGN 275 (1010)
Q Consensus 242 ~~d~C~-~~~C~~~~~C~~~~g~~~C~C~~Gf~G~ 275 (1010)
..|+|. ...|+..+.|.. .....|.|.+||.-.
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 456787 478999999964 456679999999753
No 72
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=29.55 E-value=35 Score=24.27 Aligned_cols=20 Identities=30% Similarity=0.549 Sum_probs=14.1
Q ss_pred eeeecCCceeeeCCCCCccCC
Q psy5613 101 ICDVVNHAVMCTCPPGTTGSP 121 (1010)
Q Consensus 101 ~C~~~~g~~~C~C~~G~~g~~ 121 (1010)
.|++.. .++|.||.||..+.
T Consensus 11 ~CDpn~-~~~C~CPeGyIlde 30 (34)
T PF09064_consen 11 DCDPNS-PGQCFCPEGYILDE 30 (34)
T ss_pred ccCCCC-CCceeCCCceEecC
Confidence 555433 24999999998765
No 73
>KOG3514|consensus
Probab=27.34 E-value=37 Score=42.66 Aligned_cols=34 Identities=26% Similarity=0.745 Sum_probs=31.0
Q ss_pred CCCCCCCCCCCeeeecCCceeeeCCC-CCccCCCCCcc
Q psy5613 90 PCVPGTCGEGAICDVVNHAVMCTCPP-GTTGSPFIQCK 126 (1010)
Q Consensus 90 ~C~~~~C~~~~~C~~~~g~~~C~C~~-G~~g~~~~~C~ 126 (1010)
.|.++||+|+|.|......|.|.|.. ||.|+. |+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~---Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRT---CE 659 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCcc---cc
Confidence 68899999999999999999999975 899997 87
Done!