Query psy826
Match_columns 329
No_of_seqs 366 out of 2268
Neff 8.3
Searched_HMMs 46136
Date Fri Aug 16 22:45:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy826.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/826hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4260|consensus 100.0 1.7E-61 3.6E-66 418.2 7.2 272 2-284 42-317 (350)
2 KOG1214|consensus 99.2 8.6E-12 1.9E-16 123.4 6.5 127 93-256 720-866 (1289)
3 PF11938 DUF3456: TLR4 regulat 98.9 1.6E-10 3.4E-15 96.4 -1.2 88 2-93 16-151 (151)
4 KOG1219|consensus 98.9 1.3E-09 2.8E-14 116.8 3.6 101 109-249 3869-3974(4289)
5 KOG4260|consensus 98.7 1.3E-08 2.8E-13 89.6 3.6 107 132-264 132-244 (350)
6 KOG0994|consensus 98.5 4E-07 8.6E-12 93.6 9.0 188 47-255 742-966 (1758)
7 KOG4289|consensus 98.3 7.6E-07 1.7E-11 93.1 5.8 121 28-185 1168-1308(2531)
8 KOG1225|consensus 98.3 2E-06 4.3E-11 84.3 7.5 40 93-142 238-279 (525)
9 KOG0994|consensus 98.3 1.9E-06 4.2E-11 88.7 7.6 150 93-249 834-1010(1758)
10 KOG1214|consensus 98.1 4.9E-06 1.1E-10 83.5 5.9 118 109-262 699-833 (1289)
11 KOG1217|consensus 98.1 2.5E-05 5.5E-10 76.2 10.7 123 92-222 155-296 (487)
12 KOG1225|consensus 98.0 1.1E-05 2.4E-10 79.1 6.9 96 90-250 266-365 (525)
13 PF14843 GF_recep_IV: Growth f 98.0 4.1E-06 8.8E-11 68.2 2.2 104 125-240 14-132 (132)
14 PF07645 EGF_CA: Calcium-bindi 97.9 4.6E-06 1E-10 53.8 1.8 34 196-248 1-35 (42)
15 PTZ00214 high cysteine membran 97.7 0.00027 5.8E-09 73.5 11.5 62 126-195 590-665 (800)
16 KOG4289|consensus 97.7 3.6E-05 7.7E-10 81.2 4.8 92 125-254 1219-1316(2531)
17 KOG1836|consensus 97.6 0.0002 4.2E-09 79.2 9.6 96 91-192 697-812 (1705)
18 KOG1219|consensus 97.6 5.2E-05 1.1E-09 83.0 4.5 81 93-190 3890-3975(4289)
19 PF03302 VSP: Giardia variant- 97.5 0.00032 7E-09 67.6 8.3 33 218-250 104-136 (397)
20 PTZ00214 high cysteine membran 97.5 0.00041 8.9E-09 72.2 9.1 106 140-257 560-668 (800)
21 PF14843 GF_recep_IV: Growth f 97.5 7.4E-05 1.6E-09 60.8 2.9 85 163-260 2-94 (132)
22 KOG1836|consensus 97.3 0.00085 1.9E-08 74.3 9.1 92 92-190 760-879 (1705)
23 PF03302 VSP: Giardia variant- 97.3 0.0024 5.1E-08 61.7 11.0 106 140-256 92-224 (397)
24 PF12947 EGF_3: EGF domain; I 97.1 0.0002 4.3E-09 44.4 1.0 31 204-253 6-36 (36)
25 cd00064 FU Furin-like repeats. 97.0 0.00054 1.2E-08 45.5 2.7 38 223-260 1-38 (49)
26 PF12662 cEGF: Complement Clr- 97.0 0.00042 9.2E-09 38.7 1.4 23 176-199 1-24 (24)
27 smart00261 FU Furin-like repea 96.9 0.00085 1.8E-08 43.9 3.0 39 219-257 2-40 (46)
28 KOG1217|consensus 96.9 0.0058 1.3E-07 59.5 9.8 133 92-260 255-404 (487)
29 PF07645 EGF_CA: Calcium-bindi 96.6 0.0009 2E-08 43.0 0.9 31 104-137 4-34 (42)
30 smart00179 EGF_CA Calcium-bind 96.5 0.0031 6.7E-08 39.2 3.0 33 196-247 1-33 (39)
31 PF07974 EGF_2: EGF-like domai 96.4 0.0043 9.3E-08 37.3 3.2 27 110-141 6-32 (32)
32 PF00008 EGF: EGF-like domain 96.4 0.00082 1.8E-08 40.5 0.0 28 109-139 3-31 (32)
33 PF12947 EGF_3: EGF domain; I 96.3 0.0011 2.3E-08 41.1 0.2 29 108-139 4-32 (36)
34 smart00179 EGF_CA Calcium-bind 96.3 0.0052 1.1E-07 38.1 3.4 29 110-141 9-38 (39)
35 KOG1226|consensus 96.3 0.017 3.6E-07 58.7 8.4 87 91-202 480-590 (783)
36 KOG4052|consensus 96.3 0.00015 3.3E-09 59.4 -4.9 58 26-87 76-185 (190)
37 smart00051 DSL delta serrate l 95.9 0.0091 2E-07 41.9 3.4 44 91-141 19-63 (63)
38 KOG1025|consensus 95.9 0.0064 1.4E-07 62.4 3.6 83 163-258 489-572 (1177)
39 cd00055 EGF_Lam Laminin-type e 95.8 0.012 2.5E-07 39.3 3.5 27 125-151 16-42 (50)
40 KOG0196|consensus 95.8 0.012 2.5E-07 60.2 4.7 60 126-190 257-320 (996)
41 cd00053 EGF Epidermal growth f 95.8 0.013 2.8E-07 35.2 3.2 30 109-141 5-35 (36)
42 KOG1025|consensus 95.7 0.024 5.1E-07 58.4 6.5 105 125-242 501-615 (1177)
43 PF12662 cEGF: Complement Clr- 95.7 0.0052 1.1E-07 34.3 1.0 17 242-258 6-24 (24)
44 cd00054 EGF_CA Calcium-binding 95.6 0.017 3.7E-07 35.2 3.3 29 110-141 9-37 (38)
45 PF00053 Laminin_EGF: Laminin 95.5 0.0061 1.3E-07 40.4 1.2 26 125-150 15-40 (49)
46 smart00180 EGF_Lam Laminin-typ 95.4 0.013 2.9E-07 38.3 2.5 25 126-150 16-40 (46)
47 smart00181 EGF Epidermal growt 95.4 0.019 4.2E-07 34.7 3.0 28 110-141 6-34 (35)
48 smart00261 FU Furin-like repea 95.1 0.032 6.8E-07 36.3 3.4 35 159-195 3-37 (46)
49 PF14670 FXa_inhibition: Coagu 95.0 0.01 2.2E-07 36.7 0.9 28 164-192 5-33 (36)
50 KOG3512|consensus 95.0 0.062 1.3E-06 51.7 6.5 76 112-190 280-384 (592)
51 PF12661 hEGF: Human growth fa 94.5 0.015 3.2E-07 27.5 0.6 13 129-141 1-13 (13)
52 PF14670 FXa_inhibition: Coagu 94.3 0.018 3.9E-07 35.6 0.7 24 205-249 7-30 (36)
53 cd00064 FU Furin-like repeats. 94.1 0.087 1.9E-06 34.7 3.8 32 162-195 1-32 (49)
54 cd00054 EGF_CA Calcium-binding 94.0 0.056 1.2E-06 32.9 2.7 34 197-249 2-35 (38)
55 PF00008 EGF: EGF-like domain 93.9 0.024 5.2E-07 34.0 0.8 27 204-249 4-31 (32)
56 PF06247 Plasmod_Pvs28: Plasmo 93.7 0.0094 2E-07 50.5 -1.8 122 83-249 20-162 (197)
57 PF00594 Gla: Vitamin K-depend 93.4 0.019 4.2E-07 36.8 -0.3 39 19-69 1-39 (42)
58 KOG1226|consensus 93.4 0.16 3.4E-06 51.9 5.9 38 98-142 538-580 (783)
59 cd00053 EGF Epidermal growth f 92.8 0.13 2.7E-06 30.6 2.8 27 204-249 6-32 (36)
60 PF12946 EGF_MSP1_1: MSP1 EGF 92.7 0.04 8.6E-07 34.0 0.4 32 204-254 5-37 (37)
61 PTZ00382 Variant-specific surf 92.5 0.15 3.3E-06 39.0 3.4 27 227-255 29-55 (96)
62 smart00181 EGF Epidermal growt 92.4 0.15 3.2E-06 30.7 2.7 25 205-249 7-31 (35)
63 smart00069 GLA Domain containi 92.2 0.094 2E-06 37.0 1.8 38 18-66 24-61 (65)
64 PF06247 Plasmod_Pvs28: Plasmo 91.7 0.033 7.2E-07 47.3 -1.1 61 175-255 18-87 (197)
65 KOG3512|consensus 90.0 0.84 1.8E-05 44.2 6.4 65 126-192 357-428 (592)
66 PTZ00382 Variant-specific surf 88.7 0.74 1.6E-05 35.2 4.1 45 140-190 5-50 (96)
67 PF01414 DSL: Delta serrate li 86.6 0.15 3.2E-06 35.8 -0.7 44 91-141 19-63 (63)
68 cd01475 vWA_Matrilin VWA_Matri 85.8 0.5 1.1E-05 41.7 2.1 37 192-249 182-219 (224)
69 PF09064 Tme5_EGF_like: Thromb 84.3 1.1 2.3E-05 27.1 2.3 22 170-192 11-32 (34)
70 cd00185 TNFR Tumor necrosis fa 83.9 3.3 7.1E-05 31.6 5.6 20 170-190 68-87 (98)
71 cd00185 TNFR Tumor necrosis fa 83.7 4.2 9E-05 31.0 6.1 18 233-251 71-88 (98)
72 PHA03099 epidermal growth fact 83.2 0.14 3.1E-06 40.6 -2.3 32 204-257 51-84 (139)
73 PF01683 EB: EB module; Inter 82.0 2.1 4.6E-05 28.3 3.4 21 110-137 26-46 (52)
74 PHA02887 EGF-like protein; Pro 80.1 1.4 3.1E-05 34.5 2.3 37 104-142 85-122 (126)
75 PF01683 EB: EB module; Inter 76.3 4 8.7E-05 27.0 3.4 12 242-253 41-52 (52)
76 PHA02637 TNF-alpha-receptor-li 74.6 5.4 0.00012 31.9 4.2 12 140-151 64-75 (127)
77 KOG4611|consensus 68.2 5.4 0.00012 37.5 3.3 25 140-168 99-123 (747)
78 KOG3782|consensus 67.5 0.79 1.7E-05 38.1 -2.0 18 26-43 69-86 (189)
79 KOG1218|consensus 64.4 49 0.0011 30.3 9.1 107 94-221 96-203 (316)
80 PF01826 TIL: Trypsin Inhibito 61.6 5.4 0.00012 26.7 1.6 22 178-200 34-55 (55)
81 cd01475 vWA_Matrilin VWA_Matri 58.7 8.4 0.00018 33.9 2.7 30 104-138 189-218 (224)
82 PF00757 Furin-like: Furin-lik 56.5 5.4 0.00012 33.0 1.0 34 224-259 51-85 (149)
83 KOG0196|consensus 52.8 14 0.00029 38.8 3.3 49 89-137 259-317 (996)
84 PHA03099 epidermal growth fact 52.3 12 0.00026 30.0 2.2 37 104-142 44-81 (139)
85 PF07699 GCC2_GCC3: GCC2 and G 41.2 46 0.001 21.4 3.5 10 140-149 10-19 (48)
86 PHA02887 EGF-like protein; Pro 40.0 20 0.00043 28.2 1.8 20 78-103 103-122 (126)
87 PHA02637 TNF-alpha-receptor-li 39.5 61 0.0013 26.0 4.5 9 141-149 44-52 (127)
88 PF00954 S_locus_glycop: S-loc 39.1 28 0.00061 26.8 2.6 33 196-248 76-108 (110)
89 PF04863 EGF_alliinase: Alliin 37.0 13 0.00028 25.1 0.2 32 111-142 18-50 (56)
90 PF04706 Dickkopf_N: Dickkopf 34.3 66 0.0014 21.5 3.4 32 200-231 2-33 (52)
91 PF00954 S_locus_glycop: S-loc 33.9 39 0.00085 26.0 2.6 26 109-138 83-108 (110)
92 PF07562 NCD3G: Nine Cysteines 33.5 11 0.00023 25.3 -0.5 8 142-149 7-14 (54)
93 KOG0814|consensus 30.0 9.2 0.0002 32.5 -1.6 26 16-41 189-215 (237)
94 KOG3516|consensus 25.9 55 0.0012 35.8 2.8 32 108-142 549-581 (1306)
95 KOG4258|consensus 25.0 81 0.0018 33.5 3.7 88 82-172 213-335 (1025)
96 PHA02714 CD-30-like protein; P 24.2 86 0.0019 23.8 2.8 18 170-188 52-69 (110)
97 PF12955 DUF3844: Domain of un 21.1 43 0.00093 25.9 0.7 30 107-136 10-41 (103)
98 KOG4611|consensus 20.9 1.2E+02 0.0025 28.9 3.6 18 177-194 98-115 (747)
99 KOG3509|consensus 20.6 1E+02 0.0022 33.4 3.5 62 128-190 718-793 (964)
No 1
>KOG4260|consensus
Probab=100.00 E-value=1.7e-61 Score=418.15 Aligned_cols=272 Identities=43% Similarity=0.952 Sum_probs=235.8
Q ss_pred cchhhccCCCCCCCCchhhhhccCcccccceEEEecccCcccCcc--cCccchhhHHhhhHHHHHHhhhhccCCcccCCC
Q psy826 2 SGIEKTAKGNFAGGDTAWEEEKQKIYAKSEVRLIEIQEKMCSEVS--GFLDQCHNFAADIESEIEEWWFKVQHSKAKDSD 79 (329)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~ee~~~~~y~~se~~~~e~~e~~C~~~~--~~~~~c~~~~~~~~~~~e~~~~~~~~~~~~~~~ 79 (329)
+||+||+|+||+||||||||++|++|+.||+||+||+|++|+..+ ..||+||.+++.+|++||.||++.|+ ..||
T Consensus 42 ~GlerT~r~hfaGGdTAWEEknL~kYk~SE~RLvEilEglCsks~~~n~DfeCh~lle~hEellE~w~~hkq~---e~Pd 118 (350)
T KOG4260|consen 42 EGLERTARHHFAGGDTAWEEKNLSKYKTSETRLVEILEGLCSKSSLPNMDFECHTLLEKHEELLEEWWYHKQH---ESPD 118 (350)
T ss_pred HHHHHHhhhccCCCchhhhhhhhhhccccchhHHHHHHHhhhccCCCCCChHHHHHHHHHHHHHHHHHHHhhc---CCch
Confidence 699999999999999999999999999999999999999999321 23899999999999999999999999 8899
Q ss_pred ccceeecccccCcCCCCCCCCCCccCCCCC-CCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCCC
Q psy826 80 LYTWLCINKLKRCCPVDHYGADCKPCLGFP-NVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKTI 158 (329)
Q Consensus 80 ~~~~~C~~~~~~~C~~G~~G~~C~~C~~~~-~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~~ 158 (329)
+.+|+|++..+++||+|.||++|.+||+++ .+|+++|+|.+++++.|++.|.|.+||.|+.|..|.++||....++...
T Consensus 119 l~~WlCvdqLkvCCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~l 198 (350)
T KOG4260|consen 119 LFNWLCVDQLKVCCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHL 198 (350)
T ss_pred HHhHhhhhhheeccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccc
Confidence 999999999999999999999999999866 7999999999999999999999999999999999999999999999999
Q ss_pred CCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCCC-CCCCCCceeecCCCCcccCCCcCCCCCCcCCccc
Q psy826 159 LCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDE-NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPD 237 (329)
Q Consensus 159 ~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~-~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~ 237 (329)
+|..||..|...|++.++..|..|+.||.++. ..|+|||||... .+|...++|+|+.|||+|.. .+++.. ++.
T Consensus 199 vCt~Ch~~C~~~Csg~~~k~C~kCkkGW~lde-~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~d-k~Gy~~----g~d 272 (350)
T KOG4260|consen 199 VCTACHEGCLGVCSGESSKGCSKCKKGWKLDE-EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCED-KEGYKK----GVD 272 (350)
T ss_pred hhhhhhhhhhcccCCCCCCChhhhcccceecc-cccccHHHHhcCCCCCChhheeecCCCceEecc-cccccC----ChH
Confidence 99999999998999999999999999999995 789999999987 89999999999999999982 433332 466
Q ss_pred ccccCcccccccCCcccccCcCCCCccccceeeeeeeceeeeEEEee
Q psy826 238 MCEACAEGYKLQQNICINTQAKSQNTNENLYRYGVYVGLCVATYIIF 284 (329)
Q Consensus 238 ~C~~C~~Gy~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (329)
.|..|..++.+.+..|.+++.... .+.+..+.+++..++...+.+
T Consensus 273 ~C~~~~d~~~~kn~~c~ni~~~~r--~v~f~~~~~~~g~cV~~~~p~ 317 (350)
T KOG4260|consen 273 ECQFCADVCASKNRPCMNIDGQYR--CVCFSGLIIIEGFCVWHGSPV 317 (350)
T ss_pred HhhhhhhhcccCCCCcccCCccEE--EEecccceeeeeeeeccCCch
Confidence 777777777777777766653322 334445555555554444333
No 2
>KOG1214|consensus
Probab=99.25 E-value=8.6e-12 Score=123.37 Aligned_cols=127 Identities=29% Similarity=0.689 Sum_probs=98.7
Q ss_pred CCCCCC--CCCCc---cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCC----CCCCC---------cCCCCccccccC
Q psy826 93 CPVDHY--GADCK---PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYT----GELCN---------ECNTGYFQSYKD 154 (329)
Q Consensus 93 C~~G~~--G~~C~---~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~----g~~C~---------~C~~g~y~~~~~ 154 (329)
|..||. |..|. +|......|..++.|.+ .+++++|.|..||. +-+|. .|.+|
T Consensus 720 cs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin---~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g------- 789 (1289)
T KOG1214|consen 720 CSSGYQGDGRNCVDENECATGFHRCGPNSVCIN---LPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDG------- 789 (1289)
T ss_pred EeeccCCCCCCCCChhhhccCCCCCCCCceeec---CCCceeEEEeecceeccCCcceEEecCCCCCCccccC-------
Confidence 666776 55665 56666677999999999 89999999999986 34662 33332
Q ss_pred cCCCCCcCCcccccCCcc--cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCc
Q psy826 155 EKTILCSKCHASCESGCS--TGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCH 232 (329)
Q Consensus 155 ~~~~~C~~C~~~C~~~C~--~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~ 232 (329)
...|.. ... ..|+ +.+.|.| .|.+||.|++ ..|.|+|||.. ..|.++++|.|++|++.|+
T Consensus 790 --~h~C~i-~g~--a~c~~hGgs~y~C-~CLPGfsGDG-~~c~dvDeC~p-srChp~A~CyntpgsfsC~---------- 851 (1289)
T KOG1214|consen 790 --SHTCAI-AGQ--ARCVHHGGSTYSC-ACLPGFSGDG-HQCTDVDECSP-SRCHPAATCYNTPGSFSCR---------- 851 (1289)
T ss_pred --ccccCc-CCc--eEEEecCCceEEE-eecCCccCCc-cccccccccCc-cccCCCceEecCCCcceee----------
Confidence 222222 112 2455 5678999 9999999987 67999999996 7899999999999999999
Q ss_pred CCcccccccCcccccccCCccccc
Q psy826 233 GDGPDMCEACAEGYKLQQNICINT 256 (329)
Q Consensus 233 g~~~~~C~~C~~Gy~~~~~~C~~~ 256 (329)
|.+||.|++.+|++.
T Consensus 852 ---------C~pGy~GDGf~CVP~ 866 (1289)
T KOG1214|consen 852 ---------CQPGYYGDGFQCVPD 866 (1289)
T ss_pred ---------cccCccCCCceecCC
Confidence 999999999999865
No 3
>PF11938 DUF3456: TLR4 regulator and MIR-interacting MSAP; InterPro: IPR021852 This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 182 to 356 amino acids in length.
Probab=98.89 E-value=1.6e-10 Score=96.37 Aligned_cols=88 Identities=38% Similarity=0.684 Sum_probs=73.8
Q ss_pred cchhhccCCCC-----CCCCchhhhhccCcc-cccceEEEecccCcccCcc-----------------------------
Q psy826 2 SGIEKTAKGNF-----AGGDTAWEEEKQKIY-AKSEVRLIEIQEKMCSEVS----------------------------- 46 (329)
Q Consensus 2 ~~~~~~~~~~~-----~~~~~~~ee~~~~~y-~~se~~~~e~~e~~C~~~~----------------------------- 46 (329)
.+|++|..+++ -|+|++|+++++.+| ++||+||+|++|++|....
T Consensus 16 ~~l~~~~~~~~v~~~~~r~d~~g~~~~~~~~y~~SE~~l~EvlE~vC~~~~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (151)
T PF11938_consen 16 HRLAKTDPKKFVDVGGFRLDPDGERKKKKIPYARSELRLVEVLENVCEKMKDYVLHKDKSTGKRVYKRMSTTGGTNPELS 95 (151)
T ss_pred HHHHHhCCcCceeccccccccccccccccCccccchhHHHHHHHHHccCcccceeeecCCCCceEEEecCCCcCcccccc
Confidence 36889999998 389999998888765 9999999999999999211
Q ss_pred -------------cCccchhhHHhhhHHHHHHhhhhccCCcccCCCccceeecccccCcC
Q psy826 47 -------------GFLDQCHNFAADIESEIEEWWFKVQHSKAKDSDLYTWLCINKLKRCC 93 (329)
Q Consensus 47 -------------~~~~~c~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~C~~~~~~~C 93 (329)
.-.+.|+.+++++|+.|+.||++.+. . .++..|+|.+..+.||
T Consensus 96 ~~~~~~~~~~~~~~L~~~C~~ilEe~Ed~i~~~~~~~~~---~-~~l~~~LC~~~~~~C~ 151 (151)
T PF11938_consen 96 EKDVKVDPSDENKRLKFYCERILEEYEDEIEEWFFSEQD---E-DDLEEWLCSETSKVCC 151 (151)
T ss_pred cccceeecHHHHhHHHHHHHHHHHHhHHHHHHHHHhcCC---c-cCHHHHhhhcccccCC
Confidence 12467999999999999999998876 4 6889999998877654
No 4
>KOG1219|consensus
Probab=98.86 E-value=1.3e-09 Score=116.78 Aligned_cols=101 Identities=30% Similarity=0.734 Sum_probs=82.4
Q ss_pred CCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC----cCCCCccccccCcCCCCCcCCcccccCCcc-cCCCCCCccCC
Q psy826 109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN----ECNTGYFQSYKDEKTILCSKCHASCESGCS-TGGPKGCTKCK 183 (329)
Q Consensus 109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~----~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~ 183 (329)
.+||.++|+|... -.+.|+|.|++-|+|+.|+ .|.. .||..+ ++|. ..+.+.| .|+
T Consensus 3869 ~npCqhgG~C~~~--~~ggy~CkCpsqysG~~CEi~~epC~s--------------nPC~~G--gtCip~~n~f~C-nC~ 3929 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQ--PKGGYKCKCPSQYSGNHCEIDLEPCAS--------------NPCLTG--GTCIPFYNGFLC-NCP 3929 (4289)
T ss_pred cCcccCCCEecCC--CCCceEEeCcccccCcccccccccccC--------------CCCCCC--CEEEecCCCeeE-eCC
Confidence 3789999999862 2566899999999999993 4544 577777 7888 6778999 999
Q ss_pred CCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826 184 SGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 184 ~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
.||.|..+.. ..++||.. +.|..++.|+|.+|+|.|. |-+||.+.
T Consensus 3930 ~gyTG~~Ce~-~Gi~eCs~-n~C~~gg~C~n~~gsf~Cn-------------------cT~g~~gr 3974 (4289)
T KOG1219|consen 3930 NGYTGKRCEA-RGISECSK-NVCGTGGQCINIPGSFHCN-------------------CTPGILGR 3974 (4289)
T ss_pred CCccCceeec-cccccccc-ccccCCceeeccCCceEec-------------------cChhHhcc
Confidence 9999976321 12788986 7999999999999999999 99999876
No 5
>KOG4260|consensus
Probab=98.68 E-value=1.3e-08 Score=89.64 Aligned_cols=107 Identities=24% Similarity=0.631 Sum_probs=81.3
Q ss_pred cCCCCCCCCCCcCCCCccccccCcCCCCCcCCcccccCCcc----cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCC
Q psy826 132 CNKEYTGELCNECNTGYFQSYKDEKTILCSKCHASCESGCS----TGGPKGCTKCKSGWAADKDIGCYDINECSDENICS 207 (329)
Q Consensus 132 C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~----~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~ 207 (329)
|++|..|+.|..|+.|.- .+|.++ +.|. ..|+..| .|..||.|...+ .|.
T Consensus 132 Cp~gtyGpdCl~Cpggse-----------r~C~Gn--G~C~GdGsR~GsGkC-kC~~GY~Gp~C~------------~Cg 185 (350)
T KOG4260|consen 132 CPDGTYGPDCLQCPGGSE-----------RPCFGN--GSCHGDGSREGSGKC-KCETGYTGPLCR------------YCG 185 (350)
T ss_pred cCCCCcCCccccCCCCCc-----------CCcCCC--CcccCCCCCCCCCcc-cccCCCCCcccc------------ccc
Confidence 888999999999988732 345544 4554 3457799 999999987643 333
Q ss_pred CCcee-ecCCCCcccCCCcCCCC-CCcCCcccccccCcccccccCCcccccCcCCCCcc
Q psy826 208 GNQFC-VNTEGSYRCMQCDPSCN-GCHGDGPDMCEACAEGYKLQQNICINTQAKSQNTN 264 (329)
Q Consensus 208 ~~~~C-~n~~gs~~C~~C~~~C~-~C~g~~~~~C~~C~~Gy~~~~~~C~~~~~~~~~~~ 264 (329)
.+.+= ..+.....|.+||.+|. .|+|.++..|.+|+.||.++...|+|++++...+.
T Consensus 186 ~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~ 244 (350)
T KOG4260|consen 186 IEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPA 244 (350)
T ss_pred hHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceecccccccHHHHhcCCC
Confidence 23221 12345678999999997 99999999999999999999999999999876653
No 6
>KOG0994|consensus
Probab=98.50 E-value=4e-07 Score=93.57 Aligned_cols=188 Identities=23% Similarity=0.514 Sum_probs=100.1
Q ss_pred cCccchhhHHhhhHHHHHHhhhhccCCcccCC-Cccceeeccccc-CcCCCCCCCCCCccCCCC-----CCCCCCCCccc
Q psy826 47 GFLDQCHNFAADIESEIEEWWFKVQHSKAKDS-DLYTWLCINKLK-RCCPVDHYGADCKPCLGF-----PNVCFGNGKCK 119 (329)
Q Consensus 47 ~~~~~c~~~~~~~~~~~e~~~~~~~~~~~~~~-~~~~~~C~~~~~-~~C~~G~~G~~C~~C~~~-----~~~C~~~~~C~ 119 (329)
+.++.|.+++....-.+..--..-. ..+ .-.+..|..... +-|.|+-.|..|..|.++ +..|.. -.|.
T Consensus 742 ~~~e~C~~l~~~lsa~l~n~a~~Cn----CnptGSlS~vCn~~GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~-CdC~ 816 (1758)
T KOG0994|consen 742 KSDEVCQNLDNSLSALLHNGASMCN----CNPTGSLSSVCNPNGGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKA-CDCN 816 (1758)
T ss_pred CcchhhhhhhhhHHHHHhcCccccc----cCCCccccccccCCCceecccCccccccccccCCcccCcCCccCcc-cccc
Confidence 5677888887776555432221111 111 112234444332 226777667777766442 222210 0121
Q ss_pred CCC-----CCCCCceeecCCCCCCCCCCcCCCCccccc------cCcCCCCCcCCcccccCCcc-cCCCCCCccCCCCce
Q psy826 120 GNG-----TRKGNGQCVCNKEYTGELCNECNTGYFQSY------KDEKTILCSKCHASCESGCS-TGGPKGCTKCKSGWA 187 (329)
Q Consensus 120 ~~~-----~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~------~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~~G~~ 187 (329)
..+ -..-+++|+|.+|..|..|+.|.+|||+-. .+.-...|.+=.+.|. .|. ...++.|..|..||+
T Consensus 817 ~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA~~Cd~~tGaCi-~CqD~T~G~~CdrCl~Gyy 895 (1758)
T KOG0994|consen 817 SIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHADTCDPITGACI-DCQDSTTGHSCDRCLDGYY 895 (1758)
T ss_pred ccccccccccccccceeeccccchhhccccCCCccCCCcCccccccCcccccCccccccc-cccccccccchhhhhcccc
Confidence 111 014568999999999999999999999621 1111222222223444 454 455689999999999
Q ss_pred ecCC----CCcccCCCCCCCCCCCCCc--------eeecCC--CCcccCCCcCCCCCC-cCCcccccccCccccccc---
Q psy826 188 ADKD----IGCYDINECSDENICSGNQ--------FCVNTE--GSYRCMQCDPSCNGC-HGDGPDMCEACAEGYKLQ--- 249 (329)
Q Consensus 188 ~~~~----~~C~d~~eC~~~~~C~~~~--------~C~n~~--gs~~C~~C~~~C~~C-~g~~~~~C~~C~~Gy~~~--- 249 (329)
|++. ..|.+ | +|+.+. .|.... ....|. | .|....+|..|+++|.|+
T Consensus 896 GdP~lg~g~~CrP---C----pCP~gp~Sg~~~A~sC~~d~~t~~ivC~--------C~~GY~G~RCe~CA~~~fGnP~~ 960 (1758)
T KOG0994|consen 896 GDPRLGSGIGCRP---C----PCPDGPASGRQHADSCYLDTRTQQIVCH--------CQEGYSGSRCEICADNHFGNPSE 960 (1758)
T ss_pred CCcccCCCCCCCC---C----CCCCCCccchhccccccccccccceeee--------cccCccccchhhhcccccCCccc
Confidence 9882 22432 1 232211 111111 111222 2 366777888899999886
Q ss_pred CCcccc
Q psy826 250 QNICIN 255 (329)
Q Consensus 250 ~~~C~~ 255 (329)
+++|..
T Consensus 961 GGtCq~ 966 (1758)
T KOG0994|consen 961 GGTCQK 966 (1758)
T ss_pred CCcccc
Confidence 667753
No 7
>KOG4289|consensus
Probab=98.31 E-value=7.6e-07 Score=93.14 Aligned_cols=121 Identities=28% Similarity=0.692 Sum_probs=81.1
Q ss_pred cccceEEEecccCcccCcccCccchhhHHhhhHHHHHHhhhh----ccCCc----ccCCCccceeecccccCcCCCCCCC
Q psy826 28 AKSEVRLIEIQEKMCSEVSGFLDQCHNFAADIESEIEEWWFK----VQHSK----AKDSDLYTWLCINKLKRCCPVDHYG 99 (329)
Q Consensus 28 ~~se~~~~e~~e~~C~~~~~~~~~c~~~~~~~~~~~e~~~~~----~~~~~----~~~~~~~~~~C~~~~~~~C~~G~~G 99 (329)
+.|-.|++-+.+++|- .+.|.++.... ...-+. +..++ +.+.......|. ||+||+|
T Consensus 1168 ~~sll~VlpfdDniCl-----rEPCenymkCv----svlrFdssapf~~s~s~lfRpi~pvnglrCr------CPpGFTg 1232 (2531)
T KOG4289|consen 1168 AISLLRVLPFDDNICL-----REPCENYMKCV----SVLRFDSSAPFLASDSVLFRPIHPVNGLRCR------CPPGFTG 1232 (2531)
T ss_pred HhhheeeeeccCchhh-----cchhHHHHhhh----hheeecccCccccccceeeeeccccCceeEe------CCCCCCc
Confidence 4567888888999998 66776654332 111110 00000 011222334455 9999999
Q ss_pred CCCc----cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC------cCCCCccccccCcCCCCCcCCcccccC
Q psy826 100 ADCK----PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN------ECNTGYFQSYKDEKTILCSKCHASCES 169 (329)
Q Consensus 100 ~~C~----~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~------~C~~g~y~~~~~~~~~~C~~C~~~C~~ 169 (329)
..|+ .| ...+|.+++.|.. +.|.|+|.|.+||+|..|+ .|.+|+ |..+ +
T Consensus 1233 d~CeTeiDlC--Ys~pC~nng~C~s---rEggYtCeCrpg~tGehCEvs~~agrCvpGv--------------C~ng--g 1291 (2531)
T KOG4289|consen 1233 DYCETEIDLC--YSGPCGNNGRCRS---REGGYTCECRPGFTGEHCEVSARAGRCVPGV--------------CKNG--G 1291 (2531)
T ss_pred ccccchhHhh--hcCCCCCCCceEE---ecCceeEEecCCccccceeeecccCccccce--------------ecCC--C
Confidence 9998 35 4478999999998 8999999999999999994 565553 4444 5
Q ss_pred Ccc--cCCCCCCccCCCC
Q psy826 170 GCS--TGGPKGCTKCKSG 185 (329)
Q Consensus 170 ~C~--~~~~~~C~~C~~G 185 (329)
+|. ..+.+.| .|+.|
T Consensus 1292 tC~~~~nggf~c-~Cp~g 1308 (2531)
T KOG4289|consen 1292 TCVNLLNGGFCC-HCPYG 1308 (2531)
T ss_pred EEeecCCCceec-cCCCc
Confidence 676 4566778 88887
No 8
>KOG1225|consensus
Probab=98.27 E-value=2e-06 Score=84.30 Aligned_cols=40 Identities=40% Similarity=1.115 Sum_probs=27.3
Q ss_pred CCCCCCCCCCc--cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826 93 CPVDHYGADCK--PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN 142 (329)
Q Consensus 93 C~~G~~G~~C~--~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~ 142 (329)
|+.+|+|+.|. .|+.+ |.+++.|.. +.|+|++||+|..|+
T Consensus 238 c~~~~~g~~c~~~~C~~~---c~~~g~c~~-------G~CIC~~Gf~G~dC~ 279 (525)
T KOG1225|consen 238 CPEGYFGPLCSTIYCPGG---CTGRGQCVE-------GRCICPPGFTGDDCD 279 (525)
T ss_pred cCCceeCCccccccCCCC---CcccceEeC-------CeEeCCCCCcCCCCC
Confidence 77777777776 45443 555566664 677777777777774
No 9
>KOG0994|consensus
Probab=98.27 E-value=1.9e-06 Score=88.71 Aligned_cols=150 Identities=26% Similarity=0.602 Sum_probs=86.8
Q ss_pred CCCCCCCCCCccCCCC------CCCCC--CC-CcccCCCCCCCCceee-cCCCCCCCCCCcCCCCccccccCcCCCCCcC
Q psy826 93 CPVDHYGADCKPCLGF------PNVCF--GN-GKCKGNGTRKGNGQCV-CNKEYTGELCNECNTGYFQSYKDEKTILCSK 162 (329)
Q Consensus 93 C~~G~~G~~C~~C~~~------~~~C~--~~-~~C~~~~~~~gs~~C~-C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~ 162 (329)
|.+|.||..|..|.++ +.+|. ++ ..|. +-++.|+ |..-.+|..|+.|.+||||+..-.....|.|
T Consensus 834 C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA~~Cd-----~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrP 908 (1758)
T KOG0994|consen 834 CRPGTYGRQCNQCQPGYWGFPECRPCQCNGHADTCD-----PITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRP 908 (1758)
T ss_pred eccccchhhccccCCCccCCCcCccccccCcccccC-----ccccccccccccccccchhhhhccccCCcccCCCCCCCC
Confidence 8888888888877652 13443 22 2344 4457886 9888889999999999999877777777887
Q ss_pred Cc--ccc------cCCcc---cCCCCCCccCCCCceecCCCCcccCCCCCC--CCCCCCCceeecCCCCcccCCCcCC--
Q psy826 163 CH--ASC------ESGCS---TGGPKGCTKCKSGWAADKDIGCYDINECSD--ENICSGNQFCVNTEGSYRCMQCDPS-- 227 (329)
Q Consensus 163 C~--~~C------~~~C~---~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~--~~~C~~~~~C~n~~gs~~C~~C~~~-- 227 (329)
|. .+= ..+|. ......| .|.+||.|.....|.+-.-=.. +..|.. -.|.|+...+.=..|+..
T Consensus 909 CpCP~gp~Sg~~~A~sC~~d~~t~~ivC-~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~-CeC~~NiD~~d~~aCD~~TG 986 (1758)
T KOG0994|consen 909 CPCPDGPASGRQHADSCYLDTRTQQIVC-HCQEGYSGSRCEICADNHFGNPSEGGTCQK-CECSNNIDLYDPGACDVATG 986 (1758)
T ss_pred CCCCCCCccchhccccccccccccceee-ecccCccccchhhhcccccCCcccCCcccc-ccccCCcCccCCCccchhhc
Confidence 63 100 01232 1123467 7788887765433322100000 011210 123333333333344443
Q ss_pred -CCCC-cCCcccccccCccccccc
Q psy826 228 -CNGC-HGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 228 -C~~C-~g~~~~~C~~C~~Gy~~~ 249 (329)
|..| ......+|..|++||+|+
T Consensus 987 ~CLkCL~hTeG~hCe~Ck~Gf~Gd 1010 (1758)
T KOG0994|consen 987 ACLKCLYHTEGDHCEHCKDGFYGD 1010 (1758)
T ss_pred hhhhhhhcccccchhhccccchhH
Confidence 5556 345667899999999986
No 10
>KOG1214|consensus
Probab=98.09 E-value=4.9e-06 Score=83.49 Aligned_cols=118 Identities=30% Similarity=0.724 Sum_probs=82.4
Q ss_pred CCCCCCCCcccCCCCCCCCceeecCCCCCC--CCC---CcCCCCccccccCcCCCCCcCCcccccCCcc-cCCCCCCccC
Q psy826 109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTG--ELC---NECNTGYFQSYKDEKTILCSKCHASCESGCS-TGGPKGCTKC 182 (329)
Q Consensus 109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g--~~C---~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C 182 (329)
.+.|..++.|.. + +.-.|+|.|..||.| ..| .+|+.++ .+|. +. ..|+ .+++++| .|
T Consensus 699 sh~cdt~a~C~p-g-~~~~~tcecs~g~~gdgr~c~d~~eca~~~---------~~CG---p~--s~Cin~pg~~rc-eC 761 (1289)
T KOG1214|consen 699 SHMCDTTARCHP-G-TGVDYTCECSSGYQGDGRNCVDENECATGF---------HRCG---PN--SVCINLPGSYRC-EC 761 (1289)
T ss_pred CcccCCCccccC-C-CCcceEEEEeeccCCCCCCCCChhhhccCC---------CCCC---CC--ceeecCCCceeE-EE
Confidence 344556666763 2 234589999999985 466 3455432 2233 22 3566 7889999 99
Q ss_pred CCCceecCC-CCcc------cCCCCCCC-CCCCCCc--eeecCC-CCcccCCCcCCCCCCcCCcccccccCcccccccCC
Q psy826 183 KSGWAADKD-IGCY------DINECSDE-NICSGNQ--FCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQN 251 (329)
Q Consensus 183 ~~G~~~~~~-~~C~------d~~eC~~~-~~C~~~~--~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~ 251 (329)
..||...++ .+|+ .++.|..+ +.|...+ .|+... ++|.|. |-+||.|++.
T Consensus 762 ~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~-------------------CLPGfsGDG~ 822 (1289)
T KOG1214|consen 762 RSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCA-------------------CLPGFSGDGH 822 (1289)
T ss_pred eecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEe-------------------ecCCccCCcc
Confidence 999998776 4586 35668877 7787544 444444 468998 9999999999
Q ss_pred cccccCcCCCC
Q psy826 252 ICINTQAKSQN 262 (329)
Q Consensus 252 ~C~~~~~~~~~ 262 (329)
.|.++++|..+
T Consensus 823 ~c~dvDeC~ps 833 (1289)
T KOG1214|consen 823 QCTDVDECSPS 833 (1289)
T ss_pred ccccccccCcc
Confidence 99999988754
No 11
>KOG1217|consensus
Probab=98.07 E-value=2.5e-05 Score=76.22 Aligned_cols=123 Identities=29% Similarity=0.755 Sum_probs=85.1
Q ss_pred cCCCCCCCCCCc----cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCc--------------CCCCcccccc
Q psy826 92 CCPVDHYGADCK----PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNE--------------CNTGYFQSYK 153 (329)
Q Consensus 92 ~C~~G~~G~~C~----~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~--------------C~~g~y~~~~ 153 (329)
.|..||.+..|. .|.....+|.+.+.|.+ ..+++.|.|.+||.+..|.. ++.++.+...
T Consensus 155 ~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~---~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c 231 (487)
T KOG1217|consen 155 SCTEGYEGEPCETDLDECIQYSSPCQNGGTCVN---TGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPEC 231 (487)
T ss_pred eeCCCcccccccccccccccCCCCcCCCccccc---CCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCc
Confidence 399999988776 56555567988899998 67779999999999887742 2333322111
Q ss_pred CcCCCCCcCCcccccCCcc-cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccC
Q psy826 154 DEKTILCSKCHASCESGCS-TGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCM 222 (329)
Q Consensus 154 ~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~ 222 (329)
..... .|... .+.|. ..+.+.| .|++||.+.....|.++++|.....|.++..|++..++|.|.
T Consensus 232 ~~~~~---~~~~~-~~~c~~~~~~~~C-~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~ 296 (487)
T KOG1217|consen 232 EVSIV---ECASG-DGTCVNTVGSYTC-RCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCT 296 (487)
T ss_pred ccccc---cccCC-CCcccccCCceee-eCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceee
Confidence 11111 11100 02454 4566899 999999987633578899999862388889999999889997
No 12
>KOG1225|consensus
Probab=98.02 E-value=1.1e-05 Score=79.14 Aligned_cols=96 Identities=34% Similarity=0.936 Sum_probs=69.6
Q ss_pred cCcCCCCCCCCCCcc--CCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC--cCCCCccccccCcCCCCCcCCcc
Q psy826 90 KRCCPVDHYGADCKP--CLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN--ECNTGYFQSYKDEKTILCSKCHA 165 (329)
Q Consensus 90 ~~~C~~G~~G~~C~~--C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~--~C~~g~y~~~~~~~~~~C~~C~~ 165 (329)
.+.|++||+|.+|.+ ||.. |++++.|.+ +.|+|++||.|..|. .|+. .|++
T Consensus 266 ~CIC~~Gf~G~dC~e~~Cp~~---cs~~g~~~~-------g~CiC~~g~~G~dCs~~~cpa---------------dC~g 320 (525)
T KOG1225|consen 266 RCICPPGFTGDDCDELVCPVD---CSGGGVCVD-------GECICNPGYSGKDCSIRRCPA---------------DCSG 320 (525)
T ss_pred eEeCCCCCcCCCCCcccCCcc---cCCCceecC-------CEeecCCCccccccccccCCc---------------cCCC
Confidence 345999999999995 7653 887787775 599999999999883 3332 2444
Q ss_pred cccCCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccc
Q psy826 166 SCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEG 245 (329)
Q Consensus 166 ~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~G 245 (329)
. +.|. ..+| .|.+||.|.. |... .|+++..|+| . |. |..|
T Consensus 321 ~--G~Ci---~G~C-~C~~Gy~G~~---------C~~~-~C~~~g~cv~----g-C~-------------------C~~G 360 (525)
T KOG1225|consen 321 H--GKCI---DGEC-LCDEGYTGEL---------CIQR-ACSGGGQCVN----G-CK-------------------CKKG 360 (525)
T ss_pred C--Cccc---CCce-EeCCCCcCCc---------cccc-ccCCCceecc----C-ce-------------------eccC
Confidence 3 4554 3689 9999999875 3332 3766777764 2 77 9999
Q ss_pred ccccC
Q psy826 246 YKLQQ 250 (329)
Q Consensus 246 y~~~~ 250 (329)
|.+.+
T Consensus 361 w~G~d 365 (525)
T KOG1225|consen 361 WRGPD 365 (525)
T ss_pred ccCCC
Confidence 99875
No 13
>PF14843 GF_recep_IV: Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=97.95 E-value=4.1e-06 Score=68.22 Aligned_cols=104 Identities=27% Similarity=0.740 Sum_probs=61.3
Q ss_pred CCCceee-cCCCCCCCCC-CcCCCCccccccCcCCCCCcCCccccc-----CCcccCCCCCCccCCCCceecCCCCcccC
Q psy826 125 KGNGQCV-CNKEYTGELC-NECNTGYFQSYKDEKTILCSKCHASCE-----SGCSTGGPKGCTKCKSGWAADKDIGCYDI 197 (329)
Q Consensus 125 ~gs~~C~-C~~G~~g~~C-~~C~~g~y~~~~~~~~~~C~~C~~~C~-----~~C~~~~~~~C~~C~~G~~~~~~~~C~d~ 197 (329)
++...|+ |.....+..| ..|+.++|..........|.+||..|. .+|.+++...|+.|.. |.. ...|
T Consensus 14 p~~~~C~sC~~~~~~g~Cv~~C~~~~~~~~~~~~~~~C~~Ch~~C~~~~~~~~C~Gp~~~~C~~C~~-~~~--~~~C--- 87 (132)
T PF14843_consen 14 PGPDQCLSCRNFSFNGTCVASCPDGYYFDGSYEDSNTCQPCHPECLPSNGSETCTGPGADQCLSCAH-FRD--GGQC--- 87 (132)
T ss_dssp SSTTCBSSBSSEEETTEEESSTSSSSSSSSEEEETTEEEE--TTB---TTSSSBSSSSTTTBSSBSS-EEE--TTEE---
T ss_pred cChhhccCccCcccCCcchhhcccccccccccccCCcCcccChhhCccccCcCcCCCCcchhhhccc-cCC--CCcc---
Confidence 3334443 4433334555 788876442111123678999999992 4999999999999986 432 2334
Q ss_pred CCCCCCCCCCCCce-------eecCCCCcccCCCcCCC-CCCcCCcccccc
Q psy826 198 NECSDENICSGNQF-------CVNTEGSYRCMQCDPSC-NGCHGDGPDMCE 240 (329)
Q Consensus 198 ~eC~~~~~C~~~~~-------C~n~~gs~~C~~C~~~C-~~C~g~~~~~C~ 240 (329)
+. .|+.+.+ ... .....|.+||+.| ..|+||.+.+|.
T Consensus 88 ---v~--~CP~~~~~~~~~~~~~~-~~~~~C~~Ch~~C~~gC~Gp~~~~C~ 132 (132)
T PF14843_consen 88 ---VS--SCPEGTYGENFPIYYYP-DSNGVCQPCHPSCTQGCTGPSASDCQ 132 (132)
T ss_dssp ---ES--S--EEEEBTTSEEEEEE--TTSBEEE-STTSTT-BSSSSTTCC-
T ss_pred ---cC--cCCCCCccccccceecC-CCCCccCCcCccccCcCCCCCcccCC
Confidence 43 6776631 112 3356899999999 899999988873
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.93 E-value=4.6e-06 Score=53.78 Aligned_cols=34 Identities=56% Similarity=1.188 Sum_probs=31.4
Q ss_pred cCCCCCCC-CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCcccccc
Q psy826 196 DINECSDE-NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKL 248 (329)
Q Consensus 196 d~~eC~~~-~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~ 248 (329)
|||||... +.|+.++.|+|+.|+|.|. |++||..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~-------------------C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCS-------------------CPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEE-------------------ESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEee-------------------CCCCcEE
Confidence 68999987 7898899999999999999 9999984
No 15
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=97.72 E-value=0.00027 Score=73.51 Aligned_cols=62 Identities=29% Similarity=0.741 Sum_probs=43.4
Q ss_pred CCceee-cCCCCCC--------CCC---CcCCCCccccccCcCCCCCcCCc-ccccCCcccCCCCCCccCCCCceecCCC
Q psy826 126 GNGQCV-CNKEYTG--------ELC---NECNTGYFQSYKDEKTILCSKCH-ASCESGCSTGGPKGCTKCKSGWAADKDI 192 (329)
Q Consensus 126 gs~~C~-C~~G~~g--------~~C---~~C~~g~y~~~~~~~~~~C~~C~-~~C~~~C~~~~~~~C~~C~~G~~~~~~~ 192 (329)
+...|. |..+|.- ..| ..|.+|||. + ...|.+|+ .+|+ +|.. ...|+.|..+|++..++
T Consensus 590 ~~~~C~sC~~g~yL~~~~~~~~g~Cv~~~~C~~GYY~---d--~~~C~~C~~~~C~-tC~~--~~~C~~C~~~~~ls~~~ 661 (800)
T PTZ00214 590 GPGRCTRCPSDKLLKRASGAATGSCVDPGACVDGYYA---D--GDACLPCATPGCK-TCGH--ASFCTECAGELFVSLDG 661 (800)
T ss_pred CCCcceECCCCCEeccccccccccccCcccCCCCccc---C--CCccccCCccccc-cccC--CCCcCcCCCCceecCcC
Confidence 345564 7777742 134 489999994 3 24699999 7998 8863 35899999998886644
Q ss_pred C-cc
Q psy826 193 G-CY 195 (329)
Q Consensus 193 ~-C~ 195 (329)
. |+
T Consensus 662 ~~C~ 665 (800)
T PTZ00214 662 QSCL 665 (800)
T ss_pred Cchh
Confidence 3 64
No 16
>KOG4289|consensus
Probab=97.71 E-value=3.6e-05 Score=81.19 Aligned_cols=92 Identities=24% Similarity=0.687 Sum_probs=66.7
Q ss_pred CCCceeecCCCCCCCCCC----cCCCCccccccCcCCCCCcCCcccccCCcc-cCCCCCCccCCCCceecCCCCcccCCC
Q psy826 125 KGNGQCVCNKEYTGELCN----ECNTGYFQSYKDEKTILCSKCHASCESGCS-TGGPKGCTKCKSGWAADKDIGCYDINE 199 (329)
Q Consensus 125 ~gs~~C~C~~G~~g~~C~----~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~-~~~~~~C~~C~~G~~~~~~~~C~d~~e 199 (329)
.+..+|.|++||+|+.|+ .|-. .+|.++ ++|. ..++|.| .|.+||.|..+..=...-.
T Consensus 1219 vnglrCrCPpGFTgd~CeTeiDlCYs--------------~pC~nn--g~C~srEggYtC-eCrpg~tGehCEvs~~agr 1281 (2531)
T KOG4289|consen 1219 VNGLRCRCPPGFTGDYCETEIDLCYS--------------GPCGNN--GRCRSREGGYTC-ECRPGFTGEHCEVSARAGR 1281 (2531)
T ss_pred cCceeEeCCCCCCcccccchhHhhhc--------------CCCCCC--CceEEecCceeE-EecCCccccceeeecccCc
Confidence 567899999999999993 4433 356555 5777 7788999 9999999976431112223
Q ss_pred CCCCCCCCCCceeecCC-CCcccCCCcCCCCCCcCCcccccccCcccccccCCccc
Q psy826 200 CSDENICSGNQFCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICI 254 (329)
Q Consensus 200 C~~~~~C~~~~~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~ 254 (329)
|.. +.|.++..|++.. |.+.|. |+.| .-.+..|.
T Consensus 1282 Cvp-GvC~nggtC~~~~nggf~c~-------------------Cp~g-e~e~prC~ 1316 (2531)
T KOG4289|consen 1282 CVP-GVCKNGGTCVNLLNGGFCCH-------------------CPYG-EFEDPRCE 1316 (2531)
T ss_pred ccc-ceecCCCEEeecCCCceecc-------------------CCCc-ccCCCceE
Confidence 555 6899999999886 778888 9988 44455665
No 17
>KOG1836|consensus
Probab=97.65 E-value=0.0002 Score=79.17 Aligned_cols=96 Identities=25% Similarity=0.516 Sum_probs=63.2
Q ss_pred CcCCCCCCCCCCccCCCC-------------CCCCCCCC---cccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccC
Q psy826 91 RCCPVDHYGADCKPCLGF-------------PNVCFGNG---KCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKD 154 (329)
Q Consensus 91 ~~C~~G~~G~~C~~C~~~-------------~~~C~~~~---~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~ 154 (329)
+.|++||.|..|+.|.++ +-+|.-++ +|. ..++.|.|.+...|..|+.|.+|||+....
T Consensus 697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd-----~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~ 771 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICD-----PRTGQCKCKHNTFGGQCAQCVDGFYGLPDL 771 (1705)
T ss_pred ccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCcccccc-----CCCCceecccCCCCCchhhhcCCCCCcccc
Confidence 349999999999988542 01233222 343 567899999999999999999999986555
Q ss_pred cCCCCCcCCc--c--cccCCcccCCCCCCccCCCCceecCCC
Q psy826 155 EKTILCSKCH--A--SCESGCSTGGPKGCTKCKSGWAADKDI 192 (329)
Q Consensus 155 ~~~~~C~~C~--~--~C~~~C~~~~~~~C~~C~~G~~~~~~~ 192 (329)
.....|.+|+ . .|..++. .....|+.|++||+|....
T Consensus 772 ~~~~dC~~C~Cp~~~~~~~~~~-~~~~iCk~Cp~gytG~rCe 812 (1705)
T KOG1836|consen 772 GTSGDCQPCPCPNGGACGQTPE-ILEVVCKNCPPGYTGLRCE 812 (1705)
T ss_pred CCCCCCccCCCCCChhhcCcCc-ccceecCCCCCCCcccccc
Confidence 4444577664 2 2222221 3346775577777776643
No 18
>KOG1219|consensus
Probab=97.61 E-value=5.2e-05 Score=83.00 Aligned_cols=81 Identities=30% Similarity=0.706 Sum_probs=62.5
Q ss_pred CCCCCCCCCCc----cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCCCCCcCCccccc
Q psy826 93 CPVDHYGADCK----PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKTILCSKCHASCE 168 (329)
Q Consensus 93 C~~G~~G~~C~----~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~ 168 (329)
||+-|.|..|+ +|. ++||..+++|.. ..+.|.|.|+.||+|..|+. .| .++++. .+|..+
T Consensus 3890 CpsqysG~~CEi~~epC~--snPC~~GgtCip---~~n~f~CnC~~gyTG~~Ce~--~G-----i~eCs~--n~C~~g-- 3953 (4289)
T KOG1219|consen 3890 CPSQYSGNHCEIDLEPCA--SNPCLTGGTCIP---FYNGFLCNCPNGYTGKRCEA--RG-----ISECSK--NVCGTG-- 3953 (4289)
T ss_pred CcccccCccccccccccc--CCCCCCCCEEEe---cCCCeeEeCCCCccCceeec--cc-----cccccc--ccccCC--
Confidence 99999999998 674 478999999997 67789999999999999942 11 122111 245555
Q ss_pred CCcc-cCCCCCCccCCCCceecC
Q psy826 169 SGCS-TGGPKGCTKCKSGWAADK 190 (329)
Q Consensus 169 ~~C~-~~~~~~C~~C~~G~~~~~ 190 (329)
+.|. .+|++.| .|-+||.|..
T Consensus 3954 g~C~n~~gsf~C-ncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3954 GQCINIPGSFHC-NCTPGILGRT 3975 (4289)
T ss_pred ceeeccCCceEe-ccChhHhccc
Confidence 6787 7889999 9999998765
No 19
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=97.52 E-value=0.00032 Score=67.59 Aligned_cols=33 Identities=33% Similarity=0.962 Sum_probs=29.9
Q ss_pred CcccCCCcCCCCCCcCCcccccccCcccccccC
Q psy826 218 SYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQ 250 (329)
Q Consensus 218 s~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~ 250 (329)
+..|.+||+.|..|+++.+.+|++|++||.+..
T Consensus 104 ~~~C~~C~~~C~~C~~~~~~~Ct~C~~g~~L~~ 136 (397)
T PF03302_consen 104 GNKCVPCHESCATCSGGAPNQCTSCKPGKVLKY 136 (397)
T ss_pred CCCCCCCCccccccCCCCCCCCcccCCCccccc
Confidence 458999999999999999999999999998763
No 20
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=97.49 E-value=0.00041 Score=72.16 Aligned_cols=106 Identities=24% Similarity=0.625 Sum_probs=69.0
Q ss_pred CCCcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCc
Q psy826 140 LCNECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSY 219 (329)
Q Consensus 140 ~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~ 219 (329)
.|..|.+|||. + ....|..|+.+|. .|.+++...|++|..+|++..... .....|.....|..+.| . ...
T Consensus 560 ~Ct~C~~GYyl---~-~~~~C~~Cd~~C~-~C~~~~~~~C~sC~~g~yL~~~~~-~~~g~Cv~~~~C~~GYY--~--d~~ 629 (800)
T PTZ00214 560 KCTTTKKGYGI---S-PDGKLLECDPTCL-ACTAPGPGRCTRCPSDKLLKRASG-AATGSCVDPGACVDGYY--A--DGD 629 (800)
T ss_pred cccccccCccc---C-CCCccccccchhh-hccCCCCCcceECCCCCEeccccc-cccccccCcccCCCCcc--c--CCC
Confidence 56788999984 2 2245889999998 898877889999999999875211 11112322135555555 1 234
Q ss_pred ccCCCc-CCCCCCcCCcccccccCccccccc--CCcccccC
Q psy826 220 RCMQCD-PSCNGCHGDGPDMCEACAEGYKLQ--QNICINTQ 257 (329)
Q Consensus 220 ~C~~C~-~~C~~C~g~~~~~C~~C~~Gy~~~--~~~C~~~~ 257 (329)
.|.+|+ +.|..|.. ...|++|..+|++. +..|+..+
T Consensus 630 ~C~~C~~~~C~tC~~--~~~C~~C~~~~~ls~~~~~C~~~C 668 (800)
T PTZ00214 630 ACLPCATPGCKTCGH--ASFCTECAGELFVSLDGQSCLEEC 668 (800)
T ss_pred ccccCCccccccccC--CCCcCcCCCCceecCcCCchhhcC
Confidence 788998 68888864 34788887776643 34455433
No 21
>PF14843 GF_recep_IV: Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=97.49 E-value=7.4e-05 Score=60.81 Aligned_cols=85 Identities=29% Similarity=0.840 Sum_probs=54.6
Q ss_pred Cccccc-CCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeec-CCCCcccCCCcCCC------CCCcCC
Q psy826 163 CHASCE-SGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVN-TEGSYRCMQCDPSC------NGCHGD 234 (329)
Q Consensus 163 C~~~C~-~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n-~~gs~~C~~C~~~C------~~C~g~ 234 (329)
||+.|. .+|+++++..|+.|+. |... +.|+ . .|+...+=.+ ......|.+||+.| .+|+||
T Consensus 2 C~~~C~~~gC~Gp~~~~C~sC~~-~~~~--g~Cv------~--~C~~~~~~~~~~~~~~~C~~Ch~~C~~~~~~~~C~Gp 70 (132)
T PF14843_consen 2 CHPQCASDGCWGPGPDQCLSCRN-FSFN--GTCV------A--SCPDGYYFDGSYEDSNTCQPCHPECLPSNGSETCTGP 70 (132)
T ss_dssp -CTTBGTT-BSSSSTTCBSSBSS-EEET--TEEE------S--STSSSSSSSSEEEETTEEEE--TTB---TTSSSBSSS
T ss_pred cccccCCCCccCcChhhccCccC-cccC--Ccch------h--hcccccccccccccCCcCcccChhhCccccCcCcCCC
Confidence 455554 4789999999999985 4443 4454 2 5543311000 11267899999999 999999
Q ss_pred cccccccCcccccccCCcccccCcCC
Q psy826 235 GPDMCEACAEGYKLQQNICINTQAKS 260 (329)
Q Consensus 235 ~~~~C~~C~~Gy~~~~~~C~~~~~~~ 260 (329)
++.+|++|+. | .+++.|+..++..
T Consensus 71 ~~~~C~~C~~-~-~~~~~Cv~~CP~~ 94 (132)
T PF14843_consen 71 GADQCLSCAH-F-RDGGQCVSSCPEG 94 (132)
T ss_dssp STTTBSSBSS-E-EETTEEESS--EE
T ss_pred Ccchhhhccc-c-CCCCcccCcCCCC
Confidence 9999999996 3 3577898776544
No 22
>KOG1836|consensus
Probab=97.31 E-value=0.00085 Score=74.29 Aligned_cols=92 Identities=33% Similarity=0.764 Sum_probs=61.3
Q ss_pred cCCCCCCCC-------CCccCCCCCCCCCCCCcccCCCCCCCCceee-cCCCCCCCCCCcCCCCccccccCcC--CCCCc
Q psy826 92 CCPVDHYGA-------DCKPCLGFPNVCFGNGKCKGNGTRKGNGQCV-CNKEYTGELCNECNTGYFQSYKDEK--TILCS 161 (329)
Q Consensus 92 ~C~~G~~G~-------~C~~C~~~~~~C~~~~~C~~~~~~~gs~~C~-C~~G~~g~~C~~C~~g~y~~~~~~~--~~~C~ 161 (329)
.|..||||. +|++|+ |.+.+.|.... ....+.|. |++||+|..|+.|.+|||+...... ...|.
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~C~-----Cp~~~~~~~~~-~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~ 833 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQPCP-----CPNGGACGQTP-EILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQ 833 (1705)
T ss_pred hhcCCCCCccccCCCCCCccCC-----CCCChhhcCcC-cccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCc
Confidence 378888843 377664 44445565422 14568898 9999999999999999998554332 34455
Q ss_pred CCc-----------------ccccCCcc-cCCCCCCccCCCCceecC
Q psy826 162 KCH-----------------ASCESGCS-TGGPKGCTKCKSGWAADK 190 (329)
Q Consensus 162 ~C~-----------------~~C~~~C~-~~~~~~C~~C~~G~~~~~ 190 (329)
+|. ..|. .|. +.....|..|.+||.+++
T Consensus 834 ~c~c~~n~dp~~~g~c~~~tg~c~-~ci~nT~g~~cd~c~~g~~gd~ 879 (1705)
T KOG1836|consen 834 SCQCNFNVDPNAFGNCNRLTGECL-KCIHNTAGEYCDLCKEGYFGDP 879 (1705)
T ss_pred cceeccccCcccccccccccccee-eccCCcccccccccccCccccc
Confidence 442 1232 344 344568889999999876
No 23
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=97.28 E-value=0.0024 Score=61.69 Aligned_cols=106 Identities=34% Similarity=0.826 Sum_probs=66.2
Q ss_pred CCCcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCC-----CCcccCCCCCCC---CCCCCCce
Q psy826 140 LCNECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKD-----IGCYDINECSDE---NICSGNQF 211 (329)
Q Consensus 140 ~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~-----~~C~d~~eC~~~---~~C~~~~~ 211 (329)
.|..|++|||. ....|.+||.+|. +|.+.....|+.|.+||.+... ..|. .+|... ..|. .
T Consensus 92 ~c~~C~~G~y~-----~~~~C~~C~~~C~-~C~~~~~~~Ct~C~~g~~L~~~~~~~~~tC~--~~c~~~~~~g~C~---~ 160 (397)
T PF03302_consen 92 CCSECPDGYYK-----NGNKCVPCHESCA-TCSGGAPNQCTSCKPGKVLKYGTDGTKGTCG--AECTEDCASGACK---A 160 (397)
T ss_pred cccCCCCCccc-----cCCCCCCCCcccc-ccCCCCCCCCcccCCCcccccccCCCcCccc--cccccccccCCcC---e
Confidence 56899999994 3457999999998 9997777899999999988762 1232 233322 2231 2
Q ss_pred eec-CCCCcccCCCcCC-----CCC-----------Cc--CCcccccccCcccccccCCccccc
Q psy826 212 CVN-TEGSYRCMQCDPS-----CNG-----------CH--GDGPDMCEACAEGYKLQQNICINT 256 (329)
Q Consensus 212 C~n-~~gs~~C~~C~~~-----C~~-----------C~--g~~~~~C~~C~~Gy~~~~~~C~~~ 256 (329)
|.- ..+...|.+|... -.. |. ......|++|..+|+|..+-|...
T Consensus 161 C~~~i~g~~yCs~C~~~~~~PinG~C~~~~~~~~~~C~~~~~~~G~Ct~C~~~yFl~~GGCY~~ 224 (397)
T PF03302_consen 161 CGATIGGTKYCSQCKTGTEVPINGVCTSNTARATATCTDGTASDGTCTSCTDGYFLYKGGCYST 224 (397)
T ss_pred eeeEeCCccccccCCCCCcCCcCCccCCCccccccccccccCCCCcccccccchhhhcCccccc
Confidence 221 2244556666533 122 22 223467888888888765555443
No 24
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.09 E-value=0.0002 Score=44.40 Aligned_cols=31 Identities=42% Similarity=0.885 Sum_probs=25.4
Q ss_pred CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCcccccccCCcc
Q psy826 204 NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNIC 253 (329)
Q Consensus 204 ~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C 253 (329)
+.|++++.|+++.++++|. |++||.+++..|
T Consensus 6 ~~C~~nA~C~~~~~~~~C~-------------------C~~Gy~GdG~~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCT-------------------CKPGYEGDGFFC 36 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEE-------------------E-CEEECCSTCE
T ss_pred CCCCCCcEeecCCCCEEeE-------------------CCCCCccCCcCC
Confidence 5788899999999999999 999999998765
No 25
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=97.03 E-value=0.00054 Score=45.53 Aligned_cols=38 Identities=37% Similarity=1.039 Sum_probs=33.5
Q ss_pred CCcCCCCCCcCCcccccccCcccccccCCcccccCcCC
Q psy826 223 QCDPSCNGCHGDGPDMCEACAEGYKLQQNICINTQAKS 260 (329)
Q Consensus 223 ~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~~~~ 260 (329)
+||+.|.+|+++++..|++|+.||.+++++|++.++..
T Consensus 1 ~C~~~C~~C~g~~~~~C~~C~~~~~~~~~~Cv~~C~~~ 38 (49)
T cd00064 1 PCHPSCATCTGPGPDQCTSCRHGFYLDGGTCVSECPEG 38 (49)
T ss_pred CCCCccCCCcCcCcCcCccCcCccCCCCCcccccCCCC
Confidence 58999999999999999999999999899999865443
No 26
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.96 E-value=0.00042 Score=38.67 Aligned_cols=23 Identities=35% Similarity=0.771 Sum_probs=19.1
Q ss_pred CCCCccCCCCceecCCC-CcccCCC
Q psy826 176 PKGCTKCKSGWAADKDI-GCYDINE 199 (329)
Q Consensus 176 ~~~C~~C~~G~~~~~~~-~C~d~~e 199 (329)
+|.| .|++||.+.+++ .|+||||
T Consensus 1 sy~C-~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTC-SCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEe-eCCCCCcCCCCCCccccCCC
Confidence 4789 999999987654 5999986
No 27
>smart00261 FU Furin-like repeats.
Probab=96.95 E-value=0.00085 Score=43.89 Aligned_cols=39 Identities=36% Similarity=1.009 Sum_probs=34.7
Q ss_pred cccCCCcCCCCCCcCCcccccccCcccccccCCcccccC
Q psy826 219 YRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICINTQ 257 (329)
Q Consensus 219 ~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~ 257 (329)
..|.+|++.|..|.++++..|++|..||.+..+.|+..+
T Consensus 2 ~~C~~C~~~C~~C~~~~~~~C~~C~~~~~l~~~~Cv~~C 40 (46)
T smart00261 2 GECKPCHPECATCTGPGPDDCTSCKHGFFLDGGTCVSEC 40 (46)
T ss_pred CccccccccccccCCcCcCcCccCCcccCCCCCCccCcC
Confidence 358899999999999999999999999999888888744
No 28
>KOG1217|consensus
Probab=96.88 E-value=0.0058 Score=59.51 Aligned_cols=133 Identities=33% Similarity=0.727 Sum_probs=84.5
Q ss_pred cCCCCCCCCCC------ccCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCC--CCCcCC
Q psy826 92 CCPVDHYGADC------KPCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKT--ILCSKC 163 (329)
Q Consensus 92 ~C~~G~~G~~C------~~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~--~~C~~C 163 (329)
.|++||.+..+ ..|..... |.+++.|.+ ..+.+.|.|++||.|..|..|... +... ..-.+|
T Consensus 255 ~~~~g~~~~~~~~~~~~~~C~~~~~-c~~~~~C~~---~~~~~~C~C~~g~~g~~~~~~~~~------~~C~~~~~~~~c 324 (487)
T KOG1217|consen 255 RCPEGYTGDACVTCVDVDSCALIAS-CPNGGTCVN---VPGSYRCTCPPGFTGRLCTECVDV------DECSPRNAGGPC 324 (487)
T ss_pred eCCCCccccccceeeeccccCCCCc-cCCCCeeec---CCCcceeeCCCCCCCCCCcccccc------ccccccccCCcC
Confidence 37889987763 35655443 888899997 566699999999999987222211 0000 011224
Q ss_pred cccccCCcc---cCCCCCCccCCCCceecCCCCcccCC-CCCCCCCCCCCceeec-CCCCcccCCCcCCCCCCcCCcccc
Q psy826 164 HASCESGCS---TGGPKGCTKCKSGWAADKDIGCYDIN-ECSDENICSGNQFCVN-TEGSYRCMQCDPSCNGCHGDGPDM 238 (329)
Q Consensus 164 ~~~C~~~C~---~~~~~~C~~C~~G~~~~~~~~C~d~~-eC~~~~~C~~~~~C~n-~~gs~~C~~C~~~C~~C~g~~~~~ 238 (329)
..+ ..|. ..+.+.| .|..||.+ ..|++.+ +|.. ..|..+..|.+ ..+++.|.
T Consensus 325 ~~g--~~C~~~~~~~~~~C-~c~~~~~g---~~C~~~~~~C~~-~~~~~~~~c~~~~~~~~~c~---------------- 381 (487)
T KOG1217|consen 325 ANG--GTCNTLGSFGGFRC-ACGPGFTG---RRCEDSNDECAS-SPCCPGGTCVNETPGSYRCA---------------- 381 (487)
T ss_pred CCC--cccccCCCCCCCCc-CCCCCCCC---CccccCCccccC-CccccCCEeccCCCCCeEec----------------
Confidence 433 2442 3345678 88888544 4577774 8887 34677888998 68899998
Q ss_pred cccCccccccc----CCcccccCcCC
Q psy826 239 CEACAEGYKLQ----QNICINTQAKS 260 (329)
Q Consensus 239 C~~C~~Gy~~~----~~~C~~~~~~~ 260 (329)
|+.+|.+. ...+.+..++.
T Consensus 382 ---~~~~~~~~~~~~~~~~~~~~~c~ 404 (487)
T KOG1217|consen 382 ---CPAGFAGKANGDGVGCEDIDECS 404 (487)
T ss_pred ---CCCccccCCcccccccccccccc
Confidence 88887752 23455555554
No 29
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.56 E-value=0.0009 Score=42.97 Aligned_cols=31 Identities=32% Similarity=0.754 Sum_probs=27.2
Q ss_pred cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCC
Q psy826 104 PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYT 137 (329)
Q Consensus 104 ~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~ 137 (329)
+|....+.|..++.|+| +.|+|.|.|++||.
T Consensus 4 EC~~~~~~C~~~~~C~N---~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 4 ECAEGPHNCPENGTCVN---TEGSYSCSCPPGYE 34 (42)
T ss_dssp TTTTTSSSSSTTSEEEE---ETTEEEEEESTTEE
T ss_pred ccCCCCCcCCCCCEEEc---CCCCEEeeCCCCcE
Confidence 56666678988999999 99999999999997
No 30
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.48 E-value=0.0031 Score=39.20 Aligned_cols=33 Identities=55% Similarity=1.142 Sum_probs=26.7
Q ss_pred cCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccc
Q psy826 196 DINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYK 247 (329)
Q Consensus 196 d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~ 247 (329)
++++|....+|..+..|+++.++|.|. |++||.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~-------------------C~~g~~ 33 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCE-------------------CPPGYT 33 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeE-------------------CCCCCc
Confidence 356776434787778999999999999 999998
No 31
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.41 E-value=0.0043 Score=37.31 Aligned_cols=27 Identities=48% Similarity=1.206 Sum_probs=22.8
Q ss_pred CCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826 110 NVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC 141 (329)
Q Consensus 110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C 141 (329)
..|.++|+|.. . .++|+|.+||+|..|
T Consensus 6 ~~C~~~G~C~~----~-~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVS----P-CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeC----C-CCEEECCCCCcCCCC
Confidence 46999999995 2 589999999999876
No 32
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.41 E-value=0.00082 Score=40.48 Aligned_cols=28 Identities=36% Similarity=0.778 Sum_probs=24.1
Q ss_pred CCCCCCCCcccCCCCCC-CCceeecCCCCCCC
Q psy826 109 PNVCFGNGKCKGNGTRK-GNGQCVCNKEYTGE 139 (329)
Q Consensus 109 ~~~C~~~~~C~~~~~~~-gs~~C~C~~G~~g~ 139 (329)
+++|.++|+|+. .. ++|.|.|++||+|.
T Consensus 3 ~~~C~n~g~C~~---~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTCID---LPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEEEE---ESTSEEEEEEBTTEEST
T ss_pred CCcCCCCeEEEe---CCCCCEEeECCCCCccC
Confidence 357999999997 55 88999999999985
No 33
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.32 E-value=0.0011 Score=41.09 Aligned_cols=29 Identities=28% Similarity=0.700 Sum_probs=22.3
Q ss_pred CCCCCCCCCcccCCCCCCCCceeecCCCCCCC
Q psy826 108 FPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGE 139 (329)
Q Consensus 108 ~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~ 139 (329)
....|..++.|++ +.+++.|.|++||.|+
T Consensus 4 ~~~~C~~nA~C~~---~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 4 NNGGCHPNATCTN---TGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGGGS-TTCEEEE----TTSEEEEE-CEEECC
T ss_pred CCCCCCCCcEeec---CCCCEEeECCCCCccC
Confidence 3456889999998 7889999999999975
No 34
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.31 E-value=0.0052 Score=38.12 Aligned_cols=29 Identities=38% Similarity=0.877 Sum_probs=25.1
Q ss_pred CCCCCCCcccCCCCCCCCceeecCCCCC-CCCC
Q psy826 110 NVCFGNGKCKGNGTRKGNGQCVCNKEYT-GELC 141 (329)
Q Consensus 110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~-g~~C 141 (329)
.+|.+++.|.+ +.++|.|.|.+||. |..|
T Consensus 9 ~~C~~~~~C~~---~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 9 NPCQNGGTCVN---TVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CCcCCCCEeEC---CCCCeEeECCCCCccCCcC
Confidence 56888889998 78899999999999 8776
No 35
>KOG1226|consensus
Probab=96.29 E-value=0.017 Score=58.65 Aligned_cols=87 Identities=34% Similarity=0.885 Sum_probs=49.7
Q ss_pred CcCCCCCCCCCCccCCC----------------CCCCCCCCCcccCCCCCCCCceeecCCCCC----CCCCC----cCCC
Q psy826 91 RCCPVDHYGADCKPCLG----------------FPNVCFGNGKCKGNGTRKGNGQCVCNKEYT----GELCN----ECNT 146 (329)
Q Consensus 91 ~~C~~G~~G~~C~~C~~----------------~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~----g~~C~----~C~~ 146 (329)
+.|.+||+|..|+ |+. ...+|+++|.|.= ++|+|.+... |.+|+ .|+.
T Consensus 480 C~C~~G~~G~~CE-C~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-------GqC~C~~~~~~~i~G~fCECDnfsC~r 551 (783)
T KOG1226|consen 480 CRCDEGWLGKKCE-CSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-------GQCVCHKPDNGKIYGKFCECDNFSCER 551 (783)
T ss_pred eecCCCCCCCccc-CCccccCcHhHHhhccCCCCCCCcCCCCcEeC-------CceEecCCCCCceeeeeeeccCccccc
Confidence 4599999999885 322 1125666677663 6788988766 77772 2221
Q ss_pred CccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCC
Q psy826 147 GYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSD 202 (329)
Q Consensus 147 g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~ 202 (329)
.....|.- |+.|. | .+| .|.+||.|...+-=.+.+.|..
T Consensus 552 --------~~g~lC~g-~G~C~--C-----G~C-vC~~GwtG~~C~C~~std~C~~ 590 (783)
T KOG1226|consen 552 --------HKGVLCGG-HGRCE--C-----GRC-VCNPGWTGSACNCPLSTDTCES 590 (783)
T ss_pred --------ccCcccCC-CCeEe--C-----CcE-EcCCCCccCCCCCCCCCccccC
Confidence 01122222 22222 2 478 9999999987421133444544
No 36
>KOG4052|consensus
Probab=96.26 E-value=0.00015 Score=59.37 Aligned_cols=58 Identities=40% Similarity=0.839 Sum_probs=45.6
Q ss_pred cccccceEEEecccCcccC----------------------------------------------------cccCccchh
Q psy826 26 IYAKSEVRLIEIQEKMCSE----------------------------------------------------VSGFLDQCH 53 (329)
Q Consensus 26 ~y~~se~~~~e~~e~~C~~----------------------------------------------------~~~~~~~c~ 53 (329)
.|.+||.||.|+.|++|.. +..-.-+|.
T Consensus 76 ~Y~rselrl~E~tEniCer~ley~~hker~gs~rfaraqsqt~~tlk~l~~KGvKvdlgip~elwd~psvevt~lk~qce 155 (190)
T KOG4052|consen 76 LYQRSELRLAEITENICERFLEYKIHKERTGSERFARAQSQTIETLKGLVEKGVKVDLGIPFELWDQPSVEVTALKQQCE 155 (190)
T ss_pred eeehhHhHHHHHHHHHHHHHHHHhhhccCcchhHHHHHhhhHHHHHHHHHHcCceeeeCCchHhhcCCcHHHHHHHHHHH
Confidence 7999999999999999980 001245799
Q ss_pred hHHhhhHHHHHHhhhhccCCcccCCCccceeecc
Q psy826 54 NFAADIESEIEEWWFKVQHSKAKDSDLYTWLCIN 87 (329)
Q Consensus 54 ~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~C~~ 87 (329)
++++++|+.|+.|+++.+. ..++..++|..
T Consensus 156 ~lleeyed~i~ewyf~hq~----e~~l~k~lce~ 185 (190)
T KOG4052|consen 156 SLLEEYEDLIEEWYFNHQS----EEDLFKQLCER 185 (190)
T ss_pred HHHHHHHHHHHHHHhcccc----HHHHHHHHHHh
Confidence 9999999999999999885 23456666653
No 37
>smart00051 DSL delta serrate ligand.
Probab=95.94 E-value=0.0091 Score=41.92 Aligned_cols=44 Identities=30% Similarity=0.678 Sum_probs=32.5
Q ss_pred CcCCCCCCCCCCc-cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826 91 RCCPVDHYGADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC 141 (329)
Q Consensus 91 ~~C~~G~~G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C 141 (329)
+.|+++|||..|. .|... +.+.++..|.. .+.|+|.+||+|..|
T Consensus 19 v~C~~~~yG~~C~~~C~~~-~d~~~~~~Cd~------~G~~~C~~Gw~G~~C 63 (63)
T smart00051 19 VTCDENYYGEGCNKFCRPR-DDFFGHYTCDE------NGNKGCLEGWMGPYC 63 (63)
T ss_pred eeCCCCCcCCccCCEeCcC-ccccCCccCCc------CCCEecCCCCcCCCC
Confidence 4499999999997 55431 23566777852 378999999999876
No 38
>KOG1025|consensus
Probab=95.92 E-value=0.0064 Score=62.39 Aligned_cols=83 Identities=29% Similarity=0.716 Sum_probs=56.2
Q ss_pred Cccccc-CCcccCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCccccccc
Q psy826 163 CHASCE-SGCSTGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEA 241 (329)
Q Consensus 163 C~~~C~-~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~ 241 (329)
|+..|. ..|+++++.+|.+|+.--.+ +.|++ .|... .+.. +-..+..|..||+.|.+|+|+++.+|..
T Consensus 489 C~~lCs~~GCWGpgp~qClsCrn~~rg---g~CVe--~C~~l----~g~~--rf~~~~~C~~ChPEC~TCnG~G~d~C~~ 557 (1177)
T KOG1025|consen 489 CDKLCSSGGCWGPGPDQCLSCRNFSRG---GTCVE--KCNLL----GGEP--RFVNSRECERCHPECETCNGPGADNCLQ 557 (1177)
T ss_pred hhhhcCCCCCcCCCCccceeccccccC---ceehh--hcccc----CCcc--cccccceecccChhhccCCCCCccchhh
Confidence 555553 36889999999888764333 34532 23221 1111 1244678999999999999999999999
Q ss_pred CcccccccCCcccccCc
Q psy826 242 CAEGYKLQQNICINTQA 258 (329)
Q Consensus 242 C~~Gy~~~~~~C~~~~~ 258 (329)
|+. ..++-.|+..++
T Consensus 558 CaH--f~dgp~CV~~CP 572 (1177)
T KOG1025|consen 558 CAH--FRDGPHCVSDCP 572 (1177)
T ss_pred hhh--cCCCcchhccCC
Confidence 986 556667876655
No 39
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=95.84 E-value=0.012 Score=39.27 Aligned_cols=27 Identities=41% Similarity=1.051 Sum_probs=24.0
Q ss_pred CCCceeecCCCCCCCCCCcCCCCcccc
Q psy826 125 KGNGQCVCNKEYTGELCNECNTGYFQS 151 (329)
Q Consensus 125 ~gs~~C~C~~G~~g~~C~~C~~g~y~~ 151 (329)
..+++|.|++++.|..|+.|++|||+.
T Consensus 16 ~~~G~C~C~~~~~G~~C~~C~~g~~~~ 42 (50)
T cd00055 16 PGTGQCECKPNTTGRRCDRCAPGYYGL 42 (50)
T ss_pred CCCCEEeCCCcCCCCCCCCCCCCCccC
Confidence 356899999999999999999999963
No 40
>KOG0196|consensus
Probab=95.78 E-value=0.012 Score=60.20 Aligned_cols=60 Identities=30% Similarity=0.797 Sum_probs=46.5
Q ss_pred CCceeecCCCCC----CCCCCcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecC
Q psy826 126 GNGQCVCNKEYT----GELCNECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADK 190 (329)
Q Consensus 126 gs~~C~C~~G~~----g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~ 190 (329)
..+.|.|++||. |..|..|+.|+|- .......|.+|... ..-...+...| .|..||+..+
T Consensus 257 piG~C~C~aGye~~~~~~~C~aCp~G~yK--~~~~~~~C~~CP~~--S~s~~ega~~C-~C~~gyyRA~ 320 (996)
T KOG0196|consen 257 PIGGCVCKAGYEEAENGKACQACPPGTYK--ASQGDSLCLPCPPN--SHSSSEGATSC-TCENGYYRAD 320 (996)
T ss_pred EcCceeecCCCCcccCCCcceeCCCCccc--CCCCCCCCCCCCCC--CCCCCCCCCcc-cccCCcccCC
Confidence 457899999997 6799999999994 23345678998875 22236778999 9999999766
No 41
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.76 E-value=0.013 Score=35.23 Aligned_cols=30 Identities=40% Similarity=0.956 Sum_probs=24.9
Q ss_pred CCCCCCCCcccCCCCCCCCceeecCCCCCCC-CC
Q psy826 109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTGE-LC 141 (329)
Q Consensus 109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~-~C 141 (329)
..+|.+++.|.+ ..+.+.|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~---~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVN---TPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEec---CCCCeEeECCCCCcccCCc
Confidence 356888899998 6788999999999987 54
No 42
>KOG1025|consensus
Probab=95.70 E-value=0.024 Score=58.44 Aligned_cols=105 Identities=27% Similarity=0.632 Sum_probs=65.0
Q ss_pred CCCceee-cCCCCCCCCC-CcCCCCccccccCcCCCCCcCCcccccCCcccCCCCCCccCCCCceecCCCCcccCCCCCC
Q psy826 125 KGNGQCV-CNKEYTGELC-NECNTGYFQSYKDEKTILCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCYDINECSD 202 (329)
Q Consensus 125 ~gs~~C~-C~~G~~g~~C-~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~ 202 (329)
+|..+|. |..--+|..| +.|..--. ...-.....|..||+.|. +|+++|...|..|..- .++ ..| +.
T Consensus 501 pgp~qClsCrn~~rgg~CVe~C~~l~g-~~rf~~~~~C~~ChPEC~-TCnG~G~d~C~~CaHf--~dg-p~C------V~ 569 (1177)
T KOG1025|consen 501 PGPDQCLSCRNFSRGGTCVEKCNLLGG-EPRFVNSRECERCHPECE-TCNGPGADNCLQCAHF--RDG-PHC------VS 569 (1177)
T ss_pred CCCccceeccccccCceehhhccccCC-cccccccceecccChhhc-cCCCCCccchhhhhhc--CCC-cch------hc
Confidence 4555664 5554446666 55543100 001135567999999998 9999999999888762 121 223 33
Q ss_pred CCCCCCCce-----eecC--CCCcccCCCcCCC-CCCcCCcccccccC
Q psy826 203 ENICSGNQF-----CVNT--EGSYRCMQCDPSC-NGCHGDGPDMCEAC 242 (329)
Q Consensus 203 ~~~C~~~~~-----C~n~--~gs~~C~~C~~~C-~~C~g~~~~~C~~C 242 (329)
.|+++-. -++. .-...|.+||+.| -+|.|+++..|..|
T Consensus 570 --~CP~G~~G~~s~ii~kypd~~~~C~~Ch~nCth~c~~~~~~~c~~~ 615 (1177)
T KOG1025|consen 570 --DCPDGVTGPKSTIIWKYPDPQIECRPCHPNCTHGCLGPTSPGCVLN 615 (1177)
T ss_pred --cCCCcccCCCCceeeecCCCccccccCCCcccccccCCCCCCcccC
Confidence 4433211 1111 1234599999999 58999999999987
No 43
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.66 E-value=0.0052 Score=34.29 Aligned_cols=17 Identities=29% Similarity=0.788 Sum_probs=14.5
Q ss_pred Cccccccc--CCcccccCc
Q psy826 242 CAEGYKLQ--QNICINTQA 258 (329)
Q Consensus 242 C~~Gy~~~--~~~C~~~~~ 258 (329)
|++||.+. +..|+|++|
T Consensus 6 C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 6 CPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CCCCCcCCCCCCccccCCC
Confidence 99999964 678999986
No 44
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.57 E-value=0.017 Score=35.24 Aligned_cols=29 Identities=38% Similarity=0.887 Sum_probs=24.6
Q ss_pred CCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826 110 NVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC 141 (329)
Q Consensus 110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C 141 (329)
.+|.+++.|.+ ..+.+.|.|.+||.|..|
T Consensus 9 ~~C~~~~~C~~---~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 9 NPCQNGGTCVN---TVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred CCcCCCCEeEC---CCCCeEeECCCCCcCCcC
Confidence 45877889997 788899999999999766
No 45
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=95.55 E-value=0.0061 Score=40.42 Aligned_cols=26 Identities=46% Similarity=1.128 Sum_probs=21.8
Q ss_pred CCCceeecCCCCCCCCCCcCCCCccc
Q psy826 125 KGNGQCVCNKEYTGELCNECNTGYFQ 150 (329)
Q Consensus 125 ~gs~~C~C~~G~~g~~C~~C~~g~y~ 150 (329)
..+++|.|+++|.|..|+.|.++||+
T Consensus 15 ~~~G~C~C~~~~~G~~C~~C~~g~~~ 40 (49)
T PF00053_consen 15 PSTGQCVCKPGTTGPRCDQCKPGYFG 40 (49)
T ss_dssp ETCEEESBSTTEESTTS-EE-TTEEC
T ss_pred CCCCEEeccccccCCcCcCCCCcccc
Confidence 35689999999999999999999996
No 46
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=95.44 E-value=0.013 Score=38.32 Aligned_cols=25 Identities=40% Similarity=1.095 Sum_probs=23.1
Q ss_pred CCceeecCCCCCCCCCCcCCCCccc
Q psy826 126 GNGQCVCNKEYTGELCNECNTGYFQ 150 (329)
Q Consensus 126 gs~~C~C~~G~~g~~C~~C~~g~y~ 150 (329)
.+++|.|++++.|..|+.|++|||+
T Consensus 16 ~~G~C~C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 16 DTGQCECKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred CCCEEECCCCCCCCCCCcCCCCcCC
Confidence 3689999999999999999999996
No 47
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.39 E-value=0.019 Score=34.72 Aligned_cols=28 Identities=32% Similarity=0.807 Sum_probs=23.3
Q ss_pred CCCCCCCcccCCCCCCCCceeecCCCCCC-CCC
Q psy826 110 NVCFGNGKCKGNGTRKGNGQCVCNKEYTG-ELC 141 (329)
Q Consensus 110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~g-~~C 141 (329)
.+|.++ .|.+ ..+++.|.|.+||.| ..|
T Consensus 6 ~~C~~~-~C~~---~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCIN---TPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEEC---CCCCeEeECCCCCccCCcc
Confidence 468777 8987 678999999999999 655
No 48
>smart00261 FU Furin-like repeats.
Probab=95.06 E-value=0.032 Score=36.30 Aligned_cols=35 Identities=40% Similarity=1.003 Sum_probs=28.5
Q ss_pred CCcCCcccccCCcccCCCCCCccCCCCceecCCCCcc
Q psy826 159 LCSKCHASCESGCSTGGPKGCTKCKSGWAADKDIGCY 195 (329)
Q Consensus 159 ~C~~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~ 195 (329)
.|.+|+..|. +|.+++...|+.|..||++.. +.|+
T Consensus 3 ~C~~C~~~C~-~C~~~~~~~C~~C~~~~~l~~-~~Cv 37 (46)
T smart00261 3 ECKPCHPECA-TCTGPGPDDCTSCKHGFFLDG-GTCV 37 (46)
T ss_pred cccccccccc-ccCCcCcCcCccCCcccCCCC-CCcc
Confidence 5788999998 899877889999999998765 4443
No 49
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.03 E-value=0.01 Score=36.70 Aligned_cols=28 Identities=29% Similarity=0.685 Sum_probs=21.2
Q ss_pred cccccCCcc-cCCCCCCccCCCCceecCCC
Q psy826 164 HASCESGCS-TGGPKGCTKCKSGWAADKDI 192 (329)
Q Consensus 164 ~~~C~~~C~-~~~~~~C~~C~~G~~~~~~~ 192 (329)
+++|.+.|. .++++.| .|++||.+..++
T Consensus 5 NGgC~h~C~~~~g~~~C-~C~~Gy~L~~D~ 33 (36)
T PF14670_consen 5 NGGCSHICVNTPGSYRC-SCPPGYKLAEDG 33 (36)
T ss_dssp GGGSSSEEEEETTSEEE-E-STTEEE-TTS
T ss_pred CCCcCCCCccCCCceEe-ECCCCCEECcCC
Confidence 456777787 7889999 999999998754
No 50
>KOG3512|consensus
Probab=95.01 E-value=0.062 Score=51.67 Aligned_cols=76 Identities=22% Similarity=0.632 Sum_probs=46.8
Q ss_pred CCCC-CcccCCCCCCCCceeecCCCCCCCCCCcCCCCcccccc----CcCCCCCcCCc----------------------
Q psy826 112 CFGN-GKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYK----DEKTILCSKCH---------------------- 164 (329)
Q Consensus 112 C~~~-~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~----~~~~~~C~~C~---------------------- 164 (329)
|..+ +.|+-+ ..+..+|.|.-+.+|..|..|.+-||.... ......|.+|.
T Consensus 280 CNgHAs~Cv~d--~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~S 357 (592)
T KOG3512|consen 280 CNGHASRCVMD--ESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRS 357 (592)
T ss_pred ecCccceeeec--cCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccc
Confidence 5544 347641 234489999999999999999998875221 11223344321
Q ss_pred -ccccCCcc-cCCCCCCccCCCCceecC
Q psy826 165 -ASCESGCS-TGGPKGCTKCKSGWAADK 190 (329)
Q Consensus 165 -~~C~~~C~-~~~~~~C~~C~~G~~~~~ 190 (329)
..|. .|. +..+..|..|.+||+-+.
T Consensus 358 ggvCl-nCrHnTaGrhChyCreGyyRd~ 384 (592)
T KOG3512|consen 358 GGVCL-NCRHNTAGRHCHYCREGYYRDG 384 (592)
T ss_pred cceEe-ecccCCCCcccccccCccccCC
Confidence 1122 343 334567888999998776
No 51
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.54 E-value=0.015 Score=27.54 Aligned_cols=13 Identities=38% Similarity=1.035 Sum_probs=8.7
Q ss_pred eeecCCCCCCCCC
Q psy826 129 QCVCNKEYTGELC 141 (329)
Q Consensus 129 ~C~C~~G~~g~~C 141 (329)
.|+|++||+|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4778888887765
No 52
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.26 E-value=0.018 Score=35.58 Aligned_cols=24 Identities=63% Similarity=1.316 Sum_probs=19.5
Q ss_pred CCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826 205 ICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 205 ~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
.| .++|++++++|+|. |++||.+.
T Consensus 7 gC--~h~C~~~~g~~~C~-------------------C~~Gy~L~ 30 (36)
T PF14670_consen 7 GC--SHICVNTPGSYRCS-------------------CPPGYKLA 30 (36)
T ss_dssp GS--SSEEEEETTSEEEE--------------------STTEEE-
T ss_pred Cc--CCCCccCCCceEeE-------------------CCCCCEEC
Confidence 45 56999999999999 99999986
No 53
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=94.06 E-value=0.087 Score=34.72 Aligned_cols=32 Identities=41% Similarity=1.011 Sum_probs=24.9
Q ss_pred CCcccccCCcccCCCCCCccCCCCceecCCCCcc
Q psy826 162 KCHASCESGCSTGGPKGCTKCKSGWAADKDIGCY 195 (329)
Q Consensus 162 ~C~~~C~~~C~~~~~~~C~~C~~G~~~~~~~~C~ 195 (329)
+|+..|. +|.+++...|+.|..||.+.. +.|+
T Consensus 1 ~C~~~C~-~C~g~~~~~C~~C~~~~~~~~-~~Cv 32 (49)
T cd00064 1 PCHPSCA-TCTGPGPDQCTSCRHGFYLDG-GTCV 32 (49)
T ss_pred CCCCccC-CCcCcCcCcCccCcCccCCCC-Cccc
Confidence 3677887 898887789999999998654 4454
No 54
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=94.03 E-value=0.056 Score=32.85 Aligned_cols=34 Identities=50% Similarity=1.040 Sum_probs=26.2
Q ss_pred CCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826 197 INECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 197 ~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
+++|....+|..++.|.+..++|.|. |++||.+.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~-------------------C~~g~~g~ 35 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCS-------------------CPPGYTGR 35 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeE-------------------CCCCCcCC
Confidence 45665423687778999999999999 99999873
No 55
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=93.94 E-value=0.024 Score=34.01 Aligned_cols=27 Identities=33% Similarity=0.894 Sum_probs=24.0
Q ss_pred CCCCCCceeecCC-CCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826 204 NICSGNQFCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 204 ~~C~~~~~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
.+|.+++.|+... ++|.|. |++||.|.
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~-------------------C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCE-------------------CPPGYTGK 31 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEE-------------------EBTTEEST
T ss_pred CcCCCCeEEEeCCCCCEEeE-------------------CCCCCccC
Confidence 4788899999998 899999 99999875
No 56
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=93.68 E-value=0.0094 Score=50.51 Aligned_cols=122 Identities=26% Similarity=0.665 Sum_probs=72.3
Q ss_pred eeecccccCcCCCCCC---CCCCc---cCCC---CCCCCCCCCcccCCC--CCCCCceeecCCCCC--CCCC--CcCCCC
Q psy826 83 WLCINKLKRCCPVDHY---GADCK---PCLG---FPNVCFGNGKCKGNG--TRKGNGQCVCNKEYT--GELC--NECNTG 147 (329)
Q Consensus 83 ~~C~~~~~~~C~~G~~---G~~C~---~C~~---~~~~C~~~~~C~~~~--~~~gs~~C~C~~G~~--g~~C--~~C~~g 147 (329)
+.|. |.+||. -.+|+ .|.. .-.+|..-+.|.+.. .....+.|.|.+||. ...| ..|..
T Consensus 20 fEC~------Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~- 92 (197)
T PF06247_consen 20 FECK------CNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNN- 92 (197)
T ss_dssp EEEE------ESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS-
T ss_pred eEEE------cCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCc-
Confidence 5677 999997 56776 4643 236788888998633 235679999999997 3345 33332
Q ss_pred ccccccCcCCCCCcCCcccccCCcc----cCCCCCCccCCCCceecCCCCccc--CCCCCCCCCCCCCceeecCCCCccc
Q psy826 148 YFQSYKDEKTILCSKCHASCESGCS----TGGPKGCTKCKSGWAADKDIGCYD--INECSDENICSGNQFCVNTEGSYRC 221 (329)
Q Consensus 148 ~y~~~~~~~~~~C~~C~~~C~~~C~----~~~~~~C~~C~~G~~~~~~~~C~d--~~eC~~~~~C~~~~~C~n~~gs~~C 221 (329)
..|.. +.|. .+....| +|.-|+...+.++|.- ..+|.. .|..+..|....+-|+|
T Consensus 93 -------------~~Cg~---GKCI~d~~~~~~~~C-SC~IGkV~~dn~kCtk~G~T~C~L--KCk~nE~CK~~~~~Y~C 153 (197)
T PF06247_consen 93 -------------KDCGS---GKCILDPDNPNNPTC-SCNIGKVPDDNKKCTKTGETKCSL--KCKENEECKLVDGYYKC 153 (197)
T ss_dssp ----------------TT---EEEEEEEGGGSEEEE-EE-TEEETTTTTESEEEE----------TTTEEEEEETTEEEE
T ss_pred -------------eecCC---CeEEecCCCCCCcee-EeeeceEeccCCcccCCCccceee--ecCCCcceeeeCcEEEe
Confidence 11221 2343 2333489 9999999555455752 234766 78889999999999999
Q ss_pred CCCcCCCCCCcCCcccccccCccccccc
Q psy826 222 MQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 222 ~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
. |.+|+.+.
T Consensus 154 ~-------------------~~~~~~~~ 162 (197)
T PF06247_consen 154 V-------------------CKEGFPGD 162 (197)
T ss_dssp E-------------------E-TT-EEE
T ss_pred e-------------------cCCCCCCC
Confidence 9 99998876
No 57
>PF00594 Gla: Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain; InterPro: IPR000294 The GLA (gamma-carboxyglutamic acid-rich) domain contains glutamate residues that have been post-translationally modified by vitamin K-dependent carboxylation to form gamma-carboxyglutamate (Gla) [, , ]. All glutamic acid (Glu) residues present in the GLA domain are potential carboxylation sites; in coagulation proteins, all Gu residues are modified to Gla, while in osteocalcin and matrix Gla proteins only some Glu residues are modified to Gla. The GLA domain is responsible for the high-affinity binding of calcium ions. It starts at the N-terminal extremity of the mature form of proteins and ends with a conserved aromatic residue; a conserved Gla-x(3)-Gla-x-Cys motif [] is found in the middle of the domain which seems to be important for substrate recognition by the carboxylase. The 3D structure of the GLA domain has been solved [, ]. Calcium ions induce conformational changes in the GLA domain that and are necessary for the proper folding of the GLA domain. A common structural feature of functional GLA domains is the clustering of N-terminal hydrophobic residues into a hydrophobic patch that mediates interaction with the cell surface membrane []. Proteins known to contain a GLA domain include []: Coagulation factor X [] Coagulation factor VII [] Coagulation factor IX [] Coagulation factor XIV (vitamin K-dependent protein C) [] Vitamin K-dependent protein S [] Vitamin K-dependent protein Z [] Prothrombin Transthyretin Osteocalcin (also known as bone-Gla protein, BGP) Matrix Gla protein (MGP) [] Inter-alpha-trypsin inhibitor heavy chain H2 Growth arrest-specific protein 6 (Gas-6) [] ; GO: 0005509 calcium ion binding, 0005576 extracellular region; PDB: 1Q3M_A 1VZM_C 2PF1_A 1NL2_A 1NL1_A 2SPT_A 2PF2_A 2ZP0_L 1W0Y_L 2AEI_L ....
Probab=93.41 E-value=0.019 Score=36.76 Aligned_cols=39 Identities=26% Similarity=0.455 Sum_probs=32.4
Q ss_pred hhhhccCcccccceEEEecccCcccCcccCccchhhHHhhhHHHHHHhhhh
Q psy826 19 WEEEKQKIYAKSEVRLIEIQEKMCSEVSGFLDQCHNFAADIESEIEEWWFK 69 (329)
Q Consensus 19 ~ee~~~~~y~~se~~~~e~~e~~C~~~~~~~~~c~~~~~~~~~~~e~~~~~ 69 (329)
|||.+.|+.++ ||++++|+ +.+..++++....+. .+|..
T Consensus 1 ~ee~~~g~ler------Ec~eE~C~-----~eea~E~f~~~~~t~-~fw~~ 39 (42)
T PF00594_consen 1 LEELKPGNLER------ECMEEVCE-----YEEAREIFEDTEGTN-AFWKK 39 (42)
T ss_dssp TGGGSCHHHHH------HHTSSTHT-----HHHHHHHHHSHHHHH-HHHHH
T ss_pred CCccccCCchH------HHHHhhcC-----hHHHHHHHHChHhHH-HHHHH
Confidence 79999999999 99999999 888888888876654 55543
No 58
>KOG1226|consensus
Probab=93.41 E-value=0.16 Score=51.88 Aligned_cols=38 Identities=45% Similarity=1.110 Sum_probs=29.4
Q ss_pred CCCCCc----cCCCC-CCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826 98 YGADCK----PCLGF-PNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN 142 (329)
Q Consensus 98 ~G~~C~----~C~~~-~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~ 142 (329)
||+.|+ .|+.. ...|.++|.|.= ++|+|.+||+|..|+
T Consensus 538 ~G~fCECDnfsC~r~~g~lC~g~G~C~C-------G~CvC~~GwtG~~C~ 580 (783)
T KOG1226|consen 538 YGKFCECDNFSCERHKGVLCGGHGRCEC-------GRCVCNPGWTGSACN 580 (783)
T ss_pred eeeeeeccCcccccccCcccCCCCeEeC-------CcEEcCCCCccCCCC
Confidence 488885 45542 256888888874 789999999999994
No 59
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=92.75 E-value=0.13 Score=30.65 Aligned_cols=27 Identities=56% Similarity=1.109 Sum_probs=23.2
Q ss_pred CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826 204 NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 204 ~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
..|.++..|++..+.+.|. |+.||.+.
T Consensus 6 ~~C~~~~~C~~~~~~~~C~-------------------C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCV-------------------CPPGYTGD 32 (36)
T ss_pred CCCCCCCEEecCCCCeEeE-------------------CCCCCccc
Confidence 4677778999999999999 99999876
No 60
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.74 E-value=0.04 Score=34.04 Aligned_cols=32 Identities=34% Similarity=0.746 Sum_probs=24.5
Q ss_pred CCCCCCceeecCC-CCcccCCCcCCCCCCcCCcccccccCcccccccCCccc
Q psy826 204 NICSGNQFCVNTE-GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICI 254 (329)
Q Consensus 204 ~~C~~~~~C~n~~-gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~ 254 (329)
..|+.++.|++.. |+..|. |..||..++..|+
T Consensus 5 ~~cP~NA~C~~~~dG~eecr-------------------Cllgyk~~~~~C~ 37 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECR-------------------CLLGYKKVGGKCV 37 (37)
T ss_dssp S---TTEEEEEETTSEEEEE-------------------E-TTEEEETTEEE
T ss_pred ccCCCCcccEEcCCCCEEEE-------------------eeCCccccCCCcC
Confidence 4788999999877 999999 9999999888774
No 61
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=92.48 E-value=0.15 Score=38.96 Aligned_cols=27 Identities=26% Similarity=0.914 Sum_probs=19.4
Q ss_pred CCCCCcCCcccccccCcccccccCCcccc
Q psy826 227 SCNGCHGDGPDMCEACAEGYKLQQNICIN 255 (329)
Q Consensus 227 ~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~ 255 (329)
.|..|.+ +..|..|..||.++++.|+.
T Consensus 29 ~C~~C~~--~~~C~~C~~GY~~~~~~Cv~ 55 (96)
T PTZ00382 29 NCKSCVV--DGVCGECNSGFSLDNGKCVS 55 (96)
T ss_pred CCcCCCC--CCccccCcCCcccCCCcccc
Confidence 4556644 34688899999998877764
No 62
>smart00181 EGF Epidermal growth factor-like domain.
Probab=92.42 E-value=0.15 Score=30.71 Aligned_cols=25 Identities=48% Similarity=1.051 Sum_probs=21.7
Q ss_pred CCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826 205 ICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 205 ~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
.|..+ .|++..+++.|. |++||.+.
T Consensus 7 ~C~~~-~C~~~~~~~~C~-------------------C~~g~~g~ 31 (35)
T smart00181 7 PCSNG-TCINTPGSYTCS-------------------CPPGYTGD 31 (35)
T ss_pred CCCCC-EEECCCCCeEeE-------------------CCCCCccC
Confidence 67667 899999999999 99999885
No 63
>smart00069 GLA Domain containing Gla (gamma-carboxyglutamate) residues. A hyaluronan-binding domain found in proteins associated with the extracellular matrix, cell adhesion and cell migration.
Probab=92.22 E-value=0.094 Score=37.01 Aligned_cols=38 Identities=29% Similarity=0.387 Sum_probs=32.3
Q ss_pred hhhhhccCcccccceEEEecccCcccCcccCccchhhHHhhhHHHHHHh
Q psy826 18 AWEEEKQKIYAKSEVRLIEIQEKMCSEVSGFLDQCHNFAADIESEIEEW 66 (329)
Q Consensus 18 ~~ee~~~~~y~~se~~~~e~~e~~C~~~~~~~~~c~~~~~~~~~~~e~~ 66 (329)
++||.+.|+.++ |++|++|. +++..+++++.+.+.+.|
T Consensus 24 ~~Ee~~~~~ler------Ec~eE~C~-----~eea~E~f~~~~~t~~fw 61 (65)
T smart00069 24 LLEELRPGNLER------ECQEEICS-----LEEAREVFEDNEGTDEFY 61 (65)
T ss_pred chhhhccCccch------HHHhhhcc-----CcHHHHHHHchHhHHHHH
Confidence 489999999998 99999999 888888888887775544
No 64
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=91.72 E-value=0.033 Score=47.26 Aligned_cols=61 Identities=30% Similarity=0.788 Sum_probs=44.4
Q ss_pred CCCCCccCCCCceecCCCCcccCCCCCCC----CCCCCCceeecCC-----CCcccCCCcCCCCCCcCCcccccccCccc
Q psy826 175 GPKGCTKCKSGWAADKDIGCYDINECSDE----NICSGNQFCVNTE-----GSYRCMQCDPSCNGCHGDGPDMCEACAEG 245 (329)
Q Consensus 175 ~~~~C~~C~~G~~~~~~~~C~d~~eC~~~----~~C~~~~~C~n~~-----gs~~C~~C~~~C~~C~g~~~~~C~~C~~G 245 (329)
..+.| .|.+||.+....+|....+|... .+|.+-+.|++.. ..|+|. |.+|
T Consensus 18 NHfEC-~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~-------------------C~~g 77 (197)
T PF06247_consen 18 NHFEC-KCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCD-------------------CING 77 (197)
T ss_dssp SEEEE-EESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEE-------------------E-TT
T ss_pred CceEE-EcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEe-------------------cccC
Confidence 34789 99999999887889988888762 6788888998765 468888 9999
Q ss_pred ccccCCcccc
Q psy826 246 YKLQQNICIN 255 (329)
Q Consensus 246 y~~~~~~C~~ 255 (329)
|.+..+.|++
T Consensus 78 Y~~~~~vCvp 87 (197)
T PF06247_consen 78 YILKQGVCVP 87 (197)
T ss_dssp EEESSSSEEE
T ss_pred ceeeCCeEch
Confidence 9988777763
No 65
>KOG3512|consensus
Probab=90.01 E-value=0.84 Score=44.18 Aligned_cols=65 Identities=23% Similarity=0.634 Sum_probs=39.5
Q ss_pred CCceee-cCCCCCCCCCCcCCCCccccccC--cCCCCCcCCc--cc--ccCCcccCCCCCCccCCCCceecCCC
Q psy826 126 GNGQCV-CNKEYTGELCNECNTGYFQSYKD--EKTILCSKCH--AS--CESGCSTGGPKGCTKCKSGWAADKDI 192 (329)
Q Consensus 126 gs~~C~-C~~G~~g~~C~~C~~g~y~~~~~--~~~~~C~~C~--~~--C~~~C~~~~~~~C~~C~~G~~~~~~~ 192 (329)
..++|. |.-...|..|..|.+|||.+... .....|..|+ +. =..+|+ ..+.+| .|++|-.|...+
T Consensus 357 SggvClnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCN-q~tGqC-pCkeGvtG~tCn 428 (592)
T KOG3512|consen 357 SGGVCLNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCN-QTTGQC-PCKEGVTGLTCN 428 (592)
T ss_pred ccceEeecccCCCCcccccccCccccCCCCCCchhhhhhhcCCccccccccccc-ccCCcc-cCCCCCcccccc
Confidence 345775 88888899999999999953322 1234566542 21 002455 123567 777777665543
No 66
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=88.67 E-value=0.74 Score=35.18 Aligned_cols=45 Identities=24% Similarity=0.750 Sum_probs=28.4
Q ss_pred CCCcCCCCccccccCcCCCCCcCCc-ccccCCcccCCCCCCccCCCCceecC
Q psy826 140 LCNECNTGYFQSYKDEKTILCSKCH-ASCESGCSTGGPKGCTKCKSGWAADK 190 (329)
Q Consensus 140 ~C~~C~~g~y~~~~~~~~~~C~~C~-~~C~~~C~~~~~~~C~~C~~G~~~~~ 190 (329)
.|+.|.+||| .+.....|.+|. .+|. +|.. +..|..|..||.+..
T Consensus 5 ~Ct~C~~g~~---~~~~~~~C~~C~~~~C~-~C~~--~~~C~~C~~GY~~~~ 50 (96)
T PTZ00382 5 VCTSCDSDKK---PNKDGSGCVLCSVGNCK-SCVV--DGVCGECNSGFSLDN 50 (96)
T ss_pred ccCcCCCCCc---cCCCCCcCCcCCCCCCc-CCCC--CCccccCcCCcccCC
Confidence 4667777776 222334577776 4676 6753 246878888888754
No 67
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=86.61 E-value=0.15 Score=35.77 Aligned_cols=44 Identities=34% Similarity=0.766 Sum_probs=20.0
Q ss_pred CcCCCCCCCCCCc-cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCC
Q psy826 91 RCCPVDHYGADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELC 141 (329)
Q Consensus 91 ~~C~~G~~G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C 141 (329)
+.|.+.|||+.|. .|.+... -..+-+|.. .+.-+|.+||+|..|
T Consensus 19 v~C~~nyyG~~C~~~C~~~~d-~~ghy~Cd~------~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 19 VVCDENYYGPNCSKFCKPRDD-SFGHYTCDS------NGNKVCLPGWTGPNC 63 (63)
T ss_dssp ----TTEETTTT-EE---EEE-TTEEEEE-S------S--EEE-TTEESTTS
T ss_pred EECCCCCCCccccCCcCCCcC-CcCCcccCC------CCCCCCCCCCcCCCC
Confidence 3499999999998 5543111 122334542 244568899988766
No 68
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.83 E-value=0.5 Score=41.71 Aligned_cols=37 Identities=41% Similarity=0.910 Sum_probs=30.4
Q ss_pred CCcccCCCCCCC-CCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCccccccc
Q psy826 192 IGCYDINECSDE-NICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQ 249 (329)
Q Consensus 192 ~~C~d~~eC~~~-~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~ 249 (329)
..|.++++|... +.|. +.|.++.|+|.|. |.+||.+.
T Consensus 182 ~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~-------------------c~~g~~~~ 219 (224)
T cd01475 182 KICVVPDLCATLSHVCQ--QVCISTPGSYLCA-------------------CTEGYALL 219 (224)
T ss_pred ccCcCchhhcCCCCCcc--ceEEcCCCCEEeE-------------------CCCCccCC
Confidence 348888889765 5674 5899999999999 99999875
No 69
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=84.27 E-value=1.1 Score=27.12 Aligned_cols=22 Identities=27% Similarity=0.768 Sum_probs=15.6
Q ss_pred CcccCCCCCCccCCCCceecCCC
Q psy826 170 GCSTGGPKGCTKCKSGWAADKDI 192 (329)
Q Consensus 170 ~C~~~~~~~C~~C~~G~~~~~~~ 192 (329)
.|.-.....| .|++||.++.+.
T Consensus 11 ~CDpn~~~~C-~CPeGyIlde~~ 32 (34)
T PF09064_consen 11 DCDPNSPGQC-FCPEGYILDEGS 32 (34)
T ss_pred ccCCCCCCce-eCCCceEecCCc
Confidence 3442344699 999999998743
No 70
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=83.90 E-value=3.3 Score=31.57 Aligned_cols=20 Identities=25% Similarity=0.599 Sum_probs=13.5
Q ss_pred CcccCCCCCCccCCCCceecC
Q psy826 170 GCSTGGPKGCTKCKSGWAADK 190 (329)
Q Consensus 170 ~C~~~~~~~C~~C~~G~~~~~ 190 (329)
.|+......| .|.+||+...
T Consensus 68 ~ct~t~dt~C-~C~~G~y~~~ 87 (98)
T cd00185 68 PCTATRNTVC-GCKPGFYCLT 87 (98)
T ss_pred cCCCCCCCeE-eCCCCCEecC
Confidence 3443445788 7888888765
No 71
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=83.74 E-value=4.2 Score=30.98 Aligned_cols=18 Identities=17% Similarity=0.534 Sum_probs=11.7
Q ss_pred CCcccccccCcccccccCC
Q psy826 233 GDGPDMCEACAEGYKLQQN 251 (329)
Q Consensus 233 g~~~~~C~~C~~Gy~~~~~ 251 (329)
......|. |++||+....
T Consensus 71 ~t~dt~C~-C~~G~y~~~~ 88 (98)
T cd00185 71 ATRNTVCG-CKPGFYCLTK 88 (98)
T ss_pred CCCCCeEe-CCCCCEecCC
Confidence 34445555 9999987653
No 72
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=83.22 E-value=0.14 Score=40.61 Aligned_cols=32 Identities=22% Similarity=0.592 Sum_probs=21.5
Q ss_pred CCCCCCceeecCC--CCcccCCCcCCCCCCcCCcccccccCcccccccCCcccccC
Q psy826 204 NICSGNQFCVNTE--GSYRCMQCDPSCNGCHGDGPDMCEACAEGYKLQQNICINTQ 257 (329)
Q Consensus 204 ~~C~~~~~C~n~~--gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~ 257 (329)
+.|-+| .|...+ ..+.|. |..||.|. +|+..+
T Consensus 51 ~YClHG-~C~yI~dl~~~~Cr-------------------C~~GYtGe--RCEh~d 84 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCR-------------------CSHGYTGI--RCQHVV 84 (139)
T ss_pred CEeECC-EEEeeccCCCceeE-------------------CCCCcccc--ccccee
Confidence 345544 565444 577888 99999987 776544
No 73
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=81.98 E-value=2.1 Score=28.34 Aligned_cols=21 Identities=33% Similarity=0.998 Sum_probs=15.8
Q ss_pred CCCCCCCcccCCCCCCCCceeecCCCCC
Q psy826 110 NVCFGNGKCKGNGTRKGNGQCVCNKEYT 137 (329)
Q Consensus 110 ~~C~~~~~C~~~~~~~gs~~C~C~~G~~ 137 (329)
..|..++.|.+ ++|.|++||.
T Consensus 26 ~qC~~~s~C~~-------g~C~C~~g~~ 46 (52)
T PF01683_consen 26 EQCIGGSVCVN-------GRCQCPPGYV 46 (52)
T ss_pred CCCCCcCEEcC-------CEeECCCCCE
Confidence 44666777865 7899999885
No 74
>PHA02887 EGF-like protein; Provisional
Probab=80.07 E-value=1.4 Score=34.46 Aligned_cols=37 Identities=38% Similarity=0.881 Sum_probs=27.0
Q ss_pred cCCC-CCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826 104 PCLG-FPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN 142 (329)
Q Consensus 104 ~C~~-~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~ 142 (329)
+|+. ..+-|. +|.|.... ......|.|.+||+|..|+
T Consensus 85 pC~~eyk~YCi-HG~C~yI~-dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 85 KCKNDFNDFCI-NGECMNII-DLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred ccChHhhCEee-CCEEEccc-cCCCceeECCCCcccCCCC
Confidence 5654 335677 67998533 2566899999999999885
No 75
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=76.35 E-value=4 Score=27.00 Aligned_cols=12 Identities=33% Similarity=0.783 Sum_probs=9.7
Q ss_pred CcccccccCCcc
Q psy826 242 CAEGYKLQQNIC 253 (329)
Q Consensus 242 C~~Gy~~~~~~C 253 (329)
|++||...+++|
T Consensus 41 C~~g~~~~~~~C 52 (52)
T PF01683_consen 41 CPPGYVEVGGRC 52 (52)
T ss_pred CCCCCEecCCCC
Confidence 999998876665
No 76
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=74.57 E-value=5.4 Score=31.87 Aligned_cols=12 Identities=42% Similarity=1.032 Sum_probs=6.6
Q ss_pred CCCcCCCCcccc
Q psy826 140 LCNECNTGYFQS 151 (329)
Q Consensus 140 ~C~~C~~g~y~~ 151 (329)
.|..|++|.|.+
T Consensus 64 ~C~PCp~GTYTe 75 (127)
T PHA02637 64 QCTPCGSGTFTS 75 (127)
T ss_pred ccccCCCCCeec
Confidence 455666666543
No 77
>KOG4611|consensus
Probab=68.25 E-value=5.4 Score=37.54 Aligned_cols=25 Identities=32% Similarity=1.160 Sum_probs=17.4
Q ss_pred CCCcCCCCccccccCcCCCCCcCCccccc
Q psy826 140 LCNECNTGYFQSYKDEKTILCSKCHASCE 168 (329)
Q Consensus 140 ~C~~C~~g~y~~~~~~~~~~C~~C~~~C~ 168 (329)
.|..|..|||. + ....|..|..+|.
T Consensus 99 fcgncasgfyr---n-dngyctkcetscs 123 (747)
T KOG4611|consen 99 FCGNCASGFYR---N-DNGYCTKCETSCS 123 (747)
T ss_pred ccccccccceE---C-CCcccccccccHh
Confidence 68899999984 2 3345777776664
No 78
>KOG3782|consensus
Probab=67.54 E-value=0.79 Score=38.07 Aligned_cols=18 Identities=50% Similarity=0.899 Sum_probs=17.3
Q ss_pred cccccceEEEecccCccc
Q psy826 26 IYAKSEVRLIEIQEKMCS 43 (329)
Q Consensus 26 ~y~~se~~~~e~~e~~C~ 43 (329)
-|+|||.-|.|+||++|+
T Consensus 69 ~yarSE~hLTEl~E~iCd 86 (189)
T KOG3782|consen 69 RYARSEMHLTELMEKICD 86 (189)
T ss_pred ccchhHHHHHHHHHHHHH
Confidence 599999999999999999
No 79
>KOG1218|consensus
Probab=64.40 E-value=49 Score=30.28 Aligned_cols=107 Identities=24% Similarity=0.549 Sum_probs=54.3
Q ss_pred CCCCCCCCCc-cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCCcCCCCccccccCcCCCCCcCCcccccCCcc
Q psy826 94 PVDHYGADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCNECNTGYFQSYKDEKTILCSKCHASCESGCS 172 (329)
Q Consensus 94 ~~G~~G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~ 172 (329)
..+|.|..|. +|+-+.. |.. .+|.+ ... .|.+..+|.+..|.. .++++. .....| .+... +.
T Consensus 96 ~~~~~g~~C~~~~~~~~~-c~~-~~C~~----~~~-~c~~~~~~~~~~C~~--~~~~g~---~C~~~c-~~~~~----~~ 158 (316)
T KOG1218|consen 96 LNGYEGPQCESPCPCGDG-CAE-KTCAN----PRR-ECRCGGGYIGEQCGE--ENLVGL---KCQRDC-QCTGG----CD 158 (316)
T ss_pred CCCCCcccccCCCCcCCc-ccc-cccCC----Ccc-ceecCCcCccccccc--cCCCCC---CccCCC-CCccc----cC
Confidence 5677777776 3332211 333 45654 111 688888888887754 344432 122222 11111 11
Q ss_pred cCCCCCCccCCCCceecCCCCcccCCCCCCCCCCCCCceeecCCCCccc
Q psy826 173 TGGPKGCTKCKSGWAADKDIGCYDINECSDENICSGNQFCVNTEGSYRC 221 (329)
Q Consensus 173 ~~~~~~C~~C~~G~~~~~~~~C~d~~eC~~~~~C~~~~~C~n~~gs~~C 221 (329)
.....| .|.+||.+..... ... .|.....|..+..|....+...+
T Consensus 159 -~~~~~c-~c~~g~~g~~~~~-~~~-~c~~~~~~~~g~~C~~~~~~~~~ 203 (316)
T KOG1218|consen 159 -CKNGIC-TCQPGFVGVFCVE-SCS-GCSPLTACENGAKCNRSTGSCLC 203 (316)
T ss_pred -CCCCce-eccCCcccccccc-cCC-CcCCCcccCCCCeeecccccccc
Confidence 334678 7999999876321 110 14433455555566655444333
No 80
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=61.61 E-value=5.4 Score=26.66 Aligned_cols=22 Identities=32% Similarity=0.890 Sum_probs=16.6
Q ss_pred CCccCCCCceecCCCCcccCCCC
Q psy826 178 GCTKCKSGWAADKDIGCYDINEC 200 (329)
Q Consensus 178 ~C~~C~~G~~~~~~~~C~d~~eC 200 (329)
.| .|++||.++..+.|+...+|
T Consensus 34 gC-~C~~G~v~~~~~~CV~~~~C 55 (55)
T PF01826_consen 34 GC-FCPPGYVRNDNGRCVPPSEC 55 (55)
T ss_dssp EE-EETTTEEEETTSEEEEGGGC
T ss_pred cC-CCCCCeeEcCCCCEEcHHHC
Confidence 48 99999998875567765544
No 81
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=58.73 E-value=8.4 Score=33.85 Aligned_cols=30 Identities=27% Similarity=0.656 Sum_probs=23.4
Q ss_pred cCCCCCCCCCCCCcccCCCCCCCCceeecCCCCCC
Q psy826 104 PCLGFPNVCFGNGKCKGNGTRKGNGQCVCNKEYTG 138 (329)
Q Consensus 104 ~C~~~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g 138 (329)
+|+...+.|. ..|.+ +.|+|.|.|.+||+.
T Consensus 189 ~C~~~~~~c~--~~C~~---~~g~~~c~c~~g~~~ 218 (224)
T cd01475 189 LCATLSHVCQ--QVCIS---TPGSYLCACTEGYAL 218 (224)
T ss_pred hhcCCCCCcc--ceEEc---CCCCEEeECCCCccC
Confidence 5665556675 46888 899999999999974
No 82
>PF00757 Furin-like: Furin-like cysteine rich region; InterPro: IPR006211 The furin-like cysteine rich region has been found in a variety of proteins from eukaryotes that are involved in the mechanism of signal transduction by receptor tyrosine kinases, which involves receptor aggregation [].; GO: 0004714 transmembrane receptor protein tyrosine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation, 0007169 transmembrane receptor protein tyrosine kinase signaling pathway, 0016020 membrane; PDB: 3U2P_A 2AHX_B 1N8Y_C 1IGR_A 1S78_A 3MZW_A 2A91_A 1N8Z_C 3N85_A 3H3B_B ....
Probab=56.48 E-value=5.4 Score=33.03 Aligned_cols=34 Identities=29% Similarity=0.778 Sum_probs=24.8
Q ss_pred CcCCC-CCCcCCcccccccCcccccccCCcccccCcC
Q psy826 224 CDPSC-NGCHGDGPDMCEACAEGYKLQQNICINTQAK 259 (329)
Q Consensus 224 C~~~C-~~C~g~~~~~C~~C~~Gy~~~~~~C~~~~~~ 259 (329)
||+.| ++|+|+++.+|.+|+.-+. +++|++.++.
T Consensus 51 Ch~eClGGC~gp~~~~C~aCr~~~~--~g~Cv~~CP~ 85 (149)
T PF00757_consen 51 CHEECLGGCSGPNPSDCYACRHFSY--NGTCVEQCPP 85 (149)
T ss_dssp --TTEESEESSSSTTSESCESSEEE--TTEEESS-ST
T ss_pred hhHHHhhcCCCCCCCcCHHHhCcCc--CCeecccCCh
Confidence 88888 6899999999999986443 5688877655
No 83
>KOG0196|consensus
Probab=52.76 E-value=14 Score=38.81 Aligned_cols=49 Identities=22% Similarity=0.524 Sum_probs=28.8
Q ss_pred ccCcCCCCCC----CCCCccCCCCC-CCCCCCCc---ccCCC--CCCCCceeecCCCCC
Q psy826 89 LKRCCPVDHY----GADCKPCLGFP-NVCFGNGK---CKGNG--TRKGNGQCVCNKEYT 137 (329)
Q Consensus 89 ~~~~C~~G~~----G~~C~~C~~~~-~~C~~~~~---C~~~~--~~~gs~~C~C~~G~~ 137 (329)
.++.|.+||. |..|+.||.+. .+=..... |-... ...++-.|.|..||.
T Consensus 259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGYY 317 (996)
T ss_pred CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCcc
Confidence 3566999996 67999998642 11111222 22111 136677888888875
No 84
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=52.28 E-value=12 Score=29.97 Aligned_cols=37 Identities=27% Similarity=0.553 Sum_probs=25.6
Q ss_pred cCCC-CCCCCCCCCcccCCCCCCCCceeecCCCCCCCCCC
Q psy826 104 PCLG-FPNVCFGNGKCKGNGTRKGNGQCVCNKEYTGELCN 142 (329)
Q Consensus 104 ~C~~-~~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g~~C~ 142 (329)
+|+. ..+-|.+ |.|.... ......|.|..||.|..|+
T Consensus 44 ~Cp~ey~~YClH-G~C~yI~-dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 44 LCGPEGDGYCLH-GDCIHAR-DIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred cCChhhCCEeEC-CEEEeec-cCCCceeECCCCccccccc
Confidence 4543 2345764 4887522 2567899999999999995
No 85
>PF07699 GCC2_GCC3: GCC2 and GCC3; InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []: Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins) Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []: Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction []. Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases []. This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=41.16 E-value=46 Score=21.44 Aligned_cols=10 Identities=30% Similarity=1.002 Sum_probs=7.0
Q ss_pred CCCcCCCCcc
Q psy826 140 LCNECNTGYF 149 (329)
Q Consensus 140 ~C~~C~~g~y 149 (329)
.|..|+.|+|
T Consensus 10 ~C~~Cp~GtY 19 (48)
T PF07699_consen 10 KCQPCPKGTY 19 (48)
T ss_pred ccCCCCCCcc
Confidence 5667777776
No 86
>PHA02887 EGF-like protein; Provisional
Probab=40.04 E-value=20 Score=28.22 Aligned_cols=20 Identities=35% Similarity=0.674 Sum_probs=15.7
Q ss_pred CCccceeecccccCcCCCCCCCCCCc
Q psy826 78 SDLYTWLCINKLKRCCPVDHYGADCK 103 (329)
Q Consensus 78 ~~~~~~~C~~~~~~~C~~G~~G~~C~ 103 (329)
.++.+..|. |++||+|..|+
T Consensus 103 ~dL~epsCr------C~~GYtG~RCE 122 (126)
T PHA02887 103 IDLDEKFCI------CNKGYTGIRCD 122 (126)
T ss_pred ccCCCceeE------CCCCcccCCCC
Confidence 455556666 99999999997
No 87
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=39.52 E-value=61 Score=25.98 Aligned_cols=9 Identities=33% Similarity=1.080 Sum_probs=5.2
Q ss_pred CCcCCCCcc
Q psy826 141 CNECNTGYF 149 (329)
Q Consensus 141 C~~C~~g~y 149 (329)
|..|++|+|
T Consensus 44 C~kCPPGt~ 52 (127)
T PHA02637 44 CLSCPPGTY 52 (127)
T ss_pred cCCCCCCCE
Confidence 366666654
No 88
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=39.09 E-value=28 Score=26.81 Aligned_cols=33 Identities=24% Similarity=0.675 Sum_probs=24.2
Q ss_pred cCCCCCCCCCCCCCceeecCCCCcccCCCcCCCCCCcCCcccccccCcccccc
Q psy826 196 DINECSDENICSGNQFCVNTEGSYRCMQCDPSCNGCHGDGPDMCEACAEGYKL 248 (329)
Q Consensus 196 d~~eC~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C~g~~~~~C~~C~~Gy~~ 248 (329)
..+.|.....|.+..+|.. ..+..|. |.+||.-
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~-------------------Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCS-------------------CLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceE-------------------CCCCcCC
Confidence 3456777678988899943 3456788 9999964
No 89
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=36.97 E-value=13 Score=25.13 Aligned_cols=32 Identities=34% Similarity=0.756 Sum_probs=17.1
Q ss_pred CCCCCCcccCCCC-CCCCceeecCCCCCCCCCC
Q psy826 111 VCFGNGKCKGNGT-RKGNGQCVCNKEYTGELCN 142 (329)
Q Consensus 111 ~C~~~~~C~~~~~-~~gs~~C~C~~G~~g~~C~ 142 (329)
+|+++|..--++. ..|.-.|.|+..|.|+.|.
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS 50 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence 4777777654442 2566789999999999884
No 90
>PF04706 Dickkopf_N: Dickkopf N-terminal cysteine-rich region; InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=34.26 E-value=66 Score=21.51 Aligned_cols=32 Identities=22% Similarity=0.733 Sum_probs=18.9
Q ss_pred CCCCCCCCCCceeecCCCCcccCCCcCCCCCC
Q psy826 200 CSDENICSGNQFCVNTEGSYRCMQCDPSCNGC 231 (329)
Q Consensus 200 C~~~~~C~~~~~C~n~~gs~~C~~C~~~C~~C 231 (329)
|.....|+.+.||......++|.+|...=..|
T Consensus 2 C~~D~dC~~g~yC~~~~~~~~C~~Cr~~~~rC 33 (52)
T PF04706_consen 2 CSSDEDCGYGKYCHSSSFESKCLPCRKRRKRC 33 (52)
T ss_pred CcccccCCCCCCcCCCCCCccChhhccCCCCC
Confidence 33334666677887766667777655443333
No 91
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=33.90 E-value=39 Score=25.98 Aligned_cols=26 Identities=23% Similarity=0.591 Sum_probs=19.9
Q ss_pred CCCCCCCCcccCCCCCCCCceeecCCCCCC
Q psy826 109 PNVCFGNGKCKGNGTRKGNGQCVCNKEYTG 138 (329)
Q Consensus 109 ~~~C~~~~~C~~~~~~~gs~~C~C~~G~~g 138 (329)
...|..++.|.. .....|.|.+||.-
T Consensus 83 y~~CG~~g~C~~----~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 83 YGFCGPNGICNS----NNSPKCSCLPGFEP 108 (110)
T ss_pred ccccCCccEeCC----CCCCceECCCCcCC
Confidence 456989999963 44567999999963
No 92
>PF07562 NCD3G: Nine Cysteines Domain of family 3 GPCR; InterPro: IPR011500 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents a conserved sequence, found in the extracellular region, that contains several highly-conserved Cys residues that are predicted to form disulphide bridges.; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway; PDB: 2E4X_B 2E4Y_A 2E4U_B 2E4V_B 2E4W_A.
Probab=33.45 E-value=11 Score=25.34 Aligned_cols=8 Identities=25% Similarity=0.534 Sum_probs=2.2
Q ss_pred CcCCCCcc
Q psy826 142 NECNTGYF 149 (329)
Q Consensus 142 ~~C~~g~y 149 (329)
..|++|++
T Consensus 7 ~~C~pG~~ 14 (54)
T PF07562_consen 7 EPCPPGQR 14 (54)
T ss_dssp ----TTTE
T ss_pred CCCCCCCE
Confidence 34555554
No 93
>KOG0814|consensus
Probab=30.03 E-value=9.2 Score=32.53 Aligned_cols=26 Identities=27% Similarity=0.553 Sum_probs=20.2
Q ss_pred CchhhhhccC-cccccceEEEecccCc
Q psy826 16 DTAWEEEKQK-IYAKSEVRLIEIQEKM 41 (329)
Q Consensus 16 ~~~~ee~~~~-~y~~se~~~~e~~e~~ 41 (329)
.|.|||+++. .++.|--.+++||+++
T Consensus 189 stV~EEk~~NPRLTk~~eeFv~IM~NL 215 (237)
T KOG0814|consen 189 STVWEEKNLNPRLTKSKEEFVKIMKNL 215 (237)
T ss_pred eehhhhhccCcccccCHHHHHHHHHhc
Confidence 4689999984 6777777777888776
No 94
>KOG3516|consensus
Probab=25.86 E-value=55 Score=35.85 Aligned_cols=32 Identities=34% Similarity=0.839 Sum_probs=27.5
Q ss_pred CCCCCCCCCcccCCCCCCCCceeecC-CCCCCCCCC
Q psy826 108 FPNVCFGNGKCKGNGTRKGNGQCVCN-KEYTGELCN 142 (329)
Q Consensus 108 ~~~~C~~~~~C~~~~~~~gs~~C~C~-~G~~g~~C~ 142 (329)
.+++|..++.|.. .-..+.|.|. .||+|..|.
T Consensus 549 lPN~CehgG~C~Q---s~~~f~C~C~~TGY~GatCH 581 (1306)
T KOG3516|consen 549 LPNPCEHGGKCSQ---SWDDFECNCELTGYKGATCH 581 (1306)
T ss_pred CCccccCCCcccc---cccceeEecccccccccccc
Confidence 5688999999997 5667999998 899999993
No 95
>KOG4258|consensus
Probab=24.97 E-value=81 Score=33.48 Aligned_cols=88 Identities=25% Similarity=0.580 Sum_probs=0.0
Q ss_pred ceeecccccCcCCCCCC--CCCCc-cCCCCCCCCCCCCcccCCCCCCCCceee--cCCCC--------------------
Q psy826 82 TWLCINKLKRCCPVDHY--GADCK-PCLGFPNVCFGNGKCKGNGTRKGNGQCV--CNKEY-------------------- 136 (329)
Q Consensus 82 ~~~C~~~~~~~C~~G~~--G~~C~-~C~~~~~~C~~~~~C~~~~~~~gs~~C~--C~~G~-------------------- 136 (329)
...|......-|..|-. |.-|- +|-+++..=.....|..-......+.|+ |+++|
T Consensus 213 ~~~CQK~Cp~~~g~gC~ang~cCH~eClGgC~~~~dptaC~aCr~~y~~G~CV~~Cp~~~Y~~e~~RCvt~~~C~~l~~~ 292 (1025)
T KOG4258|consen 213 NSHCQKSCPENCGNGCTANGECCHEECLGGCSQPNDPTACVACRGLYDDGVCVEACPPDYYLFENWRCVTREECALLHSL 292 (1025)
T ss_pred CCchhccCccccccCcccccccccHhhccCcCCCCChhHhHHhhccccCCceeccCCcchhhhccceeccHHHHHHhcCc
Q ss_pred ---------CCCCC-CcCCCCccccccCcCCCCCcCCcccccCCcc
Q psy826 137 ---------TGELC-NECNTGYFQSYKDEKTILCSKCHASCESGCS 172 (329)
Q Consensus 137 ---------~g~~C-~~C~~g~y~~~~~~~~~~C~~C~~~C~~~C~ 172 (329)
++..| ..||+||- .+..+..|.+|.+.|...|.
T Consensus 293 ~~~~i~G~~~~~~Cv~~CPsGy~---~N~~~~~C~~C~g~C~~vC~ 335 (1025)
T KOG4258|consen 293 SNSVISGVIHAGQCVAKCPSGYK---RNSSSSECVKCEGPCPKVCE 335 (1025)
T ss_pred ccccccceeccccchhhCCCcce---ecCcccceeccCCCCcceec
No 96
>PHA02714 CD-30-like protein; Provisional
Probab=24.16 E-value=86 Score=23.84 Aligned_cols=18 Identities=28% Similarity=0.933 Sum_probs=11.6
Q ss_pred CcccCCCCCCccCCCCcee
Q psy826 170 GCSTGGPKGCTKCKSGWAA 188 (329)
Q Consensus 170 ~C~~~~~~~C~~C~~G~~~ 188 (329)
.|....+..| .|.+|..-
T Consensus 52 PC~~ns~RvC-eCkpGm~C 69 (110)
T PHA02714 52 SCGPDKPRKC-QCGPGLKC 69 (110)
T ss_pred cCCCCCCcee-cCCCCCEE
Confidence 4554455778 88887654
No 97
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=21.09 E-value=43 Score=25.88 Aligned_cols=30 Identities=23% Similarity=0.438 Sum_probs=19.3
Q ss_pred CCCCCCCCCCcccCCCCC--CCCceeecCCCC
Q psy826 107 GFPNVCFGNGKCKGNGTR--KGNGQCVCNKEY 136 (329)
Q Consensus 107 ~~~~~C~~~~~C~~~~~~--~gs~~C~C~~G~ 136 (329)
...+.|++||.|...... ..=|.|.|.+..
T Consensus 10 ~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~ 41 (103)
T PF12955_consen 10 NATNNCSGHGSCVKKYGSGGGDCFACKCKPTV 41 (103)
T ss_pred HhccCCCCCceEeeccCCCccceEEEEeeccc
Confidence 344679999999962211 133678888844
No 98
>KOG4611|consensus
Probab=20.87 E-value=1.2e+02 Score=28.90 Aligned_cols=18 Identities=28% Similarity=0.634 Sum_probs=12.2
Q ss_pred CCCccCCCCceecCCCCc
Q psy826 177 KGCTKCKSGWAADKDIGC 194 (329)
Q Consensus 177 ~~C~~C~~G~~~~~~~~C 194 (329)
..|..|..||+.++.+-|
T Consensus 98 afcgncasgfyrndngyc 115 (747)
T KOG4611|consen 98 AFCGNCASGFYRNDNGYC 115 (747)
T ss_pred cccccccccceECCCccc
Confidence 456678888887765444
No 99
>KOG3509|consensus
Probab=20.63 E-value=1e+02 Score=33.39 Aligned_cols=62 Identities=29% Similarity=0.648 Sum_probs=37.6
Q ss_pred ceeecCCCCCCCCCCcCCCCccccccCc-CCCCCc---------CCc---ccccCCcc-cCCCCCCccCCCCceecC
Q psy826 128 GQCVCNKEYTGELCNECNTGYFQSYKDE-KTILCS---------KCH---ASCESGCS-TGGPKGCTKCKSGWAADK 190 (329)
Q Consensus 128 ~~C~C~~G~~g~~C~~C~~g~y~~~~~~-~~~~C~---------~C~---~~C~~~C~-~~~~~~C~~C~~G~~~~~ 190 (329)
-.|.|..||.|..|+.|.++|-...... ....+. .|. ..|. .|. ...+..|..|.+|++++.
T Consensus 718 ~~C~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~~~~h~~~c~~~~~~nt-~~q~~~~~~~~~~~~~g~~~da 793 (964)
T KOG3509|consen 718 EQCQCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCECNSHISQCEDDLGYNT-DCQNNTEGDRCELCSPGTYGDA 793 (964)
T ss_pred cccccCccccCcccccccccccccccCCcCcccCcccccCCCcccccccccccc-cccccCccceeeecCCCccccC
Confidence 4799999999999998888764322100 000111 111 1222 343 445678889999999876
Done!