Query psy5025
Match_columns 533
No_of_seqs 194 out of 2040
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 23:54:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5025.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5025hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 1.8E-50 3.8E-55 429.8 30.6 397 35-509 1227-1687(2531)
2 KOG3516|consensus 100.0 1.2E-48 2.7E-53 416.4 42.7 375 86-513 790-1179(1306)
3 KOG3514|consensus 100.0 3.9E-49 8.4E-54 412.0 37.2 445 4-532 98-631 (1591)
4 KOG3514|consensus 100.0 2E-47 4.2E-52 399.3 34.6 343 107-511 860-1217(1591)
5 KOG3516|consensus 100.0 1.8E-40 3.9E-45 354.2 40.3 459 2-530 257-966 (1306)
6 KOG4289|consensus 100.0 8.8E-33 1.9E-37 295.5 24.0 272 5-295 1392-1754(2531)
7 KOG1219|consensus 100.0 2E-27 4.3E-32 262.5 20.8 245 41-296 3641-3940(4289)
8 PF00054 Laminin_G_1: Laminin 99.9 9.2E-23 2E-27 178.1 18.3 129 345-507 1-131 (131)
9 PF00054 Laminin_G_1: Laminin 99.9 3.8E-22 8.3E-27 174.1 17.3 130 114-245 1-131 (131)
10 smart00282 LamG Laminin G doma 99.9 1.6E-21 3.4E-26 171.6 18.8 132 338-501 2-133 (135)
11 cd00110 LamG Laminin G domain; 99.9 6.7E-21 1.5E-25 170.7 20.2 150 299-501 2-151 (151)
12 cd00110 LamG Laminin G domain; 99.8 5.1E-20 1.1E-24 165.0 18.5 150 87-240 2-151 (151)
13 smart00282 LamG Laminin G doma 99.8 9.9E-20 2.1E-24 160.1 18.1 133 107-242 2-135 (135)
14 PF02210 Laminin_G_2: Laminin 99.8 3.1E-19 6.8E-24 155.0 16.6 126 345-503 1-127 (128)
15 KOG1219|consensus 99.8 5.6E-19 1.2E-23 196.7 19.1 195 261-512 3647-3844(4289)
16 PF02210 Laminin_G_2: Laminin 99.8 9.4E-18 2E-22 145.7 16.1 126 114-242 1-128 (128)
17 smart00210 TSPN Thrombospondin 99.1 7.3E-09 1.6E-13 95.8 16.7 125 336-500 51-180 (184)
18 KOG3509|consensus 98.7 2.8E-08 6E-13 109.4 8.8 250 40-295 168-442 (964)
19 smart00210 TSPN Thrombospondin 98.7 6.2E-07 1.3E-11 82.9 15.7 124 106-240 52-181 (184)
20 PF00008 EGF: EGF-like domain 98.7 5.9E-09 1.3E-13 66.3 1.4 31 47-77 1-32 (32)
21 PF13385 Laminin_G_3: Concanav 98.7 5.4E-07 1.2E-11 80.2 14.0 149 298-508 2-153 (157)
22 smart00159 PTX Pentraxin / C-r 98.5 5.5E-06 1.2E-10 78.0 15.2 140 336-513 30-172 (206)
23 PF00008 EGF: EGF-like domain 98.4 2E-07 4.4E-12 59.2 2.3 30 263-292 1-32 (32)
24 cd00152 PTX Pentraxins are pla 98.2 5.1E-05 1.1E-09 71.2 16.2 139 336-512 30-171 (201)
25 smart00179 EGF_CA Calcium-bind 98.2 1.5E-06 3.2E-11 58.2 3.7 36 44-79 2-39 (39)
26 PF13385 Laminin_G_3: Concanav 98.2 3.3E-05 7.2E-10 68.5 13.6 142 89-245 6-152 (157)
27 smart00159 PTX Pentraxin / C-r 98.2 8.6E-05 1.9E-09 69.9 16.7 148 90-245 15-166 (206)
28 PF00354 Pentaxin: Pentaxin fa 98.2 6.5E-05 1.4E-09 69.9 15.2 140 336-513 24-166 (195)
29 cd00152 PTX Pentraxins are pla 98.1 0.0002 4.3E-09 67.2 16.7 146 90-245 15-166 (201)
30 cd00054 EGF_CA Calcium-binding 98.0 6E-06 1.3E-10 54.6 3.7 36 44-79 2-38 (38)
31 PF00354 Pentaxin: Pentaxin fa 98.0 0.00031 6.7E-09 65.4 15.1 148 90-245 9-160 (195)
32 cd00053 EGF Epidermal growth f 97.7 3.5E-05 7.6E-10 50.1 3.6 33 47-79 2-36 (36)
33 PF02973 Sialidase: Sialidase, 97.7 0.0007 1.5E-08 61.8 13.1 136 337-508 33-179 (190)
34 smart00181 EGF Epidermal growt 97.6 6.2E-05 1.3E-09 48.9 3.5 32 47-79 2-35 (35)
35 PF07645 EGF_CA: Calcium-bindi 97.5 6.1E-05 1.3E-09 51.2 2.0 31 44-74 2-34 (42)
36 PF02973 Sialidase: Sialidase, 97.5 0.0033 7.1E-08 57.5 13.6 133 106-245 33-178 (190)
37 smart00179 EGF_CA Calcium-bind 97.4 0.00018 4E-09 47.8 3.7 34 261-294 3-39 (39)
38 cd00054 EGF_CA Calcium-binding 97.3 0.00036 7.7E-09 45.9 3.6 34 261-294 3-38 (38)
39 KOG3509|consensus 97.2 0.00097 2.1E-08 74.4 8.8 209 41-281 403-612 (964)
40 smart00560 LamGL LamG-like jel 97.1 0.012 2.7E-07 51.1 13.2 68 398-506 61-130 (133)
41 PF07974 EGF_2: EGF-like domai 97.1 0.00058 1.3E-08 43.1 3.1 27 50-78 6-32 (32)
42 cd00053 EGF Epidermal growth f 97.1 0.00077 1.7E-08 43.5 3.7 31 263-293 2-35 (36)
43 smart00181 EGF Epidermal growt 96.8 0.0017 3.8E-08 41.9 3.6 31 262-293 1-34 (35)
44 PF12661 hEGF: Human growth fa 96.5 0.00083 1.8E-08 33.1 0.2 13 66-78 1-13 (13)
45 smart00560 LamGL LamG-like jel 96.3 0.12 2.7E-06 44.7 13.2 69 168-245 61-131 (133)
46 KOG1834|consensus 96.1 0.084 1.8E-06 55.8 12.6 135 335-501 364-515 (952)
47 PF07645 EGF_CA: Calcium-bindi 95.9 0.0089 1.9E-07 40.5 3.1 29 261-289 3-34 (42)
48 PF07974 EGF_2: EGF-like domai 95.7 0.013 2.8E-07 37.0 2.9 26 266-293 6-32 (32)
49 KOG1836|consensus 95.2 0.0062 1.4E-07 72.6 0.8 135 338-512 1557-1692(1705)
50 PF12947 EGF_3: EGF domain; I 95.0 0.011 2.3E-07 38.5 1.0 27 50-76 6-32 (36)
51 PF12947 EGF_3: EGF domain; I 94.4 0.029 6.2E-07 36.5 2.0 26 266-291 6-32 (36)
52 PHA03099 epidermal growth fact 94.4 0.027 5.9E-07 46.9 2.3 33 49-82 50-84 (139)
53 PHA02887 EGF-like protein; Pro 94.0 0.048 1E-06 44.7 2.9 36 45-81 84-124 (126)
54 KOG1834|consensus 93.3 1.5 3.2E-05 46.9 12.8 156 88-244 346-519 (952)
55 PHA03099 epidermal growth fact 93.1 0.095 2.1E-06 43.8 3.2 35 266-301 51-88 (139)
56 PHA02887 EGF-like protein; Pro 93.0 0.092 2E-06 43.1 2.9 35 262-296 85-124 (126)
57 PF12662 cEGF: Complement Clr- 92.3 0.091 2E-06 30.7 1.5 17 64-80 1-21 (24)
58 PF06439 DUF1080: Domain of Un 91.2 0.29 6.3E-06 44.8 4.5 91 338-460 54-157 (185)
59 PF12946 EGF_MSP1_1: MSP1 EGF 90.9 0.093 2E-06 34.0 0.6 32 47-78 2-36 (37)
60 KOG1214|consensus 90.2 0.37 7.9E-06 52.6 4.7 42 35-76 814-859 (1289)
61 smart00051 DSL delta serrate l 90.1 0.49 1.1E-05 35.1 4.0 36 39-78 27-63 (63)
62 PF14670 FXa_inhibition: Coagu 90.1 0.21 4.7E-06 32.4 1.8 18 56-73 10-27 (36)
63 KOG3546|consensus 88.8 1 2.2E-05 47.8 6.5 127 336-501 86-223 (1167)
64 PF06439 DUF1080: Domain of Un 87.0 0.69 1.5E-05 42.3 3.8 101 92-196 43-155 (185)
65 PF14099 Polysacc_lyase: Polys 86.5 10 0.00022 35.9 11.6 67 363-456 113-183 (224)
66 KOG3546|consensus 86.0 2.7 5.9E-05 44.7 7.7 129 106-243 87-226 (1167)
67 KOG1225|consensus 84.7 0.67 1.5E-05 49.2 2.7 40 35-79 270-310 (525)
68 KOG1217|consensus 84.2 0.84 1.8E-05 48.4 3.3 35 44-78 271-306 (487)
69 KOG1217|consensus 83.6 0.94 2E-05 48.0 3.3 47 35-81 157-208 (487)
70 cd01475 vWA_Matrilin VWA_Matri 82.4 1.1 2.5E-05 42.5 3.1 35 39-75 181-218 (224)
71 KOG1225|consensus 81.0 1.3 2.9E-05 47.0 3.2 41 35-80 239-280 (525)
72 KOG4260|consensus 79.6 1.3 2.7E-05 42.3 2.2 59 23-81 119-184 (350)
73 PF00053 Laminin_EGF: Laminin 78.5 1.6 3.6E-05 30.3 2.0 24 56-81 11-34 (49)
74 KOG1214|consensus 76.9 1.9 4E-05 47.4 2.8 31 261-291 828-859 (1289)
75 PF14670 FXa_inhibition: Coagu 76.9 1.9 4.2E-05 28.0 1.8 20 271-290 10-29 (36)
76 KOG1836|consensus 76.7 2.8 6.1E-05 51.0 4.4 105 134-245 1581-1687(1705)
77 PF00954 S_locus_glycop: S-loc 75.3 10 0.00022 31.4 6.4 32 44-76 77-109 (110)
78 KOG0994|consensus 68.3 16 0.00034 42.2 7.3 136 45-180 508-690 (1758)
79 cd00055 EGF_Lam Laminin-type e 64.8 6 0.00013 27.7 2.3 22 58-81 14-35 (50)
80 PF09064 Tme5_EGF_like: Thromb 62.8 6.6 0.00014 24.9 1.9 13 278-290 16-28 (34)
81 PF02057 Glyco_hydro_59: Glyco 59.4 1.9E+02 0.0041 32.1 13.4 60 363-456 578-637 (669)
82 PF12955 DUF3844: Domain of un 59.2 7 0.00015 31.9 2.0 37 45-81 6-62 (103)
83 cd01475 vWA_Matrilin VWA_Matri 56.2 9.7 0.00021 36.1 2.9 29 261-290 188-218 (224)
84 PF14099 Polysacc_lyase: Polys 54.1 53 0.0012 30.9 7.6 59 134-192 113-181 (224)
85 PF09264 Sial-lect-inser: Vibr 53.0 98 0.0021 28.2 8.3 83 339-455 34-119 (198)
86 cd01951 lectin_L-type legume l 52.4 2E+02 0.0044 26.9 12.1 10 398-407 154-163 (223)
87 KOG4260|consensus 51.5 7.7 0.00017 37.2 1.3 35 41-75 232-269 (350)
88 PF00954 S_locus_glycop: S-loc 49.9 15 0.00033 30.4 2.8 29 261-290 78-108 (110)
89 cd00413 Glyco_hydrolase_16 gly 49.4 2.1E+02 0.0046 26.3 12.3 30 396-457 140-169 (210)
90 PF12946 EGF_MSP1_1: MSP1 EGF 48.3 16 0.00034 23.9 1.9 28 263-290 2-31 (37)
91 PF04863 EGF_alliinase: Alliin 47.7 6.6 0.00014 27.8 0.2 32 51-82 18-53 (56)
92 smart00180 EGF_Lam Laminin-typ 47.7 13 0.00027 25.5 1.6 18 64-81 17-34 (46)
93 PF07622 DUF1583: Protein of u 43.7 28 0.00061 35.4 3.9 31 4-34 83-113 (399)
94 PF01683 EB: EB module; Inter 39.9 38 0.00083 23.6 3.1 26 45-74 20-46 (52)
95 PF12955 DUF3844: Domain of un 37.3 29 0.00062 28.5 2.3 22 266-287 13-40 (103)
96 cd06899 lectin_legume_LecRK_Ar 36.3 3.9E+02 0.0084 25.4 12.0 13 395-407 160-172 (236)
97 PF07953 Toxin_R_bind_N: Clost 35.4 3.5E+02 0.0077 24.7 12.2 88 336-457 54-156 (195)
98 PTZ00334 trans-sialidase; Prov 31.9 68 0.0015 36.2 4.9 78 393-507 638-715 (780)
99 KOG1226|consensus 31.3 43 0.00094 37.1 3.2 27 51-81 556-582 (783)
100 PF02057 Glyco_hydro_59: Glyco 29.1 5.3E+02 0.011 28.8 10.9 56 135-192 579-635 (669)
101 cd01951 lectin_L-type legume l 28.7 4.1E+02 0.0088 24.8 9.2 23 168-190 154-178 (223)
102 PF05910 DUF868: Plant protein 27.5 1.3E+02 0.0027 29.5 5.4 38 441-480 173-210 (274)
103 PF03425 CBM_11: Carbohydrate 26.5 1.7E+02 0.0037 26.5 5.9 49 350-409 73-121 (178)
104 PTZ00334 trans-sialidase; Prov 24.8 5.7E+02 0.012 29.2 10.4 74 165-245 640-715 (780)
105 smart00289 WR1 Worm-specific r 23.0 78 0.0017 20.2 2.2 23 46-70 15-38 (38)
106 cd02178 GH16_beta_agarase Beta 22.7 1.9E+02 0.0042 27.9 5.9 29 397-457 178-207 (258)
107 PF08787 Alginate_lyase2: Algi 22.2 6.8E+02 0.015 23.8 12.4 52 371-455 135-189 (236)
108 PF09313 DUF1971: Domain of un 20.7 4.2E+02 0.0091 20.7 7.1 47 361-407 22-74 (82)
109 KOG0994|consensus 20.2 1E+02 0.0023 36.0 3.6 38 44-81 903-950 (1758)
No 1
>KOG4289|consensus
Probab=100.00 E-value=1.8e-50 Score=429.83 Aligned_cols=397 Identities=25% Similarity=0.533 Sum_probs=314.2
Q ss_pred ceee-cCCCC--CCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccccccc-------c-------------------
Q psy5025 35 IKGR-SKECL--EDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDL-------Q------------------- 85 (533)
Q Consensus 35 ~~G~-~~~C~--~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~-------~------------------- 85 (533)
++|| +.-|+ +|.|-+.||.|+|+|....++|+|.|.++|+|.+||.+... |
T Consensus 1227 PpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp 1306 (2531)
T KOG4289|consen 1227 PPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCP 1306 (2531)
T ss_pred CCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCC
Confidence 7888 88888 79999999999999999999999999999999999987421 1
Q ss_pred ------------ccccCCceeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcC
Q psy5025 86 ------------VPQFNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLG 153 (533)
Q Consensus 86 ------------~~~F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g 153 (533)
+.+|.+.||+.++.....+ +++++|+|.|...+|+|+|.|+. +.||++|++.++++++.|+.|
T Consensus 1307 ~ge~e~prC~v~trSFp~~sfv~frglrqRf--h~TlslsfaT~~~nGlL~ynGne---khDFvalevVd~qvqltfS~G 1381 (2531)
T KOG4289|consen 1307 YGEFEDPRCEVTTRSFPPESFVTFRGLRQRF--HFTLSLSFATIERNGLLLYNGNE---KHDFVALEVVDEQVQLTFSAG 1381 (2531)
T ss_pred CcccCCCceEEEeeccCchheEEEeccccce--EEEEEEEEEEeeecceEEecCCc---ccceEeeeeeeeeEEEEEecc
Confidence 5579999999999876655 78999999999999999999954 499999999999999999999
Q ss_pred CCceEEE--eeccccCCCcEEEEEEEECCEEEEEEcCccCcee-------------eCCCCceeeecCCCEEEcCcCCCC
Q psy5025 154 TGAATLR--SSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQI-------------LSPGPFTQLSLSLSLYLGGVPDYN 218 (533)
Q Consensus 154 ~~~~~v~--s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~-------------~~~~~~~~l~~~~~l~vGG~p~~~ 218 (533)
....++. .+..++||+||+|.++..++.+.++||++..... ...+....+++.+++++||+|...
T Consensus 1382 es~t~v~p~Vp~gvsDGqWHtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~f 1461 (2531)
T KOG4289|consen 1382 ESTTTVSPDVPGGVSDGQWHTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETF 1461 (2531)
T ss_pred cccceecCCCCCCcccCceeEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcc
Confidence 6665554 3578999999999999999999999999875432 223456678888999999999654
Q ss_pred ccCCCcccccCceeeeeeEEECcEEcccccccccC-CCccCCC---CCCCCCCC-CCCeeeecCCceEEecCCCCCcCcc
Q psy5025 219 IVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQG-VNVVNCA---HPCITQPC-GSGSCVPLYDGYKCVCTQSCKESHL 293 (533)
Q Consensus 219 ~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~-~~~~~C~---~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~~c 293 (533)
. .....|.|||+++.++++.+| ++..... ....+|+ +.|.+.+| |+|+|++.|++|.|+||.++.|..|
T Consensus 1462 p-----v~~k~FvGCmrdLsvD~~~VD-ma~fianngt~eGC~ark~fCdsg~C~n~g~CvnrWg~~~C~CP~~fggk~c 1535 (2531)
T KOG4289|consen 1462 P-----VIEKQFVGCMRDLSVDGRDVD-MATFIANNGTHEGCKARKNFCDSGQCSNGGTCVNRWGGFSCECPLGFGGKGC 1535 (2531)
T ss_pred h-----hhHhHhhhhhhhccccccccc-HHHHHhhcCcccCchhhhcccCCCccCCCCeeecccCcEeecCccccCCcch
Confidence 2 233789999999999999999 4443332 2357894 78999999 9999999999999999999999888
Q ss_pred cce--eeeeecCCCeeEEecCccccccccccceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEE
Q psy5025 294 TEV--MCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIH 371 (533)
Q Consensus 294 ~~~--~~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~ 371 (533)
... .+-.| .|.|.+.+...+. ..+..+.++|.|||++.+|+|+.+...+. .-+.+.|.
T Consensus 1536 ~~~m~~pq~f-rG~sl~sw~~~~~------------------~vSvPwylsl~FRTr~ad~vl~~~~~~~r-st~~lqld 1595 (2531)
T KOG4289|consen 1536 CQGMAHPQHF-RGHSLVSWEGLPS------------------QVSVPWYLSLMFRTRRADGVLMQAEFGGR-STYNLQLD 1595 (2531)
T ss_pred hhccCCchhc-cccceeeecCCCc------------------ceecceEEEEEEEeeccccEEEEEEeCCC-ceEEEEEc
Confidence 653 33455 7788777764433 23567899999999999999999875554 44889999
Q ss_pred CCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCc
Q psy5025 372 NGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTD 451 (533)
Q Consensus 372 ~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~ 451 (533)
+|+|.+.+ +... +.+...+++||+||++.+.. ... ..++.|..
T Consensus 1596 ~g~l~~~v--~~s~--v~L~~~~vtdg~Wh~~~i~l--------------------------------~~d-~~~t~d~g 1638 (2531)
T KOG4289|consen 1596 DGTLKYNV--GDSS--VELPAPRVTDGHWHHLVIEL--------------------------------EAD-SVATLDYG 1638 (2531)
T ss_pred CCEEEEEe--cCce--EEccCccccCCchhheeeee--------------------------------ccC-eEEEEech
Confidence 99987765 3333 34478899999999999998 433 45555543
Q ss_pred -eeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEEEEEcCceEEeec
Q psy5025 452 -YSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSNDYQVKLP 509 (533)
Q Consensus 452 -~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~~~ 509 (533)
....... + ...++..+ ||+||+|... ...+|.|||+++++ ++...-.+
T Consensus 1639 ~~~aea~~-g-l~gl~l~s-l~vGgap~~g------~p~gf~GCiqgV~v-~g~~~l~~ 1687 (2531)
T KOG4289|consen 1639 IYQAEAKA-G-LSGLNLES-LYVGGAPATG------VPRGFRGCIQGVRV-GGVSILVP 1687 (2531)
T ss_pred hhhhhhhc-C-CCCceeeE-EEEccccCCC------ccccchhhhhceEE-CCEeeccc
Confidence 2222111 1 33455554 9999999542 46799999999999 66655443
No 2
>KOG3516|consensus
Probab=100.00 E-value=1.2e-48 Score=416.42 Aligned_cols=375 Identities=24% Similarity=0.416 Sum_probs=289.2
Q ss_pred ccccC-CceeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECC-EEEEEEEcCCCceEEE--e
Q psy5025 86 VPQFN-GSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDR-YVDFTFDLGTGAATLR--S 161 (533)
Q Consensus 86 ~~~F~-g~syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G-~l~~~~~~g~~~~~v~--s 161 (533)
.++|. +.+|+.+++..... ...|+|.|||+.+.|++|..-. ..||+.|+|... .+.|.++.|+++..++ +
T Consensus 790 svSF~~~~syL~fp~f~~~~--saDIsf~FrTt~~~gvflen~g----~~dfir~eL~~~~~vtf~~dvgnGp~~~~V~s 863 (1306)
T KOG3516|consen 790 SVSFHTGASYLHFPPFHNEL--SADISFFFRTTASSGVFLENHG----INDFIRLELSSPVEVTFAFDVGNGPSQLTVRS 863 (1306)
T ss_pred ceEeecCcceeecCcccCcc--cccEEEEEEecCCceEeeeccC----CCceEEEEEcCCCceEEEEEcCCCceeEEEcC
Confidence 44565 45799998876654 6799999999999999999854 489999999875 8999999999886554 6
Q ss_pred eccccCCCcEEEEEEEECCEEEEEEcCccCceeeCC-CCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEEC
Q psy5025 162 SNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSP-GPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN 240 (533)
Q Consensus 162 ~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~-~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in 240 (533)
+..+||++||+|+++++.+.+.|+||+........+ .....+.+.+++||||.... ..+|.||||.+++|
T Consensus 864 ~t~~nD~qWH~V~~Ern~K~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vGgt~~~---------~~gF~GCIRsl~LN 934 (1306)
T KOG3516|consen 864 PTELNDNQWHQVRAERNSKEASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVGGTVSR---------QRGFLGCIRSLQLN 934 (1306)
T ss_pred CcccCCCceEEEEEEeccccceEEEcCcccceecCCCCCEEEEEeccceeccccccC---------cCcceeeeeeeeec
Confidence 678999999999999999999999999765554433 34667888899999996643 37999999999999
Q ss_pred cEEcccccccccCC-CccCCCCCCCCCCC-CCCeeeecCCceEEecCC-CCCcCcccceeeeeecCCCeeEEecCccccc
Q psy5025 241 NHELNILGEAIQGV-NVVNCAHPCITQPC-GSGSCVPLYDGYKCVCTQ-SCKESHLTEVMCASFDHGASFLHYLDPEIIH 317 (533)
Q Consensus 241 ~~~l~~~~~~~~~~-~~~~C~~~C~~~pC-ngg~C~~~~~~~~C~C~~-g~~G~~c~~~~~~~f~~~~s~~~~~~~~~~~ 317 (533)
|+.+++...+.... ...+|+..|++.|| |||+|++.|++|+|+|.. .|.|+.|.....+.| ..++++.|+.....+
T Consensus 935 Gv~ldLe~ra~~~~gv~~GC~GhCss~~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~eig~~f-e~gs~i~y~fq~~~~ 1013 (1306)
T KOG3516|consen 935 GVMLDLEYRAYGTAGVSPGCEGHCSSYPCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEIGVFF-ERGSSIRYNFQKPMR 1013 (1306)
T ss_pred ceeeeehhhhccCCcccCCCccccccccccCCCEEEEecCceeeccccCcCCCCccccccceEe-cCCceEEEeccchHH
Confidence 99999643332222 24689999999999 999999999999999997 799999998888888 677888887765431
Q ss_pred cccc--cceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEE-CCEEEEEEEcCC-ceEEEeeCce
Q psy5025 318 SSGL--LHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIH-NGYAHVMYNLGN-GEVLLENNQT 393 (533)
Q Consensus 318 ~~~~--~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~-~g~l~~~~~~g~-~~~~~~~~~~ 393 (533)
.... .+..+ .. .-.....-.|.|.|+|+...++|||..+ ..++|+.|-|. ||.|.+++.+|. .+..+.+...
T Consensus 1014 ~a~~~~~~~~~-~~--~~~~~~~e~i~~sftTt~~ps~LLfvss-F~~~y~~V~v~~nGsLq~ry~lg~~e~~~~~~~~k 1089 (1306)
T KOG3516|consen 1014 SAVFESSRVKQ-KL--EIEINPNEEINFSFTTTRAPSDLLFVSS-FTDDYLAVLVKDNGSLQTRYMLGFREPFEYQFKDK 1089 (1306)
T ss_pred Hhhhhhhhhhh-cc--ccccCccceEEEEEEeccCceEEEEeec-cccceEEEEEeCCCceEEEEecCCcCceEEecccc
Confidence 1100 00000 00 1112344689999999999999998764 45789999886 899999999987 5667777888
Q ss_pred eecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEE
Q psy5025 394 KINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYV 473 (533)
Q Consensus 394 ~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~l 473 (533)
++.||+.|+|.+.| ..+.+.|+||+.......... ...++....|++
T Consensus 1090 n~~~gq~H~i~i~r--------------------------------~~~~~~i~vD~y~~~~y~~~~-~~~~~~~ksl~l 1136 (1306)
T KOG3516|consen 1090 NIALGQPHDINITR--------------------------------GPRTVFLEVDGYLKVEYTFSI-DVDFQSPKSLTL 1136 (1306)
T ss_pred cccCCCceEEEEec--------------------------------CCceEEEEecCccceeeeccc-ceeecccchhhc
Confidence 99999999999999 889999999997766544322 233444444666
Q ss_pred cCCCCcccc---cccccccCceeeEEEEEEcCceEEeecccCc
Q psy5025 474 GGVEDIEEL---TRSKYHHSFKGWISNLILSNDYQVKLPLSRS 513 (533)
Q Consensus 474 Gg~~~~~~~---~~~~~~~~f~GCi~~l~i~~~~~l~~~~~~~ 513 (533)
|-+-..... .......+|.||++.|++++-.||+.+....
T Consensus 1137 g~v~e~~~~d~~~~k~~t~gF~GClS~Vqf~~vaPLK~~~~~~ 1179 (1306)
T KOG3516|consen 1137 GPVTETANIDHEISKYNTPGFGGCLSRVQFNDVAPLKAAFRTE 1179 (1306)
T ss_pred cceeeccCCChhHHhhcCCCccceeeEEEccccccchhccccC
Confidence 654333221 1123567899999999998888998877655
No 3
>KOG3514|consensus
Probab=100.00 E-value=3.9e-49 Score=412.04 Aligned_cols=445 Identities=22% Similarity=0.405 Sum_probs=346.8
Q ss_pred CCccccCCCceEEEEEEeCceeEEEEcCCCC-------------------------------------------------
Q psy5025 4 SDNQVILDDWNTVRVFRHRWDAWIQLNDGNK------------------------------------------------- 34 (533)
Q Consensus 4 s~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~------------------------------------------------- 34 (533)
-..+|+|++||+|++.|+-..+.|.||+...
T Consensus 98 ~~~~i~D~~WH~v~i~r~~e~t~L~vDgv~~~~~~~~~~f~fg~iasdvfVGGlP~~~~la~l~lp~v~yep~frg~~rn 177 (1591)
T KOG3514|consen 98 RRVRIDDDKWHTVTIFRSWENTKLEVDGVLVFKILNQRSFVFGNIASDVFVGGLPNMHMLAVLSLPLVRYEPRFRGNVRN 177 (1591)
T ss_pred ecceecCCceeEEEEEeccccceEEechhhhhhhhhcceeeeeeeehheeecCCChHHhhhhhcCcccccccccCcccee
Confidence 3468999999999999999999999999443
Q ss_pred ------ceee----------cCCCCC------CCCC-CCCCCCCCeeeeCCCCceecCCCCCCCCCccccccccccccCC
Q psy5025 35 ------IKGR----------SKECLE------DKCN-KTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDLQVPQFNG 91 (533)
Q Consensus 35 ------~~G~----------~~~C~~------d~C~-~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g 91 (533)
+.|. ..+|.. ..|. ..+|+|+|.|.....+.+|.|...+.|++||+.-....+.|.|
T Consensus 178 l~y~~~p~g~t~~q~l~~~~d~~c~d~~~~~~~~~~~~~~c~~~g~c~s~d~gp~c~c~~~~dgq~cekeK~~~eaTF~G 257 (1591)
T KOG3514|consen 178 LMYRQYPQGVTSPQLLEVGTDTNCDDHCKSKSMSSREQFVCLNDGECYSSDDGPHCDCQFDHDGQNCEKEKNDGEATFGG 257 (1591)
T ss_pred eeeecCCCCcCChhhhhcccCCCCcCCCCCccccccccceeccCCeEecCCCCCccccccccCccccccccCcceEEecC
Confidence 1111 123331 2233 3689999999999999999999999999999988777889999
Q ss_pred ceeEEEcCCcccc-cceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE---eeccccC
Q psy5025 92 SSYLRYQGLGEQY-LSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR---SSNPISL 167 (533)
Q Consensus 92 ~syl~~~~~~~~~-~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~---s~~~lnD 167 (533)
+.|+.|+....+. +....|+|.|||.+++|+|||++.. .||+.|.|++|.+.+...++++..++. ...+++|
T Consensus 258 ~ef~~YDls~npI~s~~d~itl~FrT~q~ngllfytG~~----~dYlnlaL~dGaV~l~~~l~~g~~e~~~~p~~~rfdD 333 (1591)
T KOG3514|consen 258 DEFVGYDLSQNPIRSKKDNITLTFRTVQGNGLLFYTGDE----KDYLNLALQDGAVSLSSKLDGGDAEIIRMPNSFRFDD 333 (1591)
T ss_pred ceEEEeeccCCcccccccceEEEEEEecCceeEEEccCC----cceeeEeecCCcEEEEEecCCccceeEEccccccccC
Confidence 9999998655443 3567899999999999999999875 799999999999999999987764432 4578999
Q ss_pred CCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcccc
Q psy5025 168 GEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNIL 247 (533)
Q Consensus 168 g~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~ 247 (533)
..||.|.++|+-.-+.+.||+.........+.+.++..+.-+|+||.|....++.. ...|.||++++.+....+.+.
T Consensus 334 ~~WH~V~v~R~~~m~t~~VDg~~t~~~~~a~~~tmlsss~~fyvgg~~~~~~l~gs---rVsF~GClkkV~y~~d~~rl~ 410 (1591)
T KOG3514|consen 334 DSWHTVIVERSLQMMTLIVDGRRTEIRQYAPELTMLSSSDFFYVGGSPNTADLPGS---RVSFMGCLKKVVYKNDDTRLE 410 (1591)
T ss_pred CcceEEEEEeeeEEEEEEEccEEecccccccceeEeeccceEEecCCCCccccCCC---ceeeeeeeeeeEeccCceeeh
Confidence 99999999999999999999988777777778888887777999999987665432 234999999999976654431
Q ss_pred cc--cccCCCccCCCCCCCCCCCCCCeeeecCCceEEecCCCCCcCcccceeeeeecCCCeeEEecCcccccccccccee
Q psy5025 248 GE--AIQGVNVVNCAHPCITQPCGSGSCVPLYDGYKCVCTQSCKESHLTEVMCASFDHGASFLHYLDPEIIHSSGLLHYK 325 (533)
Q Consensus 248 ~~--~~~~~~~~~C~~~C~~~pCngg~C~~~~~~~~C~C~~g~~G~~c~~~~~~~f~~~~s~~~~~~~~~~~~~~~~~~~ 325 (533)
-. +.++..... +...-+|.| +.+ ....+++|....||+.++.|+..
T Consensus 411 L~~LAk~g~~~~k---------------~~G~l~y~C--~n~------~~~DpvtFtt~es~l~LP~Wnt~--------- 458 (1591)
T KOG3514|consen 411 LSRLAKQGDSKMK---------------TEGDLSYSC--ENV------AQLDPVTFTTPESYLTLPRWNTK--------- 458 (1591)
T ss_pred hhHHhhcCCceeE---------------eeceEEEec--CCC------CccCceeeecccceeeccccccC---------
Confidence 11 111111000 000112333 211 12356899889999999988753
Q ss_pred eeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCC--CCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEE
Q psy5025 326 FQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQ--SKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRL 403 (533)
Q Consensus 326 ~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~--~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v 403 (533)
..-+|+|.|||+++||+|||.+..+ ..+|++++|.||+|.+.+++|++.+++..+..+++||+||+|
T Consensus 459 -----------~~gSiSf~FRTtepnGlil~~~g~~~~~~d~~A~ELldghlyl~ldlGSG~iklras~rkv~DGeWhhv 527 (1591)
T KOG3514|consen 459 -----------KSGSISFDFRTTEPNGLILFHGGPQANATDYFAIELLDGHLYLLLDLGSGVIKLRASSRKVNDGEWHHV 527 (1591)
T ss_pred -----------CcceeEEEEeecCCCceEEEccCcccccccEEEEEEeCCeEEEEEecCCceEEeeeecccccCCceEEE
Confidence 3468999999999999999987554 379999999999999999999999999988999999999999
Q ss_pred EEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccc-
Q psy5025 404 TVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEEL- 482 (533)
Q Consensus 404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~- 482 (533)
.+.| +++...+.||...... ..++..+.|+.++++|||-++.....
T Consensus 528 ~l~R--------------------------------~gR~gsvsVd~~~~df-~tpG~s~iL~ld~~mylG~~~n~l~~P 574 (1591)
T KOG3514|consen 528 DLQR--------------------------------DGRTGSVSVDAIKTDF-STPGDSEILDLDDPMYLGEVPNNLVYP 574 (1591)
T ss_pred Eeec--------------------------------cCccceEEEeeeecCc-cCCCcceeEeecCceeeccCCCCccCc
Confidence 9999 9999999999987776 35677788999999999966655322
Q ss_pred ---cccccccCceeeEEEEEEcCceEEeecccCc--CCCCcccceee---ecCCceee
Q psy5025 483 ---TRSKYHHSFKGWISNLILSNDYQVKLPLSRS--QGSSCQLTCET---LNTGQVTC 532 (533)
Q Consensus 483 ---~~~~~~~~f~GCi~~l~i~~~~~l~~~~~~~--~~~~c~~~~~~---~~~~~~~~ 532 (533)
|......+|.||||+|+| +++..|+.+.+. .+.+...+|.- +.|..++|
T Consensus 575 ~~vWta~L~~GyvGCirdl~i-~G~s~di~q~ae~q~sagvkpsCs~~~~~~C~~nPC 631 (1591)
T KOG3514|consen 575 SEVWTAALRKGYVGCIRDLFI-DGVSTDIRQEAEAQNSAGVKPSCSLSNEKICESNPC 631 (1591)
T ss_pred HHHHHHHHhccchheehhhee-cceehhhHHHhhhccccccCcccchhhccccCCCcc
Confidence 344567899999999999 899999877654 33344444432 24666665
No 4
>KOG3514|consensus
Probab=100.00 E-value=2e-47 Score=399.32 Aligned_cols=343 Identities=24% Similarity=0.480 Sum_probs=278.6
Q ss_pred eeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE--eeccccCCCcEEEEEEEECC-EEE
Q psy5025 107 LLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR--SSNPISLGEWHKLRLTRTGR-HAY 183 (533)
Q Consensus 107 ~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~--s~~~lnDg~Wh~V~v~~~~~-~~~ 183 (533)
...|-|+|+|++++|+|+|.+... +||++|+|.+|+|.+.|++|+++..++ +...+||++||.|.|.|... .-.
T Consensus 860 s~~l~Fqfkt~sp~gll~fn~gd~---ndfi~velvnG~ihYtfdlg~gp~~~k~~sr~hlnDnrWHnV~I~rd~~~~Ht 936 (1591)
T KOG3514|consen 860 SMHLFFQFKTTSPDGLLLFNSGDG---NDFIAVELVNGYIHYTFDLGNGPTSMKGPSRQHLNDNRWHNVLIYRDKTNTHT 936 (1591)
T ss_pred EEEEEEEEeecCCCeEEEecCCCC---CceEEEEEeCcEEEEEEEcCCCcccccCcccCcCccccceeEEEEcCCCCceE
Confidence 456778899999999999987653 899999999999999999999987776 45889999999999999865 458
Q ss_pred EEEcCccCceeeCCCCceeeecCCCEEEcCcCCCC--ccCCCcccccCceeeeeeEEECcEEcccccccccCC-C-ccCC
Q psy5025 184 LQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYN--IVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQGV-N-VVNC 259 (533)
Q Consensus 184 L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~--~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~~-~-~~~C 259 (533)
|.||...... ...+ ...+++.+.+|+||+-... .++.......+|.||+..+-+++...+++..++... . ..+|
T Consensus 937 L~vD~s~~t~-~~~g-~~~l~l~g~LyiGGv~k~m~~~~p~~~asR~g~~g~~~s~dl~~r~p~L~~~a~~~s~lv~~~~ 1014 (1591)
T KOG3514|consen 937 LKVDNSSTTQ-IIDG-AVNLDLKGKLYIGGVSKPMYSFLPKLVASRSGFQGCLASLDLGGRLPDLISDALFESGLVEVGC 1014 (1591)
T ss_pred EEecCceEEE-EecC-ccccccccceecccccccccccccceeeccCCCCCCcCccCccccchhHHHHhhhhccceeeec
Confidence 9999865333 3333 6678888999999997654 334455567899999999999988766544433221 2 2356
Q ss_pred --C-CCCCCCCC-CCCeeeecCCceEEecCC-CCCcCcccce-eeeeecCCCeeEEecCccccccccccceeeeeeeeee
Q psy5025 260 --A-HPCITQPC-GSGSCVPLYDGYKCVCTQ-SCKESHLTEV-MCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFR 333 (533)
Q Consensus 260 --~-~~C~~~pC-ngg~C~~~~~~~~C~C~~-g~~G~~c~~~-~~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (533)
| ..|..+-| |.|.|+..|+.|.|+|++ .|.|+.|... ..+-|..+.+.+.|..++. .|
T Consensus 1015 sgpst~c~~~acanhG~c~q~w~~~~c~csmtS~~Gp~C~d~gtTYiFgk~gglI~YtwPpN----------------dR 1078 (1591)
T KOG3514|consen 1015 SGPSTTCSEDACANHGVCIQQWNGIACDCSMTSYSGPRCNDPGTTYIFGKSGGLITYTWPPN----------------DR 1078 (1591)
T ss_pred cCCCcccchhhhhccceeeeeecceeeeccccccCCCccCCCceEEEECCCCceEEEecCCC----------------CC
Confidence 3 57999999 999999999999999998 7999999753 4578877777777766554 34
Q ss_pred ecceeeEEEEEEEeCCCCeEEEEeccCC-CCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025 334 ITGYEFNIDMRIKTSSSSGLILWTGSSQ-SKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV 412 (533)
Q Consensus 334 ~~~~~~~i~~~~rt~~~~g~ll~~~~~~-~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~ 412 (533)
....+.++.+.|+|.+++|+|+.+.+.. ..+||+|+|..|++-+.||+|...+.+......+|||++|.|++.|
T Consensus 1079 psTr~DrlAvGFsTtq~daVLvRVdSAsglgDYlqLhI~qG~igvvfNiGt~Diti~E~~~ivNDgkYHVVRFtR----- 1153 (1591)
T KOG3514|consen 1079 PSTRKDRLAVGFSTTQPDAVLVRVDSASGLGDYLQLHINQGKIGVVFNIGTDDITISEHNAIVNDGKYHVVRFTR----- 1153 (1591)
T ss_pred CCcccceEEEEEEeccCceEEEEEeccCCCCceEEEEEeccEEEEEEeccCcccccccccccccCCceEEEEEEe-----
Confidence 5567789999999999999999986554 4899999999999999999998888887788999999999999999
Q ss_pred ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEec-CCcceeeccCCceEEcCCCCcccccccccccCc
Q psy5025 413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSIS-GGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSF 491 (533)
Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~-~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f 491 (533)
.+..++|+||...+..... ......++....|-|||.... ..|
T Consensus 1154 ---------------------------~GGNATLQVD~wpV~e~yP~grqlTiFNtqa~I~IGGk~qG---------RpF 1197 (1591)
T KOG3514|consen 1154 ---------------------------SGGNATLQVDSWPVNERYPAGRQLTIFNSQARISIGGKFQG---------RPF 1197 (1591)
T ss_pred ---------------------------cCCceEEEecccchhhccCCCceeEEeeccceEEecccccC---------Ccc
Confidence 9999999999988765433 344666788888999997632 379
Q ss_pred eeeEEEEEEcCceEEeeccc
Q psy5025 492 KGWISNLILSNDYQVKLPLS 511 (533)
Q Consensus 492 ~GCi~~l~i~~~~~l~~~~~ 511 (533)
+|=|+.|.+|+.+.|++...
T Consensus 1198 qGqiSGLyyNgLkVLdlAaE 1217 (1591)
T KOG3514|consen 1198 QGQISGLYYNGLKVLDLAAE 1217 (1591)
T ss_pred cceecceEEcceeeeehhhc
Confidence 99999999988788888654
No 5
>KOG3516|consensus
Probab=100.00 E-value=1.8e-40 Score=354.23 Aligned_cols=459 Identities=21% Similarity=0.352 Sum_probs=331.6
Q ss_pred ccCCccccCCCceEEEEEEeCceeEEEEcCCCC------------------ceee---c--CCCC--CCCCCCCCCCCCC
Q psy5025 2 VKSDNQVILDDWNTVRVFRHRWDAWIQLNDGNK------------------IKGR---S--KECL--EDKCNKTPCEHGG 56 (533)
Q Consensus 2 ~~s~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~------------------~~G~---~--~~C~--~d~C~~~pC~Ngg 56 (533)
|.-.+.|+|++||.|.++|.++.+.|+||+.-. +.|. + ..-+ .-.|..|.-.|+.
T Consensus 257 is~GslLdD~hWHsV~i~r~~~~vnftvD~~~~~fr~~Ge~~~Ldld~e~~~GGiP~~~~~~~~~~nF~GCienly~N~v 336 (1306)
T KOG3516|consen 257 ISAGSLLDDQHWHSVRIERQGRQVNFTVDGVVHHFRATGEFDALDLDTEISFGGIPNDGKSVGFEKNFTGCLENLYYNGV 336 (1306)
T ss_pred eecccccCCCcceEEEEEecCcEEEEEEccceEeecccCccceeecceEEEECCccCCCcccceeeeeeeeeeeeeecCc
Confidence 445678999999999999999999999999544 1111 0 0000 2346666666766
Q ss_pred eeeeCCCCce-ecCCCCCCCCCccccccccccccCCc-eeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCC
Q psy5025 57 KCITTPGSSY-CLCPLGFTGDLCQTRLDLQVPQFNGS-SYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVG 134 (533)
Q Consensus 57 tC~~~~~~y~-C~C~~g~~G~~Ce~~~~~~~~~F~g~-syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~ 134 (533)
.=+++.-.-. =.=..|-.--.|+....+ ...|.+. +|+++|...... ...++|+|||...+|+|++...... .
T Consensus 337 diidLa~~~~~~~~~~gnv~f~C~~P~~~-pvtF~~sss~~~lpg~~~~~--~l~vSF~FRtw~~~G~ll~~~~~e~--~ 411 (1306)
T KOG3516|consen 337 DIIDLAKRRKSQISAMGNVSFSCSDPQII-PVTFGNSSSYLRLPGNPNPD--RLSVSFQFRTWNKTGLLLFSELKEG--S 411 (1306)
T ss_pred eeEeeecccccceecccceeEeccCCCCC-CeEecccceeEEcCCCCCCC--ceeeEEEEEeccccCceeeeeeccC--C
Confidence 6555431110 000001122245554332 3356544 599987655443 7899999999999999999876543 6
Q ss_pred CeEEEEEECCEEEEEEEc-CCCceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcC
Q psy5025 135 DFIALYLNDRYVDFTFDL-GTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGG 213 (533)
Q Consensus 135 ~fl~l~L~~G~l~~~~~~-g~~~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG 213 (533)
..+.|.|++|++.+.+.. +.....+.....+|||+||.|.+.+..+.+.+.||+......... ....+......|+||
T Consensus 412 g~v~~fl~eg~~~~~i~~~~r~~~~~~~g~~lnDG~WHsv~~~ak~n~~~~~iDd~~~~~~~~~-~p~~V~tg~tY~fgg 490 (1306)
T KOG3516|consen 412 GEVLLFLKEGKKFLQITQIGRSKADAYAGLKLNDGAWHSVSFNAKKNRLVLMIDDGEAEIAPDS-KPLQVYTGTTYYFGG 490 (1306)
T ss_pred ceEEEEEeCCeEEEEEeccccchhhhcccccCCCCceEEEEEEeecceeEEEEcCcccccccCC-ccEEEEeCCeeEecc
Confidence 899999999998887764 222344556688999999999999999999999999654322221 123344456899999
Q ss_pred cCCCCccCCCcccccCceeeeeeEEECcEEccccccccc-------CCCccCC--CCCCCCCCC-CCCeeeecCCceEEe
Q psy5025 214 VPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQ-------GVNVVNC--AHPCITQPC-GSGSCVPLYDGYKCV 283 (533)
Q Consensus 214 ~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~-------~~~~~~C--~~~C~~~pC-ngg~C~~~~~~~~C~ 283 (533)
.|.....-.-.....+|.||++-++++++..++.. ... .+.+..| .+.|.+||| +||.|...|+.|.|.
T Consensus 491 ~~~~~~~~~~~~~~~~f~GCmrli~vd~~~~~l~~-v~~~~~g~~~~v~id~C~i~drClPN~CehgG~C~Qs~~~f~C~ 569 (1306)
T KOG3516|consen 491 CPDKFNSWQCASPIKGFQGCMRLIKVDGQLKDLID-VKQGSLGNFSDVQIDMCGISDRCLPNPCEHGGKCSQSWDDFECN 569 (1306)
T ss_pred ccccccchhhccccccccceeEEEEECCeEeeeee-eeccccccccceeecccccccccCCccccCCCcccccccceeEe
Confidence 98774322222345789999999999999877322 111 1224567 589999999 999999999999999
Q ss_pred cC-CCCCcCcccce------------------------------------------------------------------
Q psy5025 284 CT-QSCKESHLTEV------------------------------------------------------------------ 296 (533)
Q Consensus 284 C~-~g~~G~~c~~~------------------------------------------------------------------ 296 (533)
|. .||.|..|...
T Consensus 570 C~~TGY~GatCHtsi~e~SCeay~~~~~t~~~~~iD~DGsGpl~Pl~v~C~~~ed~awTvv~H~~~~~t~V~g~n~~g~~ 649 (1306)
T KOG3516|consen 570 CELTGYKGATCHTSIYELSCEAYKNIGQTSGNFLIDSDGSGPLEPLQVYCNITEDRAWTVVQHDNLGTTRVRGSNPEGPV 649 (1306)
T ss_pred ccccccccccccCCCcchhhHHhhhhccccceEEEccCCCCcccceEEEEecccCceEEEEEeCCccceEEeccCCCCce
Confidence 99 47777211100
Q ss_pred --------------------------------------------------------------------------------
Q psy5025 297 -------------------------------------------------------------------------------- 296 (533)
Q Consensus 297 -------------------------------------------------------------------------------- 296 (533)
T Consensus 650 ~~s~~y~as~eQ~~al~n~se~CeQ~i~y~C~~sRllnt~~g~P~Swwigr~ne~~~yWGGs~Pg~qkC~Cgi~~nC~d~ 729 (1306)
T KOG3516|consen 650 AISLFYAASMEQLQALLNRSEHCEQEIEYSCRESRLLNTPDGTPFSWWIGRSNEGHVYWGGSGPGLQKCECGLLGNCLDP 729 (1306)
T ss_pred eEeeehhccHHHHHHHhhhhhhhheeeeeeeccceeeeCCCCCeeEEEecccCCccceecCCCCccceeeccccccccCc
Confidence
Q ss_pred -----------------------------------------------------------eeeeecCCCeeEEecCccccc
Q psy5025 297 -----------------------------------------------------------MCASFDHGASFLHYLDPEIIH 317 (533)
Q Consensus 297 -----------------------------------------------------------~~~~f~~~~s~~~~~~~~~~~ 317 (533)
.+++|..+++|++|+.-..
T Consensus 730 ~~~CNCDa~~~ewt~Dtg~l~~k~hLPVt~vv~gdTg~~~sea~~~lgPLrC~gDr~~wnsvSF~~~~syL~fp~f~~-- 807 (1306)
T KOG3516|consen 730 QLYCNCDADEKEWTTDTGCLAYKDHLPVTQVVIGDTGRSQSEAPYVLGPLRCEGDRNFWNSVSFHTGASYLHFPPFHN-- 807 (1306)
T ss_pred ceeeeccCCCccccccccccchhhcCCeeEEEEccCCCcccccceeecceEeecccccccceEeecCcceeecCcccC--
Confidence 1177766777888875433
Q ss_pred cccccceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEEC-CEEEEEEEcCCceEEEee-Cceee
Q psy5025 318 SSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHN-GYAHVMYNLGNGEVLLEN-NQTKI 395 (533)
Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~-g~l~~~~~~g~~~~~~~~-~~~~~ 395 (533)
.....|+|.|||+.++|++|..... .+|+.|+|.. -.+.+.++.|+++..+.. +++.+
T Consensus 808 ------------------~~saDIsf~FrTt~~~gvflen~g~--~dfir~eL~~~~~vtf~~dvgnGp~~~~V~s~t~~ 867 (1306)
T KOG3516|consen 808 ------------------ELSADISFFFRTTASSGVFLENHGI--NDFIRLELSSPVEVTFAFDVGNGPSQLTVRSPTEL 867 (1306)
T ss_pred ------------------cccccEEEEEEecCCceEeeeccCC--CceEEEEEcCCCceEEEEEcCCCceeEEEcCCccc
Confidence 3357899999999999999998654 8999999984 479999999998765554 55799
Q ss_pred cCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecC-CcceeeccCCceEEc
Q psy5025 396 NNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISG-GKFEQLNTDSGLYVG 474 (533)
Q Consensus 396 ~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~-~~~~~l~~~~~l~lG 474 (533)
||++||+|++.| +.+...|+||+.......++ ....-|...+.+|+|
T Consensus 868 nD~qWH~V~~Er--------------------------------n~K~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vG 915 (1306)
T KOG3516|consen 868 NDNQWHQVRAER--------------------------------NSKEASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVG 915 (1306)
T ss_pred CCCceEEEEEEe--------------------------------ccccceEEEcCcccceecCCCCCEEEEEeccceecc
Confidence 999999999999 99999999999876554443 334556778889999
Q ss_pred CCCCcccccccccccCceeeEEEEEEcCceEEeecccCc----CCCCcccceeeecCCce
Q psy5025 475 GVEDIEELTRSKYHHSFKGWISNLILSNDYQVKLPLSRS----QGSSCQLTCETLNTGQV 530 (533)
Q Consensus 475 g~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~~~~~~~----~~~~c~~~~~~~~~~~~ 530 (533)
|... ...+|.||||.|++ |+..|||...+. ...+|.-.|++.+|.++
T Consensus 916 gt~~--------~~~gF~GCIRsl~L-NGv~ldLe~ra~~~~gv~~GC~GhCss~~C~NG 966 (1306)
T KOG3516|consen 916 GTVS--------RQRGFLGCIRSLQL-NGVMLDLEYRAYGTAGVSPGCEGHCSSYPCLNG 966 (1306)
T ss_pred cccc--------CcCcceeeeeeeee-cceeeeehhhhccCCcccCCCccccccccccCC
Confidence 9553 34699999999999 899999976653 33589999999987654
No 6
>KOG4289|consensus
Probab=100.00 E-value=8.8e-33 Score=295.53 Aligned_cols=272 Identities=25% Similarity=0.515 Sum_probs=210.8
Q ss_pred CccccCCCceEEEEEEeCceeEEEEcCCCC--------------------------------------------------
Q psy5025 5 DNQVILDDWNTVRVFRHRWDAWIQLNDGNK-------------------------------------------------- 34 (533)
Q Consensus 5 ~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~-------------------------------------------------- 34 (533)
++.|+||+||+|.++...|.+.++||++++
T Consensus 1392 p~gvsDGqWHtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~fpv~~k~FvGC 1471 (2531)
T KOG4289|consen 1392 PGGVSDGQWHTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETFPVIEKQFVGC 1471 (2531)
T ss_pred CCCcccCceeEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcchhhHhHhhhh
Confidence 568999999999999999999999999988
Q ss_pred ------------------ceeecCCCC--CCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccccccccccccCCcee
Q psy5025 35 ------------------IKGRSKECL--EDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDLQVPQFNGSSY 94 (533)
Q Consensus 35 ------------------~~G~~~~C~--~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~sy 94 (533)
..|...+|. .+.|+..+|+|+|+|++.|++|.|.||.+|.|+.|++.... .-.|.|.|-
T Consensus 1472 mrdLsvD~~~VDma~fianngt~eGC~ark~fCdsg~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~~-pq~frG~sl 1550 (2531)
T KOG4289|consen 1472 MRDLSVDGRDVDMATFIANNGTHEGCKARKNFCDSGQCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMAH-PQHFRGHSL 1550 (2531)
T ss_pred hhhcccccccccHHHHHhhcCcccCchhhhcccCCCccCCCCeeecccCcEeecCccccCCcchhhccCC-chhccccce
Confidence 122256888 68999999999999999999999999999999999987431 236999999
Q ss_pred EEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE-eeccccCCCcEEE
Q psy5025 95 LRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR-SSNPISLGEWHKL 173 (533)
Q Consensus 95 l~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~-s~~~lnDg~Wh~V 173 (533)
+.+...+...+..+.++|.|||++.||+||.+.... ..-+.|+|.+|.|.+.+.. ..+. ....++||+||++
T Consensus 1551 ~sw~~~~~~vSvPwylsl~FRTr~ad~vl~~~~~~~---rst~~lqld~g~l~~~v~~----s~v~L~~~~vtdg~Wh~~ 1623 (2531)
T KOG4289|consen 1551 VSWEGLPSQVSVPWYLSLMFRTRRADGVLMQAEFGG---RSTYNLQLDDGTLKYNVGD----SSVELPAPRVTDGHWHHL 1623 (2531)
T ss_pred eeecCCCcceecceEEEEEEEeeccccEEEEEEeCC---CceEEEEEcCCEEEEEecC----ceEEccCccccCCchhhe
Confidence 999877766667899999999999999999986543 4559999999999987743 3333 3578999999999
Q ss_pred EEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEccccccccc-
Q psy5025 174 RLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQ- 252 (533)
Q Consensus 174 ~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~- 252 (533)
.|+.... ..+.+|...-+.....+ ...+.+ ..+|+||.|... ...+|.|||+++.++|..+. .++...
T Consensus 1624 ~i~l~~d-~~~t~d~g~~~aea~~g-l~gl~l-~sl~vGgap~~g-------~p~gf~GCiqgV~v~g~~~l-~~~kv~~ 1692 (2531)
T KOG4289|consen 1624 VIELEAD-SVATLDYGIYQAEAKAG-LSGLNL-ESLYVGGAPATG-------VPRGFRGCIQGVRVGGVSIL-VPKKVNV 1692 (2531)
T ss_pred eeeeccC-eEEEEechhhhhhhhcC-CCCcee-eEEEEccccCCC-------ccccchhhhhceEECCEeec-ccccccc
Confidence 9999875 45566653322211111 334444 369999999442 34899999999999998864 221110
Q ss_pred --CCCc--------------cCCCCCCCCCCC-CCCeeeecC--CceEEecCCCCCcCcccc
Q psy5025 253 --GVNV--------------VNCAHPCITQPC-GSGSCVPLY--DGYKCVCTQSCKESHLTE 295 (533)
Q Consensus 253 --~~~~--------------~~C~~~C~~~pC-ngg~C~~~~--~~~~C~C~~g~~G~~c~~ 295 (533)
+-.+ ..|-+.|.-+|| |.|+|+... .+|+|+|++||.|..|+.
T Consensus 1693 ~~GCvvpn~C~~d~sC~c~~~~C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~Ce~ 1754 (2531)
T KOG4289|consen 1693 EAGCVVPNPCSVDSSCPCDPYNCVDVCSLNPCENQGTCVRSPGAHGYTCECPPGYTGPYCEL 1754 (2531)
T ss_pred ccCcccCCccccCCcccCCCCCccchhcccccccCceeecCCCCCceeEECCCcccCcchhh
Confidence 0000 123356778999 999999876 579999999999998875
No 7
>KOG1219|consensus
Probab=99.95 E-value=2e-27 Score=262.48 Aligned_cols=245 Identities=27% Similarity=0.613 Sum_probs=198.9
Q ss_pred CCC--CCCCCCCCCCCCCeeeeCC--CCceecCCCCCCCCCccccccccccccCCceeEEEcCCcccccceeEEEEEEEe
Q psy5025 41 ECL--EDKCNKTPCEHGGKCITTP--GSSYCLCPLGFTGDLCQTRLDLQVPQFNGSSYLRYQGLGEQYLSLLDLTIVFKA 116 (533)
Q Consensus 41 ~C~--~d~C~~~pC~NggtC~~~~--~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~syl~~~~~~~~~~~~~~ls~~FrT 116 (533)
.|+ .+.|...||+.+..|+... ..|.|.||.|..| .|+... ..++.|+||++|....... ..+.+.|+.||
T Consensus 3641 ~Cp~~~~~C~~~pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~~---elS~tGnSYveyrlse~~n-~~~kl~frLkT 3715 (4289)
T KOG1219|consen 3641 FCPVETNQCAKSPCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGDF---ELSSTGNSYVEYRLSENQN-TRMKLGFRLKT 3715 (4289)
T ss_pred cCCcccCccccCCCcccCcccccccccceeEeccCcccc-cCCCcc---eEeecCceeEEEEcccccc-cceEEEEEEEe
Confidence 476 6899999999999999886 7899999998555 488774 5789999999999755443 34899999999
Q ss_pred cCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEee-ccccCCCcEEEEEEEECCEEEEEEcCccCceee
Q psy5025 117 IEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWHKLRLTRTGRHAYLQVDRFPSSQIL 195 (533)
Q Consensus 117 ~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~-~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~ 195 (533)
..++|+++|... .++..|.|.+|.+.+.++.|+++-.+.+. ..++||+||.|.+.+.++++.|.||+....+..
T Consensus 3716 ~~sngIiM~tr~-----~d~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWHsialerrr~~irlsvDd~~~~~at 3790 (4289)
T KOG1219|consen 3716 LQSNGIIMYTRK-----TDLAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWHSIALERRRNHIRLSVDDDTYDSAT 3790 (4289)
T ss_pred cccCcEEEEEcC-----CceEEEEecCCcEEEEEecCCCCCcccccceEeecCceeEEEeeccCCceEEEEcccCceeee
Confidence 999999999852 79999999999999999999988666665 789999999999999999999999998888888
Q ss_pred CCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEccccccc------ccCCC-ccCC---------
Q psy5025 196 SPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEA------IQGVN-VVNC--------- 259 (533)
Q Consensus 196 ~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~------~~~~~-~~~C--------- 259 (533)
.|+....++.+..||+|+.-..... .......+|.|||+.+.+|+..+++..+. +.... ..+|
T Consensus 3791 vPg~~~tln~d~hiy~Ga~vrlr~~-~~tqvs~Gf~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC~l~~d~C~~ 3869 (4289)
T KOG1219|consen 3791 VPGMKSTLNLDTHIYLGALVRLRHQ-RSTQVSYGFDGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGCSLLTDPCND 3869 (4289)
T ss_pred cccceeeccccceEEEeeEeeeccC-CCccccccccceeeeEEEccccccccCCCchhhhhhhhhccccccccccccccc
Confidence 9998888988899999998752211 22245689999999999999998864332 11000 1122
Q ss_pred ---------------------C------------CCCCCCCC-CCCeeeecCCceEEecCCCCCcCcccce
Q psy5025 260 ---------------------A------------HPCITQPC-GSGSCVPLYDGYKCVCTQSCKESHLTEV 296 (533)
Q Consensus 260 ---------------------~------------~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~~c~~~ 296 (533)
| .+|.++|| +||+|+...++|.|.||.||+|.+|+..
T Consensus 3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~ 3940 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR 3940 (4289)
T ss_pred CcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc
Confidence 1 26888888 8888888888888888888888888764
No 8
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.90 E-value=9.2e-23 Score=178.06 Aligned_cols=129 Identities=35% Similarity=0.595 Sum_probs=106.8
Q ss_pred EEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCc
Q psy5025 345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKS 424 (533)
Q Consensus 345 ~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~ 424 (533)
|||.+++|+|||.+.....+|++|+|++|+|+++++.|+++..+... .+++||+||+|.+.|
T Consensus 1 frT~~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~~~~~~~~-~~i~dg~wh~v~~~r----------------- 62 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSGPASLRSP-QKINDGKWHTVSVSR----------------- 62 (131)
T ss_dssp EEESSSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSEEEEEEES-SETTSSSEEEEEEEE-----------------
T ss_pred CccCCCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCccceecCC-CccCCCcceEEEEEE-----------------
Confidence 89999999999998777789999999999999999999998777744 459999999999999
Q ss_pred chhhhhhhhhccccccCcEEEEEECCceeeeEecCCccee-eccCCceEEcCCCCc-ccccccccccCceeeEEEEEEcC
Q psy5025 425 SKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQ-LNTDSGLYVGGVEDI-EELTRSKYHHSFKGWISNLILSN 502 (533)
Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~-l~~~~~l~lGg~~~~-~~~~~~~~~~~f~GCi~~l~i~~ 502 (533)
.++.+.|+||+.......++..... ++.+..|||||+|.. ..........+|+|||+++.| +
T Consensus 63 ---------------~~~~~~L~Vd~~~~~~~~s~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~i-n 126 (131)
T PF00054_consen 63 ---------------NGRNGSLSVDGEEVVTGESPSGATQSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSI-N 126 (131)
T ss_dssp ---------------ETTEEEEEETTSEEEEEEECSSSSSSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEE-T
T ss_pred ---------------cCcEEEEEECCccceeeecCCccccccccccCEEEccCCchhhcccccccCCCeeEEEEEeEE-C
Confidence 9999999999998855555544334 788888999999933 333445577899999999999 7
Q ss_pred ceEEe
Q psy5025 503 DYQVK 507 (533)
Q Consensus 503 ~~~l~ 507 (533)
+.++|
T Consensus 127 ~~~ld 131 (131)
T PF00054_consen 127 GKPLD 131 (131)
T ss_dssp TEEC-
T ss_pred CEECc
Confidence 77765
No 9
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.89 E-value=3.8e-22 Score=174.14 Aligned_cols=130 Identities=40% Similarity=0.704 Sum_probs=108.8
Q ss_pred EEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCce
Q psy5025 114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQ 193 (533)
Q Consensus 114 FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~ 193 (533)
|||..++|+|||.+.... .||++|+|.+|+|+++++.|.++..+.++..++||+||+|.+.|....+.|.||+.....
T Consensus 1 frT~~~~Gllly~g~~~~--~dfial~L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~~~~~ 78 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDG--KDFIALELRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGRNGSLSVDGEEVVT 78 (131)
T ss_dssp EEESSSSEEEEEEESSTT--SSEEEEEEETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETTEEEEEETTSEEEE
T ss_pred CccCCCCceEEECCcCCC--CCEEEEEEECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCcEEEEEECCcccee
Confidence 899999999999987654 599999999999999999999988888888899999999999999999999999977634
Q ss_pred eeCCCCcee-eecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025 194 ILSPGPFTQ-LSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN 245 (533)
Q Consensus 194 ~~~~~~~~~-l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~ 245 (533)
...+..... ++....+||||+|.....+.......+|.|||+++.+|+++++
T Consensus 79 ~~s~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~in~~~ld 131 (131)
T PF00054_consen 79 GESPSGATQSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSINGKPLD 131 (131)
T ss_dssp EEECSSSSSSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEETTEEC-
T ss_pred eecCCccccccccccCEEEccCCchhhcccccccCCCeeEEEEEeEECCEECc
Confidence 444444444 7777889999999444444455667899999999999999875
No 10
>smart00282 LamG Laminin G domain.
Probab=99.88 E-value=1.6e-21 Score=171.60 Aligned_cols=132 Identities=33% Similarity=0.628 Sum_probs=110.6
Q ss_pred eeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCc
Q psy5025 338 EFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSS 417 (533)
Q Consensus 338 ~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~ 417 (533)
.++|+|.|||.+++|+||+..+....+|+.|+|.+|++++.++.+++...+......++||+||+|.+.+
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~---------- 71 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVER---------- 71 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEE----------
Confidence 3689999999999999999976445899999999999999999988777766555899999999999999
Q ss_pred ccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEE
Q psy5025 418 HTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISN 497 (533)
Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~ 497 (533)
.++.+.|+||+........++....++....+||||+|+...........+|+|||++
T Consensus 72 ----------------------~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~F~GCi~~ 129 (135)
T smart00282 72 ----------------------NGRRVTLSVDGENPVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLLVTPGFRGCIRN 129 (135)
T ss_pred ----------------------eCCEEEEEECCCccccEECCCCceEEecCCCcEEccCCchhcccccccCCCCeeEeeE
Confidence 8999999999976554444444566777788999999987544344567899999999
Q ss_pred EEEc
Q psy5025 498 LILS 501 (533)
Q Consensus 498 l~i~ 501 (533)
|+||
T Consensus 130 v~in 133 (135)
T smart00282 130 LKVN 133 (135)
T ss_pred EEEC
Confidence 9994
No 11
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.87 E-value=6.7e-21 Score=170.74 Aligned_cols=150 Identities=31% Similarity=0.558 Sum_probs=119.9
Q ss_pred eeecCCCeeEEecCccccccccccceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEE
Q psy5025 299 ASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVM 378 (533)
Q Consensus 299 ~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~ 378 (533)
+.| .+.+|+.++.+... ...++|+|+|||.+++|+||+.+.....+|+.|+|.+|++++.
T Consensus 2 ~~F-~g~~~i~~~~~~~~-------------------~~~~~i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~ 61 (151)
T cd00110 2 VSF-SGSSYVRLPTLPAP-------------------RTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLR 61 (151)
T ss_pred eEe-CCCceEEecCCCCC-------------------cceeEEEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEE
Confidence 567 66799999976541 4568999999999999999999876568999999999999999
Q ss_pred EEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEec
Q psy5025 379 YNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSIS 458 (533)
Q Consensus 379 ~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~ 458 (533)
++.+.....+. ...+++||+||+|.+.+ .++.+.|+||+........
T Consensus 62 ~~~g~~~~~~~-~~~~v~dg~Wh~v~i~~--------------------------------~~~~~~l~VD~~~~~~~~~ 108 (151)
T cd00110 62 YDLGSGSLVLS-SKTPLNDGQWHSVSVER--------------------------------NGRSVTLSVDGERVVESGS 108 (151)
T ss_pred EcCCcccEEEE-ccCccCCCCEEEEEEEE--------------------------------CCCEEEEEECCccEEeeeC
Confidence 99985555555 33479999999999999 8899999999985444433
Q ss_pred CCcceeeccCCceEEcCCCCcccccccccccCceeeEEEEEEc
Q psy5025 459 GGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILS 501 (533)
Q Consensus 459 ~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~ 501 (533)
+.....+.....+||||+|............+|+|||++|+||
T Consensus 109 ~~~~~~~~~~~~~~iGg~~~~~~~~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 109 PGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred CCCceeecCCCCeEEcCCCCchhcccccccCCCceEeeEeEeC
Confidence 3332246677889999999865433345678999999999983
No 12
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.85 E-value=5.1e-20 Score=165.01 Aligned_cols=150 Identities=37% Similarity=0.691 Sum_probs=120.5
Q ss_pred cccCCceeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeecccc
Q psy5025 87 PQFNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPIS 166 (533)
Q Consensus 87 ~~F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~ln 166 (533)
..|+|.+|+.++.+... ...++|+|+|||.+++|+||+.+... ..+|+.|+|.+|+|.+.++.+.....+.+...++
T Consensus 2 ~~F~g~~~i~~~~~~~~-~~~~~i~~~frt~~~~g~l~~~~~~~--~~~~~~l~l~~g~l~~~~~~g~~~~~~~~~~~v~ 78 (151)
T cd00110 2 VSFSGSSYVRLPTLPAP-RTRLSISFSFRTTSPNGLLLYAGSQN--GGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLN 78 (151)
T ss_pred eEeCCCceEEecCCCCC-cceeEEEEEEEeCCCCeEEEEecCCC--CCCEEEEEEECCEEEEEEcCCcccEEEEccCccC
Confidence 46899999999986654 45799999999999999999998753 3899999999999999999986666777666899
Q ss_pred CCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEEC
Q psy5025 167 LGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN 240 (533)
Q Consensus 167 Dg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in 240 (533)
||+||+|.+.+.++.+.|.||+........+.....+.....+||||.|..... .......+|.|||+++++|
T Consensus 79 dg~Wh~v~i~~~~~~~~l~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~-~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKS-PGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred CCCEEEEEEEECCCEEEEEECCccEEeeeCCCCceeecCCCCeEEcCCCCchhc-ccccccCCCceEeeEeEeC
Confidence 999999999999999999999974333333322223556688999999976432 2234568999999999987
No 13
>smart00282 LamG Laminin G domain.
Probab=99.84 E-value=9.9e-20 Score=160.10 Aligned_cols=133 Identities=38% Similarity=0.719 Sum_probs=111.4
Q ss_pred eeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEee-ccccCCCcEEEEEEEECCEEEEE
Q psy5025 107 LLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWHKLRLTRTGRHAYLQ 185 (533)
Q Consensus 107 ~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~-~~lnDg~Wh~V~v~~~~~~~~L~ 185 (533)
.++|+|+|||.+++|+|||.++.. ..+|+.|+|.+|+|.+.++.+++...+... ..++||+||+|.+.+.++.+.|.
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~--~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~~~~~l~ 79 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKN--GGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRRVTLS 79 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCC--CCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeCCEEEEE
Confidence 468999999999999999997642 389999999999999999998877777776 89999999999999999999999
Q ss_pred EcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcE
Q psy5025 186 VDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNH 242 (533)
Q Consensus 186 VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~ 242 (533)
||+........++....++....+||||.|..... .......+|.|||+++++|+.
T Consensus 80 VD~~~~~~~~~~~~~~~l~~~~~l~iGG~p~~~~~-~~~~~~~~F~GCi~~v~in~~ 135 (135)
T smart00282 80 VDGENPVSGESPGGLTILNLDGPLYLGGLPEDLKL-PPLLVTPGFRGCIRNLKVNGK 135 (135)
T ss_pred ECCCccccEECCCCceEEecCCCcEEccCCchhcc-cccccCCCCeeEeeEEEECCC
Confidence 99976555555555566777789999999976432 223456899999999999973
No 14
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.82 E-value=3.1e-19 Score=154.97 Aligned_cols=126 Identities=29% Similarity=0.570 Sum_probs=104.4
Q ss_pred EEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCc
Q psy5025 345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKS 424 (533)
Q Consensus 345 ~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~ 424 (533)
|||.+++|+|||.......+|+.|+|.+|+|++.++.++...........++||+||.|.+.+
T Consensus 1 Frt~~~~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~----------------- 63 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISR----------------- 63 (128)
T ss_dssp EEESSSSEEEEEEEESTTSEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEE-----------------
T ss_pred CccCCCCEeEEEEcCCCCCEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEE-----------------
Confidence 899999999999987654579999999999999999996666666688889999999999999
Q ss_pred chhhhhhhhhccccccCcEEEEEECCceeeeEecCCcce-eeccCCceEEcCCCCcccccccccccCceeeEEEEEEcCc
Q psy5025 425 SKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFE-QLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSND 503 (533)
Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~-~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~ 503 (533)
.+..++|.||+............. .++....+||||.|............+|+|||++|+| |+
T Consensus 64 ---------------~~~~~~l~Vd~~~~~~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~~~~~f~Gci~~l~v-ng 127 (128)
T PF02210_consen 64 ---------------DGNRVTLTVDGQSVSSESLPSSSSDSLDPDGSLYIGGLPESNQPSGSVDTPGFVGCIRDLRV-NG 127 (128)
T ss_dssp ---------------ETTEEEEEETTSEEEEEESSSTTHHCBESEEEEEESSTTTTCTCTTSSTTSB-EEEEEEEEE-TT
T ss_pred ---------------eeeeEEEEecCccceEEeccccceecccCCCCEEEecccCccccccccCCCCcEEEcCeEEE-CC
Confidence 889999999999888766554433 5666667999999988654443348899999999999 54
No 15
>KOG1219|consensus
Probab=99.81 E-value=5.6e-19 Score=196.73 Aligned_cols=195 Identities=23% Similarity=0.451 Sum_probs=161.0
Q ss_pred CCCCCCCC-CCCeeeec-C-CceEEecCCCCCcCcccceeeeeecCCCeeEEecCccccccccccceeeeeeeeeeecce
Q psy5025 261 HPCITQPC-GSGSCVPL-Y-DGYKCVCTQSCKESHLTEVMCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGY 337 (533)
Q Consensus 261 ~~C~~~pC-ngg~C~~~-~-~~~~C~C~~g~~G~~c~~~~~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (533)
++|...|| .+..|+.. | ..|.|.||.|-.|. |+.-...++ .|.||+++...+.. ..
T Consensus 3647 ~~C~~~pcp~~~~Cvs~~~~~~~~cVcP~gr~g~-C~g~~elS~-tGnSYveyrlse~~-------------------n~ 3705 (4289)
T KOG1219|consen 3647 NQCAKSPCPAGNLCVSSVHNSTYTCVCPIGRFGF-CQGDFELSS-TGNSYVEYRLSENQ-------------------NT 3705 (4289)
T ss_pred CccccCCCcccCcccccccccceeEeccCccccc-CCCcceEee-cCceeEEEEccccc-------------------cc
Confidence 68999999 88999976 4 45999999985554 554456777 89999999987653 23
Q ss_pred eeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCc
Q psy5025 338 EFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSS 417 (533)
Q Consensus 338 ~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~ 417 (533)
.+.+.|++||.+++|+++|+.. .++..|.|.+|.+.+.++-|+++-.+....+.++||+||.|.+.|
T Consensus 3706 ~~kl~frLkT~~sngIiM~tr~---~d~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWHsialer---------- 3772 (4289)
T KOG1219|consen 3706 RMKLGFRLKTLQSNGIIMYTRK---TDLAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWHSIALER---------- 3772 (4289)
T ss_pred ceEEEEEEEecccCcEEEEEcC---CceEEEEecCCcEEEEEecCCCCCcccccceEeecCceeEEEeec----------
Confidence 4889999999999999999862 689999999999999999999886666455899999999999999
Q ss_pred ccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEE
Q psy5025 418 HTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISN 497 (533)
Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~ 497 (533)
.++.++|.||+........++....++.+..||+||+-.....+.-....+|.|||..
T Consensus 3773 ----------------------rr~~irlsvDd~~~~~atvPg~~~tln~d~hiy~Ga~vrlr~~~~tqvs~Gf~GClds 3830 (4289)
T KOG1219|consen 3773 ----------------------RRNHIRLSVDDDTYDSATVPGMKSTLNLDTHIYLGALVRLRHQRSTQVSYGFDGCLDS 3830 (4289)
T ss_pred ----------------------cCCceEEEEcccCceeeecccceeeccccceEEEeeEeeeccCCCccccccccceeee
Confidence 8899999999999888888888888998999999998762221222467899999999
Q ss_pred EEEcCceEEeecccC
Q psy5025 498 LILSNDYQVKLPLSR 512 (533)
Q Consensus 498 l~i~~~~~l~~~~~~ 512 (533)
+.+ |+..|++....
T Consensus 3831 iyL-ng~el~l~~k~ 3844 (4289)
T KOG1219|consen 3831 IYL-NGMELPLTRKG 3844 (4289)
T ss_pred EEE-ccccccccCCC
Confidence 999 77777775543
No 16
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.77 E-value=9.4e-18 Score=145.66 Aligned_cols=126 Identities=35% Similarity=0.635 Sum_probs=102.4
Q ss_pred EEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCc-eEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCc
Q psy5025 114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA-ATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSS 192 (533)
Q Consensus 114 FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~-~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~ 192 (533)
|||++++|+|||.+.... .+|+.|+|.+|+|++.++.+... .....+..++||+||.|.+.+....+.|.||+....
T Consensus 1 Frt~~~~g~Ll~~~~~~~--~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~~~~l~Vd~~~~~ 78 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDN--GDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGNRVTLTVDGQSVS 78 (128)
T ss_dssp EEESSSSEEEEEEEESTT--SEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETTEEEEEETTSEEE
T ss_pred CccCCCCEeEEEEcCCCC--CEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeeeeEEEEecCccce
Confidence 899999999999988753 68999999999999999999433 445567889999999999999999999999998766
Q ss_pred eeeCCCCce-eeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcE
Q psy5025 193 QILSPGPFT-QLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNH 242 (533)
Q Consensus 193 ~~~~~~~~~-~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~ 242 (533)
......... .++....+||||.|.....+.... ..+|.|||+++++|++
T Consensus 79 ~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~~-~~~f~Gci~~l~vng~ 128 (128)
T PF02210_consen 79 SESLPSSSSDSLDPDGSLYIGGLPESNQPSGSVD-TPGFVGCIRDLRVNGQ 128 (128)
T ss_dssp EEESSSTTHHCBESEEEEEESSTTTTCTCTTSST-TSB-EEEEEEEEETTE
T ss_pred EEeccccceecccCCCCEEEecccCccccccccC-CCCcEEEcCeEEECCC
Confidence 655554443 555667799999998765333222 6899999999999985
No 17
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=99.06 E-value=7.3e-09 Score=95.76 Aligned_cols=125 Identities=15% Similarity=0.281 Sum_probs=88.7
Q ss_pred ceeeEEEEEEEeC-CCCeEEEEeccCCCCCeEEEEEECCEEEEEEEc--CCc-eEEEeeCceeecCCCeEEEEEEEeeee
Q psy5025 336 GYEFNIDMRIKTS-SSSGLILWTGSSQSKDFLALGIHNGYAHVMYNL--GNG-EVLLENNQTKINNGVWHRLTVHRNFMV 411 (533)
Q Consensus 336 ~~~~~i~~~~rt~-~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~--g~~-~~~~~~~~~~~~dg~wH~v~~~~~~~~ 411 (533)
..++.|.+++|+. +..|.||...+.+...+|.|.+..+...+.+.. ..+ .....+....+.||+||+|.+..
T Consensus 51 ~~~fsi~~~~r~~~~~~g~L~si~~~~~~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal~V---- 126 (184)
T smart00210 51 PEDFSLLTTFRQTPKSRGVLFAIYDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLALSV---- 126 (184)
T ss_pred CCCeEEEEEEEeCCCCCeEEEEEEcCCCcEEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEEEE----
Confidence 4578999999997 678899988765567789998876664555442 222 33444455789999999999999
Q ss_pred eecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcc-eeeccCCceEEcCCCCcccccccccccC
Q psy5025 412 VMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKF-EQLNTDSGLYVGGVEDIEELTRSKYHHS 490 (533)
Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~-~~l~~~~~l~lGg~~~~~~~~~~~~~~~ 490 (533)
.+..++|+||............. ..+..++ ++++|.... ....
T Consensus 127 ----------------------------~~~~v~LyvDC~~~~~~~l~~~~~~~~~~~g-~~~~g~~~~-------~~~~ 170 (184)
T smart00210 127 ----------------------------SGSSATLYVDCNEIDSRPLDRPGQPPIDTDG-IEVRGAQAA-------DRKP 170 (184)
T ss_pred ----------------------------eCCEEEEEECCccccceecCCcccccccccc-eEEEeeccC-------CCCc
Confidence 88999999999987665432222 1344444 556554432 1348
Q ss_pred ceeeEEEEEE
Q psy5025 491 FKGWISNLIL 500 (533)
Q Consensus 491 f~GCi~~l~i 500 (533)
|+|+|++|+|
T Consensus 171 f~G~lq~l~i 180 (184)
T smart00210 171 FQGDLQQLKI 180 (184)
T ss_pred ceEEeEEEEE
Confidence 9999999998
No 18
>KOG3509|consensus
Probab=98.73 E-value=2.8e-08 Score=109.38 Aligned_cols=250 Identities=19% Similarity=0.296 Sum_probs=159.9
Q ss_pred CCCCCCCCCCCCCCCCCeeeeCC-CCceecCCCCCCCCCccccccccccccCCc-----eeEEEcC-------------C
Q psy5025 40 KECLEDKCNKTPCEHGGKCITTP-GSSYCLCPLGFTGDLCQTRLDLQVPQFNGS-----SYLRYQG-------------L 100 (533)
Q Consensus 40 ~~C~~d~C~~~pC~NggtC~~~~-~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~-----syl~~~~-------------~ 100 (533)
..++...|...+|+-|++|.... -+|.+.|+..-.+..|+.....++..|... ....+.. +
T Consensus 168 ~~~~~~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r~~~~f~~~~~g~~~~~~~~vp~~~e~S~~~~~~~ 247 (964)
T KOG3509|consen 168 PGTEPIHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPRAFCPFEKSVEGCLKCFCFGVPRPSESSLHAFRAI 247 (964)
T ss_pred CccccccccCcccccceeEEecCCcceeeeccccccceehhhccCceecccccccccccceeecCCCccccchhhhHhhh
Confidence 34556678889999999999986 788888888777777888776555544321 1111111 0
Q ss_pred cccccceeEEEEEEEec--CCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccCCCcEEEEEEEE
Q psy5025 101 GEQYLSLLDLTIVFKAI--EPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRT 178 (533)
Q Consensus 101 ~~~~~~~~~ls~~FrT~--~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnDg~Wh~V~v~~~ 178 (533)
.........+.. |++. ..+.++.+.... ...+|+++.+..|-+.+.+..+.+...+......-.++|+.+.+.|
T Consensus 248 h~~~~~~~~~~~-~~~~s~~~~~~~~~~~~~--~~~~f~~lt~~~g~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r- 323 (964)
T KOG3509|consen 248 HGATLHVDSLGV-FFSDSEHRDILGNFLFSS--FKDGFRALTLDGGTDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFR- 323 (964)
T ss_pred ccchhccchhee-eccccccccccccccccc--cccceeeeccCCCCccccccccCcchhhhhccccccceeeeeEeee-
Confidence 111111112222 2232 234444444332 2378888888777777777766665555566777889999999998
Q ss_pred CCEEEEEEcCcc-CceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcccccccccCCCc-
Q psy5025 179 GRHAYLQVDRFP-SSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQGVNV- 256 (533)
Q Consensus 179 ~~~~~L~VD~~~-~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~~~~- 256 (533)
.-.+.+++.. ............+.....+|+|++-....+........+|.|||+++.++.+.++...+....+..
T Consensus 324 --~s~~~~~g~~~~l~g~~~~~~~~i~~ee~v~lg~i~ni~~l~~~~~~~eGf~gci~~~~~~~k~l~~~~~~~~~v~~~ 401 (964)
T KOG3509|consen 324 --GSGLSVSGHKGVLQGNSNILVSRITNEESVFLGGIINIETLQHNLPLPEGFAGCIRDLVMNLKDLRVTLQRASYVAAQ 401 (964)
T ss_pred --cccccccCcceeecccccccccceeecccccCCceeeeccccccCCCccCccceehhhhhhccccccccccccccccc
Confidence 2233444421 111222233334445556888885555445555566789999999999999988754433333322
Q ss_pred cCC-CCCCCCCCC-CCCeeeecCCceEEecCCCCCcCcccc
Q psy5025 257 VNC-AHPCITQPC-GSGSCVPLYDGYKCVCTQSCKESHLTE 295 (533)
Q Consensus 257 ~~C-~~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~~c~~ 295 (533)
..| .+.|...|| ..+.|........|.|+++|.|..|+.
T Consensus 402 ~~c~g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d 442 (964)
T KOG3509|consen 402 GTCLGDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCED 442 (964)
T ss_pred cccCCCccccccCCCCccccccccccceeccccccCchhhc
Confidence 245 578999999 999999999999999999999976654
No 19
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=98.70 E-value=6.2e-07 Score=82.91 Aligned_cols=124 Identities=21% Similarity=0.262 Sum_probs=85.8
Q ss_pred ceeEEEEEEEec-CCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEc--CCC-ceEEEee-ccccCCCcEEEEEEEECC
Q psy5025 106 SLLDLTIVFKAI-EPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDL--GTG-AATLRSS-NPISLGEWHKLRLTRTGR 180 (533)
Q Consensus 106 ~~~~ls~~FrT~-~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~--g~~-~~~v~s~-~~lnDg~Wh~V~v~~~~~ 180 (533)
..+.|.++||+. ...|.||-..+.. ...++.|.+..++..+.+.. ..+ ...+... ..+.||+||+|.+...+.
T Consensus 52 ~~fsi~~~~r~~~~~~g~L~si~~~~--~~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal~V~~~ 129 (184)
T smart00210 52 EDFSLLTTFRQTPKSRGVLFAIYDAQ--NVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLALSVSGS 129 (184)
T ss_pred CCeEEEEEEEeCCCCCeEEEEEEcCC--CcEEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEEEEeCC
Confidence 468999999986 7788888886642 26789999987765555553 222 2333444 779999999999999999
Q ss_pred EEEEEEcCccCceeeCCCCc-eeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEEC
Q psy5025 181 HAYLQVDRFPSSQILSPGPF-TQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN 240 (533)
Q Consensus 181 ~~~L~VD~~~~~~~~~~~~~-~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in 240 (533)
.++|+||............. ..+.. ..++++|..... ...|.|+|++++|-
T Consensus 130 ~v~LyvDC~~~~~~~l~~~~~~~~~~-~g~~~~g~~~~~--------~~~f~G~lq~l~i~ 181 (184)
T smart00210 130 SATLYVDCNEIDSRPLDRPGQPPIDT-DGIEVRGAQAAD--------RKPFQGDLQQLKIV 181 (184)
T ss_pred EEEEEECCccccceecCCcccccccc-cceEEEeeccCC--------CCcceEEeEEEEEe
Confidence 99999999776554433222 12333 335555544321 25899999999884
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.69 E-value=5.9e-09 Score=66.33 Aligned_cols=31 Identities=48% Similarity=1.266 Sum_probs=29.2
Q ss_pred CCCCCCCCCCeeeeCC-CCceecCCCCCCCCC
Q psy5025 47 CNKTPCEHGGKCITTP-GSSYCLCPLGFTGDL 77 (533)
Q Consensus 47 C~~~pC~NggtC~~~~-~~y~C~C~~g~~G~~ 77 (533)
|.++||+|+|+|++.. ++|+|.|++||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 7789999999999999 999999999999974
No 21
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=98.67 E-value=5.4e-07 Score=80.19 Aligned_cols=149 Identities=19% Similarity=0.350 Sum_probs=89.7
Q ss_pred eeeecCCCeeEEecCccccccccccceeeeeeeeeeecceeeEEEEEEEeCCCCe--EEEEeccCCCCCeEEEEEE-CCE
Q psy5025 298 CASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSG--LILWTGSSQSKDFLALGIH-NGY 374 (533)
Q Consensus 298 ~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g--~ll~~~~~~~~~~l~l~l~-~g~ 374 (533)
++.|+.+.+|+.++.... ....++|+++||.....+ .++.. .....+.+.|.+. +|.
T Consensus 2 a~~f~g~~~~i~~~~~~~-------------------~~~~fTi~~w~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~ 61 (157)
T PF13385_consen 2 ALYFDGSNDYISIPNSDF-------------------PSGSFTISFWVKPDSPSSSQSFVFM-DSSGSGGFGLFINNNGR 61 (157)
T ss_dssp EEEE-STT-EEEEESGGG-------------------GGTEEEEEEEEEESS--SSEEEEEE-SSSSSEEEEEEEETTSE
T ss_pred EEEECCCCCEEEECCcCC-------------------CCCCEEEEEEEEeCCCCCCceEEEE-ecCCCCEEEEEEECCCE
Confidence 467767889999886332 145789999999986332 23322 2222457777776 577
Q ss_pred EEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceee
Q psy5025 375 AHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSM 454 (533)
Q Consensus 375 l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~ 454 (533)
+.+.+..+.+..........+.+++||+|.+.+ .+..+.|+|||....
T Consensus 62 l~~~~~~~~~~~~~~~~~~~~~~~~W~~l~~~~--------------------------------~~~~~~lyvnG~~~~ 109 (157)
T PF13385_consen 62 LRFYIGNGGGGNYSFSSDSNLPDNKWHHLALTY--------------------------------DGSTVTLYVNGELVG 109 (157)
T ss_dssp EEEEETTSEEESS-EE-BS---TT-EEEEEEEE--------------------------------ETTEEEEEETTEEET
T ss_pred EEEEEeCCCceeEEEecCcccCCCCEEEEEEEE--------------------------------ECCeEEEEECCEEEE
Confidence 777766553211122256789999999999999 888999999999776
Q ss_pred eEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEEEEEcCceEEee
Q psy5025 455 QSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSNDYQVKL 508 (533)
Q Consensus 455 ~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~~ 508 (533)
....... ........++||+... ....|.|-|.+|+| ..+.|.-
T Consensus 110 ~~~~~~~-~~~~~~~~~~iG~~~~--------~~~~~~g~i~~~~i-~~~aLt~ 153 (157)
T PF13385_consen 110 SSTIPSN-ISLNSNGPLFIGGSGG--------GSSPFNGYIDDLRI-YNRALTA 153 (157)
T ss_dssp TCTEESS-SSTTSCCEEEESS-ST--------T--B-EEEEEEEEE-ESS---H
T ss_pred eEeccCC-cCCCCcceEEEeecCC--------CCCceEEEEEEEEE-ECccCCH
Confidence 5433222 1124556799998662 24589999999999 6666543
No 22
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=98.46 E-value=5.5e-06 Score=78.02 Aligned_cols=140 Identities=14% Similarity=0.117 Sum_probs=89.1
Q ss_pred ceeeEEEEEEEeCC--CCeEEEEeccCCCCCeEEEEE-ECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025 336 GYEFNIDMRIKTSS--SSGLILWTGSSQSKDFLALGI-HNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV 412 (533)
Q Consensus 336 ~~~~~i~~~~rt~~--~~g~ll~~~~~~~~~~l~l~l-~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~ 412 (533)
...++|.+++|+.. ..+.||........+-+.+.. .++.+.+.+ ++.... ....+.||+||+|.+.++.
T Consensus 30 l~~fTvc~W~k~~~~~~~~~ifSy~~~~~~ne~~~~~~~~~~~~l~i--~g~~~~---~~~~~~~g~W~hvc~tw~~--- 101 (206)
T smart00159 30 LQAFTVCLWFYSDLSPRGYSLFSYATKGQDNELLLYKEKQGEYSLYI--GGKKVQ---FPVPESDGKWHHICTTWES--- 101 (206)
T ss_pred hhHEEEEEEEEecCCCCceEEEEEeCCCCCCeEEEEEcCCcEEEEEE--cCeEEE---ecccccCCceEEEEEEEEC---
Confidence 45789999999975 456676554443344444444 355555554 333222 2356899999999999932
Q ss_pred ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCce
Q psy5025 413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFK 492 (533)
Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~ 492 (533)
..+.+.|+|||..... ........+...+.|.||.-.+..- ........|.
T Consensus 102 ---------------------------~~g~~~lyvnG~~~~~-~~~~~g~~i~~~G~lvlGq~qd~~g-g~f~~~~~f~ 152 (206)
T smart00159 102 ---------------------------SSGIAELWVDGKPGVR-KGLAKGYTVKPGGSIILGQEQDSYG-GGFDATQSFV 152 (206)
T ss_pred ---------------------------CCCcEEEEECCEEccc-ccccCCcEECCCCEEEEEecccCCC-CCCCCCccee
Confidence 4667999999987622 2222233455666788887543211 0112456899
Q ss_pred eeEEEEEEcCceEEeecccCc
Q psy5025 493 GWISNLILSNDYQVKLPLSRS 513 (533)
Q Consensus 493 GCi~~l~i~~~~~l~~~~~~~ 513 (533)
|-|.+++| +++.|...+...
T Consensus 153 G~i~~v~i-w~~~Ls~~eI~~ 172 (206)
T smart00159 153 GEIGDLNM-WDSVLSPEEIKS 172 (206)
T ss_pred EEEeeeEE-ecccCCHHHHHH
Confidence 99999999 888887766543
No 23
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.38 E-value=2e-07 Score=59.23 Aligned_cols=30 Identities=33% Similarity=1.001 Sum_probs=27.6
Q ss_pred CCCCCC-CCCeeeecC-CceEEecCCCCCcCc
Q psy5025 263 CITQPC-GSGSCVPLY-DGYKCVCTQSCKESH 292 (533)
Q Consensus 263 C~~~pC-ngg~C~~~~-~~~~C~C~~g~~G~~ 292 (533)
|.++|| |+|+|++.. ++|.|.|++||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 788999 999999999 999999999999863
No 24
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=98.24 E-value=5.1e-05 Score=71.19 Aligned_cols=139 Identities=16% Similarity=0.141 Sum_probs=89.3
Q ss_pred ceeeEEEEEEEeCC--CCeEEEEeccCCCCCeEEEEEE-CCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025 336 GYEFNIDMRIKTSS--SSGLILWTGSSQSKDFLALGIH-NGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV 412 (533)
Q Consensus 336 ~~~~~i~~~~rt~~--~~g~ll~~~~~~~~~~l~l~l~-~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~ 412 (533)
...++|.+++|+.. ..+.||........+.+.+... +|.+.+.+. +....+. ....||+||+|.+.+..
T Consensus 30 l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i~--~~~~~~~---~~~~~g~W~hv~~t~d~--- 101 (201)
T cd00152 30 LQAFTLCLWVYTDLSTREYSLFSYATKGQDNELLLYKEKDGGYSLYIG--GKEVTFK---VPESDGAWHHICVTWES--- 101 (201)
T ss_pred hhhEEEEEEEEecCCCCCeEEEEEeCCCCCCeEEEEEcCCCeEEEEEc--CEEEEEe---ccCCCCCEEEEEEEEEC---
Confidence 45789999999874 4556665443333445555553 567766653 3333332 34599999999999922
Q ss_pred ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCce
Q psy5025 413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFK 492 (533)
Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~ 492 (533)
..+.+.|+|||....... ......+...+.|.||...+... ........|.
T Consensus 102 ---------------------------~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~~~g-g~~~~~~~f~ 152 (201)
T cd00152 102 ---------------------------TSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQDSYG-GGFDATQSFV 152 (201)
T ss_pred ---------------------------CCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeecccCCC-CCCCCCcceE
Confidence 456799999998765543 22233455666788886543210 0112346899
Q ss_pred eeEEEEEEcCceEEeecccC
Q psy5025 493 GWISNLILSNDYQVKLPLSR 512 (533)
Q Consensus 493 GCi~~l~i~~~~~l~~~~~~ 512 (533)
|-|.+++| +++.|...+..
T Consensus 153 G~I~~v~i-w~~~Ls~~eI~ 171 (201)
T cd00152 153 GEISDVNM-WDSVLSPEEIK 171 (201)
T ss_pred EEEceeEE-EcccCCHHHHH
Confidence 99999999 88878766654
No 25
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.21 E-value=1.5e-06 Score=58.18 Aligned_cols=36 Identities=47% Similarity=1.178 Sum_probs=33.1
Q ss_pred CCCCCC-CCCCCCCeeeeCCCCceecCCCCCC-CCCcc
Q psy5025 44 EDKCNK-TPCEHGGKCITTPGSSYCLCPLGFT-GDLCQ 79 (533)
Q Consensus 44 ~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~-G~~Ce 79 (533)
+++|.. +||.++|+|++..++|.|.|+.||. |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 477887 8999999999999999999999999 99885
No 26
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=98.21 E-value=3.3e-05 Score=68.52 Aligned_cols=142 Identities=20% Similarity=0.308 Sum_probs=84.4
Q ss_pred cCCceeEEEcCCcccccceeEEEEEEEecCCCe---eEEEccccCCCCCCeEEEEEE-CCEEEEEEEcCC-CceEEEeec
Q psy5025 89 FNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNG---ILLYNGHRADGVGDFIALYLN-DRYVDFTFDLGT-GAATLRSSN 163 (533)
Q Consensus 89 F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~dG---lLl~~~~~~~~~~~fl~l~L~-~G~l~~~~~~g~-~~~~v~s~~ 163 (533)
+.+.+|+.++..... ...++|+++||.....+ .++. .. ...+.+.+.+. ++++.+.+..+. ....+....
T Consensus 6 ~g~~~~i~~~~~~~~-~~~fTi~~w~~~~~~~~~~~~~~~-~~---~~~~~~~l~~~~~~~l~~~~~~~~~~~~~~~~~~ 80 (157)
T PF13385_consen 6 DGSNDYISIPNSDFP-SGSFTISFWVKPDSPSSSQSFVFM-DS---SGSGGFGLFINNNGRLRFYIGNGGGGNYSFSSDS 80 (157)
T ss_dssp -STT-EEEEESGGGG-GTEEEEEEEEEESS--SSEEEEEE-SS---SSSEEEEEEEETTSEEEEEETTSEEESS-EE-BS
T ss_pred CCCCCEEEECCcCCC-CCCEEEEEEEEeCCCCCCceEEEE-ec---CCCCEEEEEEECCCEEEEEEeCCCceeEEEecCc
Confidence 345789998863333 56899999999865332 3333 11 11346777776 477777766543 233555677
Q ss_pred cccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEE
Q psy5025 164 PISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHE 243 (533)
Q Consensus 164 ~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~ 243 (533)
.+.+++||+|.+...+..+.|+||+............ .......++||+.... ...|.|-|.+++|.+..
T Consensus 81 ~~~~~~W~~l~~~~~~~~~~lyvnG~~~~~~~~~~~~-~~~~~~~~~iG~~~~~---------~~~~~g~i~~~~i~~~a 150 (157)
T PF13385_consen 81 NLPDNKWHHLALTYDGSTVTLYVNGELVGSSTIPSNI-SLNSNGPLFIGGSGGG---------SSPFNGYIDDLRIYNRA 150 (157)
T ss_dssp ---TT-EEEEEEEEETTEEEEEETTEEETTCTEESSS-STTSCCEEEESS-STT-----------B-EEEEEEEEEESS-
T ss_pred ccCCCCEEEEEEEEECCeEEEEECCEEEEeEeccCCc-CCCCcceEEEeecCCC---------CCceEEEEEEEEEECcc
Confidence 8889999999999999999999999764433222111 1123457899987622 37899999999998877
Q ss_pred cc
Q psy5025 244 LN 245 (533)
Q Consensus 244 l~ 245 (533)
+.
T Consensus 151 Lt 152 (157)
T PF13385_consen 151 LT 152 (157)
T ss_dssp --
T ss_pred CC
Confidence 64
No 27
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=98.20 E-value=8.6e-05 Score=69.92 Aligned_cols=148 Identities=17% Similarity=0.136 Sum_probs=91.2
Q ss_pred CCceeEEEcCCcccccceeEEEEEEEecC--CCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccC
Q psy5025 90 NGSSYLRYQGLGEQYLSLLDLTIVFKAIE--PNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISL 167 (533)
Q Consensus 90 ~g~syl~~~~~~~~~~~~~~ls~~FrT~~--~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnD 167 (533)
+...|+++.+........+++.+++|+.. .++.||-...... .+.++...-.++.+.+.+. .. .+..+..+.+
T Consensus 15 s~~~yv~l~~~~~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~~-~ne~~~~~~~~~~~~l~i~--g~--~~~~~~~~~~ 89 (206)
T smart00159 15 SDTSYVKLKPELPKPLQAFTVCLWFYSDLSPRGYSLFSYATKGQ-DNELLLYKEKQGEYSLYIG--GK--KVQFPVPESD 89 (206)
T ss_pred CCCCeEEEccCCCCChhHEEEEEEEEecCCCCceEEEEEeCCCC-CCeEEEEEcCCcEEEEEEc--Ce--EEEecccccC
Confidence 45689998765443567899999999864 4555553332211 1333322223455555543 22 4445567899
Q ss_pred CCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025 168 GEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN 245 (533)
Q Consensus 168 g~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~ 245 (533)
|+||+|.+++.. +.+.|++|+.... .........+...+.+.||....... ..+.....|.|-|.++.+.++.+.
T Consensus 90 g~W~hvc~tw~~~~g~~~lyvnG~~~~-~~~~~~g~~i~~~G~lvlGq~qd~~g--g~f~~~~~f~G~i~~v~iw~~~Ls 166 (206)
T smart00159 90 GKWHHICTTWESSSGIAELWVDGKPGV-RKGLAKGYTVKPGGSIILGQEQDSYG--GGFDATQSFVGEIGDLNMWDSVLS 166 (206)
T ss_pred CceEEEEEEEECCCCcEEEEECCEEcc-cccccCCcEECCCCEEEEEecccCCC--CCCCCCcceeEEEeeeEEecccCC
Confidence 999999999985 4678999997642 11111122233445677886543221 223345789999999999999886
No 28
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=98.18 E-value=6.5e-05 Score=69.88 Aligned_cols=140 Identities=19% Similarity=0.185 Sum_probs=82.5
Q ss_pred ceeeEEEEEEEeCCC--CeEEEEeccCCCCCeEEEEEE-CCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025 336 GYEFNIDMRIKTSSS--SGLILWTGSSQSKDFLALGIH-NGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV 412 (533)
Q Consensus 336 ~~~~~i~~~~rt~~~--~g~ll~~~~~~~~~~l~l~l~-~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~ 412 (533)
..+++|.|++|+... .+.||....+...+-|.+... .+.+.+.+ ++....+ ...+.||+||++.+.++.
T Consensus 24 L~~fTvC~w~k~~~~~~~~tifSYat~~~~nell~~~~~~~~~~l~i--~~~~~~~---~~~~~~~~Whh~C~tW~s--- 95 (195)
T PF00354_consen 24 LSAFTVCFWVKTDDSSNDGTIFSYATSSQDNELLLFGSSSGSLRLYI--NGSSVSF---SGPIRDGQWHHICVTWDS--- 95 (195)
T ss_dssp BSEEEEEEEEEESGSGS-EEEEEEEETTEEEEEEEEEETTTEEEEEE--TTEEEEE---EECS-TSS-EEEEEEEET---
T ss_pred cccEEEEEEEEeccCCCceEEEEEccCCCCccEEEEEeCCceEEEEE--CCeEeEe---ccccCCCCcEEEEEEEec---
Confidence 346899999999864 788886654433343444443 46665554 3333333 246799999999999943
Q ss_pred ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCce
Q psy5025 413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFK 492 (533)
Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~ 492 (533)
....+.|++||...... .......+...+.+.||--.+... -......+|.
T Consensus 96 ---------------------------~~G~~~ly~dG~~~~~~-~~~~g~~i~~gG~~vlGQeQd~~g-G~fd~~q~F~ 146 (195)
T PF00354_consen 96 ---------------------------STGRWQLYVDGVRLSST-GLATGHSIPGGGTLVLGQEQDSYG-GGFDESQAFV 146 (195)
T ss_dssp ---------------------------TTTEEEEEETTEEEEEE-ESSTT--B-SSEEEEESS-BSBTT-BTCSGGGB--
T ss_pred ---------------------------CCcEEEEEECCEecccc-cccCCceECCCCEEEECccccccC-CCcCCccEee
Confidence 56889999999954433 233445666667777775433211 1122456899
Q ss_pred eeEEEEEEcCceEEeecccCc
Q psy5025 493 GWISNLILSNDYQVKLPLSRS 513 (533)
Q Consensus 493 GCi~~l~i~~~~~l~~~~~~~ 513 (533)
|=|.++++ +++.|...+...
T Consensus 147 G~i~~~~i-Wd~vLs~~eI~~ 166 (195)
T PF00354_consen 147 GEISDFNI-WDRVLSPEEIRA 166 (195)
T ss_dssp EEEEEEEE-ESS---HHHHHH
T ss_pred EEEeceEE-EeeeCCHHHHHH
Confidence 99999999 888887665543
No 29
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=98.09 E-value=0.0002 Score=67.23 Aligned_cols=146 Identities=20% Similarity=0.139 Sum_probs=91.0
Q ss_pred CCceeEEEcCCcccccceeEEEEEEEecC--CCeeEE-EccccCCCCCCeEEEEE-ECCEEEEEEEcCCCceEEEeeccc
Q psy5025 90 NGSSYLRYQGLGEQYLSLLDLTIVFKAIE--PNGILL-YNGHRADGVGDFIALYL-NDRYVDFTFDLGTGAATLRSSNPI 165 (533)
Q Consensus 90 ~g~syl~~~~~~~~~~~~~~ls~~FrT~~--~dGlLl-~~~~~~~~~~~fl~l~L-~~G~l~~~~~~g~~~~~v~s~~~l 165 (533)
+...|++++.........+++.+++|+.. ..+.+| |.... ..+.+.+.. ..|++.+.+.. . ........
T Consensus 15 s~~~yv~l~~~~~~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~---~~~~~~l~~~~~g~~~~~i~~--~--~~~~~~~~ 87 (201)
T cd00152 15 SDTSYVKLKPELPKPLQAFTLCLWVYTDLSTREYSLFSYATKG---QDNELLLYKEKDGGYSLYIGG--K--EVTFKVPE 87 (201)
T ss_pred CCCceEEEccCCCCChhhEEEEEEEEecCCCCCeEEEEEeCCC---CCCeEEEEEcCCCeEEEEEcC--E--EEEEeccC
Confidence 34578998765444567899999999864 444455 33332 133444444 34677776642 2 22233455
Q ss_pred cCCCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEE
Q psy5025 166 SLGEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHE 243 (533)
Q Consensus 166 nDg~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~ 243 (533)
.+|+||+|.+++.. +.+.|+||+....... ......+.....+.||....... ..+.....|.|-|.++++.+..
T Consensus 88 ~~g~W~hv~~t~d~~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~~~g--g~~~~~~~f~G~I~~v~iw~~~ 164 (201)
T cd00152 88 SDGAWHHICVTWESTSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQDSYG--GGFDATQSFVGEISDVNMWDSV 164 (201)
T ss_pred CCCCEEEEEEEEECCCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeecccCCC--CCCCCCcceEEEEceeEEEccc
Confidence 99999999999984 4679999997643332 11122233445678886543211 1122347899999999999988
Q ss_pred cc
Q psy5025 244 LN 245 (533)
Q Consensus 244 l~ 245 (533)
++
T Consensus 165 Ls 166 (201)
T cd00152 165 LS 166 (201)
T ss_pred CC
Confidence 86
No 30
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.02 E-value=6e-06 Score=54.64 Aligned_cols=36 Identities=47% Similarity=1.185 Sum_probs=32.8
Q ss_pred CCCCCC-CCCCCCCeeeeCCCCceecCCCCCCCCCcc
Q psy5025 44 EDKCNK-TPCEHGGKCITTPGSSYCLCPLGFTGDLCQ 79 (533)
Q Consensus 44 ~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce 79 (533)
.+.|.. +||.+++.|++..++|.|.|+.+|.|+.|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 467887 799999999999999999999999999885
No 31
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=97.95 E-value=0.00031 Score=65.36 Aligned_cols=148 Identities=18% Similarity=0.198 Sum_probs=86.0
Q ss_pred CCceeEEEcCCcccccceeEEEEEEEecCC--CeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccC
Q psy5025 90 NGSSYLRYQGLGEQYLSLLDLTIVFKAIEP--NGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISL 167 (533)
Q Consensus 90 ~g~syl~~~~~~~~~~~~~~ls~~FrT~~~--dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnD 167 (533)
+...|+++.+........+++.+++|+... .+.||--..... .++++.+.-..+.+.+.+.. . .+.....+.+
T Consensus 9 s~~~yv~l~~~~~~pL~~fTvC~w~k~~~~~~~~tifSYat~~~-~nell~~~~~~~~~~l~i~~--~--~~~~~~~~~~ 83 (195)
T PF00354_consen 9 STTDYVRLKPSVPLPLSAFTVCFWVKTDDSSNDGTIFSYATSSQ-DNELLLFGSSSGSLRLYING--S--SVSFSGPIRD 83 (195)
T ss_dssp BSSBEEEEEESS-S-BSEEEEEEEEEESGSGS-EEEEEEEETTE-EEEEEEEEETTTEEEEEETT--E--EEEEEECS-T
T ss_pred CCcceEEEecCCCCCcccEEEEEEEEeccCCCceEEEEEccCCC-CccEEEEEeCCceEEEEECC--e--EeEeccccCC
Confidence 346788887654333678999999999764 666666543321 13333332234666655532 2 3344556789
Q ss_pred CCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025 168 GEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN 245 (533)
Q Consensus 168 g~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~ 245 (533)
++||++.+++.. +...|++||....... ......+...+.+.||.-.... ...+...+.|.|=|.++.+.++.++
T Consensus 84 ~~Whh~C~tW~s~~G~~~ly~dG~~~~~~~-~~~g~~i~~gG~~vlGQeQd~~--gG~fd~~q~F~G~i~~~~iWd~vLs 160 (195)
T PF00354_consen 84 GQWHHICVTWDSSTGRWQLYVDGVRLSSTG-LATGHSIPGGGTLVLGQEQDSY--GGGFDESQAFVGEISDFNIWDRVLS 160 (195)
T ss_dssp SS-EEEEEEEETTTTEEEEEETTEEEEEEE-SSTT--B-SSEEEEESS-BSBT--TBTCSGGGB--EEEEEEEEESS---
T ss_pred CCcEEEEEEEecCCcEEEEEECCEeccccc-ccCCceECCCCEEEECcccccc--CCCcCCccEeeEEEeceEEEeeeCC
Confidence 999999999987 5789999997432222 2223334445567788655432 2344566899999999999998886
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.74 E-value=3.5e-05 Score=50.09 Aligned_cols=33 Identities=55% Similarity=1.438 Sum_probs=29.9
Q ss_pred CC-CCCCCCCCeeeeCCCCceecCCCCCCCC-Ccc
Q psy5025 47 CN-KTPCEHGGKCITTPGSSYCLCPLGFTGD-LCQ 79 (533)
Q Consensus 47 C~-~~pC~NggtC~~~~~~y~C~C~~g~~G~-~Ce 79 (533)
|. .++|.+++.|++..++|.|.|+.||.|. .|+
T Consensus 2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~ 36 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36 (36)
T ss_pred CCCCCCCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence 55 7899999999999999999999999998 774
No 33
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=97.73 E-value=0.0007 Score=61.81 Aligned_cols=136 Identities=18% Similarity=0.228 Sum_probs=85.4
Q ss_pred eeeEEEEEEEeCCCCeE--EEEeccCC-CCCeEEEEEECCEEEEEEEcCCc--eEEEe----eCceeecCCCeEEEEEEE
Q psy5025 337 YEFNIDMRIKTSSSSGL--ILWTGSSQ-SKDFLALGIHNGYAHVMYNLGNG--EVLLE----NNQTKINNGVWHRLTVHR 407 (533)
Q Consensus 337 ~~~~i~~~~rt~~~~g~--ll~~~~~~-~~~~l~l~l~~g~l~~~~~~g~~--~~~~~----~~~~~~~dg~wH~v~~~~ 407 (533)
...+|-++|++...+++ ||.+.+.. ...||.|++.++.+-+.++...+ .+... ..+...++-.||.|++..
T Consensus 33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~a 112 (190)
T PF02973_consen 33 EEGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFVA 112 (190)
T ss_dssp SSEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEEE
T ss_pred cccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEEE
Confidence 45789999999776664 56554332 36999999999999999987655 22221 122345777899999998
Q ss_pred eeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecC-Ccc-eeeccCCceEEcCCCCccccccc
Q psy5025 408 NFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISG-GKF-EQLNTDSGLYVGGVEDIEELTRS 485 (533)
Q Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~-~~~-~~l~~~~~l~lGg~~~~~~~~~~ 485 (533)
.. ......|++||......... ..+ ..++.-..++|||.....
T Consensus 113 d~------------------------------~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~g----- 157 (190)
T PF02973_consen 113 DS------------------------------KNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRAG----- 157 (190)
T ss_dssp ET------------------------------TTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEETT-----
T ss_pred ec------------------------------CCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeCC-----
Confidence 11 47899999999766654332 222 223223359999975432
Q ss_pred ccccCceeeEEEEEEcCceEEee
Q psy5025 486 KYHHSFKGWISNLILSNDYQVKL 508 (533)
Q Consensus 486 ~~~~~f~GCi~~l~i~~~~~l~~ 508 (533)
....+|.|-|++|+| -+..|+-
T Consensus 158 ~~~y~f~G~I~~l~i-Yn~aLsd 179 (190)
T PF02973_consen 158 SNAYPFNGTIDNLKI-YNRALSD 179 (190)
T ss_dssp EEES--EEEEEEEEE-ESS---H
T ss_pred CceecccceEEEEEE-EcCcCCH
Confidence 245689999999999 5555543
No 34
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.63 E-value=6.2e-05 Score=48.89 Aligned_cols=32 Identities=53% Similarity=1.321 Sum_probs=29.1
Q ss_pred CCC-CCCCCCCeeeeCCCCceecCCCCCCC-CCcc
Q psy5025 47 CNK-TPCEHGGKCITTPGSSYCLCPLGFTG-DLCQ 79 (533)
Q Consensus 47 C~~-~pC~NggtC~~~~~~y~C~C~~g~~G-~~Ce 79 (533)
|.. ++|.++ .|++..++|.|.|+.||.| +.|+
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence 666 799999 9999999999999999999 8875
No 35
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.49 E-value=6.1e-05 Score=51.21 Aligned_cols=31 Identities=39% Similarity=0.997 Sum_probs=27.7
Q ss_pred CCCCCC--CCCCCCCeeeeCCCCceecCCCCCC
Q psy5025 44 EDKCNK--TPCEHGGKCITTPGSSYCLCPLGFT 74 (533)
Q Consensus 44 ~d~C~~--~pC~NggtC~~~~~~y~C~C~~g~~ 74 (533)
+|+|.. ++|.+++.|+++.++|.|.|++||.
T Consensus 2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 577874 5799999999999999999999996
No 36
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=97.47 E-value=0.0033 Score=57.47 Aligned_cols=133 Identities=17% Similarity=0.201 Sum_probs=86.0
Q ss_pred ceeEEEEEEEecCCCee--EEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE--ee-----ccccCCCcEEEEEE
Q psy5025 106 SLLDLTIVFKAIEPNGI--LLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR--SS-----NPISLGEWHKLRLT 176 (533)
Q Consensus 106 ~~~~ls~~FrT~~~dGl--Ll~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~--s~-----~~lnDg~Wh~V~v~ 176 (533)
...+|.++|++...+++ ||-..+... ...|+.|++.++++-+.+....+..... .+ ...++-.||.|.+.
T Consensus 33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~~~-~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~ 111 (190)
T PF02973_consen 33 EEGTIVIRFKSDSNSGIQSLFSISNSTK-GNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFV 111 (190)
T ss_dssp SSEEEEEEEEESS-SSEEEEEEEE-TST-TSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEE
T ss_pred cccEEEEEEecCCCcceeEEEEecCCCC-ccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEE
Confidence 45789999999776666 666665543 3699999999999999998765533322 22 22456689999999
Q ss_pred EE--CCEEEEEEcCccCceeeCCCCce--eeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025 177 RT--GRHAYLQVDRFPSSQILSPGPFT--QLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN 245 (533)
Q Consensus 177 ~~--~~~~~L~VD~~~~~~~~~~~~~~--~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~ 245 (533)
.. ....+|++||............. .+.--..++||+..... ...-+|.|-|+.+.+-+++++
T Consensus 112 ad~~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~g------~~~y~f~G~I~~l~iYn~aLs 178 (190)
T PF02973_consen 112 ADSKNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRAG------SNAYPFNGTIDNLKIYNRALS 178 (190)
T ss_dssp EETTTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEETT------EEES--EEEEEEEEEESS---
T ss_pred EecCCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeCC------CceecccceEEEEEEEcCcCC
Confidence 97 78999999996544332222111 11112469999986532 234789999999999888775
No 37
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.42 E-value=0.00018 Score=47.77 Aligned_cols=34 Identities=26% Similarity=0.813 Sum_probs=29.4
Q ss_pred CCCCC-CCC-CCCeeeecCCceEEecCCCCC-cCccc
Q psy5025 261 HPCIT-QPC-GSGSCVPLYDGYKCVCTQSCK-ESHLT 294 (533)
Q Consensus 261 ~~C~~-~pC-ngg~C~~~~~~~~C~C~~g~~-G~~c~ 294 (533)
+.|.. +|| ++|+|++...+|.|.|+++|. |..|+
T Consensus 3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred ccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 45776 799 999999999999999999998 87763
No 38
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.26 E-value=0.00036 Score=45.88 Aligned_cols=34 Identities=26% Similarity=0.822 Sum_probs=29.7
Q ss_pred CCCCC-CCC-CCCeeeecCCceEEecCCCCCcCccc
Q psy5025 261 HPCIT-QPC-GSGSCVPLYDGYKCVCTQSCKESHLT 294 (533)
Q Consensus 261 ~~C~~-~pC-ngg~C~~~~~~~~C~C~~g~~G~~c~ 294 (533)
++|.. .|| +++.|.+....|.|.|+++|.|..|+
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 46776 799 99999999999999999999997763
No 39
>KOG3509|consensus
Probab=97.24 E-value=0.00097 Score=74.37 Aligned_cols=209 Identities=22% Similarity=0.272 Sum_probs=125.8
Q ss_pred CCCCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccccccccccccCCceeEEEcCCcccccceeEEEEEEEecCCC
Q psy5025 41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDLQVPQFNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPN 120 (533)
Q Consensus 41 ~C~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~d 120 (533)
-|..+.|.+.||+..+.|.+......|.|+.+|+|+.|+...+.+...+.+.......+...
T Consensus 403 ~c~g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~------------------ 464 (964)
T KOG3509|consen 403 TCLGDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQG------------------ 464 (964)
T ss_pred ccCCCccccccCCCCccccccccccceeccccccCchhhccCccccccCCccccceEeccCC------------------
Confidence 44467899999999999999999999999999999999998766555555443333322111
Q ss_pred eeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCc
Q psy5025 121 GILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPF 200 (533)
Q Consensus 121 GlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~ 200 (533)
..+.|-+.. . +++.+..| ...+.....+.......+||+..+.+......+.++-............
T Consensus 465 ~~~~~c~pg-~--g~~~~~~l----------g~~~~~a~~s~~s~~~~~~~~f~~~~t~~~~~~~c~~~~~~~s~s~~~~ 531 (964)
T KOG3509|consen 465 KRCEYCGPG-A--GAPTAGAL----------GQCGCKASASGPSTSRCMPHHFGISPTNHQGCLPCFCSELTLSISSSTY 531 (964)
T ss_pred CcceeecCC-C--CCccchhc----------cccCccceeccccccccccccCCcccccccccccccchhcccccccccc
Confidence 233333222 1 33333311 1111122223344556789999998888777777666443333333333
Q ss_pred eeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcccccccccCCCccCCCCCCCCCCC-CCCeeeecCCc
Q psy5025 201 TQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQGVNVVNCAHPCITQPC-GSGSCVPLYDG 279 (533)
Q Consensus 201 ~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~~~~~~C~~~C~~~pC-ngg~C~~~~~~ 279 (533)
........+.+|+......+.+.......+.|+.+.+-. ...+....+.........+|..+...++ -.|.++..|..
T Consensus 532 ~~~s~~~~~~~~~~~~i~~~~~~l~~r~~~l~~~~~i~~-~~s~~v~a~~~~~~~~~~~P~~~vad~vs~~g~~~~~~~s 610 (964)
T KOG3509|consen 532 RCSSVSIDYARGQFDLIELLRPDLRQRYASLGRDQLINQ-AISFSVSAEAGGLLLRWSLPAKPVADKVSPLGGALSQTHS 610 (964)
T ss_pred cccchheecccchhhhhhhccccccccccccceeeeccc-cceeeeeeeccceEEEecCCCCCCcceeccccceeeeeec
Confidence 333344456677776665555666666677777654433 1111112222222334567888888899 88888888877
Q ss_pred eE
Q psy5025 280 YK 281 (533)
Q Consensus 280 ~~ 281 (533)
|.
T Consensus 611 ~~ 612 (964)
T KOG3509|consen 611 YK 612 (964)
T ss_pred cC
Confidence 65
No 40
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=97.14 E-value=0.012 Score=51.07 Aligned_cols=68 Identities=16% Similarity=0.202 Sum_probs=45.6
Q ss_pred CCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccC--cEEEEEECCceeeeEecCCcceeeccCCceEEcC
Q psy5025 398 GVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTG--KQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGG 475 (533)
Q Consensus 398 g~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg 475 (533)
|+||+|.+.. ++ ..++|+|||......... .......+.||.
T Consensus 61 ~~W~hva~v~--------------------------------d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~ 104 (133)
T smart00560 61 GVWVHLAGVY--------------------------------DGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGG 104 (133)
T ss_pred CCEEEEEEEE--------------------------------ECCCCeEEEEECCEEccccccC----CcccCCceEEee
Confidence 9999999999 55 789999999876543221 112345688884
Q ss_pred CCCcccccccccccCceeeEEEEEEcCceEE
Q psy5025 476 VEDIEELTRSKYHHSFKGWISNLILSNDYQV 506 (533)
Q Consensus 476 ~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l 506 (533)
.... .......|.|.|.+++| .+..|
T Consensus 105 ~~~~----~~~~~~~f~G~Idevri-y~~aL 130 (133)
T smart00560 105 RILL----GGAGGENFSGRLDEVRV-YNRAL 130 (133)
T ss_pred eccC----CCCCCCCceEEeeEEEE-ecccc
Confidence 1111 11245689999999999 55544
No 41
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.10 E-value=0.00058 Score=43.13 Aligned_cols=27 Identities=33% Similarity=0.962 Sum_probs=24.3
Q ss_pred CCCCCCCeeeeCCCCceecCCCCCCCCCc
Q psy5025 50 TPCEHGGKCITTPGSSYCLCPLGFTGDLC 78 (533)
Q Consensus 50 ~pC~NggtC~~~~~~y~C~C~~g~~G~~C 78 (533)
..|.+.|+|+.. ...|.|..+|+|+.|
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 479999999976 679999999999987
No 42
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.07 E-value=0.00077 Score=43.54 Aligned_cols=31 Identities=32% Similarity=0.922 Sum_probs=27.6
Q ss_pred CC-CCCC-CCCeeeecCCceEEecCCCCCcC-cc
Q psy5025 263 CI-TQPC-GSGSCVPLYDGYKCVCTQSCKES-HL 293 (533)
Q Consensus 263 C~-~~pC-ngg~C~~~~~~~~C~C~~g~~G~-~c 293 (533)
|. ..+| +++.|++....|.|.|+.||.|. .|
T Consensus 2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 55 7899 99999999999999999999987 54
No 43
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.79 E-value=0.0017 Score=41.90 Aligned_cols=31 Identities=23% Similarity=0.754 Sum_probs=26.7
Q ss_pred CCCC-CCC-CCCeeeecCCceEEecCCCCCc-Ccc
Q psy5025 262 PCIT-QPC-GSGSCVPLYDGYKCVCTQSCKE-SHL 293 (533)
Q Consensus 262 ~C~~-~pC-ngg~C~~~~~~~~C~C~~g~~G-~~c 293 (533)
+|.. .+| ++ .|++...+|.|.|++||.| ..|
T Consensus 1 ~C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 3666 699 77 9999999999999999998 555
No 44
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.47 E-value=0.00083 Score=33.11 Aligned_cols=13 Identities=54% Similarity=1.329 Sum_probs=11.0
Q ss_pred eecCCCCCCCCCc
Q psy5025 66 YCLCPLGFTGDLC 78 (533)
Q Consensus 66 ~C~C~~g~~G~~C 78 (533)
.|.|++||+|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999988
No 45
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=96.30 E-value=0.12 Score=44.75 Aligned_cols=69 Identities=22% Similarity=0.179 Sum_probs=46.4
Q ss_pred CCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025 168 GEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN 245 (533)
Q Consensus 168 g~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~ 245 (533)
++||+|.+..+. ..++|+||+......... .......+.||..... .......|.|.|.+++|.+..+.
T Consensus 61 ~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~~~~~-----~~~~~~~f~G~Idevriy~~aLs 131 (133)
T smart00560 61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGGRILL-----GGAGGENFSGRLDEVRVYNRALT 131 (133)
T ss_pred CCEEEEEEEEECCCCeEEEEECCEEccccccC----CcccCCceEEeeeccC-----CCCCCCCceEEeeEEEEeccccC
Confidence 899999999998 789999999654322211 1122356788841111 11123689999999999887764
No 46
>KOG1834|consensus
Probab=96.10 E-value=0.084 Score=55.82 Aligned_cols=135 Identities=13% Similarity=0.197 Sum_probs=90.0
Q ss_pred cceeeEEEEEEEeCC-------CCeEEEEeccC--CCCCeEEEEEECCEEEEEEEcCCce------EEEeeCceeecCCC
Q psy5025 335 TGYEFNIDMRIKTSS-------SSGLILWTGSS--QSKDFLALGIHNGYAHVMYNLGNGE------VLLENNQTKINNGV 399 (533)
Q Consensus 335 ~~~~~~i~~~~rt~~-------~~g~ll~~~~~--~~~~~l~l~l~~g~l~~~~~~g~~~------~~~~~~~~~~~dg~ 399 (533)
....|+|+|+.|-.. ..-.||-.... .+...++|++++=+|.|.++...+. ....-.-..+.|.+
T Consensus 364 l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~E 443 (952)
T KOG1834|consen 364 LPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDNE 443 (952)
T ss_pred CCCceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhhh
Confidence 356789999998643 22366654322 2367889999999999998764321 11111345699999
Q ss_pred eEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeE--ecCCcceeeccCCceEEcCCC
Q psy5025 400 WHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQS--ISGGKFEQLNTDSGLYVGGVE 477 (533)
Q Consensus 400 wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~--~~~~~~~~l~~~~~l~lGg~~ 477 (533)
||+-.+.. ....++|++||...... ......+.-.....|.||.-.
T Consensus 444 WH~Y~ln~--------------------------------efp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW 491 (952)
T KOG1834|consen 444 WHHYVLNV--------------------------------EFPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACW 491 (952)
T ss_pred hheeEEee--------------------------------cCceEEEEEcCcccCCceeccCCccCcccccceeEEeeec
Confidence 99999999 77889999999864321 122222333345568898765
Q ss_pred CcccccccccccCceeeEEEEEEc
Q psy5025 478 DIEELTRSKYHHSFKGWISNLILS 501 (533)
Q Consensus 478 ~~~~~~~~~~~~~f~GCi~~l~i~ 501 (533)
.............|.|-+..|.+-
T Consensus 492 ~g~~~~~l~~aqfFrG~Lasltlr 515 (952)
T KOG1834|consen 492 QGRQQKPLKLAQFFRGQLASLTLR 515 (952)
T ss_pred cCccccchhHHHHhhcccceeEEe
Confidence 544333345677899999999883
No 47
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.87 E-value=0.0089 Score=40.48 Aligned_cols=29 Identities=28% Similarity=0.826 Sum_probs=24.9
Q ss_pred CCCCC--CCC-CCCeeeecCCceEEecCCCCC
Q psy5025 261 HPCIT--QPC-GSGSCVPLYDGYKCVCTQSCK 289 (533)
Q Consensus 261 ~~C~~--~pC-ngg~C~~~~~~~~C~C~~g~~ 289 (533)
++|.. ++| .++.|++...+|.|.|++||.
T Consensus 3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp STTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 35664 569 899999999999999999996
No 48
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.69 E-value=0.013 Score=37.04 Aligned_cols=26 Identities=31% Similarity=0.755 Sum_probs=22.7
Q ss_pred CCC-CCCeeeecCCceEEecCCCCCcCcc
Q psy5025 266 QPC-GSGSCVPLYDGYKCVCTQSCKESHL 293 (533)
Q Consensus 266 ~pC-ngg~C~~~~~~~~C~C~~g~~G~~c 293 (533)
..| ++|+|+.. ...|.|+++|.|+.|
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 479 99999976 789999999999865
No 49
>KOG1836|consensus
Probab=95.24 E-value=0.0062 Score=72.58 Aligned_cols=135 Identities=17% Similarity=0.149 Sum_probs=89.1
Q ss_pred eeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEe-eCceeecCCCeEEEEEEEeeeeeecCC
Q psy5025 338 EFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLE-NNQTKINNGVWHRLTVHRNFMVVMHGS 416 (533)
Q Consensus 338 ~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~-~~~~~~~dg~wH~v~~~~~~~~~~~~~ 416 (533)
.+.+.+..+..+..|.+-...+ ....+..+....+.....+..+-...... .....+-++.||.+.+.+
T Consensus 1557 ~~~~~~~~~~~~~~~~l~~~~s-~~~~~~~~~~~~~~~~~~~~~gi~~~~~s~~~~~~~~~~~~~~~~~~~--------- 1626 (1705)
T KOG1836|consen 1557 AFALVFSERNVSSTGGLTHHLS-KLGTELLVQENPIGVTEKFESGITDLSTSSTPIVSLLPGGCHSVTSST--------- 1626 (1705)
T ss_pred hHHhhhcccccccCCCcccccc-ccchHHhhhhcccccchhhhhhhhhhhhcchhhhhhcCCcceeeeeec---------
Confidence 3455566666665555544332 22567777777777777665543211111 144678999999999999
Q ss_pred cccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEE
Q psy5025 417 SHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWIS 496 (533)
Q Consensus 417 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~ 496 (533)
....+.+..|. .+. .+..........++++||.|.....+......+|.|||
T Consensus 1627 -----------------------~~~v~~~~~~~-~~~---~~~~~~~~~~~~p~~~~~~~~s~~~~~~~~~~~~~~~~- 1678 (1705)
T KOG1836|consen 1627 -----------------------DPGVVQLEDDT-YTV---GEIPPPPADTQEPIKLGGYPSSLTTLRIAVLKSFTGCI- 1678 (1705)
T ss_pred -----------------------CCccccccccc-eec---ccCCCCchhccCCcccCCccccccceeeecccccccce-
Confidence 77778888887 222 22222344567789999999887666666778999999
Q ss_pred EEEEcCceEEeecccC
Q psy5025 497 NLILSNDYQVKLPLSR 512 (533)
Q Consensus 497 ~l~i~~~~~l~~~~~~ 512 (533)
+.+ .+.++++..+.
T Consensus 1679 -~~~-~~~~~~~~~a~ 1692 (1705)
T KOG1836|consen 1679 -FVV-MGIRVDVTLAG 1692 (1705)
T ss_pred -EEe-cCCCCcHHHHH
Confidence 777 77888776654
No 50
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.99 E-value=0.011 Score=38.51 Aligned_cols=27 Identities=37% Similarity=0.840 Sum_probs=21.3
Q ss_pred CCCCCCCeeeeCCCCceecCCCCCCCC
Q psy5025 50 TPCEHGGKCITTPGSSYCLCPLGFTGD 76 (533)
Q Consensus 50 ~pC~NggtC~~~~~~y~C~C~~g~~G~ 76 (533)
..|...++|++..++|.|.|++||.|+
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 467778999999999999999999775
No 51
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.45 E-value=0.029 Score=36.53 Aligned_cols=26 Identities=19% Similarity=0.611 Sum_probs=21.4
Q ss_pred CCC-CCCeeeecCCceEEecCCCCCcC
Q psy5025 266 QPC-GSGSCVPLYDGYKCVCTQSCKES 291 (533)
Q Consensus 266 ~pC-ngg~C~~~~~~~~C~C~~g~~G~ 291 (533)
..| ..++|++...+|+|.|.+||.|+
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 358 88999999999999999999885
No 52
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=94.43 E-value=0.027 Score=46.91 Aligned_cols=33 Identities=39% Similarity=0.885 Sum_probs=28.8
Q ss_pred CCCCCCCCeeeeCC--CCceecCCCCCCCCCccccc
Q psy5025 49 KTPCEHGGKCITTP--GSSYCLCPLGFTGDLCQTRL 82 (533)
Q Consensus 49 ~~pC~NggtC~~~~--~~y~C~C~~g~~G~~Ce~~~ 82 (533)
.+-|.|| +|.-.. +.+.|.|+.||+|..||...
T Consensus 50 ~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~d 84 (139)
T PHA03099 50 DGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVV 84 (139)
T ss_pred CCEeECC-EEEeeccCCCceeECCCCccccccccee
Confidence 4689997 898765 99999999999999999863
No 53
>PHA02887 EGF-like protein; Provisional
Probab=94.01 E-value=0.048 Score=44.72 Aligned_cols=36 Identities=31% Similarity=0.904 Sum_probs=29.2
Q ss_pred CCCC---CCCCCCCCeeeeCC--CCceecCCCCCCCCCcccc
Q psy5025 45 DKCN---KTPCEHGGKCITTP--GSSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 45 d~C~---~~pC~NggtC~~~~--~~y~C~C~~g~~G~~Ce~~ 81 (533)
..|. .+-|.| |+|.-.. ..+.|.|+.||+|..||..
T Consensus 84 ~pC~~eyk~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 84 EKCKNDFNDFCIN-GECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred cccChHhhCEeeC-CEEEccccCCCceeECCCCcccCCCCcc
Confidence 4565 367886 6998765 8899999999999999974
No 54
>KOG1834|consensus
Probab=93.26 E-value=1.5 Score=46.91 Aligned_cols=156 Identities=17% Similarity=0.234 Sum_probs=94.9
Q ss_pred ccCCceeEEEcCCcc--cccceeEEEEEEEecC-------CCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCc-e
Q psy5025 88 QFNGSSYLRYQGLGE--QYLSLLDLTIVFKAIE-------PNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA-A 157 (533)
Q Consensus 88 ~F~g~syl~~~~~~~--~~~~~~~ls~~FrT~~-------~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~-~ 157 (533)
.|+|..-+.++.-.. .....|+|+|++|-.. ..-.|+-..+.......-++|++.+.||.|.+....+. .
T Consensus 346 eFdG~qgv~vpdg~~~g~l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~ 425 (952)
T KOG1834|consen 346 EFDGTQGVTVPDGNVSGSLPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATS 425 (952)
T ss_pred EEcCceeeEccCCCCCCCCCCceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccc
Confidence 466666666553221 2335789999998321 23356665554444567899999999999999864332 1
Q ss_pred EEE-----ee-ccccCCCcEEEEEEEECCEEEEEEcCccCce--eeCCCCceeeecCCCEEEcCcCCCCccCCCcccccC
Q psy5025 158 TLR-----SS-NPISLGEWHKLRLTRTGRHAYLQVDRFPSSQ--ILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSS 229 (533)
Q Consensus 158 ~v~-----s~-~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~--~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~ 229 (533)
..+ -. ..+.|.+||+-.+....-.+.|++||..-.. .....+.........+.||.-=.... ...+...+.
T Consensus 426 ~fRpaef~Wkl~qVCD~EWH~Y~ln~efp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW~g~~-~~~l~~aqf 504 (952)
T KOG1834|consen 426 DFRPAEFHWKLPQVCDNEWHHYVLNVEFPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACWQGRQ-QKPLKLAQF 504 (952)
T ss_pred cccchheeccchhhhhhhhheeEEeecCceEEEEEcCcccCCceeccCCccCcccccceeEEeeeccCcc-ccchhHHHH
Confidence 222 12 4578999999999999999999999954211 11111222222345677774322211 011234578
Q ss_pred ceeeeeeEEECcEEc
Q psy5025 230 FIGCIQKVLINNHEL 244 (533)
Q Consensus 230 F~GCI~~l~in~~~l 244 (533)
|.|-+..+.+....+
T Consensus 505 FrG~LasltlrsGkl 519 (952)
T KOG1834|consen 505 FRGQLASLTLRSGKL 519 (952)
T ss_pred hhcccceeEEecccc
Confidence 999998888865544
No 55
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=93.11 E-value=0.095 Score=43.78 Aligned_cols=35 Identities=20% Similarity=0.522 Sum_probs=27.7
Q ss_pred CCC-CCCeee--ecCCceEEecCCCCCcCcccceeeeee
Q psy5025 266 QPC-GSGSCV--PLYDGYKCVCTQSCKESHLTEVMCASF 301 (533)
Q Consensus 266 ~pC-ngg~C~--~~~~~~~C~C~~g~~G~~c~~~~~~~f 301 (533)
+-| || +|. ...+.+.|.|+.||.|.+|+...-..+
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~dLl~~ 88 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVVLVDY 88 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccccccceeeeee
Confidence 569 86 896 445789999999999999997654333
No 56
>PHA02887 EGF-like protein; Provisional
Probab=92.95 E-value=0.092 Score=43.12 Aligned_cols=35 Identities=26% Similarity=0.550 Sum_probs=26.6
Q ss_pred CCCC---CCCCCCeee--ecCCceEEecCCCCCcCcccce
Q psy5025 262 PCIT---QPCGSGSCV--PLYDGYKCVCTQSCKESHLTEV 296 (533)
Q Consensus 262 ~C~~---~pCngg~C~--~~~~~~~C~C~~g~~G~~c~~~ 296 (533)
+|.+ +-|-+|+|. ...+.+.|.|+.||.|.+|+..
T Consensus 85 pC~~eyk~YCiHG~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 85 KCKNDFNDFCINGECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred ccChHhhCEeeCCEEEccccCCCceeECCCCcccCCCCcc
Confidence 4553 568236996 4557899999999999999864
No 57
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=92.30 E-value=0.091 Score=30.68 Aligned_cols=17 Identities=41% Similarity=1.052 Sum_probs=13.6
Q ss_pred CceecCCCCCC----CCCccc
Q psy5025 64 SSYCLCPLGFT----GDLCQT 80 (533)
Q Consensus 64 ~y~C~C~~g~~----G~~Ce~ 80 (533)
+|+|.|++||. |+.|+.
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~D 21 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCED 21 (24)
T ss_pred CEEeeCCCCCcCCCCCCcccc
Confidence 69999999994 667764
No 58
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=91.20 E-value=0.29 Score=44.83 Aligned_cols=91 Identities=13% Similarity=0.236 Sum_probs=51.2
Q ss_pred eeEEEEEEE--eCCCCeEEEEecc----CCCCCeEEEEEECCEEEEEEEcCCceEE-------EeeCceeecCCCeEEEE
Q psy5025 338 EFNIDMRIK--TSSSSGLILWTGS----SQSKDFLALGIHNGYAHVMYNLGNGEVL-------LENNQTKINNGVWHRLT 404 (533)
Q Consensus 338 ~~~i~~~~r--t~~~~g~ll~~~~----~~~~~~l~l~l~~g~l~~~~~~g~~~~~-------~~~~~~~~~dg~wH~v~ 404 (533)
.+.+++.|| .....|++|.... ........+.|.++.-........+... .......+..|+||+++
T Consensus 54 df~l~~d~k~~~~~~sGi~~r~~~~~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~W~~~~ 133 (185)
T PF06439_consen 54 DFELEVDFKITPGGNSGIFFRAQSPGDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEPSVNVAIPPGEWNTVR 133 (185)
T ss_dssp SEEEEEEEEE-TT-EEEEEEEESSECCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-SSS--S--TTSEEEEE
T ss_pred cEEEEEEEEECCCCCeEEEEEeccccCCCCcceEEEEEEECCCCccCCCCccceEEEeccccccccccccCCCCceEEEE
Confidence 455666666 3345577777651 1124556676664332211111112211 22244669999999999
Q ss_pred EEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCC
Q psy5025 405 VHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGG 460 (533)
Q Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~ 460 (533)
|+. .+.++++.||+..+.....+.
T Consensus 134 I~~--------------------------------~g~~i~v~vnG~~v~~~~d~~ 157 (185)
T PF06439_consen 134 IVV--------------------------------KGNRITVWVNGKPVADFTDPS 157 (185)
T ss_dssp EEE--------------------------------ETTEEEEEETTEEEEEEETTS
T ss_pred EEE--------------------------------ECCEEEEEECCEEEEEEEcCC
Confidence 999 999999999999888765543
No 59
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=90.92 E-value=0.093 Score=33.98 Aligned_cols=32 Identities=38% Similarity=0.910 Sum_probs=23.2
Q ss_pred CCCCCCCCCCeeeeCC-CCceecCCCCC--CCCCc
Q psy5025 47 CNKTPCEHGGKCITTP-GSSYCLCPLGF--TGDLC 78 (533)
Q Consensus 47 C~~~pC~NggtC~~~~-~~y~C~C~~g~--~G~~C 78 (533)
|...+|+.++.|+... +.+.|.|-+|| .|..|
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence 7778999999999998 99999999999 45555
No 60
>KOG1214|consensus
Probab=90.23 E-value=0.37 Score=52.61 Aligned_cols=42 Identities=36% Similarity=0.865 Sum_probs=36.1
Q ss_pred ceee---cCCC-CCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCC
Q psy5025 35 IKGR---SKEC-LEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGD 76 (533)
Q Consensus 35 ~~G~---~~~C-~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~ 76 (533)
++|| +--| ..|+|.++-|...++|.++.+++.|.|.+||.|+
T Consensus 814 LPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 814 LPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred cCCccCCccccccccccCccccCCCceEecCCCcceeecccCccCC
Confidence 6788 3334 4799999999999999999999999999999764
No 61
>smart00051 DSL delta serrate ligand.
Probab=90.12 E-value=0.49 Score=35.12 Aligned_cols=36 Identities=19% Similarity=0.365 Sum_probs=26.9
Q ss_pred cCCCCCCCCCC-CCCCCCCeeeeCCCCceecCCCCCCCCCc
Q psy5025 39 SKECLEDKCNK-TPCEHGGKCITTPGSSYCLCPLGFTGDLC 78 (533)
Q Consensus 39 ~~~C~~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~G~~C 78 (533)
+..|. ..|.+ +.+.++.+|.. .-.|.|++||+|++|
T Consensus 27 G~~C~-~~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~C 63 (63)
T smart00051 27 GEGCN-KFCRPRDDFFGHYTCDE---NGNKGCLEGWMGPYC 63 (63)
T ss_pred CCccC-CEeCcCccccCCccCCc---CCCEecCCCCcCCCC
Confidence 77786 56653 45778888843 356899999999988
No 62
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=90.06 E-value=0.21 Score=32.41 Aligned_cols=18 Identities=50% Similarity=1.196 Sum_probs=15.8
Q ss_pred CeeeeCCCCceecCCCCC
Q psy5025 56 GKCITTPGSSYCLCPLGF 73 (533)
Q Consensus 56 gtC~~~~~~y~C~C~~g~ 73 (533)
-.|++..++|+|.|+.||
T Consensus 10 h~C~~~~g~~~C~C~~Gy 27 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGY 27 (36)
T ss_dssp SEEEEETTSEEEE-STTE
T ss_pred CCCccCCCceEeECCCCC
Confidence 489999999999999999
No 63
>KOG3546|consensus
Probab=88.76 E-value=1 Score=47.76 Aligned_cols=127 Identities=17% Similarity=0.302 Sum_probs=77.2
Q ss_pred ceeeEEEEEEEeCCCC-eEEEEeccCC-CCCeEEEEE---ECCEEEEE--EEc-CCceE-EEeeCceeecCCCeEEEEEE
Q psy5025 336 GYEFNIDMRIKTSSSS-GLILWTGSSQ-SKDFLALGI---HNGYAHVM--YNL-GNGEV-LLENNQTKINNGVWHRLTVH 406 (533)
Q Consensus 336 ~~~~~i~~~~rt~~~~-g~ll~~~~~~-~~~~l~l~l---~~g~l~~~--~~~-g~~~~-~~~~~~~~~~dg~wH~v~~~ 406 (533)
-..|.|.|.+|+.... |+||...+.. ..-||-|.| +||+-.+. |.. +.+.. ..-.....+..++|.++.+.
T Consensus 86 frdf~~~~~i~p~s~~~gvlfaitd~~q~~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~a~~ 165 (1167)
T KOG3546|consen 86 FRDFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALS 165 (1167)
T ss_pred hccceEEEEeeccCCCCceEEEechhhhhhheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhheeee
Confidence 3467899999988644 5666654332 234555554 36754433 322 22211 11112356788999999999
Q ss_pred EeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCccee--eccCCceEEcCCCCcccccc
Q psy5025 407 RNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQ--LNTDSGLYVGGVEDIEELTR 484 (533)
Q Consensus 407 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~--l~~~~~l~lGg~~~~~~~~~ 484 (533)
. .+..+.|+||-+....+......+. +.....||+|-.-..
T Consensus 166 v--------------------------------~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~~----- 208 (1167)
T KOG3546|consen 166 V--------------------------------AGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGA----- 208 (1167)
T ss_pred e--------------------------------cCceEEEEechHHhcccchhccccceeecCCcceEEeccCCC-----
Confidence 9 9999999999776444332222223 334445888754322
Q ss_pred cccccCceeeEEEEEEc
Q psy5025 485 SKYHHSFKGWISNLILS 501 (533)
Q Consensus 485 ~~~~~~f~GCi~~l~i~ 501 (533)
-...|+|.|..|++.
T Consensus 209 --~~~~f~g~~~~l~v~ 223 (1167)
T KOG3546|consen 209 --DPDKFQGVIAELKVR 223 (1167)
T ss_pred --ChHhhhhhhhheeec
Confidence 234699999999994
No 64
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=87.00 E-value=0.69 Score=42.29 Aligned_cols=101 Identities=18% Similarity=0.249 Sum_probs=52.7
Q ss_pred ceeEEEcCCcccccceeEEEEEEEe-c-CCCeeEEEcccc--CCCCCCeEEEEEECCEEEEEEEcCCCceE--------E
Q psy5025 92 SSYLRYQGLGEQYLSLLDLTIVFKA-I-EPNGILLYNGHR--ADGVGDFIALYLNDRYVDFTFDLGTGAAT--------L 159 (533)
Q Consensus 92 ~syl~~~~~~~~~~~~~~ls~~FrT-~-~~dGlLl~~~~~--~~~~~~fl~l~L~~G~l~~~~~~g~~~~~--------v 159 (533)
.+++.... ....+.|++.||. . ...|++|..... .......+.+.|.++.-........+... .
T Consensus 43 ~~~l~~~~----~~~df~l~~d~k~~~~~~sGi~~r~~~~~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~ 118 (185)
T PF06439_consen 43 GGYLYTDK----KFSDFELEVDFKITPGGNSGIFFRAQSPGDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEP 118 (185)
T ss_dssp S--EEESS----EBSSEEEEEEEEE-TT-EEEEEEEESSECCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-S
T ss_pred cceEEECC----ccccEEEEEEEEECCCCCeEEEEEeccccCCCCcceEEEEEEECCCCccCCCCccceEEEeccccccc
Confidence 45555542 2346888888884 3 344555554401 11124556777765432210011112111 1
Q ss_pred EeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeC
Q psy5025 160 RSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILS 196 (533)
Q Consensus 160 ~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~ 196 (533)
........++||+++|+..+..+++.||+........
T Consensus 119 ~~~~~~~~~~W~~~~I~~~g~~i~v~vnG~~v~~~~d 155 (185)
T PF06439_consen 119 SVNVAIPPGEWNTVRIVVKGNRITVWVNGKPVADFTD 155 (185)
T ss_dssp SS--S--TTSEEEEEEEEETTEEEEEETTEEEEEEET
T ss_pred cccccCCCCceEEEEEEEECCEEEEEECCEEEEEEEc
Confidence 1235577899999999999999999999976554443
No 65
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=86.49 E-value=10 Score=35.89 Aligned_cols=67 Identities=13% Similarity=0.193 Sum_probs=45.4
Q ss_pred CCeEEEEEECCEEEEEEEcCC----ceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhcccc
Q psy5025 363 KDFLALGIHNGYAHVMYNLGN----GEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLH 438 (533)
Q Consensus 363 ~~~l~l~l~~g~l~~~~~~g~----~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (533)
.-.++|.+.++.+.+.+.... ...........+.-|+||++.+... +-..
T Consensus 113 ~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~----------------------~s~~---- 166 (224)
T PF14099_consen 113 SPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVK----------------------WSPD---- 166 (224)
T ss_dssp EECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEE----------------------E-CC----
T ss_pred CCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEE----------------------ECCC----
Confidence 457889999999999987655 1222223456688899999998871 1111
Q ss_pred ccCcEEEEEECCceeeeE
Q psy5025 439 RTGKQCLIRIDTDYSMQS 456 (533)
Q Consensus 439 ~~~~~~~l~vD~~~~~~~ 456 (533)
....+.|.+||+.+...
T Consensus 167 -~~G~~~vw~nG~~v~~~ 183 (224)
T PF14099_consen 167 -SDGFLEVWLNGKLVVDY 183 (224)
T ss_dssp -CTEEEEEEECCEECCEE
T ss_pred -CCEEEEEEECCEEEEEE
Confidence 45789999999876554
No 66
>KOG3546|consensus
Probab=85.99 E-value=2.7 Score=44.71 Aligned_cols=129 Identities=18% Similarity=0.251 Sum_probs=78.6
Q ss_pred ceeEEEEEEEecC-CCeeEEEccccCCCCCCeEEEEE---ECCEEEEEE--Ec-CCCceE--EEeeccccCCCcEEEEEE
Q psy5025 106 SLLDLTIVFKAIE-PNGILLYNGHRADGVGDFIALYL---NDRYVDFTF--DL-GTGAAT--LRSSNPISLGEWHKLRLT 176 (533)
Q Consensus 106 ~~~~ls~~FrT~~-~dGlLl~~~~~~~~~~~fl~l~L---~~G~l~~~~--~~-g~~~~~--v~s~~~lnDg~Wh~V~v~ 176 (533)
+.+.|.|-.|... .-|+||...+... .--|+-|.| +||+-.+.+ .. +++... ......+--++|.++.++
T Consensus 87 rdf~~~~~i~p~s~~~gvlfaitd~~q-~~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~a~~ 165 (1167)
T KOG3546|consen 87 RDFSLLFHIRPATEGPGVLFAITDSAQ-AMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALS 165 (1167)
T ss_pred ccceEEEEeeccCCCCceEEEechhhh-hhheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhheeee
Confidence 4566777777654 5566666654421 123444444 467544433 32 222211 123456678999999999
Q ss_pred EECCEEEEEEcCccCceeeCCCCceeee--cCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEE
Q psy5025 177 RTGRHAYLQVDRFPSSQILSPGPFTQLS--LSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHE 243 (533)
Q Consensus 177 ~~~~~~~L~VD~~~~~~~~~~~~~~~l~--~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~ 243 (533)
..+..+.|.||-+.-+..........+. ....||++..-... ...|.|-|..+++...+
T Consensus 166 v~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~~~--------~~~f~g~~~~l~v~~dp 226 (1167)
T KOG3546|consen 166 VAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGAD--------PDKFQGVIAELKVRRDP 226 (1167)
T ss_pred ecCceEEEEechHHhcccchhccccceeecCCcceEEeccCCCC--------hHhhhhhhhheeecCCC
Confidence 9999999999986644433333333333 34568988665443 26799999999886543
No 67
>KOG1225|consensus
Probab=84.70 E-value=0.67 Score=49.24 Aligned_cols=40 Identities=33% Similarity=0.844 Sum_probs=23.7
Q ss_pred ceee-cCCCCCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCcc
Q psy5025 35 IKGR-SKECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQ 79 (533)
Q Consensus 35 ~~G~-~~~C~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce 79 (533)
..|| +..|.+-.|... |..++.|++. .|.|+++|.|+.|+
T Consensus 270 ~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs 310 (525)
T KOG1225|consen 270 PPGFTGDDCDELVCPVD-CSGGGVCVDG----ECICNPGYSGKDCS 310 (525)
T ss_pred CCCCcCCCCCcccCCcc-cCCCceecCC----EeecCCCccccccc
Confidence 5666 666665555554 6555665554 56666666666665
No 68
>KOG1217|consensus
Probab=84.24 E-value=0.84 Score=48.36 Aligned_cols=35 Identities=51% Similarity=1.235 Sum_probs=32.6
Q ss_pred CCCCCCCC-CCCCCeeeeCCCCceecCCCCCCCCCc
Q psy5025 44 EDKCNKTP-CEHGGKCITTPGSSYCLCPLGFTGDLC 78 (533)
Q Consensus 44 ~d~C~~~p-C~NggtC~~~~~~y~C~C~~g~~G~~C 78 (533)
.+.|...+ |.|+++|++..+.|.|.|+++|+|..|
T Consensus 271 ~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 271 VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence 78899874 999999999999999999999999998
No 69
>KOG1217|consensus
Probab=83.56 E-value=0.94 Score=47.97 Aligned_cols=47 Identities=43% Similarity=1.071 Sum_probs=39.6
Q ss_pred ceee-cCCCCC--CCCC--CCCCCCCCeeeeCCCCceecCCCCCCCCCcccc
Q psy5025 35 IKGR-SKECLE--DKCN--KTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 35 ~~G~-~~~C~~--d~C~--~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~ 81 (533)
..|| +..|.. +.|. .++|.|+++|.+...+|.|.|+.+|.|..|+..
T Consensus 157 ~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~ 208 (487)
T KOG1217|consen 157 TEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT 208 (487)
T ss_pred CCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC
Confidence 4566 666663 5898 567999999999999999999999999999875
No 70
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=82.40 E-value=1.1 Score=42.52 Aligned_cols=35 Identities=40% Similarity=0.923 Sum_probs=27.2
Q ss_pred cCCCC-CCCCCC--CCCCCCCeeeeCCCCceecCCCCCCC
Q psy5025 39 SKECL-EDKCNK--TPCEHGGKCITTPGSSYCLCPLGFTG 75 (533)
Q Consensus 39 ~~~C~-~d~C~~--~pC~NggtC~~~~~~y~C~C~~g~~G 75 (533)
..-|. .++|.. ++|. ..|.+..++|.|.|+.||+.
T Consensus 181 ~~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 181 GKICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cccCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence 45675 577853 4565 58999999999999999964
No 71
>KOG1225|consensus
Probab=81.04 E-value=1.3 Score=47.04 Aligned_cols=41 Identities=37% Similarity=0.875 Sum_probs=34.2
Q ss_pred ceee-cCCCCCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccc
Q psy5025 35 IKGR-SKECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQT 80 (533)
Q Consensus 35 ~~G~-~~~C~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~ 80 (533)
..|| ++.|+.-.|..+ |.+.|.|++. +|+|++||+|++|+.
T Consensus 239 ~~~~~g~~c~~~~C~~~-c~~~g~c~~G----~CIC~~Gf~G~dC~e 280 (525)
T KOG1225|consen 239 PEGYFGPLCSTIYCPGG-CTGRGQCVEG----RCICPPGFTGDDCDE 280 (525)
T ss_pred CCceeCCccccccCCCC-CcccceEeCC----eEeCCCCCcCCCCCc
Confidence 6677 888886677764 7677889887 899999999999998
No 72
>KOG4260|consensus
Probab=79.58 E-value=1.3 Score=42.29 Aligned_cols=59 Identities=29% Similarity=0.655 Sum_probs=42.8
Q ss_pred ceeEEEEcCCCC--cee-ecCCCCC-CCCCCCCCCCCCeeeeCC---CCceecCCCCCCCCCcccc
Q psy5025 23 WDAWIQLNDGNK--IKG-RSKECLE-DKCNKTPCEHGGKCITTP---GSSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 23 ~~~~l~vd~~~~--~~G-~~~~C~~-d~C~~~pC~NggtC~~~~---~~y~C~C~~g~~G~~Ce~~ 81 (533)
...||-||+.-. +.| |++.|.. ..=+..||.-.|.|.-.. ++-.|.|.+||+|+.|.+-
T Consensus 119 l~~WlCvdqLkvCCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~C 184 (350)
T KOG4260|consen 119 LFNWLCVDQLKVCCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYC 184 (350)
T ss_pred HHhHhhhhhheeccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccccc
Confidence 345666666554 444 4888873 111356899889999775 7889999999999999875
No 73
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=78.52 E-value=1.6 Score=30.34 Aligned_cols=24 Identities=29% Similarity=0.697 Sum_probs=19.3
Q ss_pred CeeeeCCCCceecCCCCCCCCCcccc
Q psy5025 56 GKCITTPGSSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 56 gtC~~~~~~y~C~C~~g~~G~~Ce~~ 81 (533)
.+|.. ....|.|+.+|+|+.|+.-
T Consensus 11 ~~C~~--~~G~C~C~~~~~G~~C~~C 34 (49)
T PF00053_consen 11 QTCDP--STGQCVCKPGTTGPRCDQC 34 (49)
T ss_dssp SSEEE--TCEEESBSTTEESTTS-EE
T ss_pred CcccC--CCCEEeccccccCCcCcCC
Confidence 47766 6779999999999999974
No 74
>KOG1214|consensus
Probab=76.94 E-value=1.9 Score=47.43 Aligned_cols=31 Identities=16% Similarity=0.550 Sum_probs=28.8
Q ss_pred CCCCCCCC-CCCeeeecCCceEEecCCCCCcC
Q psy5025 261 HPCITQPC-GSGSCVPLYDGYKCVCTQSCKES 291 (533)
Q Consensus 261 ~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~ 291 (533)
+.|.++-| ..+.|+...++|.|.|.+||.|+
T Consensus 828 DeC~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 828 DECSPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred cccCccccCCCceEecCCCcceeecccCccCC
Confidence 57999999 89999999999999999999993
No 75
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=76.87 E-value=1.9 Score=27.96 Aligned_cols=20 Identities=30% Similarity=0.660 Sum_probs=16.0
Q ss_pred CeeeecCCceEEecCCCCCc
Q psy5025 271 GSCVPLYDGYKCVCTQSCKE 290 (533)
Q Consensus 271 g~C~~~~~~~~C~C~~g~~G 290 (533)
-.|++...+|.|.|++||.-
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE
T ss_pred CCCccCCCceEeECCCCCEE
Confidence 36888889999999999854
No 76
>KOG1836|consensus
Probab=76.71 E-value=2.8 Score=50.95 Aligned_cols=105 Identities=18% Similarity=0.250 Sum_probs=68.7
Q ss_pred CCeEEEEEECCEEEEEEEcCCC--ceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEE
Q psy5025 134 GDFIALYLNDRYVDFTFDLGTG--AATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYL 211 (533)
Q Consensus 134 ~~fl~l~L~~G~l~~~~~~g~~--~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~v 211 (533)
..+..+....+.....+..+-. ...+.....+-++.||.+...+....+.+..|... . .+......+...++++
T Consensus 1581 ~~~~~~~~~~~~~~~~~~~gi~~~~~s~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~-~---~~~~~~~~~~~~p~~~ 1656 (1705)
T KOG1836|consen 1581 GTELLVQENPIGVTEKFESGITDLSTSSTPIVSLLPGGCHSVTSSTDPGVVQLEDDTYT-V---GEIPPPPADTQEPIKL 1656 (1705)
T ss_pred chHHhhhhcccccchhhhhhhhhhhhcchhhhhhcCCcceeeeeecCCcccccccccee-c---ccCCCCchhccCCccc
Confidence 4555555555555555444322 22333446778999999999999998888888721 1 1222333445578999
Q ss_pred cCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025 212 GGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN 245 (533)
Q Consensus 212 GG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~ 245 (533)
||++..... ........|.||| +++++.+.+
T Consensus 1657 ~~~~~s~~~-~~~~~~~~~~~~~--~~~~~~~~~ 1687 (1705)
T KOG1836|consen 1657 GGYPSSLTT-LRIAVLKSFTGCI--FVVMGIRVD 1687 (1705)
T ss_pred CCccccccc-eeeecccccccce--EEecCCCCc
Confidence 999987653 2334457899999 888888776
No 77
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=75.31 E-value=10 Score=31.43 Aligned_cols=32 Identities=31% Similarity=0.707 Sum_probs=25.4
Q ss_pred CCCCCC-CCCCCCCeeeeCCCCceecCCCCCCCC
Q psy5025 44 EDKCNK-TPCEHGGKCITTPGSSYCLCPLGFTGD 76 (533)
Q Consensus 44 ~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~G~ 76 (533)
.|.|+. ..|...|.|.. .....|.|.+||.-+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 578874 79999999954 366789999999643
No 78
>KOG0994|consensus
Probab=68.34 E-value=16 Score=42.17 Aligned_cols=136 Identities=18% Similarity=0.311 Sum_probs=77.1
Q ss_pred CCCCCCCCCCCCeeeeC--CCCceecCCCCCCCCCcccccc---------------------------------cccccc
Q psy5025 45 DKCNKTPCEHGGKCITT--PGSSYCLCPLGFTGDLCQTRLD---------------------------------LQVPQF 89 (533)
Q Consensus 45 d~C~~~pC~NggtC~~~--~~~y~C~C~~g~~G~~Ce~~~~---------------------------------~~~~~F 89 (533)
+.|.+-.|.-||.=-+. ..+-.|.|..++.|++|++.-. ...++|
T Consensus 508 ~gc~~cdcd~GGs~d~sc~~~sGqC~CRe~~~GR~c~~~~~~yy~~~l~h~i~eAe~~~~~~~~~v~~r~~~~~~~~~sf 587 (1758)
T KOG0994|consen 508 EGCRPCDCDQGGSYDNSCDLHSGQCECREHMLGRRCEQVCPGYYSPVLDHYIYEAEDAGTGVEVNVKERKVLKSTKLPSF 587 (1758)
T ss_pred CCCcccccCCCCCCCcccccccCccccccccccccccccCCcccccccchhhhhhhhccccceeeeeeeeecccCCCccc
Confidence 44666666666432222 2567899999999999997521 014578
Q ss_pred CCceeEEEcCCcc--------cccceeEEEEEEEecCC----CeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCce
Q psy5025 90 NGSSYLRYQGLGE--------QYLSLLDLTIVFKAIEP----NGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAA 157 (533)
Q Consensus 90 ~g~syl~~~~~~~--------~~~~~~~ls~~FrT~~~----dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~ 157 (533)
.|.+|++.+-... +.+..+.+-+.+..+.+ +..|+.......+.+.--.+.+.+.++.+.+..++.-+
T Consensus 588 tG~gf~r~~e~~~l~f~~~~ip~sm~Ydv~ir~~~~~~~~wen~~itvqrp~~p~~g~c~~~~~~dd~~~~sl~p~sRyv 667 (1758)
T KOG0994|consen 588 TGKGFVRVPEGTTLEFTVPIIPPSMEYDVLIRYDPRTPKLWENAKITVQRPGQPSLGRCGMAIPKDDRIPFSLPPGSRYV 667 (1758)
T ss_pred cccceeecCCCceeeeecCCCCcccccchheeccCCCcchhhhheEEeecCCCCcccccccccccccccccccCCCceee
Confidence 8888887764311 11123344444443332 22233322221222333444556677777777665444
Q ss_pred EEEeeccccCCCcEEEEEEEECC
Q psy5025 158 TLRSSNPISLGEWHKLRLTRTGR 180 (533)
Q Consensus 158 ~v~s~~~lnDg~Wh~V~v~~~~~ 180 (533)
.+..+.-+..|.-+.++|....+
T Consensus 668 v~~~~vClE~G~~Yklri~~~~~ 690 (1758)
T KOG0994|consen 668 VAPNPVCLEAGKVYKLRIYFERK 690 (1758)
T ss_pred ecCCchhhccCcceEEEEEeccc
Confidence 44455678889999888887653
No 79
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=64.81 E-value=6 Score=27.68 Aligned_cols=22 Identities=32% Similarity=0.708 Sum_probs=17.6
Q ss_pred eeeCCCCceecCCCCCCCCCcccc
Q psy5025 58 CITTPGSSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 58 C~~~~~~y~C~C~~g~~G~~Ce~~ 81 (533)
|.. ..-.|.|+.+++|+.|+.-
T Consensus 14 C~~--~~G~C~C~~~~~G~~C~~C 35 (50)
T cd00055 14 CDP--GTGQCECKPNTTGRRCDRC 35 (50)
T ss_pred ccC--CCCEEeCCCcCCCCCCCCC
Confidence 544 4567999999999999964
No 80
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=62.79 E-value=6.6 Score=24.93 Aligned_cols=13 Identities=15% Similarity=0.419 Sum_probs=9.9
Q ss_pred CceEEecCCCCCc
Q psy5025 278 DGYKCVCTQSCKE 290 (533)
Q Consensus 278 ~~~~C~C~~g~~G 290 (533)
+...|.||.||--
T Consensus 16 ~~~~C~CPeGyIl 28 (34)
T PF09064_consen 16 SPGQCFCPEGYIL 28 (34)
T ss_pred CCCceeCCCceEe
Confidence 4568999999853
No 81
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=59.42 E-value=1.9e+02 Score=32.10 Aligned_cols=60 Identities=22% Similarity=0.324 Sum_probs=39.1
Q ss_pred CCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCc
Q psy5025 363 KDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGK 442 (533)
Q Consensus 363 ~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (533)
+-||.| ..+|.-.|.-++.+.. .+......+.-++||+|++.. .+.
T Consensus 578 G~~f~v-~~~G~w~vt~d~~~~~-~l~~G~~~~~~~~WhtltL~~--------------------------------~g~ 623 (669)
T PF02057_consen 578 GYFFWV-YANGTWSVTSDLAGTT-TLASGTADIGAGKWHTLTLTI--------------------------------SGS 623 (669)
T ss_dssp EEEEEE-ETTTEEEEEEETTS-S-EEEEEE-S--TT-EEEEEEEE--------------------------------ETT
T ss_pred eEEEEE-EcCCcEEEeccCCCcE-EEeeeeecccCCeEEEEEEEE--------------------------------ECC
Confidence 345544 5588877776665432 332233458889999999999 889
Q ss_pred EEEEEECCceeeeE
Q psy5025 443 QCLIRIDTDYSMQS 456 (533)
Q Consensus 443 ~~~l~vD~~~~~~~ 456 (533)
.+.-.+|+......
T Consensus 624 ~~ta~lng~~l~~~ 637 (669)
T PF02057_consen 624 TATAMLNGTVLWTD 637 (669)
T ss_dssp EEEEEETTEEEEEE
T ss_pred EEEEEECCEEeEEe
Confidence 99999999877653
No 82
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=59.17 E-value=7 Score=31.95 Aligned_cols=37 Identities=27% Similarity=0.718 Sum_probs=28.5
Q ss_pred CCCC--CCCCCCCCeeeeCC-----CCceecCCC-------------CCCCCCcccc
Q psy5025 45 DKCN--KTPCEHGGKCITTP-----GSSYCLCPL-------------GFTGDLCQTR 81 (533)
Q Consensus 45 d~C~--~~pC~NggtC~~~~-----~~y~C~C~~-------------g~~G~~Ce~~ 81 (533)
+.|. .+-|...|.|++.. +=|.|.|.+ .|.|+.|++.
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKk 62 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKK 62 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeecccccccc
Confidence 3454 46788899999984 559999987 5778888875
No 83
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=56.15 E-value=9.7 Score=36.08 Aligned_cols=29 Identities=28% Similarity=0.671 Sum_probs=22.5
Q ss_pred CCCCCC--CCCCCeeeecCCceEEecCCCCCc
Q psy5025 261 HPCITQ--PCGSGSCVPLYDGYKCVCTQSCKE 290 (533)
Q Consensus 261 ~~C~~~--pCngg~C~~~~~~~~C~C~~g~~G 290 (533)
+.|... +| ...|.+...+|.|.|+.||..
T Consensus 188 ~~C~~~~~~c-~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 188 DLCATLSHVC-QQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred hhhcCCCCCc-cceEEcCCCCEEeECCCCccC
Confidence 456543 45 347999999999999999975
No 84
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=54.10 E-value=53 Score=30.86 Aligned_cols=59 Identities=12% Similarity=0.122 Sum_probs=41.0
Q ss_pred CCeEEEEEECCEEEEEEEcCC-C----ceEEEeeccccCCCcEEEEEEEE-----CCEEEEEEcCccCc
Q psy5025 134 GDFIALYLNDRYVDFTFDLGT-G----AATLRSSNPISLGEWHKLRLTRT-----GRHAYLQVDRFPSS 192 (533)
Q Consensus 134 ~~fl~l~L~~G~l~~~~~~g~-~----~~~v~s~~~lnDg~Wh~V~v~~~-----~~~~~L~VD~~~~~ 192 (533)
...++|.+.++++.+.+.... . .........+.-|+||+|.+... ...+.+.+|+....
T Consensus 113 ~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~ 181 (224)
T PF14099_consen 113 SPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVV 181 (224)
T ss_dssp EECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECC
T ss_pred CCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEE
Confidence 668999999999999988765 1 11222345677799999887643 35789999996543
No 85
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=53.01 E-value=98 Score=28.23 Aligned_cols=83 Identities=11% Similarity=0.134 Sum_probs=48.0
Q ss_pred eEEEEEEEeCCCCeEEEEeccCCCCCeEEE-EEE-CCEEEEEEEcCCceEEEeeCceeecC-CCeEEEEEEEeeeeeecC
Q psy5025 339 FNIDMRIKTSSSSGLILWTGSSQSKDFLAL-GIH-NGYAHVMYNLGNGEVLLENNQTKINN-GVWHRLTVHRNFMVVMHG 415 (533)
Q Consensus 339 ~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l-~l~-~g~l~~~~~~g~~~~~~~~~~~~~~d-g~wH~v~~~~~~~~~~~~ 415 (533)
++++-..|-.+....-.|.++.. ..||.. .|. +|.|++.++-.++...+. ....| ...|..++....
T Consensus 34 W~ls~~~RV~~G~~n~~yyAnG~-~r~l~~lsvn~sG~LvA~L~g~ss~~~~~---~~~~di~gyH~Y~i~~~p------ 103 (198)
T PF09264_consen 34 WSLSWESRVVSGGCNTNYYANGS-KRYLPILSVNESGSLVAELEGQSSNTLLA---TTGADIHGYHKYEIVFSP------ 103 (198)
T ss_dssp EEEEEEEEEEEES-EEEEEEESS-EEEEEEEEE-TTS-EEEEETTS-S-EEEE----CHHHHCSEEEEEEEEET------
T ss_pred cceeeeEEEecCcceeEEEcCCc-eEEEEEEEEcCCCCEEEEEecCCCcEEEe---cccccccceeEEEEEecC------
Confidence 44444444444333333333332 456644 555 788999987766665554 22223 678999998822
Q ss_pred CcccccCCcchhhhhhhhhccccccCcEEEEEECCceeee
Q psy5025 416 SSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQ 455 (533)
Q Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~ 455 (533)
.....+++|||..+.+
T Consensus 104 ------------------------~~~tASfy~DG~lI~t 119 (198)
T PF09264_consen 104 ------------------------LTNTASFYFDGTLIAT 119 (198)
T ss_dssp ------------------------TTTEEEEEETTEEEEE
T ss_pred ------------------------CCCceEEEECCEEEee
Confidence 4478999999998876
No 86
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=52.41 E-value=2e+02 Score=26.90 Aligned_cols=10 Identities=20% Similarity=0.285 Sum_probs=9.7
Q ss_pred CCeEEEEEEE
Q psy5025 398 GVWHRLTVHR 407 (533)
Q Consensus 398 g~wH~v~~~~ 407 (533)
|+||++.|..
T Consensus 154 g~~~~v~I~Y 163 (223)
T cd01951 154 GNEHTVRITY 163 (223)
T ss_pred CCEEEEEEEE
Confidence 9999999999
No 87
>KOG4260|consensus
Probab=51.45 E-value=7.7 Score=37.16 Aligned_cols=35 Identities=29% Similarity=0.703 Sum_probs=29.8
Q ss_pred CCC-CCCCC--CCCCCCCCeeeeCCCCceecCCCCCCC
Q psy5025 41 ECL-EDKCN--KTPCEHGGKCITTPGSSYCLCPLGFTG 75 (533)
Q Consensus 41 ~C~-~d~C~--~~pC~NggtC~~~~~~y~C~C~~g~~G 75 (533)
+|- +|+|. ++||....-|++..++|.|.+..||.+
T Consensus 232 gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 232 GCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred ccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 444 57775 789999999999999999999999965
No 88
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=49.92 E-value=15 Score=30.38 Aligned_cols=29 Identities=24% Similarity=0.541 Sum_probs=22.0
Q ss_pred CCCC-CCCC-CCCeeeecCCceEEecCCCCCc
Q psy5025 261 HPCI-TQPC-GSGSCVPLYDGYKCVCTQSCKE 290 (533)
Q Consensus 261 ~~C~-~~pC-ngg~C~~~~~~~~C~C~~g~~G 290 (533)
+.|. ...| ..|.|... ....|.|.+||..
T Consensus 78 d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNSN-NSPKCSCLPGFEP 108 (110)
T ss_pred cCCCCccccCCccEeCCC-CCCceECCCCcCC
Confidence 3566 4789 88999543 5667999999964
No 89
>cd00413 Glyco_hydrolase_16 glycosyl hydrolase family 16. The O-Glycosyl hydrolases are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A glycosyl hydrolase classification system based on sequence similarity has led to the definition of more than 95 different families inlcuding glycosyl hydrolase family 16. Family 16 includes lichenase, xyloglucan endotransglycosylase (XET), beta-agarase, kappa-carrageenase, endo-beta-1,3-glucanase, endo-beta-1,3-1,4-glucanase, and endo-beta-galactosidase, all of which have a conserved jelly roll fold with a deep active site channel harboring the catalytic residues.
Probab=49.44 E-value=2.1e+02 Score=26.26 Aligned_cols=30 Identities=13% Similarity=0.062 Sum_probs=26.5
Q ss_pred cCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEe
Q psy5025 396 NNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSI 457 (533)
Q Consensus 396 ~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~ 457 (533)
..+.||...+.+ ....++.+|||..+....
T Consensus 140 ~~~~~H~Y~~~W--------------------------------~~~~i~~yvDG~~~~~~~ 169 (210)
T cd00413 140 PADDFHTYRVDW--------------------------------TPGEITFYVDGVLVATIT 169 (210)
T ss_pred CccCeEEEEEEE--------------------------------eCCEEEEEECCEEEEEEC
Confidence 578999999999 889999999999887753
No 90
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=48.30 E-value=16 Score=23.86 Aligned_cols=28 Identities=25% Similarity=0.691 Sum_probs=20.5
Q ss_pred CCCCCC-CCCeeeecC-CceEEecCCCCCc
Q psy5025 263 CITQPC-GSGSCVPLY-DGYKCVCTQSCKE 290 (533)
Q Consensus 263 C~~~pC-ngg~C~~~~-~~~~C~C~~g~~G 290 (533)
|...+| -++.|.... ..+.|.|-+||..
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~ 31 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKK 31 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccc
Confidence 677789 889999887 6799999999864
No 91
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=47.71 E-value=6.6 Score=27.84 Aligned_cols=32 Identities=28% Similarity=0.647 Sum_probs=16.4
Q ss_pred CCCCCCeeeeC----CCCceecCCCCCCCCCccccc
Q psy5025 51 PCEHGGKCITT----PGSSYCLCPLGFTGDLCQTRL 82 (533)
Q Consensus 51 pC~NggtC~~~----~~~y~C~C~~g~~G~~Ce~~~ 82 (533)
+|.-.|...-. .+...|.|..-|.|++|++.+
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~ 53 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI 53 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence 45444544422 255899999999999999864
No 92
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=47.69 E-value=13 Score=25.54 Aligned_cols=18 Identities=28% Similarity=0.641 Sum_probs=15.6
Q ss_pred CceecCCCCCCCCCcccc
Q psy5025 64 SSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 64 ~y~C~C~~g~~G~~Ce~~ 81 (533)
.-.|.|+++++|+.|+.-
T Consensus 17 ~G~C~C~~~~~G~~C~~C 34 (46)
T smart00180 17 TGQCECKPNVTGRRCDRC 34 (46)
T ss_pred CCEEECCCCCCCCCCCcC
Confidence 458999999999999964
No 93
>PF07622 DUF1583: Protein of unknown function (DUF1583); InterPro: IPR011475 Most of the Rhodopirellula baltica hypothetical proteins that have this domain also match PF07619 from PFAM.
Probab=43.71 E-value=28 Score=35.44 Aligned_cols=31 Identities=23% Similarity=0.308 Sum_probs=28.4
Q ss_pred CCccccCCCceEEEEEEeCceeEEEEcCCCC
Q psy5025 4 SDNQVILDDWNTVRVFRHRWDAWIQLNDGNK 34 (533)
Q Consensus 4 s~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~ 34 (533)
....+.++.|.+|.++|.+.++.+.||++..
T Consensus 83 ~~~~l~~~~wN~v~l~~~g~~v~l~LN~~~i 113 (399)
T PF07622_consen 83 PTLPLKVNAWNRVRLQRRGDKVQLHLNGQLI 113 (399)
T ss_pred CCCCCCccccceEEEEEeCCEEEEEeCCcee
Confidence 3568899999999999999999999999887
No 94
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=39.95 E-value=38 Score=23.64 Aligned_cols=26 Identities=31% Similarity=1.008 Sum_probs=18.5
Q ss_pred CCCC-CCCCCCCCeeeeCCCCceecCCCCCC
Q psy5025 45 DKCN-KTPCEHGGKCITTPGSSYCLCPLGFT 74 (533)
Q Consensus 45 d~C~-~~pC~NggtC~~~~~~y~C~C~~g~~ 74 (533)
+.|. ...|..+..|++. +|.|+.+|.
T Consensus 20 ~~C~~~~qC~~~s~C~~g----~C~C~~g~~ 46 (52)
T PF01683_consen 20 ESCESDEQCIGGSVCVNG----RCQCPPGYV 46 (52)
T ss_pred CCCCCcCCCCCcCEEcCC----EeECCCCCE
Confidence 3354 3467778899664 899999883
No 95
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=37.27 E-value=29 Score=28.46 Aligned_cols=22 Identities=45% Similarity=1.159 Sum_probs=18.0
Q ss_pred CCC-CCCeeeecC-----CceEEecCCC
Q psy5025 266 QPC-GSGSCVPLY-----DGYKCVCTQS 287 (533)
Q Consensus 266 ~pC-ngg~C~~~~-----~~~~C~C~~g 287 (533)
+-| ++|.|+... +=|.|.|.+.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T 40 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPT 40 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeecc
Confidence 679 889999985 3499999983
No 96
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=36.27 E-value=3.9e+02 Score=25.44 Aligned_cols=13 Identities=8% Similarity=0.043 Sum_probs=11.9
Q ss_pred ecCCCeEEEEEEE
Q psy5025 395 INNGVWHRLTVHR 407 (533)
Q Consensus 395 ~~dg~wH~v~~~~ 407 (533)
+.||++|++.|..
T Consensus 160 l~~g~~~~v~I~Y 172 (236)
T cd06899 160 LKSGKPMQAWIDY 172 (236)
T ss_pred ccCCCeEEEEEEE
Confidence 5799999999999
No 97
>PF07953 Toxin_R_bind_N: Clostridium neurotoxin, N-terminal receptor binding; InterPro: IPR012928 The Clostridium neurotoxin family is composed of tetanus neurotoxin and seven serotypes of botulinum neurotoxin. The structure of the botulinum neurotoxin reveals a four domain protein. The N-terminal catalytic domain (IPR000395 from INTERPRO), the central translocation domain and two receptor binding domains []. This domain is the N-terminal receptor binding domain, which is comprised of two seven-stranded beta-sheets sandwiched together to form a jelly role motif []. The role of this domain in receptor binding appears to be indirect. ; GO: 0004222 metalloendopeptidase activity, 0050827 toxin receptor binding, 0009405 pathogenesis, 0051609 inhibition of neurotransmitter uptake, 0005576 extracellular region; PDB: 3RSJ_B 3FUQ_A 3FFZ_B 1G9B_A 1S0F_A 1Z0H_B 1F31_A 1G9D_A 1S0C_A 1S0D_A ....
Probab=35.38 E-value=3.5e+02 Score=24.75 Aligned_cols=88 Identities=15% Similarity=0.136 Sum_probs=56.4
Q ss_pred ceeeEEEEEEEeCCCC---------eEEEEeccCCCCCeEEEEEECCEEEEEEEcCCc-eEEEee--Cc-eeecC--CCe
Q psy5025 336 GYEFNIDMRIKTSSSS---------GLILWTGSSQSKDFLALGIHNGYAHVMYNLGNG-EVLLEN--NQ-TKINN--GVW 400 (533)
Q Consensus 336 ~~~~~i~~~~rt~~~~---------g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~-~~~~~~--~~-~~~~d--g~w 400 (533)
...|+|+|++|..+.. .+|=.... ..=-.++++++.+...+.-.++ ...+.+ +. ..++| .+|
T Consensus 54 ~~nFSIsFWlRipk~~~~~~~~neytII~~~~n---NsGWkI~l~~n~iiwtl~D~ng~~k~i~f~y~~~~~~SdyiNkW 130 (195)
T PF07953_consen 54 YNNFSISFWLRIPKYDNNINLNNEYTIINCMKN---NSGWKISLNNNGIIWTLIDSNGNEKSIYFNYSIMDNISDYINKW 130 (195)
T ss_dssp SSEEEEEEEEEEECHHCCHHTTSEEEEEEEEET---TEEEEEEEETTEEEEEEEETTSEEEEEEEESSSTSSTTSSTTSE
T ss_pred ccceEEEEEEEccCcccccccCcceEEEEeecC---CCceEEEEeCCcEEEEEEeCCCCEEEEEEEcccccchhhhcccE
Confidence 3468999999997522 23333321 2445788999999988765444 333333 11 22333 799
Q ss_pred EEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEe
Q psy5025 401 HRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSI 457 (533)
Q Consensus 401 H~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~ 457 (533)
|.|+++-.+ - +...|+|+|....+..
T Consensus 131 ~fITITnnr------------------------------L-~~~~IyINg~Li~~~~ 156 (195)
T PF07953_consen 131 FFITITNNR------------------------------L-GNSKIYINGNLIDNES 156 (195)
T ss_dssp EEEEEEEET------------------------------T-SEEEEEETTEEEEEEE
T ss_pred EEEEEEccc------------------------------C-ccceEEECCEEEcccc
Confidence 999999832 3 4449999999887754
No 98
>PTZ00334 trans-sialidase; Provisional
Probab=31.90 E-value=68 Score=36.23 Aligned_cols=78 Identities=17% Similarity=0.149 Sum_probs=45.5
Q ss_pred eeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceE
Q psy5025 393 TKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLY 472 (533)
Q Consensus 393 ~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~ 472 (533)
....-++-|+|.|.. +++++..++|||+.+.....+.......--+.+|
T Consensus 638 stWe~~k~yqVal~L-------------------------------~~G~~gsvYVDG~~vg~~~~~l~~~~~~~IshFy 686 (780)
T PTZ00334 638 SNWEPETTHQVAIVL-------------------------------RNGKQGSAYVDGQRVGDASCELKNTDSKGISHFY 686 (780)
T ss_pred ccccCCCeEEEEEEE-------------------------------eCCCeEEEEECCEEecCcccccCCCCCcccceEE
Confidence 346678889999887 1567999999999774422221111222224699
Q ss_pred EcCCCCcccccccccccCceeeEEEEEEcCceEEe
Q psy5025 473 VGGVEDIEELTRSKYHHSFKGWISNLILSNDYQVK 507 (533)
Q Consensus 473 lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~ 507 (533)
|||-..... ....-.=-++|+.+ -.++|+
T Consensus 687 iGgdg~~~~-----~~~~~~VTV~NVlL-YNRpL~ 715 (780)
T PTZ00334 687 IGGDGGSAG-----SKEDVPVTATNVLL-YNRPLD 715 (780)
T ss_pred ECCCccccc-----cCCCCCEEEeEeEE-eCCCCC
Confidence 998543211 01112225677777 566665
No 99
>KOG1226|consensus
Probab=31.27 E-value=43 Score=37.12 Aligned_cols=27 Identities=33% Similarity=0.880 Sum_probs=21.8
Q ss_pred CCCCCCeeeeCCCCceecCCCCCCCCCcccc
Q psy5025 51 PCEHGGKCITTPGSSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 51 pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~ 81 (533)
.|...|.|.=. .|.|.+||+|+.|+-+
T Consensus 556 lC~g~G~C~CG----~CvC~~GwtG~~C~C~ 582 (783)
T KOG1226|consen 556 LCGGHGRCECG----RCVCNPGWTGSACNCP 582 (783)
T ss_pred ccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence 67777878554 7999999999998865
No 100
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=29.08 E-value=5.3e+02 Score=28.75 Aligned_cols=56 Identities=29% Similarity=0.332 Sum_probs=36.3
Q ss_pred CeEEEEEECCEEEEEEEcCCCceEEEee-ccccCCCcEEEEEEEECCEEEEEEcCccCc
Q psy5025 135 DFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWHKLRLTRTGRHAYLQVDRFPSS 192 (533)
Q Consensus 135 ~fl~l~L~~G~l~~~~~~g~~~~~v~s~-~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~ 192 (533)
=|+.| ..+|.-.+.-++.. ...+.+. ..+.-++||++.|...+..+.-.+|+....
T Consensus 579 ~~f~v-~~~G~w~vt~d~~~-~~~l~~G~~~~~~~~WhtltL~~~g~~~ta~lng~~l~ 635 (669)
T PF02057_consen 579 YFFWV-YANGTWSVTSDLAG-TTTLASGTADIGAGKWHTLTLTISGSTATAMLNGTVLW 635 (669)
T ss_dssp EEEEE-ETTTEEEEEEETTS--SEEEEEE-S--TT-EEEEEEEEETTEEEEEETTEEEE
T ss_pred EEEEE-EcCCcEEEeccCCC-cEEEeeeeecccCCeEEEEEEEEECCEEEEEECCEEeE
Confidence 34444 56787777656553 3344433 457779999999999999999999996543
No 101
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=28.69 E-value=4.1e+02 Score=24.84 Aligned_cols=23 Identities=22% Similarity=0.251 Sum_probs=19.5
Q ss_pred CCcEEEEEEEE--CCEEEEEEcCcc
Q psy5025 168 GEWHKLRLTRT--GRHAYLQVDRFP 190 (533)
Q Consensus 168 g~Wh~V~v~~~--~~~~~L~VD~~~ 190 (533)
|+||++.|... .+.+.+.++...
T Consensus 154 g~~~~v~I~Y~~~~~~L~v~l~~~~ 178 (223)
T cd01951 154 GNEHTVRITYDPTTNTLTVYLDNGS 178 (223)
T ss_pred CCEEEEEEEEeCCCCEEEEEECCCC
Confidence 99999999999 478888888754
No 102
>PF05910 DUF868: Plant protein of unknown function (DUF868); InterPro: IPR008586 This family consists of several hypothetical proteins from plants. The function of this family is unknown.
Probab=27.53 E-value=1.3e+02 Score=29.49 Aligned_cols=38 Identities=11% Similarity=0.131 Sum_probs=27.4
Q ss_pred CcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcc
Q psy5025 441 GKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIE 480 (533)
Q Consensus 441 ~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~ 480 (533)
...+.|.||+..+.+..-... .+..+..|+|+|+|...
T Consensus 173 dp~l~V~VDgk~v~~VkrL~W--kFRGNqti~vdg~~V~V 210 (274)
T PF05910_consen 173 DPELWVSVDGKKVVQVKRLRW--KFRGNQTIFVDGLPVQV 210 (274)
T ss_pred CceEEEEECCEEEEEEEEeee--cccCceEEEECCeEEEE
Confidence 458999999998877543221 24567779999988764
No 103
>PF03425 CBM_11: Carbohydrate binding domain (family 11); InterPro: IPR005087 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM11 from CAZY which binds both beta-1,4-glucan and beta-1,3-1,4-mixed linked glucans.; GO: 0008810 cellulase activity, 0030245 cellulose catabolic process; PDB: 1V0A_A.
Probab=26.47 E-value=1.7e+02 Score=26.45 Aligned_cols=49 Identities=16% Similarity=0.253 Sum_probs=19.1
Q ss_pred CCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEee
Q psy5025 350 SSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNF 409 (533)
Q Consensus 350 ~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~ 409 (533)
-+|+=|++....+..-|.|.|.++. ...+.......+.. |++|+|-.+.
T Consensus 73 ~~gl~Fw~k~dgs~~~l~vqi~d~~----------~~e~~~~~~~~~~~-W~~V~IPF~~ 121 (178)
T PF03425_consen 73 YGGLSFWIKGDGSGNKLRVQIKDGG----------DYEYWEASFTDSST-WKTVEIPFSD 121 (178)
T ss_dssp --EEEEEEEE------EEEEEEEE-----------EEEEEEEEE---SS--EEEEEEGGG
T ss_pred CCcEEEEEEcCCCCcEEEEEEecCC----------cceeeEeecCCCCc-CEEEEEEHHH
Confidence 3455555543333555666665554 11111123445555 9999998843
No 104
>PTZ00334 trans-sialidase; Provisional
Probab=24.78 E-value=5.7e+02 Score=29.18 Aligned_cols=74 Identities=15% Similarity=0.138 Sum_probs=44.5
Q ss_pred ccCCCcEEEEEEE-ECCEEEEEEcCccCceeeCC-CCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcE
Q psy5025 165 ISLGEWHKLRLTR-TGRHAYLQVDRFPSSQILSP-GPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNH 242 (533)
Q Consensus 165 lnDg~Wh~V~v~~-~~~~~~L~VD~~~~~~~~~~-~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~ 242 (533)
-..++-|+|.|.. +++...++||+........+ ......++ ..|||||-....... ....=-++++.+-++
T Consensus 640 We~~k~yqVal~L~~G~~gsvYVDG~~vg~~~~~l~~~~~~~I-shFyiGgdg~~~~~~------~~~~VTV~NVlLYNR 712 (780)
T PTZ00334 640 WEPETTHQVAIVLRNGKQGSAYVDGQRVGDASCELKNTDSKGI-SHFYIGGDGGSAGSK------EDVPVTATNVLLYNR 712 (780)
T ss_pred ccCCCeEEEEEEEeCCCeEEEEECCEEecCcccccCCCCCccc-ceEEECCCccccccC------CCCCEEEeEeEEeCC
Confidence 4567889999887 56789999999654221111 11122222 579999976321110 112224788888888
Q ss_pred Ecc
Q psy5025 243 ELN 245 (533)
Q Consensus 243 ~l~ 245 (533)
+++
T Consensus 713 pL~ 715 (780)
T PTZ00334 713 PLD 715 (780)
T ss_pred CCC
Confidence 886
No 105
>smart00289 WR1 Worm-specific repeat type 1. Worm-specific repeat type 1. Cysteine-rich domain apparently unique (so far) to C. elegans. Often appears with KU domains. About 3 dozen worm proteins contain this domain.
Probab=22.99 E-value=78 Score=20.21 Aligned_cols=23 Identities=30% Similarity=1.009 Sum_probs=16.8
Q ss_pred CCC-CCCCCCCCeeeeCCCCceecCC
Q psy5025 46 KCN-KTPCEHGGKCITTPGSSYCLCP 70 (533)
Q Consensus 46 ~C~-~~pC~NggtC~~~~~~y~C~C~ 70 (533)
.|. ..+|..|-.|+.. ..+.| |+
T Consensus 15 ~C~~~~~CP~g~~C~~~-~~~~C-C~ 38 (38)
T smart00289 15 RCSPNGSCPSGYSCQNS-KQGIC-CP 38 (38)
T ss_pred ECCCCCCCCCCCEEecC-CCccc-Cc
Confidence 465 6788888899887 66666 53
No 106
>cd02178 GH16_beta_agarase Beta-agarase, member of glycosyl hydrolase family 16. Beta-agarase is a glycosyl hydrolase family 16 (GH16) member that hydrolyzes the internal beta-1,4-linkage of agarose, a hydrophilic polysaccharide found in the cell wall of Rhodophyceaea, marine red algae. Agarose is a linear chain of galactose units linked by alternating L-alpha-1,3- and D-beta-1,4-linkages that are additionally modified by a 3,6-anhydro-bridge. Agarose forms thermo-reversible gels that are widely used in the food industry or as a laboratory medium. While beta-agarases are also found in two other families derived from the sequence-based classification of glycosyl hydrolases (GH50, and GH86) the GH16 members are most abundant. This domain adopts a curved beta-sandwich conformation, with a tunnel-shaped active site cavity, referred to as a jellyroll fold.
Probab=22.66 E-value=1.9e+02 Score=27.89 Aligned_cols=29 Identities=14% Similarity=-0.041 Sum_probs=25.0
Q ss_pred CCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhcccccc-CcEEEEEECCceeeeEe
Q psy5025 397 NGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRT-GKQCLIRIDTDYSMQSI 457 (533)
Q Consensus 397 dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~vD~~~~~~~~ 457 (533)
.+.||...|.+ + ...+..+|||..+....
T Consensus 178 ~~~fHtY~veW--------------------------------~~p~~i~fyvDG~~~~~~~ 207 (258)
T cd02178 178 ADDFHVYGVYW--------------------------------KDPDTIRFYIDGVLVRTVE 207 (258)
T ss_pred ccCeEEEEEEE--------------------------------cCCCeEEEEECCEEEEEEc
Confidence 46899999999 8 89999999999876653
No 107
>PF08787 Alginate_lyase2: Alginate lyase; InterPro: IPR014895 Alginate lyases are enzymes that degrade the linear polysaccharide alignate. They cleave the glycosidic linkage of alignate through a beta-elimination reaction. This region forms an all beta fold, which is different to the all alpha fold of IPR008397 from INTERPRO. ; PDB: 1VAV_B 1UAI_A 1J1T_A 2Z42_A 2ZAC_A 2ZAB_A 2ZAA_A 2ZA9_A 2CWS_A.
Probab=22.24 E-value=6.8e+02 Score=23.77 Aligned_cols=52 Identities=15% Similarity=0.210 Sum_probs=36.9
Q ss_pred ECCEEEEEEEc---CCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEE
Q psy5025 371 HNGYAHVMYNL---GNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIR 447 (533)
Q Consensus 371 ~~g~l~~~~~~---g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 447 (533)
..|.|.+.++. .+...... .-..+.=|+|..+.|.. .+..+.|+
T Consensus 135 ~~G~l~~~~~~~~~~~~~~~~~-~~~~i~LG~~F~y~I~v--------------------------------~~~~l~V~ 181 (236)
T PF08787_consen 135 EKGSLYVYVRQSNPDGGDQEYT-IYGGIPLGEWFSYEIEV--------------------------------SGGTLTVT 181 (236)
T ss_dssp ETTEEEEEEESSTTTTSEEEEE-EEEEEETT-EEEEEEEE--------------------------------ETTEEEEE
T ss_pred cCCeEEEEEeccCCCCCcEEee-eEcceeCCCEEEEEEEE--------------------------------ECCEEEEE
Confidence 78999999982 12222222 22467778999999999 88899999
Q ss_pred ECCceeee
Q psy5025 448 IDTDYSMQ 455 (533)
Q Consensus 448 vD~~~~~~ 455 (533)
|++.....
T Consensus 182 ing~~~~~ 189 (236)
T PF08787_consen 182 INGEGKTT 189 (236)
T ss_dssp ETTEEEEE
T ss_pred EECCcceE
Confidence 99986655
No 108
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=20.74 E-value=4.2e+02 Score=20.73 Aligned_cols=47 Identities=17% Similarity=0.197 Sum_probs=31.2
Q ss_pred CCCCeEEEEEECCEEEEEEEcCCc----eEEEeeCc--eeecCCCeEEEEEEE
Q psy5025 361 QSKDFLALGIHNGYAHVMYNLGNG----EVLLENNQ--TKINNGVWHRLTVHR 407 (533)
Q Consensus 361 ~~~~~l~l~l~~g~l~~~~~~g~~----~~~~~~~~--~~~~dg~wH~v~~~~ 407 (533)
..+.|-.|.+..|.|.+....+.+ ...+...+ .-+.-..||+|...-
T Consensus 22 K~GtWg~l~Vl~G~L~f~~~~~~~~~~~~~~~~~~~~~~~i~Pq~wH~V~p~s 74 (82)
T PF09313_consen 22 KAGTWGKLRVLEGELKFYGLDEEGEEPEEEVFIPAGQPPVIEPQQWHRVEPLS 74 (82)
T ss_dssp STTEEEEEEEEESEEEEEEESSTT-SESEEEEEETTEEEEE-TT-EEEEEESS
T ss_pred CCCeEEEEEEEeeEEEEEEECCCCCceeEEEEeCCCCCceeCCCceEEEEECC
Confidence 347899999999999999876642 22222222 348899999998644
No 109
>KOG0994|consensus
Probab=20.16 E-value=1e+02 Score=35.96 Aligned_cols=38 Identities=26% Similarity=0.716 Sum_probs=29.5
Q ss_pred CCCCCCCCCCCCC--------eeeeCC--CCceecCCCCCCCCCcccc
Q psy5025 44 EDKCNKTPCEHGG--------KCITTP--GSSYCLCPLGFTGDLCQTR 81 (533)
Q Consensus 44 ~d~C~~~pC~Ngg--------tC~~~~--~~y~C~C~~g~~G~~Ce~~ 81 (533)
...|.+-||+-|- +|.... ....|+|..||.|..|+.-
T Consensus 903 g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~C 950 (1758)
T KOG0994|consen 903 GIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEIC 950 (1758)
T ss_pred CCCCCCCCCCCCCccchhccccccccccccceeeecccCccccchhhh
Confidence 4568888897653 555443 6788999999999999976
Done!