Query psy7014
Match_columns 500
No_of_seqs 256 out of 2648
Neff 7.3
Searched_HMMs 46136
Date Sat Aug 17 00:36:32 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7014.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7014hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3514|consensus 100.0 3.6E-37 7.8E-42 331.7 24.6 344 2-491 356-729 (1591)
2 KOG1219|consensus 100.0 2.8E-35 6.2E-40 331.1 25.6 314 23-422 3645-3977(4289)
3 KOG4289|consensus 100.0 1.5E-31 3.3E-36 293.5 25.5 293 41-489 1298-1604(2531)
4 KOG3516|consensus 100.0 2.8E-31 6E-36 291.9 25.9 310 39-492 763-1078(1306)
5 KOG3514|consensus 100.0 1.6E-30 3.5E-35 280.7 24.2 314 38-496 807-1133(1591)
6 KOG3516|consensus 100.0 3.3E-27 7.2E-32 259.9 29.9 208 9-297 123-338 (1306)
7 PF00054 Laminin_G_1: Laminin 99.9 4.4E-23 9.6E-28 184.0 14.8 130 97-298 1-131 (131)
8 cd00110 LamG Laminin G domain; 99.9 3.1E-20 6.8E-25 167.9 18.0 149 67-293 3-151 (151)
9 smart00282 LamG Laminin G doma 99.8 3.8E-20 8.3E-25 165.1 17.1 134 90-295 2-135 (135)
10 KOG4289|consensus 99.8 6.6E-20 1.4E-24 202.9 18.3 226 41-360 1519-1762(2531)
11 PF02210 Laminin_G_2: Laminin 99.8 5.9E-18 1.3E-22 148.2 14.0 127 97-295 1-128 (128)
12 KOG1219|consensus 99.3 6.6E-12 1.4E-16 145.0 12.6 108 286-395 3867-3980(4289)
13 KOG3509|consensus 99.1 2.3E-10 4.9E-15 128.5 11.7 166 216-391 312-478 (964)
14 PF00054 Laminin_G_1: Laminin 98.4 4.3E-07 9.3E-12 80.9 5.9 42 454-495 1-42 (131)
15 cd00110 LamG Laminin G domain; 98.3 1.8E-06 3.9E-11 77.6 8.1 46 446-491 20-65 (151)
16 smart00210 TSPN Thrombospondin 98.3 1.7E-05 3.8E-10 74.8 14.9 130 87-293 50-181 (184)
17 smart00282 LamG Laminin G doma 98.1 8.7E-06 1.9E-10 72.3 8.1 47 448-494 3-49 (135)
18 PF00008 EGF: EGF-like domain 97.8 8E-06 1.7E-10 54.5 1.8 26 364-389 5-31 (32)
19 PF00008 EGF: EGF-like domain 97.8 1.3E-05 2.7E-10 53.5 1.6 31 320-350 1-32 (32)
20 PF13385 Laminin_G_3: Concanav 97.7 0.00067 1.5E-08 60.2 12.3 147 66-296 3-150 (157)
21 PF02210 Laminin_G_2: Laminin 97.6 6.3E-05 1.4E-09 65.2 4.3 40 454-493 1-40 (128)
22 smart00179 EGF_CA Calcium-bind 97.6 8.8E-05 1.9E-09 51.0 4.0 37 355-392 2-39 (39)
23 PF07645 EGF_CA: Calcium-bindi 97.4 0.00011 2.5E-09 52.0 2.3 34 354-387 1-34 (42)
24 cd00054 EGF_CA Calcium-binding 97.2 0.00043 9.3E-09 46.9 3.8 36 356-392 3-38 (38)
25 cd00152 PTX Pentraxins are pla 97.1 0.017 3.7E-07 55.2 14.5 76 216-297 88-165 (201)
26 smart00159 PTX Pentraxin / C-r 97.0 0.016 3.4E-07 55.7 13.9 78 214-297 86-165 (206)
27 smart00179 EGF_CA Calcium-bind 97.0 0.00077 1.7E-08 46.2 3.5 35 318-352 3-39 (39)
28 cd00053 EGF Epidermal growth f 96.9 0.0016 3.4E-08 43.3 3.9 30 363-392 6-36 (36)
29 PF02973 Sialidase: Sialidase, 96.8 0.035 7.5E-07 52.5 13.9 140 89-297 33-177 (190)
30 cd00054 EGF_CA Calcium-binding 96.8 0.0014 3.1E-08 44.2 3.4 35 318-352 3-38 (38)
31 smart00181 EGF Epidermal growt 96.7 0.0023 5.1E-08 42.9 3.8 28 364-392 7-35 (35)
32 KOG1225|consensus 96.6 0.0033 7.2E-08 67.9 5.9 75 306-394 269-343 (525)
33 KOG1214|consensus 96.5 0.016 3.4E-07 64.4 10.7 102 317-418 734-858 (1289)
34 cd00053 EGF Epidermal growth f 96.5 0.0035 7.7E-08 41.5 3.5 32 320-351 2-35 (36)
35 KOG1214|consensus 96.4 0.0045 9.8E-08 68.5 5.7 57 330-389 800-859 (1289)
36 KOG1217|consensus 96.3 0.0056 1.2E-07 65.0 6.0 78 317-394 271-355 (487)
37 smart00181 EGF Epidermal growt 96.3 0.0045 9.8E-08 41.5 3.3 31 320-351 2-34 (35)
38 KOG4260|consensus 96.0 0.0068 1.5E-07 59.3 4.0 93 322-414 149-301 (350)
39 KOG1225|consensus 96.0 0.013 2.8E-07 63.5 6.5 58 325-395 256-313 (525)
40 KOG1217|consensus 95.9 0.014 3.1E-07 61.9 6.4 76 319-394 128-208 (487)
41 PF00354 Pentaxin: Pentaxin fa 95.3 0.2 4.2E-06 47.8 11.2 78 214-297 80-159 (195)
42 PF12947 EGF_3: EGF domain; I 95.2 0.01 2.2E-07 40.7 1.4 27 363-389 6-32 (36)
43 PF07974 EGF_2: EGF-like domai 95.2 0.026 5.6E-07 37.6 3.4 26 364-391 7-32 (32)
44 PF12661 hEGF: Human growth fa 95.2 0.0066 1.4E-07 32.0 0.4 13 45-57 1-13 (13)
45 PF12661 hEGF: Human growth fa 95.2 0.0064 1.4E-07 32.1 0.3 13 379-391 1-13 (13)
46 PF07645 EGF_CA: Calcium-bindi 94.1 0.043 9.4E-07 38.7 2.5 30 318-347 3-34 (42)
47 PF07974 EGF_2: EGF-like domai 93.7 0.072 1.6E-06 35.4 2.9 27 323-351 6-32 (32)
48 KOG1226|consensus 92.3 0.26 5.7E-06 54.9 6.3 56 339-398 567-626 (783)
49 PHA03099 epidermal growth fact 92.1 0.12 2.6E-06 45.1 2.7 31 364-395 52-84 (139)
50 PF06439 DUF1080: Domain of Un 90.8 0.45 9.7E-06 44.1 5.5 37 210-247 119-155 (185)
51 PHA02887 EGF-like protein; Pro 90.8 0.19 4E-06 43.3 2.5 30 364-394 93-124 (126)
52 smart00051 DSL delta serrate l 90.8 0.33 7.1E-06 37.6 3.7 47 338-391 17-63 (63)
53 smart00560 LamGL LamG-like jel 90.8 0.71 1.5E-05 40.9 6.5 67 218-296 61-129 (133)
54 KOG3509|consensus 90.3 3.5 7.5E-05 48.0 12.8 170 218-395 654-844 (964)
55 KOG1226|consensus 89.4 0.76 1.6E-05 51.4 6.5 43 348-394 539-582 (783)
56 PF12947 EGF_3: EGF domain; I 88.7 0.25 5.4E-06 33.8 1.4 27 323-349 6-32 (36)
57 PF14670 FXa_inhibition: Coagu 88.5 0.34 7.5E-06 33.1 2.0 18 370-387 11-28 (36)
58 KOG4260|consensus 87.8 0.51 1.1E-05 46.6 3.4 48 342-393 132-183 (350)
59 cd01475 vWA_Matrilin VWA_Matri 87.5 0.53 1.1E-05 45.6 3.5 32 355-388 187-218 (224)
60 PHA03099 epidermal growth fact 86.5 0.45 9.7E-06 41.7 2.0 31 323-354 51-83 (139)
61 PF12662 cEGF: Complement Clr- 85.6 0.75 1.6E-05 28.5 2.1 11 377-387 1-11 (24)
62 PHA02887 EGF-like protein; Pro 85.4 0.65 1.4E-05 40.0 2.4 22 39-60 103-124 (126)
63 PF01414 DSL: Delta serrate li 84.7 0.3 6.5E-06 37.8 0.1 41 338-391 17-63 (63)
64 KOG1836|consensus 75.8 1.4 3E-05 54.3 1.7 80 212-301 1611-1690(1705)
65 KOG3546|consensus 74.4 11 0.00025 41.5 7.8 66 217-293 156-223 (1167)
66 PF14670 FXa_inhibition: Coagu 72.4 2.3 4.9E-05 29.1 1.4 18 330-347 11-28 (36)
67 KOG1834|consensus 70.4 75 0.0016 35.6 12.8 147 87-296 364-518 (952)
68 KOG1836|consensus 67.8 4.7 0.0001 50.0 3.6 52 341-394 760-814 (1705)
69 PF12955 DUF3844: Domain of un 65.4 3.9 8.4E-05 34.8 1.7 39 358-396 8-64 (103)
70 PF00954 S_locus_glycop: S-loc 64.7 6.9 0.00015 33.4 3.2 31 356-388 78-108 (110)
71 KOG0994|consensus 63.8 6.5 0.00014 46.1 3.5 58 335-394 882-950 (1758)
72 PF13385 Laminin_G_3: Concanav 60.7 19 0.00042 31.1 5.5 46 445-492 21-70 (157)
73 PF06247 Plasmod_Pvs28: Plasmo 58.0 5.8 0.00012 37.3 1.5 63 323-387 6-79 (197)
74 PF12946 EGF_MSP1_1: MSP1 EGF 56.3 5.7 0.00012 27.3 0.9 24 364-387 6-30 (37)
75 PF12946 EGF_MSP1_1: MSP1 EGF 56.1 6.2 0.00013 27.1 1.1 28 320-347 2-30 (37)
76 PF00053 Laminin_EGF: Laminin 52.9 9.2 0.0002 27.6 1.6 22 370-393 12-33 (49)
77 PF04863 EGF_alliinase: Alliin 51.5 3.7 8E-05 30.7 -0.6 22 40-61 32-53 (56)
78 cd00055 EGF_Lam Laminin-type e 45.1 21 0.00044 25.9 2.5 16 378-393 19-34 (50)
79 PF02973 Sialidase: Sialidase, 44.5 95 0.0021 29.5 7.4 49 445-493 32-83 (190)
80 cd06899 lectin_legume_LecRK_Ar 43.5 2.7E+02 0.0059 27.1 10.9 26 214-240 159-186 (236)
81 cd01475 vWA_Matrilin VWA_Matri 43.1 21 0.00044 34.4 2.9 34 313-348 182-218 (224)
82 smart00210 TSPN Thrombospondin 41.9 70 0.0015 29.8 6.2 45 446-490 52-97 (184)
83 PF00139 Lectin_legB: Legume l 39.8 2.4E+02 0.0051 27.3 9.8 29 211-240 160-190 (236)
84 PF14099 Polysacc_lyase: Polys 38.9 1.2E+02 0.0026 28.9 7.5 22 120-141 112-133 (224)
85 cd01951 lectin_L-type legume l 38.7 3.4E+02 0.0074 25.8 10.7 23 218-241 154-178 (223)
86 PF04863 EGF_alliinase: Alliin 32.4 21 0.00045 26.8 0.7 33 323-355 17-53 (56)
87 PF01683 EB: EB module; Inter 32.3 62 0.0013 23.4 3.3 20 364-387 27-46 (52)
88 smart00180 EGF_Lam Laminin-typ 29.1 51 0.0011 23.5 2.3 18 43-60 17-34 (46)
89 PF12955 DUF3844: Domain of un 28.9 42 0.00091 28.6 2.1 23 322-344 12-39 (103)
90 PF11250 DUF3049: Protein of u 24.5 2E+02 0.0044 21.7 4.9 38 451-488 17-55 (56)
91 PF14607 GxDLY: N-terminus of 23.8 2.4E+02 0.0051 25.7 6.1 13 216-229 91-103 (147)
92 KOG1218|consensus 23.3 1E+02 0.0023 30.7 4.3 56 338-394 162-223 (316)
93 KOG1218|consensus 22.8 1.7E+02 0.0037 29.2 5.7 53 339-396 125-180 (316)
94 cd00152 PTX Pentraxins are pla 22.1 2.5E+02 0.0054 26.4 6.4 46 444-489 29-77 (201)
95 PF07622 DUF1583: Protein of u 21.9 4.9E+02 0.011 27.6 8.7 33 213-246 85-117 (399)
No 1
>KOG3514|consensus
Probab=100.00 E-value=3.6e-37 Score=331.75 Aligned_cols=344 Identities=21% Similarity=0.367 Sum_probs=246.6
Q ss_pred cccccccCCCCCCCCccee--cc--ccccc-cCCeeeeecccc-----cCCCc-eeecCCC--------CCCC---CCCC
Q psy7014 2 YAHHLQSCPELTNPDNIKI--LG--RHRQE-ENDRIVNFQHYF-----DTNQP-INQLLSI--------IFTN---FLPP 59 (500)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~--~~--~~~~~-~~~~~~~~~~~~-----~~~~~-~c~c~~~--------~~G~---~C~~ 59 (500)
|.+|+|++||||++.|... +| .|... +..| |.|-.|. +++.. +-.-... ..|. .|+.
T Consensus 356 ~t~~~~~a~~~tmlsss~~fyvgg~~~~~~l~gsr-VsF~GClkkV~y~~d~~rl~L~~LAk~g~~~~k~~G~l~y~C~n 434 (1591)
T KOG3514|consen 356 RTEIRQYAPELTMLSSSDFFYVGGSPNTADLPGSR-VSFMGCLKKVVYKNDDTRLELSRLAKQGDSKMKTEGDLSYSCEN 434 (1591)
T ss_pred EecccccccceeEeeccceEEecCCCCccccCCCc-eeeeeeeeeeEeccCceeehhhHHhhcCCceeEeeceEEEecCC
Confidence 7899999999999999984 44 33333 3334 3577762 33321 1111111 2222 4888
Q ss_pred CCccccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEc
Q psy7014 60 DIEIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNL 139 (500)
Q Consensus 60 ~~~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~ 139 (500)
.......+|... .||+.+|.+. .....+|+|.|||+.++ |||||++.. .....||+|+||.||++.+.+++
T Consensus 435 ~~~~DpvtFtt~----es~l~LP~Wn-t~~~gSiSf~FRTtepn--Glil~~~g~--~~~~~d~~A~ELldghlyl~ldl 505 (1591)
T KOG3514|consen 435 VAQLDPVTFTTP----ESYLTLPRWN-TKKSGSISFDFRTTEPN--GLILFHGGP--QANATDYFAIELLDGHLYLLLDL 505 (1591)
T ss_pred CCccCceeeecc----cceeeccccc-cCCcceeEEEEeecCCC--ceEEEccCc--ccccccEEEEEEeCCeEEEEEec
Confidence 776666789876 8999999964 56789999999988777 999999752 45778999999999999999999
Q ss_pred CCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCC
Q psy7014 140 GSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKR 219 (500)
Q Consensus 140 G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~ 219 (500)
|+|... | +. ...++|||.
T Consensus 506 GSG~ik-------------l------------------------------------------------ra-s~rkv~DGe 523 (1591)
T KOG3514|consen 506 GSGVIK-------------L------------------------------------------------RA-SSRKVNDGE 523 (1591)
T ss_pred CCceEE-------------e------------------------------------------------ee-ecccccCCc
Confidence 997542 2 21 122669999
Q ss_pred ccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCC---cCCCCCccccccceeecccc
Q psy7014 220 GGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHD---LPLHSGFSGCIFDVELSAGN 296 (500)
Q Consensus 220 WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~---~~~~~gF~GCIr~v~ing~~ 296 (500)
| |+|.+.|+++.+.++||... .....||....|+++.++|+|-.+ +....|.+ +..+.||+||||++.|+|..
T Consensus 524 W-hhv~l~R~gR~gsvsVd~~~-~df~tpG~s~iL~ld~~mylG~~~--n~l~~P~~vWta~L~~GyvGCirdl~i~G~s 599 (1591)
T KOG3514|consen 524 W-HHVDLQRDGRTGSVSVDAIK-TDFSTPGDSEILDLDDPMYLGEVP--NNLVYPSEVWTAALRKGYVGCIRDLFIDGVS 599 (1591)
T ss_pred e-EEEEeeccCccceEEEeeee-cCccCCCcceeEeecCceeeccCC--CCccCcHHHHHHHHhccchheehhheeccee
Confidence 9 99999999999999999976 567778999999999999999553 33344433 45778999999999999999
Q ss_pred cccccccccccCCCCcc-ccCC---ccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEe
Q psy7014 297 VGINLYKTRAAEGRGVG-QCGT---SQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCV 372 (500)
Q Consensus 297 ~~l~~~~~~~~~~~~v~-~C~~---~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~ 372 (500)
.++..... ...+.++. .|.. ..|.++||+|+|+|...|+.
T Consensus 600 ~di~q~ae-~q~sagvkpsCs~~~~~~C~~nPC~N~g~C~egwNr----------------------------------- 643 (1591)
T KOG3514|consen 600 TDIRQEAE-AQNSAGVKPSCSLSNEKICESNPCQNGGKCSEGWNR----------------------------------- 643 (1591)
T ss_pred hhhHHHhh-hccccccCcccchhhccccCCCcccCCCCccccccc-----------------------------------
Confidence 88754321 22333343 3431 34555555555555555544
Q ss_pred eCCCCeeeeCCC-CCCCCCcCCCCCCcCccccCCCceEEecCCCccccccccccccccccccccccCCccccccceeeEE
Q psy7014 373 PLTHSYECDCPP-GRTGKFCEKDESLSDISFSGRRSYISLPSSELHLIINESLSDISFSGRRSYISLPSSELHLHEACID 451 (500)
Q Consensus 373 ~~~~g~~C~C~~-G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~~~~~e~~~~~~f~~~~~~~~~p~~~~~~~~~~i~ 451 (500)
|.|+|.. +|.|+.||.+. ..+.|+|. .|+.+-.+.. . ..+.+.|.
T Consensus 644 -----fiCDCs~T~~~G~~CerE~--t~ls~nGs-~~m~i~L~~~-------------------------~-~tq~E~v~ 689 (1591)
T KOG3514|consen 644 -----FICDCSGTGFEGRTCEREA--TALSYNGS-MSMKIVLPHT-------------------------M-HTQAEDVS 689 (1591)
T ss_pred -----cccccccCcccCcccccee--eeEEEcCe-eeEEEEeccc-------------------------c-eeecceEE
Confidence 5555543 56666666543 45789996 5655543311 1 12677999
Q ss_pred EEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcC
Q psy7014 452 LEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLML 491 (500)
Q Consensus 452 l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g 491 (500)
++|||..+-||||-.+.....|-+.|+|.+|+|++.+++.
T Consensus 690 iRF~t~r~~Gll~~Tta~~s~D~l~l~L~~g~vkl~v~ls 729 (1591)
T KOG3514|consen 690 IRFRTQRAYGLLFATTARGSADTLRLELDAGQVKLFVNLS 729 (1591)
T ss_pred EEEEecccceeEEEeccCCCCceEEEEEecceEEEEEecC
Confidence 9999999999999998887899999999999999999976
No 2
>KOG1219|consensus
Probab=100.00 E-value=2.8e-35 Score=331.10 Aligned_cols=314 Identities=21% Similarity=0.354 Sum_probs=250.9
Q ss_pred ccccccCCeeeeecccc---cCCCceeecCCCCCCCCCCCCCccccccccCCCCCCcceEEeecCCCCceEEEEEEEEee
Q psy7014 23 RHRQEENDRIVNFQHYF---DTNQPINQLLSIIFTNFLPPDIEIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVP 99 (500)
Q Consensus 23 ~~~~~~~~~~~~~~~~~---~~~~~~c~c~~~~~G~~C~~~~~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt 99 (500)
.|.|=+.+++-.-+.|. -+..+.|.||.|..| .|+.+. ..++.| +||.+|....+...++.+.|++||
T Consensus 3645 ~~~~C~~~pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~~---elS~tG-----nSYveyrlse~~n~~~kl~frLkT 3715 (4289)
T KOG1219|consen 3645 ETNQCAKSPCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGDF---ELSSTG-----NSYVEYRLSENQNTRMKLGFRLKT 3715 (4289)
T ss_pred ccCccccCCCcccCcccccccccceeEeccCcccc-cCCCcc---eEeecC-----ceeEEEEcccccccceEEEEEEEe
Confidence 45555666665555443 255789999999665 499874 558888 999999998776566899999988
Q ss_pred CCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhcccccccccccccccccccc
Q psy7014 100 NSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVD 179 (500)
Q Consensus 100 ~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d 179 (500)
.+.+ |++||.. ..|+..|.|.+|.+.+.|+.|+|.+
T Consensus 3716 ~~sn--gIiM~tr-------~~d~~iLkLv~G~~~l~~~cgsG~G----------------------------------- 3751 (4289)
T KOG1219|consen 3716 LQSN--GIIMYTR-------KTDLAILKLVGGSPQLLADCGSGPG----------------------------------- 3751 (4289)
T ss_pred cccC--cEEEEEc-------CCceEEEEecCCcEEEEEecCCCCC-----------------------------------
Confidence 8666 9999995 3499999999999999999999643
Q ss_pred ceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCc
Q psy7014 180 FLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPM 259 (500)
Q Consensus 180 ~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~ 259 (500)
+.+.++..+|||+| |.|.+.|+++.+.|+||+.......+|+....|+++.-
T Consensus 3752 ---------------------------ivg~q~~~VnDgqW-Hsialerrr~~irlsvDd~~~~~atvPg~~~tln~d~h 3803 (4289)
T KOG1219|consen 3752 ---------------------------IVGSQKRTVNDGQW-HSIALERRRNHIRLSVDDDTYDSATVPGMKSTLNLDTH 3803 (4289)
T ss_pred ---------------------------cccccceEeecCce-eEEEeeccCCceEEEEcccCceeeecccceeeccccce
Confidence 22344457799999 99999999999999999999899999999999999999
Q ss_pred eEEcccccccCcCCCCCcCCCCCccccccceeecccccccccccccc---cCCCCcc-ccC--CccCCCCCCCCCCEEee
Q psy7014 260 LYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNVGINLYKTRA---AEGRGVG-QCG--TSQCHNHTCSHGGACMN 333 (500)
Q Consensus 260 lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~---~~~~~v~-~C~--~~~C~~~pC~ngg~Ci~ 333 (500)
||+||.-.. .-.+......||.|||+.+.+||..+++....... ....... .|. .++|..+||+|||+|..
T Consensus 3804 iy~Ga~vrl---r~~~~tqvs~Gf~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC~l~~d~C~~npCqhgG~C~~ 3880 (4289)
T KOG1219|consen 3804 IYLGALVRL---RHQRSTQVSYGFDGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGCSLLTDPCNDNPCQHGGTCIS 3880 (4289)
T ss_pred EEEeeEeee---ccCCCccccccccceeeeEEEccccccccCCCchhhhhhhhhcccccccccccccccCcccCCCEecC
Confidence 999997420 11122456789999999999999888764322111 2223333 343 38999999999999998
Q ss_pred CC-CceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCC---CCc------Ccccc
Q psy7014 334 HG-ATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDE---SLS------DISFS 403 (500)
Q Consensus 334 ~~-~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~---~~~------~~~F~ 403 (500)
.. +.|.|.|+..|.|.+|+.++.+|.++ ||.+||+|++..++|.|.|+.||+|++||.+. +.. +.|.+
T Consensus 3881 ~~~ggy~CkCpsqysG~~CEi~~epC~sn--PC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n 3958 (4289)
T KOG1219|consen 3881 QPKGGYKCKCPSQYSGNHCEIDLEPCASN--PCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCIN 3958 (4289)
T ss_pred CCCCceEEeCcccccCcccccccccccCC--CCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCCceeec
Confidence 75 78999999999999999999999996 89999999999999999999999999999883 221 23444
Q ss_pred CCCceEEecCCCccccccc
Q psy7014 404 GRRSYISLPSSELHLIINE 422 (500)
Q Consensus 404 g~~sy~~~~~~~~~~~~~e 422 (500)
-.++|.|-+.+++.+..|+
T Consensus 3959 ~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3959 IPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred cCCceEeccChhHhcccCc
Confidence 4578999999888866554
No 3
>KOG4289|consensus
Probab=100.00 E-value=1.5e-31 Score=293.50 Aligned_cols=293 Identities=23% Similarity=0.324 Sum_probs=217.6
Q ss_pred CCCceeecCCC-CCCCCCCCCCccccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCC
Q psy7014 41 TNQPINQLLSI-IFTNFLPPDIEIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDA 119 (500)
Q Consensus 41 ~~~~~c~c~~~-~~G~~C~~~~~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~ 119 (500)
++...|+||.| |++++|+-. +.+|.+ .||+.|..... +..+.++|+|-|. ..+|||+|+|+ .
T Consensus 1298 nggf~c~Cp~ge~e~prC~v~----trSFp~-----~sfv~frglrq-Rfh~TlslsfaT~--~~nGlL~ynGn-----e 1360 (2531)
T KOG4289|consen 1298 NGGFCCHCPYGEFEDPRCEVT----TRSFPP-----ESFVTFRGLRQ-RFHFTLSLSFATI--ERNGLLLYNGN-----E 1360 (2531)
T ss_pred CCceeccCCCcccCCCceEEE----eeccCc-----hheEEEecccc-ceEEEEEEEEEEe--eecceEEecCC-----c
Confidence 56788999998 888999974 578998 89999997643 4566677777554 55599999994 5
Q ss_pred CCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCccccc
Q psy7014 120 ITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVD 199 (500)
Q Consensus 120 ~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~ 199 (500)
..||++|+++++.|+++|.+|.-
T Consensus 1361 khDFvalevVd~qvqltfS~Ges--------------------------------------------------------- 1383 (2531)
T KOG4289|consen 1361 KHDFVALEVVDEQVQLTFSAGES--------------------------------------------------------- 1383 (2531)
T ss_pred ccceEeeeeeeeeEEEEEecccc---------------------------------------------------------
Confidence 67999999999999999999962
Q ss_pred ccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCccee-------------eeCCCccccccCCCceEEcccc
Q psy7014 200 TYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVT-------------SKSPGRLTQLNTKPMLYLGGHF 266 (500)
Q Consensus 200 ~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~-------------~~~~~~~~~L~~~~~lyIGG~~ 266 (500)
..++....+-.++||+| |+|.+++.++.++++||++.... +...+....|++..+|++||+|
T Consensus 1384 ----~t~v~p~Vp~gvsDGqW-HtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvP 1458 (2531)
T KOG4289|consen 1384 ----TTTVSPDVPGGVSDGQW-HTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVP 1458 (2531)
T ss_pred ----cceecCCCCCCcccCce-eEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCC
Confidence 12233344446789999 99999999999999999976421 1223455679999999999998
Q ss_pred cccCcCCCCCcCCCCCccccccceeecccccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCC
Q psy7014 267 SKNFSILPHDLPLHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGW 346 (500)
Q Consensus 267 ~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy 346 (500)
+.. ......|.|||+++.++++.+++..+... .+ ...
T Consensus 1459 e~f-------pv~~k~FvGCmrdLsvD~~~VDma~fian--ng-t~e--------------------------------- 1495 (2531)
T KOG4289|consen 1459 ETF-------PVIEKQFVGCMRDLSVDGRDVDMATFIAN--NG-THE--------------------------------- 1495 (2531)
T ss_pred Ccc-------hhhHhHhhhhhhhcccccccccHHHHHhh--cC-ccc---------------------------------
Confidence 421 12345799999999999999988654321 11 112
Q ss_pred CCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCCCCcCccccCCCceEEecCCCccccccccccc
Q psy7014 347 FGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDESLSDISFSGRRSYISLPSSELHLIINESLSD 426 (500)
Q Consensus 347 ~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~~~~~e~~~~ 426 (500)
.|....+.|.+. +|.|+|+|+++|++|.|.||.+|.|+.|+..... .-.|.|. |-+.+...
T Consensus 1496 ---GC~ark~fCdsg--~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~~-pq~frG~-sl~sw~~~------------ 1556 (2531)
T KOG4289|consen 1496 ---GCKARKNFCDSG--QCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMAH-PQHFRGH-SLVSWEGL------------ 1556 (2531)
T ss_pred ---CchhhhcccCCC--ccCCCCeeecccCcEeecCccccCCcchhhccCC-chhcccc-ceeeecCC------------
Confidence 244455667775 6999999999999999999999999999876432 2357774 65554311
Q ss_pred cccccccccccCCccccccceeeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEE
Q psy7014 427 ISFSGRRSYISLPSSELHLHEACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVL 489 (500)
Q Consensus 427 ~~f~~~~~~~~~p~~~~~~~~~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~ 489 (500)
++-++ ....++|+|||++.+|+||-....+ ..-+.|+|.+|+|++.+.
T Consensus 1557 ------~~~vS--------vPwylsl~FRTr~ad~vl~~~~~~~-rst~~lqld~g~l~~~v~ 1604 (2531)
T KOG4289|consen 1557 ------PSQVS--------VPWYLSLMFRTRRADGVLMQAEFGG-RSTYNLQLDDGTLKYNVG 1604 (2531)
T ss_pred ------Cccee--------cceEEEEEEEeeccccEEEEEEeCC-CceEEEEEcCCEEEEEec
Confidence 11111 3468999999999999999775443 345999999999998764
No 4
>KOG3516|consensus
Probab=100.00 E-value=2.8e-31 Score=291.87 Aligned_cols=310 Identities=20% Similarity=0.285 Sum_probs=231.1
Q ss_pred ccCCCceeecCCCCCCCCCCCCCc-cccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCC
Q psy7014 39 FDTNQPINQLLSIIFTNFLPPDIE-IGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQH 117 (500)
Q Consensus 39 ~~~~~~~c~c~~~~~G~~C~~~~~-~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~ 117 (500)
+|++...|+-+....+-+|+.+.. ...++|.+. .||+.|+...+ ..+.+|+|.|||+.++ |++|.+-
T Consensus 763 gdTg~~~sea~~~lgPLrC~gDr~~wnsvSF~~~----~syL~fp~f~~-~~saDIsf~FrTt~~~--gvflen~----- 830 (1306)
T KOG3516|consen 763 GDTGRSQSEAPYVLGPLRCEGDRNFWNSVSFHTG----ASYLHFPPFHN-ELSADISFFFRTTASS--GVFLENH----- 830 (1306)
T ss_pred ccCCCcccccceeecceEeecccccccceEeecC----cceeecCcccC-cccccEEEEEEecCCc--eEeeecc-----
Confidence 566665555566677889999765 466789874 78999999876 4789999999999777 9999884
Q ss_pred CCCCCeEEEEEECCE-EEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcc
Q psy7014 118 DAITDHLAVSFIKGY-VVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHM 196 (500)
Q Consensus 118 ~~~~df~~l~l~~G~-l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~ 196 (500)
+..||+.|+|..+. |.|.++.|+|+.
T Consensus 831 -g~~dfir~eL~~~~~vtf~~dvgnGp~---------------------------------------------------- 857 (1306)
T KOG3516|consen 831 -GINDFIRLELSSPVEVTFAFDVGNGPS---------------------------------------------------- 857 (1306)
T ss_pred -CCCceEEEEEcCCCceEEEEEcCCCce----------------------------------------------------
Confidence 46799999998764 999999999653
Q ss_pred cccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCC-CccccccCCCceEEcccccccCcCCCC
Q psy7014 197 FVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSP-GRLTQLNTKPMLYLGGHFSKNFSILPH 275 (500)
Q Consensus 197 ~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~-~~~~~L~~~~~lyIGG~~~~~~~~~~~ 275 (500)
.+....+..+||++| |+|+++|+.+.+.|+||+.......+| .....|.+...+||||...
T Consensus 858 ---------~~~V~s~t~~nD~qW-H~V~~Ern~K~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vGgt~~-------- 919 (1306)
T KOG3516|consen 858 ---------QLTVRSPTELNDNQW-HQVRAERNSKEASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVGGTVS-------- 919 (1306)
T ss_pred ---------eEEEcCCcccCCCce-EEEEEEeccccceEEEcCcccceecCCCCCEEEEEeccceecccccc--------
Confidence 333334457899999 999999999999999999987776665 4456788999999999632
Q ss_pred CcCCCCCccccccceeecccccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCCCCccccccc
Q psy7014 276 DLPLHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRY 355 (500)
Q Consensus 276 ~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i 355 (500)
-+.||.||||.+.|||..++|. .++....++..-..
T Consensus 920 ---~~~gF~GCIRsl~LNGv~ldLe---~ra~~~~gv~~GC~-------------------------------------- 955 (1306)
T KOG3516|consen 920 ---RQRGFLGCIRSLQLNGVMLDLE---YRAYGTAGVSPGCE-------------------------------------- 955 (1306)
T ss_pred ---CcCcceeeeeeeeecceeeeeh---hhhccCCcccCCCc--------------------------------------
Confidence 3459999999999999998883 22222223321111
Q ss_pred ccccCCCccCCCCCEEeeCCCCeeeeCCC-CCCCCCcCCCCCCcCccccCCCceEEecCCCcc-cccccccccccccccc
Q psy7014 356 NLCDSTRHNCSFGATCVPLTHSYECDCPP-GRTGKFCEKDESLSDISFSGRRSYISLPSSELH-LIINESLSDISFSGRR 433 (500)
Q Consensus 356 ~~C~~~p~pC~ngg~C~~~~~g~~C~C~~-G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~-~~~~e~~~~~~f~~~~ 433 (500)
-.|.+. ||.|||+|+..+.+|.|+|.. .|.|+.|.++.. +.| .+++++.|...+.. ..+++....-++.
T Consensus 956 GhCss~--~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~eig---~~f-e~gs~i~y~fq~~~~~a~~~~~~~~~~~--- 1026 (1306)
T KOG3516|consen 956 GHCSSY--PCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEIG---VFF-ERGSSIRYNFQKPMRSAVFESSRVKQKL--- 1026 (1306)
T ss_pred cccccc--cccCCCEEEEecCceeeccccCcCCCCccccccc---eEe-cCCceEEEeccchHHHhhhhhhhhhhcc---
Confidence 234443 688888888888889999976 699999988753 334 45799988754322 1122211111111
Q ss_pred ccccCCccccccceeeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEE-CCEEEEEEEcCC
Q psy7014 434 SYISLPSSELHLHEACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQ-GGVLELRVLMLG 492 (500)
Q Consensus 434 ~~~~~p~~~~~~~~~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~-~G~l~~~~~~g~ 492 (500)
.........|.|.|+|+.+.++|+|+++- ..||+++-|+ +|.|+++|.+|.
T Consensus 1027 -------~~~~~~~e~i~~sftTt~~ps~LLfvssF-~~~y~~V~v~~nGsLq~ry~lg~ 1078 (1306)
T KOG3516|consen 1027 -------EIEINPNEEINFSFTTTRAPSDLLFVSSF-TDDYLAVLVKDNGSLQTRYMLGF 1078 (1306)
T ss_pred -------ccccCccceEEEEEEeccCceEEEEeecc-ccceEEEEEeCCCceEEEEecCC
Confidence 11233567999999999999999999887 4899999999 799999999998
No 5
>KOG3514|consensus
Probab=99.97 E-value=1.6e-30 Score=280.66 Aligned_cols=314 Identities=20% Similarity=0.311 Sum_probs=231.7
Q ss_pred cccCCCceeecCCCCCCCCCCCCC-------ccccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEE
Q psy7014 38 YFDTNQPINQLLSIIFTNFLPPDI-------EIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAF 110 (500)
Q Consensus 38 ~~~~~~~~c~c~~~~~G~~C~~~~-------~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly 110 (500)
.|++..++-+|+++-- .+|+... -...+.|... +||+.+...+. +.+++|.|+|||++++ |||+|
T Consensus 807 vFNG~~Yld~~K~~~~-~ls~l~a~fkl~~iv~~paTf~sk----~Sy~~la~L~a-y~s~~l~Fqfkt~sp~--gll~f 878 (1591)
T KOG3514|consen 807 VFNGQDYLDKCKMGDI-QLSELSARFKLRAIVADPATFKSK----SSYVKLATLQA-YFSMHLFFQFKTTSPD--GLLLF 878 (1591)
T ss_pred EECcHHHHHHHhcCCc-chhhcchhhCceEEeeccceeeec----hhhhhhhhhhe-eeEEEEEEEEeecCCC--eEEEe
Confidence 3777777777776632 3455431 1233457664 79999988764 6889999999999888 99999
Q ss_pred ecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeeccccc
Q psy7014 111 IGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYL 190 (500)
Q Consensus 111 ~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 190 (500)
.+. ..+||++|||++|+|+++|++|+|+
T Consensus 879 n~g-----d~ndfi~velvnG~ihYtfdlg~gp----------------------------------------------- 906 (1591)
T KOG3514|consen 879 NSG-----DGNDFIAVELVNGYIHYTFDLGNGP----------------------------------------------- 906 (1591)
T ss_pred cCC-----CCCceEEEEEeCcEEEEEEEcCCCc-----------------------------------------------
Confidence 974 4679999999999999999999964
Q ss_pred CCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCc-EEEEEEcCCcceeeeCCCccccccCCCceEEccccccc
Q psy7014 191 QPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQ-QCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKN 269 (500)
Q Consensus 191 ~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~-~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~ 269 (500)
..++-+....+||++| |.|.|.|++. .-+|.||..... ....+ ...|++.+.|||||+..++
T Consensus 907 --------------~~~k~~sr~hlnDnrW-HnV~I~rd~~~~HtL~vD~s~~t-~~~~g-~~~l~l~g~LyiGGv~k~m 969 (1591)
T KOG3514|consen 907 --------------TSMKGPSRQHLNDNRW-HNVLIYRDKTNTHTLKVDNSSTT-QIIDG-AVNLDLKGKLYIGGVSKPM 969 (1591)
T ss_pred --------------ccccCcccCcCccccc-eeEEEEcCCCCceEEEecCceEE-EEecC-ccccccccceecccccccc
Confidence 3333344557789999 9999999865 468999998643 33333 6778999999999999888
Q ss_pred CcCCCCCcCCCCCccccccceeecccccccccccccccCCCCc-cccC--CccCCCCCCCCCCEEeeCCCceeecCCCCC
Q psy7014 270 FSILPHDLPLHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGV-GQCG--TSQCHNHTCSHGGACMNHGATFSCLCADGW 346 (500)
Q Consensus 270 ~~~~~~~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v-~~C~--~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy 346 (500)
...++.....+.+|.||...+-+++....+..... .....+ ..|. ...|..+
T Consensus 970 ~~~~p~~~asR~g~~g~~~s~dl~~r~p~L~~~a~--~~s~lv~~~~sgpst~c~~~----------------------- 1024 (1591)
T KOG3514|consen 970 YSFLPKLVASRSGFQGCLASLDLGGRLPDLISDAL--FESGLVEVGCSGPSTTCSED----------------------- 1024 (1591)
T ss_pred cccccceeeccCCCCCCcCccCccccchhHHHHhh--hhccceeeeccCCCcccchh-----------------------
Confidence 88888888889999999999999987665432211 111111 1222 1333333
Q ss_pred CCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCC-CCCCCCcCCCCCCcCccccCCCceEEecCCCcccccccccc
Q psy7014 347 FGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPP-GRTGKFCEKDESLSDISFSGRRSYISLPSSELHLIINESLS 425 (500)
Q Consensus 347 ~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~-G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~~~~~e~~~ 425 (500)
.|.|.|.|+..|.+|.|.|++ .|+|+.|.... ..+.|.+.++.+.|..|..
T Consensus 1025 -----------------acanhG~c~q~w~~~~c~csmtS~~Gp~C~d~g--tTYiFgk~gglI~YtwPpN--------- 1076 (1591)
T KOG3514|consen 1025 -----------------ACANHGVCIQQWNGIACDCSMTSYSGPRCNDPG--TTYIFGKSGGLITYTWPPN--------- 1076 (1591)
T ss_pred -----------------hhhccceeeeeecceeeeccccccCCCccCCCc--eEEEECCCCceEEEecCCC---------
Confidence 466666666666666666665 57777776653 3467888888888765533
Q ss_pred ccccccccccccCCccccccceeeEEEEEeeCCCCcEEEEcCCCC-CCCeEEEEEECCEEEEEEEcCCCCCC
Q psy7014 426 DISFSGRRSYISLPSSELHLHEACIDLEIRPTKDKGLLMYFGHPQ-KNSMMTLSLQGGVLELRVLMLGDRPK 496 (500)
Q Consensus 426 ~~~f~~~~~~~~~p~~~~~~~~~~i~l~frT~~~~GlLl~~~~~~-~~dfi~l~l~~G~l~~~~~~g~~~~~ 496 (500)
+++...+.+|.+.|+|++++|+|+-+.+.. .+||++|+|..|+|-+.||.|.....
T Consensus 1077 ---------------dRpsTr~DrlAvGFsTtq~daVLvRVdSAsglgDYlqLhI~qG~igvvfNiGt~Dit 1133 (1591)
T KOG3514|consen 1077 ---------------DRPSTRKDRLAVGFSTTQPDAVLVRVDSASGLGDYLQLHINQGKIGVVFNIGTDDIT 1133 (1591)
T ss_pred ---------------CCCCcccceEEEEEEeccCceEEEEEeccCCCCceEEEEEeccEEEEEEeccCcccc
Confidence 345557889999999999999999997664 58999999999999999999976543
No 6
>KOG3516|consensus
Probab=99.96 E-value=3.3e-27 Score=259.93 Aligned_cols=208 Identities=18% Similarity=0.198 Sum_probs=150.4
Q ss_pred CCCCCCCCcceeccccccc-cCCeeeeecccccCCCceeecCCCCCCCC-----CCCCCccccccccCCCCCCcceEEee
Q psy7014 9 CPELTNPDNIKILGRHRQE-ENDRIVNFQHYFDTNQPINQLLSIIFTNF-----LPPDIEIGQASYSSSMSGLSSFSAYV 82 (500)
Q Consensus 9 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~c~c~~~~~G~~-----C~~~~~~~~~~f~g~~~~~~sy~~~~ 82 (500)
-+.-+.++|+-+-++++.+ +.--+--|-.+ .|+-.||.|..|-+ |.-.. ....|+| .|++.|+
T Consensus 123 ~~~wtf~Gn~n~~sVv~~~l~~~~~ar~vr~----~pl~wnp~grig~rVevygc~y~s--~vi~fdg-----~s~~~yr 191 (1306)
T KOG3516|consen 123 GSSWTFVGNVNADSVVYHELEPPIEARFVRI----LPLDWNPKGRIGMRVEVYGCSYKS--PVIYFDG-----SSSLLYR 191 (1306)
T ss_pred CCccccccccccceEEeccccCcccceEEee----eeeeeCCCCcceeEEEEEeccccC--ceeEECC-----ccceeee
Confidence 4556677777776655444 22111111122 45778898888865 44432 4457999 7888888
Q ss_pred cCCC--CceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhc
Q psy7014 83 IPAN--IHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLR 160 (500)
Q Consensus 83 ~~~~--~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~ 160 (500)
.... ......|+|+|||...+ |+|||.. ...+||+.|+|++|++++.+|+|+- .+.
T Consensus 192 ~~~~~m~s~~d~is~~Fkt~~sd--Gvllh~e-----g~QGd~itlql~~~kl~l~ld~G~~---------------~~~ 249 (1306)
T KOG3516|consen 192 FHRKLMSSLKDVISLKFKTMQSD--GVLLHGE-----GQQGDYITLQLIGGKLVLILDLGNS---------------KLP 249 (1306)
T ss_pred ccccccccccceeEEEEEeeccc--eeEEEcc-----cCCCCEEEEEEeCCEEEEEEecCCc---------------cCc
Confidence 5433 34577899999887666 9999995 2578999999999999999999972 111
Q ss_pred cccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCC
Q psy7014 161 SAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNM 240 (500)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~ 240 (500)
.-+++..|.+.. .++|.+| |.|+|.|.++.+.++||+.
T Consensus 250 ---------------------------------------~s~~~~sis~Gs--lLdD~hW-HsV~i~r~~~~vnftvD~~ 287 (1306)
T KOG3516|consen 250 ---------------------------------------SSRTPTSISAGS--LLDDQHW-HSVRIERQGRQVNFTVDGV 287 (1306)
T ss_pred ---------------------------------------cccCcceeeccc--ccCCCcc-eEEEEEecCcEEEEEEccc
Confidence 112445554444 3478899 9999999999999999997
Q ss_pred cceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeeccccc
Q psy7014 241 GNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNV 297 (500)
Q Consensus 241 ~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~ 297 (500)
. ......|.++.|+++..+++||+|.+..+ ......|.|||.+|.+|+..+
T Consensus 288 ~-~~fr~~Ge~~~Ldld~e~~~GGiP~~~~~-----~~~~~nF~GCienly~N~vdi 338 (1306)
T KOG3516|consen 288 V-HHFRATGEFDALDLDTEISFGGIPNDGKS-----VGFEKNFTGCLENLYYNGVDI 338 (1306)
T ss_pred e-EeecccCccceeecceEEEECCccCCCcc-----cceeeeeeeeeeeeeecCcee
Confidence 6 45777899999999999999999875543 223478999999999997554
No 7
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.90 E-value=4.4e-23 Score=184.02 Aligned_cols=130 Identities=33% Similarity=0.549 Sum_probs=103.4
Q ss_pred EeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccc
Q psy7014 97 FVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLIL 176 (500)
Q Consensus 97 Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 176 (500)
|||..++ |||||.+++ ...||++|+|.+|+|+|+|++|+|
T Consensus 1 frT~~~~--Gllly~g~~----~~~dfial~L~~G~l~~~~~~G~~---------------------------------- 40 (131)
T PF00054_consen 1 FRTSEPN--GLLLYLGSK----DGKDFIALELRDGRLEFRYNLGSG---------------------------------- 40 (131)
T ss_dssp EEESSSS--EEEEEEESS----TTSSEEEEEEETTEEEEEEESSSE----------------------------------
T ss_pred CccCCCC--ceEEECCcC----CCCCEEEEEEECCEEEEEEeCCCc----------------------------------
Confidence 7888777 999999864 344999999999999999999994
Q ss_pred cccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCcccc-cc
Q psy7014 177 GVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQ-LN 255 (500)
Q Consensus 177 ~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~-L~ 255 (500)
+..+.+..+ ++||+| |+|.+.|.++.+.|+||+...+...++..... ++
T Consensus 41 ---------------------------~~~~~~~~~--i~dg~w-h~v~~~r~~~~~~L~Vd~~~~~~~~s~~~~~~~l~ 90 (131)
T PF00054_consen 41 ---------------------------PASLRSPQK--INDGKW-HTVSVSRNGRNGSLSVDGEEVVTGESPSGATQSLD 90 (131)
T ss_dssp ---------------------------EEEEEESSE--TTSSSE-EEEEEEEETTEEEEEETTSEEEEEEECSSSSSSCE
T ss_pred ---------------------------cceecCCCc--cCCCcc-eEEEEEEcCcEEEEEECCccceeeecCCccccccc
Confidence 334444554 689999 99999999999999999988766777755555 88
Q ss_pred CCCceEEcccccccCcCCCCCcCCCCCccccccceeecccccc
Q psy7014 256 TKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNVG 298 (500)
Q Consensus 256 ~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~~ 298 (500)
...+|||||+|.. ...+.......+|.|||+++.+|++.++
T Consensus 91 ~~~~lyvGG~p~~--~~~~~~~~~~~~f~GCi~~~~in~~~ld 131 (131)
T PF00054_consen 91 VDGPLYVGGLPSS--SSRPRPLPISPGFKGCIRNLSINGKPLD 131 (131)
T ss_dssp ECSEEEESSSSTT--TGCGSSCSCCSB-EEEEEEEEETTEEC-
T ss_pred cccCEEEccCCch--hhcccccccCCCeeEEEEEeEECCEECc
Confidence 8999999999822 2223345567799999999999987653
No 8
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.85 E-value=3.1e-20 Score=167.90 Aligned_cols=149 Identities=32% Similarity=0.456 Sum_probs=116.0
Q ss_pred cccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEE
Q psy7014 67 SYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLV 146 (500)
Q Consensus 67 ~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~ 146 (500)
+|.| +||+.|+.+......+.|+|+|||+.++ |+|||.+.. ...+|++|+|.+|++++.++.|.+
T Consensus 3 ~F~g-----~~~i~~~~~~~~~~~~~i~~~frt~~~~--g~l~~~~~~----~~~~~~~l~l~~g~l~~~~~~g~~---- 67 (151)
T cd00110 3 SFSG-----SSYVRLPTLPAPRTRLSISFSFRTTSPN--GLLLYAGSQ----NGGDFLALELEDGRLVLRYDLGSG---- 67 (151)
T ss_pred EeCC-----CceEEecCCCCCcceeEEEEEEEeCCCC--eEEEEecCC----CCCCEEEEEEECCEEEEEEcCCcc----
Confidence 6777 7999999876546789999999998776 999999853 257999999999999999999852
Q ss_pred EeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEE
Q psy7014 147 YFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRV 226 (500)
Q Consensus 147 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v 226 (500)
...+.+.. .++||+| |+|.+
T Consensus 68 ---------------------------------------------------------~~~~~~~~--~v~dg~W-h~v~i 87 (151)
T cd00110 68 ---------------------------------------------------------SLVLSSKT--PLNDGQW-HSVSV 87 (151)
T ss_pred ---------------------------------------------------------cEEEEccC--ccCCCCE-EEEEE
Confidence 22333333 5789999 99999
Q ss_pred EEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014 227 GKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS 293 (500)
Q Consensus 227 ~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in 293 (500)
.+.++.+.|.||+........+.....++...++||||.|..... .......+|.|||+++++|
T Consensus 88 ~~~~~~~~l~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~---~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 88 ERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKS---PGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred EECCCEEEEEECCccEEeeeCCCCceeecCCCCeEEcCCCCchhc---ccccccCCCceEeeEeEeC
Confidence 999999999999985444444433335677889999999753321 1234567999999999986
No 9
>smart00282 LamG Laminin G domain.
Probab=99.85 E-value=3.8e-20 Score=165.12 Aligned_cols=134 Identities=32% Similarity=0.469 Sum_probs=106.8
Q ss_pred EEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhcccccccccc
Q psy7014 90 CFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCC 169 (500)
Q Consensus 90 ~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~ 169 (500)
.++|+|.|||++++ |+|||.++. ...+|++|+|.+|++++.++.|++..
T Consensus 2 ~~~i~~~frt~~~~--g~l~~~~~~----~~~~~l~l~l~~g~l~~~~~~g~~~~------------------------- 50 (135)
T smart00282 2 RLSISFSFRTTSPN--GLLLYAGSK----NGGDYLALELRDGRLVLRYDLGSGPA------------------------- 50 (135)
T ss_pred ceEEEEEEEeCCCC--EEEEEeCCC----CCCCEEEEEEECCEEEEEEECCCCCE-------------------------
Confidence 46799999999777 999999742 35799999999999999999998432
Q ss_pred ccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCC
Q psy7014 170 LPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPG 249 (500)
Q Consensus 170 ~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~ 249 (500)
.+++ ....++||+| |+|.+.+.++.+.|.||+........++
T Consensus 51 ------------------------------------~~~~-~~~~~~dg~W-H~v~i~~~~~~~~l~VD~~~~~~~~~~~ 92 (135)
T smart00282 51 ------------------------------------RLTS-DPTPLNDGQW-HRVAVERNGRRVTLSVDGENPVSGESPG 92 (135)
T ss_pred ------------------------------------EEEE-CCeEeCCCCE-EEEEEEEeCCEEEEEECCCccccEECCC
Confidence 2222 2246799999 9999999999999999997655566666
Q ss_pred ccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeeccc
Q psy7014 250 RLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAG 295 (500)
Q Consensus 250 ~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~ 295 (500)
....+++...+||||+|..... .......+|.|||++|++|+.
T Consensus 93 ~~~~l~~~~~l~iGG~p~~~~~---~~~~~~~~F~GCi~~v~in~~ 135 (135)
T smart00282 93 GLTILNLDGPLYLGGLPEDLKL---PPLLVTPGFRGCIRNLKVNGK 135 (135)
T ss_pred CceEEecCCCcEEccCCchhcc---cccccCCCCeeEeeEEEECCC
Confidence 6677888899999999864321 224456799999999999973
No 10
>KOG4289|consensus
Probab=99.83 E-value=6.6e-20 Score=202.91 Aligned_cols=226 Identities=16% Similarity=0.248 Sum_probs=155.7
Q ss_pred CCCceeecCCCCCCCCCCCCCccccc-cccCCCCCCcceEEeecCCC-CceEEEEEEEEeeCCCCCceEEEEecccCCCC
Q psy7014 41 TNQPINQLLSIIFTNFLPPDIEIGQA-SYSSSMSGLSSFSAYVIPAN-IHHCFELKFRFVPNSFDQIALLAFIGQDYQHD 118 (500)
Q Consensus 41 ~~~~~c~c~~~~~G~~C~~~~~~~~~-~f~g~~~~~~sy~~~~~~~~-~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~ 118 (500)
-|.++|+||.+|.|+.|+..+ .-| -|.| +|.+++..... ..-.+.++|.|||+..+ |+||-..-
T Consensus 1519 Wg~~~C~CP~~fggk~c~~~m--~~pq~frG-----~sl~sw~~~~~~vSvPwylsl~FRTr~ad--~vl~~~~~----- 1584 (2531)
T KOG4289|consen 1519 WGGFSCECPLGFGGKGCCQGM--AHPQHFRG-----HSLVSWEGLPSQVSVPWYLSLMFRTRRAD--GVLMQAEF----- 1584 (2531)
T ss_pred cCcEeecCccccCCcchhhcc--CCchhccc-----cceeeecCCCcceecceEEEEEEEeeccc--cEEEEEEe-----
Confidence 357899999999999999976 334 5999 78887774332 44578999999999888 99987742
Q ss_pred CCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccc
Q psy7014 119 AITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFV 198 (500)
Q Consensus 119 ~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~ 198 (500)
+...-+.|+|.+|++.+ ++|....
T Consensus 1585 ~~rst~~lqld~g~l~~--~v~~s~v------------------------------------------------------ 1608 (2531)
T KOG4289|consen 1585 GGRSTYNLQLDDGTLKY--NVGDSSV------------------------------------------------------ 1608 (2531)
T ss_pred CCCceEEEEEcCCEEEE--EecCceE------------------------------------------------------
Confidence 22234889999999876 4443111
Q ss_pred cccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcC
Q psy7014 199 DTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLP 278 (500)
Q Consensus 199 ~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~ 278 (500)
. ....+++||+| |.+.++.... ..++.|..... .........|++.. ||+||.|. ..
T Consensus 1609 ---------~-L~~~~vtdg~W-h~~~i~l~~d-~~~t~d~g~~~-aea~~gl~gl~l~s-l~vGgap~---------~g 1665 (2531)
T KOG4289|consen 1609 ---------E-LPAPRVTDGHW-HHLVIELEAD-SVATLDYGIYQ-AEAKAGLSGLNLES-LYVGGAPA---------TG 1665 (2531)
T ss_pred ---------E-ccCccccCCch-hheeeeeccC-eEEEEechhhh-hhhhcCCCCceeeE-EEEccccC---------CC
Confidence 0 12336689999 9999998764 66677654322 22222245566665 99999873 33
Q ss_pred CCCCccccccceeecccccccccccccccCCCCc-c------c--cC----CccCCCCCCCCCCEEeeCC--CceeecCC
Q psy7014 279 LHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGV-G------Q--CG----TSQCHNHTCSHGGACMNHG--ATFSCLCA 343 (500)
Q Consensus 279 ~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v-~------~--C~----~~~C~~~pC~ngg~Ci~~~--~~~~C~C~ 343 (500)
...||+|||++|++.|..+..+.. .....+-.+ . . |+ .+.|.-+||.|.|+|+..+ ..|+|.|+
T Consensus 1666 ~p~gf~GCiqgV~v~g~~~l~~~k-v~~~~GCvvpn~C~~d~sC~c~~~~C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~ 1744 (2531)
T KOG4289|consen 1666 VPRGFRGCIQGVRVGGVSILVPKK-VNVEAGCVVPNPCSVDSSCPCDPYNCVDVCSLNPCENQGTCVRSPGAHGYTCECP 1744 (2531)
T ss_pred ccccchhhhhceEECCEeeccccc-cccccCcccCCccccCCcccCCCCCccchhcccccccCceeecCCCCCceeEECC
Confidence 456999999999999877654421 110000000 1 1 22 3667789999999998765 57999999
Q ss_pred CCCCCcccccccc-cccC
Q psy7014 344 DGWFGPLCASRYN-LCDS 360 (500)
Q Consensus 344 ~Gy~G~~C~~~i~-~C~~ 360 (500)
+||.|+.|+.+.+ +|.+
T Consensus 1745 ~g~~G~~Ce~~~dq~CPr 1762 (2531)
T KOG4289|consen 1745 PGYTGPYCELRADQPCPR 1762 (2531)
T ss_pred CcccCcchhhhccCCCCC
Confidence 9999999998764 4544
No 11
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.77 E-value=5.9e-18 Score=148.18 Aligned_cols=127 Identities=24% Similarity=0.384 Sum_probs=96.8
Q ss_pred EeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccc
Q psy7014 97 FVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLIL 176 (500)
Q Consensus 97 Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 176 (500)
|||+.++ |+|||.+.. ...+|++|+|.+|+|++.|++|++..
T Consensus 1 Frt~~~~--g~Ll~~~~~----~~~~~l~l~l~~g~l~~~~~~g~~~~-------------------------------- 42 (128)
T PF02210_consen 1 FRTRSPN--GLLLYIGSE----DNGDFLSLELVDGRLVVRYNLGGSEI-------------------------------- 42 (128)
T ss_dssp EEESSSS--EEEEEEEES----TTSEEEEEEEETTEEEEEEESSSSEE--------------------------------
T ss_pred CccCCCC--EeEEEEcCC----CCCEEEEEEEECCEEEEEEEccccce--------------------------------
Confidence 7888777 999999854 22689999999999999999995321
Q ss_pred cccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccc-ccc
Q psy7014 177 GVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLT-QLN 255 (500)
Q Consensus 177 ~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~-~L~ 255 (500)
..+ .....++|++| |+|.+.|.++.+.|.||+............. .++
T Consensus 43 ----------------------------~~~--~~~~~~~dg~w-h~v~i~~~~~~~~l~Vd~~~~~~~~~~~~~~~~~~ 91 (128)
T PF02210_consen 43 ----------------------------VTT--FSNSNLNDGQW-HKVSISRDGNRVTLTVDGQSVSSESLPSSSSDSLD 91 (128)
T ss_dssp ----------------------------EEE--ECSSSSTSSSE-EEEEEEEETTEEEEEETTSEEEEEESSSTTHHCBE
T ss_pred ----------------------------eee--ccCccccccce-eEEEEEEeeeeEEEEecCccceEEeccccceeccc
Confidence 112 22335689999 9999999999999999999876666655443 677
Q ss_pred CCCceEEcccccccCcCCCCCcCCCCCccccccceeeccc
Q psy7014 256 TKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAG 295 (500)
Q Consensus 256 ~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~ 295 (500)
....+||||.|....... .....+|.|||+++++||+
T Consensus 92 ~~~~l~iGg~~~~~~~~~---~~~~~~f~Gci~~l~vng~ 128 (128)
T PF02210_consen 92 PDGSLYIGGLPESNQPSG---SVDTPGFVGCIRDLRVNGQ 128 (128)
T ss_dssp SEEEEEESSTTTTCTCTT---SSTTSB-EEEEEEEEETTE
T ss_pred CCCCEEEecccCcccccc---ccCCCCcEEEcCeEEECCC
Confidence 777899999976443221 1126799999999999974
No 12
>KOG1219|consensus
Probab=99.33 E-value=6.6e-12 Score=144.97 Aligned_cols=108 Identities=26% Similarity=0.679 Sum_probs=93.0
Q ss_pred cccceeeccccccc---ccccccccCCCCccccCC--ccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccc-ccccc
Q psy7014 286 CIFDVELSAGNVGI---NLYKTRAAEGRGVGQCGT--SQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASR-YNLCD 359 (500)
Q Consensus 286 CIr~v~ing~~~~l---~~~~~~~~~~~~v~~C~~--~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~-i~~C~ 359 (500)
|-++-..+|+.+.- .-+.|.|...+....|+. .+|+++||.+||+|++..+.|.|.|+.||+|.+|+.+ +++|.
T Consensus 3867 C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs 3946 (4289)
T KOG1219|consen 3867 CNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECS 3946 (4289)
T ss_pred cccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccc
Confidence 55666667666532 124566677777888984 9999999999999999999999999999999999998 89999
Q ss_pred CCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCC
Q psy7014 360 STRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDE 395 (500)
Q Consensus 360 ~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~ 395 (500)
.+ +|++||.|++..++|.|.|.+||.|.+|+...
T Consensus 3947 ~n--~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~~ 3980 (4289)
T KOG1219|consen 3947 KN--VCGTGGQCINIPGSFHCNCTPGILGRTCCAEK 3980 (4289)
T ss_pred cc--cccCCceeeccCCceEeccChhHhcccCcccc
Confidence 85 89999999999999999999999999997763
No 13
>KOG3509|consensus
Probab=99.13 E-value=2.3e-10 Score=128.49 Aligned_cols=166 Identities=27% Similarity=0.510 Sum_probs=124.0
Q ss_pred CCCCccEEEEEEEeCcEEEEEEcC-CcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeecc
Q psy7014 216 KSKRGGYTVRVGKNGQQCWLMVDN-MGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSA 294 (500)
Q Consensus 216 ~dg~WwH~V~v~r~~~~~~L~VD~-~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing 294 (500)
-.++| |.+.+.| .-.+.+++ ..+....+......+...+.+|.||+ -+...+.+..+...||.|||+++.+++
T Consensus 312 ~~~E~-~~~~i~r---~s~~~~~g~~~~l~g~~~~~~~~i~~ee~v~lg~i--~ni~~l~~~~~~~eGf~gci~~~~~~~ 385 (964)
T KOG3509|consen 312 YIGEW-RFGIIFR---GSGLSVSGHKGVLQGNSNILVSRITNEESVFLGGI--INIETLQHNLPLPEGFAGCIRDLVMNL 385 (964)
T ss_pred cccee-eeeEeee---cccccccCcceeecccccccccceeecccccCCce--eeeccccccCCCccCccceehhhhhhc
Confidence 35699 9999988 44455555 44445555566666777888999996 445556666778889999999999999
Q ss_pred cccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeC
Q psy7014 295 GNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPL 374 (500)
Q Consensus 295 ~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~ 374 (500)
+.+...+.....+. ....|..+.|...||.+.+.|.+..-...|.|+.+|.|..|+...+.|...++-+ ..++|...
T Consensus 386 k~l~~~~~~~~~v~--~~~~c~g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~-y~~t~~~~ 462 (964)
T KOG3509|consen 386 KDLRVTLQRASYVA--AQGTCLGDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGS-YLGTCVPI 462 (964)
T ss_pred cccccccccccccc--cccccCCCccccccCCCCccccccccccceeccccccCchhhccCccccccCCcc-ccceEecc
Confidence 88876554322121 1226778999999999999999999999999999999999999988888775333 34677776
Q ss_pred CCCeeeeCCCCCCCCCc
Q psy7014 375 THSYECDCPPGRTGKFC 391 (500)
Q Consensus 375 ~~g~~C~C~~G~~G~~C 391 (500)
.....+.|-+| .|..+
T Consensus 463 ~~~~~~~c~pg-~g~~~ 478 (964)
T KOG3509|consen 463 QGKRCEYCGPG-AGAPT 478 (964)
T ss_pred CCCcceeecCC-CCCcc
Confidence 66677788888 55555
No 14
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=98.40 E-value=4.3e-07 Score=80.88 Aligned_cols=42 Identities=29% Similarity=0.559 Sum_probs=39.1
Q ss_pred EeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcCCCCC
Q psy7014 454 IRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGDRP 495 (500)
Q Consensus 454 frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g~~~~ 495 (500)
|||.+++|||||.+.....|||+|+|++|+|+++|++|+++.
T Consensus 1 frT~~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~~~ 42 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSGPA 42 (131)
T ss_dssp EEESSSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSEEE
T ss_pred CccCCCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCccc
Confidence 899999999999998877899999999999999999998843
No 15
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=98.32 E-value=1.8e-06 Score=77.64 Aligned_cols=46 Identities=30% Similarity=0.490 Sum_probs=42.8
Q ss_pred ceeeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcC
Q psy7014 446 HEACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLML 491 (500)
Q Consensus 446 ~~~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g 491 (500)
...+|+|+|||.+++|+|||.+.....+|++|+|++|+|.+.++.|
T Consensus 20 ~~~~i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~~~~g 65 (151)
T cd00110 20 TRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLG 65 (151)
T ss_pred ceeEEEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEEEcCC
Confidence 6789999999999999999998875689999999999999999987
No 16
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=98.31 E-value=1.7e-05 Score=74.80 Aligned_cols=130 Identities=12% Similarity=0.099 Sum_probs=80.9
Q ss_pred CceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEc-CCceeEEEeechhhhhhhhhcccccc
Q psy7014 87 IHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNL-GSGWYLVYFEHTYLFILSRLRSAQDT 165 (500)
Q Consensus 87 ~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~-G~g~~~~~~~~~~~~~~~~l~~~~~~ 165 (500)
....+.|.+.||+.. ...|.||-..+. ....++.|.+..++..+.+.. +..
T Consensus 50 ~~~~fsi~~~~r~~~-~~~g~L~si~~~----~~~~~l~v~l~g~~~~~~~~~~~~~----------------------- 101 (184)
T smart00210 50 LPEDFSLLTTFRQTP-KSRGVLFAIYDA----QNVRQFGLEVDGRANTLLLRYQGVD----------------------- 101 (184)
T ss_pred CCCCeEEEEEEEeCC-CCCeEEEEEEcC----CCcEEEEEEEeCCccEEEEEECCCC-----------------------
Confidence 345788899998874 445888766532 344688999887664554443 220
Q ss_pred ccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceee
Q psy7014 166 RLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTS 245 (500)
Q Consensus 166 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~ 245 (500)
|.....+-..+.+.||+| |+|.+...+..++|.||.......
T Consensus 102 -------------------------------------g~~~~~~f~~~~l~dg~W-H~lal~V~~~~v~LyvDC~~~~~~ 143 (184)
T smart00210 102 -------------------------------------GKQHTVSFRNLPLADGQW-HKLALSVSGSSATLYVDCNEIDSR 143 (184)
T ss_pred -------------------------------------CcEEEEeecCCccccCCc-eEEEEEEeCCEEEEEECCccccce
Confidence 111111112245789999 999999999999999999875544
Q ss_pred eCCCcc-ccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014 246 KSPGRL-TQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS 293 (500)
Q Consensus 246 ~~~~~~-~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in 293 (500)
..+... ..++.++ +.++|... .....|.|||++++|.
T Consensus 144 ~l~~~~~~~~~~~g-~~~~g~~~----------~~~~~f~G~lq~l~i~ 181 (184)
T smart00210 144 PLDRPGQPPIDTDG-IEVRGAQA----------ADRKPFQGDLQQLKIV 181 (184)
T ss_pred ecCCcccccccccc-eEEEeecc----------CCCCcceEEeEEEEEe
Confidence 333222 1333333 44554321 1124799999999984
No 17
>smart00282 LamG Laminin G domain.
Probab=98.13 E-value=8.7e-06 Score=72.26 Aligned_cols=47 Identities=30% Similarity=0.459 Sum_probs=42.6
Q ss_pred eeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcCCCC
Q psy7014 448 ACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGDR 494 (500)
Q Consensus 448 ~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g~~~ 494 (500)
.+|+|.|||.+++|+|||.+.....+|++|+|.+|+|.+.++.|++.
T Consensus 3 ~~i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~ 49 (135)
T smart00282 3 LSISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSGP 49 (135)
T ss_pred eEEEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCCC
Confidence 47999999999999999998755689999999999999999998754
No 18
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.84 E-value=8e-06 Score=54.46 Aligned_cols=26 Identities=54% Similarity=1.280 Sum_probs=16.7
Q ss_pred cCCCCCEEeeCC-CCeeeeCCCCCCCC
Q psy7014 364 NCSFGATCVPLT-HSYECDCPPGRTGK 389 (500)
Q Consensus 364 pC~ngg~C~~~~-~g~~C~C~~G~~G~ 389 (500)
||.|+|+|++.. .+|.|.|++||+|+
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 566666666655 66666666666665
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.75 E-value=1.3e-05 Score=53.51 Aligned_cols=31 Identities=29% Similarity=0.978 Sum_probs=29.1
Q ss_pred CCCCCCCCCCEEeeCC-CceeecCCCCCCCcc
Q psy7014 320 CHNHTCSHGGACMNHG-ATFSCLCADGWFGPL 350 (500)
Q Consensus 320 C~~~pC~ngg~Ci~~~-~~~~C~C~~Gy~G~~ 350 (500)
|.++||+|+|+|++.. +.|.|.|+.||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 7889999999999999 999999999999963
No 20
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=97.68 E-value=0.00067 Score=60.24 Aligned_cols=147 Identities=16% Similarity=0.194 Sum_probs=82.8
Q ss_pred ccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEEC-CEEEEEEEcCCcee
Q psy7014 66 ASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIK-GYVVLTWNLGSGWY 144 (500)
Q Consensus 66 ~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~-G~l~~~~~~G~g~~ 144 (500)
..|+|. ++|+.++...-....+.|++.||+....+...++.... ...+.+.|.+.+ |++++.+..+.+.
T Consensus 3 ~~f~g~----~~~i~~~~~~~~~~~fTi~~w~~~~~~~~~~~~~~~~~-----~~~~~~~l~~~~~~~l~~~~~~~~~~- 72 (157)
T PF13385_consen 3 LYFDGS----NDYISIPNSDFPSGSFTISFWVKPDSPSSSQSFVFMDS-----SGSGGFGLFINNNGRLRFYIGNGGGG- 72 (157)
T ss_dssp EEE-ST----T-EEEEESGGGGGTEEEEEEEEEESS--SSEEEEEESS-----SSSEEEEEEEETTSEEEEEETTSEEE-
T ss_pred EEECCC----CCEEEECCcCCCCCCEEEEEEEEeCCCCCCceEEEEec-----CCCCEEEEEEECCCEEEEEEeCCCce-
Confidence 356664 78999985322256899999999887666554444311 122366777764 6666644444311
Q ss_pred EEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEE
Q psy7014 145 LVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTV 224 (500)
Q Consensus 145 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V 224 (500)
...+.+.. .+.+++| |+|
T Consensus 73 -----------------------------------------------------------~~~~~~~~--~~~~~~W-~~l 90 (157)
T PF13385_consen 73 -----------------------------------------------------------NYSFSSDS--NLPDNKW-HHL 90 (157)
T ss_dssp -----------------------------------------------------------SS-EE-BS-----TT-E-EEE
T ss_pred -----------------------------------------------------------eEEEecCc--ccCCCCE-EEE
Confidence 01122223 3456799 999
Q ss_pred EEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeecccc
Q psy7014 225 RVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGN 296 (500)
Q Consensus 225 ~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~ 296 (500)
.+..++..+.|.||+........+.. ........++||+... ....|.|-|.++++=.+.
T Consensus 91 ~~~~~~~~~~lyvnG~~~~~~~~~~~-~~~~~~~~~~iG~~~~-----------~~~~~~g~i~~~~i~~~a 150 (157)
T PF13385_consen 91 ALTYDGSTVTLYVNGELVGSSTIPSN-ISLNSNGPLFIGGSGG-----------GSSPFNGYIDDLRIYNRA 150 (157)
T ss_dssp EEEEETTEEEEEETTEEETTCTEESS-SSTTSCCEEEESS-ST-----------T--B-EEEEEEEEEESS-
T ss_pred EEEEECCeEEEEECCEEEEeEeccCC-cCCCCcceEEEeecCC-----------CCCceEEEEEEEEEECcc
Confidence 99999999999999976433322222 1234556799998631 135799999999985443
No 21
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=97.61 E-value=6.3e-05 Score=65.24 Aligned_cols=40 Identities=30% Similarity=0.566 Sum_probs=36.3
Q ss_pred EeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcCCC
Q psy7014 454 IRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGD 493 (500)
Q Consensus 454 frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g~~ 493 (500)
|||++++|+|||.+.....+|++|+|.+|+|++.+++|+.
T Consensus 1 Frt~~~~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~ 40 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGS 40 (128)
T ss_dssp EEESSSSEEEEEEEESTTSEEEEEEEETTEEEEEEESSSS
T ss_pred CccCCCCEeEEEEcCCCCCEEEEEEEECCEEEEEEEcccc
Confidence 8999999999999877546899999999999999999944
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.59 E-value=8.8e-05 Score=50.98 Aligned_cols=37 Identities=49% Similarity=1.086 Sum_probs=29.4
Q ss_pred cccccCCCccCCCCCEEeeCCCCeeeeCCCCCC-CCCcC
Q psy7014 355 YNLCDSTRHNCSFGATCVPLTHSYECDCPPGRT-GKFCE 392 (500)
Q Consensus 355 i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~-G~~Ce 392 (500)
+++|... .||.+++.|++..++|.|.|+.||. |..|+
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASG-NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCC-CCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 3556542 3788888999988899999999998 88885
No 23
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.37 E-value=0.00011 Score=52.02 Aligned_cols=34 Identities=41% Similarity=0.919 Sum_probs=29.9
Q ss_pred ccccccCCCccCCCCCEEeeCCCCeeeeCCCCCC
Q psy7014 354 RYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRT 387 (500)
Q Consensus 354 ~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~ 387 (500)
+|++|...+++|..++.|++..++|.|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 4678888777898899999999999999999986
No 24
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.22 E-value=0.00043 Score=46.88 Aligned_cols=36 Identities=50% Similarity=1.102 Sum_probs=27.2
Q ss_pred ccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcC
Q psy7014 356 NLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCE 392 (500)
Q Consensus 356 ~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce 392 (500)
++|... .+|.+++.|.+..++|.|.|+.+|.|..|+
T Consensus 3 ~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCCCC-CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 445441 268888888888888889998888888774
No 25
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=97.06 E-value=0.017 Score=55.21 Aligned_cols=76 Identities=14% Similarity=0.055 Sum_probs=48.4
Q ss_pred CCCCccEEEEEEEe--CcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014 216 KSKRGGYTVRVGKN--GQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS 293 (500)
Q Consensus 216 ~dg~WwH~V~v~r~--~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in 293 (500)
.+++| |+|.+..+ .+.+.|.||+....... ......+...+.|.||..... +-........|.|-|.++++=
T Consensus 88 ~~g~W-~hv~~t~d~~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~~----~gg~~~~~~~f~G~I~~v~iw 161 (201)
T cd00152 88 SDGAW-HHICVTWESTSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQDS----YGGGFDATQSFVGEISDVNMW 161 (201)
T ss_pred CCCCE-EEEEEEEECCCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeecccC----CCCCCCCCcceEEEEceeEEE
Confidence 78899 99999988 44678999997643332 122234455667888854211 111122345799999999886
Q ss_pred cccc
Q psy7014 294 AGNV 297 (500)
Q Consensus 294 g~~~ 297 (500)
++.+
T Consensus 162 ~~~L 165 (201)
T cd00152 162 DSVL 165 (201)
T ss_pred cccC
Confidence 5544
No 26
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=97.02 E-value=0.016 Score=55.69 Aligned_cols=78 Identities=13% Similarity=0.067 Sum_probs=48.8
Q ss_pred eeCCCCccEEEEEEEeC--cEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCcccccccee
Q psy7014 214 SAKSKRGGYTVRVGKNG--QQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVE 291 (500)
Q Consensus 214 ~~~dg~WwH~V~v~r~~--~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ 291 (500)
.+.|++| |+|-+..+. +.+.|.|||... ..........+.....|.||-.... .-........|.|-|.++.
T Consensus 86 ~~~~g~W-~hvc~tw~~~~g~~~lyvnG~~~-~~~~~~~g~~i~~~G~lvlGq~qd~----~gg~f~~~~~f~G~i~~v~ 159 (206)
T smart00159 86 PESDGKW-HHICTTWESSSGIAELWVDGKPG-VRKGLAKGYTVKPGGSIILGQEQDS----YGGGFDATQSFVGEIGDLN 159 (206)
T ss_pred cccCCce-EEEEEEEECCCCcEEEEECCEEc-ccccccCCcEECCCCEEEEEecccC----CCCCCCCCcceeEEEeeeE
Confidence 4578899 999999884 467899999763 2221122233455566888864221 1111234457999999998
Q ss_pred eccccc
Q psy7014 292 LSAGNV 297 (500)
Q Consensus 292 ing~~~ 297 (500)
+=++.+
T Consensus 160 iw~~~L 165 (206)
T smart00159 160 MWDSVL 165 (206)
T ss_pred EecccC
Confidence 865544
No 27
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.01 E-value=0.00077 Score=46.17 Aligned_cols=35 Identities=31% Similarity=0.994 Sum_probs=30.8
Q ss_pred ccCCC-CCCCCCCEEeeCCCceeecCCCCCC-Ccccc
Q psy7014 318 SQCHN-HTCSHGGACMNHGATFSCLCADGWF-GPLCA 352 (500)
Q Consensus 318 ~~C~~-~pC~ngg~Ci~~~~~~~C~C~~Gy~-G~~C~ 352 (500)
++|.. .||.+++.|++..+.|.|.|+.||. |..|+
T Consensus 3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred ccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 56776 7999999999999999999999999 88774
No 28
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.87 E-value=0.0016 Score=43.32 Aligned_cols=30 Identities=57% Similarity=1.157 Sum_probs=24.4
Q ss_pred ccCCCCCEEeeCCCCeeeeCCCCCCCC-CcC
Q psy7014 363 HNCSFGATCVPLTHSYECDCPPGRTGK-FCE 392 (500)
Q Consensus 363 ~pC~ngg~C~~~~~g~~C~C~~G~~G~-~Ce 392 (500)
.+|.+++.|++..++|.|.|+.||.|. .|+
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~ 36 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence 368888888888888889999888887 663
No 29
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=96.82 E-value=0.035 Score=52.55 Aligned_cols=140 Identities=15% Similarity=0.079 Sum_probs=82.3
Q ss_pred eEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccc
Q psy7014 89 HCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLC 168 (500)
Q Consensus 89 ~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~ 168 (500)
....|.++|++.+....--||-.++. .....|+.|++.++++-+.++-..|.....+.
T Consensus 33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~---~~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~------------------- 90 (190)
T PF02973_consen 33 EEGTIVIRFKSDSNSGIQSLFSISNS---TKGNEYFSLYVSNNKLGFELRDTKGNQNYNFS------------------- 90 (190)
T ss_dssp SSEEEEEEEEESS-SSEEEEEEEE-T---STTSEEEEEEEETTEEEEEEEETTTTCEEEEE-------------------
T ss_pred cccEEEEEEecCCCcceeEEEEecCC---CCccceEEEEEECCEEEEEEecCCCCcccccc-------------------
Confidence 46789999999766643446666543 34558999999999988888776653211110
Q ss_pred cccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEe--CcEEEEEEcCCcceeee
Q psy7014 169 CLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKN--GQQCWLMVDNMGNVTSK 246 (500)
Q Consensus 169 ~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~--~~~~~L~VD~~~~~~~~ 246 (500)
.+..+. +...++..| |+|.+.-+ ....+|.|||.......
T Consensus 91 ----------------------------------~~~~v~---~~~~~~~~~-~tva~~ad~~~~~ykly~NG~~v~~~~ 132 (190)
T PF02973_consen 91 ----------------------------------RPAKVR---GGYKNNVTF-NTVAFVADSKNKGYKLYVNGELVSTLS 132 (190)
T ss_dssp ----------------------------------ESSE-----SEETTEES--EEEEEEEETTTTEEEEEETTCEEEEEE
T ss_pred ----------------------------------cccEec---ccccCCceE-EEEEEEEecCCCeEEEEeCCeeEEEec
Confidence 111111 112233344 99999997 88999999996544443
Q ss_pred CC-CccccccCC--CceEEcccccccCcCCCCCcCCCCCccccccceeeccccc
Q psy7014 247 SP-GRLTQLNTK--PMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNV 297 (500)
Q Consensus 247 ~~-~~~~~L~~~--~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~ 297 (500)
.+ +++ .-++. ..++|||..... ...-+|.|=|++|.+=++.+
T Consensus 133 ~~~~~F-is~i~~~n~~~iG~t~R~g--------~~~y~f~G~I~~l~iYn~aL 177 (190)
T PF02973_consen 133 SKSGNF-ISDIPGLNSVQIGGTNRAG--------SNAYPFNGTIDNLKIYNRAL 177 (190)
T ss_dssp ECTSS--GGGSTT--EEEESSEEETT--------EEES--EEEEEEEEEESS--
T ss_pred cccccH-hhcCcCCceEEEcceEeCC--------CceecccceEEEEEEEcCcC
Confidence 33 222 11222 249999984322 12348999999999976554
No 30
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.80 E-value=0.0014 Score=44.21 Aligned_cols=35 Identities=31% Similarity=1.001 Sum_probs=30.6
Q ss_pred ccCCC-CCCCCCCEEeeCCCceeecCCCCCCCcccc
Q psy7014 318 SQCHN-HTCSHGGACMNHGATFSCLCADGWFGPLCA 352 (500)
Q Consensus 318 ~~C~~-~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~ 352 (500)
++|.. .+|.+++.|.+..+.|.|.|+.||.|..|+
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 56776 789999999999999999999999997773
No 31
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.70 E-value=0.0023 Score=42.88 Aligned_cols=28 Identities=61% Similarity=1.278 Sum_probs=22.6
Q ss_pred cCCCCCEEeeCCCCeeeeCCCCCCC-CCcC
Q psy7014 364 NCSFGATCVPLTHSYECDCPPGRTG-KFCE 392 (500)
Q Consensus 364 pC~ngg~C~~~~~g~~C~C~~G~~G-~~Ce 392 (500)
+|.++ .|++..++|.|.|+.||.| ..|+
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence 67777 8888888888888888888 7664
No 32
>KOG1225|consensus
Probab=96.58 E-value=0.0033 Score=67.88 Aligned_cols=75 Identities=33% Similarity=0.821 Sum_probs=58.8
Q ss_pred ccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCC
Q psy7014 306 AAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPG 385 (500)
Q Consensus 306 ~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G 385 (500)
+..++....|..-.|... |..++.|++. .|.|.+||.|+.|+..- |.. .|.++|.|++. .|.|.+|
T Consensus 269 C~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs~~~--cpa---dC~g~G~Ci~G----~C~C~~G 334 (525)
T KOG1225|consen 269 CPPGFTGDDCDELVCPVD-CSGGGVCVDG----ECICNPGYSGKDCSIRR--CPA---DCSGHGKCIDG----ECLCDEG 334 (525)
T ss_pred CCCCCcCCCCCcccCCcc-cCCCceecCC----EeecCCCcccccccccc--CCc---cCCCCCcccCC----ceEeCCC
Confidence 345556666666566655 8888888766 89999999999997544 654 69999999933 7999999
Q ss_pred CCCCCcCCC
Q psy7014 386 RTGKFCEKD 394 (500)
Q Consensus 386 ~~G~~Ce~~ 394 (500)
|+|..|++.
T Consensus 335 y~G~~C~~~ 343 (525)
T KOG1225|consen 335 YTGELCIQR 343 (525)
T ss_pred CcCCccccc
Confidence 999999986
No 33
>KOG1214|consensus
Probab=96.55 E-value=0.016 Score=64.42 Aligned_cols=102 Identities=21% Similarity=0.522 Sum_probs=69.9
Q ss_pred CccCCCC--CCCCCCEEeeCCCceeecCCCCCC----Cccccc-----ccccccCCCccCCCCCEEe--eC-CCCeeeeC
Q psy7014 317 TSQCHNH--TCSHGGACMNHGATFSCLCADGWF----GPLCAS-----RYNLCDSTRHNCSFGATCV--PL-THSYECDC 382 (500)
Q Consensus 317 ~~~C~~~--pC~ngg~Ci~~~~~~~C~C~~Gy~----G~~C~~-----~i~~C~~~p~pC~ngg~C~--~~-~~g~~C~C 382 (500)
.+.|+.. -|-...+|+...+.|+|.|..+|. +-.|.- ..++|....+.|...|.|. .. .+.|.|.|
T Consensus 734 ~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~C 813 (1289)
T KOG1214|consen 734 ENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCAC 813 (1289)
T ss_pred hhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEee
Confidence 3667643 499999999999999999998876 345643 3466766656677666554 33 47899999
Q ss_pred CCCCCCC--CcCCC-CCC------cCccccCCCceEEecCCCccc
Q psy7014 383 PPGRTGK--FCEKD-ESL------SDISFSGRRSYISLPSSELHL 418 (500)
Q Consensus 383 ~~G~~G~--~Ce~~-~~~------~~~~F~g~~sy~~~~~~~~~~ 418 (500)
-+||+|+ .|... ++. .+.+++..++|.+-+.+++.+
T Consensus 814 LPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~G 858 (1289)
T KOG1214|consen 814 LPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYG 858 (1289)
T ss_pred cCCccCCccccccccccCccccCCCceEecCCCcceeecccCccC
Confidence 9999876 34433 221 134566667777777776654
No 34
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.47 E-value=0.0035 Score=41.55 Aligned_cols=32 Identities=38% Similarity=1.162 Sum_probs=28.2
Q ss_pred CC-CCCCCCCCEEeeCCCceeecCCCCCCCc-cc
Q psy7014 320 CH-NHTCSHGGACMNHGATFSCLCADGWFGP-LC 351 (500)
Q Consensus 320 C~-~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~-~C 351 (500)
|. ..+|.+++.|++..+.|.|.|+.||.|. .|
T Consensus 2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 45 6789999999999999999999999997 55
No 35
>KOG1214|consensus
Probab=96.42 E-value=0.0045 Score=68.49 Aligned_cols=57 Identities=37% Similarity=0.938 Sum_probs=47.0
Q ss_pred EEeeC-CCceeecCCCCCCCc--ccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCC
Q psy7014 330 ACMNH-GATFSCLCADGWFGP--LCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGK 389 (500)
Q Consensus 330 ~Ci~~-~~~~~C~C~~Gy~G~--~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~ 389 (500)
.|+.. ...|+|.|-+||.|. .|. ++++|.+. -|...++|.+.+++|.|.|.+||.|+
T Consensus 800 ~c~~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~ps--rChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 800 RCVHHGGSTYSCACLPGFSGDGHQCT-DVDECSPS--RCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred EEEecCCceEEEeecCCccCCccccc-cccccCcc--ccCCCceEecCCCcceeecccCccCC
Confidence 34444 468999999999965 443 56999875 79999999999999999999999865
No 36
>KOG1217|consensus
Probab=96.35 E-value=0.0056 Score=65.00 Aligned_cols=78 Identities=32% Similarity=0.842 Sum_probs=63.7
Q ss_pred CccCCCCC-CCCCCEEeeCCCceeecCCCCCCCccc--ccccccccC--CCccCCCCCEEee--CCCCeeeeCCCCCCCC
Q psy7014 317 TSQCHNHT-CSHGGACMNHGATFSCLCADGWFGPLC--ASRYNLCDS--TRHNCSFGATCVP--LTHSYECDCPPGRTGK 389 (500)
Q Consensus 317 ~~~C~~~p-C~ngg~Ci~~~~~~~C~C~~Gy~G~~C--~~~i~~C~~--~p~pC~ngg~C~~--~~~g~~C~C~~G~~G~ 389 (500)
.+.|...+ |.+++.|+...+.|.|.|++||.|..| ..+...|.. ...+|.++++|.. ....+.|.|..++.|.
T Consensus 271 ~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~ 350 (487)
T KOG1217|consen 271 VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR 350 (487)
T ss_pred ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC
Confidence 47788764 999999999999999999999999999 234467742 2357999999933 3468889999999999
Q ss_pred CcCCC
Q psy7014 390 FCEKD 394 (500)
Q Consensus 390 ~Ce~~ 394 (500)
.|+..
T Consensus 351 ~C~~~ 355 (487)
T KOG1217|consen 351 RCEDS 355 (487)
T ss_pred ccccC
Confidence 99976
No 37
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.33 E-value=0.0045 Score=41.47 Aligned_cols=31 Identities=35% Similarity=1.087 Sum_probs=27.6
Q ss_pred CCC-CCCCCCCEEeeCCCceeecCCCCCCC-ccc
Q psy7014 320 CHN-HTCSHGGACMNHGATFSCLCADGWFG-PLC 351 (500)
Q Consensus 320 C~~-~pC~ngg~Ci~~~~~~~C~C~~Gy~G-~~C 351 (500)
|.. .+|.++ .|++.++.|.|.|+.||.| +.|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 566 689999 9999999999999999999 666
No 38
>KOG4260|consensus
Probab=96.01 E-value=0.0068 Score=59.27 Aligned_cols=93 Identities=23% Similarity=0.475 Sum_probs=67.8
Q ss_pred CCCCCCCCEEeeC---CCceeecCCCCCCCccccc---------------------------------------------
Q psy7014 322 NHTCSHGGACMNH---GATFSCLCADGWFGPLCAS--------------------------------------------- 353 (500)
Q Consensus 322 ~~pC~ngg~Ci~~---~~~~~C~C~~Gy~G~~C~~--------------------------------------------- 353 (500)
..||...|.|.-+ .++-.|.|.+||+|+.|..
T Consensus 149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~l 228 (350)
T KOG4260|consen 149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKL 228 (350)
T ss_pred cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhccccee
Confidence 4678888888765 3567899999999888653
Q ss_pred ------ccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCC--CcCCCCCC----cCccccCCCceEEecCC
Q psy7014 354 ------RYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGK--FCEKDESL----SDISFSGRRSYISLPSS 414 (500)
Q Consensus 354 ------~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~--~Ce~~~~~----~~~~F~g~~sy~~~~~~ 414 (500)
++++|...|.||.....|++..++|.|.+.+||.+. .|+.-..+ ...+.+-.++|..++..
T Consensus 229 de~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~ 301 (350)
T KOG4260|consen 229 DEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDGQYRCVCFS 301 (350)
T ss_pred cccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCccEEEEecc
Confidence 268888889999999999999999999999998753 34442111 12345555677766543
No 39
>KOG1225|consensus
Probab=96.00 E-value=0.013 Score=63.45 Aligned_cols=58 Identities=38% Similarity=0.937 Sum_probs=47.2
Q ss_pred CCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCC
Q psy7014 325 CSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDE 395 (500)
Q Consensus 325 C~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~ 395 (500)
|.+++.|++. .|.|++||+|..|+. ..|.. +|..++.|++. .|.|+++|+|+.|+...
T Consensus 256 c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~---~cs~~g~~~~g----~CiC~~g~~G~dCs~~~ 313 (525)
T KOG1225|consen 256 CTGRGQCVEG----RCICPPGFTGDDCDE--LVCPV---DCSGGGVCVDG----ECICNPGYSGKDCSIRR 313 (525)
T ss_pred CcccceEeCC----eEeCCCCCcCCCCCc--ccCCc---ccCCCceecCC----EeecCCCcccccccccc
Confidence 5555667765 899999999999986 45755 48888888875 89999999999998764
No 40
>KOG1217|consensus
Probab=95.89 E-value=0.014 Score=61.86 Aligned_cols=76 Identities=37% Similarity=0.864 Sum_probs=62.2
Q ss_pred cCCCCC--CCCCCEEeeC---CCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCC
Q psy7014 319 QCHNHT--CSHGGACMNH---GATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEK 393 (500)
Q Consensus 319 ~C~~~p--C~ngg~Ci~~---~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~ 393 (500)
.|...+ +...+.|... ...+.|.|..||.+..|....+.|.....+|.+++.|.+...+|.|.|+.+|.|..|+.
T Consensus 128 ~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~ 207 (487)
T KOG1217|consen 128 ECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET 207 (487)
T ss_pred eecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC
Confidence 355555 3455566654 35889999999999999988789986667899999999999889999999999999987
Q ss_pred C
Q psy7014 394 D 394 (500)
Q Consensus 394 ~ 394 (500)
.
T Consensus 208 ~ 208 (487)
T KOG1217|consen 208 T 208 (487)
T ss_pred C
Confidence 6
No 41
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=95.34 E-value=0.2 Score=47.82 Aligned_cols=78 Identities=15% Similarity=0.109 Sum_probs=42.8
Q ss_pred eeCCCCccEEEEEEEeC--cEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCcccccccee
Q psy7014 214 SAKSKRGGYTVRVGKNG--QQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVE 291 (500)
Q Consensus 214 ~~~dg~WwH~V~v~r~~--~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ 291 (500)
.+.+++| |+|-+..+. ....|.+||.. ...........+.-++.+.||--.. .+.........|.|=|.++.
T Consensus 80 ~~~~~~W-hh~C~tW~s~~G~~~ly~dG~~-~~~~~~~~g~~i~~gG~~vlGQeQd----~~gG~fd~~q~F~G~i~~~~ 153 (195)
T PF00354_consen 80 PIRDGQW-HHICVTWDSSTGRWQLYVDGVR-LSSTGLATGHSIPGGGTLVLGQEQD----SYGGGFDESQAFVGEISDFN 153 (195)
T ss_dssp CS-TSS--EEEEEEEETTTTEEEEEETTEE-EEEEESSTT--B-SSEEEEESS-BS----BTTBTCSGGGB--EEEEEEE
T ss_pred ccCCCCc-EEEEEEEecCCcEEEEEECCEe-cccccccCCceECCCCEEEECcccc----ccCCCcCCccEeeEEEeceE
Confidence 4568899 999999875 67888899984 2222222234454555677774321 12222334568999999998
Q ss_pred eccccc
Q psy7014 292 LSAGNV 297 (500)
Q Consensus 292 ing~~~ 297 (500)
+=++.+
T Consensus 154 iWd~vL 159 (195)
T PF00354_consen 154 IWDRVL 159 (195)
T ss_dssp EESS--
T ss_pred EEeeeC
Confidence 855544
No 42
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=95.23 E-value=0.01 Score=40.66 Aligned_cols=27 Identities=41% Similarity=0.867 Sum_probs=20.8
Q ss_pred ccCCCCCEEeeCCCCeeeeCCCCCCCC
Q psy7014 363 HNCSFGATCVPLTHSYECDCPPGRTGK 389 (500)
Q Consensus 363 ~pC~ngg~C~~~~~g~~C~C~~G~~G~ 389 (500)
..|...+.|++..++|.|.|.+||.|.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 468889999999999999999998764
No 43
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.22 E-value=0.026 Score=37.58 Aligned_cols=26 Identities=42% Similarity=0.729 Sum_probs=21.8
Q ss_pred cCCCCCEEeeCCCCeeeeCCCCCCCCCc
Q psy7014 364 NCSFGATCVPLTHSYECDCPPGRTGKFC 391 (500)
Q Consensus 364 pC~ngg~C~~~~~g~~C~C~~G~~G~~C 391 (500)
.|.++|+|+.. ...|.|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 58888999866 558999999999887
No 44
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.21 E-value=0.0066 Score=32.00 Aligned_cols=13 Identities=8% Similarity=-0.463 Sum_probs=11.1
Q ss_pred eeecCCCCCCCCC
Q psy7014 45 INQLLSIIFTNFL 57 (500)
Q Consensus 45 ~c~c~~~~~G~~C 57 (500)
.|+|++||+|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999998
No 45
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.18 E-value=0.0064 Score=32.06 Aligned_cols=13 Identities=62% Similarity=1.559 Sum_probs=8.2
Q ss_pred eeeCCCCCCCCCc
Q psy7014 379 ECDCPPGRTGKFC 391 (500)
Q Consensus 379 ~C~C~~G~~G~~C 391 (500)
.|.|++||+|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 3677777777666
No 46
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=94.07 E-value=0.043 Score=38.69 Aligned_cols=30 Identities=33% Similarity=1.028 Sum_probs=25.7
Q ss_pred ccCCC--CCCCCCCEEeeCCCceeecCCCCCC
Q psy7014 318 SQCHN--HTCSHGGACMNHGATFSCLCADGWF 347 (500)
Q Consensus 318 ~~C~~--~pC~ngg~Ci~~~~~~~C~C~~Gy~ 347 (500)
+.|.. ++|..++.|++..++|.|.|++||.
T Consensus 3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp STTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 45553 4698899999999999999999997
No 47
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.75 E-value=0.072 Score=35.44 Aligned_cols=27 Identities=37% Similarity=0.998 Sum_probs=23.4
Q ss_pred CCCCCCCEEeeCCCceeecCCCCCCCccc
Q psy7014 323 HTCSHGGACMNHGATFSCLCADGWFGPLC 351 (500)
Q Consensus 323 ~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C 351 (500)
..|.++|+|+.. ..+|.|..||.|+.|
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 469999999876 678999999999876
No 48
>KOG1226|consensus
Probab=92.33 E-value=0.26 Score=54.95 Aligned_cols=56 Identities=34% Similarity=0.816 Sum_probs=35.1
Q ss_pred eecCCCCCCCcccccc--cccccCCC-ccCCCCCEEeeCCCCeeeeCCCC-CCCCCcCCCCCCc
Q psy7014 339 SCLCADGWFGPLCASR--YNLCDSTR-HNCSFGATCVPLTHSYECDCPPG-RTGKFCEKDESLS 398 (500)
Q Consensus 339 ~C~C~~Gy~G~~C~~~--i~~C~~~p-~pC~ngg~C~~~~~g~~C~C~~G-~~G~~Ce~~~~~~ 398 (500)
.|.|.+||+|..|+.+ .+.|.+.- .-|...|+|.=. +|.|... |+|..||+.....
T Consensus 567 ~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~cptc~ 626 (783)
T KOG1226|consen 567 RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEKCPTCP 626 (783)
T ss_pred cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhcCCCCC
Confidence 5566788888887654 35665531 234444444433 6778764 8899998875533
No 49
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=92.05 E-value=0.12 Score=45.14 Aligned_cols=31 Identities=29% Similarity=0.685 Sum_probs=25.0
Q ss_pred cCCCCCEEeeC--CCCeeeeCCCCCCCCCcCCCC
Q psy7014 364 NCSFGATCVPL--THSYECDCPPGRTGKFCEKDE 395 (500)
Q Consensus 364 pC~ngg~C~~~--~~g~~C~C~~G~~G~~Ce~~~ 395 (500)
-|.|| .|.-. ...+.|.|+.||+|.+||...
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~d 84 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVV 84 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCccccccccee
Confidence 47786 88643 478999999999999998763
No 50
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=90.83 E-value=0.45 Score=44.11 Aligned_cols=37 Identities=14% Similarity=0.083 Sum_probs=27.6
Q ss_pred CceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeC
Q psy7014 210 PNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKS 247 (500)
Q Consensus 210 ~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~ 247 (500)
........++| |+++|...+..+++.||+..+.....
T Consensus 119 ~~~~~~~~~~W-~~~~I~~~g~~i~v~vnG~~v~~~~d 155 (185)
T PF06439_consen 119 SVNVAIPPGEW-NTVRIVVKGNRITVWVNGKPVADFTD 155 (185)
T ss_dssp SS--S--TTSE-EEEEEEEETTEEEEEETTEEEEEEET
T ss_pred cccccCCCCce-EEEEEEEECCEEEEEECCEEEEEEEc
Confidence 34445677899 99999999999999999987655544
No 51
>PHA02887 EGF-like protein; Provisional
Probab=90.82 E-value=0.19 Score=43.30 Aligned_cols=30 Identities=27% Similarity=0.492 Sum_probs=22.5
Q ss_pred cCCCCCEEee--CCCCeeeeCCCCCCCCCcCCC
Q psy7014 364 NCSFGATCVP--LTHSYECDCPPGRTGKFCEKD 394 (500)
Q Consensus 364 pC~ngg~C~~--~~~g~~C~C~~G~~G~~Ce~~ 394 (500)
-|-+ |+|.- ....+.|.|+.||+|.+||..
T Consensus 93 YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 93 FCIN-GECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred EeeC-CEEEccccCCCceeECCCCcccCCCCcc
Confidence 4664 48864 346788999999999999864
No 52
>smart00051 DSL delta serrate ligand.
Probab=90.82 E-value=0.33 Score=37.60 Aligned_cols=47 Identities=21% Similarity=0.580 Sum_probs=32.5
Q ss_pred eeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCc
Q psy7014 338 FSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFC 391 (500)
Q Consensus 338 ~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~C 391 (500)
+.-.|+.+|.|..|+ ..|.+. +-+..+.+|.. .-.|.|.+||+|..|
T Consensus 17 ~rv~C~~~~yG~~C~---~~C~~~-~d~~~~~~Cd~---~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCN---KFCRPR-DDFFGHYTCDE---NGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccC---CEeCcC-ccccCCccCCc---CCCEecCCCCcCCCC
Confidence 345688999999996 345432 23556667743 135789999999987
No 53
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=90.79 E-value=0.71 Score=40.86 Aligned_cols=67 Identities=18% Similarity=0.083 Sum_probs=43.4
Q ss_pred CCccEEEEEEEeC--cEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeeccc
Q psy7014 218 KRGGYTVRVGKNG--QQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAG 295 (500)
Q Consensus 218 g~WwH~V~v~r~~--~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~ 295 (500)
++| |+|.+..++ ..+.|.||+......... ......++.||..... .......|.|.|.+++|-++
T Consensus 61 ~~W-~hva~v~d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~~~~~-------~~~~~~~f~G~Idevriy~~ 128 (133)
T smart00560 61 GVW-VHLAGVYDGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGGRILL-------GGAGGENFSGRLDEVRVYNR 128 (133)
T ss_pred CCE-EEEEEEEECCCCeEEEEECCEEccccccC----CcccCCceEEeeeccC-------CCCCCCCceEEeeEEEEecc
Confidence 689 999999998 789999999653322111 1233457888842100 01123579999999998654
Q ss_pred c
Q psy7014 296 N 296 (500)
Q Consensus 296 ~ 296 (500)
.
T Consensus 129 a 129 (133)
T smart00560 129 A 129 (133)
T ss_pred c
Confidence 3
No 54
>KOG3509|consensus
Probab=90.35 E-value=3.5 Score=48.02 Aligned_cols=170 Identities=20% Similarity=0.239 Sum_probs=83.5
Q ss_pred CCccEEEEEEEeC-cEEEEEEcCCcceeeeCCCc-------cccccCCCceEEcccccccCcCCCCCcCCCCCccc----
Q psy7014 218 KRGGYTVRVGKNG-QQCWLMVDNMGNVTSKSPGR-------LTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSG---- 285 (500)
Q Consensus 218 g~WwH~V~v~r~~-~~~~L~VD~~~~~~~~~~~~-------~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~G---- 285 (500)
+.| |++.-.+.+ ..+.+.+++...+...+... ...++++...+.++.....-...........+++|
T Consensus 654 ~~a-~~~~~~~~~~~~~~m~~a~~~~~l~~st~~~~~~p~~~~~~~~~ga~~~~~g~~~~~~~~~~~C~c~~g~~G~~ce 732 (964)
T KOG3509|consen 654 GSA-HRVDGIRARGQHILMTVADTTTVLIKSTVSTDCTPSECSSANLEGALCYGGGKTDIIAAEVEQCQCPKGLVGTSCE 732 (964)
T ss_pred ccc-ccccceecCCceeccccccccceeeeecccCCCChHHhhhhccCcccccCCCCCchhhhhccccccCccccCcccc
Confidence 467 999988887 55667777776666655432 33444555555555432211222233445678999
Q ss_pred -cccceeecccccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceee-cCCCCCCCccccccccc----cc
Q psy7014 286 -CIFDVELSAGNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSC-LCADGWFGPLCASRYNL----CD 359 (500)
Q Consensus 286 -CIr~v~ing~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C-~C~~Gy~G~~C~~~i~~----C~ 359 (500)
|.....+....-..+.....+..+..+..|..+.=.. ..|........| .|..|+.|..=...... |.
T Consensus 733 ~c~e~~~ls~t~~~~~~~~~~c~~~~h~~~c~~~~~~n------t~~q~~~~~~~~~~~~~g~~~da~~g~~~D~~p~~~ 806 (964)
T KOG3509|consen 733 DCAEGYTLSTTGGLYPGLCEDCECNSHISQCEDDLGYN------TDCQNNTEGDRCELCSPGTYGDARRGTPEDCRPATA 806 (964)
T ss_pred cccccccccccCCcCcccCcccccCCCccccccccccc------ccccccCccceeeecCCCccccCccCCcccCCccch
Confidence 7776666542111111112222222223332211001 234445556666 47788765432211111 11
Q ss_pred CCCccCCCCC-EEeeC-CCCeee-eCCCCCCCCCcCCCC
Q psy7014 360 STRHNCSFGA-TCVPL-THSYEC-DCPPGRTGKFCEKDE 395 (500)
Q Consensus 360 ~~p~pC~ngg-~C~~~-~~g~~C-~C~~G~~G~~Ce~~~ 395 (500)
.. .+|.-+. .+... ..++.| .|+.+++|.+|+...
T Consensus 807 l~-~~~~~~~r~~l~~~~~~~~~~~~p~~~~g~~~~~~~ 844 (964)
T KOG3509|consen 807 LT-IQCSCNNRSPLSCDGFGPGCLLCPHNTEGTTCERVK 844 (964)
T ss_pred hh-hhhhhcccCccccccCCCCcccCCCCccccchhhhc
Confidence 11 1222111 22222 245567 489999999998863
No 55
>KOG1226|consensus
Probab=89.44 E-value=0.76 Score=51.43 Aligned_cols=43 Identities=28% Similarity=0.590 Sum_probs=25.0
Q ss_pred CcccccccccccCC-CccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCC
Q psy7014 348 GPLCASRYNLCDST-RHNCSFGATCVPLTHSYECDCPPGRTGKFCEKD 394 (500)
Q Consensus 348 G~~C~~~i~~C~~~-p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~ 394 (500)
|+.|+.+--.|... ...|...|.|.=. .|.|.+||+|..|+=.
T Consensus 539 G~fCECDnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~ 582 (783)
T KOG1226|consen 539 GKFCECDNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACNCP 582 (783)
T ss_pred eeeeeccCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence 44444443334332 1356666767655 6778888877776654
No 56
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.66 E-value=0.25 Score=33.77 Aligned_cols=27 Identities=30% Similarity=0.890 Sum_probs=21.0
Q ss_pred CCCCCCCEEeeCCCceeecCCCCCCCc
Q psy7014 323 HTCSHGGACMNHGATFSCLCADGWFGP 349 (500)
Q Consensus 323 ~pC~ngg~Ci~~~~~~~C~C~~Gy~G~ 349 (500)
..|...+.|++..+.|.|.|.+||.|.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 358888999999999999999999863
No 57
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=88.47 E-value=0.34 Score=33.10 Aligned_cols=18 Identities=50% Similarity=1.198 Sum_probs=14.9
Q ss_pred EEeeCCCCeeeeCCCCCC
Q psy7014 370 TCVPLTHSYECDCPPGRT 387 (500)
Q Consensus 370 ~C~~~~~g~~C~C~~G~~ 387 (500)
.|++.+++|.|.|+.||.
T Consensus 11 ~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp EEEEETTSEEEE-STTEE
T ss_pred CCccCCCceEeECCCCCE
Confidence 788889999999999874
No 58
>KOG4260|consensus
Probab=87.76 E-value=0.51 Score=46.57 Aligned_cols=48 Identities=33% Similarity=0.762 Sum_probs=37.3
Q ss_pred CCCCCCCcccccccccccCC-CccCCCCCEEeeC---CCCeeeeCCCCCCCCCcCC
Q psy7014 342 CADGWFGPLCASRYNLCDST-RHNCSFGATCVPL---THSYECDCPPGRTGKFCEK 393 (500)
Q Consensus 342 C~~Gy~G~~C~~~i~~C~~~-p~pC~ngg~C~~~---~~g~~C~C~~G~~G~~Ce~ 393 (500)
|++|..|+.|. .|... ..||...|.|.-. .++-.|.|.+||.|+.|..
T Consensus 132 Cp~gtyGpdCl----~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~ 183 (350)
T KOG4260|consen 132 CPDGTYGPDCL----QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY 183 (350)
T ss_pred cCCCCcCCccc----cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc
Confidence 78899999995 34222 1389999999843 5778999999999999865
No 59
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=87.52 E-value=0.53 Score=45.56 Aligned_cols=32 Identities=31% Similarity=0.730 Sum_probs=23.9
Q ss_pred cccccCCCccCCCCCEEeeCCCCeeeeCCCCCCC
Q psy7014 355 YNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTG 388 (500)
Q Consensus 355 i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G 388 (500)
.++|...+++|.. .|.+..++|.|.|+.||+.
T Consensus 187 ~~~C~~~~~~c~~--~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 187 PDLCATLSHVCQQ--VCISTPGSYLCACTEGYAL 218 (224)
T ss_pred chhhcCCCCCccc--eEEcCCCCEEeECCCCccC
Confidence 3555554456763 7999999999999999874
No 60
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=86.51 E-value=0.45 Score=41.67 Aligned_cols=31 Identities=29% Similarity=0.747 Sum_probs=25.7
Q ss_pred CCCCCCCEEee--CCCceeecCCCCCCCcccccc
Q psy7014 323 HTCSHGGACMN--HGATFSCLCADGWFGPLCASR 354 (500)
Q Consensus 323 ~pC~ngg~Ci~--~~~~~~C~C~~Gy~G~~C~~~ 354 (500)
+-|.|| +|.- +.+.+.|.|..||+|.+|+..
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccccccce
Confidence 468897 8964 457899999999999999853
No 61
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=85.63 E-value=0.75 Score=28.52 Aligned_cols=11 Identities=64% Similarity=1.615 Sum_probs=6.3
Q ss_pred CeeeeCCCCCC
Q psy7014 377 SYECDCPPGRT 387 (500)
Q Consensus 377 g~~C~C~~G~~ 387 (500)
+|.|.|++||.
T Consensus 1 sy~C~C~~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCPPGYQ 11 (24)
T ss_pred CEEeeCCCCCc
Confidence 35666666653
No 62
>PHA02887 EGF-like protein; Provisional
Probab=85.43 E-value=0.65 Score=40.03 Aligned_cols=22 Identities=5% Similarity=-0.292 Sum_probs=19.6
Q ss_pred ccCCCceeecCCCCCCCCCCCC
Q psy7014 39 FDTNQPINQLLSIIFTNFLPPD 60 (500)
Q Consensus 39 ~~~~~~~c~c~~~~~G~~C~~~ 60 (500)
.+-++|.|.|+.||+|.+|+..
T Consensus 103 ~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 103 IDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred ccCCCceeECCCCcccCCCCcc
Confidence 5677999999999999999974
No 63
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=84.72 E-value=0.3 Score=37.81 Aligned_cols=41 Identities=32% Similarity=0.891 Sum_probs=20.9
Q ss_pred eeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeee------eCCCCCCCCCc
Q psy7014 338 FSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYEC------DCPPGRTGKFC 391 (500)
Q Consensus 338 ~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C------~C~~G~~G~~C 391 (500)
++-.|...|.|+.|.. .|.+. .+..+.|.| .|.+||+|+.|
T Consensus 17 ~rv~C~~nyyG~~C~~---~C~~~----------~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSK---FCKPR----------DDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-E---E---E----------EETTEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccC---CcCCC----------cCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 4567889999999973 23221 012356666 48999999887
No 64
>KOG1836|consensus
Probab=75.85 E-value=1.4 Score=54.34 Aligned_cols=80 Identities=19% Similarity=0.173 Sum_probs=56.5
Q ss_pred eeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCcccccccee
Q psy7014 212 TISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVE 291 (500)
Q Consensus 212 ~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ 291 (500)
.++.=++.| |.|.+.+....+.+.+|. . + ..+......+...++++||+|+..... ....-.+|.||| ++
T Consensus 1611 ~~~~~~~~~-~~~~~~~~~~v~~~~~~~-~-~--~~~~~~~~~~~~~p~~~~~~~~s~~~~---~~~~~~~~~~~~--~~ 1680 (1705)
T KOG1836|consen 1611 IVSLLPGGC-HSVTSSTDPGVVQLEDDT-Y-T--VGEIPPPPADTQEPIKLGGYPSSLTTL---RIAVLKSFTGCI--FV 1680 (1705)
T ss_pred hhhhcCCcc-eeeeeecCCccccccccc-e-e--cccCCCCchhccCCcccCCccccccce---eeecccccccce--EE
Confidence 334457789 999999999999998888 2 1 122233456778899999998643322 233456899999 88
Q ss_pred eccccccccc
Q psy7014 292 LSAGNVGINL 301 (500)
Q Consensus 292 ing~~~~l~~ 301 (500)
+++..+++..
T Consensus 1681 ~~~~~~~~~~ 1690 (1705)
T KOG1836|consen 1681 VMGIRVDVTL 1690 (1705)
T ss_pred ecCCCCcHHH
Confidence 8887777654
No 65
>KOG3546|consensus
Probab=74.35 E-value=11 Score=41.50 Aligned_cols=66 Identities=14% Similarity=0.063 Sum_probs=45.0
Q ss_pred CCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCC--CceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014 217 SKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTK--PMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS 293 (500)
Q Consensus 217 dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~--~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in 293 (500)
.++| .++.+...+..+.|.||-.+-.+.........|.+. .-||+|-.-. .-...|.|-|.++.+.
T Consensus 156 ~~~w-~~~a~~v~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~----------~~~~~f~g~~~~l~v~ 223 (1167)
T KOG3546|consen 156 VGQW-THLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGG----------ADPDKFQGVIAELKVR 223 (1167)
T ss_pred hchh-hheeeeecCceEEEEechHHhcccchhccccceeecCCcceEEeccCC----------CChHhhhhhhhheeec
Confidence 4689 999999999999999997654443333333445554 3588875421 1224699999998885
No 66
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=72.40 E-value=2.3 Score=29.08 Aligned_cols=18 Identities=28% Similarity=0.907 Sum_probs=15.5
Q ss_pred EEeeCCCceeecCCCCCC
Q psy7014 330 ACMNHGATFSCLCADGWF 347 (500)
Q Consensus 330 ~Ci~~~~~~~C~C~~Gy~ 347 (500)
.|++..+.|+|.|+.||.
T Consensus 11 ~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp EEEEETTSEEEE-STTEE
T ss_pred CCccCCCceEeECCCCCE
Confidence 788889999999999996
No 67
>KOG1834|consensus
Probab=70.38 E-value=75 Score=35.59 Aligned_cols=147 Identities=14% Similarity=0.126 Sum_probs=83.3
Q ss_pred CceEEEEEEEEeeCC------CCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhc
Q psy7014 87 IHHCFELKFRFVPNS------FDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLR 160 (500)
Q Consensus 87 ~~~~~~i~~~Frt~~------~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~ 160 (500)
....+.|+|..|--. .+. --||-..++ .+-+..+.+|++..=|+.|.++-..|. ++..|
T Consensus 364 l~dhFTlSfwMkHg~~p~~~~~ek-etIlCnsdk--~emnrhHyslyvh~Crl~fllr~d~~~------------~~~fR 428 (952)
T KOG1834|consen 364 LPDHFTLSFWMKHGPGPKDEQSEK-ETILCNSDK--TEMNRHHYSLYVHGCRLEFLLRRDAGA------------TSDFR 428 (952)
T ss_pred CCCceEEEEeeecCCCCccccccc-eeEEecccc--cccccceeEEEEeccEEEEEEccCccc------------ccccc
Confidence 445677777765221 011 235555543 244567899999999999988775532 11222
Q ss_pred cccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCC
Q psy7014 161 SAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNM 240 (500)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~ 240 (500)
+| +|.| +.-.+.|..| |.-.+....-.++|.|||.
T Consensus 429 pa----------------ef~W----------------------------kl~qVCD~EW-H~Y~ln~efp~VtlyvDG~ 463 (952)
T KOG1834|consen 429 PA----------------EFHW----------------------------KLPQVCDNEW-HHYVLNVEFPDVTLYVDGK 463 (952)
T ss_pred ch----------------heec----------------------------cchhhhhhhh-heeEEeecCceEEEEEcCc
Confidence 22 1211 1124578899 9999999999999999996
Q ss_pred ccee--eeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeecccc
Q psy7014 241 GNVT--SKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGN 296 (500)
Q Consensus 241 ~~~~--~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~ 296 (500)
.-.. ..-......-.+.+.|.||.-=.... ........-|+|=+..+.+..+.
T Consensus 464 Sfep~~i~ddwplHpsk~~tqLvVGACW~g~~---~~~l~~aqfFrG~LasltlrsGk 518 (952)
T KOG1834|consen 464 SFEPPLITDDWPLHPSKIETQLVVGACWQGRQ---QKPLKLAQFFRGQLASLTLRSGK 518 (952)
T ss_pred ccCCceeccCCccCcccccceeEEeeeccCcc---ccchhHHHHhhcccceeEEeccc
Confidence 5221 11112222333566788885411110 01122345688888877775433
No 68
>KOG1836|consensus
Probab=67.84 E-value=4.7 Score=50.01 Aligned_cols=52 Identities=33% Similarity=0.717 Sum_probs=38.0
Q ss_pred cCCCCCCCcccccccccccCCCccCCCCCEEeeCC--CCeeee-CCCCCCCCCcCCC
Q psy7014 341 LCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLT--HSYECD-CPPGRTGKFCEKD 394 (500)
Q Consensus 341 ~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~--~g~~C~-C~~G~~G~~Ce~~ 394 (500)
.|..||.|..=......|.+ .||.+++.|.... ....|. |+++|+|++|+.-
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~--C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c 814 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQP--CPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEEC 814 (1705)
T ss_pred hhcCCCCCccccCCCCCCcc--CCCCCChhhcCcCcccceecCCCCCCCcccccccC
Confidence 36667766544333333665 4799999998654 678898 9999999999985
No 69
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=65.39 E-value=3.9 Score=34.82 Aligned_cols=39 Identities=31% Similarity=0.805 Sum_probs=24.4
Q ss_pred ccCCCccCCCCCEEeeCC-----CCeeeeCCC-------------CCCCCCcCCCCC
Q psy7014 358 CDSTRHNCSFGATCVPLT-----HSYECDCPP-------------GRTGKFCEKDES 396 (500)
Q Consensus 358 C~~~p~pC~ngg~C~~~~-----~g~~C~C~~-------------G~~G~~Ce~~~~ 396 (500)
|....+-|..+|.|+... .=|.|.|.+ .|.|..|++..-
T Consensus 8 C~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkDv 64 (103)
T PF12955_consen 8 CENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKDV 64 (103)
T ss_pred HHHhccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeecccccccccc
Confidence 333334566677776652 346777765 477888888743
No 70
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=64.69 E-value=6.9 Score=33.36 Aligned_cols=31 Identities=29% Similarity=0.608 Sum_probs=22.5
Q ss_pred ccccCCCccCCCCCEEeeCCCCeeeeCCCCCCC
Q psy7014 356 NLCDSTRHNCSFGATCVPLTHSYECDCPPGRTG 388 (500)
Q Consensus 356 ~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G 388 (500)
+.|... ..|+..|.|... ....|.|.+||.-
T Consensus 78 d~Cd~y-~~CG~~g~C~~~-~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 78 DQCDVY-GFCGPNGICNSN-NSPKCSCLPGFEP 108 (110)
T ss_pred cCCCCc-cccCCccEeCCC-CCCceECCCCcCC
Confidence 455544 479999999653 4567999999864
No 71
>KOG0994|consensus
Probab=63.83 E-value=6.5 Score=46.14 Aligned_cols=58 Identities=29% Similarity=0.577 Sum_probs=35.4
Q ss_pred CCceeec-CCCCCCCcccccccccccCCCccCCCC--------CEEeeC--CCCeeeeCCCCCCCCCcCCC
Q psy7014 335 GATFSCL-CADGWFGPLCASRYNLCDSTRHNCSFG--------ATCVPL--THSYECDCPPGRTGKFCEKD 394 (500)
Q Consensus 335 ~~~~~C~-C~~Gy~G~~C~~~i~~C~~~p~pC~ng--------g~C~~~--~~g~~C~C~~G~~G~~Ce~~ 394 (500)
...+.|+ |..||.|..---.-..|.+ .||..+ ..|... .....|.|..||+|.+|+.=
T Consensus 882 T~G~~CdrCl~GyyGdP~lg~g~~CrP--CpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~C 950 (1758)
T KOG0994|consen 882 TTGHSCDRCLDGYYGDPRLGSGIGCRP--CPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEIC 950 (1758)
T ss_pred ccccchhhhhccccCCcccCCCCCCCC--CCCCCCCccchhccccccccccccceeeecccCccccchhhh
Confidence 3456664 7777775432222234433 355433 245433 35788999999999999874
No 72
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=60.70 E-value=19 Score=31.13 Aligned_cols=46 Identities=13% Similarity=0.189 Sum_probs=28.3
Q ss_pred cceeeEEEEEeeCCCCc---EEEEcCCCCCCCeEEEEEE-CCEEEEEEEcCC
Q psy7014 445 LHEACIDLEIRPTKDKG---LLMYFGHPQKNSMMTLSLQ-GGVLELRVLMLG 492 (500)
Q Consensus 445 ~~~~~i~l~frT~~~~G---lLl~~~~~~~~dfi~l~l~-~G~l~~~~~~g~ 492 (500)
...++|++.||.....+ .+++ .....+.+.|.+. +|.+.+.+..++
T Consensus 21 ~~~fTi~~w~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~l~~~~~~~~ 70 (157)
T PF13385_consen 21 SGSFTISFWVKPDSPSSSQSFVFM--DSSGSGGFGLFINNNGRLRFYIGNGG 70 (157)
T ss_dssp GTEEEEEEEEEESS--SSEEEEEE--SSSSSEEEEEEEETTSEEEEEETTSE
T ss_pred CCCEEEEEEEEeCCCCCCceEEEE--ecCCCCEEEEEEECCCEEEEEEeCCC
Confidence 36788999999886433 4343 1112347777777 577777766553
No 73
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=57.97 E-value=5.8 Score=37.34 Aligned_cols=63 Identities=27% Similarity=0.734 Sum_probs=42.5
Q ss_pred CCCCCCCEEeeCCCceeecCCCCCC---CcccccccccccC---CCccCCCCCEEeeCC-----CCeeeeCCCCCC
Q psy7014 323 HTCSHGGACMNHGATFSCLCADGWF---GPLCASRYNLCDS---TRHNCSFGATCVPLT-----HSYECDCPPGRT 387 (500)
Q Consensus 323 ~pC~ngg~Ci~~~~~~~C~C~~Gy~---G~~C~~~i~~C~~---~p~pC~ngg~C~~~~-----~g~~C~C~~G~~ 387 (500)
..|.| |..++-.+.|.|.|..||. -..|+..+ .|.. .-.+|+.-+.|+... ..|.|.|-.||.
T Consensus 6 T~CKN-G~LiQMSNHfEC~Cnegfvl~~EntCE~kv-~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~ 79 (197)
T PF06247_consen 6 TICKN-GYLIQMSNHFECKCNEGFVLKNENTCEEKV-ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYI 79 (197)
T ss_dssp ---BT-EEEEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEE
T ss_pred ccccC-CEEEEccCceEEEcCCCcEEccccccccce-ecCcccccCccccchhhhhcCCCcccceeEEEecccCce
Confidence 34555 4778888999999999997 45676554 4443 124899999998764 699999999985
No 74
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=56.31 E-value=5.7 Score=27.34 Aligned_cols=24 Identities=33% Similarity=0.660 Sum_probs=15.3
Q ss_pred cCCCCCEEeeCC-CCeeeeCCCCCC
Q psy7014 364 NCSFGATCVPLT-HSYECDCPPGRT 387 (500)
Q Consensus 364 pC~ngg~C~~~~-~g~~C~C~~G~~ 387 (500)
+|..++.|.... +.+.|.|-+||.
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEE
T ss_pred cCCCCcccEEcCCCCEEEEeeCCcc
Confidence 577777887765 778888888874
No 75
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=56.09 E-value=6.2 Score=27.14 Aligned_cols=28 Identities=21% Similarity=0.672 Sum_probs=19.9
Q ss_pred CCCCCCCCCCEEeeCC-CceeecCCCCCC
Q psy7014 320 CHNHTCSHGGACMNHG-ATFSCLCADGWF 347 (500)
Q Consensus 320 C~~~pC~ngg~Ci~~~-~~~~C~C~~Gy~ 347 (500)
|...+|..++.|.... +...|.|-.||.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 6667888999999876 789999999996
No 76
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=52.88 E-value=9.2 Score=27.56 Aligned_cols=22 Identities=45% Similarity=1.093 Sum_probs=15.9
Q ss_pred EEeeCCCCeeeeCCCCCCCCCcCC
Q psy7014 370 TCVPLTHSYECDCPPGRTGKFCEK 393 (500)
Q Consensus 370 ~C~~~~~g~~C~C~~G~~G~~Ce~ 393 (500)
.|.. ....|.|.++|+|+.|+.
T Consensus 12 ~C~~--~~G~C~C~~~~~G~~C~~ 33 (49)
T PF00053_consen 12 TCDP--STGQCVCKPGTTGPRCDQ 33 (49)
T ss_dssp SEEE--TCEEESBSTTEESTTS-E
T ss_pred cccC--CCCEEeccccccCCcCcC
Confidence 4544 345899999999999985
No 77
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=51.53 E-value=3.7 Score=30.65 Aligned_cols=22 Identities=14% Similarity=-0.087 Sum_probs=14.2
Q ss_pred cCCCceeecCCCCCCCCCCCCC
Q psy7014 40 DTNQPINQLLSIIFTNFLPPDI 61 (500)
Q Consensus 40 ~~~~~~c~c~~~~~G~~C~~~~ 61 (500)
..|.|.|+|..-|+|+.|...+
T Consensus 32 ~dG~p~CECn~Cy~GpdCS~~~ 53 (56)
T PF04863_consen 32 ADGSPVCECNSCYGGPDCSTLI 53 (56)
T ss_dssp ETTEE--EE-TTEESTTS-EE-
T ss_pred ccCCccccccCCcCCCCcccCC
Confidence 3456999999999999998754
No 78
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=45.09 E-value=21 Score=25.93 Aligned_cols=16 Identities=38% Similarity=1.167 Sum_probs=13.7
Q ss_pred eeeeCCCCCCCCCcCC
Q psy7014 378 YECDCPPGRTGKFCEK 393 (500)
Q Consensus 378 ~~C~C~~G~~G~~Ce~ 393 (500)
-.|.|.++++|..|+.
T Consensus 19 G~C~C~~~~~G~~C~~ 34 (50)
T cd00055 19 GQCECKPNTTGRRCDR 34 (50)
T ss_pred CEEeCCCcCCCCCCCC
Confidence 3789999999999984
No 79
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=44.47 E-value=95 Score=29.52 Aligned_cols=49 Identities=16% Similarity=0.310 Sum_probs=37.3
Q ss_pred cceeeEEEEEeeCCCCcE--EEEcCCC-CCCCeEEEEEECCEEEEEEEcCCC
Q psy7014 445 LHEACIDLEIRPTKDKGL--LMYFGHP-QKNSMMTLSLQGGVLELRVLMLGD 493 (500)
Q Consensus 445 ~~~~~i~l~frT~~~~Gl--Ll~~~~~-~~~dfi~l~l~~G~l~~~~~~g~~ 493 (500)
+...+|.++|++.+.+++ ||-++.. ..+.|+.|.+.++.+-+.+.-..+
T Consensus 32 L~~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~ 83 (190)
T PF02973_consen 32 LEEGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKG 83 (190)
T ss_dssp -SSEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTT
T ss_pred ccccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCC
Confidence 467799999999877774 6666554 357999999999998888876665
No 80
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=43.45 E-value=2.7e+02 Score=27.07 Aligned_cols=26 Identities=8% Similarity=0.059 Sum_probs=19.4
Q ss_pred eeCCCCccEEEEEEEeC--cEEEEEEcCC
Q psy7014 214 SAKSKRGGYTVRVGKNG--QQCWLMVDNM 240 (500)
Q Consensus 214 ~~~dg~WwH~V~v~r~~--~~~~L~VD~~ 240 (500)
.+.+|++ |+|.|.+++ +.+.+.|+..
T Consensus 159 ~l~~g~~-~~v~I~Y~~~~~~L~V~l~~~ 186 (236)
T cd06899 159 KLKSGKP-MQAWIDYDSSSKRLSVTLAYS 186 (236)
T ss_pred cccCCCe-EEEEEEEcCCCCEEEEEEEeC
Confidence 3578999 999999995 5666666554
No 81
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=43.07 E-value=21 Score=34.42 Aligned_cols=34 Identities=24% Similarity=0.781 Sum_probs=24.9
Q ss_pred cccC-CccCC--CCCCCCCCEEeeCCCceeecCCCCCCC
Q psy7014 313 GQCG-TSQCH--NHTCSHGGACMNHGATFSCLCADGWFG 348 (500)
Q Consensus 313 ~~C~-~~~C~--~~pC~ngg~Ci~~~~~~~C~C~~Gy~G 348 (500)
..|. .+.|. +.+|. ..|.+..+.|.|.|+.||+.
T Consensus 182 ~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 182 KICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred ccCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence 3453 35665 34566 47999999999999999973
No 82
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=41.88 E-value=70 Score=29.82 Aligned_cols=45 Identities=20% Similarity=0.250 Sum_probs=34.2
Q ss_pred ceeeEEEEEeeC-CCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEc
Q psy7014 446 HEACIDLEIRPT-KDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLM 490 (500)
Q Consensus 446 ~~~~i~l~frT~-~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~ 490 (500)
..++|.+.||+. ...|.||-..+.+...++.|.|.++...+.|..
T Consensus 52 ~~fsi~~~~r~~~~~~g~L~si~~~~~~~~l~v~l~g~~~~~~~~~ 97 (184)
T smart00210 52 EDFSLLTTFRQTPKSRGVLFAIYDAQNVRQFGLEVDGRANTLLLRY 97 (184)
T ss_pred CCeEEEEEEEeCCCCCeEEEEEEcCCCcEEEEEEEeCCccEEEEEE
Confidence 567899999998 688888877665556799999987765555543
No 83
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=39.77 E-value=2.4e+02 Score=27.35 Aligned_cols=29 Identities=14% Similarity=0.151 Sum_probs=22.6
Q ss_pred ceeeeCCCCccEEEEEEEeC--cEEEEEEcCC
Q psy7014 211 NTISAKSKRGGYTVRVGKNG--QQCWLMVDNM 240 (500)
Q Consensus 211 ~~~~~~dg~WwH~V~v~r~~--~~~~L~VD~~ 240 (500)
....+.+|+| |+|.|.++. +.+.+.++..
T Consensus 160 ~~~~l~~g~~-~~v~I~Yd~~~~~L~V~l~~~ 190 (236)
T PF00139_consen 160 PSFSLSDGKW-HTVWIDYDASTKRLSVYLDDN 190 (236)
T ss_dssp EEHHHGTTSE-EEEEEEEETTTTEEEEEEEET
T ss_pred ccccccCCcE-EEEEEEEcCCccEEEEEEecc
Confidence 3456789999 999999998 5666666665
No 84
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=38.87 E-value=1.2e+02 Score=28.87 Aligned_cols=22 Identities=14% Similarity=0.031 Sum_probs=17.2
Q ss_pred CCCeEEEEEECCEEEEEEEcCC
Q psy7014 120 ITDHLAVSFIKGYVVLTWNLGS 141 (500)
Q Consensus 120 ~~df~~l~l~~G~l~~~~~~G~ 141 (500)
....++|.+.+|++.+.++.+.
T Consensus 112 ~~P~~~l~~~~~~l~~~~~~~~ 133 (224)
T PF14099_consen 112 GSPPFALRIKGGRLYLRVRGDE 133 (224)
T ss_dssp EEECEEEEEETTEEEEEEEEE-
T ss_pred CCCcEEEEEeCCEEEEEEEcCC
Confidence 4567899999999998877765
No 85
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=38.72 E-value=3.4e+02 Score=25.82 Aligned_cols=23 Identities=22% Similarity=0.180 Sum_probs=19.2
Q ss_pred CCccEEEEEEEe--CcEEEEEEcCCc
Q psy7014 218 KRGGYTVRVGKN--GQQCWLMVDNMG 241 (500)
Q Consensus 218 g~WwH~V~v~r~--~~~~~L~VD~~~ 241 (500)
|+| |+|+|.++ .+.+.+.++...
T Consensus 154 g~~-~~v~I~Y~~~~~~L~v~l~~~~ 178 (223)
T cd01951 154 GNE-HTVRITYDPTTNTLTVYLDNGS 178 (223)
T ss_pred CCE-EEEEEEEeCCCCEEEEEECCCC
Confidence 789 99999999 477888888764
No 86
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=32.36 E-value=21 Score=26.79 Aligned_cols=33 Identities=24% Similarity=0.523 Sum_probs=13.8
Q ss_pred CCCCCCCEEeeC----CCceeecCCCCCCCccccccc
Q psy7014 323 HTCSHGGACMNH----GATFSCLCADGWFGPLCASRY 355 (500)
Q Consensus 323 ~pC~ngg~Ci~~----~~~~~C~C~~Gy~G~~C~~~i 355 (500)
.+|..+|....+ .+...|.|-.-|.|++|+..+
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~ 53 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI 53 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence 345555554322 234567777777777776544
No 87
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=32.31 E-value=62 Score=23.41 Aligned_cols=20 Identities=45% Similarity=1.112 Sum_probs=14.7
Q ss_pred cCCCCCEEeeCCCCeeeeCCCCCC
Q psy7014 364 NCSFGATCVPLTHSYECDCPPGRT 387 (500)
Q Consensus 364 pC~ngg~C~~~~~g~~C~C~~G~~ 387 (500)
.|..+..|++. .|.|+.||.
T Consensus 27 qC~~~s~C~~g----~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVNG----RCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcCC----EeECCCCCE
Confidence 47777788553 899999863
No 88
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=29.10 E-value=51 Score=23.48 Aligned_cols=18 Identities=0% Similarity=-0.348 Sum_probs=15.6
Q ss_pred CceeecCCCCCCCCCCCC
Q psy7014 43 QPINQLLSIIFTNFLPPD 60 (500)
Q Consensus 43 ~~~c~c~~~~~G~~C~~~ 60 (500)
.-.|.|+.+++|+.|++-
T Consensus 17 ~G~C~C~~~~~G~~C~~C 34 (46)
T smart00180 17 TGQCECKPNVTGRRCDRC 34 (46)
T ss_pred CCEEECCCCCCCCCCCcC
Confidence 457999999999999964
No 89
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=28.88 E-value=42 Score=28.63 Aligned_cols=23 Identities=30% Similarity=0.846 Sum_probs=17.8
Q ss_pred CCCCCCCCEEeeCC-----CceeecCCC
Q psy7014 322 NHTCSHGGACMNHG-----ATFSCLCAD 344 (500)
Q Consensus 322 ~~pC~ngg~Ci~~~-----~~~~C~C~~ 344 (500)
.+-|..+|.|+... +=|.|.|.+
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~ 39 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKP 39 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeec
Confidence 35699999999873 448999965
No 90
>PF11250 DUF3049: Protein of unknown function (DUF3049); InterPro: IPR021410 This eukaryotic family of proteins has no known function.
Probab=24.50 E-value=2e+02 Score=21.68 Aligned_cols=38 Identities=18% Similarity=0.349 Sum_probs=27.7
Q ss_pred EEEEeeCCCCcEEEEcC-CCCCCCeEEEEEECCEEEEEE
Q psy7014 451 DLEIRPTKDKGLLMYFG-HPQKNSMMTLSLQGGVLELRV 488 (500)
Q Consensus 451 ~l~frT~~~~GlLl~~~-~~~~~dfi~l~l~~G~l~~~~ 488 (500)
.+.+|+...||=|.-.. .-...+++..+=.||+|.+.+
T Consensus 17 ~~~~r~~r~dGRLvl~~v~v~~~~~~~A~R~~GRL~L~~ 55 (56)
T PF11250_consen 17 SVLMRPHREDGRLVLEEVRVPSHEYFHAEREDGRLRLQF 55 (56)
T ss_pred cEEEEEEccCCEEEEEEEEcCCcceEEEEccCCEEEEEe
Confidence 46788888888555442 222367999888999999875
No 91
>PF14607 GxDLY: N-terminus of Esterase_SGNH_hydro-type
Probab=23.81 E-value=2.4e+02 Score=25.71 Aligned_cols=13 Identities=15% Similarity=0.143 Sum_probs=10.1
Q ss_pred CCCCccEEEEEEEe
Q psy7014 216 KSKRGGYTVRVGKN 229 (500)
Q Consensus 216 ~dg~WwH~V~v~r~ 229 (500)
+||+| +-+.+.+-
T Consensus 91 ~~G~W-~~~~~g~p 103 (147)
T PF14607_consen 91 DDGKW-RFAGVGRP 103 (147)
T ss_pred CCCCE-EEEEeccc
Confidence 38999 98887764
No 92
>KOG1218|consensus
Probab=23.27 E-value=1e+02 Score=30.72 Aligned_cols=56 Identities=30% Similarity=0.634 Sum_probs=37.7
Q ss_pred eeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeee------eCCCCCCCCCcCCC
Q psy7014 338 FSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYEC------DCPPGRTGKFCEKD 394 (500)
Q Consensus 338 ~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C------~C~~G~~G~~Ce~~ 394 (500)
-.|.|+.||.|..|......|... ..|.+++.|......-.| .|..++.|..|...
T Consensus 162 ~~c~c~~g~~g~~~~~~~~~c~~~-~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (316)
T KOG1218|consen 162 GICTCQPGFVGVFCVESCSGCSPL-TACENGAKCNRSTGSCLCYPGPSGACKGGFHGCACLRM 223 (316)
T ss_pred CceeccCCcccccccccCCCcCCC-cccCCCCeeeccccccccCCCCcccccCCccCCcCccc
Confidence 467799999999998776656654 478888899876542222 34444566666654
No 93
>KOG1218|consensus
Probab=22.81 E-value=1.7e+02 Score=29.18 Aligned_cols=53 Identities=30% Similarity=0.744 Sum_probs=31.5
Q ss_pred eecCCCCCCCccccc---ccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCCC
Q psy7014 339 SCLCADGWFGPLCAS---RYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDES 396 (500)
Q Consensus 339 ~C~C~~Gy~G~~C~~---~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~~ 396 (500)
.|.|..++.+..|.. ....|... |.+...| ....-.|.|++||.|..|+....
T Consensus 125 ~c~~~~~~~~~~C~~~~~~g~~C~~~---c~~~~~~--~~~~~~c~c~~g~~g~~~~~~~~ 180 (316)
T KOG1218|consen 125 ECRCGGGYIGEQCGEENLVGLKCQRD---CQCTGGC--DCKNGICTCQPGFVGVFCVESCS 180 (316)
T ss_pred ceecCCcCccccccccCCCCCCccCC---CCCcccc--CCCCCceeccCCcccccccccCC
Confidence 566777777777765 12334332 3111111 12344788999999999988753
No 94
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=22.09 E-value=2.5e+02 Score=26.45 Aligned_cols=46 Identities=17% Similarity=0.136 Sum_probs=26.0
Q ss_pred ccceeeEEEEEeeCC--CCcEEE-EcCCCCCCCeEEEEEECCEEEEEEE
Q psy7014 444 HLHEACIDLEIRPTK--DKGLLM-YFGHPQKNSMMTLSLQGGVLELRVL 489 (500)
Q Consensus 444 ~~~~~~i~l~frT~~--~~GlLl-~~~~~~~~dfi~l~l~~G~l~~~~~ 489 (500)
.+..+++.+.+|+.. ..+.|| |.+..+.++++...-.+|.+.|.++
T Consensus 29 ~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i~ 77 (201)
T cd00152 29 PLQAFTLCLWVYTDLSTREYSLFSYATKGQDNELLLYKEKDGGYSLYIG 77 (201)
T ss_pred ChhhEEEEEEEEecCCCCCeEEEEEeCCCCCCeEEEEEcCCCeEEEEEc
Confidence 346788888888864 444455 5544333334333223567777664
No 95
>PF07622 DUF1583: Protein of unknown function (DUF1583); InterPro: IPR011475 Most of the Rhodopirellula baltica hypothetical proteins that have this domain also match PF07619 from PFAM.
Probab=21.91 E-value=4.9e+02 Score=27.60 Aligned_cols=33 Identities=15% Similarity=0.142 Sum_probs=26.9
Q ss_pred eeeCCCCccEEEEEEEeCcEEEEEEcCCcceeee
Q psy7014 213 ISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSK 246 (500)
Q Consensus 213 ~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~ 246 (500)
+.+++..| .+|++.+.+..+.|.+++.......
T Consensus 85 ~~l~~~~w-N~v~l~~~g~~v~l~LN~~~i~~~~ 117 (399)
T PF07622_consen 85 LPLKVNAW-NRVRLQRRGDKVQLHLNGQLIYERP 117 (399)
T ss_pred CCCCcccc-ceEEEEEeCCEEEEEeCCceeEecc
Confidence 34566789 9999999999999999998754443
Done!