Query psy5026
Match_columns 293
No_of_seqs 133 out of 1433
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 23:55:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5026.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5026hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3514|consensus 100.0 1.8E-33 3.8E-38 262.6 23.7 274 7-292 274-588 (1591)
2 KOG3514|consensus 100.0 6.5E-33 1.4E-37 258.8 22.3 282 4-292 457-985 (1591)
3 KOG3516|consensus 100.0 7.3E-30 1.6E-34 242.2 27.1 273 7-292 201-509 (1306)
4 KOG4289|consensus 100.0 9.2E-28 2E-32 229.3 20.8 253 7-276 1338-1663(2531)
5 KOG3516|consensus 99.9 2.3E-22 5.1E-27 191.6 25.8 270 4-292 807-1159(1306)
6 PF00054 Laminin_G_1: Laminin 99.9 2.9E-22 6.3E-27 154.1 15.9 117 174-292 1-118 (131)
7 smart00282 LamG Laminin G doma 99.9 2.6E-20 5.6E-25 144.2 16.6 123 167-292 2-125 (135)
8 PF00054 Laminin_G_1: Laminin 99.8 2.9E-20 6.3E-25 143.0 13.6 108 14-123 1-131 (131)
9 cd00110 LamG Laminin G domain; 99.8 1.7E-19 3.7E-24 142.0 17.8 136 153-292 8-143 (151)
10 smart00282 LamG Laminin G doma 99.8 8.3E-18 1.8E-22 130.1 14.3 111 7-120 2-135 (135)
11 PF02210 Laminin_G_2: Laminin 99.8 1.3E-17 2.9E-22 127.3 15.0 116 174-292 1-118 (128)
12 PF02210 Laminin_G_2: Laminin 99.7 3.5E-16 7.5E-21 119.4 12.2 104 14-120 1-128 (128)
13 cd00110 LamG Laminin G domain; 99.7 2.2E-15 4.7E-20 118.5 13.7 111 5-118 19-151 (151)
14 KOG4289|consensus 99.5 5.2E-13 1.1E-17 129.7 13.9 120 153-277 1326-1460(2531)
15 KOG1219|consensus 99.3 9.8E-12 2.1E-16 125.1 13.2 133 153-292 3693-3826(4289)
16 KOG1219|consensus 99.1 1.3E-09 2.8E-14 110.5 12.5 136 8-149 3707-3877(4289)
17 smart00210 TSPN Thrombospondin 98.7 1.1E-06 2.5E-11 71.4 14.3 86 166-253 52-142 (184)
18 smart00159 PTX Pentraxin / C-r 97.9 0.0011 2.3E-08 54.9 15.8 131 153-292 18-153 (206)
19 cd00152 PTX Pentraxins are pla 97.8 0.0024 5.2E-08 52.6 16.1 117 153-276 18-139 (201)
20 PF13385 Laminin_G_3: Concanav 97.8 0.00042 9.2E-09 53.8 11.1 117 153-275 10-131 (157)
21 smart00210 TSPN Thrombospondin 97.5 0.0031 6.7E-08 51.2 12.7 105 3-118 49-181 (184)
22 PF00354 Pentaxin: Pentaxin fa 97.5 0.004 8.7E-08 51.0 13.0 132 153-293 12-148 (195)
23 PF02973 Sialidase: Sialidase, 96.9 0.041 8.9E-07 44.5 12.9 109 167-276 34-155 (190)
24 smart00560 LamGL LamG-like jel 95.1 1 2.3E-05 34.2 12.6 42 228-273 61-104 (133)
25 PF06439 DUF1080: Domain of Un 94.3 0.07 1.5E-06 42.9 4.6 91 166-256 53-155 (185)
26 KOG1834|consensus 93.9 0.78 1.7E-05 43.6 10.8 112 166-277 366-493 (952)
27 PF02973 Sialidase: Sialidase, 93.6 1.3 2.8E-05 35.9 10.3 110 6-122 33-177 (190)
28 PF13385 Laminin_G_3: Concanav 92.8 1.8 3.9E-05 32.9 10.2 105 5-122 21-151 (157)
29 KOG3509|consensus 91.5 0.3 6.4E-06 48.9 5.0 106 18-126 266-392 (964)
30 KOG3509|consensus 91.2 0.24 5.1E-06 49.6 4.0 110 178-292 266-376 (964)
31 smart00159 PTX Pentraxin / C-r 88.3 13 0.00028 30.6 11.7 114 5-122 30-165 (206)
32 PF14099 Polysacc_lyase: Polys 86.2 11 0.00025 31.2 10.4 60 194-253 113-182 (224)
33 KOG3546|consensus 85.6 5.1 0.00011 38.3 8.4 108 166-274 87-205 (1167)
34 cd00152 PTX Pentraxins are pla 83.9 22 0.00048 29.0 11.7 114 4-122 29-165 (201)
35 PF00354 Pentaxin: Pentaxin fa 83.1 19 0.00041 29.4 10.1 113 6-122 25-159 (195)
36 PF02057 Glyco_hydro_59: Glyco 73.3 49 0.0011 32.4 10.8 83 167-251 543-634 (669)
37 smart00560 LamGL LamG-like jel 72.0 40 0.00086 25.3 10.9 64 53-122 61-130 (133)
38 cd01951 lectin_L-type legume l 71.1 46 0.001 27.5 9.3 23 228-250 154-178 (223)
39 KOG1834|consensus 64.8 1.1E+02 0.0023 30.0 10.9 117 3-120 363-517 (952)
40 PF00139 Lectin_legB: Legume l 62.7 38 0.00083 28.4 7.3 70 169-249 118-190 (236)
41 cd06899 lectin_legume_LecRK_Ar 61.6 97 0.0021 26.0 11.5 25 225-249 160-186 (236)
42 KOG1836|consensus 56.5 3.2 6.9E-05 44.7 -0.4 103 169-278 1559-1663(1705)
43 PF06439 DUF1080: Domain of Un 47.3 32 0.0007 27.2 4.2 70 4-73 51-147 (185)
44 PF07622 DUF1583: Protein of u 44.6 36 0.00077 30.8 4.2 32 221-252 83-114 (399)
45 PF09264 Sial-lect-inser: Vibr 38.8 2.1E+02 0.0046 23.2 8.6 80 167-251 33-117 (198)
46 PF09191 CD4-extracel: CD4, ex 37.6 1.4E+02 0.0031 21.6 5.5 47 167-213 16-62 (108)
47 PF11025 GP40: Glycoprotein GP 36.3 1.9E+02 0.0042 22.0 7.0 27 224-250 46-72 (165)
48 cd06903 lectin_EMP46_EMP47 EMP 34.3 2.7E+02 0.0059 23.1 10.3 24 226-249 149-174 (215)
49 cd02178 GH16_beta_agarase Beta 32.2 1.4E+02 0.003 25.5 5.9 27 227-253 178-205 (258)
50 PTZ00334 trans-sialidase; Prov 30.7 90 0.002 31.3 4.9 51 225-276 640-692 (780)
51 PF07081 DUF1349: Protein of u 30.5 2.8E+02 0.0062 22.2 10.0 76 167-249 53-136 (183)
52 PF05910 DUF868: Plant protein 29.5 1.4E+02 0.003 25.8 5.3 49 226-276 153-208 (274)
53 KOG1836|consensus 21.7 85 0.0018 34.5 3.2 69 54-125 1618-1689(1705)
No 1
>KOG3514|consensus
Probab=100.00 E-value=1.8e-33 Score=262.58 Aligned_cols=274 Identities=23% Similarity=0.445 Sum_probs=210.5
Q ss_pred eeEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC------------------CCCCeEEEEeccEEE
Q psy5026 7 AWRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR------------------PYADITVHRTVRTLI 68 (293)
Q Consensus 7 ~~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g------------------~w~~~~~~~~~~~v~ 68 (293)
+-.|+|.|||.+++|+|||.+... ||+-|.|++|-+.+.++++ +||.+.+.|+...+.
T Consensus 274 ~d~itl~FrT~q~ngllfytG~~~----dYlnlaL~dGaV~l~~~l~~g~~e~~~~p~~~rfdD~~WH~V~v~R~~~m~t 349 (1591)
T KOG3514|consen 274 KDNITLTFRTVQGNGLLFYTGDEK----DYLNLALQDGAVSLSSKLDGGDAEIIRMPNSFRFDDDSWHTVIVERSLQMMT 349 (1591)
T ss_pred ccceEEEEEEecCceeEEEccCCc----ceeeEeecCCcEEEEEecCCccceeEEccccccccCCcceEEEEEeeeEEEE
Confidence 347899999999999999999875 9999999999998876665 799999998877777
Q ss_pred ecccCC-------CCceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeeeeeccCCCCCcccccc---c
Q psy5026 69 LPYTVP-------SGLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLYNFNVSPGGDSLKGID---V 138 (293)
Q Consensus 69 l~~~g~-------~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~~~~~~~~~~~~~~~~---~ 138 (293)
+..+|. .+.+..|..+.-+|+||.|+...++. . ..+|.||+++|.+....+.+....++. +|.. +
T Consensus 350 ~~VDg~~t~~~~~a~~~tmlsss~~fyvgg~~~~~~l~g-s--rVsF~GClkkV~y~~d~~rl~L~~LAk--~g~~~~k~ 424 (1591)
T KOG3514|consen 350 LIVDGRRTEIRQYAPELTMLSSSDFFYVGGSPNTADLPG-S--RVSFMGCLKKVVYKNDDTRLELSRLAK--QGDSKMKT 424 (1591)
T ss_pred EEEccEEecccccccceeEeeccceEEecCCCCccccCC-C--ceeeeeeeeeeEeccCceeehhhHHhh--cCCceeEe
Confidence 665553 46777888888899999999877643 2 345999999999977654433322221 1211 1
Q ss_pred c-c-ccccCCCC---cCCc----ceeEecCCccceeeEEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEE
Q psy5026 139 A-L-LRPATHYQ---VSSI----PDQVYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFT 209 (293)
Q Consensus 139 ~-~-C~pC~~~~---~~~f----s~~~~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~ 209 (293)
. . --.|.+.. +.+| ||+.+|.+... +.-+|+|.|||.+++|||+|.........+|++++|-||++-+.
T Consensus 425 ~G~l~y~C~n~~~~DpvtFtt~es~l~LP~Wnt~--~~gSiSf~FRTtepnGlil~~~g~~~~~~d~~A~ELldghlyl~ 502 (1591)
T KOG3514|consen 425 EGDLSYSCENVAQLDPVTFTTPESYLTLPRWNTK--KSGSISFDFRTTEPNGLILFHGGPQANATDYFAIELLDGHLYLL 502 (1591)
T ss_pred eceEEEecCCCCccCceeeecccceeeccccccC--CcceeEEEEeecCCCceEEEccCcccccccEEEEEEeCCeEEEE
Confidence 0 0 01455554 2467 89999987443 45699999999999999999876544448999999999999999
Q ss_pred EECCCceEEEE-ecCCCCCCCeEEEEEEEeCCEEEEEECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCC---C
Q psy5026 210 FDLGTGAATLR-SSNPISLGEWRKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKV---K 285 (293)
Q Consensus 210 ~~~g~~~~~~~-~~~~~~dg~wh~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~---~ 285 (293)
+++|++...++ +..+++||+||+|.+.|.++..+++||.... ....|+...-|++++.+|+|-.|.....|+.+ .
T Consensus 503 ldlGSG~iklras~rkv~DGeWhhv~l~R~gR~gsvsVd~~~~-df~tpG~s~iL~ld~~mylG~~~n~l~~P~~vWta~ 581 (1591)
T KOG3514|consen 503 LDLGSGVIKLRASSRKVNDGEWHHVDLQRDGRTGSVSVDAIKT-DFSTPGDSEILDLDDPMYLGEVPNNLVYPSEVWTAA 581 (1591)
T ss_pred EecCCceEEeeeecccccCCceEEEEeeccCccceEEEeeeec-CccCCCcceeEeecCceeeccCCCCccCcHHHHHHH
Confidence 99999988887 5788999999999999999999999998743 44566777778999999999666554445443 2
Q ss_pred cCCCcee
Q psy5026 286 IKSSFIG 292 (293)
Q Consensus 286 ~~~~F~G 292 (293)
.++||+|
T Consensus 582 L~~GyvG 588 (1591)
T KOG3514|consen 582 LRKGYVG 588 (1591)
T ss_pred Hhccchh
Confidence 3566665
No 2
>KOG3514|consensus
Probab=100.00 E-value=6.5e-33 Score=258.82 Aligned_cols=282 Identities=21% Similarity=0.388 Sum_probs=201.0
Q ss_pred CCceeEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC----------------CCCCeEEEEeccEE
Q psy5026 4 SPQAWRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR----------------PYADITVHRTVRTL 67 (293)
Q Consensus 4 ~~~~~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g----------------~w~~~~~~~~~~~v 67 (293)
+.+.-.|+|.|||++|||||+|.++......||++++|.||+|.+.+++| +||.+.+.|...+-
T Consensus 457 t~~~gSiSf~FRTtepnGlil~~~g~~~~~~d~~A~ELldghlyl~ldlGSG~iklras~rkv~DGeWhhv~l~R~gR~g 536 (1591)
T KOG3514|consen 457 TKKSGSISFDFRTTEPNGLILFHGGPQANATDYFAIELLDGHLYLLLDLGSGVIKLRASSRKVNDGEWHHVDLQRDGRTG 536 (1591)
T ss_pred cCCcceeEEEEeecCCCceEEEccCcccccccEEEEEEeCCeEEEEEecCCceEEeeeecccccCCceEEEEeeccCccc
Confidence 44556899999999999999999986656789999999999999987776 57777766655533
Q ss_pred EecccC------CCCceeeeecCCceEEcCCCCCCCCCCC---CCcccCceeeEEEEEEcCeeeeeccCCCCCcc----c
Q psy5026 68 ILPYTV------PSGLFSRITFREPVFVGGRGNTSGLSDK---LPTEKGFKGCIRHLDINDHLYNFNVSPGGDSL----K 134 (293)
Q Consensus 68 ~l~~~g------~~~~~~~l~~~~~l~iGG~p~~~~~~~~---~~~~~~F~GCi~~~~~n~~~~~~~~~~~~~~~----~ 134 (293)
.+..++ .+|....|++++++|+|-.++..-.+.. ...+.+|+||||++.+||+..++......+.. .
T Consensus 537 svsVd~~~~df~tpG~s~iL~ld~~mylG~~~n~l~~P~~vWta~L~~GyvGCirdl~i~G~s~di~q~ae~q~sagvkp 616 (1591)
T KOG3514|consen 537 SVSVDAIKTDFSTPGDSEILDLDDPMYLGEVPNNLVYPSEVWTAALRKGYVGCIRDLFIDGVSTDIRQEAEAQNSAGVKP 616 (1591)
T ss_pred eEEEeeeecCccCCCcceeEeecCceeeccCCCCccCcHHHHHHHHhccchheehhheecceehhhHHHhhhccccccCc
Confidence 332222 3677788999999999965554222211 22468999999999999998877653211111 1
Q ss_pred cccc---cccc--ccCCCCc------------------------------------------------------------
Q psy5026 135 GIDV---ALLR--PATHYQV------------------------------------------------------------ 149 (293)
Q Consensus 135 ~~~~---~~C~--pC~~~~~------------------------------------------------------------ 149 (293)
+|.+ ..|. ||+|++.
T Consensus 617 sCs~~~~~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE~t~ls~nGs~~m~i~L~~~~~tq~E~v~iRF~t~r 696 (1591)
T KOG3514|consen 617 SCSLSNEKICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCEREATALSYNGSMSMKIVLPHTMHTQAEDVSIRFRTQR 696 (1591)
T ss_pred ccchhhccccCCCcccCCCCccccccccccccccCcccCccccceeeeEEEcCeeeEEEEecccceeecceEEEEEEecc
Confidence 2211 1354 8888764
Q ss_pred --------------------------------------------------------------------------------
Q psy5026 150 -------------------------------------------------------------------------------- 149 (293)
Q Consensus 150 -------------------------------------------------------------------------------- 149 (293)
T Consensus 697 ~~Gll~~Tta~~s~D~l~l~L~~g~vkl~v~ls~~~nlfag~~LnDN~WHtvrv~Rrg~~L~L~vD~~~~~~~~~~g~h~ 776 (1591)
T KOG3514|consen 697 AYGLLFATTARGSADTLRLELDAGQVKLFVNLSGPENLFAGQSLNDNEWHTVRVVRRGKSLLLYVDFWSVSIYTMNGIHV 776 (1591)
T ss_pred cceeEEEeccCCCCceEEEEEecceEEEEEecCCCcceeccccccCCcceEEEEEEcccceEEEeccccceeeeecCceE
Confidence
Q ss_pred ----------------------------------------------------------------CCc----ceeEecCCc
Q psy5026 150 ----------------------------------------------------------------SSI----PDQVYQGLG 161 (293)
Q Consensus 150 ----------------------------------------------------------------~~f----s~~~~~~~~ 161 (293)
.+| ||+.+..+
T Consensus 777 ~le~~~i~~g~e~~~~s~~~~nFiG~l~~LvFNG~~Yld~~K~~~~~ls~l~a~fkl~~iv~~paTf~sk~Sy~~la~L- 855 (1591)
T KOG3514|consen 777 RLEFHNIETGTESRAPSSVPSNFIGHLSGLVFNGQDYLDKCKMGDIQLSELSARFKLRAIVADPATFKSKSSYVKLATL- 855 (1591)
T ss_pred EEEEeeeccccccccCCCCChhhhhhhhheEECcHHHHHHHhcCCcchhhcchhhCceEEeeccceeeechhhhhhhhh-
Confidence 011 33333222
Q ss_pred cceeeEEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEE--ecCCCCCCCeEEEEEEEeC
Q psy5026 162 EQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR--SSNPISLGEWRKLRLTRTG 239 (293)
Q Consensus 162 ~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~--~~~~~~dg~wh~V~i~r~~ 239 (293)
.....++|.|+|+|.+++|+|++..... .||++|+|.+|++++.|++|+++...+ +..++||++||.|.|.|.+
T Consensus 856 -~ay~s~~l~Fqfkt~sp~gll~fn~gd~---ndfi~velvnG~ihYtfdlg~gp~~~k~~sr~hlnDnrWHnV~I~rd~ 931 (1591)
T KOG3514|consen 856 -QAYFSMHLFFQFKTTSPDGLLLFNSGDG---NDFIAVELVNGYIHYTFDLGNGPTSMKGPSRQHLNDNRWHNVLIYRDK 931 (1591)
T ss_pred -heeeEEEEEEEEeecCCCeEEEecCCCC---CceEEEEEeCcEEEEEEEcCCCcccccCcccCcCccccceeEEEEcCC
Confidence 2224678889999999999999987665 699999999999999999999876655 6789999999999999975
Q ss_pred -CEEEEEECCccceeEecCCCcccccCCCCeEEeccCCCC--ccCCCCCcCCCcee
Q psy5026 240 -RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYN--IVSPKVKIKSSFIG 292 (293)
Q Consensus 240 -~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~--~~~~~~~~~~~F~G 292 (293)
+.-.|.||..... ........+++.+.||+||+.... ..+..+..+.+|.|
T Consensus 932 ~~~HtL~vD~s~~t--~~~~g~~~l~l~g~LyiGGv~k~m~~~~p~~~asR~g~~g 985 (1591)
T KOG3514|consen 932 TNTHTLKVDNSSTT--QIIDGAVNLDLKGKLYIGGVSKPMYSFLPKLVASRSGFQG 985 (1591)
T ss_pred CCceEEEecCceEE--EEecCccccccccceecccccccccccccceeeccCCCCC
Confidence 4558999987432 222235567888999999985432 23434445666654
No 3
>KOG3516|consensus
Probab=99.97 E-value=7.3e-30 Score=242.19 Aligned_cols=273 Identities=18% Similarity=0.262 Sum_probs=204.7
Q ss_pred eeEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC----------------------CCCCeEEEEec
Q psy5026 7 AWRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR----------------------PYADITVHRTV 64 (293)
Q Consensus 7 ~~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g----------------------~w~~~~~~~~~ 64 (293)
+-.|+|.|||.++||+|||..+.. +||+.|+|++|++++.+|+| -||.+.+.+..
T Consensus 201 ~d~is~~Fkt~~sdGvllh~eg~Q---Gd~itlql~~~kl~l~ld~G~~~~~~s~~~~sis~GslLdD~hWHsV~i~r~~ 277 (1306)
T KOG3516|consen 201 KDVISLKFKTMQSDGVLLHGEGQQ---GDYITLQLIGGKLVLILDLGNSKLPSSRTPTSISAGSLLDDQHWHSVRIERQG 277 (1306)
T ss_pred cceeEEEEEeeccceeEEEcccCC---CCEEEEEEeCCEEEEEEecCCccCccccCcceeecccccCCCcceEEEEEecC
Confidence 347999999999999999998865 89999999999999999998 39999999999
Q ss_pred cEEEecccC------CCCceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCee-eeeccCCCCCcccccc
Q psy5026 65 RTLILPYTV------PSGLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHL-YNFNVSPGGDSLKGID 137 (293)
Q Consensus 65 ~~v~l~~~g------~~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~-~~~~~~~~~~~~~~~~ 137 (293)
..|++..++ ..|++..|+++..+|+||+|..... .....+|+|||++|++|+.. ++++...-.......+
T Consensus 278 ~~vnftvD~~~~~fr~~Ge~~~Ldld~e~~~GGiP~~~~~---~~~~~nF~GCienly~N~vdiidLa~~~~~~~~~~gn 354 (1306)
T KOG3516|consen 278 RQVNFTVDGVVHHFRATGEFDALDLDTEISFGGIPNDGKS---VGFEKNFTGCLENLYYNGVDIIDLAKRRKSQISAMGN 354 (1306)
T ss_pred cEEEEEEccceEeecccCccceeecceEEEECCccCCCcc---cceeeeeeeeeeeeeecCceeEeeecccccceecccc
Confidence 988876544 2588899999999999999987543 22348999999999999985 6777632111112223
Q ss_pred cccccccCCCCcC--Cc----ceeEecCCccceeeEEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEE
Q psy5026 138 VALLRPATHYQVS--SI----PDQVYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFD 211 (293)
Q Consensus 138 ~~~C~pC~~~~~~--~f----s~~~~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~ 211 (293)
+. ..|..+..+ .| ||+.+|+... ...+.++|+|||...+|+|++....+. ...+.+.|++|++...+.
T Consensus 355 v~--f~C~~P~~~pvtF~~sss~~~lpg~~~--~~~l~vSF~FRtw~~~G~ll~~~~~e~--~g~v~~fl~eg~~~~~i~ 428 (1306)
T KOG3516|consen 355 VS--FSCSDPQIIPVTFGNSSSYLRLPGNPN--PDRLSVSFQFRTWNKTGLLLFSELKEG--SGEVLLFLKEGKKFLQIT 428 (1306)
T ss_pred ee--EeccCCCCCCeEecccceeEEcCCCCC--CCceeeEEEEEeccccCceeeeeeccC--CceEEEEEeCCeEEEEEe
Confidence 32 145554432 35 6889886433 368899999999999999999866554 679999999999887764
Q ss_pred C-CCceEEEEecCCCCCCCeEEEEEEEeCCEEEEEECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCCCcCCCc
Q psy5026 212 L-GTGAATLRSSNPISLGEWRKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSF 290 (293)
Q Consensus 212 ~-g~~~~~~~~~~~~~dg~wh~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~F 290 (293)
. +...+.+.....+|||+||.|.+.++.+.+.+.+|+...... ......++......|+||.|+...........++|
T Consensus 429 ~~~r~~~~~~~g~~lnDG~WHsv~~~ak~n~~~~~iDd~~~~~~-~~~~p~~V~tg~tY~fgg~~~~~~~~~~~~~~~~f 507 (1306)
T KOG3516|consen 429 QIGRSKADAYAGLKLNDGAWHSVSFNAKKNRLVLMIDDGEAEIA-PDSKPLQVYTGTTYYFGGCPDKFNSWQCASPIKGF 507 (1306)
T ss_pred ccccchhhhcccccCCCCceEEEEEEeecceeEEEEcCcccccc-cCCccEEEEeCCeeEeccccccccchhhccccccc
Confidence 4 333456667788999999999999999999999999854221 11122345667889999999863222223334566
Q ss_pred ee
Q psy5026 291 IG 292 (293)
Q Consensus 291 ~G 292 (293)
.|
T Consensus 508 ~G 509 (1306)
T KOG3516|consen 508 QG 509 (1306)
T ss_pred cc
Confidence 54
No 4
>KOG4289|consensus
Probab=99.96 E-value=9.2e-28 Score=229.27 Aligned_cols=253 Identities=18% Similarity=0.343 Sum_probs=179.1
Q ss_pred eeEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC-----------------CCCCeEEEEeccEEEe
Q psy5026 7 AWRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR-----------------PYADITVHRTVRTLIL 69 (293)
Q Consensus 7 ~~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g-----------------~w~~~~~~~~~~~v~l 69 (293)
.+.++|+|.|.+.||+|||.+++ ++||++|++.++++++.|+.| +||.+.+...++...+
T Consensus 1338 h~TlslsfaT~~~nGlL~ynGne---khDFvalevVd~qvqltfS~Ges~t~v~p~Vp~gvsDGqWHtV~l~YyNK~av~ 1414 (2531)
T KOG4289|consen 1338 HFTLSLSFATIERNGLLLYNGNE---KHDFVALEVVDEQVQLTFSAGESTTTVSPDVPGGVSDGQWHTVQLEYYNKVAVV 1414 (2531)
T ss_pred EEEEEEEEEEeeecceEEecCCc---ccceEeeeeeeeeEEEEEecccccceecCCCCCCcccCceeEEEEEEeceEEEE
Confidence 45677788899999999999965 489999999999999987766 7999999888875444
Q ss_pred cccC----------C----------CCceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeeeeeccCCC
Q psy5026 70 PYTV----------P----------SGLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLYNFNVSPG 129 (293)
Q Consensus 70 ~~~g----------~----------~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~~~~~~~~ 129 (293)
..+. . .++...|++.++|++||+|..... ....|.|||+++.++++.+||+....
T Consensus 1415 svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~fpv-----~~k~FvGCmrdLsvD~~~VDma~fia 1489 (2531)
T KOG4289|consen 1415 SVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETFPV-----IEKQFVGCMRDLSVDGRDVDMATFIA 1489 (2531)
T ss_pred EeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcchh-----hHhHhhhhhhhcccccccccHHHHHh
Confidence 2211 1 234456999999999999965332 24789999999999999999886422
Q ss_pred CC-ccccccc--cccc--ccCCCCc---------------------------C-Cc---ceeEecCCccceeeEEEEEEE
Q psy5026 130 GD-SLKGIDV--ALLR--PATHYQV---------------------------S-SI---PDQVYQGLGEQYLSLLDLTIV 173 (293)
Q Consensus 130 ~~-~~~~~~~--~~C~--pC~~~~~---------------------------~-~f---s~~~~~~~~~~~~~~~~i~~~ 173 (293)
.+ ..+||.. .-|. +|.+++. | .| |-+++..++....-.+.++++
T Consensus 1490 nngt~eGC~ark~fCdsg~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~~pq~frG~sl~sw~~~~~~vSvPwylsl~ 1569 (2531)
T KOG4289|consen 1490 NNGTHEGCKARKNFCDSGQCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMAHPQHFRGHSLVSWEGLPSQVSVPWYLSLM 1569 (2531)
T ss_pred hcCcccCchhhhcccCCCccCCCCeeecccCcEeecCccccCCcchhhccCCchhccccceeeecCCCcceecceEEEEE
Confidence 21 1223311 1122 5555542 1 23 566666665566668999999
Q ss_pred EEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEEecCCCCCCCeEEEEEEEeCCEEEEEECCcccee
Q psy5026 174 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWRKLRLTRTGRHAYLQVDRFPSSQ 253 (293)
Q Consensus 174 frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~~~~~~~dg~wh~V~i~r~~~~~~l~VD~~~~~~ 253 (293)
|||+..+|+||.+.... ..-+.+.|.+|++.+.+.. +.+.+ +.-.++||+||++.+..+.. ..++.|...- +
T Consensus 1570 FRTr~ad~vl~~~~~~~---rst~~lqld~g~l~~~v~~--s~v~L-~~~~vtdg~Wh~~~i~l~~d-~~~t~d~g~~-~ 1641 (2531)
T KOG4289|consen 1570 FRTRRADGVLMQAEFGG---RSTYNLQLDDGTLKYNVGD--SSVEL-PAPRVTDGHWHHLVIELEAD-SVATLDYGIY-Q 1641 (2531)
T ss_pred EEeeccccEEEEEEeCC---CceEEEEEcCCEEEEEecC--ceEEc-cCccccCCchhheeeeeccC-eEEEEechhh-h
Confidence 99999999999886554 3459999999999977643 33333 34569999999999999875 4566665422 1
Q ss_pred EecCCCcccccCCCCeEEeccCC
Q psy5026 254 ILSPGPFTQLSLSLSLYLGGVPD 276 (293)
Q Consensus 254 ~~~~~~~~~l~~~~~lyvGG~p~ 276 (293)
.........|++. .||+||+|.
T Consensus 1642 aea~~gl~gl~l~-sl~vGgap~ 1663 (2531)
T KOG4289|consen 1642 AEAKAGLSGLNLE-SLYVGGAPA 1663 (2531)
T ss_pred hhhhcCCCCceee-EEEEccccC
Confidence 1222224445555 799999994
No 5
>KOG3516|consensus
Probab=99.91 E-value=2.3e-22 Score=191.59 Aligned_cols=270 Identities=17% Similarity=0.257 Sum_probs=191.3
Q ss_pred CCceeEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECC-EEEEEEeCC-----------------CCCCeEEEEecc
Q psy5026 4 SPQAWRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEG-YVEFSTPYR-----------------PYADITVHRTVR 65 (293)
Q Consensus 4 ~~~~~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G-~l~~~~~~g-----------------~w~~~~~~~~~~ 65 (293)
...+.+|+|.|||..+.|++|++-+.+ ||+-++|..+ .++|.++.| +||.+++-+..+
T Consensus 807 ~~~saDIsf~FrTt~~~gvflen~g~~----dfir~eL~~~~~vtf~~dvgnGp~~~~V~s~t~~nD~qWH~V~~Ern~K 882 (1306)
T KOG3516|consen 807 NELSADISFFFRTTASSGVFLENHGIN----DFIRLELSSPVEVTFAFDVGNGPSQLTVRSPTELNDNQWHQVRAERNSK 882 (1306)
T ss_pred CcccccEEEEEEecCCceEeeeccCCC----ceEEEEEcCCCceEEEEEcCCCceeEEEcCCcccCCCceEEEEEEeccc
Confidence 456779999999999999999998865 9999999654 566755544 799999988887
Q ss_pred EEEecccCC--------CCceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeeeeeccCCCCCcccccc
Q psy5026 66 TLILPYTVP--------SGLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLYNFNVSPGGDSLKGID 137 (293)
Q Consensus 66 ~v~l~~~g~--------~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~~~~~~~~~~~~~~~~ 137 (293)
.-.|..++. ......|++...+||||.... +++|.||||.+++||+.+||.... ....+..
T Consensus 883 ~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vGgt~~~---------~~gF~GCIRsl~LNGv~ldLe~ra--~~~~gv~ 951 (1306)
T KOG3516|consen 883 EASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVGGTVSR---------QRGFLGCIRSLQLNGVMLDLEYRA--YGTAGVS 951 (1306)
T ss_pred cceEEEcCcccceecCCCCCEEEEEeccceeccccccC---------cCcceeeeeeeeecceeeeehhhh--ccCCccc
Confidence 666654442 234467888899999997432 589999999999999999996422 2222322
Q ss_pred c---cccc--ccCCCCc---------------------------CCc---ceeEecCCcc-----------------cee
Q psy5026 138 V---ALLR--PATHYQV---------------------------SSI---PDQVYQGLGE-----------------QYL 165 (293)
Q Consensus 138 ~---~~C~--pC~~~~~---------------------------~~f---s~~~~~~~~~-----------------~~~ 165 (293)
. +.|+ +|.|+|. ..| ++++|+-+.. ...
T Consensus 952 ~GC~GhCss~~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~eig~~fe~gs~i~y~fq~~~~~a~~~~~~~~~~~~~~~~ 1031 (1306)
T KOG3516|consen 952 PGCEGHCSSYPCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEIGVFFERGSSIRYNFQKPMRSAVFESSRVKQKLEIEIN 1031 (1306)
T ss_pred CCCccccccccccCCCEEEEecCceeeccccCcCCCCccccccceEecCCceEEEeccchHHHhhhhhhhhhhccccccC
Confidence 2 3565 8888874 012 5677653210 011
Q ss_pred eEEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEe-CCEEEEEEECCC-ceEEEEe-cCCCCCCCeEEEEEEEeCCEE
Q psy5026 166 SLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLN-DRYVDFTFDLGT-GAATLRS-SNPISLGEWRKLRLTRTGRHA 242 (293)
Q Consensus 166 ~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~-~g~v~~~~~~g~-~~~~~~~-~~~~~dg~wh~V~i~r~~~~~ 242 (293)
..-.|.|.|+|....++|+|.++-. .+|+++-+. ||.+..++.+|. .+..++. .+.+.||+.|.|.|.|..+.+
T Consensus 1032 ~~e~i~~sftTt~~ps~LLfvssF~---~~y~~V~v~~nGsLq~ry~lg~~e~~~~~~~~kn~~~gq~H~i~i~r~~~~~ 1108 (1306)
T KOG3516|consen 1032 PNEEINFSFTTTRAPSDLLFVSSFT---DDYLAVLVKDNGSLQTRYMLGFREPFEYQFKDKNIALGQPHDINITRGPRTV 1108 (1306)
T ss_pred ccceEEEEEEeccCceEEEEeeccc---cceEEEEEeCCCceEEEEecCCcCceEEecccccccCCCceEEEEecCCceE
Confidence 2457899999999999999988876 789999996 899999999998 5566664 567999999999999999999
Q ss_pred EEEECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCC--CcCCCcee
Q psy5026 243 YLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKV--KIKSSFIG 292 (293)
Q Consensus 243 ~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~--~~~~~F~G 292 (293)
.+.||+.+......+. ...++....+++|-+-......+++ -.+.+|.|
T Consensus 1109 ~i~vD~y~~~~y~~~~-~~~~~~~ksl~lg~v~e~~~~d~~~~k~~t~gF~G 1159 (1306)
T KOG3516|consen 1109 FLEVDGYLKVEYTFSI-DVDFQSPKSLTLGPVTETANIDHEISKYNTPGFGG 1159 (1306)
T ss_pred EEEecCccceeeeccc-ceeecccchhhccceeeccCCChhHHhhcCCCccc
Confidence 9999997664433322 2333444455555443332222222 13556665
No 6
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.90 E-value=2.9e-22 Score=154.13 Aligned_cols=117 Identities=39% Similarity=0.676 Sum_probs=98.7
Q ss_pred EEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEEecCCCCCCCeEEEEEEEeCCEEEEEECCcccee
Q psy5026 174 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWRKLRLTRTGRHAYLQVDRFPSSQ 253 (293)
Q Consensus 174 frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~~~~~~~dg~wh~V~i~r~~~~~~l~VD~~~~~~ 253 (293)
|||..++|+|||.+.... .||++|+|.+|+++++++.|.+...+.+...++||+||+|++.|+...+.|.||+.....
T Consensus 1 frT~~~~Gllly~g~~~~--~dfial~L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~~~~~ 78 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDG--KDFIALELRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGRNGSLSVDGEEVVT 78 (131)
T ss_dssp EEESSSSEEEEEEESSTT--SSEEEEEEETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETTEEEEEETTSEEEE
T ss_pred CccCCCCceEEECCcCCC--CCEEEEEEECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCcEEEEEECCcccee
Confidence 899999999999988765 699999999999999999999988888877899999999999999999999999987645
Q ss_pred EecCCCccc-ccCCCCeEEeccCCCCccCCCCCcCCCcee
Q psy5026 254 ILSPGPFTQ-LSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 292 (293)
Q Consensus 254 ~~~~~~~~~-l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G 292 (293)
...+..... ++....+||||+|.....+.......+|+|
T Consensus 79 ~~s~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~G 118 (131)
T PF00054_consen 79 GESPSGATQSLDVDGPLYVGGLPSSSSRPRPLPISPGFKG 118 (131)
T ss_dssp EEECSSSSSSCEECSEEEESSSSTTTGCGSSCSCCSB-EE
T ss_pred eecCCccccccccccCEEEccCCchhhcccccccCCCeeE
Confidence 556544444 788888999999944434445567788987
No 7
>smart00282 LamG Laminin G domain.
Probab=99.86 E-value=2.6e-20 Score=144.19 Aligned_cols=123 Identities=37% Similarity=0.703 Sum_probs=101.7
Q ss_pred EEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEEec-CCCCCCCeEEEEEEEeCCEEEEE
Q psy5026 167 LLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWRKLRLTRTGRHAYLQ 245 (293)
Q Consensus 167 ~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~~~-~~~~dg~wh~V~i~r~~~~~~l~ 245 (293)
.++|+|.|||.+++|+|||+.+... .+|+.|+|.+|++.+.++.+++...++.. ..++||+||+|.+.++++.+.|.
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~~--~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~~~~~l~ 79 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKNG--GDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRRVTLS 79 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCCC--CCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeCCEEEEE
Confidence 4689999999999999999977433 78999999999999999999877777765 88999999999999999999999
Q ss_pred ECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCCCcCCCcee
Q psy5026 246 VDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 292 (293)
Q Consensus 246 VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G 292 (293)
||+........+.....++....+||||+|+.... .......+|.|
T Consensus 80 VD~~~~~~~~~~~~~~~l~~~~~l~iGG~p~~~~~-~~~~~~~~F~G 125 (135)
T smart00282 80 VDGENPVSGESPGGLTILNLDGPLYLGGLPEDLKL-PPLLVTPGFRG 125 (135)
T ss_pred ECCCccccEECCCCceEEecCCCcEEccCCchhcc-cccccCCCCee
Confidence 99975544445555556777889999999986432 23345688987
No 8
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.85 E-value=2.9e-20 Score=142.97 Aligned_cols=108 Identities=30% Similarity=0.533 Sum_probs=83.0
Q ss_pred EEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC---------------CCCCeEEEEeccEEEecccCC----
Q psy5026 14 FKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR---------------PYADITVHRTVRTLILPYTVP---- 74 (293)
Q Consensus 14 FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g---------------~w~~~~~~~~~~~v~l~~~g~---- 74 (293)
|||.++||+|||.+..+ ..||++|+|.+|+|+++++.| +||.+.+.+....+.|..++.
T Consensus 1 frT~~~~Gllly~g~~~--~~dfial~L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~~~~~ 78 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKD--GKDFIALELRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGRNGSLSVDGEEVVT 78 (131)
T ss_dssp EEESSSSEEEEEEESST--TSSEEEEEEETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETTEEEEEETTSEEEE
T ss_pred CccCCCCceEEECCcCC--CCCEEEEEEECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCcEEEEEECCcccee
Confidence 89999999999998865 459999999999999998877 699998888887776654432
Q ss_pred ---CCcee-eeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeeee
Q psy5026 75 ---SGLFS-RITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLYN 123 (293)
Q Consensus 75 ---~~~~~-~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~~ 123 (293)
+.... .++....+||||+|.............+|.|||+++.+|++.+|
T Consensus 79 ~~s~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~in~~~ld 131 (131)
T PF00054_consen 79 GESPSGATQSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSINGKPLD 131 (131)
T ss_dssp EEECSSSSSSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEETTEEC-
T ss_pred eecCCccccccccccCEEEccCCchhhcccccccCCCeeEEEEEeEECCEECc
Confidence 22222 37788889999999433333334556899999999999999764
No 9
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.84 E-value=1.7e-19 Score=142.03 Aligned_cols=136 Identities=34% Similarity=0.607 Sum_probs=108.5
Q ss_pred ceeEecCCccceeeEEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEEecCCCCCCCeEE
Q psy5026 153 PDQVYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWRK 232 (293)
Q Consensus 153 s~~~~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~~~~~~~dg~wh~ 232 (293)
+|+.|+..... ...++|++.|||..++|+||+...... .+|+.|+|.+|++.+.++.+.....+++...++||+||+
T Consensus 8 ~~i~~~~~~~~-~~~~~i~~~frt~~~~g~l~~~~~~~~--~~~~~l~l~~g~l~~~~~~g~~~~~~~~~~~v~dg~Wh~ 84 (151)
T cd00110 8 SYVRLPTLPAP-RTRLSISFSFRTTSPNGLLLYAGSQNG--GDFLALELEDGRLVLRYDLGSGSLVLSSKTPLNDGQWHS 84 (151)
T ss_pred ceEEecCCCCC-cceeEEEEEEEeCCCCeEEEEecCCCC--CCEEEEEEECCEEEEEEcCCcccEEEEccCccCCCCEEE
Confidence 78899865433 568899999999999999999988643 789999999999999999986667777766899999999
Q ss_pred EEEEEeCCEEEEEECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCCCcCCCcee
Q psy5026 233 LRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 292 (293)
Q Consensus 233 V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G 292 (293)
|.+.+.++.+.|.||+........+.....+.....+||||+|+.... .......+|+|
T Consensus 85 v~i~~~~~~~~l~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~-~~~~~~~~F~G 143 (151)
T cd00110 85 VSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKS-PGLPVSPGFVG 143 (151)
T ss_pred EEEEECCCEEEEEECCccEEeeeCCCCceeecCCCCeEEcCCCCchhc-ccccccCCCce
Confidence 999999999999999974433333333224567789999999986432 23445688887
No 10
>smart00282 LamG Laminin G domain.
Probab=99.78 E-value=8.3e-18 Score=130.10 Aligned_cols=111 Identities=24% Similarity=0.521 Sum_probs=87.6
Q ss_pred eeEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC----------------CCCCeEEEEeccEEEec
Q psy5026 7 AWRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR----------------PYADITVHRTVRTLILP 70 (293)
Q Consensus 7 ~~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g----------------~w~~~~~~~~~~~v~l~ 70 (293)
.++++|.|||.+++|+|||.++.. ..+|++|+|.+|++.+.++.| +||.+.+.+..+.+.+.
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~--~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~~~~~l~ 79 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKN--GGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRRVTLS 79 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCC--CCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeCCEEEEE
Confidence 578999999999999999998742 369999999999999876654 69999998888887776
Q ss_pred ccCC-------CCceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCe
Q psy5026 71 YTVP-------SGLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDH 120 (293)
Q Consensus 71 ~~g~-------~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~ 120 (293)
.++. ++....++....+||||+|..... .......+|.|||+++.+|+.
T Consensus 80 VD~~~~~~~~~~~~~~~l~~~~~l~iGG~p~~~~~-~~~~~~~~F~GCi~~v~in~~ 135 (135)
T smart00282 80 VDGENPVSGESPGGLTILNLDGPLYLGGLPEDLKL-PPLLVTPGFRGCIRNLKVNGK 135 (135)
T ss_pred ECCCccccEECCCCceEEecCCCcEEccCCchhcc-cccccCCCCeeEeeEEEECCC
Confidence 5542 233356777889999999976432 223346899999999999974
No 11
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.77 E-value=1.3e-17 Score=127.33 Aligned_cols=116 Identities=34% Similarity=0.586 Sum_probs=92.9
Q ss_pred EEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCc-eEEEEecCCCCCCCeEEEEEEEeCCEEEEEECCccce
Q psy5026 174 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTG-AATLRSSNPISLGEWRKLRLTRTGRHAYLQVDRFPSS 252 (293)
Q Consensus 174 frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~-~~~~~~~~~~~dg~wh~V~i~r~~~~~~l~VD~~~~~ 252 (293)
|||+.++|+|||+..... .+|+.|+|.+|++.+.++.|.. ......+..++||+||+|.+.|.++.+.|.||+....
T Consensus 1 Frt~~~~g~Ll~~~~~~~--~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~~~~l~Vd~~~~~ 78 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDN--GDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGNRVTLTVDGQSVS 78 (128)
T ss_dssp EEESSSSEEEEEEEESTT--SEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETTEEEEEETTSEEE
T ss_pred CccCCCCEeEEEEcCCCC--CEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeeeeEEEEecCccce
Confidence 899999999999988764 5799999999999999999944 4566678899999999999999999999999998665
Q ss_pred eEecCCCcc-cccCCCCeEEeccCCCCccCCCCCcCCCcee
Q psy5026 253 QILSPGPFT-QLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 292 (293)
Q Consensus 253 ~~~~~~~~~-~l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G 292 (293)
......... .++....+||||.|.....+... ...+|.|
T Consensus 79 ~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~-~~~~f~G 118 (128)
T PF02210_consen 79 SESLPSSSSDSLDPDGSLYIGGLPESNQPSGSV-DTPGFVG 118 (128)
T ss_dssp EEESSSTTHHCBESEEEEEESSTTTTCTCTTSS-TTSB-EE
T ss_pred EEeccccceecccCCCCEEEecccCcccccccc-CCCCcEE
Confidence 444433332 56677789999999876433222 2688887
No 12
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.69 E-value=3.5e-16 Score=119.44 Aligned_cols=104 Identities=24% Similarity=0.486 Sum_probs=82.0
Q ss_pred EEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC----------------CCCCeEEEEeccEEEecccCCC--
Q psy5026 14 FKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR----------------PYADITVHRTVRTLILPYTVPS-- 75 (293)
Q Consensus 14 FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g----------------~w~~~~~~~~~~~v~l~~~g~~-- 75 (293)
|||.+++|+|||.++.+. .+|++|+|.+|+|++.+++| +||.+.+.+....+.+..++..
T Consensus 1 Frt~~~~g~Ll~~~~~~~--~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~~~~l~Vd~~~~~ 78 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDN--GDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGNRVTLTVDGQSVS 78 (128)
T ss_dssp EEESSSSEEEEEEEESTT--SEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETTEEEEEETTSEEE
T ss_pred CccCCCCEeEEEEcCCCC--CEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeeeeEEEEecCccce
Confidence 899999999999999762 57999999999999998877 6999999999988877655431
Q ss_pred -----Cce-eeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCe
Q psy5026 76 -----GLF-SRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDH 120 (293)
Q Consensus 76 -----~~~-~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~ 120 (293)
... ..++....+||||.|.......... ..+|.|||+++++||+
T Consensus 79 ~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~~-~~~f~Gci~~l~vng~ 128 (128)
T PF02210_consen 79 SESLPSSSSDSLDPDGSLYIGGLPESNQPSGSVD-TPGFVGCIRDLRVNGQ 128 (128)
T ss_dssp EEESSSTTHHCBESEEEEEESSTTTTCTCTTSST-TSB-EEEEEEEEETTE
T ss_pred EEeccccceecccCCCCEEEecccCccccccccC-CCCcEEEcCeEEECCC
Confidence 111 1556677899999998765433222 5899999999999985
No 13
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.66 E-value=2.2e-15 Score=118.55 Aligned_cols=111 Identities=25% Similarity=0.505 Sum_probs=87.0
Q ss_pred CceeEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeC---------------CCCCCeEEEEeccEEEe
Q psy5026 5 PQAWRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPY---------------RPYADITVHRTVRTLIL 69 (293)
Q Consensus 5 ~~~~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~---------------g~w~~~~~~~~~~~v~l 69 (293)
...++|+|.|||.+++|+|||.++.. ..+|++|+|.+|++.+.++. |+||.+.+.+....+.|
T Consensus 19 ~~~~~i~~~frt~~~~g~l~~~~~~~--~~~~~~l~l~~g~l~~~~~~g~~~~~~~~~~~v~dg~Wh~v~i~~~~~~~~l 96 (151)
T cd00110 19 RTRLSISFSFRTTSPNGLLLYAGSQN--GGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLNDGQWHSVSVERNGRSVTL 96 (151)
T ss_pred cceeEEEEEEEeCCCCeEEEEecCCC--CCCEEEEEEECCEEEEEEcCCcccEEEEccCccCCCCEEEEEEEECCCEEEE
Confidence 56889999999999999999999862 46999999999999987665 47999999888888877
Q ss_pred cccCCC-------CceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEc
Q psy5026 70 PYTVPS-------GLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDIN 118 (293)
Q Consensus 70 ~~~g~~-------~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n 118 (293)
..++.. .....++....+||||.|..... ...+...+|.|||+++++|
T Consensus 97 ~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~-~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 97 SVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKS-PGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred EECCccEEeeeCCCCceeecCCCCeEEcCCCCchhc-ccccccCCCceEeeEeEeC
Confidence 665531 11113567789999999875432 1223468999999999987
No 14
>KOG4289|consensus
Probab=99.48 E-value=5.2e-13 Score=129.68 Aligned_cols=120 Identities=33% Similarity=0.545 Sum_probs=96.1
Q ss_pred ceeEecCCccceeeEEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEEe--cCCCCCCCe
Q psy5026 153 PDQVYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRS--SNPISLGEW 230 (293)
Q Consensus 153 s~~~~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~~--~~~~~dg~w 230 (293)
|++.|.+. ...-.+.+++.|-|.+.+|+|+|.+++. .||++|++.++.+.+.|..|....++.. +..++||+|
T Consensus 1326 sfv~frgl--rqRfh~TlslsfaT~~~nGlL~ynGnek---hDFvalevVd~qvqltfS~Ges~t~v~p~Vp~gvsDGqW 1400 (2531)
T KOG4289|consen 1326 SFVTFRGL--RQRFHFTLSLSFATIERNGLLLYNGNEK---HDFVALEVVDEQVQLTFSAGESTTTVSPDVPGGVSDGQW 1400 (2531)
T ss_pred heEEEecc--ccceEEEEEEEEEEeeecceEEecCCcc---cceEeeeeeeeeEEEEEecccccceecCCCCCCcccCce
Confidence 67777653 2224678899999999999999999665 7999999999999999999976666664 457999999
Q ss_pred EEEEEEEeCCEEEEEECCccceeE-------------ecCCCcccccCCCCeEEeccCCC
Q psy5026 231 RKLRLTRTGRHAYLQVDRFPSSQI-------------LSPGPFTQLSLSLSLYLGGVPDY 277 (293)
Q Consensus 231 h~V~i~r~~~~~~l~VD~~~~~~~-------------~~~~~~~~l~~~~~lyvGG~p~~ 277 (293)
|+|.+++.++.+.++||+...... +.-++...|++.++|++||+|+.
T Consensus 1401 HtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~ 1460 (2531)
T KOG4289|consen 1401 HTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPET 1460 (2531)
T ss_pred eEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCc
Confidence 999999999999999999855211 11123345888999999999964
No 15
>KOG1219|consensus
Probab=99.35 E-value=9.8e-12 Score=125.12 Aligned_cols=133 Identities=24% Similarity=0.386 Sum_probs=107.9
Q ss_pred ceeEecCCccceeeEEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEEec-CCCCCCCeE
Q psy5026 153 PDQVYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWR 231 (293)
Q Consensus 153 s~~~~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~~~-~~~~dg~wh 231 (293)
||++|... ......+++.|++||..++|++||... .++..|.|.+|++.+.++.|++.-.+.+. ..++||+||
T Consensus 3693 SYveyrls-e~~n~~~kl~frLkT~~sngIiM~tr~-----~d~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWH 3766 (4289)
T KOG1219|consen 3693 SYVEYRLS-ENQNTRMKLGFRLKTLQSNGIIMYTRK-----TDLAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWH 3766 (4289)
T ss_pred eeEEEEcc-cccccceEEEEEEEecccCcEEEEEcC-----CceEEEEecCCcEEEEEecCCCCCcccccceEeecCcee
Confidence 78888643 322335899999999999999999984 57999999999999999999988555554 789999999
Q ss_pred EEEEEEeCCEEEEEECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCCCcCCCcee
Q psy5026 232 KLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 292 (293)
Q Consensus 232 ~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G 292 (293)
.+.+.|+++.++|+||+........|+....++++..||+||.-... ..+.-.+..||.|
T Consensus 3767 sialerrr~~irlsvDd~~~~~atvPg~~~tln~d~hiy~Ga~vrlr-~~~~tqvs~Gf~G 3826 (4289)
T KOG1219|consen 3767 SIALERRRNHIRLSVDDDTYDSATVPGMKSTLNLDTHIYLGALVRLR-HQRSTQVSYGFDG 3826 (4289)
T ss_pred EEEeeccCCceEEEEcccCceeeecccceeeccccceEEEeeEeeec-cCCCccccccccc
Confidence 99999999999999999877777888888889999999999976511 1112134577776
No 16
>KOG1219|consensus
Probab=99.07 E-value=1.3e-09 Score=110.50 Aligned_cols=136 Identities=15% Similarity=0.267 Sum_probs=94.1
Q ss_pred eEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC----------------CCCCeEEEEeccEEEe--
Q psy5026 8 WRFPIQFKPESWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR----------------PYADITVHRTVRTLIL-- 69 (293)
Q Consensus 8 ~~i~~~FrT~~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g----------------~w~~~~~~~~~~~v~l-- 69 (293)
+.+.|+.||.+++|+++|... . +|..|.|.+|.+.+.++.| +||++.+-+......+
T Consensus 3707 ~kl~frLkT~~sngIiM~tr~-~----d~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWHsialerrr~~irlsv 3781 (4289)
T KOG1219|consen 3707 MKLGFRLKTLQSNGIIMYTRK-T----DLAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWHSIALERRRNHIRLSV 3781 (4289)
T ss_pred eEEEEEEEecccCcEEEEEcC-C----ceEEEEecCCcEEEEEecCCCCCcccccceEeecCceeEEEeeccCCceEEEE
Confidence 788999999999999999984 3 8999999999999988776 5777766544443333
Q ss_pred cccC-----CCCceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeeeeeccCCCC--------Cccccc
Q psy5026 70 PYTV-----PSGLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLYNFNVSPGG--------DSLKGI 136 (293)
Q Consensus 70 ~~~g-----~~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~~~~~~~~~--------~~~~~~ 136 (293)
.+.. .++..+.++.+..+|+||.-.....+. .....+|.|||..+++||..+.+....-. ....||
T Consensus 3782 Dd~~~~~atvPg~~~tln~d~hiy~Ga~vrlr~~~~-tqvs~Gf~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC 3860 (4289)
T KOG1219|consen 3782 DDDTYDSATVPGMKSTLNLDTHIYLGALVRLRHQRS-TQVSYGFDGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGC 3860 (4289)
T ss_pred cccCceeeecccceeeccccceEEEeeEeeeccCCC-ccccccccceeeeEEEccccccccCCCchhhhhhhhhcccccc
Confidence 2221 145566788889999999864211111 12357999999999999998876654310 012343
Q ss_pred cc--cccc--ccCCCCc
Q psy5026 137 DV--ALLR--PATHYQV 149 (293)
Q Consensus 137 ~~--~~C~--pC~~~~~ 149 (293)
.. ..|. ||+|+|.
T Consensus 3861 ~l~~d~C~~npCqhgG~ 3877 (4289)
T KOG1219|consen 3861 SLLTDPCNDNPCQHGGT 3877 (4289)
T ss_pred cccccccccCcccCCCE
Confidence 22 3455 8888873
No 17
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=98.66 E-value=1.1e-06 Score=71.36 Aligned_cols=86 Identities=22% Similarity=0.297 Sum_probs=64.3
Q ss_pred eEEEEEEEEEeC-CCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEEC--CCc-eEEEEec-CCCCCCCeEEEEEEEeCC
Q psy5026 166 SLLDLTIVFKAI-EPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDL--GTG-AATLRSS-NPISLGEWRKLRLTRTGR 240 (293)
Q Consensus 166 ~~~~i~~~frt~-~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~--g~~-~~~~~~~-~~~~dg~wh~V~i~r~~~ 240 (293)
..+.|.+.+|+. ...|.||...+... ..++.|.+..++..+.+.. ..+ ...+..+ ..++||+||+|.+...+.
T Consensus 52 ~~fsi~~~~r~~~~~~g~L~si~~~~~--~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal~V~~~ 129 (184)
T smart00210 52 EDFSLLTTFRQTPKSRGVLFAIYDAQN--VRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLALSVSGS 129 (184)
T ss_pred CCeEEEEEEEeCCCCCeEEEEEEcCCC--cEEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEEEEeCC
Confidence 578999999987 77899998876533 6689999876554444432 222 2344433 789999999999999999
Q ss_pred EEEEEECCcccee
Q psy5026 241 HAYLQVDRFPSSQ 253 (293)
Q Consensus 241 ~~~l~VD~~~~~~ 253 (293)
+++|.||......
T Consensus 130 ~v~LyvDC~~~~~ 142 (184)
T smart00210 130 SATLYVDCNEIDS 142 (184)
T ss_pred EEEEEECCccccc
Confidence 9999999986543
No 18
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=97.90 E-value=0.0011 Score=54.93 Aligned_cols=131 Identities=18% Similarity=0.130 Sum_probs=76.9
Q ss_pred ceeEecCCccceeeEEEEEEEEEeCC--CCeEEEEeccCCCCCCCeEEEEE-eCCEEEEEEECCCceEEEEecCCCCCCC
Q psy5026 153 PDQVYQGLGEQYLSLLDLTIVFKAIE--PNGILLYNGHRADGVGDFIALYL-NDRYVDFTFDLGTGAATLRSSNPISLGE 229 (293)
Q Consensus 153 s~~~~~~~~~~~~~~~~i~~~frt~~--~~GlLl~~~~~~~~~~~~l~l~l-~~g~v~~~~~~g~~~~~~~~~~~~~dg~ 229 (293)
.|+.+........+.+++.+.+|+.. .++.||....... .+-+.+.. .++.+.+.+ +.. .+.....+.||+
T Consensus 18 ~yv~l~~~~~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~~--~ne~~~~~~~~~~~~l~i--~g~--~~~~~~~~~~g~ 91 (206)
T smart00159 18 SYVKLKPELPKPLQAFTVCLWFYSDLSPRGYSLFSYATKGQ--DNELLLYKEKQGEYSLYI--GGK--KVQFPVPESDGK 91 (206)
T ss_pred CeEEEccCCCCChhHEEEEEEEEecCCCCceEEEEEeCCCC--CCeEEEEEcCCcEEEEEE--cCe--EEEecccccCCc
Confidence 57777543222346889999999864 5566665444332 23333433 345565554 322 333445689999
Q ss_pred eEEEEEEEe--CCEEEEEECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCCCcCCCcee
Q psy5026 230 WRKLRLTRT--GRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 292 (293)
Q Consensus 230 wh~V~i~r~--~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G 292 (293)
||+|.+..+ .+++.|.|||... ..........+...+.+.||-..+. +..+......|.|
T Consensus 92 W~hvc~tw~~~~g~~~lyvnG~~~-~~~~~~~g~~i~~~G~lvlGq~qd~--~gg~f~~~~~f~G 153 (206)
T smart00159 92 WHHICTTWESSSGIAELWVDGKPG-VRKGLAKGYTVKPGGSIILGQEQDS--YGGGFDATQSFVG 153 (206)
T ss_pred eEEEEEEEECCCCcEEEEECCEEc-ccccccCCcEECCCCEEEEEecccC--CCCCCCCCcceeE
Confidence 999999997 5578999999854 2111111223455667888875543 2223444556766
No 19
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=97.79 E-value=0.0024 Score=52.65 Aligned_cols=117 Identities=17% Similarity=0.069 Sum_probs=70.7
Q ss_pred ceeEecCCccceeeEEEEEEEEEeCC--CCeEEEEeccCCCCCCCeEEEEE-eCCEEEEEEECCCceEEEEecCCCCCCC
Q psy5026 153 PDQVYQGLGEQYLSLLDLTIVFKAIE--PNGILLYNGHRADGVGDFIALYL-NDRYVDFTFDLGTGAATLRSSNPISLGE 229 (293)
Q Consensus 153 s~~~~~~~~~~~~~~~~i~~~frt~~--~~GlLl~~~~~~~~~~~~l~l~l-~~g~v~~~~~~g~~~~~~~~~~~~~dg~ 229 (293)
.|+.+..........+++.+.+|+.. ..+.||......+ .+.+.+.. ..|++.+.++ .....+ .....+|+
T Consensus 18 ~yv~l~~~~~~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~--~~~~~l~~~~~g~~~~~i~--~~~~~~--~~~~~~g~ 91 (201)
T cd00152 18 SYVKLKPELPKPLQAFTLCLWVYTDLSTREYSLFSYATKGQ--DNELLLYKEKDGGYSLYIG--GKEVTF--KVPESDGA 91 (201)
T ss_pred ceEEEccCCCCChhhEEEEEEEEecCCCCCeEEEEEeCCCC--CCeEEEEEcCCCeEEEEEc--CEEEEE--eccCCCCC
Confidence 57777543222346889999999764 4566664433322 33444444 3567777653 222222 23459999
Q ss_pred eEEEEEEEe--CCEEEEEECCccceeEecCCCcccccCCCCeEEeccCC
Q psy5026 230 WRKLRLTRT--GRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPD 276 (293)
Q Consensus 230 wh~V~i~r~--~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~ 276 (293)
||+|.+..+ .+.+.|.|||....... ......+...+.+.||-.++
T Consensus 92 W~hv~~t~d~~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~ 139 (201)
T cd00152 92 WHHICVTWESTSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQD 139 (201)
T ss_pred EEEEEEEEECCCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeeccc
Confidence 999999997 55789999998543222 11122345556788886554
No 20
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=97.78 E-value=0.00042 Score=53.76 Aligned_cols=117 Identities=17% Similarity=0.245 Sum_probs=68.8
Q ss_pred ceeEecCCccceeeEEEEEEEEEeCCCC--e-EEEEeccCCCCCCCeEEEEEe-CCEEEEEEECCCc-eEEEEecCCCCC
Q psy5026 153 PDQVYQGLGEQYLSLLDLTIVFKAIEPN--G-ILLYNGHRADGVGDFIALYLN-DRYVDFTFDLGTG-AATLRSSNPISL 227 (293)
Q Consensus 153 s~~~~~~~~~~~~~~~~i~~~frt~~~~--G-lLl~~~~~~~~~~~~l~l~l~-~g~v~~~~~~g~~-~~~~~~~~~~~d 227 (293)
+|+.++..... ...++|++.||..... . .++. .... .+.+.+.+. ++++.+.+..+.+ ...+.....+.+
T Consensus 10 ~~i~~~~~~~~-~~~fTi~~w~~~~~~~~~~~~~~~-~~~~---~~~~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 84 (157)
T PF13385_consen 10 DYISIPNSDFP-SGSFTISFWVKPDSPSSSQSFVFM-DSSG---SGGFGLFINNNGRLRFYIGNGGGGNYSFSSDSNLPD 84 (157)
T ss_dssp -EEEEESGGGG-GTEEEEEEEEEESS--SSEEEEEE-SSSS---SEEEEEEEETTSEEEEEETTSEEESS-EE-BS---T
T ss_pred CEEEECCcCCC-CCCEEEEEEEEeCCCCCCceEEEE-ecCC---CCEEEEEEECCCEEEEEEeCCCceeEEEecCcccCC
Confidence 57777653333 4688999999976432 2 3333 1111 346777776 5778877666542 235556678889
Q ss_pred CCeEEEEEEEeCCEEEEEECCccceeEecCCCcccccCCCCeEEeccC
Q psy5026 228 GEWRKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVP 275 (293)
Q Consensus 228 g~wh~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p 275 (293)
++||+|.+...++.+.|.|||........+.. ........++||+..
T Consensus 85 ~~W~~l~~~~~~~~~~lyvnG~~~~~~~~~~~-~~~~~~~~~~iG~~~ 131 (157)
T PF13385_consen 85 NKWHHLALTYDGSTVTLYVNGELVGSSTIPSN-ISLNSNGPLFIGGSG 131 (157)
T ss_dssp T-EEEEEEEEETTEEEEEETTEEETTCTEESS-SSTTSCCEEEESS-S
T ss_pred CCEEEEEEEEECCeEEEEECCEEEEeEeccCC-cCCCCcceEEEeecC
Confidence 99999999999999999999985432211111 112345688999866
No 21
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=97.52 E-value=0.0031 Score=51.21 Aligned_cols=105 Identities=16% Similarity=0.172 Sum_probs=68.2
Q ss_pred CCCceeEEEEEEEeC-CCCeEEEEeccCCCCCCCeEEEEEECCEEEEEE-------------------eCCCCCCeEEEE
Q psy5026 3 GSPQAWRFPIQFKPE-SWDGILFLTGERDDLNGDFMTLLIFEGYVEFST-------------------PYRPYADITVHR 62 (293)
Q Consensus 3 ~~~~~~~i~~~FrT~-~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~-------------------~~g~w~~~~~~~ 62 (293)
..+..+.|.+.||+. ...|.||-..+++ ...++.|.+..++..+.+ .+|+||.+.+..
T Consensus 49 ~~~~~fsi~~~~r~~~~~~g~L~si~~~~--~~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal~V 126 (184)
T smart00210 49 GLPEDFSLLTTFRQTPKSRGVLFAIYDAQ--NVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLALSV 126 (184)
T ss_pred CCCCCeEEEEEEEeCCCCCeEEEEEEcCC--CcEEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEEEE
Confidence 356789999999986 8888999776642 356888888766533322 245899999988
Q ss_pred eccEEEecccCCCCce--------eeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEc
Q psy5026 63 TVRTLILPYTVPSGLF--------SRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDIN 118 (293)
Q Consensus 63 ~~~~v~l~~~g~~~~~--------~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n 118 (293)
....|.|..+...-.. ..++. ..++++|.... ....|.|||+++.+-
T Consensus 127 ~~~~v~LyvDC~~~~~~~l~~~~~~~~~~-~g~~~~g~~~~--------~~~~f~G~lq~l~i~ 181 (184)
T smart00210 127 SGSSATLYVDCNEIDSRPLDRPGQPPIDT-DGIEVRGAQAA--------DRKPFQGDLQQLKIV 181 (184)
T ss_pred eCCEEEEEECCccccceecCCcccccccc-cceEEEeeccC--------CCCcceEEeEEEEEe
Confidence 8888888555431110 12232 23444444321 135899999999873
No 22
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=97.48 E-value=0.004 Score=50.99 Aligned_cols=132 Identities=16% Similarity=0.170 Sum_probs=73.6
Q ss_pred ceeEecCCccceeeEEEEEEEEEeCCC--CeEEEEeccCCCCCCCeEEEEEe-CCEEEEEEECCCceEEEEecCCCCCCC
Q psy5026 153 PDQVYQGLGEQYLSLLDLTIVFKAIEP--NGILLYNGHRADGVGDFIALYLN-DRYVDFTFDLGTGAATLRSSNPISLGE 229 (293)
Q Consensus 153 s~~~~~~~~~~~~~~~~i~~~frt~~~--~GlLl~~~~~~~~~~~~l~l~l~-~g~v~~~~~~g~~~~~~~~~~~~~dg~ 229 (293)
+|+.+........+.+++-+.+|+... .+.||.-+.+.+ .+=+.+... .+.+.+.+ +.....+ ...+.||+
T Consensus 12 ~yv~l~~~~~~pL~~fTvC~w~k~~~~~~~~tifSYat~~~--~nell~~~~~~~~~~l~i--~~~~~~~--~~~~~~~~ 85 (195)
T PF00354_consen 12 DYVRLKPSVPLPLSAFTVCFWVKTDDSSNDGTIFSYATSSQ--DNELLLFGSSSGSLRLYI--NGSSVSF--SGPIRDGQ 85 (195)
T ss_dssp BEEEEEESS-S-BSEEEEEEEEEESGSGS-EEEEEEEETTE--EEEEEEEEETTTEEEEEE--TTEEEEE--EECS-TSS
T ss_pred ceEEEecCCCCCcccEEEEEEEEeccCCCceEEEEEccCCC--CccEEEEEeCCceEEEEE--CCeEeEe--ccccCCCC
Confidence 577775432222578999999999754 778876544332 222333333 46666554 3333333 34578999
Q ss_pred eEEEEEEEe--CCEEEEEECCccceeEecCCCcccccCCCCeEEeccCCCCccCCCCCcCCCceeC
Q psy5026 230 WRKLRLTRT--GRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGK 293 (293)
Q Consensus 230 wh~V~i~r~--~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G~ 293 (293)
||++-+..+ .+.+.+.+||.... .........+...+.+.||=-.+. +..+.....+|.|.
T Consensus 86 Whh~C~tW~s~~G~~~ly~dG~~~~-~~~~~~g~~i~~gG~~vlGQeQd~--~gG~fd~~q~F~G~ 148 (195)
T PF00354_consen 86 WHHICVTWDSSTGRWQLYVDGVRLS-STGLATGHSIPGGGTLVLGQEQDS--YGGGFDESQAFVGE 148 (195)
T ss_dssp -EEEEEEEETTTTEEEEEETTEEEE-EEESSTT--B-SSEEEEESS-BSB--TTBTCSGGGB--EE
T ss_pred cEEEEEEEecCCcEEEEEECCEecc-cccccCCceECCCCEEEECccccc--cCCCcCCccEeeEE
Confidence 999999986 47889999998432 222222334555667778854442 34455556778773
No 23
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=96.90 E-value=0.041 Score=44.52 Aligned_cols=109 Identities=17% Similarity=0.249 Sum_probs=69.2
Q ss_pred EEEEEEEEEeCCCCe--EEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCce--EEEEecC-----CCCCCCeEEEEEEE
Q psy5026 167 LLDLTIVFKAIEPNG--ILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA--ATLRSSN-----PISLGEWRKLRLTR 237 (293)
Q Consensus 167 ~~~i~~~frt~~~~G--lLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~--~~~~~~~-----~~~dg~wh~V~i~r 237 (293)
..+|.++|++...++ -||.+.+...+ ..|+.+++.++++-+.++-..+. .....+. ..++-.||++.+..
T Consensus 34 ~gTI~i~Fk~~~~~~~~sLfsiSn~~~~-n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~a 112 (190)
T PF02973_consen 34 EGTIVIRFKSDSNSGIQSLFSISNSTKG-NEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFVA 112 (190)
T ss_dssp SEEEEEEEEESS-SSEEEEEEEE-TSTT-SEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEEE
T ss_pred ccEEEEEEecCCCcceeEEEEecCCCCc-cceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEEE
Confidence 458889999865544 46766665432 58999999999999988776542 2222222 24556799999999
Q ss_pred e--CCEEEEEECCccceeEecC-CCc-ccccCCCCeEEeccCC
Q psy5026 238 T--GRHAYLQVDRFPSSQILSP-GPF-TQLSLSLSLYLGGVPD 276 (293)
Q Consensus 238 ~--~~~~~l~VD~~~~~~~~~~-~~~-~~l~~~~~lyvGG~p~ 276 (293)
+ +...+|.|||........+ ..+ ..+.--..++|||.-.
T Consensus 113 d~~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R 155 (190)
T PF02973_consen 113 DSKNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNR 155 (190)
T ss_dssp ETTTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEE
T ss_pred ecCCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEe
Confidence 7 8899999999644333222 222 1222124799999743
No 24
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=95.12 E-value=1 Score=34.17 Aligned_cols=42 Identities=24% Similarity=0.226 Sum_probs=29.5
Q ss_pred CCeEEEEEEEeC--CEEEEEECCccceeEecCCCcccccCCCCeEEec
Q psy5026 228 GEWRKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGG 273 (293)
Q Consensus 228 g~wh~V~i~r~~--~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG 273 (293)
|+||+|.+..+. ++++|.|||......... ......++.+|.
T Consensus 61 ~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~ 104 (133)
T smart00560 61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGG 104 (133)
T ss_pred CCEEEEEEEEECCCCeEEEEECCEEccccccC----CcccCCceEEee
Confidence 899999999987 799999999855322111 112345788883
No 25
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=94.32 E-value=0.07 Score=42.95 Aligned_cols=91 Identities=18% Similarity=0.237 Sum_probs=50.4
Q ss_pred eEEEEEEEEEe--CCCCeEEEEecc--CCCCCCCeEEEEEeCCEEEEEEECCCceEE--------EEecCCCCCCCeEEE
Q psy5026 166 SLLDLTIVFKA--IEPNGILLYNGH--RADGVGDFIALYLNDRYVDFTFDLGTGAAT--------LRSSNPISLGEWRKL 233 (293)
Q Consensus 166 ~~~~i~~~frt--~~~~GlLl~~~~--~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~--------~~~~~~~~dg~wh~V 233 (293)
+.+.|+++||. ....|++|.... ..........+.|.++..........+... .........|+||++
T Consensus 53 ~df~l~~d~k~~~~~~sGi~~r~~~~~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~W~~~ 132 (185)
T PF06439_consen 53 SDFELEVDFKITPGGNSGIFFRAQSPGDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEPSVNVAIPPGEWNTV 132 (185)
T ss_dssp SSEEEEEEEEE-TT-EEEEEEEESSECCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-SSS--S--TTSEEEE
T ss_pred ccEEEEEEEEECCCCCeEEEEEeccccCCCCcceEEEEEEECCCCccCCCCccceEEEeccccccccccccCCCCceEEE
Confidence 46777777773 344577777661 111113456666654332211111112211 123456888999999
Q ss_pred EEEEeCCEEEEEECCccceeEec
Q psy5026 234 RLTRTGRHAYLQVDRFPSSQILS 256 (293)
Q Consensus 234 ~i~r~~~~~~l~VD~~~~~~~~~ 256 (293)
.|...++++.+.|||........
T Consensus 133 ~I~~~g~~i~v~vnG~~v~~~~d 155 (185)
T PF06439_consen 133 RIVVKGNRITVWVNGKPVADFTD 155 (185)
T ss_dssp EEEEETTEEEEEETTEEEEEEET
T ss_pred EEEEECCEEEEEECCEEEEEEEc
Confidence 99999999999999986654433
No 26
>KOG1834|consensus
Probab=93.92 E-value=0.78 Score=43.59 Aligned_cols=112 Identities=18% Similarity=0.225 Sum_probs=71.3
Q ss_pred eEEEEEEEEEeCC-------CCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCce-EEEEe------cCCCCCCCeE
Q psy5026 166 SLLDLTIVFKAIE-------PNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA-ATLRS------SNPISLGEWR 231 (293)
Q Consensus 166 ~~~~i~~~frt~~-------~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~-~~~~~------~~~~~dg~wh 231 (293)
+.|.|+|..|-.. ..-.|+-..++..-...+.+|++.+=++.|-+....+. ...++ -..+||.+||
T Consensus 366 dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~EWH 445 (952)
T KOG1834|consen 366 DHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDNEWH 445 (952)
T ss_pred CceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhhhhh
Confidence 4688888887321 22356655554332367899999988999988775432 22222 2369999999
Q ss_pred EEEEEEeCCEEEEEECCcccee--EecCCCcccccCCCCeEEeccCCC
Q psy5026 232 KLRLTRTGRHAYLQVDRFPSSQ--ILSPGPFTQLSLSLSLYLGGVPDY 277 (293)
Q Consensus 232 ~V~i~r~~~~~~l~VD~~~~~~--~~~~~~~~~l~~~~~lyvGG~p~~ 277 (293)
+-.+..+...+.|.|||..-.. .........-.....|-||...-.
T Consensus 446 ~Y~ln~efp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW~g 493 (952)
T KOG1834|consen 446 HYVLNVEFPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACWQG 493 (952)
T ss_pred eeEEeecCceEEEEEcCcccCCceeccCCccCcccccceeEEeeeccC
Confidence 9999999999999999973211 111000111124467888887643
No 27
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=93.55 E-value=1.3 Score=35.95 Aligned_cols=110 Identities=22% Similarity=0.324 Sum_probs=61.6
Q ss_pred ceeEEEEEEEeCCCCeE--EEEeccCCCCCCCeEEEEEECCEEEEEEeCC----------------------CCCCeEEE
Q psy5026 6 QAWRFPIQFKPESWDGI--LFLTGERDDLNGDFMTLLIFEGYVEFSTPYR----------------------PYADITVH 61 (293)
Q Consensus 6 ~~~~i~~~FrT~~~~Gl--L~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g----------------------~w~~~~~~ 61 (293)
+...|.++|++..++++ ||-..+.. ....|+.+.+.++.+-+.+... .||.++..
T Consensus 33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~~-~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~ 111 (190)
T PF02973_consen 33 EEGTIVIRFKSDSNSGIQSLFSISNST-KGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFV 111 (190)
T ss_dssp SSEEEEEEEEESS-SSEEEEEEEE-TS-TTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEE
T ss_pred cccEEEEEEecCCCcceeEEEEecCCC-CccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEE
Confidence 35588999999766664 66665543 4558999999999887766543 24444443
Q ss_pred Ee--ccEEEecccCC--------CCce-eeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeee
Q psy5026 62 RT--VRTLILPYTVP--------SGLF-SRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLY 122 (293)
Q Consensus 62 ~~--~~~v~l~~~g~--------~~~~-~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~ 122 (293)
.. .+.+.|-.+|. .+.+ ..+.--..++|||...... ..-+|.|-|+++.+-+..+
T Consensus 112 ad~~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~g~------~~y~f~G~I~~l~iYn~aL 177 (190)
T PF02973_consen 112 ADSKNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRAGS------NAYPFNGTIDNLKIYNRAL 177 (190)
T ss_dssp EETTTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEETTE------EES--EEEEEEEEEESS--
T ss_pred EecCCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeCCC------ceecccceEEEEEEEcCcC
Confidence 33 23344433331 1111 1222235789999843321 2468999999999876644
No 28
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=92.84 E-value=1.8 Score=32.91 Aligned_cols=105 Identities=19% Similarity=0.325 Sum_probs=60.0
Q ss_pred CceeEEEEEEEeCCCCe---EEEEeccCCCCCCCeEEEEEE-CCEEEEEEeC----------------CCCCCeEEEEec
Q psy5026 5 PQAWRFPIQFKPESWDG---ILFLTGERDDLNGDFMTLLIF-EGYVEFSTPY----------------RPYADITVHRTV 64 (293)
Q Consensus 5 ~~~~~i~~~FrT~~~~G---lL~~~~~~~~~~~~~~~l~l~-~G~l~~~~~~----------------g~w~~~~~~~~~ 64 (293)
.+.+.|++.||.....+ .++..... .+.+.+.+. +|.+.+.+.. ++||.+.+....
T Consensus 21 ~~~fTi~~w~~~~~~~~~~~~~~~~~~~----~~~~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~W~~l~~~~~~ 96 (157)
T PF13385_consen 21 SGSFTISFWVKPDSPSSSQSFVFMDSSG----SGGFGLFINNNGRLRFYIGNGGGGNYSFSSDSNLPDNKWHHLALTYDG 96 (157)
T ss_dssp GTEEEEEEEEEESS--SSEEEEEESSSS----SEEEEEEEETTSEEEEEETTSEEESS-EE-BS---TT-EEEEEEEEET
T ss_pred CCCEEEEEEEEeCCCCCCceEEEEecCC----CCEEEEEEECCCEEEEEEeCCCceeEEEecCcccCCCCEEEEEEEEEC
Confidence 46788999999744333 33331111 235566665 4777775443 368888887777
Q ss_pred cEEEecccCCC---Cce---eeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeee
Q psy5026 65 RTLILPYTVPS---GLF---SRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLY 122 (293)
Q Consensus 65 ~~v~l~~~g~~---~~~---~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~ 122 (293)
..+.+-.+|.. ... ........++||+... ....|.|-|.++++=+..+
T Consensus 97 ~~~~lyvnG~~~~~~~~~~~~~~~~~~~~~iG~~~~---------~~~~~~g~i~~~~i~~~aL 151 (157)
T PF13385_consen 97 STVTLYVNGELVGSSTIPSNISLNSNGPLFIGGSGG---------GSSPFNGYIDDLRIYNRAL 151 (157)
T ss_dssp TEEEEEETTEEETTCTEESSSSTTSCCEEEESS-ST---------T--B-EEEEEEEEEESS--
T ss_pred CeEEEEECCEEEEeEeccCCcCCCCcceEEEeecCC---------CCCceEEEEEEEEEECccC
Confidence 77877666631 110 1123456889998752 1478999999999855543
No 29
>KOG3509|consensus
Probab=91.53 E-value=0.3 Score=48.88 Aligned_cols=106 Identities=26% Similarity=0.281 Sum_probs=62.7
Q ss_pred CCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeCC---------------CCCCeEEEEecc------EEEecccCCCC
Q psy5026 18 SWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPYR---------------PYADITVHRTVR------TLILPYTVPSG 76 (293)
Q Consensus 18 ~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g---------------~w~~~~~~~~~~------~v~l~~~g~~~ 76 (293)
..+.+..+... ....+|+++.+..|.+-+.+..+ +|+...+.|... .+.+..+ ..+
T Consensus 266 ~~~~~~~~~~~--~~~~~f~~lt~~~g~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r~s~~~~~g~~~~l~g~-~~~ 342 (964)
T KOG3509|consen 266 HRDILGNFLFS--SFKDGFRALTLDGGTDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFRGSGLSVSGHKGVLQGN-SNI 342 (964)
T ss_pred ccccccccccc--ccccceeeeccCCCCccccccccCcchhhhhccccccceeeeeEeeecccccccCcceeeccc-ccc
Confidence 34444444443 23457888877777666655544 465555544110 1111100 123
Q ss_pred ceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeeeeecc
Q psy5026 77 LFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLYNFNV 126 (293)
Q Consensus 77 ~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~~~~~ 126 (293)
....+.....+|.||+-+..-+....+...+|.|||+++..+.+.++...
T Consensus 343 ~~~~i~~ee~v~lg~i~ni~~l~~~~~~~eGf~gci~~~~~~~k~l~~~~ 392 (964)
T KOG3509|consen 343 LVSRITNEESVFLGGIINIETLQHNLPLPEGFAGCIRDLVMNLKDLRVTL 392 (964)
T ss_pred cccceeecccccCCceeeeccccccCCCccCccceehhhhhhcccccccc
Confidence 34455566778999865555455556667899999999999998776554
No 30
>KOG3509|consensus
Probab=91.23 E-value=0.24 Score=49.56 Aligned_cols=110 Identities=20% Similarity=0.210 Sum_probs=70.8
Q ss_pred CCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCceEEEEecCCCCCCCeEEEEEEEeCCEEEEEECCccce-eEec
Q psy5026 178 EPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWRKLRLTRTGRHAYLQVDRFPSS-QILS 256 (293)
Q Consensus 178 ~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~~~~~~~~~~~~dg~wh~V~i~r~~~~~~l~VD~~~~~-~~~~ 256 (293)
..+.++.+...... .+|+++.+.-|.+.+++.-+.....+....+.-.|+|+.+.+.| .-.+.+++.... ....
T Consensus 266 ~~~~~~~~~~~~~~--~~f~~lt~~~g~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r---~s~~~~~g~~~~l~g~~ 340 (964)
T KOG3509|consen 266 HRDILGNFLFSSFK--DGFRALTLDGGTDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFR---GSGLSVSGHKGVLQGNS 340 (964)
T ss_pred cccccccccccccc--cceeeeccCCCCccccccccCcchhhhhccccccceeeeeEeee---cccccccCcceeecccc
Confidence 34556666655443 78988888777788887777666666667778889999999988 234555553222 2223
Q ss_pred CCCcccccCCCCeEEeccCCCCccCCCCCcCCCcee
Q psy5026 257 PGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 292 (293)
Q Consensus 257 ~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~F~G 292 (293)
......+...+.+|+||+-.-..++...+...||.|
T Consensus 341 ~~~~~~i~~ee~v~lg~i~ni~~l~~~~~~~eGf~g 376 (964)
T KOG3509|consen 341 NILVSRITNEESVFLGGIINIETLQHNLPLPEGFAG 376 (964)
T ss_pred cccccceeecccccCCceeeeccccccCCCccCccc
Confidence 333334566678999985444445555566677776
No 31
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=88.31 E-value=13 Score=30.56 Aligned_cols=114 Identities=14% Similarity=0.154 Sum_probs=60.2
Q ss_pred CceeEEEEEEEeCC--CCeEEE-EeccCCCCCCCeEEEEEECCEEEE-----------EEeCCCCCCeEEEEecc--EEE
Q psy5026 5 PQAWRFPIQFKPES--WDGILF-LTGERDDLNGDFMTLLIFEGYVEF-----------STPYRPYADITVHRTVR--TLI 68 (293)
Q Consensus 5 ~~~~~i~~~FrT~~--~~GlL~-~~~~~~~~~~~~~~l~l~~G~l~~-----------~~~~g~w~~~~~~~~~~--~v~ 68 (293)
-+.+.+.+.+|+.. .++.|| |..+.. ...++...-.++.+.+ .+..|+||.+.+..... .+.
T Consensus 30 l~~fTvc~W~k~~~~~~~~~ifSy~~~~~--~ne~~~~~~~~~~~~l~i~g~~~~~~~~~~~g~W~hvc~tw~~~~g~~~ 107 (206)
T smart00159 30 LQAFTVCLWFYSDLSPRGYSLFSYATKGQ--DNELLLYKEKQGEYSLYIGGKKVQFPVPESDGKWHHICTTWESSSGIAE 107 (206)
T ss_pred hhHEEEEEEEEecCCCCceEEEEEeCCCC--CCeEEEEEcCCcEEEEEEcCeEEEecccccCCceEEEEEEEECCCCcEE
Confidence 35677888888853 556666 333321 1122211112233333 23457899988765533 466
Q ss_pred ecccCCCC------ceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeee
Q psy5026 69 LPYTVPSG------LFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLY 122 (293)
Q Consensus 69 l~~~g~~~------~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~ 122 (293)
+-.+|... ....+...+.+.||.-..... ........|.|.|.++.+=++.+
T Consensus 108 lyvnG~~~~~~~~~~g~~i~~~G~lvlGq~qd~~g--g~f~~~~~f~G~i~~v~iw~~~L 165 (206)
T smart00159 108 LWVDGKPGVRKGLAKGYTVKPGGSIILGQEQDSYG--GGFDATQSFVGEIGDLNMWDSVL 165 (206)
T ss_pred EEECCEEcccccccCCcEECCCCEEEEEecccCCC--CCCCCCcceeEEEeeeEEecccC
Confidence 65555321 112234456778886432211 11223478999999998755433
No 32
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=86.21 E-value=11 Score=31.15 Aligned_cols=60 Identities=10% Similarity=0.095 Sum_probs=42.5
Q ss_pred CCeEEEEEeCCEEEEEEECCC-----ceEEEEecCCCCCCCeEEEEEEEe-----CCEEEEEECCcccee
Q psy5026 194 GDFIALYLNDRYVDFTFDLGT-----GAATLRSSNPISLGEWRKLRLTRT-----GRHAYLQVDRFPSSQ 253 (293)
Q Consensus 194 ~~~l~l~l~~g~v~~~~~~g~-----~~~~~~~~~~~~dg~wh~V~i~r~-----~~~~~l~VD~~~~~~ 253 (293)
...++|.+.++++.+.+..+. ..........+.-|+||.+.+..+ .+.+.+.+||+....
T Consensus 113 ~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~~ 182 (224)
T PF14099_consen 113 SPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVVD 182 (224)
T ss_dssp EECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECCE
T ss_pred CCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEEE
Confidence 457999999999999887765 123333455677799999997774 356788889986543
No 33
>KOG3546|consensus
Probab=85.63 E-value=5.1 Score=38.25 Aligned_cols=108 Identities=19% Similarity=0.253 Sum_probs=67.4
Q ss_pred eEEEEEEEEEeCC-CCeEEEEeccCCCCCCCeEEEEE---eCCE--EEEEEEC-CCce-EE-EEecCCCCCCCeEEEEEE
Q psy5026 166 SLLDLTIVFKAIE-PNGILLYNGHRADGVGDFIALYL---NDRY--VDFTFDL-GTGA-AT-LRSSNPISLGEWRKLRLT 236 (293)
Q Consensus 166 ~~~~i~~~frt~~-~~GlLl~~~~~~~~~~~~l~l~l---~~g~--v~~~~~~-g~~~-~~-~~~~~~~~dg~wh~V~i~ 236 (293)
+.|.|.+.+|+.+ .-|+||.+.+..+. .-||-|.| +||+ +.+.|.. |+.. .+ ......+-.++|.++.++
T Consensus 87 rdf~~~~~i~p~s~~~gvlfaitd~~q~-~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~a~~ 165 (1167)
T KOG3546|consen 87 RDFSLLFHIRPATEGPGVLFAITDSAQA-MVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALS 165 (1167)
T ss_pred ccceEEEEeeccCCCCceEEEechhhhh-hheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhheeee
Confidence 4678888889764 56888888775431 44666665 3664 4444433 3322 11 122456677999999999
Q ss_pred EeCCEEEEEECCccceeEecCCCcccc--cCCCCeEEecc
Q psy5026 237 RTGRHAYLQVDRFPSSQILSPGPFTQL--SLSLSLYLGGV 274 (293)
Q Consensus 237 r~~~~~~l~VD~~~~~~~~~~~~~~~l--~~~~~lyvGG~ 274 (293)
..+..+.|.||=+....+-..-+.+.| ....-||+|-.
T Consensus 166 v~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~a 205 (1167)
T KOG3546|consen 166 VAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQA 205 (1167)
T ss_pred ecCceEEEEechHHhcccchhccccceeecCCcceEEecc
Confidence 999999999997755433222222333 34456888643
No 34
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=83.90 E-value=22 Score=29.03 Aligned_cols=114 Identities=13% Similarity=0.117 Sum_probs=63.5
Q ss_pred CCceeEEEEEEEeC--CCCeEEE-EeccCCCCCCCeEEEEE-ECCEEEEEE-----------eCCCCCCeEEEEec--cE
Q psy5026 4 SPQAWRFPIQFKPE--SWDGILF-LTGERDDLNGDFMTLLI-FEGYVEFST-----------PYRPYADITVHRTV--RT 66 (293)
Q Consensus 4 ~~~~~~i~~~FrT~--~~~GlL~-~~~~~~~~~~~~~~l~l-~~G~l~~~~-----------~~g~w~~~~~~~~~--~~ 66 (293)
.-+.+.+.+.+|+. ...+.+| |..... .+.+.+.. ..|++.+.+ ..|+||.+.+.... ..
T Consensus 29 ~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~---~~~~~l~~~~~g~~~~~i~~~~~~~~~~~~~g~W~hv~~t~d~~~g~ 105 (201)
T cd00152 29 PLQAFTLCLWVYTDLSTREYSLFSYATKGQ---DNELLLYKEKDGGYSLYIGGKEVTFKVPESDGAWHHICVTWESTSGI 105 (201)
T ss_pred ChhhEEEEEEEEecCCCCCeEEEEEeCCCC---CCeEEEEEcCCCeEEEEEcCEEEEEeccCCCCCEEEEEEEEECCCCc
Confidence 34567888888875 3556666 443321 23333333 335655533 45689998876653 34
Q ss_pred EEecccCCC------CceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeee
Q psy5026 67 LILPYTVPS------GLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLY 122 (293)
Q Consensus 67 v~l~~~g~~------~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~ 122 (293)
+.+-.+|.. .....+...+.+.||....... ........|.|.|.++.+=++.+
T Consensus 106 ~~lyvnG~~~~~~~~~~~~~~~~~g~l~lG~~q~~~g--g~~~~~~~f~G~I~~v~iw~~~L 165 (201)
T cd00152 106 AELWVNGKLSVRKSLKKGYTVGPGGSIILGQEQDSYG--GGFDATQSFVGEISDVNMWDSVL 165 (201)
T ss_pred EEEEECCEEeccccccCCCEECCCCeEEEeecccCCC--CCCCCCcceEEEEceeEEEcccC
Confidence 666555531 1112344556788886532211 11222468999999998755533
No 35
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=83.06 E-value=19 Score=29.37 Aligned_cols=113 Identities=14% Similarity=0.159 Sum_probs=53.0
Q ss_pred ceeEEEEEEEeCCC--CeEEE-EeccCCCCCCCeEEEEEECCEEEEEE-----------eCCCCCCeEEEEec--cEEEe
Q psy5026 6 QAWRFPIQFKPESW--DGILF-LTGERDDLNGDFMTLLIFEGYVEFST-----------PYRPYADITVHRTV--RTLIL 69 (293)
Q Consensus 6 ~~~~i~~~FrT~~~--~GlL~-~~~~~~~~~~~~~~l~l~~G~l~~~~-----------~~g~w~~~~~~~~~--~~v~l 69 (293)
+.+.+.+.+||... .+.|| |+.+.. ...++...-..+.+.+.+ .+|+||.+-+.-.. ..+.+
T Consensus 25 ~~fTvC~w~k~~~~~~~~tifSYat~~~--~nell~~~~~~~~~~l~i~~~~~~~~~~~~~~~Whh~C~tW~s~~G~~~l 102 (195)
T PF00354_consen 25 SAFTVCFWVKTDDSSNDGTIFSYATSSQ--DNELLLFGSSSGSLRLYINGSSVSFSGPIRDGQWHHICVTWDSSTGRWQL 102 (195)
T ss_dssp SEEEEEEEEEESGSGS-EEEEEEEETTE--EEEEEEEEETTTEEEEEETTEEEEEEECS-TSS-EEEEEEEETTTTEEEE
T ss_pred ccEEEEEEEEeccCCCceEEEEEccCCC--CccEEEEEeCCceEEEEECCeEeEeccccCCCCcEEEEEEEecCCcEEEE
Confidence 46888999998664 78888 443321 112222211224444332 24688887664333 35555
Q ss_pred cccCCCC------ceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeee
Q psy5026 70 PYTVPSG------LFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLY 122 (293)
Q Consensus 70 ~~~g~~~------~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~ 122 (293)
..+|... ....+...+.+.||--... ..........|.|=|.++.+=++.+
T Consensus 103 y~dG~~~~~~~~~~g~~i~~gG~~vlGQeQd~--~gG~fd~~q~F~G~i~~~~iWd~vL 159 (195)
T PF00354_consen 103 YVDGVRLSSTGLATGHSIPGGGTLVLGQEQDS--YGGGFDESQAFVGEISDFNIWDRVL 159 (195)
T ss_dssp EETTEEEEEEESSTT--B-SSEEEEESS-BSB--TTBTCSGGGB--EEEEEEEEESS--
T ss_pred EECCEecccccccCCceECCCCEEEECccccc--cCCCcCCccEeeEEEeceEEEeeeC
Confidence 4444311 1122334456666643221 1122334579999999999744433
No 36
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=73.32 E-value=49 Score=32.35 Aligned_cols=83 Identities=20% Similarity=0.247 Sum_probs=48.0
Q ss_pred EEEEEEEEEeC--CCCeEEEEeccCCC------CCCCeEEEEEeCCEEEEEEECCCceEEEE-ecCCCCCCCeEEEEEEE
Q psy5026 167 LLDLTIVFKAI--EPNGILLYNGHRAD------GVGDFIALYLNDRYVDFTFDLGTGAATLR-SSNPISLGEWRKLRLTR 237 (293)
Q Consensus 167 ~~~i~~~frt~--~~~GlLl~~~~~~~------~~~~~l~l~l~~g~v~~~~~~g~~~~~~~-~~~~~~dg~wh~V~i~r 237 (293)
.+.++..+... +.-|+.+...-... ..+-|+.| ..+|.-.+.-++.... .+. ....+..++||++.+..
T Consensus 543 NytVs~DV~ie~~~~ggv~lagRv~~~g~~~~~~~G~~f~v-~~~G~w~vt~d~~~~~-~l~~G~~~~~~~~WhtltL~~ 620 (669)
T PF02057_consen 543 NYTVSCDVYIETPDTGGVFLAGRVNKGGCDVRSARGYFFWV-YANGTWSVTSDLAGTT-TLASGTADIGAGKWHTLTLTI 620 (669)
T ss_dssp EEEEEEEEEE-STTT-EEEEEEEE---GGGGGG-EEEEEEE-ETTTEEEEEEETTS-S-EEEEEE-S--TT-EEEEEEEE
T ss_pred EEEEEEEEEeccCCcCcEEEEEeecccccccCCCCeEEEEE-EcCCcEEEeccCCCcE-EEeeeeecccCCeEEEEEEEE
Confidence 45666555544 34454443322111 11345555 5689888877776433 333 24567789999999999
Q ss_pred eCCEEEEEECCccc
Q psy5026 238 TGRHAYLQVDRFPS 251 (293)
Q Consensus 238 ~~~~~~l~VD~~~~ 251 (293)
++..+.-.+|+...
T Consensus 621 ~g~~~ta~lng~~l 634 (669)
T PF02057_consen 621 SGSTATAMLNGTVL 634 (669)
T ss_dssp ETTEEEEEETTEEE
T ss_pred ECCEEEEEECCEEe
Confidence 99999999999854
No 37
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=72.00 E-value=40 Score=25.35 Aligned_cols=64 Identities=9% Similarity=-0.032 Sum_probs=38.5
Q ss_pred CCCCCeEEEEec--cEEEecccCCC---CceeeeecCCceEEcC-CCCCCCCCCCCCcccCceeeEEEEEEcCeee
Q psy5026 53 RPYADITVHRTV--RTLILPYTVPS---GLFSRITFREPVFVGG-RGNTSGLSDKLPTEKGFKGCIRHLDINDHLY 122 (293)
Q Consensus 53 g~w~~~~~~~~~--~~v~l~~~g~~---~~~~~l~~~~~l~iGG-~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~ 122 (293)
|+||.+.+.... ..+.|-.+|.. ..........++.||. .... ......|.|+|.++++-++.+
T Consensus 61 ~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~~~~~~~~~~~iG~~~~~~------~~~~~~f~G~Idevriy~~aL 130 (133)
T smart00560 61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQPSPSSGNLPQGGRILLG------GAGGENFSGRLDEVRVYNRAL 130 (133)
T ss_pred CCEEEEEEEEECCCCeEEEEECCEEccccccCCcccCCceEEeeeccCC------CCCCCCceEEeeEEEEecccc
Confidence 689999887665 56777655531 1111123346788884 2111 112368999999999866533
No 38
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=71.12 E-value=46 Score=27.53 Aligned_cols=23 Identities=17% Similarity=0.173 Sum_probs=19.7
Q ss_pred CCeEEEEEEEe--CCEEEEEECCcc
Q psy5026 228 GEWRKLRLTRT--GRHAYLQVDRFP 250 (293)
Q Consensus 228 g~wh~V~i~r~--~~~~~l~VD~~~ 250 (293)
|+||+|.|.++ .+.+.+.+++..
T Consensus 154 g~~~~v~I~Y~~~~~~L~v~l~~~~ 178 (223)
T cd01951 154 GNEHTVRITYDPTTNTLTVYLDNGS 178 (223)
T ss_pred CCEEEEEEEEeCCCCEEEEEECCCC
Confidence 99999999998 478888888763
No 39
>KOG1834|consensus
Probab=64.81 E-value=1.1e+02 Score=29.95 Aligned_cols=117 Identities=13% Similarity=0.162 Sum_probs=75.8
Q ss_pred CCCceeEEEEEEEeC-------CCCeEEEEeccCCCCCCCeEEEEEECCEEEEEEeC--C--------------------
Q psy5026 3 GSPQAWRFPIQFKPE-------SWDGILFLTGERDDLNGDFMTLLIFEGYVEFSTPY--R-------------------- 53 (293)
Q Consensus 3 ~~~~~~~i~~~FrT~-------~~~GlL~~~~~~~~~~~~~~~l~l~~G~l~~~~~~--g-------------------- 53 (293)
...+.|.|+|..|-- ...-.|+-..++.+....+.+|.+.+=+|.|.+.- |
T Consensus 363 ~l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~ 442 (952)
T KOG1834|consen 363 SLPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDN 442 (952)
T ss_pred CCCCceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhh
Confidence 456788999988831 22346677777666667888999999999885432 2
Q ss_pred CCCCeEEEEeccEEEecccCCC---------CceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCe
Q psy5026 54 PYADITVHRTVRTLILPYTVPS---------GLFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDH 120 (293)
Q Consensus 54 ~w~~~~~~~~~~~v~l~~~g~~---------~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~ 120 (293)
+||...+.-....|+|..+|.. ......-+...|-||.-....+.. .+....-|+|-+..+.+-..
T Consensus 443 EWH~Y~ln~efp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW~g~~~~-~l~~aqfFrG~LasltlrsG 517 (952)
T KOG1834|consen 443 EWHHYVLNVEFPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACWQGRQQK-PLKLAQFFRGQLASLTLRSG 517 (952)
T ss_pred hhheeEEeecCceEEEEEcCcccCCceeccCCccCcccccceeEEeeeccCcccc-chhHHHHhhcccceeEEecc
Confidence 6988887666667887666641 111111234578888776544321 12334679999999888543
No 40
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=62.67 E-value=38 Score=28.40 Aligned_cols=70 Identities=13% Similarity=0.216 Sum_probs=40.3
Q ss_pred EEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEE-ECCCceEEEEecCCCCCCCeEEEEEEEeC--CEEEEE
Q psy5026 169 DLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTF-DLGTGAATLRSSNPISLGEWRKLRLTRTG--RHAYLQ 245 (293)
Q Consensus 169 ~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~-~~g~~~~~~~~~~~~~dg~wh~V~i~r~~--~~~~l~ 245 (293)
.+.++|-|..... ..+.. .+++.|.+ ++...... +.+. .......+.||+||.|.|.++. +.+++.
T Consensus 118 ~vAVEFDT~~N~~----~~d~~---~nHIgI~~-n~~~s~~~~~~~~---~~~~~~~l~~g~~~~v~I~Yd~~~~~L~V~ 186 (236)
T PF00139_consen 118 SVAVEFDTYKNPE----YNDPD---DNHIGIDV-NSVVSNKTASAGY---YSSPSFSLSDGKWHTVWIDYDASTKRLSVY 186 (236)
T ss_dssp EEEEEEETSTCGG----GTTTS---SSEEEEEE-SSSSESEEEE-------EEEEHHHGTTSEEEEEEEEETTTTEEEEE
T ss_pred EEEEEEeeeeccc----ccccC---CCEEEEEC-CCCcccccccccc---cccccccccCCcEEEEEEEEcCCccEEEEE
Confidence 6777888876321 12221 56777765 33222111 1110 0223456889999999999987 577777
Q ss_pred ECCc
Q psy5026 246 VDRF 249 (293)
Q Consensus 246 VD~~ 249 (293)
++..
T Consensus 187 l~~~ 190 (236)
T PF00139_consen 187 LDDN 190 (236)
T ss_dssp EEET
T ss_pred Eecc
Confidence 6665
No 41
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=61.56 E-value=97 Score=26.00 Aligned_cols=25 Identities=4% Similarity=0.015 Sum_probs=19.6
Q ss_pred CCCCCeEEEEEEEeC--CEEEEEECCc
Q psy5026 225 ISLGEWRKLRLTRTG--RHAYLQVDRF 249 (293)
Q Consensus 225 ~~dg~wh~V~i~r~~--~~~~l~VD~~ 249 (293)
+.+|++|++.|.++. +.+.+.++..
T Consensus 160 l~~g~~~~v~I~Y~~~~~~L~V~l~~~ 186 (236)
T cd06899 160 LKSGKPMQAWIDYDSSSKRLSVTLAYS 186 (236)
T ss_pred ccCCCeEEEEEEEcCCCCEEEEEEEeC
Confidence 579999999999984 5666666644
No 42
>KOG1836|consensus
Probab=56.54 E-value=3.2 Score=44.70 Aligned_cols=103 Identities=16% Similarity=0.139 Sum_probs=64.0
Q ss_pred EEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECCCc--eEEEEecCCCCCCCeEEEEEEEeCCEEEEEE
Q psy5026 169 DLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTG--AATLRSSNPISLGEWRKLRLTRTGRHAYLQV 246 (293)
Q Consensus 169 ~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g~~--~~~~~~~~~~~dg~wh~V~i~r~~~~~~l~V 246 (293)
.+.+..+.+...|.|-...... +.+..+....+.+..++..|-. ...+.......++.||.+...+....+.+.+
T Consensus 1559 ~~~~~~~~~~~~~~l~~~~s~~---~~~~~~~~~~~~~~~~~~~gi~~~~~s~~~~~~~~~~~~~~~~~~~~~~v~~~~~ 1635 (1705)
T KOG1836|consen 1559 ALVFSERNVSSTGGLTHHLSKL---GTELLVQENPIGVTEKFESGITDLSTSSTPIVSLLPGGCHSVTSSTDPGVVQLED 1635 (1705)
T ss_pred HhhhcccccccCCCcccccccc---chHHhhhhcccccchhhhhhhhhhhhcchhhhhhcCCcceeeeeecCCccccccc
Confidence 3444444444444443333322 4456666666666666555532 2345556678899999999999999988888
Q ss_pred CCccceeEecCCCcccccCCCCeEEeccCCCC
Q psy5026 247 DRFPSSQILSPGPFTQLSLSLSLYLGGVPDYN 278 (293)
Q Consensus 247 D~~~~~~~~~~~~~~~l~~~~~lyvGG~p~~~ 278 (293)
|.. ...+..........++++||+|.+.
T Consensus 1636 ~~~----~~~~~~~~~~~~~~p~~~~~~~~s~ 1663 (1705)
T KOG1836|consen 1636 DTY----TVGEIPPPPADTQEPIKLGGYPSSL 1663 (1705)
T ss_pred cce----ecccCCCCchhccCCcccCCccccc
Confidence 872 2222222334566899999999864
No 43
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=47.27 E-value=32 Score=27.19 Aligned_cols=70 Identities=10% Similarity=0.069 Sum_probs=39.8
Q ss_pred CCceeEEEEEEEe--CCCCeEEEEec--cCCCCCCCeEEEEEECCE-----------EE------------EEEeCCCCC
Q psy5026 4 SPQAWRFPIQFKP--ESWDGILFLTG--ERDDLNGDFMTLLIFEGY-----------VE------------FSTPYRPYA 56 (293)
Q Consensus 4 ~~~~~~i~~~FrT--~~~~GlL~~~~--~~~~~~~~~~~l~l~~G~-----------l~------------~~~~~g~w~ 56 (293)
.+..++++++||. ....|++|... ...........+.|.++. +. ..+..|+|+
T Consensus 51 ~~~df~l~~d~k~~~~~~sGi~~r~~~~~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~W~ 130 (185)
T PF06439_consen 51 KFSDFELEVDFKITPGGNSGIFFRAQSPGDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEPSVNVAIPPGEWN 130 (185)
T ss_dssp EBSSEEEEEEEEE-TT-EEEEEEEESSECCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-SSS--S--TTSEE
T ss_pred ccccEEEEEEEEECCCCCeEEEEEeccccCCCCcceEEEEEEECCCCccCCCCccceEEEeccccccccccccCCCCceE
Confidence 5778899999993 34455666555 111123355577776531 11 134455788
Q ss_pred CeEEEEeccEEEecccC
Q psy5026 57 DITVHRTVRTLILPYTV 73 (293)
Q Consensus 57 ~~~~~~~~~~v~l~~~g 73 (293)
.+++......+++..+|
T Consensus 131 ~~~I~~~g~~i~v~vnG 147 (185)
T PF06439_consen 131 TVRIVVKGNRITVWVNG 147 (185)
T ss_dssp EEEEEEETTEEEEEETT
T ss_pred EEEEEEECCEEEEEECC
Confidence 88887777777765555
No 44
>PF07622 DUF1583: Protein of unknown function (DUF1583); InterPro: IPR011475 Most of the Rhodopirellula baltica hypothetical proteins that have this domain also match PF07619 from PFAM.
Probab=44.61 E-value=36 Score=30.80 Aligned_cols=32 Identities=22% Similarity=0.508 Sum_probs=28.0
Q ss_pred ecCCCCCCCeEEEEEEEeCCEEEEEECCccce
Q psy5026 221 SSNPISLGEWRKLRLTRTGRHAYLQVDRFPSS 252 (293)
Q Consensus 221 ~~~~~~dg~wh~V~i~r~~~~~~l~VD~~~~~ 252 (293)
...++++..|++|.+.+.++++.|.++++...
T Consensus 83 ~~~~l~~~~wN~v~l~~~g~~v~l~LN~~~i~ 114 (399)
T PF07622_consen 83 PTLPLKVNAWNRVRLQRRGDKVQLHLNGQLIY 114 (399)
T ss_pred CCCCCCccccceEEEEEeCCEEEEEeCCceeE
Confidence 35678999999999999999999999998653
No 45
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=38.80 E-value=2.1e+02 Score=23.18 Aligned_cols=80 Identities=10% Similarity=0.083 Sum_probs=47.8
Q ss_pred EEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEE-EEe-CCEEEEEEECCCceEEEEec-CCCCCCCeEEEEEEEeC--CE
Q psy5026 167 LLDLTIVFKAIEPNGILLYNGHRADGVGDFIAL-YLN-DRYVDFTFDLGTGAATLRSS-NPISLGEWRKLRLTRTG--RH 241 (293)
Q Consensus 167 ~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l-~l~-~g~v~~~~~~g~~~~~~~~~-~~~~dg~wh~V~i~r~~--~~ 241 (293)
...++-+.|-......-.|.++.. ..|+.+ .+. +|.+++.++-.++...+... ..+ ..+|...+.... ..
T Consensus 33 gW~ls~~~RV~~G~~n~~yyAnG~---~r~l~~lsvn~sG~LvA~L~g~ss~~~~~~~~~di--~gyH~Y~i~~~p~~~t 107 (198)
T PF09264_consen 33 GWSLSWESRVVSGGCNTNYYANGS---KRYLPILSVNESGSLVAELEGQSSNTLLATTGADI--HGYHKYEIVFSPLTNT 107 (198)
T ss_dssp -EEEEEEEEEEEES-EEEEEEESS---EEEEEEEEE-TTS-EEEEETTS-S-EEEE-CHHHH--CSEEEEEEEEETTTTE
T ss_pred CcceeeeEEEecCcceeEEEcCCc---eEEEEEEEEcCCCCEEEEEecCCCcEEEecccccc--cceeEEEEEecCCCCc
Confidence 456666666554444444443332 456544 454 67888888776666666654 222 469999999964 89
Q ss_pred EEEEECCccc
Q psy5026 242 AYLQVDRFPS 251 (293)
Q Consensus 242 ~~l~VD~~~~ 251 (293)
+++.|||...
T Consensus 108 ASfy~DG~lI 117 (198)
T PF09264_consen 108 ASFYFDGTLI 117 (198)
T ss_dssp EEEEETTEEE
T ss_pred eEEEECCEEE
Confidence 9999999854
No 46
>PF09191 CD4-extracel: CD4, extracellular; InterPro: IPR015274 This domain adopts an immunoglobulin-like beta-sandwich with seven strands in 2 beta sheets, in a Greek key topology. It is predominantly found in the extracellular portion of CD4 proteins, where it enables interaction with major histocompatibility complex class II antigens []. ; PDB: 1WIQ_B 1WIP_B 1WIO_A 3T0E_E 1CID_A.
Probab=37.60 E-value=1.4e+02 Score=21.58 Aligned_cols=47 Identities=15% Similarity=0.219 Sum_probs=29.8
Q ss_pred EEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEeCCEEEEEEECC
Q psy5026 167 LLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLG 213 (293)
Q Consensus 167 ~~~i~~~frt~~~~GlLl~~~~~~~~~~~~l~l~l~~g~v~~~~~~g 213 (293)
.+++-+.|......|=|.|.++.......++...++|.++.+.....
T Consensus 16 efSFPL~f~dE~L~GEL~WqaegasS~q~WitFsl~n~kvsv~~~~~ 62 (108)
T PF09191_consen 16 EFSFPLNFEDENLSGELRWQAEGASSSQSWITFSLKNKKVSVQKVTQ 62 (108)
T ss_dssp EEE-----SS-SCEEEEEEEESSSSSS--EEEEEEETTEEEEECEET
T ss_pred EEecccccCccccceEEEEEecCCCCCCCcEEEEEeCCeEEEeecCC
Confidence 45555666667778999998776544468999999999999875443
No 47
>PF11025 GP40: Glycoprotein GP40 of Cryptosporidium; InterPro: IPR021035 This entry represents proteins that are highly conserved in Cryptosporidium spp. Many members are annotated as being a 60 kDa glycoprotein.
Probab=36.35 E-value=1.9e+02 Score=22.03 Aligned_cols=27 Identities=15% Similarity=0.222 Sum_probs=23.3
Q ss_pred CCCCCCeEEEEEEEeCCEEEEEECCcc
Q psy5026 224 PISLGEWRKLRLTRTGRHAYLQVDRFP 250 (293)
Q Consensus 224 ~~~dg~wh~V~i~r~~~~~~l~VD~~~ 250 (293)
.+-.|+-..|++....+.+.+.||+..
T Consensus 46 rYISGev~~VtFeksd~TvkIkvd~ke 72 (165)
T PF11025_consen 46 RYISGEVKSVTFEKSDSTVKIKVDGKE 72 (165)
T ss_pred ceeecceEEEEEeccCCeEEEEECCeE
Confidence 466688999999999999999999874
No 48
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins. EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=34.35 E-value=2.7e+02 Score=23.08 Aligned_cols=24 Identities=25% Similarity=0.181 Sum_probs=19.6
Q ss_pred CCCCeEEEEEEEeC--CEEEEEECCc
Q psy5026 226 SLGEWRKLRLTRTG--RHAYLQVDRF 249 (293)
Q Consensus 226 ~dg~wh~V~i~r~~--~~~~l~VD~~ 249 (293)
+.+...++.|++.. ..+++.||+.
T Consensus 149 n~~~p~~iri~Y~~~~~~l~v~vd~~ 174 (215)
T cd06903 149 DSGVPSTIRLSYDALNSLFKVQVDNR 174 (215)
T ss_pred CCCCCEEEEEEEECCCCEEEEEECCC
Confidence 55667889999988 8899999985
No 49
>cd02178 GH16_beta_agarase Beta-agarase, member of glycosyl hydrolase family 16. Beta-agarase is a glycosyl hydrolase family 16 (GH16) member that hydrolyzes the internal beta-1,4-linkage of agarose, a hydrophilic polysaccharide found in the cell wall of Rhodophyceaea, marine red algae. Agarose is a linear chain of galactose units linked by alternating L-alpha-1,3- and D-beta-1,4-linkages that are additionally modified by a 3,6-anhydro-bridge. Agarose forms thermo-reversible gels that are widely used in the food industry or as a laboratory medium. While beta-agarases are also found in two other families derived from the sequence-based classification of glycosyl hydrolases (GH50, and GH86) the GH16 members are most abundant. This domain adopts a curved beta-sandwich conformation, with a tunnel-shaped active site cavity, referred to as a jellyroll fold.
Probab=32.21 E-value=1.4e+02 Score=25.45 Aligned_cols=27 Identities=4% Similarity=0.007 Sum_probs=23.2
Q ss_pred CCCeEEEEEEEe-CCEEEEEECCcccee
Q psy5026 227 LGEWRKLRLTRT-GRHAYLQVDRFPSSQ 253 (293)
Q Consensus 227 dg~wh~V~i~r~-~~~~~l~VD~~~~~~ 253 (293)
++.||+-.+..+ .+.+...|||.....
T Consensus 178 ~~~fHtY~veW~~p~~i~fyvDG~~~~~ 205 (258)
T cd02178 178 ADDFHVYGVYWKDPDTIRFYIDGVLVRT 205 (258)
T ss_pred ccCeEEEEEEEcCCCeEEEEECCEEEEE
Confidence 467999999999 999999999985533
No 50
>PTZ00334 trans-sialidase; Provisional
Probab=30.74 E-value=90 Score=31.29 Aligned_cols=51 Identities=12% Similarity=0.102 Sum_probs=32.3
Q ss_pred CCCCCeEEEEEEE-eCCEEEEEECCccceeEecCCCc-ccccCCCCeEEeccCC
Q psy5026 225 ISLGEWRKLRLTR-TGRHAYLQVDRFPSSQILSPGPF-TQLSLSLSLYLGGVPD 276 (293)
Q Consensus 225 ~~dg~wh~V~i~r-~~~~~~l~VD~~~~~~~~~~~~~-~~l~~~~~lyvGG~p~ 276 (293)
-.-++-|+|.|.. ++++....|||+..-....+... ....+ ..+||||.-.
T Consensus 640 We~~k~yqVal~L~~G~~gsvYVDG~~vg~~~~~l~~~~~~~I-shFyiGgdg~ 692 (780)
T PTZ00334 640 WEPETTHQVAIVLRNGKQGSAYVDGQRVGDASCELKNTDSKGI-SHFYIGGDGG 692 (780)
T ss_pred ccCCCeEEEEEEEeCCCeEEEEECCEEecCcccccCCCCCccc-ceEEECCCcc
Confidence 4557889999887 56799999999854222211111 11222 3799999643
No 51
>PF07081 DUF1349: Protein of unknown function (DUF1349); InterPro: IPR009784 This family consists of several hypothetical bacterial proteins but contains one sequence (P40893 from SWISSPROT) from Saccharomyces cerevisiae. Members of this family are typically around 200 residues in length. The function of this family is unknown.; PDB: 3MEP_B 3O12_A.
Probab=30.47 E-value=2.8e+02 Score=22.17 Aligned_cols=76 Identities=21% Similarity=0.295 Sum_probs=41.3
Q ss_pred EEEEEEEEEeC-CCCeEEEEeccCCCCCCCeEEEEE---eCCEEEEEEEC--CCceEEEEecCCCCCCCeEEEEEEEeCC
Q psy5026 167 LLDLTIVFKAI-EPNGILLYNGHRADGVGDFIALYL---NDRYVDFTFDL--GTGAATLRSSNPISLGEWRKLRLTRTGR 240 (293)
Q Consensus 167 ~~~i~~~frt~-~~~GlLl~~~~~~~~~~~~l~l~l---~~g~v~~~~~~--g~~~~~~~~~~~~~dg~wh~V~i~r~~~ 240 (293)
...++..++.. ++.||+++..+ ..|+...+ .+|...+..-. +-+.-.+..- -.++..-.+.+.|.++
T Consensus 53 ~~~v~~~~~~~YDQaGL~v~~~~-----~~WiK~giE~~~~g~~~l~sV~t~~~SDws~~~~--~~~~~~~~lrv~R~g~ 125 (183)
T PF07081_consen 53 EVKVSGDFKEQYDQAGLMVYQDE-----DNWIKAGIEYSNDGTPRLSSVVTNGYSDWSLSPL--PSDGQSVWLRVERRGD 125 (183)
T ss_dssp EEEEEE---STT-EEEEEEEEET-----TEEEEEEEEE-ETTCEEEEEEEEESSEEEEEEE----SBTTSEEEEEEEETT
T ss_pred EEEEEeCCccceeeEEEEEEECC-----cccEEEEEEEecCCCceEEEEeccCccccccccc--CCCCCEEEEEEEEeCC
Confidence 34444555543 56799999876 45777755 46766654211 2222122211 2345566799999999
Q ss_pred EEEE--EECCc
Q psy5026 241 HAYL--QVDRF 249 (293)
Q Consensus 241 ~~~l--~VD~~ 249 (293)
.+.+ +.||.
T Consensus 126 ~~~~~ys~DG~ 136 (183)
T PF07081_consen 126 DLWIYYSADGK 136 (183)
T ss_dssp EEEEEEESSSS
T ss_pred EEEEEEEcCCC
Confidence 8765 44665
No 52
>PF05910 DUF868: Plant protein of unknown function (DUF868); InterPro: IPR008586 This family consists of several hypothetical proteins from plants. The function of this family is unknown.
Probab=29.49 E-value=1.4e+02 Score=25.84 Aligned_cols=49 Identities=12% Similarity=0.228 Sum_probs=30.9
Q ss_pred CCCCeEEEEEEEe-------CCEEEEEECCccceeEecCCCcccccCCCCeEEeccCC
Q psy5026 226 SLGEWRKLRLTRT-------GRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPD 276 (293)
Q Consensus 226 ~dg~wh~V~i~r~-------~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyvGG~p~ 276 (293)
..|+.|.|.|.-. ...+.+.||++....... ... .+.-+..|+|.|+|-
T Consensus 153 e~G~~HeI~Iec~~~~~g~~dp~l~V~VDgk~v~~Vkr-L~W-kFRGNqti~vdg~~V 208 (274)
T PF05910_consen 153 EGGKEHEISIECGGETGGPKDPELWVSVDGKKVVQVKR-LRW-KFRGNQTIFVDGLPV 208 (274)
T ss_pred CCCcEEEEEEEEeccCCCCCCceEEEEECCEEEEEEEE-eee-cccCceEEEECCeEE
Confidence 3688999999881 237899999986543321 111 123345678888773
No 53
>KOG1836|consensus
Probab=21.74 E-value=85 Score=34.50 Aligned_cols=69 Identities=17% Similarity=0.208 Sum_probs=43.2
Q ss_pred CCCCeEEEEeccEEEecccCC-CC--ceeeeecCCceEEcCCCCCCCCCCCCCcccCceeeEEEEEEcCeeeeec
Q psy5026 54 PYADITVHRTVRTLILPYTVP-SG--LFSRITFREPVFVGGRGNTSGLSDKLPTEKGFKGCIRHLDINDHLYNFN 125 (293)
Q Consensus 54 ~w~~~~~~~~~~~v~l~~~g~-~~--~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~~F~GCi~~~~~n~~~~~~~ 125 (293)
.||.+..++.+..+.+.++.- .+ .....+...++++||.|....... .+...+|.||| +++++...++.
T Consensus 1618 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~~~~~-~~~~~~~~~~~--~~~~~~~~~~~ 1689 (1705)
T KOG1836|consen 1618 GCHSVTSSTDPGVVQLEDDTYTVGEIPPPPADTQEPIKLGGYPSSLTTLR-IAVLKSFTGCI--FVVMGIRVDVT 1689 (1705)
T ss_pred cceeeeeecCCccccccccceecccCCCCchhccCCcccCCcccccccee-eecccccccce--EEecCCCCcHH
Confidence 477777766665555443321 01 112344567999999998655432 33468999999 88888766543
Done!