Query psy9727
Match_columns 257
No_of_seqs 200 out of 562
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 18:02:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9727.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9727hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd06902 lectin_ERGIC-53_ERGL E 100.0 1.7E-46 3.7E-51 331.4 21.7 181 29-256 4-194 (225)
2 PF03388 Lectin_leg-like: Legu 100.0 2.4E-46 5.3E-51 331.2 18.4 178 27-247 2-180 (229)
3 cd06901 lectin_VIP36_VIPL VIP3 100.0 3.8E-45 8.2E-50 327.0 22.3 197 27-230 1-197 (248)
4 cd06903 lectin_EMP46_EMP47 EMP 100.0 8.9E-45 1.9E-49 318.4 20.2 174 29-257 2-184 (215)
5 KOG3839|consensus 100.0 7.5E-45 1.6E-49 329.2 16.7 218 5-229 31-248 (351)
6 KOG3838|consensus 100.0 8.2E-45 1.8E-49 334.1 15.7 202 6-254 12-215 (497)
7 cd07308 lectin_leg-like legume 100.0 1.2E-37 2.6E-42 272.9 21.4 176 29-252 2-184 (218)
8 cd01951 lectin_L-type legume l 99.9 5E-20 1.1E-24 161.5 18.7 96 44-140 12-116 (223)
9 cd06899 lectin_legume_LecRK_Ar 99.7 3.9E-17 8.4E-22 145.2 14.8 117 45-161 12-153 (236)
10 PF00139 Lectin_legB: Legume l 99.6 4.6E-14 9.9E-19 125.3 15.4 95 45-139 12-129 (236)
11 KOG3839|consensus 99.5 1.6E-13 3.5E-18 125.5 10.0 182 44-256 35-237 (351)
12 cd06901 lectin_VIP36_VIPL VIP3 99.4 2.3E-12 5E-17 115.6 11.9 87 167-253 98-190 (248)
13 KOG3838|consensus 98.7 3.3E-09 7.3E-14 99.2 1.7 57 148-204 152-212 (497)
14 cd06902 lectin_ERGIC-53_ERGL E 98.4 4.9E-07 1.1E-11 80.2 7.0 74 150-229 122-197 (225)
15 cd06903 lectin_EMP46_EMP47 EMP 98.0 1.2E-05 2.6E-10 70.9 6.5 41 161-207 130-172 (215)
16 cd07308 lectin_leg-like legume 97.5 0.00021 4.5E-09 62.6 6.6 44 172-221 141-184 (218)
17 cd06900 lectin_VcfQ VcfQ bacte 97.5 0.0012 2.5E-08 59.2 10.5 93 46-139 14-126 (255)
18 PF03388 Lectin_leg-like: Legu 95.2 0.068 1.5E-06 47.4 7.0 33 168-200 101-133 (229)
19 PF02210 Laminin_G_2: Laminin 65.3 41 0.00088 25.3 7.4 57 191-248 20-76 (128)
20 PF06439 DUF1080: Domain of Un 52.6 70 0.0015 26.2 7.2 62 42-110 8-76 (185)
21 smart00282 LamG Laminin G doma 46.0 1.4E+02 0.0029 23.1 7.6 58 190-248 27-84 (135)
22 PF00054 Laminin_G_1: Laminin 44.5 1.1E+02 0.0025 24.0 7.0 57 191-249 20-76 (131)
23 cd06899 lectin_legume_LecRK_Ar 43.5 2.3E+02 0.005 24.9 9.8 24 224-247 161-186 (236)
24 PF08139 LPAM_1: Prokaryotic m 38.6 23 0.0005 20.7 1.4 14 1-14 7-20 (25)
25 cd00110 LamG Laminin G domain; 35.9 2E+02 0.0044 22.2 7.4 56 190-247 46-101 (151)
26 PRK00022 lolB outer membrane l 35.4 1.6E+02 0.0035 25.1 7.0 23 224-246 176-198 (202)
27 PF02248 Como_SCP: Small coat 34.0 35 0.00077 29.4 2.5 24 83-111 113-136 (182)
28 PLN03161 Probable xyloglucan e 33.0 1.4E+02 0.0031 27.6 6.5 18 1-18 1-18 (291)
29 PF04294 VanW: VanW like prote 32.7 67 0.0014 26.2 3.9 31 214-244 99-130 (132)
30 PF11153 DUF2931: Protein of u 32.5 45 0.00098 29.0 3.1 11 1-11 1-11 (216)
31 PF15524 Toxin_45: Putative to 29.6 60 0.0013 24.6 2.8 29 70-98 43-71 (94)
32 PF09224 DUF1961: Domain of un 27.8 4E+02 0.0088 23.7 8.1 63 44-113 16-86 (218)
33 PF02402 Lysis_col: Lysis prot 27.5 26 0.00057 23.3 0.5 22 1-23 1-22 (46)
34 cd02182 GH16_Strep_laminarinas 26.7 1.4E+02 0.0031 26.5 5.4 60 53-113 47-117 (259)
35 PF08787 Alginate_lyase2: Algi 26.6 2E+02 0.0044 25.3 6.2 60 189-248 120-186 (236)
36 PRK09993 C-lysozyme inhibitor; 24.0 20 0.00043 30.1 -0.7 80 1-81 6-87 (153)
37 TIGR01655 yxeA_fam conserved h 20.4 72 0.0016 25.2 1.9 18 1-18 1-18 (114)
No 1
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC). ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53. In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII. Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=100.00 E-value=1.7e-46 Score=331.40 Aligned_cols=181 Identities=35% Similarity=0.667 Sum_probs=156.7
Q ss_pred eeeeeeeCCccCC-CCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEE
Q psy9727 29 KRHHSMVKPYLTS-GLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAF 107 (257)
Q Consensus 29 ~~~~Sf~~P~~~~-~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAf 107 (257)
.-+|||.+||+.. +..+++|++.|+|++.+++|||||+.++++|+||++.|++.++|||||+|||+|.++ .+||||||
T Consensus 4 ~~~~s~~~P~l~~~~~~i~~W~~~G~t~~~~~~IrLTp~~~~~~G~iw~~~~~~~~~w~ie~~Fri~g~~~-~~gdG~a~ 82 (225)
T cd06902 4 EYKYSFKGPHLAQKDGTVPFWSHGGDAIASLEQVRLTPSLRSKKGSVWTKNPFSFENWEVEVTFRVTGRGR-IGADGLAI 82 (225)
T ss_pred eeeecccCcccccCCCCCCceEecccEEecCCEEEECCCCCCCEEEEeeCCCcCCCCEEEEEEEEEecCCC-CCCCEEEE
Confidence 3469999999986 468999999999999999999999999999999999999999999999999999875 67899999
Q ss_pred EEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCC
Q psy9727 108 WYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHN 187 (257)
Q Consensus 108 W~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~ 187 (257)
||++++.+.|++||.+++|+||||+|||++|.... .
T Consensus 83 W~t~~~~~~G~~~G~~~~f~Gl~I~~Dt~~n~~~~---~----------------------------------------- 118 (225)
T cd06902 83 WYTKERGEEGPVFGSSDKWNGVGIFFDSFDNDGKK---N----------------------------------------- 118 (225)
T ss_pred EEECCCCCCCCccCCCCcccEEEEEEECCCCCCCC---C-----------------------------------------
Confidence 99999988999999999999999999999886421 2
Q ss_pred ccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeC---------CcccCCCccc
Q psy9727 188 HNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVN---------DLLSKGPFIN 256 (257)
Q Consensus 188 ~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~---------~~~~~~~~~~ 256 (257)
.|+|+++++||+.+|+++.|+..+++++|.++|||+.+|+++||+|.+++|+|++|+ ..||+-.-|.
T Consensus 119 --~p~i~~~~NDGt~~yd~~~D~~~~~~~~C~~~~rn~~~p~~~rI~Y~~~~l~V~~d~~~~~~~~~~~~Cf~~~~v~ 194 (225)
T cd06902 119 --NPAILVVGNDGTKSYDHQNDGLTQALGSCLRDFRNKPYPVRAKITYYQNVLTVSINNGFTPNKDDYELCTRVENMV 194 (225)
T ss_pred --CcEEEEEECCCCeeccccCCCcccccceEEEeccCCCCCeEEEEEEECCeEEEEEeCCcCCCCCcccEEEecCCee
Confidence 345555555555566666666667899999999999999999999999999999996 6687655544
No 2
>PF03388 Lectin_leg-like: Legume-like lectin family; InterPro: IPR005052 Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[]. ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=100.00 E-value=2.4e-46 Score=331.16 Aligned_cols=178 Identities=46% Similarity=0.871 Sum_probs=139.7
Q ss_pred ceeeeeeeeCCccCCC-CCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeE
Q psy9727 27 FLKRHHSMVKPYLTSG-LSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGM 105 (257)
Q Consensus 27 ~~~~~~Sf~~P~~~~~-~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGl 105 (257)
.++++|||.+||+.++ ..+++|++.|+|++.+++|||||+.++++|+||++.|++.++|+++++|||+|.++..+||||
T Consensus 2 ~~~~~~S~~~P~~~~~~~~i~~W~~~G~t~i~~~~IrLTp~~~~~~G~iws~~~~~~~~w~i~~~Fri~g~~~~~~g~G~ 81 (229)
T PF03388_consen 2 RFKESHSFSPPFLDNGDNEIPNWDIGGSTVITDNFIRLTPDRQSQSGSIWSRKPIPFDNWEIEFTFRISGQEKGLGGDGM 81 (229)
T ss_dssp EEEGGGEEESSSCSCTTSCBTTEEEEET-EEESSEEEEE-SSTTEEEEEEESS-BEESEEEEEEEEEEESS-SSS-S-EE
T ss_pred cccccCCcCCCcCcCCCCccCCEEECCeEEecCCEEEECCCcccCEEEEEEcCCCCccCEEEEEEEEEeccccCcCCCeE
Confidence 4678899999999876 578999999999999999999999999999999999999999999999999998666789999
Q ss_pred EEEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCC
Q psy9727 106 AFWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGE 185 (257)
Q Consensus 106 AfW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~ 185 (257)
|||||+++.+.|++||.+++|+||||+||||+|++.++.+.+|.|++++|||++.|
T Consensus 82 a~W~t~~~~~~G~~fG~~~~f~Gl~i~idt~~N~~~~~~~~~p~i~~~~NDGt~~~------------------------ 137 (229)
T PF03388_consen 82 AFWYTKDPGSDGPVFGGPDKFDGLGIFIDTYDNDEGGHKRGFPYISAMLNDGTKSY------------------------ 137 (229)
T ss_dssp EEEEESSSSSSCSBTTB-SS-EEEEEEEEES-TTCTTCTSTSSEEEEEEEESSS--------------------------
T ss_pred EEEEEcCccccccccCCCcccceEEEEEEcccCCCcccccccceEEEEecCCCccc------------------------
Confidence 99999999999999999999999999999999986443334566666666665555
Q ss_pred CCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCC
Q psy9727 186 HNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVND 247 (257)
Q Consensus 186 ~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~ 247 (257)
+++.||..+++++|.+.|||+++|+++||+|.++.|+|.+|.+
T Consensus 138 -------------------~~~~dg~~~~~~~C~~~~rn~~~p~~~ri~Y~~~~l~v~id~~ 180 (229)
T PF03388_consen 138 -------------------DHDNDGKDQSLGSCSADYRNSDVPTRIRISYSKNTLTVSIDSN 180 (229)
T ss_dssp --------------------GGGTTTTT-SEEEE---BTESSEEEEEEEEETTEEEEEEETS
T ss_pred -------------------cccccCcccccccceeccCcCCCCEEEEEEEECCeEEEEEecc
Confidence 4555555567888888888888888888888888888888855
No 3
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER). Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=100.00 E-value=3.8e-45 Score=326.95 Aligned_cols=197 Identities=49% Similarity=0.951 Sum_probs=176.5
Q ss_pred ceeeeeeeeCCccCCCCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEE
Q psy9727 27 FLKRHHSMVKPYLTSGLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMA 106 (257)
Q Consensus 27 ~~~~~~Sf~~P~~~~~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlA 106 (257)
+++|+|||.+||+..+..+++|.+.|+|++.+++|||||+.++++|++|++.|+++++|+++|+|||++.++..+|||||
T Consensus 1 ~~~~~~sL~~Py~~~~~~i~~w~~~G~a~v~~~~IrLTp~~~~~~G~~w~~~p~~~~~F~~~f~F~I~~~~~~~~GdGlA 80 (248)
T cd06901 1 YLKREHSLIKPYQGVGSSMPLWDFLGSTMVTSQYIRLTPDHQSKQGSIWNRVPCYLRDWEMHVHFKVHGSGKNLFGDGFA 80 (248)
T ss_pred CccccccccCCeecCCCCCCCEEEcceEEEcCCeEEECCCCCCCEEEEeccCCccCCCEEEEEEEEEeCCCCCCCCCEEE
Confidence 57899999999999887799999999999999999999999999999999999999999999999999987667899999
Q ss_pred EEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCC
Q psy9727 107 FWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEH 186 (257)
Q Consensus 107 fW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~ 186 (257)
|||++++++.|++||.+++|+||||+||||+|..+.+.+.+|+|++++|||++.|+.++||..+++++|...+||..++
T Consensus 81 fw~t~~~~~~G~~fG~~~~f~Gl~I~~Dt~~n~~~~~~~~~P~i~~~~NDGt~~yd~~~Dg~~~~~~~C~~~~rn~~~~- 159 (248)
T cd06901 81 IWYTKERMQPGPVFGSKDNFHGLAIFFDTYSNQNGEHEHVHPYISAMVNNGSLSYDHDRDGTHTELAGCSAPFRNKDHD- 159 (248)
T ss_pred EEEEcCCCccCcccccCCCCceEEEEEECCCCCCCcccCCCceEEEEEcCCCeeecccCCCchhhcCceeeeccCCCCC-
Confidence 9999999999999999999999999999999986555567899999999999999999999999999999999987765
Q ss_pred CccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccE
Q psy9727 187 NHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTH 230 (257)
Q Consensus 187 ~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~ 230 (257)
+.++|.|.++.+.++.+.++.+ +|..|.. .++...|..
T Consensus 160 ----t~~rI~Y~~~~l~v~vd~~~~~-~w~~Cf~-~~~v~LP~~ 197 (248)
T cd06901 160 ----TFVAIRYSKGRLTVMTDIDGKN-EWKECFD-VTGVRLPTG 197 (248)
T ss_pred ----eEEEEEEECCeEEEEEecCCCC-ceeeeEE-eCCeecCCC
Confidence 6788888888888888888776 7999944 365556653
No 4
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins. EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=100.00 E-value=8.9e-45 Score=318.40 Aligned_cols=174 Identities=23% Similarity=0.425 Sum_probs=149.2
Q ss_pred eeeeeeeCCccCC----CCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCC-cEEEEEEEEEEccCcccCCC
Q psy9727 29 KRHHSMVKPYLTS----GLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMN-NWEVQITLKIHGKGKELFGD 103 (257)
Q Consensus 29 ~~~~Sf~~P~~~~----~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~-~w~ie~~Fri~g~~~~~~gd 103 (257)
..++|| |.+.. +..+++|++.|+|++.+++|||||+ ++++|+||++.|++.+ +||+|++|||+|+++ .+||
T Consensus 2 ~~~~s~--p~l~~~~~~~~~i~~W~~~G~t~v~~~~IrLTp~-~s~~G~iWs~~pl~~~~~w~ie~~Fri~G~~~-~~gd 77 (215)
T cd06903 2 NKDLSL--PNLLKISPNGKLIPNWQTSGNPKLESGRIILTPP-GNQRGSLWLKKPLSLKDEWTIEWTFRSTGPEG-RSGG 77 (215)
T ss_pred cccccC--CchhccccCCCCCCCeEEcCcEEeeCCeEEECCC-CCceEeEeeCCcCCCCCCEEEEEEEEecccCC-cCCC
Confidence 457888 54333 2589999999999999999999999 9999999999999999 999999999999875 6899
Q ss_pred eEEEEEeecCCCCC--ccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccC
Q psy9727 104 GMAFWYVRDRMEGG--PVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSN 181 (257)
Q Consensus 104 GlAfW~t~~~~~~G--~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn 181 (257)
|||||||+++.+.+ ++||++++|+||||+|||++|. .|.|++|+|||
T Consensus 78 Gla~W~t~~~~~~~g~~~fG~~~~f~Gl~I~~Dt~~n~-------~p~i~~~~NDG------------------------ 126 (215)
T cd06903 78 GLNFWLVKDGNADVGTSSIYGPSKFDGLQLLIDNNGGS-------GGSLRGFLNDG------------------------ 126 (215)
T ss_pred EEEEEEECCCcccCCccccCCCCCCcEEEEEEECCCCC-------CceEEEEECCC------------------------
Confidence 99999999987654 8999999999999999999874 14455555555
Q ss_pred CCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeC--CeEEEEEeCCcccCCCcccC
Q psy9727 182 HNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYED--ENLTVSGVNDLLSKGPFINL 257 (257)
Q Consensus 182 ~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~--~~l~v~~d~~~~~~~~~~~~ 257 (257)
+..|+++.| ..+++++|.++|||+.+|+++||+|.+ +.|+|+||+++||+-.-|+|
T Consensus 127 -------------------t~~yd~~~d-~~~~~g~C~~~~rn~~~p~~iri~Y~~~~~~l~v~vd~~~Cf~~~~v~l 184 (215)
T cd06903 127 -------------------SKDYKNEDV-DSLAFGSCLFAYQDSGVPSTIRLSYDALNSLFKVQVDNRLCFQTDKVQL 184 (215)
T ss_pred -------------------CeeccccCC-cccccceeeEeccCCCCCEEEEEEEECCCCEEEEEECCCEEEecCCeec
Confidence 455566666 567999999999999999999999999 88999999999998776654
No 5
>KOG3839|consensus
Probab=100.00 E-value=7.5e-45 Score=329.21 Aligned_cols=218 Identities=43% Similarity=0.788 Sum_probs=200.6
Q ss_pred hHHHHHHHHHHHHhhhccCcccceeeeeeeeCCccCCCCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCc
Q psy9727 5 LFTVIALIWQVVAQEAQWNTEDFLKRHHSMVKPYLTSGLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNN 84 (257)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~Sf~~P~~~~~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~ 84 (257)
+++++.+.+.++......+...+++||||+.+||+..+...++|++.|++++++++|||||+.|++.|+||++.|+..++
T Consensus 31 ~~~ll~~~~~v~g~~~~~~g~~ylkrehSL~~Py~g~g~r~~~W~~~Gstvv~~~~irLT~d~qsk~GAv~n~~Pv~s~~ 110 (351)
T KOG3839|consen 31 LSNLLILPGFVLGSQVEGSGYEYLKREHSLSKPYQGVGGRQPNWNLSGSTVVTSNYIRLTPDEQSKSGAVWNRQPVFSRD 110 (351)
T ss_pred hhHhhhhhheeecccccCcchhheeehhhcccccccccccccCccccccEEEEeeeeeccccccccccccccCCCccccc
Confidence 45566666666666666555689999999999999999899999999999999999999999999999999999999999
Q ss_pred EEEEEEEEEEccCcccCCCeEEEEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCC
Q psy9727 85 WEVQITLKIHGKGKELFGDGMAFWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHD 164 (257)
Q Consensus 85 w~ie~~Fri~g~~~~~~gdGlAfW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~ 164 (257)
||+.+.|+|+|++++++|||||+||++++.+.||+||..++|+||+|++|||.|+.+.+.|++|.+++|+|+|+.+||++
T Consensus 111 wev~v~fkv~~~s~~lfgdG~Aiw~t~Er~q~GPvFG~~dkF~GL~vfidtY~n~~g~~~~v~P~isvmv~~gs~sYD~~ 190 (351)
T KOG3839|consen 111 WEVLVHFKVHGQSKNLFGDGMAIWYTKERAQPGPVFGSKDKFTGLAVFIDTYGNHNGPHEHVFPYISVMVNIGSLSYDHS 190 (351)
T ss_pred eeEEEEEEEecCCCcccccceEEEeecccccCCCCCCCcccceeEEEEEeccCCcCCCcccceeeEEEEeccCCcccccC
Confidence 99999999999998899999999999999999999999999999999999999998888899999999999999999999
Q ss_pred CCCcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCcc
Q psy9727 165 MDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDT 229 (257)
Q Consensus 165 ~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~ 229 (257)
.||+++++++|.+.+||.++. +.+.++|+..++++..|.++.+ +|..|.- +++-..|+
T Consensus 191 ~Dg~~tel~gc~a~~rn~~~d-----t~~~iry~~~~l~~~~dl~~~~-~~~~c~~-~n~v~lp~ 248 (351)
T KOG3839|consen 191 KDGTHTELAGCTANFRNLPHD-----TLVVIRYEKKTLSISIDLEGPN-EWIDCFS-LNNVELPL 248 (351)
T ss_pred CCCCccccccceeeeccCCCc-----ceeEEEecCCceEEEEecCCCc-eeeeeee-ecceeccc
Confidence 999999999999999999884 8899999999999999999997 9999974 55555553
No 6
>KOG3838|consensus
Probab=100.00 E-value=8.2e-45 Score=334.06 Aligned_cols=202 Identities=33% Similarity=0.629 Sum_probs=171.5
Q ss_pred HHHHHHHHHHHHhhhccC-cccceeeeeeeeCCccCCC-CCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCC
Q psy9727 6 FTVIALIWQVVAQEAQWN-TEDFLKRHHSMVKPYLTSG-LSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMN 83 (257)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~Sf~~P~~~~~-~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~ 83 (257)
+++|++|.+.+...+.-+ +.....-.|||++|++... .++|+|...|+|+...+.|||+|+.++++|++|+|..++++
T Consensus 12 ~~lLllLa~~~~~~~~~~~~~rrFEYK~SFk~P~Laq~dgtiPFW~~~GdAIas~eqvRlaPSmrsrkGavWtka~~~fe 91 (497)
T KOG3838|consen 12 CALLLLLAPHVPETGCGTPPHRRFEYKYSFKGPRLAQPDGTIPFWSHHGDAIASSEQVRLAPSMRSRKGAVWTKASVPFE 91 (497)
T ss_pred HHHHHHccCcCcccccCCCccceeeeeecccCCccccCCCCcceeeecCcccccccceeeccccccccCceeecccCCcc
Confidence 455555555555533332 3334455699999999874 68999999999999999999999999999999999999999
Q ss_pred cEEEEEEEEEEccCcccCCCeEEEEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecC
Q psy9727 84 NWEVQITLKIHGKGKELFGDGMAFWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDH 163 (257)
Q Consensus 84 ~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~ 163 (257)
+|+||+.|||+|.++ .++||||||||...+..|++||+.+.|+|++|+||+++|+..
T Consensus 92 ~weVev~~rVtGrGR-iGAdGlaiWYt~~~G~~GpVfGg~d~WnGigiffDSfdnD~q---------------------- 148 (497)
T KOG3838|consen 92 NWEVEVQFRVTGRGR-IGADGLAIWYTRGRGHVGPVFGGLDSWNGIGIFFDSFDNDGQ---------------------- 148 (497)
T ss_pred cceEEEEEEeccccc-ccCCceEEEEecCCCcccccccccccccceEEEeecccccCC----------------------
Confidence 999999999999998 789999999999999999999999999999999999998743
Q ss_pred CCCCcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEE
Q psy9727 164 DMDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVS 243 (257)
Q Consensus 164 ~~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~ 243 (257)
.+.|+|+++.+||+.+|||..||..|.+++|...|||+++|+++||+|.+++|+|+
T Consensus 149 ------------------------knnP~Is~~lndGt~~ydh~~DGasQ~LssCqrDFRNkPyPvRarItY~~nvLtv~ 204 (497)
T KOG3838|consen 149 ------------------------KNNPAISVLLNDGTIPYDHPGDGASQGLSSCQRDFRNKPYPVRARITYYGNVLTVM 204 (497)
T ss_pred ------------------------cCCccEEEEecCCcccccCCCccHHHHHHHhhHHhccCCCCceEEEEEeccEEEEE
Confidence 22455566666666667777777788999999999999999999999999999999
Q ss_pred EeCCcccCCCc
Q psy9727 244 GVNDLLSKGPF 254 (257)
Q Consensus 244 ~d~~~~~~~~~ 254 (257)
|+|++.-.+.|
T Consensus 205 innGmtp~d~y 215 (497)
T KOG3838|consen 205 INNGMTPSDDY 215 (497)
T ss_pred EcCCCCCCCCc
Confidence 99998776543
No 7
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins. This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex. These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein. Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=100.00 E-value=1.2e-37 Score=272.86 Aligned_cols=176 Identities=36% Similarity=0.682 Sum_probs=144.3
Q ss_pred eeeeeeeCCccCC-CCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEE
Q psy9727 29 KRHHSMVKPYLTS-GLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAF 107 (257)
Q Consensus 29 ~~~~Sf~~P~~~~-~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAf 107 (257)
+++|||.+||+.+ +..+++|.+.|+|.+.+++|||||+.++++|++|++.|+..++|+++|+|+|++.. ..+||||||
T Consensus 2 ~~~~Sl~~P~~~~~~~~~~~w~~~G~a~~~~~~i~LT~~~~~~~G~~~~~~pi~~~~F~~~f~F~i~~~~-~~~gdG~af 80 (218)
T cd07308 2 ISEHSLSPPFLDDNDGEIGNWTVGGSTVITKNYIRLTPDVPSQSGSLWSRVPIPAKDFEIEVEFSIHGGS-GLGGDGFAF 80 (218)
T ss_pred ccccccCCCcccCCCCccCCeEEcCCeEEeCCEEEeCCCCCCCEeEEEeCCCccCCCEEEEEEEEEeCCC-CCCCCEEEE
Confidence 5689999999987 46889999999999999999999999999999999999999999999999999874 357999999
Q ss_pred EEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCC
Q psy9727 108 WYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHN 187 (257)
Q Consensus 108 W~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~ 187 (257)
||++++...|++||.++.|+||||+|||++|.. +..|+|++++|||+++|+.++|+
T Consensus 81 ~~~~~~~~~g~~~G~~~~~~Glai~fdt~~n~~----~~~p~i~~~~Ndg~~~~~~~~d~-------------------- 136 (218)
T cd07308 81 WYTEEPGSDGPLFGGPDKFKGLAIFFDTYDNDG----KGFPSISVFLNDGTKSYDYETDG-------------------- 136 (218)
T ss_pred EEECCCCCCCcccccCCCCCEEEEEEEcCCCCC----CCCCeEEEEEeCCCceecccCCC--------------------
Confidence 999998899999999999999999999999862 13466777777766666554444
Q ss_pred ccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEE------eCCcccCC
Q psy9727 188 HNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSG------VNDLLSKG 252 (257)
Q Consensus 188 ~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~------d~~~~~~~ 252 (257)
..+++++|..++|+.++|+++||+|.++.|+|++ +...|++-
T Consensus 137 -----------------------~~~~~~~c~~~~~~~~~~~~~~I~y~~~~l~v~i~~~~~~~~~~c~~~ 184 (218)
T cd07308 137 -----------------------EKLELASCSLKFRNSNAPTTLRISYLNNTLKVDITYSEGNNWKECFTV 184 (218)
T ss_pred -----------------------ccccccceeEecccCCCCeEEEEEEECCEEEEEEeCCCCCCccEEEEc
Confidence 4446666777777777777777777766677777 44555543
No 8
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=99.85 E-value=5e-20 Score=161.55 Aligned_cols=96 Identities=27% Similarity=0.479 Sum_probs=84.7
Q ss_pred CCCCEEEeeeEEEe--CCEEEECCCCCCcEEEEEecCCCCC-CcEEEEEEEEEEccCcccCCCeEEEEEeecCCC----C
Q psy9727 44 SIPYWDVHGFALAS--SNYVRLTADLQSRYGAIWNTVPVYM-NNWEVQITLKIHGKGKELFGDGMAFWYVRDRME----G 116 (257)
Q Consensus 44 ~i~~W~~~G~a~v~--~~~IrLTp~~~~~~G~iWs~~p~~~-~~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~~~----~ 116 (257)
..++|.+.|+|.+. ++.|||||+.+++.|++|+++|+++ .+|+++|+|+|.+.. ..+|||||||+++++.. .
T Consensus 12 ~~~~~~~~G~A~~~~~~~~l~Lt~~~~~~~G~~~~~~~i~~~~~F~~~F~f~i~~~~-~~~gdG~aF~l~~~~~~~~~~~ 90 (223)
T cd01951 12 NQSNWQLNGSATLTTDSGVLRLTPDTGNQAGSAWYKTPIDLSKDFTTTFKFYLGTKG-TNGADGIAFVLQNDPAGALGGG 90 (223)
T ss_pred ChhhcEEcccEEecCCCCEEEECCCCCCcEEEEEECCcEeccCCEEEEEEEEEeCCC-CCCCCcEEEEEecCCCCccccC
Confidence 35789999999998 8999999999999999999999999 899999999999875 35799999999987532 2
Q ss_pred C--ccCCCCCCCcEEEEEEEccCCCC
Q psy9727 117 G--PVFGNKDFFSGLGVIIDTYSNHN 140 (257)
Q Consensus 117 G--~~~G~~~~f~GlaV~~DT~~n~~ 140 (257)
| ..+|......+|||+||||.|.+
T Consensus 91 g~~~~lG~~~~~~~~aVefDT~~N~~ 116 (223)
T cd01951 91 GGGGGLGYGGIGNSVAVEFDTYKNDD 116 (223)
T ss_pred CCCCccCccccCCeEEEEEeccccCC
Confidence 2 67888889999999999999863
No 9
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=99.74 E-value=3.9e-17 Score=145.19 Aligned_cols=117 Identities=22% Similarity=0.284 Sum_probs=91.5
Q ss_pred CCCEEEeeeEEEe-CCEEEECCCC--CCcEEEEEecCCCCC--------CcEEEEEEEEEEccCcccCCCeEEEEEeecC
Q psy9727 45 IPYWDVHGFALAS-SNYVRLTADL--QSRYGAIWNTVPVYM--------NNWEVQITLKIHGKGKELFGDGMAFWYVRDR 113 (257)
Q Consensus 45 i~~W~~~G~a~v~-~~~IrLTp~~--~~~~G~iWs~~p~~~--------~~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~ 113 (257)
..+|.+.|+|.+. ++.|+|||+. +++.|++|+++|+++ .+|+++|+|+|++.....+|||||||+++++
T Consensus 12 ~~~l~l~G~A~~~~~~~i~LT~~~~~~~~~G~v~y~~pi~l~~~~~~~~~sFst~F~F~i~~~~~~~~gdGlAF~i~~~~ 91 (236)
T cd06899 12 QSNLTLQGDATISSNGALQLTNDTSPASSVGRALYSKPVRLWDSTTGKVASFSTSFSFSITPPNPSLGGDGLAFFLAPTD 91 (236)
T ss_pred CCCEEEecceEcCCCCeEEecCCCCCCcceEEEEeCCCEEeecCCCCCceeEEEEEEEEEEcCCCCCCCCeEEEEEecCC
Confidence 5689999999998 8999999998 899999999999975 3799999999998754467999999999875
Q ss_pred C----CCCccCCCCC-------CCcEEEEEEEccCCCC---CCCCCCCCceEEEEeccceee
Q psy9727 114 M----EGGPVFGNKD-------FFSGLGVIIDTYSNHN---GEHNHNHPYLSAMVNNGSLHY 161 (257)
Q Consensus 114 ~----~~G~~~G~~~-------~f~GlaV~~DT~~n~~---~~~~~~~p~i~~~~NdGt~~y 161 (257)
. ..|+.+|... ...+|||+|||+.|.. +..+|..++++.+.+..+..|
T Consensus 92 ~~~~~~~G~~lG~~~~~~~~~~~~~~vAVEFDT~~n~~~~D~~~nHigIdvn~~~S~~~~~~ 153 (236)
T cd06899 92 SLPPASSGGYLGLFNSSNNGNSSNHIVAVEFDTFQNPEFGDPDDNHVGIDVNSLVSVKAGYW 153 (236)
T ss_pred CCCCCCCcceeeeecCCCCCCcccceEEEEeecccCcccCCCCCCeEEEEcCCcccceeecc
Confidence 4 3567777542 5678999999999864 233455555555444433333
No 10
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=99.58 E-value=4.6e-14 Score=125.25 Aligned_cols=95 Identities=24% Similarity=0.370 Sum_probs=79.0
Q ss_pred CCCEEEeeeEEE-eCCEEEECCCCC-CcEEEEEecCCCCC--------CcEEEEEEEEEEcc--CcccCCCeEEEEEeec
Q psy9727 45 IPYWDVHGFALA-SSNYVRLTADLQ-SRYGAIWNTVPVYM--------NNWEVQITLKIHGK--GKELFGDGMAFWYVRD 112 (257)
Q Consensus 45 i~~W~~~G~a~v-~~~~IrLTp~~~-~~~G~iWs~~p~~~--------~~w~ie~~Fri~g~--~~~~~gdGlAfW~t~~ 112 (257)
..+|.+.|+|.+ .++.|+||++.. .+.|++|+++|+.+ .+|+++|+|+|... ....+||||||.++++
T Consensus 12 ~~~~~l~G~A~~~~~~~l~LT~~~~~~~~G~~~y~~pi~l~d~~~~~~~sF~t~F~f~i~~~~~~~~~~~dGlAFvi~~~ 91 (236)
T PF00139_consen 12 SSNLTLNGDASISSNGSLQLTPDSTNNQAGRAWYNNPIQLWDSTTGNVASFSTSFSFSITNGPGSSNNGGDGLAFVIQPD 91 (236)
T ss_dssp GTTEEEEETEEEETTSEEESSTBETSSEEEEEEESSEEESBETTTTEBEEEEEEEEEEEEESSSSSSS-BEEEEEEEEET
T ss_pred CCceEEEeeEEeccCCeEEcCCCCCCCcEEEEEECCcEEEeCCCCcceeeeeeEEEEEEeccCCCCccCCCceEEEEecC
Confidence 489999999998 789999999998 99999999999988 68999999999643 2245789999999998
Q ss_pred CC----CCCccCCCCC-------CCcEEEEEEEccCCC
Q psy9727 113 RM----EGGPVFGNKD-------FFSGLGVIIDTYSNH 139 (257)
Q Consensus 113 ~~----~~G~~~G~~~-------~f~GlaV~~DT~~n~ 139 (257)
+. ..|..+|... ...-|||+|||+.|.
T Consensus 92 ~~~~~~~~g~~lG~~~~~~~~~~~~~~vAVEFDT~~N~ 129 (236)
T PF00139_consen 92 PNLPGGSSGGYLGLFNSSTDGNGINNSVAVEFDTYKNP 129 (236)
T ss_dssp TSSTTTSSGGGTTTSSSSSTTGGGGCEEEEEEETSTCG
T ss_pred cccccCCCCCccCccccccCCCccCcEEEEEEeeeecc
Confidence 63 2556677533 467899999999975
No 11
>KOG3839|consensus
Probab=99.48 E-value=1.6e-13 Score=125.48 Aligned_cols=182 Identities=37% Similarity=0.539 Sum_probs=129.7
Q ss_pred CCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCc-------ccCCCeEEEEEeecC---
Q psy9727 44 SIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGK-------ELFGDGMAFWYVRDR--- 113 (257)
Q Consensus 44 ~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~-------~~~gdGlAfW~t~~~--- 113 (257)
.+-.|.+.|+.....++..|+... +..+..|+.. .....|++--++.|+.... +..| |+|-..+-
T Consensus 35 l~~~~~v~g~~~~~~g~~ylkreh-SL~~Py~g~g-~r~~~W~~~Gstvv~~~~irLT~d~qsk~G---Av~n~~Pv~s~ 109 (351)
T KOG3839|consen 35 LILPGFVLGSQVEGSGYEYLKREH-SLSKPYQGVG-GRQPNWNLSGSTVVTSNYIRLTPDEQSKSG---AVWNRQPVFSR 109 (351)
T ss_pred hhhhheeecccccCcchhheeehh-hccccccccc-ccccCccccccEEEEeeeeecccccccccc---ccccCCCcccc
Confidence 455788899988877777776543 3566667666 6667898888887775421 1112 55543321
Q ss_pred -----------CCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCC
Q psy9727 114 -----------MEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNH 182 (257)
Q Consensus 114 -----------~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~ 182 (257)
.+.+.+|| +|+||+.---....+ |.. | .-...+.|+...++|.|.
T Consensus 110 ~wev~v~fkv~~~s~~lfg-----dG~Aiw~t~Er~q~G------PvF------G-------~~dkF~GL~vfidtY~n~ 165 (351)
T KOG3839|consen 110 DWEVLVHFKVHGQSKNLFG-----DGMAIWYTKERAQPG------PVF------G-------SKDKFTGLAVFIDTYGNH 165 (351)
T ss_pred ceeEEEEEEEecCCCcccc-----cceEEEeecccccCC------CCC------C-------CcccceeEEEEEeccCCc
Confidence 12334454 467777643322211 211 1 112356788999999998
Q ss_pred CCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCcccCCCccc
Q psy9727 183 NGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDLLSKGPFIN 256 (257)
Q Consensus 183 ~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~~~~~~~~~ 256 (257)
..+....+||+++|+++|.++||++.||+++++++|.+.|||.++.|...|+|.+++|++++| +..+..+||
T Consensus 166 ~g~~~~v~P~isvmv~~gs~sYD~~~Dg~~tel~gc~a~~rn~~~dt~~~iry~~~~l~~~~d--l~~~~~~~~ 237 (351)
T KOG3839|consen 166 NGPHEHVFPYISVMVNIGSLSYDHSKDGTHTELAGCTANFRNLPHDTLVVIRYEKKTLSISID--LEGPNEWID 237 (351)
T ss_pred CCCcccceeeEEEEeccCCcccccCCCCCccccccceeeeccCCCcceeEEEecCCceEEEEe--cCCCceeee
Confidence 766556699999999999999999999999999999999999999999999999999999999 556666654
No 12
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER). Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=99.40 E-value=2.3e-12 Score=115.64 Aligned_cols=87 Identities=56% Similarity=0.865 Sum_probs=75.6
Q ss_pred CcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEe-
Q psy9727 167 GTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGV- 245 (257)
Q Consensus 167 G~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d- 245 (257)
...+++|...+++.|.........|+|+++++||...||++.||..+++++|.++|||+++|+++||+|.++.|+|++|
T Consensus 98 ~~f~Gl~I~~Dt~~n~~~~~~~~~P~i~~~~NDGt~~yd~~~Dg~~~~~~~C~~~~rn~~~~t~~rI~Y~~~~l~v~vd~ 177 (248)
T cd06901 98 DNFHGLAIFFDTYSNQNGEHEHVHPYISAMVNNGSLSYDHDRDGTHTELAGCSAPFRNKDHDTFVAIRYSKGRLTVMTDI 177 (248)
T ss_pred CCCceEEEEEECCCCCCCcccCCCceEEEEEcCCCeeecccCCCchhhcCceeeeccCCCCCeEEEEEEECCeEEEEEec
Confidence 3678899999999988644335789999999999999999999998899999999999999999999999999988887
Q ss_pred -C----CcccCCC
Q psy9727 246 -N----DLLSKGP 253 (257)
Q Consensus 246 -~----~~~~~~~ 253 (257)
+ ..||+-.
T Consensus 178 ~~~~~w~~Cf~~~ 190 (248)
T cd06901 178 DGKNEWKECFDVT 190 (248)
T ss_pred CCCCceeeeEEeC
Confidence 3 3666553
No 13
>KOG3838|consensus
Probab=98.73 E-value=3.3e-09 Score=99.19 Aligned_cols=57 Identities=30% Similarity=0.501 Sum_probs=52.8
Q ss_pred CceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCCccCcE----EEEEEeCCeeEe
Q psy9727 148 PYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHNHNHPY----LSAMVNNGSLHY 204 (257)
Q Consensus 148 p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~~~~~~----i~i~~~~G~~~~ 204 (257)
|.|++++|||+..||+..||..+.|++|..+|||+++|+|.|++| +++|++||..+-
T Consensus 152 P~Is~~lndGt~~ydh~~DGasQ~LssCqrDFRNkPyPvRarItY~~nvLtv~innGmtp~ 212 (497)
T KOG3838|consen 152 PAISVLLNDGTIPYDHPGDGASQGLSSCQRDFRNKPYPVRARITYYGNVLTVMINNGMTPS 212 (497)
T ss_pred ccEEEEecCCcccccCCCccHHHHHHHhhHHhccCCCCceEEEEEeccEEEEEEcCCCCCC
Confidence 789999999999999999999999999999999999999999998 788888886544
No 14
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC). ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53. In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII. Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=98.44 E-value=4.9e-07 Score=80.16 Aligned_cols=74 Identities=16% Similarity=0.203 Sum_probs=59.9
Q ss_pred eEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCC--CCcceeecccccccCCCC
Q psy9727 150 LSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMD--GTHTQLAGCECKLRNLNH 227 (257)
Q Consensus 150 i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~d--g~~~~~g~C~~~~rn~~~ 227 (257)
|++++|||++.|+.++|+..+++++|...|||.+.| ..++|.|..+.++++.+.+ +...+|..|... ++...
T Consensus 122 i~~~~NDGt~~yd~~~D~~~~~~~~C~~~~rn~~~p-----~~~rI~Y~~~~l~V~~d~~~~~~~~~~~~Cf~~-~~v~L 195 (225)
T cd06902 122 ILVVGNDGTKSYDHQNDGLTQALGSCLRDFRNKPYP-----VRAKITYYQNVLTVSINNGFTPNKDDYELCTRV-ENMVL 195 (225)
T ss_pred EEEEECCCCeeccccCCCcccccceEEEeccCCCCC-----eEEEEEEECCeEEEEEeCCcCCCCCcccEEEec-CCeeC
Confidence 789999999999999999999999999999998765 6788888888898888742 333479999743 44445
Q ss_pred cc
Q psy9727 228 DT 229 (257)
Q Consensus 228 ~~ 229 (257)
|.
T Consensus 196 P~ 197 (225)
T cd06902 196 PP 197 (225)
T ss_pred CC
Confidence 53
No 15
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins. EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=98.01 E-value=1.2e-05 Score=70.87 Aligned_cols=41 Identities=7% Similarity=0.109 Sum_probs=34.6
Q ss_pred ecCCCCCcccceeeEeeeccCCCCCCCccCcEEEEEEeC--CeeEeecC
Q psy9727 161 YDHDMDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNN--GSLHYDHD 207 (257)
Q Consensus 161 y~~~~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~--G~~~~d~~ 207 (257)
|+++.| ..+++|+|...|||...| +.|+|.|.. +.+++++|
T Consensus 130 yd~~~d-~~~~~g~C~~~~rn~~~p-----~~iri~Y~~~~~~l~v~vd 172 (215)
T cd06903 130 YKNEDV-DSLAFGSCLFAYQDSGVP-----STIRLSYDALNSLFKVQVD 172 (215)
T ss_pred ccccCC-cccccceeeEeccCCCCC-----EEEEEEEECCCCEEEEEEC
Confidence 567888 899999999999998765 778888877 88888875
No 16
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins. This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex. These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein. Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=97.54 E-value=0.00021 Score=62.56 Aligned_cols=44 Identities=20% Similarity=0.165 Sum_probs=35.3
Q ss_pred eeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeeccccc
Q psy9727 172 LAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECK 221 (257)
Q Consensus 172 l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~ 221 (257)
+++|..++|+... ...++|.|.++.+.++.+....+ .|..|...
T Consensus 141 ~~~c~~~~~~~~~-----~~~~~I~y~~~~l~v~i~~~~~~-~~~~c~~~ 184 (218)
T cd07308 141 LASCSLKFRNSNA-----PTTLRISYLNNTLKVDITYSEGN-NWKECFTV 184 (218)
T ss_pred ccceeEecccCCC-----CeEEEEEEECCEEEEEEeCCCCC-CccEEEEc
Confidence 6899999998633 47788899999999999866554 88999765
No 17
>cd06900 lectin_VcfQ VcfQ bacterial pilus biogenesis protein, lectin domain. This family includes bacterial proteins homologous to the VcfQ (also known as MshQ) bacterial pilus biogenesis protein. VcfQ is encoded by the vcfQ gene of the type IV pilus gene cluster of Vibrio cholerae and is essential for type IV pilus assembly. VcfQ has a Laminin G-like domain as well as an L-type lectin domain.
Probab=97.48 E-value=0.0012 Score=59.19 Aligned_cols=93 Identities=25% Similarity=0.251 Sum_probs=67.9
Q ss_pred CCEEEeeeE-----EEeCCEEEECCCCCCcEEEEEecCCCCCCc--EEEEEEEEEEccCcccCCCeEEEEEeecC--CC-
Q psy9727 46 PYWDVHGFA-----LASSNYVRLTADLQSRYGAIWNTVPVYMNN--WEVQITLKIHGKGKELFGDGMAFWYVRDR--ME- 115 (257)
Q Consensus 46 ~~W~~~G~a-----~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~--w~ie~~Fri~g~~~~~~gdGlAfW~t~~~--~~- 115 (257)
.+|.+.... .+.++++|||+...++.|++....+++.++ -+|||+..+.+.+ ..++|||||+|.+-. .+
T Consensus 14 ~~W~v~~~~~s~~P~i~~g~LRLT~~~~nqata~~~~~~FPs~~n~v~veFd~yayg~~-g~GADGia~vLsDasv~p~~ 92 (255)
T cd06900 14 NDWVVISSGGSFTPQVVNNRLRLTDASGNQATAVTLQRLFPSAGNYVEVEFDYYAYGSG-GNGADGVALVLSDASVTPQA 92 (255)
T ss_pred CceEEEecCCccCCccccCeEEeccCccCcceeEEEeeeeccCCCeEEEEEEEEEecCC-CCCCceEEEEEeCCCcCCcC
Confidence 467764432 245899999999999999999999999864 8999999998853 357999999998642 12
Q ss_pred --CCccCCCCC------CCcE--EEEEEEccCCC
Q psy9727 116 --GGPVFGNKD------FFSG--LGVIIDTYSNH 139 (257)
Q Consensus 116 --~G~~~G~~~------~f~G--laV~~DT~~n~ 139 (257)
-|.-+|+.. -|.| |+|-||-|.|-
T Consensus 93 G~fGGsLGYa~~~~~~~GfaGGwLGiGlDEyGNF 126 (255)
T cd06900 93 GAFGGSLGYAQRNDGVPGFAGGWLGIGLDEYGNF 126 (255)
T ss_pred CCcCcccccccccCCCCccccceEEEEEeccccc
Confidence 333344322 2333 78999988774
No 18
>PF03388 Lectin_leg-like: Legume-like lectin family; InterPro: IPR005052 Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[]. ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=95.17 E-value=0.068 Score=47.42 Aligned_cols=33 Identities=45% Similarity=0.880 Sum_probs=26.9
Q ss_pred cccceeeEeeeccCCCCCCCccCcEEEEEEeCC
Q psy9727 168 THTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNG 200 (257)
Q Consensus 168 ~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G 200 (257)
...+++...++|.|...+.+...|+|+++++||
T Consensus 101 ~f~Gl~i~idt~~N~~~~~~~~~p~i~~~~NDG 133 (229)
T PF03388_consen 101 KFDGLGIFIDTYDNDEGGHKRGFPYISAMLNDG 133 (229)
T ss_dssp S-EEEEEEEEES-TTCTTCTSTSSEEEEEEEES
T ss_pred ccceEEEEEEcccCCCcccccccceEEEEecCC
Confidence 567899999999998766557889999999999
No 19
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=65.32 E-value=41 Score=25.28 Aligned_cols=57 Identities=14% Similarity=0.158 Sum_probs=40.4
Q ss_pred cEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCc
Q psy9727 191 PYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDL 248 (257)
Q Consensus 191 ~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~ 248 (257)
.|+.+.+.+|++.+..+..+.....-.=... -+.+...++.|++..+.+++.||+..
T Consensus 20 ~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~-~~dg~wh~v~i~~~~~~~~l~Vd~~~ 76 (128)
T PF02210_consen 20 DFLSLELVDGRLVVRYNLGGSEIVTTFSNSN-LNDGQWHKVSISRDGNRVTLTVDGQS 76 (128)
T ss_dssp EEEEEEEETTEEEEEEESSSSEEEEEECSSS-STSSSEEEEEEEEETTEEEEEETTSE
T ss_pred EEEEEEEECCEEEEEEEccccceeeeccCcc-ccccceeEEEEEEeeeeEEEEecCcc
Confidence 5888888999999888776332111111111 35667789999999999999999763
No 20
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=52.65 E-value=70 Score=26.16 Aligned_cols=62 Identities=18% Similarity=0.315 Sum_probs=38.6
Q ss_pred CCCCCCEEEeee------EEEeCCEEEECCC-CCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEEEEe
Q psy9727 42 GLSIPYWDVHGF------ALASSNYVRLTAD-LQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAFWYV 110 (257)
Q Consensus 42 ~~~i~~W~~~G~------a~v~~~~IrLTp~-~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAfW~t 110 (257)
+.....|...+. ..+.++.|+ +.. .....|.|.++. ...+|+++++||+...+ ..|+.|-..
T Consensus 8 g~~l~gW~~~~~~~~~~~~~v~dG~l~-~~~~~~~~~~~l~~~~--~~~df~l~~d~k~~~~~----~sGi~~r~~ 76 (185)
T PF06439_consen 8 GKDLDGWKIYGGGWFEGGWSVKDGVLV-SNGSSGSGGGYLYTDK--KFSDFELEVDFKITPGG----NSGIFFRAQ 76 (185)
T ss_dssp SSCGTTEEETTSSSETTTEEEETTEEE--GGGGESSS--EEESS--EBSSEEEEEEEEE-TT-----EEEEEEEES
T ss_pred CCCHHHCeeCCCCccccCcEeeCCEEE-ecccCCCCcceEEECC--ccccEEEEEEEEECCCC----CeEEEEEec
Confidence 345678887632 357788777 222 225678899885 78999999999994321 346666666
No 21
>smart00282 LamG Laminin G domain.
Probab=46.01 E-value=1.4e+02 Score=23.10 Aligned_cols=58 Identities=17% Similarity=0.183 Sum_probs=40.4
Q ss_pred CcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCc
Q psy9727 190 HPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDL 248 (257)
Q Consensus 190 ~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~ 248 (257)
..++.+.+.+|++.+..+..+..... .....--+-+.-.+++|++..+.+++++|+..
T Consensus 27 ~~~l~l~l~~g~l~~~~~~g~~~~~~-~~~~~~~~dg~WH~v~i~~~~~~~~l~VD~~~ 84 (135)
T smart00282 27 GDYLALELRDGRLVLRYDLGSGPARL-TSDPTPLNDGQWHRVAVERNGRRVTLSVDGEN 84 (135)
T ss_pred CCEEEEEEECCEEEEEEECCCCCEEE-EECCeEeCCCCEEEEEEEEeCCEEEEEECCCc
Confidence 46888899999999988875432111 11112224456688999999999999999753
No 22
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=44.48 E-value=1.1e+02 Score=23.95 Aligned_cols=57 Identities=11% Similarity=0.131 Sum_probs=44.1
Q ss_pred cEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCcc
Q psy9727 191 PYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDLL 249 (257)
Q Consensus 191 ~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~~ 249 (257)
.|+.+...+|++.+..+..+.. ..-.+... -+-+.-.++.+...+...++++|+...
T Consensus 20 dfial~L~~G~l~~~~~~G~~~-~~~~~~~~-i~dg~wh~v~~~r~~~~~~L~Vd~~~~ 76 (131)
T PF00054_consen 20 DFIALELRDGRLEFRYNLGSGP-ASLRSPQK-INDGKWHTVSVSRNGRNGSLSVDGEEV 76 (131)
T ss_dssp SEEEEEEETTEEEEEEESSSEE-EEEEESSE-TTSSSEEEEEEEEETTEEEEEETTSEE
T ss_pred CEEEEEEECCEEEEEEeCCCcc-ceecCCCc-cCCCcceEEEEEEcCcEEEEEECCccc
Confidence 7999999999999998875542 22223333 677788899999999999999998764
No 23
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=43.47 E-value=2.3e+02 Score=24.90 Aligned_cols=24 Identities=17% Similarity=0.232 Sum_probs=19.6
Q ss_pred CCCCccEEEEEEe--CCeEEEEEeCC
Q psy9727 224 NLNHDTHIAIRYE--DENLTVSGVND 247 (257)
Q Consensus 224 n~~~~~~~~i~Y~--~~~l~v~~d~~ 247 (257)
..+...+++|.|. .++|+|.++..
T Consensus 161 ~~g~~~~v~I~Y~~~~~~L~V~l~~~ 186 (236)
T cd06899 161 KSGKPMQAWIDYDSSSKRLSVTLAYS 186 (236)
T ss_pred cCCCeEEEEEEEcCCCCEEEEEEEeC
Confidence 4567889999999 56799999864
No 24
>PF08139 LPAM_1: Prokaryotic membrane lipoprotein lipid attachment site; InterPro: IPR012640 In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognises a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [,]. This lipid attachment site is found in homologues of the VirB proteins of type IV secretion systems (T4SS). Conjugal transfer across the cell envelope of Gram-negative bacteria is mediated by a supramolecular structure termed mating pair formation (Mpf) complex. Collectively, secretion pathways ancestrally related to bacterial conjugation systems are now known as T4SS. T4SS are involved in the delivery of effector molecules to eukaryotic target cells; each of these systems exports distinct DNA or protein substrates to effect a myriad of changes in host cell physiology during infection [].
Probab=38.57 E-value=23 Score=20.75 Aligned_cols=14 Identities=50% Similarity=0.683 Sum_probs=10.2
Q ss_pred ChhhhHHHHHHHHH
Q psy9727 1 MFKILFTVIALIWQ 14 (257)
Q Consensus 1 ~~~~~~~~~~~~~~ 14 (257)
|+|||+.+++++.+
T Consensus 7 mKkil~~l~a~~~L 20 (25)
T PF08139_consen 7 MKKILFPLLALFML 20 (25)
T ss_pred HHHHHHHHHHHHHH
Confidence 37888888877653
No 25
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=35.88 E-value=2e+02 Score=22.16 Aligned_cols=56 Identities=13% Similarity=0.164 Sum_probs=39.4
Q ss_pred CcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCC
Q psy9727 190 HPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVND 247 (257)
Q Consensus 190 ~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~ 247 (257)
..++.+...+|++.+..+..+.. ..-.+.. --+-+.-.++.|++..+.+++.||+.
T Consensus 46 ~~~~~l~l~~g~l~~~~~~g~~~-~~~~~~~-~v~dg~Wh~v~i~~~~~~~~l~VD~~ 101 (151)
T cd00110 46 GDFLALELEDGRLVLRYDLGSGS-LVLSSKT-PLNDGQWHSVSVERNGRSVTLSVDGE 101 (151)
T ss_pred CCEEEEEEECCEEEEEEcCCccc-EEEEccC-ccCCCCEEEEEEEECCCEEEEEECCc
Confidence 46788888899988888775322 2222222 23445668999999999999999986
No 26
>PRK00022 lolB outer membrane lipoprotein LolB; Provisional
Probab=35.39 E-value=1.6e+02 Score=25.14 Aligned_cols=23 Identities=0% Similarity=-0.039 Sum_probs=17.8
Q ss_pred CCCCccEEEEEEeCCeEEEEEeC
Q psy9727 224 NLNHDTHIAIRYEDENLTVSGVN 246 (257)
Q Consensus 224 n~~~~~~~~i~Y~~~~l~v~~d~ 246 (257)
....|.++++++-+-++++-|++
T Consensus 176 ~~~LP~ki~l~~~~~~ikl~i~~ 198 (202)
T PRK00022 176 QPALPRRLELTRGELRIKLVVDQ 198 (202)
T ss_pred CCCCCeeEEEeCCCeEEEEEEcc
Confidence 34689999999877678887764
No 27
>PF02248 Como_SCP: Small coat protein; InterPro: IPR003182 The virus capsid is composed 60 icosahedral units, each of which is composed of one copy of each of the two coat proteins. This family contains the small coat protein (SCP) [] of the comoviridae viral family.; GO: 0005198 structural molecule activity, 0019028 viral capsid; PDB: 1PGW_1 1PGL_1 1BMV_1 1NY7_1 2BFU_S.
Probab=33.99 E-value=35 Score=29.39 Aligned_cols=24 Identities=21% Similarity=0.635 Sum_probs=16.3
Q ss_pred CcEEEEEEEEEEccCcccCCCeEEEEEee
Q psy9727 83 NNWEVQITLKIHGKGKELFGDGMAFWYVR 111 (257)
Q Consensus 83 ~~w~ie~~Fri~g~~~~~~gdGlAfW~t~ 111 (257)
..||++|++.|-|+ -|||++|=..
T Consensus 113 ~s~el~Fs~eVvGP-----n~GFem~~s~ 136 (182)
T PF02248_consen 113 HSGELEFSREVVGP-----NDGFEMWWSD 136 (182)
T ss_dssp SCEEEEEEEEE-BT-----CCC-B-TT-T
T ss_pred CceeEEeeEEEEcC-----Cchhhhcccc
Confidence 47999999999997 4888877544
No 28
>PLN03161 Probable xyloglucan endotransglucosylase/hydrolase protein; Provisional
Probab=32.95 E-value=1.4e+02 Score=27.63 Aligned_cols=18 Identities=22% Similarity=0.178 Sum_probs=10.3
Q ss_pred ChhhhHHHHHHHHHHHHh
Q psy9727 1 MFKILFTVIALIWQVVAQ 18 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~ 18 (257)
|+.+|-+|++++++..+.
T Consensus 1 ~~~~~~~~~~~~~~~~~~ 18 (291)
T PLN03161 1 MASLKTLLVALFAALAAF 18 (291)
T ss_pred ChhHHHHHHHHHHHHHhc
Confidence 666665666655555444
No 29
>PF04294 VanW: VanW like protein; InterPro: IPR007391 Members of this family include vancomycin resistance protein W (VanW). Genes encoding members of this family have been found in vancomycin resistance gene clusters vanB [] and vanG []. The function of VanW is unknown.
Probab=32.69 E-value=67 Score=26.21 Aligned_cols=31 Identities=23% Similarity=0.166 Sum_probs=27.2
Q ss_pred eeecccccccC-CCCccEEEEEEeCCeEEEEE
Q psy9727 214 QLAGCECKLRN-LNHDTHIAIRYEDENLTVSG 244 (257)
Q Consensus 214 ~~g~C~~~~rn-~~~~~~~~i~Y~~~~l~v~~ 244 (257)
.++.-..+||| .+.|..++..+.+++|+|+|
T Consensus 99 ~~~~~Dlkf~N~~~~~i~i~~~~~~~~l~v~i 130 (132)
T PF04294_consen 99 AYGYLDLKFRNPTDYPILIQASVDGGKLTVEI 130 (132)
T ss_pred ecCCcceEEEcCCCCeEEEEEEEeCCEEEEEE
Confidence 56678889997 58999999999999999987
No 30
>PF11153 DUF2931: Protein of unknown function (DUF2931); InterPro: IPR021326 Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function.
Probab=32.54 E-value=45 Score=28.98 Aligned_cols=11 Identities=45% Similarity=0.615 Sum_probs=7.4
Q ss_pred ChhhhHHHHHH
Q psy9727 1 MFKILFTVIAL 11 (257)
Q Consensus 1 ~~~~~~~~~~~ 11 (257)
|+++|+++++|
T Consensus 1 mk~i~~l~l~l 11 (216)
T PF11153_consen 1 MKKILLLLLLL 11 (216)
T ss_pred ChHHHHHHHHH
Confidence 77777776444
No 31
>PF15524 Toxin_45: Putative toxin 45
Probab=29.58 E-value=60 Score=24.57 Aligned_cols=29 Identities=17% Similarity=0.503 Sum_probs=23.4
Q ss_pred cEEEEEecCCCCCCcEEEEEEEEEEccCc
Q psy9727 70 RYGAIWNTVPVYMNNWEVQITLKIHGKGK 98 (257)
Q Consensus 70 ~~G~iWs~~p~~~~~w~ie~~Fri~g~~~ 98 (257)
+.|-.|.+-|-....++.|++...+..++
T Consensus 43 kFGNeWtkgPsrtkgq~fEWDVQLS~~G~ 71 (94)
T PF15524_consen 43 KFGNEWTKGPSRTKGQEFEWDVQLSETGR 71 (94)
T ss_pred cccceeccCCCccCCcceEEEeeeCHhHH
Confidence 67999999998888777777777776654
No 32
>PF09224 DUF1961: Domain of unknown function (DUF1961); InterPro: IPR015305 Members of this family are found in a set of hypothetical bacterial proteins. Their exact function has not, as yet, been determined. ; PDB: 1OQ1_C.
Probab=27.79 E-value=4e+02 Score=23.70 Aligned_cols=63 Identities=11% Similarity=0.242 Sum_probs=41.4
Q ss_pred CCCCEEEeeeEEEe--CCEEEECCC------CCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEEEEeecC
Q psy9727 44 SIPYWDVHGFALAS--SNYVRLTAD------LQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAFWYVRDR 113 (257)
Q Consensus 44 ~i~~W~~~G~a~v~--~~~IrLTp~------~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~ 113 (257)
.+..|.+.|.+.++ ++.++|... .+...=-+|.+.-++ ++..|+++|+...+ -|||+.+..-.
T Consensus 16 dv~~w~~eG~~~~t~~~~~m~l~~~~~~e~~~~~~hfVlW~p~~Fp-~~i~IsweF~p~~e------pGLamlfF~Aa 86 (218)
T PF09224_consen 16 DVADWVAEGPAKVTFQNGRMELSGELDPEQFGDAAHFVLWCPEVFP-DDIRISWEFTPLAE------PGLAMLFFAAA 86 (218)
T ss_dssp GGTT-EEEESEEEE-GGGSEEEE------ESSTT--EEEE-SS-B--SSEEEEEEEEEEE-------SEEEEEEEEEE
T ss_pred HhhhhhccCCeEEeecCCcEEEeeeeccccccccccEEEECcccCC-CCeEEEEEEEEcCC------CCEEEEEeecc
Confidence 58899999999886 688998732 223455689765554 46799999988753 68898886543
No 33
>PF02402 Lysis_col: Lysis protein; InterPro: IPR003059 The DNA sequence of the entire colicin E2 operon has been determined []. The operon comprises the colicin activity gene (ceaB), the colicin immunity gene (ceiB) and the lysis gene (celB), which is essential for colicin release from producing cells []. A putative LexA binding site is located upstream from ceaB, and a rho-independent terminator structure is located downstream from celB []. Comparison of the amino acid sequences of colicin E2 and cloacin DF13 reveal extensive similarity. These colicins have different modes of action and recognise different cell surface receptors; the two major regions of heterology at the C terminus, and in the C-terminal end of the central region are thought to correspond to the catalytic and receptor-recognition domains, respectively []. Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. By contrast, the lysis proteins of the ColE2, ColE1 and CloDF13 plasmids are almost identical except in the N-terminal regions, which themselves are similar to lipoprotein signal peptides []. Processing of the ColE2 prolysis protein to the mature form is prevented by globomycin, a specific inhibitor of the lipoprotein signal peptidase []. The mature ColE2 lysis protein is located in the cell envelope [].; GO: 0009405 pathogenesis, 0019835 cytolysis, 0019867 outer membrane
Probab=27.52 E-value=26 Score=23.32 Aligned_cols=22 Identities=36% Similarity=0.378 Sum_probs=15.3
Q ss_pred ChhhhHHHHHHHHHHHHhhhccC
Q psy9727 1 MFKILFTVIALIWQVVAQEAQWN 23 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~ 23 (257)
|+||++..+.++..++ ..||-|
T Consensus 1 MkKi~~~~i~~~~~~L-~aCQaN 22 (46)
T PF02402_consen 1 MKKIIFIGIFLLTMLL-AACQAN 22 (46)
T ss_pred CcEEEEeHHHHHHHHH-HHhhhc
Confidence 8888888877777444 356655
No 34
>cd02182 GH16_Strep_laminarinase_like Streptomyces laminarinase-like, member of glycosyl hydrolase family 16. Proteins similar to Streptomyces sioyaensis beta-1,3-glucanase (laminarinase) present in Actinomycetales as well as Peziomycotina. Laminarinases belong to glycosyl hydrolase family 16 and hydrolyze the glycosidic bond of the 1,3-beta-linked glucan, a major component of fungal and plant cell walls and the structural and storage polysaccharides (laminarin) of marine macro-algae. Members of the GH16 family have a conserved jelly roll fold with an active site channel.
Probab=26.65 E-value=1.4e+02 Score=26.48 Aligned_cols=60 Identities=18% Similarity=0.238 Sum_probs=35.2
Q ss_pred eEEEe-CCEEEECCCC----CCcEEEEEecCCCCCCc----EEEEEEEEEEccCcccCCCeE--EEEEeecC
Q psy9727 53 FALAS-SNYVRLTADL----QSRYGAIWNTVPVYMNN----WEVQITLKIHGKGKELFGDGM--AFWYVRDR 113 (257)
Q Consensus 53 ~a~v~-~~~IrLTp~~----~~~~G~iWs~~p~~~~~----w~ie~~Fri~g~~~~~~gdGl--AfW~t~~~ 113 (257)
++.+. ++.++|+... +-.+|+|-++....... +.+|..+++.... ...+.|+ |||+....
T Consensus 47 n~~v~~dG~L~I~a~~~~~~~ytSg~i~s~~~~~~~~~gg~~~~EaRik~p~~~-~~~~~G~wPAfWll~~~ 117 (259)
T cd02182 47 NVQLSGNGTLQITPLRDGSGKWTSGRIETTRTDFAAPPGGKLRVEASIRLGDVP-GSNQQGIWPAFWMLGDS 117 (259)
T ss_pred CEEEcCCCeEEEEEEecCCCCEEEEEEEECCccccccCCCcEEEEEEEECCCCc-ccCCCCcCeeeeccCCC
Confidence 34566 7777777632 24689999988765431 3444444443210 0125677 99998764
No 35
>PF08787 Alginate_lyase2: Alginate lyase; InterPro: IPR014895 Alginate lyases are enzymes that degrade the linear polysaccharide alignate. They cleave the glycosidic linkage of alignate through a beta-elimination reaction. This region forms an all beta fold, which is different to the all alpha fold of IPR008397 from INTERPRO. ; PDB: 1VAV_B 1UAI_A 1J1T_A 2Z42_A 2ZAC_A 2ZAB_A 2ZAA_A 2ZA9_A 2CWS_A.
Probab=26.58 E-value=2e+02 Score=25.34 Aligned_cols=60 Identities=18% Similarity=0.008 Sum_probs=38.5
Q ss_pred cCcEEEEEE------eCCeeEeecCCC-CCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCc
Q psy9727 189 NHPYLSAMV------NNGSLHYDHDMD-GTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDL 248 (257)
Q Consensus 189 ~~~~i~i~~------~~G~~~~d~~~d-g~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~ 248 (257)
..|.|+|.| .+|.+.+-+... ..+.....|.+.==.++-.+..+|.+..+.|+|.+++..
T Consensus 120 ~~pllkl~~~~~~~~~~G~l~~~~~~~~~~~~~~~~~~~~~i~LG~~F~y~I~v~~~~l~V~ing~~ 186 (236)
T PF08787_consen 120 NEPLLKLYYRKEPGNEKGSLYVYVRQSNPDGGDQEYTIYGGIPLGEWFSYEIEVSGGTLTVTINGEG 186 (236)
T ss_dssp EEEEEEEEEEEC--TETTEEEEEEESSTTTTSEEEEEEEEEEETT-EEEEEEEEETTEEEEEETTEE
T ss_pred CCCEEEEEEEEeeeccCCeEEEEEeccCCCCCcEEeeeEcceeCCCEEEEEEEEECCEEEEEEECCc
Confidence 458889999 789888887722 111122222222113456689999999999999998753
No 36
>PRK09993 C-lysozyme inhibitor; Provisional
Probab=23.98 E-value=20 Score=30.12 Aligned_cols=80 Identities=23% Similarity=0.223 Sum_probs=39.9
Q ss_pred ChhhhHHHHHHHHHHHHhhhccCcccceeeeeeeeCCccC--CCCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecC
Q psy9727 1 MFKILFTVIALIWQVVAQEAQWNTEDFLKRHHSMVKPYLT--SGLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTV 78 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Sf~~P~~~--~~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~ 78 (257)
|++.|++.++|.....+-.+++-+-..+..+--++..|+. .+..+|.|...|-+..-...|.+ ...+..+|.++-.+
T Consensus 6 ~~~~l~aa~lL~~s~~a~Aq~d~t~seLl~~p~yk~aw~~mvkg~~LP~WV~~gGtssP~~~V~~-~G~~YlVg~~CKPH 84 (153)
T PRK09993 6 MFKALTTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYSPAQTVTL-GDETYQVMSACKPH 84 (153)
T ss_pred HHHHHHHHHHHHHHHHHHhcCCccHHHHhcCHHHHHHHHHHHcCCCCcHHHHcCCCCCchhheee-CCceEEEecccccc
Confidence 6777777666655554333333222233333333333432 23479999987733322333433 22335678887544
Q ss_pred CCC
Q psy9727 79 PVY 81 (257)
Q Consensus 79 p~~ 81 (257)
-+.
T Consensus 85 DC~ 87 (153)
T PRK09993 85 DCG 87 (153)
T ss_pred CCC
Confidence 443
No 37
>TIGR01655 yxeA_fam conserved hypothetical protein TIGR01655. This model represents a family of small (about 115 amino acids) uncharacterized proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.
Probab=20.38 E-value=72 Score=25.16 Aligned_cols=18 Identities=33% Similarity=0.233 Sum_probs=14.3
Q ss_pred ChhhhHHHHHHHHHHHHh
Q psy9727 1 MFKILFTVIALIWQVVAQ 18 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~ 18 (257)
|+++|+.+++++..++..
T Consensus 1 mKK~li~li~~ivv~~~~ 18 (114)
T TIGR01655 1 MKKGLAILLALIVVITIV 18 (114)
T ss_pred CceehHHHHHHHHhHHHh
Confidence 899999998888766655
Done!