Query         psy9727
Match_columns 257
No_of_seqs    200 out of 562
Neff          6.5 
Searched_HMMs 46136
Date          Fri Aug 16 18:02:05 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9727.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9727hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd06902 lectin_ERGIC-53_ERGL E 100.0 1.7E-46 3.7E-51  331.4  21.7  181   29-256     4-194 (225)
  2 PF03388 Lectin_leg-like:  Legu 100.0 2.4E-46 5.3E-51  331.2  18.4  178   27-247     2-180 (229)
  3 cd06901 lectin_VIP36_VIPL VIP3 100.0 3.8E-45 8.2E-50  327.0  22.3  197   27-230     1-197 (248)
  4 cd06903 lectin_EMP46_EMP47 EMP 100.0 8.9E-45 1.9E-49  318.4  20.2  174   29-257     2-184 (215)
  5 KOG3839|consensus              100.0 7.5E-45 1.6E-49  329.2  16.7  218    5-229    31-248 (351)
  6 KOG3838|consensus              100.0 8.2E-45 1.8E-49  334.1  15.7  202    6-254    12-215 (497)
  7 cd07308 lectin_leg-like legume 100.0 1.2E-37 2.6E-42  272.9  21.4  176   29-252     2-184 (218)
  8 cd01951 lectin_L-type legume l  99.9   5E-20 1.1E-24  161.5  18.7   96   44-140    12-116 (223)
  9 cd06899 lectin_legume_LecRK_Ar  99.7 3.9E-17 8.4E-22  145.2  14.8  117   45-161    12-153 (236)
 10 PF00139 Lectin_legB:  Legume l  99.6 4.6E-14 9.9E-19  125.3  15.4   95   45-139    12-129 (236)
 11 KOG3839|consensus               99.5 1.6E-13 3.5E-18  125.5  10.0  182   44-256    35-237 (351)
 12 cd06901 lectin_VIP36_VIPL VIP3  99.4 2.3E-12   5E-17  115.6  11.9   87  167-253    98-190 (248)
 13 KOG3838|consensus               98.7 3.3E-09 7.3E-14   99.2   1.7   57  148-204   152-212 (497)
 14 cd06902 lectin_ERGIC-53_ERGL E  98.4 4.9E-07 1.1E-11   80.2   7.0   74  150-229   122-197 (225)
 15 cd06903 lectin_EMP46_EMP47 EMP  98.0 1.2E-05 2.6E-10   70.9   6.5   41  161-207   130-172 (215)
 16 cd07308 lectin_leg-like legume  97.5 0.00021 4.5E-09   62.6   6.6   44  172-221   141-184 (218)
 17 cd06900 lectin_VcfQ VcfQ bacte  97.5  0.0012 2.5E-08   59.2  10.5   93   46-139    14-126 (255)
 18 PF03388 Lectin_leg-like:  Legu  95.2   0.068 1.5E-06   47.4   7.0   33  168-200   101-133 (229)
 19 PF02210 Laminin_G_2:  Laminin   65.3      41 0.00088   25.3   7.4   57  191-248    20-76  (128)
 20 PF06439 DUF1080:  Domain of Un  52.6      70  0.0015   26.2   7.2   62   42-110     8-76  (185)
 21 smart00282 LamG Laminin G doma  46.0 1.4E+02  0.0029   23.1   7.6   58  190-248    27-84  (135)
 22 PF00054 Laminin_G_1:  Laminin   44.5 1.1E+02  0.0025   24.0   7.0   57  191-249    20-76  (131)
 23 cd06899 lectin_legume_LecRK_Ar  43.5 2.3E+02   0.005   24.9   9.8   24  224-247   161-186 (236)
 24 PF08139 LPAM_1:  Prokaryotic m  38.6      23  0.0005   20.7   1.4   14    1-14      7-20  (25)
 25 cd00110 LamG Laminin G domain;  35.9   2E+02  0.0044   22.2   7.4   56  190-247    46-101 (151)
 26 PRK00022 lolB outer membrane l  35.4 1.6E+02  0.0035   25.1   7.0   23  224-246   176-198 (202)
 27 PF02248 Como_SCP:  Small coat   34.0      35 0.00077   29.4   2.5   24   83-111   113-136 (182)
 28 PLN03161 Probable xyloglucan e  33.0 1.4E+02  0.0031   27.6   6.5   18    1-18      1-18  (291)
 29 PF04294 VanW:  VanW like prote  32.7      67  0.0014   26.2   3.9   31  214-244    99-130 (132)
 30 PF11153 DUF2931:  Protein of u  32.5      45 0.00098   29.0   3.1   11    1-11      1-11  (216)
 31 PF15524 Toxin_45:  Putative to  29.6      60  0.0013   24.6   2.8   29   70-98     43-71  (94)
 32 PF09224 DUF1961:  Domain of un  27.8   4E+02  0.0088   23.7   8.1   63   44-113    16-86  (218)
 33 PF02402 Lysis_col:  Lysis prot  27.5      26 0.00057   23.3   0.5   22    1-23      1-22  (46)
 34 cd02182 GH16_Strep_laminarinas  26.7 1.4E+02  0.0031   26.5   5.4   60   53-113    47-117 (259)
 35 PF08787 Alginate_lyase2:  Algi  26.6   2E+02  0.0044   25.3   6.2   60  189-248   120-186 (236)
 36 PRK09993 C-lysozyme inhibitor;  24.0      20 0.00043   30.1  -0.7   80    1-81      6-87  (153)
 37 TIGR01655 yxeA_fam conserved h  20.4      72  0.0016   25.2   1.9   18    1-18      1-18  (114)

No 1  
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC).  ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain.  ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53.  In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII.  Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=100.00  E-value=1.7e-46  Score=331.40  Aligned_cols=181  Identities=35%  Similarity=0.667  Sum_probs=156.7

Q ss_pred             eeeeeeeCCccCC-CCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEE
Q psy9727          29 KRHHSMVKPYLTS-GLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAF  107 (257)
Q Consensus        29 ~~~~Sf~~P~~~~-~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAf  107 (257)
                      .-+|||.+||+.. +..+++|++.|+|++.+++|||||+.++++|+||++.|++.++|||||+|||+|.++ .+||||||
T Consensus         4 ~~~~s~~~P~l~~~~~~i~~W~~~G~t~~~~~~IrLTp~~~~~~G~iw~~~~~~~~~w~ie~~Fri~g~~~-~~gdG~a~   82 (225)
T cd06902           4 EYKYSFKGPHLAQKDGTVPFWSHGGDAIASLEQVRLTPSLRSKKGSVWTKNPFSFENWEVEVTFRVTGRGR-IGADGLAI   82 (225)
T ss_pred             eeeecccCcccccCCCCCCceEecccEEecCCEEEECCCCCCCEEEEeeCCCcCCCCEEEEEEEEEecCCC-CCCCEEEE
Confidence            3469999999986 468999999999999999999999999999999999999999999999999999875 67899999


Q ss_pred             EEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCC
Q psy9727         108 WYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHN  187 (257)
Q Consensus       108 W~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~  187 (257)
                      ||++++.+.|++||.+++|+||||+|||++|....   .                                         
T Consensus        83 W~t~~~~~~G~~~G~~~~f~Gl~I~~Dt~~n~~~~---~-----------------------------------------  118 (225)
T cd06902          83 WYTKERGEEGPVFGSSDKWNGVGIFFDSFDNDGKK---N-----------------------------------------  118 (225)
T ss_pred             EEECCCCCCCCccCCCCcccEEEEEEECCCCCCCC---C-----------------------------------------
Confidence            99999988999999999999999999999886421   2                                         


Q ss_pred             ccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeC---------CcccCCCccc
Q psy9727         188 HNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVN---------DLLSKGPFIN  256 (257)
Q Consensus       188 ~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~---------~~~~~~~~~~  256 (257)
                        .|+|+++++||+.+|+++.|+..+++++|.++|||+.+|+++||+|.+++|+|++|+         ..||+-.-|.
T Consensus       119 --~p~i~~~~NDGt~~yd~~~D~~~~~~~~C~~~~rn~~~p~~~rI~Y~~~~l~V~~d~~~~~~~~~~~~Cf~~~~v~  194 (225)
T cd06902         119 --NPAILVVGNDGTKSYDHQNDGLTQALGSCLRDFRNKPYPVRAKITYYQNVLTVSINNGFTPNKDDYELCTRVENMV  194 (225)
T ss_pred             --CcEEEEEECCCCeeccccCCCcccccceEEEeccCCCCCeEEEEEEECCeEEEEEeCCcCCCCCcccEEEecCCee
Confidence              345555555555566666666667899999999999999999999999999999996         6687655544


No 2  
>PF03388 Lectin_leg-like:  Legume-like lectin family;  InterPro: IPR005052  Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[].  ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=100.00  E-value=2.4e-46  Score=331.16  Aligned_cols=178  Identities=46%  Similarity=0.871  Sum_probs=139.7

Q ss_pred             ceeeeeeeeCCccCCC-CCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeE
Q psy9727          27 FLKRHHSMVKPYLTSG-LSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGM  105 (257)
Q Consensus        27 ~~~~~~Sf~~P~~~~~-~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGl  105 (257)
                      .++++|||.+||+.++ ..+++|++.|+|++.+++|||||+.++++|+||++.|++.++|+++++|||+|.++..+||||
T Consensus         2 ~~~~~~S~~~P~~~~~~~~i~~W~~~G~t~i~~~~IrLTp~~~~~~G~iws~~~~~~~~w~i~~~Fri~g~~~~~~g~G~   81 (229)
T PF03388_consen    2 RFKESHSFSPPFLDNGDNEIPNWDIGGSTVITDNFIRLTPDRQSQSGSIWSRKPIPFDNWEIEFTFRISGQEKGLGGDGM   81 (229)
T ss_dssp             EEEGGGEEESSSCSCTTSCBTTEEEEET-EEESSEEEEE-SSTTEEEEEEESS-BEESEEEEEEEEEEESS-SSS-S-EE
T ss_pred             cccccCCcCCCcCcCCCCccCCEEECCeEEecCCEEEECCCcccCEEEEEEcCCCCccCEEEEEEEEEeccccCcCCCeE
Confidence            4678899999999876 578999999999999999999999999999999999999999999999999998666789999


Q ss_pred             EEEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCC
Q psy9727         106 AFWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGE  185 (257)
Q Consensus       106 AfW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~  185 (257)
                      |||||+++.+.|++||.+++|+||||+||||+|++.++.+.+|.|++++|||++.|                        
T Consensus        82 a~W~t~~~~~~G~~fG~~~~f~Gl~i~idt~~N~~~~~~~~~p~i~~~~NDGt~~~------------------------  137 (229)
T PF03388_consen   82 AFWYTKDPGSDGPVFGGPDKFDGLGIFIDTYDNDEGGHKRGFPYISAMLNDGTKSY------------------------  137 (229)
T ss_dssp             EEEEESSSSSSCSBTTB-SS-EEEEEEEEES-TTCTTCTSTSSEEEEEEEESSS--------------------------
T ss_pred             EEEEEcCccccccccCCCcccceEEEEEEcccCCCcccccccceEEEEecCCCccc------------------------
Confidence            99999999999999999999999999999999986443334566666666665555                        


Q ss_pred             CCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCC
Q psy9727         186 HNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVND  247 (257)
Q Consensus       186 ~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~  247 (257)
                                         +++.||..+++++|.+.|||+++|+++||+|.++.|+|.+|.+
T Consensus       138 -------------------~~~~dg~~~~~~~C~~~~rn~~~p~~~ri~Y~~~~l~v~id~~  180 (229)
T PF03388_consen  138 -------------------DHDNDGKDQSLGSCSADYRNSDVPTRIRISYSKNTLTVSIDSN  180 (229)
T ss_dssp             --------------------GGGTTTTT-SEEEE---BTESSEEEEEEEEETTEEEEEEETS
T ss_pred             -------------------cccccCcccccccceeccCcCCCCEEEEEEEECCeEEEEEecc
Confidence                               4555555567888888888888888888888888888888855


No 3  
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER).  Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=100.00  E-value=3.8e-45  Score=326.95  Aligned_cols=197  Identities=49%  Similarity=0.951  Sum_probs=176.5

Q ss_pred             ceeeeeeeeCCccCCCCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEE
Q psy9727          27 FLKRHHSMVKPYLTSGLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMA  106 (257)
Q Consensus        27 ~~~~~~Sf~~P~~~~~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlA  106 (257)
                      +++|+|||.+||+..+..+++|.+.|+|++.+++|||||+.++++|++|++.|+++++|+++|+|||++.++..+|||||
T Consensus         1 ~~~~~~sL~~Py~~~~~~i~~w~~~G~a~v~~~~IrLTp~~~~~~G~~w~~~p~~~~~F~~~f~F~I~~~~~~~~GdGlA   80 (248)
T cd06901           1 YLKREHSLIKPYQGVGSSMPLWDFLGSTMVTSQYIRLTPDHQSKQGSIWNRVPCYLRDWEMHVHFKVHGSGKNLFGDGFA   80 (248)
T ss_pred             CccccccccCCeecCCCCCCCEEEcceEEEcCCeEEECCCCCCCEEEEeccCCccCCCEEEEEEEEEeCCCCCCCCCEEE
Confidence            57899999999999887799999999999999999999999999999999999999999999999999987667899999


Q ss_pred             EEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCC
Q psy9727         107 FWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEH  186 (257)
Q Consensus       107 fW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~  186 (257)
                      |||++++++.|++||.+++|+||||+||||+|..+.+.+.+|+|++++|||++.|+.++||..+++++|...+||..++ 
T Consensus        81 fw~t~~~~~~G~~fG~~~~f~Gl~I~~Dt~~n~~~~~~~~~P~i~~~~NDGt~~yd~~~Dg~~~~~~~C~~~~rn~~~~-  159 (248)
T cd06901          81 IWYTKERMQPGPVFGSKDNFHGLAIFFDTYSNQNGEHEHVHPYISAMVNNGSLSYDHDRDGTHTELAGCSAPFRNKDHD-  159 (248)
T ss_pred             EEEEcCCCccCcccccCCCCceEEEEEECCCCCCCcccCCCceEEEEEcCCCeeecccCCCchhhcCceeeeccCCCCC-
Confidence            9999999999999999999999999999999986555567899999999999999999999999999999999987765 


Q ss_pred             CccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccE
Q psy9727         187 NHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTH  230 (257)
Q Consensus       187 ~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~  230 (257)
                          +.++|.|.++.+.++.+.++.+ +|..|.. .++...|..
T Consensus       160 ----t~~rI~Y~~~~l~v~vd~~~~~-~w~~Cf~-~~~v~LP~~  197 (248)
T cd06901         160 ----TFVAIRYSKGRLTVMTDIDGKN-EWKECFD-VTGVRLPTG  197 (248)
T ss_pred             ----eEEEEEEECCeEEEEEecCCCC-ceeeeEE-eCCeecCCC
Confidence                6788888888888888888776 7999944 365556653


No 4  
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins.  EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=100.00  E-value=8.9e-45  Score=318.40  Aligned_cols=174  Identities=23%  Similarity=0.425  Sum_probs=149.2

Q ss_pred             eeeeeeeCCccCC----CCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCC-cEEEEEEEEEEccCcccCCC
Q psy9727          29 KRHHSMVKPYLTS----GLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMN-NWEVQITLKIHGKGKELFGD  103 (257)
Q Consensus        29 ~~~~Sf~~P~~~~----~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~-~w~ie~~Fri~g~~~~~~gd  103 (257)
                      ..++||  |.+..    +..+++|++.|+|++.+++|||||+ ++++|+||++.|++.+ +||+|++|||+|+++ .+||
T Consensus         2 ~~~~s~--p~l~~~~~~~~~i~~W~~~G~t~v~~~~IrLTp~-~s~~G~iWs~~pl~~~~~w~ie~~Fri~G~~~-~~gd   77 (215)
T cd06903           2 NKDLSL--PNLLKISPNGKLIPNWQTSGNPKLESGRIILTPP-GNQRGSLWLKKPLSLKDEWTIEWTFRSTGPEG-RSGG   77 (215)
T ss_pred             cccccC--CchhccccCCCCCCCeEEcCcEEeeCCeEEECCC-CCceEeEeeCCcCCCCCCEEEEEEEEecccCC-cCCC
Confidence            457888  54333    2589999999999999999999999 9999999999999999 999999999999875 6899


Q ss_pred             eEEEEEeecCCCCC--ccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccC
Q psy9727         104 GMAFWYVRDRMEGG--PVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSN  181 (257)
Q Consensus       104 GlAfW~t~~~~~~G--~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn  181 (257)
                      |||||||+++.+.+  ++||++++|+||||+|||++|.       .|.|++|+|||                        
T Consensus        78 Gla~W~t~~~~~~~g~~~fG~~~~f~Gl~I~~Dt~~n~-------~p~i~~~~NDG------------------------  126 (215)
T cd06903          78 GLNFWLVKDGNADVGTSSIYGPSKFDGLQLLIDNNGGS-------GGSLRGFLNDG------------------------  126 (215)
T ss_pred             EEEEEEECCCcccCCccccCCCCCCcEEEEEEECCCCC-------CceEEEEECCC------------------------
Confidence            99999999987654  8999999999999999999874       14455555555                        


Q ss_pred             CCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeC--CeEEEEEeCCcccCCCcccC
Q psy9727         182 HNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYED--ENLTVSGVNDLLSKGPFINL  257 (257)
Q Consensus       182 ~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~--~~l~v~~d~~~~~~~~~~~~  257 (257)
                                         +..|+++.| ..+++++|.++|||+.+|+++||+|.+  +.|+|+||+++||+-.-|+|
T Consensus       127 -------------------t~~yd~~~d-~~~~~g~C~~~~rn~~~p~~iri~Y~~~~~~l~v~vd~~~Cf~~~~v~l  184 (215)
T cd06903         127 -------------------SKDYKNEDV-DSLAFGSCLFAYQDSGVPSTIRLSYDALNSLFKVQVDNRLCFQTDKVQL  184 (215)
T ss_pred             -------------------CeeccccCC-cccccceeeEeccCCCCCEEEEEEEECCCCEEEEEECCCEEEecCCeec
Confidence                               455566666 567999999999999999999999999  88999999999998776654


No 5  
>KOG3839|consensus
Probab=100.00  E-value=7.5e-45  Score=329.21  Aligned_cols=218  Identities=43%  Similarity=0.788  Sum_probs=200.6

Q ss_pred             hHHHHHHHHHHHHhhhccCcccceeeeeeeeCCccCCCCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCc
Q psy9727           5 LFTVIALIWQVVAQEAQWNTEDFLKRHHSMVKPYLTSGLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNN   84 (257)
Q Consensus         5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~Sf~~P~~~~~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~   84 (257)
                      +++++.+.+.++......+...+++||||+.+||+..+...++|++.|++++++++|||||+.|++.|+||++.|+..++
T Consensus        31 ~~~ll~~~~~v~g~~~~~~g~~ylkrehSL~~Py~g~g~r~~~W~~~Gstvv~~~~irLT~d~qsk~GAv~n~~Pv~s~~  110 (351)
T KOG3839|consen   31 LSNLLILPGFVLGSQVEGSGYEYLKREHSLSKPYQGVGGRQPNWNLSGSTVVTSNYIRLTPDEQSKSGAVWNRQPVFSRD  110 (351)
T ss_pred             hhHhhhhhheeecccccCcchhheeehhhcccccccccccccCccccccEEEEeeeeeccccccccccccccCCCccccc
Confidence            45566666666666666555689999999999999999899999999999999999999999999999999999999999


Q ss_pred             EEEEEEEEEEccCcccCCCeEEEEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCC
Q psy9727          85 WEVQITLKIHGKGKELFGDGMAFWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHD  164 (257)
Q Consensus        85 w~ie~~Fri~g~~~~~~gdGlAfW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~  164 (257)
                      ||+.+.|+|+|++++++|||||+||++++.+.||+||..++|+||+|++|||.|+.+.+.|++|.+++|+|+|+.+||++
T Consensus       111 wev~v~fkv~~~s~~lfgdG~Aiw~t~Er~q~GPvFG~~dkF~GL~vfidtY~n~~g~~~~v~P~isvmv~~gs~sYD~~  190 (351)
T KOG3839|consen  111 WEVLVHFKVHGQSKNLFGDGMAIWYTKERAQPGPVFGSKDKFTGLAVFIDTYGNHNGPHEHVFPYISVMVNIGSLSYDHS  190 (351)
T ss_pred             eeEEEEEEEecCCCcccccceEEEeecccccCCCCCCCcccceeEEEEEeccCCcCCCcccceeeEEEEeccCCcccccC
Confidence            99999999999998899999999999999999999999999999999999999998888899999999999999999999


Q ss_pred             CCCcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCcc
Q psy9727         165 MDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDT  229 (257)
Q Consensus       165 ~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~  229 (257)
                      .||+++++++|.+.+||.++.     +.+.++|+..++++..|.++.+ +|..|.- +++-..|+
T Consensus       191 ~Dg~~tel~gc~a~~rn~~~d-----t~~~iry~~~~l~~~~dl~~~~-~~~~c~~-~n~v~lp~  248 (351)
T KOG3839|consen  191 KDGTHTELAGCTANFRNLPHD-----TLVVIRYEKKTLSISIDLEGPN-EWIDCFS-LNNVELPL  248 (351)
T ss_pred             CCCCccccccceeeeccCCCc-----ceeEEEecCCceEEEEecCCCc-eeeeeee-ecceeccc
Confidence            999999999999999999884     8899999999999999999997 9999974 55555553


No 6  
>KOG3838|consensus
Probab=100.00  E-value=8.2e-45  Score=334.06  Aligned_cols=202  Identities=33%  Similarity=0.629  Sum_probs=171.5

Q ss_pred             HHHHHHHHHHHHhhhccC-cccceeeeeeeeCCccCCC-CCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCC
Q psy9727           6 FTVIALIWQVVAQEAQWN-TEDFLKRHHSMVKPYLTSG-LSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMN   83 (257)
Q Consensus         6 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~Sf~~P~~~~~-~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~   83 (257)
                      +++|++|.+.+...+.-+ +.....-.|||++|++... .++|+|...|+|+...+.|||+|+.++++|++|+|..++++
T Consensus        12 ~~lLllLa~~~~~~~~~~~~~rrFEYK~SFk~P~Laq~dgtiPFW~~~GdAIas~eqvRlaPSmrsrkGavWtka~~~fe   91 (497)
T KOG3838|consen   12 CALLLLLAPHVPETGCGTPPHRRFEYKYSFKGPRLAQPDGTIPFWSHHGDAIASSEQVRLAPSMRSRKGAVWTKASVPFE   91 (497)
T ss_pred             HHHHHHccCcCcccccCCCccceeeeeecccCCccccCCCCcceeeecCcccccccceeeccccccccCceeecccCCcc
Confidence            455555555555533332 3334455699999999874 68999999999999999999999999999999999999999


Q ss_pred             cEEEEEEEEEEccCcccCCCeEEEEEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecC
Q psy9727          84 NWEVQITLKIHGKGKELFGDGMAFWYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDH  163 (257)
Q Consensus        84 ~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~  163 (257)
                      +|+||+.|||+|.++ .++||||||||...+..|++||+.+.|+|++|+||+++|+..                      
T Consensus        92 ~weVev~~rVtGrGR-iGAdGlaiWYt~~~G~~GpVfGg~d~WnGigiffDSfdnD~q----------------------  148 (497)
T KOG3838|consen   92 NWEVEVQFRVTGRGR-IGADGLAIWYTRGRGHVGPVFGGLDSWNGIGIFFDSFDNDGQ----------------------  148 (497)
T ss_pred             cceEEEEEEeccccc-ccCCceEEEEecCCCcccccccccccccceEEEeecccccCC----------------------
Confidence            999999999999998 789999999999999999999999999999999999998743                      


Q ss_pred             CCCCcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEE
Q psy9727         164 DMDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVS  243 (257)
Q Consensus       164 ~~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~  243 (257)
                                              .+.|+|+++.+||+.+|||..||..|.+++|...|||+++|+++||+|.+++|+|+
T Consensus       149 ------------------------knnP~Is~~lndGt~~ydh~~DGasQ~LssCqrDFRNkPyPvRarItY~~nvLtv~  204 (497)
T KOG3838|consen  149 ------------------------KNNPAISVLLNDGTIPYDHPGDGASQGLSSCQRDFRNKPYPVRARITYYGNVLTVM  204 (497)
T ss_pred             ------------------------cCCccEEEEecCCcccccCCCccHHHHHHHhhHHhccCCCCceEEEEEeccEEEEE
Confidence                                    22455566666666667777777788999999999999999999999999999999


Q ss_pred             EeCCcccCCCc
Q psy9727         244 GVNDLLSKGPF  254 (257)
Q Consensus       244 ~d~~~~~~~~~  254 (257)
                      |+|++.-.+.|
T Consensus       205 innGmtp~d~y  215 (497)
T KOG3838|consen  205 INNGMTPSDDY  215 (497)
T ss_pred             EcCCCCCCCCc
Confidence            99998776543


No 7  
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins.  This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex.  These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein.  Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=100.00  E-value=1.2e-37  Score=272.86  Aligned_cols=176  Identities=36%  Similarity=0.682  Sum_probs=144.3

Q ss_pred             eeeeeeeCCccCC-CCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEE
Q psy9727          29 KRHHSMVKPYLTS-GLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAF  107 (257)
Q Consensus        29 ~~~~Sf~~P~~~~-~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAf  107 (257)
                      +++|||.+||+.+ +..+++|.+.|+|.+.+++|||||+.++++|++|++.|+..++|+++|+|+|++.. ..+||||||
T Consensus         2 ~~~~Sl~~P~~~~~~~~~~~w~~~G~a~~~~~~i~LT~~~~~~~G~~~~~~pi~~~~F~~~f~F~i~~~~-~~~gdG~af   80 (218)
T cd07308           2 ISEHSLSPPFLDDNDGEIGNWTVGGSTVITKNYIRLTPDVPSQSGSLWSRVPIPAKDFEIEVEFSIHGGS-GLGGDGFAF   80 (218)
T ss_pred             ccccccCCCcccCCCCccCCeEEcCCeEEeCCEEEeCCCCCCCEeEEEeCCCccCCCEEEEEEEEEeCCC-CCCCCEEEE
Confidence            5689999999987 46889999999999999999999999999999999999999999999999999874 357999999


Q ss_pred             EEeecCCCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCC
Q psy9727         108 WYVRDRMEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHN  187 (257)
Q Consensus       108 W~t~~~~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~  187 (257)
                      ||++++...|++||.++.|+||||+|||++|..    +..|+|++++|||+++|+.++|+                    
T Consensus        81 ~~~~~~~~~g~~~G~~~~~~Glai~fdt~~n~~----~~~p~i~~~~Ndg~~~~~~~~d~--------------------  136 (218)
T cd07308          81 WYTEEPGSDGPLFGGPDKFKGLAIFFDTYDNDG----KGFPSISVFLNDGTKSYDYETDG--------------------  136 (218)
T ss_pred             EEECCCCCCCcccccCCCCCEEEEEEEcCCCCC----CCCCeEEEEEeCCCceecccCCC--------------------
Confidence            999998899999999999999999999999862    13466777777766666554444                    


Q ss_pred             ccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEE------eCCcccCC
Q psy9727         188 HNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSG------VNDLLSKG  252 (257)
Q Consensus       188 ~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~------d~~~~~~~  252 (257)
                                             ..+++++|..++|+.++|+++||+|.++.|+|++      +...|++-
T Consensus       137 -----------------------~~~~~~~c~~~~~~~~~~~~~~I~y~~~~l~v~i~~~~~~~~~~c~~~  184 (218)
T cd07308         137 -----------------------EKLELASCSLKFRNSNAPTTLRISYLNNTLKVDITYSEGNNWKECFTV  184 (218)
T ss_pred             -----------------------ccccccceeEecccCCCCeEEEEEEECCEEEEEEeCCCCCCccEEEEc
Confidence                                   4446666777777777777777777766677777      44555543


No 8  
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind.  This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers.  Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=99.85  E-value=5e-20  Score=161.55  Aligned_cols=96  Identities=27%  Similarity=0.479  Sum_probs=84.7

Q ss_pred             CCCCEEEeeeEEEe--CCEEEECCCCCCcEEEEEecCCCCC-CcEEEEEEEEEEccCcccCCCeEEEEEeecCCC----C
Q psy9727          44 SIPYWDVHGFALAS--SNYVRLTADLQSRYGAIWNTVPVYM-NNWEVQITLKIHGKGKELFGDGMAFWYVRDRME----G  116 (257)
Q Consensus        44 ~i~~W~~~G~a~v~--~~~IrLTp~~~~~~G~iWs~~p~~~-~~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~~~----~  116 (257)
                      ..++|.+.|+|.+.  ++.|||||+.+++.|++|+++|+++ .+|+++|+|+|.+.. ..+|||||||+++++..    .
T Consensus        12 ~~~~~~~~G~A~~~~~~~~l~Lt~~~~~~~G~~~~~~~i~~~~~F~~~F~f~i~~~~-~~~gdG~aF~l~~~~~~~~~~~   90 (223)
T cd01951          12 NQSNWQLNGSATLTTDSGVLRLTPDTGNQAGSAWYKTPIDLSKDFTTTFKFYLGTKG-TNGADGIAFVLQNDPAGALGGG   90 (223)
T ss_pred             ChhhcEEcccEEecCCCCEEEECCCCCCcEEEEEECCcEeccCCEEEEEEEEEeCCC-CCCCCcEEEEEecCCCCccccC
Confidence            35789999999998  8999999999999999999999999 899999999999875 35799999999987532    2


Q ss_pred             C--ccCCCCCCCcEEEEEEEccCCCC
Q psy9727         117 G--PVFGNKDFFSGLGVIIDTYSNHN  140 (257)
Q Consensus       117 G--~~~G~~~~f~GlaV~~DT~~n~~  140 (257)
                      |  ..+|......+|||+||||.|.+
T Consensus        91 g~~~~lG~~~~~~~~aVefDT~~N~~  116 (223)
T cd01951          91 GGGGGLGYGGIGNSVAVEFDTYKNDD  116 (223)
T ss_pred             CCCCccCccccCCeEEEEEeccccCC
Confidence            2  67888889999999999999863


No 9  
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor.  Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids.  Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family.  Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins.  PHA agglutinates most mammalian red blood cell types by bindin
Probab=99.74  E-value=3.9e-17  Score=145.19  Aligned_cols=117  Identities=22%  Similarity=0.284  Sum_probs=91.5

Q ss_pred             CCCEEEeeeEEEe-CCEEEECCCC--CCcEEEEEecCCCCC--------CcEEEEEEEEEEccCcccCCCeEEEEEeecC
Q psy9727          45 IPYWDVHGFALAS-SNYVRLTADL--QSRYGAIWNTVPVYM--------NNWEVQITLKIHGKGKELFGDGMAFWYVRDR  113 (257)
Q Consensus        45 i~~W~~~G~a~v~-~~~IrLTp~~--~~~~G~iWs~~p~~~--------~~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~  113 (257)
                      ..+|.+.|+|.+. ++.|+|||+.  +++.|++|+++|+++        .+|+++|+|+|++.....+|||||||+++++
T Consensus        12 ~~~l~l~G~A~~~~~~~i~LT~~~~~~~~~G~v~y~~pi~l~~~~~~~~~sFst~F~F~i~~~~~~~~gdGlAF~i~~~~   91 (236)
T cd06899          12 QSNLTLQGDATISSNGALQLTNDTSPASSVGRALYSKPVRLWDSTTGKVASFSTSFSFSITPPNPSLGGDGLAFFLAPTD   91 (236)
T ss_pred             CCCEEEecceEcCCCCeEEecCCCCCCcceEEEEeCCCEEeecCCCCCceeEEEEEEEEEEcCCCCCCCCeEEEEEecCC
Confidence            5689999999998 8999999998  899999999999975        3799999999998754467999999999875


Q ss_pred             C----CCCccCCCCC-------CCcEEEEEEEccCCCC---CCCCCCCCceEEEEeccceee
Q psy9727         114 M----EGGPVFGNKD-------FFSGLGVIIDTYSNHN---GEHNHNHPYLSAMVNNGSLHY  161 (257)
Q Consensus       114 ~----~~G~~~G~~~-------~f~GlaV~~DT~~n~~---~~~~~~~p~i~~~~NdGt~~y  161 (257)
                      .    ..|+.+|...       ...+|||+|||+.|..   +..+|..++++.+.+..+..|
T Consensus        92 ~~~~~~~G~~lG~~~~~~~~~~~~~~vAVEFDT~~n~~~~D~~~nHigIdvn~~~S~~~~~~  153 (236)
T cd06899          92 SLPPASSGGYLGLFNSSNNGNSSNHIVAVEFDTFQNPEFGDPDDNHVGIDVNSLVSVKAGYW  153 (236)
T ss_pred             CCCCCCCcceeeeecCCCCCCcccceEEEEeecccCcccCCCCCCeEEEEcCCcccceeecc
Confidence            4    3567777542       5678999999999864   233455555555444433333


No 10 
>PF00139 Lectin_legB:  Legume lectin domain;  InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=99.58  E-value=4.6e-14  Score=125.25  Aligned_cols=95  Identities=24%  Similarity=0.370  Sum_probs=79.0

Q ss_pred             CCCEEEeeeEEE-eCCEEEECCCCC-CcEEEEEecCCCCC--------CcEEEEEEEEEEcc--CcccCCCeEEEEEeec
Q psy9727          45 IPYWDVHGFALA-SSNYVRLTADLQ-SRYGAIWNTVPVYM--------NNWEVQITLKIHGK--GKELFGDGMAFWYVRD  112 (257)
Q Consensus        45 i~~W~~~G~a~v-~~~~IrLTp~~~-~~~G~iWs~~p~~~--------~~w~ie~~Fri~g~--~~~~~gdGlAfW~t~~  112 (257)
                      ..+|.+.|+|.+ .++.|+||++.. .+.|++|+++|+.+        .+|+++|+|+|...  ....+||||||.++++
T Consensus        12 ~~~~~l~G~A~~~~~~~l~LT~~~~~~~~G~~~y~~pi~l~d~~~~~~~sF~t~F~f~i~~~~~~~~~~~dGlAFvi~~~   91 (236)
T PF00139_consen   12 SSNLTLNGDASISSNGSLQLTPDSTNNQAGRAWYNNPIQLWDSTTGNVASFSTSFSFSITNGPGSSNNGGDGLAFVIQPD   91 (236)
T ss_dssp             GTTEEEEETEEEETTSEEESSTBETSSEEEEEEESSEEESBETTTTEBEEEEEEEEEEEEESSSSSSS-BEEEEEEEEET
T ss_pred             CCceEEEeeEEeccCCeEEcCCCCCCCcEEEEEECCcEEEeCCCCcceeeeeeEEEEEEeccCCCCccCCCceEEEEecC
Confidence            489999999998 789999999998 99999999999988        68999999999643  2245789999999998


Q ss_pred             CC----CCCccCCCCC-------CCcEEEEEEEccCCC
Q psy9727         113 RM----EGGPVFGNKD-------FFSGLGVIIDTYSNH  139 (257)
Q Consensus       113 ~~----~~G~~~G~~~-------~f~GlaV~~DT~~n~  139 (257)
                      +.    ..|..+|...       ...-|||+|||+.|.
T Consensus        92 ~~~~~~~~g~~lG~~~~~~~~~~~~~~vAVEFDT~~N~  129 (236)
T PF00139_consen   92 PNLPGGSSGGYLGLFNSSTDGNGINNSVAVEFDTYKNP  129 (236)
T ss_dssp             TSSTTTSSGGGTTTSSSSSTTGGGGCEEEEEEETSTCG
T ss_pred             cccccCCCCCccCccccccCCCccCcEEEEEEeeeecc
Confidence            63    2556677533       467899999999975


No 11 
>KOG3839|consensus
Probab=99.48  E-value=1.6e-13  Score=125.48  Aligned_cols=182  Identities=37%  Similarity=0.539  Sum_probs=129.7

Q ss_pred             CCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecCCCCCCcEEEEEEEEEEccCc-------ccCCCeEEEEEeecC---
Q psy9727          44 SIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTVPVYMNNWEVQITLKIHGKGK-------ELFGDGMAFWYVRDR---  113 (257)
Q Consensus        44 ~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~-------~~~gdGlAfW~t~~~---  113 (257)
                      .+-.|.+.|+.....++..|+... +..+..|+.. .....|++--++.|+....       +..|   |+|-..+-   
T Consensus        35 l~~~~~v~g~~~~~~g~~ylkreh-SL~~Py~g~g-~r~~~W~~~Gstvv~~~~irLT~d~qsk~G---Av~n~~Pv~s~  109 (351)
T KOG3839|consen   35 LILPGFVLGSQVEGSGYEYLKREH-SLSKPYQGVG-GRQPNWNLSGSTVVTSNYIRLTPDEQSKSG---AVWNRQPVFSR  109 (351)
T ss_pred             hhhhheeecccccCcchhheeehh-hccccccccc-ccccCccccccEEEEeeeeecccccccccc---ccccCCCcccc
Confidence            455788899988877777776543 3566667666 6667898888887775421       1112   55543321   


Q ss_pred             -----------CCCCccCCCCCCCcEEEEEEEccCCCCCCCCCCCCceEEEEeccceeecCCCCCcccceeeEeeeccCC
Q psy9727         114 -----------MEGGPVFGNKDFFSGLGVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNH  182 (257)
Q Consensus       114 -----------~~~G~~~G~~~~f~GlaV~~DT~~n~~~~~~~~~p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~  182 (257)
                                 .+.+.+||     +|+||+.---....+      |..      |       .-...+.|+...++|.|.
T Consensus       110 ~wev~v~fkv~~~s~~lfg-----dG~Aiw~t~Er~q~G------PvF------G-------~~dkF~GL~vfidtY~n~  165 (351)
T KOG3839|consen  110 DWEVLVHFKVHGQSKNLFG-----DGMAIWYTKERAQPG------PVF------G-------SKDKFTGLAVFIDTYGNH  165 (351)
T ss_pred             ceeEEEEEEEecCCCcccc-----cceEEEeecccccCC------CCC------C-------CcccceeEEEEEeccCCc
Confidence                       12334454     467777643322211      211      1       112356788999999998


Q ss_pred             CCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCcccCCCccc
Q psy9727         183 NGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDLLSKGPFIN  256 (257)
Q Consensus       183 ~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~~~~~~~~~  256 (257)
                      ..+....+||+++|+++|.++||++.||+++++++|.+.|||.++.|...|+|.+++|++++|  +..+..+||
T Consensus       166 ~g~~~~v~P~isvmv~~gs~sYD~~~Dg~~tel~gc~a~~rn~~~dt~~~iry~~~~l~~~~d--l~~~~~~~~  237 (351)
T KOG3839|consen  166 NGPHEHVFPYISVMVNIGSLSYDHSKDGTHTELAGCTANFRNLPHDTLVVIRYEKKTLSISID--LEGPNEWID  237 (351)
T ss_pred             CCCcccceeeEEEEeccCCcccccCCCCCccccccceeeeccCCCcceeEEEecCCceEEEEe--cCCCceeee
Confidence            766556699999999999999999999999999999999999999999999999999999999  556666654


No 12 
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER).  Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=99.40  E-value=2.3e-12  Score=115.64  Aligned_cols=87  Identities=56%  Similarity=0.865  Sum_probs=75.6

Q ss_pred             CcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEe-
Q psy9727         167 GTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGV-  245 (257)
Q Consensus       167 G~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d-  245 (257)
                      ...+++|...+++.|.........|+|+++++||...||++.||..+++++|.++|||+++|+++||+|.++.|+|++| 
T Consensus        98 ~~f~Gl~I~~Dt~~n~~~~~~~~~P~i~~~~NDGt~~yd~~~Dg~~~~~~~C~~~~rn~~~~t~~rI~Y~~~~l~v~vd~  177 (248)
T cd06901          98 DNFHGLAIFFDTYSNQNGEHEHVHPYISAMVNNGSLSYDHDRDGTHTELAGCSAPFRNKDHDTFVAIRYSKGRLTVMTDI  177 (248)
T ss_pred             CCCceEEEEEECCCCCCCcccCCCceEEEEEcCCCeeecccCCCchhhcCceeeeccCCCCCeEEEEEEECCeEEEEEec
Confidence            3678899999999988644335789999999999999999999998899999999999999999999999999988887 


Q ss_pred             -C----CcccCCC
Q psy9727         246 -N----DLLSKGP  253 (257)
Q Consensus       246 -~----~~~~~~~  253 (257)
                       +    ..||+-.
T Consensus       178 ~~~~~w~~Cf~~~  190 (248)
T cd06901         178 DGKNEWKECFDVT  190 (248)
T ss_pred             CCCCceeeeEEeC
Confidence             3    3666553


No 13 
>KOG3838|consensus
Probab=98.73  E-value=3.3e-09  Score=99.19  Aligned_cols=57  Identities=30%  Similarity=0.501  Sum_probs=52.8

Q ss_pred             CceEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCCccCcE----EEEEEeCCeeEe
Q psy9727         148 PYLSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHNHNHPY----LSAMVNNGSLHY  204 (257)
Q Consensus       148 p~i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~~~~~~----i~i~~~~G~~~~  204 (257)
                      |.|++++|||+..||+..||..+.|++|..+|||+++|+|.|++|    +++|++||..+-
T Consensus       152 P~Is~~lndGt~~ydh~~DGasQ~LssCqrDFRNkPyPvRarItY~~nvLtv~innGmtp~  212 (497)
T KOG3838|consen  152 PAISVLLNDGTIPYDHPGDGASQGLSSCQRDFRNKPYPVRARITYYGNVLTVMINNGMTPS  212 (497)
T ss_pred             ccEEEEecCCcccccCCCccHHHHHHHhhHHhccCCCCceEEEEEeccEEEEEEcCCCCCC
Confidence            789999999999999999999999999999999999999999998    788888886544


No 14 
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC).  ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain.  ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53.  In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII.  Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=98.44  E-value=4.9e-07  Score=80.16  Aligned_cols=74  Identities=16%  Similarity=0.203  Sum_probs=59.9

Q ss_pred             eEEEEeccceeecCCCCCcccceeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCC--CCcceeecccccccCCCC
Q psy9727         150 LSAMVNNGSLHYDHDMDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMD--GTHTQLAGCECKLRNLNH  227 (257)
Q Consensus       150 i~~~~NdGt~~y~~~~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~d--g~~~~~g~C~~~~rn~~~  227 (257)
                      |++++|||++.|+.++|+..+++++|...|||.+.|     ..++|.|..+.++++.+.+  +...+|..|... ++...
T Consensus       122 i~~~~NDGt~~yd~~~D~~~~~~~~C~~~~rn~~~p-----~~~rI~Y~~~~l~V~~d~~~~~~~~~~~~Cf~~-~~v~L  195 (225)
T cd06902         122 ILVVGNDGTKSYDHQNDGLTQALGSCLRDFRNKPYP-----VRAKITYYQNVLTVSINNGFTPNKDDYELCTRV-ENMVL  195 (225)
T ss_pred             EEEEECCCCeeccccCCCcccccceEEEeccCCCCC-----eEEEEEEECCeEEEEEeCCcCCCCCcccEEEec-CCeeC
Confidence            789999999999999999999999999999998765     6788888888898888742  333479999743 44445


Q ss_pred             cc
Q psy9727         228 DT  229 (257)
Q Consensus       228 ~~  229 (257)
                      |.
T Consensus       196 P~  197 (225)
T cd06902         196 PP  197 (225)
T ss_pred             CC
Confidence            53


No 15 
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins.  EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=98.01  E-value=1.2e-05  Score=70.87  Aligned_cols=41  Identities=7%  Similarity=0.109  Sum_probs=34.6

Q ss_pred             ecCCCCCcccceeeEeeeccCCCCCCCccCcEEEEEEeC--CeeEeecC
Q psy9727         161 YDHDMDGTHTQLAVIIDTYSNHNGEHNHNHPYLSAMVNN--GSLHYDHD  207 (257)
Q Consensus       161 y~~~~DG~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~--G~~~~d~~  207 (257)
                      |+++.| ..+++|+|...|||...|     +.|+|.|..  +.+++++|
T Consensus       130 yd~~~d-~~~~~g~C~~~~rn~~~p-----~~iri~Y~~~~~~l~v~vd  172 (215)
T cd06903         130 YKNEDV-DSLAFGSCLFAYQDSGVP-----STIRLSYDALNSLFKVQVD  172 (215)
T ss_pred             ccccCC-cccccceeeEeccCCCCC-----EEEEEEEECCCCEEEEEEC
Confidence            567888 899999999999998765     778888877  88888875


No 16 
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins.  This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex.  These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein.  Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=97.54  E-value=0.00021  Score=62.56  Aligned_cols=44  Identities=20%  Similarity=0.165  Sum_probs=35.3

Q ss_pred             eeeEeeeccCCCCCCCccCcEEEEEEeCCeeEeecCCCCCcceeeccccc
Q psy9727         172 LAVIIDTYSNHNGEHNHNHPYLSAMVNNGSLHYDHDMDGTHTQLAGCECK  221 (257)
Q Consensus       172 l~~C~~~~rn~~~~~~~~~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~  221 (257)
                      +++|..++|+...     ...++|.|.++.+.++.+....+ .|..|...
T Consensus       141 ~~~c~~~~~~~~~-----~~~~~I~y~~~~l~v~i~~~~~~-~~~~c~~~  184 (218)
T cd07308         141 LASCSLKFRNSNA-----PTTLRISYLNNTLKVDITYSEGN-NWKECFTV  184 (218)
T ss_pred             ccceeEecccCCC-----CeEEEEEEECCEEEEEEeCCCCC-CccEEEEc
Confidence            6899999998633     47788899999999999866554 88999765


No 17 
>cd06900 lectin_VcfQ VcfQ bacterial pilus biogenesis protein, lectin domain. This family includes bacterial proteins homologous to the VcfQ (also known as MshQ) bacterial pilus biogenesis protein.  VcfQ is encoded by the vcfQ gene of the type IV pilus gene cluster of Vibrio cholerae and is essential for type IV pilus assembly.  VcfQ has a Laminin G-like domain as well as an L-type lectin domain.
Probab=97.48  E-value=0.0012  Score=59.19  Aligned_cols=93  Identities=25%  Similarity=0.251  Sum_probs=67.9

Q ss_pred             CCEEEeeeE-----EEeCCEEEECCCCCCcEEEEEecCCCCCCc--EEEEEEEEEEccCcccCCCeEEEEEeecC--CC-
Q psy9727          46 PYWDVHGFA-----LASSNYVRLTADLQSRYGAIWNTVPVYMNN--WEVQITLKIHGKGKELFGDGMAFWYVRDR--ME-  115 (257)
Q Consensus        46 ~~W~~~G~a-----~v~~~~IrLTp~~~~~~G~iWs~~p~~~~~--w~ie~~Fri~g~~~~~~gdGlAfW~t~~~--~~-  115 (257)
                      .+|.+....     .+.++++|||+...++.|++....+++.++  -+|||+..+.+.+ ..++|||||+|.+-.  .+ 
T Consensus        14 ~~W~v~~~~~s~~P~i~~g~LRLT~~~~nqata~~~~~~FPs~~n~v~veFd~yayg~~-g~GADGia~vLsDasv~p~~   92 (255)
T cd06900          14 NDWVVISSGGSFTPQVVNNRLRLTDASGNQATAVTLQRLFPSAGNYVEVEFDYYAYGSG-GNGADGVALVLSDASVTPQA   92 (255)
T ss_pred             CceEEEecCCccCCccccCeEEeccCccCcceeEEEeeeeccCCCeEEEEEEEEEecCC-CCCCceEEEEEeCCCcCCcC
Confidence            467764432     245899999999999999999999999864  8999999998853 357999999998642  12 


Q ss_pred             --CCccCCCCC------CCcE--EEEEEEccCCC
Q psy9727         116 --GGPVFGNKD------FFSG--LGVIIDTYSNH  139 (257)
Q Consensus       116 --~G~~~G~~~------~f~G--laV~~DT~~n~  139 (257)
                        -|.-+|+..      -|.|  |+|-||-|.|-
T Consensus        93 G~fGGsLGYa~~~~~~~GfaGGwLGiGlDEyGNF  126 (255)
T cd06900          93 GAFGGSLGYAQRNDGVPGFAGGWLGIGLDEYGNF  126 (255)
T ss_pred             CCcCcccccccccCCCCccccceEEEEEeccccc
Confidence              333344322      2333  78999988774


No 18 
>PF03388 Lectin_leg-like:  Legume-like lectin family;  InterPro: IPR005052  Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[].  ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=95.17  E-value=0.068  Score=47.42  Aligned_cols=33  Identities=45%  Similarity=0.880  Sum_probs=26.9

Q ss_pred             cccceeeEeeeccCCCCCCCccCcEEEEEEeCC
Q psy9727         168 THTQLAVIIDTYSNHNGEHNHNHPYLSAMVNNG  200 (257)
Q Consensus       168 ~~~~l~~C~~~~rn~~~~~~~~~~~i~i~~~~G  200 (257)
                      ...+++...++|.|...+.+...|+|+++++||
T Consensus       101 ~f~Gl~i~idt~~N~~~~~~~~~p~i~~~~NDG  133 (229)
T PF03388_consen  101 KFDGLGIFIDTYDNDEGGHKRGFPYISAMLNDG  133 (229)
T ss_dssp             S-EEEEEEEEES-TTCTTCTSTSSEEEEEEEES
T ss_pred             ccceEEEEEEcccCCCcccccccceEEEEecCC
Confidence            567899999999998766557889999999999


No 19 
>PF02210 Laminin_G_2:  Laminin G domain;  InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=65.32  E-value=41  Score=25.28  Aligned_cols=57  Identities=14%  Similarity=0.158  Sum_probs=40.4

Q ss_pred             cEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCc
Q psy9727         191 PYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDL  248 (257)
Q Consensus       191 ~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~  248 (257)
                      .|+.+.+.+|++.+..+..+.....-.=... -+.+...++.|++..+.+++.||+..
T Consensus        20 ~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~-~~dg~wh~v~i~~~~~~~~l~Vd~~~   76 (128)
T PF02210_consen   20 DFLSLELVDGRLVVRYNLGGSEIVTTFSNSN-LNDGQWHKVSISRDGNRVTLTVDGQS   76 (128)
T ss_dssp             EEEEEEEETTEEEEEEESSSSEEEEEECSSS-STSSSEEEEEEEEETTEEEEEETTSE
T ss_pred             EEEEEEEECCEEEEEEEccccceeeeccCcc-ccccceeEEEEEEeeeeEEEEecCcc
Confidence            5888888999999888776332111111111 35667789999999999999999763


No 20 
>PF06439 DUF1080:  Domain of Unknown Function (DUF1080);  InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=52.65  E-value=70  Score=26.16  Aligned_cols=62  Identities=18%  Similarity=0.315  Sum_probs=38.6

Q ss_pred             CCCCCCEEEeee------EEEeCCEEEECCC-CCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEEEEe
Q psy9727          42 GLSIPYWDVHGF------ALASSNYVRLTAD-LQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAFWYV  110 (257)
Q Consensus        42 ~~~i~~W~~~G~------a~v~~~~IrLTp~-~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAfW~t  110 (257)
                      +.....|...+.      ..+.++.|+ +.. .....|.|.++.  ...+|+++++||+...+    ..|+.|-..
T Consensus         8 g~~l~gW~~~~~~~~~~~~~v~dG~l~-~~~~~~~~~~~l~~~~--~~~df~l~~d~k~~~~~----~sGi~~r~~   76 (185)
T PF06439_consen    8 GKDLDGWKIYGGGWFEGGWSVKDGVLV-SNGSSGSGGGYLYTDK--KFSDFELEVDFKITPGG----NSGIFFRAQ   76 (185)
T ss_dssp             SSCGTTEEETTSSSETTTEEEETTEEE--GGGGESSS--EEESS--EBSSEEEEEEEEE-TT-----EEEEEEEES
T ss_pred             CCCHHHCeeCCCCccccCcEeeCCEEE-ecccCCCCcceEEECC--ccccEEEEEEEEECCCC----CeEEEEEec
Confidence            345678887632      357788777 222 225678899885  78999999999994321    346666666


No 21 
>smart00282 LamG Laminin G domain.
Probab=46.01  E-value=1.4e+02  Score=23.10  Aligned_cols=58  Identities=17%  Similarity=0.183  Sum_probs=40.4

Q ss_pred             CcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCc
Q psy9727         190 HPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDL  248 (257)
Q Consensus       190 ~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~  248 (257)
                      ..++.+.+.+|++.+..+..+..... .....--+-+.-.+++|++..+.+++++|+..
T Consensus        27 ~~~l~l~l~~g~l~~~~~~g~~~~~~-~~~~~~~~dg~WH~v~i~~~~~~~~l~VD~~~   84 (135)
T smart00282       27 GDYLALELRDGRLVLRYDLGSGPARL-TSDPTPLNDGQWHRVAVERNGRRVTLSVDGEN   84 (135)
T ss_pred             CCEEEEEEECCEEEEEEECCCCCEEE-EECCeEeCCCCEEEEEEEEeCCEEEEEECCCc
Confidence            46888899999999988875432111 11112224456688999999999999999753


No 22 
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=44.48  E-value=1.1e+02  Score=23.95  Aligned_cols=57  Identities=11%  Similarity=0.131  Sum_probs=44.1

Q ss_pred             cEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCcc
Q psy9727         191 PYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDLL  249 (257)
Q Consensus       191 ~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~~  249 (257)
                      .|+.+...+|++.+..+..+.. ..-.+... -+-+.-.++.+...+...++++|+...
T Consensus        20 dfial~L~~G~l~~~~~~G~~~-~~~~~~~~-i~dg~wh~v~~~r~~~~~~L~Vd~~~~   76 (131)
T PF00054_consen   20 DFIALELRDGRLEFRYNLGSGP-ASLRSPQK-INDGKWHTVSVSRNGRNGSLSVDGEEV   76 (131)
T ss_dssp             SEEEEEEETTEEEEEEESSSEE-EEEEESSE-TTSSSEEEEEEEEETTEEEEEETTSEE
T ss_pred             CEEEEEEECCEEEEEEeCCCcc-ceecCCCc-cCCCcceEEEEEEcCcEEEEEECCccc
Confidence            7999999999999998875542 22223333 677788899999999999999998764


No 23 
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor.  Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids.  Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family.  Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins.  PHA agglutinates most mammalian red blood cell types by bindin
Probab=43.47  E-value=2.3e+02  Score=24.90  Aligned_cols=24  Identities=17%  Similarity=0.232  Sum_probs=19.6

Q ss_pred             CCCCccEEEEEEe--CCeEEEEEeCC
Q psy9727         224 NLNHDTHIAIRYE--DENLTVSGVND  247 (257)
Q Consensus       224 n~~~~~~~~i~Y~--~~~l~v~~d~~  247 (257)
                      ..+...+++|.|.  .++|+|.++..
T Consensus       161 ~~g~~~~v~I~Y~~~~~~L~V~l~~~  186 (236)
T cd06899         161 KSGKPMQAWIDYDSSSKRLSVTLAYS  186 (236)
T ss_pred             cCCCeEEEEEEEcCCCCEEEEEEEeC
Confidence            4567889999999  56799999864


No 24 
>PF08139 LPAM_1:  Prokaryotic membrane lipoprotein lipid attachment site;  InterPro: IPR012640  In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognises a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [,].  This lipid attachment site is found in homologues of the VirB proteins of type IV secretion systems (T4SS). Conjugal transfer across the cell envelope of Gram-negative bacteria is mediated by a supramolecular structure termed mating pair formation (Mpf) complex. Collectively, secretion pathways ancestrally related to bacterial conjugation systems are now known as T4SS. T4SS are involved in the delivery of effector molecules to eukaryotic target cells; each of these systems exports distinct DNA or protein substrates to effect a myriad of changes in host cell physiology during infection [].
Probab=38.57  E-value=23  Score=20.75  Aligned_cols=14  Identities=50%  Similarity=0.683  Sum_probs=10.2

Q ss_pred             ChhhhHHHHHHHHH
Q psy9727           1 MFKILFTVIALIWQ   14 (257)
Q Consensus         1 ~~~~~~~~~~~~~~   14 (257)
                      |+|||+.+++++.+
T Consensus         7 mKkil~~l~a~~~L   20 (25)
T PF08139_consen    7 MKKILFPLLALFML   20 (25)
T ss_pred             HHHHHHHHHHHHHH
Confidence            37888888877653


No 25 
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=35.88  E-value=2e+02  Score=22.16  Aligned_cols=56  Identities=13%  Similarity=0.164  Sum_probs=39.4

Q ss_pred             CcEEEEEEeCCeeEeecCCCCCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCC
Q psy9727         190 HPYLSAMVNNGSLHYDHDMDGTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVND  247 (257)
Q Consensus       190 ~~~i~i~~~~G~~~~d~~~dg~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~  247 (257)
                      ..++.+...+|++.+..+..+.. ..-.+.. --+-+.-.++.|++..+.+++.||+.
T Consensus        46 ~~~~~l~l~~g~l~~~~~~g~~~-~~~~~~~-~v~dg~Wh~v~i~~~~~~~~l~VD~~  101 (151)
T cd00110          46 GDFLALELEDGRLVLRYDLGSGS-LVLSSKT-PLNDGQWHSVSVERNGRSVTLSVDGE  101 (151)
T ss_pred             CCEEEEEEECCEEEEEEcCCccc-EEEEccC-ccCCCCEEEEEEEECCCEEEEEECCc
Confidence            46788888899988888775322 2222222 23445668999999999999999986


No 26 
>PRK00022 lolB outer membrane lipoprotein LolB; Provisional
Probab=35.39  E-value=1.6e+02  Score=25.14  Aligned_cols=23  Identities=0%  Similarity=-0.039  Sum_probs=17.8

Q ss_pred             CCCCccEEEEEEeCCeEEEEEeC
Q psy9727         224 NLNHDTHIAIRYEDENLTVSGVN  246 (257)
Q Consensus       224 n~~~~~~~~i~Y~~~~l~v~~d~  246 (257)
                      ....|.++++++-+-++++-|++
T Consensus       176 ~~~LP~ki~l~~~~~~ikl~i~~  198 (202)
T PRK00022        176 QPALPRRLELTRGELRIKLVVDQ  198 (202)
T ss_pred             CCCCCeeEEEeCCCeEEEEEEcc
Confidence            34689999999877678887764


No 27 
>PF02248 Como_SCP:  Small coat protein;  InterPro: IPR003182 The virus capsid is composed 60 icosahedral units, each of which is composed of one copy of each of the two coat proteins. This family contains the small coat protein (SCP) [] of the comoviridae viral family.; GO: 0005198 structural molecule activity, 0019028 viral capsid; PDB: 1PGW_1 1PGL_1 1BMV_1 1NY7_1 2BFU_S.
Probab=33.99  E-value=35  Score=29.39  Aligned_cols=24  Identities=21%  Similarity=0.635  Sum_probs=16.3

Q ss_pred             CcEEEEEEEEEEccCcccCCCeEEEEEee
Q psy9727          83 NNWEVQITLKIHGKGKELFGDGMAFWYVR  111 (257)
Q Consensus        83 ~~w~ie~~Fri~g~~~~~~gdGlAfW~t~  111 (257)
                      ..||++|++.|-|+     -|||++|=..
T Consensus       113 ~s~el~Fs~eVvGP-----n~GFem~~s~  136 (182)
T PF02248_consen  113 HSGELEFSREVVGP-----NDGFEMWWSD  136 (182)
T ss_dssp             SCEEEEEEEEE-BT-----CCC-B-TT-T
T ss_pred             CceeEEeeEEEEcC-----Cchhhhcccc
Confidence            47999999999997     4888877544


No 28 
>PLN03161 Probable xyloglucan endotransglucosylase/hydrolase protein; Provisional
Probab=32.95  E-value=1.4e+02  Score=27.63  Aligned_cols=18  Identities=22%  Similarity=0.178  Sum_probs=10.3

Q ss_pred             ChhhhHHHHHHHHHHHHh
Q psy9727           1 MFKILFTVIALIWQVVAQ   18 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~   18 (257)
                      |+.+|-+|++++++..+.
T Consensus         1 ~~~~~~~~~~~~~~~~~~   18 (291)
T PLN03161          1 MASLKTLLVALFAALAAF   18 (291)
T ss_pred             ChhHHHHHHHHHHHHHhc
Confidence            666665666655555444


No 29 
>PF04294 VanW:  VanW like protein;  InterPro: IPR007391 Members of this family include vancomycin resistance protein W (VanW). Genes encoding members of this family have been found in vancomycin resistance gene clusters vanB [] and vanG []. The function of VanW is unknown.
Probab=32.69  E-value=67  Score=26.21  Aligned_cols=31  Identities=23%  Similarity=0.166  Sum_probs=27.2

Q ss_pred             eeecccccccC-CCCccEEEEEEeCCeEEEEE
Q psy9727         214 QLAGCECKLRN-LNHDTHIAIRYEDENLTVSG  244 (257)
Q Consensus       214 ~~g~C~~~~rn-~~~~~~~~i~Y~~~~l~v~~  244 (257)
                      .++.-..+||| .+.|..++..+.+++|+|+|
T Consensus        99 ~~~~~Dlkf~N~~~~~i~i~~~~~~~~l~v~i  130 (132)
T PF04294_consen   99 AYGYLDLKFRNPTDYPILIQASVDGGKLTVEI  130 (132)
T ss_pred             ecCCcceEEEcCCCCeEEEEEEEeCCEEEEEE
Confidence            56678889997 58999999999999999987


No 30 
>PF11153 DUF2931:  Protein of unknown function (DUF2931);  InterPro: IPR021326  Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. 
Probab=32.54  E-value=45  Score=28.98  Aligned_cols=11  Identities=45%  Similarity=0.615  Sum_probs=7.4

Q ss_pred             ChhhhHHHHHH
Q psy9727           1 MFKILFTVIAL   11 (257)
Q Consensus         1 ~~~~~~~~~~~   11 (257)
                      |+++|+++++|
T Consensus         1 mk~i~~l~l~l   11 (216)
T PF11153_consen    1 MKKILLLLLLL   11 (216)
T ss_pred             ChHHHHHHHHH
Confidence            77777776444


No 31 
>PF15524 Toxin_45:  Putative toxin 45
Probab=29.58  E-value=60  Score=24.57  Aligned_cols=29  Identities=17%  Similarity=0.503  Sum_probs=23.4

Q ss_pred             cEEEEEecCCCCCCcEEEEEEEEEEccCc
Q psy9727          70 RYGAIWNTVPVYMNNWEVQITLKIHGKGK   98 (257)
Q Consensus        70 ~~G~iWs~~p~~~~~w~ie~~Fri~g~~~   98 (257)
                      +.|-.|.+-|-....++.|++...+..++
T Consensus        43 kFGNeWtkgPsrtkgq~fEWDVQLS~~G~   71 (94)
T PF15524_consen   43 KFGNEWTKGPSRTKGQEFEWDVQLSETGR   71 (94)
T ss_pred             cccceeccCCCccCCcceEEEeeeCHhHH
Confidence            67999999998888777777777776654


No 32 
>PF09224 DUF1961:  Domain of unknown function (DUF1961);  InterPro: IPR015305 Members of this family are found in a set of hypothetical bacterial proteins. Their exact function has not, as yet, been determined. ; PDB: 1OQ1_C.
Probab=27.79  E-value=4e+02  Score=23.70  Aligned_cols=63  Identities=11%  Similarity=0.242  Sum_probs=41.4

Q ss_pred             CCCCEEEeeeEEEe--CCEEEECCC------CCCcEEEEEecCCCCCCcEEEEEEEEEEccCcccCCCeEEEEEeecC
Q psy9727          44 SIPYWDVHGFALAS--SNYVRLTAD------LQSRYGAIWNTVPVYMNNWEVQITLKIHGKGKELFGDGMAFWYVRDR  113 (257)
Q Consensus        44 ~i~~W~~~G~a~v~--~~~IrLTp~------~~~~~G~iWs~~p~~~~~w~ie~~Fri~g~~~~~~gdGlAfW~t~~~  113 (257)
                      .+..|.+.|.+.++  ++.++|...      .+...=-+|.+.-++ ++..|+++|+...+      -|||+.+..-.
T Consensus        16 dv~~w~~eG~~~~t~~~~~m~l~~~~~~e~~~~~~hfVlW~p~~Fp-~~i~IsweF~p~~e------pGLamlfF~Aa   86 (218)
T PF09224_consen   16 DVADWVAEGPAKVTFQNGRMELSGELDPEQFGDAAHFVLWCPEVFP-DDIRISWEFTPLAE------PGLAMLFFAAA   86 (218)
T ss_dssp             GGTT-EEEESEEEE-GGGSEEEE------ESSTT--EEEE-SS-B--SSEEEEEEEEEEE-------SEEEEEEEEEE
T ss_pred             HhhhhhccCCeEEeecCCcEEEeeeeccccccccccEEEECcccCC-CCeEEEEEEEEcCC------CCEEEEEeecc
Confidence            58899999999886  688998732      223455689765554 46799999988753      68898886543


No 33 
>PF02402 Lysis_col:  Lysis protein;  InterPro: IPR003059 The DNA sequence of the entire colicin E2 operon has been determined []. The operon comprises the colicin activity gene (ceaB), the colicin immunity gene (ceiB) and the lysis gene (celB), which is essential for colicin release from producing cells []. A putative LexA binding site is located upstream from ceaB, and a rho-independent terminator structure is located downstream from celB []. Comparison of the amino acid sequences of colicin E2 and cloacin DF13 reveal extensive similarity. These colicins have different modes of action and recognise different cell surface receptors; the two major regions of heterology at the C terminus, and in the C-terminal end of the central region are thought to correspond to the catalytic and receptor-recognition domains, respectively [].  Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. By contrast, the lysis proteins of the ColE2, ColE1 and CloDF13 plasmids are almost identical except in the N-terminal regions, which themselves are similar to lipoprotein signal peptides []. Processing of the ColE2 prolysis protein to the mature form is prevented by globomycin, a specific inhibitor of the lipoprotein signal peptidase []. The mature ColE2 lysis protein is located in the cell envelope [].; GO: 0009405 pathogenesis, 0019835 cytolysis, 0019867 outer membrane
Probab=27.52  E-value=26  Score=23.32  Aligned_cols=22  Identities=36%  Similarity=0.378  Sum_probs=15.3

Q ss_pred             ChhhhHHHHHHHHHHHHhhhccC
Q psy9727           1 MFKILFTVIALIWQVVAQEAQWN   23 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~   23 (257)
                      |+||++..+.++..++ ..||-|
T Consensus         1 MkKi~~~~i~~~~~~L-~aCQaN   22 (46)
T PF02402_consen    1 MKKIIFIGIFLLTMLL-AACQAN   22 (46)
T ss_pred             CcEEEEeHHHHHHHHH-HHhhhc
Confidence            8888888877777444 356655


No 34 
>cd02182 GH16_Strep_laminarinase_like Streptomyces laminarinase-like, member of glycosyl hydrolase family 16. Proteins similar to Streptomyces sioyaensis beta-1,3-glucanase (laminarinase) present in Actinomycetales as well as Peziomycotina. Laminarinases belong to glycosyl hydrolase family 16 and hydrolyze the glycosidic bond of the 1,3-beta-linked glucan, a major component of fungal and plant cell walls and the structural and storage polysaccharides (laminarin) of marine macro-algae. Members of the GH16 family have a conserved jelly roll fold with an active site channel.
Probab=26.65  E-value=1.4e+02  Score=26.48  Aligned_cols=60  Identities=18%  Similarity=0.238  Sum_probs=35.2

Q ss_pred             eEEEe-CCEEEECCCC----CCcEEEEEecCCCCCCc----EEEEEEEEEEccCcccCCCeE--EEEEeecC
Q psy9727          53 FALAS-SNYVRLTADL----QSRYGAIWNTVPVYMNN----WEVQITLKIHGKGKELFGDGM--AFWYVRDR  113 (257)
Q Consensus        53 ~a~v~-~~~IrLTp~~----~~~~G~iWs~~p~~~~~----w~ie~~Fri~g~~~~~~gdGl--AfW~t~~~  113 (257)
                      ++.+. ++.++|+...    +-.+|+|-++.......    +.+|..+++.... ...+.|+  |||+....
T Consensus        47 n~~v~~dG~L~I~a~~~~~~~ytSg~i~s~~~~~~~~~gg~~~~EaRik~p~~~-~~~~~G~wPAfWll~~~  117 (259)
T cd02182          47 NVQLSGNGTLQITPLRDGSGKWTSGRIETTRTDFAAPPGGKLRVEASIRLGDVP-GSNQQGIWPAFWMLGDS  117 (259)
T ss_pred             CEEEcCCCeEEEEEEecCCCCEEEEEEEECCccccccCCCcEEEEEEEECCCCc-ccCCCCcCeeeeccCCC
Confidence            34566 7777777632    24689999988765431    3444444443210 0125677  99998764


No 35 
>PF08787 Alginate_lyase2:  Alginate lyase;  InterPro: IPR014895 Alginate lyases are enzymes that degrade the linear polysaccharide alignate. They cleave the glycosidic linkage of alignate through a beta-elimination reaction. This region forms an all beta fold, which is different to the all alpha fold of IPR008397 from INTERPRO. ; PDB: 1VAV_B 1UAI_A 1J1T_A 2Z42_A 2ZAC_A 2ZAB_A 2ZAA_A 2ZA9_A 2CWS_A.
Probab=26.58  E-value=2e+02  Score=25.34  Aligned_cols=60  Identities=18%  Similarity=0.008  Sum_probs=38.5

Q ss_pred             cCcEEEEEE------eCCeeEeecCCC-CCcceeecccccccCCCCccEEEEEEeCCeEEEEEeCCc
Q psy9727         189 NHPYLSAMV------NNGSLHYDHDMD-GTHTQLAGCECKLRNLNHDTHIAIRYEDENLTVSGVNDL  248 (257)
Q Consensus       189 ~~~~i~i~~------~~G~~~~d~~~d-g~~~~~g~C~~~~rn~~~~~~~~i~Y~~~~l~v~~d~~~  248 (257)
                      ..|.|+|.|      .+|.+.+-+... ..+.....|.+.==.++-.+..+|.+..+.|+|.+++..
T Consensus       120 ~~pllkl~~~~~~~~~~G~l~~~~~~~~~~~~~~~~~~~~~i~LG~~F~y~I~v~~~~l~V~ing~~  186 (236)
T PF08787_consen  120 NEPLLKLYYRKEPGNEKGSLYVYVRQSNPDGGDQEYTIYGGIPLGEWFSYEIEVSGGTLTVTINGEG  186 (236)
T ss_dssp             EEEEEEEEEEEC--TETTEEEEEEESSTTTTSEEEEEEEEEEETT-EEEEEEEEETTEEEEEETTEE
T ss_pred             CCCEEEEEEEEeeeccCCeEEEEEeccCCCCCcEEeeeEcceeCCCEEEEEEEEECCEEEEEEECCc
Confidence            458889999      789888887722 111122222222113456689999999999999998753


No 36 
>PRK09993 C-lysozyme inhibitor; Provisional
Probab=23.98  E-value=20  Score=30.12  Aligned_cols=80  Identities=23%  Similarity=0.223  Sum_probs=39.9

Q ss_pred             ChhhhHHHHHHHHHHHHhhhccCcccceeeeeeeeCCccC--CCCCCCCEEEeeeEEEeCCEEEECCCCCCcEEEEEecC
Q psy9727           1 MFKILFTVIALIWQVVAQEAQWNTEDFLKRHHSMVKPYLT--SGLSIPYWDVHGFALASSNYVRLTADLQSRYGAIWNTV   78 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Sf~~P~~~--~~~~i~~W~~~G~a~v~~~~IrLTp~~~~~~G~iWs~~   78 (257)
                      |++.|++.++|.....+-.+++-+-..+..+--++..|+.  .+..+|.|...|-+..-...|.+ ...+..+|.++-.+
T Consensus         6 ~~~~l~aa~lL~~s~~a~Aq~d~t~seLl~~p~yk~aw~~mvkg~~LP~WV~~gGtssP~~~V~~-~G~~YlVg~~CKPH   84 (153)
T PRK09993          6 MFKALTTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYSPAQTVTL-GDETYQVMSACKPH   84 (153)
T ss_pred             HHHHHHHHHHHHHHHHHHhcCCccHHHHhcCHHHHHHHHHHHcCCCCcHHHHcCCCCCchhheee-CCceEEEecccccc
Confidence            6777777666655554333333222233333333333432  23479999987733322333433 22335678887544


Q ss_pred             CCC
Q psy9727          79 PVY   81 (257)
Q Consensus        79 p~~   81 (257)
                      -+.
T Consensus        85 DC~   87 (153)
T PRK09993         85 DCG   87 (153)
T ss_pred             CCC
Confidence            443


No 37 
>TIGR01655 yxeA_fam conserved hypothetical protein TIGR01655. This model represents a family of small (about 115 amino acids) uncharacterized proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.
Probab=20.38  E-value=72  Score=25.16  Aligned_cols=18  Identities=33%  Similarity=0.233  Sum_probs=14.3

Q ss_pred             ChhhhHHHHHHHHHHHHh
Q psy9727           1 MFKILFTVIALIWQVVAQ   18 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~   18 (257)
                      |+++|+.+++++..++..
T Consensus         1 mKK~li~li~~ivv~~~~   18 (114)
T TIGR01655         1 MKKGLAILLALIVVITIV   18 (114)
T ss_pred             CceehHHHHHHHHhHHHh
Confidence            899999998888766655


Done!