Query         040418
Match_columns 249
No_of_seqs    142 out of 930
Neff          6.7 
Searched_HMMs 46136
Date          Fri Mar 29 08:13:57 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/040418.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/040418hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd06899 lectin_legume_LecRK_Ar 100.0 5.6E-52 1.2E-56  365.2  22.5  207   31-249     1-211 (236)
  2 PF00139 Lectin_legB:  Legume l 100.0   4E-51 8.8E-56  359.3  18.9  210   31-249     2-214 (236)
  3 cd01951 lectin_L-type legume l 100.0   2E-36 4.2E-41  263.3  21.1  183   40-249    13-200 (223)
  4 cd07308 lectin_leg-like legume  99.8 1.9E-17   4E-22  143.9  20.3  146   41-224    20-173 (218)
  5 cd06901 lectin_VIP36_VIPL VIP3  99.5 4.5E-12 9.6E-17  112.7  20.1  151   40-224    19-178 (248)
  6 cd06902 lectin_ERGIC-53_ERGL E  99.4 1.3E-10 2.8E-15  102.0  20.2  151   41-223    22-175 (225)
  7 cd06903 lectin_EMP46_EMP47 EMP  99.2 3.5E-09 7.7E-14   92.3  17.6  145   41-223    21-173 (215)
  8 PF03388 Lectin_leg-like:  Legu  98.9 6.5E-08 1.4E-12   85.0  16.5  153   41-223    22-179 (229)
  9 cd06900 lectin_VcfQ VcfQ bacte  98.4 1.8E-06 3.9E-11   76.0  10.5   94   54-161    31-126 (255)
 10 KOG3838 Mannose lectin ERGIC-5  97.7  0.0027 5.9E-08   59.5  16.0  152   42-225    55-209 (497)
 11 KOG3839 Lectin VIP36, involved  97.7  0.0024 5.1E-08   58.7  15.3   94   41-161    72-165 (351)
 12 PF07172 GRP:  Glycine rich pro  80.7     1.4   3E-05   33.7   2.4   19    1-19      1-19  (95)
 13 KOG3514 Neurexin III-alpha [Si  62.1      49  0.0011   35.7   9.0  132   41-219   804-939 (1591)
 14 PF10731 Anophelin:  Thrombin i  48.1      19 0.00042   25.2   2.5   37    1-39      1-39  (65)
 15 smart00282 LamG Laminin G doma  46.9 1.3E+02  0.0028   23.0   9.7   26  196-223    57-82  (135)
 16 cd00110 LamG Laminin G domain;  44.2 1.5E+02  0.0031   22.8  11.8   26  197-224    76-101 (151)
 17 PF13619 KTSC:  KTSC domain      33.5      49  0.0011   22.6   2.8   20  205-224     6-25  (60)
 18 smart00560 LamGL LamG-like jel  32.6      64  0.0014   25.2   3.7   23  201-223    61-83  (133)
 19 smart00159 PTX Pentraxin / C-r  32.5      61  0.0013   27.6   3.8   27  197-223    86-112 (206)
 20 PF10049 DUF2283:  Protein of u  28.3      67  0.0015   21.2   2.7   16  207-222     2-17  (50)
 21 PRK10894 lipopolysaccharide tr  28.3 1.2E+02  0.0025   25.4   4.8   20   41-61     46-65  (180)
 22 cd00152 PTX Pentraxins are pla  28.2      78  0.0017   26.8   3.7   26  198-223    87-112 (201)

No 1  
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor.  Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids.  Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family.  Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins.  PHA agglutinates most mammalian red blood cell types by bindin
Probab=100.00  E-value=5.6e-52  Score=365.18  Aligned_cols=207  Identities=49%  Similarity=0.763  Sum_probs=185.1

Q ss_pred             ceEEeCCCCC--CCeEEeeceEECcCCeEEccCCC--CCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeec
Q 040418           31 PSFIYNGFRS--ANLSLDGIAQFTSNGLLKLTNET--KGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSE  106 (249)
Q Consensus        31 ~sF~f~~F~~--~~l~l~G~A~~~~~g~l~LT~~~--~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~  106 (249)
                      ++|+|++|..  ++|+++|+|.+..++.|+||++.  .+++|||||++||+||++.      +++++||+|+|+|.|.+.
T Consensus         1 ~~f~f~~f~~~~~~l~l~G~A~~~~~~~i~LT~~~~~~~~~G~v~y~~pi~l~~~~------~~~~~sFst~F~F~i~~~   74 (236)
T cd06899           1 LSFNFNGFSSDQSNLTLQGDATISSNGALQLTNDTSPASSVGRALYSKPVRLWDST------TGKVASFSTSFSFSITPP   74 (236)
T ss_pred             CceecCCCCCCCCCEEEecceEcCCCCeEEecCCCCCCcceEEEEeCCCEEeecCC------CCCceeEEEEEEEEEEcC
Confidence            4799999986  79999999999658999999998  8999999999999999987      889999999999999986


Q ss_pred             cccCccccceEEEEccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEcCCCcccccccC
Q 040418          107 FHTTLSAHGIAFVIAPTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDINSLKSEISYPA  186 (249)
Q Consensus       107 ~~~~~~gdGlAFvl~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdvns~~S~~t~~~  186 (249)
                       ....+||||||+|+|....+.+..|++|||.+.++.+...++.|||||||++|.+++||+.+|||||+|++.|..+..+
T Consensus        75 -~~~~~gdGlAF~i~~~~~~~~~~~G~~lG~~~~~~~~~~~~~~vAVEFDT~~n~~~~D~~~nHigIdvn~~~S~~~~~~  153 (236)
T cd06899          75 -NPSLGGDGLAFFLAPTDSLPPASSGGYLGLFNSSNNGNSSNHIVAVEFDTFQNPEFGDPDDNHVGIDVNSLVSVKAGYW  153 (236)
T ss_pred             -CCCCCCCeEEEEEecCCCCCCCCCcceeeeecCCCCCCcccceEEEEeecccCcccCCCCCCeEEEEcCCcccceeecc
Confidence             4567899999999998755557889999999887666677899999999999998889999999999999988877665


Q ss_pred             CccCCCcccccccCCcceEEEEEEeCCCcEEEEEEeeCCCCCCCCceeeEEecCCcccCccCC
Q 040418          187 GYYGDHFVNLTLISGRPMQVWVEYDGLEKRTNVTLAPINIPKPRLPLLSLSRDLSSVLNDAMY  249 (249)
Q Consensus       187 ~~~~~~~~~~~l~~G~~~~v~I~Yd~~~~~L~V~l~~~~~~~p~~p~ls~~vdLs~~l~e~vy  249 (249)
                      ..     ..++|.+|+.|+|||+||+.+++|+|+|+..+..||..|+|++++||+++|||+||
T Consensus       154 ~~-----~~~~l~~g~~~~v~I~Y~~~~~~L~V~l~~~~~~~~~~~~ls~~vdL~~~l~~~~~  211 (236)
T cd06899         154 DD-----DGGKLKSGKPMQAWIDYDSSSKRLSVTLAYSGVAKPKKPLLSYPVDLSKVLPEEVY  211 (236)
T ss_pred             cc-----ccccccCCCeEEEEEEEcCCCCEEEEEEEeCCCCCCcCCEEEEeccHHHhCCCceE
Confidence            31     24458899999999999999999999999887668999999999999999999986


No 2  
>PF00139 Lectin_legB:  Legume lectin domain;  InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=100.00  E-value=4e-51  Score=359.30  Aligned_cols=210  Identities=44%  Similarity=0.713  Sum_probs=182.1

Q ss_pred             ceEEeCCC-CCCCeEEeeceEECcCCeEEccCCCC-CeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeecc-
Q 040418           31 PSFIYNGF-RSANLSLDGIAQFTSNGLLKLTNETK-GQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEF-  107 (249)
Q Consensus        31 ~sF~f~~F-~~~~l~l~G~A~~~~~g~l~LT~~~~-~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~-  107 (249)
                      ++|+|++| +..+++++|+|.+..+|+|+||++.. +|+|||||++||+|||+.      ++.++||+|+|+|+|.... 
T Consensus         2 ~~F~~~~F~~~~~~~l~G~A~~~~~~~l~LT~~~~~~~~G~~~y~~pi~l~d~~------~~~~~sF~t~F~f~i~~~~~   75 (236)
T PF00139_consen    2 VSFSFPSFSNSSNLTLNGDASISSNGSLQLTPDSTNNQAGRAWYNNPIQLWDST------TGNVASFSTSFSFSITNGPG   75 (236)
T ss_dssp             EEEEESSBTTGTTEEEEETEEEETTSEEESSTBETSSEEEEEEESSEEESBETT------TTEBEEEEEEEEEEEEESSS
T ss_pred             ceEEcCCCCCCCceEEEeeEEeccCCeEEcCCCCCCCcEEEEEECCcEEEeCCC------CcceeeeeeEEEEEEeccCC
Confidence            58999999 46999999999985589999999987 999999999999999987      8889999999999996431 


Q ss_pred             ccCccccceEEEEccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEcCCCcccccccCC
Q 040418          108 HTTLSAHGIAFVIAPTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDINSLKSEISYPAG  187 (249)
Q Consensus       108 ~~~~~gdGlAFvl~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdvns~~S~~t~~~~  187 (249)
                      +...+||||||||+|....+.++.|++||+.+..+.+...+++||||||||+|.+++||+.+||||++|++.+..+.+++
T Consensus        76 ~~~~~~dGlAFvi~~~~~~~~~~~g~~lG~~~~~~~~~~~~~~vAVEFDT~~N~~~~d~~~nHIgI~~n~~~s~~~~~~~  155 (236)
T PF00139_consen   76 SSNNGGDGLAFVIQPDPNLPGGSSGGYLGLFNSSTDGNGINNSVAVEFDTYKNPEYNDPDDNHIGIDVNSVVSNKTASAG  155 (236)
T ss_dssp             SSSS-BEEEEEEEEETTSSTTTSSGGGTTTSSSSSTTGGGGCEEEEEEETSTCGGGTTTSSSEEEEEESSSSESEEEE--
T ss_pred             CCccCCCceEEEEecCcccccCCCCCccCccccccCCCccCcEEEEEEeeeecccccccCCCEEEEECCCCccccccccc
Confidence            46779999999999988767778999999998776666678999999999999988999999999999999998886654


Q ss_pred             ccCCCcccccccCCcceEEEEEEeCCCcEEEEEEeeCCCCCCCCceeeEEecCCcccCccCC
Q 040418          188 YYGDHFVNLTLISGRPMQVWVEYDGLEKRTNVTLAPINIPKPRLPLLSLSRDLSSVLNDAMY  249 (249)
Q Consensus       188 ~~~~~~~~~~l~~G~~~~v~I~Yd~~~~~L~V~l~~~~~~~p~~p~ls~~vdLs~~l~e~vy  249 (249)
                      ++  .....++.+|+.|+|||+||+.+++|+|+|+... .||..|+|++.|||+++|+++||
T Consensus       156 ~~--~~~~~~l~~g~~~~v~I~Yd~~~~~L~V~l~~~~-~~~~~~~l~~~vdL~~~l~~~v~  214 (236)
T PF00139_consen  156 YY--SSPSFSLSDGKWHTVWIDYDASTKRLSVYLDDNS-SKPSSPVLSVNVDLSAVLPEQVY  214 (236)
T ss_dssp             ----EEEEHHHGTTSEEEEEEEEETTTTEEEEEEEETT-TTSEEEEEEEE--HHHHSCSEEE
T ss_pred             cc--ccccccccCCcEEEEEEEEcCCccEEEEEEeccc-CCCcceeEEEEEchHHhcCCCcE
Confidence            33  1346789999999999999999999999999874 58999999999999999999987


No 3  
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind.  This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers.  Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=100.00  E-value=2e-36  Score=263.35  Aligned_cols=183  Identities=29%  Similarity=0.362  Sum_probs=148.6

Q ss_pred             CCCeEEeeceEECc-CCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEE
Q 040418           40 SANLSLDGIAQFTS-NGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAF  118 (249)
Q Consensus        40 ~~~l~l~G~A~~~~-~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAF  118 (249)
                      ..++.++|+|.+.. ++.|+||++..+++||+||++||++|.             +|+|+|+|+|...  ...+||||||
T Consensus        13 ~~~~~~~G~A~~~~~~~~l~Lt~~~~~~~G~~~~~~~i~~~~-------------~F~~~F~f~i~~~--~~~~gdG~aF   77 (223)
T cd01951          13 QSNWQLNGSATLTTDSGVLRLTPDTGNQAGSAWYKTPIDLSK-------------DFTTTFKFYLGTK--GTNGADGIAF   77 (223)
T ss_pred             hhhcEEcccEEecCCCCEEEECCCCCCcEEEEEECCcEeccC-------------CEEEEEEEEEeCC--CCCCCCcEEE
Confidence            37899999999964 789999999999999999999999982             8999999999975  3568999999


Q ss_pred             EEccCCCCCCCCCC--CcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEcCCCcccc--cccCCccCCCcc
Q 040418          119 VIAPTRGLPGARPS--QYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDINSLKSEI--SYPAGYYGDHFV  194 (249)
Q Consensus       119 vl~p~~~~p~~s~G--g~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdvns~~S~~--t~~~~~~~~~~~  194 (249)
                      +|+|....+.+..|  ++||+.       ..++.+|||||||+|.+++||+.+||||++|+..+..  ......+.   .
T Consensus        78 ~l~~~~~~~~~~~g~~~~lG~~-------~~~~~~aVefDT~~N~~~~dp~~~higi~~n~~~~~~~~~~~~~~~~---~  147 (223)
T cd01951          78 VLQNDPAGALGGGGGGGGLGYG-------GIGNSVAVEFDTYKNDDNNDPNGNHISIDVNGNGNNTALATSLGSAS---L  147 (223)
T ss_pred             EEecCCCCccccCCCCCccCcc-------ccCCeEEEEEeccccCCCCCCCCCEEEEEcCCCCCCcccccccceee---C
Confidence            99998654444444  788873       3468999999999999888999999999999987541  11111110   1


Q ss_pred             cccccCCcceEEEEEEeCCCcEEEEEEeeCCCCCCCCceeeEEecCCcccCccCC
Q 040418          195 NLTLISGRPMQVWVEYDGLEKRTNVTLAPINIPKPRLPLLSLSRDLSSVLNDAMY  249 (249)
Q Consensus       195 ~~~l~~G~~~~v~I~Yd~~~~~L~V~l~~~~~~~p~~p~ls~~vdLs~~l~e~vy  249 (249)
                      ......|+.|+|||+|++.+++|+|+|.....  |..|+++.++||+.+++++||
T Consensus       148 ~~~~~~g~~~~v~I~Y~~~~~~L~v~l~~~~~--~~~~~l~~~~~l~~~~~~~~y  200 (223)
T cd01951         148 PNGTGLGNEHTVRITYDPTTNTLTVYLDNGST--LTSLDITIPVDLIQLGPTKAY  200 (223)
T ss_pred             CCccCCCCEEEEEEEEeCCCCEEEEEECCCCc--cccccEEEeeeecccCCCcEE
Confidence            11222389999999999999999999987653  677899999999999999986


No 4  
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins.  This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex.  These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein.  Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=99.79  E-value=1.9e-17  Score=143.89  Aligned_cols=146  Identities=22%  Similarity=0.300  Sum_probs=106.4

Q ss_pred             CCeEEeeceEECcCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEEE
Q 040418           41 ANLSLDGIAQFTSNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFVI  120 (249)
Q Consensus        41 ~~l~l~G~A~~~~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFvl  120 (249)
                      .++.+.|+|.+. ++.|+||++..++.|++||+.|+++.              +|+++|+|+|...  ...+||||||+|
T Consensus        20 ~~w~~~G~a~~~-~~~i~LT~~~~~~~G~~~~~~pi~~~--------------~F~~~f~F~i~~~--~~~~gdG~af~~   82 (218)
T cd07308          20 GNWTVGGSTVIT-KNYIRLTPDVPSQSGSLWSRVPIPAK--------------DFEIEVEFSIHGG--SGLGGDGFAFWY   82 (218)
T ss_pred             CCeEEcCCeEEe-CCEEEeCCCCCCCEeEEEeCCCccCC--------------CEEEEEEEEEeCC--CCCCCCEEEEEE
Confidence            689999999997 89999999999999999999999973              7999999999875  356899999999


Q ss_pred             ccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEcC-CCcccc------cccCCccCCCc
Q 040418          121 APTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDIN-SLKSEI------SYPAGYYGDHF  193 (249)
Q Consensus       121 ~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdvn-s~~S~~------t~~~~~~~~~~  193 (249)
                      +|...    ..|..+|.-+       ..+-+||||||++|.   +-..++|.+.+| +..+..      ....+     .
T Consensus        83 ~~~~~----~~g~~~G~~~-------~~~Glai~fdt~~n~---~~~~p~i~~~~Ndg~~~~~~~~d~~~~~~~-----~  143 (218)
T cd07308          83 TEEPG----SDGPLFGGPD-------KFKGLAIFFDTYDND---GKGFPSISVFLNDGTKSYDYETDGEKLELA-----S  143 (218)
T ss_pred             ECCCC----CCCcccccCC-------CCCEEEEEEEcCCCC---CCCCCeEEEEEeCCCceecccCCCcccccc-----c
Confidence            98642    2455566532       357899999999985   233456666554 222211      01110     0


Q ss_pred             ccccccC-CcceEEEEEEeCCCcEEEEEEeeC
Q 040418          194 VNLTLIS-GRPMQVWVEYDGLEKRTNVTLAPI  224 (249)
Q Consensus       194 ~~~~l~~-G~~~~v~I~Yd~~~~~L~V~l~~~  224 (249)
                      ......+ +++.+++|.|+  .+.|.|.+...
T Consensus       144 c~~~~~~~~~~~~~~I~y~--~~~l~v~i~~~  173 (218)
T cd07308         144 CSLKFRNSNAPTTLRISYL--NNTLKVDITYS  173 (218)
T ss_pred             eeEecccCCCCeEEEEEEE--CCEEEEEEeCC
Confidence            0112222 67899999999  67899999753


No 5  
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER).  Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=99.49  E-value=4.5e-12  Score=112.68  Aligned_cols=151  Identities=16%  Similarity=0.145  Sum_probs=100.2

Q ss_pred             CCCeEEeeceEECcCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEE
Q 040418           40 SANLSLDGIAQFTSNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFV  119 (249)
Q Consensus        40 ~~~l~l~G~A~~~~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFv  119 (249)
                      ..++.+.|+|.+. ++.||||++..++.|++||+.|+++.              +|+++|+|+|.+. ....+||||||.
T Consensus        19 i~~w~~~G~a~v~-~~~IrLTp~~~~~~G~~w~~~p~~~~--------------~F~~~f~F~I~~~-~~~~~GdGlAfw   82 (248)
T cd06901          19 MPLWDFLGSTMVT-SQYIRLTPDHQSKQGSIWNRVPCYLR--------------DWEMHVHFKVHGS-GKNLFGDGFAIW   82 (248)
T ss_pred             CCCEEEcceEEEc-CCeEEECCCCCCCEEEEeccCCccCC--------------CEEEEEEEEEeCC-CCCCCCCEEEEE
Confidence            4789999999997 78999999988899999999999983              7999999999986 445689999999


Q ss_pred             EccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCC-CCCCCCeeEEEcC-CCccccc------ccCCccCC
Q 040418          120 IAPTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEF-SDINDNHVGIDIN-SLKSEIS------YPAGYYGD  191 (249)
Q Consensus       120 l~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~-~Dp~~nHVgIdvn-s~~S~~t------~~~~~~~~  191 (249)
                      ++....    ..|..+|-.+       .-.=+||-|||+.|..- -....+-|.+-+| +......      ..++.-  
T Consensus        83 ~t~~~~----~~G~~fG~~~-------~f~Gl~I~~Dt~~n~~~~~~~~~P~i~~~~NDGt~~yd~~~Dg~~~~~~~C--  149 (248)
T cd06901          83 YTKERM----QPGPVFGSKD-------NFHGLAIFFDTYSNQNGEHEHVHPYISAMVNNGSLSYDHDRDGTHTELAGC--  149 (248)
T ss_pred             EEcCCC----ccCcccccCC-------CCceEEEEEECCCCCCCcccCCCceEEEEEcCCCeeecccCCCchhhcCce--
Confidence            998642    3344445422       12458999999998631 0112233444333 3222110      001000  


Q ss_pred             Ccccccc-cCCcceEEEEEEeCCCcEEEEEEeeC
Q 040418          192 HFVNLTL-ISGRPMQVWVEYDGLEKRTNVTLAPI  224 (249)
Q Consensus       192 ~~~~~~l-~~G~~~~v~I~Yd~~~~~L~V~l~~~  224 (249)
                         .... +.+.+-.++|.|...  .|+|.++..
T Consensus       150 ---~~~~rn~~~~t~~rI~Y~~~--~l~v~vd~~  178 (248)
T cd06901         150 ---SAPFRNKDHDTFVAIRYSKG--RLTVMTDID  178 (248)
T ss_pred             ---eeeccCCCCCeEEEEEEECC--eEEEEEecC
Confidence               0111 234557899999974  577777643


No 6  
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC).  ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain.  ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53.  In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII.  Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=99.37  E-value=1.3e-10  Score=101.97  Aligned_cols=151  Identities=17%  Similarity=0.188  Sum_probs=102.6

Q ss_pred             CCeEEeeceEECcCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEEE
Q 040418           41 ANLSLDGIAQFTSNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFVI  120 (249)
Q Consensus        41 ~~l~l~G~A~~~~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFvl  120 (249)
                      ..+.+.|+|.+. ++.||||++.+++.|.+|.+.|++..              +|+.+|+|+|...  ...+||||||.+
T Consensus        22 ~~W~~~G~t~~~-~~~IrLTp~~~~~~G~iw~~~~~~~~--------------~w~ie~~Fri~g~--~~~~gdG~a~W~   84 (225)
T cd06902          22 PFWSHGGDAIAS-LEQVRLTPSLRSKKGSVWTKNPFSFE--------------NWEVEVTFRVTGR--GRIGADGLAIWY   84 (225)
T ss_pred             CceEecccEEec-CCEEEECCCCCCCEEEEeeCCCcCCC--------------CEEEEEEEEEecC--CCCCCCEEEEEE
Confidence            689999999886 88999999999999999999999832              7999999999875  345789999999


Q ss_pred             ccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEcC-CCcccccccCCccCCCcc-cccc
Q 040418          121 APTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDIN-SLKSEISYPAGYYGDHFV-NLTL  198 (249)
Q Consensus       121 ~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdvn-s~~S~~t~~~~~~~~~~~-~~~l  198 (249)
                      +....    ..|+.+|..+       .-.-+||.|||+.|.+  ....++|.+-.| +........-........ ....
T Consensus        85 t~~~~----~~G~~~G~~~-------~f~Gl~I~~Dt~~n~~--~~~~p~i~~~~NDGt~~yd~~~D~~~~~~~~C~~~~  151 (225)
T cd06902          85 TKERG----EEGPVFGSSD-------KWNGVGIFFDSFDNDG--KKNNPAILVVGNDGTKSYDHQNDGLTQALGSCLRDF  151 (225)
T ss_pred             ECCCC----CCCCccCCCC-------cccEEEEEEECCCCCC--CCCCcEEEEEECCCCeeccccCCCcccccceEEEec
Confidence            97642    2455566533       2346899999998852  233456766554 322211110000000000 0122


Q ss_pred             -cCCcceEEEEEEeCCCcEEEEEEee
Q 040418          199 -ISGRPMQVWVEYDGLEKRTNVTLAP  223 (249)
Q Consensus       199 -~~G~~~~v~I~Yd~~~~~L~V~l~~  223 (249)
                       +...+.+++|.|..  +.|+|.++.
T Consensus       152 rn~~~p~~~rI~Y~~--~~l~V~~d~  175 (225)
T cd06902         152 RNKPYPVRAKITYYQ--NVLTVSINN  175 (225)
T ss_pred             cCCCCCeEEEEEEEC--CeEEEEEeC
Confidence             23467899999998  469998874


No 7  
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins.  EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=99.15  E-value=3.5e-09  Score=92.29  Aligned_cols=145  Identities=16%  Similarity=0.182  Sum_probs=98.8

Q ss_pred             CCeEEeeceEECcCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEEE
Q 040418           41 ANLSLDGIAQFTSNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFVI  120 (249)
Q Consensus        41 ~~l~l~G~A~~~~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFvl  120 (249)
                      .++.+.|+|.+. ++.||||++ +++.|.+|-+.|+++.+             +|+.+|+|+|+..  ...+||||||-+
T Consensus        21 ~~W~~~G~t~v~-~~~IrLTp~-~s~~G~iWs~~pl~~~~-------------~w~ie~~Fri~G~--~~~~gdGla~W~   83 (215)
T cd06903          21 PNWQTSGNPKLE-SGRIILTPP-GNQRGSLWLKKPLSLKD-------------EWTIEWTFRSTGP--EGRSGGGLNFWL   83 (215)
T ss_pred             CCeEEcCcEEee-CCeEEECCC-CCceEeEeeCCcCCCCC-------------CEEEEEEEEeccc--CCcCCCEEEEEE
Confidence            689999999997 889999999 99999999999999742             6999999999875  336899999999


Q ss_pred             ccCCCCCCCCCC-CcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEc-CCCccccccc-----CCccCCCc
Q 040418          121 APTRGLPGARPS-QYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDI-NSLKSEISYP-----AGYYGDHF  193 (249)
Q Consensus       121 ~p~~~~p~~s~G-g~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdv-ns~~S~~t~~-----~~~~~~~~  193 (249)
                      .......   .| ..+|-.+       .-.=+||.|||+.|.   .   ..|.+-+ ++........     ++.=    
T Consensus        84 t~~~~~~---~g~~~fG~~~-------~f~Gl~I~~Dt~~n~---~---p~i~~~~NDGt~~yd~~~d~~~~~g~C----  143 (215)
T cd06903          84 VKDGNAD---VGTSSIYGPS-------KFDGLQLLIDNNGGS---G---GSLRGFLNDGSKDYKNEDVDSLAFGSC----  143 (215)
T ss_pred             ECCCccc---CCccccCCCC-------CCcEEEEEEECCCCC---C---ceEEEEECCCCeeccccCCccccccee----
Confidence            9754211   11 2222211       123489999999874   1   2333333 3332211111     1100    


Q ss_pred             ccc-cccCCcceEEEEEEeCCCcEEEEEEee
Q 040418          194 VNL-TLISGRPMQVWVEYDGLEKRTNVTLAP  223 (249)
Q Consensus       194 ~~~-~l~~G~~~~v~I~Yd~~~~~L~V~l~~  223 (249)
                       .. -.+.+.+.+++|.|....+.|+|.++.
T Consensus       144 -~~~~rn~~~p~~iri~Y~~~~~~l~v~vd~  173 (215)
T cd06903         144 -LFAYQDSGVPSTIRLSYDALNSLFKVQVDN  173 (215)
T ss_pred             -eEeccCCCCCEEEEEEEECCCCEEEEEECC
Confidence             01 134566899999999977889998864


No 8  
>PF03388 Lectin_leg-like:  Legume-like lectin family;  InterPro: IPR005052  Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[].  ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=98.93  E-value=6.5e-08  Score=85.04  Aligned_cols=153  Identities=21%  Similarity=0.315  Sum_probs=94.5

Q ss_pred             CCeEEeeceEECcCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEEE
Q 040418           41 ANLSLDGIAQFTSNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFVI  120 (249)
Q Consensus        41 ~~l~l~G~A~~~~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFvl  120 (249)
                      ..+.+.|+|.+. ++.||||++.+++.|.+|.+.|++..              .|+.+|+|+|... ....+||||||-+
T Consensus        22 ~~W~~~G~t~i~-~~~IrLTp~~~~~~G~iws~~~~~~~--------------~w~i~~~Fri~g~-~~~~~g~G~a~W~   85 (229)
T PF03388_consen   22 PNWDIGGSTVIT-DNFIRLTPDRQSQSGSIWSRKPIPFD--------------NWEIEFTFRISGQ-EKGLGGDGMAFWY   85 (229)
T ss_dssp             TTEEEEET-EEE-SSEEEEE-SSTTEEEEEEESS-BEES--------------EEEEEEEEEEESS--SSS-S-EEEEEE
T ss_pred             CCEEECCeEEec-CCEEEECCCcccCEEEEEEcCCCCcc--------------CEEEEEEEEEecc-ccCcCCCeEEEEE
Confidence            579999999987 89999999999999999999999972              7999999999876 3455899999999


Q ss_pred             ccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCCC-CCCCCeeEEEcC-CCcccccccCCccCCCcc--cc
Q 040418          121 APTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEFS-DINDNHVGIDIN-SLKSEISYPAGYYGDHFV--NL  196 (249)
Q Consensus       121 ~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~-Dp~~nHVgIdvn-s~~S~~t~~~~~~~~~~~--~~  196 (249)
                      .....    ..|..+|..+       .-.=++|=||||.|.+-. .-....|.+.+| +........-+. .....  ..
T Consensus        86 t~~~~----~~G~~fG~~~-------~f~Gl~i~idt~~N~~~~~~~~~p~i~~~~NDGt~~~~~~~dg~-~~~~~~C~~  153 (229)
T PF03388_consen   86 TKDPG----SDGPVFGGPD-------KFDGLGIFIDTYDNDEGGHKRGFPYISAMLNDGTKSYDHDNDGK-DQSLGSCSA  153 (229)
T ss_dssp             ESSSS----SSCSBTTB-S-------S-EEEEEEEEES-TTCTTCTSTSSEEEEEEEESSS---GGGTTT-TT-SEEEE-
T ss_pred             EcCcc----ccccccCCCc-------ccceEEEEEEcccCCCcccccccceEEEEecCCCccccccccCc-cccccccee
Confidence            97542    2555556422       124589999999986311 112355655554 222111110000 00000  11


Q ss_pred             ccc-CCcceEEEEEEeCCCcEEEEEEee
Q 040418          197 TLI-SGRPMQVWVEYDGLEKRTNVTLAP  223 (249)
Q Consensus       197 ~l~-~G~~~~v~I~Yd~~~~~L~V~l~~  223 (249)
                      ... .+.+.+++|.|...  .|.|.++.
T Consensus       154 ~~rn~~~p~~~ri~Y~~~--~l~v~id~  179 (229)
T PF03388_consen  154 DYRNSDVPTRIRISYSKN--TLTVSIDS  179 (229)
T ss_dssp             --BTESSEEEEEEEEETT--EEEEEEET
T ss_pred             ccCcCCCCEEEEEEEECC--eEEEEEec
Confidence            222 34567899999985  67777763


No 9  
>cd06900 lectin_VcfQ VcfQ bacterial pilus biogenesis protein, lectin domain. This family includes bacterial proteins homologous to the VcfQ (also known as MshQ) bacterial pilus biogenesis protein.  VcfQ is encoded by the vcfQ gene of the type IV pilus gene cluster of Vibrio cholerae and is essential for type IV pilus assembly.  VcfQ has a Laminin G-like domain as well as an L-type lectin domain.
Probab=98.44  E-value=1.8e-06  Score=75.99  Aligned_cols=94  Identities=18%  Similarity=0.198  Sum_probs=72.2

Q ss_pred             CCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEEEccCCC-CCCCCCC
Q 040418           54 NGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFVIAPTRG-LPGARPS  132 (249)
Q Consensus        54 ~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFvl~p~~~-~p~~s~G  132 (249)
                      +|.||||++..+|+|.+.|.++++--+.            ....+|.+.....  ...+||||||||.-..- +..+..|
T Consensus        31 ~g~LRLT~~~~nqata~~~~~~FPs~~n------------~v~veFd~yayg~--~g~GADGia~vLsDasv~p~~G~fG   96 (255)
T cd06900          31 NNRLRLTDASGNQATAVTLQRLFPSAGN------------YVEVEFDYYAYGS--GGNGADGVALVLSDASVTPQAGAFG   96 (255)
T ss_pred             cCeEEeccCccCcceeEEEeeeeccCCC------------eEEEEEEEEEecC--CCCCCceEEEEEeCCCcCCcCCCcC
Confidence            7999999999999999999999985321            3678888877763  56799999999995432 2347789


Q ss_pred             CcccCCcCC-CCCCCCCcEEEEEEecCccC
Q 040418          133 QYLGLFNES-NLGNETNHVFAVELDTIENH  161 (249)
Q Consensus       133 g~LGl~n~~-~~~~~~~~~vAVEFDT~~n~  161 (249)
                      |.|||.-.. ...+.....+.|-||-|-|.
T Consensus        97 GsLGYa~~~~~~~GfaGGwLGiGlDEyGNF  126 (255)
T cd06900          97 GSLGYAQRNDGVPGFAGGWLGIGLDEYGNF  126 (255)
T ss_pred             cccccccccCCCCccccceEEEEEeccccc
Confidence            999997654 22234456899999998774


No 10 
>KOG3838 consensus Mannose lectin ERGIC-53, involved in glycoprotein traffic [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.67  E-value=0.0027  Score=59.52  Aligned_cols=152  Identities=15%  Similarity=0.212  Sum_probs=95.2

Q ss_pred             CeEEeeceEECcCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEEEc
Q 040418           42 NLSLDGIAQFTSNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFVIA  121 (249)
Q Consensus        42 ~l~l~G~A~~~~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFvl~  121 (249)
                      -+...|||-.+ ...|||++.-.++.|.||-+..+++-              -|..+-+|+|...  ...+|||||+--.
T Consensus        55 FW~~~GdAIas-~eqvRlaPSmrsrkGavWtka~~~fe--------------~weVev~~rVtGr--GRiGAdGlaiWYt  117 (497)
T KOG3838|consen   55 FWSHHGDAIAS-SEQVRLAPSMRSRKGAVWTKASVPFE--------------NWEVEVQFRVTGR--GRIGADGLAIWYT  117 (497)
T ss_pred             eeeecCccccc-ccceeeccccccccCceeecccCCcc--------------cceEEEEEEeccc--ccccCCceEEEEe
Confidence            37788999654 77999999999999999999877753              4788889999985  6779999999877


Q ss_pred             cCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEcCC-CcccccccCCccCCCcc-ccccc
Q 040418          122 PTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDINS-LKSEISYPAGYYGDHFV-NLTLI  199 (249)
Q Consensus       122 p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdvns-~~S~~t~~~~~~~~~~~-~~~l~  199 (249)
                      ...    |--|.-+|=..       .=.-+++=||.+-|+  +.-++.-|.+-.|. ..+.--..=+.-..... --+..
T Consensus       118 ~~~----G~~GpVfGg~d-------~WnGigiffDSfdnD--~qknnP~Is~~lndGt~~ydh~~DGasQ~LssCqrDFR  184 (497)
T KOG3838|consen  118 RGR----GHVGPVFGGLD-------SWNGIGIFFDSFDND--GQKNNPAISVLLNDGTIPYDHPGDGASQGLSSCQRDFR  184 (497)
T ss_pred             cCC----Ccccccccccc-------cccceEEEeeccccc--CCcCCccEEEEecCCcccccCCCccHHHHHHHhhHHhc
Confidence            532    12232233211       112468999999886  33455667776653 22110000000000000 01222


Q ss_pred             C-CcceEEEEEEeCCCcEEEEEEeeCC
Q 040418          200 S-GRPMQVWVEYDGLEKRTNVTLAPIN  225 (249)
Q Consensus       200 ~-G~~~~v~I~Yd~~~~~L~V~l~~~~  225 (249)
                      + --+..++|+|-.  ++|+|-+...=
T Consensus       185 NkPyPvRarItY~~--nvLtv~innGm  209 (497)
T KOG3838|consen  185 NKPYPVRARITYYG--NVLTVMINNGM  209 (497)
T ss_pred             cCCCCceEEEEEec--cEEEEEEcCCC
Confidence            2 236789999987  58999987643


No 11 
>KOG3839 consensus Lectin VIP36, involved in the transport of glycoproteins carrying high mannose-type glycans [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.66  E-value=0.0024  Score=58.69  Aligned_cols=94  Identities=22%  Similarity=0.360  Sum_probs=76.8

Q ss_pred             CCeEEeeceEECcCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEEEE
Q 040418           41 ANLSLDGIAQFTSNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAFVI  120 (249)
Q Consensus        41 ~~l~l~G~A~~~~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAFvl  120 (249)
                      .++.+.|...+. ..-||||.+.+++.|.+|-.+||-..              .|+..+.|++... ...--|||||+.+
T Consensus        72 ~~W~~~Gstvv~-~~~irLT~d~qsk~GAv~n~~Pv~s~--------------~wev~v~fkv~~~-s~~lfgdG~Aiw~  135 (351)
T KOG3839|consen   72 PNWNLSGSTVVT-SNYIRLTPDEQSKSGAVWNRQPVFSR--------------DWEVLVHFKVHGQ-SKNLFGDGMAIWY  135 (351)
T ss_pred             cCccccccEEEE-eeeeeccccccccccccccCCCcccc--------------ceeEEEEEEEecC-CCcccccceEEEe
Confidence            578999999886 77899999999999999999999853              6999999999987 5566899999999


Q ss_pred             ccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccC
Q 040418          121 APTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENH  161 (249)
Q Consensus       121 ~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~  161 (249)
                      .-....    .|..+|-+..       -+-+||=.|||.|.
T Consensus       136 t~Er~q----~GPvFG~~dk-------F~GL~vfidtY~n~  165 (351)
T KOG3839|consen  136 TKERAQ----PGPVFGSKDK-------FTGLAVFIDTYGNH  165 (351)
T ss_pred             eccccc----CCCCCCCccc-------ceeEEEEEeccCCc
Confidence            875432    5556665432       24589999999886


No 12 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=80.73  E-value=1.4  Score=33.65  Aligned_cols=19  Identities=42%  Similarity=0.464  Sum_probs=11.1

Q ss_pred             CCCcchHHHHHHHHHHHHH
Q 040418            1 MAFKLPILLVSLLMIIIII   19 (249)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~   19 (249)
                      ||+|..++|..||.++||+
T Consensus         1 MaSK~~llL~l~LA~lLli   19 (95)
T PF07172_consen    1 MASKAFLLLGLLLAALLLI   19 (95)
T ss_pred             CchhHHHHHHHHHHHHHHH
Confidence            8999855555444444443


No 13 
>KOG3514 consensus Neurexin III-alpha [Signal transduction mechanisms]
Probab=62.06  E-value=49  Score=35.70  Aligned_cols=132  Identities=20%  Similarity=0.188  Sum_probs=76.9

Q ss_pred             CCeEEeeceEEC--cCCeEEccCCCCCeEEEEEecCceeccCCCCCCCCCCCeeeeEEEEEEEEEeeccccCccccceEE
Q 040418           41 ANLSLDGIAQFT--SNGLLKLTNETKGQIGHAFYPAPIPFKNNNSNSSTANGTVFSFSTTFVFSILSEFHTTLSAHGIAF  118 (249)
Q Consensus        41 ~~l~l~G~A~~~--~~g~l~LT~~~~~~~Grv~y~~Pv~l~d~~~~~~~~t~~~asFsT~F~F~I~~~~~~~~~gdGlAF  118 (249)
                      +.|.|+|...+.  .+|-++|..-....-+|++-..|+.++.+.+--+ ...-.+.|+.+|-|+....     ..|||- 
T Consensus       804 ~~LvFNG~~Yld~~K~~~~~ls~l~a~fkl~~iv~~paTf~sk~Sy~~-la~L~ay~s~~l~Fqfkt~-----sp~gll-  876 (1591)
T KOG3514|consen  804 SGLVFNGQDYLDKCKMGDIQLSELSARFKLRAIVADPATFKSKSSYVK-LATLQAYFSMHLFFQFKTT-----SPDGLL-  876 (1591)
T ss_pred             hheEECcHHHHHHHhcCCcchhhcchhhCceEEeeccceeeechhhhh-hhhhheeeEEEEEEEEeec-----CCCeEE-
Confidence            679999988774  3577888776656678888889998875431111 1122357777777777654     455542 


Q ss_pred             EEccCCCCCCCCCCCcccCCcCCCCCCCCCcEEEEEEecCccCCCCCCCCCeeEEEcC-CCcccccccCCccCCCccccc
Q 040418          119 VIAPTRGLPGARPSQYLGLFNESNLGNETNHVFAVELDTIENHEFSDINDNHVGIDIN-SLKSEISYPAGYYGDHFVNLT  197 (249)
Q Consensus       119 vl~p~~~~p~~s~Gg~LGl~n~~~~~~~~~~~vAVEFDT~~n~~~~Dp~~nHVgIdvn-s~~S~~t~~~~~~~~~~~~~~  197 (249)
                       +-+.+                     .-|.++|||.---         +=|--.|.. +..+.+-         .....
T Consensus       877 -~fn~g---------------------d~ndfi~velvnG---------~ihYtfdlg~gp~~~k~---------~sr~h  916 (1591)
T KOG3514|consen  877 -LFNSG---------------------DGNDFIAVELVNG---------YIHYTFDLGNGPTSMKG---------PSRQH  916 (1591)
T ss_pred             -EecCC---------------------CCCceEEEEEeCc---------EEEEEEEcCCCcccccC---------cccCc
Confidence             22211                     1257889986431         122233332 2111111         23567


Q ss_pred             ccCCcceEEEEEEeCC-CcEEEE
Q 040418          198 LISGRPMQVWVEYDGL-EKRTNV  219 (249)
Q Consensus       198 l~~G~~~~v~I~Yd~~-~~~L~V  219 (249)
                      |+|.++|+|.|.=|.. ++.|.|
T Consensus       917 lnDnrWHnV~I~rd~~~~HtL~v  939 (1591)
T KOG3514|consen  917 LNDNRWHNVLIYRDKTNTHTLKV  939 (1591)
T ss_pred             CccccceeEEEEcCCCCceEEEe
Confidence            8889999999988844 234443


No 14 
>PF10731 Anophelin:  Thrombin inhibitor from mosquito;  InterPro: IPR018932  Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing. 
Probab=48.07  E-value=19  Score=25.17  Aligned_cols=37  Identities=30%  Similarity=0.459  Sum_probs=22.0

Q ss_pred             CCCcchHHHHHHHHHHHHHHhhcccc--CCCCceEEeCCCC
Q 040418            1 MAFKLPILLVSLLMIIIIIITSSAAA--KDKNPSFIYNGFR   39 (249)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~sF~f~~F~   39 (249)
                      ||.|+  ++.++||+.|+-++.+|..  .....+|+=..|+
T Consensus         1 MA~Kl--~vialLC~aLva~vQ~APQYa~GeeP~YDEdd~d   39 (65)
T PF10731_consen    1 MASKL--IVIALLCVALVAIVQSAPQYAPGEEPSYDEDDDD   39 (65)
T ss_pred             Ccchh--hHHHHHHHHHHHHHhcCcccCCCCCCCcCcccCc
Confidence            66665  6777888877777766643  2333344444444


No 15 
>smart00282 LamG Laminin G domain.
Probab=46.93  E-value=1.3e+02  Score=23.00  Aligned_cols=26  Identities=27%  Similarity=0.307  Sum_probs=21.0

Q ss_pred             ccccCCcceEEEEEEeCCCcEEEEEEee
Q 040418          196 LTLISGRPMQVWVEYDGLEKRTNVTLAP  223 (249)
Q Consensus       196 ~~l~~G~~~~v~I~Yd~~~~~L~V~l~~  223 (249)
                      ..++||++|++.|.++..  .+.++|+.
T Consensus        57 ~~~~dg~WH~v~i~~~~~--~~~l~VD~   82 (135)
T smart00282       57 TPLNDGQWHRVAVERNGR--RVTLSVDG   82 (135)
T ss_pred             eEeCCCCEEEEEEEEeCC--EEEEEECC
Confidence            478899999999999864  67777764


No 16 
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=44.20  E-value=1.5e+02  Score=22.83  Aligned_cols=26  Identities=23%  Similarity=0.276  Sum_probs=21.6

Q ss_pred             cccCCcceEEEEEEeCCCcEEEEEEeeC
Q 040418          197 TLISGRPMQVWVEYDGLEKRTNVTLAPI  224 (249)
Q Consensus       197 ~l~~G~~~~v~I~Yd~~~~~L~V~l~~~  224 (249)
                      .++||++|++.|.++.  +.+.++++..
T Consensus        76 ~v~dg~Wh~v~i~~~~--~~~~l~VD~~  101 (151)
T cd00110          76 PLNDGQWHSVSVERNG--RSVTLSVDGE  101 (151)
T ss_pred             ccCCCCEEEEEEEECC--CEEEEEECCc
Confidence            5889999999999997  5777777653


No 17 
>PF13619 KTSC:  KTSC domain
Probab=33.45  E-value=49  Score=22.55  Aligned_cols=20  Identities=20%  Similarity=0.107  Sum_probs=16.8

Q ss_pred             EEEEEEeCCCcEEEEEEeeC
Q 040418          205 QVWVEYDGLEKRTNVTLAPI  224 (249)
Q Consensus       205 ~v~I~Yd~~~~~L~V~l~~~  224 (249)
                      ...|.||..++.|+|.+...
T Consensus         6 I~~v~Yd~~~~~L~V~F~~G   25 (60)
T PF13619_consen    6 IRSVGYDPETRTLEVEFKSG   25 (60)
T ss_pred             ccEEeECCCCCEEEEEEcCC
Confidence            34699999999999999654


No 18 
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=32.59  E-value=64  Score=25.22  Aligned_cols=23  Identities=17%  Similarity=0.163  Sum_probs=21.0

Q ss_pred             CcceEEEEEEeCCCcEEEEEEee
Q 040418          201 GRPMQVWVEYDGLEKRTNVTLAP  223 (249)
Q Consensus       201 G~~~~v~I~Yd~~~~~L~V~l~~  223 (249)
                      |+.+++.+.||+...+|++|++-
T Consensus        61 ~~W~hva~v~d~~~g~~~lYvnG   83 (133)
T smart00560       61 GVWVHLAGVYDGGAGKLSLYVNG   83 (133)
T ss_pred             CCEEEEEEEEECCCCeEEEEECC
Confidence            78999999999998999999973


No 19 
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=32.46  E-value=61  Score=27.62  Aligned_cols=27  Identities=4%  Similarity=0.048  Sum_probs=24.5

Q ss_pred             cccCCcceEEEEEEeCCCcEEEEEEee
Q 040418          197 TLISGRPMQVWVEYDGLEKRTNVTLAP  223 (249)
Q Consensus       197 ~l~~G~~~~v~I~Yd~~~~~L~V~l~~  223 (249)
                      .+.+|+.|++-+.||+.+.++.+|++-
T Consensus        86 ~~~~g~W~hvc~tw~~~~g~~~lyvnG  112 (206)
T smart00159       86 PESDGKWHHICTTWESSSGIAELWVDG  112 (206)
T ss_pred             cccCCceEEEEEEEECCCCcEEEEECC
Confidence            577899999999999999999999974


No 20 
>PF10049 DUF2283:  Protein of unknown function (DUF2283);  InterPro: IPR019270  Members of this family of hypothetical proteins have no known function. 
Probab=28.30  E-value=67  Score=21.16  Aligned_cols=16  Identities=25%  Similarity=0.345  Sum_probs=14.8

Q ss_pred             EEEEeCCCcEEEEEEe
Q 040418          207 WVEYDGLEKRTNVTLA  222 (249)
Q Consensus       207 ~I~Yd~~~~~L~V~l~  222 (249)
                      ||+||..+..|-+++.
T Consensus         2 ki~YD~~~D~lyi~l~   17 (50)
T PF10049_consen    2 KIEYDPEADALYIRLS   17 (50)
T ss_pred             EeEEcCcCCEEEEEEC
Confidence            7999999999999993


No 21 
>PRK10894 lipopolysaccharide transport periplasmic protein LptA; Provisional
Probab=28.26  E-value=1.2e+02  Score=25.40  Aligned_cols=20  Identities=20%  Similarity=0.381  Sum_probs=13.2

Q ss_pred             CCeEEeeceEECcCCeEEccC
Q 040418           41 ANLSLDGIAQFTSNGLLKLTN   61 (249)
Q Consensus        41 ~~l~l~G~A~~~~~g~l~LT~   61 (249)
                      ...++.|++.+. .|..+|+.
T Consensus        46 ~~~~~tGnV~i~-QG~~~L~A   65 (180)
T PRK10894         46 NVVTFTGNVVVT-QGTIKINA   65 (180)
T ss_pred             CEEEEEeeEEEE-ECceEEEe
Confidence            446778888776 56666653


No 22 
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=28.18  E-value=78  Score=26.77  Aligned_cols=26  Identities=8%  Similarity=0.060  Sum_probs=23.7

Q ss_pred             ccCCcceEEEEEEeCCCcEEEEEEee
Q 040418          198 LISGRPMQVWVEYDGLEKRTNVTLAP  223 (249)
Q Consensus       198 l~~G~~~~v~I~Yd~~~~~L~V~l~~  223 (249)
                      ..+|+.|++-+.||+.+.++.+|++-
T Consensus        87 ~~~g~W~hv~~t~d~~~g~~~lyvnG  112 (201)
T cd00152          87 ESDGAWHHICVTWESTSGIAELWVNG  112 (201)
T ss_pred             CCCCCEEEEEEEEECCCCcEEEEECC
Confidence            47899999999999999999999974


Done!