Query         012913
Match_columns 453
No_of_seqs    323 out of 2085
Neff          8.0 
Searched_HMMs 46136
Date          Fri Mar 29 07:36:58 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/012913.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/012913hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PLN03150 hypothetical protein; 100.0 9.2E-82   2E-86  677.6  45.1  424   20-453    14-447 (623)
  2 PF12819 Malectin_like:  Carboh 100.0 3.8E-67 8.1E-72  526.7  32.8  319   34-367     1-347 (347)
  3 PF11721 Malectin:  Di-glucose   99.9 6.4E-23 1.4E-27  186.6   6.6  154  201-362     2-174 (174)
  4 PF11721 Malectin:  Di-glucose   99.7 2.3E-17 5.1E-22  150.1   8.7  141   31-177     2-174 (174)
  5 PLN03150 hypothetical protein;  99.4 2.3E-12 4.9E-17  139.7  14.9  152   32-184   194-364 (623)
  6 PF12819 Malectin_like:  Carboh  99.4 4.7E-12   1E-16  127.7  12.9  154  204-370     1-164 (347)
  7 KOG3593 Predicted receptor-lik  98.0 2.8E-06   6E-11   80.6   2.7  105  261-367   107-228 (355)
  8 PF08263 LRRNT_2:  Leucine rich  97.6 6.4E-05 1.4E-09   51.8   3.2   35  376-416     2-43  (43)
  9 PLN00113 leucine-rich repeat r  97.5 0.00016 3.4E-09   82.9   6.6   70  374-453    26-98  (968)
 10 KOG3593 Predicted receptor-lik  94.2   0.033 7.3E-07   53.4   2.8  115   22-142    52-173 (355)
 11 PF00560 LRR_1:  Leucine Rich R  87.3    0.23   5E-06   28.7   0.5   21  426-447     2-22  (22)
 12 PF12799 LRR_4:  Leucine Rich r  81.3    0.63 1.4E-05   32.0   0.6   27  426-453     3-29  (44)
 13 PF03422 CBM_6:  Carbohydrate b  56.4 1.1E+02  0.0025   25.2  11.9   76  273-356    32-112 (125)
 14 PF13855 LRR_8:  Leucine rich r  47.0       9 0.00019   27.7   0.8   30  424-453    25-54  (61)
 15 PF13855 LRR_8:  Leucine rich r  38.4      14 0.00031   26.6   0.7   29  425-453     2-30  (61)
 16 PRK06764 hypothetical protein;  33.3      40 0.00087   26.5   2.5   17   90-106    74-90  (105)
 17 PF03422 CBM_6:  Carbohydrate b  32.9 2.8E+02   0.006   22.8   9.2   38   86-128    30-68  (125)
 18 KOG2230 Predicted beta-mannosi  32.4 6.9E+02   0.015   27.2  12.0  108  241-354    29-147 (867)
 19 PF07172 GRP:  Glycine rich pro  27.0      41 0.00089   27.3   1.7   14    7-20      5-18  (95)
 20 KOG0472 Leucine-rich repeat pr  21.4      37  0.0008   35.1   0.5   29  424-453   505-533 (565)

No 1  
>PLN03150 hypothetical protein; Provisional
Probab=100.00  E-value=9.2e-82  Score=677.59  Aligned_cols=424  Identities=35%  Similarity=0.642  Sum_probs=341.2

Q ss_pred             hhccccCCCCccEEEecCCCCCc-cCCCCCeEeeCCCCccCCCcccccCCCCCCCcccceeeeeCCCCCCCceEEeecCC
Q 012913           20 LSLASSYPYKASYRIDCGSATST-TDPFNTTWQADDRYYTSGATSIVSEPLHFRFPHEKTLRYFPPSSGKKNCYIIPNLP   98 (453)
Q Consensus        20 ~~~~~~~~~~~~~~IdCG~~~~~-~d~~~~~w~~Dd~~~~~g~~~~~~~~~~~~~~~y~t~R~F~~~~g~~~cY~lpv~~   98 (453)
                      +.++++.+.+++|+||||++++. +|.+||+|++|..+ ..|.....+.+ ....++|+|+|+||.++|+++||+||+.+
T Consensus        14 ~~~~~~~~~~~~~~I~CGs~~~~~~d~~~~~w~~D~~~-~~~~~~~~~~~-~~~~~~~~t~R~F~~~~g~~~cY~~~~~~   91 (623)
T PLN03150         14 VLASLASPEPFTMRISCGARVNVRTAPTNTLWYKDFAY-TGGIPANATRP-SFIAPPLKTLRYFPLSDGPENCYNINRVP   91 (623)
T ss_pred             hhcccccCCCccEEEeCCCCCCcccCCCCCEEcCCccc-ccCccccccCc-ccccchhhccccCCcccccccceEeeecC
Confidence            33444455667799999999876 67899999997443 43433322222 23457899999999877999999999888


Q ss_pred             CccEEEEEEeecCCCCCCCCCCcEEEEECCEEEEecCCCCCcccCCCceEEEEEEEecCCeeEEEEeecCCCCceeeeeE
Q 012913           99 PGRYYIRTFTVYDNYDGKSHSPSFDVSVEGTLVFSWRSPWPEGLARDGAYSDLFAFVKDGELDLCFYSFATDPPVIASLE  178 (453)
Q Consensus        99 ~g~ylvRl~F~y~nyd~~~~~~~Fdv~~~~~~~l~~~~~~~~~~~~~~~~~E~i~~~~~~~l~vcf~~~~~~~pfIs~iE  178 (453)
                      +|+|+||+||+||+||+.++.|.|||++|++.|.+....+..  ....++||+++.++++.++|||+|++.++||||+||
T Consensus        92 ~g~ylVRl~F~~~~y~~~~~~~~Fdv~~~~~~~~tv~~~~~~--~~~~v~~E~i~~~~~~~l~vcf~~~~~~~pFIs~iE  169 (623)
T PLN03150         92 KGHYSVRVFFGLVAEPNFDSEPLFDVSVEGTQISSLKSGWSS--HDEQVFAEALVFLTDGSASICFHSTGHGDPAILSIE  169 (623)
T ss_pred             CCcEEEEEEeecCCcCCCCCCCceEEEECcEEEEEEecCccc--CCCcEEEEEEEEecCCcEEEEEecCCCCCCceeEEE
Confidence            999999999999999998788999999999776664322222  235688999999999999999999988999999999


Q ss_pred             eEecCCCCccccc-ccccceeeeeeeccCCCCCCCCCCCCCCCCC--CCCCcCCCCCCCCCCCCCeeEeeccccccCCCC
Q 012913          179 VQQIDPLSYDAAT-IGNNHILVNYGRLTSGSNQWGPGFSNDADDF--GRSWQSDAASRSPNAKSSIKSVTTRERITNTNQ  255 (453)
Q Consensus       179 l~~l~~~~y~~~~-~~~~~~l~~~~Rin~G~~~~~~~~~~~~D~~--dR~W~~d~~~~~~~~~~~~~~~~t~~~i~~~~~  255 (453)
                      ||+||+++|.... .+++.+|++++|+|||+......++|++|++  ||+|.+|..|..+    .+..+++...|+++.+
T Consensus       170 v~~l~~~~y~~~~~~~~~~~L~~~~R~n~G~~~~~~~~d~~~D~~~~dR~W~~d~~~~~~----~~~~~st~~~I~~~~~  245 (623)
T PLN03150        170 ILQVDDKAYNFGPSWGQGVILRTAKRLSCGAGKSKFDEDYSGDHWGGDRFWNRMQTFGSG----SDQAISTENVIKKASN  245 (623)
T ss_pred             EEEcCcccccccccccCceEEEEEEEEEecCcccccccCCCCCcccCccccCcCcccCCC----cccccccccccccccC
Confidence            9999999996432 2356679999999999875444568899999  9999999887632    2456667777876655


Q ss_pred             CCCCChHHHHHhhhccC---CceEEEEEeCCCCcEEEEEEEEeeecccCCCceEEEEEEECCEe-ecccceeeccCC-cc
Q 012913          256 PPNYYPMKLYQTAIVSS---GAIQYNLAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVNDKN-VTRVDIFNSVGS-FA  330 (453)
Q Consensus       256 ~~~~~P~~Vy~TAr~~~---~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~ing~~-~~~~di~~~~g~-~~  330 (453)
                      .++.+|+.||||||++.   .+++|.|++++++.|+|||||||++......++|+|||+|||+. +.++|+...+|. ..
T Consensus       246 ~~~~~P~~VyqTA~~~~~~~~~lty~~~v~~~~~Y~VrLhFaEi~~~~~~~~~R~F~V~ing~~~~~~~di~~~~g~~~~  325 (623)
T PLN03150        246 APNFYPESLYQSALVSTDTQPDLSYTMDVDPNRNYSVWLHFAEIDNSITAEGKRVFDVLINGDTAFKDVDIVKMSGERYT  325 (623)
T ss_pred             CCccChHHHhhhhccccCCCCceEEEeecCCCCCEEEEEEEEeccCccCCCceEEEEEEECCEEeecccChhhhcCCccc
Confidence            56779999999999983   46999999999999999999999975445668999999999996 688999877665 67


Q ss_pred             eeEEEEEEEecCCceEEEEEeecCCC-cceeEEeeeeccCCCCCCChhHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCC
Q 012913          331 AYSWHYVAKNLSSTELTVKLVPVVGA-ALISGLENYALVPNDLSTVPEQVIAMRALKESLRVPDRMGWNGDPCAPTNWDA  409 (453)
Q Consensus       331 ~~~~~~~v~~~~~g~l~i~~~~~~~s-p~lNaiEI~~~~~~~~~t~~~d~~aL~~ik~~~~~~~~~~W~gdpC~p~~~~~  409 (453)
                      +++++|.+.. .+++|+|+|+|..++ |+|||||||++.+....+.++|+.||+.+|+.+..+...+|+|+||.|.. +.
T Consensus       326 ~~~~~~~v~~-~~g~l~isl~p~~~s~pilNaiEI~~~~~~~~~t~~~~~~aL~~~k~~~~~~~~~~W~g~~C~p~~-~~  403 (623)
T PLN03150        326 ALVLNKTVAV-SGRTLTIVLQPKKGTHAIINAIEVFEIITAESKTLLEEVSALQTLKSSLGLPLRFGWNGDPCVPQQ-HP  403 (623)
T ss_pred             ceEEEeEEee-cCCeEEEEEeeCCCCcceeeeeeeeeccccccccCchHHHHHHHHHHhcCCcccCCCCCCCCCCcc-cc
Confidence            8899888764 458899999998877 99999999999998778899999999999999976533489999998876 58


Q ss_pred             ceeEEeCCCCCCCcceEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913          410 WEGITCHPNKDETAVVISQIDLGSQGLKGYISDKISLLSNLVNL  453 (453)
Q Consensus       410 W~Gi~C~~~~~~~~~~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L  453 (453)
                      |.||.|+........+|+.|+|++++|+|.+|+.|++|++|+.|
T Consensus       404 w~Gv~C~~~~~~~~~~v~~L~L~~n~L~g~ip~~i~~L~~L~~L  447 (623)
T PLN03150        404 WSGADCQFDSTKGKWFIDGLGLDNQGLRGFIPNDISKLRHLQSI  447 (623)
T ss_pred             cccceeeccCCCCceEEEEEECCCCCccccCCHHHhCCCCCCEE
Confidence            99999964221112357888888888888888888877777654


No 2  
>PF12819 Malectin_like:  Carbohydrate-binding protein of the ER;  InterPro: IPR024788 Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan []. This entry represents a malectin-like domain found in a number of plant receptor kinases.
Probab=100.00  E-value=3.8e-67  Score=526.74  Aligned_cols=319  Identities=27%  Similarity=0.476  Sum_probs=250.5

Q ss_pred             EecCCCCCc---cC-CCCCeEeeCCCCccCCCcccccCC----CCCCCcccceeeeeCCCCCCCceEEeecC--CCccEE
Q 012913           34 IDCGSATST---TD-PFNTTWQADDRYYTSGATSIVSEP----LHFRFPHEKTLRYFPPSSGKKNCYIIPNL--PPGRYY  103 (453)
Q Consensus        34 IdCG~~~~~---~d-~~~~~w~~Dd~~~~~g~~~~~~~~----~~~~~~~y~t~R~F~~~~g~~~cY~lpv~--~~g~yl  103 (453)
                      ||||++.+.   +| .+|++|++|++|+.+|.++.++..    .....++|+|+|+||  +|.|+||+||+.  +++|||
T Consensus         1 IdCG~~~~~s~y~D~~tg~~~~~D~~~~~~g~~~~i~~~~~~~~~~~~~~y~taR~F~--~g~r~cY~l~~~~~~~~~yl   78 (347)
T PF12819_consen    1 IDCGSSSNSSSYVDDSTGRTWVSDDDFIDTGKSGNISSQPDSSSSDSSPPYQTARIFP--EGSRNCYTLPVTPPGGGKYL   78 (347)
T ss_pred             CcCCCCCCCcccccCCCCcEEeCCCCcccCCCccccccccCCcCCccccccceEEEcC--CCCccEEEeeccCCCCceEE
Confidence            799999763   34 579999999998889987777321    124578999999999  477899999997  355999


Q ss_pred             EEEEeecCCCCCCC-----CCCcEEEEECCEEEEecCCCCCcccCCCceEEEEEEEec-CCeeEEEEeecCCCC-ceeee
Q 012913          104 IRTFTVYDNYDGKS-----HSPSFDVSVEGTLVFSWRSPWPEGLARDGAYSDLFAFVK-DGELDLCFYSFATDP-PVIAS  176 (453)
Q Consensus       104 vRl~F~y~nyd~~~-----~~~~Fdv~~~~~~~l~~~~~~~~~~~~~~~~~E~i~~~~-~~~l~vcf~~~~~~~-pfIs~  176 (453)
                      |||||+|||||+++     .++.|||++|++.|.+....   .....+++||+++.+. ++.|.|||+|++.|. |||||
T Consensus        79 iRl~F~~gnyd~~~fs~~~~~~~FdL~~~~n~~~tV~~~---~~~~~~~~~E~ii~v~~~~~l~vclv~~~~g~~pFIsa  155 (347)
T PF12819_consen   79 IRLHFYYGNYDGLNFSVSSSPPTFDLLLGFNFWSTVNLS---NSPSSPVVKEFIINVTWSDTLSVCLVPTGSGTFPFISA  155 (347)
T ss_pred             EEEEeccccccccccccccCCcceEEEECCceeEEEEec---CCCcceEEEEEEEEEcCCCcEEEEEEeCCCCCCCceeE
Confidence            99999999999863     25779999999986553321   1112468999877777 799999999999887 99999


Q ss_pred             eEeEecCCCCcccccccccceeeeeeeccCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCCCCCeeEeecccccc-CCCC
Q 012913          177 LEVQQIDPLSYDAATIGNNHILVNYGRLTSGSNQWGPGFSNDADDFGRSWQSDAASRSPNAKSSIKSVTTRERIT-NTNQ  255 (453)
Q Consensus       177 iEl~~l~~~~y~~~~~~~~~~l~~~~Rin~G~~~~~~~~~~~~D~~dR~W~~d~~~~~~~~~~~~~~~~t~~~i~-~~~~  255 (453)
                      ||||+||+++|+.....++.+|++++|+|||+...  .+||++|++||+|.+...      ...|..+++..+|+ ....
T Consensus       156 iEl~~lp~~ly~~~~~~~s~~L~~~~R~n~G~~~~--~iryp~D~~dR~W~~~~~------~~~~~~ist~~~i~~~~~~  227 (347)
T PF12819_consen  156 IELRPLPDSLYPDTDANSSQALETVYRLNVGGSSS--FIRYPDDTYDRIWQPYSS------SPGWSNISTTSNININSSN  227 (347)
T ss_pred             EEEEECCccceeccccCCCceeEEEEeecCCCccc--ccCCCCCcceeecccccc------CccccccccceeeecccCC
Confidence            99999999999543334677899999999998642  278999999999996421      22567777766676 3334


Q ss_pred             CCCCChHHHHHhhhcc--C---CceEEEEEeCCCCcEEEEEEEEeeecccCCCceEEEEEEECCEeecccceeec-cCC-
Q 012913          256 PPNYYPMKLYQTAIVS--S---GAIQYNLAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVNDKNVTRVDIFNS-VGS-  328 (453)
Q Consensus       256 ~~~~~P~~Vy~TAr~~--~---~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~ing~~~~~~di~~~-~g~-  328 (453)
                      .++.+|..|||||+++  .   .+++|.| ++++..|+||||||||+......++|+|||+|||+...+ |+... .+. 
T Consensus       228 ~~~~~P~~V~~TA~~~~~~s~~~nltw~~-~~~~~~y~v~lHFaEi~~~~~~~~~R~F~IyiN~~~~~~-~~~~~~~~~~  305 (347)
T PF12819_consen  228 NPYDAPSAVYQTARTPSNSSDPLNLTWSF-VDPGFSYYVRLHFAEIQSLSPNNNQREFDIYINGQTAYS-DVSPPYLGAD  305 (347)
T ss_pred             ccCcChHHHHHhhhcccccccceEEEecc-CCCCccEEEEEEEeecccccCCCCeEEEEEEECCeEccC-ccCcccccCc
Confidence            4567999999999999  1   3577877 888899999999999997544566899999999997432 44332 222 


Q ss_pred             cceeEEEEEEEecCCceEEEEEeecCCC---cceeEEeeeec
Q 012913          329 FAAYSWHYVAKNLSSTELTVKLVPVVGA---ALISGLENYAL  367 (453)
Q Consensus       329 ~~~~~~~~~v~~~~~g~l~i~~~~~~~s---p~lNaiEI~~~  367 (453)
                      ..+++++|.+....++.++|+|+|+.+|   |||||+|||+|
T Consensus       306 ~~~~~~d~~~~~~~~~~~~isL~~t~~S~lppiLNalEIy~v  347 (347)
T PF12819_consen  306 TVPYYSDYVVNVPDSGFLNISLGPTPDSTLPPILNALEIYKV  347 (347)
T ss_pred             ceEeecceEEEecCCCEEEEEEEeCCCCCcCceeEeeeeEeC
Confidence            4567888887755667899999999886   99999999986


No 3  
>PF11721 Malectin:  Di-glucose binding within endoplasmic reticulum;  InterPro: IPR021720  Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan [.  This entry represents a malectin domain, and can also be found in probable receptor-like serine/threonine-protein kinases from plants [] and in proteins described as glycoside hydrolases. ; PDB: 2KR2_A 2JWP_A 2K46_A.
Probab=99.87  E-value=6.4e-23  Score=186.59  Aligned_cols=154  Identities=25%  Similarity=0.401  Sum_probs=95.8

Q ss_pred             eeeccCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCCCCCeeEeeccccccCCCCCCCCChHHHHHhhhccCCceEEEEE
Q 012913          201 YGRLTSGSNQWGPGFSNDADDFGRSWQSDAASRSPNAKSSIKSVTTRERITNTNQPPNYYPMKLYQTAIVSSGAIQYNLA  280 (453)
Q Consensus       201 ~~Rin~G~~~~~~~~~~~~D~~dR~W~~d~~~~~~~~~~~~~~~~t~~~i~~~~~~~~~~P~~Vy~TAr~~~~~l~~~f~  280 (453)
                      ++|||||+..      + +|..++.|.+|+.|..+.+...+..-................++.+|||+|.....++|.+|
T Consensus         2 ~~~IN~Gg~~------~-~~~~g~~w~~D~~~~~g~~~y~~~~~~~~~~~~~~~~i~~t~d~~Lyqt~R~g~~~f~Y~ip   74 (174)
T PF11721_consen    2 VLRINAGGPA------Y-TDSSGIVWEADQYYTGGSWGYYVSSDNNGSTSSTNSSIPGTTDDPLYQTERYGPSSFSYDIP   74 (174)
T ss_dssp             EEEEEETSSS------E-EETTTEEE-SSSSSTTSS-----------SSTTS--TTS-HHHHHTTT-----SSSEEEEEE
T ss_pred             EEEEECCCCc------c-cCCCCCEEcCCCCCCCCCcccccccccccccccccccccCCCchhhhHhhcCCCCceEEEEe
Confidence            5799999864      3 67789999999988654431111100000001111111222467999999997457999999


Q ss_pred             eCCCCcEEEEEEEEeeeccc----CCCceEEEEEEECCEe-ecccceeeccCCcc-eeEEEE-EEEecCCceEEEEEe--
Q 012913          281 VDAKLDYLIWFHFAEIDSSV----TKAGQRVFDILVNDKN-VTRVDIFNSVGSFA-AYSWHY-VAKNLSSTELTVKLV--  351 (453)
Q Consensus       281 v~~~~~Y~V~LhFaEi~~~~----~~~~~R~FdV~ing~~-~~~~di~~~~g~~~-~~~~~~-~v~~~~~g~l~i~~~--  351 (453)
                      +.++|.|.|+|||||++...    ..+++|+|||+|||+. +++|||+.++|+.. +.++.+ .+. +.+|.|+|+|.  
T Consensus        75 ~~~~G~Y~V~L~FaE~~~~~~~~~~~~G~RvFdV~v~g~~vl~~~Di~~~~G~~~~~~~~~~~~v~-v~dg~L~i~f~~~  153 (174)
T PF11721_consen   75 VVPNGTYTVRLHFAELYFGASGGASGPGQRVFDVYVNGETVLKNFDIYAEAGGFNKAAVRRFFNVT-VTDGTLNIQFVWA  153 (174)
T ss_dssp             --S-EEEEEEEEEE-SSS--------SSSS-EEEEETTEEEEEEE-HHHHHSSSS---EEEEEEEE-EETTEEETTEEEE
T ss_pred             cCCCcEEEEEEEeccccccccccccCCCceEEEEEecceEEEeccCHHHHcCCCceEEEEEEEEEE-EeCCcEEEEEEec
Confidence            66789999999999998742    2379999999999997 79999999998844 777777 444 46788999999  


Q ss_pred             ----------ecCCCcceeEE
Q 012913          352 ----------PVVGAALISGL  362 (453)
Q Consensus       352 ----------~~~~sp~lNai  362 (453)
                                +..+.|+||||
T Consensus       154 ~~~~~~i~~~~~~~~p~IsaI  174 (174)
T PF11721_consen  154 GKGTLCIPFIGSYGNPLISAI  174 (174)
T ss_dssp             --SEEEEEEESSSSSSSEEEE
T ss_pred             CCCcEEeeccccCCCcEEeeC
Confidence                      56667999997


No 4  
>PF11721 Malectin:  Di-glucose binding within endoplasmic reticulum;  InterPro: IPR021720  Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan [.  This entry represents a malectin domain, and can also be found in probable receptor-like serine/threonine-protein kinases from plants [] and in proteins described as glycoside hydrolases. ; PDB: 2KR2_A 2JWP_A 2K46_A.
Probab=99.71  E-value=2.3e-17  Score=150.09  Aligned_cols=141  Identities=28%  Similarity=0.383  Sum_probs=91.4

Q ss_pred             cEEEecCCCCCccCCCCCeEeeCCCCccCCCcc------------cccCC--CCCCCcccceeeeeCCCCCCCceEEeec
Q 012913           31 SYRIDCGSATSTTDPFNTTWQADDRYYTSGATS------------IVSEP--LHFRFPHEKTLRYFPPSSGKKNCYIIPN   96 (453)
Q Consensus        31 ~~~IdCG~~~~~~d~~~~~w~~Dd~~~~~g~~~------------~~~~~--~~~~~~~y~t~R~F~~~~g~~~cY~lpv   96 (453)
                      .++||||++.- +|..|..|.+|. ++.+|...            .....  ....+++|+|.|+-+.    .+.|.||+
T Consensus         2 ~~~IN~Gg~~~-~~~~g~~w~~D~-~~~~g~~~y~~~~~~~~~~~~~~~~i~~t~d~~Lyqt~R~g~~----~f~Y~ip~   75 (174)
T PF11721_consen    2 VLRINAGGPAY-TDSSGIVWEADQ-YYTGGSWGYYVSSDNNGSTSSTNSSIPGTTDDPLYQTERYGPS----SFSYDIPV   75 (174)
T ss_dssp             EEEEEETSSSE-EETTTEEE-SSS-SSTTSS-----------SSTTS--TTS-HHHHHTTT-----SS----SEEEEEE-
T ss_pred             EEEEECCCCcc-cCCCCCEEcCCC-CCCCCCcccccccccccccccccccccCCCchhhhHhhcCCCC----ceEEEEec
Confidence            37899999864 678899999954 44433330            00011  1123579999999653    38999996


Q ss_pred             CCCccEEEEEEeecCCCCC-----CCCCCcEEEEECCEEEEecCCCCCcccCCC-ceEEEE-EEEecCCeeEEEEee---
Q 012913           97 LPPGRYYIRTFTVYDNYDG-----KSHSPSFDVSVEGTLVFSWRSPWPEGLARD-GAYSDL-FAFVKDGELDLCFYS---  166 (453)
Q Consensus        97 ~~~g~ylvRl~F~y~nyd~-----~~~~~~Fdv~~~~~~~l~~~~~~~~~~~~~-~~~~E~-i~~~~~~~l~vcf~~---  166 (453)
                      .++|.|.|||||....+..     ....++|||+++|..+++++|++.+++... ++++++ -+.++++.|.|+|..   
T Consensus        76 ~~~G~Y~V~L~FaE~~~~~~~~~~~~G~RvFdV~v~g~~vl~~~Di~~~~G~~~~~~~~~~~~v~v~dg~L~i~f~~~~~  155 (174)
T PF11721_consen   76 VPNGTYTVRLHFAELYFGASGGASGPGQRVFDVYVNGETVLKNFDIYAEAGGFNKAAVRRFFNVTVTDGTLNIQFVWAGK  155 (174)
T ss_dssp             -S-EEEEEEEEEE-SSS--------SSSS-EEEEETTEEEEEEE-HHHHHSSSS---EEEEEEEEEETTEEETTEEEE--
T ss_pred             CCCcEEEEEEEeccccccccccccCCCceEEEEEecceEEEeccCHHHHcCCCceEEEEEEEEEEEeCCcEEEEEEecCC
Confidence            6999999999998654443     134799999999999999999988876554 667777 456799999999995   


Q ss_pred             --------cCCCCceeeee
Q 012913          167 --------FATDPPVIASL  177 (453)
Q Consensus       167 --------~~~~~pfIs~i  177 (453)
                              ...+.|.||||
T Consensus       156 ~~~~i~~~~~~~~p~IsaI  174 (174)
T PF11721_consen  156 GTLCIPFIGSYGNPLISAI  174 (174)
T ss_dssp             SEEEEEEESSSSSSSEEEE
T ss_pred             CcEEeeccccCCCcEEeeC
Confidence                    34567888887


No 5  
>PLN03150 hypothetical protein; Provisional
Probab=99.41  E-value=2.3e-12  Score=139.67  Aligned_cols=152  Identities=14%  Similarity=0.162  Sum_probs=107.2

Q ss_pred             EEEecCCCCCc--cC----CC--CCeEeeCCCCccCC-C----cccccC----CCCCCCcccceeeeeCCCCCCCceEEe
Q 012913           32 YRIDCGSATST--TD----PF--NTTWQADDRYYTSG-A----TSIVSE----PLHFRFPHEKTLRYFPPSSGKKNCYII   94 (453)
Q Consensus        32 ~~IdCG~~~~~--~d----~~--~~~w~~Dd~~~~~g-~----~~~~~~----~~~~~~~~y~t~R~F~~~~g~~~cY~l   94 (453)
                      ++||||+.+..  .|    .-  .|.|.+|+.|.... .    ...+..    +...++..|+|+|.+... ....+|++
T Consensus       194 ~R~n~G~~~~~~~~d~~~D~~~~dR~W~~d~~~~~~~~~~~st~~~I~~~~~~~~~~P~~VyqTA~~~~~~-~~~lty~~  272 (623)
T PLN03150        194 KRLSCGAGKSKFDEDYSGDHWGGDRFWNRMQTFGSGSDQAISTENVIKKASNAPNFYPESLYQSALVSTDT-QPDLSYTM  272 (623)
T ss_pred             EEEEecCcccccccCCCCCcccCccccCcCcccCCCcccccccccccccccCCCccChHHHhhhhccccCC-CCceEEEe
Confidence            58999986422  22    22  79999976654321 1    111111    111245689999998642 23579999


Q ss_pred             ecCCCccEEEEEEeecCCCCC-CCCCCcEEEEECCEEEEecCCCCCccc-CCCceEEEEEEEecCCeeEEEEeecCCCCc
Q 012913           95 PNLPPGRYYIRTFTVYDNYDG-KSHSPSFDVSVEGTLVFSWRSPWPEGL-ARDGAYSDLFAFVKDGELDLCFYSFATDPP  172 (453)
Q Consensus        95 pv~~~g~ylvRl~F~y~nyd~-~~~~~~Fdv~~~~~~~l~~~~~~~~~~-~~~~~~~E~i~~~~~~~l~vcf~~~~~~~p  172 (453)
                      ++.++++|+||+||..-.... ....++|||++++..+++.+++....+ ...++++++.+.+.++.+.|+|+|.....|
T Consensus       273 ~v~~~~~Y~VrLhFaEi~~~~~~~~~R~F~V~ing~~~~~~~di~~~~g~~~~~~~~~~~v~~~~g~l~isl~p~~~s~p  352 (623)
T PLN03150        273 DVDPNRNYSVWLHFAEIDNSITAEGKRVFDVLINGDTAFKDVDIVKMSGERYTALVLNKTVAVSGRTLTIVLQPKKGTHA  352 (623)
T ss_pred             ecCCCCCEEEEEEEEeccCccCCCceEEEEEEECCEEeecccChhhhcCCcccceEEEeEEeecCCeEEEEEeeCCCCcc
Confidence            987889999999998653222 234689999999999999888744322 234678898887778899999999866679


Q ss_pred             eeeeeEeEecCC
Q 012913          173 VIASLEVQQIDP  184 (453)
Q Consensus       173 fIs~iEl~~l~~  184 (453)
                      +||||||..+..
T Consensus       353 ilNaiEI~~~~~  364 (623)
T PLN03150        353 IINAIEVFEIIT  364 (623)
T ss_pred             eeeeeeeeeccc
Confidence            999999998865


No 6  
>PF12819 Malectin_like:  Carbohydrate-binding protein of the ER;  InterPro: IPR024788 Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan []. This entry represents a malectin-like domain found in a number of plant receptor kinases.
Probab=99.37  E-value=4.7e-12  Score=127.72  Aligned_cols=154  Identities=25%  Similarity=0.286  Sum_probs=104.4

Q ss_pred             ccCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCCCCCeeEeeccccccCCCCCCCCChHHHHHhhhccC--CceEEEEEe
Q 012913          204 LTSGSNQWGPGFSNDADDFGRSWQSDAASRSPNAKSSIKSVTTRERITNTNQPPNYYPMKLYQTAIVSS--GAIQYNLAV  281 (453)
Q Consensus       204 in~G~~~~~~~~~~~~D~~dR~W~~d~~~~~~~~~~~~~~~~t~~~i~~~~~~~~~~P~~Vy~TAr~~~--~~l~~~f~v  281 (453)
                      ||||+.... . .|.|+..+|.|.+|..|...+.         +..|....+.....+...|+|||...  ....|+|++
T Consensus         1 IdCG~~~~~-s-~y~D~~tg~~~~~D~~~~~~g~---------~~~i~~~~~~~~~~~~~~y~taR~F~~g~r~cY~l~~   69 (347)
T PF12819_consen    1 IDCGSSSNS-S-SYVDDSTGRTWVSDDDFIDTGK---------SGNISSQPDSSSSDSSPPYQTARIFPEGSRNCYTLPV   69 (347)
T ss_pred             CcCCCCCCC-c-ccccCCCCcEEeCCCCcccCCC---------ccccccccCCcCCccccccceEEEcCCCCccEEEeec
Confidence            789986421 1 1557788999999997763111         11221000000113456899999983  458999998


Q ss_pred             C--CCCcEEEEEEEEeeecccCC----CceEEEEEEECCEeecccceeeccCCcceeEEEEEEEecCCceEEEEEeecC-
Q 012913          282 D--AKLDYLIWFHFAEIDSSVTK----AGQRVFDILVNDKNVTRVDIFNSVGSFAAYSWHYVAKNLSSTELTVKLVPVV-  354 (453)
Q Consensus       282 ~--~~~~Y~V~LhFaEi~~~~~~----~~~R~FdV~ing~~~~~~di~~~~g~~~~~~~~~~v~~~~~g~l~i~~~~~~-  354 (453)
                      .  .+++|+|||||.-.......    ...-.||++++...+..+++..  ....+++++|.+....++.|.|.|.++. 
T Consensus        70 ~~~~~~~yliRl~F~~gnyd~~~fs~~~~~~~FdL~~~~n~~~tV~~~~--~~~~~~~~E~ii~v~~~~~l~vclv~~~~  147 (347)
T PF12819_consen   70 TPPGGGKYLIRLHFYYGNYDGLNFSVSSSPPTFDLLLGFNFWSTVNLSN--SPSSPVVKEFIINVTWSDTLSVCLVPTGS  147 (347)
T ss_pred             cCCCCceEEEEEEeccccccccccccccCCcceEEEECCceeEEEEecC--CCcceEEEEEEEEEcCCCcEEEEEEeCCC
Confidence            7  35699999999976542110    1134699999988654444322  1124688998888655689999999998 


Q ss_pred             CC-cceeEEeeeeccCC
Q 012913          355 GA-ALISGLENYALVPN  370 (453)
Q Consensus       355 ~s-p~lNaiEI~~~~~~  370 (453)
                      +. |+||||||..+++.
T Consensus       148 g~~pFIsaiEl~~lp~~  164 (347)
T PF12819_consen  148 GTFPFISAIELRPLPDS  164 (347)
T ss_pred             CCCCceeEEEEEECCcc
Confidence            66 99999999999875


No 7  
>KOG3593 consensus Predicted receptor-like serine/threonine kinase [Signal transduction mechanisms]
Probab=98.02  E-value=2.8e-06  Score=80.58  Aligned_cols=105  Identities=22%  Similarity=0.286  Sum_probs=78.3

Q ss_pred             hHHHHHhhhccCCceEEEEEeCCCCcEEEEEEEEeeecccCCCceEEEEEEEC-CEe-ecccceeeccCC-ccee--EEE
Q 012913          261 PMKLYQTAIVSSGAIQYNLAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVN-DKN-VTRVDIFNSVGS-FAAY--SWH  335 (453)
Q Consensus       261 P~~Vy~TAr~~~~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~in-g~~-~~~~di~~~~g~-~~~~--~~~  335 (453)
                      ...+|||+|.....+.|..+++..|.|-+.|-|||.+.  +..+..+|||-+| +.. ++++|++...|. .+|-  +..
T Consensus       107 d~ily~ter~neetFgyd~pik~dgdyalvlkfaevyF--~~~q~kvfdvrln~sh~vVk~ldi~~~vg~rg~AhDe~i~  184 (355)
T KOG3593|consen  107 DIILYQTERYNEETFGYDVPIKEDGDYALVLKFAEVYF--KTCQHKVFDVRLNCSHCVVKALDIFDQVGDRGKAHDEIIP  184 (355)
T ss_pred             hhhhhhhcccchhhhcccccccCCCceehhhhHHHHHH--HhhhhhheeeeeccceeEEeccchhhhcCCCcccccceEE
Confidence            35799999996445778888888899999999999986  5778999999999 665 788999888873 3331  111


Q ss_pred             EEE-----------EecCCceEEEEEeecC-CCcceeEEeeeec
Q 012913          336 YVA-----------KNLSSTELTVKLVPVV-GAALISGLENYAL  367 (453)
Q Consensus       336 ~~v-----------~~~~~g~l~i~~~~~~-~sp~lNaiEI~~~  367 (453)
                      +.+           ...+.|+++|+|.+.. .+|.+||..|++.
T Consensus       185 ~~i~~gkls~~gess~~t~gkl~le~~kg~ldnpk~~a~aIl~g  228 (355)
T KOG3593|consen  185 CLIGQGKLSVCGESSISTLGKLNLEFLKGVLDNPKDCARAILVG  228 (355)
T ss_pred             EEEcCceEEEEeeeEEeecceEEEEeecccCCChhhhhHHHhhc
Confidence            111           1124478999998876 4699999988764


No 8  
>PF08263 LRRNT_2:  Leucine rich repeat N-terminal domain;  InterPro: IPR013210 Leucine-rich repeats (LRR) consist of 2-45 motifs of 20-30 amino acids in length that generally folds into an arc or horseshoe shape []. LRRs occur in proteins ranging from viruses to eukaryotes, and appear to provide a structural framework for the formation of protein-protein interactions [, ].Proteins containing LRRs include tyrosine kinase receptors, cell-adhesion molecules, virulence factors, and extracellular matrix-binding glycoproteins, and are involved in a variety of biological processes, including signal transduction, cell adhesion, DNA repair, recombination, transcription, RNA processing, disease resistance, apoptosis, and the immune response []. Sequence analyses of LRR proteins suggested the existence of several different subfamilies of LRRs. The significance of this classification is that repeats from different subfamilies never occur simultaneously and have most probably evolved independently. It is, however, now clear that all major classes of LRR have curved horseshoe structures with a parallel beta sheet on the concave side and mostly helical elements on the convex side. At least six families of LRR proteins, characterised by different lengths and consensus sequences of the repeats, have been identified. Eleven-residue segments of the LRRs (LxxLxLxxN/CxL), corresponding to the beta-strand and adjacent loop regions, are conserved in LRR proteins, whereas the remaining parts of the repeats (herein termed variable) may be very different. Despite the differences, each of the variable parts contains two half-turns at both ends and a "linear" segment (as the chain follows a linear path overall), usually formed by a helix, in the middle. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structure of some LRR proteins-ligand complexes show that the concave surface of LRR domain is ideal for interaction with alpha-helix, thus supporting earlier conclusions that the elongated and curved LRR structure provides an outstanding framework for achieving diverse protein-protein interactions []. Molecular modeling suggests that the conserved pattern LxxLxL, which is shorter than the previously proposed LxxLxLxxN/CxL is sufficient to impart the characteristic horseshoe curvature to proteins with 20- to 30-residue repeats [].  This domain is often found at the N terminus of tandem leucine rich repeats.; PDB: 3RGZ_A 3RJ0_A 3RIZ_A 3RGX_A 1OGQ_A.
Probab=97.59  E-value=6.4e-05  Score=51.79  Aligned_cols=35  Identities=43%  Similarity=0.906  Sum_probs=25.2

Q ss_pred             hhHHHHHHHHHHHcCC-C--CCCCCCC----CCCCCCCCCCceeEEeC
Q 012913          376 PEQVIAMRALKESLRV-P--DRMGWNG----DPCAPTNWDAWEGITCH  416 (453)
Q Consensus       376 ~~d~~aL~~ik~~~~~-~--~~~~W~g----dpC~p~~~~~W~Gi~C~  416 (453)
                      ++|++||++||+.+.. +  .+.+|+.    +||      .|.||+|+
T Consensus         2 ~~d~~aLl~~k~~l~~~~~~~l~~W~~~~~~~~C------~W~GV~Cd   43 (43)
T PF08263_consen    2 NQDRQALLAFKKSLNNDPSGVLSSWNPSSDSDPC------SWSGVTCD   43 (43)
T ss_dssp             HHHHHHHHHHHHCTT-SC-CCCTT--TT--S-CC------CSTTEEE-
T ss_pred             cHHHHHHHHHHHhcccccCcccccCCCcCCCCCe------eeccEEeC
Confidence            4789999999999974 3  3779963    566      89999995


No 9  
>PLN00113 leucine-rich repeat receptor-like protein kinase; Provisional
Probab=97.48  E-value=0.00016  Score=82.94  Aligned_cols=70  Identities=27%  Similarity=0.477  Sum_probs=56.6

Q ss_pred             CChhHHHHHHHHHHHcCCCC--CCCCCC-CCCCCCCCCCceeEEeCCCCCCCcceEEEEEeCCCCceeecCccccCCCCC
Q 012913          374 TVPEQVIAMRALKESLRVPD--RMGWNG-DPCAPTNWDAWEGITCHPNKDETAVVISQIDLGSQGLKGYISDKISLLSNL  450 (453)
Q Consensus       374 t~~~d~~aL~~ik~~~~~~~--~~~W~g-dpC~p~~~~~W~Gi~C~~~~~~~~~~v~~L~Ls~~~LsG~ip~~l~~L~~L  450 (453)
                      ..++|..+|+++|+.+..+.  +.+|.. +.|     |.|.||.|+.     .++|+.|+|++++++|.+++.+..|++|
T Consensus        26 ~~~~~~~~l~~~~~~~~~~~~~~~~w~~~~~~-----c~w~gv~c~~-----~~~v~~L~L~~~~i~~~~~~~~~~l~~L   95 (968)
T PLN00113         26 LHAEELELLLSFKSSINDPLKYLSNWNSSADV-----CLWQGITCNN-----SSRVVSIDLSGKNISGKISSAIFRLPYI   95 (968)
T ss_pred             CCHHHHHHHHHHHHhCCCCcccCCCCCCCCCC-----CcCcceecCC-----CCcEEEEEecCCCccccCChHHhCCCCC
Confidence            35688999999999986542  568963 445     4899999974     2579999999999999999989888888


Q ss_pred             cCC
Q 012913          451 VNL  453 (453)
Q Consensus       451 ~~L  453 (453)
                      +.|
T Consensus        96 ~~L   98 (968)
T PLN00113         96 QTI   98 (968)
T ss_pred             CEE
Confidence            865


No 10 
>KOG3593 consensus Predicted receptor-like serine/threonine kinase [Signal transduction mechanisms]
Probab=94.24  E-value=0.033  Score=53.43  Aligned_cols=115  Identities=16%  Similarity=0.148  Sum_probs=73.1

Q ss_pred             ccccCCCCccEEEecCCCCCccCCCCCeEeeCCC-Cc----cCCCccccc-CCCCCCCcccceeeeeCCCCCCCceEEee
Q 012913           22 LASSYPYKASYRIDCGSATSTTDPFNTTWQADDR-YY----TSGATSIVS-EPLHFRFPHEKTLRYFPPSSGKKNCYIIP   95 (453)
Q Consensus        22 ~~~~~~~~~~~~IdCG~~~~~~d~~~~~w~~Dd~-~~----~~g~~~~~~-~~~~~~~~~y~t~R~F~~~~g~~~cY~lp   95 (453)
                      +.|+-+...-.-||||+.. .+|..|+.|-.|-. ..    .-|..-.+- ....-+..+|+|+|+=..    .+-|..|
T Consensus        52 ~Taa~~~svI~aVncGgda-avd~ygI~f~aD~~~~VGrasd~G~~l~i~~raeeed~ily~ter~nee----tFgyd~p  126 (355)
T KOG3593|consen   52 PTAALPSSVIPAVNCGGDA-AVDNYGIRFAADPLEGVGRASDYGMVLGIGCRAEEEDIILYQTERYNEE----TFGYDVP  126 (355)
T ss_pred             ccccCchhhhheeccCChh-hhcccceEeeccccccccccCCccceeeccccCChhhhhhhhhcccchh----hhccccc
Confidence            3444444443579999975 47888999988541 10    012110110 111123568999999754    3789999


Q ss_pred             cCCCccEEEEEEeecCCCCCCCCCCcEEEEEC-CEEEEecCCCCCccc
Q 012913           96 NLPPGRYYIRTFTVYDNYDGKSHSPSFDVSVE-GTLVFSWRSPWPEGL  142 (453)
Q Consensus        96 v~~~g~ylvRl~F~y~nyd~~~~~~~Fdv~~~-~~~~l~~~~~~~~~~  142 (453)
                      +...|.|-+=+.|..--|+. .+.-.|||.++ ...+...-+++..++
T Consensus       127 ik~dgdyalvlkfaevyF~~-~q~kvfdvrln~sh~vVk~ldi~~~vg  173 (355)
T KOG3593|consen  127 IKEDGDYALVLKFAEVYFKT-CQHKVFDVRLNCSHCVVKALDIFDQVG  173 (355)
T ss_pred             ccCCCceehhhhHHHHHHHh-hhhhheeeeeccceeEEeccchhhhcC
Confidence            98889998888885332443 23468999999 777777777766554


No 11 
>PF00560 LRR_1:  Leucine Rich Repeat;  InterPro: IPR001611 Leucine-rich repeats (LRR) consist of 2-45 motifs of 20-30 amino acids in length that generally folds into an arc or horseshoe shape []. LRRs occur in proteins ranging from viruses to eukaryotes, and appear to provide a structural framework for the formation of protein-protein interactions [, ].Proteins containing LRRs include tyrosine kinase receptors, cell-adhesion molecules, virulence factors, and extracellular matrix-binding glycoproteins, and are involved in a variety of biological processes, including signal transduction, cell adhesion, DNA repair, recombination, transcription, RNA processing, disease resistance, apoptosis, and the immune response []. Sequence analyses of LRR proteins suggested the existence of several different subfamilies of LRRs. The significance of this classification is that repeats from different subfamilies never occur simultaneously and have most probably evolved independently. It is, however, now clear that all major classes of LRR have curved horseshoe structures with a parallel beta sheet on the concave side and mostly helical elements on the convex side. At least six families of LRR proteins, characterised by different lengths and consensus sequences of the repeats, have been identified. Eleven-residue segments of the LRRs (LxxLxLxxN/CxL), corresponding to the beta-strand and adjacent loop regions, are conserved in LRR proteins, whereas the remaining parts of the repeats (herein termed variable) may be very different. Despite the differences, each of the variable parts contains two half-turns at both ends and a "linear" segment (as the chain follows a linear path overall), usually formed by a helix, in the middle. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structure of some LRR proteins-ligand complexes show that the concave surface of LRR domain is ideal for interaction with alpha-helix, thus supporting earlier conclusions that the elongated and curved LRR structure provides an outstanding framework for achieving diverse protein-protein interactions []. Molecular modeling suggests that the conserved pattern LxxLxL, which is shorter than the previously proposed LxxLxLxxN/CxL is sufficient to impart the characteristic horseshoe curvature to proteins with 20- to 30-residue repeats []. ; GO: 0005515 protein binding; PDB: 4ECO_B 2A0Z_A 3ULU_A 1ZIW_A 3ULV_A 1DCE_C 1LTX_A 3J0A_B 3A79_B 4FCG_A ....
Probab=87.28  E-value=0.23  Score=28.65  Aligned_cols=21  Identities=29%  Similarity=0.415  Sum_probs=17.4

Q ss_pred             EEEEEeCCCCceeecCccccCC
Q 012913          426 ISQIDLGSQGLKGYISDKISLL  447 (453)
Q Consensus       426 v~~L~Ls~~~LsG~ip~~l~~L  447 (453)
                      +..|+|++++|+ .||+.|++|
T Consensus         2 L~~Ldls~n~l~-~ip~~~~~l   22 (22)
T PF00560_consen    2 LEYLDLSGNNLT-SIPSSFSNL   22 (22)
T ss_dssp             ESEEEETSSEES-EEGTTTTT-
T ss_pred             ccEEECCCCcCE-eCChhhcCC
Confidence            457999999999 999988765


No 12 
>PF12799 LRR_4:  Leucine Rich repeats (2 copies); PDB: 2OMT_A 1XEU_A 2OMX_A 2OMU_A 2UZY_A 2WQU_D 1D0B_A 2WQW_A 1OTO_A 2WQV_B ....
Probab=81.31  E-value=0.63  Score=31.97  Aligned_cols=27  Identities=26%  Similarity=0.468  Sum_probs=21.5

Q ss_pred             EEEEEeCCCCceeecCccccCCCCCcCC
Q 012913          426 ISQIDLGSQGLKGYISDKISLLSNLVNL  453 (453)
Q Consensus       426 v~~L~Ls~~~LsG~ip~~l~~L~~L~~L  453 (453)
                      +..|+|++++++ .||+.|++|+.|+.|
T Consensus         3 L~~L~l~~N~i~-~l~~~l~~l~~L~~L   29 (44)
T PF12799_consen    3 LEELDLSNNQIT-DLPPELSNLPNLETL   29 (44)
T ss_dssp             -SEEEETSSS-S-SHGGHGTTCTTSSEE
T ss_pred             ceEEEccCCCCc-ccCchHhCCCCCCEE
Confidence            457899999999 688889999999865


No 13 
>PF03422 CBM_6:  Carbohydrate binding module (family 6);  InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see [].  This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=56.39  E-value=1.1e+02  Score=25.19  Aligned_cols=76  Identities=20%  Similarity=0.292  Sum_probs=46.5

Q ss_pred             CceEEE-EEeCCCCcEEEEEEEEeeecccCCCceEEEEEEECC---EeecccceeeccCCccee-EEEEEEEecCCceEE
Q 012913          273 GAIQYN-LAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVND---KNVTRVDIFNSVGSFAAY-SWHYVAKNLSSTELT  347 (453)
Q Consensus       273 ~~l~~~-f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~ing---~~~~~~di~~~~g~~~~~-~~~~~v~~~~~g~l~  347 (453)
                      ..+.|. .++...+.|.+.+..+--.      +.+.++|++||   ..+..+++-. .|+...+ .....+. ...|.-.
T Consensus        32 ~~~~~~~Vd~~~~g~y~~~~~~a~~~------~~~~~~l~id~~~g~~~~~~~~~~-tg~w~~~~~~~~~v~-l~~G~h~  103 (125)
T PF03422_consen   32 DWIEYNNVDVPEAGTYTLTIRYANGG------GGGTIELRIDGPDGTLIGTVSLPP-TGGWDTWQTVSVSVK-LPAGKHT  103 (125)
T ss_dssp             TEEEEEEEEESSSEEEEEEEEEEESS------SSEEEEEEETTTTSEEEEEEEEE--ESSTTEEEEEEEEEE-EESEEEE
T ss_pred             CEEEEEEEeeCCCceEEEEEEEECCC------CCcEEEEEECCCCCcEEEEEEEcC-CCCccccEEEEEEEe-eCCCeeE
Confidence            358888 8887789999998887532      22889999999   5555566533 3442222 1112232 2346677


Q ss_pred             EEEeecCCC
Q 012913          348 VKLVPVVGA  356 (453)
Q Consensus       348 i~~~~~~~s  356 (453)
                      |.|....+.
T Consensus       104 i~l~~~~~~  112 (125)
T PF03422_consen  104 IYLVFNGGD  112 (125)
T ss_dssp             EEEEESSSS
T ss_pred             EEEEEECCC
Confidence            777766543


No 14 
>PF13855 LRR_8:  Leucine rich repeat; PDB: 2O6S_A 3A79_B 3RFS_A 3G39_A 3VQ2_A 3VQ1_B 2Z64_A 2Z66_C 3FXI_A 2Z63_A ....
Probab=47.02  E-value=9  Score=27.69  Aligned_cols=30  Identities=30%  Similarity=0.355  Sum_probs=24.4

Q ss_pred             ceEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913          424 VVISQIDLGSQGLKGYISDKISLLSNLVNL  453 (453)
Q Consensus       424 ~~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L  453 (453)
                      ..+..|+|+++.++.--+..|..|++|+.|
T Consensus        25 ~~L~~L~l~~N~l~~i~~~~f~~l~~L~~L   54 (61)
T PF13855_consen   25 PNLETLDLSNNNLTSIPPDAFSNLPNLRYL   54 (61)
T ss_dssp             TTESEEEETSSSESEEETTTTTTSTTESEE
T ss_pred             CCCCEeEccCCccCccCHHHHcCCCCCCEE
Confidence            468889999999986666678999988864


No 15 
>PF13855 LRR_8:  Leucine rich repeat; PDB: 2O6S_A 3A79_B 3RFS_A 3G39_A 3VQ2_A 3VQ1_B 2Z64_A 2Z66_C 3FXI_A 2Z63_A ....
Probab=38.44  E-value=14  Score=26.60  Aligned_cols=29  Identities=31%  Similarity=0.387  Sum_probs=22.5

Q ss_pred             eEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913          425 VISQIDLGSQGLKGYISDKISLLSNLVNL  453 (453)
Q Consensus       425 ~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L  453 (453)
                      .+..|+|+++.|+.--+..|..|+.|+.|
T Consensus         2 ~L~~L~l~~n~l~~i~~~~f~~l~~L~~L   30 (61)
T PF13855_consen    2 NLESLDLSNNKLTEIPPDSFSNLPNLETL   30 (61)
T ss_dssp             TESEEEETSSTESEECTTTTTTGTTESEE
T ss_pred             cCcEEECCCCCCCccCHHHHcCCCCCCEe
Confidence            36789999999995444578889888764


No 16 
>PRK06764 hypothetical protein; Provisional
Probab=33.34  E-value=40  Score=26.53  Aligned_cols=17  Identities=47%  Similarity=0.741  Sum_probs=15.5

Q ss_pred             ceEEeecCCCccEEEEE
Q 012913           90 NCYIIPNLPPGRYYIRT  106 (453)
Q Consensus        90 ~cY~lpv~~~g~ylvRl  106 (453)
                      +.|++.-.++|+|.||.
T Consensus        74 nkyti~f~kpg~yvirv   90 (105)
T PRK06764         74 NKYTIRFSKPGKYVIRV   90 (105)
T ss_pred             eeeEEEecCCccEEEEE
Confidence            68999988999999997


No 17 
>PF03422 CBM_6:  Carbohydrate binding module (family 6);  InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see [].  This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=32.86  E-value=2.8e+02  Score=22.78  Aligned_cols=38  Identities=16%  Similarity=0.303  Sum_probs=28.4

Q ss_pred             CCCCceEE-eecCCCccEEEEEEeecCCCCCCCCCCcEEEEECC
Q 012913           86 SGKKNCYI-IPNLPPGRYYIRTFTVYDNYDGKSHSPSFDVSVEG  128 (453)
Q Consensus        86 ~g~~~cY~-lpv~~~g~ylvRl~F~y~nyd~~~~~~~Fdv~~~~  128 (453)
                      .|...+|+ +.+...|.|.|++.+.-+.  +   ...++|++++
T Consensus        30 ~G~~~~~~~Vd~~~~g~y~~~~~~a~~~--~---~~~~~l~id~   68 (125)
T PF03422_consen   30 NGDWIEYNNVDVPEAGTYTLTIRYANGG--G---GGTIELRIDG   68 (125)
T ss_dssp             TTTEEEEEEEEESSSEEEEEEEEEEESS--S---SEEEEEEETT
T ss_pred             CCCEEEEEEEeeCCCceEEEEEEEECCC--C---CcEEEEEECC
Confidence            46667898 8887899999986665332  1   2689999998


No 18 
>KOG2230 consensus Predicted beta-mannosidase [Carbohydrate transport and metabolism]
Probab=32.38  E-value=6.9e+02  Score=27.21  Aligned_cols=108  Identities=15%  Similarity=0.288  Sum_probs=60.1

Q ss_pred             eeEeeccccccCCCCCCCCChHHHHHhhhcc-----CCceEEEEEeCCCCcEEEEEEEEeeecccC-----CCceEEEEE
Q 012913          241 IKSVTTRERITNTNQPPNYYPMKLYQTAIVS-----SGAIQYNLAVDAKLDYLIWFHFAEIDSSVT-----KAGQRVFDI  310 (453)
Q Consensus       241 ~~~~~t~~~i~~~~~~~~~~P~~Vy~TAr~~-----~~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~-----~~~~R~FdV  310 (453)
                      |.--+.+.+++.+..+|-..=+++|+.....     ...+.|.|-...+.+|.=.+..-|+..-..     ..-.-+=.|
T Consensus        29 w~~~~~n~t~~~~g~vpg~i~s~l~~~gii~~~~~~~n~ln~kwia~d~wtysr~frl~dl~~~~~~~l~ie~vdtia~v  108 (867)
T KOG2230|consen   29 WEFSSSNKTVNGTGTVPGDIYSDLYASGIIDNPLFGENHLNLKWIAEDDWTYSRKFRLIDLDDTVGAFLEIESVDTIATV  108 (867)
T ss_pred             eEEecCCCceecCCCCCchHhHHHHhcccccCccccccccceeEEeccCccceeeeEEEEccccccceEEEeecceeEEE
Confidence            4333445566665554444456778887665     234778876666777765555555543111     111345678


Q ss_pred             EECCEe-ecccceeeccCCcceeEEEEEEEecCCceEEEEEeecC
Q 012913          311 LVNDKN-VTRVDIFNSVGSFAAYSWHYVAKNLSSTELTVKLVPVV  354 (453)
Q Consensus       311 ~ing~~-~~~~di~~~~g~~~~~~~~~~v~~~~~g~l~i~~~~~~  354 (453)
                      |+||+. +..-+      .+.++..+++....+...+.+.|..+.
T Consensus       109 ~~n~~~v~~s~n------~f~~y~~~vt~ii~~~n~i~~~f~ssv  147 (867)
T KOG2230|consen  109 YVNGQKVLHSRN------QFLPYHVNVTDIIAGENDITIKFKSSV  147 (867)
T ss_pred             EEccEEEeeccc------cceeEEEeEEEEecCCcceEEEeehhH
Confidence            999986 33211      244555555554444456777777654


No 19 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=27.00  E-value=41  Score=27.26  Aligned_cols=14  Identities=29%  Similarity=0.249  Sum_probs=5.2

Q ss_pred             hhHHHHHHHHHhhh
Q 012913            7 SSFFFLSLLLVLPL   20 (453)
Q Consensus         7 ~~~~~~~~~~~~~~   20 (453)
                      +.+|+.++|+++++
T Consensus         5 ~~llL~l~LA~lLl   18 (95)
T PF07172_consen    5 AFLLLGLLLAALLL   18 (95)
T ss_pred             HHHHHHHHHHHHHH
Confidence            33333333333333


No 20 
>KOG0472 consensus Leucine-rich repeat protein [Function unknown]
Probab=21.44  E-value=37  Score=35.08  Aligned_cols=29  Identities=24%  Similarity=0.383  Sum_probs=24.4

Q ss_pred             ceEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913          424 VVISQIDLGSQGLKGYISDKISLLSNLVNL  453 (453)
Q Consensus       424 ~~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L  453 (453)
                      ++++.|||.+|.|. .||+.+++|++|++|
T Consensus       505 ~nL~tLDL~nNdlq-~IPp~LgnmtnL~hL  533 (565)
T KOG0472|consen  505 RNLTTLDLQNNDLQ-QIPPILGNMTNLRHL  533 (565)
T ss_pred             hhcceeccCCCchh-hCChhhccccceeEE
Confidence            35778999998886 789999999999875


Done!