Query 012913
Match_columns 453
No_of_seqs 323 out of 2085
Neff 8.0
Searched_HMMs 46136
Date Fri Mar 29 07:36:58 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/012913.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/012913hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03150 hypothetical protein; 100.0 9.2E-82 2E-86 677.6 45.1 424 20-453 14-447 (623)
2 PF12819 Malectin_like: Carboh 100.0 3.8E-67 8.1E-72 526.7 32.8 319 34-367 1-347 (347)
3 PF11721 Malectin: Di-glucose 99.9 6.4E-23 1.4E-27 186.6 6.6 154 201-362 2-174 (174)
4 PF11721 Malectin: Di-glucose 99.7 2.3E-17 5.1E-22 150.1 8.7 141 31-177 2-174 (174)
5 PLN03150 hypothetical protein; 99.4 2.3E-12 4.9E-17 139.7 14.9 152 32-184 194-364 (623)
6 PF12819 Malectin_like: Carboh 99.4 4.7E-12 1E-16 127.7 12.9 154 204-370 1-164 (347)
7 KOG3593 Predicted receptor-lik 98.0 2.8E-06 6E-11 80.6 2.7 105 261-367 107-228 (355)
8 PF08263 LRRNT_2: Leucine rich 97.6 6.4E-05 1.4E-09 51.8 3.2 35 376-416 2-43 (43)
9 PLN00113 leucine-rich repeat r 97.5 0.00016 3.4E-09 82.9 6.6 70 374-453 26-98 (968)
10 KOG3593 Predicted receptor-lik 94.2 0.033 7.3E-07 53.4 2.8 115 22-142 52-173 (355)
11 PF00560 LRR_1: Leucine Rich R 87.3 0.23 5E-06 28.7 0.5 21 426-447 2-22 (22)
12 PF12799 LRR_4: Leucine Rich r 81.3 0.63 1.4E-05 32.0 0.6 27 426-453 3-29 (44)
13 PF03422 CBM_6: Carbohydrate b 56.4 1.1E+02 0.0025 25.2 11.9 76 273-356 32-112 (125)
14 PF13855 LRR_8: Leucine rich r 47.0 9 0.00019 27.7 0.8 30 424-453 25-54 (61)
15 PF13855 LRR_8: Leucine rich r 38.4 14 0.00031 26.6 0.7 29 425-453 2-30 (61)
16 PRK06764 hypothetical protein; 33.3 40 0.00087 26.5 2.5 17 90-106 74-90 (105)
17 PF03422 CBM_6: Carbohydrate b 32.9 2.8E+02 0.006 22.8 9.2 38 86-128 30-68 (125)
18 KOG2230 Predicted beta-mannosi 32.4 6.9E+02 0.015 27.2 12.0 108 241-354 29-147 (867)
19 PF07172 GRP: Glycine rich pro 27.0 41 0.00089 27.3 1.7 14 7-20 5-18 (95)
20 KOG0472 Leucine-rich repeat pr 21.4 37 0.0008 35.1 0.5 29 424-453 505-533 (565)
No 1
>PLN03150 hypothetical protein; Provisional
Probab=100.00 E-value=9.2e-82 Score=677.59 Aligned_cols=424 Identities=35% Similarity=0.642 Sum_probs=341.2
Q ss_pred hhccccCCCCccEEEecCCCCCc-cCCCCCeEeeCCCCccCCCcccccCCCCCCCcccceeeeeCCCCCCCceEEeecCC
Q 012913 20 LSLASSYPYKASYRIDCGSATST-TDPFNTTWQADDRYYTSGATSIVSEPLHFRFPHEKTLRYFPPSSGKKNCYIIPNLP 98 (453)
Q Consensus 20 ~~~~~~~~~~~~~~IdCG~~~~~-~d~~~~~w~~Dd~~~~~g~~~~~~~~~~~~~~~y~t~R~F~~~~g~~~cY~lpv~~ 98 (453)
+.++++.+.+++|+||||++++. +|.+||+|++|..+ ..|.....+.+ ....++|+|+|+||.++|+++||+||+.+
T Consensus 14 ~~~~~~~~~~~~~~I~CGs~~~~~~d~~~~~w~~D~~~-~~~~~~~~~~~-~~~~~~~~t~R~F~~~~g~~~cY~~~~~~ 91 (623)
T PLN03150 14 VLASLASPEPFTMRISCGARVNVRTAPTNTLWYKDFAY-TGGIPANATRP-SFIAPPLKTLRYFPLSDGPENCYNINRVP 91 (623)
T ss_pred hhcccccCCCccEEEeCCCCCCcccCCCCCEEcCCccc-ccCccccccCc-ccccchhhccccCCcccccccceEeeecC
Confidence 33444455667799999999876 67899999997443 43433322222 23457899999999877999999999888
Q ss_pred CccEEEEEEeecCCCCCCCCCCcEEEEECCEEEEecCCCCCcccCCCceEEEEEEEecCCeeEEEEeecCCCCceeeeeE
Q 012913 99 PGRYYIRTFTVYDNYDGKSHSPSFDVSVEGTLVFSWRSPWPEGLARDGAYSDLFAFVKDGELDLCFYSFATDPPVIASLE 178 (453)
Q Consensus 99 ~g~ylvRl~F~y~nyd~~~~~~~Fdv~~~~~~~l~~~~~~~~~~~~~~~~~E~i~~~~~~~l~vcf~~~~~~~pfIs~iE 178 (453)
+|+|+||+||+||+||+.++.|.|||++|++.|.+....+.. ....++||+++.++++.++|||+|++.++||||+||
T Consensus 92 ~g~ylVRl~F~~~~y~~~~~~~~Fdv~~~~~~~~tv~~~~~~--~~~~v~~E~i~~~~~~~l~vcf~~~~~~~pFIs~iE 169 (623)
T PLN03150 92 KGHYSVRVFFGLVAEPNFDSEPLFDVSVEGTQISSLKSGWSS--HDEQVFAEALVFLTDGSASICFHSTGHGDPAILSIE 169 (623)
T ss_pred CCcEEEEEEeecCCcCCCCCCCceEEEECcEEEEEEecCccc--CCCcEEEEEEEEecCCcEEEEEecCCCCCCceeEEE
Confidence 999999999999999998788999999999776664322222 235688999999999999999999988999999999
Q ss_pred eEecCCCCccccc-ccccceeeeeeeccCCCCCCCCCCCCCCCCC--CCCCcCCCCCCCCCCCCCeeEeeccccccCCCC
Q 012913 179 VQQIDPLSYDAAT-IGNNHILVNYGRLTSGSNQWGPGFSNDADDF--GRSWQSDAASRSPNAKSSIKSVTTRERITNTNQ 255 (453)
Q Consensus 179 l~~l~~~~y~~~~-~~~~~~l~~~~Rin~G~~~~~~~~~~~~D~~--dR~W~~d~~~~~~~~~~~~~~~~t~~~i~~~~~ 255 (453)
||+||+++|.... .+++.+|++++|+|||+......++|++|++ ||+|.+|..|..+ .+..+++...|+++.+
T Consensus 170 v~~l~~~~y~~~~~~~~~~~L~~~~R~n~G~~~~~~~~d~~~D~~~~dR~W~~d~~~~~~----~~~~~st~~~I~~~~~ 245 (623)
T PLN03150 170 ILQVDDKAYNFGPSWGQGVILRTAKRLSCGAGKSKFDEDYSGDHWGGDRFWNRMQTFGSG----SDQAISTENVIKKASN 245 (623)
T ss_pred EEEcCcccccccccccCceEEEEEEEEEecCcccccccCCCCCcccCccccCcCcccCCC----cccccccccccccccC
Confidence 9999999996432 2356679999999999875444568899999 9999999887632 2456667777876655
Q ss_pred CCCCChHHHHHhhhccC---CceEEEEEeCCCCcEEEEEEEEeeecccCCCceEEEEEEECCEe-ecccceeeccCC-cc
Q 012913 256 PPNYYPMKLYQTAIVSS---GAIQYNLAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVNDKN-VTRVDIFNSVGS-FA 330 (453)
Q Consensus 256 ~~~~~P~~Vy~TAr~~~---~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~ing~~-~~~~di~~~~g~-~~ 330 (453)
.++.+|+.||||||++. .+++|.|++++++.|+|||||||++......++|+|||+|||+. +.++|+...+|. ..
T Consensus 246 ~~~~~P~~VyqTA~~~~~~~~~lty~~~v~~~~~Y~VrLhFaEi~~~~~~~~~R~F~V~ing~~~~~~~di~~~~g~~~~ 325 (623)
T PLN03150 246 APNFYPESLYQSALVSTDTQPDLSYTMDVDPNRNYSVWLHFAEIDNSITAEGKRVFDVLINGDTAFKDVDIVKMSGERYT 325 (623)
T ss_pred CCccChHHHhhhhccccCCCCceEEEeecCCCCCEEEEEEEEeccCccCCCceEEEEEEECCEEeecccChhhhcCCccc
Confidence 56779999999999983 46999999999999999999999975445668999999999996 688999877665 67
Q ss_pred eeEEEEEEEecCCceEEEEEeecCCC-cceeEEeeeeccCCCCCCChhHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCC
Q 012913 331 AYSWHYVAKNLSSTELTVKLVPVVGA-ALISGLENYALVPNDLSTVPEQVIAMRALKESLRVPDRMGWNGDPCAPTNWDA 409 (453)
Q Consensus 331 ~~~~~~~v~~~~~g~l~i~~~~~~~s-p~lNaiEI~~~~~~~~~t~~~d~~aL~~ik~~~~~~~~~~W~gdpC~p~~~~~ 409 (453)
+++++|.+.. .+++|+|+|+|..++ |+|||||||++.+....+.++|+.||+.+|+.+..+...+|+|+||.|.. +.
T Consensus 326 ~~~~~~~v~~-~~g~l~isl~p~~~s~pilNaiEI~~~~~~~~~t~~~~~~aL~~~k~~~~~~~~~~W~g~~C~p~~-~~ 403 (623)
T PLN03150 326 ALVLNKTVAV-SGRTLTIVLQPKKGTHAIINAIEVFEIITAESKTLLEEVSALQTLKSSLGLPLRFGWNGDPCVPQQ-HP 403 (623)
T ss_pred ceEEEeEEee-cCCeEEEEEeeCCCCcceeeeeeeeeccccccccCchHHHHHHHHHHhcCCcccCCCCCCCCCCcc-cc
Confidence 8899888764 458899999998877 99999999999998778899999999999999976533489999998876 58
Q ss_pred ceeEEeCCCCCCCcceEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913 410 WEGITCHPNKDETAVVISQIDLGSQGLKGYISDKISLLSNLVNL 453 (453)
Q Consensus 410 W~Gi~C~~~~~~~~~~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L 453 (453)
|.||.|+........+|+.|+|++++|+|.+|+.|++|++|+.|
T Consensus 404 w~Gv~C~~~~~~~~~~v~~L~L~~n~L~g~ip~~i~~L~~L~~L 447 (623)
T PLN03150 404 WSGADCQFDSTKGKWFIDGLGLDNQGLRGFIPNDISKLRHLQSI 447 (623)
T ss_pred cccceeeccCCCCceEEEEEECCCCCccccCCHHHhCCCCCCEE
Confidence 99999964221112357888888888888888888877777654
No 2
>PF12819 Malectin_like: Carbohydrate-binding protein of the ER; InterPro: IPR024788 Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan []. This entry represents a malectin-like domain found in a number of plant receptor kinases.
Probab=100.00 E-value=3.8e-67 Score=526.74 Aligned_cols=319 Identities=27% Similarity=0.476 Sum_probs=250.5
Q ss_pred EecCCCCCc---cC-CCCCeEeeCCCCccCCCcccccCC----CCCCCcccceeeeeCCCCCCCceEEeecC--CCccEE
Q 012913 34 IDCGSATST---TD-PFNTTWQADDRYYTSGATSIVSEP----LHFRFPHEKTLRYFPPSSGKKNCYIIPNL--PPGRYY 103 (453)
Q Consensus 34 IdCG~~~~~---~d-~~~~~w~~Dd~~~~~g~~~~~~~~----~~~~~~~y~t~R~F~~~~g~~~cY~lpv~--~~g~yl 103 (453)
||||++.+. +| .+|++|++|++|+.+|.++.++.. .....++|+|+|+|| +|.|+||+||+. +++|||
T Consensus 1 IdCG~~~~~s~y~D~~tg~~~~~D~~~~~~g~~~~i~~~~~~~~~~~~~~y~taR~F~--~g~r~cY~l~~~~~~~~~yl 78 (347)
T PF12819_consen 1 IDCGSSSNSSSYVDDSTGRTWVSDDDFIDTGKSGNISSQPDSSSSDSSPPYQTARIFP--EGSRNCYTLPVTPPGGGKYL 78 (347)
T ss_pred CcCCCCCCCcccccCCCCcEEeCCCCcccCCCccccccccCCcCCccccccceEEEcC--CCCccEEEeeccCCCCceEE
Confidence 799999763 34 579999999998889987777321 124578999999999 477899999997 355999
Q ss_pred EEEEeecCCCCCCC-----CCCcEEEEECCEEEEecCCCCCcccCCCceEEEEEEEec-CCeeEEEEeecCCCC-ceeee
Q 012913 104 IRTFTVYDNYDGKS-----HSPSFDVSVEGTLVFSWRSPWPEGLARDGAYSDLFAFVK-DGELDLCFYSFATDP-PVIAS 176 (453)
Q Consensus 104 vRl~F~y~nyd~~~-----~~~~Fdv~~~~~~~l~~~~~~~~~~~~~~~~~E~i~~~~-~~~l~vcf~~~~~~~-pfIs~ 176 (453)
|||||+|||||+++ .++.|||++|++.|.+.... .....+++||+++.+. ++.|.|||+|++.|. |||||
T Consensus 79 iRl~F~~gnyd~~~fs~~~~~~~FdL~~~~n~~~tV~~~---~~~~~~~~~E~ii~v~~~~~l~vclv~~~~g~~pFIsa 155 (347)
T PF12819_consen 79 IRLHFYYGNYDGLNFSVSSSPPTFDLLLGFNFWSTVNLS---NSPSSPVVKEFIINVTWSDTLSVCLVPTGSGTFPFISA 155 (347)
T ss_pred EEEEeccccccccccccccCCcceEEEECCceeEEEEec---CCCcceEEEEEEEEEcCCCcEEEEEEeCCCCCCCceeE
Confidence 99999999999863 25779999999986553321 1112468999877777 799999999999887 99999
Q ss_pred eEeEecCCCCcccccccccceeeeeeeccCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCCCCCeeEeecccccc-CCCC
Q 012913 177 LEVQQIDPLSYDAATIGNNHILVNYGRLTSGSNQWGPGFSNDADDFGRSWQSDAASRSPNAKSSIKSVTTRERIT-NTNQ 255 (453)
Q Consensus 177 iEl~~l~~~~y~~~~~~~~~~l~~~~Rin~G~~~~~~~~~~~~D~~dR~W~~d~~~~~~~~~~~~~~~~t~~~i~-~~~~ 255 (453)
||||+||+++|+.....++.+|++++|+|||+... .+||++|++||+|.+... ...|..+++..+|+ ....
T Consensus 156 iEl~~lp~~ly~~~~~~~s~~L~~~~R~n~G~~~~--~iryp~D~~dR~W~~~~~------~~~~~~ist~~~i~~~~~~ 227 (347)
T PF12819_consen 156 IELRPLPDSLYPDTDANSSQALETVYRLNVGGSSS--FIRYPDDTYDRIWQPYSS------SPGWSNISTTSNININSSN 227 (347)
T ss_pred EEEEECCccceeccccCCCceeEEEEeecCCCccc--ccCCCCCcceeecccccc------CccccccccceeeecccCC
Confidence 99999999999543334677899999999998642 278999999999996421 22567777766676 3334
Q ss_pred CCCCChHHHHHhhhcc--C---CceEEEEEeCCCCcEEEEEEEEeeecccCCCceEEEEEEECCEeecccceeec-cCC-
Q 012913 256 PPNYYPMKLYQTAIVS--S---GAIQYNLAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVNDKNVTRVDIFNS-VGS- 328 (453)
Q Consensus 256 ~~~~~P~~Vy~TAr~~--~---~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~ing~~~~~~di~~~-~g~- 328 (453)
.++.+|..|||||+++ . .+++|.| ++++..|+||||||||+......++|+|||+|||+...+ |+... .+.
T Consensus 228 ~~~~~P~~V~~TA~~~~~~s~~~nltw~~-~~~~~~y~v~lHFaEi~~~~~~~~~R~F~IyiN~~~~~~-~~~~~~~~~~ 305 (347)
T PF12819_consen 228 NPYDAPSAVYQTARTPSNSSDPLNLTWSF-VDPGFSYYVRLHFAEIQSLSPNNNQREFDIYINGQTAYS-DVSPPYLGAD 305 (347)
T ss_pred ccCcChHHHHHhhhcccccccceEEEecc-CCCCccEEEEEEEeecccccCCCCeEEEEEEECCeEccC-ccCcccccCc
Confidence 4567999999999999 1 3577877 888899999999999997544566899999999997432 44332 222
Q ss_pred cceeEEEEEEEecCCceEEEEEeecCCC---cceeEEeeeec
Q 012913 329 FAAYSWHYVAKNLSSTELTVKLVPVVGA---ALISGLENYAL 367 (453)
Q Consensus 329 ~~~~~~~~~v~~~~~g~l~i~~~~~~~s---p~lNaiEI~~~ 367 (453)
..+++++|.+....++.++|+|+|+.+| |||||+|||+|
T Consensus 306 ~~~~~~d~~~~~~~~~~~~isL~~t~~S~lppiLNalEIy~v 347 (347)
T PF12819_consen 306 TVPYYSDYVVNVPDSGFLNISLGPTPDSTLPPILNALEIYKV 347 (347)
T ss_pred ceEeecceEEEecCCCEEEEEEEeCCCCCcCceeEeeeeEeC
Confidence 4567888887755667899999999886 99999999986
No 3
>PF11721 Malectin: Di-glucose binding within endoplasmic reticulum; InterPro: IPR021720 Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan [. This entry represents a malectin domain, and can also be found in probable receptor-like serine/threonine-protein kinases from plants [] and in proteins described as glycoside hydrolases. ; PDB: 2KR2_A 2JWP_A 2K46_A.
Probab=99.87 E-value=6.4e-23 Score=186.59 Aligned_cols=154 Identities=25% Similarity=0.401 Sum_probs=95.8
Q ss_pred eeeccCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCCCCCeeEeeccccccCCCCCCCCChHHHHHhhhccCCceEEEEE
Q 012913 201 YGRLTSGSNQWGPGFSNDADDFGRSWQSDAASRSPNAKSSIKSVTTRERITNTNQPPNYYPMKLYQTAIVSSGAIQYNLA 280 (453)
Q Consensus 201 ~~Rin~G~~~~~~~~~~~~D~~dR~W~~d~~~~~~~~~~~~~~~~t~~~i~~~~~~~~~~P~~Vy~TAr~~~~~l~~~f~ 280 (453)
++|||||+.. + +|..++.|.+|+.|..+.+...+..-................++.+|||+|.....++|.+|
T Consensus 2 ~~~IN~Gg~~------~-~~~~g~~w~~D~~~~~g~~~y~~~~~~~~~~~~~~~~i~~t~d~~Lyqt~R~g~~~f~Y~ip 74 (174)
T PF11721_consen 2 VLRINAGGPA------Y-TDSSGIVWEADQYYTGGSWGYYVSSDNNGSTSSTNSSIPGTTDDPLYQTERYGPSSFSYDIP 74 (174)
T ss_dssp EEEEEETSSS------E-EETTTEEE-SSSSSTTSS-----------SSTTS--TTS-HHHHHTTT-----SSSEEEEEE
T ss_pred EEEEECCCCc------c-cCCCCCEEcCCCCCCCCCcccccccccccccccccccccCCCchhhhHhhcCCCCceEEEEe
Confidence 5799999864 3 67789999999988654431111100000001111111222467999999997457999999
Q ss_pred eCCCCcEEEEEEEEeeeccc----CCCceEEEEEEECCEe-ecccceeeccCCcc-eeEEEE-EEEecCCceEEEEEe--
Q 012913 281 VDAKLDYLIWFHFAEIDSSV----TKAGQRVFDILVNDKN-VTRVDIFNSVGSFA-AYSWHY-VAKNLSSTELTVKLV-- 351 (453)
Q Consensus 281 v~~~~~Y~V~LhFaEi~~~~----~~~~~R~FdV~ing~~-~~~~di~~~~g~~~-~~~~~~-~v~~~~~g~l~i~~~-- 351 (453)
+.++|.|.|+|||||++... ..+++|+|||+|||+. +++|||+.++|+.. +.++.+ .+. +.+|.|+|+|.
T Consensus 75 ~~~~G~Y~V~L~FaE~~~~~~~~~~~~G~RvFdV~v~g~~vl~~~Di~~~~G~~~~~~~~~~~~v~-v~dg~L~i~f~~~ 153 (174)
T PF11721_consen 75 VVPNGTYTVRLHFAELYFGASGGASGPGQRVFDVYVNGETVLKNFDIYAEAGGFNKAAVRRFFNVT-VTDGTLNIQFVWA 153 (174)
T ss_dssp --S-EEEEEEEEEE-SSS--------SSSS-EEEEETTEEEEEEE-HHHHHSSSS---EEEEEEEE-EETTEEETTEEEE
T ss_pred cCCCcEEEEEEEeccccccccccccCCCceEEEEEecceEEEeccCHHHHcCCCceEEEEEEEEEE-EeCCcEEEEEEec
Confidence 66789999999999998742 2379999999999997 79999999998844 777777 444 46788999999
Q ss_pred ----------ecCCCcceeEE
Q 012913 352 ----------PVVGAALISGL 362 (453)
Q Consensus 352 ----------~~~~sp~lNai 362 (453)
+..+.|+||||
T Consensus 154 ~~~~~~i~~~~~~~~p~IsaI 174 (174)
T PF11721_consen 154 GKGTLCIPFIGSYGNPLISAI 174 (174)
T ss_dssp --SEEEEEEESSSSSSSEEEE
T ss_pred CCCcEEeeccccCCCcEEeeC
Confidence 56667999997
No 4
>PF11721 Malectin: Di-glucose binding within endoplasmic reticulum; InterPro: IPR021720 Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan [. This entry represents a malectin domain, and can also be found in probable receptor-like serine/threonine-protein kinases from plants [] and in proteins described as glycoside hydrolases. ; PDB: 2KR2_A 2JWP_A 2K46_A.
Probab=99.71 E-value=2.3e-17 Score=150.09 Aligned_cols=141 Identities=28% Similarity=0.383 Sum_probs=91.4
Q ss_pred cEEEecCCCCCccCCCCCeEeeCCCCccCCCcc------------cccCC--CCCCCcccceeeeeCCCCCCCceEEeec
Q 012913 31 SYRIDCGSATSTTDPFNTTWQADDRYYTSGATS------------IVSEP--LHFRFPHEKTLRYFPPSSGKKNCYIIPN 96 (453)
Q Consensus 31 ~~~IdCG~~~~~~d~~~~~w~~Dd~~~~~g~~~------------~~~~~--~~~~~~~y~t~R~F~~~~g~~~cY~lpv 96 (453)
.++||||++.- +|..|..|.+|. ++.+|... ..... ....+++|+|.|+-+. .+.|.||+
T Consensus 2 ~~~IN~Gg~~~-~~~~g~~w~~D~-~~~~g~~~y~~~~~~~~~~~~~~~~i~~t~d~~Lyqt~R~g~~----~f~Y~ip~ 75 (174)
T PF11721_consen 2 VLRINAGGPAY-TDSSGIVWEADQ-YYTGGSWGYYVSSDNNGSTSSTNSSIPGTTDDPLYQTERYGPS----SFSYDIPV 75 (174)
T ss_dssp EEEEEETSSSE-EETTTEEE-SSS-SSTTSS-----------SSTTS--TTS-HHHHHTTT-----SS----SEEEEEE-
T ss_pred EEEEECCCCcc-cCCCCCEEcCCC-CCCCCCcccccccccccccccccccccCCCchhhhHhhcCCCC----ceEEEEec
Confidence 37899999864 678899999954 44433330 00011 1123579999999653 38999996
Q ss_pred CCCccEEEEEEeecCCCCC-----CCCCCcEEEEECCEEEEecCCCCCcccCCC-ceEEEE-EEEecCCeeEEEEee---
Q 012913 97 LPPGRYYIRTFTVYDNYDG-----KSHSPSFDVSVEGTLVFSWRSPWPEGLARD-GAYSDL-FAFVKDGELDLCFYS--- 166 (453)
Q Consensus 97 ~~~g~ylvRl~F~y~nyd~-----~~~~~~Fdv~~~~~~~l~~~~~~~~~~~~~-~~~~E~-i~~~~~~~l~vcf~~--- 166 (453)
.++|.|.|||||....+.. ....++|||+++|..+++++|++.+++... ++++++ -+.++++.|.|+|..
T Consensus 76 ~~~G~Y~V~L~FaE~~~~~~~~~~~~G~RvFdV~v~g~~vl~~~Di~~~~G~~~~~~~~~~~~v~v~dg~L~i~f~~~~~ 155 (174)
T PF11721_consen 76 VPNGTYTVRLHFAELYFGASGGASGPGQRVFDVYVNGETVLKNFDIYAEAGGFNKAAVRRFFNVTVTDGTLNIQFVWAGK 155 (174)
T ss_dssp -S-EEEEEEEEEE-SSS--------SSSS-EEEEETTEEEEEEE-HHHHHSSSS---EEEEEEEEEETTEEETTEEEE--
T ss_pred CCCcEEEEEEEeccccccccccccCCCceEEEEEecceEEEeccCHHHHcCCCceEEEEEEEEEEEeCCcEEEEEEecCC
Confidence 6999999999998654443 134799999999999999999988876554 667777 456799999999995
Q ss_pred --------cCCCCceeeee
Q 012913 167 --------FATDPPVIASL 177 (453)
Q Consensus 167 --------~~~~~pfIs~i 177 (453)
...+.|.||||
T Consensus 156 ~~~~i~~~~~~~~p~IsaI 174 (174)
T PF11721_consen 156 GTLCIPFIGSYGNPLISAI 174 (174)
T ss_dssp SEEEEEEESSSSSSSEEEE
T ss_pred CcEEeeccccCCCcEEeeC
Confidence 34567888887
No 5
>PLN03150 hypothetical protein; Provisional
Probab=99.41 E-value=2.3e-12 Score=139.67 Aligned_cols=152 Identities=14% Similarity=0.162 Sum_probs=107.2
Q ss_pred EEEecCCCCCc--cC----CC--CCeEeeCCCCccCC-C----cccccC----CCCCCCcccceeeeeCCCCCCCceEEe
Q 012913 32 YRIDCGSATST--TD----PF--NTTWQADDRYYTSG-A----TSIVSE----PLHFRFPHEKTLRYFPPSSGKKNCYII 94 (453)
Q Consensus 32 ~~IdCG~~~~~--~d----~~--~~~w~~Dd~~~~~g-~----~~~~~~----~~~~~~~~y~t~R~F~~~~g~~~cY~l 94 (453)
++||||+.+.. .| .- .|.|.+|+.|.... . ...+.. +...++..|+|+|.+... ....+|++
T Consensus 194 ~R~n~G~~~~~~~~d~~~D~~~~dR~W~~d~~~~~~~~~~~st~~~I~~~~~~~~~~P~~VyqTA~~~~~~-~~~lty~~ 272 (623)
T PLN03150 194 KRLSCGAGKSKFDEDYSGDHWGGDRFWNRMQTFGSGSDQAISTENVIKKASNAPNFYPESLYQSALVSTDT-QPDLSYTM 272 (623)
T ss_pred EEEEecCcccccccCCCCCcccCccccCcCcccCCCcccccccccccccccCCCccChHHHhhhhccccCC-CCceEEEe
Confidence 58999986422 22 22 79999976654321 1 111111 111245689999998642 23579999
Q ss_pred ecCCCccEEEEEEeecCCCCC-CCCCCcEEEEECCEEEEecCCCCCccc-CCCceEEEEEEEecCCeeEEEEeecCCCCc
Q 012913 95 PNLPPGRYYIRTFTVYDNYDG-KSHSPSFDVSVEGTLVFSWRSPWPEGL-ARDGAYSDLFAFVKDGELDLCFYSFATDPP 172 (453)
Q Consensus 95 pv~~~g~ylvRl~F~y~nyd~-~~~~~~Fdv~~~~~~~l~~~~~~~~~~-~~~~~~~E~i~~~~~~~l~vcf~~~~~~~p 172 (453)
++.++++|+||+||..-.... ....++|||++++..+++.+++....+ ...++++++.+.+.++.+.|+|+|.....|
T Consensus 273 ~v~~~~~Y~VrLhFaEi~~~~~~~~~R~F~V~ing~~~~~~~di~~~~g~~~~~~~~~~~v~~~~g~l~isl~p~~~s~p 352 (623)
T PLN03150 273 DVDPNRNYSVWLHFAEIDNSITAEGKRVFDVLINGDTAFKDVDIVKMSGERYTALVLNKTVAVSGRTLTIVLQPKKGTHA 352 (623)
T ss_pred ecCCCCCEEEEEEEEeccCccCCCceEEEEEEECCEEeecccChhhhcCCcccceEEEeEEeecCCeEEEEEeeCCCCcc
Confidence 987889999999998653222 234689999999999999888744322 234678898887778899999999866679
Q ss_pred eeeeeEeEecCC
Q 012913 173 VIASLEVQQIDP 184 (453)
Q Consensus 173 fIs~iEl~~l~~ 184 (453)
+||||||..+..
T Consensus 353 ilNaiEI~~~~~ 364 (623)
T PLN03150 353 IINAIEVFEIIT 364 (623)
T ss_pred eeeeeeeeeccc
Confidence 999999998865
No 6
>PF12819 Malectin_like: Carbohydrate-binding protein of the ER; InterPro: IPR024788 Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan []. This entry represents a malectin-like domain found in a number of plant receptor kinases.
Probab=99.37 E-value=4.7e-12 Score=127.72 Aligned_cols=154 Identities=25% Similarity=0.286 Sum_probs=104.4
Q ss_pred ccCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCCCCCeeEeeccccccCCCCCCCCChHHHHHhhhccC--CceEEEEEe
Q 012913 204 LTSGSNQWGPGFSNDADDFGRSWQSDAASRSPNAKSSIKSVTTRERITNTNQPPNYYPMKLYQTAIVSS--GAIQYNLAV 281 (453)
Q Consensus 204 in~G~~~~~~~~~~~~D~~dR~W~~d~~~~~~~~~~~~~~~~t~~~i~~~~~~~~~~P~~Vy~TAr~~~--~~l~~~f~v 281 (453)
||||+.... . .|.|+..+|.|.+|..|...+. +..|....+.....+...|+|||... ....|+|++
T Consensus 1 IdCG~~~~~-s-~y~D~~tg~~~~~D~~~~~~g~---------~~~i~~~~~~~~~~~~~~y~taR~F~~g~r~cY~l~~ 69 (347)
T PF12819_consen 1 IDCGSSSNS-S-SYVDDSTGRTWVSDDDFIDTGK---------SGNISSQPDSSSSDSSPPYQTARIFPEGSRNCYTLPV 69 (347)
T ss_pred CcCCCCCCC-c-ccccCCCCcEEeCCCCcccCCC---------ccccccccCCcCCccccccceEEEcCCCCccEEEeec
Confidence 789986421 1 1557788999999997763111 11221000000113456899999983 458999998
Q ss_pred C--CCCcEEEEEEEEeeecccCC----CceEEEEEEECCEeecccceeeccCCcceeEEEEEEEecCCceEEEEEeecC-
Q 012913 282 D--AKLDYLIWFHFAEIDSSVTK----AGQRVFDILVNDKNVTRVDIFNSVGSFAAYSWHYVAKNLSSTELTVKLVPVV- 354 (453)
Q Consensus 282 ~--~~~~Y~V~LhFaEi~~~~~~----~~~R~FdV~ing~~~~~~di~~~~g~~~~~~~~~~v~~~~~g~l~i~~~~~~- 354 (453)
. .+++|+|||||.-....... ...-.||++++...+..+++.. ....+++++|.+....++.|.|.|.++.
T Consensus 70 ~~~~~~~yliRl~F~~gnyd~~~fs~~~~~~~FdL~~~~n~~~tV~~~~--~~~~~~~~E~ii~v~~~~~l~vclv~~~~ 147 (347)
T PF12819_consen 70 TPPGGGKYLIRLHFYYGNYDGLNFSVSSSPPTFDLLLGFNFWSTVNLSN--SPSSPVVKEFIINVTWSDTLSVCLVPTGS 147 (347)
T ss_pred cCCCCceEEEEEEeccccccccccccccCCcceEEEECCceeEEEEecC--CCcceEEEEEEEEEcCCCcEEEEEEeCCC
Confidence 7 35699999999976542110 1134699999988654444322 1124688998888655689999999998
Q ss_pred CC-cceeEEeeeeccCC
Q 012913 355 GA-ALISGLENYALVPN 370 (453)
Q Consensus 355 ~s-p~lNaiEI~~~~~~ 370 (453)
+. |+||||||..+++.
T Consensus 148 g~~pFIsaiEl~~lp~~ 164 (347)
T PF12819_consen 148 GTFPFISAIELRPLPDS 164 (347)
T ss_pred CCCCceeEEEEEECCcc
Confidence 66 99999999999875
No 7
>KOG3593 consensus Predicted receptor-like serine/threonine kinase [Signal transduction mechanisms]
Probab=98.02 E-value=2.8e-06 Score=80.58 Aligned_cols=105 Identities=22% Similarity=0.286 Sum_probs=78.3
Q ss_pred hHHHHHhhhccCCceEEEEEeCCCCcEEEEEEEEeeecccCCCceEEEEEEEC-CEe-ecccceeeccCC-ccee--EEE
Q 012913 261 PMKLYQTAIVSSGAIQYNLAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVN-DKN-VTRVDIFNSVGS-FAAY--SWH 335 (453)
Q Consensus 261 P~~Vy~TAr~~~~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~in-g~~-~~~~di~~~~g~-~~~~--~~~ 335 (453)
...+|||+|.....+.|..+++..|.|-+.|-|||.+. +..+..+|||-+| +.. ++++|++...|. .+|- +..
T Consensus 107 d~ily~ter~neetFgyd~pik~dgdyalvlkfaevyF--~~~q~kvfdvrln~sh~vVk~ldi~~~vg~rg~AhDe~i~ 184 (355)
T KOG3593|consen 107 DIILYQTERYNEETFGYDVPIKEDGDYALVLKFAEVYF--KTCQHKVFDVRLNCSHCVVKALDIFDQVGDRGKAHDEIIP 184 (355)
T ss_pred hhhhhhhcccchhhhcccccccCCCceehhhhHHHHHH--HhhhhhheeeeeccceeEEeccchhhhcCCCcccccceEE
Confidence 35799999996445778888888899999999999986 5778999999999 665 788999888873 3331 111
Q ss_pred EEE-----------EecCCceEEEEEeecC-CCcceeEEeeeec
Q 012913 336 YVA-----------KNLSSTELTVKLVPVV-GAALISGLENYAL 367 (453)
Q Consensus 336 ~~v-----------~~~~~g~l~i~~~~~~-~sp~lNaiEI~~~ 367 (453)
+.+ ...+.|+++|+|.+.. .+|.+||..|++.
T Consensus 185 ~~i~~gkls~~gess~~t~gkl~le~~kg~ldnpk~~a~aIl~g 228 (355)
T KOG3593|consen 185 CLIGQGKLSVCGESSISTLGKLNLEFLKGVLDNPKDCARAILVG 228 (355)
T ss_pred EEEcCceEEEEeeeEEeecceEEEEeecccCCChhhhhHHHhhc
Confidence 111 1124478999998876 4699999988764
No 8
>PF08263 LRRNT_2: Leucine rich repeat N-terminal domain; InterPro: IPR013210 Leucine-rich repeats (LRR) consist of 2-45 motifs of 20-30 amino acids in length that generally folds into an arc or horseshoe shape []. LRRs occur in proteins ranging from viruses to eukaryotes, and appear to provide a structural framework for the formation of protein-protein interactions [, ].Proteins containing LRRs include tyrosine kinase receptors, cell-adhesion molecules, virulence factors, and extracellular matrix-binding glycoproteins, and are involved in a variety of biological processes, including signal transduction, cell adhesion, DNA repair, recombination, transcription, RNA processing, disease resistance, apoptosis, and the immune response []. Sequence analyses of LRR proteins suggested the existence of several different subfamilies of LRRs. The significance of this classification is that repeats from different subfamilies never occur simultaneously and have most probably evolved independently. It is, however, now clear that all major classes of LRR have curved horseshoe structures with a parallel beta sheet on the concave side and mostly helical elements on the convex side. At least six families of LRR proteins, characterised by different lengths and consensus sequences of the repeats, have been identified. Eleven-residue segments of the LRRs (LxxLxLxxN/CxL), corresponding to the beta-strand and adjacent loop regions, are conserved in LRR proteins, whereas the remaining parts of the repeats (herein termed variable) may be very different. Despite the differences, each of the variable parts contains two half-turns at both ends and a "linear" segment (as the chain follows a linear path overall), usually formed by a helix, in the middle. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structure of some LRR proteins-ligand complexes show that the concave surface of LRR domain is ideal for interaction with alpha-helix, thus supporting earlier conclusions that the elongated and curved LRR structure provides an outstanding framework for achieving diverse protein-protein interactions []. Molecular modeling suggests that the conserved pattern LxxLxL, which is shorter than the previously proposed LxxLxLxxN/CxL is sufficient to impart the characteristic horseshoe curvature to proteins with 20- to 30-residue repeats []. This domain is often found at the N terminus of tandem leucine rich repeats.; PDB: 3RGZ_A 3RJ0_A 3RIZ_A 3RGX_A 1OGQ_A.
Probab=97.59 E-value=6.4e-05 Score=51.79 Aligned_cols=35 Identities=43% Similarity=0.906 Sum_probs=25.2
Q ss_pred hhHHHHHHHHHHHcCC-C--CCCCCCC----CCCCCCCCCCceeEEeC
Q 012913 376 PEQVIAMRALKESLRV-P--DRMGWNG----DPCAPTNWDAWEGITCH 416 (453)
Q Consensus 376 ~~d~~aL~~ik~~~~~-~--~~~~W~g----dpC~p~~~~~W~Gi~C~ 416 (453)
++|++||++||+.+.. + .+.+|+. +|| .|.||+|+
T Consensus 2 ~~d~~aLl~~k~~l~~~~~~~l~~W~~~~~~~~C------~W~GV~Cd 43 (43)
T PF08263_consen 2 NQDRQALLAFKKSLNNDPSGVLSSWNPSSDSDPC------SWSGVTCD 43 (43)
T ss_dssp HHHHHHHHHHHHCTT-SC-CCCTT--TT--S-CC------CSTTEEE-
T ss_pred cHHHHHHHHHHHhcccccCcccccCCCcCCCCCe------eeccEEeC
Confidence 4789999999999974 3 3779963 566 89999995
No 9
>PLN00113 leucine-rich repeat receptor-like protein kinase; Provisional
Probab=97.48 E-value=0.00016 Score=82.94 Aligned_cols=70 Identities=27% Similarity=0.477 Sum_probs=56.6
Q ss_pred CChhHHHHHHHHHHHcCCCC--CCCCCC-CCCCCCCCCCceeEEeCCCCCCCcceEEEEEeCCCCceeecCccccCCCCC
Q 012913 374 TVPEQVIAMRALKESLRVPD--RMGWNG-DPCAPTNWDAWEGITCHPNKDETAVVISQIDLGSQGLKGYISDKISLLSNL 450 (453)
Q Consensus 374 t~~~d~~aL~~ik~~~~~~~--~~~W~g-dpC~p~~~~~W~Gi~C~~~~~~~~~~v~~L~Ls~~~LsG~ip~~l~~L~~L 450 (453)
..++|..+|+++|+.+..+. +.+|.. +.| |.|.||.|+. .++|+.|+|++++++|.+++.+..|++|
T Consensus 26 ~~~~~~~~l~~~~~~~~~~~~~~~~w~~~~~~-----c~w~gv~c~~-----~~~v~~L~L~~~~i~~~~~~~~~~l~~L 95 (968)
T PLN00113 26 LHAEELELLLSFKSSINDPLKYLSNWNSSADV-----CLWQGITCNN-----SSRVVSIDLSGKNISGKISSAIFRLPYI 95 (968)
T ss_pred CCHHHHHHHHHHHHhCCCCcccCCCCCCCCCC-----CcCcceecCC-----CCcEEEEEecCCCccccCChHHhCCCCC
Confidence 35688999999999986542 568963 445 4899999974 2579999999999999999989888888
Q ss_pred cCC
Q 012913 451 VNL 453 (453)
Q Consensus 451 ~~L 453 (453)
+.|
T Consensus 96 ~~L 98 (968)
T PLN00113 96 QTI 98 (968)
T ss_pred CEE
Confidence 865
No 10
>KOG3593 consensus Predicted receptor-like serine/threonine kinase [Signal transduction mechanisms]
Probab=94.24 E-value=0.033 Score=53.43 Aligned_cols=115 Identities=16% Similarity=0.148 Sum_probs=73.1
Q ss_pred ccccCCCCccEEEecCCCCCccCCCCCeEeeCCC-Cc----cCCCccccc-CCCCCCCcccceeeeeCCCCCCCceEEee
Q 012913 22 LASSYPYKASYRIDCGSATSTTDPFNTTWQADDR-YY----TSGATSIVS-EPLHFRFPHEKTLRYFPPSSGKKNCYIIP 95 (453)
Q Consensus 22 ~~~~~~~~~~~~IdCG~~~~~~d~~~~~w~~Dd~-~~----~~g~~~~~~-~~~~~~~~~y~t~R~F~~~~g~~~cY~lp 95 (453)
+.|+-+...-.-||||+.. .+|..|+.|-.|-. .. .-|..-.+- ....-+..+|+|+|+=.. .+-|..|
T Consensus 52 ~Taa~~~svI~aVncGgda-avd~ygI~f~aD~~~~VGrasd~G~~l~i~~raeeed~ily~ter~nee----tFgyd~p 126 (355)
T KOG3593|consen 52 PTAALPSSVIPAVNCGGDA-AVDNYGIRFAADPLEGVGRASDYGMVLGIGCRAEEEDIILYQTERYNEE----TFGYDVP 126 (355)
T ss_pred ccccCchhhhheeccCChh-hhcccceEeeccccccccccCCccceeeccccCChhhhhhhhhcccchh----hhccccc
Confidence 3444444443579999975 47888999988541 10 012110110 111123568999999754 3789999
Q ss_pred cCCCccEEEEEEeecCCCCCCCCCCcEEEEEC-CEEEEecCCCCCccc
Q 012913 96 NLPPGRYYIRTFTVYDNYDGKSHSPSFDVSVE-GTLVFSWRSPWPEGL 142 (453)
Q Consensus 96 v~~~g~ylvRl~F~y~nyd~~~~~~~Fdv~~~-~~~~l~~~~~~~~~~ 142 (453)
+...|.|-+=+.|..--|+. .+.-.|||.++ ...+...-+++..++
T Consensus 127 ik~dgdyalvlkfaevyF~~-~q~kvfdvrln~sh~vVk~ldi~~~vg 173 (355)
T KOG3593|consen 127 IKEDGDYALVLKFAEVYFKT-CQHKVFDVRLNCSHCVVKALDIFDQVG 173 (355)
T ss_pred ccCCCceehhhhHHHHHHHh-hhhhheeeeeccceeEEeccchhhhcC
Confidence 98889998888885332443 23468999999 777777777766554
No 11
>PF00560 LRR_1: Leucine Rich Repeat; InterPro: IPR001611 Leucine-rich repeats (LRR) consist of 2-45 motifs of 20-30 amino acids in length that generally folds into an arc or horseshoe shape []. LRRs occur in proteins ranging from viruses to eukaryotes, and appear to provide a structural framework for the formation of protein-protein interactions [, ].Proteins containing LRRs include tyrosine kinase receptors, cell-adhesion molecules, virulence factors, and extracellular matrix-binding glycoproteins, and are involved in a variety of biological processes, including signal transduction, cell adhesion, DNA repair, recombination, transcription, RNA processing, disease resistance, apoptosis, and the immune response []. Sequence analyses of LRR proteins suggested the existence of several different subfamilies of LRRs. The significance of this classification is that repeats from different subfamilies never occur simultaneously and have most probably evolved independently. It is, however, now clear that all major classes of LRR have curved horseshoe structures with a parallel beta sheet on the concave side and mostly helical elements on the convex side. At least six families of LRR proteins, characterised by different lengths and consensus sequences of the repeats, have been identified. Eleven-residue segments of the LRRs (LxxLxLxxN/CxL), corresponding to the beta-strand and adjacent loop regions, are conserved in LRR proteins, whereas the remaining parts of the repeats (herein termed variable) may be very different. Despite the differences, each of the variable parts contains two half-turns at both ends and a "linear" segment (as the chain follows a linear path overall), usually formed by a helix, in the middle. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structure of some LRR proteins-ligand complexes show that the concave surface of LRR domain is ideal for interaction with alpha-helix, thus supporting earlier conclusions that the elongated and curved LRR structure provides an outstanding framework for achieving diverse protein-protein interactions []. Molecular modeling suggests that the conserved pattern LxxLxL, which is shorter than the previously proposed LxxLxLxxN/CxL is sufficient to impart the characteristic horseshoe curvature to proteins with 20- to 30-residue repeats []. ; GO: 0005515 protein binding; PDB: 4ECO_B 2A0Z_A 3ULU_A 1ZIW_A 3ULV_A 1DCE_C 1LTX_A 3J0A_B 3A79_B 4FCG_A ....
Probab=87.28 E-value=0.23 Score=28.65 Aligned_cols=21 Identities=29% Similarity=0.415 Sum_probs=17.4
Q ss_pred EEEEEeCCCCceeecCccccCC
Q 012913 426 ISQIDLGSQGLKGYISDKISLL 447 (453)
Q Consensus 426 v~~L~Ls~~~LsG~ip~~l~~L 447 (453)
+..|+|++++|+ .||+.|++|
T Consensus 2 L~~Ldls~n~l~-~ip~~~~~l 22 (22)
T PF00560_consen 2 LEYLDLSGNNLT-SIPSSFSNL 22 (22)
T ss_dssp ESEEEETSSEES-EEGTTTTT-
T ss_pred ccEEECCCCcCE-eCChhhcCC
Confidence 457999999999 999988765
No 12
>PF12799 LRR_4: Leucine Rich repeats (2 copies); PDB: 2OMT_A 1XEU_A 2OMX_A 2OMU_A 2UZY_A 2WQU_D 1D0B_A 2WQW_A 1OTO_A 2WQV_B ....
Probab=81.31 E-value=0.63 Score=31.97 Aligned_cols=27 Identities=26% Similarity=0.468 Sum_probs=21.5
Q ss_pred EEEEEeCCCCceeecCccccCCCCCcCC
Q 012913 426 ISQIDLGSQGLKGYISDKISLLSNLVNL 453 (453)
Q Consensus 426 v~~L~Ls~~~LsG~ip~~l~~L~~L~~L 453 (453)
+..|+|++++++ .||+.|++|+.|+.|
T Consensus 3 L~~L~l~~N~i~-~l~~~l~~l~~L~~L 29 (44)
T PF12799_consen 3 LEELDLSNNQIT-DLPPELSNLPNLETL 29 (44)
T ss_dssp -SEEEETSSS-S-SHGGHGTTCTTSSEE
T ss_pred ceEEEccCCCCc-ccCchHhCCCCCCEE
Confidence 457899999999 688889999999865
No 13
>PF03422 CBM_6: Carbohydrate binding module (family 6); InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=56.39 E-value=1.1e+02 Score=25.19 Aligned_cols=76 Identities=20% Similarity=0.292 Sum_probs=46.5
Q ss_pred CceEEE-EEeCCCCcEEEEEEEEeeecccCCCceEEEEEEECC---EeecccceeeccCCccee-EEEEEEEecCCceEE
Q 012913 273 GAIQYN-LAVDAKLDYLIWFHFAEIDSSVTKAGQRVFDILVND---KNVTRVDIFNSVGSFAAY-SWHYVAKNLSSTELT 347 (453)
Q Consensus 273 ~~l~~~-f~v~~~~~Y~V~LhFaEi~~~~~~~~~R~FdV~ing---~~~~~~di~~~~g~~~~~-~~~~~v~~~~~g~l~ 347 (453)
..+.|. .++...+.|.+.+..+--. +.+.++|++|| ..+..+++-. .|+...+ .....+. ...|.-.
T Consensus 32 ~~~~~~~Vd~~~~g~y~~~~~~a~~~------~~~~~~l~id~~~g~~~~~~~~~~-tg~w~~~~~~~~~v~-l~~G~h~ 103 (125)
T PF03422_consen 32 DWIEYNNVDVPEAGTYTLTIRYANGG------GGGTIELRIDGPDGTLIGTVSLPP-TGGWDTWQTVSVSVK-LPAGKHT 103 (125)
T ss_dssp TEEEEEEEEESSSEEEEEEEEEEESS------SSEEEEEEETTTTSEEEEEEEEE--ESSTTEEEEEEEEEE-EESEEEE
T ss_pred CEEEEEEEeeCCCceEEEEEEEECCC------CCcEEEEEECCCCCcEEEEEEEcC-CCCccccEEEEEEEe-eCCCeeE
Confidence 358888 8887789999998887532 22889999999 5555566533 3442222 1112232 2346677
Q ss_pred EEEeecCCC
Q 012913 348 VKLVPVVGA 356 (453)
Q Consensus 348 i~~~~~~~s 356 (453)
|.|....+.
T Consensus 104 i~l~~~~~~ 112 (125)
T PF03422_consen 104 IYLVFNGGD 112 (125)
T ss_dssp EEEEESSSS
T ss_pred EEEEEECCC
Confidence 777766543
No 14
>PF13855 LRR_8: Leucine rich repeat; PDB: 2O6S_A 3A79_B 3RFS_A 3G39_A 3VQ2_A 3VQ1_B 2Z64_A 2Z66_C 3FXI_A 2Z63_A ....
Probab=47.02 E-value=9 Score=27.69 Aligned_cols=30 Identities=30% Similarity=0.355 Sum_probs=24.4
Q ss_pred ceEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913 424 VVISQIDLGSQGLKGYISDKISLLSNLVNL 453 (453)
Q Consensus 424 ~~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L 453 (453)
..+..|+|+++.++.--+..|..|++|+.|
T Consensus 25 ~~L~~L~l~~N~l~~i~~~~f~~l~~L~~L 54 (61)
T PF13855_consen 25 PNLETLDLSNNNLTSIPPDAFSNLPNLRYL 54 (61)
T ss_dssp TTESEEEETSSSESEEETTTTTTSTTESEE
T ss_pred CCCCEeEccCCccCccCHHHHcCCCCCCEE
Confidence 468889999999986666678999988864
No 15
>PF13855 LRR_8: Leucine rich repeat; PDB: 2O6S_A 3A79_B 3RFS_A 3G39_A 3VQ2_A 3VQ1_B 2Z64_A 2Z66_C 3FXI_A 2Z63_A ....
Probab=38.44 E-value=14 Score=26.60 Aligned_cols=29 Identities=31% Similarity=0.387 Sum_probs=22.5
Q ss_pred eEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913 425 VISQIDLGSQGLKGYISDKISLLSNLVNL 453 (453)
Q Consensus 425 ~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L 453 (453)
.+..|+|+++.|+.--+..|..|+.|+.|
T Consensus 2 ~L~~L~l~~n~l~~i~~~~f~~l~~L~~L 30 (61)
T PF13855_consen 2 NLESLDLSNNKLTEIPPDSFSNLPNLETL 30 (61)
T ss_dssp TESEEEETSSTESEECTTTTTTGTTESEE
T ss_pred cCcEEECCCCCCCccCHHHHcCCCCCCEe
Confidence 36789999999995444578889888764
No 16
>PRK06764 hypothetical protein; Provisional
Probab=33.34 E-value=40 Score=26.53 Aligned_cols=17 Identities=47% Similarity=0.741 Sum_probs=15.5
Q ss_pred ceEEeecCCCccEEEEE
Q 012913 90 NCYIIPNLPPGRYYIRT 106 (453)
Q Consensus 90 ~cY~lpv~~~g~ylvRl 106 (453)
+.|++.-.++|+|.||.
T Consensus 74 nkyti~f~kpg~yvirv 90 (105)
T PRK06764 74 NKYTIRFSKPGKYVIRV 90 (105)
T ss_pred eeeEEEecCCccEEEEE
Confidence 68999988999999997
No 17
>PF03422 CBM_6: Carbohydrate binding module (family 6); InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=32.86 E-value=2.8e+02 Score=22.78 Aligned_cols=38 Identities=16% Similarity=0.303 Sum_probs=28.4
Q ss_pred CCCCceEE-eecCCCccEEEEEEeecCCCCCCCCCCcEEEEECC
Q 012913 86 SGKKNCYI-IPNLPPGRYYIRTFTVYDNYDGKSHSPSFDVSVEG 128 (453)
Q Consensus 86 ~g~~~cY~-lpv~~~g~ylvRl~F~y~nyd~~~~~~~Fdv~~~~ 128 (453)
.|...+|+ +.+...|.|.|++.+.-+. + ...++|++++
T Consensus 30 ~G~~~~~~~Vd~~~~g~y~~~~~~a~~~--~---~~~~~l~id~ 68 (125)
T PF03422_consen 30 NGDWIEYNNVDVPEAGTYTLTIRYANGG--G---GGTIELRIDG 68 (125)
T ss_dssp TTTEEEEEEEEESSSEEEEEEEEEEESS--S---SEEEEEEETT
T ss_pred CCCEEEEEEEeeCCCceEEEEEEEECCC--C---CcEEEEEECC
Confidence 46667898 8887899999986665332 1 2689999998
No 18
>KOG2230 consensus Predicted beta-mannosidase [Carbohydrate transport and metabolism]
Probab=32.38 E-value=6.9e+02 Score=27.21 Aligned_cols=108 Identities=15% Similarity=0.288 Sum_probs=60.1
Q ss_pred eeEeeccccccCCCCCCCCChHHHHHhhhcc-----CCceEEEEEeCCCCcEEEEEEEEeeecccC-----CCceEEEEE
Q 012913 241 IKSVTTRERITNTNQPPNYYPMKLYQTAIVS-----SGAIQYNLAVDAKLDYLIWFHFAEIDSSVT-----KAGQRVFDI 310 (453)
Q Consensus 241 ~~~~~t~~~i~~~~~~~~~~P~~Vy~TAr~~-----~~~l~~~f~v~~~~~Y~V~LhFaEi~~~~~-----~~~~R~FdV 310 (453)
|.--+.+.+++.+..+|-..=+++|+..... ...+.|.|-...+.+|.=.+..-|+..-.. ..-.-+=.|
T Consensus 29 w~~~~~n~t~~~~g~vpg~i~s~l~~~gii~~~~~~~n~ln~kwia~d~wtysr~frl~dl~~~~~~~l~ie~vdtia~v 108 (867)
T KOG2230|consen 29 WEFSSSNKTVNGTGTVPGDIYSDLYASGIIDNPLFGENHLNLKWIAEDDWTYSRKFRLIDLDDTVGAFLEIESVDTIATV 108 (867)
T ss_pred eEEecCCCceecCCCCCchHhHHHHhcccccCccccccccceeEEeccCccceeeeEEEEccccccceEEEeecceeEEE
Confidence 4333445566665554444456778887665 234778876666777765555555543111 111345678
Q ss_pred EECCEe-ecccceeeccCCcceeEEEEEEEecCCceEEEEEeecC
Q 012913 311 LVNDKN-VTRVDIFNSVGSFAAYSWHYVAKNLSSTELTVKLVPVV 354 (453)
Q Consensus 311 ~ing~~-~~~~di~~~~g~~~~~~~~~~v~~~~~g~l~i~~~~~~ 354 (453)
|+||+. +..-+ .+.++..+++....+...+.+.|..+.
T Consensus 109 ~~n~~~v~~s~n------~f~~y~~~vt~ii~~~n~i~~~f~ssv 147 (867)
T KOG2230|consen 109 YVNGQKVLHSRN------QFLPYHVNVTDIIAGENDITIKFKSSV 147 (867)
T ss_pred EEccEEEeeccc------cceeEEEeEEEEecCCcceEEEeehhH
Confidence 999986 33211 244555555554444456777777654
No 19
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=27.00 E-value=41 Score=27.26 Aligned_cols=14 Identities=29% Similarity=0.249 Sum_probs=5.2
Q ss_pred hhHHHHHHHHHhhh
Q 012913 7 SSFFFLSLLLVLPL 20 (453)
Q Consensus 7 ~~~~~~~~~~~~~~ 20 (453)
+.+|+.++|+++++
T Consensus 5 ~~llL~l~LA~lLl 18 (95)
T PF07172_consen 5 AFLLLGLLLAALLL 18 (95)
T ss_pred HHHHHHHHHHHHHH
Confidence 33333333333333
No 20
>KOG0472 consensus Leucine-rich repeat protein [Function unknown]
Probab=21.44 E-value=37 Score=35.08 Aligned_cols=29 Identities=24% Similarity=0.383 Sum_probs=24.4
Q ss_pred ceEEEEEeCCCCceeecCccccCCCCCcCC
Q 012913 424 VVISQIDLGSQGLKGYISDKISLLSNLVNL 453 (453)
Q Consensus 424 ~~v~~L~Ls~~~LsG~ip~~l~~L~~L~~L 453 (453)
++++.|||.+|.|. .||+.+++|++|++|
T Consensus 505 ~nL~tLDL~nNdlq-~IPp~LgnmtnL~hL 533 (565)
T KOG0472|consen 505 RNLTTLDLQNNDLQ-QIPPILGNMTNLRHL 533 (565)
T ss_pred hhcceeccCCCchh-hCChhhccccceeEE
Confidence 35778999998886 789999999999875
Done!