Citrus Sinensis ID: 021996


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300----
MSTLFLATFLFIFSASNPLTTADIHINKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSRISWPLSQISAGFFVLLSIWMTINHG
ccHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccEEEEEcccEEEEEccccEEEEEEEccccEEEEEEccccccccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEccEEEEEEcccccccccccEEEcccccccccccccEEEEEEEEEEccccccHHHHccccEEccccccccEEEEccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHcc
cHHHHHHHHHHHHHHccccccccccccHcccccccEEEcccccccccccccccccEEEEccccEEEEcccccEEEEEEEccccEEEEEcccccccccccccccccccccccccccccccccccEEEEEEccccccccccccccccccccccccccccccccccccEccccccccccccccccccccccccEEEEEccccccccccccEcccccccccccccccEEEEEEEccccccHHHHHcccccEEcccccccEEEEEEccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHcc
MSTLFLATFLFIfsasnplttadiHINKCRSycgnitidypfaiqqgcghpgfrDLLFCVNDFLMFHISSGSYRVLEIDYAYQsltlhdahmstcdnmvlggkgngfaveqwrapyfnptadnVFMLIGcsaksplfqgfpgqhlpcrnvsgmgcedyyrcpswslvgrkrtapmygsgppecCAVAFESIKMINLtklecegyasayslaplkidgpsewsygIRVKYSVQGGDQFCRaceatggtcgfgtdgvrqlcmcgsvnstsncdsvgssssriswplsqiSAGFFVLLSIWMTINHG
MSTLFLATFLFIFSASNPLTTADIHINKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSRISWPLSQISAGFFVLLSIWMTINHG
MSTLFLATFLFIFSASNPLTTADIHINKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSRISWPLSQISAGFFVLLSIWMTINHG
***LFLATFLFIFSASNPLTTADIHINKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGS*****************************************
*STLFLATFLFIFSASNPLTTA*******R*YCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSRISWPLSQISAGFFVLLSIWMTINHG
MSTLFLATFLFIFSASNPLTTADIHINKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGSVNS*************ISWPLSQISAGFFVLLSIWMTINHG
*STLFLATFLFIFSASNPLTTADIHINKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSRISWPLSQISAGFFVLLSIWMTINHG
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHooo
SSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiii
SSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSTLFLATFLFIFSASNPLTTADIHINKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSRISWPLSQISAGFFVLLSIWMTINHG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query304 2.2.26 [Sep-21-2011]
Q9LMN8 741 Wall-associated receptor no no 0.332 0.136 0.333 9e-05
>sp|Q9LMN8|WAK3_ARATH Wall-associated receptor kinase 3 OS=Arabidopsis thaliana GN=WAK3 PE=2 SV=2 Back     alignment and function desciption
 Score = 48.1 bits (113), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 10/111 (9%)

Query: 29  CRSYCGNITIDYPFAIQQGCGHPGFRDL-LFCV--NDFLMFHISSGSYRVLEIDYAYQSL 85
           C+  CGN+TI+YPF I  GC +PG  +  L CV     L+F    G  +V  I ++    
Sbjct: 31  CKLKCGNVTIEYPFGISTGCYYPGDDNFNLTCVVEEKLLLF----GIIQVTNISHSGHVS 86

Query: 86  TLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPL 136
            L +   S C       + NG A+       F+ +++N F L+GC+A S L
Sbjct: 87  VLFE-RFSECYEQ--KNETNGTALGYQLGSSFSLSSNNKFTLVGCNALSLL 134




Serine/threonine-protein kinase that may function as a signaling receptor of extracellular matrix component. Binding to pectin may have significance in the control of cell expansion, morphogenesis and development.
Arabidopsis thaliana (taxid: 3702)
EC: 2EC: .EC: 7EC: .EC: 1EC: 1EC: .EC: -

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query304
224133586300 predicted protein [Populus trichocarpa] 0.914 0.926 0.760 1e-125
118481976300 unknown [Populus trichocarpa] 0.914 0.926 0.760 1e-125
356513619319 PREDICTED: uncharacterized protein LOC10 0.888 0.846 0.737 1e-120
449487756303 PREDICTED: uncharacterized protein LOC10 0.888 0.891 0.725 1e-119
359491235306 PREDICTED: uncharacterized protein LOC10 0.986 0.980 0.672 1e-115
297792333303 hypothetical protein ARALYDRAFT_495088 [ 0.904 0.907 0.660 1e-108
449469703292 PREDICTED: uncharacterized protein LOC10 0.756 0.787 0.743 1e-106
9759031335 unnamed protein product [Arabidopsis tha 0.911 0.826 0.635 1e-106
55740683303 hypothetical protein At5g50290 [Arabidop 0.904 0.907 0.648 1e-106
79533330303 uncharacterized protein [Arabidopsis tha 0.904 0.907 0.648 1e-106
>gi|224133586|ref|XP_002321611.1| predicted protein [Populus trichocarpa] gi|222868607|gb|EEF05738.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 213/280 (76%), Positives = 249/280 (88%), Gaps = 2/280 (0%)

Query: 1   MSTLFLATFLFIFSASNPLTTADIHINK-CRSYCGNITIDYPFAIQQGCGHPGFRDLLFC 59
           MSTLFL     + S       +++H+N  CRSYCGNITIDYPFA+Q GCGHPGFRDLLFC
Sbjct: 1   MSTLFLIILSILSSLVPLSFASNVHVNNPCRSYCGNITIDYPFALQYGCGHPGFRDLLFC 60

Query: 60  VNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNP 119
           +ND LMFHISSGSYRVLEIDYAYQS+T+H+ H+STCD +VLGGKGNGFAVEQWR+PYFNP
Sbjct: 61  MNDVLMFHISSGSYRVLEIDYAYQSVTIHEPHLSTCDTLVLGGKGNGFAVEQWRSPYFNP 120

Query: 120 TADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSG 179
           TADNVFMLIGCSA+S LFQGFPG+HLPCRNVSGMGCE+YY CP+WSL GR +   M+GSG
Sbjct: 121 TADNVFMLIGCSAQSSLFQGFPGKHLPCRNVSGMGCEEYYGCPAWSLAGRGQMGSMFGSG 180

Query: 180 PPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCR 239
           PPECCAVAFE+I+ INL+KL+CEGY+SAYSLAPL++DGPSEWS+GIRVKYSVQ G++FCR
Sbjct: 181 PPECCAVAFEAIRAINLSKLDCEGYSSAYSLAPLRVDGPSEWSFGIRVKYSVQ-GNEFCR 239

Query: 240 ACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSR 279
           ACEATGGTCG+G++G+RQLCMCG +NSTSNCDSV S++ R
Sbjct: 240 ACEATGGTCGYGSNGIRQLCMCGDMNSTSNCDSVTSATDR 279




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|118481976|gb|ABK92919.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356513619|ref|XP_003525509.1| PREDICTED: uncharacterized protein LOC100818252 [Glycine max] Back     alignment and taxonomy information
>gi|449487756|ref|XP_004157785.1| PREDICTED: uncharacterized protein LOC101230571 [Cucumis sativus] Back     alignment and taxonomy information
>gi|359491235|ref|XP_002282273.2| PREDICTED: uncharacterized protein LOC100250137 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297792333|ref|XP_002864051.1| hypothetical protein ARALYDRAFT_495088 [Arabidopsis lyrata subsp. lyrata] gi|297309886|gb|EFH40310.1| hypothetical protein ARALYDRAFT_495088 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449469703|ref|XP_004152558.1| PREDICTED: uncharacterized protein LOC101203279 [Cucumis sativus] Back     alignment and taxonomy information
>gi|9759031|dbj|BAB09400.1| unnamed protein product [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|55740683|gb|AAV63934.1| hypothetical protein At5g50290 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|79533330|ref|NP_199840.2| uncharacterized protein [Arabidopsis thaliana] gi|52354535|gb|AAU44588.1| hypothetical protein AT5G50290 [Arabidopsis thaliana] gi|332008539|gb|AED95922.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query304
TAIR|locus:2157742303 AT5G50290 "AT5G50290" [Arabido 0.832 0.834 0.692 9.5e-105
TAIR|locus:2012863305 AT1G10380 "AT1G10380" [Arabido 0.865 0.862 0.326 4.2e-31
TAIR|locus:505006118329 AT1G11915 "AT1G11915" [Arabido 0.832 0.768 0.296 1.1e-21
TAIR|locus:2014897 741 WAK3 "wall associated kinase 3 0.411 0.168 0.311 3.5e-05
TAIR|locus:2204465 1296 AT1G66880 [Arabidopsis thalian 0.230 0.054 0.363 0.00012
TAIR|locus:2204470332 AT1G66940 "AT1G66940" [Arabido 0.312 0.286 0.314 0.00013
TAIR|locus:2157742 AT5G50290 "AT5G50290" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1037 (370.1 bits), Expect = 9.5e-105, P = 9.5e-105
 Identities = 180/260 (69%), Positives = 214/260 (82%)

Query:    26 INKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSL 85
             ++ CRSYCGNIT+DYPF I+ GCGHPG+RDLLFC+ND LMFHISSGSYRVL+IDYAYQS+
Sbjct:    19 VDACRSYCGNITVDYPFGIRNGCGHPGYRDLLFCMNDVLMFHISSGSYRVLDIDYAYQSI 78

Query:    86 TLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHL 145
             TLHD HMS C+ +VLGGKGNGF  E WR PYFNPT+DNVFMLIGCS KSP+FQGFP + +
Sbjct:    79 TLHDPHMSNCETIVLGGKGNGFEAEDWRTPYFNPTSDNVFMLIGCSPKSPIFQGFPEKKV 138

Query:   146 PCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYA 205
             PCRN+SGM CE+Y  CP+W +VG ++     GSGPP CC V FES+K INL+KLECEGY+
Sbjct:   139 PCRNISGMSCEEYMSCPAWDMVGYRQPGIHSGSGPPMCCGVGFESVKAINLSKLECEGYS 198

Query:   206 SAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFG-TDG--VRQLCMCG 262
             SAY+LAPLK+ GPS+W+YGIRVKY +QG D FCRAC AT GTCG+   DG  +R +CMC 
Sbjct:   199 SAYNLAPLKLRGPSDWAYGIRVKYELQGSDAFCRACVATSGTCGYEPADGGGLRHVCMCD 258

Query:   263 SVNSTSNCDSV----GSSSS 278
             + NST+NCDSV    G+SSS
Sbjct:   259 NHNSTTNCDSVISPTGASSS 278




GO:0003674 "molecular_function" evidence=ND
GO:0005576 "extracellular region" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2012863 AT1G10380 "AT1G10380" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006118 AT1G11915 "AT1G11915" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2014897 WAK3 "wall associated kinase 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2204465 AT1G66880 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2204470 AT1G66940 "AT1G66940" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query304
pfam13947106 pfam13947, GUB_WAK_bind, Wall-associated receptor 7e-21
>gnl|CDD|222467 pfam13947, GUB_WAK_bind, Wall-associated receptor kinase galacturonan-binding Back     alignment and domain information
 Score = 85.0 bits (211), Expect = 7e-21
 Identities = 35/107 (32%), Positives = 50/107 (46%), Gaps = 7/107 (6%)

Query: 29  CRSYCGNITIDYPFAIQQGCGH-PGFRDLLFCVNDF--LMFHISSGSYRVLEIDYAYQSL 85
           C   CGN++I YPF I  GC   PGF   L C N        + +G+Y VL I  A  ++
Sbjct: 4   CPDRCGNVSIPYPFGIGPGCARDPGFE--LTCNNTTSPPRLLLGNGNYEVLSISLANGTV 61

Query: 86  TLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSA 132
            + D   S C N    GK           P+F  ++ N F+++GC+ 
Sbjct: 62  RVLDPISSNCYN--SSGKRTDNGSLSLGGPFFFSSSRNKFVVVGCNT 106


This cysteine-rich GUB_WAK_bind domain is the extracellular part of this serine/threonine kinase that binds to the cell-wall pectins. Length = 106

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 304
PF13947106 GUB_WAK_bind: Wall-associated receptor kinase gala 99.91
PF1438094 WAK_assoc: Wall-associated receptor kinase C-termi 99.31
PF08488103 WAK: Wall-associated kinase; InterPro: IPR013695 T 93.18
PHA03099139 epidermal growth factor-like protein (EGF-like pro 80.97
>PF13947 GUB_WAK_bind: Wall-associated receptor kinase galacturonan-binding Back     alignment and domain information
Probab=99.91  E-value=4.5e-24  Score=168.69  Aligned_cols=102  Identities=35%  Similarity=0.770  Sum_probs=86.4

Q ss_pred             cCcccCCcCCccccCCCCCCCCCCc-cCCCCceEeecC--ceeeeeCCccEEEEEEEcCCCEEEEEeCCCCCccCCCCCC
Q 021996           26 INKCRSYCGNITIDYPFAIQQGCGH-PGFRDLLFCVND--FLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGG  102 (304)
Q Consensus        26 ~~~C~~~CG~i~I~YPFg~~~~Cg~-p~F~~~L~C~~~--~~~l~~~~~~~~V~~I~~~~~~i~v~~~~~~~C~~~~~~~  102 (304)
                      +++|+.+|||++|+||||++++|++ |+|+  |+|+++  .++|.+..++|+|++|+|++++|+|.+++...|+......
T Consensus         1 ~~~C~~~CGnv~IpYPFgi~~~C~~~~~F~--L~C~~~~~~~~l~l~~~~~~V~~I~~~~~~i~v~~~~~~~~~~~~~~~   78 (106)
T PF13947_consen    1 KPGCPSSCGNVSIPYPFGIGPGCGRDPGFE--LTCNNNTSPPKLLLSSGNYEVLSISYENGTIRVSDPISSNCYSSSSSN   78 (106)
T ss_pred             CCCCCCccCCEeecCCCccCCCCCCCCCcE--EECCCCCCCceeEecCCcEEEEEEecCCCEEEEEeccccceecCCCCc
Confidence            4789999999999999999999999 9999  999987  4677778899999999999999999999888887654432


Q ss_pred             C-CCceeecccCCcccccCCCceEEEEcCCC
Q 021996          103 K-GNGFAVEQWRAPYFNPTADNVFMLIGCSA  132 (304)
Q Consensus       103 ~-~~~~~l~~~~~pf~~~~~~N~~~~~gC~~  132 (304)
                      . ...+++   ..||+.++..|.++++||++
T Consensus        79 ~~~~~~~~---~~~~~~s~~~N~~~~~GC~t  106 (106)
T PF13947_consen   79 SSNSNLSL---NGPFFFSSSSNKFTVVGCNT  106 (106)
T ss_pred             ccccEEee---cCCceEccCCcEEEEECCCC
Confidence            2 223444   45788888999999999985



>PF14380 WAK_assoc: Wall-associated receptor kinase C-terminal Back     alignment and domain information
>PF08488 WAK: Wall-associated kinase; InterPro: IPR013695 This domain is found together with the eukaryotic protein kinase domain IPR000719 from INTERPRO in plant wall-associated kinases (WAKs) and related proteins Back     alignment and domain information
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query304
d1nqlb_48 Epidermal growth factor, EGF {Human (Homo sapiens) 87.1
d1k36a_46 Epiregulin, EGF-domain {Human (Homo sapiens) [TaxI 85.73
d1tpga141 Plasminogen activator (tissue-type), t-PA {Human ( 85.65
d3egfa_53 Epidermal growth factor, EGF {Mouse (Mus musculus) 84.59
>d1nqlb_ g.3.11.1 (B:) Epidermal growth factor, EGF {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Small proteins
fold: Knottins (small inhibitors, toxins, lectins)
superfamily: EGF/Laminin
family: EGF-type module
domain: Epidermal growth factor, EGF
species: Human (Homo sapiens) [TaxId: 9606]
Probab=87.10  E-value=0.16  Score=31.45  Aligned_cols=32  Identities=19%  Similarity=0.347  Sum_probs=25.2

Q ss_pred             cccCCceeccCCCCCceeEeeCCCCCCCCCCCC
Q 021996          241 CEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSV  273 (304)
Q Consensus       241 C~~s~g~Cg~~~~~~g~~C~C~~~~~~~~C~~~  273 (304)
                      |.. +|+|.+.....+|.|.|..+.....|.-.
T Consensus        12 C~n-gG~C~~~~~~~~~~C~C~~gy~G~~Ce~~   43 (48)
T d1nqlb_          12 CLH-DGVCMYIEALDKYACNCVVGYIGERCQYR   43 (48)
T ss_dssp             ECT-TCEEEEETTTTEEEEEECTTEESTTSCEE
T ss_pred             CCC-CcEEeeeCCCCCeEEeCCCCcccCCCCcC
Confidence            542 79998876777999999998777788743



>d1k36a_ g.3.11.1 (A:) Epiregulin, EGF-domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1tpga1 g.3.11.1 (A:51-91) Plasminogen activator (tissue-type), t-PA {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d3egfa_ g.3.11.1 (A:) Epidermal growth factor, EGF {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure