Citrus Sinensis ID: 021996
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 304 | ||||||
| 224133586 | 300 | predicted protein [Populus trichocarpa] | 0.914 | 0.926 | 0.760 | 1e-125 | |
| 118481976 | 300 | unknown [Populus trichocarpa] | 0.914 | 0.926 | 0.760 | 1e-125 | |
| 356513619 | 319 | PREDICTED: uncharacterized protein LOC10 | 0.888 | 0.846 | 0.737 | 1e-120 | |
| 449487756 | 303 | PREDICTED: uncharacterized protein LOC10 | 0.888 | 0.891 | 0.725 | 1e-119 | |
| 359491235 | 306 | PREDICTED: uncharacterized protein LOC10 | 0.986 | 0.980 | 0.672 | 1e-115 | |
| 297792333 | 303 | hypothetical protein ARALYDRAFT_495088 [ | 0.904 | 0.907 | 0.660 | 1e-108 | |
| 449469703 | 292 | PREDICTED: uncharacterized protein LOC10 | 0.756 | 0.787 | 0.743 | 1e-106 | |
| 9759031 | 335 | unnamed protein product [Arabidopsis tha | 0.911 | 0.826 | 0.635 | 1e-106 | |
| 55740683 | 303 | hypothetical protein At5g50290 [Arabidop | 0.904 | 0.907 | 0.648 | 1e-106 | |
| 79533330 | 303 | uncharacterized protein [Arabidopsis tha | 0.904 | 0.907 | 0.648 | 1e-106 |
| >gi|224133586|ref|XP_002321611.1| predicted protein [Populus trichocarpa] gi|222868607|gb|EEF05738.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 213/280 (76%), Positives = 249/280 (88%), Gaps = 2/280 (0%)
Query: 1 MSTLFLATFLFIFSASNPLTTADIHINK-CRSYCGNITIDYPFAIQQGCGHPGFRDLLFC 59
MSTLFL + S +++H+N CRSYCGNITIDYPFA+Q GCGHPGFRDLLFC
Sbjct: 1 MSTLFLIILSILSSLVPLSFASNVHVNNPCRSYCGNITIDYPFALQYGCGHPGFRDLLFC 60
Query: 60 VNDFLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNP 119
+ND LMFHISSGSYRVLEIDYAYQS+T+H+ H+STCD +VLGGKGNGFAVEQWR+PYFNP
Sbjct: 61 MNDVLMFHISSGSYRVLEIDYAYQSVTIHEPHLSTCDTLVLGGKGNGFAVEQWRSPYFNP 120
Query: 120 TADNVFMLIGCSAKSPLFQGFPGQHLPCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSG 179
TADNVFMLIGCSA+S LFQGFPG+HLPCRNVSGMGCE+YY CP+WSL GR + M+GSG
Sbjct: 121 TADNVFMLIGCSAQSSLFQGFPGKHLPCRNVSGMGCEEYYGCPAWSLAGRGQMGSMFGSG 180
Query: 180 PPECCAVAFESIKMINLTKLECEGYASAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCR 239
PPECCAVAFE+I+ INL+KL+CEGY+SAYSLAPL++DGPSEWS+GIRVKYSVQ G++FCR
Sbjct: 181 PPECCAVAFEAIRAINLSKLDCEGYSSAYSLAPLRVDGPSEWSFGIRVKYSVQ-GNEFCR 239
Query: 240 ACEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSVGSSSSR 279
ACEATGGTCG+G++G+RQLCMCG +NSTSNCDSV S++ R
Sbjct: 240 ACEATGGTCGYGSNGIRQLCMCGDMNSTSNCDSVTSATDR 279
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|118481976|gb|ABK92919.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356513619|ref|XP_003525509.1| PREDICTED: uncharacterized protein LOC100818252 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449487756|ref|XP_004157785.1| PREDICTED: uncharacterized protein LOC101230571 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|359491235|ref|XP_002282273.2| PREDICTED: uncharacterized protein LOC100250137 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|297792333|ref|XP_002864051.1| hypothetical protein ARALYDRAFT_495088 [Arabidopsis lyrata subsp. lyrata] gi|297309886|gb|EFH40310.1| hypothetical protein ARALYDRAFT_495088 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|449469703|ref|XP_004152558.1| PREDICTED: uncharacterized protein LOC101203279 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|9759031|dbj|BAB09400.1| unnamed protein product [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|55740683|gb|AAV63934.1| hypothetical protein At5g50290 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|79533330|ref|NP_199840.2| uncharacterized protein [Arabidopsis thaliana] gi|52354535|gb|AAU44588.1| hypothetical protein AT5G50290 [Arabidopsis thaliana] gi|332008539|gb|AED95922.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 304 | ||||||
| TAIR|locus:2157742 | 303 | AT5G50290 "AT5G50290" [Arabido | 0.832 | 0.834 | 0.692 | 9.5e-105 | |
| TAIR|locus:2012863 | 305 | AT1G10380 "AT1G10380" [Arabido | 0.865 | 0.862 | 0.326 | 4.2e-31 | |
| TAIR|locus:505006118 | 329 | AT1G11915 "AT1G11915" [Arabido | 0.832 | 0.768 | 0.296 | 1.1e-21 | |
| TAIR|locus:2014897 | 741 | WAK3 "wall associated kinase 3 | 0.411 | 0.168 | 0.311 | 3.5e-05 | |
| TAIR|locus:2204465 | 1296 | AT1G66880 [Arabidopsis thalian | 0.230 | 0.054 | 0.363 | 0.00012 | |
| TAIR|locus:2204470 | 332 | AT1G66940 "AT1G66940" [Arabido | 0.312 | 0.286 | 0.314 | 0.00013 |
| TAIR|locus:2157742 AT5G50290 "AT5G50290" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1037 (370.1 bits), Expect = 9.5e-105, P = 9.5e-105
Identities = 180/260 (69%), Positives = 214/260 (82%)
Query: 26 INKCRSYCGNITIDYPFAIQQGCGHPGFRDLLFCVNDFLMFHISSGSYRVLEIDYAYQSL 85
++ CRSYCGNIT+DYPF I+ GCGHPG+RDLLFC+ND LMFHISSGSYRVL+IDYAYQS+
Sbjct: 19 VDACRSYCGNITVDYPFGIRNGCGHPGYRDLLFCMNDVLMFHISSGSYRVLDIDYAYQSI 78
Query: 86 TLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSAKSPLFQGFPGQHL 145
TLHD HMS C+ +VLGGKGNGF E WR PYFNPT+DNVFMLIGCS KSP+FQGFP + +
Sbjct: 79 TLHDPHMSNCETIVLGGKGNGFEAEDWRTPYFNPTSDNVFMLIGCSPKSPIFQGFPEKKV 138
Query: 146 PCRNVSGMGCEDYYRCPSWSLVGRKRTAPMYGSGPPECCAVAFESIKMINLTKLECEGYA 205
PCRN+SGM CE+Y CP+W +VG ++ GSGPP CC V FES+K INL+KLECEGY+
Sbjct: 139 PCRNISGMSCEEYMSCPAWDMVGYRQPGIHSGSGPPMCCGVGFESVKAINLSKLECEGYS 198
Query: 206 SAYSLAPLKIDGPSEWSYGIRVKYSVQGGDQFCRACEATGGTCGFG-TDG--VRQLCMCG 262
SAY+LAPLK+ GPS+W+YGIRVKY +QG D FCRAC AT GTCG+ DG +R +CMC
Sbjct: 199 SAYNLAPLKLRGPSDWAYGIRVKYELQGSDAFCRACVATSGTCGYEPADGGGLRHVCMCD 258
Query: 263 SVNSTSNCDSV----GSSSS 278
+ NST+NCDSV G+SSS
Sbjct: 259 NHNSTTNCDSVISPTGASSS 278
|
|
| TAIR|locus:2012863 AT1G10380 "AT1G10380" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:505006118 AT1G11915 "AT1G11915" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2014897 WAK3 "wall associated kinase 3" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2204465 AT1G66880 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2204470 AT1G66940 "AT1G66940" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 304 | |||
| pfam13947 | 106 | pfam13947, GUB_WAK_bind, Wall-associated receptor | 7e-21 |
| >gnl|CDD|222467 pfam13947, GUB_WAK_bind, Wall-associated receptor kinase galacturonan-binding | Back alignment and domain information |
|---|
Score = 85.0 bits (211), Expect = 7e-21
Identities = 35/107 (32%), Positives = 50/107 (46%), Gaps = 7/107 (6%)
Query: 29 CRSYCGNITIDYPFAIQQGCGH-PGFRDLLFCVNDF--LMFHISSGSYRVLEIDYAYQSL 85
C CGN++I YPF I GC PGF L C N + +G+Y VL I A ++
Sbjct: 4 CPDRCGNVSIPYPFGIGPGCARDPGFE--LTCNNTTSPPRLLLGNGNYEVLSISLANGTV 61
Query: 86 TLHDAHMSTCDNMVLGGKGNGFAVEQWRAPYFNPTADNVFMLIGCSA 132
+ D S C N GK P+F ++ N F+++GC+
Sbjct: 62 RVLDPISSNCYN--SSGKRTDNGSLSLGGPFFFSSSRNKFVVVGCNT 106
|
This cysteine-rich GUB_WAK_bind domain is the extracellular part of this serine/threonine kinase that binds to the cell-wall pectins. Length = 106 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 304 | |||
| PF13947 | 106 | GUB_WAK_bind: Wall-associated receptor kinase gala | 99.91 | |
| PF14380 | 94 | WAK_assoc: Wall-associated receptor kinase C-termi | 99.31 | |
| PF08488 | 103 | WAK: Wall-associated kinase; InterPro: IPR013695 T | 93.18 | |
| PHA03099 | 139 | epidermal growth factor-like protein (EGF-like pro | 80.97 |
| >PF13947 GUB_WAK_bind: Wall-associated receptor kinase galacturonan-binding | Back alignment and domain information |
|---|
Probab=99.91 E-value=4.5e-24 Score=168.69 Aligned_cols=102 Identities=35% Similarity=0.770 Sum_probs=86.4
Q ss_pred cCcccCCcCCccccCCCCCCCCCCc-cCCCCceEeecC--ceeeeeCCccEEEEEEEcCCCEEEEEeCCCCCccCCCCCC
Q 021996 26 INKCRSYCGNITIDYPFAIQQGCGH-PGFRDLLFCVND--FLMFHISSGSYRVLEIDYAYQSLTLHDAHMSTCDNMVLGG 102 (304)
Q Consensus 26 ~~~C~~~CG~i~I~YPFg~~~~Cg~-p~F~~~L~C~~~--~~~l~~~~~~~~V~~I~~~~~~i~v~~~~~~~C~~~~~~~ 102 (304)
+++|+.+|||++|+||||++++|++ |+|+ |+|+++ .++|.+..++|+|++|+|++++|+|.+++...|+......
T Consensus 1 ~~~C~~~CGnv~IpYPFgi~~~C~~~~~F~--L~C~~~~~~~~l~l~~~~~~V~~I~~~~~~i~v~~~~~~~~~~~~~~~ 78 (106)
T PF13947_consen 1 KPGCPSSCGNVSIPYPFGIGPGCGRDPGFE--LTCNNNTSPPKLLLSSGNYEVLSISYENGTIRVSDPISSNCYSSSSSN 78 (106)
T ss_pred CCCCCCccCCEeecCCCccCCCCCCCCCcE--EECCCCCCCceeEecCCcEEEEEEecCCCEEEEEeccccceecCCCCc
Confidence 4789999999999999999999999 9999 999987 4677778899999999999999999999888887654432
Q ss_pred C-CCceeecccCCcccccCCCceEEEEcCCC
Q 021996 103 K-GNGFAVEQWRAPYFNPTADNVFMLIGCSA 132 (304)
Q Consensus 103 ~-~~~~~l~~~~~pf~~~~~~N~~~~~gC~~ 132 (304)
. ...+++ ..||+.++..|.++++||++
T Consensus 79 ~~~~~~~~---~~~~~~s~~~N~~~~~GC~t 106 (106)
T PF13947_consen 79 SSNSNLSL---NGPFFFSSSSNKFTVVGCNT 106 (106)
T ss_pred ccccEEee---cCCceEccCCcEEEEECCCC
Confidence 2 223444 45788888999999999985
|
|
| >PF14380 WAK_assoc: Wall-associated receptor kinase C-terminal | Back alignment and domain information |
|---|
| >PF08488 WAK: Wall-associated kinase; InterPro: IPR013695 This domain is found together with the eukaryotic protein kinase domain IPR000719 from INTERPRO in plant wall-associated kinases (WAKs) and related proteins | Back alignment and domain information |
|---|
| >PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 304 | |||
| d1nqlb_ | 48 | Epidermal growth factor, EGF {Human (Homo sapiens) | 87.1 | |
| d1k36a_ | 46 | Epiregulin, EGF-domain {Human (Homo sapiens) [TaxI | 85.73 | |
| d1tpga1 | 41 | Plasminogen activator (tissue-type), t-PA {Human ( | 85.65 | |
| d3egfa_ | 53 | Epidermal growth factor, EGF {Mouse (Mus musculus) | 84.59 |
| >d1nqlb_ g.3.11.1 (B:) Epidermal growth factor, EGF {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
class: Small proteins fold: Knottins (small inhibitors, toxins, lectins) superfamily: EGF/Laminin family: EGF-type module domain: Epidermal growth factor, EGF species: Human (Homo sapiens) [TaxId: 9606]
Probab=87.10 E-value=0.16 Score=31.45 Aligned_cols=32 Identities=19% Similarity=0.347 Sum_probs=25.2
Q ss_pred cccCCceeccCCCCCceeEeeCCCCCCCCCCCC
Q 021996 241 CEATGGTCGFGTDGVRQLCMCGSVNSTSNCDSV 273 (304)
Q Consensus 241 C~~s~g~Cg~~~~~~g~~C~C~~~~~~~~C~~~ 273 (304)
|.. +|+|.+.....+|.|.|..+.....|.-.
T Consensus 12 C~n-gG~C~~~~~~~~~~C~C~~gy~G~~Ce~~ 43 (48)
T d1nqlb_ 12 CLH-DGVCMYIEALDKYACNCVVGYIGERCQYR 43 (48)
T ss_dssp ECT-TCEEEEETTTTEEEEEECTTEESTTSCEE
T ss_pred CCC-CcEEeeeCCCCCeEEeCCCCcccCCCCcC
Confidence 542 79998876777999999998777788743
|
| >d1k36a_ g.3.11.1 (A:) Epiregulin, EGF-domain {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1tpga1 g.3.11.1 (A:51-91) Plasminogen activator (tissue-type), t-PA {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d3egfa_ g.3.11.1 (A:) Epidermal growth factor, EGF {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|