Citrus Sinensis ID: 035606


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230--
MWKGSRKFQRSGPWNGLQFSASPLRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGTRRLVYVTPL
cccccEEEEEEcccccEEEEEccccccccEEEEEEEcccEEEEEEEEccccEEEEEEEEccccEEEEEEEEcccccEEEEEEccccccccccccccccccccccccccccccccccccccccccccccEEcccccccccccEEEcccccccccccEEEEccccHHHHHHHHHcccccccccccccccccccEEEEccccEEEEEcccccEEEEEEEEccccccccEEEEEEc
cccccEEEEEccccccEEEcccccccccEEEEEEEccccEEEEEEEEEcccEEEEEEEcccccEEEEEEEcccccccEEEEEccccccccHHccccccEEccccccccccccccccccHHHcccccccEEccEccccccccEEEEcccccccccEEEEEccccHHHHHHHHHcccccEEEEccEcccccccEEEEcccHHHHHHHHcccccEEEEEEHHHHHccccEEEEEc
mwkgsrkfqrsgpwnglqfsasplrpnpifnfslvsnedelcytfdmrdkAAFSRIVMNQTLYLLQRFIWNKATQswelysylprdlcdtyalcgacgvciindlpvchclngfkpksrgyvdwsqgcvrdkslnysrqdgfikftamklpdatpsrvsksmnlnecrekclensscmaytnsdirgggsgcamwfgelidmrdfpdagqDLYIRMSASEIGTRRLVYVTPL
mwkgsrkfqrsgpwnglqfsasplrpNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFtamklpdatpsrvsksmnlNECREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMsaseigtrrlvyvtpl
MWKGSRKFQRSGPWNGLQFSASPLRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGTRRLVYVTPL
***********************LRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFTAMK*****************CREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGTRRLVYV***
MWKGSRKFQRSGPWNGLQFSASPLRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGTRRLVYVTPL
************PWNGLQFSASPLRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGTRRLVYVTPL
*WKGSRKFQRSGPWNGLQFSASPLRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGTRRLVYVTPL
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MWKGSRKFQRSGPWNGLQFSASPLRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSRGYVDWSQGCVRDKSLNYSRQDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSCMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGTRRLVYVTPL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query232 2.2.26 [Sep-21-2011]
O81832 783 G-type lectin S-receptor- yes no 0.948 0.280 0.455 3e-53
O81905 850 Receptor-like serine/thre no no 0.965 0.263 0.406 2e-51
Q39086 843 Receptor-like serine/thre no no 0.922 0.253 0.417 3e-49
Q9S972 847 Receptor-like serine/thre no no 0.931 0.255 0.404 8e-47
P0DH86 853 G-type lectin S-receptor- no no 0.909 0.247 0.403 1e-44
P0DH87 546 Putative inactive G-type no no 0.909 0.386 0.403 3e-44
P07761436 S-locus-specific glycopro N/A no 0.935 0.497 0.385 1e-43
O81833 815 G-type lectin S-receptor- no no 0.900 0.256 0.438 3e-43
P22551444 S-locus-specific glycopro N/A no 0.939 0.490 0.388 1e-42
P22552444 S-locus-specific glycopro N/A no 0.922 0.481 0.390 5e-42
>sp|O81832|Y4729_ARATH G-type lectin S-receptor-like serine/threonine-protein kinase At4g27290 OS=Arabidopsis thaliana GN=At4g27290 PE=3 SV=4 Back     alignment and function desciption
 Score =  208 bits (529), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 103/226 (45%), Positives = 147/226 (65%), Gaps = 6/226 (2%)

Query: 3   KGSRKFQRSGPWNGLQFSASP-LRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQT 61
           K S    R+GPWNGL+F+  P L+PNPI+ +  V  E+E+ YT+ + + +  +R+ +N  
Sbjct: 206 KNSVVVFRTGPWNGLRFTGMPNLKPNPIYRYEYVFTEEEVYYTYKLENPSVLTRMQLNPN 265

Query: 62  LYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKS-RG 120
              LQR+ W    QSW  Y     D CD Y LCG+ G C IN+ P C CL GF  K+ + 
Sbjct: 266 -GALQRYTWVDNLQSWNFYLSAMMDSCDQYTLCGSYGSCNINESPACRCLKGFVAKTPQA 324

Query: 121 YV--DWSQGCVRDKSLNYSR-QDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSC 177
           +V  DWS+GCVR   L+  + +DGF+K + +KLPD   S   K+M+LNEC++ CL N +C
Sbjct: 325 WVAGDWSEGCVRRVKLDCGKGEDGFLKISKLKLPDTRTSWYDKNMDLNECKKVCLRNCTC 384

Query: 178 MAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGT 223
            AY+  DIR GG GC +WFG+LID+R++ + GQDLY+R+++SEI T
Sbjct: 385 SAYSPFDIRDGGKGCILWFGDLIDIREYNENGQDLYVRLASSEIET 430





Arabidopsis thaliana (taxid: 3702)
EC: 2EC: .EC: 7EC: .EC: 1EC: 1EC: .EC: 1
>sp|O81905|SD18_ARATH Receptor-like serine/threonine-protein kinase SD1-8 OS=Arabidopsis thaliana GN=SD18 PE=1 SV=1 Back     alignment and function description
>sp|Q39086|SD17_ARATH Receptor-like serine/threonine-protein kinase SD1-7 OS=Arabidopsis thaliana GN=SD17 PE=1 SV=1 Back     alignment and function description
>sp|Q9S972|SD16_ARATH Receptor-like serine/threonine-protein kinase SD1-6 OS=Arabidopsis thaliana GN=SD16 PE=1 SV=2 Back     alignment and function description
>sp|P0DH86|SRK_ARATH G-type lectin S-receptor-like serine/threonine-protein kinase SRK OS=Arabidopsis thaliana GN=SRK PE=2 SV=1 Back     alignment and function description
>sp|P0DH87|PSRK_ARATH Putative inactive G-type lectin S-receptor-like serine/threonine-protein kinase SRK OS=Arabidopsis thaliana GN=PSEUDOSRKA PE=5 SV=1 Back     alignment and function description
>sp|P07761|SLSG6_BRAOL S-locus-specific glycoprotein S6 OS=Brassica oleracea GN=SLSG PE=2 SV=2 Back     alignment and function description
>sp|O81833|SD11_ARATH G-type lectin S-receptor-like serine/threonine-protein kinase SD1-1 OS=Arabidopsis thaliana GN=SD11 PE=1 SV=1 Back     alignment and function description
>sp|P22551|SLSG0_BRAOA S-locus-specific glycoprotein OS=Brassica oleracea var. alboglabra GN=SLSG PE=2 SV=1 Back     alignment and function description
>sp|P22552|SLSG1_BRAOA S-locus-specific glycoprotein BS29-1 OS=Brassica oleracea var. alboglabra GN=SLSG PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query232
224113157 786 predicted protein [Populus trichocarpa] 0.948 0.279 0.571 1e-73
224113153 785 predicted protein [Populus trichocarpa] 0.948 0.280 0.571 3e-73
224126243 836 predicted protein [Populus trichocarpa] 0.943 0.261 0.573 2e-71
224151394 439 predicted protein [Populus trichocarpa] 0.991 0.523 0.550 5e-70
224113149 820 predicted protein [Populus trichocarpa] 0.943 0.267 0.562 5e-70
224115136 843 predicted protein [Populus trichocarpa] 0.965 0.265 0.567 1e-69
224124250 820 predicted protein [Populus trichocarpa] 0.943 0.267 0.553 5e-69
356546914 788 PREDICTED: G-type lectin S-receptor-like 0.952 0.280 0.528 8e-68
356545297 824 PREDICTED: G-type lectin S-receptor-like 0.965 0.271 0.534 3e-66
356514895 821 PREDICTED: G-type lectin S-receptor-like 0.965 0.272 0.521 8e-66
>gi|224113157|ref|XP_002332644.1| predicted protein [Populus trichocarpa] gi|222832839|gb|EEE71316.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  281 bits (720), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 128/224 (57%), Positives = 172/224 (76%), Gaps = 4/224 (1%)

Query: 1   MWKGSRKFQRSGPWNGLQFSASP-LRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMN 59
           +W+GS+++ RSGPW G+ F+ +P L  NP+F  + VS+EDE+  ++++++ +AFSRIV+N
Sbjct: 210 IWRGSQQYFRSGPWTGIAFTGAPELVQNPVFKLNFVSSEDEVYLSYNLKNISAFSRIVVN 269

Query: 60  QTLYLLQRFIWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKSR 119
           QT    + + WN+ATQ+W LY+ +PRD CD YA CGA G CIINDLP+C CL  FKPKS 
Sbjct: 270 QTTNYREAYTWNEATQTWVLYASVPRDSCDNYASCGANGNCIINDLPICRCLKKFKPKSP 329

Query: 120 ---GYVDWSQGCVRDKSLNYSRQDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSS 176
                +DWS GCVR+K LN  + DGF+K+  +K PDAT S ++KSMNLNECR KCL+N S
Sbjct: 330 EKWNLMDWSDGCVRNKPLNCQKGDGFVKYLGLKWPDATHSWLNKSMNLNECRAKCLQNCS 389

Query: 177 CMAYTNSDIRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASE 220
           CMAY+NSD+RGGGSGC +W+G+LID+R FP  GQ+LYIRM+ SE
Sbjct: 390 CMAYSNSDVRGGGSGCIIWYGDLIDIRQFPAGGQELYIRMNPSE 433




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224113153|ref|XP_002332643.1| predicted protein [Populus trichocarpa] gi|222832838|gb|EEE71315.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224126243|ref|XP_002319790.1| predicted protein [Populus trichocarpa] gi|222858166|gb|EEE95713.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224151394|ref|XP_002337099.1| predicted protein [Populus trichocarpa] gi|222838019|gb|EEE76384.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224113149|ref|XP_002332642.1| predicted protein [Populus trichocarpa] gi|222832837|gb|EEE71314.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224115136|ref|XP_002316951.1| predicted protein [Populus trichocarpa] gi|222860016|gb|EEE97563.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224124250|ref|XP_002319283.1| predicted protein [Populus trichocarpa] gi|222857659|gb|EEE95206.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356546914|ref|XP_003541865.1| PREDICTED: G-type lectin S-receptor-like serine/threonine-protein kinase At4g27290-like [Glycine max] Back     alignment and taxonomy information
>gi|356545297|ref|XP_003541080.1| PREDICTED: G-type lectin S-receptor-like serine/threonine-protein kinase At4g27290-like [Glycine max] Back     alignment and taxonomy information
>gi|356514895|ref|XP_003526137.1| PREDICTED: G-type lectin S-receptor-like serine/threonine-protein kinase At4g27290-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query232
TAIR|locus:2131684 783 AT4G27290 [Arabidopsis thalian 0.918 0.272 0.461 1.5e-53
TAIR|locus:2141181 850 RK3 "receptor kinase 3" [Arabi 0.965 0.263 0.406 2.6e-50
TAIR|locus:2018546 843 RK1 "receptor kinase 1" [Arabi 0.922 0.253 0.417 5e-48
TAIR|locus:2018506 847 RK2 "receptor kinase 2" [Arabi 0.931 0.255 0.404 7.9e-46
TAIR|locus:2131694 815 AT4G27300 [Arabidopsis thalian 0.900 0.256 0.438 1.7e-43
TAIR|locus:2088619439 AT3G12000 [Arabidopsis thalian 0.806 0.425 0.369 2.5e-33
TAIR|locus:2059103 828 AT2G19130 [Arabidopsis thalian 0.926 0.259 0.331 4.3e-29
TAIR|locus:2200151 830 SD1-13 "S-domain-1 13" [Arabid 0.956 0.267 0.344 9.1e-29
TAIR|locus:2141176 849 B120 [Arabidopsis thaliana (ta 0.918 0.250 0.330 6.4e-27
TAIR|locus:2200888 804 AT1G61500 [Arabidopsis thalian 0.909 0.262 0.337 1e-24
TAIR|locus:2131684 AT4G27290 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 554 (200.1 bits), Expect = 1.5e-53, P = 1.5e-53
 Identities = 101/219 (46%), Positives = 145/219 (66%)

Query:    10 RSGPWNGLQFSASP-LRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRF 68
             R+GPWNGL+F+  P L+PNPI+ +  V  E+E+ YT+ + + +  +R+ +N     LQR+
Sbjct:   213 RTGPWNGLRFTGMPNLKPNPIYRYEYVFTEEEVYYTYKLENPSVLTRMQLNPN-GALQRY 271

Query:    69 IWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKS-RGYV--DWS 125
              W    QSW  Y     D CD Y LCG+ G C IN+ P C CL GF  K+ + +V  DWS
Sbjct:   272 TWVDNLQSWNFYLSAMMDSCDQYTLCGSYGSCNINESPACRCLKGFVAKTPQAWVAGDWS 331

Query:   126 QGCVRDKSLNYSR-QDGFIKFTAMKLPDATPSRVSKSMNLNECREKCLENSSCMAYTNSD 184
             +GCVR   L+  + +DGF+K + +KLPD   S   K+M+LNEC++ CL N +C AY+  D
Sbjct:   332 EGCVRRVKLDCGKGEDGFLKISKLKLPDTRTSWYDKNMDLNECKKVCLRNCTCSAYSPFD 391

Query:   185 IRGGGSGCAMWFGELIDMRDFPDAGQDLYIRMSASEIGT 223
             IR GG GC +WFG+LID+R++ + GQDLY+R+++SEI T
Sbjct:   392 IRDGGKGCILWFGDLIDIREYNENGQDLYVRLASSEIET 430




GO:0004672 "protein kinase activity" evidence=IEA;ISS
GO:0004674 "protein serine/threonine kinase activity" evidence=IEA
GO:0004713 "protein tyrosine kinase activity" evidence=IEA
GO:0005524 "ATP binding" evidence=IEA
GO:0005886 "plasma membrane" evidence=ISM
GO:0006468 "protein phosphorylation" evidence=IEA
GO:0016772 "transferase activity, transferring phosphorus-containing groups" evidence=IEA
GO:0030246 "carbohydrate binding" evidence=IEA
GO:0048544 "recognition of pollen" evidence=IEA
GO:0048765 "root hair cell differentiation" evidence=RCA
TAIR|locus:2141181 RK3 "receptor kinase 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2018546 RK1 "receptor kinase 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2018506 RK2 "receptor kinase 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2131694 AT4G27300 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2088619 AT3G12000 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2059103 AT2G19130 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2200151 SD1-13 "S-domain-1 13" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2141176 B120 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2200888 AT1G61500 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query232
pfam00954110 pfam00954, S_locus_glycop, S-locus glycoprotein fa 2e-35
pfam0827666 pfam08276, PAN_2, PAN-like domain 7e-23
cd0109884 cd01098, PAN_AP_plant, Plant PAN/APPLE-like domain 3e-19
smart0047378 smart00473, PAN_AP, divergent subfamily of APPLE d 1e-11
>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family Back     alignment and domain information
 Score =  121 bits (305), Expect = 2e-35
 Identities = 41/110 (37%), Positives = 65/110 (59%), Gaps = 2/110 (1%)

Query: 10  RSGPWNGLQFSASP-LRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRF 68
           RSGPWNG++FS  P ++    + ++   N +E+ YT+ M + + +SR+ ++    L +RF
Sbjct: 2   RSGPWNGIRFSGIPEMQKLSYYVYNFTENNEEVYYTYRMTNNSIYSRLTLSSEGSL-ERF 60

Query: 69  IWNKATQSWELYSYLPRDLCDTYALCGACGVCIINDLPVCHCLNGFKPKS 118
            W   +Q W L+   P+D CD Y  CG  G C +N  P C+C+ GF PK+
Sbjct: 61  TWIPNSQDWNLFWSAPKDQCDVYGRCGPYGYCDVNTSPKCNCIKGFVPKN 110


In Brassicaceae, self-incompatible plants have a self/non-self recognition system. This is sporophytically controlled by multiple alleles at a single locus (S). S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles. Length = 110

>gnl|CDD|219774 pfam08276, PAN_2, PAN-like domain Back     alignment and domain information
>gnl|CDD|238531 cd01098, PAN_AP_plant, Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins Back     alignment and domain information
>gnl|CDD|214680 smart00473, PAN_AP, divergent subfamily of APPLE domains Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 232
PF00954110 S_locus_glycop: S-locus glycoprotein family; Inter 100.0
PF0827666 PAN_2: PAN-like domain; InterPro: IPR013227 PAN do 99.74
cd0109884 PAN_AP_plant Plant PAN/APPLE-like domain; present 99.63
cd0012980 PAN_APPLE PAN/APPLE-like domain; present in N-term 99.54
smart0047378 PAN_AP divergent subfamily of APPLE domains. Apple 98.75
cd0110073 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like d 97.72
PF0002479 PAN_1: PAN domain This Prosite entry concerns appl 94.8
PF1429551 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A. 94.15
smart0022379 APPLE APPLE domain. Four-fold repeat in plasma kal 93.88
PF0827771 PAN_3: PAN-like domain; InterPro: IPR006583 PAN do 92.76
smart0060594 CW CW domain. 91.47
PF00954110 S_locus_glycop: S-locus glycoprotein family; Inter 90.14
PF0764542 EGF_CA: Calcium-binding EGF domain; InterPro: IPR0 85.92
cd0005336 EGF Epidermal growth factor domain, found in epide 84.72
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation Back     alignment and domain information
Probab=100.00  E-value=8.7e-33  Score=207.45  Aligned_cols=109  Identities=46%  Similarity=1.036  Sum_probs=104.2

Q ss_pred             EeccccCCceeeecc-CCCCCceEEEEEEeCCeEEEEEEecCCCcEEEEEEeecCceeEEEEEecCCCCEEEEEeecCCC
Q 035606            9 QRSGPWNGLQFSASP-LRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFIWNKATQSWELYSYLPRDL   87 (232)
Q Consensus         9 w~sG~W~g~~f~~~p-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rl~ld~~G~l~~~~~w~~~~~~w~~~~~~p~~~   87 (232)
                      ||||+|||..|+|+| |.....+.+.|+.+++|+||+|.+.+.+.+.|++||++|++ +++.|.+..+.|...|++|.++
T Consensus         1 wrsG~WnG~~f~g~p~~~~~~~~~~~fv~~~~e~~~t~~~~~~s~~~r~~ld~~G~l-~~~~w~~~~~~W~~~~~~p~d~   79 (110)
T PF00954_consen    1 WRSGPWNGQRFSGIPEMSSNSLYNYSFVSNNEEVYYTYSLSNSSVLSRLVLDSDGQL-QRYIWNESTQSWSVFWSAPKDQ   79 (110)
T ss_pred             CCccccCCeEECCcccccccceeEEEEEECCCeEEEEEecCCCceEEEEEEeeeeEE-EEEEEecCCCcEEEEEEecccC
Confidence            899999999999999 88888999999999999999999988889999999999999 9999999999999999999999


Q ss_pred             CCccCCCCCCcccccCCCceecCCCCceeCC
Q 035606           88 CDTYALCGACGVCIINDLPVCHCLNGFKPKS  118 (232)
Q Consensus        88 C~~~~~CG~~g~C~~~~~~~C~C~~Gf~~~~  118 (232)
                      |++|+.||+||+|+.+..+.|+||+||+|++
T Consensus        80 Cd~y~~CG~~g~C~~~~~~~C~Cl~GF~P~n  110 (110)
T PF00954_consen   80 CDVYGFCGPNGICNSNNSPKCSCLPGFEPKN  110 (110)
T ss_pred             CCCccccCCccEeCCCCCCceECCCCcCCCc
Confidence            9999999999999887788999999999863



This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen

>PF08276 PAN_2: PAN-like domain; InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions [] Back     alignment and domain information
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins Back     alignment and domain information
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins Back     alignment and domain information
>smart00473 PAN_AP divergent subfamily of APPLE domains Back     alignment and domain information
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins Back     alignment and domain information
>PF00024 PAN_1: PAN domain This Prosite entry concerns apple domains, a subset of PAN domains; InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions [] Back     alignment and domain information
>PF14295 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A Back     alignment and domain information
>smart00223 APPLE APPLE domain Back     alignment and domain information
>PF08277 PAN_3: PAN-like domain; InterPro: IPR006583 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions [] Back     alignment and domain information
>smart00605 CW CW domain Back     alignment and domain information
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation Back     alignment and domain information
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins Back     alignment and domain information
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query232
1dlp_A236 Lectin scafet precursor; two-domain lectin, beta p 96.48
3m7h_A276 Putidacin L1; monocot mannose-binding lectin, bact 96.23
2ll3_A82 Micronemal protein 4; cell adhesion; NMR {Toxoplas 94.23
4a5v_A161 MIC4, micronemal protein 4; adhesion; NMR {Toxopla 83.49
4a5v_A161 MIC4, micronemal protein 4; adhesion; NMR {Toxopla 81.02
>1dlp_A Lectin scafet precursor; two-domain lectin, beta prism II fold, native, sugar binding protein; 3.30A {Hyacinthoides hispanica} SCOP: b.78.1.1 b.78.1.1 Back     alignment and structure
Probab=96.48  E-value=0.00081  Score=55.44  Aligned_cols=64  Identities=9%  Similarity=-0.023  Sum_probs=40.9

Q ss_pred             ccCCeeEEeccccCCceeeecc-CCCCCceEEEEEEeCCeEEEEEEecCCCcEEEEEEeecCceeEEEE
Q 035606            2 WKGSRKFQRSGPWNGLQFSASP-LRPNPIFNFSLVSNEDELCYTFDMRDKAAFSRIVMNQTLYLLQRFI   69 (232)
Q Consensus         2 ~~~~~~yw~sG~W~g~~f~~~p-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rl~ld~~G~l~~~~~   69 (232)
                      ++++.+||++|+|++. +...+ |.. ....+.+..++++++.+|..... ...|++|++||++ ++|.
T Consensus       165 ~~~~~~~W~s~~~~~~-~~~~~~L~~-~Gnl~ly~~~~~~vw~s~~~~~~-~~~rl~Ld~dG~l-~ly~  229 (236)
T 1dlp_A          165 FDRDDRVWSTNTAGKG-TGCRAVLQP-NGRMDVLTNQNIAVWTSGNSRSA-GRYVFVLQPDRNL-AIYG  229 (236)
T ss_dssp             EBTTBCCSCCCCCSSC-SSCEEEEET-TTEEEEEETTTEEEEECCCCCSS-SCCEEEECSSSCE-EEEC
T ss_pred             EcCCEeEEeCCCcCCC-CceEEEECC-CCcEEEEeCCCcEEEEeCCCCCC-CCEEEEEcCCCcE-EEeC
Confidence            5567799999999764 55666 532 22222232234456666654433 3579999999999 9884



>3m7h_A Putidacin L1; monocot mannose-binding lectin, bacteriocin, LLPA, pseudomon bacterial toxin, siras, antimicrobial protein; 2.20A {Pseudomonas SP} PDB: 3m7j_A* Back     alignment and structure
>2ll3_A Micronemal protein 4; cell adhesion; NMR {Toxoplasma gondii} PDB: 2ll4_M* Back     alignment and structure
>4a5v_A MIC4, micronemal protein 4; adhesion; NMR {Toxoplasma gondii} Back     alignment and structure
>4a5v_A MIC4, micronemal protein 4; adhesion; NMR {Toxoplasma gondii} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query232
d1emoa239 Fibrillin-1 {Human (Homo sapiens) [TaxId: 9606]} 92.1
d2vj3a142 Neurogenic locus notch homolog protein 1, Notch1 { 89.96
d2qj2a191 Hepatocyte growth factor {Human (Homo sapiens) [Ta 86.73
d1xkba139 Factor X, N-terminal module {Human (Homo sapiens) 86.64
d2vj3a239 Neurogenic locus notch homolog protein 1, Notch1 { 83.46
d1xd5a_112 Gastrodianin (antifungal protein) {Gastrodia elata 82.48
d1lmja144 Fibrillin-1 {Human (Homo sapiens) [TaxId: 9606]} 82.18
d1i0ua241 Low density lipoprotein (LDL) receptor, different 80.88
>d1emoa2 g.3.11.1 (A:2167-2205) Fibrillin-1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Small proteins
fold: Knottins (small inhibitors, toxins, lectins)
superfamily: EGF/Laminin
family: EGF-type module
domain: Fibrillin-1
species: Human (Homo sapiens) [TaxId: 9606]
Probab=92.10  E-value=0.037  Score=30.13  Aligned_cols=31  Identities=35%  Similarity=0.717  Sum_probs=24.2

Q ss_pred             CCCCccCCCCCCcccccC-CCceecCCCCceeC
Q 035606           86 DLCDTYALCGACGVCIIN-DLPVCHCLNGFKPK  117 (232)
Q Consensus        86 ~~C~~~~~CG~~g~C~~~-~~~~C~C~~Gf~~~  117 (232)
                      |.|.+-+.|| +|-|+.- ..-.|.|-.||+|-
T Consensus         2 dECsignpCG-nGTCtNviGgFec~C~~GFepg   33 (39)
T d1emoa2           2 DECSVGNPCG-NGTCKNVIGGFECTCEEGFEPG   33 (39)
T ss_dssp             CGGGTTCSSS-SSCEEECSSSEEECCSSSSCCC
T ss_pred             CccccCCccC-CceeeecccceEEeecCCcCCC
Confidence            6788999998 4778654 34579999999874



>d2vj3a1 g.3.11.1 (A:411-452) Neurogenic locus notch homolog protein 1, Notch1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2qj2a1 g.10.1.1 (A:36-126) Hepatocyte growth factor {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1xkba1 g.3.11.1 (A:48-86) Factor X, N-terminal module {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2vj3a2 g.3.11.1 (A:453-491) Neurogenic locus notch homolog protein 1, Notch1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1xd5a_ b.78.1.1 (A:) Gastrodianin (antifungal protein) {Gastrodia elata [TaxId: 91201]} Back     information, alignment and structure
>d1lmja1 g.3.11.1 (A:3-46) Fibrillin-1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1i0ua2 g.3.11.1 (A:42-82) Low density lipoprotein (LDL) receptor, different EGF domains {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure