Citrus Sinensis ID: 014574


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420--
MKSPKNSGDETRVDIQFNLQQENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCASVGGSDSGKGNAGEAADHDQCRDSTSSSSDSDSSSDINIKVLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
cccccccccccccccccccccccccccccccHHHccccHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccHHcHHHHHHHHHHHHHHHHHHccccEEccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEccccccccccccHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccEEEEccccEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHccccEEEEEEEEHHHHHHHHHHHHHHHHHHHHccEEEEc
cccccccccccEEEccccHcHccccccccccHccccccHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccHcHEEEHHHHHHHHHHHHHHHHHHHHHcHHHEEEEccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccEEEEcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccEEEEEEEEcHHHHHHHHHHHHHHHHHHHccEEEEc
mkspknsgdetrVDIQFNLQqenvplllqpsyarSKSLLFDELRNFRISLKWCaldhsscigkFISYFAFIFLAIIVPLISSlcikvpsstqdcaqdgpvsfnklvqfpesglaLVGFFILSRFFSKYGLRHLLFldglqedslfVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTvrvsfpggggdpwiNSIVMFVLVLASWVYRtgrrsgsdagfiFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGthadksffssgeLVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATcasvggsdsgkgnageaadhdqcrdstssssdsdsssdinikvlssqeppsFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
mkspknsgdetrVDIQFNLQQENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRsgsdagfifreHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCASVGGSDSGKGNAGEAADHDQCRDStssssdsdsssDINIKVLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
MKSPKNSGDETRVDIQFNLQQENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCASVGGSDSGKGNAGEAADHDQCRdstssssdsdsssdINIKVLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
**************IQFNLQQENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCAS************************************************FQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVL*
****************************************DELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQ*CAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCA****************************************VLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
**********TRVDIQFNLQQENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCAS***********************************INIKVLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
**********TRVDIQFNLQ*ENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCAS*************************************IKVLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKSPKNSGDETRVDIQFNLQQENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTGRRSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCASVGGSDSGKGNAGEAADHDQCRDSTSSSSDSDSSSDINIKVLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAFEFSLVLWILSKVVVLS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query422
224115706440 predicted protein [Populus trichocarpa] 0.912 0.875 0.651 1e-149
255574165436 conserved hypothetical protein [Ricinus 0.947 0.917 0.639 1e-147
224121452409 predicted protein [Populus trichocarpa] 0.893 0.921 0.664 1e-145
356554454455 PREDICTED: uncharacterized protein LOC10 0.959 0.890 0.593 1e-138
356565842454 PREDICTED: uncharacterized protein LOC10 0.928 0.863 0.596 1e-134
225429856460 PREDICTED: uncharacterized protein LOC10 0.931 0.854 0.627 1e-134
357436689482 hypothetical protein MTR_1g009230 [Medic 0.957 0.838 0.52 1e-130
356518296437 PREDICTED: uncharacterized protein LOC10 0.921 0.890 0.564 1e-129
296081799435 unnamed protein product [Vitis vinifera] 0.893 0.866 0.606 1e-126
297821303414 hypothetical protein ARALYDRAFT_480957 [ 0.890 0.908 0.556 1e-114
>gi|224115706|ref|XP_002332122.1| predicted protein [Populus trichocarpa] gi|222874942|gb|EEF12073.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  535 bits (1377), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 288/442 (65%), Positives = 329/442 (74%), Gaps = 57/442 (12%)

Query: 11  TRVDIQFNLQQENVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAF 70
           T+ + Q +  Q NVPLLLQPSYARSKSLLFDELRNFR+SL+WCALDHSSCIGK +SYF F
Sbjct: 26  TKENDQSDQLQANVPLLLQPSYARSKSLLFDELRNFRMSLRWCALDHSSCIGKTVSYFVF 85

Query: 71  IFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGL 130
           IFLAIIVP++SSL I+VPSS      D P+S+NKLVQ PES LA + FF L RFF +YGL
Sbjct: 86  IFLAIIVPIVSSLSIRVPSSA---PADDPISYNKLVQVPESALAFIAFFTLFRFFKRYGL 142

Query: 131 RHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFP-- 188
           R LLFLDGLQ+DSLFVR+GY+RELDKAFRYLACILLPSFFVELAHKII+FSTV++  P  
Sbjct: 143 RQLLFLDGLQDDSLFVRQGYSRELDKAFRYLACILLPSFFVELAHKIIFFSTVKIWLPYN 202

Query: 189 -GGGGDPWINSIVMFVLVLASWVYRTG---------------------------RRSGSD 220
               G P +NSI MFVLVLASWVYRTG                              GS 
Sbjct: 203 ISPHGIP-LNSI-MFVLVLASWVYRTGVFLLVCVLFRLTCELQILRFEGLHKLFEGCGSG 260

Query: 221 AGFIFREHLRIRKQLSATSHRYRFFIIASLVVISISQMGALLLILGTHADKSFFSSGELV 280
           AG IFREH+RI+KQLS TSHRYRFFII+ LV I++SQ GALLL+LG +  KSFF+SG+LV
Sbjct: 261 AGVIFREHVRIKKQLSLTSHRYRFFIISCLVTITVSQFGALLLVLGFNTQKSFFNSGDLV 320

Query: 281 VCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCASVGGSDSGKGNAGEAADHD 340
           +CSAV+LSGFFLC+LGAARITHRAQGI S+ATRWHM  T A     D GK ++       
Sbjct: 321 ICSAVQLSGFFLCLLGAARITHRAQGIVSIATRWHMNVTSA-FARVDQGKNHS------- 372

Query: 341 QCRDSTSSSSDSDSSSDINIKVLSSQEPPSFQIRQALVAYLQHNNGGITLFGFALDRGLL 400
                         SSD  I + SSQ+P +FQ RQALVAYLQHNNGGITLFGFALDRGLL
Sbjct: 373 -------------DSSDFFIAI-SSQDPCTFQTRQALVAYLQHNNGGITLFGFALDRGLL 418

Query: 401 HTIFAFEFSLVLWILSKVVVLS 422
           HT+FAFEFSL +WI+SKVVVLS
Sbjct: 419 HTLFAFEFSLGMWIMSKVVVLS 440




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255574165|ref|XP_002527998.1| conserved hypothetical protein [Ricinus communis] gi|223532624|gb|EEF34410.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224121452|ref|XP_002330831.1| predicted protein [Populus trichocarpa] gi|222872633|gb|EEF09764.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356554454|ref|XP_003545561.1| PREDICTED: uncharacterized protein LOC100783743 [Glycine max] Back     alignment and taxonomy information
>gi|356565842|ref|XP_003551145.1| PREDICTED: uncharacterized protein LOC100775945 [Glycine max] Back     alignment and taxonomy information
>gi|225429856|ref|XP_002280927.1| PREDICTED: uncharacterized protein LOC100262807 [Vitis vinifera] Back     alignment and taxonomy information
>gi|357436689|ref|XP_003588620.1| hypothetical protein MTR_1g009230 [Medicago truncatula] gi|355477668|gb|AES58871.1| hypothetical protein MTR_1g009230 [Medicago truncatula] Back     alignment and taxonomy information
>gi|356518296|ref|XP_003527815.1| PREDICTED: uncharacterized protein LOC100804399 [Glycine max] Back     alignment and taxonomy information
>gi|296081799|emb|CBI20804.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|297821303|ref|XP_002878534.1| hypothetical protein ARALYDRAFT_480957 [Arabidopsis lyrata subsp. lyrata] gi|297324373|gb|EFH54793.1| hypothetical protein ARALYDRAFT_480957 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query422
TAIR|locus:2047032414 AT2G21080 "AT2G21080" [Arabido 0.428 0.437 0.572 3.9e-108
TAIR|locus:2092424452 AT3G20300 [Arabidopsis thalian 0.483 0.451 0.389 8.8e-59
TAIR|locus:2008046453 AT1G50630 [Arabidopsis thalian 0.483 0.450 0.363 8.8e-57
TAIR|locus:2141481453 AT4G03820 "AT4G03820" [Arabido 0.464 0.432 0.339 1.6e-44
TAIR|locus:2132045437 MRB1 "AT4G22270" [Arabidopsis 0.459 0.443 0.351 9.6e-44
TAIR|locus:2008828456 AT1G67570 "AT1G67570" [Arabido 0.466 0.432 0.347 9.3e-36
TAIR|locus:2047032 AT2G21080 "AT2G21080" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 484 (175.4 bits), Expect = 3.9e-108, Sum P(3) = 3.9e-108
 Identities = 110/192 (57%), Positives = 124/192 (64%)

Query:    23 NVPLLLQPSYARSKSLLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISS 82
             N PL  QPS    KSL  D+LRNFR+ LKWCALDHSS  GK +SY  F+   ++VPLIS 
Sbjct:    18 NTPL--QPS----KSLPSDDLRNFRLLLKWCALDHSSSCGKAVSYMMFVVFTLLVPLISC 71

Query:    83 LCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQED 142
             L IK P +      D   SFN LVQFPESGLA++GF  L  FF  Y L  LLFLD    D
Sbjct:    72 LFIKTPRNRPSAVMDAN-SFNVLVQFPESGLAVIGFLTLICFFRIYSLTKLLFLD----D 126

Query:   143 SLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMF 202
             S  VR GY+RELDKA RYLA IL+PSF VEL HK I+F +  VSFP         + VMF
Sbjct:   127 STLVRLGYSRELDKALRYLAYILVPSFLVELVHKSIFFYSAEVSFPFIKSSCAALNFVMF 186

Query:   203 VLVLASWVYRTG 214
              LVL SWVYRTG
Sbjct:   187 FLVLFSWVYRTG 198


GO:0003674 "molecular_function" evidence=ND
GO:0005739 "mitochondrion" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0005773 "vacuole" evidence=IDA
TAIR|locus:2092424 AT3G20300 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2008046 AT1G50630 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2141481 AT4G03820 "AT4G03820" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2132045 MRB1 "AT4G22270" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2008828 AT1G67570 "AT1G67570" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query422
pfam12056398 pfam12056, DUF3537, Protein of unknown function (D 1e-122
>gnl|CDD|221398 pfam12056, DUF3537, Protein of unknown function (DUF3537) Back     alignment and domain information
 Score =  359 bits (923), Expect = e-122
 Identities = 163/400 (40%), Positives = 219/400 (54%), Gaps = 39/400 (9%)

Query: 41  DELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPV 100
           DELR+FR  L+W   DHSS  G  +S+  F  LA+ VP IS   +       DC  D   
Sbjct: 4   DELRSFRSLLRWVCFDHSSPCGAALSWSVFFLLAVAVPAISHFLLPCR----DCDFDHRR 59

Query: 101 SFNKLVQFPESGLALVGFFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRY 160
            F+ LVQ   S  A V F  LSR+F KYGLR  LFLD L +DS  VR GYT E+ ++FR 
Sbjct: 60  PFDVLVQLSLSAFAAVSFLCLSRWFRKYGLRKFLFLDKLSDDSDRVRLGYTAEIKRSFRL 119

Query: 161 LACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWINSIVMFVLVLASWVYRTG------ 214
           LA  +LP F +E  +KI ++++     P  G      S+   +  LASW+YRT       
Sbjct: 120 LAYFVLPCFLLEAVYKIWWYASGGSQIPFIGNPVLSLSVACTLE-LASWMYRTTIFLLVC 178

Query: 215 ------------RRSG--------SDAGFIFREHLRIRKQLSATSHRYRFFIIASLVVIS 254
                       R           SD   + +EHLRIR+QLS  SHR+R FI+ SL++++
Sbjct: 179 VLFRLICHLQILRLDDFAKLFEGESDVESVLKEHLRIRRQLSIISHRFRKFILLSLLLVT 238

Query: 255 ISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRW 314
            SQ  ALLL      + +  + GEL +CS V +SG F+C+  AA+ITH+AQ +TS+A++W
Sbjct: 239 ASQFAALLLTTAASGEVNILNGGELALCSVVLVSGLFICLHSAAKITHKAQAVTSLASKW 298

Query: 315 HMTATCASVGGSD--SGKGNAGEAADHDQCRDSTSSSSDSDSSSDI-----NIKVLSSQE 367
           H  ATC+S   +   +       AA +   R  +SSSS SD S D      N K++ S  
Sbjct: 299 HACATCSSFDTTVDETPTSVLLPAASNLVLRPVSSSSSSSDESGDEEDDLDNTKIMPSYA 358

Query: 368 PP-SFQIRQALVAYLQHNNGGITLFGFALDRGLLHTIFAF 406
              SFQ RQALV YL++N  GIT++GF +DR  LHTIF  
Sbjct: 359 RTISFQKRQALVTYLENNRAGITVYGFTVDRTWLHTIFMI 398


This family of transmembrane proteins are functionally uncharacterized. This protein is found in eukaryotes. Proteins in this family are typically between 427 to 453 amino acids in length. Length = 398

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 422
PF12056398 DUF3537: Protein of unknown function (DUF3537); In 100.0
PF08395372 7tm_7: 7tm Chemosensory receptor; InterPro: IPR013 90.76
>PF12056 DUF3537: Protein of unknown function (DUF3537); InterPro: IPR021924 This family of transmembrane proteins are functionally uncharacterised Back     alignment and domain information
Probab=100.00  E-value=2.1e-153  Score=1147.18  Aligned_cols=363  Identities=50%  Similarity=0.804  Sum_probs=329.9

Q ss_pred             chHHHHHhHHhhhheeeccCCCcchhHHHHHHHHHHHHHHhhhhhhcccCCCCCccccCCCCcccccceecchhhHHHHH
Q 014574           38 LLFDELRNFRISLKWCALDHSSCIGKFISYFAFIFLAIIVPLISSLCIKVPSSTQDCAQDGPVSFNKLVQFPESGLALVG  117 (422)
Q Consensus        38 ~~~deLr~fr~~L~w~~lDqSs~~~~~~Sw~~F~~lav~vP~~~~~~~~c~~c~~~c~~~~~~~fe~~Vq~s~s~lAavS  117 (422)
                      |++||||+||++|+|||+||||++++++||++|+++|++||++++++++||    +||++|++|||++||+|||++|+||
T Consensus         1 h~~deL~~fr~~L~w~~~dqSs~~~~~~Sw~~F~ll~v~vP~~~~~~~~c~----~cd~~~~~~f~~~Vq~s~s~lAavs   76 (398)
T PF12056_consen    1 HARDELRSFRSFLRWCGLDQSSPLSAALSWSVFLLLAVAVPAASHFLLSCP----SCDKYHIRPFEKLVQLSQSALAAVS   76 (398)
T ss_pred             CchhHHHHHHHHHHHhhccCCchHHHHHHHHHHHHHHHHHHHHHHHhcCCC----CchhcCCCchhHHHHHHHHHHHHHH
Confidence            689999999999999999999999999999999999999999999999999    5669999999999999999999999


Q ss_pred             HHHhhhhhhhcccceecccccccchhHHHHhhHHHHHHHHHHHHHHHHhhHHHHHHhhheeEEEeccccCCCCCCCCchh
Q 014574          118 FFILSRFFSKYGLRHLLFLDGLQEDSLFVRRGYTRELDKAFRYLACILLPSFFVELAHKIIYFSTVRVSFPGGGGDPWIN  197 (422)
Q Consensus       118 f~cLS~~~rkyGLRrfLflD~l~~~s~~vr~gY~~~l~~sfrlLa~~vLPcf~~e~a~ki~wy~~v~~~~P~~~~~~~~~  197 (422)
                      |+|||||+||||||||||+||++|||++||+||++||++|||+|++|+||||++|+|||||||++|++++||++|++||+
T Consensus        77 f~cLS~~~rkyGlRr~LflD~l~~~s~~vr~gY~~~l~~sfrlLa~~vLPcf~~e~~~ki~~~~~~~~~~P~~~~~~~~~  156 (398)
T PF12056_consen   77 FLCLSHFLRKYGLRRFLFLDKLRDDSEEVRRGYTRQLNRSFRLLACWVLPCFLVEAAYKIWWYSSVAVQIPYIGHPSWWN  156 (398)
T ss_pred             HHHHHHHhHhcCceeeeeecccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHheEEeeccccCCccCCccHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHhhhhhhhcc--------------------------ccCCCHhHHHHHHHHHHHHHhhhhhhhHHHHHHHHH
Q 014574          198 SIVMFVLVLASWVYRTGR--------------------------RSGSDAGFIFREHLRIRKQLSATSHRYRFFIIASLV  251 (422)
Q Consensus       198 s~v~~~l~l~SW~YrT~i--------------------------~~~sDv~~il~EH~RIR~qL~~ISHRfR~Fil~~ll  251 (422)
                      + ++|+++++||+|||+|                          |+++||++||+||+|||+||+|||||||+|||+||+
T Consensus       157 s-va~~l~l~SW~YrT~ifl~~CvLFrLiC~LQiLr~~~~~klfe~~sdv~~il~EH~RiR~qL~~ISHRfR~Fil~~l~  235 (398)
T PF12056_consen  157 S-VACLLELASWLYRTTIFLLVCVLFRLICNLQILRFEGYAKLFERDSDVGVILEEHLRIRRQLSKISHRFRIFILLSLL  235 (398)
T ss_pred             H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhheehHhHHHHHccCCCHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence            9 9999999999999999                          999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHhhcccceeecccCceeehhhhHhhhHHHhhhhhhhhHhhhhhhhhHhhhhhheeeecccCCCC-CCC
Q 014574          252 VISISQMGALLLILGTHADKSFFSSGELVVCSAVELSGFFLCVLGAARITHRAQGITSVATRWHMTATCASVGGSD-SGK  330 (422)
Q Consensus       252 ~VTaSQf~aLl~tt~~~~~inf~~~GdLavcS~vqvsGl~icL~~AaKITHRAQ~I~siAskWHa~aT~~s~~~~~-~~~  330 (422)
                      +||||||++||+||++++++||+|+|||||||+|||+|++||||||||||||||+|||+||||||+|||+|+|.++ ++|
T Consensus       236 ~VTaSQf~sLl~~t~~~~~inf~~~GdlavcS~vqv~Gl~iCL~~AaKITHrAQ~I~siAskWHa~aTc~s~d~~~~~~p  315 (398)
T PF12056_consen  236 LVTASQFVSLLQTTRSKGDINFFNAGDLAVCSVVQVSGLFICLHGAAKITHRAQAITSIASKWHACATCSSFDSDQGETP  315 (398)
T ss_pred             HHHHHHHHHHHHHhccccceeEeeccceeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheecccccccccCC
Confidence            9999999999999999999999999999999999999999999999999999999999999999999999986322 344


Q ss_pred             CCC-CCccCCCC-CCCCCCCCCCC-----CCCcccccccCCCC-CCchHHhHHHHHHHhhcCCCceEEEEEEeehhhHHH
Q 014574          331 GNA-GEAADHDQ-CRDSTSSSSDS-----DSSSDINIKVLSSQ-EPPSFQIRQALVAYLQHNNGGITLFGFALDRGLLHT  402 (422)
Q Consensus       331 ~~~-~~~~~~~~-~~~~~~s~sd~-----ed~~~~~~~~~~~~-~~~SfqkRQALVtYLq~N~~GITvfGf~lDR~~LhT  402 (422)
                      ++. .+.+.++. ....++|+||+     ||+.++ ++..+.+ +|+||||||||||||||||||||||||+||||||||
T Consensus       316 ~~~~~~~~~~~~~~~~~~~~~sd~es~d~~~~~~~-~~~~~~~~~~~SfqKRQALVtYLq~N~aGITvfGf~lDR~~LhT  394 (398)
T PF12056_consen  316 TSDSLSAAANPFPSAGINSSSSDDESGDEEDDLDN-TKQLPSYADMISFQKRQALVTYLQNNRAGITVFGFVLDRGLLHT  394 (398)
T ss_pred             CCCccccccccccCCCCCCCccchhhcccCCCCCc-cccccccchhHHHHHHHHHHHHHhcCCCCeEEEEEEeehHHHHH
Confidence            432 11111111 11122222222     333333 4444444 599999999999999999999999999999999999


Q ss_pred             HHHH
Q 014574          403 IFAF  406 (422)
Q Consensus       403 IF~~  406 (422)
                      |||+
T Consensus       395 IF~i  398 (398)
T PF12056_consen  395 IFMI  398 (398)
T ss_pred             HhhC
Confidence            9985



This protein is found in eukaryotes. Proteins in this family are typically between 427 to 453 amino acids in length.

>PF08395 7tm_7: 7tm Chemosensory receptor; InterPro: IPR013604 This family includes a number of gustatory and odorant receptors mainly from insect species such as Anopheles gambiae (African malaria mosquito) and Drosophila melanogaster (Fruit fly) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00