Citrus Sinensis ID: 047235


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270------
MKTLPLRPSTISFSSSPPRSHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA
ccccccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHcccHHHHHHHHHHHccccccHHHHHHHHHHHHHHccccccccEEEEEEccccccccccEEEccccHHHcccccccccccccccHHHHHHHHccccccccccccccEEEEEEcccccccccccccccccccccccccccccccccccccccccc
ccccccccccccccccccccccccHcccccccccHHHHHccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHccccHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHcccccEEEEEcccccccccEEEEEEHHEHcccccEEEEEEccccccHHHHHHHccccEEEccccccccEEEEEcccccccccccHcccccHHHcccccccccccccccccccccc
mktlplrpstisfssspprshhipklhpqlqsprfsvlrssiqpqpqappikresdtsrteykpgvlddlFLSSFRNKLVQEvgldsekpgydGLIELVNHLMmkgksssdaRDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLmghctvnsvdlpdgtscqsgvFVERCKyleeskcvgvcintcklptqtffkdymgvpllmepnfsdyscqfkfgilpplpkddttlkepcldicptssrrkevamnsnveqcpka
mktlplrpstisfssspprshHIPKLHPQLQSPRFSVLRssiqpqpqappikresdtsrteyKPGVLDDLFLSSFRNKLVQevgldsekpgyDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDIcptssrrkevamnsnveqcpka
MKTLplrpstisfssspprshhipKLHPQLQSPRFSVLRSSIqpqpqappIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA
****************************************************************GVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMM**********AAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDIC*********************
*************************************************************YKPGVLDDLFLSSFRNKLVQ****DSEKPGYDGLIELVNH***************QIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVD****TSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICP***************QCP**
**************************HPQLQSPRFSV***********************EYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMK********DAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSR****************
************FS*SP**SHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPI******SRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPT*******************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKTLPLRPSTISFSSSPPRSHHIPKLHPQLQSPRFSVLRSSIQPQPQAPPIKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query276
449436415267 PREDICTED: uncharacterized protein LOC10 0.902 0.932 0.652 2e-98
449507826267 PREDICTED: uncharacterized LOC101220175 0.902 0.932 0.648 8e-98
225437593261 PREDICTED: uncharacterized protein LOC10 0.913 0.965 0.627 2e-95
357442215260 hypothetical protein MTR_1g086840 [Medic 0.782 0.830 0.718 3e-91
363807932264 uncharacterized protein LOC100788939 [Gl 0.775 0.810 0.728 8e-91
255548419282 conserved hypothetical protein [Ricinus 0.931 0.911 0.660 5e-90
224068582200 predicted protein [Populus trichocarpa] 0.724 1.0 0.727 2e-84
351720764265 uncharacterized protein LOC100499918 [Gl 0.775 0.807 0.691 3e-83
242089393277 hypothetical protein SORBIDRAFT_09g00257 0.775 0.772 0.649 3e-79
297814129263 hypothetical protein ARALYDRAFT_912030 [ 0.746 0.783 0.693 3e-77
>gi|449436415|ref|XP_004135988.1| PREDICTED: uncharacterized protein LOC101220175 [Cucumis sativus] Back     alignment and taxonomy information
 Score =  364 bits (935), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 171/262 (65%), Positives = 219/262 (83%), Gaps = 13/262 (4%)

Query: 26  LHPQLQSP---RFSVLRSSIQPQP--------QAPPIKRESDTSRTEYKPGVLDDLFLSS 74
           LHP   SP   RF+  +S ++ +P        Q+  IK +  +S+ EYKPG+LDD FL+ 
Sbjct: 8   LHPPTISPAPLRFTSSKSHLRNRPFILSCSALQSGSIK-DGASSKAEYKPGILDDFFLNV 66

Query: 75  FRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYK 134
           FR+K+VQEVG DSEKPGYDGLIE+ + L M GK++S+  +A+ +RIL++LFPPL+LKLY+
Sbjct: 67  FRSKMVQEVGWDSEKPGYDGLIEVASRLTMTGKTNSETIEAS-VRILIALFPPLLLKLYR 125

Query: 135 ILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESK 194
           IL+SP+AGGK+AA+MVARVTALTCQWLMG CTVNS++LPDG+SCQSGVFVE+CKYLEESK
Sbjct: 126 ILVSPIAGGKVAAIMVARVTALTCQWLMGTCTVNSIELPDGSSCQSGVFVEKCKYLEESK 185

Query: 195 CVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI 254
           C+G+CINTCKLPTQ+FFKD MG+PLLMEPNF+DYSCQFKFG+LPPLP++D+ LKEPCL+I
Sbjct: 186 CIGICINTCKLPTQSFFKDQMGIPLLMEPNFTDYSCQFKFGVLPPLPEEDSILKEPCLEI 245

Query: 255 CPTSSRRKEVAMNSNVEQCPKA 276
           CP ++RR+EV+   +  QCPKA
Sbjct: 246 CPNATRRREVSGKISAAQCPKA 267




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449507826|ref|XP_004163139.1| PREDICTED: uncharacterized LOC101220175 [Cucumis sativus] Back     alignment and taxonomy information
>gi|225437593|ref|XP_002271003.1| PREDICTED: uncharacterized protein LOC100253777 [Vitis vinifera] gi|297743993|emb|CBI36963.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357442215|ref|XP_003591385.1| hypothetical protein MTR_1g086840 [Medicago truncatula] gi|355480433|gb|AES61636.1| hypothetical protein MTR_1g086840 [Medicago truncatula] Back     alignment and taxonomy information
>gi|363807932|ref|NP_001242708.1| uncharacterized protein LOC100788939 [Glycine max] gi|255647168|gb|ACU24052.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|255548419|ref|XP_002515266.1| conserved hypothetical protein [Ricinus communis] gi|223545746|gb|EEF47250.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224068582|ref|XP_002302776.1| predicted protein [Populus trichocarpa] gi|222844502|gb|EEE82049.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|351720764|ref|NP_001237955.1| uncharacterized protein LOC100499918 [Glycine max] gi|255627671|gb|ACU14180.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|242089393|ref|XP_002440529.1| hypothetical protein SORBIDRAFT_09g002570 [Sorghum bicolor] gi|241945814|gb|EES18959.1| hypothetical protein SORBIDRAFT_09g002570 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|297814129|ref|XP_002874948.1| hypothetical protein ARALYDRAFT_912030 [Arabidopsis lyrata subsp. lyrata] gi|297320785|gb|EFH51207.1| hypothetical protein ARALYDRAFT_912030 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query276
TAIR|locus:504955232258 AT4G01995 "AT4G01995" [Arabido 0.789 0.844 0.659 3.8e-78
TAIR|locus:2010856250 AT1G64680 "AT1G64680" [Arabido 0.706 0.78 0.347 6.2e-30
TAIR|locus:505006097264 D27 "AT1G03055" [Arabidopsis t 0.721 0.753 0.303 6.6e-26
TAIR|locus:504955232 AT4G01995 "AT4G01995" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 786 (281.7 bits), Expect = 3.8e-78, P = 3.8e-78
 Identities = 149/226 (65%), Positives = 179/226 (79%)

Query:    51 IKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSS 110
             +K +    + EYKPG LDD F+ SFRNKLV+EVG DSEKPGY GLIELV  L++KG++ S
Sbjct:    41 VKSDEGAPKLEYKPGPLDDFFMQSFRNKLVEEVGSDSEKPGYVGLIELVKLLLLKGRTRS 100

Query:   111 DARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSV 170
             +  DAA +RIL SLFPPL+L+LYK+LI+P+A GK+AA+MVARVT LTCQWLMG   VN +
Sbjct:   101 ETSDAA-VRILKSLFPPLILELYKLLIAPIAQGKLAALMVARVTVLTCQWLMGPSKVNII 159

Query:   171 DLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSC 230
             DLP+G S  SGVFVE+C+YLEESKCVGVCINTCKLPTQTFFKDYMGVPL+MEPNF DYSC
Sbjct:   160 DLPNGESWDSGVFVEKCQYLEESKCVGVCINTCKLPTQTFFKDYMGVPLVMEPNFKDYSC 219

Query:   231 QFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA 276
             QFKFG+ PP  +DD  + EPC + C  + RRK   + S   +CP A
Sbjct:   220 QFKFGVAPP--EDDGNVNEPCFETCSIAGRRK---LKSG--ECPLA 258




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
TAIR|locus:2010856 AT1G64680 "AT1G64680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006097 D27 "AT1G03055" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.II.884.1
hypothetical protein (200 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query276
pfam1322585 pfam13225, DUF4033, Domain of unknown function (DU 2e-47
>gnl|CDD|221990 pfam13225, DUF4033, Domain of unknown function (DUF4033) Back     alignment and domain information
 Score =  152 bits (387), Expect = 2e-47
 Identities = 50/85 (58%), Positives = 63/85 (74%)

Query: 160 WLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPL 219
           WL+G C VN V+   G   +SGV +++C+YLEES C G+C+NTCKLPTQTFFK+  G+PL
Sbjct: 1   WLVGPCEVNEVENGLGRGEKSGVLIKKCRYLEESGCAGMCVNTCKLPTQTFFKEEFGLPL 60

Query: 220 LMEPNFSDYSCQFKFGILPPLPKDD 244
            MEPNF DYSCQ  FG  PP  ++D
Sbjct: 61  TMEPNFEDYSCQMIFGQTPPPIEED 85


This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 80 amino acids in length. Length = 85

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 276
PF1322586 DUF4033: Domain of unknown function (DUF4033) 100.0
>PF13225 DUF4033: Domain of unknown function (DUF4033) Back     alignment and domain information
Probab=100.00  E-value=6.4e-55  Score=343.15  Aligned_cols=85  Identities=56%  Similarity=1.171  Sum_probs=82.7

Q ss_pred             ccccceeeeeee-CCCCceeccceEeeccccccccCcccccccccCCchhhhhhhhcCCceeeccCcccceeeeeeCCCC
Q 047235          160 WLMGHCTVNSVD-LPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP  238 (276)
Q Consensus       160 WLVGP~eV~~ve-v~dG~~~~sgV~IeKCRyLEeSgC~GmCvN~CKiPTQ~FF~e~~GlPLtM~PNFED~SCqm~FG~~P  238 (276)
                      ||||||+|+++| ..+|++++|||+||||||||||||+|||+|+||+|||+||+|+|||||||+||||||||||+||++|
T Consensus         1 WLvGp~~v~~~e~~~~~~~~~sgV~i~kCRyLEes~C~g~C~N~CK~PtQ~Ff~~~~Glpl~M~PNfed~SC~~~FG~~P   80 (86)
T PF13225_consen    1 WLVGPCEVNEVEENGNGRGQKSGVHIEKCRYLEESGCAGMCVNMCKIPTQTFFKEEFGLPLTMEPNFEDFSCQMIFGQTP   80 (86)
T ss_pred             CcccccEeeEeeccCCCccccceEEEEEeEEeecCCceeeeehhcccchHHHHHhccCCceEecCCCcCcEEEEEcCCCC
Confidence            999999999999 3488889999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCCCC
Q 047235          239 PLPKDD  244 (276)
Q Consensus       239 Pp~eeD  244 (276)
                      ||+|||
T Consensus        81 pp~eeD   86 (86)
T PF13225_consen   81 PPIEED   86 (86)
T ss_pred             CCCCCC
Confidence            999998




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00