Citrus Sinensis ID: 029249


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190------
MGNCQAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPLPSSSVSDEDKQPDHQKTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAKKSAKTKRLHQESVHENLQTVQEKLIDSGAETQRDKENTNQVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
cccccHHHHHHHEEEcccccEEEEEEcccHHHHHHcccccEEEEEEcccccccccccccccccccccEEEEEEcccccccccccEEEEEEHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHccccccccccccccHHHHHHcccccccccEEEEcccEEEcccccEEEEEEccccHHHHHHHHHHHHHccc
cccHHHHHHHEEEEEcccccEEEEEEEccHHHHHHHccccEEEEEEcccccccccccccccccccEEEEEEEEcccccEEEEEEEEEEEcHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccHccccccccccEEEEHHHcccccccEEEEEccccccccHHEEEEEccccHHHHHHHHHHHHHHHcc
MGNCQAIDAAALVIqhpngrierlywpvtasevmrmnpghyvsliiplpsssvsdedkqpdhqktvrftrvkllrpsdtlslghaYRLVTSQEVMKVLRAKKSAKTKRLHQESVHENLQTVQEKLIDSgaetqrdkentNQVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
MGNCQAIDAAAlviqhpngrieRLYWPVTASEVMRMNPGHYVSLIIPLPSssvsdedkqpdhqktvrftrvkllrpsdtlslghayrlvtsQEVMKVLRAKKSaktkrlhqesvhenlQTVQEKLIDsgaetqrdkentnqVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
MGNCQAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYvsliiplpsssvsDEDKQPDHQKTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAKKSAKTKRLHQESVHENLQTVQEKLIDSGAETQRDKENTNQVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
****QAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPL******************RFTRVKLLRPSDTLSLGHAYRLVTSQEVMKV*******************************************QVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIF*
***C*AIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIP*********************TRVKLLRPSDTLSLGHAYRLVTSQEV*************************************************************ILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
MGNCQAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPLPSS***********QKTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAK************VHENLQTVQEKLIDSGAETQRDKENTNQVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
*GNCQAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPLPSS************KTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAKKSAKTKR*******************************NQVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhhiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGNCQAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPLPSSSVSDEDKQPDHQKTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAKKSAKTKRLHQESVHENLQTVQEKLIDSGAETQRDKENTNQVSIYLSTFKRIAKGILLTQSEFETCTGFLSFISIYPFSHTYISMHLMLVRLKIFR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query196
449478544169 PREDICTED: uncharacterized LOC101209663 0.683 0.792 0.690 3e-46
224099705180 predicted protein [Populus trichocarpa] 0.698 0.761 0.689 3e-46
449435384108 PREDICTED: uncharacterized protein LOC10 0.551 1.0 0.787 4e-44
147866500222 hypothetical protein VITISV_025958 [Viti 0.698 0.617 0.680 6e-44
225424766175 PREDICTED: uncharacterized protein LOC10 0.698 0.782 0.680 7e-44
224111302158 predicted protein [Populus trichocarpa] 0.683 0.848 0.690 2e-43
255581188187 conserved hypothetical protein [Ricinus 0.770 0.807 0.573 1e-42
356497998172 PREDICTED: uncharacterized protein LOC10 0.693 0.790 0.624 4e-41
388506026175 unknown [Lotus japonicus] 0.683 0.765 0.619 4e-39
18406580173 uncharacterized protein [Arabidopsis tha 0.709 0.803 0.575 2e-38
>gi|449478544|ref|XP_004155347.1| PREDICTED: uncharacterized LOC101209663 [Cucumis sativus] Back     alignment and taxonomy information
 Score =  190 bits (482), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 112/142 (78%), Gaps = 8/142 (5%)

Query: 1   MGNCQAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPLPSSSVSDEDKQP 60
           MGNCQA+DAAALVIQHP+GRIERLYWPV ASEVMR NPGHYVSLIIPLP S   D++++P
Sbjct: 1   MGNCQAVDAAALVIQHPSGRIERLYWPVVASEVMRTNPGHYVSLIIPLPQS--EDDNREP 58

Query: 61  DHQKTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAKKSAKTKRLHQESVHENLQT 120
              KTVRFTRVKLLRP+DTL+LGHAYRLVT+QEVMKVLRAKK AK+K+   ES  E  QT
Sbjct: 59  ---KTVRFTRVKLLRPNDTLALGHAYRLVTTQEVMKVLRAKKYAKSKKPLPES-EEKPQT 114

Query: 121 VQEKLIDSGAETQRDKENTNQV 142
           V E+   S  + + D E   Q 
Sbjct: 115 VMEER--SAGDEEEDTEKNQQA 134




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224099705|ref|XP_002311585.1| predicted protein [Populus trichocarpa] gi|222851405|gb|EEE88952.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449435384|ref|XP_004135475.1| PREDICTED: uncharacterized protein LOC101209663, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|147866500|emb|CAN79847.1| hypothetical protein VITISV_025958 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225424766|ref|XP_002268639.1| PREDICTED: uncharacterized protein LOC100250100 [Vitis vinifera] gi|296086500|emb|CBI32089.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224111302|ref|XP_002315808.1| predicted protein [Populus trichocarpa] gi|222864848|gb|EEF01979.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255581188|ref|XP_002531407.1| conserved hypothetical protein [Ricinus communis] gi|223529000|gb|EEF30991.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356497998|ref|XP_003517842.1| PREDICTED: uncharacterized protein LOC100777177 [Glycine max] Back     alignment and taxonomy information
>gi|388506026|gb|AFK41079.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|18406580|ref|NP_564752.1| uncharacterized protein [Arabidopsis thaliana] gi|4249380|gb|AAD14477.1| ESTs gb|Z37637, gb|AA042498 and gb|AA042269 come from this gene [Arabidopsis thaliana] gi|26450481|dbj|BAC42354.1| unknown protein [Arabidopsis thaliana] gi|28827334|gb|AAO50511.1| unknown protein [Arabidopsis thaliana] gi|332195529|gb|AEE33650.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query196
TAIR|locus:2202690173 AT1G60010 "AT1G60010" [Arabido 0.709 0.803 0.520 3.2e-33
TAIR|locus:2194616166 AT1G10530 "AT1G10530" [Arabido 0.744 0.879 0.460 3.5e-27
TAIR|locus:2170116159 AT5G50090 "AT5G50090" [Arabido 0.734 0.905 0.448 1.8e-25
TAIR|locus:2170693161 AT5G62900 "AT5G62900" [Arabido 0.627 0.763 0.471 4.8e-23
TAIR|locus:2158651182 AT5G67620 "AT5G67620" [Arabido 0.668 0.719 0.368 2.5e-19
TAIR|locus:2202690 AT1G60010 "AT1G60010" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 362 (132.5 bits), Expect = 3.2e-33, P = 3.2e-33
 Identities = 76/146 (52%), Positives = 97/146 (66%)

Query:     1 MGNCQAIDAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYXXXXX-----XXXXXXXXD 55
             MGNCQA+DAAALV+QHP+G+I+R Y PV+ SE+MRM PGHY                   
Sbjct:     1 MGNCQAVDAAALVLQHPDGKIDRYYGPVSVSEIMRMYPGHYVSLIIPLPEKNIPATTTTT 60

Query:    56 EDKQPDHQKTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAKKSAKTKRLHQESVH 115
             +DK    +K VRFTRVKLLRP++ L LGHAYRL+TSQEVMKVLRAKK AKTK+   E+  
Sbjct:    61 DDKS--ERKVVRFTRVKLLRPTENLVLGHAYRLITSQEVMKVLRAKKYAKTKKHQSETSK 118

Query:   116 ENLQTVQEKLIDSGAETQRDKENTNQ 141
             E  +   EK ID  ++  ++ E  ++
Sbjct:   119 EKKKPSSEKKIDEESDKNQNLETKDE 144




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0006833 "water transport" evidence=RCA
GO:0009651 "response to salt stress" evidence=RCA
GO:0009750 "response to fructose stimulus" evidence=RCA
TAIR|locus:2194616 AT1G10530 "AT1G10530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2170116 AT5G50090 "AT5G50090" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2170693 AT5G62900 "AT5G62900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2158651 AT5G67620 "AT5G67620" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00081370
hypothetical protein (180 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query196
pfam14009181 pfam14009, DUF4228, Domain of unknown function (DU 7e-22
>gnl|CDD|222492 pfam14009, DUF4228, Domain of unknown function (DUF4228) Back     alignment and domain information
 Score = 87.4 bits (217), Expect = 7e-22
 Identities = 34/111 (30%), Positives = 48/111 (43%), Gaps = 23/111 (20%)

Query: 1   MGNCQAI------DAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPLPSSSVS 54
           MGNC +        AA + + HP+G++     PVTA+E+M   PGH+V            
Sbjct: 1   MGNCLSCCLAPSGAAATVKVVHPDGKVREYSRPVTAAELMLEYPGHFVCD---------- 50

Query: 55  DEDKQPDHQKTVRFTRVKLLRPSDTLSLGHAYRLVTSQEVMKVLRAKKSAK 105
                 D     R  R+  L P D L  G  Y L+ ++ +  VL AK  A 
Sbjct: 51  -----SDSLYIGR--RIPALPPDDELERGQLYFLLPAERLQSVLAAKDMAS 94


This domain is found in plants. The function is not known. Length = 181

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 196
PF14009181 DUF4228: Domain of unknown function (DUF4228) 99.97
>PF14009 DUF4228: Domain of unknown function (DUF4228) Back     alignment and domain information
Probab=99.97  E-value=1.7e-31  Score=208.35  Aligned_cols=89  Identities=39%  Similarity=0.589  Sum_probs=75.8

Q ss_pred             CCccccc------cccEEEEEcCCCeEEeeecCccHhHHhhcCCCcEEEeeecCCCCCCCCCCCCCCCCccccceeeeec
Q 029249            1 MGNCQAI------DAAALVIQHPNGRIERLYWPVTASEVMRMNPGHYVSLIIPLPSSSVSDEDKQPDHQKTVRFTRVKLL   74 (196)
Q Consensus         1 MGNCqA~------daa~VVIqHP~GrVEr~y~PVSAaEVM~~nPGHyVal~i~~p~s~~~~~~~~~~~~g~vr~tRIklL   74 (196)
                      ||||.++      +..+|+|+|+||+|++|++||+|+|||.+|||||||    .++..             ...+++++|
T Consensus         1 MGn~~~~~~~~~~~~~~vkvv~~~G~v~~~~~pv~a~evm~~~P~h~v~----~~~~~-------------~~~~~~~~l   63 (181)
T PF14009_consen    1 MGNCVSCCLASSSSAATVKVVHPDGKVEEFKRPVTAAEVMLENPGHFVC----DSDSF-------------RFGRRIKPL   63 (181)
T ss_pred             CCCcccccccccCCCceEEEEcCCCcEEEeCCCcCHHHHHHHCCCCEEe----ccccc-------------cCCCcccCC
Confidence            9999998      778899999999999999999999999999999998    22111             114799999


Q ss_pred             CCCCccCCCCEEEEEehhHHHHHHHHhhHHHH
Q 029249           75 RPSDTLSLGHAYRLVTSQEVMKVLRAKKSAKT  106 (196)
Q Consensus        75 ~PDdtL~lGqvYrLIP~qeV~k~l~akk~~k~  106 (196)
                      +|||+|++|++|||+|.+.+.+.+.+...++.
T Consensus        64 ~~d~~L~~G~~Y~llP~~~~~~~~~~~~~~~~   95 (181)
T PF14009_consen   64 PPDEELQPGQIYFLLPMSRLQSVLSASDMASL   95 (181)
T ss_pred             CccCeecCCCEEEEEEccccCcccccchhccc
Confidence            99999999999999999998887766554444




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query196
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 39.5 bits (91), Expect = 4e-04
 Identities = 31/202 (15%), Positives = 64/202 (31%), Gaps = 39/202 (19%)

Query: 17  PNGRIERLYWPVTASEVMR-MNPGHYVSLI--------IPLPS----SSVSDEDKQPDHQ 63
           P   +  +++ V  S+VM  +N  H  SL+        I +PS      V  E++   H+
Sbjct: 388 PTILLSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALHR 447

Query: 64  KTVRFTRVKLLRPSDTLSLGH---------AYRLVTSQEVMKVLRAKKSAKTKRLHQESV 114
             V    +     SD L   +          + L        +   ++    + +     
Sbjct: 448 SIVDHYNIPKTFDSDDLIPPYLDQYFYSHIGHHL------KNIEHPERMTLFRMVFL--- 498

Query: 115 HENLQTVQEKLIDSGAETQRDKENTNQVSIYLSTFKRIAKGILLTQSEFETCTG-FLSFI 173
             + + +++K+              N     L   K     I     ++E      L F+
Sbjct: 499 --DFRFLEQKIRHDSTAWNASGSILNT----LQQLKFYKPYICDNDPKYERLVNAILDFL 552

Query: 174 SIYPFSHTYISMHLMLVRLKIF 195
                 +   S +  L+R+ + 
Sbjct: 553 PKIE-ENLICSKYTDLLRIALM 573


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00