Citrus Sinensis ID: 029169


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------20
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFLSDI
ccccccccccccccEEcccccccccccccccccccccccccccccHHHHHccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccEEEEcccccc
cccccccccccccEEEccccccccccccccccccccccccccccccEHccccccccccccccccccccEEccccHccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccEEcHHEHccc
mlqthhllslnfpftvshhpqklnflqkptislsafprrrpliepyclaqaqepttnitapttseegpvelpqsifattdepsslQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKAlgsssidakgppspvqaLLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFLSDI
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAkrakelkfrssgakkslkdeALDNLKALgsssidakgppSPVQALLGGLTAGVIAIILYKFTTTIEAALnrqtisdnfsvcssiflsdi
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVflfralrrrakrakelkfrSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFLSDI
*****HLLSLNFPFTVSHHPQKLNFLQKPTISLSAF************************************************LQVATSVLLTGAISVFLFRALRR********************************************QALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFL***
*********LNFPFTVSHHPQKLN*****************LIEPYCLAQAQEPTTN********EGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRA*******************************************ALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFLSDI
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFLSDI
*L*THHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNIT******EGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRS***************KALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFLSDI
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCSSIFLSDI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query198
449457015282 PREDICTED: uncharacterized protein LOC10 0.939 0.659 0.616 8e-52
225461564277 PREDICTED: uncharacterized protein LOC10 0.681 0.487 0.753 1e-50
118488906238 unknown [Populus trichocarpa x Populus d 0.934 0.777 0.601 1e-50
224116710211 predicted protein [Populus trichocarpa] 0.585 0.549 0.810 2e-47
356544064277 PREDICTED: uncharacterized protein LOC10 0.848 0.606 0.59 3e-46
255564541285 conserved hypothetical protein [Ricinus 0.944 0.656 0.595 1e-43
297834372261 hypothetical protein ARALYDRAFT_478942 [ 0.898 0.681 0.53 2e-42
21617902266 unknown [Arabidopsis thaliana] 0.898 0.669 0.54 2e-42
18400636266 uncharacterized protein [Arabidopsis tha 0.898 0.669 0.54 5e-42
388496044230 unknown [Medicago truncatula] 0.616 0.530 0.617 1e-36
>gi|449457015|ref|XP_004146244.1| PREDICTED: uncharacterized protein LOC101221005 [Cucumis sativus] gi|449495512|ref|XP_004159863.1| PREDICTED: uncharacterized LOC101221005 [Cucumis sativus] Back     alignment and taxonomy information
 Score =  208 bits (530), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 122/198 (61%), Positives = 143/198 (72%), Gaps = 12/198 (6%)

Query: 1   MLQTHHLLSLNFP-FTVS--HHPQKLNFLQKPTISLSAFPR------RRPLIEPYCLAQA 51
           M  T +LLS NFP FT+S   +  KL FL  PT +LS+  R        PL    C    
Sbjct: 1   MWHTQNLLSSNFPLFTLSPPTYNHKL-FLSPPT-TLSSLHRPITFHSVSPLTNHRCFCLP 58

Query: 52  QEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKR 111
           Q  T    A    + GPVELP +IFATTD+PSSLQVATSVLLTGAISVFLFR+LRRRAKR
Sbjct: 59  QF-TDLADATFLDDNGPVELPPTIFATTDDPSSLQVATSVLLTGAISVFLFRSLRRRAKR 117

Query: 112 AKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFT 171
            KELKFRS G KKSLK+EA+D+LKA+ +  I++K  PSP+QA LG + AGVIA+ILYKFT
Sbjct: 118 VKELKFRSGGVKKSLKEEAMDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFT 177

Query: 172 TTIEAALNRQTISDNFSV 189
           TTIEA+LNRQT+SDNFSV
Sbjct: 178 TTIEASLNRQTVSDNFSV 195




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225461564|ref|XP_002282834.1| PREDICTED: uncharacterized protein LOC100267434 [Vitis vinifera] gi|302142945|emb|CBI20240.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|118488906|gb|ABK96262.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|224116710|ref|XP_002317372.1| predicted protein [Populus trichocarpa] gi|222860437|gb|EEE97984.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356544064|ref|XP_003540475.1| PREDICTED: uncharacterized protein LOC100799393 [Glycine max] Back     alignment and taxonomy information
>gi|255564541|ref|XP_002523266.1| conserved hypothetical protein [Ricinus communis] gi|223537479|gb|EEF39105.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|297834372|ref|XP_002885068.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp. lyrata] gi|297330908|gb|EFH61327.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|21617902|gb|AAM66952.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|18400636|ref|NP_566500.1| uncharacterized protein [Arabidopsis thaliana] gi|87116616|gb|ABD19672.1| At3g15110 [Arabidopsis thaliana] gi|332642098|gb|AEE75619.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|388496044|gb|AFK36088.1| unknown [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query198
TAIR|locus:2083676266 AT3G15110 "AT3G15110" [Arabido 0.898 0.669 0.465 1.8e-32
TAIR|locus:2083676 AT3G15110 "AT3G15110" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 355 (130.0 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 93/200 (46%), Positives = 120/200 (60%)

Query:     1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
             +LQ+H  L  + P+ +   P +L     P  SLS+F R RP I    L+  +E   ++  
Sbjct:     3 VLQSHQCL-FSLPYRL--RPTRLI---SPIHSLSSFTRIRPGI--IRLSAVKE-IADVAE 53

Query:    61 PTTSEEGPVELP----------QSIFATTDEPSSLQVATSVLLTGAISVXXXXXXXXXXX 110
                 E+GP+ELP           SIFAT+D+P+ LQ+ATSVLLTGAI+V           
Sbjct:    54 --VEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query:   111 XXXXXXXXSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
                     S+GAKKSLK+EA+DNLKAL S+ I+     PS  QA LG + AGVIA+ILYK
Sbjct:   112 RAKELTFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query:   170 FTTTIEAALNRQTISDNFSV 189
             FT T+E+ LNRQTISDNFSV
Sbjct:   172 FTVTVESGLNRQTISDNFSV 191


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.129   0.360    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      198       179   0.00089  109 3  11 23  0.45    32
                                                     31  0.39    35


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  550 (59 KB)
  Total size of DFA:  122 KB (2079 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  17.37u 0.07s 17.44t   Elapsed:  00:00:01
  Total cpu time:  17.37u 0.07s 17.44t   Elapsed:  00:00:01
  Start:  Fri May 10 08:46:54 2013   End:  Fri May 10 08:46:55 2013


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 198
PF1128282 DUF3082: Protein of unknown function (DUF3082); In 99.38
COG1963153 Uncharacterized protein conserved in bacteria [Fun 86.89
>PF11282 DUF3082: Protein of unknown function (DUF3082); InterPro: IPR021434 This family of proteins has no known function Back     alignment and domain information
Probab=99.38  E-value=2.7e-13  Score=101.94  Aligned_cols=49  Identities=33%  Similarity=0.470  Sum_probs=47.0

Q ss_pred             CCChHHHHHHHHHHHHHHHHHHHHhhhhhhhhcccccC-CCcceeeeeec
Q 029169          147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTIS-DNFSVCSSIFL  195 (198)
Q Consensus       147 ~pSp~QaflGai~AgvIA~iLYkFtTtVeASf~rQtlp-DnfSvRqIti~  195 (198)
                      +|||+|+|+|+++||+||++||+|+++|+++|++|+++ |||.+|||++.
T Consensus         1 ~~~Pl~~l~Ga~~ag~la~~ly~lt~~i~~~fa~~p~~s~~~~a~~Ia~~   50 (82)
T PF11282_consen    1 KPTPLRCLSGALIAGGLAYGLYFLTTSIAASFASKPIHSSNYIAQNIASA   50 (82)
T ss_pred             CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCCcccHHHHHHHHH
Confidence            68999999999999999999999999999999999999 99999999863



>COG1963 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query198
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 44.1 bits (103), Expect = 2e-05
 Identities = 39/228 (17%), Positives = 63/228 (27%), Gaps = 44/228 (19%)

Query: 2   LQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAP 61
             ++ +      F +      L     P   L    +    I+P       + ++NI   
Sbjct: 171 CLSYKVQC-KMDFKI--FWLNLKNCNSPETVLEMLQKLLYQIDPN-WTSRSDHSSNI--K 224

Query: 62  TTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLF--RAL---RRRA-----KR 111
                   EL + +  +    + L V  +V    A + F    + L   R +        
Sbjct: 225 LRIHSIQAEL-RRLLKSKPYENCLLVLLNVQNAKAWNAFNLSCKILLTTRFKQVTDFLSA 283

Query: 112 AKELKFRSSGAKKSL-KDEALDNL-KALGSSSIDAKGPPSPVQALLGG-LTAGVIA---- 164
           A            +L  DE    L K L     D      P + L        +IA    
Sbjct: 284 ATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQD-----LPREVLTTNPRRLSIIAESIR 338

Query: 165 -----------IILYKFTTTIEAALNRQT---ISDNFSVCSSIFLSDI 198
                      +   K TT IE++LN          F    S+F    
Sbjct: 339 DGLATWDNWKHVNCDKLTTIIESSLNVLEPAEYRKMFDRL-SVFPPSA 385


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00