Citrus Sinensis ID: 026685


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-----
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCASNNNNNKDYCKWVVLPCNICFWHQLSWFVSIFWPACPEFFHRRF
ccccccccccccccccccccccccccccccccccccccccccccccHHHccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccEEEEEEEEEccccEEEEEEEEccEEEEEEHHHHHcccccccccHHcccc
cccccccccccccEEEcccccccccccccccccccccccccccccEEEccccccccccccccccccccEEccccHccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccEEEEEccccccccEEEEEccccEEEccccccEEEEccccccHHHcccc
mlqthhllslnfpftvshhpqklnflqkptislsafprrrpliepyclaqaqepttnitapttseegpvelpqsifattdepsslQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKAlgsssidakgppspvqaLLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFsvcasnnnnnkdyckwvvlpcnicfwhQLSWFVsifwpacpeffhrrf
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAkrakelkfrssgakkslkdeALDNLKALgsssidakgppSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCASNNNNNKDYCKWVVLPCNICFWHQLSWFVSIFWPACPEFFHRRF
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVflfralrrrakrakelkfrSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCASnnnnnKDYCKWVVLPCNICFWHQLSWFVSIFWPACPEFFHRRF
*****HLLSLNFPFTVSHHPQKLNFLQKPTISLSAF************************************************LQVATSVLLTGAISVFLFRALRR********************************************QALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNF************************************************
**********NFPFTVSHHPQK*******************LIEPYCLAQAQEPT*************VELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRA*******************************************ALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCASNNNNNKDYCKWVVLPCNICFWHQLSWFVSIFWPACPEFFHRR*
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCASNNNNNKDYCKWVVLPCNICFWHQLSWFVSIFWPACPEFFHRRF
***THHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEP***********EGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRS***************KALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCASNNNNNKDYCKWVVLPCNICFWHQLSWFVSIFWPACPEFFHRRF
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVCASNNNNNKDYCKWVVLPCNICFWHQLSWFVSIFWPACPEFFHRRF
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query235
225461564277 PREDICTED: uncharacterized protein LOC10 0.574 0.487 0.753 9e-51
449457015282 PREDICTED: uncharacterized protein LOC10 0.787 0.656 0.613 1e-50
118488906238 unknown [Populus trichocarpa x Populus d 0.787 0.777 0.611 1e-49
224116710211 predicted protein [Populus trichocarpa] 0.493 0.549 0.810 3e-47
356544064277 PREDICTED: uncharacterized protein LOC10 0.714 0.606 0.59 1e-45
388496044230 unknown [Medicago truncatula] 0.693 0.708 0.522 4e-45
255564541285 conserved hypothetical protein [Ricinus 0.795 0.656 0.595 7e-43
297834372261 hypothetical protein ARALYDRAFT_478942 [ 0.757 0.681 0.53 2e-42
21617902266 unknown [Arabidopsis thaliana] 0.757 0.669 0.54 3e-42
18400636266 uncharacterized protein [Arabidopsis tha 0.757 0.669 0.54 5e-42
>gi|225461564|ref|XP_002282834.1| PREDICTED: uncharacterized protein LOC100267434 [Vitis vinifera] gi|302142945|emb|CBI20240.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  206 bits (523), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 107/142 (75%), Positives = 122/142 (85%), Gaps = 7/142 (4%)

Query: 51  AQEPTTNITAPTTSEEGPVELP---QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRR 107
           AQ P T  TAP   EEGP+ELP    SIFAT D+P+ LQVATSVLLTGAISVFLFR++RR
Sbjct: 53  AQVPDTTTTAP---EEGPIELPPSSSSIFATNDDPTPLQVATSVLLTGAISVFLFRSIRR 109

Query: 108 RAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIIL 167
           R KRAKEL+FRSSG KK+LK+EALD+LKA+GS S+ A  PPSPVQALLGG+TAGVIA+IL
Sbjct: 110 RVKRAKELRFRSSGVKKTLKEEALDSLKAMGSGSVKA-APPSPVQALLGGITAGVIALIL 168

Query: 168 YKFTTTIEAALNRQTISDNFSV 189
           YKFT TIEA+LNRQT+SDNFSV
Sbjct: 169 YKFTITIEASLNRQTVSDNFSV 190




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449457015|ref|XP_004146244.1| PREDICTED: uncharacterized protein LOC101221005 [Cucumis sativus] gi|449495512|ref|XP_004159863.1| PREDICTED: uncharacterized LOC101221005 [Cucumis sativus] Back     alignment and taxonomy information
>gi|118488906|gb|ABK96262.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|224116710|ref|XP_002317372.1| predicted protein [Populus trichocarpa] gi|222860437|gb|EEE97984.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356544064|ref|XP_003540475.1| PREDICTED: uncharacterized protein LOC100799393 [Glycine max] Back     alignment and taxonomy information
>gi|388496044|gb|AFK36088.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|255564541|ref|XP_002523266.1| conserved hypothetical protein [Ricinus communis] gi|223537479|gb|EEF39105.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|297834372|ref|XP_002885068.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp. lyrata] gi|297330908|gb|EFH61327.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|21617902|gb|AAM66952.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|18400636|ref|NP_566500.1| uncharacterized protein [Arabidopsis thaliana] gi|87116616|gb|ABD19672.1| At3g15110 [Arabidopsis thaliana] gi|332642098|gb|AEE75619.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query235
TAIR|locus:2083676266 AT3G15110 "AT3G15110" [Arabido 0.757 0.669 0.465 1.8e-32
TAIR|locus:2083676 AT3G15110 "AT3G15110" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 355 (130.0 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 93/200 (46%), Positives = 120/200 (60%)

Query:     1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
             +LQ+H  L  + P+ +   P +L     P  SLS+F R RP I    L+  +E   ++  
Sbjct:     3 VLQSHQCL-FSLPYRL--RPTRLI---SPIHSLSSFTRIRPGI--IRLSAVKE-IADVAE 53

Query:    61 PTTSEEGPVELP----------QSIFATTDEPSSLQVATSVLLTGAISVXXXXXXXXXXX 110
                 E+GP+ELP           SIFAT+D+P+ LQ+ATSVLLTGAI+V           
Sbjct:    54 --VEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query:   111 XXXXXXXXSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
                     S+GAKKSLK+EA+DNLKAL S+ I+     PS  QA LG + AGVIA+ILYK
Sbjct:   112 RAKELTFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query:   170 FTTTIEAALNRQTISDNFSV 189
             FT T+E+ LNRQTISDNFSV
Sbjct:   172 FTVTVESGLNRQTISDNFSV 191


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.133   0.412    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      235       211   0.00081  112 3  11 22  0.48    32
                                                     31  0.48    35


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  603 (64 KB)
  Total size of DFA:  176 KB (2102 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  20.08u 0.18s 20.26t   Elapsed:  00:00:01
  Total cpu time:  20.08u 0.18s 20.26t   Elapsed:  00:00:01
  Start:  Sat May 11 01:49:31 2013   End:  Sat May 11 01:49:32 2013


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 235
PF1128282 DUF3082: Protein of unknown function (DUF3082); In 99.77
COG1963153 Uncharacterized protein conserved in bacteria [Fun 83.32
>PF11282 DUF3082: Protein of unknown function (DUF3082); InterPro: IPR021434 This family of proteins has no known function Back     alignment and domain information
Probab=99.77  E-value=5e-20  Score=141.02  Aligned_cols=63  Identities=27%  Similarity=0.365  Sum_probs=60.9

Q ss_pred             CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccC-CCceeeeeeec-----cCceeeeeeccccc
Q 026685          147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTIS-DNFSVCASNNN-----NNKDYCKWVVLPCN  209 (235)
Q Consensus       147 ppSP~QallGav~AGvIA~iLYkFTT~IeaSf~rQ~lp-DnysaRnItIt-----~GL~YLatfV~~an  209 (235)
                      +|||+|||+||++||+||+++|+||++|+++|++||++ |||+++||+++     +|+|||+||+|+.|
T Consensus         1 ~~~Pl~~l~Ga~~ag~la~~ly~lt~~i~~~fa~~p~~s~~~~a~~Ia~~vRTlv~Gl~~LaTfiF~~~   69 (82)
T PF11282_consen    1 KPTPLRCLSGALIAGGLAYGLYFLTTSIAASFASKPIHSSNYIAQNIASAVRTLVVGLCYLATFIFGFV   69 (82)
T ss_pred             CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            48999999999999999999999999999999999999 99999999998     79999999999877



>COG1963 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00