Citrus Sinensis ID: 032374


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140--
MSLNALVRLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQLI
cccccccccccccccccccccccccccccccccccccccccEEEEEEEccccccHHHHHccccccccccccccccccEEEEEEEEcccEEEEEEEEEEccHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
ccHHHHHccccccccccccEEEEcccccccccccccccccccEEEEEEEcccccHHHHHHccccccccccccccccccEEEEEEEccccEEEccEEEccHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHcHHHHHHHc
mslnalvrlplsssrthedvlvkhspfssrttqkpngrerQRRVLVVQAkgkrglqarqfqrppppslpkieddgnprFVIFIRMANVYLwyplsiitggTTAKIMVAAKDNFlgkyiykdTLARNLAAVIYRVSSVLLQLI
mslnalvrlplsssrthedvlvkhspfssrttqkpngrerqrrVLVVqakgkrglqarqfqrppppslpkieddgnPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQLI
MSLNALVRLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQLI
****************************************************************************PRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQ**
****A***L********************************************************************RFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQLI
MSLNALVRLPLSSSRTHED*********************QRRVLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQLI
****ALVRLPLSSSRTHEDVLVKHSPFSS**********RQRRVLVVQAKGKRGLQAR***RPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQLI
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSLNALVRLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRVSSVLLQLI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query142
449450401231 PREDICTED: uncharacterized protein LOC10 0.922 0.567 0.766 3e-52
449450403228 PREDICTED: uncharacterized protein LOC10 0.922 0.574 0.766 4e-52
359473697229 PREDICTED: uncharacterized protein LOC10 0.901 0.558 0.761 6e-52
255547908233 oligopeptidase, putative [Ricinus commun 0.929 0.566 0.748 7e-52
297838515 985 F12A21.16 [Arabidopsis lyrata subsp. lyr 0.992 0.143 0.705 2e-50
30697554230 uncharacterized protein [Arabidopsis tha 0.929 0.573 0.731 3e-50
334183724229 uncharacterized protein [Arabidopsis tha 0.929 0.576 0.731 3e-50
30697556231 uncharacterized protein [Arabidopsis tha 0.929 0.571 0.731 3e-50
351723561239 uncharacterized protein LOC100306628 [Gl 0.936 0.556 0.719 6e-50
224107689237 predicted protein [Populus trichocarpa] 0.922 0.552 0.725 5e-49
>gi|449450401|ref|XP_004142951.1| PREDICTED: uncharacterized protein LOC101203734 isoform 1 [Cucumis sativus] gi|449494502|ref|XP_004159563.1| PREDICTED: uncharacterized protein LOC101226204 isoform 1 [Cucumis sativus] Back     alignment and taxonomy information
 Score =  209 bits (532), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 102/133 (76%), Positives = 116/133 (87%), Gaps = 2/133 (1%)

Query: 1   MSLNALVRLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQF 60
           MSLNALVRLPLS+S+  ED +V+HS FS+RT  KP+     RR L+V+AKGK+G+Q+R  
Sbjct: 5   MSLNALVRLPLSNSKLLEDGVVRHSLFSARTMHKPH--HTYRRPLLVEAKGKKGMQSRLS 62

Query: 61  QRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYK 120
           QRP PP LPKIEDDGNP+FVIFIRMANVYLWYPLS+I GGTTAKIMVAAKDNF+GKYIYK
Sbjct: 63  QRPGPPPLPKIEDDGNPKFVIFIRMANVYLWYPLSLINGGTTAKIMVAAKDNFVGKYIYK 122

Query: 121 DTLARNLAAVIYR 133
           DTLARNLAAVIYR
Sbjct: 123 DTLARNLAAVIYR 135




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449450403|ref|XP_004142952.1| PREDICTED: uncharacterized protein LOC101203734 isoform 2 [Cucumis sativus] gi|449494506|ref|XP_004159564.1| PREDICTED: uncharacterized protein LOC101226204 isoform 2 [Cucumis sativus] Back     alignment and taxonomy information
>gi|359473697|ref|XP_002272215.2| PREDICTED: uncharacterized protein LOC100256260 [Vitis vinifera] Back     alignment and taxonomy information
>gi|255547908|ref|XP_002515011.1| oligopeptidase, putative [Ricinus communis] gi|223546062|gb|EEF47565.1| oligopeptidase, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|297838515|ref|XP_002887139.1| F12A21.16 [Arabidopsis lyrata subsp. lyrata] gi|297332980|gb|EFH63398.1| F12A21.16 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|30697554|ref|NP_850972.1| uncharacterized protein [Arabidopsis thaliana] gi|15215606|gb|AAK91348.1| At1g67700/F12A21_30 [Arabidopsis thaliana] gi|21435985|gb|AAM51570.1| At1g67700/F12A21_30 [Arabidopsis thaliana] gi|332196562|gb|AEE34683.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|334183724|ref|NP_001185346.1| uncharacterized protein [Arabidopsis thaliana] gi|332196564|gb|AEE34685.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|30697556|ref|NP_564903.3| uncharacterized protein [Arabidopsis thaliana] gi|21554073|gb|AAM63154.1| unknown [Arabidopsis thaliana] gi|222423377|dbj|BAH19661.1| AT1G67700 [Arabidopsis thaliana] gi|332196563|gb|AEE34684.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|351723561|ref|NP_001237795.1| uncharacterized protein LOC100306628 [Glycine max] gi|255629119|gb|ACU14904.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|224107689|ref|XP_002314565.1| predicted protein [Populus trichocarpa] gi|118487840|gb|ABK95743.1| unknown [Populus trichocarpa] gi|222863605|gb|EEF00736.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query142
TAIR|locus:2008615231 AT1G67700 "AT1G67700" [Arabido 0.929 0.571 0.731 3.3e-47
TAIR|locus:2008615 AT1G67700 "AT1G67700" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 494 (179.0 bits), Expect = 3.3e-47, P = 3.3e-47
 Identities = 98/134 (73%), Positives = 113/134 (84%)

Query:     1 MSLNALVRLPLSSSRTHEDV-LVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQ 59
             MSLNAL RLPL ++   E+V L +HS FSSRT  +    +++R V VV+AKGK+G+ ARQ
Sbjct:     5 MSLNALTRLPLKNTGRFEEVGLARHSLFSSRTACRETAVQQRRMVFVVEAKGKKGMAARQ 64

Query:    60 FQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIY 119
             +QR PPP +PKIEDDGNPRFVIFIRMANVYLWYPLSII GGTTAKIMVAAKDN LGKYIY
Sbjct:    65 YQRTPPP-MPKIEDDGNPRFVIFIRMANVYLWYPLSIIAGGTTAKIMVAAKDNLLGKYIY 123

Query:   120 KDTLARNLAAVIYR 133
             KDT+ARN+AAVIYR
Sbjct:   124 KDTIARNIAAVIYR 137


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.322   0.137   0.393    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      142       142   0.00091  102 3  11 22  0.40    31
                                                     30  0.47    33


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  529 (56 KB)
  Total size of DFA:  115 KB (2077 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  15.59u 0.10s 15.69t   Elapsed:  00:00:00
  Total cpu time:  15.59u 0.10s 15.69t   Elapsed:  00:00:00
  Start:  Sat May 11 02:58:12 2013   End:  Sat May 11 02:58:12 2013


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0009507 "chloroplast" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
GO:0006364 "rRNA processing" evidence=RCA
GO:0009902 "chloroplast relocation" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0015979 "photosynthesis" evidence=RCA
GO:0015995 "chlorophyll biosynthetic process" evidence=RCA
GO:0016117 "carotenoid biosynthetic process" evidence=RCA
GO:0019288 "isopentenyl diphosphate biosynthetic process, mevalonate-independent pathway" evidence=RCA
GO:0019684 "photosynthesis, light reaction" evidence=RCA
GO:0034660 "ncRNA metabolic process" evidence=RCA
GO:0042793 "transcription from plastid promoter" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00035114001
SubName- Full=Chromosome chr1 scaffold_75, whole genome shotgun sequence; (226 aa)
(Vitis vinifera)
Predicted Functional Partners:
GSVIVG00020753001
SubName- Full=Chromosome chr14 scaffold_21, whole genome shotgun sequence; (448 aa)
       0.458
GSVIVG00033192001
SubName- Full=Chromosome chr5 scaffold_64, whole genome shotgun sequence; (410 aa)
       0.455
GSVIVG00002992001
SubName- Full=Chromosome chr10 scaffold_138, whole genome shotgun sequence; (289 aa)
       0.455
GSVIVG00010970001
RecName- Full=Aminomethyltransferase; EC=2.1.2.10;; The glycine cleavage system catalyzes the d [...] (408 aa)
       0.454
GSVIVG00026943001
SubName- Full=Chromosome chr15 scaffold_40, whole genome shotgun sequence; (265 aa)
       0.451
GSVIVG00036212001
SubName- Full=Chromosome chr3 scaffold_8, whole genome shotgun sequence; (397 aa)
       0.448
GSVIVG00034383001
SubName- Full=Putative uncharacterized protein (Chromosome chr9 scaffold_7, whole genome shotgu [...] (144 aa)
       0.441
GSVIVG00011932001
SubName- Full=Chromosome undetermined scaffold_2569, whole genome shotgun sequence; (393 aa)
       0.441
GSVIVG00037228001
SubName- Full=Chromosome chr16 scaffold_86, whole genome shotgun sequence; (221 aa)
       0.440
GSVIVG00015908001
SubName- Full=Chromosome chr17 scaffold_12, whole genome shotgun sequence; (324 aa)
       0.437

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00