Citrus Sinensis ID: 034805


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--
MAYSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQCK
ccccccccEEEEccccccccccHHHHHHHHHHHHHHHHcccccccccHHHHccccHHHHHHHHHHcccccEEcccccEEEEc
cccccccEEEEEcccccccEccHHHHHHHHHHHHHHcccccccccccccHHHcccHHHHHHHHHHHcEEEEEcccccEEEEc
maysrtetyvllepgveekfVTEEELKARLKYWLENWagqvgkgglppdlakFATIDEAVAFLITNVCElelqgdvgsiqck
maysrtetyvllepgveekfVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCElelqgdvgsiqck
MAYSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQCK
*******TYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDV******
****RTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQC*
MAYSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQCK
****RTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQCK
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAYSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQCK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query82
359488522151 PREDICTED: uncharacterized protein LOC10 0.914 0.496 0.696 5e-24
388494090151 unknown [Lotus japonicus] 0.914 0.496 0.670 2e-21
356573538152 PREDICTED: uncharacterized protein LOC10 0.914 0.493 0.658 2e-21
270342082152 hypothetical protein [Phaseolus vulgaris 0.914 0.493 0.658 5e-21
326494344151 predicted protein [Hordeum vulgare subsp 0.865 0.470 0.6 8e-20
297805830157 hypothetical protein ARALYDRAFT_916402 [ 0.926 0.484 0.575 2e-19
21553859156 unknown [Arabidopsis thaliana] 0.902 0.474 0.589 4e-19
18421842156 chlororespiratory reduction 7 [Arabidops 0.902 0.474 0.589 6e-19
118489214162 unknown [Populus trichocarpa x Populus d 0.902 0.456 0.615 7e-19
357136583154 PREDICTED: uncharacterized protein LOC10 0.878 0.467 0.592 1e-18
>gi|359488522|ref|XP_002280011.2| PREDICTED: uncharacterized protein LOC100249838 [Vitis vinifera] gi|296082204|emb|CBI21209.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  115 bits (287), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 55/79 (69%), Positives = 66/79 (83%), Gaps = 4/79 (5%)

Query: 2   AYSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVA 61
           AYS+TETYVL+EPG +E+FV++EELKARLK WLENW G+     LPPDLAKF TID+AV 
Sbjct: 70  AYSQTETYVLMEPGKDEEFVSQEELKARLKGWLENWPGK----ALPPDLAKFQTIDDAVM 125

Query: 62  FLITNVCELELQGDVGSIQ 80
           +L+  VCELE+ GDVGSIQ
Sbjct: 126 YLVKAVCELEIDGDVGSIQ 144




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388494090|gb|AFK35111.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|356573538|ref|XP_003554915.1| PREDICTED: uncharacterized protein LOC100801516 [Glycine max] Back     alignment and taxonomy information
>gi|270342082|gb|ACZ74666.1| hypothetical protein [Phaseolus vulgaris] Back     alignment and taxonomy information
>gi|326494344|dbj|BAJ90441.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326520968|dbj|BAJ92847.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|297805830|ref|XP_002870799.1| hypothetical protein ARALYDRAFT_916402 [Arabidopsis lyrata subsp. lyrata] gi|297316635|gb|EFH47058.1| hypothetical protein ARALYDRAFT_916402 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|21553859|gb|AAM62952.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|18421842|ref|NP_568563.1| chlororespiratory reduction 7 [Arabidopsis thaliana] gi|9758849|dbj|BAB09375.1| unnamed protein product [Arabidopsis thaliana] gi|17529164|gb|AAL38808.1| unknown protein [Arabidopsis thaliana] gi|21281014|gb|AAM45076.1| unknown protein [Arabidopsis thaliana] gi|332007025|gb|AED94408.1| chlororespiratory reduction 7 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|118489214|gb|ABK96413.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|357136583|ref|XP_003569883.1| PREDICTED: uncharacterized protein LOC100831735 [Brachypodium distachyon] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query82
TAIR|locus:2157250156 CRR7 "AT5G39210" [Arabidopsis 0.902 0.474 0.589 4.4e-20
TAIR|locus:2157250 CRR7 "AT5G39210" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 238 (88.8 bits), Expect = 4.4e-20, P = 4.4e-20
 Identities = 46/78 (58%), Positives = 60/78 (76%)

Query:     3 YSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAF 62
             +S ++TYVLLE G +E+FVTE+ELKA+L+ WLENW        LPPDLA+F  +DEAV F
Sbjct:    76 HSNSDTYVLLEAGQDEQFVTEDELKAKLRGWLENWP----VNSLPPDLARFDDLDEAVDF 131

Query:    63 LITNVCELELQGDVGSIQ 80
             L+  VCELE+ G+VGS+Q
Sbjct:   132 LVKAVCELEIDGEVGSVQ 149


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.136   0.406    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0       82        82   0.00091  102 3  11 22  0.47    29
                                                     29  0.43    30


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  575 (61 KB)
  Total size of DFA:  113 KB (2074 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  9.73u 0.09s 9.82t   Elapsed:  00:00:00
  Total cpu time:  9.73u 0.09s 9.82t   Elapsed:  00:00:00
  Start:  Fri May 10 01:25:35 2013   End:  Fri May 10 01:25:35 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0009507 "chloroplast" evidence=IDA
GO:0010275 "NAD(P)H dehydrogenase complex assembly" evidence=IGI;IMP
GO:0010598 "NAD(P)H dehydrogenase complex (plastoquinone)" evidence=IGI
GO:0016020 "membrane" evidence=IDA
GO:0009570 "chloroplast stroma" evidence=IDA
GO:0000023 "maltose metabolic process" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0016556 "mRNA modification" evidence=RCA
GO:0019252 "starch biosynthetic process" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query82
pfam1209583 pfam12095, DUF3571, Protein of unknown function (D 5e-17
>gnl|CDD|192936 pfam12095, DUF3571, Protein of unknown function (DUF3571) Back     alignment and domain information
 Score = 68.1 bits (167), Expect = 5e-17
 Identities = 30/74 (40%), Positives = 40/74 (54%), Gaps = 7/74 (9%)

Query: 7  ETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITN 66
          + YV+LEPG  E+F+T  EL A LK WL           LP DLAK  ++D     L+  
Sbjct: 10 DHYVVLEPGQPEQFLTPAELLAWLKEWLTR------LDSLPADLAKLPSLDAQAQRLLDT 63

Query: 67 VCELELQGDVGSIQ 80
           CELE+ G   ++Q
Sbjct: 64 ACELEI-GPGLTLQ 76


This family of proteins is functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 85 to 97 amino acids in length. Length = 83

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 82
PF1209583 DUF3571: Protein of unknown function (DUF3571); In 100.0
>PF12095 DUF3571: Protein of unknown function (DUF3571); InterPro: IPR021954 This family of proteins is functionally uncharacterised Back     alignment and domain information
Probab=100.00  E-value=2e-39  Score=214.13  Aligned_cols=72  Identities=47%  Similarity=0.898  Sum_probs=56.1

Q ss_pred             CccccCceEEecCCCCccccCHHHHHHHHHHHHHhhhccCCCCCCChhhhccCCHHHHHHHHHHhcccccccCCcceeee
Q 034805            2 AYSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQC   81 (82)
Q Consensus         2 my~q~D~yVvLEp~~~EqflT~~Ell~~Lk~~L~~~~~~~~~~~LP~dL~k~~s~~~qa~~Lldt~CELEi~pg~~~lQW   81 (82)
                      || ++|||||||||+||||||++||++|||+||++.      ++||+||+||+|+++||+||++|+|||||+|| +||||
T Consensus         6 m~-~~d~yVvLEp~~~Eqflt~~Ell~~Lk~~L~~~------~~LP~dL~~~~s~~~qa~~Lldt~CeLeigpg-~~lQW   77 (83)
T PF12095_consen    6 MY-QEDHYVVLEPGQPEQFLTPEELLEKLKEWLQNQ------DDLPPDLAKFSSVEEQAQYLLDTACELEIGPG-GYLQW   77 (83)
T ss_dssp             S------EEEEESSS-SEEE-HHHHHHHHHHHHHHT------TTS-HHHHH---HHHHHHHHHHH---EEEETT-EEEEE
T ss_pred             hh-ccCCEEEecCCCCcccCCHHHHHHHHHHHHHcC------CCCCHHHHhCCCHHHHHHHHHHhceeeecCCC-CEEEE
Confidence            89 999999999999999999999999999999994      48999999999999999999999999999999 69999



This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 85 to 97 amino acids in length. ; PDB: 2KRX_A.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query82
2krx_A94 ASL3597 protein; structural genomics, PSI-2, prote 7e-23
>2krx_A ASL3597 protein; structural genomics, PSI-2, protein structure initiative, no structural genomics consortium, NESG, unknown function; NMR {Nostoc SP} Length = 94 Back     alignment and structure
 Score = 82.9 bits (205), Expect = 7e-23
 Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 5/74 (6%)

Query: 7  ETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITN 66
          + +V+LE    E+F+T  EL  +LK  LE  +       LP +L K  ++      LI  
Sbjct: 10 DNFVVLETNQPEQFLTTIELLEKLKGELEKISFSD----LPLELQKLDSLPAQAQHLIDT 65

Query: 67 VCELELQGDVGSIQ 80
           CEL++ G    +Q
Sbjct: 66 SCELDV-GAGKYLQ 78


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query82
2krx_A94 ASL3597 protein; structural genomics, PSI-2, prote 100.0
>2krx_A ASL3597 protein; structural genomics, PSI-2, protein structure initiative, no structural genomics consortium, NESG, unknown function; NMR {Nostoc SP} Back     alignment and structure
Probab=100.00  E-value=3.1e-40  Score=221.24  Aligned_cols=75  Identities=31%  Similarity=0.491  Sum_probs=72.5

Q ss_pred             CCccccCceEEecCCCCccccCHHHHHHHHHHHHHhhhccCCCCCCChhhhccCCHHHHHHHHHHhcccccccCCcceee
Q 034805            1 MAYSRTETYVLLEPGVEEKFVTEEELKARLKYWLENWAGQVGKGGLPPDLAKFATIDEAVAFLITNVCELELQGDVGSIQ   80 (82)
Q Consensus         1 ~my~q~D~yVvLEp~~~EqflT~~Ell~~Lk~~L~~~~~~~~~~~LP~dL~k~~s~~~qa~~Lldt~CELEi~pg~~~lQ   80 (82)
                      .|| |+|||||||||+||||||++||++|||+||+++|+    ++||+||+||+|+++||+||++|+|||||+||+ |||
T Consensus         5 lmy-q~D~yVvLEp~~~EqfLT~~Ell~~Lk~~L~~~~~----~~LP~dL~~~~s~~~qaq~Lldt~CELei~pG~-~lQ   78 (94)
T 2krx_A            5 LMY-QQDNFVVLETNQPEQFLTTIELLEKLKGELEKISF----SDLPLELQKLDSLPAQAQHLIDTSCELDVGAGK-YLQ   78 (94)
T ss_dssp             CSC-CCCCEEEEESSSCSEEECHHHHHHHHHHHHHHSCT----TTSCHHHHHCCCHHHHHHHHHHHCCCEEEETTE-EEE
T ss_pred             hhc-ccCCEEEecCCCCcccCCHHHHHHHHHHHHHhCcc----ccCCHHHHhCCCHHHHHHHHHHheeeeeeCCCC-EEE
Confidence            389 99999999999999999999999999999999995    499999999999999999999999999999995 999


Q ss_pred             e
Q 034805           81 C   81 (82)
Q Consensus        81 W   81 (82)
                      |
T Consensus        79 W   79 (94)
T 2krx_A           79 W   79 (94)
T ss_dssp             E
T ss_pred             E
Confidence            9




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00