Citrus Sinensis ID: 030500


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170------
MALMASSNGYCYYNYKKKYNKRYNCSTVPRITLSQSQEPRDQTSRREIILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGVGLAHHTFVQLFFHQA
ccccccccccccEEEEEccccccccccccEEEEccccccccccccHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccEEEccccccccccccccccccccccccccHHHHHHHHHHHHHHccccccccEEEEEEccEEEEEEEccEEccccccccEEEEEEccc
cccccccccEcccccccccccccccccHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccEccccccccccHHHHHHHHHHHHHHccccccccEEEEEcccEEEEEEccHHHHHHHHHHHHHHHHHccc
malmassngycyynykkkynkryncstvpritlsqsqeprdqtsrrEIILRSSEVAVIGAIFnlsgkkpeylgVQKNQQalalcpatkncistaenisdlthyappwnynrgrkkpvsREVAMEELLQVIKstkpdrftprlvekkddyvhveyespvlgvgLAHHTFVQLFFHQA
MALMASSNGYCYYNYKKKYNKRYNCSTVpritlsqsqeprdqtsrreiilrsseVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENisdlthyappwnynrgrKKPVSREVAMEELLQvikstkpdrftprlvekkddyVHVEYESPVLGVGLAHHTFVQLFFHQA
MALMASSngycyynykkkynkrynCSTVPRITLSQSQEPRDQTSRREIILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGVGLAHHTFVQLFFHQA
*******NGYCYYNYKKKYNKRYNCSTVPRIT***************IILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRG********VAMEELLQVIKS***DRFTPRLVEKKDDYVHVEYESPVLGVGLAHHTFVQLFFH**
***********YYNYKKKYNKRYNCSTVPRI****************IILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGVGLAHHTFVQLFFHQ*
MALMASSNGYCYYNYKKKYNKRYNCSTVPRITL***********RREIILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGVGLAHHTFVQLFFHQA
******SNGYCYYNYKKKYNKRYNCSTVPRITLSQSQEPRDQTSRREIILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGVGLAHHTFVQLFFHQA
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MALMASSNGYCYYNYKKKYNKRYNCSTVPRITLSQSQEPRDQTSRREIILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGVGLAHHTFVQLFFHQA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query176
255541316223 conserved hypothetical protein [Ricinus 0.909 0.717 0.7 4e-61
449454636222 PREDICTED: uncharacterized protein LOC10 0.892 0.707 0.652 9e-56
224063866219 predicted protein [Populus trichocarpa] 0.914 0.735 0.630 4e-54
449498986263 PREDICTED: uncharacterized protein LOC10 0.761 0.509 0.725 5e-54
356524043219 PREDICTED: uncharacterized protein LOC10 0.892 0.716 0.615 5e-52
357469573220 hypothetical protein MTR_4g023550 [Medic 0.903 0.722 0.618 7e-52
297817400216 hypothetical protein ARALYDRAFT_486553 [ 0.897 0.731 0.642 8e-52
15232964214 uncharacterized protein [Arabidopsis tha 0.880 0.724 0.650 5e-51
296089151 321 unnamed protein product [Vitis vinifera] 0.897 0.492 0.640 1e-50
225453911218 PREDICTED: uncharacterized protein LOC10 0.897 0.724 0.640 4e-50
>gi|255541316|ref|XP_002511722.1| conserved hypothetical protein [Ricinus communis] gi|223548902|gb|EEF50391.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  239 bits (609), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 119/170 (70%), Positives = 140/170 (82%), Gaps = 10/170 (5%)

Query: 1   MALMA--SSNGYCYYNYKKKYNKRYNCSTVPRITLSQSQ-------EPRDQTSRREIILR 51
           M  MA  +++ YC Y YK + +K +N    PR+ LSQ Q       +P DQ +RREIILR
Sbjct: 1   MVSMAAPTTSAYCGYYYKTR-SKNFNGLIRPRVILSQQQNDNDNNNKPSDQIARREIILR 59

Query: 52  SSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNR 111
           SSE+AV+GAIFNLSGKKPEYLGVQKNQQALALCPATKNCIST+ENISDLTHY PPWNYN 
Sbjct: 60  SSEIAVVGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTSENISDLTHYTPPWNYNG 119

Query: 112 GRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGV 161
            RKKPV++EVAM+ELLQVI+STKPD+FTPR+ EK DDYVHVEY+SP+LG+
Sbjct: 120 NRKKPVNKEVAMKELLQVIQSTKPDKFTPRVAEKSDDYVHVEYQSPILGL 169




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449454636|ref|XP_004145060.1| PREDICTED: uncharacterized protein LOC101221074 [Cucumis sativus] gi|449472979|ref|XP_004153749.1| PREDICTED: uncharacterized protein LOC101211056 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224063866|ref|XP_002301293.1| predicted protein [Populus trichocarpa] gi|222843019|gb|EEE80566.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449498986|ref|XP_004160689.1| PREDICTED: uncharacterized protein LOC101227119 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356524043|ref|XP_003530642.1| PREDICTED: uncharacterized protein LOC100806732 [Glycine max] Back     alignment and taxonomy information
>gi|357469573|ref|XP_003605071.1| hypothetical protein MTR_4g023550 [Medicago truncatula] gi|355506126|gb|AES87268.1| hypothetical protein MTR_4g023550 [Medicago truncatula] Back     alignment and taxonomy information
>gi|297817400|ref|XP_002876583.1| hypothetical protein ARALYDRAFT_486553 [Arabidopsis lyrata subsp. lyrata] gi|297322421|gb|EFH52842.1| hypothetical protein ARALYDRAFT_486553 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|15232964|ref|NP_191640.1| uncharacterized protein [Arabidopsis thaliana] gi|7329691|emb|CAB82685.1| putative protein [Arabidopsis thaliana] gi|110741004|dbj|BAE98596.1| hypothetical protein [Arabidopsis thaliana] gi|117168213|gb|ABK32189.1| At3g60810 [Arabidopsis thaliana] gi|332646589|gb|AEE80110.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|296089151|emb|CBI38854.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225453911|ref|XP_002279226.1| PREDICTED: uncharacterized protein LOC100245470 [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query176
TAIR|locus:2101826214 AT3G60810 "AT3G60810" [Arabido 0.715 0.588 0.75 1.8e-48
TAIR|locus:2101826 AT3G60810 "AT3G60810" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 506 (183.2 bits), Expect = 1.8e-48, P = 1.8e-48
 Identities = 96/128 (75%), Positives = 112/128 (87%)

Query:    34 SQSQEPRDQTSRREIILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCIST 93
             S  + P+    RR+IILRSSE+A+IGAIF LSGKKP+YLGVQKN++ LALCPAT NCIST
Sbjct:    35 SYDKHPK-LIGRRDIILRSSELAMIGAIFQLSGKKPDYLGVQKNER-LALCPATNNCIST 92

Query:    94 AENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVE 153
             +ENISD  HYAPPWNYN GRK PV+R+VAM+ELL VIKS KPD+FTPR+VEKKDDYVHVE
Sbjct:    93 SENISDRVHYAPPWNYNGGRKTPVNRQVAMKELLNVIKSVKPDKFTPRIVEKKDDYVHVE 152

Query:   154 YESPVLGV 161
             YESP+LG+
Sbjct:   153 YESPILGL 160


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.317   0.131   0.378    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      176       159   0.00091  106 3  11 22  0.37    32
                                                     30  0.43    34


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  584 (62 KB)
  Total size of DFA:  138 KB (2086 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  14.97u 0.13s 15.10t   Elapsed:  00:00:01
  Total cpu time:  14.98u 0.13s 15.11t   Elapsed:  00:00:01
  Start:  Sat May 11 03:28:37 2013   End:  Sat May 11 03:28:38 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pg.C_LG_II1341
hypothetical protein (219 aa)
(Populus trichocarpa)
Predicted Functional Partners:
eugene3.00170453
hypothetical protein (457 aa)
       0.407

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query176
pfam07386118 pfam07386, DUF1499, Protein of unknown function (D 1e-08
COG4446141 COG4446, COG4446, Uncharacterized protein conserve 1e-07
>gnl|CDD|219394 pfam07386, DUF1499, Protein of unknown function (DUF1499) Back     alignment and domain information
 Score = 50.3 bits (121), Expect = 1e-08
 Identities = 22/81 (27%), Positives = 31/81 (38%), Gaps = 10/81 (12%)

Query: 81  LALCPATKNCISTAENIS-DLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFT 139
           L+ CP + NC+S+A   S D      P           S E A+  LL V ++       
Sbjct: 3   LSTCPDSPNCVSSAAAASRDAYPDIAPLTL------DGSPEEALAALLAVAEALGRTIVV 56

Query: 140 PRLVEKKDDYVHVEYESPVLG 160
               E  D Y+     SP+ G
Sbjct: 57  T---EPTDGYIEATATSPLFG 74


This family consists of several hypothetical bacterial and plant proteins of around 125 residues in length. The function of this family is unknown. Length = 118

>gnl|CDD|226854 COG4446, COG4446, Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 176
COG4446141 Uncharacterized protein conserved in bacteria [Fun 99.97
PF07386118 DUF1499: Protein of unknown function (DUF1499); In 99.87
>COG4446 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
Probab=99.97  E-value=1.7e-32  Score=215.63  Aligned_cols=94  Identities=32%  Similarity=0.542  Sum_probs=89.4

Q ss_pred             HhcCCCCCCcccccCcccccCCCCCCCeeeeccccCCCCcccCCeeccCCCCCCCCHHHHHHHHHHHHHhcCCCCCCcEE
Q 030500           63 NLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRL  142 (176)
Q Consensus        63 ~~sg~~P~~LGV~dG~~~La~CP~sPNCVSSqa~~~D~~h~I~Pw~y~g~~~~~~s~e~A~~~L~~vl~~~~~~~f~~~I  142 (176)
                      .++|+ |.+|||.+|+  |+|||++|||||||+  .|..|.|+|+.|.++      ++.|.++|++++.++||    ++|
T Consensus        11 ~f~~s-~~~lGV~sgr--lapCpn~PNCVssQ~--adt~h~iaPl~f~~~------~~~a~e~l~~il~~lP~----t~i   75 (141)
T COG4446          11 AFSGS-PCNLGVDSGR--LAPCPNSPNCVSSQD--ADTKHAIAPLNFILD------PGVAIEQLERILLSLPG----TVI   75 (141)
T ss_pred             hhccC-ccccCcccCc--ccCCCCCCCeeeccc--ccchhcccccccccC------HHHHHHHHHHHHhhCCC----ceE
Confidence            57777 9999999998  999999999999998  799999999999986      88999999999999998    999


Q ss_pred             EEecCCEEEEEEEcCCCCcccceEEEeee
Q 030500          143 VEKKDDYVHVEYESPVLGVGLAHHTFVQL  171 (176)
Q Consensus       143 Ve~~~~YL~aef~S~ifGFVDDVEF~~d~  171 (176)
                      |+++++||||||+|++||||||||||+++
T Consensus        76 ve~~~nYl~ae~~Srlf~FVDDlEfyl~~  104 (141)
T COG4446          76 VEKNDNYLRAECTSRLFGFVDDLEFYLPQ  104 (141)
T ss_pred             eecCchHHHHHHHHHHhhcccceEEecCC
Confidence            99999999999999999999999999975



>PF07386 DUF1499: Protein of unknown function (DUF1499); InterPro: IPR010865 This family consists of several hypothetical bacterial and plant proteins of around 125 residues in length Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query176
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 7e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 41.4 bits (96), Expect = 7e-05
 Identities = 25/152 (16%), Positives = 50/152 (32%), Gaps = 38/152 (25%)

Query: 10  YCYYN----YKKKYNKRYNCSTV---PRITLSQS------QEP-RDQTSRR--EIILRSS 53
           Y Y +    ++  +   ++C  V   P+  LS+               + R    +L   
Sbjct: 16  YQYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQ 75

Query: 54  EVAVIGAIFNLSGKKPEY------LGVQKNQQALALCPATKNCISTAENISDLTHYAPPW 107
           E  V    F     +  Y      +  ++ Q ++     T+  I   + +         +
Sbjct: 76  EEMV--QKFVEEVLRINYKFLMSPIKTEQRQPSMM----TRMYIEQRDRL---------Y 120

Query: 108 NYNRG-RKKPVSREVAMEELLQVIKSTKPDRF 138
           N N+   K  VSR     +L Q +   +P + 
Sbjct: 121 NDNQVFAKYNVSRLQPYLKLRQALLELRPAKN 152


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00