Citrus Sinensis ID: 034565


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-
MAIDVDRTIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSEEELPPEFKLYLPMQKT
ccEEEccccEEEcEEccEEEEEEcEEcccccEEEEEHHHHHHHHHcccEEEEEEEEEcccEEEEEEEEccccccccccccccEEEcccccc
ccEEEcccccEcEEEccEEEEEEEEccccccEEEEcHHHHHHHHHcccEEEEEEEEEEcccEEEEEEEccccccHHHccHHcEEEcccccc
MAIDVDRTIQAVTRvkpyictmplrlddgwNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRriyfsdrlyseeelppefklylpmqkt
maidvdrtiqavtrvkpyICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVEtlrvqvhancRLRRIYFsdrlyseeelppefklylpmqkt
MAIDVDRTIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSEEELPPEFKLYLPMQKT
****VDRTIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSE****************
**IDVDRTIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSEEELPPEFKLYLPM***
MAIDVDRTIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSEEELPPEFKLYLPMQKT
MAIDVDRTIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSEEELPPEFKLYLPM***
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooo
iiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAIDVDRTIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSEEELPPEFKLYLPMQKT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query91 2.2.26 [Sep-21-2011]
Q9VKV8199 UPF0468 protein CG5343 OS yes no 0.890 0.407 0.839 3e-39
Q6PBJ2192 UPF0468 protein C16orf80 yes no 0.879 0.416 0.837 2e-38
Q6GL74193 UPF0468 protein C16orf80 yes no 0.879 0.414 0.837 2e-38
Q6GPY6193 UPF0468 protein C16orf80 N/A no 0.879 0.414 0.837 2e-38
Q8BTU1193 UPF0468 protein C16orf80 yes no 0.879 0.414 0.837 2e-38
Q5ZHP3193 UPF0468 protein C16orf80 yes no 0.879 0.414 0.837 2e-38
Q6B857193 UPF0468 protein C16orf80 yes no 0.879 0.414 0.837 2e-38
Q9Y6A4193 UPF0468 protein C16orf80 yes no 0.879 0.414 0.837 4e-38
Q61JK7203 UPF0468 protein C16orf80 N/A no 0.879 0.394 0.75 7e-36
Q86D25203 UPF0468 protein C16orf80 yes no 0.879 0.394 0.75 8e-36
>sp|Q9VKV8|U0468_DROME UPF0468 protein CG5343 OS=Drosophila melanogaster GN=CG5343 PE=1 SV=1 Back     alignment and function desciption
 Score =  160 bits (404), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 68/81 (83%), Positives = 79/81 (97%)

Query: 10  QAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFS 69
           Q+ TRVKP+ICTMP+RLD+GWNQIQ NL+DFTRRAYGTNYVETLRVQ+HANCR+RR+YFS
Sbjct: 109 QSTTRVKPFICTMPMRLDEGWNQIQFNLSDFTRRAYGTNYVETLRVQIHANCRIRRVYFS 168

Query: 70  DRLYSEEELPPEFKLYLPMQK 90
           DRLYSE+ELPPEFKL+LP+QK
Sbjct: 169 DRLYSEDELPPEFKLFLPIQK 189





Drosophila melanogaster (taxid: 7227)
>sp|Q6PBJ2|CP080_DANRE UPF0468 protein C16orf80 homolog OS=Danio rerio GN=zgc:73380 PE=2 SV=1 Back     alignment and function description
>sp|Q6GL74|CP080_XENTR UPF0468 protein C16orf80 homolog OS=Xenopus tropicalis PE=2 SV=1 Back     alignment and function description
>sp|Q6GPY6|CP080_XENLA UPF0468 protein C16orf80 homolog OS=Xenopus laevis PE=2 SV=1 Back     alignment and function description
>sp|Q8BTU1|CP080_MOUSE UPF0468 protein C16orf80 homolog OS=Mus musculus GN=Gtl3 PE=2 SV=1 Back     alignment and function description
>sp|Q5ZHP3|CP080_CHICK UPF0468 protein C16orf80 homolog OS=Gallus gallus GN=RCJMB04_34o2 PE=2 SV=1 Back     alignment and function description
>sp|Q6B857|CP080_BOVIN UPF0468 protein C16orf80 homolog OS=Bos taurus PE=2 SV=1 Back     alignment and function description
>sp|Q9Y6A4|CP080_HUMAN UPF0468 protein C16orf80 OS=Homo sapiens GN=C16orf80 PE=1 SV=1 Back     alignment and function description
>sp|Q61JK7|CP080_CAEBR UPF0468 protein C16orf80 homolog OS=Caenorhabditis briggsae GN=CBG09753 PE=3 SV=1 Back     alignment and function description
>sp|Q86D25|CP080_CAEEL UPF0468 protein C16orf80 homolog OS=Caenorhabditis elegans GN=C54C6.6 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query91
255578235186 orf protein, putative [Ricinus communis] 0.901 0.440 0.939 5e-41
388517243190 unknown [Lotus japonicus] 0.923 0.442 0.916 5e-41
225464011190 PREDICTED: UPF0468 protein CG5343 [Vitis 0.912 0.436 0.939 1e-40
357515357 218 hypothetical protein MTR_8g040630 [Medic 0.901 0.376 0.926 2e-40
356530569190 PREDICTED: UPF0468 protein CG5343-like [ 0.923 0.442 0.904 3e-40
217074284189 unknown [Medicago truncatula] gi|3884951 0.912 0.439 0.915 3e-40
116791609190 unknown [Picea sitchensis] 0.901 0.431 0.926 3e-40
297744360 224 unnamed protein product [Vitis vinifera] 0.879 0.357 0.95 6e-40
356548806190 PREDICTED: LOW QUALITY PROTEIN: UPF0468 0.923 0.442 0.892 6e-40
147781661 199 hypothetical protein VITISV_021801 [Viti 0.890 0.407 0.938 9e-40
>gi|255578235|ref|XP_002529985.1| orf protein, putative [Ricinus communis] gi|223530508|gb|EEF32390.1| orf protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  171 bits (433), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 77/82 (93%), Positives = 81/82 (98%)

Query: 10  QAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFS 69
           QA+TRVKPYICTMPL++D+GWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFS
Sbjct: 105 QAITRVKPYICTMPLKMDEGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFS 164

Query: 70  DRLYSEEELPPEFKLYLPMQKT 91
           DRLYSEEELPPEFKLYLPMQK 
Sbjct: 165 DRLYSEEELPPEFKLYLPMQKA 186




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388517243|gb|AFK46683.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|225464011|ref|XP_002265223.1| PREDICTED: UPF0468 protein CG5343 [Vitis vinifera] gi|296087820|emb|CBI35076.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357515357|ref|XP_003627967.1| hypothetical protein MTR_8g040630 [Medicago truncatula] gi|355521989|gb|AET02443.1| hypothetical protein MTR_8g040630 [Medicago truncatula] Back     alignment and taxonomy information
>gi|356530569|ref|XP_003533853.1| PREDICTED: UPF0468 protein CG5343-like [Glycine max] Back     alignment and taxonomy information
>gi|217074284|gb|ACJ85502.1| unknown [Medicago truncatula] gi|388495166|gb|AFK35649.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|116791609|gb|ABK26040.1| unknown [Picea sitchensis] Back     alignment and taxonomy information
>gi|297744360|emb|CBI37330.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356548806|ref|XP_003542790.1| PREDICTED: LOW QUALITY PROTEIN: UPF0468 protein CG5343-like [Glycine max] Back     alignment and taxonomy information
>gi|147781661|emb|CAN62739.1| hypothetical protein VITISV_021801 [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query91
TAIR|locus:2082184190 BUG22 "AT3G12300" [Arabidopsis 0.890 0.426 0.901 7e-38
FB|FBgn0032248199 CG5343 [Drosophila melanogaste 0.890 0.407 0.839 1.3e-36
UNIPROTKB|Q5ZHP3193 RCJMB04_34o2 "UPF0468 protein 0.879 0.414 0.837 5.7e-36
UNIPROTKB|Q6B857193 Q6B857 "UPF0468 protein C16orf 0.879 0.414 0.837 5.7e-36
UNIPROTKB|E2R3J4193 C16orf80 "Uncharacterized prot 0.879 0.414 0.837 5.7e-36
UNIPROTKB|F2Z513193 C16orf80 "Uncharacterized prot 0.879 0.414 0.837 5.7e-36
MGI|MGI:107428193 Gtl3 "gene trap locus 3" [Mus 0.879 0.414 0.837 5.7e-36
ZFIN|ZDB-GENE-040426-1784192 zgc:73380 "zgc:73380" [Danio r 0.879 0.416 0.837 5.7e-36
UNIPROTKB|Q9Y6A4193 C16orf80 "UPF0468 protein C16o 0.879 0.414 0.837 9.2e-36
FB|FBgn0032291293 CG17118 [Drosophila melanogast 0.824 0.255 0.613 4.8e-23
TAIR|locus:2082184 BUG22 "AT3G12300" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 406 (148.0 bits), Expect = 7.0e-38, P = 7.0e-38
 Identities = 73/81 (90%), Positives = 79/81 (97%)

Query:    10 QAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFS 69
             QAVTRVKPYICTMPL++D+GWNQIQLNLAD TRRAYGTNY ETLRVQ+HANCRLRRIYF+
Sbjct:   109 QAVTRVKPYICTMPLKMDEGWNQIQLNLADLTRRAYGTNYAETLRVQIHANCRLRRIYFA 168

Query:    70 DRLYSEEELPPEFKLYLPMQK 90
             DRLYSEEELPPEFKLYLP+QK
Sbjct:   169 DRLYSEEELPPEFKLYLPVQK 189




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
FB|FBgn0032248 CG5343 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|Q5ZHP3 RCJMB04_34o2 "UPF0468 protein C16orf80 homolog" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|Q6B857 Q6B857 "UPF0468 protein C16orf80 homolog" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E2R3J4 C16orf80 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F2Z513 C16orf80 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
MGI|MGI:107428 Gtl3 "gene trap locus 3" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040426-1784 zgc:73380 "zgc:73380" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|Q9Y6A4 C16orf80 "UPF0468 protein C16orf80" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
FB|FBgn0032291 CG17118 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q86D25CP080_CAEELNo assigned EC number0.750.87910.3940yesno
Q6GL74CP080_XENTRNo assigned EC number0.83750.87910.4145yesno
Q8BTU1CP080_MOUSENo assigned EC number0.83750.87910.4145yesno
Q6B857CP080_BOVINNo assigned EC number0.83750.87910.4145yesno
Q9Y6A4CP080_HUMANNo assigned EC number0.83750.87910.4145yesno
Q6PBJ2CP080_DANRENo assigned EC number0.83750.87910.4166yesno
Q9VKV8U0468_DROMENo assigned EC number0.83950.89010.4070yesno
Q5ZHP3CP080_CHICKNo assigned EC number0.83750.87910.4145yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query91
pfam05018190 pfam05018, DUF667, Protein of unknown function (DU 6e-49
>gnl|CDD|191163 pfam05018, DUF667, Protein of unknown function (DUF667) Back     alignment and domain information
 Score =  152 bits (386), Expect = 6e-49
 Identities = 63/82 (76%), Positives = 74/82 (90%)

Query: 8   TIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIY 67
             Q+VTRVKP+ICTMPLRLD GWNQIQ NL+DFTRRAYGTNY+ETLRVQ+HANCR+RR+Y
Sbjct: 107 NYQSVTRVKPFICTMPLRLDPGWNQIQFNLSDFTRRAYGTNYIETLRVQIHANCRIRRVY 166

Query: 68  FSDRLYSEEELPPEFKLYLPMQ 89
           F DRLY+E+ELPPE +LY P +
Sbjct: 167 FCDRLYTEDELPPELRLYCPKK 188


This family of proteins are highly conserved in eukaryotes. Some proteins in the family are annotated as transcription factors. However, there is currently no support for this in the literature. Length = 190

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 91
KOG3213238 consensus Transcription factor IIB [Transcription] 100.0
PF05018190 DUF667: Protein of unknown function (DUF667); Inte 100.0
>KOG3213 consensus Transcription factor IIB [Transcription] Back     alignment and domain information
Probab=100.00  E-value=4.1e-38  Score=242.35  Aligned_cols=83  Identities=72%  Similarity=1.206  Sum_probs=81.1

Q ss_pred             ceeeeeeeeceeEeeccccCccchhhhhcHHHHHHHHhCcceeEEEEEEEecceeeEEeeeccccCCCcCCcccceeecc
Q 034565            8 TIQAVTRVKPYICTMPLRLDDGWNQIQLNLADFTRRAYGTNYVETLRVQVHANCRLRRIYFSDRLYSEEELPPEFKLYLP   87 (91)
Q Consensus         8 ~fq~~trVkp~~c~mPL~L~~GWN~i~~nL~d~t~~aygT~yvet~rv~I~ANCRlRRIYFsDrlYs~~ELP~e~kl~~p   87 (91)
                      |||++++|+|++|+|||.|++||||||+||+|||+++||++|.||++||||||||||||||+|++|+++|+|.+||++.+
T Consensus       107 n~~ke~~~kp~~~~mPl~m~~~W~~iqlnL~dft~~~~~~~y~etl~iql~AncriRriyf~~kl~~~~e~~~~frlm~r  186 (238)
T KOG3213|consen  107 NFQKETSVKPFICTMPLVMDAGWNQIQLNLADFTRRAYGTNYGETLSIQLHANCRIRRIYFADKLYSEAELPLEFRLMLR  186 (238)
T ss_pred             ccchhhcccceEEecceEecCcceeEEeeHHHHHHHHhccceeeEEEEEEecceEEEEEEeccccCChhhCCCcceEccc
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999988


Q ss_pred             ccC
Q 034565           88 MQK   90 (91)
Q Consensus        88 ~~~   90 (91)
                      .++
T Consensus       187 f~~  189 (238)
T KOG3213|consen  187 FQV  189 (238)
T ss_pred             ccC
Confidence            764



>PF05018 DUF667: Protein of unknown function (DUF667); InterPro: IPR007714 This family of proteins are highly conserved in eukaryotes Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00