Citrus Sinensis ID: 023394
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 283 | ||||||
| 224081445 | 283 | predicted protein [Populus trichocarpa] | 0.939 | 0.939 | 0.481 | 2e-66 | |
| 255544666 | 277 | conserved hypothetical protein [Ricinus | 0.939 | 0.960 | 0.5 | 1e-63 | |
| 359494537 | 241 | PREDICTED: uncharacterized protein LOC10 | 0.756 | 0.887 | 0.575 | 4e-63 | |
| 296082728 | 215 | unnamed protein product [Vitis vinifera] | 0.749 | 0.986 | 0.575 | 2e-62 | |
| 147856564 | 240 | hypothetical protein VITISV_022419 [Viti | 0.752 | 0.887 | 0.570 | 6e-61 | |
| 224094228 | 296 | predicted protein [Populus trichocarpa] | 0.968 | 0.925 | 0.445 | 2e-55 | |
| 357451727 | 266 | CCT [Medicago truncatula] gi|355485188|g | 0.784 | 0.834 | 0.502 | 3e-46 | |
| 449464656 | 245 | PREDICTED: uncharacterized protein LOC10 | 0.738 | 0.853 | 0.451 | 2e-39 | |
| 297829822 | 251 | hypothetical protein ARALYDRAFT_478650 [ | 0.625 | 0.705 | 0.497 | 3e-37 | |
| 18399780 | 251 | protein activator of spomin::LUC2 [Arabi | 0.784 | 0.884 | 0.409 | 6e-36 |
| >gi|224081445|ref|XP_002306413.1| predicted protein [Populus trichocarpa] gi|222855862|gb|EEE93409.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/297 (48%), Positives = 187/297 (62%), Gaps = 31/297 (10%)
Query: 1 MASIPQFYSDDSFSSAELSQFPYPISAAMAAAGESTNYGECPGNSVSSCGGSVSSGNNGP 60
MASIP FYSD FS + S+ IS+ MA G ++SS G
Sbjct: 1 MASIPHFYSDYQFSPDKFSE----ISSVMAQDHRYARTGSFSSTNMSS----------GA 46
Query: 61 IWGTEENFPIVSF---EN-----ILAPESDVVSSSPLIPFTAEQLGIYDSMVPTLAEYNG 112
++G +++ +F EN I ES+++S P E+ GI + +VPTL +Y
Sbjct: 47 MFGDHQDYSFPTFYDHENGCALDIFQGESEIMSPIPATKSLPERFGISNMVVPTLMDYKM 106
Query: 113 MAMCGTNEIQNYGARFQLPSAACEFGDDCCAFMHDLKPMH-PATGENWGIQNNQMASMED 171
+ CG +IQ++G FQL S CE+G+DCC + P PA ENW ++ N++ + E+
Sbjct: 107 GSNCGIAKIQSFGGGFQL-SDVCEYGEDCCGILPKFTPATCPAAEENWRLEYNRITAKEN 165
Query: 172 ANVIKVGRYSVEERKDRILRYLKKKNQRNFNKTIKYACRKTLADRRVRVRGRFARNNELC 231
N++KVGRY+VEERKDRILRYLKK+NQRNFNKTIKYACRKTLADRRVRVRGRFA+NNE+
Sbjct: 166 TNMVKVGRYTVEERKDRILRYLKKRNQRNFNKTIKYACRKTLADRRVRVRGRFAKNNEIF 225
Query: 232 EEDQTVMKNDDN---SHKGKNPYCVDDDVQMKQD----EEWLQEAMASLMYLPYIAG 281
EE+ V KNDDN H K+ YC D VQ+K D E+WLQEA+ASLM++PYIA
Sbjct: 226 EEETEVKKNDDNIPHHHHEKDTYCTSDAVQIKNDDDDEEQWLQEALASLMFVPYIAA 282
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255544666|ref|XP_002513394.1| conserved hypothetical protein [Ricinus communis] gi|223547302|gb|EEF48797.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|359494537|ref|XP_002263147.2| PREDICTED: uncharacterized protein LOC100264678 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|296082728|emb|CBI21733.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|147856564|emb|CAN82490.1| hypothetical protein VITISV_022419 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224094228|ref|XP_002310099.1| predicted protein [Populus trichocarpa] gi|222853002|gb|EEE90549.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|357451727|ref|XP_003596140.1| CCT [Medicago truncatula] gi|355485188|gb|AES66391.1| CCT [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449464656|ref|XP_004150045.1| PREDICTED: uncharacterized protein LOC101221095 [Cucumis sativus] gi|449502519|ref|XP_004161664.1| PREDICTED: uncharacterized protein LOC101230503 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|297829822|ref|XP_002882793.1| hypothetical protein ARALYDRAFT_478650 [Arabidopsis lyrata subsp. lyrata] gi|297328633|gb|EFH59052.1| hypothetical protein ARALYDRAFT_478650 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|18399780|ref|NP_566437.1| protein activator of spomin::LUC2 [Arabidopsis thaliana] gi|6997164|gb|AAF34828.1| hypothetical protein [Arabidopsis thaliana] gi|9279773|dbj|BAB01418.1| unnamed protein product [Arabidopsis thaliana] gi|21553722|gb|AAM62815.1| unknown [Arabidopsis thaliana] gi|59894822|gb|AAX11222.1| activator of sporamin LUC 2 [Arabidopsis thaliana] gi|332641736|gb|AEE75257.1| protein activator of spomin::LUC2 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 283 | ||||||
| TAIR|locus:2090374 | 251 | ASML2 "activator of spomin::LU | 0.621 | 0.701 | 0.445 | 2.4e-35 | |
| TAIR|locus:2051099 | 410 | AT2G33350 "AT2G33350" [Arabido | 0.388 | 0.268 | 0.4 | 1.9e-14 | |
| TAIR|locus:2177911 | 307 | AT5G41380 "AT5G41380" [Arabido | 0.279 | 0.257 | 0.487 | 2.3e-14 | |
| TAIR|locus:2018344 | 386 | AT1G04500 "AT1G04500" [Arabido | 0.325 | 0.238 | 0.4 | 7.4e-14 | |
| TAIR|locus:2195518 | 293 | AT1G63820 [Arabidopsis thalian | 0.257 | 0.249 | 0.474 | 2.8e-11 | |
| TAIR|locus:2168083 | 241 | AT5G59990 "AT5G59990" [Arabido | 0.226 | 0.265 | 0.461 | 9.4e-09 | |
| TAIR|locus:2132902 | 261 | AT4G27900 "AT4G27900" [Arabido | 0.116 | 0.126 | 0.575 | 0.00042 | |
| TAIR|locus:2178441 | 264 | AT5G53420 "AT5G53420" [Arabido | 0.116 | 0.125 | 0.575 | 0.00044 |
| TAIR|locus:2090374 ASML2 "activator of spomin::LUC2" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 348 (127.6 bits), Expect = 2.4e-35, Sum P(2) = 2.4e-35
Identities = 90/202 (44%), Positives = 116/202 (57%)
Query: 85 SSSPLIPFTAEQLGIYDSMVPTLAEYNGMAMCGTNEIQNYGARFQLPSAACEF--GD--D 140
SS+P F + GI D VP EYN NY + + +A F GD +
Sbjct: 50 SSNPESIFVPQVFGISDVSVP---EYN-----------NYYQKMSVNNATHYFHGGDHQE 95
Query: 141 CCAFMHDLKPM-HPATGE-NWGIQNNQMASMEDANVIKVGRYSVEERKDRILRYLKKKNQ 198
F ++KP+ PATGE +WG N++ + N KVGRYSVEERKDRI+RYLKKKNQ
Sbjct: 96 YYGFSPEIKPLFRPATGEQSWG--NSEGGIQAEPNT-KVGRYSVEERKDRIMRYLKKKNQ 152
Query: 199 RNFNKTIKYACRKTLADXXXXXXXXXXXNNELCEEDQTVMKNDDN-SHKGKNPYCVDDD- 256
RNFNKTIKY CRKTLAD NN+ CEE + KN +N S K ++ + DD
Sbjct: 153 RNFNKTIKYVCRKTLADRRVRVRGRFARNNDTCEEQPHMSKNHNNHSEKEEDMFSGSDDY 212
Query: 257 -VQMKQDEEWLQEAMASLMYLP 277
+QM+ D+ WL EAM++L+ P
Sbjct: 213 LIQMEHDDGWLHEAMSNLIPFP 234
|
|
| TAIR|locus:2051099 AT2G33350 "AT2G33350" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2177911 AT5G41380 "AT5G41380" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2018344 AT1G04500 "AT1G04500" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2195518 AT1G63820 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2168083 AT5G59990 "AT5G59990" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2132902 AT4G27900 "AT4G27900" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2178441 AT5G53420 "AT5G53420" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 283 | |||
| pfam06203 | 45 | pfam06203, CCT, CCT motif | 1e-20 |
| >gnl|CDD|203407 pfam06203, CCT, CCT motif | Back alignment and domain information |
|---|
Score = 82.3 bits (204), Expect = 1e-20
Identities = 21/45 (46%), Positives = 32/45 (71%)
Query: 185 RKDRILRYLKKKNQRNFNKTIKYACRKTLADRRVRVRGRFARNNE 229
R+ +LRY +K+ R F+K I+YA RK +A+ R RV+GRF + +E
Sbjct: 1 REAALLRYKEKRKTRKFDKKIRYASRKAVAESRPRVKGRFVKQSE 45
|
This short motif is found in a number of plant proteins. It is rich in basic amino acids and has been called a CCT motif after Co, Col and Toc1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif. Toc1 mutants have been identified in this region. Length = 45 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 283 | |||
| PF06203 | 45 | CCT: CCT motif; InterPro: IPR010402 The CCT (CONST | 99.79 | |
| KOG1601 | 340 | consensus GATA-4/5/6 transcription factors [Transc | 96.33 | |
| PF09425 | 27 | CCT_2: Divergent CCT motif; InterPro: IPR018467 Th | 95.57 |
| >PF06203 CCT: CCT motif; InterPro: IPR010402 The CCT (CONSTANS, CO-like, and TOC1) domain is a highly conserved basic module of ~43 amino acids, which is found near the C terminus of plant proteins often involved in light signal transduction | Back alignment and domain information |
|---|
Probab=99.79 E-value=4.4e-20 Score=131.27 Aligned_cols=45 Identities=47% Similarity=0.938 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHhhhcCCCCccccchhhhhhhhcCccCceeecCCc
Q 023394 185 RKDRILRYLKKKNQRNFNKTIKYACRKTLADRRVRVRGRFARNNE 229 (283)
Q Consensus 185 R~~~l~RYreKRk~R~f~KkIRY~~RK~~Ad~RpRVKGRFvk~~e 229 (283)
|+++|+||++||++|+|+|+|+|+|||.+||.|||||||||+.+|
T Consensus 1 R~~~l~Ry~~Kr~~R~f~kkirY~~Rk~~A~~R~RvkGRFvk~~e 45 (45)
T PF06203_consen 1 REEKLQRYREKRKRRNFEKKIRYESRKAVADKRPRVKGRFVKKSE 45 (45)
T ss_pred CHHHHHHHHHHHHhhcccccCCcchHHHHHhhCCccCCcccCCCC
Confidence 689999999999999999999999999999999999999999874
|
The CCT domain is found in association with other domains, such as the B-box zinc finger, the GATA-type zinc finger, the ZIM motif or the response regulatory domain. The CCT domain contains a putative nuclear localisation signal within the second half of the CCT motif and has been shown to be involved in nuclear localization and probably also has a role in protein-protein interaction [].; GO: 0005515 protein binding |
| >KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] | Back alignment and domain information |
|---|
| >PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1 | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 283 | |||
| 3ogl_Q | 21 | JAZ1 incomplete degron peptide; leucine-rich repea | 94.95 | |
| 3ogk_Q | 22 | JAZ1 incomplete degron peptide; leucine rich repea | 93.43 |
| >3ogk_Q JAZ1 incomplete degron peptide; leucine rich repeat, ubiquitin ligase, SCF, protein binding; HET: OGK; 2.80A {Arabidopsis thaliana} | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00