Citrus Sinensis ID: 023225
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 285 | ||||||
| 224059600 | 382 | predicted protein [Populus trichocarpa] | 0.957 | 0.714 | 0.664 | 3e-87 | |
| 255553448 | 388 | conserved hypothetical protein [Ricinus | 0.954 | 0.701 | 0.641 | 3e-86 | |
| 225450009 | 380 | PREDICTED: uncharacterized protein LOC10 | 0.954 | 0.715 | 0.677 | 2e-85 | |
| 356542804 | 372 | PREDICTED: uncharacterized protein LOC10 | 0.652 | 0.5 | 0.811 | 6e-82 | |
| 356531649 | 376 | PREDICTED: uncharacterized protein LOC10 | 0.652 | 0.494 | 0.811 | 5e-81 | |
| 225435391 | 384 | PREDICTED: uncharacterized protein LOC10 | 0.684 | 0.507 | 0.753 | 6e-81 | |
| 413945208 | 387 | hypothetical protein ZEAMMB73_229307 [Ze | 0.649 | 0.478 | 0.767 | 3e-80 | |
| 226529786 | 387 | uncharacterized protein LOC100276525 [Ze | 0.649 | 0.478 | 0.767 | 3e-80 | |
| 242087801 | 413 | hypothetical protein SORBIDRAFT_09g01918 | 0.649 | 0.447 | 0.772 | 5e-80 | |
| 125552186 | 297 | hypothetical protein OsI_19813 [Oryza sa | 0.663 | 0.636 | 0.772 | 6e-80 |
| >gi|224059600|ref|XP_002299927.1| predicted protein [Populus trichocarpa] gi|222847185|gb|EEE84732.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 327 bits (838), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 188/283 (66%), Positives = 213/283 (75%), Gaps = 10/283 (3%)
Query: 1 MATASFSNLPTRAFS---PSYHNCHQSFVIKNITRQIP----NPHLINFSLTCTSSTPNK 53
MA A+FS TRAFS P+YH+ + S I+ +P P S T T+S +
Sbjct: 1 MAFATFS---TRAFSLQSPNYHHQYHSNNFATISLCVPIPTSKPSATPLSWTTTTSRFKR 57
Query: 54 NKTCRPTVIFASGGGKGGGGSWGSGGGGGGGDDDAGFHNCTEAIFALAKAGRTLGDLPKD 113
+T GG GGGG G GGGG DAG N +EAI ALA+ GR+L LPKD
Sbjct: 58 FRTTIALSSGGGDGGVGGGGISGGGGGGNDDGGDAGSRNKSEAILALAEVGRSLESLPKD 117
Query: 114 LAGAIEAGRVPAAIVKRYLELEKSPVFRWLLNFGGFRERLLADDLFLAKVAMECGVGVFT 173
LA AIEAGRVP +IV RY ELEKS VFRWLL FGGF+ERLLADDLFL KVA+ECGVG+FT
Sbjct: 118 LAAAIEAGRVPGSIVSRYFELEKSAVFRWLLQFGGFKERLLADDLFLTKVAIECGVGIFT 177
Query: 174 KTAAELERRRENFSKELDFVFADVVMAIIADFMLVWLPAPTISLRPPLAMSTGPIAKFFF 233
KTAAELERRRENF+KELDFVFADVVMAIIADFMLVWLPAPT+SLRPPLA+S GP++KFF+
Sbjct: 178 KTAAELERRRENFTKELDFVFADVVMAIIADFMLVWLPAPTVSLRPPLALSAGPVSKFFY 237
Query: 234 NCPDNAFQVAFTGTSYSLIQRIGAIVRNGAKLFAVGTSASLVG 276
+CPDNAFQVA GTSYS +QRIGAI+RNGAKLFAVGT ASLVG
Sbjct: 238 SCPDNAFQVALAGTSYSFLQRIGAILRNGAKLFAVGTGASLVG 280
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255553448|ref|XP_002517765.1| conserved hypothetical protein [Ricinus communis] gi|223543037|gb|EEF44572.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|225450009|ref|XP_002274472.1| PREDICTED: uncharacterized protein LOC100255131 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|356542804|ref|XP_003539855.1| PREDICTED: uncharacterized protein LOC100818590 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356531649|ref|XP_003534389.1| PREDICTED: uncharacterized protein LOC100808020 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|225435391|ref|XP_002285363.1| PREDICTED: uncharacterized protein LOC100265633 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|413945208|gb|AFW77857.1| hypothetical protein ZEAMMB73_229307 [Zea mays] | Back alignment and taxonomy information |
|---|
| >gi|226529786|ref|NP_001143763.1| uncharacterized protein LOC100276525 [Zea mays] gi|195626500|gb|ACG35080.1| hypothetical protein [Zea mays] | Back alignment and taxonomy information |
|---|
| >gi|242087801|ref|XP_002439733.1| hypothetical protein SORBIDRAFT_09g019180 [Sorghum bicolor] gi|241945018|gb|EES18163.1| hypothetical protein SORBIDRAFT_09g019180 [Sorghum bicolor] | Back alignment and taxonomy information |
|---|
| >gi|125552186|gb|EAY97895.1| hypothetical protein OsI_19813 [Oryza sativa Indica Group] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 285 | ||||||
| TAIR|locus:505006615 | 386 | AT5G12470 "AT5G12470" [Arabido | 0.649 | 0.479 | 0.729 | 4.3e-70 | |
| TAIR|locus:2063136 | 735 | AT2G40400 [Arabidopsis thalian | 0.638 | 0.247 | 0.376 | 2.1e-26 | |
| TAIR|locus:2078446 | 745 | AT3G56140 "AT3G56140" [Arabido | 0.638 | 0.244 | 0.366 | 3.5e-26 | |
| TAIR|locus:2158155 | 433 | RER1 "AT5G22790" [Arabidopsis | 0.585 | 0.385 | 0.302 | 2.4e-13 | |
| TAIR|locus:2065649 | 432 | LCD1 "AT2G37860" [Arabidopsis | 0.582 | 0.384 | 0.279 | 1.7e-10 | |
| TAIR|locus:2077838 | 337 | AT3G08640 "AT3G08640" [Arabido | 0.424 | 0.359 | 0.353 | 3e-09 | |
| TAIR|locus:2077828 | 339 | AT3G08630 "AT3G08630" [Arabido | 0.477 | 0.401 | 0.344 | 3e-09 | |
| TAIR|locus:2176826 | 521 | AT5G24690 "AT5G24690" [Arabido | 0.578 | 0.316 | 0.24 | 4e-05 |
| TAIR|locus:505006615 AT5G12470 "AT5G12470" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 710 (255.0 bits), Expect = 4.3e-70, P = 4.3e-70
Identities = 135/185 (72%), Positives = 160/185 (86%)
Query: 92 NCTEAIFALAKAGRTLGDLPKDLAGAIEAGRVPAAIVKRYLELEKSPVFRWLLNFGGFRE 151
N EA+ L ++G L LPKDLA AIEAGR+P +++ R+LEL+KS V RWL+ FGGFRE
Sbjct: 101 NRNEAMLLLKESGIELESLPKDLAAAIEAGRIPGSVITRFLELQKSAVMRWLMQFGGFRE 160
Query: 152 RLLADDLFLAKVAMECGVGVFTKTAAELERRRENFSKELDFVFADVVMAIIADFMLVWLP 211
RLLADDLF+AK+AMECGVG+FTKTAAE ERRRENF EL+ VFADV MAIIADFMLV+LP
Sbjct: 161 RLLADDLFMAKLAMECGVGIFTKTAAEYERRRENFFNELEVVFADVAMAIIADFMLVYLP 220
Query: 212 APTISLRPPLAMSTGPIAKFFFNCPDNAFQVAFTGTSYSLIQRIGAIVRNGAKLFAVGTS 271
APT+SLRPPLA++ G I+KFF NCPDNAFQVA +GTSY+L+QR+GAI RNGAKLFAVGT+
Sbjct: 221 APTVSLRPPLALTAGGISKFFHNCPDNAFQVALSGTSYTLLQRLGAITRNGAKLFAVGTT 280
Query: 272 ASLVG 276
+SLVG
Sbjct: 281 SSLVG 285
|
|
| TAIR|locus:2063136 AT2G40400 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2078446 AT3G56140 "AT3G56140" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2158155 RER1 "AT5G22790" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2065649 LCD1 "AT2G37860" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2077838 AT3G08640 "AT3G08640" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2077828 AT3G08630 "AT3G08630" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2176826 AT5G24690 "AT5G24690" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| estExt_fgenesh4_pg.C_LG_I1877 | hypothetical protein (382 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 285 | |||
| pfam11891 | 179 | pfam11891, DUF3411, Domain of unknown function (DU | 2e-53 | |
| PLN03138 | 796 | PLN03138, PLN03138, Protein TOC75; Provisional | 1e-04 |
| >gnl|CDD|221294 pfam11891, DUF3411, Domain of unknown function (DUF3411) | Back alignment and domain information |
|---|
Score = 171 bits (436), Expect = 2e-53
Identities = 59/126 (46%), Positives = 74/126 (58%)
Query: 151 ERLLADDLFLAKVAMECGVGVFTKTAAELERRRENFSKELDFVFADVVMAIIADFMLVWL 210
ERLLAD FL K+AME + + T AE+E+R ENF E D V AD+++ + +F LV+L
Sbjct: 1 ERLLADPSFLFKLAMEEVIDICAATVAEVEKRGENFWAEFDLVAADLLVGSVVNFALVYL 60
Query: 211 PAPTISLRPPLAMSTGPIAKFFFNCPDNAFQVAFTGTSYSLIQRIGAIVRNGAKLFAVGT 270
APT S A G + KF + P N F+ G SYSL QR G + GAKL AVG
Sbjct: 61 LAPTRSFGSTAAEMAGGLRKFLGSLPSNVFEKGLPGRSYSLQQRFGTVFYKGAKLAAVGF 120
Query: 271 SASLVG 276
A LVG
Sbjct: 121 IAGLVG 126
|
This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 168 to 186 amino acids in length. This domain has a conserved RYQ sequence motif. Length = 179 |
| >gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 285 | |||
| PF11891 | 180 | DUF3411: Domain of unknown function (DUF3411); Int | 100.0 | |
| COG4907 | 595 | Predicted membrane protein [Function unknown] | 88.71 |
| >PF11891 DUF3411: Domain of unknown function (DUF3411); InterPro: IPR021825 This presumed domain is functionally uncharacterised | Back alignment and domain information |
|---|
Probab=100.00 E-value=6e-56 Score=389.89 Aligned_cols=135 Identities=53% Similarity=0.720 Sum_probs=130.9
Q ss_pred hhhhcchhhHHHHHHHhhhhhhhhHHHHHhhcccchhhHHHHHHHHHHHHHHHHHHhhhhcccccccCCCcccccc-hhh
Q 023225 151 ERLLADDLFLAKVAMECGVGVFTKTAAELERRRENFSKELDFVFADVVMAIIADFMLVWLPAPTISLRPPLAMSTG-PIA 229 (285)
Q Consensus 151 ~RlLADP~FlfKl~~E~~I~v~~~~~aE~~~Rge~F~~ElDfV~sdlv~g~i~nfaLV~LLAPt~s~g~~~~~sag-~l~ 229 (285)
|||||||+|||||++||+||++|+++|||++|||+||+|||||+||+++++|+||+||||||||++++++++...| .++
T Consensus 1 ~RllADP~Fl~Kl~~E~~i~i~~~~~~e~~~R~e~f~~E~d~v~~d~v~~~i~n~~lv~llAPt~s~~~~~~~~~~~~~~ 80 (180)
T PF11891_consen 1 ERLLADPSFLFKLAIEEVIGIGCATAAEYAKRGERFWNELDFVFSDVVVGSIVNFALVWLLAPTRSFGSPAASSPGGGLQ 80 (180)
T ss_pred CcccccchHHHHHHHHHHHHHHHHHHHHHHHcccchHHHHHHHHHHHHHHHHHHHHHHHhccchHhhCcccccccchHHH
Confidence 7999999999999999999999999999999999999999999999999999999999999999999998875544 899
Q ss_pred hHhhcCCchhhhcccCCCCCchhhhHHHHhhccceehhhhhhhhhhHHHhhhcccC
Q 023225 230 KFFFNCPDNAFQVAFTGTSYSLIQRIGAIVRNGAKLFAVGTSASLVGWCRHNECFD 285 (285)
Q Consensus 230 k~~~~lP~n~Fq~~~pg~~fsl~qR~~a~v~KGa~l~~VG~~aGlvG~glsN~Lm~ 285 (285)
|++++||+|+||+++||++||++||++||+|||++|++||++||++||++||+||+
T Consensus 81 ~~~~~~P~n~Fq~~~~g~~fsl~qR~~~~~~kg~~l~~VG~~ag~vg~~lsn~L~~ 136 (180)
T PF11891_consen 81 KFLGSLPNNAFQKGYPGRSFSLAQRIGAFVYKGAKLAAVGFIAGLVGTGLSNALIA 136 (180)
T ss_pred HHHHhChHHHhccCCCCCcccHHHHHHHHHHcchHhhhhHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999973
|
This domain is found in eukaryotes. This domain is typically between 168 to 186 amino acids in length. This domain has a conserved RYQ sequence motif. |
| >COG4907 Predicted membrane protein [Function unknown] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 285 | |||
| 2pk2_A | 358 | Cyclin-T1, protein TAT; TAR, twinning, transcripti | 91.17 |
| >2pk2_A Cyclin-T1, protein TAT; TAR, twinning, transcription regulation P- TEFB, cell cycle; 2.67A {Homo sapiens} SCOP: a.74.1.1 a.74.1.1 PDB: 2w2h_C | Back alignment and structure |
|---|
Probab=91.17 E-value=0.039 Score=51.65 Aligned_cols=8 Identities=0% Similarity=-0.093 Sum_probs=3.2
Q ss_pred ccCCCCCc
Q 023225 31 TRQIPNPH 38 (285)
Q Consensus 31 ~~~~~~~~ 38 (285)
.+.+..++
T Consensus 246 ~~~~~~L~ 253 (358)
T 2pk2_A 246 EKTPNRLK 253 (358)
T ss_dssp TTCTTHHH
T ss_pred HhCccccc
Confidence 34444333
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00