Citrus Sinensis ID: 023115
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 287 | ||||||
| 255574198 | 271 | conserved hypothetical protein [Ricinus | 0.926 | 0.981 | 0.578 | 9e-80 | |
| 224096129 | 244 | predicted protein [Populus trichocarpa] | 0.815 | 0.959 | 0.602 | 1e-79 | |
| 688423 | 287 | tumor-related protein [Nicotiana glauca | 0.916 | 0.916 | 0.549 | 1e-78 | |
| 356511199 | 285 | PREDICTED: transcription factor TGA4-lik | 0.937 | 0.943 | 0.525 | 2e-77 | |
| 449480891 | 255 | PREDICTED: transcription factor TGA6-lik | 0.850 | 0.956 | 0.549 | 6e-76 | |
| 225438349 | 255 | PREDICTED: transcription factor HBP-1b(c | 0.846 | 0.952 | 0.591 | 3e-74 | |
| 357520133 | 272 | Transcription factor HBP-1b(c1) [Medicag | 0.850 | 0.897 | 0.506 | 2e-70 | |
| 356565430 | 277 | PREDICTED: LOW QUALITY PROTEIN: transcri | 0.818 | 0.848 | 0.503 | 5e-63 | |
| 294610363 | 260 | hypothetical protein [Triticum aestivum] | 0.808 | 0.892 | 0.430 | 1e-40 | |
| 359493600 | 246 | PREDICTED: transcription factor TGA4-lik | 0.804 | 0.939 | 0.367 | 3e-40 |
| >gi|255574198|ref|XP_002528014.1| conserved hypothetical protein [Ricinus communis] gi|223532583|gb|EEF34370.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 302 bits (774), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 162/280 (57%), Positives = 204/280 (72%), Gaps = 14/280 (5%)
Query: 1 MSVLPSSIVTFLSSAAAIRNGDNNNSDGESFQKFFESWLVEQNQHLQALISASKQQENNS 60
MS LPSS S A++R+ +S SF FFE WLVEQ Q L LISASKQ+ NN
Sbjct: 1 MSDLPSS------SPASLRS---TSSSHNSFHDFFEWWLVEQKQELDELISASKQKLNNK 51
Query: 61 SSSN--FEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMA 118
++++ + + V+EHYEQYY AKS+W K DVL M +PSW S LED+FLWIGGWRPSMA
Sbjct: 52 NNNHDSLQPLINRVLEHYEQYYSAKSKWAKHDVLAMFSPSWTSPLEDAFLWIGGWRPSMA 111
Query: 119 FHLLYSKSGLQLEGKLHDLIRGLSSGDLGDLSPTQLTRVDGLQRVIIKEEKDLSEKLAKY 178
FHLLYSKSGLQLE +LHD+IRGL +GDLGDLSP QL +VD QR II+EEK+++E +AK+
Sbjct: 112 FHLLYSKSGLQLEAQLHDVIRGLCTGDLGDLSPNQLVQVDEFQRRIIREEKNITENMAKH 171
Query: 179 QETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKLEEIFHKADDLR 238
Q TVAD+S+VEL+H +++ T D S + +++ES TK LE++ KADDLR
Sbjct: 172 QATVADTSMVELTHAISK---TRRDEGSSIGNEVQERVESTLKTKLEGLEKVLQKADDLR 228
Query: 239 LKTFKAVIDILTPIQAVHFLIAAAELHLRLHDWGKQRDAR 278
L+T K +IDILTP Q VHFLIAAAELHLRLH+ GK+ D +
Sbjct: 229 LRTLKGIIDILTPDQTVHFLIAAAELHLRLHELGKKMDTK 268
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224096129|ref|XP_002310543.1| predicted protein [Populus trichocarpa] gi|222853446|gb|EEE90993.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|688423|dbj|BAA05470.1| tumor-related protein [Nicotiana glauca x Nicotiana langsdorffii] | Back alignment and taxonomy information |
|---|
| >gi|356511199|ref|XP_003524316.1| PREDICTED: transcription factor TGA4-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449480891|ref|XP_004156023.1| PREDICTED: transcription factor TGA6-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|225438349|ref|XP_002273632.1| PREDICTED: transcription factor HBP-1b(c1) [Vitis vinifera] gi|296082614|emb|CBI21619.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|357520133|ref|XP_003630355.1| Transcription factor HBP-1b(c1) [Medicago truncatula] gi|355524377|gb|AET04831.1| Transcription factor HBP-1b(c1) [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|356565430|ref|XP_003550943.1| PREDICTED: LOW QUALITY PROTEIN: transcription factor TGA2-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|294610363|dbj|BAJ05339.1| hypothetical protein [Triticum aestivum] | Back alignment and taxonomy information |
|---|
| >gi|359493600|ref|XP_003634633.1| PREDICTED: transcription factor TGA4-like [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 287 | ||||||
| TAIR|locus:2086400 | 244 | AT3G14880 "AT3G14880" [Arabido | 0.540 | 0.635 | 0.347 | 6.4e-37 | |
| TAIR|locus:2124157 | 282 | AT4G18690 "AT4G18690" [Arabido | 0.832 | 0.847 | 0.284 | 1.8e-25 | |
| TAIR|locus:2152390 | 291 | DOG1 "AT5G45830" [Arabidopsis | 0.592 | 0.584 | 0.340 | 2.9e-23 | |
| TAIR|locus:2124117 | 232 | AT4G18650 "AT4G18650" [Arabido | 0.400 | 0.495 | 0.320 | 2.3e-18 | |
| TAIR|locus:2016620 | 225 | ZW2 [Arabidopsis thaliana (tax | 0.334 | 0.426 | 0.33 | 6.7e-18 | |
| TAIR|locus:2124152 | 235 | AT4G18680 "AT4G18680" [Arabido | 0.668 | 0.817 | 0.274 | 1.6e-15 | |
| TAIR|locus:2024412 | 230 | RAS1 "AT1G09950" [Arabidopsis | 0.452 | 0.565 | 0.287 | 4.4e-14 | |
| TAIR|locus:2124132 | 152 | AT4G18660 "AT4G18660" [Arabido | 0.449 | 0.848 | 0.302 | 3.3e-12 | |
| TAIR|locus:2030551 | 384 | TGA3 "AT1G22070" [Arabidopsis | 0.317 | 0.236 | 0.308 | 4.8e-09 | |
| TAIR|locus:2184078 | 364 | TGA4 "AT5G10030" [Arabidopsis | 0.358 | 0.282 | 0.311 | 1.2e-08 |
| TAIR|locus:2086400 AT3G14880 "AT3G14880" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 273 (101.2 bits), Expect = 6.4e-37, Sum P(2) = 6.4e-37
Identities = 57/164 (34%), Positives = 90/164 (54%)
Query: 30 SFQKFFESWLVEQNQHLQALIXXXXXXXXXXXXXXXXXXVRV--VVEHYEQYYRAKSRWV 87
SF+KF +SW+ + HL L V V+EH+ +Y+RAK
Sbjct: 9 SFKKFQQSWIEQLRNHLNHLRSAQNHHRNSATGDEERLREAVDRVMEHFREYHRAKWAAT 68
Query: 88 KLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLYSKSGLQLEGKLHDLIRGLSSGDLG 147
DV+ ++ W S+LE S W+GGWRP+ FHL+Y++S + E ++ D++RG +GDL
Sbjct: 69 DKDVIEVMASPWASALERSLQWVGGWRPTTLFHLVYTESSILFESRIVDILRGFRTGDLS 128
Query: 148 DLSPTQLT-------RVDGLQRVIIKEEKDLSEKLAKYQETVAD 184
DLSP+Q V LQ +KEE ++E+L+++Q+ +D
Sbjct: 129 DLSPSQFRVPFVKGRTVSELQCETVKEENAITEELSEWQDDASD 172
|
|
| TAIR|locus:2124157 AT4G18690 "AT4G18690" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2152390 DOG1 "AT5G45830" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2124117 AT4G18650 "AT4G18650" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2016620 ZW2 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2124152 AT4G18680 "AT4G18680" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2024412 RAS1 "AT1G09950" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2124132 AT4G18660 "AT4G18660" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2030551 TGA3 "AT1G22070" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2184078 TGA4 "AT5G10030" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| gw1.VII.1642.1 | hypothetical protein (244 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 287 | |||
| pfam14144 | 79 | pfam14144, DOG1, Seed dormancy control | 3e-24 |
| >gnl|CDD|206313 pfam14144, DOG1, Seed dormancy control | Back alignment and domain information |
|---|
Score = 93.0 bits (232), Expect = 3e-24
Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 3/82 (3%)
Query: 44 QHLQALISASKQQENNSSSSNFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSL 103
+ L L +A +S V V+ HY++Y+R KS K DV +L+ W++
Sbjct: 1 RLLAELRAALNAH---ASDDELRSLVDKVLAHYDEYFRLKSAAAKADVFHLLSGMWKTPA 57
Query: 104 EDSFLWIGGWRPSMAFHLLYSK 125
E FLWIGG+RPS LLYS
Sbjct: 58 ERCFLWIGGFRPSELLKLLYSL 79
|
This family of plant proteins appears to be a highly specific controller seed dormancy. Length = 79 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 287 | |||
| PF14144 | 80 | DOG1: Seed dormancy control | 100.0 | |
| PF13801 | 125 | Metal_resist: Heavy-metal resistance; PDB: 3EPV_C | 96.85 | |
| COG3678 | 160 | CpxP P pilus assembly/Cpx signaling pathway, perip | 96.13 | |
| PRK12750 | 170 | cpxP periplasmic repressor CpxP; Reviewed | 95.8 | |
| PRK10363 | 166 | cpxP periplasmic repressor CpxP; Reviewed | 94.29 | |
| PRK10455 | 161 | periplasmic protein; Reviewed | 93.95 | |
| PRK12751 | 162 | cpxP periplasmic stress adaptor protein CpxP; Revi | 93.86 | |
| PF07813 | 100 | LTXXQ: LTXXQ motif family protein; InterPro: IPR01 | 90.93 |
| >PF14144 DOG1: Seed dormancy control | Back alignment and domain information |
|---|
Probab=100.00 E-value=4.2e-35 Score=226.97 Aligned_cols=79 Identities=48% Similarity=0.907 Sum_probs=75.1
Q ss_pred HHHHHHHHHhhhhcCCCCCchHHHHHHHHHHhHHHHHHHHhhhhhcccccccCCCCCChhHHHHHHhcCCCchHHHHHHH
Q 023115 44 QHLQALISASKQQENNSSSSNFEEQVRVVVEHYEQYYRAKSRWVKLDVLGMLTPSWRSSLEDSFLWIGGWRPSMAFHLLY 123 (287)
Q Consensus 44 ~~l~eLr~Al~~~~~~~sd~eL~~LV~~~l~HY~~y~~~Ks~aA~~DV~~llsp~W~tplEr~fLWiGG~RPS~~l~Lly 123 (287)
|++.|||+|++++. .+|.+|+.||+++++||++||++|+.+|++|||++|+|+|+||+||||+||||||||++|||||
T Consensus 1 ~~l~eLr~al~~~~--~~~~~L~~lV~~~~~Hy~~y~~~K~~aa~~DV~~~~s~~W~sp~Er~flWiGG~RPS~~~~ll~ 78 (80)
T PF14144_consen 1 RQLNELRAALQSHA--DSDDELRSLVDKVMSHYDEYYRAKSAAAKADVFHLLSPPWKSPLERCFLWIGGWRPSELFKLLY 78 (80)
T ss_pred CcHHHHHHHHHhcC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHhCCCCCCHHHHHHHHHhcCCHHHHHHHHh
Confidence 57899999998665 4699999999999999999999999999999999999999999999999999999999999998
Q ss_pred h
Q 023115 124 S 124 (287)
Q Consensus 124 s 124 (287)
+
T Consensus 79 s 79 (80)
T PF14144_consen 79 S 79 (80)
T ss_pred c
Confidence 6
|
|
| >PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H | Back alignment and domain information |
|---|
| >COG3678 CpxP P pilus assembly/Cpx signaling pathway, periplasmic inhibitor/zinc-resistance associated protein [Intracellular trafficking and secretion / Cell motility and secretio / Signal transduction mechanisms / Inorganic ion transport and metabolism] | Back alignment and domain information |
|---|
| >PRK12750 cpxP periplasmic repressor CpxP; Reviewed | Back alignment and domain information |
|---|
| >PRK10363 cpxP periplasmic repressor CpxP; Reviewed | Back alignment and domain information |
|---|
| >PRK10455 periplasmic protein; Reviewed | Back alignment and domain information |
|---|
| >PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed | Back alignment and domain information |
|---|
| >PF07813 LTXXQ: LTXXQ motif family protein; InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT) | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 287 | |||
| 2y39_A | 118 | Nickel and cobalt resistance protein CNRR; metal b | 96.97 | |
| 3itf_A | 145 | Periplasmic adaptor protein CPXP; CPXR, CPXA, cpxr | 95.15 | |
| 3o39_A | 108 | Periplasmic protein related to spheroblast format; | 92.9 | |
| 3oeo_A | 138 | Spheroplast protein Y; LTXXQ, extracytoplasmic str | 91.35 | |
| 3lay_A | 175 | Zinc resistance-associated protein; salmonella typ | 89.82 |
| >2y39_A Nickel and cobalt resistance protein CNRR; metal binding protein; 1.41A {Cupriavidus metallidurans} PDB: 2y3b_A 2y3d_A 2y3g_A* 2y3h_A 3epv_A* | Back alignment and structure |
|---|
Probab=96.97 E-value=0.0089 Score=48.65 Aligned_cols=89 Identities=13% Similarity=0.120 Sum_probs=67.6
Q ss_pred CCcHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhhhccCCCCCCcchhhhhhHHHHHHHHHhHHHH
Q 023115 148 DLSPTQLTRVDGLQRVIIKEEKDLSEKLAKYQETVADSSLVELSHVVTELMSTNDDRQDSDRRLMDDQIESNFVTKEAKL 227 (287)
Q Consensus 148 ~LS~~Ql~~I~~Lq~~t~~~E~aLs~~ma~lQ~slad~~~~~~a~~~~~~~g~~~~~~~~~~~~~~~~m~~a~~~kl~~L 227 (287)
+||++|-..|+.++..-.....+|..+|......+++.+..+ .+ ++ -++..|+...-.
T Consensus 20 ~Lt~~Q~~~leaie~~fa~~r~~le~emRaan~~La~ai~~~--------~~--~~----------p~V~aaid~~h~-- 77 (118)
T 2y39_A 20 PLDANEREILELKEDAFAQRRREIETRLRAANGKLADAIAKN--------PA--WS----------PEVEAATQEVER-- 77 (118)
T ss_dssp CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--------CS--CC----------HHHHHHHHHHHH--
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--------cc--cC----------HHHHHHHHHHHH--
Confidence 399999999999999999999999999999999999887421 11 22 245455433322
Q ss_pred HHHHHHhhHHHHHH---HHHHHhhcCHHHHHHHHHHHHH
Q 023115 228 EEIFHKADDLRLKT---FKAVIDILTPIQAVHFLIAAAE 263 (287)
Q Consensus 228 ~~~l~qAD~LR~~T---L~~l~~ILTp~QAA~fL~A~~e 263 (287)
..-.|+..| +-+|..||||-|+..|=-.+.+
T Consensus 78 -----~mG~LQkeTi~HvfeMR~VLtPeQ~~~fd~~vv~ 111 (118)
T 2y39_A 78 -----AAGDLQRATLVHVFEMRAGLKPEHRPAYDRVLID 111 (118)
T ss_dssp -----HHHHHHHHHHHHHHHHHHHSCGGGHHHHHHHHHH
T ss_pred -----HHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 444588999 6667899999999999766554
|
| >3itf_A Periplasmic adaptor protein CPXP; CPXR, CPXA, cpxrap, CPX-pathway, envelope stress, transduction; HET: MSE; 1.45A {Escherichia coli str} PDB: 3qzc_A | Back alignment and structure |
|---|
| >3o39_A Periplasmic protein related to spheroblast format; alpha-helical, structural genomics, montreal-kingston bacter structural genomics initiative; HET: MSE; 2.60A {Escherichia coli} | Back alignment and structure |
|---|
| >3oeo_A Spheroplast protein Y; LTXXQ, extracytoplasmic stress response-related, signaling P; 2.70A {Escherichia coli} | Back alignment and structure |
|---|
| >3lay_A Zinc resistance-associated protein; salmonella typhimurium L structural genomics, center for structural genomics of INFE diseases; 2.70A {Salmonella enterica subsp} | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00