Citrus Sinensis ID: 026455
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 238 | ||||||
| 147789492 | 219 | hypothetical protein VITISV_034276 [Viti | 0.890 | 0.968 | 0.583 | 5e-66 | |
| 225440213 | 219 | PREDICTED: uncharacterized protein LOC10 | 0.890 | 0.968 | 0.583 | 6e-66 | |
| 224137606 | 255 | predicted protein [Populus trichocarpa] | 0.987 | 0.921 | 0.554 | 4e-59 | |
| 224086753 | 260 | predicted protein [Populus trichocarpa] | 0.987 | 0.903 | 0.558 | 2e-56 | |
| 255560030 | 250 | conserved hypothetical protein [Ricinus | 0.907 | 0.864 | 0.5 | 7e-54 | |
| 356571872 | 232 | PREDICTED: uncharacterized protein LOC10 | 0.928 | 0.952 | 0.518 | 1e-48 | |
| 356503344 | 229 | PREDICTED: uncharacterized protein LOC10 | 0.806 | 0.838 | 0.524 | 1e-43 | |
| 449437978 | 240 | PREDICTED: uncharacterized protein LOC10 | 0.941 | 0.933 | 0.446 | 5e-42 | |
| 15230149 | 245 | VQ motif-containing protein [Arabidopsis | 0.928 | 0.902 | 0.458 | 2e-40 | |
| 297817002 | 242 | VQ motif-containing protein [Arabidopsis | 0.957 | 0.942 | 0.456 | 3e-40 |
| >gi|147789492|emb|CAN76136.1| hypothetical protein VITISV_034276 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 256 bits (654), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 143/245 (58%), Positives = 167/245 (68%), Gaps = 33/245 (13%)
Query: 1 MTSSENLPSVDAWSFRAAFADSWLSEAYARDTETLTKALQKTFSNNNNNNNNSNDITYSV 60
M SS+NL SVD W FR FADSW+SEA+ARDTETLTKALQK+ SN+ +N
Sbjct: 1 MASSDNLASVDPWVFRHTFADSWISEAFARDTETLTKALQKSISNHPDN----------- 49
Query: 61 TETLACDFTPPFV----APET---PTVSNVSGSDPETLPKRQRTAAVGGPNSGKITKRKA 113
+ACD PF+ PET P+VS VSGSD ET PKR+ A+ G GKITKRK+
Sbjct: 50 ---VACDTISPFLNDLLKPETAQTPSVSGVSGSDHETAPKRR--TAIPGAAGGKITKRKS 104
Query: 114 RASKKTQTTFITADPANFRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPGGRLHGIGGGSL 173
RASK++QTTFITADPANFRQMVQQVTGVRFG++Q+ + P+LKPEP+RPG RL G L
Sbjct: 105 RASKRSQTTFITADPANFRQMVQQVTGVRFGNSQLPVNPILKPEPKRPGNRLLGC----L 160
Query: 174 PTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEFDALSNFPTLE 233
PTLDTS FLLDHH Q P G FQ +V DG +TGLEF+ +FPTLE
Sbjct: 161 PTLDTSEFLLDHHHQQVMGPATAT-----QGAMTFQ-PVVADGGATGLEFETFPSFPTLE 214
Query: 234 SWKVM 238
SWKVM
Sbjct: 215 SWKVM 219
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225440213|ref|XP_002283659.1| PREDICTED: uncharacterized protein LOC100255095 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224137606|ref|XP_002322599.1| predicted protein [Populus trichocarpa] gi|222867229|gb|EEF04360.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224086753|ref|XP_002307950.1| predicted protein [Populus trichocarpa] gi|222853926|gb|EEE91473.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|255560030|ref|XP_002521033.1| conserved hypothetical protein [Ricinus communis] gi|223539736|gb|EEF41317.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|356571872|ref|XP_003554095.1| PREDICTED: uncharacterized protein LOC100790912 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356503344|ref|XP_003520470.1| PREDICTED: uncharacterized protein LOC100782392 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449437978|ref|XP_004136767.1| PREDICTED: uncharacterized protein LOC101211486 [Cucumis sativus] gi|449525975|ref|XP_004169991.1| PREDICTED: uncharacterized LOC101211486 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|15230149|ref|NP_191247.1| VQ motif-containing protein [Arabidopsis thaliana] gi|9663007|emb|CAC00751.1| putative protein [Arabidopsis thaliana] gi|15028355|gb|AAK76654.1| unknown protein [Arabidopsis thaliana] gi|24030447|gb|AAN41377.1| unknown protein [Arabidopsis thaliana] gi|332646060|gb|AEE79581.1| VQ motif-containing protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297817002|ref|XP_002876384.1| VQ motif-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322222|gb|EFH52643.1| VQ motif-containing protein [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 238 | ||||||
| TAIR|locus:2103590 | 245 | AT3G56880 "AT3G56880" [Arabido | 0.953 | 0.926 | 0.453 | 2.8e-41 | |
| TAIR|locus:2063265 | 238 | CAMBP25 "AT2G41010" [Arabidops | 0.924 | 0.924 | 0.414 | 7.4e-34 | |
| TAIR|locus:2129700 | 193 | AT4G15120 "AT4G15120" [Arabido | 0.260 | 0.321 | 0.405 | 5.8e-05 | |
| TAIR|locus:2090389 | 192 | AT3G22160 "AT3G22160" [Arabido | 0.205 | 0.255 | 0.446 | 7.8e-05 |
| TAIR|locus:2103590 AT3G56880 "AT3G56880" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 438 (159.2 bits), Expect = 2.8e-41, P = 2.8e-41
Identities = 116/256 (45%), Positives = 147/256 (57%)
Query: 1 MTSSENLPSVDAWSFRAAF-ADSWL-SEAYARDTETLTKALQKTFXXXXXXXXXXXDITY 58
M SSE L SVD WSFR F DSWL S++++ D++ L KAL ++ +
Sbjct: 1 MASSEGLASVDPWSFRQNFNIDSWLLSDSFSHDSDILAKALHRSISTSTESSPLSPSSFF 60
Query: 59 SVTETLACDFTPPFVAPETPTVSNVS-GSDPET-------LPKRQRTAAVGGPNSGKITK 110
+ T A DF+PP T+SNVS GSDPE L KR+R V G GK TK
Sbjct: 61 D-SSTAAVDFSPP------QTLSNVSFGSDPEIPAASALGLGKRKRGPGVSG---GKQTK 110
Query: 111 RKARAS-KKTQTTFITADPANFRQMVQQVTGVRF-GSAQVAMAPVLKPEPQR-----PXX 163
R++R S KK+QTTFITAD ANFRQMVQQVTG +F GS+ AP++KPEP R P
Sbjct: 111 RRSRVSNKKSQTTFITADAANFRQMVQQVTGAKFLGSSNSIFAPIVKPEPHRLASRLPPS 170
Query: 164 XXXXXXXXSLPTLDTSAFLLDHHQQ-LGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLE 222
++PTLDTS+FL +HHQ+ + GA GS T+ V G S+ +E
Sbjct: 171 CGNLDRSSAVPTLDTSSFLSNHHQENIITDLGAPTGSFHHQSSAGTTTANV-GGGSSAVE 229
Query: 223 FDALSNFPTLESWKVM 238
D+ +FPTLESWKVM
Sbjct: 230 LDSYPSFPTLESWKVM 245
|
|
| TAIR|locus:2063265 CAMBP25 "AT2G41010" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2129700 AT4G15120 "AT4G15120" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2090389 AT3G22160 "AT3G22160" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00023331001 | SubName- Full=Chromosome chr8 scaffold_29, whole genome shotgun sequence; (219 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 238 | |||
| pfam05678 | 31 | pfam05678, VQ, VQ motif | 2e-07 |
| >gnl|CDD|114404 pfam05678, VQ, VQ motif | Back alignment and domain information |
|---|
Score = 45.5 bits (109), Expect = 2e-07
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 117 KKTQTTFITADPANFRQMVQQVTGVRFGSA 146
+ T I DPANFR +VQ++TG A
Sbjct: 1 YRAPPTVIHTDPANFRALVQRLTGAPAAPA 30
|
This short motif is found in a variety of plant proteins. These proteins vary greatly in length and are mostly composed of low complexity regions. They all conserve a short motif FXhVQChTG, where X is any amino acid and h is a hydrophobic amino acid. The function of this motif is uncertain, however one protein in this family has been found to bind the SigA sigma factor. It would seem plausible that this motif is needed for this activity and that this whole family might be involved in modulating plastid sigma factors (Bateman A pers. obs.). Length = 31 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 238 | |||
| PF05678 | 31 | VQ: VQ motif; InterPro: IPR008889 This short motif | 99.01 |
| >PF05678 VQ: VQ motif; InterPro: IPR008889 This short motif is found in a variety of plant proteins | Back alignment and domain information |
|---|
Probab=99.01 E-value=2.9e-10 Score=74.03 Aligned_cols=29 Identities=38% Similarity=0.646 Sum_probs=26.5
Q ss_pred CCCCceEeecCchHHHHHHHHHhcCCCCC
Q 026455 117 KKTQTTFITADPANFRQMVQQVTGVRFGS 145 (238)
Q Consensus 117 rR~ptT~i~ad~anFRaMVQq~TG~p~~~ 145 (238)
++.+++||.+|++|||+||||+||.+..+
T Consensus 1 ~~~~p~vi~~d~~~Fr~lVQ~LTG~~~~~ 29 (31)
T PF05678_consen 1 YRSPPTVIHTDPSNFRALVQRLTGAPSAP 29 (31)
T ss_pred CCCCCEEEEeCHHHHHHHHHHhHCcCCCC
Confidence 46899999999999999999999999754
|
These proteins vary greatly in length and are mostly composed of low complexity regions. They all conserve a short motif FXhVQChTG, where X is any amino acid and h is a hydrophobic amino acid. The function of this motif is uncertain, however one protein in this family has been found to bind the SigA sigma factor Q9LDH1 from SWISSPROT. It would seem plausible that this motif is needed for this activity and that this whole family might be involved in modulating plastid sigma factors. |
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00