Citrus Sinensis ID: 026455


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------24
MTSSENLPSVDAWSFRAAFADSWLSEAYARDTETLTKALQKTFSNNNNNNNNSNDITYSVTETLACDFTPPFVAPETPTVSNVSGSDPETLPKRQRTAAVGGPNSGKITKRKARASKKTQTTFITADPANFRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPGGRLHGIGGGSLPTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEFDALSNFPTLESWKVM
cccccccccccccccccccHHHHHHHHHHccHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccHHHHHHcccccccccccccccccccccccccEEEEcccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccc
ccccccccccccHHcccccccHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHcccccccccccccccccccccccccEEEEccHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccc
mtssenlpsvdawSFRAAFADSWLSEAYARDTETLTKALQKTFSnnnnnnnnsndiTYSVTEtlacdftppfvapetptvsnvsgsdpetlpkrqrtaavggpnsgkitKRKARASKktqttfitadpaNFRQMVQQVTGVRFGSaqvamapvlkpepqrpggrlhgigggslptldtSAFLLDHHQqlgqapgagvgsvlgsghfdfqtslvcdgastglefdalsnfptleswkvm
mtssenlpsvdAWSFRAAFADSWLSEAYARDTETLTKALQKTfsnnnnnnnnsNDITYSVTETLACDFTPPFVAPETPtvsnvsgsdpetlpkrqrtaavggpnsgkitkrkaraskktqttfitadpanfRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPGGRLHGIGGGSLPTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEfdalsnfptleswkvm
MTSSENLPSVDAWSFRAAFADSWLSEAYARDTETLTKALQKTFsnnnnnnnnsnDITYSVTETLACDFTPPFVAPETPTVSNVSGSDPETLPKRQRTAAVGGPNSGKITKRKARASKKTQTTFITADPANFRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPggrlhgigggSLPTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEFDALSNFPTLESWKVM
**********DAWSFRAAFADSWLSEAYAR**************************TYSVTETLACDFTPPFV*************************************************FITADPANFRQMVQQVTGVRFGSAQVAM*****************IGGGSLPTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEFDALSNFPTL******
*********VDAWSFRAAFADSWLSEAYARDTETLTKALQ**********************************************************************************FITADPANFRQMVQQVTGVRF**************************GGSLPTLDTSAFLLD***************************************DALSNFPTLESWKVM
********SVDAWSFRAAFADSWLSEAYARDTETLTKALQKTFSNNNNNNNNSNDITYSVTETLACDFTPPFVAPET******************************************QTTFITADPANFRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPGGRLHGIGGGSLPTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEFDALSNFPTLESWKVM
********SVDAWSFRAAFADSWLSEAYARDTETLTKALQKTFS***************************************************************************QTTFITADPANFRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPGGRLHGIGGGSLPTLDTSAFLLDHHQ************************************DALSNFPTLESWK**
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MTSSENLPSVDAWSFRAAFADSWLSEAYARDTETLTKALQKTFSNNNNNNNNSNDITYSVTETLACDFTPPFVAPETPTVSNVSGSDPETLPKRQRTAAVGGPNSGKITKRKARASKKTQTTFITADPANFRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPGGRLHGIGGGSLPTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEFDALSNFPTLESWKVM
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query238
147789492219 hypothetical protein VITISV_034276 [Viti 0.890 0.968 0.583 5e-66
225440213219 PREDICTED: uncharacterized protein LOC10 0.890 0.968 0.583 6e-66
224137606255 predicted protein [Populus trichocarpa] 0.987 0.921 0.554 4e-59
224086753260 predicted protein [Populus trichocarpa] 0.987 0.903 0.558 2e-56
255560030250 conserved hypothetical protein [Ricinus 0.907 0.864 0.5 7e-54
356571872232 PREDICTED: uncharacterized protein LOC10 0.928 0.952 0.518 1e-48
356503344229 PREDICTED: uncharacterized protein LOC10 0.806 0.838 0.524 1e-43
449437978240 PREDICTED: uncharacterized protein LOC10 0.941 0.933 0.446 5e-42
15230149245 VQ motif-containing protein [Arabidopsis 0.928 0.902 0.458 2e-40
297817002242 VQ motif-containing protein [Arabidopsis 0.957 0.942 0.456 3e-40
>gi|147789492|emb|CAN76136.1| hypothetical protein VITISV_034276 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  256 bits (654), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 143/245 (58%), Positives = 167/245 (68%), Gaps = 33/245 (13%)

Query: 1   MTSSENLPSVDAWSFRAAFADSWLSEAYARDTETLTKALQKTFSNNNNNNNNSNDITYSV 60
           M SS+NL SVD W FR  FADSW+SEA+ARDTETLTKALQK+ SN+ +N           
Sbjct: 1   MASSDNLASVDPWVFRHTFADSWISEAFARDTETLTKALQKSISNHPDN----------- 49

Query: 61  TETLACDFTPPFV----APET---PTVSNVSGSDPETLPKRQRTAAVGGPNSGKITKRKA 113
              +ACD   PF+     PET   P+VS VSGSD ET PKR+   A+ G   GKITKRK+
Sbjct: 50  ---VACDTISPFLNDLLKPETAQTPSVSGVSGSDHETAPKRR--TAIPGAAGGKITKRKS 104

Query: 114 RASKKTQTTFITADPANFRQMVQQVTGVRFGSAQVAMAPVLKPEPQRPGGRLHGIGGGSL 173
           RASK++QTTFITADPANFRQMVQQVTGVRFG++Q+ + P+LKPEP+RPG RL G     L
Sbjct: 105 RASKRSQTTFITADPANFRQMVQQVTGVRFGNSQLPVNPILKPEPKRPGNRLLGC----L 160

Query: 174 PTLDTSAFLLDHHQQLGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLEFDALSNFPTLE 233
           PTLDTS FLLDHH Q    P          G   FQ  +V DG +TGLEF+   +FPTLE
Sbjct: 161 PTLDTSEFLLDHHHQQVMGPATAT-----QGAMTFQ-PVVADGGATGLEFETFPSFPTLE 214

Query: 234 SWKVM 238
           SWKVM
Sbjct: 215 SWKVM 219




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225440213|ref|XP_002283659.1| PREDICTED: uncharacterized protein LOC100255095 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224137606|ref|XP_002322599.1| predicted protein [Populus trichocarpa] gi|222867229|gb|EEF04360.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224086753|ref|XP_002307950.1| predicted protein [Populus trichocarpa] gi|222853926|gb|EEE91473.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255560030|ref|XP_002521033.1| conserved hypothetical protein [Ricinus communis] gi|223539736|gb|EEF41317.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356571872|ref|XP_003554095.1| PREDICTED: uncharacterized protein LOC100790912 [Glycine max] Back     alignment and taxonomy information
>gi|356503344|ref|XP_003520470.1| PREDICTED: uncharacterized protein LOC100782392 [Glycine max] Back     alignment and taxonomy information
>gi|449437978|ref|XP_004136767.1| PREDICTED: uncharacterized protein LOC101211486 [Cucumis sativus] gi|449525975|ref|XP_004169991.1| PREDICTED: uncharacterized LOC101211486 [Cucumis sativus] Back     alignment and taxonomy information
>gi|15230149|ref|NP_191247.1| VQ motif-containing protein [Arabidopsis thaliana] gi|9663007|emb|CAC00751.1| putative protein [Arabidopsis thaliana] gi|15028355|gb|AAK76654.1| unknown protein [Arabidopsis thaliana] gi|24030447|gb|AAN41377.1| unknown protein [Arabidopsis thaliana] gi|332646060|gb|AEE79581.1| VQ motif-containing protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297817002|ref|XP_002876384.1| VQ motif-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322222|gb|EFH52643.1| VQ motif-containing protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query238
TAIR|locus:2103590245 AT3G56880 "AT3G56880" [Arabido 0.953 0.926 0.453 2.8e-41
TAIR|locus:2063265238 CAMBP25 "AT2G41010" [Arabidops 0.924 0.924 0.414 7.4e-34
TAIR|locus:2129700193 AT4G15120 "AT4G15120" [Arabido 0.260 0.321 0.405 5.8e-05
TAIR|locus:2090389192 AT3G22160 "AT3G22160" [Arabido 0.205 0.255 0.446 7.8e-05
TAIR|locus:2103590 AT3G56880 "AT3G56880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 438 (159.2 bits), Expect = 2.8e-41, P = 2.8e-41
 Identities = 116/256 (45%), Positives = 147/256 (57%)

Query:     1 MTSSENLPSVDAWSFRAAF-ADSWL-SEAYARDTETLTKALQKTFXXXXXXXXXXXDITY 58
             M SSE L SVD WSFR  F  DSWL S++++ D++ L KAL ++               +
Sbjct:     1 MASSEGLASVDPWSFRQNFNIDSWLLSDSFSHDSDILAKALHRSISTSTESSPLSPSSFF 60

Query:    59 SVTETLACDFTPPFVAPETPTVSNVS-GSDPET-------LPKRQRTAAVGGPNSGKITK 110
               + T A DF+PP       T+SNVS GSDPE        L KR+R   V G   GK TK
Sbjct:    61 D-SSTAAVDFSPP------QTLSNVSFGSDPEIPAASALGLGKRKRGPGVSG---GKQTK 110

Query:   111 RKARAS-KKTQTTFITADPANFRQMVQQVTGVRF-GSAQVAMAPVLKPEPQR-----PXX 163
             R++R S KK+QTTFITAD ANFRQMVQQVTG +F GS+    AP++KPEP R     P  
Sbjct:   111 RRSRVSNKKSQTTFITADAANFRQMVQQVTGAKFLGSSNSIFAPIVKPEPHRLASRLPPS 170

Query:   164 XXXXXXXXSLPTLDTSAFLLDHHQQ-LGQAPGAGVGSVLGSGHFDFQTSLVCDGASTGLE 222
                     ++PTLDTS+FL +HHQ+ +    GA  GS          T+ V  G S+ +E
Sbjct:   171 CGNLDRSSAVPTLDTSSFLSNHHQENIITDLGAPTGSFHHQSSAGTTTANV-GGGSSAVE 229

Query:   223 FDALSNFPTLESWKVM 238
              D+  +FPTLESWKVM
Sbjct:   230 LDSYPSFPTLESWKVM 245




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2063265 CAMBP25 "AT2G41010" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2129700 AT4G15120 "AT4G15120" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2090389 AT3G22160 "AT3G22160" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00023331001
SubName- Full=Chromosome chr8 scaffold_29, whole genome shotgun sequence; (219 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query238
pfam0567831 pfam05678, VQ, VQ motif 2e-07
>gnl|CDD|114404 pfam05678, VQ, VQ motif Back     alignment and domain information
 Score = 45.5 bits (109), Expect = 2e-07
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 117 KKTQTTFITADPANFRQMVQQVTGVRFGSA 146
            +   T I  DPANFR +VQ++TG     A
Sbjct: 1   YRAPPTVIHTDPANFRALVQRLTGAPAAPA 30


This short motif is found in a variety of plant proteins. These proteins vary greatly in length and are mostly composed of low complexity regions. They all conserve a short motif FXhVQChTG, where X is any amino acid and h is a hydrophobic amino acid. The function of this motif is uncertain, however one protein in this family has been found to bind the SigA sigma factor. It would seem plausible that this motif is needed for this activity and that this whole family might be involved in modulating plastid sigma factors (Bateman A pers. obs.). Length = 31

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 238
PF0567831 VQ: VQ motif; InterPro: IPR008889 This short motif 99.01
>PF05678 VQ: VQ motif; InterPro: IPR008889 This short motif is found in a variety of plant proteins Back     alignment and domain information
Probab=99.01  E-value=2.9e-10  Score=74.03  Aligned_cols=29  Identities=38%  Similarity=0.646  Sum_probs=26.5

Q ss_pred             CCCCceEeecCchHHHHHHHHHhcCCCCC
Q 026455          117 KKTQTTFITADPANFRQMVQQVTGVRFGS  145 (238)
Q Consensus       117 rR~ptT~i~ad~anFRaMVQq~TG~p~~~  145 (238)
                      ++.+++||.+|++|||+||||+||.+..+
T Consensus         1 ~~~~p~vi~~d~~~Fr~lVQ~LTG~~~~~   29 (31)
T PF05678_consen    1 YRSPPTVIHTDPSNFRALVQRLTGAPSAP   29 (31)
T ss_pred             CCCCCEEEEeCHHHHHHHHHHhHCcCCCC
Confidence            46899999999999999999999999754



These proteins vary greatly in length and are mostly composed of low complexity regions. They all conserve a short motif FXhVQChTG, where X is any amino acid and h is a hydrophobic amino acid. The function of this motif is uncertain, however one protein in this family has been found to bind the SigA sigma factor Q9LDH1 from SWISSPROT. It would seem plausible that this motif is needed for this activity and that this whole family might be involved in modulating plastid sigma factors.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00