Citrus Sinensis ID: 030409


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------18
MGIGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFSSAALSSKKELHSYSEIKLKVRDA
cccHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccc
cccHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccc
mgiglwfnKRIVEPLLQILrrgaepnqlAFSTALGitlglfpiCGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPflrfgealtggphfplnsDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFSSAALSSKKELHSYSEIKLKVRDA
mgiglwfnKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFSSaalsskkelhsyseiklkvrda
MGIGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFSSAALSSKKELHSYSEIKLKVRDA
**IGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFSS**********************
*GIGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKIL****************************
MGIGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFSSAALSSKKELHSYSEIKLKVRDA
*GIGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFS***********************
oooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooHHHHHHHHHHHHHHHHHHHiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGIGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKFSSAALSSKKELHSYSEIKLKVRDA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query178
225459985185 PREDICTED: uncharacterized protein LOC10 0.994 0.956 0.757 1e-72
388509988184 unknown [Lotus japonicus] 0.994 0.961 0.747 3e-71
388520271179 unknown [Lotus japonicus] 0.994 0.988 0.747 3e-71
351721344176 uncharacterized protein LOC100305584 [Gl 0.966 0.977 0.773 5e-71
388497588183 unknown [Medicago truncatula] 0.994 0.967 0.723 1e-69
449441776184 PREDICTED: uncharacterized protein LOC10 0.966 0.934 0.767 2e-69
255561476183 conserved hypothetical protein [Ricinus 0.994 0.967 0.723 3e-69
224074029183 predicted protein [Populus trichocarpa] 0.994 0.967 0.728 3e-68
224124400175 predicted protein [Populus trichocarpa] 0.966 0.982 0.720 4e-67
18413303183 uncharacterized protein [Arabidopsis tha 0.971 0.945 0.693 3e-63
>gi|225459985|ref|XP_002269541.1| PREDICTED: uncharacterized protein LOC100248177 [Vitis vinifera] gi|297734771|emb|CBI17005.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  277 bits (709), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 134/177 (75%), Positives = 151/177 (85%)

Query: 1   MGIGLWFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRS 60
           +G+G WFNK+IV+PLLQILRRGAEP QLAFS ALGITLG+FPICG  VFLCGMAIA L S
Sbjct: 8   IGVGPWFNKKIVDPLLQILRRGAEPKQLAFSAALGITLGVFPICGVTVFLCGMAIALLGS 67

Query: 61  RCHAPSVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLS 120
            CH+PSVMLANFIATPIELSLVVPFLRFGE ++GGPHFPL SDA+K+VLTG AS+EVLLS
Sbjct: 68  LCHSPSVMLANFIATPIELSLVVPFLRFGEVISGGPHFPLTSDAMKKVLTGHASREVLLS 127

Query: 121 IAHALLGWLVATPFILTALYLVLLPVFKILVRKFSSAALSSKKELHSYSEIKLKVRD 177
           I HALLGWLVA PFIL  LY++ LP  K+LVRKFS    S KK +H + E+KLKVRD
Sbjct: 128 IVHALLGWLVAAPFILVILYIIFLPCCKLLVRKFSPVPSSPKKPVHPHMEVKLKVRD 184




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388509988|gb|AFK43060.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|388520271|gb|AFK48197.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|351721344|ref|NP_001237462.1| uncharacterized protein LOC100305584 [Glycine max] gi|255625983|gb|ACU13336.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|388497588|gb|AFK36860.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|449441776|ref|XP_004138658.1| PREDICTED: uncharacterized protein LOC101220662 [Cucumis sativus] gi|449525081|ref|XP_004169548.1| PREDICTED: uncharacterized protein LOC101224137 [Cucumis sativus] Back     alignment and taxonomy information
>gi|255561476|ref|XP_002521748.1| conserved hypothetical protein [Ricinus communis] gi|223538961|gb|EEF40558.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224074029|ref|XP_002304222.1| predicted protein [Populus trichocarpa] gi|222841654|gb|EEE79201.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224124400|ref|XP_002319322.1| predicted protein [Populus trichocarpa] gi|222857698|gb|EEE95245.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|18413303|ref|NP_567353.1| uncharacterized protein [Arabidopsis thaliana] gi|4538978|emb|CAB39766.1| hypothetical protein [Arabidopsis thaliana] gi|7267710|emb|CAB78137.1| hypothetical protein [Arabidopsis thaliana] gi|21555511|gb|AAM63876.1| unknown [Arabidopsis thaliana] gi|332657447|gb|AEE82847.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query178
TAIR|locus:2124690183 AT4G10140 "AT4G10140" [Arabido 0.971 0.945 0.693 3.9e-60
TAIR|locus:2007018175 AT1G33490 "AT1G33490" [Arabido 0.887 0.902 0.727 6.4e-60
TAIR|locus:2124690 AT4G10140 "AT4G10140" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 616 (221.9 bits), Expect = 3.9e-60, P = 3.9e-60
 Identities = 120/173 (69%), Positives = 135/173 (78%)

Query:     6 WFNKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAP 65
             WFNK+I +PLLQILRRG EP QLAFS ALGITLG+FPICG  V LCG+A A L S CHAP
Sbjct:    11 WFNKKITDPLLQILRRGTEPKQLAFSAALGITLGIFPICGVPVLLCGVAFALLGSSCHAP 70

Query:    66 SVMLANFIATPIELSLVVPFLRFGEALTGGPHFPLNSDALKRVLTGKASQEVLLSIAHAL 125
             +VMLAN IATP+EL+LVVPFLR GE +TGGPHFPL SDALK+V TG+ASQEV LSI +AL
Sbjct:    71 TVMLANIIATPVELALVVPFLRLGEKVTGGPHFPLTSDALKKVFTGQASQEVFLSIGNAL 130

Query:   126 LGWLVATPFILTALYLVLLPVFKILVRKFSSAALSSKKELHSYSEIKLKVRDA 178
             LGWLVATPF    LY+VLLP F ILVRKF   A + K      +E+  K RDA
Sbjct:   131 LGWLVATPFFFITLYVVLLPCFTILVRKFGGVASTQKLPTSMETELNPKPRDA 183




GO:0003674 "molecular_function" evidence=ND
GO:0005739 "mitochondrion" evidence=ISM
GO:0005794 "Golgi apparatus" evidence=IDA
GO:0015824 "proline transport" evidence=RCA
TAIR|locus:2007018 AT1G33490 "AT1G33490" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00032005001
SubName- Full=Chromosome chr18 scaffold_61, whole genome shotgun sequence; (185 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 178
PF09835154 DUF2062: Uncharacterized protein conserved in bact 99.95
TIGR03546154 conserved hypothetical protein TIGR03546. Members 99.92
COG3216184 Uncharacterized protein conserved in bacteria [Fun 99.89
PF09835154 DUF2062: Uncharacterized protein conserved in bact 95.62
PF01970 419 TctA: Tripartite tricarboxylate transporter TctA f 85.03
>PF09835 DUF2062: Uncharacterized protein conserved in bacteria (DUF2062); InterPro: IPR018639 This domain, found in various prokaryotic proteins, has no known function Back     alignment and domain information
Probab=99.95  E-value=4.3e-27  Score=185.67  Aligned_cols=145  Identities=22%  Similarity=0.374  Sum_probs=118.8

Q ss_pred             HHhhHHHHHHHHhcCCCHHHHHHHHHHHhhhhccccchHHHHHHHHHHHHHHhccCHHHHHHHHHhhhhhhHHHHHHH-H
Q 030409            8 NKRIVEPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGMAIAALRSRCHAPSVMLANFIATPIELSLVVPF-L   86 (178)
Q Consensus         8 ~~~i~~~ll~iLr~~~sP~~iA~s~AlGvfiGi~P~~G~~t~l~~~la~~l~~RlN~~aa~~~~~i~~Pl~~~li~pf-~   86 (178)
                      .|.+++.++++++++.||+++|.|+|+|+|+|++|++|+|+++|+++|+++  |+|+++++++++++||+|+++++++ +
T Consensus         2 ~R~~~~~~~~~l~~~~~p~~iA~g~AiG~fig~~P~~g~~~~l~~~la~~~--r~N~~aa~~~~~i~nPlt~~~i~~~~y   79 (154)
T PF09835_consen    2 KRRFRYRLLRLLRWGGSPHSIALGFAIGVFIGFLPIFGLQTVLAIALALLF--RLNKPAAILGTWISNPLTIPPIYPLSY   79 (154)
T ss_pred             ccHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHH--HccHHHHHHHHHHHhHHHHHHHHHHHH
Confidence            457888899999999999999999999999999999999999999999988  9999999999999999999999885 8


Q ss_pred             HHHHHHhcCCCCCCCHH---HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 030409           87 RFGEALTGGPHFPLNSD---ALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVFKILVRKF  154 (178)
Q Consensus        87 ~~G~~l~g~~~~~l~~~---~l~~~l~~~~~~~~~~~~~~Al~gwlv~ap~~~~~~Y~l~~p~~k~l~~k~  154 (178)
                      ++|++++|.+..++...   +...+-..+..+.+.......++|+++.+++.+++.|++.+++.++.++|-
T Consensus        80 ~vG~~ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~i~~~v~~~i~Y~l~~~~~~~~r~~r  150 (154)
T PF09835_consen   80 RVGSFLLGGPPEEFAALFEFDWSLMHWSDLLESLWEFGLPFLLGSLILGIVLGIISYFLVYFLVRKYRKRR  150 (154)
T ss_pred             HHHHHHhCCChhhhhhhcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999886111100   000000011123445556677899999999999999999999999997764



It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.

>TIGR03546 conserved hypothetical protein TIGR03546 Back     alignment and domain information
>COG3216 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PF09835 DUF2062: Uncharacterized protein conserved in bacteria (DUF2062); InterPro: IPR018639 This domain, found in various prokaryotic proteins, has no known function Back     alignment and domain information
>PF01970 TctA: Tripartite tricarboxylate transporter TctA family; InterPro: IPR002823 Members of this prokaryotic family have no known function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query178
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 6e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 38.7 bits (89), Expect = 6e-04
 Identities = 37/204 (18%), Positives = 59/204 (28%), Gaps = 77/204 (37%)

Query: 7   FNKRIV---EPLLQILRRGAEPNQLAFSTALGITLGLFPICGAAVFLCGM--------AI 55
           F K  V   +P L+ LR            AL   L L P     V + G+        A+
Sbjct: 126 FAKYNVSRLQPYLK-LR-----------QAL---LELRP--AKNVLIDGVLGSGKTWVAL 168

Query: 56  AALRS-----------------RCHAPSV---MLANFIATPIELSLVVPFLRFGEALTGG 95
               S                  C++P     ML   +       +   +    +  +  
Sbjct: 169 DVCLSYKVQCKMDFKIFWLNLKNCNSPETVLEMLQKLL-----YQIDPNWTSRSDHSSNI 223

Query: 96  PHFPLNS--DALKRVLTGKASQEVLLSIAHALLGWLVATPFILTALYLVLLPVF----KI 149
               ++S    L+R+L  K  +  LL + +      V       A        F    KI
Sbjct: 224 KL-RIHSIQAELRRLLKSKPYENCLLVLLN------VQNAKAWNA--------FNLSCKI 268

Query: 150 LV--RKFSSA-ALSSKKELHSYSE 170
           L+  R       LS+    H   +
Sbjct: 269 LLTTRFKQVTDFLSAATTTHISLD 292


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00