Citrus Sinensis ID: 030352


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------18
MSIEQPQPSQNLNQLMLQQSSGNLSFSSSNLSKDDEEMSRSALTTFRAKEEEIEKKKLEVREKVQAHLGRVEEETKRLATIREELEALADPMRKEVAVVRKKIDSVNKELKPLGHTCQKKVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNVCHRKENTRKPLKPLMKRTRRKYS
ccccccccccHHHHHHHHccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHccEEEEHHHHHHHHHHHHHHccccccccccccccccccccccccHHHHHHHHHHcc
cccccccccHHHHHHHHHcccccEccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHccccccHcHHHcHcHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccHHHHHHHHHcc
msieqpqpsqnLNQLMLQQssgnlsfsssnlskddeEMSRSALTTFRAKEEEIEKKKLEVREKVQAHLGRVEEETKRLATIREELEALADPMRKEVAVVRKKIDSVNkelkplghtcqkkvkfngiflCLQWTIHVYWFFFSlgqlwhpmktydnvchrkentrkplkplmkrtrrkys
msieqpqpsqNLNQLMLQQSSGNLSFSSsnlskddeEMSRSALTtfrakeeeiekkklevrekvqahlgrveeetkrLATIREElealadpmrkeVAVVRKKIDsvnkelkplghtcqkkVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNvchrkentrkplkplmkrtrrkys
MSIEqpqpsqnlnqlmlqqssgnlsfsssnlsKDDEEMSRSALTTFRAkeeeiekkklevrekVQAHLGRVEEETKRLATIREELEALADPMRKEVAVVRKKIDSVNKELKPLGHTCQKKVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNVCHRKENTRKPLKPLMKRTRRKYS
**********************************************************************************************EVAVVRKKIDSVNKELKPLGHTCQKKVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNVCH*********************
****************************************************************************RLATIREELEALADPMRKEVAVVRKKIDSVNKELKP****CQKKVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNVCHRK*******************
*********QNLNQLMLQQS**********************LTTFRAKEEEIEKKKLEVREKVQAHLGRVEEETKRLATIREELEALADPMRKEVAVVRKKIDSVNKELKPLGHTCQKKVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNVCHRKENTRKPLKPL*********
*******************************************TTFRAKEEEIEKKKLEVREKVQAHLGRVEEETKRLATIREELEALADPMRKEVAVVRKKIDSVNKELKPLGHTCQKKVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNVCHRKENTRKPLKPLMKRTRR***
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSIEQPQPSQNLNQLMLQQSSGNLSFSSSNLSKDDEEMSRSALTTFRAKEEEIEKKKLEVREKVQAHLGRVEEETKRLATIREELEALADPMRKEVAVVRKKIDSVNKELKPLGHTCQKKVKFNGIFLCLQWTIHVYWFFFSLGQLWHPMKTYDNVCHRKENTRKPLKPLMKRTRRKYS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query179
351723393164 uncharacterized protein LOC100499923 [Gl 0.631 0.689 0.824 4e-44
255637447164 unknown [Glycine max] 0.631 0.689 0.815 1e-43
351724745167 uncharacterized protein LOC100527015 [Gl 0.631 0.676 0.794 4e-42
255595013149 conserved hypothetical protein [Ricinus 0.592 0.711 0.841 8e-42
388511961166 unknown [Lotus japonicus] 0.648 0.698 0.771 1e-41
449440207167 PREDICTED: uncharacterized protein LOC10 0.569 0.610 0.854 1e-40
79324461192 uncharacterized protein [Arabidopsis tha 0.592 0.552 0.724 1e-37
297823455191 hypothetical protein ARALYDRAFT_482617 [ 0.586 0.549 0.722 4e-37
18404123195 uncharacterized protein [Arabidopsis tha 0.586 0.538 0.722 4e-37
21592509195 unknown [Arabidopsis thaliana] 0.586 0.538 0.722 5e-37
>gi|351723393|ref|NP_001238045.1| uncharacterized protein LOC100499923 [Glycine max] gi|255627713|gb|ACU14201.1| unknown [Glycine max] Back     alignment and taxonomy information
 Score =  182 bits (463), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 94/114 (82%), Positives = 106/114 (92%), Gaps = 1/114 (0%)

Query: 9   SQNLNQLMLQQSSGNLSFSSSNLSKDDEEMSRSALTTFRAKEEEIEKKKLEVREKVQAHL 68
           S+N N  ++ Q+SG+LSFSS +LSK+DEEMSRSAL+TFRAKEEEIE+KK+EVREKVQ  L
Sbjct: 3   SENQNNQLVVQNSGSLSFSS-HLSKEDEEMSRSALSTFRAKEEEIERKKMEVREKVQLQL 61

Query: 69  GRVEEETKRLATIREELEALADPMRKEVAVVRKKIDSVNKELKPLGHTCQKKVK 122
           GRVEEETKRLATIREELEALADPMRKEVA+VRK+IDSVNKELKPLGHTCQKK K
Sbjct: 62  GRVEEETKRLATIREELEALADPMRKEVALVRKRIDSVNKELKPLGHTCQKKEK 115




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255637447|gb|ACU19051.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|351724745|ref|NP_001237324.1| uncharacterized protein LOC100527015 [Glycine max] gi|255631368|gb|ACU16051.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|255595013|ref|XP_002536212.1| conserved hypothetical protein [Ricinus communis] gi|223520454|gb|EEF26171.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|388511961|gb|AFK44042.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|449440207|ref|XP_004137876.1| PREDICTED: uncharacterized protein LOC101210169 isoform 1 [Cucumis sativus] gi|449440209|ref|XP_004137877.1| PREDICTED: uncharacterized protein LOC101210169 isoform 2 [Cucumis sativus] gi|449516930|ref|XP_004165499.1| PREDICTED: uncharacterized protein LOC101227916 [Cucumis sativus] Back     alignment and taxonomy information
>gi|79324461|ref|NP_001031494.1| uncharacterized protein [Arabidopsis thaliana] gi|330254157|gb|AEC09251.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297823455|ref|XP_002879610.1| hypothetical protein ARALYDRAFT_482617 [Arabidopsis lyrata subsp. lyrata] gi|297325449|gb|EFH55869.1| hypothetical protein ARALYDRAFT_482617 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18404123|ref|NP_565846.1| uncharacterized protein [Arabidopsis thaliana] gi|20197926|gb|AAD24625.2| expressed protein [Arabidopsis thaliana] gi|22135968|gb|AAM91566.1| unknown protein [Arabidopsis thaliana] gi|23198268|gb|AAN15661.1| unknown protein [Arabidopsis thaliana] gi|330254156|gb|AEC09250.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|21592509|gb|AAM64459.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query179
TAIR|locus:2100083178 AT3G09980 "AT3G09980" [Arabido 0.491 0.494 0.727 1.2e-26
TAIR|locus:2044883195 AT2G36410 [Arabidopsis thalian 0.491 0.451 0.670 3.6e-25
TAIR|locus:2085231180 AT3G52920 [Arabidopsis thalian 0.491 0.488 0.670 6.8e-24
TAIR|locus:2144618173 AT5G03660 "AT5G03660" [Arabido 0.491 0.508 0.602 9e-22
TAIR|locus:2041985174 AT2G27740 "AT2G27740" [Arabido 0.480 0.494 0.511 5.1e-19
TAIR|locus:2085206164 AT3G52900 "AT3G52900" [Arabido 0.497 0.542 0.438 6.2e-14
TAIR|locus:1009023152155 AT2G36355 "AT2G36355" [Arabido 0.318 0.367 0.526 5.8e-11
TAIR|locus:2100083 AT3G09980 "AT3G09980" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 300 (110.7 bits), Expect = 1.2e-26, P = 1.2e-26
 Identities = 64/88 (72%), Positives = 68/88 (77%)

Query:    33 KDDEEMSRSALTTFRAXXXXXXXXXXXXXXXVQAHLGRVEEETKRLATIREELEALADPM 92
             K+DEEMSR+AL+ FRA               VQA LGRVEEETKRLA IREELE LADPM
Sbjct:    40 KEDEEMSRTALSAFRAKEEEIEKKKMEIRERVQAQLGRVEEETKRLALIREELEGLADPM 99

Query:    93 RKEVAVVRKKIDSVNKELKPLGHTCQKK 120
             RKEVA+VRKKIDSVNKELKPLGHT QKK
Sbjct:   100 RKEVAMVRKKIDSVNKELKPLGHTVQKK 127




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0009733 "response to auxin stimulus" evidence=RCA
GO:0005886 "plasma membrane" evidence=IDA
TAIR|locus:2044883 AT2G36410 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2085231 AT3G52920 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2144618 AT5G03660 "AT5G03660" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2041985 AT2G27740 "AT2G27740" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2085206 AT3G52900 "AT3G52900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:1009023152 AT2G36355 "AT2G36355" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query179
pfam04949160 pfam04949, Transcrip_act, Transcriptional activato 1e-46
TIGR02680 1353 TIGR02680, TIGR02680, TIGR02680 family protein 1e-04
>gnl|CDD|191136 pfam04949, Transcrip_act, Transcriptional activator Back     alignment and domain information
 Score =  150 bits (379), Expect = 1e-46
 Identities = 75/110 (68%), Positives = 94/110 (85%), Gaps = 1/110 (0%)

Query: 14  QLMLQQSSGNLSFS-SSNLSKDDEEMSRSALTTFRAKEEEIEKKKLEVREKVQAHLGRVE 72
           Q +  + SG++SF  S  + + DEE+SRSAL+ F+AKEEEIE++K+EVRE+VQA LGRVE
Sbjct: 3   QRLRVKHSGSISFGGSPMMDEKDEELSRSALSLFKAKEEEIERRKMEVRERVQAQLGRVE 62

Query: 73  EETKRLATIREELEALADPMRKEVAVVRKKIDSVNKELKPLGHTCQKKVK 122
           EETKRLA IREELE +ADPMRKEV+++RK+IDSVN+ELKPLG TCQKK K
Sbjct: 63  EETKRLAEIREELEGMADPMRKEVSMIRKRIDSVNRELKPLGQTCQKKEK 112


This family of proteins may act as a transcriptional activator. It plays a role in stress response in plants. Length = 160

>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 179
PF04949159 Transcrip_act: Transcriptional activator; InterPro 100.0
TIGR01837118 PHA_granule_1 poly(hydroxyalkanoate) granule-assoc 95.62
PF05597132 Phasin: Poly(hydroxyalcanoate) granule associated 95.37
PF10805106 DUF2730: Protein of unknown function (DUF2730); In 90.18
PF1024188 KxDL: Uncharacterized conserved protein; InterPro: 81.01
TIGR00763 775 lon ATP-dependent protease La. This protein is ind 80.49
PF1429361 YWFCY: YWFCY protein 80.48
>PF04949 Transcrip_act: Transcriptional activator; InterPro: IPR007033 Golgins are a family of coiled-coil proteins associated with the Golgi apparatus necessary for tethering events in membrane fusion and as structural supports for Golgi cisternae [] Back     alignment and domain information
Probab=100.00  E-value=4.5e-52  Score=334.95  Aligned_cols=108  Identities=74%  Similarity=1.029  Sum_probs=104.1

Q ss_pred             hcccCccccccCCCC-CchhHHhhHHHHhhhhhhHHHHHHHHHHHHHHHHHHhchhHHHHhhHHHHHHHHHhccchhHHH
Q 030352           17 LQQSSGNLSFSSSNL-SKDDEEMSRSALTTFRAKEEEIEKKKLEVREKVQAHLGRVEEETKRLATIREELEALADPMRKE   95 (179)
Q Consensus        17 ~~~~sg~ls~~~s~~-~~~eEEmsRsALS~FrAKEEEIERKKmEVReKVqaqLGRvEEETKRLA~IREELE~laDPmRKE   95 (179)
                      +++++|+++++|+++ ++++|+|||||+|+|+||||||||||||||||||+|||||||||||||+||+|||+|+||||||
T Consensus         6 ~~~~~~~~~~~g~~~~~~~~e~~s~sals~f~AkEeeIErkKmeVrekVq~~LgrveEetkrLa~ireeLE~l~dP~RkE   85 (159)
T PF04949_consen    6 RVKNSGSISFNGSSMMDDEDEEMSRSALSAFRAKEEEIERKKMEVREKVQAQLGRVEEETKRLAEIREELEVLADPMRKE   85 (159)
T ss_pred             hcccCCCCCCCCCcccchhHHHhhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhhccchHHH
Confidence            478999999988777 7889999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHhhhhhcccccchhhhhhhhhhc
Q 030352           96 VAVVRKKIDSVNKELKPLGHTCQKKVKFN  124 (179)
Q Consensus        96 Va~VRKKID~vNReLKPLGqsCQKKEke~  124 (179)
                      |++||||||+|||||||||++|||||||-
T Consensus        86 v~~vRkkID~vNreLkpl~~~cqKKEkEy  114 (159)
T PF04949_consen   86 VEMVRKKIDSVNRELKPLGQSCQKKEKEY  114 (159)
T ss_pred             HHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Confidence            99999999999999999999999999984



This entry represents proteins annotated as RAB6-interacting golgins.

>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein Back     alignment and domain information
>PF05597 Phasin: Poly(hydroxyalcanoate) granule associated protein (phasin); InterPro: IPR008769 Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material Back     alignment and domain information
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins Back     alignment and domain information
>PF10241 KxDL: Uncharacterized conserved protein; InterPro: IPR019371 This entry represents a conserved region of 80 residues which defines a family of short proteins Back     alignment and domain information
>TIGR00763 lon ATP-dependent protease La Back     alignment and domain information
>PF14293 YWFCY: YWFCY protein Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query179
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 43.3 bits (101), Expect = 2e-05
 Identities = 29/193 (15%), Positives = 59/193 (30%), Gaps = 61/193 (31%)

Query: 32  SKDDEEMSRSALTTFRAKEEEIEKKKLEVREKVQAHLG----RVEEETKRLATIREELEA 87
           SKD    +     T  +K+EE+ +K +E  E ++ +       ++ E ++ + +      
Sbjct: 57  SKDAVSGTLRLFWTLLSKQEEMVQKFVE--EVLRINYKFLMSPIKTEQRQPSMMTRMYIE 114

Query: 88  LADPMRKEVAVVRK----------KIDSVNKELKP-----------LGHTC-------QK 119
             D +  +  V  K          K+     EL+P            G T          
Sbjct: 115 QRDRLYNDNQVFAKYNVSRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSY 174

Query: 120 KV--KFNGIFLCLQWTIHVYWFFFSLGQ-------------LWHPM-KTYDNVCHRKENT 163
           KV  K +           ++W   +L               L + +   + +      N 
Sbjct: 175 KVQCKMDF---------KIFW--LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNI 223

Query: 164 RKPLKPLMKRTRR 176
           +  +  +    RR
Sbjct: 224 KLRIHSIQAELRR 236


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00