Citrus Sinensis ID: 018368


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------
MSIFFCWVAYLLLANSTFCASQMDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAIAERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSLGGNMSSCPPGCQISRGVKHMHHPQQHVHQLHHVSEAAYSREMHTGDALPVSPGASEAAKPRRYKRAKEPKVLSPNKKTAKSPRKVKRENEDLNKVVFGKPSEWKSVQDLDGGDDDVNKQSTASKSDWKGQVLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSGSAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK
ccHHHHHHHHHHHHccHHHHHHHcccccccccccHHHHHHHHHcHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccHHHHHccccccccccccccccccccccccccccccccccccccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccccccccHHHHHHcccccEEEEc
ccHHHHHHHHHHHHHccccHHHcccccccccccccccccccccccHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHccccccccccccccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccEEEEEEEEccccccccEEcccccccccEEcccccccccccccEEEcccccccccccccEEccccccHHHHHHHHHHHHHccccccccccHHHHHHHccccEEEEEc
MSIFFCWVAYLLLANSTFcasqmddgghrengrhkADQYKAAQgqwlmhhqpSMKQIMTIMAERDAALQERNLATSEKKAAIAERDMAFLQRDTAIAERNNAILERDNAIASLQYREnslggnmsscppgcqisrgvkhmhhpqqhvhqLHHVseaaysremhtgdalpvspgaseaakprrykrakepkvlspnkktaksprkvkrenedlnkvvfgkpsewksvqdldggdddvnkqstasksdwkgqvlglnqvtfdestmpppacsctgvlrqcykwgnggwqsaccttslsmyplpavpnkrharvggrkmsGSAFNKLLTRLAAeghdlshpvdlkdhwakhgtnryitik
MSIFFCWVAYLLLANSTFCASQMDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAIAERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSLGGNMSSCPPGCQISRGVKHMHHPQQHVHQLHHVSEAAYSREMHTGDALpvspgaseaakprrykrakepkvlspnkktaksprkvkrenedlnkvvfgkpsewksvqdldggddDVNKQstasksdwkgqVLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSGSAFNKLLTRLAAEGHDlshpvdlkdhwakhgtnryitik
MSIFFCWVAYLLLANSTFCASQMDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAIAERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSLGGNMSSCPPGCQISRGvkhmhhpqqhvhqlhhvSEAAYSREMHTGDALPVSPGASEAAKPRRYKRAKEPKVLSPNKKTAKSPRKVKRENEDLNKVVFGKPSEWKSVQDLDGGDDDVNKQSTASKSDWKGQVLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSGSAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK
**IFFCWVAYLLLANSTFCA*************************WL****************************************AFLQRDTAIAERNNAILERDNAIASL***************************************************************************************************************************************GQVLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVP***************AFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYI***
*SIFFCWVAYLLLANSTF******************************************************************************************************************************************************************************************************************************************GLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPA***************GSAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK
MSIFFCWVAYLLLANSTFCASQMDD*****************QGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAIAERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSLGGNMSSCPPGCQISRGVKHMHHPQQHVHQLHHVSEAAYSREMHTGDALPV*************************************ENEDLNKVVFGKPSEWKSVQDLDGGDDDVNKQSTASKSDWKGQVLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSGSAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK
*SIFFCWVAYLLLANSTFC*************RHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAIAERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSLGGNMSSCPPGCQISRGVKHMHHPQQHVHQLHHVSEAAYSREMHTGDALPVSPG*************************************************************************WKGQVLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSGSAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSIFFCWVAYLLLANSTFCASQMDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxNAIASLQYRENSLGGNMSSCPPGCQISRGVKHMHHPQQHVHQLHHVSEAAYSREMHTGDALPVSPGASEAAKPRRYKRAKEPKVLSPNKKTAKSPRKVKRENEDLNKVVFGKPSEWKSVQDLDGGDDDVNKQSTASKSDWKGQVLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSGSAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query357 2.2.26 [Sep-21-2011]
Q8L999342 Protein BASIC PENTACYSTEI yes no 0.904 0.944 0.689 1e-127
Q5VSA8331 Barley B recombinant-like yes no 0.893 0.963 0.536 1e-90
Q8S8C6296 Protein BASIC PENTACYSTEI no no 0.795 0.959 0.461 2e-76
F4JUI3283 Protein BASIC PENTACYSTEI no no 0.775 0.978 0.469 7e-73
O82286226 Protein BASIC PENTACYSTEI no no 0.322 0.508 0.591 2e-37
Q8GUC3350 Protein Barley B recombin N/A no 0.364 0.371 0.519 2e-35
Q9C9X6269 Protein BASIC PENTACYSTEI no no 0.291 0.386 0.586 3e-35
P0DH89341 Barley B recombinant-like no no 0.448 0.469 0.408 1e-34
P0DH88341 Barley B recombinant-like no no 0.448 0.469 0.408 1e-34
Q9LDE2279 Protein BASIC PENTACYSTEI no no 0.296 0.379 0.556 1e-33
>sp|Q8L999|BPC6_ARATH Protein BASIC PENTACYSTEINE6 OS=Arabidopsis thaliana GN=BPC6 PE=1 SV=1 Back     alignment and function desciption
 Score =  454 bits (1167), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 244/354 (68%), Positives = 281/354 (79%), Gaps = 31/354 (8%)

Query: 23  MDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAI 82
           MDDGGHRENGRHKA    A QGQWLM HQPSMKQ+M+I+AERDAA+QERNLA SEKKAA+
Sbjct: 1   MDDGGHRENGRHKA----AVQGQWLMQHQPSMKQVMSIIAERDAAIQERNLAISEKKAAV 56

Query: 83  AERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSL----GGNMSSCPPGCQISRGVK 138
           AERDMAFLQRDTAIAERNNAI+ERD+A+ +LQYRENS+      NMS+CPPGCQISRGVK
Sbjct: 57  AERDMAFLQRDTAIAERNNAIMERDSALTALQYRENSMVTAPAANMSACPPGCQISRGVK 116

Query: 139 HMHHPQQHVHQ----LHHVSEAAY-SREMHTGDALPVSPGAS---EAAKPRRYKRAKEPK 190
           H+HHP  H H     +  ++E AY +REM   D LP SP A    E+AKP+R KR   PK
Sbjct: 117 HLHHPHMHHHHQQHHIPQLTENAYETREMEPNDGLPTSPPAGSTLESAKPKRGKRV-NPK 175

Query: 191 VLS---PNKKTAKSPRKVKRENED-LNKVVFGKPSEWKSVQDLDGGDDDVNKQS-TASKS 245
             +    NK+  K+ RKVK+E+ED LNK++F K +        D  D+D +K     SKS
Sbjct: 176 ATTQTAANKRGPKNQRKVKKESEDDLNKIMFVKTTH-------DYTDEDSSKHILIGSKS 228

Query: 246 DWKGQ-VLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVP 304
           DWK Q ++GLNQV +DE+TMPPP CSCTGVLRQCYKWGNGGWQS+CCTT+LSMYPLPA+P
Sbjct: 229 DWKSQEMVGLNQVVYDETTMPPPVCSCTGVLRQCYKWGNGGWQSSCCTTTLSMYPLPALP 288

Query: 305 NKRHARVGGRKMSGSAFNKLLTRLAAEG-HDLSHPVDLKDHWAKHGTNRYITIK 357
           NKRHARVGGRKMSGSAFNKLL+RLAAEG HDLS+PVDLKDHWAKHGTNRYITIK
Sbjct: 289 NKRHARVGGRKMSGSAFNKLLSRLAAEGHHDLSNPVDLKDHWAKHGTNRYITIK 342




Transcriptional regulator that specifically binds to GA-rich elements (GAGA-repeats) present in regulatory sequences of genes involved in developmental processes.
Arabidopsis thaliana (taxid: 3702)
>sp|Q5VSA8|BBRD_ORYSJ Barley B recombinant-like protein D OS=Oryza sativa subsp. japonica GN=Os06g0130600 PE=2 SV=1 Back     alignment and function description
>sp|Q8S8C6|BPC4_ARATH Protein BASIC PENTACYSTEINE4 OS=Arabidopsis thaliana GN=BPC4 PE=1 SV=1 Back     alignment and function description
>sp|F4JUI3|BPC5_ARATH Protein BASIC PENTACYSTEINE5 OS=Arabidopsis thaliana GN=BPC5 PE=1 SV=1 Back     alignment and function description
>sp|O82286|BPC7_ARATH Protein BASIC PENTACYSTEINE7 OS=Arabidopsis thaliana GN=BPC7 PE=1 SV=1 Back     alignment and function description
>sp|Q8GUC3|BBR_HORVU Protein Barley B recombinant OS=Hordeum vulgare GN=BBR PE=1 SV=1 Back     alignment and function description
>sp|Q9C9X6|BPC3_ARATH Protein BASIC PENTACYSTEINE3 OS=Arabidopsis thaliana GN=BPC3 PE=1 SV=1 Back     alignment and function description
>sp|P0DH89|BBRB_ORYSJ Barley B recombinant-like protein B OS=Oryza sativa subsp. japonica GN=Os10g0115200 PE=2 SV=1 Back     alignment and function description
>sp|P0DH88|BBRA_ORYSJ Barley B recombinant-like protein A OS=Oryza sativa subsp. japonica GN=Os10g0114500 PE=2 SV=1 Back     alignment and function description
>sp|Q9LDE2|BPC2_ARATH Protein BASIC PENTACYSTEINE2 OS=Arabidopsis thaliana GN=BPC2 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query357
255538854339 conserved hypothetical protein [Ricinus 0.938 0.988 0.805 1e-159
363807974338 uncharacterized protein LOC100783502 [Gl 0.938 0.991 0.807 1e-159
225458065337 PREDICTED: protein BASIC PENTACYSTEINE6 0.938 0.994 0.824 1e-154
363808282336 uncharacterized protein LOC100809369 [Gl 0.932 0.991 0.766 1e-149
357438223340 GAGA-binding transcriptional activator [ 0.927 0.973 0.752 1e-144
224082960335 predicted protein [Populus trichocarpa] 0.932 0.994 0.768 1e-142
449469901338 PREDICTED: protein BASIC PENTACYSTEINE6- 0.935 0.988 0.758 1e-141
224066007336 predicted protein [Populus trichocarpa] 0.932 0.991 0.775 1e-138
118723368318 GAGA-binding transcriptional activator [ 0.862 0.968 0.754 1e-134
319718191323 GAGA-binding transcriptional activator [ 0.901 0.996 0.723 1e-131
>gi|255538854|ref|XP_002510492.1| conserved hypothetical protein [Ricinus communis] gi|223551193|gb|EEF52679.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 273/339 (80%), Positives = 300/339 (88%), Gaps = 4/339 (1%)

Query: 23  MDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAI 82
           MDDGGHRENGRHKADQYK AQGQWLM  QPSMKQIM IMAERDAA+ ERN+A SEKKAAI
Sbjct: 1   MDDGGHRENGRHKADQYKTAQGQWLMQAQPSMKQIMAIMAERDAAIHERNMAISEKKAAI 60

Query: 83  AERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSL-GGNM-SSCPPGCQISRGVKHM 140
           AERDMAFLQRD+AIAERNNAI+ERDNAIA+LQYRENS+  GNM SSCPPGCQISRGVKHM
Sbjct: 61  AERDMAFLQRDSAIAERNNAIMERDNAIATLQYRENSIPSGNMPSSCPPGCQISRGVKHM 120

Query: 141 HHPQQHVHQLHHVSEAAYS-REMHTGDALPVSPGASEAAKPRRYKRAKEPKVLSPNKKTA 199
           HHPQQH H + H SEA+YS REM T D LP+SP  SEAAKPRR KR+K+ K+   NKKT+
Sbjct: 121 HHPQQHAHHMPHSSEASYSTREMQTSDTLPMSPVGSEAAKPRRVKRSKDAKMAPSNKKTS 180

Query: 200 KSPRKVKRENEDLNKVVFGKPSEWKSVQDLDGGDDDVNKQ-STASKSDWKGQVLGLNQVT 258
           KSPRK+KRE+EDLNKV+FGK  EWK+ QD+ GG DD+NKQ   ASKSDWKG  LGLNQ+ 
Sbjct: 181 KSPRKIKRESEDLNKVMFGKSHEWKNGQDMGGGADDLNKQLVVASKSDWKGHDLGLNQIA 240

Query: 259 FDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSG 318
           FDESTMP P CSCTGV RQCYKWGNGGWQS+CCTT+LSM+PLPAVPNKRHARVGGRKMSG
Sbjct: 241 FDESTMPAPVCSCTGVFRQCYKWGNGGWQSSCCTTTLSMHPLPAVPNKRHARVGGRKMSG 300

Query: 319 SAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK 357
           SAF+KLL+RLAAEG+DLS PVDLK+HWAKHGTNRYITIK
Sbjct: 301 SAFSKLLSRLAAEGYDLSSPVDLKEHWAKHGTNRYITIK 339




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|363807974|ref|NP_001242458.1| uncharacterized protein LOC100783502 [Glycine max] gi|255635590|gb|ACU18145.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|225458065|ref|XP_002278539.1| PREDICTED: protein BASIC PENTACYSTEINE6 [Vitis vinifera] gi|183604863|gb|ACC64529.1| GAGA-binding transcriptional activator BBR/BPC6-like [Vitis vinifera] gi|183604865|gb|ACC64530.1| GAGA-binding transcriptional activator BBR/BPC6-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|363808282|ref|NP_001242497.1| uncharacterized protein LOC100809369 [Glycine max] gi|255635231|gb|ACU17970.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357438223|ref|XP_003589387.1| GAGA-binding transcriptional activator [Medicago truncatula] gi|355478435|gb|AES59638.1| GAGA-binding transcriptional activator [Medicago truncatula] Back     alignment and taxonomy information
>gi|224082960|ref|XP_002306908.1| predicted protein [Populus trichocarpa] gi|222856357|gb|EEE93904.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449469901|ref|XP_004152657.1| PREDICTED: protein BASIC PENTACYSTEINE6-like [Cucumis sativus] gi|449517054|ref|XP_004165561.1| PREDICTED: protein BASIC PENTACYSTEINE6-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224066007|ref|XP_002301996.1| predicted protein [Populus trichocarpa] gi|222843722|gb|EEE81269.1| predicted protein [Populus trichocarpa] gi|319718087|gb|ADV59345.1| GAGA-binding transcriptional activator [Populus trichocarpa] gi|319718089|gb|ADV59346.1| GAGA-binding transcriptional activator [Populus trichocarpa] Back     alignment and taxonomy information
>gi|118723368|gb|ABL10371.1| GAGA-binding transcriptional activator [Medicago truncatula] gi|118723370|gb|ABL10372.1| GAGA-binding transcriptional activator [Medicago truncatula] Back     alignment and taxonomy information
>gi|319718191|gb|ADV59397.1| GAGA-binding transcriptional activator [Solanum tuberosum] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query357
TAIR|locus:2162291342 BPC6 "basic pentacysteine 6" [ 0.904 0.944 0.666 1.5e-115
TAIR|locus:2053086296 BPC4 "basic pentacysteine 4" [ 0.347 0.418 0.695 4.1e-73
UNIPROTKB|Q8GUC3350 BBR "Protein Barley B recombin 0.462 0.471 0.452 1.8e-41
TAIR|locus:2823945279 BPC2 "basic pentacysteine 2" [ 0.383 0.491 0.474 1e-36
TAIR|locus:2045842283 BPC1 "basic pentacysteine1" [A 0.375 0.473 0.477 9.5e-34
TAIR|locus:2162291 BPC6 "basic pentacysteine 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1139 (406.0 bits), Expect = 1.5e-115, P = 1.5e-115
 Identities = 236/354 (66%), Positives = 270/354 (76%)

Query:    23 MDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAI 82
             MDDGGHRENGRHKA    A QGQWLM HQPSMKQ+M+I+AERDAA+QERNLA SEKKAA+
Sbjct:     1 MDDGGHRENGRHKA----AVQGQWLMQHQPSMKQVMSIIAERDAAIQERNLAISEKKAAV 56

Query:    83 AERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSL----GGNMSSCPPGCQISRGXX 138
             AERDMAFLQRDTAIAERNNAI+ERD+A+ +LQYRENS+      NMS+CPPGCQISRG  
Sbjct:    57 AERDMAFLQRDTAIAERNNAIMERDSALTALQYRENSMVTAPAANMSACPPGCQISRGVK 116

Query:   139 XXXXXXXXXXXXXXX----SEAAY-SREMHTGDALPVSPGAS---EAAKPRRYKRAKEPK 190
                                +E AY +REM   D LP SP A    E+AKP+R KR   PK
Sbjct:   117 HLHHPHMHHHHQQHHIPQLTENAYETREMEPNDGLPTSPPAGSTLESAKPKRGKRVN-PK 175

Query:   191 VLSP---NKKTAKSPRKVKRENED-LNKVVFGKPSEWKSVQDLDGGDDDVNKQST-ASKS 245
               +    NK+  K+ RKVK+E+ED LNK++F K +        D  D+D +K     SKS
Sbjct:   176 ATTQTAANKRGPKNQRKVKKESEDDLNKIMFVKTTH-------DYTDEDSSKHILIGSKS 228

Query:   246 DWKGQ-VLGLNQVTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVP 304
             DWK Q ++GLNQV +DE+TMPPP CSCTGVLRQCYKWGNGGWQS+CCTT+LSMYPLPA+P
Sbjct:   229 DWKSQEMVGLNQVVYDETTMPPPVCSCTGVLRQCYKWGNGGWQSSCCTTTLSMYPLPALP 288

Query:   305 NKRHARVGGRKMSGSAFNKLLTRLAAEGH-DLSHPVDLKDHWAKHGTNRYITIK 357
             NKRHARVGGRKMSGSAFNKLL+RLAAEGH DLS+PVDLKDHWAKHGTNRYITIK
Sbjct:   289 NKRHARVGGRKMSGSAFNKLLSRLAAEGHHDLSNPVDLKDHWAKHGTNRYITIK 342




GO:0005634 "nucleus" evidence=ISM;IDA
GO:0003677 "DNA binding" evidence=IPI
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IPI
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IPI
GO:0005515 "protein binding" evidence=IPI
GO:0005730 "nucleolus" evidence=IDA
GO:0042803 "protein homodimerization activity" evidence=IDA
GO:0009723 "response to ethylene stimulus" evidence=IGI
TAIR|locus:2053086 BPC4 "basic pentacysteine 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q8GUC3 BBR "Protein Barley B recombinant" [Hordeum vulgare (taxid:4513)] Back     alignment and assigned GO terms
TAIR|locus:2823945 BPC2 "basic pentacysteine 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2045842 BPC1 "basic pentacysteine1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q5VSA8BBRD_ORYSJNo assigned EC number0.53600.89350.9637yesno
Q8L999BPC6_ARATHNo assigned EC number0.68920.90470.9444yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pm.C_LG_V0621
hypothetical protein (335 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query357
pfam06217301 pfam06217, GAGA_bind, GAGA binding protein-like fa 1e-132
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family Back     alignment and domain information
 Score =  380 bits (977), Expect = e-132
 Identities = 168/341 (49%), Positives = 199/341 (58%), Gaps = 46/341 (13%)

Query: 23  MDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAI 82
           MDD GH  N R+    ++ AQ      +     Q+M+ +AERDA  +ERNLA SEKKAA 
Sbjct: 1   MDDDGHL-NIRNWGYYFEPAQAPSFKSNLG--LQLMSSIAERDAKPRERNLALSEKKAAY 57

Query: 83  AERDMAFLQRDTAIAERNNAILERDNAIASLQYRENSLGGNMSSCPPGCQISRGVKHMHH 142
           AERDMAFLQRD  IAERNNA++ERDN  A+LQ  ENSL       PPGC I  G K +HH
Sbjct: 58  AERDMAFLQRDAWIAERNNALMERDNKFAALQPVENSL-------PPGCGILPGTKGIHH 110

Query: 143 PQQHVHQLHHVSEAAYSREMHTGDALPVSPGASEAAKPR------RYKRAKEPKVLSPNK 196
            Q   H          +REMH  +  P+S    EA + +        K  K PK   P K
Sbjct: 111 LQML-HHPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKK 169

Query: 197 KTAKSPRKVKRENEDLNKVVFGKPSEWKSVQDLDGGDDDVNKQSTASKSDWKGQVLGLNQ 256
           K + S R VK    D                               SK DWK Q L +N 
Sbjct: 170 KGSVSNRSVKMPGIDPR-----------------------------SKPDWKSQDLVING 200

Query: 257 VTFDESTMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKM 316
           V+ D S MP P CSCTG  RQCYKWGNGGWQSACCTT++SMYPLP    +R AR+ GRKM
Sbjct: 201 VSMDISGMPVPVCSCTGAPRQCYKWGNGGWQSACCTTTISMYPLPMSTKRRGARIAGRKM 260

Query: 317 SGSAFNKLLTRLAAEGHDLSHPVDLKDHWAKHGTNRYITIK 357
           SG AF KLL RLAAEG+DLS+P+DLKDHWAKHGTN+++TI+
Sbjct: 261 SGGAFKKLLERLAAEGYDLSNPIDLKDHWAKHGTNKFVTIR 301


This family includes gbp a protein from Soybean that binds to GAGA element dinucleotide repeat DNA. It seems likely that the this domain mediates DNA binding. This putative domain contains several conserved cysteines and a histidine suggesting this may be a zinc-binding DNA interaction domain. Length = 301

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 357
PF06217301 GAGA_bind: GAGA binding protein-like family; Inter 100.0
PF06217301 GAGA_bind: GAGA binding protein-like family; Inter 99.86
COG4372499 Uncharacterized protein conserved in bacteria with 96.71
COG4372499 Uncharacterized protein conserved in bacteria with 95.17
PF04094828 DUF390: Protein of unknown function (DUF390); Inte 95.11
PF04094828 DUF390: Protein of unknown function (DUF390); Inte 94.45
TIGR03319514 YmdA_YtgF conserved hypothetical protein YmdA/YtgF 85.53
PRK12704520 phosphodiesterase; Provisional 85.23
PRK12704 520 phosphodiesterase; Provisional 85.15
TIGR03319 514 YmdA_YtgF conserved hypothetical protein YmdA/YtgF 82.95
>PF06217 GAGA_bind: GAGA binding protein-like family; InterPro: IPR010409 This family includes gbp a protein from Soybean that binds to GAGA element dinucleotide repeat DNA [] Back     alignment and domain information
Probab=100.00  E-value=1.4e-119  Score=860.26  Aligned_cols=300  Identities=60%  Similarity=0.942  Sum_probs=253.6

Q ss_pred             ccCCCcccCCccccccccccccccccCCCCcHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 018368           23 MDDGGHRENGRHKADQYKAAQGQWLMHHQPSMKQIMTIMAERDAALQERNLATSEKKAAIAERDMAFLQRDTAIAERNNA  102 (357)
Q Consensus        23 mdd~g~~engr~k~d~yk~~~~~w~m~~~~~~~~~~a~~~ERd~AI~ERd~AlaERd~Ai~qRD~AiaqRD~AI~ERD~A  102 (357)
                      |||+|++|  ||+. ||+-.  .++..|  ..++||++|.|||+||.||+.|++|+++|+++||+|++|||+||+|||+|
T Consensus         1 MDddg~~~--R~wg-~ye~~--~s~K~h--LGLqlMs~maERDaai~er~~al~ekk~a~aeRDma~~qRD~aiaern~a   73 (301)
T PF06217_consen    1 MDDDGSLE--RNWG-YYEPQ--NSMKGH--LGLQLMSIMAERDAAIRERNLALSEKKAAFAERDMAYLQRDAAIAERNNA   73 (301)
T ss_pred             CCCccccc--cCcc-ccccc--cccccc--ccceeeccccccCcccccccchhccccccccchhhhhhhhhhhhccchhh
Confidence            99999999  9999 99932  444443  36789999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHhhhhhccCCCCCCCCCCCCccccCccCCCCCcccccccccchhhhh-hccccCCCCCCCCCCCcCcCCcc
Q 018368          103 ILERDNAIASLQYRENSLGGNMSSCPPGCQISRGVKHMHHPQQHVHQLHHVSEAAY-SREMHTGDALPVSPGASEAAKPR  181 (357)
Q Consensus       103 I~eRD~aiaaLq~reNsmng~~~~cpp~~g~krg~K~~hH~~~h~~~~~hisea~~-~~~~~~~~asPis~~~~e~~k~K  181 (357)
                      |+|||++|++|+|++++++   ..||++||+.++.+++||++++  +++++.+..| ++|+++.++.||+.++++.++++
T Consensus        74 i~ERd~a~a~l~~~~~~~n---~~~p~~~~~~~~tk~~~~~q~~--~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~  148 (301)
T PF06217_consen   74 IMERDNAIAALQPRNNNGN---HHCPPGCGILPGTKHIHHMQQH--HHPQLQPPPYSTREMHNIDAPPISTAPPESKKSK  148 (301)
T ss_pred             cccchhhhhhccccCCCcc---ccCCCCCcccCCCccCCCcccc--cccccCCCCCCccccccccccccccCCcccccCC
Confidence            9999999999999999877   5689999999999999987663  2367788888 79999999999988766666665


Q ss_pred             cccccCCCCCCCCCCcCCCCcccccccccccccccccCCCcccccccCCCCCCcccccccccccccCCcccccceeeecC
Q 018368          182 RYKRAKEPKVLSPNKKTAKSPRKVKRENEDLNKVVFGKPSEWKSVQDLDGGDDDVNKQSTASKSDWKGQVLGLNQVTFDE  261 (357)
Q Consensus       182 R~Kr~KesK~~~~~kK~kK~~ek~kk~geDlNr~l~~~k~EWk~~QDl~~G~n~v~~~~~r~~~~~~~~~~~~n~~~~d~  261 (357)
                      +.|+.+..+..+.++ .    ++.++.++++++.          ++ .. +.     + ++.+.+||++||+||+|+||+
T Consensus       149 ~~k~~~~~~~~k~~k-~----~k~kk~~~~~~~~----------~~-~~-~~-----~-~~~k~~~k~~dl~in~i~~D~  205 (301)
T PF06217_consen  149 PPKKRKRPKSPKSPK-P----KKPKKKPSVPNRN----------VN-PP-GS-----D-QRSKPEKKNQDLVINGINFDE  205 (301)
T ss_pred             CCCCCCCCCCCCCCC-C----CCCCCCCCCchhh----------cc-cc-cc-----c-cccccccccccccccceeecc
Confidence            555444333222111 1    1233333333221          00 00 11     0 456899999999999999999


Q ss_pred             CCCCCCcccccCCCccceeccCCcccccccccccccccCCCCCCCCCccccccccchhHHHHHHHHHhhcccCCCCCccc
Q 018368          262 STMPPPACSCTGVLRQCYKWGNGGWQSACCTTSLSMYPLPAVPNKRHARVGGRKMSGSAFNKLLTRLAAEGHDLSHPVDL  341 (357)
Q Consensus       262 s~~p~p~csctg~~~~cy~wg~ggwqs~cctt~~s~yplp~~~~~r~~r~~grkms~~af~kll~~l~~eg~~~s~p~dl  341 (357)
                      |+||+|||||||+|||||||||||||||||||+||||||||+||||||||||||||+|||+|||+|||+||||||+||||
T Consensus       206 s~mP~PvCSCTG~pqqCYrWG~GGWQSaCCTT~iSmYPLPm~~nkRgaRi~GRKMS~gAF~KlL~rLAaEGydls~PiDL  285 (301)
T PF06217_consen  206 STMPIPVCSCTGVPQQCYRWGNGGWQSACCTTTISMYPLPMSPNKRGARIGGRKMSGGAFKKLLERLAAEGYDLSNPIDL  285 (301)
T ss_pred             CCCCCcccccCCCCccccccCCCccccccccccccccccCCCCCCCccccccccccHHHHHHHHHHHHhcccccCCccch
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hhhhhhcCCcceeecC
Q 018368          342 KDHWAKHGTNRYITIK  357 (357)
Q Consensus       342 k~~wa~hgtn~~~ti~  357 (357)
                      |+||||||||||||||
T Consensus       286 K~hWAKHGTNryvTIk  301 (301)
T PF06217_consen  286 KDHWAKHGTNRYVTIK  301 (301)
T ss_pred             HHHHHhcCCcceEecC
Confidence            9999999999999998



It seems likely that the region which defines this family mediates DNA binding. This putative domain contains several conserved cysteines and a histidine suggesting this may be a zinc-binding DNA interaction domain.

>PF06217 GAGA_bind: GAGA binding protein-like family; InterPro: IPR010409 This family includes gbp a protein from Soybean that binds to GAGA element dinucleotide repeat DNA [] Back     alignment and domain information
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown] Back     alignment and domain information
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown] Back     alignment and domain information
>PF04094 DUF390: Protein of unknown function (DUF390); InterPro: IPR007228 This domain is found in a family of long proteins that are currently found only in rice Back     alignment and domain information
>PF04094 DUF390: Protein of unknown function (DUF390); InterPro: IPR007228 This domain is found in a family of long proteins that are currently found only in rice Back     alignment and domain information
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF Back     alignment and domain information
>PRK12704 phosphodiesterase; Provisional Back     alignment and domain information
>PRK12704 phosphodiesterase; Provisional Back     alignment and domain information
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query357
3nmd_A72 CGMP dependent protein kinase; leucine zipper, coi 90.11
3nmd_A72 CGMP dependent protein kinase; leucine zipper, coi 87.13
>3nmd_A CGMP dependent protein kinase; leucine zipper, coiled-coil, structural genomics, berkeley S genomics center, BSGC, dimerization; HET: MSE; 2.27A {Homo sapiens} Back     alignment and structure
Probab=90.11  E-value=0.73  Score=36.32  Aligned_cols=38  Identities=18%  Similarity=0.275  Sum_probs=18.9

Q ss_pred             HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 018368           68 LQERNLATSEKKAAIAERDMAFLQRDTAIAERNNAILE  105 (357)
Q Consensus        68 I~ERd~AlaERd~Ai~qRD~AiaqRD~AI~ERD~AI~e  105 (357)
                      |.|-..++.+|++.|.+||.-|.+.+..|.|+|..|.+
T Consensus        21 i~eLq~~L~~K~eELr~kd~~I~eLEk~L~ekd~eI~~   58 (72)
T 3nmd_A           21 LRDLQYALQEKIEELRQRDALIDELELELDQKDELIQM   58 (72)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34444444444444555555555555555555555554



>3nmd_A CGMP dependent protein kinase; leucine zipper, coiled-coil, structural genomics, berkeley S genomics center, BSGC, dimerization; HET: MSE; 2.27A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00