Citrus Sinensis ID: 021481


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310--
MYSAIHSLPLDGGHPDFQGGPLDGTNLPGDACLVLTTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCVAESQDTGSSTTSSTRMVAQDPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAIKVSNDCQGMVPLENIKMPSISELAAALESKNASTIPARIGDCSVESCLTSTSSPVSPMGLGSQAAAMKKRPRPLFGNGESLPLEGNMRQEVEWVMPHIG
ccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHcccccccHHHHHHHHccccccHHHHHHHHHHHHHccccccccccccccHHHHHHHccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHccccccccccccccccccHHHHHHHHHcccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccc
ccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccccccHHHHHHcccccEEEEEHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccHHHccccccccccccccccccccccccccHHHHccHcccccccccccccccccccccccccccccccccccccHHHccccccccccccccEccccccccccccccccc
mysaihslpldgghpdfqggpldgtnlpgdaclvlttdpkprlrwTAELHDRFVDAVtqlggpdkatpktimrtmgvkGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCVaesqdtgssttsstrmvaqdpndgyqVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAIKVSndcqgmvplenikmpSISELAAALEsknastiparigdcsvescltstsspvspmglgsqaaamkkrprplfgngeslplegnmrQEVEWVMPHIG
MYSAIHSLPLDGGHPDFQGGPLDGTNLPGDACLVLTTDPKPRLRWTAELHDRFVDavtqlggpdkatpktIMRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCVaesqdtgssttsstrmvaqdpnDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAIKVSNDCQGMVPLENIKMPSISELAAALESKNASTIPARIGDCSVESCLTSTSSPVSPMGLGSQAAAMKKRPRPLFgngeslplegnmrqevewvmphig
MYSAIHSLPLDGGHPDFQGGPLDGTNLPGDACLVLTTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCVAESQDtgssttsstRMVAQDPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVaagleaareelselaIKVSNDCQGMVPLENIKMPSISELAAALESKNASTIPARIGDCSVESCLTSTSSPVSPMGLGSQAAAMKKRPRPLFGNGESLPLEGNMRQEVEWVMPHIG
*************************NLPGDACLVLTTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQACK********DYVALEVSCV*****************************ALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAIKVSNDCQGMVPLENIK***********************************************************************************
********************************************WTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRL***************************************************************************QLRIEAQGKYLQSI**************************************************************************************************************************WVMP***
MYSAIHSLPLDGGHPDFQGGPLDGTNLPGDACLVLTTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCV*****************AQDPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAIKVSNDCQGMVPLENIKMPSISELAAALESKNASTIPARIGDCSVESC*********************KRPRPLFGNGESLPLEGNMRQEVEWVMPHIG
****************************************PRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQA*******************************************GYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALN************************************************************************************************************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYSAIHSLPLDGGHPDFQGGPLDGTNLPGDACLVLTTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCVAESQDTGSSTTSSTRMVAQDPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAREELSELAIKVSNDCQGMVPLENIKMPSISELAAALESKNASTIPARIGDCSVESCLTSTSSPVSPMGLGSQAAAMKKRPRPLFGNGESLPLEGNMRQEVEWVMPHIG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query312 2.2.26 [Sep-21-2011]
Q9SAK5358 Myb family transcription no no 0.474 0.413 0.650 1e-53
Q8GUN5413 Protein PHR1-LIKE 1 OS=Ar no no 0.451 0.341 0.461 1e-31
Q700D9255 Putative Myb family trans no no 0.176 0.215 0.654 7e-14
Q93WJ9403 Transcription repressor K no no 0.192 0.148 0.583 2e-13
Q941I2322 Probable transcription fa no no 0.221 0.214 0.521 2e-13
Q9C616388 Probable transcription fa no no 0.221 0.177 0.521 3e-13
Q0J235532 Probable transcription fa no no 0.221 0.129 0.521 3e-13
Q9FJV5276 Probable transcription fa no no 0.179 0.202 0.589 1e-12
Q9ZWJ9 664 Two-component response re no no 0.173 0.081 0.581 2e-10
Q9FGT7 635 Two-component response re no no 0.182 0.089 0.568 2e-10
>sp|Q9SAK5|APL_ARATH Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=2 SV=2 Back     alignment and function desciption
 Score =  210 bits (534), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 106/163 (65%), Positives = 124/163 (76%), Gaps = 15/163 (9%)

Query: 29  GDACLVLTTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSH 88
           GD+ LVLTTDPKPRLRWT ELH+RFVDAV QLGGPDKATPKTIMR MGVKGLTLYHLKSH
Sbjct: 23  GDSGLVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKSH 82

Query: 89  LQKYRLGKQACKETTENSKDDYVALEVSCVAESQDTGSSTTSSTRMVAQDPNDGYQVTEA 148
           LQK+RLGKQ  KE  ++S  +         A + D   +  SS+ M++++ N+       
Sbjct: 83  LQKFRLGKQPHKEYGDHSTKE------GSRASAMDIQRNVASSSGMMSRNMNE------- 129

Query: 149 LRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKAL 191
             +QMEVQRRLHEQLEVQR LQLRIEAQGKY+QSILE+AC+ L
Sbjct: 130 --MQMEVQRRLHEQLEVQRHLQLRIEAQGKYMQSILERACQTL 170




Transcription factor required for phloem identity. Has a dual role both in promoting phloem differentiation and in repressing xylem differentiation during vascular development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8GUN5|PHL1_ARATH Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1 Back     alignment and function description
>sp|Q700D9|MYBF_ARATH Putative Myb family transcription factor At1g14600 OS=Arabidopsis thaliana GN=At1g14600 PE=2 SV=2 Back     alignment and function description
>sp|Q93WJ9|KAN1_ARATH Transcription repressor KAN1 OS=Arabidopsis thaliana GN=KAN1 PE=1 SV=1 Back     alignment and function description
>sp|Q941I2|KAN3_ARATH Probable transcription factor KAN3 OS=Arabidopsis thaliana GN=KAN3 PE=2 SV=1 Back     alignment and function description
>sp|Q9C616|KAN2_ARATH Probable transcription factor KAN2 OS=Arabidopsis thaliana GN=KAN2 PE=2 SV=1 Back     alignment and function description
>sp|Q0J235|ROLL9_ORYSJ Probable transcription factor RL9 OS=Oryza sativa subsp. japonica GN=RL9 PE=2 SV=2 Back     alignment and function description
>sp|Q9FJV5|KAN4_ARATH Probable transcription factor KAN4 OS=Arabidopsis thaliana GN=KAN4 PE=1 SV=1 Back     alignment and function description
>sp|Q9ZWJ9|ARR2_ARATH Two-component response regulator ARR2 OS=Arabidopsis thaliana GN=ARR2 PE=1 SV=1 Back     alignment and function description
>sp|Q9FGT7|ARR18_ARATH Two-component response regulator ARR18 OS=Arabidopsis thaliana GN=ARR18 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query312
255554128303 transcription factor, putative [Ricinus 0.971 1.0 0.846 1e-149
118486035309 unknown [Populus trichocarpa] 0.980 0.990 0.826 1e-147
327412613307 putative MYB transcription factor [Rosa 0.971 0.986 0.821 1e-140
224060385285 predicted protein [Populus trichocarpa] 0.913 1.0 0.833 1e-139
302398985307 MYBR domain class transcription factor [ 0.983 1.0 0.801 1e-139
359477885307 PREDICTED: uncharacterized protein LOC10 0.977 0.993 0.812 1e-137
224140815280 predicted protein [Populus trichocarpa] 0.897 1.0 0.838 1e-134
449487536315 PREDICTED: uncharacterized protein LOC10 0.974 0.965 0.753 1e-124
449432414315 PREDICTED: LOW QUALITY PROTEIN: uncharac 0.974 0.965 0.75 1e-124
18404044295 myb family transcription factor [Arabido 0.907 0.959 0.696 1e-111
>gi|255554128|ref|XP_002518104.1| transcription factor, putative [Ricinus communis] gi|223542700|gb|EEF44237.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  533 bits (1374), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 264/312 (84%), Positives = 289/312 (92%), Gaps = 9/312 (2%)

Query: 1   MYSAIHSLPLDGGHPDFQGGPLDGTNLPGDACLVLTTDPKPRLRWTAELHDRFVDAVTQL 60
           MYSAIHSLPLDG H DFQG  LDGTNLPGDACLVLTTDPKPRLRWTAELH+RFVDAVTQL
Sbjct: 1   MYSAIHSLPLDG-HGDFQGS-LDGTNLPGDACLVLTTDPKPRLRWTAELHERFVDAVTQL 58

Query: 61  GGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCVAE 120
           GGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLG+Q+CKE+ ENSKD       + VAE
Sbjct: 59  GGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGRQSCKESNENSKD-------ASVAE 111

Query: 121 SQDTGSSTTSSTRMVAQDPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYL 180
           SQDTGSST++S+RM+AQD NDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYL
Sbjct: 112 SQDTGSSTSTSSRMIAQDVNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYL 171

Query: 181 QSILEKACKALNDQAIVAAGLEAAREELSELAIKVSNDCQGMVPLENIKMPSISELAAAL 240
           QSILEKACKALNDQA V+AGLEAAREELSELAIKVSN+CQG+VP +N+KMPS+SELA AL
Sbjct: 172 QSILEKACKALNDQAAVSAGLEAAREELSELAIKVSNECQGIVPADNMKMPSLSELAVAL 231

Query: 241 ESKNASTIPARIGDCSVESCLTSTSSPVSPMGLGSQAAAMKKRPRPLFGNGESLPLEGNM 300
           ESK+ S +PARIGDCSVESCLTST SPVSPMG+GS  A++KKRPRP+FGNG+SLPLEG+M
Sbjct: 232 ESKSTSNLPARIGDCSVESCLTSTGSPVSPMGVGSHTASIKKRPRPIFGNGDSLPLEGSM 291

Query: 301 RQEVEWVMPHIG 312
           RQEVEW+M +IG
Sbjct: 292 RQEVEWMMGNIG 303




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|118486035|gb|ABK94861.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|327412613|emb|CCA29095.1| putative MYB transcription factor [Rosa rugosa] gi|327412625|emb|CCA29101.1| putative MYB transcription factor [Rosa rugosa] Back     alignment and taxonomy information
>gi|224060385|ref|XP_002300173.1| predicted protein [Populus trichocarpa] gi|222847431|gb|EEE84978.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|302398985|gb|ADL36787.1| MYBR domain class transcription factor [Malus x domestica] Back     alignment and taxonomy information
>gi|359477885|ref|XP_002268698.2| PREDICTED: uncharacterized protein LOC100250267 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224140815|ref|XP_002323774.1| predicted protein [Populus trichocarpa] gi|222866776|gb|EEF03907.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449487536|ref|XP_004157675.1| PREDICTED: uncharacterized protein LOC101223852 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449432414|ref|XP_004133994.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101213244 [Cucumis sativus] Back     alignment and taxonomy information
>gi|18404044|ref|NP_566744.1| myb family transcription factor [Arabidopsis thaliana] gi|15215654|gb|AAK91372.1| AT3g24120/MUJ8_3 [Arabidopsis thaliana] gi|20334892|gb|AAM16202.1| AT3g24120/MUJ8_3 [Arabidopsis thaliana] gi|21594046|gb|AAM65964.1| transfactor, putative [Arabidopsis thaliana] gi|332643338|gb|AEE76859.1| myb family transcription factor [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query312
TAIR|locus:2093726298 AT3G24120 [Arabidopsis thalian 0.858 0.899 0.654 1.8e-87
TAIR|locus:2119425295 UNE16 "unfertilized embryo sac 0.894 0.945 0.618 1.3e-86
TAIR|locus:2095933394 MYR2 [Arabidopsis thaliana (ta 0.727 0.576 0.464 6.9e-47
TAIR|locus:2206415358 APL "ALTERED PHLOEM DEVELOPMEN 0.474 0.413 0.638 1.1e-46
TAIR|locus:2172394402 MYR1 "myb-related protein 1" [ 0.746 0.579 0.458 7.1e-45
TAIR|locus:2087740235 AT3G12730 [Arabidopsis thalian 0.467 0.621 0.556 1.8e-39
TAIR|locus:2026739337 AT1G69580 [Arabidopsis thalian 0.733 0.679 0.430 2.9e-39
TAIR|locus:2157146264 AT5G45580 [Arabidopsis thalian 0.5 0.590 0.543 1.6e-38
TAIR|locus:2089260449 AT3G13040 [Arabidopsis thalian 0.435 0.302 0.490 2.1e-32
TAIR|locus:2148720413 PHL1 "AT5G29000" [Arabidopsis 0.451 0.341 0.448 1.6e-29
TAIR|locus:2093726 AT3G24120 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 874 (312.7 bits), Expect = 1.8e-87, P = 1.8e-87
 Identities = 193/295 (65%), Positives = 212/295 (71%)

Query:     1 MYSAIHSLPLDGGHP--DFQGGPLDGTNLPGDACLVLTTDPKPRLRWTAELHDRFVDAVT 58
             MYSAI SLPLDGGH   D+ G PLDGTNLPGDACLVLTTDPKPRLRWT ELH+RFVDAVT
Sbjct:     1 MYSAIRSLPLDGGHVGGDYHG-PLDGTNLPGDACLVLTTDPKPRLRWTTELHERFVDAVT 59

Query:    59 QLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDDYVALEVSCV 118
             QLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQK+RLG+QA KE+TENSKD       SCV
Sbjct:    60 QLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKFRLGRQAGKESTENSKD------ASCV 113

Query:   119 AESQDXXXXXXXXXRMVAQDPNDGYQVTEALRVQMEVQRRLHEQLE---VQRRLQLRIEA 175
              ESQD         RM  Q+ N+GYQVTEALR QMEVQRRLH+QLE   VQRRLQLRIEA
Sbjct:   114 GESQDTGSSSTSSMRMAQQEQNEGYQVTEALRAQMEVQRRLHDQLEYGQVQRRLQLRIEA 173

Query:   176 QGKYLQSILEKACKALNDQAIVXXXXXXXXXXXXXXXIKVSNDCQGM-VPL----ENIKM 230
             QGKYLQSILEKACKA ++QA                 IKVSN  QG  VP     + + M
Sbjct:   174 QGKYLQSILEKACKAFDEQAATFAGLEAAREELSELAIKVSNSSQGTSVPYFDATKMMMM 233

Query:   231 PSISELAAALESKNASTIPARIGDCSVESCLTSTSSPVSPMGLGSQAAAMKKRPR 285
             PS+SELA A+++KN  T      +CSVES LTS +      G    AA+MKKR R
Sbjct:   234 PSLSELAVAIDNKNNITT-----NCSVESSLTSITH-----GSSISAASMKKRQR 278




GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=TAS
GO:0000394 "RNA splicing, via endonucleolytic cleavage and ligation" evidence=RCA
GO:0009086 "methionine biosynthetic process" evidence=RCA
GO:0010048 "vernalization response" evidence=RCA
GO:0048573 "photoperiodism, flowering" evidence=RCA
TAIR|locus:2119425 UNE16 "unfertilized embryo sac 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2095933 MYR2 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2206415 APL "ALTERED PHLOEM DEVELOPMENT" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2172394 MYR1 "myb-related protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2087740 AT3G12730 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026739 AT1G69580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2157146 AT5G45580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2089260 AT3G13040 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2148720 PHL1 "AT5G29000" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query312
pfam1437951 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, 4e-26
TIGR0155757 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domai 2e-24
PLN03162526 PLN03162, PLN03162, golden-2 like transcription fa 2e-09
pfam0024947 pfam00249, Myb_DNA-binding, Myb-like DNA-binding d 5e-08
>gnl|CDD|206547 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
 Score = 97.3 bits (243), Expect = 4e-26
 Identities = 42/51 (82%), Positives = 46/51 (90%)

Query: 142 GYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALN 192
           G Q+TEALR+QMEVQRRLHEQLEVQR LQ+RIEAQGKYLQ ILEKA K L+
Sbjct: 1   GMQITEALRLQMEVQRRLHEQLEVQRHLQIRIEAQGKYLQEILEKAQKTLS 51


This family is found towards the C-terminus of Myb-CC type transcription factors, and carries a highly conserved LHEQLE sequence motif. Length = 51

>gnl|CDD|130620 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>gnl|CDD|178707 PLN03162, PLN03162, golden-2 like transcription factor; Provisional Back     alignment and domain information
>gnl|CDD|215818 pfam00249, Myb_DNA-binding, Myb-like DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 312
PF1437951 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE mot 99.93
PLN03162526 golden-2 like transcription factor; Provisional 99.89
TIGR0155757 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF c 99.86
PF0024948 Myb_DNA-binding: Myb-like DNA-binding domain; Inte 97.42
smart0042668 TEA TEA domain. 90.48
PF1437951 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE mot 87.29
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
Probab=99.93  E-value=2.5e-26  Score=168.64  Aligned_cols=51  Identities=88%  Similarity=1.125  Sum_probs=48.9

Q ss_pred             CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh
Q 021481          142 GYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALN  192 (312)
Q Consensus       142 ~~qi~EALr~QmEVQrrLhEQLEVQR~LQlRIEAQGKYLQsiLekaqe~La  192 (312)
                      +++|+||||+||||||||||||||||+||+|||||||||++|||||+++++
T Consensus         1 g~~i~EALr~QmEvQrrLhEQLEvQr~Lqlrieaqgkyl~~ilek~~~~~s   51 (51)
T PF14379_consen    1 GMQITEALRMQMEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEKAQKALS   51 (51)
T ss_pred             CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcC
Confidence            578999999999999999999999999999999999999999999999864



>PLN03162 golden-2 like transcription factor; Provisional Back     alignment and domain information
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins Back     alignment and domain information
>smart00426 TEA TEA domain Back     alignment and domain information
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query312
1irz_A64 Solution Structure Of Arr10-B Belonging To The Garp 1e-08
>pdb|1IRZ|A Chain A, Solution Structure Of Arr10-B Belonging To The Garp Family Of Plant Myb-Related Dna Binding Motifs Of The Arabidopsis Response Regulators Length = 64 Back     alignment and structure

Iteration: 1

Score = 57.0 bits (136), Expect = 1e-08, Method: Composition-based stats. Identities = 28/60 (46%), Positives = 39/60 (65%), Gaps = 1/60 (1%) Query: 36 TTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLG 95 T KPR+ WT ELH++F+ AV L G ++A PK I+ M V LT ++ SHLQK+R+ Sbjct: 1 TAQKKPRVLWTHELHNKFLAAVDHL-GVERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query312
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 1e-26
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Length = 64 Back     alignment and structure
 Score = 98.5 bits (245), Expect = 1e-26
 Identities = 28/62 (45%), Positives = 40/62 (64%), Gaps = 1/62 (1%)

Query: 36 TTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLG 95
          T   KPR+ WT ELH++F+ AV  LG  ++A PK I+  M V  LT  ++ SHLQK+R+ 
Sbjct: 1  TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 96 KQ 97
           +
Sbjct: 60 LK 61


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query312
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 99.96
2yum_A75 ZZZ3 protein, zinc finger ZZ-type-containing prote 90.34
2yus_A79 SWI/SNF-related matrix-associated actin- dependent 87.93
2cu7_A72 KIAA1915 protein; nuclear protein, SANT domain, DN 87.26
1x41_A60 Transcriptional adaptor 2-like, isoform B; transcr 87.14
2xag_B482 REST corepressor 1; amine oxidase, chromatin regul 86.22
2iw5_B235 Protein corest, REST corepressor 1; oxidoreductase 83.38
2hzd_A82 Transcriptional enhancer factor TEF-1; DNA-binding 83.07
2cqq_A72 RSGI RUH-037, DNAJ homolog subfamily C member 1; m 83.07
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Back     alignment and structure
Probab=99.96  E-value=5.4e-30  Score=194.30  Aligned_cols=62  Identities=45%  Similarity=0.763  Sum_probs=58.9

Q ss_pred             cCCCCCCcccCHHHHHHHHHHHHhhCCCCCCChhHHHhhhCCCCCCHHHHHHhhhhhhccccc
Q 021481           36 TTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQA   98 (312)
Q Consensus        36 s~~~K~RlrWT~ELH~rFV~AV~qLGG~~kAtPK~IL~lM~v~gLT~~hVkSHLQKYRl~~~~   98 (312)
                      ++++|+|++||+|||++||+||++|| .++||||.||++|+|+|||++||||||||||+..++
T Consensus         1 ~~~~k~r~~WT~elH~~Fv~Av~~LG-~~~AtPk~Il~~M~v~gLT~~~VkSHLQKYR~~l~r   62 (64)
T 1irz_A            1 TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVALKK   62 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHC-TTTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHHS
T ss_pred             CCCCCCCCcCCHHHHHHHHHHHHHhC-CCCCCcHHHHHHcCCCCCCHHHHHHHHHHHHHHHHc
Confidence            46799999999999999999999999 799999999999999999999999999999998764



>2yum_A ZZZ3 protein, zinc finger ZZ-type-containing protein 3; transcription, structural genomics, NPPSFA; NMR {Homo sapiens} Back     alignment and structure
>2yus_A SWI/SNF-related matrix-associated actin- dependent regulator of chromatin subfamily...; SWI/SNF complex 155 kDa subunit, BRG1-associated factor 155; NMR {Homo sapiens} Back     alignment and structure
>2cu7_A KIAA1915 protein; nuclear protein, SANT domain, DNA binding, regulation of transcription, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure
>1x41_A Transcriptional adaptor 2-like, isoform B; transcriptional adaptor protein2, transcriptional activation, MYB domain, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.1 Back     alignment and structure
>2xag_B REST corepressor 1; amine oxidase, chromatin regulator, histone inhibitor binding, methylation, nucleosome core, oxidoreductase; HET: FAD TCF; 3.10A {Homo sapiens} PDB: 2xaf_B* 2xah_B* 2xaj_B* 2xaq_B* 2xas_B* Back     alignment and structure
>2iw5_B Protein corest, REST corepressor 1; oxidoreductase-transcription regulator complex, oxidoreductase/repressor complex, histone demethylase, FAD; HET: FAD; 2.57A {Homo sapiens} SCOP: a.4.1.3 PDB: 2uxn_B* 2uxx_B* 2y48_B* 2v1d_B* 2x0l_B* Back     alignment and structure
>2hzd_A Transcriptional enhancer factor TEF-1; DNA-binding, helix-turn-helix, gene regulation; NMR {Homo sapiens} Back     alignment and structure
>2cqq_A RSGI RUH-037, DNAJ homolog subfamily C member 1; membrane protein, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 312
d1irza_64 a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis th 2e-24
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 64 Back     information, alignment and structure

class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 91.9 bits (228), Expect = 2e-24
 Identities = 28/59 (47%), Positives = 39/59 (66%), Gaps = 1/59 (1%)

Query: 36 TTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRL 94
          T   KPR+ WT ELH++F+ AV  LG  ++A PK I+  M V  LT  ++ SHLQK+R+
Sbjct: 1  TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRV 58


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query312
d1irza_64 Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId 99.93
d2cu7a165 MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 960 91.4
d1x41a147 Transcriptional adaptor 2-like, TADA2L, isoform b 90.14
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.93  E-value=4.3e-27  Score=176.42  Aligned_cols=62  Identities=45%  Similarity=0.755  Sum_probs=58.3

Q ss_pred             cCCCCCCcccCHHHHHHHHHHHHhhCCCCCCChhHHHhhhCCCCCCHHHHHHhhhhhhccccc
Q 021481           36 TTDPKPRLRWTAELHDRFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKYRLGKQA   98 (312)
Q Consensus        36 s~~~K~RlrWT~ELH~rFV~AV~qLGG~~kAtPK~IL~lM~v~gLT~~hVkSHLQKYRl~~~~   98 (312)
                      |.++|+|++||+|||++||+||++||+ +.||||.|+++|+|+|||+.||+|||||||+..++
T Consensus         1 s~~kk~R~~WT~elH~~Fv~Av~~lG~-~~atpk~I~~~m~v~~lT~~qV~SHlQKYrl~l~k   62 (64)
T d1irza_           1 TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVALKK   62 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHCT-TTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHHS
T ss_pred             CCCCCCCCCCCHHHHHHHHHHHHHhCc-cccchHHHHHHcCCCCCCHHHHHHHHHHHHHHHHh
Confidence            467999999999999999999999985 89999999999999999999999999999998654



>d2cu7a1 a.4.1.3 (A:8-72) MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x41a1 a.4.1.1 (A:8-54) Transcriptional adaptor 2-like, TADA2L, isoform b {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure