Citrus Sinensis ID: 020267


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------33
MYQPNSVPSSSLVRSNSLVHGQHLDCGSSQMDPMNGGNSLNNNPSLASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKVDKKETGDMLSSLDGSSGMQITEALKLQMEVQKRLHEQLERQLQLRIEAQGKYLKKIIEEQQRLSGVLTEAPGSGGSAPASGDNCQEPDKKTDPATPAPTSESPLQDKAAKEHVPAKSLSLDESFSSQNEPLTPDSGCNVSSPSQSPKGERSMKKQRVDMETAYAKPEMVLTHPILESSISSSYQQPQNVFDPSSRASVGKEDRPEVSGNDL
cccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHcccccccHHHHHHHHccccccHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
cccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccccccHHHHHHcccccEEEEEHHHHHHHHHHHccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHccccccccccccccccccccccEEccccccccccccccccccccccccccccccccccc
myqpnsvpssslvrsnslvhgqhldcgssqmdpmnggnslnnnpslaskqRLRWTHELHERFVDAVaqlggpdratpkgvlrVMGVQGLTIYHVKSHLQKYRLakylpdsssdgkkvdkketgdmlssldgssgMQITEALKLQMEVQKRLHEQLERQLQLRIEAQGKYLKKIIEEQQRLSgvlteapgsggsapasgdncqepdkktdpatpaptsesplqdkaakehvpakslsldesfssqnepltpdsgcnvsspsqspkgersmkkQRVDMETayakpemvlthpilessisssyqqpqnvfdpssrasvgkedrpevsgndl
myqpnsvpssslVRSNSLVHGQHLDCGSSQMDPMNGGNSLNNNPSLASKQRLRWTHELHERFVDAvaqlggpdratpkGVLRVMGVQGLTIYHVKSHLQKYRLakylpdsssdgkkvdkketgdmlssldgssGMQITEALKLQMEVQKRLHEQLERQLQLRIEAQGKYLKKIIEEQQRLSGVLTEAPGSGGSAPASGDNCQEPDKKTDPATPAPtsesplqdkaAKEHVPAKSLSLDESFSSQnepltpdsgcnvsspsqspkgersmkkqRVDMETAYAKPEMVLTHPILESSISSSYQQPQNvfdpssrasvgkedrpevsgndl
MYQpnsvpssslvrsnslVHGQHLDCGSSQMDpmnggnslnnnpslASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPdsssdgkkvdkkETgdmlssldgssgmQITEALKLQMEVQKrlheqlerqlqlrieAQGKYLKKIIEEQQRLSGVLTEapgsggsapasgDNCQEPDKKTDPATPAPTSESPLQDKAAKEHVPAKSLSLDESFSSQNEPLTPDSGCNVSSPSQSPKGERSMKKQRVDMETAYAKPEMVLTHPILESSISSSYQQPQNVFDPSSRASVGKEDRPEVSGNDL
***************************************************LRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYL***************************************************LQLRIEAQGKYLKKII**********************************************************************************************************************************************************
****************************************************RWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYR*************************************************************E********************************************************************************************************************************************************************
**************SNSLVHGQHLDCGSSQMDPMNGGNSLNNNPSLASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLP*************TGDMLSSLDGSSGMQITEALKLQMEVQKRLHEQLERQLQLRIEAQGKYLKKIIEEQQRLSGVLT****************************************************************************************VDMETAYAKPEMVLTHPILESSIS*******************************
**************************************************RLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYL**************************GMQITEALKLQMEVQKRLHEQLERQLQLRIEAQGKYLKKIIEEQQRLS********************************************************************************************************VL*****************************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYQPNSVPSSSLVRSNSLVHGQHLDCGSSQMDPMNGGNSLNNNPSLASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKVDKKETGDMLSSLDGSSGMQIxxxxxxxxxxxxxxxxxxxxxLQLRIEAQGKYLKKIIEEQQRLSGVLTEAPGSGGSAPASGDNCQEPDKKTDPATPAPTSESPLQDKAAKEHVPAKSLSLDESFSSQNEPLTPDSGCNVSSPSQSPKGERSMKKQRVDMETAYAKPEMVLTHPILESSISSSYQQPQNVFDPSSRASVGKEDRPEVSGNDL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query328 2.2.26 [Sep-21-2011]
Q8GUN5413 Protein PHR1-LIKE 1 OS=Ar no no 0.448 0.355 0.568 1e-39
Q9SAK5358 Myb family transcription no no 0.399 0.365 0.584 3e-37
Q9C616388 Probable transcription fa no no 0.310 0.262 0.418 9e-15
Q93WJ9403 Transcription repressor K no no 0.195 0.158 0.606 2e-14
Q941I2322 Probable transcription fa no no 0.280 0.285 0.45 1e-13
Q9FJV5276 Probable transcription fa no no 0.213 0.253 0.542 3e-13
Q0J235532 Probable transcription fa no no 0.222 0.137 0.512 3e-13
Q700D9255 Putative Myb family trans no no 0.158 0.203 0.692 5e-13
Q9FGT7 635 Two-component response re no no 0.204 0.105 0.480 1e-10
O49397552 Two-component response re no no 0.192 0.114 0.5 3e-10
>sp|Q8GUN5|PHL1_ARATH Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1 Back     alignment and function desciption
 Score =  164 bits (414), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 87/153 (56%), Positives = 116/153 (75%), Gaps = 6/153 (3%)

Query: 32  DPMNGGNSLNNNPSLASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTI 91
           D ++G NS  ++    SKQR+RWT ELHE FV+AV QLGG +RATPK VL+++   GLTI
Sbjct: 216 DQLSGRNS--SSSVATSKQRMRWTPELHEAFVEAVNQLGGSERATPKAVLKLLNNPGLTI 273

Query: 92  YHVKSHLQKYRLAKYLPDSSS-DGKKVDKKETG-DMLSSLDGSSGMQITEALKLQMEVQK 149
           YHVKSHLQKYR A+Y P++S   G+  +KK T  + + SLD  + ++IT+AL+LQMEVQK
Sbjct: 274 YHVKSHLQKYRTARYKPETSEVTGEPQEKKMTSIEDIKSLDMKTSVEITQALRLQMEVQK 333

Query: 150 RLHEQLE--RQLQLRIEAQGKYLKKIIEEQQRL 180
           RLHEQLE  R LQL+IE QG+YL+ + E+QQ++
Sbjct: 334 RLHEQLEIQRSLQLQIEKQGRYLQMMFEKQQKI 366




Transcription factor acting as central integrator of phosphate starvation responses.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9SAK5|APL_ARATH Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=2 SV=2 Back     alignment and function description
>sp|Q9C616|KAN2_ARATH Probable transcription factor KAN2 OS=Arabidopsis thaliana GN=KAN2 PE=2 SV=1 Back     alignment and function description
>sp|Q93WJ9|KAN1_ARATH Transcription repressor KAN1 OS=Arabidopsis thaliana GN=KAN1 PE=1 SV=1 Back     alignment and function description
>sp|Q941I2|KAN3_ARATH Probable transcription factor KAN3 OS=Arabidopsis thaliana GN=KAN3 PE=2 SV=1 Back     alignment and function description
>sp|Q9FJV5|KAN4_ARATH Probable transcription factor KAN4 OS=Arabidopsis thaliana GN=KAN4 PE=1 SV=1 Back     alignment and function description
>sp|Q0J235|ROLL9_ORYSJ Probable transcription factor RL9 OS=Oryza sativa subsp. japonica GN=RL9 PE=2 SV=2 Back     alignment and function description
>sp|Q700D9|MYBF_ARATH Putative Myb family transcription factor At1g14600 OS=Arabidopsis thaliana GN=At1g14600 PE=2 SV=2 Back     alignment and function description
>sp|Q9FGT7|ARR18_ARATH Two-component response regulator ARR18 OS=Arabidopsis thaliana GN=ARR18 PE=2 SV=2 Back     alignment and function description
>sp|O49397|ARR10_ARATH Two-component response regulator ARR10 OS=Arabidopsis thaliana GN=ARR10 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query328
359496884336 PREDICTED: uncharacterized protein LOC10 0.978 0.955 0.818 1e-151
255568528336 transcription factor, putative [Ricinus 0.993 0.970 0.812 1e-149
224124782314 predicted protein [Populus trichocarpa] 0.926 0.968 0.805 1e-138
147840891306 hypothetical protein VITISV_035498 [Viti 0.887 0.950 0.83 1e-137
255638496329 unknown [Glycine max] 0.908 0.905 0.787 1e-132
356555797329 PREDICTED: uncharacterized protein LOC10 0.908 0.905 0.787 1e-132
224089499360 predicted protein [Populus trichocarpa] 0.993 0.905 0.737 1e-132
388498370321 unknown [Lotus japonicus] 0.945 0.965 0.725 1e-128
357448191330 Two-component response regulator EHD1 [M 0.917 0.912 0.775 1e-123
217073354330 unknown [Medicago truncatula] 0.917 0.912 0.768 1e-122
>gi|359496884|ref|XP_002272588.2| PREDICTED: uncharacterized protein LOC100248144 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  538 bits (1387), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 270/330 (81%), Positives = 290/330 (87%), Gaps = 9/330 (2%)

Query: 1   MYQPNSVPSSSLVRSNSLVHGQHLDCGSSQMDPMNGGNSLNNNPSLASKQRLRWTHELHE 60
           MYQP +VPS SLV +NSLVHGQH DCG++ MDP+NGGNSLNNNPSLASKQRLRWTHELHE
Sbjct: 1   MYQPKAVPSPSLVHNNSLVHGQHSDCGANTMDPINGGNSLNNNPSLASKQRLRWTHELHE 60

Query: 61  RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKVDKK 120
           RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKK DKK
Sbjct: 61  RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120

Query: 121 ETGDMLSSLDGSSGMQITEALKLQMEVQKRLHEQLE--RQLQLRIEAQGKYLKKIIEEQQ 178
           E+GDMLSSLDGSSGMQITEALKLQMEVQKRLHEQLE  RQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 121 ESGDMLSSLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 180

Query: 179 RLSGVLTEAPGSGGSAPASGDNCQEPDKKTDPATPAPTSESPLQDKAAKEHVPAKSLSLD 238
           RLSGV+TE PGSG S P SGDNC E D KTDPATPAPTSE PL DKAAKE  PAKSLS+D
Sbjct: 181 RLSGVITEVPGSGVSVPVSGDNCLESD-KTDPATPAPTSEGPLLDKAAKETAPAKSLSID 239

Query: 239 ESFSSQNEPLTPDSGCNVSSPSQSPKGERSMKKQRVDMETAYAKPEMVLTHPILESSISS 298
           ESFSS +EPLTPDSGC+V+SP +SPKGERS+KKQRV +  AYAK EMVLTH ILESS+SS
Sbjct: 240 ESFSSHHEPLTPDSGCHVNSPDESPKGERSVKKQRVSIGAAYAKQEMVLTHQILESSLSS 299

Query: 299 SYQQPQNV------FDPSSRASVGKEDRPE 322
           S+ QP +V      FDP +  S+  ED+ E
Sbjct: 300 SFHQPHSVFLNRDQFDPQAGISISNEDQLE 329




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255568528|ref|XP_002525238.1| transcription factor, putative [Ricinus communis] gi|223535535|gb|EEF37204.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224124782|ref|XP_002329947.1| predicted protein [Populus trichocarpa] gi|222871969|gb|EEF09100.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147840891|emb|CAN66505.1| hypothetical protein VITISV_035498 [Vitis vinifera] Back     alignment and taxonomy information
>gi|255638496|gb|ACU19557.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356555797|ref|XP_003546216.1| PREDICTED: uncharacterized protein LOC100786925 isoform 1 [Glycine max] gi|356555799|ref|XP_003546217.1| PREDICTED: uncharacterized protein LOC100786925 isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|224089499|ref|XP_002308734.1| predicted protein [Populus trichocarpa] gi|222854710|gb|EEE92257.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388498370|gb|AFK37251.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|357448191|ref|XP_003594371.1| Two-component response regulator EHD1 [Medicago truncatula] gi|355483419|gb|AES64622.1| Two-component response regulator EHD1 [Medicago truncatula] Back     alignment and taxonomy information
>gi|217073354|gb|ACJ85036.1| unknown [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query328
TAIR|locus:2045761286 AT2G01060 [Arabidopsis thalian 0.820 0.940 0.574 5.3e-72
TAIR|locus:2139865409 PHR1 "phosphate starvation res 0.503 0.403 0.433 1.5e-26
TAIR|locus:2100880442 AT3G04450 [Arabidopsis thalian 0.567 0.420 0.359 6.7e-26
TAIR|locus:2089260449 AT3G13040 [Arabidopsis thalian 0.539 0.394 0.372 2.3e-25
TAIR|locus:2026739337 AT1G69580 [Arabidopsis thalian 0.176 0.172 0.637 6.3e-25
TAIR|locus:2148720413 PHL1 "AT5G29000" [Arabidopsis 0.405 0.322 0.452 1.6e-24
TAIR|locus:2119425295 UNE16 "unfertilized embryo sac 0.390 0.433 0.430 4.8e-23
TAIR|locus:2206415358 APL "ALTERED PHLOEM DEVELOPMEN 0.408 0.374 0.438 6.1e-23
TAIR|locus:2093726298 AT3G24120 [Arabidopsis thalian 0.390 0.429 0.430 1.6e-22
TAIR|locus:2157146264 AT5G45580 [Arabidopsis thalian 0.173 0.215 0.719 1.8e-22
TAIR|locus:2045761 AT2G01060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 728 (261.3 bits), Expect = 5.3e-72, P = 5.3e-72
 Identities = 165/287 (57%), Positives = 187/287 (65%)

Query:    47 ASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKY 106
             ASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKY
Sbjct:    13 ASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKY 72

Query:   107 LPXXXXXXXXXXXXETXXXXXXXXXXXXXQITEALKLQMEVQKXXXXXXXXXXXXXXX-- 164
             LP            E+             QITEALKLQMEVQK                 
Sbjct:    73 LPDSSSEGKKTDKKESGDMLSGLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIE 132

Query:   165 AQGKYLKKIIEEQQRLSGVLTEXXXXXXXXXXXXDNCQEPDKKTDPATPAPTSESPLQDK 224
             AQGKYLKKIIEEQQRLSGVL E                     +DPATPAPTSESPLQDK
Sbjct:   133 AQGKYLKKIIEEQQRLSGVLGEPSAPVTGD-------------SDPATPAPTSESPLQDK 179

Query:   225 AAKEHVPAKSLSLDESFSSQNEPLTPDSGCNVSSPSQSPKGERSMKKQRVDMETAYAKPE 284
             + K+  P KSLS+DES SS  EPLTPDSGCN+ SP +S   ER  KK R+    A   P+
Sbjct:   180 SGKDCGPDKSLSVDESLSSYREPLTPDSGCNIGSPDESTGEERLSKKPRLVRGAAGYTPD 239

Query:   285 MVLTHPILESSISSSYQQPQNV--FDPSSRASVGKEDR-PEVSGNDL 328
             +V+ HPILES +++SY Q  +V  FD  S + +G E++  +VSG++L
Sbjct:   240 IVVGHPILESGLNTSYHQSDHVLAFDQPSTSLLGAEEQLDKVSGDNL 286




GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=TAS
GO:0006499 "N-terminal protein myristoylation" evidence=RCA
GO:0006635 "fatty acid beta-oxidation" evidence=RCA
GO:0006891 "intra-Golgi vesicle-mediated transport" evidence=RCA
GO:0016558 "protein import into peroxisome matrix" evidence=RCA
TAIR|locus:2139865 PHR1 "phosphate starvation response 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2100880 AT3G04450 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2089260 AT3G13040 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026739 AT1G69580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2148720 PHL1 "AT5G29000" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2119425 UNE16 "unfertilized embryo sac 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2206415 APL "ALTERED PHLOEM DEVELOPMENT" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093726 AT3G24120 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2157146 AT5G45580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pg.C_1370033
hypothetical protein (314 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query328
TIGR0155757 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domai 2e-24
pfam1437951 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, 1e-20
PLN03162526 PLN03162, PLN03162, golden-2 like transcription fa 5e-09
pfam0024947 pfam00249, Myb_DNA-binding, Myb-like DNA-binding d 1e-08
>gnl|CDD|130620 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
 Score = 93.6 bits (233), Expect = 2e-24
 Identities = 34/57 (59%), Positives = 42/57 (73%)

Query: 49  KQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAK 105
           K R+ WT +LH+RF+ AV +LGGPD ATPK +L +M V GLT   V SHLQKYRL +
Sbjct: 1   KPRVVWTEDLHDRFLQAVQKLGGPDWATPKRILELMVVDGLTRDQVASHLQKYRLKQ 57


This model describes a DNA-binding domain restricted to (but common in) plant proteins, many of which also contain a response regulator domain. The domain appears related to the Myb-like DNA-binding domain described by pfam00249. It is distinguished in part by a well-conserved motif SH[AL]QKY[RF] at the C-terminal end of the motif. Length = 57

>gnl|CDD|206547 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
>gnl|CDD|178707 PLN03162, PLN03162, golden-2 like transcription factor; Provisional Back     alignment and domain information
>gnl|CDD|215818 pfam00249, Myb_DNA-binding, Myb-like DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 328
PLN03162526 golden-2 like transcription factor; Provisional 99.89
PF1437951 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE mot 99.87
TIGR0155757 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF c 99.85
PF0024948 Myb_DNA-binding: Myb-like DNA-binding domain; Inte 97.42
smart0042668 TEA TEA domain. 89.73
>PLN03162 golden-2 like transcription factor; Provisional Back     alignment and domain information
Probab=99.89  E-value=6.3e-24  Score=207.80  Aligned_cols=63  Identities=48%  Similarity=0.699  Sum_probs=59.2

Q ss_pred             CCCCCCcccCCHHHHHHHHHHHHHhCCCCCCCccchhhhcCCCCccHHHHHHHhhhhhhhccCC
Q 020267           45 SLASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLP  108 (328)
Q Consensus        45 ~~~~KpRlrWT~ELH~rFV~AV~qLGG~dkAtPK~IL~lM~V~GLT~~hVKSHLQKYRl~k~~p  108 (328)
                      ...+|+||+||+|||+|||+||++|| .++||||+||++|+|+|||++||||||||||+.+++.
T Consensus       231 ~g~KKpRLrWTpELH~rFVeAV~qLG-~dKATPK~ILelMnV~GLTRenVKSHLQKYRl~rk~l  293 (526)
T PLN03162        231 PGKKKAKVDWTPELHRRFVHAVEQLG-VEKAFPSRILELMGVQCLTRHNIASHLQKYRSHRRHL  293 (526)
T ss_pred             CCCCCCcccCCHHHHHHHHHHHHHhC-cCccchHHHHHHcCCCCcCHHHHHHHHHHHHHhcccc
Confidence            34789999999999999999999999 7999999999999999999999999999999987653



>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins Back     alignment and domain information
>smart00426 TEA TEA domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query328
1irz_A64 Solution Structure Of Arr10-B Belonging To The Garp 1e-08
>pdb|1IRZ|A Chain A, Solution Structure Of Arr10-B Belonging To The Garp Family Of Plant Myb-Related Dna Binding Motifs Of The Arabidopsis Response Regulators Length = 64 Back     alignment and structure

Iteration: 1

Score = 57.0 bits (136), Expect = 1e-08, Method: Composition-based stats. Identities = 30/56 (53%), Positives = 39/56 (69%), Gaps = 1/56 (1%) Query: 49 KQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLA 104 K R+ WTHELH +F+ AV LG +RA PK +L +M V LT +V SHLQK+R+A Sbjct: 5 KPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query328
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 7e-25
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-06
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Length = 64 Back     alignment and structure
 Score = 94.2 bits (234), Expect = 7e-25
 Identities = 30/61 (49%), Positives = 40/61 (65%), Gaps = 1/61 (1%)

Query: 45  SLASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLA 104
           +   K R+ WTHELH +F+ AV  LG  +RA PK +L +M V  LT  +V SHLQK+R+A
Sbjct: 1   TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 105 K 105
            
Sbjct: 60  L 60


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query328
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 99.95
2yus_A79 SWI/SNF-related matrix-associated actin- dependent 94.52
2yum_A75 ZZZ3 protein, zinc finger ZZ-type-containing prote 92.7
2cu7_A72 KIAA1915 protein; nuclear protein, SANT domain, DN 92.38
1x41_A60 Transcriptional adaptor 2-like, isoform B; transcr 91.19
2iw5_B235 Protein corest, REST corepressor 1; oxidoreductase 90.46
2xag_B482 REST corepressor 1; amine oxidase, chromatin regul 89.0
1ity_A69 TRF1; helix-turn-helix, telomeres, DNA binding, MY 87.99
2hzd_A82 Transcriptional enhancer factor TEF-1; DNA-binding 86.38
2elk_A58 SPCC24B10.08C protein; hypothetical protein, struc 85.73
2cqq_A72 RSGI RUH-037, DNAJ homolog subfamily C member 1; m 84.51
2ba2_A85 D12_ORF131, hypothetical UPF0134 protein MPN010; D 82.49
3sjm_A64 Telomeric repeat-binding factor 2; human telomeric 82.14
2cqr_A73 RSGI RUH-043, DNAJ homolog subfamily C member 1; m 81.33
2d9a_A60 B-MYB, MYB-related protein B; DNA binding, structu 80.8
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Back     alignment and structure
Probab=99.95  E-value=3.7e-29  Score=190.54  Aligned_cols=61  Identities=49%  Similarity=0.750  Sum_probs=57.5

Q ss_pred             CCCCCcccCCHHHHHHHHHHHHHhCCCCCCCccchhhhcCCCCccHHHHHHHhhhhhhhccC
Q 020267           46 LASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYL  107 (328)
Q Consensus        46 ~~~KpRlrWT~ELH~rFV~AV~qLGG~dkAtPK~IL~lM~V~GLT~~hVKSHLQKYRl~k~~  107 (328)
                      ..+|+||+||+|||++||+||++|| .++||||.||++|+|+|||++||+|||||||+...+
T Consensus         2 ~~~k~r~~WT~elH~~Fv~Av~~LG-~~~AtPk~Il~~M~v~gLT~~~VkSHLQKYR~~l~r   62 (64)
T 1irz_A            2 AQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVALKK   62 (64)
T ss_dssp             CCCCSSCSSCHHHHHHHHHHHHHHC-TTTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHHS
T ss_pred             CCCCCCCcCCHHHHHHHHHHHHHhC-CCCCCcHHHHHHcCCCCCCHHHHHHHHHHHHHHHHc
Confidence            4689999999999999999999999 799999999999999999999999999999997543



>2yus_A SWI/SNF-related matrix-associated actin- dependent regulator of chromatin subfamily...; SWI/SNF complex 155 kDa subunit, BRG1-associated factor 155; NMR {Homo sapiens} Back     alignment and structure
>2yum_A ZZZ3 protein, zinc finger ZZ-type-containing protein 3; transcription, structural genomics, NPPSFA; NMR {Homo sapiens} Back     alignment and structure
>2cu7_A KIAA1915 protein; nuclear protein, SANT domain, DNA binding, regulation of transcription, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure
>1x41_A Transcriptional adaptor 2-like, isoform B; transcriptional adaptor protein2, transcriptional activation, MYB domain, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.1 Back     alignment and structure
>2iw5_B Protein corest, REST corepressor 1; oxidoreductase-transcription regulator complex, oxidoreductase/repressor complex, histone demethylase, FAD; HET: FAD; 2.57A {Homo sapiens} SCOP: a.4.1.3 PDB: 2uxn_B* 2uxx_B* 2y48_B* 2v1d_B* 2x0l_B* Back     alignment and structure
>2xag_B REST corepressor 1; amine oxidase, chromatin regulator, histone inhibitor binding, methylation, nucleosome core, oxidoreductase; HET: FAD TCF; 3.10A {Homo sapiens} PDB: 2xaf_B* 2xah_B* 2xaj_B* 2xaq_B* 2xas_B* Back     alignment and structure
>1ity_A TRF1; helix-turn-helix, telomeres, DNA binding, MYB domain, riken structural genomics/proteomics initiative, RSGI, structural genomics; NMR {Homo sapiens} SCOP: a.4.1.4 PDB: 1iv6_A Back     alignment and structure
>2hzd_A Transcriptional enhancer factor TEF-1; DNA-binding, helix-turn-helix, gene regulation; NMR {Homo sapiens} Back     alignment and structure
>2elk_A SPCC24B10.08C protein; hypothetical protein, structural genomics, NPPSFA; NMR {Schizosaccharomyces pombe} Back     alignment and structure
>2cqq_A RSGI RUH-037, DNAJ homolog subfamily C member 1; membrane protein, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure
>2ba2_A D12_ORF131, hypothetical UPF0134 protein MPN010; DUF16, hypothetical protein, coiled-coil, stutter, structural genomics, PSI; 1.80A {Mycoplasma pneumoniae} SCOP: h.1.30.1 Back     alignment and structure
>3sjm_A Telomeric repeat-binding factor 2; human telomeric repeat binding protein 2, telomere, telomeri homeodomain proteins amino acid sequence; HET: DNA; 1.35A {Homo sapiens} PDB: 1xg1_A 1vfc_A 1vf9_A 1w0u_A Back     alignment and structure
>2cqr_A RSGI RUH-043, DNAJ homolog subfamily C member 1; membrane protein, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure
>2d9a_A B-MYB, MYB-related protein B; DNA binding, structural genomics, unknown function, NPPSFA; NMR {Mus musculus} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 328
d1irza_64 a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis th 2e-23
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 64 Back     information, alignment and structure

class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 89.6 bits (222), Expect = 2e-23
 Identities = 30/61 (49%), Positives = 40/61 (65%), Gaps = 1/61 (1%)

Query: 45  SLASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLA 104
           +   K R+ WTHELH +F+ AV  LG  +RA PK +L +M V  LT  +V SHLQK+R+A
Sbjct: 1   TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 105 K 105
            
Sbjct: 60  L 60


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query328
d1irza_64 Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId 99.93
d2cu7a165 MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 960 94.32
d1x41a147 Transcriptional adaptor 2-like, TADA2L, isoform b 91.64
d1xc5a168 Nuclear receptor corepressor 2 {Human (Homo sapien 88.81
d2iw5b165 REST corepressor 1, CoREST {Human (Homo sapiens) [ 85.14
d1guua_50 c-Myb, DNA-binding domain {Mouse (Mus musculus) [T 82.17
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.93  E-value=1e-26  Score=174.93  Aligned_cols=61  Identities=49%  Similarity=0.742  Sum_probs=57.2

Q ss_pred             CCCCCcccCCHHHHHHHHHHHHHhCCCCCCCccchhhhcCCCCccHHHHHHHhhhhhhhccC
Q 020267           46 LASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYL  107 (328)
Q Consensus        46 ~~~KpRlrWT~ELH~rFV~AV~qLGG~dkAtPK~IL~lM~V~GLT~~hVKSHLQKYRl~k~~  107 (328)
                      ..+|+|++||+|||++||+||++||+ ++||||.|+++|+|+|||+.||+|||||||+...+
T Consensus         2 ~~kk~R~~WT~elH~~Fv~Av~~lG~-~~atpk~I~~~m~v~~lT~~qV~SHlQKYrl~l~k   62 (64)
T d1irza_           2 AQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVALKK   62 (64)
T ss_dssp             CCCCSSCSSCHHHHHHHHHHHHHHCT-TTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHHS
T ss_pred             CCCCCCCCCCHHHHHHHHHHHHHhCc-cccchHHHHHHcCCCCCCHHHHHHHHHHHHHHHHh
Confidence            46899999999999999999999985 89999999999999999999999999999997643



>d2cu7a1 a.4.1.3 (A:8-72) MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x41a1 a.4.1.1 (A:8-54) Transcriptional adaptor 2-like, TADA2L, isoform b {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1xc5a1 a.4.1.3 (A:413-480) Nuclear receptor corepressor 2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2iw5b1 a.4.1.3 (B:376-440) REST corepressor 1, CoREST {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1guua_ a.4.1.3 (A:) c-Myb, DNA-binding domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure