Citrus Sinensis ID: 015840


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------40
MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
ccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHcccccccHHHHHHHHccccccHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHccHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
ccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccccccHHHHHHcccccEEEEEHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHccHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHcccHcccHHHHHccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccccccccccEEEccccccc
myhhhqnqgksmhsssrmpipterhlflqggsgpgdsglvlstdakprlkwtpdLHERFIEAVNQlggadkatpKTVMKlmgipgltlYHLKSHLQKYRlsknlhgqanignnkivtvpgermpeanathmnnlsigpqpnkslhISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKelqgfcpqqpqanqptdcsmdscltscegsqkdqeihnggvrlrpyhgtptlepkeiveepmlQQTELKWRKdlkeskflssigkdrgpgelsigsgsfpagrfkasnedehfqdqtnkkpegaklenenllpeyrlpcfstkldlnahdhendvasgckqfdlngfswn
myhhhqnqgksmhsssRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNqlggadkatpKTVMKLMGIPGLTLYHLKSHLQKYRLSknlhgqanigNNKIVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKdqeihnggvrlrpyhgtptlepkeiveepmLQQTELKWRKDLKESKFlssigkdrgpgelsiGSGSFPAGRFKASNEDEHfqdqtnkkpegaklenenllpeYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
***********************************************RLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIVTV*******************************IQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGF******************************************************************************************************************************LPEYRLPCFSTKLDLNAH*****VA*GCKQFD*******
*************************************************KWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKY********************************************************************************************************************************************************************************************************************************************************************************TKLDLN**D********C*QFDLNGFSW*
*****************MPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQ***********DSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKAS**********NKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
**********************************************PRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNL**************************************SLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLG******************************************************************************************KEIVEEPMLQQTELKWRKDLKESKFLSSIG*************************************************EYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSW*
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query399 2.2.26 [Sep-21-2011]
Q9SAK5358 Myb family transcription no no 0.383 0.427 0.588 4e-44
Q8GUN5413 Protein PHR1-LIKE 1 OS=Ar no no 0.340 0.329 0.486 3e-32
Q700D9255 Putative Myb family trans no no 0.140 0.219 0.607 4e-14
Q93WJ9403 Transcription repressor K no no 0.140 0.138 0.571 7e-13
Q9C616388 Probable transcription fa no no 0.140 0.144 0.571 1e-12
Q0J235532 Probable transcription fa no no 0.142 0.107 0.561 2e-12
Q9FJV5276 Probable transcription fa no no 0.160 0.231 0.538 3e-12
Q9ZWJ9664 Two-component response re no no 0.135 0.081 0.545 5e-12
Q941I2322 Probable transcription fa no no 0.140 0.173 0.553 7e-12
Q940D0 690 Two-component response re no no 0.135 0.078 0.545 1e-11
>sp|Q9SAK5|APL_ARATH Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=2 SV=2 Back     alignment and function desciption
 Score =  179 bits (453), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 96/163 (58%), Positives = 122/163 (74%), Gaps = 10/163 (6%)

Query: 35  GDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSH 94
           GDSGLVL+TD KPRL+WT +LHERF++AV QLGG DKATPKT+M++MG+ GLTLYHLKSH
Sbjct: 23  GDSGLVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKSH 82

Query: 95  LQKYRLSKNLHGQANIGNNKIVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQI 154
           LQK+RL K  H +   G++   T  G R   A+A  +           S +++E   MQ+
Sbjct: 83  LQKFRLGKQPHKE--YGDHS--TKEGSR---ASAMDIQRNVASSSGMMSRNMNE---MQM 132

Query: 155 EVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNL 197
           EVQRRLHEQLEVQRHLQLRIEAQGKY+Q++LE+A +TL  +N+
Sbjct: 133 EVQRRLHEQLEVQRHLQLRIEAQGKYMQSILERACQTLAGENM 175




Transcription factor required for phloem identity. Has a dual role both in promoting phloem differentiation and in repressing xylem differentiation during vascular development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8GUN5|PHL1_ARATH Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1 Back     alignment and function description
>sp|Q700D9|MYBF_ARATH Putative Myb family transcription factor At1g14600 OS=Arabidopsis thaliana GN=At1g14600 PE=2 SV=2 Back     alignment and function description
>sp|Q93WJ9|KAN1_ARATH Transcription repressor KAN1 OS=Arabidopsis thaliana GN=KAN1 PE=1 SV=1 Back     alignment and function description
>sp|Q9C616|KAN2_ARATH Probable transcription factor KAN2 OS=Arabidopsis thaliana GN=KAN2 PE=2 SV=1 Back     alignment and function description
>sp|Q0J235|ROLL9_ORYSJ Probable transcription factor RL9 OS=Oryza sativa subsp. japonica GN=RL9 PE=2 SV=2 Back     alignment and function description
>sp|Q9FJV5|KAN4_ARATH Probable transcription factor KAN4 OS=Arabidopsis thaliana GN=KAN4 PE=1 SV=1 Back     alignment and function description
>sp|Q9ZWJ9|ARR2_ARATH Two-component response regulator ARR2 OS=Arabidopsis thaliana GN=ARR2 PE=1 SV=1 Back     alignment and function description
>sp|Q941I2|KAN3_ARATH Probable transcription factor KAN3 OS=Arabidopsis thaliana GN=KAN3 PE=2 SV=1 Back     alignment and function description
>sp|Q940D0|ARR1_ARATH Two-component response regulator ARR1 OS=Arabidopsis thaliana GN=ARR1 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query399
224125896427 predicted protein [Populus trichocarpa] 0.997 0.932 0.75 1e-175
255568946419 transcription factor, putative [Ricinus 0.997 0.949 0.719 1e-166
224144770421 predicted protein [Populus trichocarpa] 0.997 0.945 0.718 1e-165
225441650418 PREDICTED: uncharacterized protein LOC10 0.987 0.942 0.704 1e-158
225441648412 PREDICTED: uncharacterized protein LOC10 0.972 0.941 0.690 1e-152
356543598420 PREDICTED: uncharacterized protein LOC10 0.989 0.940 0.639 1e-142
356547128400 PREDICTED: uncharacterized protein LOC10 0.924 0.922 0.645 1e-137
356542754405 PREDICTED: uncharacterized protein LOC10 0.977 0.962 0.623 1e-131
356542756399 PREDICTED: uncharacterized protein LOC10 0.962 0.962 0.611 1e-127
186509740394 myb family transcription factor [Arabido 0.952 0.964 0.605 1e-126
>gi|224125896|ref|XP_002319702.1| predicted protein [Populus trichocarpa] gi|222858078|gb|EEE95625.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  621 bits (1601), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 321/428 (75%), Positives = 348/428 (81%), Gaps = 30/428 (7%)

Query: 1   MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60
           MYHHHQ+QGKS+HSSSRM IP ERHLFLQGG+GPGDSGLVLSTDAKPRLKWTPDLHERFI
Sbjct: 1   MYHHHQHQGKSIHSSSRMAIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 61  EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKI--VTV 118
           EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIG++KI  V V
Sbjct: 61  EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGSSKIGTVAV 120

Query: 119 PGERMPEANATHMN--NLSIGPQPNK-----SLHISETIQMQIEVQRRLHEQLEVQRHLQ 171
            G+RMPEANATH+N  NLSIG QPNK     SLH SE +QMQIEVQRRLHEQLEVQRHLQ
Sbjct: 121 VGDRMPEANATHININNLSIGSQPNKILKSRSLHFSEALQMQIEVQRRLHEQLEVQRHLQ 180

Query: 172 LRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKEL 231
           LRIEAQGKYLQAVLEKAQETLGRQNLGT GLEAAKVQLSELVSKVSTQCLNSTFS+L +L
Sbjct: 181 LRIEAQGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSTFSELNDL 240

Query: 232 QGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPM 291
           QG CPQQ    QP DCSMDSCLTSCEGSQK+QEIHN G+ LRP +    LEPKEI EE  
Sbjct: 241 QGLCPQQTPPTQPNDCSMDSCLTSCEGSQKEQEIHNIGMGLRPCNSNALLEPKEIAEEHA 300

Query: 292 LQQTELKWRKDLKESK-FLSSIG---------KDRGPGELSIGSG----------SFPAG 331
           LQQTELKW + L+++K FL+SIG          +R   +LSIG G          SF  G
Sbjct: 301 LQQTELKWGEYLRDNKMFLTSIGHETERRTFSAERSCSDLSIGVGLQGEKGNINSSFAEG 360

Query: 332 RFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQF 391
           RFK  +ED+ FQDQTNK+ E  K E+E + P YRL  F+TKLDLN+HD E D AS CKQ 
Sbjct: 361 RFKGMSEDDSFQDQTNKRAESVKYEDEKMSPGYRLSYFTTKLDLNSHD-EIDAASSCKQL 419

Query: 392 DLNGFSWN 399
           DLNGFSWN
Sbjct: 420 DLNGFSWN 427




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255568946|ref|XP_002525443.1| transcription factor, putative [Ricinus communis] gi|223535256|gb|EEF36933.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224144770|ref|XP_002325408.1| predicted protein [Populus trichocarpa] gi|222862283|gb|EEE99789.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225441650|ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248614 isoform 1 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225441648|ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248614 isoform 2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356543598|ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810396 [Glycine max] Back     alignment and taxonomy information
>gi|356547128|ref|XP_003541969.1| PREDICTED: uncharacterized protein LOC100799064 [Glycine max] Back     alignment and taxonomy information
>gi|356542754|ref|XP_003539830.1| PREDICTED: uncharacterized protein LOC100805237 isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|356542756|ref|XP_003539831.1| PREDICTED: uncharacterized protein LOC100805237 isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|186509740|ref|NP_001118567.1| myb family transcription factor [Arabidopsis thaliana] gi|6223653|gb|AAF05867.1|AC011698_18 transfactor-like [Arabidopsis thaliana] gi|332640508|gb|AEE74029.1| myb family transcription factor [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query399
TAIR|locus:2095933394 MYR2 [Arabidopsis thaliana (ta 0.957 0.969 0.618 6e-119
TAIR|locus:2172394402 MYR1 "myb-related protein 1" [ 0.967 0.960 0.592 1.2e-115
TAIR|locus:2026739337 AT1G69580 [Arabidopsis thalian 0.573 0.679 0.489 2.4e-51
TAIR|locus:2119425295 UNE16 "unfertilized embryo sac 0.606 0.820 0.478 1.7e-50
TAIR|locus:2093726298 AT3G24120 [Arabidopsis thalian 0.566 0.758 0.477 4.1e-49
TAIR|locus:2206415358 APL "ALTERED PHLOEM DEVELOPMEN 0.403 0.449 0.573 3.9e-44
TAIR|locus:2157146264 AT5G45580 [Arabidopsis thalian 0.406 0.613 0.486 1.3e-34
TAIR|locus:2045761286 AT2G01060 [Arabidopsis thalian 0.365 0.510 0.503 7.4e-34
TAIR|locus:2087740235 AT3G12730 [Arabidopsis thalian 0.370 0.629 0.503 7.4e-34
TAIR|locus:2139865409 PHR1 "phosphate starvation res 0.456 0.444 0.405 4.1e-33
TAIR|locus:2095933 MYR2 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1171 (417.3 bits), Expect = 6.0e-119, P = 6.0e-119
 Identities = 254/411 (61%), Positives = 301/411 (73%)

Query:     1 MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60
             MY+ +Q+QGK++ SSSRM I +ERH FL+G S PGDSGL+LSTDAKPRLKWTPDLHERFI
Sbjct:     1 MYYQNQHQGKNILSSSRMHITSERHPFLRGNS-PGDSGLILSTDAKPRLKWTPDLHERFI 59

Query:    61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKI--VTV 118
             EAVNQLGGADKATPKT+MK+MGIPGLTLYHLKSHLQKYRLSKNL+GQAN   NKI  +T+
Sbjct:    60 EAVNQLGGADKATPKTIMKVMGIPGLTLYHLKSHLQKYRLSKNLNGQANNSFNKIGIMTM 119

Query:   119 PGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQG 178
               E+ P+A+     NLSIGPQPNK+  I E +QMQIEVQRRLHEQLEVQRHLQLRIEAQG
Sbjct:   120 MEEKTPDADEIQSENLSIGPQPNKNSPIGEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 179

Query:   179 KYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQ 238
             KYLQ+VLEKAQETLGRQNLG AG+EAAKVQLSELVSKVS +  NS+F + KELQ  C QQ
Sbjct:   180 KYLQSVLEKAQETLGRQNLGAAGIEAAKVQLSELVSKVSAEYPNSSFLEPKELQNLCSQQ 239

Query:   239 PQANQPTDCSMDSCLTSCEGSQKDQE-IHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTEL 297
              Q N P DCS++SCLTS EG+QK+ + + N  + LR Y G  T E KEI+EEP+ Q+ EL
Sbjct:   240 MQTNYPPDCSLESCLTSSEGTQKNSKMLENNRLGLRTYIGDSTSEQKEIMEEPLFQRMEL 299

Query:   298 KWRKDLKESKFLSSIGKD---------RGPGELSIGSGSFPAGRFKASNEDEHFQDQTNK 348
              W + L+ + +LS++  +         R PG LSIG G          ++ +H   Q N 
Sbjct:   300 TWTEGLRGNPYLSTMVSEAEQRISYSERSPGRLSIGVGLH-------GHKSQH--QQGNN 350

Query:   349 KPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN 399
               E  KLE  N     +    +T+LDLN H  EN   +  KQFDLNGFSWN
Sbjct:   351 --EDHKLETRNR----KGMDSTTELDLNTHV-ENYCTTRTKQFDLNGFSWN 394




GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
TAIR|locus:2172394 MYR1 "myb-related protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026739 AT1G69580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2119425 UNE16 "unfertilized embryo sac 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093726 AT3G24120 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2206415 APL "ALTERED PHLOEM DEVELOPMENT" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2157146 AT5G45580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2045761 AT2G01060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2087740 AT3G12730 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2139865 PHR1 "phosphate starvation response 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.4.240.691
3rd Layer3.4.24.57LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00130592
hypothetical protein (427 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query399
pfam1437951 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, 3e-22
TIGR0155757 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domai 7e-21
PLN03162526 PLN03162, PLN03162, golden-2 like transcription fa 2e-10
pfam0024947 pfam00249, Myb_DNA-binding, Myb-like DNA-binding d 7e-08
>gnl|CDD|206547 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
 Score = 88.5 bits (220), Expect = 3e-22
 Identities = 37/51 (72%), Positives = 46/51 (90%)

Query: 143 SLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLG 193
            + I+E +++Q+EVQRRLHEQLEVQRHLQ+RIEAQGKYLQ +LEKAQ+TL 
Sbjct: 1   GMQITEALRLQMEVQRRLHEQLEVQRHLQIRIEAQGKYLQEILEKAQKTLS 51


This family is found towards the C-terminus of Myb-CC type transcription factors, and carries a highly conserved LHEQLE sequence motif. Length = 51

>gnl|CDD|130620 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>gnl|CDD|178707 PLN03162, PLN03162, golden-2 like transcription factor; Provisional Back     alignment and domain information
>gnl|CDD|215818 pfam00249, Myb_DNA-binding, Myb-like DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 399
PF1437951 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE mot 99.92
PLN03162526 golden-2 like transcription factor; Provisional 99.87
TIGR0155757 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF c 99.84
PF0024948 Myb_DNA-binding: Myb-like DNA-binding domain; Inte 97.18
smart0042668 TEA TEA domain. 89.3
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
Probab=99.92  E-value=1.3e-25  Score=169.68  Aligned_cols=51  Identities=75%  Similarity=1.086  Sum_probs=49.0

Q ss_pred             CCChHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHHHHHHHHHhc
Q 015840          143 SLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLG  193 (399)
Q Consensus       143 ~~qItEALqmQmEVQRRLHEQLEVQRhLQLRIEAQGKYLQsILEKAqe~La  193 (399)
                      +++|+|||++||||||||||||||||+||+|||||||||++|||||+++++
T Consensus         1 g~~i~EALr~QmEvQrrLhEQLEvQr~Lqlrieaqgkyl~~ilek~~~~~s   51 (51)
T PF14379_consen    1 GMQITEALRMQMEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEKAQKALS   51 (51)
T ss_pred             CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcC
Confidence            578999999999999999999999999999999999999999999999874



>PLN03162 golden-2 like transcription factor; Provisional Back     alignment and domain information
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins Back     alignment and domain information
>smart00426 TEA TEA domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query399
1irz_A64 Solution Structure Of Arr10-B Belonging To The Garp 4e-08
>pdb|1IRZ|A Chain A, Solution Structure Of Arr10-B Belonging To The Garp Family Of Plant Myb-Related Dna Binding Motifs Of The Arabidopsis Response Regulators Length = 64 Back     alignment and structure

Iteration: 1

Score = 55.8 bits (133), Expect = 4e-08, Method: Composition-based stats. Identities = 25/60 (41%), Positives = 41/60 (68%), Gaps = 1/60 (1%) Query: 42 STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLS 101 + KPR+ WT +LH +F+ AV+ LG ++A PK ++ LM + LT ++ SHLQK+R++ Sbjct: 1 TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query399
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 2e-24
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 5e-04
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Length = 64 Back     alignment and structure
 Score = 94.6 bits (235), Expect = 2e-24
 Identities = 25/61 (40%), Positives = 41/61 (67%), Gaps = 1/61 (1%)

Query: 42  STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLS 101
           +   KPR+ WT +LH +F+ AV+ LG  ++A PK ++ LM +  LT  ++ SHLQK+R++
Sbjct: 1   TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 102 K 102
            
Sbjct: 60  L 60


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query399
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 99.95
2yus_A79 SWI/SNF-related matrix-associated actin- dependent 89.83
2yum_A75 ZZZ3 protein, zinc finger ZZ-type-containing prote 87.11
2cu7_A72 KIAA1915 protein; nuclear protein, SANT domain, DN 86.53
2hzd_A82 Transcriptional enhancer factor TEF-1; DNA-binding 84.21
1x41_A60 Transcriptional adaptor 2-like, isoform B; transcr 81.3
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Back     alignment and structure
Probab=99.95  E-value=9.9e-29  Score=192.77  Aligned_cols=62  Identities=40%  Similarity=0.785  Sum_probs=58.7

Q ss_pred             cCCCCCCccCCHHHHHHHHHHHHHhCCCCCCCchHHhhhcCCCCccHHHHHHhhhhhhhhccc
Q 015840           42 STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNL  104 (399)
Q Consensus        42 s~~~KpRlrWT~ELH~rFV~AV~qLGG~dkAtPK~Il~lM~v~gLT~~hVkSHLQKYRl~~~~  104 (399)
                      ++.+|||++||+|||++||+||++|| .++||||.||++|+|+|||++||||||||||+..+.
T Consensus         1 ~~~~k~r~~WT~elH~~Fv~Av~~LG-~~~AtPk~Il~~M~v~gLT~~~VkSHLQKYR~~l~r   62 (64)
T 1irz_A            1 TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVALKK   62 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHC-TTTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHHS
T ss_pred             CCCCCCCCcCCHHHHHHHHHHHHHhC-CCCCCcHHHHHHcCCCCCCHHHHHHHHHHHHHHHHc
Confidence            46789999999999999999999999 799999999999999999999999999999998653



>2yus_A SWI/SNF-related matrix-associated actin- dependent regulator of chromatin subfamily...; SWI/SNF complex 155 kDa subunit, BRG1-associated factor 155; NMR {Homo sapiens} Back     alignment and structure
>2yum_A ZZZ3 protein, zinc finger ZZ-type-containing protein 3; transcription, structural genomics, NPPSFA; NMR {Homo sapiens} Back     alignment and structure
>2cu7_A KIAA1915 protein; nuclear protein, SANT domain, DNA binding, regulation of transcription, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure
>2hzd_A Transcriptional enhancer factor TEF-1; DNA-binding, helix-turn-helix, gene regulation; NMR {Homo sapiens} Back     alignment and structure
>1x41_A Transcriptional adaptor 2-like, isoform B; transcriptional adaptor protein2, transcriptional activation, MYB domain, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 399
d1irza_64 a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis th 1e-22
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 64 Back     information, alignment and structure

class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 88.4 bits (219), Expect = 1e-22
 Identities = 25/61 (40%), Positives = 41/61 (67%), Gaps = 1/61 (1%)

Query: 42  STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLS 101
           +   KPR+ WT +LH +F+ AV+ LG  ++A PK ++ LM +  LT  ++ SHLQK+R++
Sbjct: 1   TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 102 K 102
            
Sbjct: 60  L 60


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query399
d1irza_64 Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId 99.92
d2cu7a165 MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 960 90.12
d1x41a147 Transcriptional adaptor 2-like, TADA2L, isoform b 84.1
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.92  E-value=3.3e-26  Score=176.50  Aligned_cols=61  Identities=41%  Similarity=0.791  Sum_probs=57.6

Q ss_pred             cCCCCCCccCCHHHHHHHHHHHHHhCCCCCCCchHHhhhcCCCCccHHHHHHhhhhhhhhcc
Q 015840           42 STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKN  103 (399)
Q Consensus        42 s~~~KpRlrWT~ELH~rFV~AV~qLGG~dkAtPK~Il~lM~v~gLT~~hVkSHLQKYRl~~~  103 (399)
                      |..+|||++||+|||++||+||++||+ ++||||.|+++|+|+|||+.||+|||||||+...
T Consensus         1 s~~kk~R~~WT~elH~~Fv~Av~~lG~-~~atpk~I~~~m~v~~lT~~qV~SHlQKYrl~l~   61 (64)
T d1irza_           1 TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVALK   61 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHCT-TTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHH
T ss_pred             CCCCCCCCCCCHHHHHHHHHHHHHhCc-cccchHHHHHHcCCCCCCHHHHHHHHHHHHHHHH
Confidence            457899999999999999999999985 8999999999999999999999999999999864



>d2cu7a1 a.4.1.3 (A:8-72) MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x41a1 a.4.1.1 (A:8-54) Transcriptional adaptor 2-like, TADA2L, isoform b {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure