Citrus Sinensis ID: 014887


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410------
MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
ccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHccccccccHHHHHHHccccccHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHccHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccccccccccccccccccccc
ccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccccccHHHHHHcccccEEEEEHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHccHccccccccccccccccccccccccccccccccHcccccccccccccccccccccccccccccccccccccccHHHcccHHccHHHHHccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccccccccccEEEccccccc
myhhhqnqgksmhsssrmpipterhlflqggsgpgdsglvlstdakprlkwtpdLHERFIEAVNQlggadkatpKTVMKlmgipgltlYHLKSHLQKYRlsknlhgqanignnkiahtgiggmkfkssgvgpvtvpgermpeanathmnnlsigpqpnkslhISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKelqgfcpqqpqanqptdcsmdscltscegsqkdqeihnggvrlrpyhgtptlepkeiveepmlQQTELKWRKdlkeskflssigkdrgpgelsigsgsfpagrfkasnedehfqdqtnkkpegaklenenllpeyrlpcfstkldlnahdhendvasgckqfdlngfswn
myhhhqnqgksmhsssRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNqlggadkatpKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKdqeihnggvrlrpyhgtptlepkeiveepmLQQTELKWRKDLKESKFlssigkdrgpgelsiGSGSFPAGRFKASNEDEHfqdqtnkkpegaklenenllpeYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
***********************************************RLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGIGGMKF*****************************************IQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGF******************************************************************************************************************************LPEYRLPCFSTKLDLNAH*****VA*GCKQFD*******
*************************************************KWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQK***********************************************ATH***************************************************************************************************************************************************************************************************************************************************D****************FDLNGFSW*
*****************MPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQ***********DSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKAS**********NKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
**********************************************PRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNL*******************************************************SLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLG*****************************************************************************************PKEIVEEPMLQQTELKWRKDLKESKFLSSIG*************************************************EYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSW*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query416 2.2.26 [Sep-21-2011]
Q9SAK5358 Myb family transcription no no 0.367 0.427 0.522 2e-42
Q8GUN5413 Protein PHR1-LIKE 1 OS=Ar no no 0.326 0.329 0.447 2e-30
Q700D9255 Putative Myb family trans no no 0.134 0.219 0.607 5e-14
Q9C616388 Probable transcription fa no no 0.182 0.195 0.467 5e-13
Q93WJ9403 Transcription repressor K no no 0.134 0.138 0.571 7e-13
Q0J235532 Probable transcription fa no no 0.137 0.107 0.561 2e-12
Q9FJV5276 Probable transcription fa no no 0.153 0.231 0.538 4e-12
Q9ZWJ9664 Two-component response re no no 0.156 0.097 0.5 5e-12
Q941I2322 Probable transcription fa no no 0.134 0.173 0.553 7e-12
Q940D0690 Two-component response re no no 0.129 0.078 0.545 1e-11
>sp|Q9SAK5|APL_ARATH Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=2 SV=2 Back     alignment and function desciption
 Score =  173 bits (439), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/180 (52%), Positives = 118/180 (65%), Gaps = 27/180 (15%)

Query: 35  GDSGLVLSTDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSH 94
           GDSGLVL+TD KPRL+WT +LHERF++AV QLGG DKATPKT+M++MG+ GLTLYHLKSH
Sbjct: 23  GDSGLVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKSH 82

Query: 95  LQKYRLSKNLHGQANIGNNKIAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIG 154
           LQK+RL K  H +         H+   G +  +  +         M   N   MN     
Sbjct: 83  LQKFRLGKQPHKEYG------DHSTKEGSRASAMDIQRNVASSSGMMSRN---MN----- 128

Query: 155 PQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNL 214
                        +MQ+EVQRRLHEQLEVQRHLQLRIEAQGKY+Q++LE+A +TL  +N+
Sbjct: 129 -------------EMQMEVQRRLHEQLEVQRHLQLRIEAQGKYMQSILERACQTLAGENM 175




Transcription factor required for phloem identity. Has a dual role both in promoting phloem differentiation and in repressing xylem differentiation during vascular development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8GUN5|PHL1_ARATH Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1 Back     alignment and function description
>sp|Q700D9|MYBF_ARATH Putative Myb family transcription factor At1g14600 OS=Arabidopsis thaliana GN=At1g14600 PE=2 SV=2 Back     alignment and function description
>sp|Q9C616|KAN2_ARATH Probable transcription factor KAN2 OS=Arabidopsis thaliana GN=KAN2 PE=2 SV=1 Back     alignment and function description
>sp|Q93WJ9|KAN1_ARATH Transcription repressor KAN1 OS=Arabidopsis thaliana GN=KAN1 PE=1 SV=1 Back     alignment and function description
>sp|Q0J235|ROLL9_ORYSJ Probable transcription factor RL9 OS=Oryza sativa subsp. japonica GN=RL9 PE=2 SV=2 Back     alignment and function description
>sp|Q9FJV5|KAN4_ARATH Probable transcription factor KAN4 OS=Arabidopsis thaliana GN=KAN4 PE=1 SV=1 Back     alignment and function description
>sp|Q9ZWJ9|ARR2_ARATH Two-component response regulator ARR2 OS=Arabidopsis thaliana GN=ARR2 PE=1 SV=1 Back     alignment and function description
>sp|Q941I2|KAN3_ARATH Probable transcription factor KAN3 OS=Arabidopsis thaliana GN=KAN3 PE=2 SV=1 Back     alignment and function description
>sp|Q940D0|ARR1_ARATH Two-component response regulator ARR1 OS=Arabidopsis thaliana GN=ARR1 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query416
224125896427 predicted protein [Populus trichocarpa] 0.961 0.936 0.726 1e-174
255568946419 transcription factor, putative [Ricinus 0.961 0.954 0.696 1e-165
224144770421 predicted protein [Populus trichocarpa] 0.961 0.950 0.695 1e-164
225441650418 PREDICTED: uncharacterized protein LOC10 0.947 0.942 0.677 1e-156
225441648412 PREDICTED: uncharacterized protein LOC10 0.932 0.941 0.663 1e-150
356543598420 PREDICTED: uncharacterized protein LOC10 0.951 0.942 0.616 1e-140
356547128400 PREDICTED: uncharacterized protein LOC10 0.891 0.927 0.619 1e-135
356542754405 PREDICTED: uncharacterized protein LOC10 0.947 0.972 0.601 1e-130
356542756399 PREDICTED: uncharacterized protein LOC10 0.932 0.972 0.590 1e-126
186509740394 myb family transcription factor [Arabido 0.918 0.969 0.586 1e-125
>gi|224125896|ref|XP_002319702.1| predicted protein [Populus trichocarpa] gi|222858078|gb|EEE95625.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  618 bits (1593), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 322/443 (72%), Positives = 349/443 (78%), Gaps = 43/443 (9%)

Query: 1   MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60
           MYHHHQ+QGKS+HSSSRM IP ERHLFLQGG+GPGDSGLVLSTDAKPRLKWTPDLHERFI
Sbjct: 1   MYHHHQHQGKSIHSSSRMAIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 61  EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGI 120
           EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIG++KI     
Sbjct: 61  EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGSSKI----- 115

Query: 121 GGMKFKSSGVGPVTVPGERMPEANATHMN--NLSIGPQPNK-----SLHISETIQMQIEV 173
                     G V V G+RMPEANATH+N  NLSIG QPNK     SLH SE +QMQIEV
Sbjct: 116 ----------GTVAVVGDRMPEANATHININNLSIGSQPNKILKSRSLHFSEALQMQIEV 165

Query: 174 QRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKV 233
           QRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGT GLEAAKVQLSELVSKV
Sbjct: 166 QRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKV 225

Query: 234 STQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYH 293
           STQCLNSTFS+L +LQG CPQQ    QP DCSMDSCLTSCEGSQK+QEIHN G+ LRP +
Sbjct: 226 STQCLNSTFSELNDLQGLCPQQTPPTQPNDCSMDSCLTSCEGSQKEQEIHNIGMGLRPCN 285

Query: 294 GTPTLEPKEIVEEPMLQQTELKWRKDLKESK-FLSSIG---------KDRGPGELSIGSG 343
               LEPKEI EE  LQQTELKW + L+++K FL+SIG          +R   +LSIG G
Sbjct: 286 SNALLEPKEIAEEHALQQTELKWGEYLRDNKMFLTSIGHETERRTFSAERSCSDLSIGVG 345

Query: 344 ----------SFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLN 393
                     SF  GRFK  +ED+ FQDQTNK+ E  K E+E + P YRL  F+TKLDLN
Sbjct: 346 LQGEKGNINSSFAEGRFKGMSEDDSFQDQTNKRAESVKYEDEKMSPGYRLSYFTTKLDLN 405

Query: 394 AHDHENDVASGCKQFDLNGFSWN 416
           +HD E D AS CKQ DLNGFSWN
Sbjct: 406 SHD-EIDAASSCKQLDLNGFSWN 427




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255568946|ref|XP_002525443.1| transcription factor, putative [Ricinus communis] gi|223535256|gb|EEF36933.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224144770|ref|XP_002325408.1| predicted protein [Populus trichocarpa] gi|222862283|gb|EEE99789.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225441650|ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248614 isoform 1 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225441648|ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248614 isoform 2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356543598|ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810396 [Glycine max] Back     alignment and taxonomy information
>gi|356547128|ref|XP_003541969.1| PREDICTED: uncharacterized protein LOC100799064 [Glycine max] Back     alignment and taxonomy information
>gi|356542754|ref|XP_003539830.1| PREDICTED: uncharacterized protein LOC100805237 isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|356542756|ref|XP_003539831.1| PREDICTED: uncharacterized protein LOC100805237 isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|186509740|ref|NP_001118567.1| myb family transcription factor [Arabidopsis thaliana] gi|6223653|gb|AAF05867.1|AC011698_18 transfactor-like [Arabidopsis thaliana] gi|332640508|gb|AEE74029.1| myb family transcription factor [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query416
TAIR|locus:2095933394 MYR2 [Arabidopsis thaliana (ta 0.923 0.974 0.598 1.6e-116
TAIR|locus:2172394402 MYR1 "myb-related protein 1" [ 0.932 0.965 0.574 1.1e-112
TAIR|locus:2026739337 AT1G69580 [Arabidopsis thalian 0.562 0.694 0.478 8.3e-51
TAIR|locus:2093726298 AT3G24120 [Arabidopsis thalian 0.550 0.768 0.439 1.7e-41
TAIR|locus:2119425295 UNE16 "unfertilized embryo sac 0.591 0.833 0.417 4.6e-41
TAIR|locus:2157146264 AT5G45580 [Arabidopsis thalian 0.415 0.655 0.444 2.2e-34
TAIR|locus:2206415358 APL "ALTERED PHLOEM DEVELOPMEN 0.425 0.494 0.402 5.6e-29
TAIR|locus:2170199375 AT5G06800 [Arabidopsis thalian 0.137 0.152 0.649 1.2e-27
TAIR|locus:2038957397 AT2G20400 "AT2G20400" [Arabido 0.139 0.146 0.637 1e-26
TAIR|locus:2087740235 AT3G12730 [Arabidopsis thalian 0.382 0.676 0.380 7.6e-25
TAIR|locus:2095933 MYR2 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1148 (409.2 bits), Expect = 1.6e-116, P = 1.6e-116
 Identities = 255/426 (59%), Positives = 302/426 (70%)

Query:     1 MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60
             MY+ +Q+QGK++ SSSRM I +ERH FL+G S PGDSGL+LSTDAKPRLKWTPDLHERFI
Sbjct:     1 MYYQNQHQGKNILSSSRMHITSERHPFLRGNS-PGDSGLILSTDAKPRLKWTPDLHERFI 59

Query:    61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIAHTGI 120
             EAVNQLGGADKATPKT+MK+MGIPGLTLYHLKSHLQKYRLSKNL+GQAN   NKI     
Sbjct:    60 EAVNQLGGADKATPKTIMKVMGIPGLTLYHLKSHLQKYRLSKNLNGQANNSFNKI----- 114

Query:   121 GGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQ 180
                       G +T+  E+ P+A+     NLSIGPQPNK+  I E +QMQIEVQRRLHEQ
Sbjct:   115 ----------GIMTMMEEKTPDADEIQSENLSIGPQPNKNSPIGEALQMQIEVQRRLHEQ 164

Query:   181 LEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNS 240
             LEVQRHLQLRIEAQGKYLQ+VLEKAQETLGRQNLG AG+EAAKVQLSELVSKVS +  NS
Sbjct:   165 LEVQRHLQLRIEAQGKYLQSVLEKAQETLGRQNLGAAGIEAAKVQLSELVSKVSAEYPNS 224

Query:   241 TFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEGSQKDQE-IHNGGVRLRPYHGTPTLE 299
             +F + KELQ  C QQ Q N P DCS++SCLTS EG+QK+ + + N  + LR Y G  T E
Sbjct:   225 SFLEPKELQNLCSQQMQTNYPPDCSLESCLTSSEGTQKNSKMLENNRLGLRTYIGDSTSE 284

Query:   300 PKEIVEEPMLQQTELKWRKDLKESKFLSSIGKD---------RGPGELSIGSGSFPAGRF 350
              KEI+EEP+ Q+ EL W + L+ + +LS++  +         R PG LSIG G       
Sbjct:   285 QKEIMEEPLFQRMELTWTEGLRGNPYLSTMVSEAEQRISYSERSPGRLSIGVGLH----- 339

Query:   351 KASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDL 410
                ++ +H   Q N   E  KLE  N     +    +T+LDLN H  EN   +  KQFDL
Sbjct:   340 --GHKSQH--QQGNN--EDHKLETRNR----KGMDSTTELDLNTHV-ENYCTTRTKQFDL 388

Query:   411 NGFSWN 416
             NGFSWN
Sbjct:   389 NGFSWN 394




GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
TAIR|locus:2172394 MYR1 "myb-related protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026739 AT1G69580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093726 AT3G24120 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2119425 UNE16 "unfertilized embryo sac 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2157146 AT5G45580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2206415 APL "ALTERED PHLOEM DEVELOPMENT" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2170199 AT5G06800 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2038957 AT2G20400 "AT2G20400" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2087740 AT3G12730 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.4.240.691
3rd Layer3.4.24.57LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00130592
hypothetical protein (427 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query416
pfam1437951 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, 1e-22
TIGR0155757 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domai 5e-21
PLN03162526 PLN03162, PLN03162, golden-2 like transcription fa 2e-10
pfam0024947 pfam00249, Myb_DNA-binding, Myb-like DNA-binding d 6e-08
>gnl|CDD|206547 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
 Score = 90.0 bits (224), Expect = 1e-22
 Identities = 37/51 (72%), Positives = 46/51 (90%)

Query: 160 SLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLG 210
            + I+E +++Q+EVQRRLHEQLEVQRHLQ+RIEAQGKYLQ +LEKAQ+TL 
Sbjct: 1   GMQITEALRLQMEVQRRLHEQLEVQRHLQIRIEAQGKYLQEILEKAQKTLS 51


This family is found towards the C-terminus of Myb-CC type transcription factors, and carries a highly conserved LHEQLE sequence motif. Length = 51

>gnl|CDD|130620 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>gnl|CDD|178707 PLN03162, PLN03162, golden-2 like transcription factor; Provisional Back     alignment and domain information
>gnl|CDD|215818 pfam00249, Myb_DNA-binding, Myb-like DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 416
PF1437951 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE mot 99.92
PLN03162526 golden-2 like transcription factor; Provisional 99.88
TIGR0155757 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF c 99.84
PF0024948 Myb_DNA-binding: Myb-like DNA-binding domain; Inte 97.14
smart0042668 TEA TEA domain. 90.11
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
Probab=99.92  E-value=3.7e-25  Score=167.96  Aligned_cols=51  Identities=75%  Similarity=1.086  Sum_probs=48.9

Q ss_pred             CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhc
Q 014887          160 SLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLG  210 (416)
Q Consensus       160 ~~qI~EALqmQmEVQrrLHEQLEVQRhLQLRIEAQGKYLQsiLEkAQe~La  210 (416)
                      +++|+|||++||||||||||||||||+||+|||||||||++|||+|+++++
T Consensus         1 g~~i~EALr~QmEvQrrLhEQLEvQr~Lqlrieaqgkyl~~ilek~~~~~s   51 (51)
T PF14379_consen    1 GMQITEALRMQMEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEKAQKALS   51 (51)
T ss_pred             CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcC
Confidence            478999999999999999999999999999999999999999999999864



>PLN03162 golden-2 like transcription factor; Provisional Back     alignment and domain information
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins Back     alignment and domain information
>smart00426 TEA TEA domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query416
1irz_A64 Solution Structure Of Arr10-B Belonging To The Garp 3e-08
>pdb|1IRZ|A Chain A, Solution Structure Of Arr10-B Belonging To The Garp Family Of Plant Myb-Related Dna Binding Motifs Of The Arabidopsis Response Regulators Length = 64 Back     alignment and structure

Iteration: 1

Score = 56.2 bits (134), Expect = 3e-08, Method: Composition-based stats. Identities = 25/60 (41%), Positives = 41/60 (68%), Gaps = 1/60 (1%) Query: 42 STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLS 101 + KPR+ WT +LH +F+ AV+ LG ++A PK ++ LM + LT ++ SHLQK+R++ Sbjct: 1 TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query416
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 1e-24
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Length = 64 Back     alignment and structure
 Score = 95.0 bits (236), Expect = 1e-24
 Identities = 25/61 (40%), Positives = 41/61 (67%), Gaps = 1/61 (1%)

Query: 42  STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLS 101
           +   KPR+ WT +LH +F+ AV+ LG  ++A PK ++ LM +  LT  ++ SHLQK+R++
Sbjct: 1   TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 102 K 102
            
Sbjct: 60  L 60


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query416
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 99.95
2yus_A79 SWI/SNF-related matrix-associated actin- dependent 88.78
2yum_A75 ZZZ3 protein, zinc finger ZZ-type-containing prote 86.68
2cu7_A72 KIAA1915 protein; nuclear protein, SANT domain, DN 85.85
2hzd_A82 Transcriptional enhancer factor TEF-1; DNA-binding 84.05
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Back     alignment and structure
Probab=99.95  E-value=1e-28  Score=193.50  Aligned_cols=61  Identities=41%  Similarity=0.810  Sum_probs=57.8

Q ss_pred             cCCCCCCccCCHHHHHHHHHHHHHhCCCCCCChHHHHhhhCCCCCCHHHHHHHhhhhhhccc
Q 014887           42 STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKN  103 (416)
Q Consensus        42 s~~~KpRlrWT~eLH~rFV~AV~~LGG~~kAtPK~IL~lM~v~gLT~~hVkSHLQKYRl~k~  103 (416)
                      ++.+|||++||+|||++||+||++|| .++||||.||++|+|+|||++||||||||||+...
T Consensus         1 ~~~~k~r~~WT~elH~~Fv~Av~~LG-~~~AtPk~Il~~M~v~gLT~~~VkSHLQKYR~~l~   61 (64)
T 1irz_A            1 TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVALK   61 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHC-TTTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHH
T ss_pred             CCCCCCCCcCCHHHHHHHHHHHHHhC-CCCCCcHHHHHHcCCCCCCHHHHHHHHHHHHHHHH
Confidence            46789999999999999999999998 79999999999999999999999999999999753



>2yus_A SWI/SNF-related matrix-associated actin- dependent regulator of chromatin subfamily...; SWI/SNF complex 155 kDa subunit, BRG1-associated factor 155; NMR {Homo sapiens} Back     alignment and structure
>2yum_A ZZZ3 protein, zinc finger ZZ-type-containing protein 3; transcription, structural genomics, NPPSFA; NMR {Homo sapiens} Back     alignment and structure
>2cu7_A KIAA1915 protein; nuclear protein, SANT domain, DNA binding, regulation of transcription, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure
>2hzd_A Transcriptional enhancer factor TEF-1; DNA-binding, helix-turn-helix, gene regulation; NMR {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 416
d1irza_64 a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis th 9e-23
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 64 Back     information, alignment and structure

class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 88.8 bits (220), Expect = 9e-23
 Identities = 25/61 (40%), Positives = 41/61 (67%), Gaps = 1/61 (1%)

Query: 42  STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLS 101
           +   KPR+ WT +LH +F+ AV+ LG  ++A PK ++ LM +  LT  ++ SHLQK+R++
Sbjct: 1   TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 102 K 102
            
Sbjct: 60  L 60


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query416
d1irza_64 Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId 99.92
d2cu7a165 MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 960 88.28
d1x41a147 Transcriptional adaptor 2-like, TADA2L, isoform b 82.13
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.92  E-value=3.9e-26  Score=176.84  Aligned_cols=61  Identities=41%  Similarity=0.791  Sum_probs=57.2

Q ss_pred             cCCCCCCccCCHHHHHHHHHHHHHhCCCCCCChHHHHhhhCCCCCCHHHHHHHhhhhhhccc
Q 014887           42 STDAKPRLKWTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKN  103 (416)
Q Consensus        42 s~~~KpRlrWT~eLH~rFV~AV~~LGG~~kAtPK~IL~lM~v~gLT~~hVkSHLQKYRl~k~  103 (416)
                      |..+|||++||+|||++||+||++||+ ++||||.|+++|+|+|||+.||+|||||||+...
T Consensus         1 s~~kk~R~~WT~elH~~Fv~Av~~lG~-~~atpk~I~~~m~v~~lT~~qV~SHlQKYrl~l~   61 (64)
T d1irza_           1 TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVALK   61 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHCT-TTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHH
T ss_pred             CCCCCCCCCCCHHHHHHHHHHHHHhCc-cccchHHHHHHcCCCCCCHHHHHHHHHHHHHHHH
Confidence            467899999999999999999999975 8999999999999999999999999999999753



>d2cu7a1 a.4.1.3 (A:8-72) MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x41a1 a.4.1.1 (A:8-54) Transcriptional adaptor 2-like, TADA2L, isoform b {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure