Citrus Sinensis ID: 022117


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300--
MIQREEIISRTSYHYHQNHGGGIEPCVVMTSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGREMGDSPKDASYLIESSANDNSLPNFPASDMNDYEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQILGGAVVDADSEKCKGLGTKMPRSYILDSLGFYTSQSVEVAEVQCPEELLPNLHPQRADCSTESCLTSHESSGGLTMEGSPVGGKNSILSLDTTTGSLIWGEVKMGAQEISLARENPSGISGYGI
ccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHccccccccHHHHHHHccccccHHHHHHHHHHHHHcccccccccccccccHHHHHcccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHcccccccccccccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccc
ccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccccccHHHHHHcccccEEEEEHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHccccccccccccccccccccccccccccHccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEHHHccccccEEEccccccccccccc
MIQREEIISRTSYHyhqnhgggiepcvvmtsdpkprlrwtADLHDRFVDAvtqlggpskatpKAIMRTMNVKGLTLFHLKSHLQKYRlgkqsgremgdspkdASYLIEssandnslpnfpasdmndyEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDqilggavvdadsekckglgtkmprsyildslgfytsqsvevaevqcpeellpnlhpqradcstescltshessggltmegspvggknsilsldtttgslIWGEVKMGAQEISLarenpsgisgygi
MIQREEIISRTSYHYHQNHGGGIEPCVVMTSDPKPRLRWTADLHDRFVDAVtqlggpskatpkAIMRTMNVKGLTLFHLKSHLQKYRlgkqsgremgdSPKDASYLIESSandnslpnfpASDMNDYEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQILGGAVVDADSEKCKGLGTKMPRSYILDSLGFYTSQSVEVAEVQCPEELLPNLHPQRADCSTESCLTSHESSGGLTMEGSPVGGKNSILSLDTTTGSLIWGEVKMGAQeislarenpsgisgygi
MIQREEIISRTSYHYHQNHGGGIEPCVVMTSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGREMGDSPKDASYLIESSANDNSLPNFPASDMNDYEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQILGGAVVDADSEKCKGLGTKMPRSYILDSLGFYTSQSVEVAEVQCPEELLPNLHPQRADCSTESCLTSHESSGGLTMEGSPVGGKNSILSLDTTTGSLIWGEVKMGAQEISLARENPSGISGYGI
**********TSYHYHQNHGGGIEPCVVMTSDPKPRLRWTADLHDRFVDAVTQLGGP**ATPKAIMRTMNVKGLTLFHLKSHLQKY*******************************************************KLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQILGGAVVDADSEKCKGLGTKMPRSYILDSLGFYTSQSVEVAEVQCPEELL*************************************ILSLDTTTGSLIWGEVKMG*******************
*************************************RWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKY****************************************************************KHLQIRQNAQQ**************************************************************************************************************************************G**GY**
MIQREEIISRTSYHYHQNHGGGIEPCVVMTSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGK***********DASYLIESSANDNSLPNFPASDMNDYEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQILGGAVVDADSEKCKGLGTKMPRSYILDSLGFYTSQSVEVAEVQCPEELLPNLHPQ**********************GSPVGGKNSILSLDTTTGSLIWGEVKMGAQEISLARENPSGISGYGI
**********************************PRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQS*********************************DYEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQ**********************************************************************************************T*SLIWGEVKMGAQEISLARENPSGISGYG*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MIQREEIISRTSYHYHQNHGGGIEPCVVMTSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQSGREMGDSPKDASYLIESSANDNSLPNFPASDMNDYEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQILGGAVVDADSEKCKGLGTKMPRSYILDSLGFYTSQSVEVAEVQCPEELLPNLHPQRADCSTESCLTSHESSGGLTMEGSPVGGKNSILSLDTTTGSLIWGEVKMGAQEISLARENPSGISGYGI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query302 2.2.26 [Sep-21-2011]
Q9SAK5358 Myb family transcription no no 0.473 0.399 0.569 4e-39
Q8GUN5413 Protein PHR1-LIKE 1 OS=Ar no no 0.493 0.360 0.396 4e-25
Q700D9255 Putative Myb family trans no no 0.211 0.250 0.582 7e-15
Q9C616388 Probable transcription fa no no 0.334 0.260 0.420 8e-15
Q93WJ9403 Transcription repressor K no no 0.215 0.161 0.538 2e-13
Q9FJV5276 Probable transcription fa no no 0.261 0.286 0.487 3e-13
Q0J235532 Probable transcription fa no no 0.225 0.127 0.536 7e-13
Q941I2322 Probable transcription fa no no 0.215 0.201 0.507 1e-11
Q940D0 690 Two-component response re no no 0.360 0.157 0.385 3e-10
Q9ZWJ9 664 Two-component response re no no 0.178 0.081 0.563 8e-10
>sp|Q9SAK5|APL_ARATH Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL PE=2 SV=2 Back     alignment and function desciption
 Score =  162 bits (409), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 86/151 (56%), Positives = 109/151 (72%), Gaps = 8/151 (5%)

Query: 27  VVMTSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKY 86
           +V+T+DPKPRLRWT +LH+RFVDAV QLGGP KATPK IMR M VKGLTL+HLKSHLQK+
Sbjct: 27  LVLTTDPKPRLRWTVELHERFVDAVAQLGGPDKATPKTIMRVMGVKGLTLYHLKSHLQKF 86

Query: 87  RLGKQSGREMGD-SPKDASYLIESSANDNSLPNFPASDMNDYEVKEALRVQMEVQSKLHL 145
           RLGKQ  +E GD S K+ S    +SA D       +S M    + E   +QMEVQ +LH 
Sbjct: 87  RLGKQPHKEYGDHSTKEGS---RASAMDIQRNVASSSGMMSRNMNE---MQMEVQRRLHE 140

Query: 146 QVEAEKHLQIRQNAQQRYL-AMLERACKILT 175
           Q+E ++HLQ+R  AQ +Y+ ++LERAC+ L 
Sbjct: 141 QLEVQRHLQLRIEAQGKYMQSILERACQTLA 171




Transcription factor required for phloem identity. Has a dual role both in promoting phloem differentiation and in repressing xylem differentiation during vascular development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8GUN5|PHL1_ARATH Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1 Back     alignment and function description
>sp|Q700D9|MYBF_ARATH Putative Myb family transcription factor At1g14600 OS=Arabidopsis thaliana GN=At1g14600 PE=2 SV=2 Back     alignment and function description
>sp|Q9C616|KAN2_ARATH Probable transcription factor KAN2 OS=Arabidopsis thaliana GN=KAN2 PE=2 SV=1 Back     alignment and function description
>sp|Q93WJ9|KAN1_ARATH Transcription repressor KAN1 OS=Arabidopsis thaliana GN=KAN1 PE=1 SV=1 Back     alignment and function description
>sp|Q9FJV5|KAN4_ARATH Probable transcription factor KAN4 OS=Arabidopsis thaliana GN=KAN4 PE=1 SV=1 Back     alignment and function description
>sp|Q0J235|ROLL9_ORYSJ Probable transcription factor RL9 OS=Oryza sativa subsp. japonica GN=RL9 PE=2 SV=2 Back     alignment and function description
>sp|Q941I2|KAN3_ARATH Probable transcription factor KAN3 OS=Arabidopsis thaliana GN=KAN3 PE=2 SV=1 Back     alignment and function description
>sp|Q940D0|ARR1_ARATH Two-component response regulator ARR1 OS=Arabidopsis thaliana GN=ARR1 PE=1 SV=2 Back     alignment and function description
>sp|Q9ZWJ9|ARR2_ARATH Two-component response regulator ARR2 OS=Arabidopsis thaliana GN=ARR2 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query302
359806178313 uncharacterized protein LOC100786397 [Gl 0.943 0.910 0.670 1e-108
363806688314 uncharacterized protein LOC100782369 [Gl 0.943 0.907 0.651 1e-107
357448183313 Myb family transcription factor [Medicag 0.937 0.904 0.666 1e-106
388503044313 unknown [Medicago truncatula] 0.937 0.904 0.659 1e-105
147840890350 hypothetical protein VITISV_035497 [Viti 0.937 0.808 0.598 1e-104
224124778298 predicted protein [Populus trichocarpa] 0.903 0.916 0.689 1e-103
255568526313 transcription factor, putative [Ricinus 0.923 0.891 0.660 1e-101
357448185299 Myb family transcription factor [Medicag 0.900 0.909 0.649 1e-101
449434424 572 PREDICTED: uncharacterized protein LOC10 0.907 0.479 0.658 1e-100
388514523292 unknown [Lotus japonicus] 0.890 0.921 0.627 7e-98
>gi|359806178|ref|NP_001241456.1| uncharacterized protein LOC100786397 [Glycine max] gi|255635309|gb|ACU18008.1| unknown [Glycine max] Back     alignment and taxonomy information
 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 193/288 (67%), Positives = 231/288 (80%), Gaps = 3/288 (1%)

Query: 18  NHGGGIEPCVVMTSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLF 77
           +H    +PC+V+T+DPKPRLRWT DLH+RFVDAVTQLGG SKATPKAIMRTMNVKGLTLF
Sbjct: 26  SHAHKGDPCLVLTADPKPRLRWTQDLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLF 85

Query: 78  HLKSHLQKYRLGKQSGREMGDSPKDASYLIESSANDNSLPNFPASDMND-YEVKEALRVQ 136
           HLKSHLQKYRLGKQSG+++G+  KD SYL+ES   DNS P  P SD N+ YE+KEALR Q
Sbjct: 86  HLKSHLQKYRLGKQSGKDVGEGCKDGSYLLESPGADNSSPKLPTSDTNEGYEIKEALRAQ 145

Query: 137 MEVQSKLHLQVEAEKHLQIRQNAQQRYLAMLERACKILTDQILGGAVVDADSEKCKGLGT 196
           MEVQSKLHLQVEAEKHLQIRQ+A++RY+AMLERACK+L DQ +G  V+D DS+K +G+G+
Sbjct: 146 MEVQSKLHLQVEAEKHLQIRQDAERRYMAMLERACKMLADQFIGATVIDTDSQKFQGIGS 205

Query: 197 KMPRSYILDSLGFYTSQSVEVAEVQCPEELLP-NLHPQRADCSTESCLTSHESSGGLTME 255
           K PR  ++D LGFY+  S EVA V  PEE +P +L PQRADCSTESCLTSHESSGGL +E
Sbjct: 206 KAPRGTLVDPLGFYSMPSTEVAGVNVPEEEIPLSLPPQRADCSTESCLTSHESSGGLALE 265

Query: 256 GSPVGGKNSILSLDTTTGSLIWGEVKMGAQEISLAREN-PSGISGYGI 302
           GSP  GK  +L +D+    LIW E KM  Q I++A+ N P GI+ YG+
Sbjct: 266 GSPGEGKRRMLGMDSMAAPLIWSEAKMRTQAINVAQGNLPQGITRYGM 313




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|363806688|ref|NP_001242009.1| uncharacterized protein LOC100782369 [Glycine max] gi|255636439|gb|ACU18558.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357448183|ref|XP_003594367.1| Myb family transcription factor [Medicago truncatula] gi|124360874|gb|ABN08846.1| Homeodomain-related [Medicago truncatula] gi|355483415|gb|AES64618.1| Myb family transcription factor [Medicago truncatula] Back     alignment and taxonomy information
>gi|388503044|gb|AFK39588.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|147840890|emb|CAN66504.1| hypothetical protein VITISV_035497 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224124778|ref|XP_002329946.1| predicted protein [Populus trichocarpa] gi|222871968|gb|EEF09099.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255568526|ref|XP_002525237.1| transcription factor, putative [Ricinus communis] gi|223535534|gb|EEF37203.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|357448185|ref|XP_003594368.1| Myb family transcription factor [Medicago truncatula] gi|355483416|gb|AES64619.1| Myb family transcription factor [Medicago truncatula] Back     alignment and taxonomy information
>gi|449434424|ref|XP_004134996.1| PREDICTED: uncharacterized protein LOC101215173 [Cucumis sativus] Back     alignment and taxonomy information
>gi|388514523|gb|AFK45323.1| unknown [Lotus japonicus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query302
TAIR|locus:2119425295 UNE16 "unfertilized embryo sac 0.718 0.735 0.487 6.9e-47
TAIR|locus:2093726298 AT3G24120 [Arabidopsis thalian 0.741 0.751 0.468 3e-46
TAIR|locus:2206415358 APL "ALTERED PHLOEM DEVELOPMEN 0.509 0.430 0.543 1.6e-38
TAIR|locus:2095933394 MYR2 [Arabidopsis thaliana (ta 0.701 0.538 0.410 3e-37
TAIR|locus:2172394402 MYR1 "myb-related protein 1" [ 0.768 0.577 0.390 2.7e-36
TAIR|locus:2026739337 AT1G69580 [Arabidopsis thalian 0.731 0.655 0.382 2e-33
TAIR|locus:2157146264 AT5G45580 [Arabidopsis thalian 0.486 0.556 0.468 4.7e-32
TAIR|locus:2087740235 AT3G12730 [Arabidopsis thalian 0.456 0.587 0.496 3e-30
TAIR|locus:2045761286 AT2G01060 [Arabidopsis thalian 0.490 0.517 0.449 1.5e-28
TAIR|locus:2148720413 PHL1 "AT5G29000" [Arabidopsis 0.506 0.370 0.4 1.4e-25
TAIR|locus:2119425 UNE16 "unfertilized embryo sac 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 491 (177.9 bits), Expect = 6.9e-47, P = 6.9e-47
 Identities = 114/234 (48%), Positives = 156/234 (66%)

Query:    23 IEPCVVMTSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSH 82
             I+ C+V+T+DPKPRLRWT++LH+RFVDAVTQLGGP KATPK IMRTM VKGLTL+HLKSH
Sbjct:    26 IDACLVLTTDPKPRLRWTSELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSH 85

Query:    83 LQKYRLGKQSGREMGDSPKDASYLIES----SANDNSLPNFPASDMND-YEVKEALRVQM 137
             LQK+RLG+QS +E  D+ KD S + ES    S++ +SL    A + N+ Y+V EALR QM
Sbjct:    86 LQKFRLGRQSCKESIDNSKDVSCVAESQDTGSSSTSSL-RLAAQEQNESYQVTEALRAQM 144

Query:   138 EVQSKLHLQVE---AEKHLQIRQNAQQRYL-AMLERACKILTDQILGGAVVDADSEKCKG 193
             EVQ +LH Q+E    ++ LQ+R  AQ +YL ++LE+ACK + +Q +  A ++A  E+   
Sbjct:   145 EVQRRLHEQLEYTQVQRRLQLRIEAQGKYLQSILEKACKAIEEQAVAFAGLEAAREELSE 204

Query:   194 LGTKMPRSYILDSLGFYTSQSVEVAEVQCPE--ELLPNLHPQRADCSTESCLTS 245
             L  K   S      G  T+ + +  ++  P   EL   +   + +CS ES LTS
Sbjct:   205 LAIKA--SITNGCQG--TTSTFDTTKMMIPSLSELAVAIE-HKNNCSAESSLTS 253




GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0009567 "double fertilization forming a zygote and endosperm" evidence=IMP
TAIR|locus:2093726 AT3G24120 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2206415 APL "ALTERED PHLOEM DEVELOPMENT" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2095933 MYR2 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2172394 MYR1 "myb-related protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026739 AT1G69580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2157146 AT5G45580 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2087740 AT3G12730 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2045761 AT2G01060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2148720 PHL1 "AT5G29000" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00011845001
SubName- Full=Chromosome undetermined scaffold_342, whole genome shotgun sequence; (308 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query302
TIGR0155757 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domai 2e-23
pfam1437951 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, 8e-16
PLN03162526 PLN03162, PLN03162, golden-2 like transcription fa 6e-08
pfam0024947 pfam00249, Myb_DNA-binding, Myb-like DNA-binding d 7e-06
>gnl|CDD|130620 TIGR01557, myb_SHAQKYF, myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
 Score = 90.5 bits (225), Expect = 2e-23
 Identities = 35/57 (61%), Positives = 41/57 (71%)

Query: 34 KPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGK 90
          KPR+ WT DLHDRF+ AV +LGGP  ATPK I+  M V GLT   + SHLQKYRL +
Sbjct: 1  KPRVVWTEDLHDRFLQAVQKLGGPDWATPKRILELMVVDGLTRDQVASHLQKYRLKQ 57


This model describes a DNA-binding domain restricted to (but common in) plant proteins, many of which also contain a response regulator domain. The domain appears related to the Myb-like DNA-binding domain described by pfam00249. It is distinguished in part by a well-conserved motif SH[AL]QKY[RF] at the C-terminal end of the motif. Length = 57

>gnl|CDD|206547 pfam14379, Myb_CC_LHEQLE, MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
>gnl|CDD|178707 PLN03162, PLN03162, golden-2 like transcription factor; Provisional Back     alignment and domain information
>gnl|CDD|215818 pfam00249, Myb_DNA-binding, Myb-like DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 302
PF1437951 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE mot 99.91
PLN03162526 golden-2 like transcription factor; Provisional 99.89
TIGR0155757 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF c 99.86
PF0024948 Myb_DNA-binding: Myb-like DNA-binding domain; Inte 97.28
smart0042668 TEA TEA domain. 91.93
PF1437951 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE mot 84.08
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information
Probab=99.91  E-value=8.9e-25  Score=159.66  Aligned_cols=49  Identities=53%  Similarity=0.805  Sum_probs=46.9

Q ss_pred             hhHHHHHHHHHHHHHhhHHHHHHHHHHhHhHHHHHHHHH-HHHHHHHHhh
Q 022117          127 YEVKEALRVQMEVQSKLHLQVEAEKHLQIRQNAQQRYLA-MLERACKILT  175 (302)
Q Consensus       127 ~~i~eaLr~QmEvQrrLhEQLEVQRhLQLRIEAQGKYLq-iLEKAqe~La  175 (302)
                      ++|+||||+||||||||||||||||+||+|||||||||+ |||||+++++
T Consensus         2 ~~i~EALr~QmEvQrrLhEQLEvQr~Lqlrieaqgkyl~~ilek~~~~~s   51 (51)
T PF14379_consen    2 MQITEALRMQMEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEKAQKALS   51 (51)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcC
Confidence            579999999999999999999999999999999999998 9999999874



>PLN03162 golden-2 like transcription factor; Provisional Back     alignment and domain information
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins Back     alignment and domain information
>smart00426 TEA TEA domain Back     alignment and domain information
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query302
1irz_A64 Solution Structure Of Arr10-B Belonging To The Garp 6e-09
>pdb|1IRZ|A Chain A, Solution Structure Of Arr10-B Belonging To The Garp Family Of Plant Myb-Related Dna Binding Motifs Of The Arabidopsis Response Regulators Length = 64 Back     alignment and structure

Iteration: 1

Score = 58.2 bits (139), Expect = 6e-09, Method: Composition-based stats. Identities = 28/60 (46%), Positives = 40/60 (66%), Gaps = 1/60 (1%) Query: 30 TSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLG 89 T+ KPR+ WT +LH++F+ AV LG +A PK I+ MNV LT ++ SHLQK+R+ Sbjct: 1 TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query302
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 4e-25
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-06
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Length = 64 Back     alignment and structure
 Score = 94.6 bits (235), Expect = 4e-25
 Identities = 28/62 (45%), Positives = 41/62 (66%), Gaps = 1/62 (1%)

Query: 30 TSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLG 89
          T+  KPR+ WT +LH++F+ AV  LG   +A PK I+  MNV  LT  ++ SHLQK+R+ 
Sbjct: 1  TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVA 59

Query: 90 KQ 91
           +
Sbjct: 60 LK 61


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query302
1irz_A64 ARR10-B; helix-turn-helix, DNA binding protein; NM 99.96
2yum_A75 ZZZ3 protein, zinc finger ZZ-type-containing prote 89.5
2yus_A79 SWI/SNF-related matrix-associated actin- dependent 88.87
2xag_B482 REST corepressor 1; amine oxidase, chromatin regul 88.37
2cu7_A72 KIAA1915 protein; nuclear protein, SANT domain, DN 87.98
1x41_A60 Transcriptional adaptor 2-like, isoform B; transcr 85.59
2hzd_A82 Transcriptional enhancer factor TEF-1; DNA-binding 85.59
2iw5_B235 Protein corest, REST corepressor 1; oxidoreductase 83.97
2cqq_A72 RSGI RUH-037, DNAJ homolog subfamily C member 1; m 80.1
>1irz_A ARR10-B; helix-turn-helix, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: a.4.1.11 Back     alignment and structure
Probab=99.96  E-value=5.9e-30  Score=193.33  Aligned_cols=61  Identities=46%  Similarity=0.771  Sum_probs=58.1

Q ss_pred             ecCCCCCcccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcccCCCCHHHHHHhhhhhhcccc
Q 022117           30 TSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQ   91 (302)
Q Consensus        30 ~~~~K~Rl~WT~eLH~rFv~AV~qLGG~~kAtPK~Il~~M~v~gLT~~hVkSHLQKYRl~~~   91 (302)
                      ++++|+|++||+|||++||+||++|| .|+||||.||++|+|+|||++||||||||||+...
T Consensus         1 ~~~~k~r~~WT~elH~~Fv~Av~~LG-~~~AtPk~Il~~M~v~gLT~~~VkSHLQKYR~~l~   61 (64)
T 1irz_A            1 TAQKKPRVLWTHELHNKFLAAVDHLG-VERAVPKKILDLMNVDKLTRENVASHLQKFRVALK   61 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHC-TTTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHH
T ss_pred             CCCCCCCCcCCHHHHHHHHHHHHHhC-CCCCCcHHHHHHcCCCCCCHHHHHHHHHHHHHHHH
Confidence            47899999999999999999999999 79999999999999999999999999999999754



>2yum_A ZZZ3 protein, zinc finger ZZ-type-containing protein 3; transcription, structural genomics, NPPSFA; NMR {Homo sapiens} Back     alignment and structure
>2yus_A SWI/SNF-related matrix-associated actin- dependent regulator of chromatin subfamily...; SWI/SNF complex 155 kDa subunit, BRG1-associated factor 155; NMR {Homo sapiens} Back     alignment and structure
>2xag_B REST corepressor 1; amine oxidase, chromatin regulator, histone inhibitor binding, methylation, nucleosome core, oxidoreductase; HET: FAD TCF; 3.10A {Homo sapiens} PDB: 2xaf_B* 2xah_B* 2xaj_B* 2xaq_B* 2xas_B* Back     alignment and structure
>2cu7_A KIAA1915 protein; nuclear protein, SANT domain, DNA binding, regulation of transcription, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure
>1x41_A Transcriptional adaptor 2-like, isoform B; transcriptional adaptor protein2, transcriptional activation, MYB domain, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.1 Back     alignment and structure
>2hzd_A Transcriptional enhancer factor TEF-1; DNA-binding, helix-turn-helix, gene regulation; NMR {Homo sapiens} Back     alignment and structure
>2iw5_B Protein corest, REST corepressor 1; oxidoreductase-transcription regulator complex, oxidoreductase/repressor complex, histone demethylase, FAD; HET: FAD; 2.57A {Homo sapiens} SCOP: a.4.1.3 PDB: 2uxn_B* 2uxx_B* 2y48_B* 2v1d_B* 2x0l_B* Back     alignment and structure
>2cqq_A RSGI RUH-037, DNAJ homolog subfamily C member 1; membrane protein, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 302
d1irza_64 a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis th 4e-23
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 64 Back     information, alignment and structure

class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 88.1 bits (218), Expect = 4e-23
 Identities = 28/59 (47%), Positives = 40/59 (67%), Gaps = 1/59 (1%)

Query: 30 TSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRL 88
          T+  KPR+ WT +LH++F+ AV  LG   +A PK I+  MNV  LT  ++ SHLQK+R+
Sbjct: 1  TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRV 58


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query302
d1irza_64 Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId 99.93
d2cu7a165 MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 960 90.15
d1x41a147 Transcriptional adaptor 2-like, TADA2L, isoform b 88.86
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.93  E-value=4.3e-27  Score=175.73  Aligned_cols=61  Identities=46%  Similarity=0.763  Sum_probs=57.6

Q ss_pred             ecCCCCCcccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcccCCCCHHHHHHhhhhhhcccc
Q 022117           30 TSDPKPRLRWTADLHDRFVDAVTQLGGPSKATPKAIMRTMNVKGLTLFHLKSHLQKYRLGKQ   91 (302)
Q Consensus        30 ~~~~K~Rl~WT~eLH~rFv~AV~qLGG~~kAtPK~Il~~M~v~gLT~~hVkSHLQKYRl~~~   91 (302)
                      +.++|+|++||+|||++||+||++||+ ++||||.|+++|+|+|||+.||+|||||||+...
T Consensus         1 s~~kk~R~~WT~elH~~Fv~Av~~lG~-~~atpk~I~~~m~v~~lT~~qV~SHlQKYrl~l~   61 (64)
T d1irza_           1 TAQKKPRVLWTHELHNKFLAAVDHLGV-ERAVPKKILDLMNVDKLTRENVASHLQKFRVALK   61 (64)
T ss_dssp             CCCCCSSCSSCHHHHHHHHHHHHHHCT-TTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHH
T ss_pred             CCCCCCCCCCCHHHHHHHHHHHHHhCc-cccchHHHHHHcCCCCCCHHHHHHHHHHHHHHHH
Confidence            468999999999999999999999985 8999999999999999999999999999999754



>d2cu7a1 a.4.1.3 (A:8-72) MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x41a1 a.4.1.1 (A:8-54) Transcriptional adaptor 2-like, TADA2L, isoform b {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure