Citrus Sinensis ID: 019391


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-
MWDLNDSLKDIEESEACSSQKTSIDGDDEKGKQVGSVSNSSSSAVVIENGSDEEIGASERRLLSRSGGSKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLAFPRANWVGVKFCQSEPIVEAGKSVIEAPQPLKKSRRGPRSRSSQYRGVTFYRRTGRWESHIWDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLYQDELKASGKSLTMSWCRPQSRFEFGQFSF
ccccccccccccHHHcccccccccccccccccccccccccccccEEcccccccccccHHHHHccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEcccccEEEEEEcccEEEEEcccccHHHHHHHHHHHHHHHcccccccccccccccHHHHHHccccHHHHHHHHcccccccccccccccccEEccccccccccccccccEEEEEcccccHHHHHHHHHHHHHHHccccccccccccccHHHHHHccccccccccccccccccccccc
ccccccccccccccccccccccccccccccccEEcccccccccEEEEEcccccccccccccccccccccEEEEEEcccccccccccccccEEEEcccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccEEccEEEEcccccEEEEEEccccEEEEccccHHHHHHHHHHHHHHHHcccccEEcccHHHHHHHHHHHHHccHHHHHHHHHHccccccccccEEccEEccccccEEEEEcEEccccEEEEEccccHHHHHHHHHHHHHHHcccccEEcccccHcHHHHccccccccHHHHccccccccccccc
mwdlndslkdieeseacssqktsidgddekgkqvgsvsnsSSSAVviengsdeeigASERRLlsrsggskiFGVCLYQeameesepeppvtrqffpvdfqeqqatsseagagglafpranwvgvkfcqsepiveagksvieapqplkksrrgprsrssqyrgvtfyrrtgrweshiwdsgkqvylgGFDTAHAAARAYDRAAIKFrgaeadinfsIEDYEDDLKQMSNLTKEEFVHVLRRqstgfprgsskyrgvtlhkcGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAvkcngkdavtnfdpslYQDELKASgksltmswcrpqsrfefgqfsf
mwdlndslkdieeseacssqktsidgddekgkqvgsvsnssssavviengsdeeigaserrllsrsggsKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLAFPRANWVGVKFCQSEPIVeagksvieapqplkksrrgprsrssqyrgvtfyrrtgrweshiwDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLrrqstgfprgsskyrgvtlhkcgrwEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKcngkdavtnfdpsLYQDELKASGKsltmswcrpqsrfefgqfsf
MWDLNDSLKDIEESEACSSQKTSIDGDDEKGKQvgsvsnssssavvIENGSDEEIGASErrllsrsggsKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLAFPRANWVGVKFCQSEPIVEAGKSVIEAPQPLKKsrrgprsrssQYRGVTFYRRTGRWESHIWDSGKQVYLGGFDTahaaaraydraaIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLYQDELKASGKSLTMSWCRPQSRFEFGQFSF
*********************************************************************KIFGVCLYQ*********************************GGLAFPRANWVGVKFCQSEPIVE*************************YRGVTFYRRTGRWESHIWDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLYQ***********MSWC*************
MWDLNDSL***********************************************************************************************************************************************************TFYRRTGRWESHIWDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGV*********ARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLY*****************************
MWDLNDSLKDI************************************ENGSDEEIGASERRLLSRSGGSKIFGVCLYQEA*********VTRQFFPVDFQEQQATSSEAGAGGLAFPRANWVGVKFCQSEPIVEAGKSVIEAP****************YRGVTFYRRTGRWESHIWDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLYQDELKASGKSLTMSWCRPQSRFEFGQFSF
******************************************************************GGSKIFGVCLYQEAM**SEPEPPVTRQFFPVDF*********************************************************SSQYRGVTFYRRTGRWESHIWDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLYQDELKA***********************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MWDLNDSLKDIEESEACSSQKTSIDGDDEKGKQVGSVSNSSSSAVVIENGSDEEIGASERRLLSRSGGSKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLAFPRANWVGVKFCQSEPIVEAGKSVIEAPQPLKKSRRGPRSRSSQYRGVTFYRRTGRWESHIWDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLYQDELKASGKSLTMSWCRPQSRFEFGQFSF
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query341 2.2.26 [Sep-21-2011]
P47927 432 Floral homeotic protein A yes no 0.809 0.638 0.659 6e-97
Q9FH95352 AP2-like ethylene-respons no no 0.739 0.715 0.603 1e-79
Q9SK03 449 Ethylene-responsive trans no no 0.765 0.581 0.528 5e-60
Q9LVG2 485 AP2-like ethylene-respons no no 0.598 0.420 0.544 1e-51
Q38914555 AP2-like ethylene-respons no no 0.510 0.313 0.5 5e-45
Q6PQQ3 558 AP2-like ethylene-respons no no 0.504 0.308 0.494 7e-45
Q1PFE1415 AP2-like ethylene-respons no no 0.460 0.378 0.517 4e-44
Q5YGP7 568 AP2-like ethylene-respons no no 0.492 0.295 0.508 6e-44
Q8LSN2 579 AP2-like ethylene-respons N/A no 0.639 0.376 0.426 1e-43
Q8L3U3 579 AP2-like ethylene-respons N/A no 0.639 0.376 0.421 2e-43
>sp|P47927|AP2_ARATH Floral homeotic protein APETALA 2 OS=Arabidopsis thaliana GN=AP2 PE=1 SV=1 Back     alignment and function desciption
 Score =  354 bits (908), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 219/332 (65%), Positives = 242/332 (72%), Gaps = 56/332 (16%)

Query: 1   MWDLND----SLKDIEESEACSSQKTSIDGDDEKGKQVGSVSNSSSSAVVIENGSDEEIG 56
           MWDLND    + ++ E  E C S  +         K+VGS SNSSSSAVVIE+GSD++  
Sbjct: 1   MWDLNDAPHQTQREEESEEFCYSSPS---------KRVGSFSNSSSSAVVIEDGSDDD-- 49

Query: 57  ASERRLLSRSGGSKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLA- 115
                                 E        P VT QFFP           ++  GG+A 
Sbjct: 50  ----------------------ELNRVRPNNPLVTHQFFP---------EMDSNGGGVAS 78

Query: 116 -FPRANWVGVKFCQSEPIV--EAGKS------VIEAPQPLKKSRRGPRSRSSQYRGVTFY 166
            FPRA+W GVKFCQS+      AGK+      V+E  QPLKKSRRGPRSRSSQYRGVTFY
Sbjct: 79  GFPRAHWFGVKFCQSDLATGSSAGKATNVAAAVVEPAQPLKKSRRGPRSRSSQYRGVTFY 138

Query: 167 RRTGRWESHIWDSGKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQM 226
           RRTGRWESHIWD GKQVYLGGFDTAHAAARAYDRAAIKFRG EADINF+I+DY+DDLKQM
Sbjct: 139 RRTGRWESHIWDCGKQVYLGGFDTAHAAARAYDRAAIKFRGVEADINFNIDDYDDDLKQM 198

Query: 227 SNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVE 286
           +NLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVE
Sbjct: 199 TNLTKEEFVHVLRRQSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVE 258

Query: 287 AARAYDRAAVKCNGKDAVTNFDPSLYQDELKA 318
           AARAYD+AA+KCNGKDAVTNFDPS+Y +EL A
Sbjct: 259 AARAYDKAAIKCNGKDAVTNFDPSIYDEELNA 290




Probable transcriptional activator that promotes early floral meristem identity. Is required subsequently for the transition of an inflorescence meristem into a floral meristem. Plays a central role in the specification of floral identity, particularly for the normal development of sepals and petals in the wild-type flower. Acts as A class cadastral protein by repressing the C class floral homeotic gene AGAMOUS in association with other repressors like LEUNIG and SEUSS. It is also required during seed development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9FH95|TOE3_ARATH AP2-like ethylene-responsive transcription factor TOE3 OS=Arabidopsis thaliana GN=TOE3 PE=2 SV=1 Back     alignment and function description
>sp|Q9SK03|RAP27_ARATH Ethylene-responsive transcription factor RAP2-7 OS=Arabidopsis thaliana GN=RAP2-7 PE=2 SV=2 Back     alignment and function description
>sp|Q9LVG2|TOE2_ARATH AP2-like ethylene-responsive transcription factor TOE2 OS=Arabidopsis thaliana GN=TOE2 PE=1 SV=1 Back     alignment and function description
>sp|Q38914|ANT_ARATH AP2-like ethylene-responsive transcription factor ANT OS=Arabidopsis thaliana GN=ANT PE=1 SV=2 Back     alignment and function description
>sp|Q6PQQ3|AIL5_ARATH AP2-like ethylene-responsive transcription factor AIL5 OS=Arabidopsis thaliana GN=AIL5 PE=2 SV=2 Back     alignment and function description
>sp|Q1PFE1|AIL1_ARATH AP2-like ethylene-responsive transcription factor AIL1 OS=Arabidopsis thaliana GN=AIL1 PE=2 SV=1 Back     alignment and function description
>sp|Q5YGP7|PLET2_ARATH AP2-like ethylene-responsive transcription factor PLT2 OS=Arabidopsis thaliana GN=PLT2 PE=2 SV=1 Back     alignment and function description
>sp|Q8LSN2|BBM2_BRANA AP2-like ethylene-responsive transcription factor BBM2 OS=Brassica napus GN=BBM2 PE=2 SV=1 Back     alignment and function description
>sp|Q8L3U3|BBM1_BRANA AP2-like ethylene-responsive transcription factor BBM1 OS=Brassica napus GN=BBM1 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query341
195979199 512 transcription factor APETALA2 [Citrus tr 0.947 0.630 0.987 0.0
359480728 511 PREDICTED: floral homeotic protein APETA 0.912 0.608 0.756 1e-116
317106692 493 JHL03K20.2 [Jatropha curcas] 0.920 0.636 0.779 1e-114
147791287 520 hypothetical protein VITISV_042267 [Viti 0.856 0.561 0.699 1e-108
224081723 513 AP2 domain-containing transcription fact 0.982 0.653 0.7 1e-108
342360009 517 APETALA2 [Betula platyphylla] 0.912 0.601 0.756 1e-107
296082382 490 unnamed protein product [Vitis vinifera] 0.868 0.604 0.737 1e-107
357482811 511 Transcription factor APETALA2 [Medicago 0.894 0.596 0.728 1e-106
224096732 534 AP2 domain-containing transcription fact 0.961 0.614 0.673 1e-106
356563604 531 PREDICTED: floral homeotic protein APETA 0.923 0.593 0.722 1e-105
>gi|195979199|gb|ACG63707.1| transcription factor APETALA2 [Citrus trifoliata] Back     alignment and taxonomy information
 Score =  662 bits (1709), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/323 (98%), Positives = 321/323 (99%)

Query: 1   MWDLNDSLKDIEESEACSSQKTSIDGDDEKGKQVGSVSNSSSSAVVIENGSDEEIGASER 60
           MWDLNDSLKDI+ESEACSSQKTSIDGDDEKGKQVGSVSNSSSSAVVIENGSDEEIGASER
Sbjct: 1   MWDLNDSLKDIKESEACSSQKTSIDGDDEKGKQVGSVSNSSSSAVVIENGSDEEIGASER 60

Query: 61  RLLSRSGGSKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLAFPRAN 120
           RLLSRSGGSKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLAFPRAN
Sbjct: 61  RLLSRSGGSKIFGVCLYQEAMEESEPEPPVTRQFFPVDFQEQQATSSEAGAGGLAFPRAN 120

Query: 121 WVGVKFCQSEPIVEAGKSVIEAPQPLKKSRRGPRSRSSQYRGVTFYRRTGRWESHIWDSG 180
           WVGVKFCQSEPIVEAGKSVIEAPQPLKKSRRGPRSRSSQYRGVTFYRRTGRWESHIWDSG
Sbjct: 121 WVGVKFCQSEPIVEAGKSVIEAPQPLKKSRRGPRSRSSQYRGVTFYRRTGRWESHIWDSG 180

Query: 181 KQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRR 240
           KQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRR
Sbjct: 181 KQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRR 240

Query: 241 QSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNG 300
           QSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNG
Sbjct: 241 QSTGFPRGSSKYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNG 300

Query: 301 KDAVTNFDPSLYQDELKASGKSL 323
           KDAVTNFDPSLYQDELKASG  +
Sbjct: 301 KDAVTNFDPSLYQDELKASGHGV 323




Source: Citrus trifoliata

Species: Citrus trifoliata

Genus: Citrus

Family: Rutaceae

Order: Sapindales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359480728|ref|XP_002283045.2| PREDICTED: floral homeotic protein APETALA 2 [Vitis vinifera] gi|226377504|gb|ACO52508.1| transcription factor APETALA2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|317106692|dbj|BAJ53193.1| JHL03K20.2 [Jatropha curcas] Back     alignment and taxonomy information
>gi|147791287|emb|CAN65605.1| hypothetical protein VITISV_042267 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224081723|ref|XP_002306481.1| AP2 domain-containing transcription factor [Populus trichocarpa] gi|222855930|gb|EEE93477.1| AP2 domain-containing transcription factor [Populus trichocarpa] Back     alignment and taxonomy information
>gi|342360009|gb|AEL29576.1| APETALA2 [Betula platyphylla] Back     alignment and taxonomy information
>gi|296082382|emb|CBI21387.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357482811|ref|XP_003611692.1| Transcription factor APETALA2 [Medicago truncatula] gi|355513027|gb|AES94650.1| Transcription factor APETALA2 [Medicago truncatula] Back     alignment and taxonomy information
>gi|224096732|ref|XP_002310715.1| AP2 domain-containing transcription factor [Populus trichocarpa] gi|222853618|gb|EEE91165.1| AP2 domain-containing transcription factor [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356563604|ref|XP_003550051.1| PREDICTED: floral homeotic protein APETALA 2-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query341
TAIR|locus:2005493 432 AP2 "APETALA 2" [Arabidopsis t 0.668 0.527 0.686 1.3e-88
TAIR|locus:2155598352 TOE3 "target of early activati 0.645 0.625 0.641 3.2e-77
TAIR|locus:2173009 558 AIL5 "AINTEGUMENTA-like 5" [Ar 0.530 0.324 0.458 1.9e-37
TAIR|locus:2137559555 ANT "AINTEGUMENTA" [Arabidopsi 0.563 0.345 0.411 2.4e-37
TAIR|locus:2008216 568 PLT2 "PLETHORA 2" [Arabidopsis 0.548 0.329 0.425 6.3e-37
TAIR|locus:2030265415 AT1G72570 [Arabidopsis thalian 0.577 0.474 0.401 6.3e-37
TAIR|locus:2091891 574 PLT1 "PLETHORA 1" [Arabidopsis 0.542 0.322 0.422 1.2e-36
TAIR|locus:2155680498 AIL7 "AINTEGUMENTA-like 7" [Ar 0.457 0.313 0.473 4.4e-36
TAIR|locus:2017829313 WRI4 "WRINKLED 4" [Arabidopsis 0.478 0.520 0.454 7.2e-36
TAIR|locus:2178915 584 BBM "BABY BOOM" [Arabidopsis t 0.536 0.313 0.408 1.7e-35
TAIR|locus:2005493 AP2 "APETALA 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 855 (306.0 bits), Expect = 1.3e-88, Sum P(2) = 1.3e-88
 Identities = 171/249 (68%), Positives = 186/249 (74%)

Query:    88 PPVTRQFFPVDFQEQQATSSEAGAGGLA--FPRANWVGVKFCQSEPIV--EAGKS----- 138
             P VT QFFP           ++  GG+A  FPRA+W GVKFCQS+      AGK+     
Sbjct:    59 PLVTHQFFP---------EMDSNGGGVASGFPRAHWFGVKFCQSDLATGSSAGKATNVAA 109

Query:   139 -VIEAPQPLKKXXXXXXXXXXQYRGVTFYRRTGRWESHIWDSGKQVYLGGFDTXXXXXXX 197
              V+E  QPLKK          QYRGVTFYRRTGRWESHIWD GKQVYLGGFDT       
Sbjct:   110 AVVEPAQPLKKSRRGPRSRSSQYRGVTFYRRTGRWESHIWDCGKQVYLGGFDTAHAAARA 169

Query:   198 XXXXXIKFRGAEADINFSIEDYEDDLKQMSNLTKEEFVHVLRRQSTGFPRGSSKYRGVTL 257
                  IKFRG EADINF+I+DY+DDLKQM+NLTKEEFVHVLRRQSTGFPRGSSKYRGVTL
Sbjct:   170 YDRAAIKFRGVEADINFNIDDYDDDLKQMTNLTKEEFVHVLRRQSTGFPRGSSKYRGVTL 229

Query:   258 HKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPSLYQDELK 317
             HKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYD+AA+KCNGKDAVTNFDPS+Y +EL 
Sbjct:   230 HKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDKAAIKCNGKDAVTNFDPSIYDEELN 289

Query:   318 A--SGKSLT 324
             A  SG   T
Sbjct:   290 AESSGNPTT 298


GO:0003677 "DNA binding" evidence=IEA
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS;TAS
GO:0005634 "nucleus" evidence=ISM;ISS;TAS
GO:0010093 "specification of floral organ identity" evidence=IMP
GO:0048316 "seed development" evidence=IMP
GO:0010073 "meristem maintenance" evidence=IMP
GO:0009888 "tissue development" evidence=RCA
GO:0030154 "cell differentiation" evidence=RCA;TAS
GO:0048481 "ovule development" evidence=IGI
GO:0006355 "regulation of transcription, DNA-dependent" evidence=TAS
GO:0009908 "flower development" evidence=TAS
GO:0019953 "sexual reproduction" evidence=TAS
TAIR|locus:2155598 TOE3 "target of early activation tagged (EAT) 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2173009 AIL5 "AINTEGUMENTA-like 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2137559 ANT "AINTEGUMENTA" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2008216 PLT2 "PLETHORA 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2030265 AT1G72570 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2091891 PLT1 "PLETHORA 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2155680 AIL7 "AINTEGUMENTA-like 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2017829 WRI4 "WRINKLED 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2178915 BBM "BABY BOOM" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
P47927AP2_ARATHNo assigned EC number0.65960.80930.6388yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query341
smart0038064 smart00380, AP2, DNA-binding domain in plant prote 2e-26
smart0038064 smart00380, AP2, DNA-binding domain in plant prote 2e-23
cd0001861 cd00018, AP2, DNA-binding domain found in transcri 3e-23
cd0001861 cd00018, AP2, DNA-binding domain found in transcri 2e-22
pfam0084753 pfam00847, AP2, AP2 domain 4e-09
pfam0084753 pfam00847, AP2, AP2 domain 2e-07
>gnl|CDD|197689 smart00380, AP2, DNA-binding domain in plant proteins such as APETALA2 and EREBPs Back     alignment and domain information
 Score = 99.3 bits (248), Expect = 2e-26
 Identities = 33/63 (52%), Positives = 37/63 (58%)

Query: 251 KYRGVTLHKCGRWEARMGQFLGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPS 310
           KYRGV     G+W A +      K V+LG FDT  EAARAYDRAA K  G+ A  NF  S
Sbjct: 1   KYRGVRQRPWGKWVAEIRDPSKGKRVWLGTFDTAEEAARAYDRAAFKFRGRSARLNFPNS 60

Query: 311 LYQ 313
           LY 
Sbjct: 61  LYD 63


Length = 64

>gnl|CDD|197689 smart00380, AP2, DNA-binding domain in plant proteins such as APETALA2 and EREBPs Back     alignment and domain information
>gnl|CDD|237985 cd00018, AP2, DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) Back     alignment and domain information
>gnl|CDD|237985 cd00018, AP2, DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) Back     alignment and domain information
>gnl|CDD|216148 pfam00847, AP2, AP2 domain Back     alignment and domain information
>gnl|CDD|216148 pfam00847, AP2, AP2 domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 341
cd0001861 AP2 DNA-binding domain found in transcription regu 99.72
smart0038064 AP2 DNA-binding domain in plant proteins such as A 99.71
smart0038064 AP2 DNA-binding domain in plant proteins such as A 99.71
cd0001861 AP2 DNA-binding domain found in transcription regu 99.7
PHA00280121 putative NHN endonuclease 99.46
PHA00280121 putative NHN endonuclease 99.36
PF0084756 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis- 99.11
PF0084756 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis- 98.9
cd00652174 TBP_TLF TATA box binding protein (TBP): Present in 84.92
cd04518174 TBP_archaea archaeal TATA box binding protein (TBP 81.4
>cd00018 AP2 DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) Back     alignment and domain information
Probab=99.72  E-value=7.9e-18  Score=126.42  Aligned_cols=59  Identities=53%  Similarity=0.885  Sum_probs=55.3

Q ss_pred             CCeeEeEEecCCCeEEEEeecC--CeeEEeCCCCCHHHHHHHHHHHHHHhcCCccccccccc
Q 019391          158 SQYRGVTFYRRTGRWESHIWDS--GKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIE  217 (341)
Q Consensus       158 S~YRGV~~~r~~gKW~A~I~~~--gKri~LGtF~TaEeAArAYD~Aai~l~G~~A~~NFp~s  217 (341)
                      |+|+||++++ +|||+|+|+++  ++++|||+|+|+||||+|||.|+++++|..+.+|||.+
T Consensus         1 s~~~GV~~~~-~gkw~A~I~~~~~gk~~~lG~f~t~eeAa~Ayd~a~~~~~g~~a~~Nf~~~   61 (61)
T cd00018           1 SKYRGVRQRP-WGKWVAEIRDPSGGRRIWLGTFDTAEEAARAYDRAALKLRGSSAVLNFPDS   61 (61)
T ss_pred             CCccCEEECC-CCcEEEEEEeCCCCceEccCCCCCHHHHHHHHHHHHHHhcCCccccCCCCC
Confidence            6899999765 59999999999  99999999999999999999999999999999999853



In EREBPs the domain specifically binds to the 11bp GCC box of the ethylene response element (ERE), a promotor element essential for ethylene responsiveness. EREBPs and the C-repeat binding factor CBF1, which is involved in stress response, contain a single copy of the AP2 domain. APETALA2-like proteins, which play a role in plant development contain two copies.

>smart00380 AP2 DNA-binding domain in plant proteins such as APETALA2 and EREBPs Back     alignment and domain information
>smart00380 AP2 DNA-binding domain in plant proteins such as APETALA2 and EREBPs Back     alignment and domain information
>cd00018 AP2 DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) Back     alignment and domain information
>PHA00280 putative NHN endonuclease Back     alignment and domain information
>PHA00280 putative NHN endonuclease Back     alignment and domain information
>PF00847 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis-related genes transcriptional activator binds to the GCC-box pathogenesis-related promoter element and activates the plant's defence genes Back     alignment and domain information
>PF00847 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis-related genes transcriptional activator binds to the GCC-box pathogenesis-related promoter element and activates the plant's defence genes Back     alignment and domain information
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription Back     alignment and domain information
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query341
1gcc_A63 Ethylene responsive element binding factor 1; tran 3e-15
1gcc_A63 Ethylene responsive element binding factor 1; tran 1e-10
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-04
>1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Length = 63 Back     alignment and structure
 Score = 68.5 bits (168), Expect = 3e-15
 Identities = 28/63 (44%), Positives = 38/63 (60%), Gaps = 6/63 (9%)

Query: 159 QYRGVTFYRRT-GRWESHIWDSGKQ---VYLGGFDTAHAAARAYDRAAIKFRGAEADINF 214
            YRGV   +R  G++ + I D  K    V+LG F+TA  AA AYDRAA + RG+ A +NF
Sbjct: 2   HYRGVR--QRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNF 59

Query: 215 SIE 217
            + 
Sbjct: 60  PLR 62


>1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Length = 63 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query341
1gcc_A63 Ethylene responsive element binding factor 1; tran 99.79
1gcc_A63 Ethylene responsive element binding factor 1; tran 99.79
3igm_B77 PF14_0633 protein; AP2 domain, specific transcript 94.82
1u3e_M174 HNH homing endonuclease; HNH catalytic motif, heli 84.36
3igm_B77 PF14_0633 protein; AP2 domain, specific transcript 82.46
3eik_A218 Tata-box-binding protein; DNA-binding, initiation 80.07
>1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Back     alignment and structure
Probab=99.79  E-value=6.1e-20  Score=139.00  Aligned_cols=58  Identities=48%  Similarity=0.823  Sum_probs=54.0

Q ss_pred             CeeEeEEecCCCeEEEEeecC---CeeEEeCCCCCHHHHHHHHHHHHHHhcCCccccccccc
Q 019391          159 QYRGVTFYRRTGRWESHIWDS---GKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFSIE  217 (341)
Q Consensus       159 ~YRGV~~~r~~gKW~A~I~~~---gKri~LGtF~TaEeAArAYD~Aai~l~G~~A~~NFp~s  217 (341)
                      +||||++ +++|||+|+|+++   ++++|||+|+|+||||+|||.|+++++|..+.+|||.+
T Consensus         2 ~yrGV~~-r~~gkw~A~I~~~~~~g~r~~LGtf~T~eeAA~AyD~Aa~~~~G~~a~~NFp~~   62 (63)
T 1gcc_A            2 HYRGVRQ-RPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR   62 (63)
T ss_dssp             CCTTEEE-ETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred             CcccEee-CCCCcEEEEEccccCCCeEEEeeeCCCHHHHHHHHHHHHHHhcCcccccCCCCc
Confidence            6999997 5799999999984   78999999999999999999999999999999999954



>1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Back     alignment and structure
>3igm_B PF14_0633 protein; AP2 domain, specific transcription FA protein-DNA complex, transcription-DNA complex; 2.20A {Plasmodium falciparum} Back     alignment and structure
>1u3e_M HNH homing endonuclease; HNH catalytic motif, helix-turn-helix DNA binding domain, PR complex, DNA binding protein-DNA complex; 2.92A {Bacillus phage SPO1} SCOP: d.4.1.3 d.285.1.1 Back     alignment and structure
>3igm_B PF14_0633 protein; AP2 domain, specific transcription FA protein-DNA complex, transcription-DNA complex; 2.20A {Plasmodium falciparum} Back     alignment and structure
>3eik_A Tata-box-binding protein; DNA-binding, initiation factor, nucleus, transcription; 1.90A {Encephalitozoon cuniculi} PDB: 3oci_A 3oc3_C Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 341
d1gcca_63 d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cr 7e-21
d1gcca_63 d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cr 2e-19
>d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 63 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: DNA-binding domain
superfamily: DNA-binding domain
family: GCC-box binding domain
domain: GCC-box binding domain
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 82.8 bits (205), Expect = 7e-21
 Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 4/62 (6%)

Query: 159 QYRGVTFYRRTGRWESHIWDS---GKQVYLGGFDTAHAAARAYDRAAIKFRGAEADINFS 215
            YRGV   R  G++ + I D    G +V+LG F+TA  AA AYDRAA + RG+ A +NF 
Sbjct: 2   HYRGVR-QRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFP 60

Query: 216 IE 217
           + 
Sbjct: 61  LR 62


>d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 63 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query341
d1gcca_63 GCC-box binding domain {Mouse-ear cress (Arabidops 99.82
d1gcca_63 GCC-box binding domain {Mouse-ear cress (Arabidops 99.8
>d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: DNA-binding domain
superfamily: DNA-binding domain
family: GCC-box binding domain
domain: GCC-box binding domain
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.82  E-value=4.8e-21  Score=143.46  Aligned_cols=60  Identities=38%  Similarity=0.587  Sum_probs=55.2

Q ss_pred             cccCeeEeecceEEEEecee-cCceeEEeccCCCHHHHHHHHHHHHHHhcCCCCCCCCCCc
Q 019391          251 KYRGVTLHKCGRWEARMGQF-LGKKYVYLGLFDTEVEAARAYDRAAVKCNGKDAVTNFDPS  310 (341)
Q Consensus       251 ~yrGV~~~k~GkW~ArI~~~-~~~k~~~LG~FdTeeEAArAYDkAaikl~G~~A~tNFp~s  310 (341)
                      +|+||++++.|||+|+|+.. ..+++++||+|+|+||||+|||+|+++|+|..+.+|||+.
T Consensus         2 ~yrGVr~r~~gkw~A~Ir~~~~~~~r~~LGtf~t~eeAArAYD~aa~~~~G~~a~~NFP~~   62 (63)
T d1gcca_           2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR   62 (63)
T ss_dssp             CCTTEEEETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred             CcceEeECCCCCEEEEEecCCCCCcEeccccccCHHHHHHHHHHHHHHhcCCCcccCCCcc
Confidence            59999998899999999753 4568899999999999999999999999999999999975



>d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure