Citrus Sinensis ID: 031272


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160--
MGYARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKATILVKMEKSGKWVVTRFIKDHNHPLVVTANGYSTVGDKDKKIEELTLELERQEQLCAAYREKLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEMLKSFLQ
ccccccccEEEEEEcEEEEcccccEEEEEEEEEcccccccccccccccccccccccccccEEEEEEEEEccEEEEEEEEcccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHccc
ccHHHcccEEEEEEEEEEEccccEEEEEEEEEccccccccccccccccccccccccccccEEEEEEEcccccEEEEEEHHHcccccccccccccccccHcHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHcc
MGYARRMGFVVRIMQrrrsrtdgttlarrlgcnkqgfspnskgtngpekkprpsaregcKATILVKMEKSGKWVVTRFIkdhnhplvvtangystvgdkdkKIEELTLELERQEQLCAAYREKLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEMLKSFLQ
mgyarrmgfvvrimqrrrsrtdgttlarrlgcnkqgfspnskgtngpekkprpsaregcKATILVKMEKSGKWVVTrfikdhnhplvvtangystvgdkdKKIEELTLELERQEQLCAAYREKLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEMLKSFLQ
MGYARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKATILVKMEKSGKWVVTRFIKDHNHPLVVTANGYSTVGDKDKKIeeltlelerqeqlCAAYREKLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEMLKSFLQ
*******GFVVRI*********************************************CKATILVKMEKSGKWVVTRFIKDHNHPLVVTANGYSTVGDKDKKIEELTLELERQEQLCAAYREKLFNFMNNVE*******SKIQVIVDNIR************
MGYARRMGFVVRIMQRRRSR*DGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKATILVKMEKSGKWVVTRFIKDHNHPLVVTANG*****************************EKLFNFMNNVEEQTEELSSKIQVIVDNIRKVES********
MGYARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSP******************GCKATILVKMEKSGKWVVTRFIKDHNHPLVVTANGYSTVGDKDKKIEELTLELERQEQLCAAYREKLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEMLKSFLQ
MGYARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQ*******************AREGCKATILVKMEKSGKWVVTRFIKDHNHPLVVTANGYSTVGDKDKKIEELTLELERQEQLCAAYREKLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEMLKSFL*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGYARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKATILVKMEKSGKWVVTRFIKDHNHPLVVTANGYSTxxxxxxxxxxxxxxxxxxxxxxxxxxxxLFNFxxxxxxxxxxxxxxxxxxxxxIRKVESEMLKSFLQ
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query162 2.2.26 [Sep-21-2011]
Q9SZL8 788 Protein FAR1-RELATED SEQU no no 0.530 0.109 0.522 4e-15
Q3E7I5 788 Protein FAR1-RELATED SEQU no no 0.438 0.090 0.447 9e-12
Q9M8J3 764 Protein FAR1-RELATED SEQU no no 0.432 0.091 0.452 9e-12
Q6NQJ7 732 Protein FAR1-RELATED SEQU no no 0.537 0.118 0.351 5e-09
Q3EBQ3 807 Protein FAR1-RELATED SEQU no no 0.623 0.125 0.311 6e-09
Q9SWG3 827 Protein FAR-RED IMPAIRED no no 0.518 0.101 0.329 2e-08
Q9LIE5 839 Protein FAR-RED ELONGATED no no 0.524 0.101 0.319 2e-07
Q5UBY2 687 Protein FAR1-RELATED SEQU no no 0.518 0.122 0.312 6e-07
Q9SY66 680 Protein FAR1-RELATED SEQU no no 0.580 0.138 0.327 4e-05
Q9SZL7545 Protein FAR1-RELATED SEQU no no 0.462 0.137 0.330 0.0001
>sp|Q9SZL8|FRS5_ARATH Protein FAR1-RELATED SEQUENCE 5 OS=Arabidopsis thaliana GN=FRS5 PE=2 SV=1 Back     alignment and function desciption
 Score = 80.5 bits (197), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 46/88 (52%), Positives = 59/88 (67%), Gaps = 2/88 (2%)

Query: 2   GYARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGF-SPNSKGTNGPE-KKPRPSAREGC 59
            YARR+GF  R+   RRSR DG  + R+  C K+GF + N K T   E K+PR   R GC
Sbjct: 91  SYARRIGFSTRVSSSRRSRRDGAIIQRQFVCAKEGFRNMNEKRTKDREIKRPRTITRVGC 150

Query: 60  KATILVKMEKSGKWVVTRFIKDHNHPLV 87
           KA++ VKM+ SGKW+V+ F+KDHNH LV
Sbjct: 151 KASLSVKMQDSGKWLVSGFVKDHNHELV 178




Putative transcription activator involved in regulating light control of development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q3E7I5|FRS12_ARATH Protein FAR1-RELATED SEQUENCE 12 OS=Arabidopsis thaliana GN=FRS12 PE=2 SV=1 Back     alignment and function description
>sp|Q9M8J3|FRS7_ARATH Protein FAR1-RELATED SEQUENCE 7 OS=Arabidopsis thaliana GN=FRS7 PE=2 SV=1 Back     alignment and function description
>sp|Q6NQJ7|FRS4_ARATH Protein FAR1-RELATED SEQUENCE 4 OS=Arabidopsis thaliana GN=FRS4 PE=2 SV=2 Back     alignment and function description
>sp|Q3EBQ3|FRS2_ARATH Protein FAR1-RELATED SEQUENCE 2 OS=Arabidopsis thaliana GN=FRS2 PE=2 SV=1 Back     alignment and function description
>sp|Q9SWG3|FAR1_ARATH Protein FAR-RED IMPAIRED RESPONSE 1 OS=Arabidopsis thaliana GN=FAR1 PE=1 SV=1 Back     alignment and function description
>sp|Q9LIE5|FHY3_ARATH Protein FAR-RED ELONGATED HYPOCOTYL 3 OS=Arabidopsis thaliana GN=FHY3 PE=1 SV=1 Back     alignment and function description
>sp|Q5UBY2|FRS1_ARATH Protein FAR1-RELATED SEQUENCE 1 OS=Arabidopsis thaliana GN=FRS1 PE=1 SV=1 Back     alignment and function description
>sp|Q9SY66|FRS11_ARATH Protein FAR1-RELATED SEQUENCE 11 OS=Arabidopsis thaliana GN=FRS11 PE=2 SV=1 Back     alignment and function description
>sp|Q9SZL7|FRS9_ARATH Protein FAR1-RELATED SEQUENCE 9 OS=Arabidopsis thaliana GN=FRS9 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query162
356567708219 PREDICTED: protein FAR1-RELATED SEQUENCE 0.962 0.712 0.801 5e-71
255572674212 conserved hypothetical protein [Ricinus 0.975 0.745 0.816 3e-70
124360619 657 FAR1; Polynucleotidyl transferase, Ribon 0.969 0.238 0.789 4e-70
449502056209 PREDICTED: protein FAR1-RELATED SEQUENCE 0.975 0.755 0.797 9e-70
449462447209 PREDICTED: protein FAR1-RELATED SEQUENCE 0.975 0.755 0.797 1e-69
224069685195 predicted protein [Populus trichocarpa] 0.950 0.789 0.794 1e-67
388506702180 unknown [Lotus japonicus] 0.944 0.85 0.777 2e-67
224129362167 predicted protein [Populus trichocarpa] 0.907 0.880 0.756 2e-61
147864082240 hypothetical protein VITISV_030389 [Viti 0.944 0.637 0.597 8e-46
359495410 471 PREDICTED: uncharacterized protein LOC10 0.975 0.335 0.584 9e-46
>gi|356567708|ref|XP_003552059.1| PREDICTED: protein FAR1-RELATED SEQUENCE 7-like [Glycine max] Back     alignment and taxonomy information
 Score =  271 bits (694), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 125/156 (80%), Positives = 144/156 (92%)

Query: 3   YARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKAT 62
           YARR+GFVVRIMQRRRS  DG TLARRLGCNKQGFSPN+KG  GPEKKPRPSAREGCKAT
Sbjct: 53  YARRVGFVVRIMQRRRSGIDGRTLARRLGCNKQGFSPNNKGILGPEKKPRPSAREGCKAT 112

Query: 63  ILVKMEKSGKWVVTRFIKDHNHPLVVTANGYSTVGDKDKKIEELTLELERQEQLCAAYRE 122
           ILVK EKSGKW+VTRF+KDHNHPL+ TANG+ST GDKDKKI ELT+ELERQ+QLCAAYRE
Sbjct: 113 ILVKFEKSGKWIVTRFVKDHNHPLIATANGFSTAGDKDKKIVELTMELERQDQLCAAYRE 172

Query: 123 KLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEMLK 158
           KL +F++NVEE+TEELS+KIQ+I++N+R+ ES+M K
Sbjct: 173 KLLSFISNVEEETEELSTKIQLIIENVRRAESDMKK 208




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255572674|ref|XP_002527270.1| conserved hypothetical protein [Ricinus communis] gi|223533363|gb|EEF35114.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|124360619|gb|ABD33394.2| FAR1; Polynucleotidyl transferase, Ribonuclease H fold [Medicago truncatula] Back     alignment and taxonomy information
>gi|449502056|ref|XP_004161531.1| PREDICTED: protein FAR1-RELATED SEQUENCE 5-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449462447|ref|XP_004148952.1| PREDICTED: protein FAR1-RELATED SEQUENCE 5-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224069685|ref|XP_002303021.1| predicted protein [Populus trichocarpa] gi|222844747|gb|EEE82294.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388506702|gb|AFK41417.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|224129362|ref|XP_002320568.1| predicted protein [Populus trichocarpa] gi|222861341|gb|EEE98883.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147864082|emb|CAN81119.1| hypothetical protein VITISV_030389 [Vitis vinifera] Back     alignment and taxonomy information
>gi|359495410|ref|XP_002270370.2| PREDICTED: uncharacterized protein LOC100248330 [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query162
TAIR|locus:2135738183 AT4G12850 "AT4G12850" [Arabido 0.925 0.819 0.541 1.2e-40
TAIR|locus:2041041206 AT2G43280 [Arabidopsis thalian 0.932 0.733 0.473 1.2e-35
TAIR|locus:2079651217 AT3G07500 "AT3G07500" [Arabido 0.932 0.695 0.408 1.4e-25
TAIR|locus:2097543251 AT3G59470 [Arabidopsis thalian 0.919 0.593 0.380 6.8e-24
TAIR|locus:2121060 788 FRS5 "FAR1-related sequence 5" 0.524 0.107 0.528 1.3e-22
TAIR|locus:2129665 827 FAR1 "FAR-RED IMPAIRED RESPONS 0.518 0.101 0.341 2.4e-10
TAIR|locus:2082420 764 FRS7 "FAR1-related sequence 7" 0.432 0.091 0.452 4.6e-10
TAIR|locus:2144930 788 FRS12 "FAR1-related sequence 1 0.432 0.088 0.452 4.8e-10
TAIR|locus:2062606 807 FRS2 "FAR1-related sequence 2" 0.580 0.116 0.333 4.6e-09
TAIR|locus:2090394 839 FHY3 "far-red elongated hypoco 0.524 0.101 0.329 6.5e-09
TAIR|locus:2135738 AT4G12850 "AT4G12850" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 432 (157.1 bits), Expect = 1.2e-40, P = 1.2e-40
 Identities = 84/155 (54%), Positives = 113/155 (72%)

Query:     3 YARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKAT 62
             Y++R+GFVVR+MQRRRS  DG TLARRLGCNKQGF PN++ ++        S+REGCKAT
Sbjct:    29 YSKRLGFVVRMMQRRRSGIDGRTLARRLGCNKQGFGPNNQRSSSSSS----SSREGCKAT 84

Query:    63 ILVKMEKSGKWVVTRFIKDHNHPL-VVTANGYSTVGDKDKKIXXXXXXXXXXXXXCAAYR 121
             ILVKMEKSGKWVVTRFIK+HNH L  + ++ Y +  DK++KI             C  YR
Sbjct:    85 ILVKMEKSGKWVVTRFIKEHNHSLQFIGSSSYDSFADKERKIKELTEEIECQDRLCDVYR 144

Query:   122 EKLFNFMNNVEEQTEELSSKIQVIVDNIRKVESEM 156
             ++L +F++NVE  TEELS K++ IV+N++K+E ++
Sbjct:   145 DRLVSFIDNVEHYTEELSLKVRDIVENVKKLECQI 179




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0009639 "response to red or far red light" evidence=ISS
TAIR|locus:2041041 AT2G43280 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079651 AT3G07500 "AT3G07500" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2097543 AT3G59470 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2121060 FRS5 "FAR1-related sequence 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2129665 FAR1 "FAR-RED IMPAIRED RESPONSE 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2082420 FRS7 "FAR1-related sequence 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2144930 FRS12 "FAR1-related sequence 12" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062606 FRS2 "FAR1-related sequence 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2090394 FHY3 "far-red elongated hypocotyls 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query162
pfam0310190 pfam03101, FAR1, FAR1 DNA-binding domain 5e-34
PLN03097 846 PLN03097, FHY3, Protein FAR-RED ELONGATED HYPOCOTY 2e-10
>gnl|CDD|217369 pfam03101, FAR1, FAR1 DNA-binding domain Back     alignment and domain information
 Score =  114 bits (288), Expect = 5e-34
 Identities = 43/85 (50%), Positives = 58/85 (68%), Gaps = 1/85 (1%)

Query: 3  YARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKAT 62
          YARR+GF VR  + RRSR DG  + RR  C+K+GF    K +   +++PR S R GCKA 
Sbjct: 6  YARRVGFSVRKSKSRRSRRDGEIIRRRFVCSKEGFRKERKKSV-RKRRPRASTRTGCKAM 64

Query: 63 ILVKMEKSGKWVVTRFIKDHNHPLV 87
          ++VK+   GKWVVT+F+ +HNHPL 
Sbjct: 65 MVVKLRSDGKWVVTKFVLEHNHPLA 89


This domain contains a WRKY like fold and is therefore most likely a zinc binding DNA-binding domain. Length = 90

>gnl|CDD|178645 PLN03097, FHY3, Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 162
PF0310191 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 99.95
PLN03097 846 FHY3 Protein FAR-RED ELONGATED HYPOCOTYL 3; Provis 99.95
PF08731111 AFT: Transcription factor AFT; InterPro: IPR014842 98.11
PF0310660 WRKY: WRKY DNA -binding domain; InterPro: IPR00365 91.32
PF0450062 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR00 90.32
PF0310867 DBD_Tnp_Mut: MuDR family transposase; InterPro: IP 80.26
>PF03101 FAR1: FAR1 DNA-binding domain; InterPro: IPR004330 Phytochrome A is the primary photoreceptor for mediating various far-red light-induced responses in higher plants Back     alignment and domain information
Probab=99.95  E-value=1.4e-27  Score=167.18  Aligned_cols=87  Identities=41%  Similarity=0.718  Sum_probs=78.9

Q ss_pred             ccccccCeEEEEeceeecCCCCceEEEEEEecCCCCCCCCCCCCCCCCCCCCccccCCceEEEEEEeeCCcEEEEEEeec
Q 031272            2 GYARRMGFVVRIMQRRRSRTDGTTLARRLGCNKQGFSPNSKGTNGPEKKPRPSAREGCKATILVKMEKSGKWVVTRFIKD   81 (162)
Q Consensus         2 ~YA~~~GF~iR~~~~~rs~~~g~i~~~~~vCsreG~~~~~~~~~~~~~~~r~~tR~gC~A~i~v~~~~~gkW~V~~f~~e   81 (162)
                      +||+.+||+||+.++++++.+|.+++..|+|+++|+...+.......+++++++||||+|+|.|++..+|+|.|+.|+.|
T Consensus         5 ~yA~~~GF~vr~~~s~~~~~~~~~~~~~~~C~r~G~~~~~~~~~~~~~r~~~s~ktgC~a~i~v~~~~~~~w~v~~~~~~   84 (91)
T PF03101_consen    5 SYARRHGFSVRKSSSRKSKKNGEIKRVTFVCSRGGKYKSKKKNEEKRRRNRPSKKTGCKARINVKRRKDGKWRVTSFVLE   84 (91)
T ss_pred             HhcCcCCeEEEEeeeEeCCCCceEEEEEEEECCcccccccccccccccccccccccCCCEEEEEEEccCCEEEEEECcCC
Confidence            59999999999999999889999999999999999997654333456778999999999999999987899999999999


Q ss_pred             CCcCccc
Q 031272           82 HNHPLVV   88 (162)
Q Consensus        82 HNH~L~~   88 (162)
                      |||||+|
T Consensus        85 HNH~L~P   91 (91)
T PF03101_consen   85 HNHPLCP   91 (91)
T ss_pred             cCCCCCC
Confidence            9999987



It has been found that the proteins governing this response, which include FAR-RED ELONGATED HYPOCOTYL3 (FHY3) and FAR-RED-IMPAIRED RESPONSE1 (FAR1), are a pair of homologous proteins sharing significant sequence homology to mutator-like transposases. These proteins appear to be novel transcription factors, which are essential for activating the expression of FHY1 and FHL (for FHY1-like) and related genes, whose products are required for light-induced phytochrome A nuclear accumulation and subsequent light responses in plants. The FRS (FAR1 Related Sequences) family of proteins share a similar domain structure to mutator-like transposases, including an N-terminal C2H2 zinc finger domain, a central putative core transposase domain, and a C-terminal SWIM motif (named after SWI2/SNF and MuDR transposases). It seems plausible that the FRS family represent transcription factors derived from mutator-like transposases [, ]. This entry represents a domain found in FAR1 and FRS proteins. It contains a WRKY like fold and is therefore most likely a zinc binding DNA-binding domain.

>PLN03097 FHY3 Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional Back     alignment and domain information
>PF08731 AFT: Transcription factor AFT; InterPro: IPR014842 AFT (activator of iron transcription) is an iron regulated transcriptional activator that regulates the expression of genes involved in iron homeostasis Back     alignment and domain information
>PF03106 WRKY: WRKY DNA -binding domain; InterPro: IPR003657 The WRKY domain is a 60 amino acid region that is defined by the conserved amino acid sequence WRKYGQK at its N-terminal end, together with a novel zinc-finger- like motif Back     alignment and domain information
>PF04500 FLYWCH: FLYWCH zinc finger domain; InterPro: IPR007588 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>PF03108 DBD_Tnp_Mut: MuDR family transposase; InterPro: IPR004332 The plant MuDR transposase domain is present in plant proteins that are presumed to be the transposases for Mutator transposable elements [, ] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query162
2ayd_A76 WRKY transcription factor 1; beta strands, zinc fi 92.65
1wj2_A78 Probable WRKY transcription factor 4; DNA-binding 92.13
2rpr_A87 Flywch-type zinc finger-containing protein 1; flyw 81.31
>2ayd_A WRKY transcription factor 1; beta strands, zinc finger; 1.60A {Arabidopsis thaliana} Back     alignment and structure
Probab=92.65  E-value=0.19  Score=33.47  Aligned_cols=32  Identities=22%  Similarity=0.423  Sum_probs=27.1

Q ss_pred             cCCceEEEEEEe-eCCcEEEEEEeecCCcCccc
Q 031272           57 EGCKATILVKME-KSGKWVVTRFIKDHNHPLVV   88 (162)
Q Consensus        57 ~gC~A~i~v~~~-~~gkW~V~~f~~eHNH~L~~   88 (162)
                      .||+|+=.|.+. +++.-+++-..-+|||++.+
T Consensus        43 ~gC~a~K~Ver~~~d~~~~~~tY~G~H~H~~p~   75 (76)
T 2ayd_A           43 PGCPVKKHVERSSHDTKLLITTYEGKHDHDMPP   75 (76)
T ss_dssp             TTCCCEEEEEECSSSTTEEEEEEESCCSSCCCC
T ss_pred             CCCCceeeEEEECCCCCEEEEEEccCcCCCCCC
Confidence            499999999876 46788899999999999843



>1wj2_A Probable WRKY transcription factor 4; DNA-binding domain, zinc-binding, structural genomics; NMR {Arabidopsis thaliana} SCOP: g.79.1.1 PDB: 2lex_A* Back     alignment and structure
>2rpr_A Flywch-type zinc finger-containing protein 1; flywch domain, alternative splicing, DNA-binding, metal- binding, nucleus, metal binding protein; NMR {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query162
d1wj2a_71 WRKY DNA-binding protein 4 {Thale cress (Arabidops 92.8
>d1wj2a_ g.79.1.1 (A:) WRKY DNA-binding protein 4 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Small proteins
fold: WRKY DNA-binding domain
superfamily: WRKY DNA-binding domain
family: WRKY DNA-binding domain
domain: WRKY DNA-binding protein 4
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=92.80  E-value=0.087  Score=33.15  Aligned_cols=32  Identities=34%  Similarity=0.438  Sum_probs=26.5

Q ss_pred             ccCCceEEEEEEe-eCCcEEEEEEeecCCcCcc
Q 031272           56 REGCKATILVKME-KSGKWVVTRFIKDHNHPLV   87 (162)
Q Consensus        56 R~gC~A~i~v~~~-~~gkW~V~~f~~eHNH~L~   87 (162)
                      ..||+|+=.|-.. +++.-+++-..-+||||+.
T Consensus        38 ~~~C~a~K~Vqr~~~d~~~~~vtY~G~H~h~~P   70 (71)
T d1wj2a_          38 TPGCGVRKHVERAATDPKAVVTTYEGKHNHDLP   70 (71)
T ss_dssp             CSSCEEEEEEEEETTTTSEEEEEEESCCSSCCC
T ss_pred             ccCCCCcceEEEEcCCCCEEEEEEeeEeCCCCC
Confidence            3599999999876 4677888888999999873