Citrus Sinensis ID: 011169


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490--
MNGGDMSAPIKDPAIKLFGAKIPVPDTRIPDVCKQITKEEAEDNHEYNSEEPDKSSSLNNVKEGARNFVQANGIQANAKTKADQSESNTDQEKAFKRPDKIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVPVTGLETPNSANQQLMLCGDSATTFRPSTGNGTVLKFGPETPLCESMENVLNLRDQKRSVEICTVTCVENKEEPSSSGSPVTASSIRGNELPESVMQKEQVMMTGSCNELNMQQSMHCYPVSPWGVPWNPNNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPLTGSNGCLSPSSSTSNNSCCSGNGSPTLGKHSRDPNLTDEGQSEKCVLVPKTLRIDDPDEASRSPIWATLGFKPDQKDLKSGGAIFKDFETKKGGKADVLDGAEILESNPAALSRSHTFQEST
ccccccccccccccccccccccccccccccccHHcccHHHHHccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHccccccccc
ccccccccccccHHHHHccccccccccccccccHcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHccccccccccccccccccccEEEEEcccccccccHHHHHHHHHHccccccccccccccccccccccccccccEcccccccccccccccccccccccccccccccccccccccEEEccccccccccHHHHccccHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEccEEEEccHHHHHHcHHHHHcccccccccccccccEEcccccccccccccccccHHHHccHHHHHccccccccc
mnggdmsapikdpaiklfgakipvpdtripdVCKQITKEeaednheynseepdkssslnnvKEGARNFVQANgiqanaktkadqsesntdqekafkrpdkiipcprcnsmdtkfcyfnnynvnqprhfckncqrywtaggtmrnvpvgagrrknkHFASQYRQILvssdgvpvtgletpnsanqqlmlcgdsattfrpstgngtvlkfgpetplcesMENVLNLRDQKRSVEICTVTCvenkeepsssgspvtassirgnelpesvmQKEQVMMTGscnelnmqqsmhcypvspwgvpwnpnnvasmeaaqnpsdrvcvpngsnpgpvqwcpttvlavpgfcapniplqfvpasywgmgvwaagagnipltgsngclspssstsnnsccsgngsptlgkhsrdpnltdegqsekcvlvpktlriddpdeasrspiwatlgfkpdqkdlksggaifkdfetkkggkadvldgaeilesnpaalsrshtfqest
mnggdmsapiKDPAIKLFgakipvpdtriPDVCKQITKeeaednheynseepdkssslNNVKEGARNFVQANGIqanaktkadqsesntdqekafkrpdkiipcprCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVPVTGLETPNSANQQLMLCGDSATTFRPSTGNGTVLKFGPETPLCESMENVlnlrdqkrsVEICTVTcvenkeepsssgspvtassirgneLPESVMQKEQVMMTGSCNELNMQQSMHCYPVSPWGVPWNPNNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPLTGSNGCLSPSSSTSNNSCCSGNGSPTLGKHSrdpnltdegqsekcvlvpktlriddpdeasrspiwatlgfkpdqkdlKSGGAIFKDFETKKGGKADVLDGAEIlesnpaalsrshtfqest
MNGGDMSAPIKDPAIKLFGAKIPVPDTRIPDVCKQITKEEAEDNHEYNSEEPDKSSSLNNVKEGARNFVQANGIQANAKTKADQSESNTDQEKAFKRPDKIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVPVTGLETPNSANQQLMLCGDSATTFRPSTGNGTVLKFGPETPLCESMENVLNLRDQKRSVEICTVTCVENKEEPsssgspvtassIRGNELPESVMQKEQVMMTGSCNELNMQQSMHCYPVSPWGVPWNPNNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPLtgsngclspssstsnnsccsgngsptLGKHSRDPNLTDEGQSEKCVLVPKTLRIDDPDEASRSPIWATLGFKPDQKDLKSGGAIFKDFETKKGGKADVLDGAEILESNPAALSRSHTFQEST
**************IKLFGAKIPVPDTRIPDVC******************************************************************KIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVPVT***********LMLCGD****F*****NGTVLKFGPETPLCESMENVLNLRDQKRSVEICTVTCV**********************************************SMHCYPVSPWGVPWNPN***********************GPVQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPLTG*******************************************VLV****************IWATLGF************IF*************************************
*************AIKLFGAKIPVP*********************************************************************************CNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTM*****************************************************************************************************************************************************VSPWGVPWNPNNVASMEA******************VQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPL**********************************************LVPKTLRIDDPDEASRSPIWATLGFKPD*******************************ESNPAALSR*HTF****
MNGGDMSAPIKDPAIKLFGAKIPVPDTRIPDVCKQITKEE******************NNVKEGARNFVQANGIQA******************FKRPDKIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVPVTGLETPNSANQQLMLCGDSATTFRPSTGNGTVLKFGPETPLCESMENVLNLRDQKRSVEICTVTCVE****************IRGNELPESVMQKEQVMMTGSCNELNMQQSMHCYPVSPWGVPWNPNNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPLTGSNGC****************GSPTLGKHSRDPNLTDEGQSEKCVLVPKTLRIDDPDEASRSPIWATLGFKPDQKDLKSGGAIFKDFETKKGGKADVLDGAEILESNPAA***********
*********IKDPAIKLF*AKIPVP*********************************************************************FKRPDKIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVG**********************************************T*****GNGTVLKFGPETPLCESMENVLNLRDQ**************************************************CNELNMQQSMHCYPVSPWGVPWNPNNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPLTGSN*************************************SEKCVLVPKTLRIDDPDEASRSPIWATLGFKPD************************************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MNGGDMSAPIKDPAIKLFGAKIPVPDTRIPDVCKQITKEEAEDNHEYNSEEPDKSSSLNNVKEGARNFVQANGIQANAKTKADQSESNTDQEKAFKRPDKIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVPVTGLETPNSANQQLMLCGDSATTFRPSTGNGTVLKFGPETPLCESMENVLNLRDQKRSVEICTVTCVENKEEPSSSGSPVTASSIRGNELPESVMQKEQVMMTGSCNELNMQQSMHCYPVSPWGVPWNPNNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPLQFVPASYWGMGVWAAGAGNIPLTGSNGCLSPSSSTSNNSCCSGNGSPTLGKHSRDPNLTDEGQSEKCVLVPKTLRIDDPDEASRSPIWATLGFKPDQKDLKSGGAIFKDFETKKGGKADVLDGAEILESNPAALSRSHTFQEST
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query492 2.2.26 [Sep-21-2011]
Q8LFV3448 Cyclic dof factor 3 OS=Ar yes no 0.857 0.941 0.369 4e-62
Q93ZL5457 Cyclic dof factor 2 OS=Ar no no 0.792 0.853 0.338 2e-57
Q8W1E3298 Cyclic dof factor 1 OS=Ar no no 0.252 0.416 0.476 1e-35
Q9LQX4366 Dof zinc finger protein D no no 0.268 0.360 0.482 2e-30
Q9SEZ3399 Cyclic dof factor 5 OS=Ar no no 0.054 0.067 0.675 1e-29
P68350175 Dof zinc finger protein D no no 0.126 0.354 0.887 4e-29
O22967170 Cyclic dof factor 4 OS=Ar no no 0.152 0.441 0.763 3e-28
Q9M2U1323 Dof zinc finger protein D no no 0.140 0.213 0.594 3e-21
Q8LDR0307 Dof zinc finger protein D no no 0.132 0.211 0.656 5e-21
Q8LAP8342 Dof zinc finger protein D no no 0.132 0.190 0.615 8e-21
>sp|Q8LFV3|CDF3_ARATH Cyclic dof factor 3 OS=Arabidopsis thaliana GN=CDF3 PE=1 SV=2 Back     alignment and function desciption
 Score =  239 bits (610), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 188/509 (36%), Positives = 247/509 (48%), Gaps = 87/509 (17%)

Query: 6   MSAPIKDPAIKLFGAKIPVPDTRIPDVCKQITKEEAEDNHEYNSEEPDKSS------SLN 59
           M    +DPAIKLFG KIP P   + +    +  +E +D    + + P+K +      + N
Sbjct: 1   MMMETRDPAIKLFGMKIPFPS--VFESAVTVEDDEEDDWSGGDDKSPEKVTPELSDKNNN 58

Query: 60  NVKEGARNFVQANGIQANAKTKADQSESNTDQE---------KAFKRPDKIIPCPRCNSM 110
           N  + + N  +   +     T  DQ ES+   E         K  K+P KI+PCPRC SM
Sbjct: 59  NCNDNSFNNSKPETLDKEEATSTDQIESSDTPEDNQQTTPDGKTLKKPTKILPCPRCKSM 118

Query: 111 DTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDG 170
           +TKFCY+NNYN+NQPRHFCK CQRYWTAGGTMRNVPVGAGRRKNK  +S YR I + S+ 
Sbjct: 119 ETKFCYYNNYNINQPRHFCKACQRYWTAGGTMRNVPVGAGRRKNKSSSSHYRHITI-SEA 177

Query: 171 VPVTGLETPNSANQQLMLCGDSATTFRPSTGNGTVLKFGPETPLCESMENVLN-LRDQKR 229
           +    L+    AN +++  G  A     +     V+K   +  +     N  + L DQ+ 
Sbjct: 178 LEAARLDPGLQANTRVLSFGLEAQQQHVAAPMTPVMKLQEDQKVSNGARNRFHGLADQR- 236

Query: 230 SVEICTVTCVENKEEPSSSGSPVTASSIRGNELPESVMQKEQVMMTGSCNELNMQQSMH- 288
                 V  VEN ++  SSGS VT S+   + + ES  Q   V+     N  N   + + 
Sbjct: 237 -----LVARVENGDD-CSSGSSVTTSN--NHSVDESRAQSGSVVEAQMNNNNNNNMNGYA 288

Query: 289 CYPVSPWGVPWNPNNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPL 348
           C P  PW   WNP   A       P     +P                            
Sbjct: 289 CIPGVPWPYTWNP---AMPPPGFYPPPGYPMP---------------------------- 317

Query: 349 QFVPASYWGMGVWAAGAGNIPLTGSNGCLSPSSSTSNNSCCSGNGSPTLGKHSRDP---- 404
            F P  YW           IP+   +   SP S       CS   SPTLGKH RD     
Sbjct: 318 -FYP--YW----------TIPMLPPHQSSSPISQK-----CSNTNSPTLGKHPRDEGSSK 359

Query: 405 --NLTDEGQSEKCVLVPKTLRIDDPDEASRSPIWATLGFKPDQKDLKSGGAIFKDFETK- 461
             N T+  Q   CVLVPKTLRIDDP+EA++S IW TLG K ++   K+GG +FK F+ K 
Sbjct: 360 KDNETERKQKAGCVLVPKTLRIDDPNEAAKSSIWTTLGIK-NEAMCKAGG-MFKGFDHKT 417

Query: 462 KGGKADVLDGAEILESNPAALSRSHTFQE 490
           K    D  + + +L +NPAALSRSH F E
Sbjct: 418 KMYNNDKAENSPVLSANPAALSRSHNFHE 446




Transcription factor that binds specifically to a 5'-AA[AG]G-3' consensus core sequence.
Arabidopsis thaliana (taxid: 3702)
>sp|Q93ZL5|CDF2_ARATH Cyclic dof factor 2 OS=Arabidopsis thaliana GN=CDF2 PE=1 SV=2 Back     alignment and function description
>sp|Q8W1E3|CDF1_ARATH Cyclic dof factor 1 OS=Arabidopsis thaliana GN=CDF1 PE=1 SV=2 Back     alignment and function description
>sp|Q9LQX4|DOF13_ARATH Dof zinc finger protein DOF1.3 OS=Arabidopsis thaliana GN=DOF1.3 PE=2 SV=1 Back     alignment and function description
>sp|Q9SEZ3|CDF5_ARATH Cyclic dof factor 5 OS=Arabidopsis thaliana GN=CDF5 PE=2 SV=1 Back     alignment and function description
>sp|P68350|DOF15_ARATH Dof zinc finger protein DOF1.5 OS=Arabidopsis thaliana GN=DOF1.5 PE=1 SV=1 Back     alignment and function description
>sp|O22967|CDF4_ARATH Cyclic dof factor 4 OS=Arabidopsis thaliana GN=CDF4 PE=2 SV=1 Back     alignment and function description
>sp|Q9M2U1|DOF36_ARATH Dof zinc finger protein DOF3.6 OS=Arabidopsis thaliana GN=DOF3.6 PE=1 SV=2 Back     alignment and function description
>sp|Q8LDR0|DOF54_ARATH Dof zinc finger protein DOF5.4 OS=Arabidopsis thaliana GN=DOF5.4 PE=2 SV=2 Back     alignment and function description
>sp|Q8LAP8|DOF46_ARATH Dof zinc finger protein DOF4.6 OS=Arabidopsis thaliana GN=DOF4.6 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query492
255569414506 conserved hypothetical protein [Ricinus 0.975 0.948 0.649 1e-164
224123914494 predicted protein [Populus trichocarpa] 0.981 0.977 0.616 1e-163
225439183494 PREDICTED: dof zinc finger protein DOF5. 0.983 0.979 0.613 1e-160
224144860493 f-box family protein [Populus trichocarp 0.979 0.977 0.608 1e-156
296085888465 unnamed protein product [Vitis vinifera] 0.928 0.982 0.592 1e-149
356571789483 PREDICTED: dof zinc finger protein DOF5. 0.949 0.966 0.542 1e-139
255635928483 unknown [Glycine max] 0.949 0.966 0.540 1e-138
449438006503 PREDICTED: dof zinc finger protein DOF5. 0.934 0.914 0.526 1e-129
356506208482 PREDICTED: LOW QUALITY PROTEIN: dof zinc 0.947 0.966 0.519 1e-126
449533128477 PREDICTED: dof zinc finger protein DOF5. 0.892 0.920 0.530 1e-122
>gi|255569414|ref|XP_002525674.1| conserved hypothetical protein [Ricinus communis] gi|223534974|gb|EEF36657.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  583 bits (1504), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 320/493 (64%), Positives = 365/493 (74%), Gaps = 13/493 (2%)

Query: 11  KDPAIKLFGAKIPVPDTRIP------DVCKQITKEEAEDNHEYNSEEPDKSSSLNNVKEG 64
           KDPA KLFG KIPVPDT+IP      + C    K E +   E N+ E DK S+    KE 
Sbjct: 16  KDPAFKLFGRKIPVPDTQIPAKPEFKNSCNDSIKGETDIYDEDNTGEQDKLSATGQTKEE 75

Query: 65  ARNFVQANGIQANAKTKADQSESN-TDQEKAFKRPDKIIPCPRCNSMDTKFCYFNNYNVN 123
            +   Q N  +A +  K DQ E+N TDQEK FK+PDKI+PCPRCNS+DTKFCYFNNYNVN
Sbjct: 76  RQTPKQVNE-EAQSGPKDDQVETNSTDQEKVFKKPDKILPCPRCNSLDTKFCYFNNYNVN 134

Query: 124 QPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVPVTGLETPNSAN 183
           QPRHFCKNCQRYWTAGGTMRNVP+GAGRRKNKH ASQYRQILVSSDGVP+T +E   S N
Sbjct: 135 QPRHFCKNCQRYWTAGGTMRNVPIGAGRRKNKHLASQYRQILVSSDGVPITRIENSESTN 194

Query: 184 QQLMLCGDSATTFRPS-TGNGTVLKFGPETPLCESMENVLNLRDQKRSVEICTVTCVENK 242
            QL+   +SATT RPS TGN  VLKFG E PLCESME VLNL DQKR  E+ +  C +N 
Sbjct: 195 HQLISSVESATTLRPSTTGNEVVLKFGSEAPLCESMETVLNLGDQKRYAEMSSANCRDNG 254

Query: 243 EEPSSSGSPVTASSIRGNELPESVMQKEQVMMTGSCNELNMQQSMHCYPVSPWGVPWNP- 301
           EEPSS GS +T+SS RGNELPE+V QKE V M GS NELN+  S+HCYPV PW  PWNP 
Sbjct: 255 EEPSSCGSSMTSSSCRGNELPETVTQKELVGMPGSTNELNVPHSLHCYPVPPWVFPWNPN 314

Query: 302 -NNVASMEAAQNPSDRVCVPNGSNPGPVQWCPTTVLAVPGFCAPNIPLQFVPASYWG-MG 359
            NN  S+  AQ+ + + C+PN +N   VQWCPT +LAVP FC P+IPLQ VPASY G M 
Sbjct: 315 WNNATSITTAQHSTGQPCMPNSNNSNQVQWCPTPILAVPSFCPPSIPLQLVPASYLGCMP 374

Query: 360 VWAAGAGNIPLTGSNGCLSPSSSTSNNSCCSGNGSPTLGKHSRDPNLTDEGQSEKCVLVP 419
           VW AGAGNI + GSNGC S  SS+++ SCCSGNGSP LGKHSRD NL  E Q E CVLVP
Sbjct: 375 VWGAGAGNISVNGSNGCSS-PSSSTSTSCCSGNGSPKLGKHSRDANLMGEEQVENCVLVP 433

Query: 420 KTLRIDDPDEASRSPIWATLGFKPDQKDLKSGGAIFKDFETKKGGKADVLDGAEILESNP 479
           KTLRIDDP+EAS+SP+WATLG K DQK+  S G IFK  ETK+ GK  V + A ILE+NP
Sbjct: 434 KTLRIDDPNEASKSPLWATLGLKLDQKNPASKGTIFKTSETKEEGKGHVPESAHILEANP 493

Query: 480 AALSRSHTFQEST 492
           AALSRS TFQES+
Sbjct: 494 AALSRSTTFQESS 506




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224123914|ref|XP_002319195.1| predicted protein [Populus trichocarpa] gi|222857571|gb|EEE95118.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225439183|ref|XP_002275127.1| PREDICTED: dof zinc finger protein DOF5.2-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|224144860|ref|XP_002325441.1| f-box family protein [Populus trichocarpa] gi|222862316|gb|EEE99822.1| f-box family protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|296085888|emb|CBI31212.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356571789|ref|XP_003554055.1| PREDICTED: dof zinc finger protein DOF5.2-like [Glycine max] Back     alignment and taxonomy information
>gi|255635928|gb|ACU18311.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|449438006|ref|XP_004136781.1| PREDICTED: dof zinc finger protein DOF5.2-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356506208|ref|XP_003521879.1| PREDICTED: LOW QUALITY PROTEIN: dof zinc finger protein DOF3.3-like [Glycine max] Back     alignment and taxonomy information
>gi|449533128|ref|XP_004173529.1| PREDICTED: dof zinc finger protein DOF5.2-like, partial [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query492
TAIR|locus:2079152448 CDF3 "cycling DOF factor 3" [A 0.581 0.638 0.366 6.1e-66
TAIR|locus:2164860457 CDF2 "cycling DOF factor 2" [A 0.347 0.374 0.465 1.4e-60
TAIR|locus:2154079298 CDF1 "AT5G62430" [Arabidopsis 0.184 0.305 0.649 1.4e-53
TAIR|locus:2007151399 AT1G69570 [Arabidopsis thalian 0.327 0.403 0.426 4.4e-49
TAIR|locus:2030006175 AT1G29160 [Arabidopsis thalian 0.182 0.514 0.621 1.3e-29
TAIR|locus:2055506170 AT2G34140 [Arabidopsis thalian 0.152 0.441 0.763 7.1e-29
TAIR|locus:2079492324 DAG1 "dof affecting germinatio 0.270 0.410 0.408 1.3e-23
TAIR|locus:2144030257 TMO6 "TARGET OF MONOPTEROS 6" 0.182 0.350 0.478 3.2e-23
TAIR|locus:2032153325 AT1G28310 [Arabidopsis thalian 0.203 0.307 0.46 6.2e-23
TAIR|locus:2053175340 AT2G28810 [Arabidopsis thalian 0.266 0.385 0.426 8e-23
TAIR|locus:2079152 CDF3 "cycling DOF factor 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 453 (164.5 bits), Expect = 6.1e-66, Sum P(2) = 6.1e-66
 Identities = 114/311 (36%), Positives = 152/311 (48%)

Query:     6 MSAPIKDPAIKLFGAKIPVPDTRIPDVCKQITKEEAEDNHEYNSEE---PDKSS-SLNNV 61
             M    +DPAIKLFG KIP P      V  +  +E+     +  S E   P+ S  + NN 
Sbjct:     1 MMMETRDPAIKLFGMKIPFPSVFESAVTVEDDEEDDWSGGDDKSPEKVTPELSDKNNNNC 60

Query:    62 KEGARNFVQANGIQANAKTKADQSESNTDQE---------KAFKRPDKIIPCPRCNSMDT 112
              + + N  +   +     T  DQ ES+   E         K  K+P KI+PCPRC SM+T
Sbjct:    61 NDNSFNNSKPETLDKEEATSTDQIESSDTPEDNQQTTPDGKTLKKPTKILPCPRCKSMET 120

Query:   113 KFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFASQYRQILVSSDGVP 172
             KFCY+NNYN+NQPRHFCK CQRYWTAGGTMRNVPVGAGRRKNK  +S YR I +S + + 
Sbjct:   121 KFCYYNNYNINQPRHFCKACQRYWTAGGTMRNVPVGAGRRKNKSSSSHYRHITIS-EALE 179

Query:   173 VTGLETPNSANQQLMLCGDSATTFRPSTGNGTVLKFGPETPLCESMENVLN-LRDQKRSV 231
                L+    AN +++  G  A     +     V+K   +  +     N  + L DQ+   
Sbjct:   180 AARLDPGLQANTRVLSFGLEAQQQHVAAPMTPVMKLQEDQKVSNGARNRFHGLADQR--- 236

Query:   232 EICTVTCVENKEEPXXXXXXXXXXXIRGNELPESVMQKEQVMMTGSCNELNMQQSMH-CY 290
                 V  VEN ++               + + ES  Q   V+     N  N   + + C 
Sbjct:   237 ---LVARVENGDDCSSGSSVTTS---NNHSVDESRAQSGSVVEAQMNNNNNNNMNGYACI 290

Query:   291 PVSPWGVPWNP 301
             P  PW   WNP
Sbjct:   291 PGVPWPYTWNP 301


GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA;TAS
GO:0008270 "zinc ion binding" evidence=IEA
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0005515 "protein binding" evidence=IPI
GO:0048510 "regulation of timing of transition from vegetative to reproductive phase" evidence=IMP
TAIR|locus:2164860 CDF2 "cycling DOF factor 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2154079 CDF1 "AT5G62430" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2007151 AT1G69570 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2030006 AT1G29160 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2055506 AT2G34140 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079492 DAG1 "dof affecting germination 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2144030 TMO6 "TARGET OF MONOPTEROS 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2032153 AT1G28310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2053175 AT2G28810 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q8LFV3CDF3_ARATHNo assigned EC number0.36930.85770.9419yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query492
pfam0270163 pfam02701, zf-Dof, Dof domain, zinc finger 3e-39
>gnl|CDD|111583 pfam02701, zf-Dof, Dof domain, zinc finger Back     alignment and domain information
 Score =  136 bits (343), Expect = 3e-39
 Identities = 47/63 (74%), Positives = 57/63 (90%)

Query: 97  RPDKIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKH 156
           +PDK + CPRC+SM+TKFCY+NNYN+NQPR+FCKNC+RYWTAGG +RNVPVG GRRKNK 
Sbjct: 1   KPDKALKCPRCDSMNTKFCYYNNYNLNQPRYFCKNCRRYWTAGGALRNVPVGGGRRKNKR 60

Query: 157 FAS 159
            +S
Sbjct: 61  SSS 63


The Dof domain is a zinc finger DNA-binding domain, that shows resemblance to the Cys2 zinc finger. Length = 63

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 492
PF0270163 zf-Dof: Dof domain, zinc finger; InterPro: IPR0038 100.0
TIGR02159146 PA_CoA_Oxy4 phenylacetate-CoA oxygenase, PaaJ subu 87.51
PF1276046 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; Int 86.39
COG3677129 Transposase and inactivated derivatives [DNA repli 84.2
PF0381136 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR0 84.15
smart0044040 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding mo 80.34
>PF02701 zf-Dof: Dof domain, zinc finger; InterPro: IPR003851 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
Probab=100.00  E-value=4.6e-36  Score=239.03  Aligned_cols=63  Identities=67%  Similarity=1.417  Sum_probs=60.9

Q ss_pred             CCCCccCCCCCCCCCCceeeecccCCCCCccccccchhhhccCCcccccccCcccccCCCCcc
Q 011169           97 RPDKIIPCPRCNSMDTKFCYFNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKNKHFAS  159 (492)
Q Consensus        97 ~p~~~~~CPrc~s~~tkfcyynny~~~qpR~~Ck~C~RywT~GG~lRnvPvGgG~RKnk~s~s  159 (492)
                      +|++.++||||+|++|||||||||++.||||||++|+||||+||+|||||||||+||+|+++|
T Consensus         1 ~~~~~~~CPRC~S~nTKFcYyNNy~~~QPR~~Ck~C~rywT~GG~lRnVPvggg~Rk~k~~~s   63 (63)
T PF02701_consen    1 KPEQPLPCPRCDSTNTKFCYYNNYNLSQPRYFCKSCRRYWTHGGTLRNVPVGGGCRKNKRSSS   63 (63)
T ss_pred             CCccCCCCCCcCCCCCEEEeecCCCCCCcchhhHHHHHHHHhcceecCCccCCCcccCCcCCC
Confidence            588999999999999999999999999999999999999999999999999999999999764



Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry consists of proteins containing a Dof domain, which is a zinc finger DNA-binding domain that shows resemblance to the Cys2 zinc finger, although it has a longer putative loop where an extra Cys residue is conserved []. AOBP, a DNA-binding protein in pumpkin (Cucurbita maxima), contains a 52 amino acid Dof domain, which is highly conserved in several DNA-binding proteins of higher plants. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent

>TIGR02159 PA_CoA_Oxy4 phenylacetate-CoA oxygenase, PaaJ subunit Back     alignment and domain information
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc Back     alignment and domain information
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair] Back     alignment and domain information
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases Back     alignment and domain information
>smart00440 ZnF_C2C2 C2C2 Zinc finger Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query492
d1twfi272 RBP9 subunit of RNA polymerase II {Baker's yeast ( 89.4
d1tfia_50 Transcriptional factor SII, C-terminal domain {Hum 82.35
>d1twfi2 g.41.3.1 (I:50-121) RBP9 subunit of RNA polymerase II {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
class: Small proteins
fold: Rubredoxin-like
superfamily: Zinc beta-ribbon
family: Transcriptional factor domain
domain: RBP9 subunit of RNA polymerase II
species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=89.40  E-value=0.2  Score=38.86  Aligned_cols=51  Identities=18%  Similarity=0.444  Sum_probs=37.3

Q ss_pred             chhccccCCCCCccCCCCCCCCCCceeeecccCCCCCc---cccccchhhhccCCc
Q 011169           89 TDQEKAFKRPDKIIPCPRCNSMDTKFCYFNNYNVNQPR---HFCKNCQRYWTAGGT  141 (492)
Q Consensus        89 ~~~~~~~~~p~~~~~CPrc~s~~tkfcyynny~~~qpR---~~Ck~C~RywT~GG~  141 (492)
                      ..++.+|++-  ...||+|...+.-|-+.+-....-|-   |.|.+|.-.|+..++
T Consensus        13 ~~~DPtlp~t--~~~CpkCg~~~a~~~q~QtRsaDE~mT~Fy~C~~C~h~Wr~~~~   66 (72)
T d1twfi2          13 IGSDPTLPRS--DRECPKCHSRENVFFQSQQRRKDTSMVLFFVCLSCSHIFTSDQK   66 (72)
T ss_dssp             GGGCTTSCCC--CCCCTTTCCCCEEEEECSSCCTTCCCCEEEEETTTCCEEECCTT
T ss_pred             cccCCCCCcc--CCCCCCCCCCeEEEEEeecCccCCCceEEEEcCCCCCCcccccc
Confidence            4556677654  35999999877777666665544443   899999999998655



>d1tfia_ g.41.3.1 (A:) Transcriptional factor SII, C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure