Citrus Sinensis ID: 004618


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-
MLLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRKTCTNAPMSQIGAKDGKLRSSVSSLRCNQVLDLEKEPICDRPNGDEKPTYTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKANEKFGKPDAKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALPTAEVQATQNYPRHVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAASATTENESNVPKSTHQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGLLPLCAPFHAPFICPPACATAVSSMDTAQVPAAKTDKNDNTPPNPALKDHQPDLEYSEALQAQNSASKSLTVLTSDSEESASAKLKMELKATDHEKNSEENELHDSNKTKNRKQVDRSSCGSNTPSSSEVETDALEKQEKGKEESKEIDPSLPATDPGNRRSRSSSNITDSWKEVSEVGRLAFQALFSREVLPQSFSPPHDLKDKMQQDNVEDKQNGNKKDGDKSLFDLNSKTWGSCFGHQEVEKSTVSGVENNGGEGLLTIGLGHGKLKARRTGFKPYKRCSVEAKENRILNTGNQAEEKCPKRIRVEGATT
ccHHHHHHHHHHHHHHHccccccEEEEEEccccHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccHHccccccccccccccccccccccccccccccccccccccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHccccccccccccccccEEEEccccc
ccHHHHHHHHHHHHHHHHHccHHHHHHHHccHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccEEEEEEccccccccccccccccccccccccccccccccHHHccccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHccccHHHHHHHHHHHHcHHHHHHHHHHHHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccHHHHHHHcccccccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccccccccccccHHHHcHHHHHHHHHHHHHHccccccccccHcccccccccccccccccccccccEEEEccccccccccccHHHcccccccccccccccEEEEEcccccccccccccccccEEEEEEEccccccccccccccccEEEEEccccc
MLLFSIFDLIGDLSRLFFLCFTWFraehigtkkAVQIRSHAQKFFSKLEKEALskgvpigqaididippprpkrkprnpyprktctnapmsqigakdgklrsSVSSLrcnqvldlekepicdrpngdekptytiesqkdncsEVFILHQeahcssvssvnknsmptpvglrdscnlrefvpslkevvnqdetreSYVTVELKAnekfgkpdaklalqdngsskplnlenacpsheksvhgekrddvadalptaevqatqnyprhvnvhildgslgtgtqspsdmpmqesifhpigevhgnpnlftnpaasattenesnvpksthqsfptfhppftqfrhdqddyrsFLHISSAFSSLIVSTLLQNPAAHAAASFaatfwpytnmetsadsptcpqggflsrqmssppsMAAIAAATVAAATAWWAAhgllplcapfhapficppacatavssmdtaqvpaaktdkndntppnpalkdhqpdleYSEALQAQNSASKSLTVLTSDSEESASAKLKMELKAtdheknseenelhdsnktknrkqvdrsscgsntpssseveTDALEKqekgkeeskeidpslpatdpgnrrsrsssnitdswkEVSEVGRLAFQALfsrevlpqsfspphdlkdkmqqdnvedkqngnkkdgdkslfdlnsktwgscfghqevekstvsgvennggeglltiglghgklkarrtgfkpykrcsveakenrilntgnqaeekcpkrirvegatt
MLLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKFFSKLEKEalskgvpigqaididippprpkrkprnpyprktctnapmsqigakdgklrsSVSSLRCnqvldlekepicdrpngDEKPTYTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPslkevvnqdetreSYVTVelkanekfgkpdaKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALPTAEvqatqnyprhVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAASATTENESNVPKSTHQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGLLPLCAPFHAPFICPPACATAVSSMDTAQVPAaktdkndntppnPALKDHQPDLEYSEALQAQNSASKSLTVLTSDSEESASAKLKMELkatdheknseenelhdsnktknrkqvdrsscgsntpsssevetdalekqekgkeeskeidpslpatdpgnrrsrsssnitdswkevSEVGRLAFQALFSREVLPQSFSPPHDLKDKMQQDNVEdkqngnkkdgdkSLFDLNSKTWGSCFGHQEVEKSTVSGVENNGGEGLLTiglghgklkarrtgfkpykrcsveakenrilntgnqaeekcpkrirvegatt
MLLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIppprpkrkprnpyprkTCTNAPMSQIGAKDGKLRSSVSSLRCNQVLDLEKEPICDRPNGDEKPTYTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKANEKFGKPDAKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALPTAEVQATQNYPRHVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAASATTENESNVPKSTHQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPaahaaasfaaTFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMaaiaaatvaaatawwaaHGLLPLCAPFHAPFICPPACATAVSSMDTAQVPAAKTDKNDNTPPNPALKDHQPDLEYSEALQAQNSASKSLTVLTSDSEESASAKLKMELKATDHEKNSEENELHDSNKTKNRKQVDRSSCGSNTPSSSEVETDALekqekgkeeskeIDPSLPATDPGNRRSRSSSNITDSWKEVSEVGRLAFQALFSREVLPQSFSPPHDLKDKMQQDNVEdkqngnkkdgdkSLFDLNSKTWGSCFGHQEVEKSTVSGVENNGGEGLLTIGLGHGKLKARRTGFKPYKRCSVEAKENRILNTGNQAEEKCPKRIRVEGATT
*LLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKFFSKL********************************************************************************************CSEVFILHQEAHC********************CNLREFVPSL*****************************************************************************YPRHVNVHILD**************************************************************FTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTN***************************AIAAATVAAATAWWAAHGLLPLCAPFHAPFICPPACATAV**********************************************************************************************************************************************************EVGRLAFQALFS****************************************LNSKTWGSCFGHQEV*******V*NNGGEGLLTIGLGHGKLKARRTGFKPYKRCSV*****************************
*LLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKF***************************PK********************************************************************************************************************************************************************************************PRHVNVH***********************************************************************************************************************************************IAAATVAAATAWWAAHGLLPLCAPFHAPFICP******************************************************************************************************************************************************************VGRLAFQALFSREVL*****************************************WG***********************************ARRTGFKPYKRCSVE*********************RVE****
MLLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRKTCTNAPMSQIG***********SLRCNQVLDLEKEPICDRPNGDEKPTYTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKANEKFGKPDAKLALQDNGSSKPLNLEN***************DVADALPTAEVQATQNYPRHVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAA****************SFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTNME********PQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGLLPLCAPFHAPFICPPACATAVSSMDTAQVPAAKTDKNDNTPPNPALKDHQPDLEYSEALQAQNSASKSLTVLTSDSEESASAKLKMELKA*****************************************************************************TDSWKEVSEVGRLAFQALFSREVLPQSFSPPHDLKDKMQQ***********KDGDKSLFDLNSKTWGSCFGHQEVEKSTVSGVENNGGEGLLTIGLGHGKLKARRTGFKPYKRCSVEAKENRILNTGNQAEEKCPKRIRVEGATT
MLLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRK******************************************************************************************************E**PSLKEVVN**ETRESYVTVELKAN********************************************ALPTAEVQATQNYPRHVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAAS***************SFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTNME*******************SPPSMAAIAAATVAAATAWWAAHGLLPLCAPFHAPFICPPACATAVSSMDT************************************************************************************************************************************************WKEVSEVGRLAFQALFSREVLPQSFSPP************************KSLFDL*SKTWGSCFGHQEVEKSTVSGVENNGGEGLLTIGLGHGKLKARRTGFKPYKRCSVEAKEN*************PK*********
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MLLFSIFDLIGDLSRLFFLCFTWFRAEHIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRKTCTNAPMSQIGAKDGKLRSSVSSLRCNQVLDLEKEPICDRPNGDEKPTYTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKANEKFGKPDAKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALPTAEVQATQNYPRHVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAASATTENESNVPKSTHQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGLLPLCAPFHAPFICPPACATAVSSMDTAQVPAAKTDKNDNTPPNPALKDHQPDLEYSEALQAQNSASKSLTVLTSDSEESAxxxxxxxxxxxxxxxxxxxxxLHDSNKTKNRKQVDRSSCGSNTPSSSEVETDALEKQEKGKEESKEIDPSLPATDPGNRRSRSSSNITDSWKEVSEVGRLAFQALFSREVLPQSFSPPHDLKDKMQQDNVEDKQNGNKKDGDKSLFDLNSKTWGSCFGHQEVEKSTVSGVENNGGEGLLTIGLGHGKLKARRTGFKPYKRCSVEAKENRILNTGNQAEEKCPKRIRVEGATT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query741 2.2.26 [Sep-21-2011]
Q6R0H1645 Protein LHY OS=Arabidopsi yes no 0.793 0.911 0.432 3e-94
P92973608 Protein CCA1 OS=Arabidops no no 0.719 0.876 0.369 5e-67
Q54IF9423 Myb-like protein G OS=Dic yes no 0.068 0.120 0.451 1e-06
Q6R0H0287 Transcription factor ASG4 no no 0.060 0.156 0.454 4e-06
>sp|Q6R0H1|LHY_ARATH Protein LHY OS=Arabidopsis thaliana GN=LHY PE=1 SV=2 Back     alignment and function desciption
 Score =  347 bits (889), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 322/744 (43%), Positives = 405/744 (54%), Gaps = 156/744 (20%)

Query: 13  LSRLFFLCFTWFRAE-HIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPR 71
           L  L      W R E HIGTK AVQIRSHAQKFF+KLEKEA  KG+P+ QA+DI+IPPPR
Sbjct: 36  LEALRLYGRAWQRIEEHIGTKTAVQIRSHAQKFFTKLEKEAEVKGIPVCQALDIEIPPPR 95

Query: 72  PKRKPRNPYPRKTCTNAPMS-QIG-AKDGKLRSSVSSLRCNQV-LDLEKEPICDRPNGDE 128
           PKRKP  PYPRK   N   S Q+  AKD KL SS SS + NQ  LDLEK P        E
Sbjct: 96  PKRKPNTPYPRKPGNNGTSSSQVSSAKDAKLVSSASSSQLNQAFLDLEKMPFS------E 149

Query: 129 KPTYTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVN 188
           K +   E+Q +NCS             VS+VNK  +PT                 K+V  
Sbjct: 150 KTSTGKENQDENCS------------GVSTVNKYPLPT-----------------KQVSG 180

Query: 189 QDETRESYVTVELKANEKFGKPDAKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVAD 248
             ET                             SK   ++NA     + V  + +D   D
Sbjct: 181 DIET-----------------------------SKTSTVDNAV----QDVPKKNKD--KD 205

Query: 249 ALPTAEVQATQNYPRHVNVHILDGSLGTGTQS-PSDMPMQESIFHPIGE-VHGNPNLFTN 306
                 V + QNYP H +  I++G++    Q+ PS M  Q+ +FHP+ E  HG+ NL   
Sbjct: 206 GNDGTTVHSMQNYPWHFHADIVNGNIAKCPQNHPSGMVSQDFMFHPMREETHGHANLQAT 265

Query: 307 PAASATTENESNVPKSTHQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNP 366
            A++ TT        ++HQ+FP  H         QDDYRSFL ISS FS+LI+STLLQNP
Sbjct: 266 TASATTT--------ASHQAFPACH--------SQDDYRSFLQISSTFSNLIMSTLLQNP 309

Query: 367 AAHAAASFAATFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAA 426
           AAHAAA+FAA+ WPY ++  S DS         +   SSPPS+ AIAAATVAAATAWWA+
Sbjct: 310 AAHAAATFAASVWPYASVGNSGDS--------STPMSSSPPSITAIAAATVAAATAWWAS 361

Query: 427 HGLLPLCAPFHAPFICPPACATAVSSMDTAQVPA-AKTDKNDNTPPNPALKDHQPDLEYS 485
           HGLLP+CAP  AP  C P    AV +      PA  + D  +NT         QP  + +
Sbjct: 362 HGLLPVCAP--APITCVPFSTVAVPT------PAMTEMDTVENT---------QPFEKQN 404

Query: 486 EALQAQNSASKSLTVLTSDSEESASAKLKMELKATDH--EKNSEENELHDSNKTKNRKQV 543
            ALQ QN ASKS    + DS+E+   KL  + K  D   E+      +HDSN  + +  V
Sbjct: 405 TALQDQNLASKSPASSSDDSDETGVTKLNADSKTNDDKIEEVVVTAAVHDSNTAQKKNLV 464

Query: 544 DRSSCGSNTPSSSEVETDALEKQEKGKEESKEIDPSLP-ATDPGNRRSRSSSN------I 596
           DRSSCGSNTPS S+ ETDAL+K EK KE+ KE D + P   +  NR+ +   N       
Sbjct: 465 DRSSCGSNTPSGSDAETDALDKMEKDKEDVKETDENQPDVIELNNRKIKMRDNNSNNNAT 524

Query: 597 TDSWKEVSEVGRLAFQALFSREVLPQSFSPPHDLKDKMQQDNVEDKQNGNKKDGDKSL-F 655
           TDSWKEVSE GR+AFQALF+RE LPQSFSPP            +  +N N+K  D S+  
Sbjct: 525 TDSWKEVSEEGRIAFQALFARERLPQSFSPP------------QVAENVNRKQSDTSMPL 572

Query: 656 DLNSKTWGSCFGHQEVEKSTVSGVENNGGEGLLTIGLGHGK-LKARRTGFKPYKRCSVEA 714
             N K+  SC   Q               EG++ IG+G  K LK R+TGFKPYKRCS+E 
Sbjct: 573 APNFKSQDSCAADQ---------------EGVVMIGVGTCKSLKTRQTGFKPYKRCSMEV 617

Query: 715 KENRILNTGNQAEEKCPKRIRVEG 738
           KE+++ N  NQ++EK  KR+R+EG
Sbjct: 618 KESQVGNINNQSDEKVCKRLRLEG 641




Transcription factor involved in the circadian clock. Binds to the promoter region of APRR1/TOC1 and TCP21/CHE to repress their transcription. Represses both CCA1 and itself.
Arabidopsis thaliana (taxid: 3702)
>sp|P92973|CCA1_ARATH Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1 Back     alignment and function description
>sp|Q54IF9|MYBG_DICDI Myb-like protein G OS=Dictyostelium discoideum GN=mybG PE=3 SV=1 Back     alignment and function description
>sp|Q6R0H0|ASG4_ARATH Transcription factor ASG4 OS=Arabidopsis thaliana GN=ASG4 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query741
255548073768 conserved hypothetical protein [Ricinus 0.975 0.941 0.671 0.0
51980218768 late elongated hypocotyl [Castanea sativ 0.971 0.937 0.662 0.0
225453650771 PREDICTED: protein LHY-like [Vitis vinif 0.968 0.931 0.645 0.0
147856747 857 hypothetical protein VITISV_012722 [Viti 0.968 0.837 0.645 0.0
219687743768 transcription factor LHY [Populus nigra] 0.977 0.942 0.648 0.0
219687745764 transcription factor LHY [Populus nigra] 0.973 0.943 0.631 0.0
118488480764 unknown [Populus trichocarpa] 0.973 0.943 0.634 0.0
224128077750 predicted protein [Populus trichocarpa] 0.954 0.942 0.624 0.0
224064384710 predicted protein [Populus trichocarpa] 0.935 0.976 0.641 0.0
356520209750 PREDICTED: protein LHY [Glycine max] 0.946 0.934 0.605 0.0
>gi|255548073|ref|XP_002515093.1| conserved hypothetical protein [Ricinus communis] gi|223545573|gb|EEF47077.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  808 bits (2087), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 491/731 (67%), Positives = 559/731 (76%), Gaps = 8/731 (1%)

Query: 13  LSRLFFLCFTWFRAE-HIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPR 71
           L  L      W R E HIGTK AVQIRSHAQKFFSKLEKEA++KGVPIGQA+DIDIPPPR
Sbjct: 37  LEALKLYGRAWQRIEEHIGTKTAVQIRSHAQKFFSKLEKEAIAKGVPIGQALDIDIPPPR 96

Query: 72  PKRKPRNPYPRKTCTNAPMSQIGAKDGKLRSSVSSLRCNQVLDLEKEPICDRPNGDEKPT 131
           PKRKP NPYPRKT      SQ+ AKDGKL S VS  RC QVLDLEKEP+ +R NG EK T
Sbjct: 97  PKRKPSNPYPRKTGAGPTPSQVAAKDGKLPSLVSFPRCTQVLDLEKEPLPERLNGHEKQT 156

Query: 132 YTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDE 191
              E+Q DNCSEVF L QEAHCSSVSS NKNS+ T   L++SC+ REFVPSLK+VVNQD 
Sbjct: 157 DATENQGDNCSEVFTLLQEAHCSSVSSANKNSVVTAEALKNSCSFREFVPSLKKVVNQDA 216

Query: 192 TRESYVTVELKANEKFGKPDAKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALP 251
           T ESYVT+EL+ N+K  KPDAK  +QDNGSSK    E +C  HEK    +K D+   ALP
Sbjct: 217 TNESYVTIELEGNQKLDKPDAKQTVQDNGSSKASKSE-SCLFHEKFDQAKKSDEFNSALP 275

Query: 252 TAEVQATQNYPRHVNVHILDGSLGTGTQSP-SDMPMQESIFHPIGEVHGNPNLFTNPAAS 310
           T E++  Q YPRHV VH+L+GSLGT  Q+P SD+  QE IF P GEVHG+PNL+++PAAS
Sbjct: 276 TDEMETMQGYPRHVPVHVLEGSLGTCMQTPTSDVSFQEPIFCPTGEVHGHPNLYSHPAAS 335

Query: 311 ATTENESNVPKS-THQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAH 369
           ATTE+++  P+S THQSFP+F PPF   +H+QDDYRSFLHISS FSSLIVSTLLQNPAAH
Sbjct: 336 ATTEHQNTAPRSSTHQSFPSF-PPFNSTQHNQDDYRSFLHISSTFSSLIVSTLLQNPAAH 394

Query: 370 AAASFAATFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGL 429
           +AASFAATFWPY N+E+SADSP CPQGGF SRQM+S PSMAAIAAATVAAATAWWAAHGL
Sbjct: 395 SAASFAATFWPYANLESSADSPICPQGGFPSRQMNSAPSMAAIAAATVAAATAWWAAHGL 454

Query: 430 LPLCAPFHAPFICPPACATAVSSMDTAQVPAAKTDKNDNTPPNPALKDHQPDLEYSEALQ 489
           LPLCAP HA F CPPA  TAV+S    QVPAAKT++   T  NP L++ Q D+E+S+ LQ
Sbjct: 455 LPLCAPLHAAFTCPPASGTAVASTGAGQVPAAKTERK-LTVENPLLQNQQFDVEHSKVLQ 513

Query: 490 AQNSASKSLTVLTSDSEESASAKLKMELKATDHEKNSEENELHDSNKTKNRKQVDRSSCG 549
           AQNSASKSL +  SDSEES   K     KATDHE  +   E+ D +K K RK  DRSSCG
Sbjct: 514 AQNSASKSLEMSLSDSEESGGPKKNTGSKATDHEMATPAPEVQDPSKAKARKPADRSSCG 573

Query: 550 SNTPSSSEVETDALEKQEKGKEESKEIDPSLPATDPGNRRSRSSSNITDSWKEVSEVGRL 609
           SNT SSSEVETDALEK EKG EE KE D +   T+   RRSRS+S+I+DSWKEVSE GRL
Sbjct: 574 SNTSSSSEVETDALEKLEKGNEELKETDTNPEPTESSCRRSRSNSSISDSWKEVSEEGRL 633

Query: 610 AFQALFSREVLPQSFSPPHDLKDKM-QQDNV-EDKQNGNKKDGDKSLFDLNSKTWGSCFG 667
           AFQALFSREVLPQSFSPPH LK++  Q+D + EDKQN  +K+ +  L  LN    G C  
Sbjct: 634 AFQALFSREVLPQSFSPPHVLKNEARQKDEIEEDKQNTVEKNENALLLSLNGNISGFCTS 693

Query: 668 HQEVEKSTVSGVENNGGEGLLTIGLGHGKLKARRTGFKPYKRCSVEAKENRILNTGNQAE 727
           HQE EK  +   ENNG +GLLT GLGHGKLKARRTGFKPYKRCSVEAKENR+L  G+Q E
Sbjct: 694 HQEAEKIEMPRCENNGEDGLLTFGLGHGKLKARRTGFKPYKRCSVEAKENRMLTAGSQGE 753

Query: 728 EKCPKRIRVEG 738
           EK PKRIRVEG
Sbjct: 754 EKGPKRIRVEG 764




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|51980218|gb|AAU20773.1| late elongated hypocotyl [Castanea sativa] Back     alignment and taxonomy information
>gi|225453650|ref|XP_002267720.1| PREDICTED: protein LHY-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|147856747|emb|CAN81352.1| hypothetical protein VITISV_012722 [Vitis vinifera] Back     alignment and taxonomy information
>gi|219687743|dbj|BAH09382.1| transcription factor LHY [Populus nigra] gi|219687747|dbj|BAH09384.1| PnLHY1 [Populus nigra] Back     alignment and taxonomy information
>gi|219687745|dbj|BAH09383.1| transcription factor LHY [Populus nigra] gi|219687749|dbj|BAH09385.1| PnLHY2 [Populus nigra] Back     alignment and taxonomy information
>gi|118488480|gb|ABK96054.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224128077|ref|XP_002320238.1| predicted protein [Populus trichocarpa] gi|222861011|gb|EEE98553.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224064384|ref|XP_002301449.1| predicted protein [Populus trichocarpa] gi|222843175|gb|EEE80722.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356520209|ref|XP_003528756.1| PREDICTED: protein LHY [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query741
TAIR|locus:2200970645 LHY "LATE ELONGATED HYPOCOTYL" 0.469 0.539 0.351 4.3e-80
TAIR|locus:2044345608 CCA1 "circadian clock associat 0.398 0.485 0.322 8.6e-44
TAIR|locus:504955668336 AT3G10113 [Arabidopsis thalian 0.052 0.116 0.666 2.9e-05
TAIR|locus:2014114372 EPR1 "EARLY-PHYTOCHROME-RESPON 0.052 0.104 0.666 3.6e-05
TAIR|locus:2169135287 RVE2 "REVEILLE 2" [Arabidopsis 0.039 0.101 0.666 4.1e-05
TAIR|locus:2200970 LHY "LATE ELONGATED HYPOCOTYL" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 470 (170.5 bits), Expect = 4.3e-80, Sum P(3) = 4.3e-80
 Identities = 139/395 (35%), Positives = 193/395 (48%)

Query:   242 KRDDVADALPTAEVQATQNYPRHVNVHILDGSLGTGTQS-PSDMPMQESIFHPIGE-VHG 299
             K++   D      V + QNYP H +  I++G++    Q+ PS M  Q+ +FHP+ E  HG
Sbjct:   199 KKNKDKDGNDGTTVHSMQNYPWHFHADIVNGNIAKCPQNHPSGMVSQDFMFHPMREETHG 258

Query:   300 NPNLFTNPAASATTENESNVPKSTHQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIV 359
             + NL    A++ TT        ++HQ+FP  H         QDDYRSFL ISS FS+LI+
Sbjct:   259 HANLQATTASATTT--------ASHQAFPACH--------SQDDYRSFLQISSTFSNLIM 302

Query:   360 STLLQNPXXXXXXXXXXTFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMXXXXXXXXXX 419
             STLLQNP          + WPY ++  S DS T P         SSPPS+          
Sbjct:   303 STLLQNPAAHAAATFAASVWPYASVGNSGDSST-PMS-------SSPPSITAIAAATVAA 354

Query:   420 XXXXXXXHGLLPLCAPFHAPFICPPACATAVSSMDTAQVPA-AKTDKNDNTPP----NPA 474
                    HGLLP+CAP  AP  C P    AV +      PA  + D  +NT P    N A
Sbjct:   355 ATAWWASHGLLPVCAP--APITCVPFSTVAVPT------PAMTEMDTVENTQPFEKQNTA 406

Query:   475 LKDHQPDLEYSEALQAQNSASKSLTVLTSDSEESASAKLKMELKATDHEKNS-EENELHD 533
             L+D Q     S A  + +S    +T L +DS+ +     ++ + A  H+ N+ ++  L D
Sbjct:   407 LQD-QNLASKSPASSSDDSDETGVTKLNADSKTNDDKIEEVVVTAAVHDSNTAQKKNLVD 465

Query:   534 SNKTKNRKQVDRSSCGSNTPSSSEVETDALXXXXXXXXXXXXIDPSLPATDPGNRRSRSS 593
              +   +      S   + T +  ++E D              I+  L       R + S+
Sbjct:   466 RSSCGSNTP---SGSDAETDALDKMEKDKEDVKETDENQPDVIE--LNNRKIKMRDNNSN 520

Query:   594 SNIT-DSWKEVSEVGRLAFQALFSREVLPQSFSPP 627
             +N T DSWKEVSE GR+AFQALF+RE LPQSFSPP
Sbjct:   521 NNATTDSWKEVSEEGRIAFQALFARERLPQSFSPP 555


GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS;IDA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=RCA;TAS
GO:0009651 "response to salt stress" evidence=IEP;RCA
GO:0009723 "response to ethylene stimulus" evidence=IEP;RCA
GO:0009733 "response to auxin stimulus" evidence=IEP;RCA
GO:0009737 "response to abscisic acid stimulus" evidence=IEP;RCA
GO:0009739 "response to gibberellin stimulus" evidence=IEP;RCA
GO:0009751 "response to salicylic acid stimulus" evidence=IEP;RCA
GO:0009753 "response to jasmonic acid stimulus" evidence=IEP;RCA
GO:0046686 "response to cadmium ion" evidence=IEP
GO:0048574 "long-day photoperiodism, flowering" evidence=IGI;RCA
GO:0042752 "regulation of circadian rhythm" evidence=RCA;IMP
GO:0043433 "negative regulation of sequence-specific DNA binding transcription factor activity" evidence=IMP
GO:0044212 "transcription regulatory region DNA binding" evidence=IDA
GO:0007623 "circadian rhythm" evidence=IGI;RCA
GO:0009409 "response to cold" evidence=IGI
GO:0042754 "negative regulation of circadian rhythm" evidence=IMP
GO:0009639 "response to red or far red light" evidence=RCA
TAIR|locus:2044345 CCA1 "circadian clock associated 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955668 AT3G10113 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2014114 EPR1 "EARLY-PHYTOCHROME-RESPONSIVE1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2169135 RVE2 "REVEILLE 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 741
TIGR0155757 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF c 99.46
KOG0724335 consensus Zuotin and related molecular chaperones 98.92
PF0024948 Myb_DNA-binding: Myb-like DNA-binding domain; Inte 98.77
cd0016745 SANT 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding do 98.22
smart0071749 SANT SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-bindi 98.05
PF1392160 Myb_DNA-bind_6: Myb-like DNA-binding domain; PDB: 97.23
KOG0457438 consensus Histone acetyltransferase complex SAGA/A 96.57
COG5259531 RSC8 RSC chromatin remodeling complex subunit RSC8 96.15
PLN03162526 golden-2 like transcription factor; Provisional 95.53
KOG1279506 consensus Chromatin remodeling factor subunit and 94.36
PLN03212249 Transcription repressor MYB5; Provisional 93.26
COG5114432 Histone acetyltransferase complex SAGA/ADA, subuni 93.25
PLN03091459 hypothetical protein; Provisional 91.65
KOG4329445 consensus DNA-binding protein [General function pr 90.09
KOG0724335 consensus Zuotin and related molecular chaperones 89.28
smart0042668 TEA TEA domain. 87.19
PLN03212249 Transcription repressor MYB5; Provisional 86.31
PLN03091459 hypothetical protein; Provisional 85.19
KOG3841455 consensus TEF-1 and related transcription factor, 82.87
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class Back     alignment and domain information
Probab=99.46  E-value=2.6e-14  Score=117.09  Aligned_cols=47  Identities=23%  Similarity=0.101  Sum_probs=43.0

Q ss_pred             cchhhhHHHHHHHHHHhcCc-CC---cee-Ee-eccc-chHHHHHHHHHHHHHH
Q 004618            2 LLFSIFDLIGDLSRLFFLCF-TW---FRA-EH-IGTK-KAVQIRSHAQKFFSKL   48 (741)
Q Consensus         2 ~l~s~~eh~~FLeaL~lyGR-dW---kkI-e~-VgTR-T~~QIRSHAQKYF~Kl   48 (741)
                      +.|+.|||.+||+||+.||+ +|   ++| ++ +.|+ |..||+|||||||+|+
T Consensus         4 ~~WT~eeh~~Fl~ai~~~G~g~~a~pk~I~~~~~~~~lT~~qV~SH~QKy~~k~   57 (57)
T TIGR01557         4 VVWTEDLHDRFLQAVQKLGGPDWATPKRILELMVVDGLTRDQVASHLQKYRLKQ   57 (57)
T ss_pred             CCCCHHHHHHHHHHHHHhCCCcccchHHHHHHcCCCCCCHHHHHHHHHHHHccC
Confidence            57999999999999999999 99   999 65 4588 9999999999999873



This model describes a DNA-binding domain restricted to (but common in) plant proteins, many of which also contain a response regulator domain. The domain appears related to the Myb-like DNA-binding domain described by Pfam model pfam00249. It is distinguished in part by a well-conserved motif SH[AL]QKY[RF] at the C-terminal end of the motif.

>KOG0724 consensus Zuotin and related molecular chaperones (DnaJ superfamily), contains DNA-binding domains [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins Back     alignment and domain information
>cd00167 SANT 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains Back     alignment and domain information
>smart00717 SANT SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains Back     alignment and domain information
>PF13921 Myb_DNA-bind_6: Myb-like DNA-binding domain; PDB: 1A5J_A 1MBH_A 1GV5_A 1H89_C 1IDY_A 1MBK_A 1IDZ_A 1H88_C 1GVD_A 1MBG_A Back     alignment and domain information
>KOG0457 consensus Histone acetyltransferase complex SAGA/ADA, subunit ADA2 [Chromatin structure and dynamics] Back     alignment and domain information
>COG5259 RSC8 RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription] Back     alignment and domain information
>PLN03162 golden-2 like transcription factor; Provisional Back     alignment and domain information
>KOG1279 consensus Chromatin remodeling factor subunit and related transcription factors [Chromatin structure and dynamics] Back     alignment and domain information
>PLN03212 Transcription repressor MYB5; Provisional Back     alignment and domain information
>COG5114 Histone acetyltransferase complex SAGA/ADA, subunit ADA2 [Chromatin structure and dynamics] Back     alignment and domain information
>PLN03091 hypothetical protein; Provisional Back     alignment and domain information
>KOG4329 consensus DNA-binding protein [General function prediction only] Back     alignment and domain information
>KOG0724 consensus Zuotin and related molecular chaperones (DnaJ superfamily), contains DNA-binding domains [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>smart00426 TEA TEA domain Back     alignment and domain information
>PLN03212 Transcription repressor MYB5; Provisional Back     alignment and domain information
>PLN03091 hypothetical protein; Provisional Back     alignment and domain information
>KOG3841 consensus TEF-1 and related transcription factor, TEAD family [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query741
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 5e-09
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-05
2cu7_A72 KIAA1915 protein; nuclear protein, SANT domain, DN 6e-09
2yum_A75 ZZZ3 protein, zinc finger ZZ-type-containing prote 9e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 59.1 bits (142), Expect = 5e-09
 Identities = 72/504 (14%), Positives = 135/504 (26%), Gaps = 143/504 (28%)

Query: 104 VSSLRCNQVLDLEK--------EPICDRPNGDEKPTY----TIESQKDNCSEVFILHQEA 151
           V +  C  V D+ K        + I    +     T     T+ S+++   + F+     
Sbjct: 30  VDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSG-TLRLFWTLLSKQEEMVQKFV----- 83

Query: 152 HCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKANEKFGK-- 209
               V  +N   + +P  ++         PS+                    N+ F K  
Sbjct: 84  --EEVLRINYKFLMSP--IKTEQR----QPSMMT-----RMYIEQRDRLYNDNQVFAKYN 130

Query: 210 ---PDAKLALQDNGSSKPLNLENACPSHEKSVHGE----KRDDVADALPTAEVQATQNYP 262
                  L L+         L    P+    + G     K     D   + +VQ   ++ 
Sbjct: 131 VSRLQPYLKLRQ-------ALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDF- 182

Query: 263 RHVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAASATTENESNVPKS 322
               +  L  +L       + + M + + + I     +PN  +        ++ SN+   
Sbjct: 183 ---KIFWL--NLKNCNSPETVLEMLQKLLYQI-----DPNWTSRS------DHSSNIKLR 226

Query: 323 THQSFPTFHPPFTQFRHDQ-----DDYRSFLHISSAFS----------------SLIVST 361
            H              ++       + ++     +AF+                 L  +T
Sbjct: 227 IHSIQAELRRLLKSKPYENCLLVLLNVQNAKAW-NAFNLSCKILLTTRFKQVTDFLSAAT 285

Query: 362 LLQNPAAHAAASFAAT-----FWPYTNMETSADSPTCPQGGFLSRQMS--SPPSMAAIAA 414
                  H + +            Y +               L R++   +P  ++ IA 
Sbjct: 286 TTHISLDHHSMTLTPDEVKSLLLKYLDCRPQD----------LPREVLTTNPRRLSIIAE 335

Query: 415 ATVAAATAW--WAAHGLLPLCAPFHAPFICPPACATAVSSMDTAQVPAAKTDK------- 465
           +       W  W       L               T + S      PA            
Sbjct: 336 SIRDGLATWDNWKHVNCDKL--------------TTIIESSLNVLEPAEYRKMFDRLSVF 381

Query: 466 NDNTP-PNPALKDHQPDLEYSEALQ-AQNSASKSLTVLTSDSEESASA--KLKMELKATD 521
             +   P   L     D+  S+ +         SL  +    +ES  +   + +ELK   
Sbjct: 382 PPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYSL--VEKQPKESTISIPSIYLELKV-- 437

Query: 522 HEKNSEENELHDSNKTKNRKQVDR 545
             K   E  LH       R  VD 
Sbjct: 438 --KLENEYALH-------RSIVDH 452


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>2cu7_A KIAA1915 protein; nuclear protein, SANT domain, DNA binding, regulation of transcription, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.4.1.3 Length = 72 Back     alignment and structure
>2yum_A ZZZ3 protein, zinc finger ZZ-type-containing protein 3; transcription, structural genomics, NPPSFA; NMR {Homo sapiens} Length = 75 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query741
d1irza_64 Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId 99.07
d2cu7a165 MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 960 98.56
d1x41a147 Transcriptional adaptor 2-like, TADA2L, isoform b 98.15
d1xc5a168 Nuclear receptor corepressor 2 {Human (Homo sapien 97.9
d2iw5b165 REST corepressor 1, CoREST {Human (Homo sapiens) [ 97.89
d2crga157 Metastasis associated protein MTA3 {Mouse (Mus mus 97.74
d2cjja163 Radialis {Garden snapdragon (Antirrhinum majus) [T 97.43
d1guua_50 c-Myb, DNA-binding domain {Mouse (Mus musculus) [T 96.72
d2cqra160 DnaJ homolog subfamily C member 1 {Human (Homo sap 96.68
d1gv2a247 c-Myb, DNA-binding domain {Mouse (Mus musculus) [T 95.98
d1gvda_52 c-Myb, DNA-binding domain {Mouse (Mus musculus) [T 95.42
d2cqqa159 DnaJ homolog subfamily C member 1 {Human (Homo sap 94.17
d1w0ua_55 Telomeric repeat binding factor 2, TRF2 {Human (Ho 94.16
d1ofcx152 SANT domain of the nucleosome remodeling ATPase IS 92.93
d1w0ta_52 DNA-binding domain of human telomeric protein, hTR 92.47
d1igna186 DNA-binding domain of rap1 {Baker's yeast (Sacchar 91.68
d2ckxa183 Telomere binding protein TBP1 {Tobacco (Nicotiana 84.44
d1ug2a_95 2610100b20rik gene product {Mouse (Mus musculus) [ 82.28
>d1irza_ a.4.1.11 (A:) Arr10-B {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: GARP response regulators
domain: Arr10-B
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.07  E-value=2.3e-11  Score=90.97  Aligned_cols=50  Identities=20%  Similarity=0.107  Sum_probs=44.7

Q ss_pred             CCHHHHHHHHHHHHHHHCCCCC---CEE-EE--ECCCCHHHHHHHHHHHHHHHHHH
Q ss_conf             6300667999999987429687---605-84--01354679997799999999995
Q 004618            2 LLFSIFDLIGDLSRLFFLCFTW---FRA-EH--IGTKKAVQIRSHAQKFFSKLEKE   51 (741)
Q Consensus         2 ~l~s~~eh~~FLeaL~lyGRdW---kkI-e~--VgTRT~~QIRSHAQKYF~Kl~r~   51 (741)
                      +.|+.++|.+|++||..||++|   ++| ++  |.++|..||+||+||||+++.+.
T Consensus         8 ~~WT~elH~~Fv~Av~~lG~~~atpk~I~~~m~v~~lT~~qV~SHlQKYrl~l~k~   63 (64)
T d1irza_           8 VLWTHELHNKFLAAVDHLGVERAVPKKILDLMNVDKLTRENVASHLQKFRVALKKV   63 (64)
T ss_dssp             CSSCHHHHHHHHHHHHHHCTTTCCHHHHHHHHCCTTCCHHHHHHHHHHHHHHHHSC
T ss_pred             CCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHC
T ss_conf             98999999999999999585435367899870888887999998999999999804



>d2cu7a1 a.4.1.3 (A:8-72) MYSM1 (KIAA1915) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x41a1 a.4.1.1 (A:8-54) Transcriptional adaptor 2-like, TADA2L, isoform b {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1xc5a1 a.4.1.3 (A:413-480) Nuclear receptor corepressor 2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2iw5b1 a.4.1.3 (B:376-440) REST corepressor 1, CoREST {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2crga1 a.4.1.3 (A:8-64) Metastasis associated protein MTA3 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2cjja1 a.4.1.3 (A:8-70) Radialis {Garden snapdragon (Antirrhinum majus) [TaxId: 4151]} Back     information, alignment and structure
>d1guua_ a.4.1.3 (A:) c-Myb, DNA-binding domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2cqra1 a.4.1.3 (A:7-66) DnaJ homolog subfamily C member 1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1gv2a2 a.4.1.3 (A:144-190) c-Myb, DNA-binding domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1gvda_ a.4.1.3 (A:) c-Myb, DNA-binding domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2cqqa1 a.4.1.3 (A:8-66) DnaJ homolog subfamily C member 1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1w0ua_ a.4.1.4 (A:) Telomeric repeat binding factor 2, TRF2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1ofcx1 a.4.1.3 (X:799-850) SANT domain of the nucleosome remodeling ATPase ISWI {Fruit fly (Drosophila melanogaster) [TaxId: 7227]} Back     information, alignment and structure
>d1w0ta_ a.4.1.4 (A:) DNA-binding domain of human telomeric protein, hTRF1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1igna1 a.4.1.6 (A:360-445) DNA-binding domain of rap1 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d2ckxa1 a.4.1.3 (A:578-660) Telomere binding protein TBP1 {Tobacco (Nicotiana tabacum) [TaxId: 4097]} Back     information, alignment and structure
>d1ug2a_ a.4.1.3 (A:) 2610100b20rik gene product {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure