Citrus Sinensis ID: 004499


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------75
MGASSTTFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSSVKNTLSLPDLPTKTIPNRWHNLDEVVNRGGPDVQLPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPAFLRKDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCNSGSDVKSERTFCRSMQSLLTTEKKPESSSQSKNTNNSVSYSISQSSLVEEDAKHFLNSSEVCGAVSSKGFSSTCPSTCSEQLDRSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLLQSKTTSS
cccccHHHHHHHHHHHHccccccEEEEEEEEccccccEEEEEcccccccccccccHHccccHHHHHHccccccccccHHHHHccccEEEcccccEEEEEEEcccEEEEEccccccccccccccHHHHHHHHcccccEEEEEEEccccEEEEcccccccccHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHcccccccccccccccccHHHccccccccccccccccccccccccccHHHHHHHHHHHccccccccccccccccccHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccEEEEEEcccccEEEEEEcccccHHHHHHHHHHHcccEEEEEEEEEEccEEEEEEEEEEccccccHHHHHHHHHHHHcccccccc
cccccHHHHHHHHHHHHHHccccEEEEEEEEcccccEEEEEccccccccccHHccccHHHHHHHHHHHccccccccccHHHHHHcHHccccccccccEEEEcccEEEEEcccccccccccHHcHHHHHHHHHcccEEEEEEEcccccEEEEcccccccccHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHcccccccccccccccccEEccccccccccEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccHHHHccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccHHHHHHcHHHccccccccccccccccccccccccccccccccccccHHHHHHHHHcccccccccccccccccccccEccccccccccccccccccccccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHccccccEEccccccccEEEEEEcccccEccEEEEcccccccEEEHEEHHHHcHHHHHHHHHHHcccEEEEEEEEEccccEEEEEEEEccccccccHHHHHHHHHHHHccccccc
mgassttfdLHGILKSLCFNTAWKYAVFWKLKHrtrmvltwedgyydncgqqdslenkcsseslenfhggryshdplGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDqlvtnscssfefsdgwqsqFSAGIRTIAVVAVVPhgvvqlgsldevteDMKVVTHIRDVFAALNDISVGHVSSTIQSSVkntlslpdlptktipnrwhnldevvnrggpdvqlpmfpyvekhndgsyafsgmqpkigdgvvnrnegillssaggvgsakilhpksnvinldyqnqmgIHFISDGmsrvessgwkdlgviseqngtpfsinSVIDSINLCSVALQAEKFvadrtylasnpleavlgeqvklectdscqngmlhipeisdikFEKDLEKLQNqtelnhldpsgmslKFSAVSELHEAlgpaflrkdiyndrepentvdgetvgmpeltssshlmfdsgseNLLDAVVASVCnsgsdvkseRTFCRSMQSLlttekkpesssqskntnnsvSYSISQSSLVEEDAKHFLNSsevcgavsskgfsstcpstcseqldrssepaknnkkrartgengrprprdRQLIQDRIKELRElvpngskcsIDSLLERTIKHMLFLQSITKHADKLSKCAEskmhqkgngihgsnyeqgSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKgvteahgdkTWICFVVEGQDNRIMHRMDVLWSLVQLLQSKTTSS
mgassttfdlhgiLKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQssvkntlslpdlpTKTIPNRWHNLDEVVNRGGPDVQLPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPaflrkdiyndrePENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCnsgsdvksertFCRSMQSLlttekkpesssqskntnnsvSYSISQSSLVEEDAKHFLNSSEVCGAVSSKGFSSTCPSTcseqldrssepaknnkkrartgengrprprdrqlIQDRIKElrelvpngskcSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVteahgdktwICFVVEGQDNRIMHRMDVLWSLVQllqskttss
MGASSTTFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAvvavvphgvvQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSSVKNTLSLPDLPTKTIPNRWHNLDEVVNRGGPDVQLPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPAFLRKDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCNSGSDVKSERTFCRSMQSLLTTEKKPESSSQSKNTNNsvsysisqssLVEEDAKHFLNSSEVCGAVSSKGFSSTCPSTCSEQLDRSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLLQSKTTSS
******TFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQ**************NFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSSVKNTLSLPDLPTKTIPNRWHNLDEVVNRGGPDVQLPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGMLHIPEISDIKFE**************************V*ELHEALGPAFLRKDIY*********************************LLDAVVASVC**************************************************************************************************************************LVPNGSKCSIDSLLERTIKHMLFLQSITKHA**************************SSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLL*******
********DLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDN**************************DPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALND*************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************RIKELRELVPNGSKCSIDSLLERTIKHMLFLQS*************************************************ENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLV***Q******
********DLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSSVKNTLSLPDLPTKTIPNRWHNLDEVVNRGGPDVQLPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPAFLRKDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVC**********TFCRSMQSL*****************************VEEDAKHFLNSSEVCGA******************************************RDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLLQSKTTSS
***SSTTFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDIS*********************************************************************GDGVV*RNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISD*MS**ESSGW***************INSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGMLHIP*********************H**PSGMSLKFSAVSELHEALGPAFLR********************************SGSENLLDAVVASVC**************************************************************CGAV*****S*********************************RPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLLQS*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGASSTTFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSSVKNTLSLPDLPTKTIPNRWHNLDEVVNRGGPDVQLPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPAFLRKDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCNSGSDVKSERTFCRSMQSLLTTEKKPESSSQSKNTNNSVSYSISQSSLVEEDAKHFLNSSEVCGAVSSKGFSSTCPSTCSEQLDRSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLLQSKTTSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query748 2.2.26 [Sep-21-2011]
P0C7P8734 Transcription factor EMB1 no no 0.943 0.961 0.466 1e-161
Q58G01720 Transcription factor bHLH no no 0.927 0.963 0.458 1e-160
Q9XIN0650 Transcription factor LHW no no 0.371 0.427 0.448 4e-57
Q7XJU0527 Transcription factor bHLH no no 0.228 0.324 0.502 3e-47
Q9ZPY8566 Transcription factor ABA- no no 0.195 0.257 0.273 1e-06
Q9FN69637 Transcription factor GLAB no no 0.237 0.279 0.223 0.0001
Q9ZVX2571 Transcription factor ABOR no no 0.255 0.334 0.260 0.0002
Q9FT81518 Transcription factor TT8 no no 0.208 0.301 0.247 0.0003
Q8S3D5310 Transcription factor bHLH no no 0.159 0.383 0.325 0.0003
P13526610 Anthocyanin regulatory Lc N/A no 0.189 0.232 0.244 0.0003
>sp|P0C7P8|LHWL1_ARATH Transcription factor EMB1444 OS=Arabidopsis thaliana GN=EMB1444 PE=2 SV=1 Back     alignment and function desciption
 Score =  568 bits (1463), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 356/763 (46%), Positives = 464/763 (60%), Gaps = 57/763 (7%)

Query: 8   FDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENF 67
           + L  IL+S+C NT W YAVFWKL H + MVLT ED Y  N       E     ESL   
Sbjct: 3   YTLQQILRSICSNTDWNYAVFWKLNHHSPMVLTLEDVYCVN------HERGLMPESL--- 53

Query: 68  HGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGW 127
           HGGR++HDPLGLAVAKMSYHV+SLGEGIVGQVA++G+HQWIFS+ L  +S S+ +  +GW
Sbjct: 54  HGGRHAHDPLGLAVAKMSYHVHSLGEGIVGQVAISGQHQWIFSEYL-NDSHSTLQVHNGW 112

Query: 128 QSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQ 187
           +SQ SAGI+TI +VAV   GVVQLGSL +V ED  +VTHIR +F AL D    H S+ +Q
Sbjct: 113 ESQISAGIKTILIVAVGSCGVVQLGSLCKVEEDPALVTHIRHLFLALTDPLADHASNLMQ 172

Query: 188 SSVKNTLSLPDLPTKTI----PNRWHNLDEVVNRGG---------------PDVQLPMFP 228
             + +    P +P+K +    P+     D+ ++  G               P    P + 
Sbjct: 173 CDINSPSDRPKIPSKCLHEASPDFSGEFDKAMDMEGLNIVSQNTSNRSNDLPYNFTPTYF 232

Query: 229 YVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMG 288
           ++E+          +QP +       N+ +  +S   VG          V++  ++NQ+ 
Sbjct: 233 HMERTAQVIGGLEAVQPSM----FGSNDCV--TSGFSVG----------VVDTKHKNQVD 276

Query: 289 IHFISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLAS 348
           I  +S  +   E+ G++    + + N   +S N V +S    ++A+++++  A  +Y   
Sbjct: 277 ISDMSKVIYDEETGGYRYSREL-DPNFQHYSRNHVRNSGGTSALAMESDRLKAGSSYPQ- 334

Query: 349 NPLEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQ-TELNHLDPSGMSLKF 407
             L++ +   +K +   S +N +    E     F KD E  Q + +E + LD    SL  
Sbjct: 335 --LDSTVLTALKTDKDYSRRNEVFQPSESQGSIFVKDTEHRQEEKSESSQLDALTASLCS 392

Query: 408 SAVSELHEALGPAFLRKDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVA 467
            + SEL EALGPAF  K   +  E         +      S SHL F+S SENLLDAVVA
Sbjct: 393 FSGSELLEALGPAF-SKTSTDYGELAKFESAAAIRRTNDMSHSHLTFESSSENLLDAVVA 451

Query: 468 SVCNSGSDVKSERTFCRSMQSLLTTEKKPESSSQSKNTNNSVSYS---ISQSSLVEEDAK 524
           S+ N   +V+ E +  RS QSLLTT +  ++     N  N VS     ISQ  L +   +
Sbjct: 452 SMSNGDGNVRREISSSRSTQSLLTTAEMAQAEPFGHNKQNIVSTVDSVISQPPLADGLIQ 511

Query: 525 HFLNSSEVCGAVSSKGFSSTCPSTCSEQLDRSSEPAKNNKKRARTGENGRPRPRDRQLIQ 584
              N S +CGA SS GFSSTC S+ S+Q   S E  K NKKRA+ GE+ RPRPRDRQLIQ
Sbjct: 512 Q--NPSNICGAFSSIGFSSTCLSSSSDQFPTSLEIPKKNKKRAKPGESSRPRPRDRQLIQ 569

Query: 585 DRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHG- 643
           DRIKELRELVPNGSKCSIDSLLE TIKHMLFLQS+++HADKL+K A SKM  K  G  G 
Sbjct: 570 DRIKELRELVPNGSKCSIDSLLECTIKHMLFLQSVSQHADKLTKSASSKMQHKDTGTLGI 629

Query: 644 SNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTIL 703
           S+ EQGSSWAVE+G HL+VCSI+VENL+K G ML+EMLCEECSHFLEIA  IRSL L IL
Sbjct: 630 SSTEQGSSWAVEIGGHLQVCSIMVENLDKEGVMLIEMLCEECSHFLEIANVIRSLELIIL 689

Query: 704 KGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLLQSKTT 746
           +G TE  G+KTWICFVVEGQ+N++MHRMD+LWSLVQ+ Q K T
Sbjct: 690 RGTTEKQGEKTWICFVVEGQNNKVMHRMDILWSLVQIFQPKAT 732




Transcription factor that may regulate root development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q58G01|LHWL3_ARATH Transcription factor bHLH155 OS=Arabidopsis thaliana GN=BHLH155 PE=2 SV=1 Back     alignment and function description
>sp|Q9XIN0|LHW_ARATH Transcription factor LHW OS=Arabidopsis thaliana GN=LHW PE=1 SV=1 Back     alignment and function description
>sp|Q7XJU0|LHWL2_ARATH Transcription factor bHLH157 OS=Arabidopsis thaliana GN=BHLH157 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZPY8|AIB_ARATH Transcription factor ABA-INDUCIBLE bHLH-TYPE OS=Arabidopsis thaliana GN=AIB PE=2 SV=2 Back     alignment and function description
>sp|Q9FN69|GL3_ARATH Transcription factor GLABRA 3 OS=Arabidopsis thaliana GN=GL3 PE=1 SV=1 Back     alignment and function description
>sp|Q9ZVX2|AMS_ARATH Transcription factor ABORTED MICROSPORES OS=Arabidopsis thaliana GN=AMS PE=1 SV=2 Back     alignment and function description
>sp|Q9FT81|TT8_ARATH Transcription factor TT8 OS=Arabidopsis thaliana GN=TT8 PE=1 SV=2 Back     alignment and function description
>sp|Q8S3D5|BH069_ARATH Transcription factor bHLH69 OS=Arabidopsis thaliana GN=BHLH69 PE=2 SV=2 Back     alignment and function description
>sp|P13526|ARLC_MAIZE Anthocyanin regulatory Lc protein OS=Zea mays GN=LC PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query748
255583220749 basic helix-loop-helix-containing protei 0.982 0.981 0.629 0.0
359480260749 PREDICTED: uncharacterized basic helix-l 0.983 0.982 0.622 0.0
297744122774 unnamed protein product [Vitis vinifera] 0.983 0.950 0.622 0.0
449502085691 PREDICTED: transcription factor EMB1444- 0.915 0.991 0.547 0.0
224062159703 predicted protein [Populus trichocarpa] 0.918 0.977 0.542 0.0
449458502677 PREDICTED: transcription factor EMB1444- 0.897 0.991 0.535 0.0
356570632756 PREDICTED: uncharacterized basic helix-l 0.975 0.965 0.512 0.0
356503600756 PREDICTED: uncharacterized basic helix-l 0.974 0.964 0.502 0.0
356536821697 PREDICTED: uncharacterized basic helix-l 0.913 0.979 0.491 1e-177
312282411699 unnamed protein product [Thellungiella h 0.907 0.971 0.460 1e-159
>gi|255583220|ref|XP_002532375.1| basic helix-loop-helix-containing protein, putative [Ricinus communis] gi|223527931|gb|EEF30018.1| basic helix-loop-helix-containing protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  891 bits (2303), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/750 (62%), Positives = 570/750 (76%), Gaps = 15/750 (2%)

Query: 9   DLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFH 68
           DLH  L+SLCFNT WKYAVFWKLKHRTRMVLTWED YY+NC Q D LENKC  E+ EN  
Sbjct: 4   DLHNTLRSLCFNTDWKYAVFWKLKHRTRMVLTWEDAYYNNCEQHDLLENKCFGETFENLC 63

Query: 69  GGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQ 128
           GGRYS+DP+GLAVAKMSYHVYSLGEGIVGQVAVTGKH+WI +D+ VTNS SSFEFSDGWQ
Sbjct: 64  GGRYSNDPVGLAVAKMSYHVYSLGEGIVGQVAVTGKHRWIVADKHVTNSISSFEFSDGWQ 123

Query: 129 SQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQS 188
           SQFSAGIRTI VVAVVPHGVVQLGSL++V EDMK+V HI+DVF++L D SV  +S  +Q 
Sbjct: 124 SQFSAGIRTIIVVAVVPHGVVQLGSLNKVAEDMKLVNHIKDVFSSLQDSSVEQISIPLQY 183

Query: 189 SVKNTLSLPDLPTKT-------IPNRWHNLDEVVNRGGPDVQLPMFPYVEKHNDGSYAFS 241
           S+K +L LPD+PT++       IP+   NLD+  ++ GP  Q  MFPY++K +D SY +S
Sbjct: 184 SMKTSLYLPDVPTQSLDSESVVIPDNLCNLDKAADK-GPYNQSTMFPYLQKQSDDSYFYS 242

Query: 242 --GMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRV 299
             G+  K    +VN+  G  LS    + S K+L P+SN+  L+  NQ+GI+ + D     
Sbjct: 243 LPGIHQKTAVELVNKYGGGGLSLPVNISSVKLLQPRSNISYLEQHNQVGINLVVDHTCGG 302

Query: 300 ESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQV 359
           ++S WKD G  SE N TP   NSV D+INLC V L  +KF AD      + L++ + ++ 
Sbjct: 303 KTSVWKDPGRGSELNVTPHLDNSVKDNINLCDVILPDQKFGADPANFPMDLLDSTVCDRH 362

Query: 360 KLECTDSCQNGMLHIPEISDIKFEKDLE-KLQNQTELNHLDPSGMSLKFSAVSELHEALG 418
           K +  D   NG L +PE S I  +K LE KL+ Q   +HL+ S   LKFSA  ELHEALG
Sbjct: 363 KSDEID-ILNGALDMPESSSIDLKKHLEKKLEYQAGSSHLESSSTFLKFSAGCELHEALG 421

Query: 419 PAFLRKDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCNSGS-DVK 477
           PAF +  +Y D E   T   + + +PE  S+S + FD+GSENLLDAVV +VC SGS DVK
Sbjct: 422 PAFSKGCLYFDCEEGKTESADIIEVPEGISTSQMTFDTGSENLLDAVVGNVCYSGSTDVK 481

Query: 478 SERTFCRSMQSLLTTEKKPESSSQSKNTNNSVSYSISQSSLVEEDAKHFLNSSEVCGAVS 537
            E++ C+S QSLLTTEK PE S Q+K+  +S  YSI++ S+V+ D  +  +S+ V GA S
Sbjct: 482 REKSVCKSAQSLLTTEKMPEPSFQAKHITHSAGYSINRQSVVQNDTHNCSSSTGVRGATS 541

Query: 538 SKGFSSTCPSTCSEQLDRSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNG 597
           S G+SS CPSTCSEQLDR SEPA+ NKKRAR GEN RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 542 SNGYSSNCPSTCSEQLDRRSEPAEKNKKRARPGENCRPRPRDRQLIQDRIKELRELVPNG 601

Query: 598 SKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEMG 657
           +KCSIDSLLERTIKHMLFL+SITKHADKL+KCAESKM+QKG     SNYE+GSSWAVE+G
Sbjct: 602 AKCSIDSLLERTIKHMLFLESITKHADKLNKCAESKMYQKGTDT--SNYEKGSSWAVEVG 659

Query: 658 SHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWIC 717
            HLKV SI+VE+LNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKG+TE HG+KTWIC
Sbjct: 660 GHLKVSSIIVESLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGITEVHGEKTWIC 719

Query: 718 FVVEGQDNRIMHRMDVLWSLVQLLQSKTTS 747
           F+VEGQ+N++MHRMD+LWSLVQ+LQ KT++
Sbjct: 720 FMVEGQNNKVMHRMDILWSLVQILQPKTSN 749




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359480260|ref|XP_003632423.1| PREDICTED: uncharacterized basic helix-loop-helix protein At1g06150-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|297744122|emb|CBI37092.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449502085|ref|XP_004161538.1| PREDICTED: transcription factor EMB1444-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224062159|ref|XP_002300784.1| predicted protein [Populus trichocarpa] gi|222842510|gb|EEE80057.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449458502|ref|XP_004146986.1| PREDICTED: transcription factor EMB1444-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356570632|ref|XP_003553489.1| PREDICTED: uncharacterized basic helix-loop-helix protein At1g06150-like [Glycine max] Back     alignment and taxonomy information
>gi|356503600|ref|XP_003520595.1| PREDICTED: uncharacterized basic helix-loop-helix protein At1g06150-like [Glycine max] Back     alignment and taxonomy information
>gi|356536821|ref|XP_003536932.1| PREDICTED: uncharacterized basic helix-loop-helix protein At1g06150-like [Glycine max] Back     alignment and taxonomy information
>gi|312282411|dbj|BAJ34071.1| unnamed protein product [Thellungiella halophila] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query748
TAIR|locus:2042551720 CPUORF7 "conserved peptide ups 0.881 0.915 0.447 1.7e-163
UNIPROTKB|Q5N9E6904 P0505D12.13 "BHLH transcriptio 0.339 0.280 0.480 1.6e-80
TAIR|locus:2039580650 LHW "AT2G27230" [Arabidopsis t 0.371 0.427 0.442 9.7e-72
UNIPROTKB|Q2QXE3881 LOC_Os12g06330 "Expressed prot 0.378 0.321 0.429 5.4e-71
TAIR|locus:1005716759527 AT1G64625 [Arabidopsis thalian 0.315 0.447 0.416 5.6e-60
TAIR|locus:2087070362 AT3G15240 "AT3G15240" [Arabido 0.271 0.560 0.324 4.9e-17
TAIR|locus:2154719377 AT5G53900 "AT5G53900" [Arabido 0.291 0.578 0.307 1.9e-16
TAIR|locus:2202735386 AT1G60060 "AT1G60060" [Arabido 0.212 0.411 0.296 4.8e-12
TAIR|locus:401071396879 AT5G24352 "AT5G24352" [Arabido 0.081 0.772 0.405 5.5e-07
TAIR|locus:504954829637 GL3 "AT5G41315" [Arabidopsis t 0.085 0.100 0.283 1.5e-06
TAIR|locus:2042551 CPUORF7 "conserved peptide upstream open reading frame 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1306 (464.8 bits), Expect = 1.7e-163, Sum P(2) = 1.7e-163
 Identities = 308/688 (44%), Positives = 413/688 (60%)

Query:    66 NFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSS-FEFS 124
             N HG   +HDPLGLAVAKMSYHVYSLGEGIVGQVAV+G+HQW+F +    N+C+S FEF 
Sbjct:    48 NMHG---AHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGEHQWVFPENY--NNCNSAFEFH 102

Query:   125 DGWQSQFSAGIRTIAXXXXXXXXXXQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSS 184
             + W+SQ SAGI+TI           QLGSL +V ED+  V HIR +F AL D    H ++
Sbjct:   103 NVWESQISAGIKTILVVAVGPCGVVQLGSLCKVNEDVNFVNHIRHLFLALRDPLADHAAN 162

Query:   185 TIQSSVKNTLSLPDLPTKTI-PNRWHNLDEVVNRGGPDVQLP--MFPYVEKHNDGS-Y-- 238
               Q ++ N+L LP +P++ +    + +    V++   DV+    +  Y  + +D   Y  
Sbjct:   163 LRQCNMNNSLCLPKMPSEGLHAEAFPDCSGEVDKA-MDVEESNILTQYKTRRSDSMPYNT 221

Query:   239 AFSGMQPKIGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSR 298
               S +  +    VV   E +  S+ G         P  +++   ++NQ+G + I D    
Sbjct:   222 PSSCLVMEKAAQVVGGREVVQGSTCGSYSGVTFGFPV-DLVGAKHENQVGTNIIRDAPHV 280

Query:   299 VESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQ 358
               +SG KD   + + N   +  N V++  +  ++A++AE+ +  ++Y     L++     
Sbjct:   281 GMTSGCKDSRDL-DPNLHLYMKNHVLNDTSTSALAIEAERLITSQSYPR---LDSTFQAT 336

Query:   359 VKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQT-ELNHLDPSGMSLKFSAVSELHEAL 417
              + +   S  N +  + E    K+ K+ E++  +  E +  D    S    A SEL EAL
Sbjct:   337 SRTDKESSYHNEVFQLSENQGNKYIKETERMLGRNCESSQFDALISSGYTFAGSELLEAL 396

Query:   418 GPAFLRKDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCNSGSDVK 477
             G AF + +     E   +  G T+   +  S S L FD G ENLLDAVVA+VC    + +
Sbjct:   397 GSAFKQTNT-GQEELLKSEHGSTMRPTDDMSHSQLTFDPGPENLLDAVVANVCQRDGNAR 455

Query:   478 SERTFCRSMQSLLTTEKKPESSSQSK-NTNNXXXXXXXXXXLVEEDAKHFLNSSEVCGAV 536
              +    RS+QSLLT  +  E S Q K N  N          + E D +   NSS++CGA 
Sbjct:   456 DDMMSSRSVQSLLTNMELAEPSGQKKHNIVNPINSAMNQPPMAEVDTQQ--NSSDICGAF 513

Query:   537 SSKGFSSTCPSTCSEQLDRSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPN 596
             SS GFSST PS+ S+Q   S +  K NKKRA+ GE+ RPRPRDRQLIQDRIKELRELVPN
Sbjct:   514 SSIGFSSTYPSSSSDQFQTSLDIPKKNKKRAKPGESSRPRPRDRQLIQDRIKELRELVPN 573

Query:   597 GSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESKMHQKGNGIHGSNYEQGSSWAVEM 656
             GSKCSIDSLLERTIKHMLFLQ++TKHA+KLSK A  KM QK  G+      QGSS AVE+
Sbjct:   574 GSKCSIDSLLERTIKHMLFLQNVTKHAEKLSKSANEKMQQKETGM------QGSSCAVEV 627

Query:   657 GSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWI 716
             G HL+V SI+VENLNK G +L+EMLCEEC HFLEIA  IRSL L IL+G TE  G+KTWI
Sbjct:   628 GGHLQVSSIIVENLNKQGMVLIEMLCEECGHFLEIANVIRSLDLVILRGFTETQGEKTWI 687

Query:   717 CFVVEGQDNRIMHRMDVLWSLVQLLQSK 744
             CFV E Q++++M RMD+LWSLVQ+ Q K
Sbjct:   688 CFVTESQNSKVMQRMDILWSLVQIFQPK 715


GO:0005634 "nucleus" evidence=ISM
GO:0005739 "mitochondrion" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=TAS
UNIPROTKB|Q5N9E6 P0505D12.13 "BHLH transcription factor-like protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2039580 LHW "AT2G27230" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q2QXE3 LOC_Os12g06330 "Expressed protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:1005716759 AT1G64625 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2087070 AT3G15240 "AT3G15240" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2154719 AT5G53900 "AT5G53900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2202735 AT1G60060 "AT1G60060" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4010713968 AT5G24352 "AT5G24352" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504954829 GL3 "AT5G41315" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00007504001
SubName- Full=Chromosome chr7 scaffold_192, whole genome shotgun sequence; (749 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query748
pfam14215171 pfam14215, bHLH-MYC_N, bHLH-MYC and R2R3-MYB trans 7e-55
smart0035353 smart00353, HLH, helix loop helix domain 0.002
>gnl|CDD|222598 pfam14215, bHLH-MYC_N, bHLH-MYC and R2R3-MYB transcription factors N-terminal Back     alignment and domain information
 Score =  185 bits (473), Expect = 7e-55
 Identities = 52/177 (29%), Positives = 78/177 (44%), Gaps = 21/177 (11%)

Query: 10  LHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCG--------QQDSLENKCSS 61
           L   L++L  +  W YA+FW+L      VL W DGYY+           +      K   
Sbjct: 1   LQQRLQALVESEQWSYAIFWQLSPDQSGVLGWGDGYYNGEIKTRKTVAEELGLQRKKVLR 60

Query: 62  ESLENFHGGR--YSHDPLGLAVAKM-----SYHVYSLGEGIVGQVAVTGKHQWIFSDQLV 114
           E      G R   +  P  L   +          +  GEG+ G+   +G H W+      
Sbjct: 61  ELYSLLSGSRDAAALSPEDLTDTEWFYLVSMTFSFPPGEGLPGKAFASGSHIWLSGA--- 117

Query: 115 TNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVF 171
            N   S   S  W +Q  AGI+TI  +  VP GVV+LGS +++ ED+ +V H++ +F
Sbjct: 118 -NELDSSNCSRSWLAQS-AGIQTIVCIP-VPGGVVELGSTEKIPEDLNLVQHVKSLF 171


This is the N-terminal region of a family of MYB and MYC transcription factors. The DNA-binding HLH domain is further downstream, pfam00010. Members of the MYB and MYC family regulate the biosynthesis of phenylpropanoids in several plant species (DOI:10.1007/s11295-009-0232-y). Length = 171

>gnl|CDD|197674 smart00353, HLH, helix loop helix domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 748
PF14215163 bHLH-MYC_N: bHLH-MYC and R2R3-MYB transcription fa 100.0
smart0035353 HLH helix loop helix domain. 97.91
cd0008360 HLH Helix-loop-helix domain, found in specific DNA 97.78
PF0001055 HLH: Helix-loop-helix DNA-binding domain only nucl 97.67
KOG1318411 consensus Helix loop helix transcription factor EB 96.84
KOG1319229 consensus bHLHZip transcription factor BIGMAX [Tra 96.72
KOG4304250 consensus Transcriptional repressors of the hairy/ 94.49
cd0492776 ACT_ACR-like_2 Second ACT domain, of a novel type 92.34
cd0490073 ACT_UUR-like_1 ACT domain family, ACT_UUR-like_1, 92.17
KOG2483232 consensus Upstream transcription factor 2/L-myc-2 91.8
cd0492574 ACT_ACR_2 ACT domain-containing protein which is c 90.77
TIGR01817534 nifA Nif-specific regulatory protein. This model r 89.33
PF13185148 GAF_2: GAF domain; PDB: 2QYB_A 3KSG_B 3KSF_C 3KSI_ 88.42
PRK11061748 fused phosphoenolpyruvate-protein phosphotransfera 88.17
cd0492672 ACT_ACR_4 C-terminal ACT domain, of a novel type o 87.47
cd0489970 ACT_ACR-UUR-like_2 C-terminal ACT domains of the b 87.31
KOG2588 953 consensus Predicted DNA-binding protein [Transcrip 84.7
KOG4029228 consensus Transcription factor HAND2/Transcription 83.6
KOG3561 803 consensus Aryl-hydrocarbon receptor nuclear transl 83.43
cd0489572 ACT_ACR_1 ACT domain-containing protein which is c 83.16
PF01590154 GAF: GAF domain; InterPro: IPR003018 This domain i 82.88
>PF14215 bHLH-MYC_N: bHLH-MYC and R2R3-MYB transcription factors N-terminal Back     alignment and domain information
Probab=100.00  E-value=1.4e-42  Score=338.72  Aligned_cols=152  Identities=38%  Similarity=0.710  Sum_probs=124.9

Q ss_pred             HHHHHHhhccCCCcEEEEEeeecCCCCeEEEecCCccCCCCCcchhh---hhcchhhhhcccCCC--CCCCc------cc
Q 004499           10 LHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLE---NKCSSESLENFHGGR--YSHDP------LG   78 (748)
Q Consensus        10 Lqq~LrsLc~~~~WsYAIFWqls~~~~~vL~WgDGyc~g~~~~~~~e---~~~~~k~l~~L~gg~--~~~d~------~~   78 (748)
                      |||+||+||++.+|+||||||++++++ +|+||||||++++++++..   ....+++++.++.+.  ++..+      +.
T Consensus         1 Lq~~Lr~lv~~~~W~YaVFWk~~~~~~-~L~W~DG~~~g~~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~v~~~e~   79 (163)
T PF14215_consen    1 LQQRLRSLVENSQWTYAVFWKLSPDNS-VLVWGDGYCNGPKETRKNGEEEQEQRSKVLRELHSSFSSYALSPEEVTDTEW   79 (163)
T ss_pred             ChHHHHHHhCCCCCcEEEEeEEcCCCC-eeeEcceeecCCcccccchhhccchhhhHHHHHhhhccccccccchhHHHHH
Confidence            799999999999999999999999996 9999999999988765432   122455666654321  22233      33


Q ss_pred             ceeeeEEEEEecCCCCeeeeEeeCCCeEeeeCCCCccCcCCCCCcchhhhcccccCceeEEEEEecCCcEEEeccccccc
Q 004499           79 LAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVT  158 (748)
Q Consensus        79 l~v~~MS~~sF~~GeGlpGrAaaSG~hvWI~~~~~~~~~~~~~e~~r~~~~QfSAGIQTIVcIPV~~~GVLELGSTe~V~  158 (748)
                      +++.+|+| +|  |+|+|||||++|+|+||++++....    ..|.|++++|+ +|||||||||| ++||||||||++|+
T Consensus        80 f~~~s~~~-sf--g~G~~G~a~~sg~~~Wi~~~~~~~~----~~~~r~~~aq~-~~~~Tiv~IPv-~~GVvELGSt~~I~  150 (163)
T PF14215_consen   80 FYLVSMSY-SF--GEGIPGRAAASGQHIWISGANELDS----SYCERAWLAQF-AGIQTIVCIPV-PNGVVELGSTEKIP  150 (163)
T ss_pred             HhhceeeE-Ee--cCCccEEEeecCccEEEeCCCcccc----ccchhhhhhcc-cccceEEEEEe-cCCEEEeeeeeeec
Confidence            45667743 44  9999999999999999999987543    44889998777 99999999998 99999999999999


Q ss_pred             CCHHHHHHHHHHH
Q 004499          159 EDMKVVTHIRDVF  171 (748)
Q Consensus       159 Ed~~lV~~VKslF  171 (748)
                      ||++||++||++|
T Consensus       151 Ed~~~v~~vk~~F  163 (163)
T PF14215_consen  151 EDSNLVQRVKSLF  163 (163)
T ss_pred             cCHHHHHHHHhhC
Confidence            9999999999998



>smart00353 HLH helix loop helix domain Back     alignment and domain information
>cd00083 HLH Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
>PF00010 HLH: Helix-loop-helix DNA-binding domain only nuclear translocator protein (Arnt) Back     alignment and domain information
>KOG1318 consensus Helix loop helix transcription factor EB [Transcription] Back     alignment and domain information
>KOG1319 consensus bHLHZip transcription factor BIGMAX [Transcription] Back     alignment and domain information
>KOG4304 consensus Transcriptional repressors of the hairy/E(spl) family (contains HLH) [Transcription] Back     alignment and domain information
>cd04927 ACT_ACR-like_2 Second ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
>cd04900 ACT_UUR-like_1 ACT domain family, ACT_UUR-like_1, includes the first of two C-terminal ACT domains of the bacterial signal-transducing uridylyltransferase /uridylyl-removing (UUR) enzyme, GlnD and related domains Back     alignment and domain information
>KOG2483 consensus Upstream transcription factor 2/L-myc-2 protein [Transcription] Back     alignment and domain information
>cd04925 ACT_ACR_2 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
>TIGR01817 nifA Nif-specific regulatory protein Back     alignment and domain information
>PF13185 GAF_2: GAF domain; PDB: 2QYB_A 3KSG_B 3KSF_C 3KSI_A 3KSH_A 3MMH_A 3RFB_B 1F5M_A 3KO6_B 3HCY_A Back     alignment and domain information
>PRK11061 fused phosphoenolpyruvate-protein phosphotransferase PtsP/GAF domain; Provisional Back     alignment and domain information
>cd04926 ACT_ACR_4 C-terminal ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
>cd04899 ACT_ACR-UUR-like_2 C-terminal ACT domains of the bacterial signal-transducing uridylyltransferase /uridylyl-removing (UUR) enzyme, GlnD and related domains Back     alignment and domain information
>KOG2588 consensus Predicted DNA-binding protein [Transcription] Back     alignment and domain information
>KOG4029 consensus Transcription factor HAND2/Transcription factor TAL1/TAL2/LYL1 [Transcription] Back     alignment and domain information
>KOG3561 consensus Aryl-hydrocarbon receptor nuclear translocator [Transcription] Back     alignment and domain information
>cd04895 ACT_ACR_1 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
>PF01590 GAF: GAF domain; InterPro: IPR003018 This domain is present in phytochromes and cGMP-specific phosphodiesterases Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query748
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 3e-04
>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Length = 83 Back     alignment and structure
 Score = 39.3 bits (92), Expect = 3e-04
 Identities = 11/41 (26%), Positives = 22/41 (53%), Gaps = 2/41 (4%)

Query: 580 RQLIQDRIKELRELVP--NGSKCSIDSLLERTIKHMLFLQS 618
           R  I+D    LR+ VP   G K S   +L++  +++ +++ 
Sbjct: 14  RDHIKDSFHSLRDSVPSLQGEKASRAQILDKATEYIQYMRR 54


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 748
d1uklc_61 a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId 2e-04
d1an4a_65 a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapien 7e-04
d1mdya_68 a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus muscul 7e-04
d1a0aa_63 a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Sa 0.001
d1nkpa_88 a.38.1.1 (A:) Myc proto-oncogene protein {Human (H 0.002
d1nkpb_83 a.38.1.1 (B:) Max protein {Human (Homo sapiens) [T 0.004
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Length = 61 Back     information, alignment and structure

class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: SREBP-2
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 37.8 bits (88), Expect = 2e-04
 Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 580 RQLIQDRIKELRELVPNG-SKCSIDSLLERTIKHMLFLQS 618
           R  I D+I EL++LV    +K     +L + I ++ +LQ 
Sbjct: 1   RSSINDKIIELKDLVMGTDAKMHKSGVLRKAIDYIKYLQQ 40


>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Length = 65 Back     information, alignment and structure
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 68 Back     information, alignment and structure
>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 63 Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Length = 88 Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Length = 83 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query748
d1uklc_61 SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} 97.88
d1a0aa_63 Pho4 B/HLH domain {Baker's yeast (Saccharomyces ce 97.56
d1mdya_68 Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10 97.53
d1am9a_80 SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} 97.52
d1nkpb_83 Max protein {Human (Homo sapiens) [TaxId: 9606]} 97.49
d1nlwa_79 Mad protein {Human (Homo sapiens) [TaxId: 9606]} 97.43
d1nkpa_88 Myc proto-oncogene protein {Human (Homo sapiens) [ 97.32
d1an4a_65 Usf B/HLH domain {Human (Homo sapiens) [TaxId: 960 96.75
d1f5ma_176 Hypothetical protein ykl069wp {Baker's yeast (Sacc 96.47
d1mc0a1187 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF 94.52
d1vhma_159 Hypothetical protein YebR {Escherichia coli [TaxId 94.28
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: SREBP-2
species: Human (Homo sapiens) [TaxId: 9606]
Probab=97.88  E-value=6.3e-06  Score=54.29  Aligned_cols=48  Identities=29%  Similarity=0.428  Sum_probs=41.8

Q ss_pred             HHHHHHHHHHHHHHCCCC-CCCCHHHHHHHHHHHHHHHHHH-HHHHHHHH
Q ss_conf             578999999998506899-6330666899999899999886-45676653
Q 004499          580 RQLIQDRIKELRELVPNG-SKCSIDSLLERTIKHMLFLQSI-TKHADKLS  627 (748)
Q Consensus       580 r~~i~~r~~~lr~~vp~~-~k~~i~~~l~~~i~~~~~l~~~-~~~~dkl~  627 (748)
                      |..|+|++.+||.+||+. .|.|..++|.+||.||.+||.+ ..+..+..
T Consensus         1 R~~in~~f~~L~~lvP~~~~K~~Ka~iL~~Ai~YI~~Lq~~~~~L~~e~~   50 (61)
T d1uklc_           1 RSSINDKIIELKDLVMGTDAKMHKSGVLRKAIDYIKYLQQVNHKLRQENM   50 (61)
T ss_dssp             CCCHHHHHHHHHHHHSCSSCCCCTTTHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             94788999999966899987666999999999999999999999999999



>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1f5ma_ d.110.2.1 (A:) Hypothetical protein ykl069wp {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1mc0a1 d.110.2.1 (A:215-401) 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF A and GAF B domains {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1vhma_ d.110.2.1 (A:) Hypothetical protein YebR {Escherichia coli [TaxId: 562]} Back     information, alignment and structure