Citrus Sinensis ID: 002607


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------840-------850-------860-------870-------880-------890-------900-
MKLEERQIENLLLHSFDCFSSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQGSMTLHGSESSEEREVVSSNNSLESSTSVVSSINESKCKLMNSSEIYPETYNDVLSSPNSLDSSFAPFADGTISSSNSNSDAGDSSNVPTLNSFNGSNSFVELLQMVGSTMLHGNYNHRNGHMSSDENSKDEHSQFQTLESNTQRVKVKDIDDPKVLSRVSSIPPSSFHPCLTQDLSVEVESYEMRREETRSSGISDVTDKIALMPEFASQTTDATKLIVAGPEAPRHGNKQSRNSMQANKNSIAQHESELFGDSRFAMEPPAHAQKNDLNLPKISSGSIDAIESHNALYNRENTQLKSSVSDQNKYDHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKLEIAVNNCEPIIEEPATPEPERVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKLNKVPGKADADHK
cHHHHHHHHHHHHEEEccHHHHHHHHHcccccccccccccccccccccccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHcccccccccccccccHHHHHHccHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHccccccccccccccccccccccHHHHHccccccccccccHHHHHHHHHHHHccEEEEEEccccccccccccHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccccccHHHHHccccccccccccccccccccccEEEEEcccccccccccccccccccccEEEEEEccccccccccccccccccccccccccccccccccccccccccEEEEEEEEEccccccccccccccccEEcEEEEcccccccccEEccHHHcccccEEEEEcccHHHHHccccHHHHHHHHHcccEEEEccccccccccccccccccccccccccccccccccc
cccHHHHHHHHEEEHHHHHHHHHHHHHHccccccccccccccccEEEEEcccEEEEcccccccccccccccccccccccEEcccccccccEEccccccccccccEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEHHcccccccHcccccccccEEEEccccccccccccccccccccccccccccccHcccccccccccccccccccccEEEcccccccccccccccccccccccccccccccccccccEEEccccccccccccccccccccccccEEccccccccccccccccccHHHcccccccccccccHHHHHcccccccccccEcccccccccccccccccccccccccHHcccccccccccccccHHHHHHHHHHccccccccccccccccHHHHHHccHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHcccccEEEEccccHHHccccHHHHHHHHcHHHHHHccHHHHHHHHHHHHHcccEEEcccccccccccccHHHHHHHHHHHHHHHcccccHHcccccccccccccccccccccccccccccccccccccccccccccccEEEcccccccccccccHHHHHHHHcccccccccEEccHHHHHHHHHHHHHHcccHccccHcHHHEEEccccccccccccccccccEccEEEEEcccccHHHcccccccccccccEEEEEEcccccccccccccccccccccccccccccccccccHHHccccEEEEEEEEEccHccccccccccEEEEEEEEEEccccccccccccHHHHHccccEEEEEcccccHHcccccHHHHHHHHHccEEEEEccccccccccccHHHEccccccccccccccHHHcc
MKLEERQIENLLLHSFDCFSSAFMSlaanfplnskqkpchgeeitsvieepaeyvldpedtiewkekmshqpvcdqgsmtlhgsesseerevvssnnslesSTSVVSSINESkcklmnsseiypetyndvlsspnsldssfapfadgtisssnsnsdagdssnvptlnsfngsNSFVELLQMVGstmlhgnynhrnghmssdenskdehsqfqtlesntqrvkvkdiddpkvlsrvssippssfhpcltqdlsveVESYEMRREetrssgisdvtdkialmpefasqttdATKLivagpeaprhgnkqsrnSMQANKNSIAQHEselfgdsrfameppahaqkndlnlpkissgsidAIESHNAlynrentqlkssvsdqnkydhsfskelngiddatskskstrvskekqndfdwdSLRRQVEanggkkerpehtkdsldWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDhgsvdlewlrdvppdkakypVLESIQKYLWprlckldqRTLYELHYQMITFGKVfctkskpncnacpmrgecrHFASAFassrlalpgpeekAIVSAnenrtntqnpamminqlplplthatdlpvgkleiavnncepiieepatpepervqvsendiedtfcedpeeiptIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAgaasipapklknvsrlrtehqvyelpdshpllrgmekrepddpgryLLAIwtpgetansiqppesrcssqehgkmcdektcfscnsvresefqivrgtilipcrtamrgsfplngtyfqvnevfadhdsslkpinvprewlwnlprrtvyfgtsipsifkglttegiqhcfwrgyvcvrgfdqksraprplmarlhfpasklnkvpgkadadhk
MKLEERQIENLLLHSFDCFSSAFMSLAANFPLNSKQKPCHGEEITsvieepaeyvlDPEDTIEWKEKMSHQPVCDQGSMTLHGSESSEEREVvssnnslesstsvVSSINESKCKLMNSSEIYPETYNDVLSSPNSLDSSFAPFADGTISSSNSNSDAGDSSNVPTLNSFNGSNSFVELLQMVGSTMLHGNYNHRNGHMSSDENSKDEHSQFQtlesntqrvkvkdiddPKVLSrvssippssfhpcltqdlSVEVESYEMRreetrssgisdvtdKIALMPEFASQTTDATKLIVAGPEAPRHGNKQSRNSMQANKNSIAQHESELFGDSRFAMEPPAHaqkndlnlpkISSGSIDAIESHNALYNRENTQLKSSVSDQNKYDHsfskelngiddatskskstrvskekqndfdwdslrrqveanggkkerpehtkdsldweavrCADVNKIAntikergmnnmLAGRIKDFLNRLVRDHGsvdlewlrdvppdkakypVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKLEIAVNNCEPiieepatpepervqvSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGaasipapklknvsrlrTEHQvyelpdshpllrgmEKREPDDPGRYLLAIWTPgetansiqppESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMArlhfpasklnkvpgkadadhk
MKLEERQIENLLLHSFDCFSSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQGSMTLHGsesseerevvssnnslesstsvvssinesKCKLMNSSEIYPETYNDVLSSPNSLDSSFAPFADGTIsssnsnsdagdssnVPTLNSFNGSNSFVELLQMVGSTMLHGNYNHRNGHMSSDENSKDEHSQFQTLESNTQRVKVKDIDDPKVLSRVSSIPPSSFHPCLTQDLSVEVESYEMRREETRSSGISDVTDKIALMPEFASQTTDATKLIVAGPEAPRHGNKQSRNSMQANKNSIAQHESELFGDSRFAMEPPAHAQKNDLNLPKISSGSIDAIESHNALYNRENTQLKSSVSDQNKYDHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHfasafassRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKLEIAVNNCepiieepatpepeRVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKLNKVPGKADADHK
*******IENLLLHSFDCFSSAFMSLAANFPLN*****CH*EEITSVIEEPAEYVLD**********************************************************************************************************************FVELLQMVGSTML***********************************************************************************************************************************************************************************************************************************************************LDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFA******************************INQLPLPLTHATDLPVGKLEIAVNNCEPII******************************TIKLNMKEFTQTLQNYM*************ALVALTAG*******************************************RYLLAIWT*************************EKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFD********************************
******Q*ENLLLHSFDCFSSAFMSLAA**************************VL************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************DSL*******************SLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRH****************************************************************IIEEPATP****VQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYM*************ALVALTAGAASI*APKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGE***********************KTCFSCNSV*E*EFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARL*******************
MKLEERQIENLLLHSFDCFSSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQ*********************************NESKCKLMNSSEIYPETYNDVLSSPNSLDSSFAPFADGTIS*************VPTLNSFNGSNSFVELLQMVGSTMLHGNYNHRNGH*******************NTQRVKVKDIDDPKVLSRVSSIPPSSFHPCLTQDLSVEVE*************ISDVTDKIALMPEFASQTTDATKLIVAGPEAPRHGNKQSRNSMQANKNSIAQHESELFGDSRFAMEPPAHAQKNDLNLPKISSGSIDAIESHNALYNRENT***********YDHSFSKELNGIDDA****************FDWDSLRRQV***************SLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKLEIAVNNCEPIIEEPATPEPERVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETA******************CDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKLN***********
*KLEERQIENLLLHSFDCFSSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQGSMTLHGS*****************************************************************************************NSFVELLQMVGSTMLHGNYNHRNGHMSSDEN*****************************************PC***DLSVEVESY*M********************************************************************************************************************************************************FDWDSLRRQVEAN********HTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALP***********************INQLPLPLTHA**********AVNNCEPIIEEPATPEPERVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGET*******************C**KTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASK*************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKLEERQIENLLLHSFDCFSSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQGSMTLHGSESSEEREVVSSNNSLESSTSVVSSINESKCKLMNSSEIYPETYNDVLSSPNSLDSSFAPFADGTISSSNSNSDAGDSSNVPTLNSFNGSNSFVELLQMVGSTMLHGNYNHRNGHMSSDENSKDEHSQFQTLESNTQRVKVKDIDDPKVLSRVSSIPPSSFHPCLTQDLSVEVESYEMRREETRSSGISDVTDKIALMPEFASQTTDATKLIVAGPEAPRHGNKQSRNSMQANKNSIAQHESELFGDSRFAMEPPAHAQKNDLNLPKISSGSIDAIESHNALYNRENTQLKSSVSDQNKYDHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKLEIAVNNCEPIIEEPATPEPERVQVSENDIEDTFCEDPEEIPTIKLNxxxxxxxxxxxxxxxxxxxxxDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKLNKVPGKADADHK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query901 2.2.26 [Sep-21-2011]
Q9SJQ61393 Protein ROS1 OS=Arabidops yes no 0.536 0.346 0.608 0.0
Q8LK561987 Transcriptional activator no no 0.944 0.428 0.426 0.0
Q9SR661332 DEMETER-like protein 2 OS no no 0.511 0.346 0.506 1e-146
O494981044 DEMETER-like protein 3 OS no no 0.406 0.350 0.469 1e-100
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2 Back     alignment and function desciption
 Score =  684 bits (1765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/554 (60%), Positives = 392/554 (70%), Gaps = 71/554 (12%)

Query: 398  TSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTI 457
            T K K  +V KE++  FDWD LRR+ +A  G +E+   T D++DW+A+R ADV ++A TI
Sbjct: 841  TLKEKGKKVLKEEKKAFDWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETI 900

Query: 458  KERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAK------------------- 498
            K RGMN+ LA RI+ FL+RLV DHGS+DLEWLRDVPPDKAK                   
Sbjct: 901  KSRGMNHKLAERIQGFLDRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRL 960

Query: 499  --------------------------YPVLESIQ--------------KYLWPRLCKLDQ 518
                                       P+ ES+Q              KYLWPRLCKLDQ
Sbjct: 961  LTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQ 1020

Query: 519  RTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSAN 578
            +TLYELHYQMITFGKVFCTKSKPNCNACPM+GECRHFASAFAS+RLALP  E+       
Sbjct: 1021 KTLYELHYQMITFGKVFCTKSKPNCNACPMKGECRHFASAFASARLALPSTEKGM----- 1075

Query: 579  ENRTNTQNPAMMINQLPLPLTHATDLPV---GKLEIAVNNCEPIIEEPATPEPERVQVSE 635
               T  +NP  +   LP P        V    +    V  CEPIIEEPA+PEPE  +VS 
Sbjct: 1076 --GTPDKNPLPL--HLPEPFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSI 1131

Query: 636  NDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAP 695
             DIE+ F EDPEEIPTI+LNM  FT  L+  M+ N ELQ+G+MS ALVALTA  AS+P P
Sbjct: 1132 ADIEEAFFEDPEEIPTIRLNMDAFTSNLKKIMEHNKELQDGNMSSALVALTAETASLPMP 1191

Query: 696  KLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCS 755
            KLKN+S+LRTEH+VYELPD HPLL  +EKREPDDP  YLLAIWTPGETA+SIQP  S C 
Sbjct: 1192 KLKNISQLRTEHRVYELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCI 1251

Query: 756  SQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHD 815
             Q +G +CDE+TCFSCNS++E+  QIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADH 
Sbjct: 1252 FQANGMLCDEETCFSCNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHA 1311

Query: 816  SSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAP 875
            SSL PINVPRE +W LPRRTVYFGTS+P+IFKGL+TE IQ CFW+GYVCVRGFD+K+R P
Sbjct: 1312 SSLNPINVPRELIWELPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGP 1371

Query: 876  RPLMARLHFPASKL 889
            +PL+ARLHFPASKL
Sbjct: 1372 KPLIARLHFPASKL 1385




Prevents DNA hypermethylation, specifically in the promoter of otherwise silenced loci. May be involved in DNA repair through its nicking activity on methylated DNA. Catalyzes the release of 5-methylcytosine (5-meC) from DNA by a glycosylase/lyase mechanism.
Arabidopsis thaliana (taxid: 3702)
EC: 3EC: .EC: 2EC: .EC: 2EC: .EC: -
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1 SV=2 Back     alignment and function description
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2 Back     alignment and function description
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query901
330370553 1758 repressor of gene silencing 1 [Gossypium 0.968 0.496 0.546 0.0
225440992 1942 PREDICTED: transcriptional activator DEM 0.963 0.446 0.527 0.0
225449724 2198 PREDICTED: transcriptional activator DEM 0.940 0.385 0.487 0.0
138996985 1796 repressor of silencing 1 [Nicotiana taba 0.948 0.476 0.480 0.0
356503779 1764 PREDICTED: protein ROS1-like [Glycine ma 0.897 0.458 0.473 0.0
296090400 1621 unnamed protein product [Vitis vinifera] 0.539 0.299 0.678 0.0
356546049 1881 PREDICTED: protein ROS1-like [Glycine ma 0.916 0.439 0.468 0.0
449440467 1849 PREDICTED: uncharacterized protein LOC10 0.566 0.275 0.641 0.0
449501064 1768 PREDICTED: uncharacterized protein LOC10 0.566 0.288 0.639 0.0
449465553 1679 PREDICTED: transcriptional activator DEM 0.923 0.495 0.456 0.0
>gi|330370553|gb|AEC12446.1| repressor of gene silencing 1 [Gossypium hirsutum] Back     alignment and taxonomy information
 Score =  975 bits (2521), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 544/996 (54%), Positives = 650/996 (65%), Gaps = 123/996 (12%)

Query: 20   SSAFMSLAANFPLNSKQK-PCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQGS 78
            SSAFMSLAA FP+ SK K   + +E TS++     YVL+PE++I+W  K + QPV DQ S
Sbjct: 768  SSAFMSLAARFPIKSKSKDKLYHQEGTSLVNGEEFYVLEPEESIKWDAKTAIQPVGDQSS 827

Query: 79   MTLHGSESSEEREVVSSNNSLESSTSVVSSINESKCKLMNSSEIYPETY----------- 127
            MT+ G + SEE+EV +S     SST+ VSSINE KC L+NSS     TY           
Sbjct: 828  MTVDGYQDSEEKEVANSEELSGSSTATVSSINEPKCNLLNSSGSGLSTYCDSTANRLNME 887

Query: 128  ---------------NDVLSSPNSL-------DSSFAPFADGTISSSNSNSDAGDSSNVP 165
                           NDVLSS NS+       D S    A+ T S S  NS+  D +  P
Sbjct: 888  TIRGKTDCFKGDEETNDVLSSQNSVVSSENSGDFSLVQTAERTGSCSEGNSEGADHTKRP 947

Query: 166  TLNSFNGSNSFVELLQMVGSTMLHGNYNHRNGHMSSDENSKDEHSQFQTLESNTQRVKVK 225
              N  NGS SFV+LLQMVGS  LH   +H+N  MS +E  K ++        N QR    
Sbjct: 948  IFNILNGSTSFVQLLQMVGSARLHEVQSHQN--MSPNEKLKCQNKPI----PNHQRENCD 1001

Query: 226  DIDDPKVLSRVSSIPPSSFHPCLTQDLSV-EVESYEMRREETRSSGISDVTDKIA----- 279
            + D PK  +R   +P +++HP LT +  V E+  +E  +EETR S  S   D+       
Sbjct: 1002 NSDGPKSFTREDLMPSANYHPYLTLNSEVREIGHFETLKEETRVSEASKTIDESMIKRLS 1061

Query: 280  -LMPEFASQTTDATKLI--VAGPEAPRHGNKQSRNSMQANKNSIAQHESELFGDSRFAME 336
             L  E AS+T D       V   +     N QS       + +++     L  D+   +E
Sbjct: 1062 PLTQESASRTMDQNDKTRSVQVAQQSSFENFQSSTYTIPVEMTVSHCPKGLLQDTINLVE 1121

Query: 337  PPAHAQKNDL----NLPKISSGSIDAIESHNALYNRENTQLKSSVSDQNKYDHSFSKELN 392
             PA AQ  ++    ++ K S  ++D  ES  A  N+ N Q K   S+   +D S +KELN
Sbjct: 1122 SPAEAQNKEMLRHVSMSKHSEETLDITESSTAFDNQRNPQQKMQESNLYTHDSSSNKELN 1181

Query: 393  GIDDATSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNK 452
             +     KS+  +V KEK++DFDWDSLR+Q E NG K+E+ E T DSLDWEAVRCA+V++
Sbjct: 1182 SMV-GELKSEGRKVKKEKKDDFDWDSLRKQTEVNGRKREKTERTMDSLDWEAVRCAEVHE 1240

Query: 453  IANTIKERGMNNMLAGRI---------------------------KDFL----------- 474
            IA TIKERGMNN+LA RI                           K++L           
Sbjct: 1241 IAETIKERGMNNVLAQRIKDFLNRLVRDHGSIDLEWLRDVPPDKAKEYLLSIRGLGLKSV 1300

Query: 475  --NRLVRDHG------------SVDLEWLRDVP-PDKAK------YPVLESIQKYLWPRL 513
               RL+  H             +V L W+   P P+  +      YP+LESIQKYLWPRL
Sbjct: 1301 ECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPILESIQKYLWPRL 1360

Query: 514  CKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKA 573
            CKLDQRTLYELHYQMITFGKVFCTK KPNCNACPMRGECRHFASAFAS+RLALPGPEEK+
Sbjct: 1361 CKLDQRTLYELHYQMITFGKVFCTKGKPNCNACPMRGECRHFASAFASARLALPGPEEKS 1420

Query: 574  IVSANENRTNTQNPAMMINQLPLPLTHATDL--------PVGKLEIA--VNNCEPIIEEP 623
            IVSA EN T+ +NPA++I+QL LPL  + +L            L+ A  VN C+PIIEEP
Sbjct: 1421 IVSATENGTSDRNPAVIIDQLALPLPQSNELLDRNYQSEANQHLQAASTVNKCDPIIEEP 1480

Query: 624  ATPEPERVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALV 683
            A+PEPE  QV+ENDIED F EDP+EIPTIKLNM+EFTQTLQNYMQ N+ELQEGDMSKALV
Sbjct: 1481 ASPEPECTQVAENDIEDMFSEDPDEIPTIKLNMEEFTQTLQNYMQNNIELQEGDMSKALV 1540

Query: 684  ALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGET 743
            ALTA AASIP P+LKNV+RLRTEHQVYELPDSHPLL  ++KREPDDP +YLLAIWTPGET
Sbjct: 1541 ALTAEAASIPTPRLKNVNRLRTEHQVYELPDSHPLLNELDKREPDDPCKYLLAIWTPGET 1600

Query: 744  ANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGT 803
            ANSIQ PE RC+SQEHGK+CD++TCFSCNS++E+E QIVRGT+LIPCRTAMRGSFPLNGT
Sbjct: 1601 ANSIQQPERRCNSQEHGKLCDDETCFSCNSIQEAESQIVRGTLLIPCRTAMRGSFPLNGT 1660

Query: 804  YFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYV 863
            YFQVNEVFADHDSSL PI VPREWLWNLPRR VYFGTSIPSIFKGLTTEGIQHCFWRGYV
Sbjct: 1661 YFQVNEVFADHDSSLNPIAVPREWLWNLPRRMVYFGTSIPSIFKGLTTEGIQHCFWRGYV 1720

Query: 864  CVRGFDQKSRAPRPLMARLHFPASKLNKVPGKADAD 899
            CVRGFDQKSRAPRPLMARLHFP S+L K  GK   +
Sbjct: 1721 CVRGFDQKSRAPRPLMARLHFPVSRLAKAKGKGAGE 1756




Source: Gossypium hirsutum

Species: Gossypium hirsutum

Genus: Gossypium

Family: Malvaceae

Order: Malvales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225440992|ref|XP_002277401.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|225449724|ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|138996985|dbj|BAF52855.1| repressor of silencing 1 [Nicotiana tabacum] Back     alignment and taxonomy information
>gi|356503779|ref|XP_003520681.1| PREDICTED: protein ROS1-like [Glycine max] Back     alignment and taxonomy information
>gi|296090400|emb|CBI40219.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356546049|ref|XP_003541444.1| PREDICTED: protein ROS1-like [Glycine max] Back     alignment and taxonomy information
>gi|449440467|ref|XP_004138006.1| PREDICTED: uncharacterized protein LOC101222935 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449501064|ref|XP_004161268.1| PREDICTED: uncharacterized protein LOC101230418 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449465553|ref|XP_004150492.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query901
TAIR|locus:20449231393 DML1 "demeter-like 1" [Arabido 0.427 0.276 0.662 3.4e-178
TAIR|locus:21844321987 DME "DEMETER" [Arabidopsis tha 0.519 0.235 0.587 1.6e-156
TAIR|locus:21001381332 DML2 "demeter-like 2" [Arabido 0.430 0.291 0.509 1.9e-140
TAIR|locus:21243011044 DML3 "demeter-like protein 3" 0.558 0.481 0.395 1e-90
TAIR|locus:451510334278 AT4G04957 "AT4G04957" [Arabido 0.051 0.589 0.695 1.3e-11
TAIR|locus:2044923 DML1 "demeter-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1357 (482.7 bits), Expect = 3.4e-178, Sum P(3) = 3.4e-178
 Identities = 259/391 (66%), Positives = 298/391 (76%)

Query:   499 YPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHXXXX 558
             YP+LESIQKYLWPRLCKLDQ+TLYELHYQMITFGKVFCTKSKPNCNACPM+GECRH    
Sbjct:  1001 YPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVFCTKSKPNCNACPMKGECRHFASA 1060

Query:   559 XXXXRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKLEIAVNNCXX 618
                 RLALP  E K + + ++N      P     +    +   ++ P  K    V  C  
Sbjct:  1061 FASARLALPSTE-KGMGTPDKNPLPLHLPEPFQREQGSEVVQHSE-PAKK----VTCCEP 1114

Query:   619 XXXXXXXXXXXRVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDM 678
                          +VS  DIE+ F EDPEEIPTI+LNM  FT  L+  M+ N ELQ+G+M
Sbjct:  1115 IIEEPASPEPETAEVSIADIEEAFFEDPEEIPTIRLNMDAFTSNLKKIMEHNKELQDGNM 1174

Query:   679 SKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIW 738
             S ALVALTA  AS+P PKLKN+S+LRTEH+VYELPD HPLL  +EKREPDDP  YLLAIW
Sbjct:  1175 SSALVALTAETASLPMPKLKNISQLRTEHRVYELPDEHPLLAQLEKREPDDPCSYLLAIW 1234

Query:   739 TPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSF 798
             TPGETA+SIQP  S C  Q +G +CDE+TCFSCNS++E+  QIVRGTILIPCRTAMRGSF
Sbjct:  1235 TPGETADSIQPSVSTCIFQANGMLCDEETCFSCNSIKETRSQIVRGTILIPCRTAMRGSF 1294

Query:   799 PLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCF 858
             PLNGTYFQVNEVFADH SSL PINVPRE +W LPRRTVYFGTS+P+IFKGL+TE IQ CF
Sbjct:  1295 PLNGTYFQVNEVFADHASSLNPINVPRELIWELPRRTVYFGTSVPTIFKGLSTEKIQACF 1354

Query:   859 WRGYVCVRGFDQKSRAPRPLMARLHFPASKL 889
             W+GYVCVRGFD+K+R P+PL+ARLHFPASKL
Sbjct:  1355 WKGYVCVRGFDRKTRGPKPLIARLHFPASKL 1385


GO:0005634 "nucleus" evidence=ISM;IDA
GO:0006281 "DNA repair" evidence=IEA;IMP
GO:0006284 "base-excision repair" evidence=IEA
GO:0003906 "DNA-(apurinic or apyrimidinic site) lyase activity" evidence=ISS;IDA
GO:0006342 "chromatin silencing" evidence=IMP
GO:0019104 "DNA N-glycosylase activity" evidence=ISS;IDA;TAS
GO:0005515 "protein binding" evidence=IPI
GO:0006306 "DNA methylation" evidence=RCA;IDA
GO:0031936 "negative regulation of chromatin silencing" evidence=IMP
GO:0080111 "DNA demethylation" evidence=IMP
TAIR|locus:2184432 DME "DEMETER" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2100138 DML2 "demeter-like 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124301 DML3 "demeter-like protein 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4515103342 AT4G04957 "AT4G04957" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.2.2LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00034990001
SubName- Full=Chromosome chr13 scaffold_74, whole genome shotgun sequence; (975 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query901
COG0177211 COG0177, Nth, Predicted EndoIII-related endonuclea 8e-06
smart0052521 smart00525, FES, iron-sulpphur binding domain in D 0.003
>gnl|CDD|223255 COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Back     alignment and domain information
 Score = 47.6 bits (114), Expect = 8e-06
 Identities = 31/152 (20%), Positives = 46/152 (30%), Gaps = 46/152 (30%)

Query: 441 DWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSV---DLEWLRDVP---- 493
             E +  AD  ++   IK  G+    A  IK+    L+   G       E L  +P    
Sbjct: 60  TPEDLLNADEEELEELIKSIGLYRNKAKNIKELARILLEKFGGEVPDTREELLSLPGVGR 119

Query: 494 --------------------------------PDKAKYPVLESIQKYLWPRLCKLDQRTL 521
                                           P K    V E++ K +   L        
Sbjct: 120 KTANVVLSFAFGIPAIAVDTHVHRVSNRLGLVPGKTPEEVEEALMKLIPKEL-------W 172

Query: 522 YELHYQMITFGKVFCTKSKPNCNACPMRGECR 553
            +LH+ +I  G+  C   KP C  CP+   C 
Sbjct: 173 TDLHHWLILHGRYICKARKPRCEECPLADLCP 204


Length = 211

>gnl|CDD|197771 smart00525, FES, iron-sulpphur binding domain in DNA-(apurinic or apyrimidinic site) lyase (subfamily of ENDO3) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 901
PF15628103 RRM_DME: RRM in Demeter 100.0
COG2231215 Uncharacterized protein related to Endonuclease II 99.97
PRK10702211 endonuclease III; Provisional 99.95
COG0177211 Nth Predicted EndoIII-related endonuclease [DNA re 99.95
PRK10880350 adenine DNA glycosylase; Provisional 99.92
TIGR01084275 mutY A/G-specific adenine glycosylase. This equiva 99.91
PRK13913218 3-methyladenine DNA glycosylase; Provisional 99.89
TIGR01083191 nth endonuclease III. This equivalog model identif 99.86
PRK13910289 DNA glycosylase MutY; Provisional 99.85
KOG1921286 consensus Endonuclease III [Replication, recombina 99.83
COG1194342 MutY A/G-specific DNA glycosylase [DNA replication 99.74
TIGR03252177 uncharacterized HhH-GPD family protein. This model 99.56
smart00478149 ENDO3c endonuclease III. includes endonuclease III 99.51
PRK01229208 N-glycosylase/DNA lyase; Provisional 99.5
cd00056158 ENDO3c endonuclease III; includes endonuclease III 99.38
KOG2457555 consensus A/G-specific adenine DNA glycosylase [Re 99.33
PF1562932 Perm-CXXC: Permuted single zf-CXXC unit 99.18
PF00730108 HhH-GPD: HhH-GPD superfamily base excision DNA rep 98.89
TIGR00588310 ogg 8-oxoguanine DNA-glycosylase (ogg). All protei 98.44
PRK10308283 3-methyl-adenine DNA glycosylase II; Provisional 97.18
COG0122285 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine 96.89
PF1057617 EndIII_4Fe-2S: Iron-sulfur binding domain of endon 96.45
KOG2875323 consensus 8-oxoguanine DNA glycosylase [Replicatio 96.43
smart0052526 FES FES domain. iron-sulpphur binding domain in DN 95.55
COG1059210 Thermostable 8-oxoguanine DNA glycosylase [DNA rep 92.47
>PF15628 RRM_DME: RRM in Demeter Back     alignment and domain information
Probab=100.00  E-value=2.5e-70  Score=497.02  Aligned_cols=103  Identities=82%  Similarity=1.488  Sum_probs=102.1

Q ss_pred             eeEeeeccccccCCccCCccceeeeeeeeccCCCCCCcccchhhhccCCceeEEecCChhhhcccCCHHHHHHhhhccce
Q 002607          784 GTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYV  863 (901)
Q Consensus       784 gTiLiPCRTAmrG~FPLNGTYFQvNEVFaDh~SS~~Pi~vpr~~iw~L~rr~vyfGtSv~sIfkglt~~~Iq~cF~~G~v  863 (901)
                      |||||||||||||+|||||||||||||||||+||+|||+|||+|||+|+||+|||||||+|||||||++|||+|||+|||
T Consensus         1 gtiLiPCrtAmrg~FPLnGTYFQ~nEVFaD~~Ss~~Pi~vpr~~iw~L~rr~vYfGtsv~SIfrGls~~eIq~cF~~G~V   80 (103)
T PF15628_consen    1 GTILIPCRTAMRGSFPLNGTYFQVNEVFADHESSRNPIDVPRELIWNLPRRIVYFGTSVSSIFRGLSREEIQQCFWKGFV   80 (103)
T ss_pred             CceEeeehhhhCCccccCceEEEEEEEEeccccccCCeeccHHHhhcCCceEEEecCcHHHHhcccCHHHHHHHHhcCcE
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EeecccccccCCcccccccccCc
Q 002607          864 CVRGFDQKSRAPRPLMARLHFPA  886 (901)
Q Consensus       864 CVR~fdr~tr~PrpL~~rlH~~~  886 (901)
                      |||||||+||+||||++|||+++
T Consensus        81 CVR~FDr~Tr~PrpL~~rLH~~~  103 (103)
T PF15628_consen   81 CVRGFDRKTRAPRPLCARLHFPA  103 (103)
T ss_pred             EEeecccccCCCcchhhhccCCC
Confidence            99999999999999999999985



>COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair] Back     alignment and domain information
>PRK10702 endonuclease III; Provisional Back     alignment and domain information
>COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Back     alignment and domain information
>PRK10880 adenine DNA glycosylase; Provisional Back     alignment and domain information
>TIGR01084 mutY A/G-specific adenine glycosylase Back     alignment and domain information
>PRK13913 3-methyladenine DNA glycosylase; Provisional Back     alignment and domain information
>TIGR01083 nth endonuclease III Back     alignment and domain information
>PRK13910 DNA glycosylase MutY; Provisional Back     alignment and domain information
>KOG1921 consensus Endonuclease III [Replication, recombination and repair] Back     alignment and domain information
>COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair] Back     alignment and domain information
>TIGR03252 uncharacterized HhH-GPD family protein Back     alignment and domain information
>smart00478 ENDO3c endonuclease III Back     alignment and domain information
>PRK01229 N-glycosylase/DNA lyase; Provisional Back     alignment and domain information
>cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Back     alignment and domain information
>KOG2457 consensus A/G-specific adenine DNA glycosylase [Replication, recombination and repair] Back     alignment and domain information
>PF15629 Perm-CXXC: Permuted single zf-CXXC unit Back     alignment and domain information
>PF00730 HhH-GPD: HhH-GPD superfamily base excision DNA repair protein This entry corresponds to Endonuclease III This entry corresponds to Alkylbase DNA glycosidase; InterPro: IPR003265 Endonuclease III (4 Back     alignment and domain information
>TIGR00588 ogg 8-oxoguanine DNA-glycosylase (ogg) Back     alignment and domain information
>PRK10308 3-methyl-adenine DNA glycosylase II; Provisional Back     alignment and domain information
>COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Back     alignment and domain information
>PF10576 EndIII_4Fe-2S: Iron-sulfur binding domain of endonuclease III; InterPro: IPR003651 Endonuclease III (4 Back     alignment and domain information
>KOG2875 consensus 8-oxoguanine DNA glycosylase [Replication, recombination and repair] Back     alignment and domain information
>smart00525 FES FES domain Back     alignment and domain information
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query901
1pu6_A218 3-methyladenine DNA glycosylase; helix-hairpin-hel 9e-07
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-06
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 5e-05
>1pu6_A 3-methyladenine DNA glycosylase; helix-hairpin-helix, base excision repair, hydrolase; HET: KCX; 1.64A {Helicobacter pylori} SCOP: a.96.1.5 PDB: 1pu7_A* 1pu8_A* Length = 218 Back     alignment and structure
 Score = 49.7 bits (119), Expect = 9e-07
 Identities = 22/161 (13%), Positives = 46/161 (28%), Gaps = 51/161 (31%)

Query: 429 KKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSV---- 484
           K        D ++ + +   + +K+A  ++  G  N  A R+ D    +++D  S     
Sbjct: 56  KNAFILENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDLSGNILKDFQSFENFK 115

Query: 485 ---DLEWLRDVP---------------------------------------PDKAKYPVL 502
                EWL D                                          D+ ++   
Sbjct: 116 QEVTREWLLDQKGIGKESADAILCYACAKEVMVVDKYSYLFLKKLGIEIEDYDELQHFFE 175

Query: 503 ESIQKYLWPRLCKLDQ-----RTLYELHYQMITFGKVFCTK 538
           + +Q+ L   L   +      +     H +++ F K     
Sbjct: 176 KGVQENLNSALALYENTISLAQLYARFHGKIVEFSKQKLEL 216


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 901
d1keaa_217 a.96.1.2 (A:) Thymine-DNA glycosylase {Archaeon Me 5e-07
d1orna_214 a.96.1.1 (A:) Endonuclease III {Escherichia coli [ 5e-06
d1kg2a_224 a.96.1.2 (A:) Catalytic domain of MutY {Escherichi 6e-06
d1rrqa1221 a.96.1.2 (A:9-229) Catalytic domain of MutY {Bacil 9e-06
d2abka_211 a.96.1.1 (A:) Endonuclease III {Escherichia coli [ 2e-05
d1pu6a_217 a.96.1.5 (A:) 3-Methyladenine DNA glycosylase III 3e-04
>d1keaa_ a.96.1.2 (A:) Thymine-DNA glycosylase {Archaeon Methanobacterium thermoformicicum [TaxId: 145262]} Length = 217 Back     information, alignment and structure

class: All alpha proteins
fold: DNA-glycosylase
superfamily: DNA-glycosylase
family: Mismatch glycosylase
domain: Thymine-DNA glycosylase
species: Archaeon Methanobacterium thermoformicicum [TaxId: 145262]
 Score = 49.2 bits (116), Expect = 5e-07
 Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 1/60 (1%)

Query: 497 AKYPVLESIQKYLWPRLCKL-DQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHF 555
             Y  L    K LW     L       + +  ++ F  + C   KP C  C M   C ++
Sbjct: 155 GSYENLNYNHKALWELAETLVPGGKCRDFNLGLMDFSAIICAPRKPKCEKCGMSKLCSYY 214


>d1orna_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Length = 214 Back     information, alignment and structure
>d1kg2a_ a.96.1.2 (A:) Catalytic domain of MutY {Escherichia coli [TaxId: 562]} Length = 224 Back     information, alignment and structure
>d1rrqa1 a.96.1.2 (A:9-229) Catalytic domain of MutY {Bacillus stearothermophilus [TaxId: 1422]} Length = 221 Back     information, alignment and structure
>d2abka_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Length = 211 Back     information, alignment and structure
>d1pu6a_ a.96.1.5 (A:) 3-Methyladenine DNA glycosylase III (MagIII) {Helicobacter pylori [TaxId: 210]} Length = 217 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query901
d2abka_211 Endonuclease III {Escherichia coli [TaxId: 562]} 99.96
d1keaa_217 Thymine-DNA glycosylase {Archaeon Methanobacterium 99.96
d1kg2a_224 Catalytic domain of MutY {Escherichia coli [TaxId: 99.96
d1rrqa1221 Catalytic domain of MutY {Bacillus stearothermophi 99.95
d1orna_214 Endonuclease III {Escherichia coli [TaxId: 562]} 99.95
d1pu6a_217 3-Methyladenine DNA glycosylase III (MagIII) {Heli 99.88
d1ngna_144 Mismatch-specific thymine glycosylase domain of th 99.46
d2noha1190 8-oxoguanine glycosylase {Human (Homo sapiens) [Ta 98.77
d1mpga1183 3-Methyladenine DNA glycosylase II (gene alkA or a 98.05
>d2abka_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA-glycosylase
superfamily: DNA-glycosylase
family: Endonuclease III
domain: Endonuclease III
species: Escherichia coli [TaxId: 562]
Probab=99.96  E-value=6.3e-31  Score=222.75  Aligned_cols=147  Identities=19%  Similarity=0.147  Sum_probs=129.5

Q ss_pred             CCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHC
Q ss_conf             23344841455679689999999608998898876999999999719999999997404962999999999999999970
Q 002607          402 KSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDH  481 (901)
Q Consensus       402 k~~~~~~iL~QNTsWeNVeKAl~n~Lkk~er~~~~~gllDpEaLA~Ad~EELeeLIRpaGFYr~KAkRIKalAk~Ivee~  481 (901)
                      -..=++.||.|+|.|++|.+++.+ |.+        -.++|++++.|+.++|+++|+++|||++||++|++++++|+++|
T Consensus        30 ~~vli~~iLsqqT~~~~v~~~~~~-l~~--------~~~t~~~la~a~~~~l~~~i~~~G~y~~Ka~~l~~~a~~i~~~~  100 (211)
T d2abka_          30 FELLIAVLLSAQATDVSVNKATAK-LYP--------VANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH  100 (211)
T ss_dssp             HHHHHHHHHTTTSCHHHHHHHHHH-HTT--------TCCSHHHHHHHHHHHHHHHHTTSTTHHHHHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHCCCHHHHHHHHHHH-HHH--------HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             999999999562639999999998-887--------41472788765576767777775441157889999988899871


Q ss_pred             CCC------CCCCCCCCCCCCCC-----------CCCHHHHHHHHHH--------------H-HCCCCHHHHHHHHHHHH
Q ss_conf             995------65325789934569-----------8423879999965--------------3-11599998999999999
Q 002607          482 GSV------DLEWLRDVPPDKAK-----------YPVLESIQKYLWP--------------R-LCKLDQRTLYELHYQMI  529 (901)
Q Consensus       482 Ggi------DLE~LrGIG~ETAd-----------YPVdt~VqR~Le~--------------~-L~~ld~e~l~EfHaLLI  529 (901)
                      ||.      +|+.|+|||++||+           ++||++|+|.+..              . ...+|.+.+.+||++||
T Consensus       101 ~g~~p~~~~~L~~LpGVG~~TA~~Il~~a~~~p~~~vD~nv~Rv~~R~g~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~  180 (211)
T d2abka_         101 NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQFAPGKNVEQVEEKLLKVVPAEFKVDCHHWLI  180 (211)
T ss_dssp             TTSCCSCHHHHHHSTTCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCSSCCSSHHHHHHHHHHHSCGGGTTTHHHHHH
T ss_pred             CCCHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHH
T ss_conf             48604779999999860517899999998568032256799999997066754204678888860486034899999999


Q ss_pred             HHHHHHCCCCCCCCCCCCCCCCCHHHHH
Q ss_conf             9816331258988888997200234430
Q 002607          530 TFGKVFCTKSKPNCNACPMRGECRHFAS  557 (901)
Q Consensus       530 efGK~iCtarKPkC~aCPLrd~C~yy~s  557 (901)
                      +||+.+|++++|+|+.|||++.|+||..
T Consensus       181 ~~G~~vC~~~~P~C~~Cpl~~~C~~~~k  208 (211)
T d2abka_         181 LHGRYTCIARKPRCGSCIIEDLCEYKEK  208 (211)
T ss_dssp             HHHHHTSCSSSCCGGGCTTGGGCCCTTC
T ss_pred             HHHHHHCCCCCCCCCCCCCCCCCCCCCC
T ss_conf             9855552599989999979055898676



>d1keaa_ a.96.1.2 (A:) Thymine-DNA glycosylase {Archaeon Methanobacterium thermoformicicum [TaxId: 145262]} Back     information, alignment and structure
>d1kg2a_ a.96.1.2 (A:) Catalytic domain of MutY {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1rrqa1 a.96.1.2 (A:9-229) Catalytic domain of MutY {Bacillus stearothermophilus [TaxId: 1422]} Back     information, alignment and structure
>d1orna_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1pu6a_ a.96.1.5 (A:) 3-Methyladenine DNA glycosylase III (MagIII) {Helicobacter pylori [TaxId: 210]} Back     information, alignment and structure
>d1ngna_ a.96.1.2 (A:) Mismatch-specific thymine glycosylase domain of the methyl-GpG binding protein mbd4 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2noha1 a.96.1.3 (A:136-325) 8-oxoguanine glycosylase {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1mpga1 a.96.1.3 (A:100-282) 3-Methyladenine DNA glycosylase II (gene alkA or aidA) {Escherichia coli [TaxId: 562]} Back     information, alignment and structure