Citrus Sinensis ID: 000414


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------840-------850-------860-------870-------880-------890-------900-------910-------920-------930-------940-------950-------960-------970-------980-------990------1000------1010------1020------1030------1040------1050------1060------1070------1080------1090------1100------1110------1120------1130------1140------1150------1160------1170------1180------1190------1200------1210------1220------1230------1240------1250------1260------1270------1280------1290------1300------1310------1320------1330------1340------1350------1360------1370------1380------1390------1400------1410------1420------1430------1440------1450------1460------1470------1480------1490------1500------1510------1520------1530------1540-
MKNHRPRVLVEGMPKKEPKPKKAPQQPKTPKPVTPNRAHETTRKRKYVRKNSVSGNVVMDESVSKRRKCGRGNRQSNSDAIILLPPPPIKTCRQALNFDLENQSRDDKAIIIAIPECGVQALDECAFTKATEHGVQVVDECVDAREQRENNKTRSNRRSRRRRRLNILFMRNLPVGQGQYPRRKRRLRRATRRCNLTVLTAIPLCSQLPKLPSKQAFPGEKATPICSQPPKLPRKQAAAIEKNEAEVIYEPEASKDMLLTDNLEKNVAGRTIEPSVVVDLELQGNKSEGSPFKYEDFPLASYLTGVPRKKRKHPLELENFGLMDQAIDSQKNLQEPGLGLQEGFQEPRIGESPHFQEQKIDSLAPFPGQSSGKTLPFPGHFISGLPQLNIGVLGLLPELSIGAVEPFLDQSNSSLEFFPRQAQAINSQAEFLKEQVMYRLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPRYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPSQAFDTSSVQSLDRTQLFPTGNSKADVASSTKTSNAESFITQFSHTGNLKKNSVNQLFPTVNSKADVASPSQNHITQSSVTQFWPTGNSTADVASPSKTCIKESSIAASTEIPQLENTALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEISVLPSNRESFRTGMPQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSRDES
ccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccHHHHcccccccccccHHHHccccccccEEEHHHHHHHHcccccccHHHHHHHHHHcccccccccccccccccHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccHHHHHHHHcccccEEEccccccccccccccccccccccccccEEEEEEccccccccccccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccHHHHHHHHHHcHHHHHHHHHHHHHHHHHHccccHHHHHHHcHHcccccccccccccccccccccccccccccccccccccccccccccHHHHHHcccHHHHHHHHHcccccccccccccccccccccccccccEEEEcccccccccccEEEcccccccccccccccccHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccEEEEEccccccccHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHcccccccccccccccHHHHHcccHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHccccHHHHccccHHHHHHHHHHcccccHHHHHHHHHHccccccEEEccHHHHHHHHHccccccccHHHHHHHHHHHcccHHHHHHHHccHHHcccHHHHHHHHHHHHHHHcccccccccccccccccccccHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccHHHHHHccHHHHHHcccccccccccEEccccccccccccccccccccccEEEEEcccccccccccccccccccccEEEEEEccccccccccccccccccccccccccccccccccHHcccccEEEEEEEEEcccccccccccccccEEEEEEEcccccccccccccHHHHHccccEEEEEcccHHHHHccccHHHHHHHHHccEEEEEcccccccccccccccccccccccccccccccccc
ccccccEEEEEcccccccccccccccccccccccHHHHHHHHHHcHHEEEccccccEEEccccccccccccccccccccEEEEccccccHHHHHHHcccHHcccccccEEEEEccHccHHHHHHHHHccccccccEEEcccccHHHHHHHccccccHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHccHHHHHHcHHHccccccccccccccccccccccccccccHHHHHHHHHcccEEEEcccccHHHHHHccHHHHcccccEcccEEEEEEEccccccccccccccccHHHHHccccccccccccEEEccccccccccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccHHHHHHccccccccccccccccccccccccccccHHHcccccHHHcccHcccccccEcccccccccccccccEEEcEEcccccccccccccccccEEccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccccccccccccccccccccEEEEcccccccccccEEEcccccccccccccccccHHHHHHHHHHHccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccEEEcEEEEEEcccccccccHHHHHHHHHccccccccccccccccccccEEEcccccccccccccccccccccccccccccccEEEccccccccccccHHHHcccccccEEEccccccccccccccccccEEEcccccccccccccccccccEEEEEEEccccccccccccEcccccccHHHccccccccccccHcccccccccccHHHccccccEEEcccccccccccccccccccccccccccccHHHccccccccccccccccccccHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHccEHEccccHHHHHHHHcccHHHHHHHcccccHHHHHHccHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHccccHHHHHcccHHHHHHHHHccccccHHHHHHHHHHHcccccccccccHHHHHHHcccccccccHHHHHHHHHHHccHHHHHHHHHcHHHccccHHHHHHHHHHHHHHccHEcccccccccccccHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccEEEcccccccccccccHHHHHHcccccccHHHcccEcccHHHHHHHHHHHHHHccccHHHcccEEEEcccHccccccccccccHcccccEEEEcccccHHHcccccccccccccEEEEEEccccccccccccccccccccccccccccccccccHHHccccEEEEEEEEEccHccccccccccEEEEEEEEEEccccccccccccHHHHHccccEEEEEcccHHHHHccccHHHHHHHHHccEEEEEcccccccccccHHHHccccccHcccccccccccc
mknhrprvlvegmpkkepkpkkapqqpktpkpvtpnrahettrkrkyvRKNSvsgnvvmdesvskrrkcgrgnrqsnsdaiillppppiktCRQALNFdlenqsrddKAIIIAipecgvqalDECAFTKATEHGVQVVDECVDAREqrennktrsnrRSRRRRRLNILFmrnlpvgqgqyprrKRRLRRATRRCNLTVLTaiplcsqlpklpskqafpgekatpicsqppklprKQAAAIEKNeaeviyepeaskdmlltdnleknvagrtiepSVVVDLElqgnksegspfkyedfplasyltgvprkkrkhplelenfglmDQAIDSqknlqepglglqegfqeprigesphfqeqkidslapfpgqssgktlpfpghfisglpqlnigvlgllpelsigavepfldqsnssleffpRQAQAINSQAEFLKEQVMYRLDQLRKqingpssqlqEQNRVIQrkswnpkglwftaksrgsqKIIQVErnglptqkmpprysrglqvDMEQQLMQSTTADIIRKFKdltirdggsqlplqcnkadgpsnvhnavvpyqvgpssehgalvpHQIKEKrkklkpevvldpgtlrMWNLImniddgttkdqtsnEDMEKWWQKEREVFEGRIQSFTARMHLILgdrrfkpwkgsvvDSVVGVYLTQNvsdnlsssAYMSLAarfplkstndhtpsdenlrttaslepigsnstsngavydsegnmyfvtepepdrccelkdrdDAFDSRIQRKALqengdikvmtdavpsqafdtssvqsldrtqlfptgnskadvasstktsnaESFITQFshtgnlkknsvnqlfptvnskadvaspsqnhitqssvtqfwptgnstadvaspsktcikessiaasteipqlentallqdkvdgilfcdecldgytkptridngnqastsgrndlksdfrsisssdfndpfeisvlpsnresfrtgmpqahdatttskksprgkgkskeyksdmkndrtkkttpkknsdntvQQDWDLLRRIYStgeershdkmdsvdweavrCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHgsidlewlrnvppdkvKEYLLDIEGLGLKSVECVRLLSLQhiafpvdvnvGRIAVrlgwvplqplpgdlhihllkeypvmdkiqmylwprlcyldqKTLYELHYQMITFGKifckkrspncgacplrgecKHFASAVASarfalpgpsekgivtsefgngigqspplvvnpipvirieadpvsesgyqinncepiieeprspgpqcsespeseiddddfsigdieeiptlrLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCahlrtehhvyvlpdahellrgfdrrdpddpspyllaiwspgespnsvgspqyrcnskdsklcnseicyscnsvcernagivrgtilipcrtanrgrfplngtyfqvnevfadhetshhpinvprhtIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFvcvrgfdrryrcprplvnrlhcspskkeagskrsrdes
mknhrprvlvegmpkkepkpkkapqqpktpkpvtpnrahettrkrkyvrknsvsgnvvmdesvskrrkcgrgnrqsnsdaiillppppIKTCRQALNFDLENQSRDDKAIIIAIPECGVQALDECAFTKATEHGVQVVDecvdareqrennktrsnrrsrrrrrlnilfmrnlpvgqgqyprrkrrlrratrrcnltvltaiplcsqlpkLPSKQAFPGEKatpicsqppklPRKQAAAIEKNEAEViyepeaskdmlltDNLEKNVAGRTIEPSVVVDLELqgnksegspfkyedfpLASYLTGVPRKKRKHPLELENFGLMDQAIDSQKNLQEPGLGLQEGFQEPRIGESPHFQEQKIDSLAPFPGQSSGKTLPFPGHFISGLPQLNIGVLGLLPELSIGAVEPFLDQSNSSLEFFPRQAQAINSQAEFLKEQVMYRLDQLRKQIngpssqlqeqNRVIqrkswnpkglwftaksrgsqkiiqvernglptqkmpprYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQikekrkklkpevvldpgtlrmwNLIMniddgttkdqtsNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLrttaslepigsnstsngavydSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRkalqengdikvmTDAVPSQAFDTSSVQSLDRTQLFPTGNSKADVASSTKTSNAESFITQfshtgnlkkNSVNQLFPTVNSKADVASPSQNHITQSSVTQFWPTGNSTADVASPSKTCIKESSIAASTEIPQLENTALLQDKVDGILFCDECLDGytkptridngnqastsgrndlKSDFRSisssdfndpfEISVLPSNRESFRtgmpqahdatttskksprgkgkskeyksdmkndrtkkttpkknsdntvqqdwDLLRRIYStgeershdkmdsvdWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGsidlewlrnvppDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESpeseiddddfsiGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIwspgespnsvgSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCprplvnrlhcspskkeagskrsrdes
MKNHRPRVLVEGMpkkepkpkkapqqpktpkpVTPNRAHETTRKRKYVRKNSVSGNVVMDESVSKRRKCGRGNRQSNSDAIILLPPPPIKTCRQALNFDLENQSRDDKAIIIAIPECGVQALDECAFTKATEHGVQVVDECVDAREQrennktrsnrrsrrrrrlnilfmrnlPVGQGQYPrrkrrlrratrrCNLTVLTAIPLCSQLPKLPSKQAFPGEKATPICSQPPKLPRKQAAAIEKNEAEVIYEPEASKDMLLTDNLEKNVAGRTIEPSVVVDLELQGNKSEGSPFKYEDFPLASYLTGVPRKKRKHPLELENFGLMDQAIDSQKNLQEPGLGLQEGFQEPRIGESPHFQEQKIDSLAPFPGQSSGKTLPFPGHFISGLPQLNIGVLGLLPELSIGAVEPFLDQSNSSLEFFPRQAQAINSQAEFLKEQVMYRLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPRYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPSQAFDTSSVQSLDRTQLFPTGNSKADVASSTKTSNAESFITQFSHTGNLKKNSVNQLFPTVNSKADVASPSQNHITQSSVTQFWPTGNSTADVASPSKTCIKESSIAASTEIPQLENTALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEISVLPSNRESFRTGMPQAHDAtttskksprgkgkskEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCsespeseiddddfsigdieeiPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSRDES
********************************************************************************IILLPPPPIKTCRQALNFDLENQSRDDKAIIIAIPECGVQALDECAFTKATEHGVQVVDECV**********************LNILFMRNLPVGQGQYPR*KRRLRRATRRCNLTVLTAIPLCSQL**********************************************************VAGRTIEPSVVVDLE***********KYEDFPLASYLTGV********************************************************************LPFPGHFISGLPQLNIGVLGLLPELSIGAVEPFLDQ****LEFFPR***AINSQAEFLKEQVMYRLD***********************SWNPKGLWFTA******************************************ADIIRKFKDLTIRD********************AVVPY*************************EVVLDPGTLRMWNLIMNIDD*************KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNL****************************************************MYFV*********************************************************************************************************************************************************PQLENTALLQDKVDGILFCDECLDGYTK**************************************************************************************************WDLLRRIY*************VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGP*EKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNC**********************************I*TLR********NFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGF**********YLLAIW*********************KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLH*****************
****RPR************************************************************************************CRQALNFD********************************EHGVQV********************************************************CNLTVLTAIPL*************************************KNEAEVIYEPEASK****************I***V*******************DFPLASYLT**********LELENFG***********************************************************FISGLPQLNIGVLGLLPELSIG***********************N****************************************************************************************STTADIIRKFKDLTIRD******************HNAVVPYQVGPSSEHGALVPHQIK*********VVLDPGTLRMWNLIMNI***************KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAAR**********************************************************************************************************************************************************************************************************************************************************************************************************************************QDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF************************************************************PIIEEPRSPGP****SPESEIDDDDFSIGDIEEIPTLRLQEREFKENFP*****************ALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGE************************CYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRL******************
MKNHRPRVLVEGM*****************************RKRKYVRKNSVSGNVVMDE**************SNSDAIILLPPPPIKTCRQALNFDLENQSRDDKAIIIAIPECGVQALDECAFTKATEHGVQVVDECVDAR**************RRRRRLNILFMRNLPVGQ**********RRATRRCNLTVLTAIPLCSQLPKLPSKQAFPGEKATPICSQPP*********IEKNEAEVIYEPEASKDMLLTDNLEKNVAGRTIEPSVVVDLELQGNKSEGSPFKYEDFPLASYLTGVPRKKRKHPLELENFGLMDQAIDSQKNLQEPGLGLQEGFQEPRIGESPHFQEQKIDSLAPFPGQSSGKTLPFPGHFISGLPQLNIGVLGLLPELSIGAVEPFLDQSNSSLEFFPRQAQAINSQAEFLKEQVMYRLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPRYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTT*********EKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPSQAFDTSSVQSLDRTQLFPT***************AESFITQFSHTGNLKKNSVNQLFPTVNSKADVASPSQNHITQSSVTQFWPTGNSTADVASPSKTCIKESSIAASTEIPQLENTALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEISVLPSNRESFR**********************************************TVQQDWDLLRRIYSTG*********SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEE****************DDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGE**************KDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS***************
****RPRVLVEG*PKK************TPKPVTPNRAHETTRKRKYVRKNSVS*N********************NSDAIILLPPPPIKTCRQALNFDLENQSRDDKAIIIAIPECGVQALDECAFTKATEHGVQVVDECVDAREQR***********RRRRRLNILFMRNLPVGQGQYPRRKRRLRRATRRCNLTVLTAIPLCSQLPKLPSKQAFPGEKATPICSQPPKLPRKQAAAIEKNEAEVIYEPEASKDMLLTDNLEKNVAGRTIEPSVVVDLELQGNKSEGSPFKYEDFPLASYLTGVPRKKRKHPLELENFGLMDQAI*************************************P******GKTLPFPGHFISGLPQLNIGVLGLLPELSIGAVEPFLDQSN**L*FF*RQ********************QLRKQ****SSQ****NRVIQRKSWN**************KIIQVERNGLPTQKMPPRYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLP***N******NVHNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNID********SNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL***************TASL******************************CCE*K*RDDAFDSRIQRKALQENGDIKVMTD**PS*AFDTSSVQSLDRTQLFPT*NSKADVASS***SNAESFITQFSHTGNLKKNSVNQLFPTVNSKADVASPSQN*IT*S*VTQF*********************************NT*************************************************************************************************************KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP***********************VNP*PVIRI*******SGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGE*******************C**EICYSCNSVC*RNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS***************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKNHRPRVLVEGMPKKEPKPKKAPQQPKTPKPVTPNRAHETTRKRKYVRKNSVSGNVVMDESVSKRRKCGRGNRQSNSDAIILLPPPPIKTCRQALNFDLENQSRDDKAIIIAIPECGVQALDECAFTKATEHGVQVVDECVDAREQRENNKTRSNRRSRRRRRLNILFMRNLPVGQGQYPRRKRRLRRATRRCNLTVLTAIPLCSQLPKLPSKQAFPGEKATPICSQPPKLPRKQAAAIEKNEAEVIYEPEASKDMLLTDNLEKNVAGRTIEPSVVVDLELQGNKSEGSPFKYEDFPLASYLTGVPRKKRKHPLELENFGLMDQAIDSQKNLQEPGLGLQEGFQEPRIGESPHFQEQKIDSLAPFPGQSSGKTLPFPGHFISGLPQLNIGVLGLLPELSIGAVEPFLDQSNSSLEFFPRQAQAINSQAEFLKEQVMYRLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPRYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPSQAFDTSSVQSLDRTQLFPTGNSKADVASSTKTSNAESFITQFSHTGNLKKNSVNQLFPTVNSKADVASPSQNHITQSSVTQFWPTGNSTADVASPSKTCIKESSIAASTEIPQLENTALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEISVLPSNRESFRTGMPQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSRDES
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query1541 2.2.26 [Sep-21-2011]
Q8LK561987 Transcriptional activator no no 0.340 0.263 0.574 1e-179
Q9SJQ61393 Protein ROS1 OS=Arabidops no no 0.334 0.370 0.579 1e-177
Q9SR661332 DEMETER-like protein 2 OS no no 0.439 0.509 0.450 1e-164
O494981044 DEMETER-like protein 3 OS no no 0.340 0.501 0.498 1e-143
Q9WYK0213 Endonuclease III OS=Therm no no 0.090 0.657 0.299 1e-06
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1 SV=2 Back     alignment and function desciption
 Score =  629 bits (1621), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)

Query: 1004 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 1060
            WD LR+     E   ER+ + MDS+D+EA+R A  SEI++AIKERG  N++A RIK+FL 
Sbjct: 1450 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1509

Query: 1061 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 1120
            R+V+ HG IDLEWLR  PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1510 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1569

Query: 1121 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 1180
            VR+GWVPLQPLP  L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1570 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1629

Query: 1181 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 1240
             K  PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++         PP+ +  I +
Sbjct: 1630 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1689

Query: 1241 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 1299
             + +E    S +     NCEPIIEEP SPG +C+E  ES+I+D  ++  D +EIPT++L 
Sbjct: 1690 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1748

Query: 1300 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 1356
              +F       ME N + LQ+   S ALVAL     S+PT KLK  + LRTEH VY LPD
Sbjct: 1749 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1807

Query: 1357 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 1415
            +H LL G D+R+PDDPSPYLLAIW+PGE+ NS   P+ +C  K S K+C  E C  CNS+
Sbjct: 1808 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1867

Query: 1416 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1475
             E N+  VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S  PI+VPR  I +L R 
Sbjct: 1868 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1927

Query: 1476 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1529
             V FG+S T++F  L   QIQ  FW+GFVCVRGF+++ R PRPL+ RLH   SK
Sbjct: 1928 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1981




Transcriptional activator involved in gene imprinting. Catalyzes the release of 5-methylcytosine (5-meC) from DNA by a glycosylase/lyase mechanism. Allows the expression of the maternal copy of the imprinted MEA gene before fertilization, possibly by antagonizing or suppressing DNA methylation on target promoter. Probably acts by nicking the MEA promoter. Required for stable reproducible patterns of floral and vegetative development.
Arabidopsis thaliana (taxid: 3702)
EC: 3EC: .EC: 2EC: .EC: 2EC: .EC: -
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2 Back     alignment and function description
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2 Back     alignment and function description
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2 Back     alignment and function description
>sp|Q9WYK0|END3_THEMA Endonuclease III OS=Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099) GN=nth PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1541
356537037 1903 PREDICTED: uncharacterized protein LOC10 0.605 0.490 0.442 0.0
449456184 1810 PREDICTED: uncharacterized protein LOC10 0.604 0.514 0.422 0.0
4495305361176 PREDICTED: transcriptional activator DEM 0.598 0.784 0.428 0.0
356546049 1881 PREDICTED: protein ROS1-like [Glycine ma 0.347 0.284 0.592 0.0
225449724 2198 PREDICTED: transcriptional activator DEM 0.340 0.238 0.604 0.0
330370553 1758 repressor of gene silencing 1 [Gossypium 0.347 0.304 0.579 0.0
449512809 1736 PREDICTED: transcriptional activator DEM 0.351 0.311 0.588 0.0
449465553 1679 PREDICTED: transcriptional activator DEM 0.351 0.322 0.588 0.0
2960904001621 unnamed protein product [Vitis vinifera] 0.340 0.323 0.604 0.0
138996985 1796 repressor of silencing 1 [Nicotiana taba 0.352 0.302 0.566 0.0
>gi|356537037|ref|XP_003537037.1| PREDICTED: uncharacterized protein LOC100806794 [Glycine max] Back     alignment and taxonomy information
 Score =  748 bits (1931), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/1070 (44%), Positives = 611/1070 (57%), Gaps = 137/1070 (12%)

Query: 535  QLPLQCNKADGPSNVHNAVVPYQVGPSSEHG--ALVPHQIKEKRKKLKPEVVLDPGTLRM 592
            QL L     D   +  NA+VPY+   S  HG   +VP  IK  ++ L+P+V LD  T R+
Sbjct: 850  QLNLNAEARDLAFHEENALVPYKQKNSLIHGDGVIVPFHIK--KQHLRPKVNLDDETDRV 907

Query: 593  WNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVV 652
            W L++ +D  +     ++ED  KWW++ER VF GR  SF ARMHL+ GDRRF  WKGSVV
Sbjct: 908  WKLLL-LDINSHGIDGTDEDKAKWWEEERNVFRGRADSFIARMHLVQGDRRFSRWKGSVV 966

Query: 653  DSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTN---DHTPSDENLRTTAS----LEPIG 705
            DSVVGV+LTQNV+D+LSSSA+MSLAARFP  S++    H   D  L         +EP  
Sbjct: 967  DSVVGVFLTQNVTDHLSSSAFMSLAARFPKNSSSMCKRHHKEDTRLVVNEPQVHIVEPEE 1026

Query: 706  SNSTS----NGAVYDS---------EGNMYFVTEPE----PDRCCELKDRDDAFDSRIQR 748
            S        N +VYD           G    V   E    P     L D  ++  S + +
Sbjct: 1027 STEWDVKLLNQSVYDQTSTIDMAEHSGEKEAVNSNESCGTPSSVISLTDESNSRLSELPQ 1086

Query: 749  KALQE-----------------------NGDIKVMTDAVPSQ--------AFDTSSVQSL 777
            K ++E                       NGD K + D V SQ        + D S+ Q+ 
Sbjct: 1087 KNIKEHCSPTRSGILSATIEEGEEKSCYNGDRKELNDIVSSQGSVFSSQISGDFSNDQNP 1146

Query: 778  DRTQLFPTGNSKADVASSTKTSNAESFITQFSHTGNLKKNSVNQLFPTVNSKADVASPSQ 837
            ++       NS+ +V SST   N     T FS    L+  S  + +   + K++    S 
Sbjct: 1147 EKIGSCSDSNSEVEVLSSTAKYNHFGSNTSFSKL--LEMVSSTKFYEDNSQKSE----SI 1200

Query: 838  NHITQSSVTQFWP--TGNSTADVASPSKTCIKESSIAASTEIPQLENTALLQDKVDGILF 895
             +     V  F P  T  ST+D+    +  +  SS+  +    Q+  T            
Sbjct: 1201 ENSGMLEVNGFDPFKTEASTSDLKKKDENGMNRSSLQTTEPAGQVAITH----------- 1249

Query: 896  CDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSIS--SSDFNDPFEISVLPSNRESFRT 953
              + +     P    N  Q S          F +IS  + D       S L   + + R 
Sbjct: 1250 -SQSIASQVHPREQSNHQQQS----------FFNISGQTQDLMQKERGSGLGEQKNATRN 1298

Query: 954  GMPQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLR---RI 1010
            G  +   A    K   +GK K  ++                        +WD LR   + 
Sbjct: 1299 GTNEISSAPIKLKTKEQGKEKKDDF------------------------NWDSLRIDAQA 1334

Query: 1011 YSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID 1070
             +   E++ + MDS+DW+AVRCAD SEIA+ IKERG  N +A RIK FLNRLVE H SID
Sbjct: 1335 KAGKREKTENTMDSLDWDAVRCADVSEIAETIKERGMNNRLAERIKNFLNRLVEEHESID 1394

Query: 1071 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 1130
            LEWLR+VPPDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQP
Sbjct: 1395 LEWLRDVPPDKAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1454

Query: 1131 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 1190
            LP  L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K  PNC AC
Sbjct: 1455 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKSKPNCNAC 1514

Query: 1191 PLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN----PIPVIRIEAD 1246
            P+R EC+HFASA ASARFALPGP +K IV++   + I Q+P  +++    P P    + D
Sbjct: 1515 PMRAECRHFASAFASARFALPGPEQKSIVSTTGNSVINQNPSEIISQLHLPPPENTAQED 1574

Query: 1247 PVS--------ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRL 1298
             +         ES ++IN C+PIIEEPR+P P+C +  ES+ D +D    D  EIPT+ L
Sbjct: 1575 EIQLTEVSRQLESKFEINICQPIIEEPRTPEPECLQ--ESQTDIEDAFYEDSSEIPTINL 1632

Query: 1299 QEREFKENFPNFMEMNKVML--QDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 1356
               EF  N  N+M+ N  +   + S ALVAL  +AAS+P  KLK    LRTEH VY LPD
Sbjct: 1633 NIEEFTLNLQNYMQENMELQGGEMSKALVALNPQAASIPMPKLKNVGRLRTEHCVYELPD 1692

Query: 1357 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD--SKLCNSEICYSCNS 1414
             H LL+G+D R+PDDP  YLLAIW+PGE+ NS+  P+  C+S++   +LCN + C+SCNS
Sbjct: 1693 THPLLQGWDTREPDDPGKYLLAIWTPGETANSIQPPESNCSSQEECGQLCNEKECFSCNS 1752

Query: 1415 VCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRR 1474
              E N+ IVRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI+VPR  I NL R
Sbjct: 1753 FREANSQIVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLNPISVPRSWIWNLNR 1812

Query: 1475 EIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLH 1524
              V FG+S TT+F  L  ++ Q+ FWRG++CVRGFDR  R PRPL+ RLH
Sbjct: 1813 RTVYFGTSVTTIFKGLTTQETQQCFWRGYICVRGFDREARAPRPLMARLH 1862




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449456184|ref|XP_004145830.1| PREDICTED: uncharacterized protein LOC101216331 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449530536|ref|XP_004172250.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356546049|ref|XP_003541444.1| PREDICTED: protein ROS1-like [Glycine max] Back     alignment and taxonomy information
>gi|225449724|ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|330370553|gb|AEC12446.1| repressor of gene silencing 1 [Gossypium hirsutum] Back     alignment and taxonomy information
>gi|449512809|ref|XP_004164145.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449465553|ref|XP_004150492.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|296090400|emb|CBI40219.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|138996985|dbj|BAF52855.1| repressor of silencing 1 [Nicotiana tabacum] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1541
TAIR|locus:21844321987 DME "DEMETER" [Arabidopsis tha 0.356 0.276 0.546 1.6e-206
TAIR|locus:20449231393 DML1 "demeter-like 1" [Arabido 0.485 0.536 0.447 2.8e-193
TAIR|locus:21001381332 DML2 "demeter-like 2" [Arabido 0.419 0.484 0.461 4.2e-184
TAIR|locus:21243011044 DML3 "demeter-like protein 3" 0.362 0.534 0.473 3.3e-158
TAIR|locus:2100382293 AT3G47830 [Arabidopsis thalian 0.076 0.402 0.389 3.8e-12
TIGR_CMR|CHY_1121210 CHY_1121 "endonuclease III" [C 0.069 0.509 0.281 7.1e-05
TAIR|locus:2184432 DME "DEMETER" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1525 (541.9 bits), Expect = 1.6e-206, Sum P(4) = 1.6e-206
 Identities = 307/562 (54%), Positives = 376/562 (66%)

Query:   977 EYKSDMKND-RTKKTTPKKNSDNTVQQDWDLLRRIYSTGE---ERSHDKMDSVDWEAVRC 1032
             EYK       R  K T       T Q  WD LR+     E   ER+ + MDS+D+EA+R 
Sbjct:  1424 EYKETNATILREMKGTLADGKKPTSQ--WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRR 1481

Query:  1033 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1092
             A  SEI++AIKERG  N++A RIK+FL R+V+ HG IDLEWLR  PPDK K+YLL I GL
Sbjct:  1482 ASISEISEAIKERGMNNMLAVRIKDFLERIVKDHGGIDLEWLRESPPDKAKDYLLSIRGL 1541

Query:  1093 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1152
             GLKSVECVRLL+L ++AFPVD NVGRIAVR+GWVPLQPLP  L +HLL+ YPV++ IQ +
Sbjct:  1542 GLKSVECVRLLTLHNLAFPVDTNVGRIAVRMGWVPLQPLPESLQLHLLELYPVLESIQKF 1601

Query:  1153 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPG 1212
             LWPRLC LDQ+TLYELHYQ+ITFGK+FC K  PNC ACP+RGEC+HFASA ASAR ALP 
Sbjct:  1602 LWPRLCKLDQRTLYELHYQLITFGKVFCTKSRPNCNACPMRGECRHFASAYASARLALPA 1661

Query:  1213 PSEKGIVTSEFGNGIGQSPPLVVNPIPV-IRIEADPVSESGYQINNCEPIIEEPRSPGPQ 1271
             P E+ + ++         PP+ +  I + + +E    S +     NCEPIIEEP SPG +
Sbjct:  1662 PEERSLTSATIPVPPESYPPVAIPMIELPLPLEKSLASGAPSNRENCEPIIEEPASPGQE 1721

Query:  1272 CXXXXXXXXXXXXXXXXXXXXXPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALT 1328
             C                     PT++L   +F       ME N + LQ+   S ALVAL 
Sbjct:  1722 CTEITESDIEDAYYNEDPDEI-PTIKLNIEQFGMTLREHMERN-MELQEGDMSKALVALH 1779

Query:  1329 AEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNS 1388
                 S+PT KLK  + LRTEH VY LPD+H LL G D+R+PDDPSPYLLAIW+PGE+ NS
Sbjct:  1780 PTTTSIPTPKLKNISRLRTEHQVYELPDSHRLLDGMDKREPDDPSPYLLAIWTPGETANS 1839

Query:  1389 VGSPQYRCNSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQ 1447
                P+ +C  K S K+C  E C  CNS+ E N+  VRGT+LIPCRTA RG FPLNGTYFQ
Sbjct:  1840 AQPPEQKCGGKASGKMCFDETCSECNSLREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQ 1899

Query:  1448 VNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVR 1507
             VNE+FADHE+S  PI+VPR  I +L R  V FG+S T++F  L   QIQ  FW+GFVCVR
Sbjct:  1900 VNELFADHESSLKPIDVPRDWIWDLPRRTVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVR 1959

Query:  1508 GFDRRYRCPRPLVNRLHCSPSK 1529
             GF+++ R PRPL+ RLH   SK
Sbjct:  1960 GFEQKTRAPRPLMARLHFPASK 1981


GO:0003824 "catalytic activity" evidence=IEA
GO:0004519 "endonuclease activity" evidence=IEA
GO:0005634 "nucleus" evidence=ISM;ISS
GO:0006281 "DNA repair" evidence=IEA
GO:0006284 "base-excision repair" evidence=IEA
GO:0051539 "4 iron, 4 sulfur cluster binding" evidence=IEA
GO:0009793 "embryo development ending in seed dormancy" evidence=IMP
GO:0019104 "DNA N-glycosylase activity" evidence=ISS;IDA
GO:0043078 "polar nucleus" evidence=IDA
GO:0006349 "regulation of gene expression by genetic imprinting" evidence=IMP
GO:0003906 "DNA-(apurinic or apyrimidinic site) lyase activity" evidence=IDA
GO:0006306 "DNA methylation" evidence=RCA;IDA
TAIR|locus:2044923 DML1 "demeter-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2100138 DML2 "demeter-like 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124301 DML3 "demeter-like protein 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2100382 AT3G47830 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TIGR_CMR|CHY_1121 CHY_1121 "endonuclease III" [Carboxydothermus hydrogenoformans Z-2901 (taxid:246194)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.2.2LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1541
COG0177211 COG0177, Nth, Predicted EndoIII-related endonuclea 2e-22
smart00478149 smart00478, ENDO3c, endonuclease III 8e-14
cd00056158 cd00056, ENDO3c, endonuclease III; includes endonu 3e-13
pfam00730144 pfam00730, HhH-GPD, HhH-GPD superfamily base excis 1e-12
COG2231215 COG2231, COG2231, Uncharacterized protein related 1e-10
TIGR01083191 TIGR01083, nth, endonuclease III 4e-06
COG0122285 COG0122, AlkA, 3-methyladenine DNA glycosylase/8-o 0.002
TIGR00588310 TIGR00588, ogg, 8-oxoguanine DNA-glycosylase (ogg) 0.003
smart0052521 smart00525, FES, iron-sulpphur binding domain in D 0.003
>gnl|CDD|223255 COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Back     alignment and domain information
 Score = 96.9 bits (242), Expect = 2e-22
 Identities = 49/172 (28%), Positives = 73/172 (42%), Gaps = 27/172 (15%)

Query: 1026 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 1085
              E +  ADE E+ + IK  G     A  IKE    L+E  G           PD  +E 
Sbjct: 60   TPEDLLNADEEELEELIKSIGLYRNKAKNIKELARILLEKFGG--------EVPDTREE- 110

Query: 1086 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1145
            LL + G+G K+   V   +    A  VD +V R++ RLG VP              + P 
Sbjct: 111  LLSLPGVGRKTANVVLSFAFGIPAIAVDTHVHRVSNRLGLVP-------------GKTP- 156

Query: 1146 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 1197
             ++++  L   +    ++   +LH+ +I  G+  CK R P C  CPL   C 
Sbjct: 157  -EEVEEALMKLI---PKELWTDLHHWLILHGRYICKARKPRCEECPLADLCP 204


Length = 211

>gnl|CDD|214684 smart00478, ENDO3c, endonuclease III Back     alignment and domain information
>gnl|CDD|238013 cd00056, ENDO3c, endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Back     alignment and domain information
>gnl|CDD|216088 pfam00730, HhH-GPD, HhH-GPD superfamily base excision DNA repair protein Back     alignment and domain information
>gnl|CDD|225140 COG2231, COG2231, Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|130155 TIGR01083, nth, endonuclease III Back     alignment and domain information
>gnl|CDD|223200 COG0122, AlkA, 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|211589 TIGR00588, ogg, 8-oxoguanine DNA-glycosylase (ogg) Back     alignment and domain information
>gnl|CDD|197771 smart00525, FES, iron-sulpphur binding domain in DNA-(apurinic or apyrimidinic site) lyase (subfamily of ENDO3) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 1541
PF15628103 RRM_DME: RRM in Demeter 100.0
COG0177211 Nth Predicted EndoIII-related endonuclease [DNA re 100.0
PRK10702211 endonuclease III; Provisional 100.0
TIGR01084275 mutY A/G-specific adenine glycosylase. This equiva 100.0
PRK10880350 adenine DNA glycosylase; Provisional 100.0
TIGR01083191 nth endonuclease III. This equivalog model identif 99.98
PRK13910289 DNA glycosylase MutY; Provisional 99.97
COG2231215 Uncharacterized protein related to Endonuclease II 99.97
PRK13913218 3-methyladenine DNA glycosylase; Provisional 99.97
KOG1921286 consensus Endonuclease III [Replication, recombina 99.96
COG1194342 MutY A/G-specific DNA glycosylase [DNA replication 99.95
smart00478149 ENDO3c endonuclease III. includes endonuclease III 99.92
cd00056158 ENDO3c endonuclease III; includes endonuclease III 99.91
PRK01229208 N-glycosylase/DNA lyase; Provisional 99.86
KOG2457555 consensus A/G-specific adenine DNA glycosylase [Re 99.81
TIGR00588310 ogg 8-oxoguanine DNA-glycosylase (ogg). All protei 99.8
COG0122285 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine 99.74
TIGR03252177 uncharacterized HhH-GPD family protein. This model 99.71
PRK10308283 3-methyl-adenine DNA glycosylase II; Provisional 99.65
PF00730108 HhH-GPD: HhH-GPD superfamily base excision DNA rep 99.54
KOG2875323 consensus 8-oxoguanine DNA glycosylase [Replicatio 99.54
KOG1918254 consensus 3-methyladenine DNA glycosidase [Replica 99.47
COG1059210 Thermostable 8-oxoguanine DNA glycosylase [DNA rep 98.79
PF1562932 Perm-CXXC: Permuted single zf-CXXC unit 98.76
PF0063330 HHH: Helix-hairpin-helix motif; InterPro: IPR00044 96.8
PF1057617 EndIII_4Fe-2S: Iron-sulfur binding domain of endon 95.61
smart0052526 FES FES domain. iron-sulpphur binding domain in DN 94.59
PF09674232 DUF2400: Protein of unknown function (DUF2400); In 94.26
TIGR02757229 conserved hypothetical protein TIGR02757. Members 92.54
PF1282664 HHH_2: Helix-hairpin-helix motif; PDB: 1X2I_B 1DGS 85.42
smart0027826 HhH1 Helix-hairpin-helix DNA-binding motif class 1 84.8
PRK10353187 3-methyl-adenine DNA glycosylase I; Provisional 84.31
PF03352179 Adenine_glyco: Methyladenine glycosylase; InterPro 83.97
>PF15628 RRM_DME: RRM in Demeter Back     alignment and domain information
Probab=100.00  E-value=3.7e-67  Score=497.54  Aligned_cols=103  Identities=67%  Similarity=1.197  Sum_probs=102.1

Q ss_pred             eeEEeecccccCCCCCCCccceeeeeeeeccCCCCCCcccchhhhhcccceEEEecCChhhhhcccCHHHHHHHHhccce
Q 000414         1425 GTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFV 1504 (1541)
Q Consensus      1425 GTiLIPCRTAMRGsFPLNGTYFQvNEVFADh~SS~nPI~VpR~~IwnL~Rr~VYFGTSV~SIfkGlt~eeIq~cFwkG~V 1504 (1541)
                      |||||||||||||+|||||||||||||||||+||+|||+|||+|||+|+||+|||||||+|||||||++|||+|||+|||
T Consensus         1 gtiLiPCrtAmrg~FPLnGTYFQ~nEVFaD~~Ss~~Pi~vpr~~iw~L~rr~vYfGtsv~SIfrGls~~eIq~cF~~G~V   80 (103)
T PF15628_consen    1 GTILIPCRTAMRGSFPLNGTYFQVNEVFADHESSRNPIDVPRELIWNLPRRIVYFGTSVSSIFRGLSREEIQQCFWKGFV   80 (103)
T ss_pred             CceEeeehhhhCCccccCceEEEEEEEEeccccccCCeeccHHHhhcCCceEEEecCcHHHHhcccCHHHHHHHHhcCcE
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EeeccccccCCCcccccccccCC
Q 000414         1505 CVRGFDRRYRCPRPLVNRLHCSP 1527 (1541)
Q Consensus      1505 CVRgFDr~TR~PrPL~~rlH~~~ 1527 (1541)
                      |||||||+||+||||++|||+++
T Consensus        81 CVR~FDr~Tr~PrpL~~rLH~~~  103 (103)
T PF15628_consen   81 CVRGFDRKTRAPRPLCARLHFPA  103 (103)
T ss_pred             EEeecccccCCCcchhhhccCCC
Confidence            99999999999999999999985



>COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Back     alignment and domain information
>PRK10702 endonuclease III; Provisional Back     alignment and domain information
>TIGR01084 mutY A/G-specific adenine glycosylase Back     alignment and domain information
>PRK10880 adenine DNA glycosylase; Provisional Back     alignment and domain information
>TIGR01083 nth endonuclease III Back     alignment and domain information
>PRK13910 DNA glycosylase MutY; Provisional Back     alignment and domain information
>COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair] Back     alignment and domain information
>PRK13913 3-methyladenine DNA glycosylase; Provisional Back     alignment and domain information
>KOG1921 consensus Endonuclease III [Replication, recombination and repair] Back     alignment and domain information
>COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair] Back     alignment and domain information
>smart00478 ENDO3c endonuclease III Back     alignment and domain information
>cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Back     alignment and domain information
>PRK01229 N-glycosylase/DNA lyase; Provisional Back     alignment and domain information
>KOG2457 consensus A/G-specific adenine DNA glycosylase [Replication, recombination and repair] Back     alignment and domain information
>TIGR00588 ogg 8-oxoguanine DNA-glycosylase (ogg) Back     alignment and domain information
>COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Back     alignment and domain information
>TIGR03252 uncharacterized HhH-GPD family protein Back     alignment and domain information
>PRK10308 3-methyl-adenine DNA glycosylase II; Provisional Back     alignment and domain information
>PF00730 HhH-GPD: HhH-GPD superfamily base excision DNA repair protein This entry corresponds to Endonuclease III This entry corresponds to Alkylbase DNA glycosidase; InterPro: IPR003265 Endonuclease III (4 Back     alignment and domain information
>KOG2875 consensus 8-oxoguanine DNA glycosylase [Replication, recombination and repair] Back     alignment and domain information
>KOG1918 consensus 3-methyladenine DNA glycosidase [Replication, recombination and repair] Back     alignment and domain information
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Back     alignment and domain information
>PF15629 Perm-CXXC: Permuted single zf-CXXC unit Back     alignment and domain information
>PF00633 HHH: Helix-hairpin-helix motif; InterPro: IPR000445 The HhH motif is an around 20 amino acids domain present in prokaryotic and eukaryotic non-sequence-specific DNA binding proteins [, , ] Back     alignment and domain information
>PF10576 EndIII_4Fe-2S: Iron-sulfur binding domain of endonuclease III; InterPro: IPR003651 Endonuclease III (4 Back     alignment and domain information
>smart00525 FES FES domain Back     alignment and domain information
>PF09674 DUF2400: Protein of unknown function (DUF2400); InterPro: IPR014127 Members of this uncharacterised protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides Back     alignment and domain information
>TIGR02757 conserved hypothetical protein TIGR02757 Back     alignment and domain information
>PF12826 HHH_2: Helix-hairpin-helix motif; PDB: 1X2I_B 1DGS_A 1V9P_B Back     alignment and domain information
>smart00278 HhH1 Helix-hairpin-helix DNA-binding motif class 1 Back     alignment and domain information
>PRK10353 3-methyl-adenine DNA glycosylase I; Provisional Back     alignment and domain information
>PF03352 Adenine_glyco: Methyladenine glycosylase; InterPro: IPR005019 This family of methyladenine glycosylases includes DNA-3-methyladenine glycosylase I (3 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1541
1orn_A226 Structure Of A Trapped Endonuclease Iii-Dna Covalen 3e-04
1p59_A226 Structure Of A Non-Covalent Endonuclease Iii-Dna Co 4e-04
>pdb|1ORN|A Chain A, Structure Of A Trapped Endonuclease Iii-Dna Covalent Intermediate: Estranged-Guanine Complex Length = 226 Back     alignment and structure

Iteration: 1

Score = 45.4 bits (106), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 19/121 (15%) Query: 1077 VPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLH 1136 VP D+ + L+ + G+G K+ V ++ A VD +V R++ RLG+ ++ Sbjct: 108 VPRDR--DELMKLPGVGRKTANVVVSVAFGVPAIAVDTHVERVSKRLGFCRWDDSVLEVE 165 Query: 1137 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 1196 L+K P ++ H++MI FG+ CK +SP C +CPL C Sbjct: 166 KTLMKIIP-----------------KEEWSITHHRMIFFGRYHCKAQSPQCPSCPLLHLC 208 Query: 1197 K 1197 + Sbjct: 209 R 209
>pdb|1P59|A Chain A, Structure Of A Non-Covalent Endonuclease Iii-Dna Complex Length = 226 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1541
1pu6_A218 3-methyladenine DNA glycosylase; helix-hairpin-hel 2e-23
1orn_A226 Endonuclease III; DNA repair, DNA glycosylase, [4F 2e-11
3fhg_A207 Mjogg, N-glycosylase/DNA lyase, DNA-(apurinic; hel 6e-11
3i0w_A290 8-oxoguanine-DNA-glycosylase; OGG, cacogg, DNA, 8- 6e-10
2abk_A211 Endonuclease III; DNA-repair, DNA glycosylase; 1.8 6e-10
2xhi_A360 N-glycosylase/DNA lyase; lyase-DNA complex, lyase/ 2e-09
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-09
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 7e-08
1vt4_I1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-07
3n0u_A219 Probable N-glycosylase/DNA lyase; structural genom 1e-05
2jhn_A295 ALKA, 3-methyladenine DNA-glycosylase; DNA repair, 1e-04
>1pu6_A 3-methyladenine DNA glycosylase; helix-hairpin-helix, base excision repair, hydrolase; HET: KCX; 1.64A {Helicobacter pylori} SCOP: a.96.1.5 PDB: 1pu7_A* 1pu8_A* Length = 218 Back     alignment and structure
 Score = 99.4 bits (248), Expect = 2e-23
 Identities = 30/189 (15%), Positives = 69/189 (36%), Gaps = 16/189 (8%)

Query: 996  SDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRI 1055
                V +  + L+  +    +      D ++ + +   + S++A+ ++  G  N  A R+
Sbjct: 44   KFEAVLKSLENLKNAFILEND------DEINLKKIAYIEFSKLAECVRPSGFYNQKAKRL 97

Query: 1056 KEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 1115
             +    +++   S       N   +  +E+LLD +G+G +S + +   +       VD  
Sbjct: 98   IDLSGNILKDFQS-----FENFKQEVTREWLLDQKGIGKESADAILCYACAKEVMVVDKY 152

Query: 1116 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLY-ELHYQMIT 1174
                  +LG         +L     K     + +   L      +    LY   H +++ 
Sbjct: 153  SYLFLKKLGIEI--EDYDELQHFFEKGVQ--ENLNSALALYENTISLAQLYARFHGKIVE 208

Query: 1175 FGKIFCKKR 1183
            F K   + +
Sbjct: 209  FSKQKLELK 217


>1orn_A Endonuclease III; DNA repair, DNA glycosylase, [4Fe-4S] cluster, iron-sulfur cluster, hydrolase/DNA complex; HET: PED; 1.70A {Geobacillus stearothermophilus} SCOP: a.96.1.1 PDB: 1orp_A* 1p59_A* Length = 226 Back     alignment and structure
>3fhg_A Mjogg, N-glycosylase/DNA lyase, DNA-(apurinic; helix-hairpin-helix, 8-oxoguanine, 8-OXOG, DNA damage, DNA repair, glycosidase, hydrolase; 1.90A {Sulfolobus solfataricus} Length = 207 Back     alignment and structure
>3i0w_A 8-oxoguanine-DNA-glycosylase; OGG, cacogg, DNA, 8-OXOG, 8OXOG, glycosylase, cytosine, hydrolase,lyase/DNA complex; HET: 8OG; 1.73A {Clostridium acetobutylicum} PDB: 3i0x_A* 3f10_A* 3f0z_A Length = 290 Back     alignment and structure
>2abk_A Endonuclease III; DNA-repair, DNA glycosylase; 1.85A {Escherichia coli} SCOP: a.96.1.1 Length = 211 Back     alignment and structure
>2xhi_A N-glycosylase/DNA lyase; lyase-DNA complex, lyase/DNA complex, separation-OF-function helix-hairpin-helix, DNA repair; HET: 8OG; 1.55A {Homo sapiens} PDB: 1ko9_A 1lwy_A* 1hu0_A* 1lwv_A* 1lww_A* 2noe_A* 2noh_A* 2nol_A* 1n3c_A* 1fn7_A* 2noz_A* 1yqk_A 1yqr_A* 1yql_A* 1yqm_A* 2noi_A 1ebm_A* 1m3q_A* 1m3h_A* 1n39_A* ... Length = 360 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>3n0u_A Probable N-glycosylase/DNA lyase; structural genomics, ISFI, DNA repair, 8-oxoguanine, base EX repair, PSI-2, protein structure initiative; 1.50A {Thermotoga maritima} Length = 219 Back     alignment and structure
>2jhn_A ALKA, 3-methyladenine DNA-glycosylase; DNA repair, N1-methyladenine, N3-methylcytosine, hyperthermophiles, hydrolase; HET: MBO MES; 1.8A {Archaeoglobus fulgidus} PDB: 2jhj_A Length = 295 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 1541
d1pu6a_217 a.96.1.5 (A:) 3-Methyladenine DNA glycosylase III 4e-20
d1kg2a_224 a.96.1.2 (A:) Catalytic domain of MutY {Escherichi 2e-10
d1orna_214 a.96.1.1 (A:) Endonuclease III {Escherichia coli [ 3e-10
d2abka_211 a.96.1.1 (A:) Endonuclease III {Escherichia coli [ 1e-09
d1keaa_217 a.96.1.2 (A:) Thymine-DNA glycosylase {Archaeon Me 6e-09
d1rrqa1221 a.96.1.2 (A:9-229) Catalytic domain of MutY {Bacil 1e-08
d2noha1190 a.96.1.3 (A:136-325) 8-oxoguanine glycosylase {Hum 4e-07
>d1pu6a_ a.96.1.5 (A:) 3-Methyladenine DNA glycosylase III (MagIII) {Helicobacter pylori [TaxId: 210]} Length = 217 Back     information, alignment and structure

class: All alpha proteins
fold: DNA-glycosylase
superfamily: DNA-glycosylase
family: 3-Methyladenine DNA glycosylase III (MagIII)
domain: 3-Methyladenine DNA glycosylase III (MagIII)
species: Helicobacter pylori [TaxId: 210]
 Score = 88.3 bits (218), Expect = 4e-20
 Identities = 29/189 (15%), Positives = 69/189 (36%), Gaps = 16/189 (8%)

Query: 996  SDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRI 1055
                V +  + L+  +    +      D ++ + +   + S++A+ ++  G  N  A R+
Sbjct: 43   KFEAVLKSLENLKNAFILEND------DEINLKKIAYIEFSKLAECVRPSGFYNQKAKRL 96

Query: 1056 KEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 1115
             +    +++       +   N   +  +E+LLD +G+G +S + +   +       VD  
Sbjct: 97   IDLSGNILKDF-----QSFENFKQEVTREWLLDQKGIGKESADAILCYACAKEVMVVDKY 151

Query: 1116 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLY-ELHYQMIT 1174
                  +LG         +L     K     + +   L      +    LY   H +++ 
Sbjct: 152  SYLFLKKLGIEI--EDYDELQHFFEKGVQ--ENLNSALALYENTISLAQLYARFHGKIVE 207

Query: 1175 FGKIFCKKR 1183
            F K   + +
Sbjct: 208  FSKQKLELK 216


>d1kg2a_ a.96.1.2 (A:) Catalytic domain of MutY {Escherichia coli [TaxId: 562]} Length = 224 Back     information, alignment and structure
>d1orna_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Length = 214 Back     information, alignment and structure
>d2abka_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Length = 211 Back     information, alignment and structure
>d1keaa_ a.96.1.2 (A:) Thymine-DNA glycosylase {Archaeon Methanobacterium thermoformicicum [TaxId: 145262]} Length = 217 Back     information, alignment and structure
>d1rrqa1 a.96.1.2 (A:9-229) Catalytic domain of MutY {Bacillus stearothermophilus [TaxId: 1422]} Length = 221 Back     information, alignment and structure
>d2noha1 a.96.1.3 (A:136-325) 8-oxoguanine glycosylase {Human (Homo sapiens) [TaxId: 9606]} Length = 190 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query1541
d2abka_211 Endonuclease III {Escherichia coli [TaxId: 562]} 100.0
d1orna_214 Endonuclease III {Escherichia coli [TaxId: 562]} 100.0
d1keaa_217 Thymine-DNA glycosylase {Archaeon Methanobacterium 100.0
d1kg2a_224 Catalytic domain of MutY {Escherichia coli [TaxId: 100.0
d1rrqa1221 Catalytic domain of MutY {Bacillus stearothermophi 100.0
d1pu6a_217 3-Methyladenine DNA glycosylase III (MagIII) {Heli 100.0
d2noha1190 8-oxoguanine glycosylase {Human (Homo sapiens) [Ta 99.87
d1mpga1183 3-Methyladenine DNA glycosylase II (gene alkA or a 99.82
d1ngna_144 Mismatch-specific thymine glycosylase domain of th 99.73
d1nkua_187 3-Methyladenine DNA glycosylase I (Tag) {Escherich 83.01
>d2abka_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA-glycosylase
superfamily: DNA-glycosylase
family: Endonuclease III
domain: Endonuclease III
species: Escherichia coli [TaxId: 562]
Probab=100.00  E-value=1.6e-40  Score=294.30  Aligned_cols=182  Identities=21%  Similarity=0.208  Sum_probs=168.5

Q ss_pred             CCCCCCCCCEECCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHC
Q ss_conf             76665111110001269999999997230026888998889998199999999999609948999999999999999809
Q 000414          988 KKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHG 1067 (1541)
Q Consensus       988 K~kk~~VgAILsQQTsWesVrKaiaRLreRg~~~adfPTpEALAkAs~EELeelIR~lGf~nrKAKrIkeLAr~IvedyG 1067 (1541)
                      .+....+++||+|||.|+++.+++.+|.      ..+++|++++.++.++|+++|+++||+++||++|+++++.|.++||
T Consensus        28 ~p~~vli~~iLsqqT~~~~v~~~~~~l~------~~~~t~~~la~a~~~~l~~~i~~~G~y~~Ka~~l~~~a~~i~~~~~  101 (211)
T d2abka_          28 SPFELLIAVLLSAQATDVSVNKATAKLY------PVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQHN  101 (211)
T ss_dssp             SHHHHHHHHHHTTTSCHHHHHHHHHHHT------TTCCSHHHHHHHHHHHHHHHHTTSTTHHHHHHHHHHHHHHHHHHTT
T ss_pred             CHHHHHHHHHHHCCCHHHHHHHHHHHHH------HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
T ss_conf             8799999999956263999999999888------7414727887655767677777754411578899999888998714


Q ss_pred             CCCHHHHHCCCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEECCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCHH
Q ss_conf             98923430699258999976187776888999999933981221204898999661888999995899999994289625
Q 000414         1068 SIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 1147 (1541)
Q Consensus      1068 gIDLE~LrkIPdDeAREeLLSLKGIGpKTADcILLFALgRpaFPVDTHV~RIL~RLGLVp~kpspEEIEk~LeeLlPile 1147 (1541)
                      |       .+|.+  +++|++|||||+|||++||+|+|++++|+||+||.||+.|+||++. .++.+++..+.+++|   
T Consensus       102 g-------~~p~~--~~~L~~LpGVG~~TA~~Il~~a~~~p~~~vD~nv~Rv~~R~g~~~~-~~~~~~~~~~~~~~p---  168 (211)
T d2abka_         102 G-------EVPED--RAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQFAPG-KNVEQVEEKLLKVVP---  168 (211)
T ss_dssp             T-------SCCSC--HHHHHHSTTCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCSSCC-SSHHHHHHHHHHHSC---
T ss_pred             C-------CHHHH--HHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCC-CHHHHHHHHHHHCCC---
T ss_conf             8-------60477--9999999860517899999998568032256799999997066754-204678888860486---


Q ss_pred             HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHH
Q ss_conf             6765320000048946399999999999487433778888889981011557732
Q 000414         1148 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 1202 (1541)
Q Consensus      1148 ~IqK~LW~RLc~lD~etl~ELH~lLIeFGKeICtarKPnCdaCPLRs~CkyYasa 1202 (1541)
                         ...|.           ++|.+||+||+.+|++++|+|+.|||++.|+||.+.
T Consensus       169 ---~~~~~-----------~~~~~l~~~G~~vC~~~~P~C~~Cpl~~~C~~~~k~  209 (211)
T d2abka_         169 ---AEFKV-----------DCHHWLILHGRYTCIARKPRCGSCIIEDLCEYKEKV  209 (211)
T ss_dssp             ---GGGTT-----------THHHHHHHHHHHTSCSSSCCGGGCTTGGGCCCTTCS
T ss_pred             ---CCCHH-----------HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC
T ss_conf             ---03489-----------999999998555525999899999790558986767



>d1orna_ a.96.1.1 (A:) Endonuclease III {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1keaa_ a.96.1.2 (A:) Thymine-DNA glycosylase {Archaeon Methanobacterium thermoformicicum [TaxId: 145262]} Back     information, alignment and structure
>d1kg2a_ a.96.1.2 (A:) Catalytic domain of MutY {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1rrqa1 a.96.1.2 (A:9-229) Catalytic domain of MutY {Bacillus stearothermophilus [TaxId: 1422]} Back     information, alignment and structure
>d1pu6a_ a.96.1.5 (A:) 3-Methyladenine DNA glycosylase III (MagIII) {Helicobacter pylori [TaxId: 210]} Back     information, alignment and structure
>d2noha1 a.96.1.3 (A:136-325) 8-oxoguanine glycosylase {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1mpga1 a.96.1.3 (A:100-282) 3-Methyladenine DNA glycosylase II (gene alkA or aidA) {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1ngna_ a.96.1.2 (A:) Mismatch-specific thymine glycosylase domain of the methyl-GpG binding protein mbd4 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1nkua_ a.96.1.4 (A:) 3-Methyladenine DNA glycosylase I (Tag) {Escherichia coli [TaxId: 562]} Back     information, alignment and structure