Citrus Sinensis ID: 003229


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------
MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPEPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT
ccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcHHHHHHHHHHHccccccccccccccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccHHHHHHHHHccccccccccccccccccccccccccccccHHHHHHHHHHHHHccccHHHHHHHHHHcccccccccccccHHHHHHHHHccccccEEEcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
ccccccHHHHEcccccccccEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEccccccccccccEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccEEccccccccccccHHHHHHHHHHHHHHHHcccccccccHHHccccccccccccccccccHHHHHHHHHcccccccccccccccccccccEEcccccccHHHHHHHcccHHccccccccHHHHHHHcccccHHHccHHHHHHHHHHHHcccHHHHHHHHcccccccccccccHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHEEEHHHHHHHHHccHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
mepltaaqdvsivpdhkidkfeeygyagnnvkqddrslesktgtdnalssSSEAIEVasdnkidsenetpstgdvshsssginsindvakqddlqresasddmsvapdtaltspklpepevvsgtenasplegsdsildanlpesaseitgenpidvepssfsnptdlgndgskfsrifsdsssissshapiePLAAVIsvssdttvepqilpkgdtetvaspstiknveqsekpllsgedssssmevhdlnkngssgtsvspsifpfsneketcdlnesnsssftespptgsssspagipapSVVSAALQvlpgkvlvpAVVDQVQGQALSALQVLKVIEAdvkpgdlcirREYARWLVSASStltrstmskvypamyienvtdlafdditpedpdfssiqgLAEAGLISsklshrdllneepgpifflpesplsrqDLVSWKMALEKRQLPEANKKILYQLSgfididkinpdawPALLADLTAGEQGIIALAFGctrlfqpdkpvtNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRwerqgikvvvdkdlreeSDAAVMWVNagkqfsvdqTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT
mepltaaqdvsivpdhkIDKFEEYGYAgnnvkqddrslesktgtdnalsssseAIEVAsdnkidsenetpstgdvshsssginSINDVAKQDDLQRESASDDMSVAPDTaltspklpepevVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFSDSSSISSSHAPIEPLAAVISVSsdttvepqilpkgdtetvaspstiknveqsekPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWlvsasstltrstmskvYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESfekelsmerekidvVEKMAEEARQELerlraerevdkIALMKERAAIESEMEILSKLRREVEEQLeslmsnkveisyeKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKAlegardrwerqgikvvvdkdlrEESDAAVMWVNagkqfsvdqtvSRAQSLVDKLKAMandvsgkskEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLtegakrvagdcregvekltqrfkt
MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPEPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGskfsrifsdsssisssHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNsssftespptgsssspagipapsvvsaaLQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIeaesaaenavsehsaLVAEVEKEINESFEKELSMEREKIDVVEKMaeearqelerlraereVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT
*******************KFEEYGY**************************************************************************************************************************************************************************************************************************************************************************************VVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITP*******IQGLAEAGLI**************GPIFFLPE**LSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIG*********************************************************************************************************************************************************************************WERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKA*********************************************************
*******************************************************************************************************************************************************************************************************************L***************************************************************************************************VLPGKVLVPAVVDQVQGQALSALQVLKV****VKPGDLCIRREYARWLVSASSTLTR**************VTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH*****EEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAA****************************************************************************************************************************************************************************************************************************************************************************************************************************L*Q****
********DVSIVPDHKIDKFEEYGYAGNNVKQ***********************************************GINSINDVAKQD**************PDTALTSPKLP*************LEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFS*********APIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIK*********************************VSPSIFPFSNEKET*************************IPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIE*************ALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMA***********************RWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT
******************DKFEEYGY************************************************************************************************************************************************************************************************************************************************************************************************GKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISS************GPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPEPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEIxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxKVExxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxLSMAxxxxxxxxxxxxxxxxxxxxxxxxxxxxGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query837
255545258 976 conserved hypothetical protein [Ricinus 0.970 0.831 0.585 0.0
225464485 985 PREDICTED: uncharacterized protein LOC10 0.704 0.598 0.717 0.0
302143846 1040 unnamed protein product [Vitis vinifera] 0.704 0.567 0.717 0.0
224134861592 predicted protein [Populus trichocarpa] 0.664 0.939 0.719 0.0
356529194 975 PREDICTED: uncharacterized protein LOC10 0.988 0.848 0.552 0.0
356561542 1002 PREDICTED: uncharacterized protein LOC10 0.990 0.827 0.552 0.0
224122346793 predicted protein [Populus trichocarpa] 0.667 0.704 0.708 0.0
449446025722 PREDICTED: uncharacterized protein LOC10 0.643 0.746 0.732 0.0
222424656805 AT5G23890 [Arabidopsis thaliana] 0.848 0.881 0.557 0.0
15237846 946 uncharacterized protein [Arabidopsis tha 0.955 0.845 0.511 0.0
>gi|255545258|ref|XP_002513690.1| conserved hypothetical protein [Ricinus communis] gi|223547598|gb|EEF49093.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  886 bits (2289), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 508/868 (58%), Positives = 626/868 (72%), Gaps = 56/868 (6%)

Query: 1   MEPLTAAQDVSIVPDHKIDKFEEYGYA--GNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
           MEPLT  Q+VS+V D + DK E+   A    N+K++  SLE KT TD  L SS +  E  
Sbjct: 134 MEPLTVQQEVSLVSDDEEDKIEKNTSAESSANLKEEYISLEHKTNTDVDLPSSPQIEETH 193

Query: 59  SDNKIDSENETPSTGD-VSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLP 117
           ++NK+  + +   + D  ++  S  +++++   Q+DLQ +SA D      +T   S  LP
Sbjct: 194 NENKLSGDTDQLLSADNGNYIISSNDTVDNAPVQEDLQYDSAFDSKLGVLETTPNSTNLP 253

Query: 118 EPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSR 177
           E ++                +D NL      + GE    +   +         + +  S 
Sbjct: 254 ESKIAK--------------IDKNL------VNGEPAYSLNIINTITEHTEAKENTIPSS 293

Query: 178 IFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLL 237
             S S  + SS   +  ++  I+++SDT  E   L K   ++ AS  T + +  S   + 
Sbjct: 294 DSSISPVLKSSEPVV--VSTSITLTSDTVSEVGNLFKDGMDSEASVPTKEELNTSTNQV- 350

Query: 238 SGEDSSSSMEVHDLNKNGSSGTSVSPS-IFPFSNEKETC---DLNESNSSSFTESPPTGS 293
           S + +SSS+E++ L ++GSSG +      +PF+N+++     D+N S +SS  ESPP   
Sbjct: 351 STDRNSSSLEMNYLTESGSSGVTSVSEWAYPFANKQDIVANDDMNLSKTSS--ESPPFSG 408

Query: 294 SSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRR 353
           S S AG+PAPS V  +LQV PGK+LVPAVVDQ  GQAL+ALQVLKVIEADV+P DLC RR
Sbjct: 409 SFSSAGVPAPSAVPESLQVSPGKILVPAVVDQTHGQALAALQVLKVIEADVQPSDLCTRR 468

Query: 354 EYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSK 413
           EYARWLV+ASS L+RST+SKVYPAMYIEN T+ AFDDITP+DPDFSSIQGLAEAGLISS+
Sbjct: 469 EYARWLVAASSALSRSTLSKVYPAMYIENATEPAFDDITPDDPDFSSIQGLAEAGLISSR 528

Query: 414 LSHRDLLN--EEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDK 471
           LS+ DLL+  E+ GP+ F PESPLSRQDLVSWKMALEKRQLPEAN+KILYQLSGF D+DK
Sbjct: 529 LSNHDLLSPVEDQGPLNFSPESPLSRQDLVSWKMALEKRQLPEANRKILYQLSGFRDVDK 588

Query: 472 INPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQ 531
           I+PDAWPAL+ADL+AG+QGII+LAFGCTRLFQP+KPVT AQAAVALAIGEASD VNEEL 
Sbjct: 589 IHPDAWPALIADLSAGDQGIISLAFGCTRLFQPNKPVTKAQAAVALAIGEASDIVNEELA 648

Query: 532 RIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRA 591
           RIEAES AENAVS H+ALVA+VE++IN SFEKEL MEREKI+ VEKMAEEAR ELERLRA
Sbjct: 649 RIEAESMAENAVSAHNALVAQVEQDINASFEKELLMEREKINAVEKMAEEARLELERLRA 708

Query: 592 EREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAEN 651
           ERE D  ALMKERA+IE+EME+LS+L+ EVEEQL++L+S+KVEISYEKERIN L+KEAEN
Sbjct: 709 EREADNFALMKERASIEAEMEVLSRLKGEVEEQLQTLLSSKVEISYEKERINKLQKEAEN 768

Query: 652 ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDL 711
           E QEI+RLQYELEVERKALS+ARAWAEDEAKRARE AK +E ARDRWERQGIKVVVD DL
Sbjct: 769 EKQEISRLQYELEVERKALSIARAWAEDEAKRAREHAKVIEEARDRWERQGIKVVVDNDL 828

Query: 712 REESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFIS 771
           REE+ A   WV   +QFSV+ TVSRA+ LV +LK +A++  GKSKE+INTII KIL+ IS
Sbjct: 829 REETSAGGTWVATARQFSVEGTVSRAEKLVGELKLLADNARGKSKEVINTIIQKILVIIS 888

Query: 772 NLKKWASKASMRAAELKDATILKAKGSVQE----------------------LQQSTAEF 809
            LK+W S+A  +A ELKDA +LKAK SV+E                      LQQSTAEF
Sbjct: 889 RLKEWISEARTQAGELKDAAVLKAKESVEELQKNTSEFSSTIKERARGSIYGLQQSTAEF 948

Query: 810 RSNLTEGAKRVAGDCREGVEKLTQRFKT 837
              + EGAKRVAGDCREGVE+LTQRFK+
Sbjct: 949 SFAMKEGAKRVAGDCREGVERLTQRFKS 976




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225464485|ref|XP_002271744.1| PREDICTED: uncharacterized protein LOC100264485 [Vitis vinifera] Back     alignment and taxonomy information
>gi|302143846|emb|CBI22707.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224134861|ref|XP_002321923.1| predicted protein [Populus trichocarpa] gi|222868919|gb|EEF06050.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356529194|ref|XP_003533181.1| PREDICTED: uncharacterized protein LOC100780360 [Glycine max] Back     alignment and taxonomy information
>gi|356561542|ref|XP_003549040.1| PREDICTED: uncharacterized protein LOC100810148 [Glycine max] Back     alignment and taxonomy information
>gi|224122346|ref|XP_002318812.1| predicted protein [Populus trichocarpa] gi|222859485|gb|EEE97032.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449446025|ref|XP_004140772.1| PREDICTED: uncharacterized protein LOC101215442 [Cucumis sativus] gi|449518413|ref|XP_004166236.1| PREDICTED: uncharacterized LOC101215442 [Cucumis sativus] Back     alignment and taxonomy information
>gi|222424656|dbj|BAH20282.1| AT5G23890 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|15237846|ref|NP_197777.1| uncharacterized protein [Arabidopsis thaliana] gi|10176856|dbj|BAB10062.1| unnamed protein product [Arabidopsis thaliana] gi|332005846|gb|AED93229.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query837
TAIR|locus:2172853946 AT5G23890 [Arabidopsis thalian 0.959 0.848 0.477 7.8e-181
TAIR|locus:2156687761 AT5G52410 [Arabidopsis thalian 0.623 0.685 0.569 5.2e-149
TAIR|locus:2102375558 AT3G25680 "AT3G25680" [Arabido 0.457 0.686 0.343 2.9e-56
POMBASE|SPAC29E6.10c 1085 SPAC29E6.10c "conserved fungal 0.317 0.245 0.214 3.8e-06
MGI|MGI:2684999 1091 Tmf1 "TATA element modulatory 0.260 0.199 0.252 5.3e-06
CGD|CAL0003874 1404 PGA55 [Candida albicans (taxid 0.261 0.155 0.230 1.1e-05
UNIPROTKB|Q59SG9 1404 PGA55 "Flocculin-like protein" 0.261 0.155 0.230 1.1e-05
DICTYBASE|DDB_G0280959 1157 DG1113 "C2 domain-containing p 0.304 0.220 0.208 1.4e-05
UNIPROTKB|G4N0Y1709 MGG_09571 "Uncharacterized pro 0.454 0.535 0.223 1.6e-05
MGI|MGI:109172934 Dspp "dentin sialophosphoprote 0.305 0.274 0.213 1.8e-05
TAIR|locus:2172853 AT5G23890 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1755 (622.8 bits), Expect = 7.8e-181, P = 7.8e-181
 Identities = 409/856 (47%), Positives = 549/856 (64%)

Query:     1 MEPLTAAQDVSIVPDHKI--DKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
             M  LT+ Q+  I    +I  D+ +      +N+K +D+S+ES     N ++  S+  E +
Sbjct:   125 MHSLTSQQESMIQSSDEISSDEIKVANSEESNLKDEDKSIES-----NDVAQKSD--EGS 177

Query:    59 SDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPE 118
              ++K+  +  +   G ++  +    SI     + DL   + +D     P+TA  S K   
Sbjct:   178 GEDKLLGKETSSFDGVMTDEADATESIPQNTPEADLMVNAETD-----PETA-ESEK--- 228

Query:   119 PEVVSGTENASPLEGSDS--ILDANLPESASEITGENPIDVEPSSFSN--PT---DLGND 171
               ++S  E+ S L+ S    +LDA   ES++ +  EN    +P S  N  PT   DL N 
Sbjct:   229 --IIS--ESKSLLDSSTEPILLDA---ESSNLVGVENTNSEDPESLLNTEPTNVSDLENH 281

Query:   172 GXXXXXXXXXXXXXXXXHAPIEPLAAVISVSS--DTTVEPQILPKGDTETV-ASPSTIKN 228
                              +A    +  +  VSS  D+T +PQI+P  DTET  A+   +  
Sbjct:   282 VNSQKEDSLSSLSGIDAYAASGTVTELPEVSSQLDSTSKPQIVPLNDTETAFATAEELSE 341

Query:   229 VEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLN-ESNXXXXXX 287
             V  + +   + + SS S    D++      +S SP   P S +    +LN  S       
Sbjct:   342 VNGTPEYFETSDWSSIS----DIDTTKELESSKSP--VPESTDGSKDELNIYSQDELDDN 395

Query:   288 XXXXXXXXXXXXXXXXXXXXXXLQVL--PGKVLVPAVVDQVQGQALSALQVLKVIEADVK 345
                                   + V+  PGK+LVP   DQ+Q QA +ALQVLKVIE D +
Sbjct:   396 RMLLEIPSGGSAFSSAGIPAPFMSVIVNPGKILVPVAADQIQCQAFAALQVLKVIETDTQ 455

Query:   346 PGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLA 405
             P DLC RREYARWL+SASS L+R+T SKVYPAMYIENVT+LAFDDITPEDPDFSSIQGLA
Sbjct:   456 PSDLCTRREYARWLISASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFSSIQGLA 515

Query:   406 EAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSG 465
             EAGLI+SKLS+RDLL++  G   F PES LSRQDL+SWKMALEKRQLPEA+KK+LY+LSG
Sbjct:   516 EAGLIASKLSNRDLLDDVEGTFLFSPESLLSRQDLISWKMALEKRQLPEADKKMLYKLSG 575

Query:   466 FIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDA 525
             FIDIDKINPDAWP+++ADL+ GEQGI ALAFGCTRLFQP KPVT  QAA+AL+ GEASD 
Sbjct:   576 FIDIDKINPDAWPSIIADLSTGEQGIAALAFGCTRLFQPHKPVTKGQAAIALSSGEASDI 635

Query:   526 VNEELQRIXXXXXXXXXXXXXXXLVAEVEKEINESFEKELSMEREKIDVVEKMXXXXXXX 585
             V+EEL RI               LVAEVEK++N SFEKELSMEREKI+ VEKM       
Sbjct:   636 VSEELARIEAESMAEKAVSAHNALVAEVEKDVNASFEKELSMEREKIEAVEKMAELAKVE 695

Query:   586 XXXXXXXXXVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINML 645
                       + +AL+KERAA+ESEME+LS+LRR+ EE+LE LMSNK EI++EKER+  L
Sbjct:   696 LEQLREKREEENLALVKERAAVESEMEVLSRLRRDAEEKLEDLMSNKAEITFEKERVFNL 755

Query:   646 RKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKV 705
             RKEAE E+Q I++LQYELEVERKALSMAR+WAE+EAK+AREQ +ALE AR RWE  G++V
Sbjct:   756 RKEAEEESQRISKLQYELEVERKALSMARSWAEEEAKKAREQGRALEEARKRWETNGLRV 815

Query:   706 VVDKDLRE----ESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINT 761
             VVDKDL+E    E++ +++ +N  ++ SV++T  RA++L+DKLK MA  VSGKS+E+I T
Sbjct:   816 VVDKDLQETSSRETEQSIV-LNEMERSSVEETERRAKTLMDKLKEMAGTVSGKSREVIFT 874

Query:   762 IIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVA 821
             ++ KI L+I+ LK++A     RA E++DA I++AKG+  +++Q T +    +++  K++A
Sbjct:   875 VMEKIRLWITVLKEYAVNLGKRAGEMRDAAIVRAKGAAADVEQGTVQ----VSDKVKKMA 930

Query:   822 GDCREGVEKLTQRFKT 837
              +CR+GV K++QRFKT
Sbjct:   931 EECRDGVGKISQRFKT 946




GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0005739 "mitochondrion" evidence=IDA
GO:0009536 "plastid" evidence=IDA
TAIR|locus:2156687 AT5G52410 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2102375 AT3G25680 "AT3G25680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
POMBASE|SPAC29E6.10c SPAC29E6.10c "conserved fungal protein" [Schizosaccharomyces pombe (taxid:4896)] Back     alignment and assigned GO terms
MGI|MGI:2684999 Tmf1 "TATA element modulatory factor 1" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
CGD|CAL0003874 PGA55 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q59SG9 PGA55 "Flocculin-like protein" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0280959 DG1113 "C2 domain-containing protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|G4N0Y1 MGG_09571 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms
MGI|MGI:109172 Dspp "dentin sialophosphoprotein" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query837
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 2e-09
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 7e-09
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 2e-08
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 2e-08
TIGR021691164 TIGR02169, SMC_prok_A, chromosome segregation prot 2e-07
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 5e-07
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 6e-07
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 1e-06
PTZ00121 2084 PTZ00121, PTZ00121, MAEBL; Provisional 1e-06
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 2e-06
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 2e-06
COG0419908 COG0419, SbcC, ATPase involved in DNA repair [DNA 2e-06
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 3e-06
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 3e-06
COG0419 908 COG0419, SbcC, ATPase involved in DNA repair [DNA 6e-06
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 8e-06
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 8e-06
COG0419 908 COG0419, SbcC, ATPase involved in DNA repair [DNA 1e-05
PRK03918880 PRK03918, PRK03918, chromosome segregation protein 1e-05
PRK03918880 PRK03918, PRK03918, chromosome segregation protein 1e-05
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 2e-05
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 2e-05
PRK02224 880 PRK02224, PRK02224, chromosome segregation protein 2e-05
TIGR01843423 TIGR01843, type_I_hlyD, type I secretion membrane 3e-05
pfam13868349 pfam13868, Trichoplein, Tumour suppressor, Mitosta 6e-05
COG1579239 COG1579, COG1579, Zn-ribbon protein, possibly nucl 6e-05
COG4942420 COG4942, COG4942, Membrane-bound metallopeptidase 8e-05
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 1e-04
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 1e-04
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 2e-04
COG0419 908 COG0419, SbcC, ATPase involved in DNA repair [DNA 2e-04
pfam13868349 pfam13868, Trichoplein, Tumour suppressor, Mitosta 2e-04
COG2433652 COG2433, COG2433, Uncharacterized conserved protei 2e-04
PRK12704 520 PRK12704, PRK12704, phosphodiesterase; Provisional 2e-04
pfam05262489 pfam05262, Borrelia_P83, Borrelia P83/100 protein 3e-04
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 4e-04
pfam12128 1198 pfam12128, DUF3584, Protein of unknown function (D 5e-04
PTZ00121 2084 PTZ00121, PTZ00121, MAEBL; Provisional 6e-04
COG4372 499 COG4372, COG4372, Uncharacterized protein conserve 7e-04
COG3096 1480 COG3096, MukB, Uncharacterized protein involved in 7e-04
COG0419 908 COG0419, SbcC, ATPase involved in DNA repair [DNA 8e-04
pfam02463 1162 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain 8e-04
TIGR02680 1353 TIGR02680, TIGR02680, TIGR02680 family protein 0.001
PRK00409782 PRK00409, PRK00409, recombination and DNA strand e 0.001
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 0.002
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 0.002
PRK03918 880 PRK03918, PRK03918, chromosome segregation protein 0.002
COG1579239 COG1579, COG1579, Zn-ribbon protein, possibly nucl 0.002
COG4372 499 COG4372, COG4372, Uncharacterized protein conserve 0.002
PTZ00491850 PTZ00491, PTZ00491, major vault protein; Provision 0.002
PRK03918 880 PRK03918, PRK03918, chromosome segregation protein 0.003
pfam13868349 pfam13868, Trichoplein, Tumour suppressor, Mitosta 0.003
TIGR01069771 TIGR01069, mutS2, MutS2 family protein 0.003
PTZ00121 2084 PTZ00121, PTZ00121, MAEBL; Provisional 0.004
PRK02224 880 PRK02224, PRK02224, chromosome segregation protein 0.004
COG5185622 COG5185, HEC1, Protein involved in chromosome segr 0.004
TIGR03319 514 TIGR03319, RNase_Y, ribonuclease Y 0.004
TIGR04211198 TIGR04211, SH3_and_anchor, SH3 domain protein 0.004
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
 Score = 61.2 bits (149), Expect = 2e-09
 Identities = 49/189 (25%), Positives = 81/189 (42%), Gaps = 17/189 (8%)

Query: 523 SDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEA 582
             A+ ++L R+EAE             +A++ KE+ E  E E+    E+++  E+   EA
Sbjct: 728 ISALRKDLARLEAEVEQ------LEERIAQLSKELTE-LEAEIEELEERLEEAEEELAEA 780

Query: 583 RQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
             E+E L A+ E  K  L   R A++     L+ L  E     E L S +  I+  + R+
Sbjct: 781 EAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRL 840

Query: 643 NMLRKEAENENQEIARLQY----------ELEVERKALSMARAWAEDEAKRAREQAKALE 692
             L ++ E  +++I  L            ELE E +AL   RA  E+     R + + L 
Sbjct: 841 EDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELS 900

Query: 693 GARDRWERQ 701
                 E +
Sbjct: 901 EELRELESK 909


SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]. Length = 1179

>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein, HlyD family Back     alignment and domain information
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin Back     alignment and domain information
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only] Back     alignment and domain information
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin Back     alignment and domain information
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional Back     alignment and domain information
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584) Back     alignment and domain information
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional Back     alignment and domain information
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown] Back     alignment and domain information
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain Back     alignment and domain information
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein Back     alignment and domain information
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only] Back     alignment and domain information
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown] Back     alignment and domain information
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional Back     alignment and domain information
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin Back     alignment and domain information
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein Back     alignment and domain information
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional Back     alignment and domain information
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation, interacts with SMC proteins [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y Back     alignment and domain information
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 837
PF0039545 SLH: S-layer homology domain; InterPro: IPR001119 98.95
PF0039545 SLH: S-layer homology domain; InterPro: IPR001119 97.73
PF08317325 Spc7: Spc7 kinetochore protein; InterPro: IPR01325 96.26
KOG0018 1141 consensus Structural maintenance of chromosome pro 95.81
KOG1029 1118 consensus Endocytic adaptor protein intersectin [S 95.75
smart00787312 Spc7 Spc7 kinetochore protein. This domain is foun 94.93
KOG1029 1118 consensus Endocytic adaptor protein intersectin [S 94.67
PRK00106 535 hypothetical protein; Provisional 94.24
PF12128 1201 DUF3584: Protein of unknown function (DUF3584); In 94.2
TIGR03185 650 DNA_S_dndD DNA sulfur modification protein DndD. T 94.08
PF09726697 Macoilin: Transmembrane protein; InterPro: IPR0191 93.97
COG4942420 Membrane-bound metallopeptidase [Cell division and 93.82
PF09726697 Macoilin: Transmembrane protein; InterPro: IPR0191 93.4
PF12072201 DUF3552: Domain of unknown function (DUF3552); Int 93.33
PRK00106 535 hypothetical protein; Provisional 92.82
PRK12704 520 phosphodiesterase; Provisional 92.72
KOG09941758 consensus Extracellular matrix glycoprotein Lamini 92.42
KOG0161 1930 consensus Myosin class II heavy chain [Cytoskeleto 91.97
COG1196 1163 Smc Chromosome segregation ATPases [Cell division 91.0
TIGR03319 514 YmdA_YtgF conserved hypothetical protein YmdA/YtgF 90.82
PRK12704 520 phosphodiesterase; Provisional 89.26
TIGR03319 514 YmdA_YtgF conserved hypothetical protein YmdA/YtgF 88.9
KOG0161 1930 consensus Myosin class II heavy chain [Cytoskeleto 88.76
PF10186302 Atg14: UV radiation resistance protein and autopha 88.46
KOG4302 660 consensus Microtubule-associated protein essential 88.39
PTZ00121 2084 MAEBL; Provisional 87.65
PTZ00121 2084 MAEBL; Provisional 87.64
TIGR02169 1164 SMC_prok_A chromosome segregation protein SMC, pri 87.09
TIGR01069771 mutS2 MutS2 family protein. Function of MutS2 is u 86.75
PRK11637428 AmiB activator; Provisional 85.82
PLN031881320 kinesin-12 family protein; Provisional 85.63
PF05667594 DUF812: Protein of unknown function (DUF812); Inte 85.41
KOG05791187 consensus Ste20-like serine/threonine protein kina 85.38
PF00769246 ERM: Ezrin/radixin/moesin family; InterPro: IPR011 85.22
TIGR02168 1179 SMC_prok_B chromosome segregation protein SMC, com 84.97
KOG0995581 consensus Centromere-associated protein HEC1 [Cell 84.48
KOG3850455 consensus Predicted membrane protein [Function unk 83.65
KOG0977 546 consensus Nuclear envelope protein lamin, intermed 83.57
TIGR00606 1311 rad50 rad50. This family is based on the phylogeno 83.15
PF05701522 WEMBL: Weak chloroplast movement under blue light; 82.34
KOG2685421 consensus Cystoskeletal protein Tektin [Cytoskelet 82.34
PF00038312 Filament: Intermediate filament protein; InterPro: 82.26
PF10186302 Atg14: UV radiation resistance protein and autopha 81.69
KOG0933 1174 consensus Structural maintenance of chromosome pro 80.14
COG1579239 Zn-ribbon protein, possibly nucleic acid-binding [ 80.11
>PF00395 SLH: S-layer homology domain; InterPro: IPR001119 S-layers are paracrystalline mono-layered assemblies of (glyco)proteins which coat the surface of bacteria [, ] Back     alignment and domain information
Probab=98.95  E-value=7.2e-10  Score=86.33  Aligned_cols=44  Identities=27%  Similarity=0.421  Sum_probs=31.5

Q ss_pred             CccCCCCCCCCh-HHHHHHHHcCCcccccccccCCCCCCCCcccCCCCcCcHHHHH
Q 003229          387 AFDDITPEDPDF-SSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLV  441 (837)
Q Consensus       387 AF~DV~~ddPyf-~yIQAAAEAGIIsGkLSG~~~~s~dDG~~tFrPDsPITRQELA  441 (837)
                      .|+||+..+|+| .+|+.+++.|||.|+.           +++|+|+++|||+|||
T Consensus         1 ~F~Dv~~~~~~~a~~i~~~~~~gi~~G~~-----------~~~f~P~~~iTR~e~A   45 (45)
T PF00395_consen    1 PFKDVPSISWAYAEAIQWLYQLGIISGYS-----------DGTFNPNDPITRAEAA   45 (45)
T ss_dssp             -BTTB-TTSSSTTHHHHHHHHTTSS---T-----------TS---TTSB-BHHHHH
T ss_pred             CCCCCCCCcHHHHHHHHHHHHcCCcccCC-----------CCeECCCCCcCHHHhC
Confidence            599999999955 9999999999999973           3379999999999986



Several S-layer proteins and some other cell wall proteins contain one or more copies of a domain of about 50-60 residues, which has been called SLH (for S-layer homology). Although it was originally proposed that SLH domains bind to peptidoglycan, it is now evident that pyruvylated secondary cell wall polymers (SCWPs), which are either teichoic acids, teichuronic acids, lipoteichoic acids or lipoglycans, serve as the anchoring structures for SLH motifs in the Gram-positive cell wall [, ]. However, the study of S-layer protein SbpA of Bacillus sphaericus revealed that SLH motifs are not sufficient for specific binding to SCWPs. Thus, the molecular basis explaining SLH affinity and specificity of interaction with cell wall polymers are not completely elucidated [].; PDB: 3PYW_A.

>PF00395 SLH: S-layer homology domain; InterPro: IPR001119 S-layers are paracrystalline mono-layered assemblies of (glyco)proteins which coat the surface of bacteria [, ] Back     alignment and domain information
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [] Back     alignment and domain information
>KOG0018 consensus Structural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
>smart00787 Spc7 Spc7 kinetochore protein Back     alignment and domain information
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
>PRK00106 hypothetical protein; Provisional Back     alignment and domain information
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins Back     alignment and domain information
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD Back     alignment and domain information
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes Back     alignment and domain information
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Back     alignment and domain information
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes Back     alignment and domain information
>PF12072 DUF3552: Domain of unknown function (DUF3552); InterPro: IPR022711 This presumed domain is functionally uncharacterised Back     alignment and domain information
>PRK00106 hypothetical protein; Provisional Back     alignment and domain information
>PRK12704 phosphodiesterase; Provisional Back     alignment and domain information
>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures] Back     alignment and domain information
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] Back     alignment and domain information
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF Back     alignment and domain information
>PRK12704 phosphodiesterase; Provisional Back     alignment and domain information
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF Back     alignment and domain information
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] Back     alignment and domain information
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking Back     alignment and domain information
>KOG4302 consensus Microtubule-associated protein essential for anaphase spindle elongation [Cell cycle control, cell division, chromosome partitioning; Cytoskeleton] Back     alignment and domain information
>PTZ00121 MAEBL; Provisional Back     alignment and domain information
>PTZ00121 MAEBL; Provisional Back     alignment and domain information
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>TIGR01069 mutS2 MutS2 family protein Back     alignment and domain information
>PRK11637 AmiB activator; Provisional Back     alignment and domain information
>PLN03188 kinesin-12 family protein; Provisional Back     alignment and domain information
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function Back     alignment and domain information
>KOG0579 consensus Ste20-like serine/threonine protein kinase [Signal transduction mechanisms] Back     alignment and domain information
>PF00769 ERM: Ezrin/radixin/moesin family; InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin [] Back     alignment and domain information
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>KOG3850 consensus Predicted membrane protein [Function unknown] Back     alignment and domain information
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure] Back     alignment and domain information
>TIGR00606 rad50 rad50 Back     alignment and domain information
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function Back     alignment and domain information
>KOG2685 consensus Cystoskeletal protein Tektin [Cytoskeleton] Back     alignment and domain information
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope Back     alignment and domain information
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking Back     alignment and domain information
>KOG0933 consensus Structural maintenance of chromosome protein 2 (chromosome condensation complex Condensin, subunit E) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query837
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 3e-12
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 1e-10
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 1e-08
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 2e-08
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 6e-08
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-11
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-09
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-04
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 5e-10
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 2e-07
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 3e-07
2zuo_A861 MVP, major vault protein; repeat domains, protein- 5e-09
2zuo_A861 MVP, major vault protein; repeat domains, protein- 7e-08
2zuo_A861 MVP, major vault protein; repeat domains, protein- 1e-07
2zuo_A861 MVP, major vault protein; repeat domains, protein- 2e-06
2zuo_A861 MVP, major vault protein; repeat domains, protein- 2e-04
3na7_A256 HP0958; flagellar biogenesis, flagellum export, C4 6e-06
3na7_A256 HP0958; flagellar biogenesis, flagellum export, C4 2e-04
3lvg_D190 LCB, clathrin light chain B; SELF assembly, coated 3e-05
3lvg_D190 LCB, clathrin light chain B; SELF assembly, coated 4e-04
3lvg_D190 LCB, clathrin light chain B; SELF assembly, coated 4e-04
3lvg_D190 LCB, clathrin light chain B; SELF assembly, coated 9e-04
2qag_B427 Septin-6, protein NEDD5; cell cycle, cell division 8e-05
3vkg_A 3245 Dynein heavy chain, cytoplasmic; AAA+ protein, mol 1e-04
3vkg_A 3245 Dynein heavy chain, cytoplasmic; AAA+ protein, mol 1e-04
3vkg_A 3245 Dynein heavy chain, cytoplasmic; AAA+ protein, mol 5e-04
1c1g_A284 Tropomyosin; contractIle protein; 7.00A {Sus scrof 1e-04
1c1g_A284 Tropomyosin; contractIle protein; 7.00A {Sus scrof 4e-04
1c1g_A284 Tropomyosin; contractIle protein; 7.00A {Sus scrof 8e-04
1f5n_A592 Interferon-induced guanylate-binding protein 1; GB 3e-04
2fxo_A129 Myosin heavy chain, cardiac muscle beta isoform; c 4e-04
2ycu_A995 Non muscle myosin 2C, alpha-actinin; motor protein 5e-04
2i1j_A575 Moesin; FERM, coiled-coil, C-ermad, ERM, radixin, 6e-04
3pyw_A203 S-layer protein SAP; SLH-domains, polysaccharide b 8e-04
1cii_A 602 Colicin IA; bacteriocin, ION channel formation, tr 8e-04
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
 Score = 69.5 bits (170), Expect = 3e-12
 Identities = 39/199 (19%), Positives = 90/199 (45%), Gaps = 8/199 (4%)

Query: 521  EASDAVNEELQRIEAE-SAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMA 579
            +    + ++  ++  E    E  VS+ +  +AE E++     + +   E    ++  ++ 
Sbjct: 990  DDILIMEDQNNKLTKERKLLEERVSDLTTNLAEEEEKAKNLTKLKNKHESMISELEVRLK 1049

Query: 580  EE--ARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
            +E  +RQELE+++ + E +   L ++ A +++++  L     + EE+L++ ++    +  
Sbjct: 1050 KEEKSRQELEKIKRKLEGESSDLHEQIAELQAQIAELKAQLAKKEEELQAALA---RLED 1106

Query: 638  EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDR 697
            E  + N   K+       I+ LQ +LE E+ A + A     D ++        LE   D 
Sbjct: 1107 ETSQKNNALKKIRELESHISDLQEDLESEKAARNKAEKQKRDLSEELEALKTELEDTLDT 1166

Query: 698  WERQGIKVVVDKDLREESD 716
               Q  + +   D +++ D
Sbjct: 1167 TATQ--QELRGSDYKDDDD 1183


>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861 Back     alignment and structure
>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861 Back     alignment and structure
>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861 Back     alignment and structure
>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861 Back     alignment and structure
>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861 Back     alignment and structure
>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Length = 256 Back     alignment and structure
>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Length = 256 Back     alignment and structure
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190 Back     alignment and structure
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190 Back     alignment and structure
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190 Back     alignment and structure
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190 Back     alignment and structure
>2qag_B Septin-6, protein NEDD5; cell cycle, cell division, GTP-binding, nucleotide-binding, phosphorylation, acetylation, alternative splicing, coiled coil; HET: GDP GTP; 4.00A {Homo sapiens} Length = 427 Back     alignment and structure
>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* 3vkh_C* Length = 3245 Back     alignment and structure
>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* 3vkh_C* Length = 3245 Back     alignment and structure
>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* 3vkh_C* Length = 3245 Back     alignment and structure
>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Length = 284 Back     alignment and structure
>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Length = 284 Back     alignment and structure
>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Length = 284 Back     alignment and structure
>1f5n_A Interferon-induced guanylate-binding protein 1; GBP, GTP hydrolysis, GDP, GMP, dynamin related, large GTPase family. GMPPNP, GPPNHP.; HET: GNP; 1.70A {Homo sapiens} SCOP: a.114.1.1 c.37.1.8 PDB: 1dg3_A* 2b8w_A* 2b92_A* 2bc9_A* 2d4h_A* Length = 592 Back     alignment and structure
>2fxo_A Myosin heavy chain, cardiac muscle beta isoform; coiled coil (dimeric, parallel), familial hypertrophic cardiomyopathy, FHC-associated mutant E924K; 2.50A {Homo sapiens} SCOP: h.1.26.1 PDB: 2fxm_A Length = 129 Back     alignment and structure
>2ycu_A Non muscle myosin 2C, alpha-actinin; motor protein; HET: AOV; 2.25A {Homo sapiens} PDB: 1br1_A* 1br4_A* 1br2_A* Length = 995 Back     alignment and structure
>2i1j_A Moesin; FERM, coiled-coil, C-ermad, ERM, radixin, ezrin, MER actin binding, masking, regulation, SELF-inhibition, cell A membrane protein; 2.10A {Spodoptera frugiperda} PDB: 2i1k_A 1e5w_A Length = 575 Back     alignment and structure
>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-2, protein structure initiative; 1.80A {Bacillus anthracis} Length = 203 Back     alignment and structure
>1cii_A Colicin IA; bacteriocin, ION channel formation, transmembrane protein; 3.00A {Escherichia coli} SCOP: f.1.1.1 h.4.3.1 Length = 602 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query837
3pyw_A203 S-layer protein SAP; SLH-domains, polysaccharide b 99.96
4aq1_A 892 SBSB protein; structural protein, S-layer protein, 99.87
3pyw_A203 S-layer protein SAP; SLH-domains, polysaccharide b 99.74
4aq1_A 892 SBSB protein; structural protein, S-layer protein, 99.38
3oja_B597 Anopheles plasmodium-responsive leucine-rich REPE 90.75
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 90.37
3oja_B597 Anopheles plasmodium-responsive leucine-rich REPE 87.89
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 87.61
2no2_A107 HIP-I, huntingtin-interacting protein 1; clathrin 87.03
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 86.75
3na7_A256 HP0958; flagellar biogenesis, flagellum export, C4 86.45
3vkg_A 3245 Dynein heavy chain, cytoplasmic; AAA+ protein, mol 84.83
3vkg_A 3245 Dynein heavy chain, cytoplasmic; AAA+ protein, mol 84.43
2fxo_A129 Myosin heavy chain, cardiac muscle beta isoform; c 80.52
>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-biology, protein structure initiative; 1.80A {Bacillus anthracis} Back     alignment and structure
Probab=99.96  E-value=9.4e-29  Score=240.01  Aligned_cols=169  Identities=20%  Similarity=0.181  Sum_probs=132.3

Q ss_pred             cccccC-CCceeeccccchhhHHHHHhcccccccC-CccCCCCCccHHHHHHHHHHhcccccccccccccccccccCCCC
Q 003229          308 AALQVL-PGKVLVPAVVDQVQGQALSALQVLKVIE-ADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTD  385 (837)
Q Consensus       308 aaf~v~-p~rv~VPvavD~~qIeALAaLGILkgiE-gtF~Pn~pITRAEFArwLVRAls~L~R~p~~kI~Pam~is~st~  385 (837)
                      .+|.+. .++    -+.  .+|+.|..+|+++|++ ++|+|+++|||+|||.||+++++.          +   ......
T Consensus         3 ~~F~Dv~~~~----wa~--~~I~~l~~~gii~G~~dg~F~P~~~iTR~e~a~~l~r~~~~----------~---~~~~~~   63 (203)
T 3pyw_A            3 KTFPDVPADH----WGI--DSINYLVEKGAVKGNDKGMFEPGKELTRAEAATMMAQILNL----------P---IDKDAK   63 (203)
T ss_dssp             CCBTTBCTTS----TTH--HHHHHHHHTTSCCCCTTSCCCTTCBCBHHHHHHHHHHHTTC----------C---CCTTCC
T ss_pred             CCCcCCCCCc----HHH--HHHHHHHHCcCEecCCCCeECCCCCchHHHHHHHHHHHhCC----------C---CCcCCC
Confidence            357444 555    233  4899999999999997 699999999999999999999871          1   112345


Q ss_pred             CCccCCCCCCCChHHHHHHHHcCCcccccccccCCCCCCCCcccCCCCcCcHHHHHHHHHH-HhhcCCchhhhhhhhccC
Q 003229          386 LAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMA-LEKRQLPEANKKILYQLS  464 (837)
Q Consensus       386 ~AF~DV~~ddPyf~yIQAAAEAGIIsGkLSG~~~~s~dDG~~tFrPDsPITRQELAvwkVA-Le~~~LP~ad~k~L~~~s  464 (837)
                      ..|+||+ +||++.+|+.|+++|||.|+           +++ |+|+++|||+||+.|+++ +++..   ..  ......
T Consensus        64 ~~F~Dv~-~~w~~~~i~~a~~~Gii~g~-----------~~~-F~P~~~iTR~e~a~~l~r~~~~~~---~~--~~~~~~  125 (203)
T 3pyw_A           64 PSFADSQ-GQWYTPFIAAVEKAGVIKGT-----------GNG-FEPNGKIDRVSMASLLVEAYKLDT---KV--NGTPAT  125 (203)
T ss_dssp             CSCGGGC-SSTTHHHHHHHHHTTSSCCC-----------SSC-CCTTSBCBHHHHHHHHHHHHTGGG---GS--SSCCCC
T ss_pred             CCCCCCC-cHHHHHHHHHHHHCcCeecC-----------CCC-CCCCCcccHHHHHHHHHHHhcccc---cc--ccccCC
Confidence            6899995 99999999999999999997           234 999999999999999875 43321   10  011245


Q ss_pred             CcccccCcCcchHHHHHHHHhcCcccceecccCCCccccCCCCCcHHHHHHHHHhc
Q 003229          465 GFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIG  520 (837)
Q Consensus       465 gF~DadkIspwA~~AVaadL~aGd~GII~~vfG~tg~FqPnkPVTRAEAAAaL~sG  520 (837)
                      .|.|...| +|+.++|......   ||+.   |..+.|+|++++||+|||++|++-
T Consensus       126 ~f~D~~~i-~wa~~~v~~~~~~---Gi~~---g~~~~f~P~~~iTRaE~A~~l~r~  174 (203)
T 3pyw_A          126 KFKDLETL-NWGKEKANILVEL---GISV---GTGDQWEPKKTVTKAEAAQFIAKT  174 (203)
T ss_dssp             CCHHHHTC-SSSHHHHHHHHHT---TCSC---CCSSCCCTTSBCBHHHHHHHHHHH
T ss_pred             CCCCCCcc-HHHHHHHHHHHHC---cCEe---CCCCCcCCCCCCcHHHHHHHHHHH
Confidence            79999999 8999999877755   4653   444589999999999999999964



>4aq1_A SBSB protein; structural protein, S-layer protein, nanobody; 2.42A {Geobacillus stearothermophilus} PDB: 4aq1_C Back     alignment and structure
>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-biology, protein structure initiative; 1.80A {Bacillus anthracis} Back     alignment and structure
>4aq1_A SBSB protein; structural protein, S-layer protein, nanobody; 2.42A {Geobacillus stearothermophilus} PDB: 4aq1_C Back     alignment and structure
>3oja_B Anopheles plasmodium-responsive leucine-rich REPE 1; coiled-coil, helix-loop-helix, leucine-rich repeat, protein; HET: NAG MAN; 2.70A {Anopheles gambiae} Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Back     alignment and structure
>3oja_B Anopheles plasmodium-responsive leucine-rich REPE 1; coiled-coil, helix-loop-helix, leucine-rich repeat, protein; HET: NAG MAN; 2.70A {Anopheles gambiae} Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Back     alignment and structure
>2no2_A HIP-I, huntingtin-interacting protein 1; clathrin light chain binding, HIP1 coiled-coil domain, endocytosis, clathrin SELF-assembly, cell adhesion; 2.80A {Homo sapiens} Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Back     alignment and structure
>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Back     alignment and structure
>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* Back     alignment and structure
>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* Back     alignment and structure
>2fxo_A Myosin heavy chain, cardiac muscle beta isoform; coiled coil (dimeric, parallel), familial hypertrophic cardiomyopathy, FHC-associated mutant E924K; 2.50A {Homo sapiens} SCOP: h.1.26.1 PDB: 2fxm_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 837
d2es4d1280 a.137.15.1 (D:53-332) Lipase chaperone LifO (LipB) 2e-06
d2es4d1280 a.137.15.1 (D:53-332) Lipase chaperone LifO (LipB) 0.002
d1f5na1300 a.114.1.1 (A:284-583) Interferon-induced guanylate 4e-04
d1f5na1300 a.114.1.1 (A:284-583) Interferon-induced guanylate 0.002
>d1f5na1 a.114.1.1 (A:284-583) Interferon-induced guanylate-binding protein 1 (GBP1), C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Length = 300 Back     information, alignment and structure
>d1f5na1 a.114.1.1 (A:284-583) Interferon-induced guanylate-binding protein 1 (GBP1), C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Length = 300 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00