Citrus Sinensis ID: 003229

Local Sequence Feature Prediction

Prediction and (Method)	Result

Residue Number Marker

Protein Sequence

Secondary Structure (PSIPRED)

Secondary Structure Prediction (SSPRO)

Coil and Loop (DISEMBL)

Flexible Loop (DISEMBL)

Low Complexity Region (SEG)

Disordered region (IsUnstruct)

Disordered Region (DISOPRED)

Disordered Region (DISEMBL)

Disordered Region (DISPRO)

Transmembrane Helix (TMHMM)

Transmembrane Helix (HMMTOP)

Transmembrane Helix (MEMSAT)

TM Helix, Signal Peptide (MEMSAT_SVM)

TM Helix, Signal Peptide (Phobius)

Signal Peptide (SignalP HMM Mode)

Signal Peptide (SignalP NN Mode)

Coiled Coils (COILS)

Positional Conservation

--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------

MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPEPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT

ccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcHHHHHHHHHHHccccccccccccccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccHHHHHHHHHccccccccccccccccccccccccccccccHHHHHHHHHHHHHccccHHHHHHHHHHcccccccccccccHHHHHHHHHccccccEEEcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc

ccccccHHHHEcccccccccEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEccccccccccccEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccEEccccccccccccHHHHHHHHHHHHHHHHcccccccccHHHccccccccccccccccccHHHHHHHHHcccccccccccccccccccccEEcccccccHHHHHHHcccHHccccccccHHHHHHHcccccHHHccHHHHHHHHHHHHcccHHHHHHHHcccccccccccccHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHEEEHHHHHHHHHccHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc

mepltaaqdvsivpdhkidkfeeygyagnnvkqddrslesktgtdnalssSSEAIEVasdnkidsenetpstgdvshsssginsindvakqddlqresasddmsvapdtaltspklpepevvsgtenasplegsdsildanlpesaseitgenpidvepssfsnptdlgndgskfsrifsdsssissshapiePLAAVIsvssdttvepqilpkgdtetvaspstiknveqsekpllsgedssssmevhdlnkngssgtsvspsifpfsneketcdlnesnsssftespptgsssspagipapSVVSAALQvlpgkvlvpAVVDQVQGQALSALQVLKVIEAdvkpgdlcirREYARWLVSASStltrstmskvypamyienvtdlafdditpedpdfssiqgLAEAGLISsklshrdllneepgpifflpesplsrqDLVSWKMALEKRQLPEANKKILYQLSgfididkinpdawPALLADLTAGEQGIIALAFGctrlfqpdkpvtNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRwerqgikvvvdkdlreeSDAAVMWVNagkqfsvdqTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT

mepltaaqdvsivpdhkIDKFEEYGYAgnnvkqddrslesktgtdnalsssseAIEVAsdnkidsenetpstgdvshsssginSINDVAKQDDLQRESASDDMSVAPDTaltspklpepevVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFSDSSSISSSHAPIEPLAAVISVSsdttvepqilpkgdtetvaspstiknveqsekPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWlvsasstltrstmskvYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESfekelsmerekidvVEKMAEEARQELerlraerevdkIALMKERAAIESEMEILSKLRREVEEQLeslmsnkveisyeKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKAlegardrwerqgikvvvdkdlrEESDAAVMWVNagkqfsvdqtvSRAQSLVDKLKAMandvsgkskEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLtegakrvagdcregvekltqrfkt

MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPEPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGskfsrifsdsssisssHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNsssftespptgsssspagipapsvvsaaLQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIeaesaaenavsehsaLVAEVEKEINESFEKELSMEREKIDVVEKMaeearqelerlraereVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT

*******************KFEEYGY**************************************************************************************************************************************************************************************************************************************************************************************VVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITP*******IQGLAEAGLI**************GPIFFLPE**LSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIG*********************************************************************************************************************************************************************************WERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKA*********************************************************

*******************************************************************************************************************************************************************************************************************L***************************************************************************************************VLPGKVLVPAVVDQVQGQALSALQVLKV****VKPGDLCIRREYARWLVSASSTLTR**************VTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH*****EEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAA****************************************************************************************************************************************************************************************************************************************************************************************************************************L*Q****

********DVSIVPDHKIDKFEEYGYAGNNVKQ***********************************************GINSINDVAKQD**************PDTALTSPKLP*************LEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFS*********APIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIK*********************************VSPSIFPFSNEKET*************************IPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIE*************ALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMA***********************RWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT

******************DKFEEYGY************************************************************************************************************************************************************************************************************************************************************************************************GKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISS************GPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT

ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPEPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSRIFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEIxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxKVExxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxLSMAxxxxxxxxxxxxxxxxxxxxxxxxxxxxGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT

no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST

Original result of BLAST against SWISS-PROT Database

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST

Original result of BLAST against Nonredundant Database

GI	Alignment Graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		837
255545258		976	conserved hypothetical protein [Ricinus	0.970	0.831	0.585	0.0
225464485		985	PREDICTED: uncharacterized protein LOC10	0.704	0.598	0.717	0.0
302143846		1040	unnamed protein product [Vitis vinifera]	0.704	0.567	0.717	0.0
224134861		592	predicted protein [Populus trichocarpa]	0.664	0.939	0.719	0.0
356529194		975	PREDICTED: uncharacterized protein LOC10	0.988	0.848	0.552	0.0
356561542		1002	PREDICTED: uncharacterized protein LOC10	0.990	0.827	0.552	0.0
224122346		793	predicted protein [Populus trichocarpa]	0.667	0.704	0.708	0.0
449446025		722	PREDICTED: uncharacterized protein LOC10	0.643	0.746	0.732	0.0
222424656		805	AT5G23890 [Arabidopsis thaliana]	0.848	0.881	0.557	0.0
15237846		946	uncharacterized protein [Arabidopsis tha	0.955	0.845	0.511	0.0

>gi\|255545258\|ref\|XP_002513690.1\| conserved hypothetical protein [Ricinus communis] gi\|223547598\|gb\|EEF49093.1\| conserved hypothetical protein [Ricinus communis]	Back alignment and taxonomy information

 Score =  886 bits (2289), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 508/868 (58%), Positives = 626/868 (72%), Gaps = 56/868 (6%)

Query: 1   MEPLTAAQDVSIVPDHKIDKFEEYGYA--GNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
           MEPLT  Q+VS+V D + DK E+   A    N+K++  SLE KT TD  L SS +  E  
Sbjct: 134 MEPLTVQQEVSLVSDDEEDKIEKNTSAESSANLKEEYISLEHKTNTDVDLPSSPQIEETH 193

Query: 59  SDNKIDSENETPSTGD-VSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLP 117
           ++NK+  + +   + D  ++  S  +++++   Q+DLQ +SA D      +T   S  LP
Sbjct: 194 NENKLSGDTDQLLSADNGNYIISSNDTVDNAPVQEDLQYDSAFDSKLGVLETTPNSTNLP 253

Query: 118 EPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSR 177
           E ++                +D NL      + GE    +   +         + +  S 
Sbjct: 254 ESKIAK--------------IDKNL------VNGEPAYSLNIINTITEHTEAKENTIPSS 293

Query: 178 IFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLL 237
             S S  + SS   +  ++  I+++SDT  E   L K   ++ AS  T + +  S   + 
Sbjct: 294 DSSISPVLKSSEPVV--VSTSITLTSDTVSEVGNLFKDGMDSEASVPTKEELNTSTNQV- 350

Query: 238 SGEDSSSSMEVHDLNKNGSSGTSVSPS-IFPFSNEKETC---DLNESNSSSFTESPPTGS 293
           S + +SSS+E++ L ++GSSG +      +PF+N+++     D+N S +SS  ESPP   
Sbjct: 351 STDRNSSSLEMNYLTESGSSGVTSVSEWAYPFANKQDIVANDDMNLSKTSS--ESPPFSG 408

Query: 294 SSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRR 353
           S S AG+PAPS V  +LQV PGK+LVPAVVDQ  GQAL+ALQVLKVIEADV+P DLC RR
Sbjct: 409 SFSSAGVPAPSAVPESLQVSPGKILVPAVVDQTHGQALAALQVLKVIEADVQPSDLCTRR 468

Query: 354 EYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSK 413
           EYARWLV+ASS L+RST+SKVYPAMYIEN T+ AFDDITP+DPDFSSIQGLAEAGLISS+
Sbjct: 469 EYARWLVAASSALSRSTLSKVYPAMYIENATEPAFDDITPDDPDFSSIQGLAEAGLISSR 528

Query: 414 LSHRDLLN--EEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDK 471
           LS+ DLL+  E+ GP+ F PESPLSRQDLVSWKMALEKRQLPEAN+KILYQLSGF D+DK
Sbjct: 529 LSNHDLLSPVEDQGPLNFSPESPLSRQDLVSWKMALEKRQLPEANRKILYQLSGFRDVDK 588

Query: 472 INPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQ 531
           I+PDAWPAL+ADL+AG+QGII+LAFGCTRLFQP+KPVT AQAAVALAIGEASD VNEEL 
Sbjct: 589 IHPDAWPALIADLSAGDQGIISLAFGCTRLFQPNKPVTKAQAAVALAIGEASDIVNEELA 648

Query: 532 RIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRA 591
           RIEAES AENAVS H+ALVA+VE++IN SFEKEL MEREKI+ VEKMAEEAR ELERLRA
Sbjct: 649 RIEAESMAENAVSAHNALVAQVEQDINASFEKELLMEREKINAVEKMAEEARLELERLRA 708

Query: 592 EREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAEN 651
           ERE D  ALMKERA+IE+EME+LS+L+ EVEEQL++L+S+KVEISYEKERIN L+KEAEN
Sbjct: 709 EREADNFALMKERASIEAEMEVLSRLKGEVEEQLQTLLSSKVEISYEKERINKLQKEAEN 768

Query: 652 ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDL 711
           E QEI+RLQYELEVERKALS+ARAWAEDEAKRARE AK +E ARDRWERQGIKVVVD DL
Sbjct: 769 EKQEISRLQYELEVERKALSIARAWAEDEAKRAREHAKVIEEARDRWERQGIKVVVDNDL 828

Query: 712 REESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFIS 771
           REE+ A   WV   +QFSV+ TVSRA+ LV +LK +A++  GKSKE+INTII KIL+ IS
Sbjct: 829 REETSAGGTWVATARQFSVEGTVSRAEKLVGELKLLADNARGKSKEVINTIIQKILVIIS 888

Query: 772 NLKKWASKASMRAAELKDATILKAKGSVQE----------------------LQQSTAEF 809
            LK+W S+A  +A ELKDA +LKAK SV+E                      LQQSTAEF
Sbjct: 889 RLKEWISEARTQAGELKDAAVLKAKESVEELQKNTSEFSSTIKERARGSIYGLQQSTAEF 948

Query: 810 RSNLTEGAKRVAGDCREGVEKLTQRFKT 837
              + EGAKRVAGDCREGVE+LTQRFK+
Sbjct: 949 SFAMKEGAKRVAGDCREGVERLTQRFKS 976

Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi\|225464485\|ref\|XP_002271744.1\| PREDICTED: uncharacterized protein LOC100264485 [Vitis vinifera]	Back alignment and taxonomy information

>gi\|302143846\|emb\|CBI22707.3\| unnamed protein product [Vitis vinifera]	Back alignment and taxonomy information

>gi\|224134861\|ref\|XP_002321923.1\| predicted protein [Populus trichocarpa] gi\|222868919\|gb\|EEF06050.1\| predicted protein [Populus trichocarpa]	Back alignment and taxonomy information

>gi\|356529194\|ref\|XP_003533181.1\| PREDICTED: uncharacterized protein LOC100780360 [Glycine max]	Back alignment and taxonomy information

>gi\|356561542\|ref\|XP_003549040.1\| PREDICTED: uncharacterized protein LOC100810148 [Glycine max]	Back alignment and taxonomy information

>gi\|224122346\|ref\|XP_002318812.1\| predicted protein [Populus trichocarpa] gi\|222859485\|gb\|EEE97032.1\| predicted protein [Populus trichocarpa]	Back alignment and taxonomy information

>gi\|449446025\|ref\|XP_004140772.1\| PREDICTED: uncharacterized protein LOC101215442 [Cucumis sativus] gi\|449518413\|ref\|XP_004166236.1\| PREDICTED: uncharacterized LOC101215442 [Cucumis sativus]	Back alignment and taxonomy information

>gi\|222424656\|dbj\|BAH20282.1\| AT5G23890 [Arabidopsis thaliana]	Back alignment and taxonomy information

>gi\|15237846\|ref\|NP_197777.1\| uncharacterized protein [Arabidopsis thaliana] gi\|10176856\|dbj\|BAB10062.1\| unnamed protein product [Arabidopsis thaliana] gi\|332005846\|gb\|AED93229.1\| uncharacterized protein [Arabidopsis thaliana]	Back alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST

Original result of BLAST against Gene Ontology (AMIGO)

ID	Alignment graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		837
TAIR\|locus:2172853		946	AT5G23890 [Arabidopsis thalian	0.959	0.848	0.477	7.8e-181
TAIR\|locus:2156687		761	AT5G52410 [Arabidopsis thalian	0.623	0.685	0.569	5.2e-149
TAIR\|locus:2102375		558	AT3G25680 "AT3G25680" [Arabido	0.457	0.686	0.343	2.9e-56
POMBASE\|SPAC29E6.10c		1085	SPAC29E6.10c "conserved fungal	0.317	0.245	0.214	3.8e-06
MGI\|MGI:2684999		1091	Tmf1 "TATA element modulatory	0.260	0.199	0.252	5.3e-06
CGD\|CAL0003874		1404	PGA55 [Candida albicans (taxid	0.261	0.155	0.230	1.1e-05
UNIPROTKB\|Q59SG9		1404	PGA55 "Flocculin-like protein"	0.261	0.155	0.230	1.1e-05
DICTYBASE\|DDB_G0280959		1157	DG1113 "C2 domain-containing p	0.304	0.220	0.208	1.4e-05
UNIPROTKB\|G4N0Y1		709	MGG_09571 "Uncharacterized pro	0.454	0.535	0.223	1.6e-05
MGI\|MGI:109172		934	Dspp "dentin sialophosphoprote	0.305	0.274	0.213	1.8e-05

TAIR\|locus:2172853 AT5G23890 [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

 Score = 1755 (622.8 bits), Expect = 7.8e-181, P = 7.8e-181
 Identities = 409/856 (47%), Positives = 549/856 (64%)

Query:     1 MEPLTAAQDVSIVPDHKI--DKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
             M  LT+ Q+  I    +I  D+ +      +N+K +D+S+ES     N ++  S+  E +
Sbjct:   125 MHSLTSQQESMIQSSDEISSDEIKVANSEESNLKDEDKSIES-----NDVAQKSD--EGS 177

Query:    59 SDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPE 118
              ++K+  +  +   G ++  +    SI     + DL   + +D     P+TA  S K   
Sbjct:   178 GEDKLLGKETSSFDGVMTDEADATESIPQNTPEADLMVNAETD-----PETA-ESEK--- 228

Query:   119 PEVVSGTENASPLEGSDS--ILDANLPESASEITGENPIDVEPSSFSN--PT---DLGND 171
               ++S  E+ S L+ S    +LDA   ES++ +  EN    +P S  N  PT   DL N 
Sbjct:   229 --IIS--ESKSLLDSSTEPILLDA---ESSNLVGVENTNSEDPESLLNTEPTNVSDLENH 281

Query:   172 GXXXXXXXXXXXXXXXXHAPIEPLAAVISVSS--DTTVEPQILPKGDTETV-ASPSTIKN 228
                              +A    +  +  VSS  D+T +PQI+P  DTET  A+   +  
Sbjct:   282 VNSQKEDSLSSLSGIDAYAASGTVTELPEVSSQLDSTSKPQIVPLNDTETAFATAEELSE 341

Query:   229 VEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLN-ESNXXXXXX 287
             V  + +   + + SS S    D++      +S SP   P S +    +LN  S       
Sbjct:   342 VNGTPEYFETSDWSSIS----DIDTTKELESSKSP--VPESTDGSKDELNIYSQDELDDN 395

Query:   288 XXXXXXXXXXXXXXXXXXXXXXLQVL--PGKVLVPAVVDQVQGQALSALQVLKVIEADVK 345
                                   + V+  PGK+LVP   DQ+Q QA +ALQVLKVIE D +
Sbjct:   396 RMLLEIPSGGSAFSSAGIPAPFMSVIVNPGKILVPVAADQIQCQAFAALQVLKVIETDTQ 455

Query:   346 PGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLA 405
             P DLC RREYARWL+SASS L+R+T SKVYPAMYIENVT+LAFDDITPEDPDFSSIQGLA
Sbjct:   456 PSDLCTRREYARWLISASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFSSIQGLA 515

Query:   406 EAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSG 465
             EAGLI+SKLS+RDLL++  G   F PES LSRQDL+SWKMALEKRQLPEA+KK+LY+LSG
Sbjct:   516 EAGLIASKLSNRDLLDDVEGTFLFSPESLLSRQDLISWKMALEKRQLPEADKKMLYKLSG 575

Query:   466 FIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDA 525
             FIDIDKINPDAWP+++ADL+ GEQGI ALAFGCTRLFQP KPVT  QAA+AL+ GEASD 
Sbjct:   576 FIDIDKINPDAWPSIIADLSTGEQGIAALAFGCTRLFQPHKPVTKGQAAIALSSGEASDI 635

Query:   526 VNEELQRIXXXXXXXXXXXXXXXLVAEVEKEINESFEKELSMEREKIDVVEKMXXXXXXX 585
             V+EEL RI               LVAEVEK++N SFEKELSMEREKI+ VEKM       
Sbjct:   636 VSEELARIEAESMAEKAVSAHNALVAEVEKDVNASFEKELSMEREKIEAVEKMAELAKVE 695

Query:   586 XXXXXXXXXVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINML 645
                       + +AL+KERAA+ESEME+LS+LRR+ EE+LE LMSNK EI++EKER+  L
Sbjct:   696 LEQLREKREEENLALVKERAAVESEMEVLSRLRRDAEEKLEDLMSNKAEITFEKERVFNL 755

Query:   646 RKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKV 705
             RKEAE E+Q I++LQYELEVERKALSMAR+WAE+EAK+AREQ +ALE AR RWE  G++V
Sbjct:   756 RKEAEEESQRISKLQYELEVERKALSMARSWAEEEAKKAREQGRALEEARKRWETNGLRV 815

Query:   706 VVDKDLRE----ESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINT 761
             VVDKDL+E    E++ +++ +N  ++ SV++T  RA++L+DKLK MA  VSGKS+E+I T
Sbjct:   816 VVDKDLQETSSRETEQSIV-LNEMERSSVEETERRAKTLMDKLKEMAGTVSGKSREVIFT 874

Query:   762 IIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVA 821
             ++ KI L+I+ LK++A     RA E++DA I++AKG+  +++Q T +    +++  K++A
Sbjct:   875 VMEKIRLWITVLKEYAVNLGKRAGEMRDAAIVRAKGAAADVEQGTVQ----VSDKVKKMA 930

Query:   822 GDCREGVEKLTQRFKT 837
              +CR+GV K++QRFKT
Sbjct:   931 EECRDGVGKISQRFKT 946

GO:0009507 "chloroplast" evidence=ISM;IDA

GO:0009941 "chloroplast envelope" evidence=IDA

GO:0009535 "chloroplast thylakoid membrane" evidence=IDA

GO:0005739 "mitochondrion" evidence=IDA

GO:0009536 "plastid" evidence=IDA

TAIR\|locus:2156687 AT5G52410 [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

TAIR\|locus:2102375 AT3G25680 "AT3G25680" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

POMBASE\|SPAC29E6.10c SPAC29E6.10c "conserved fungal protein" [Schizosaccharomyces pombe (taxid:4896)]	Back alignment and assigned GO terms

MGI\|MGI:2684999 Tmf1 "TATA element modulatory factor 1" [Mus musculus (taxid:10090)]	Back alignment and assigned GO terms

CGD\|CAL0003874 PGA55 [Candida albicans (taxid:5476)]	Back alignment and assigned GO terms

UNIPROTKB\|Q59SG9 PGA55 "Flocculin-like protein" [Candida albicans SC5314 (taxid:237561)]	Back alignment and assigned GO terms

DICTYBASE\|DDB_G0280959 DG1113 "C2 domain-containing protein" [Dictyostelium discoideum (taxid:44689)]	Back alignment and assigned GO terms

UNIPROTKB\|G4N0Y1 MGG_09571 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)]	Back alignment and assigned GO terms

MGI\|MGI:109172 Dspp "dentin sialophosphoprotein" [Mus musculus (taxid:10090)]	Back alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries

Original result of BLAST against SWISS-PROT

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server

Original result from Ezypred Server

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software

No EC number assignment, probably not an enzyme!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING

Original result from the STRING server

Fail to connect to STRING server

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST

Original result of RPS-BLAST against CDD database part I

ID	Alignment Graph	Length	Definition	E-value
Query		837
TIGR02168		1179	TIGR02168, SMC_prok_B, chromosome segregation prot	2e-09
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	7e-09
TIGR02168		1179	TIGR02168, SMC_prok_B, chromosome segregation prot	2e-08
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	2e-08
TIGR02169		1164	TIGR02169, SMC_prok_A, chromosome segregation prot	2e-07
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	5e-07
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	6e-07
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	1e-06
PTZ00121		2084	PTZ00121, PTZ00121, MAEBL; Provisional	1e-06
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	2e-06
TIGR02169		1164	TIGR02169, SMC_prok_A, chromosome segregation prot	2e-06
COG0419		908	COG0419, SbcC, ATPase involved in DNA repair [DNA	2e-06
TIGR02168		1179	TIGR02168, SMC_prok_B, chromosome segregation prot	3e-06
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	3e-06
COG0419		908	COG0419, SbcC, ATPase involved in DNA repair [DNA	6e-06
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	8e-06
TIGR02169		1164	TIGR02169, SMC_prok_A, chromosome segregation prot	8e-06
COG0419		908	COG0419, SbcC, ATPase involved in DNA repair [DNA	1e-05
PRK03918		880	PRK03918, PRK03918, chromosome segregation protein	1e-05
PRK03918		880	PRK03918, PRK03918, chromosome segregation protein	1e-05
TIGR02168		1179	TIGR02168, SMC_prok_B, chromosome segregation prot	2e-05
COG1196		1163	COG1196, Smc, Chromosome segregation ATPases [Cell	2e-05
PRK02224		880	PRK02224, PRK02224, chromosome segregation protein	2e-05
TIGR01843		423	TIGR01843, type_I_hlyD, type I secretion membrane	3e-05
pfam13868		349	pfam13868, Trichoplein, Tumour suppressor, Mitosta	6e-05
COG1579		239	COG1579, COG1579, Zn-ribbon protein, possibly nucl	6e-05
COG4942		420	COG4942, COG4942, Membrane-bound metallopeptidase	8e-05
TIGR02168		1179	TIGR02168, SMC_prok_B, chromosome segregation prot	1e-04
TIGR02168		1179	TIGR02168, SMC_prok_B, chromosome segregation prot	1e-04
TIGR02169		1164	TIGR02169, SMC_prok_A, chromosome segregation prot	2e-04
COG0419		908	COG0419, SbcC, ATPase involved in DNA repair [DNA	2e-04
pfam13868		349	pfam13868, Trichoplein, Tumour suppressor, Mitosta	2e-04
COG2433		652	COG2433, COG2433, Uncharacterized conserved protei	2e-04
PRK12704		520	PRK12704, PRK12704, phosphodiesterase; Provisional	2e-04
pfam05262		489	pfam05262, Borrelia_P83, Borrelia P83/100 protein	3e-04
TIGR02169		1164	TIGR02169, SMC_prok_A, chromosome segregation prot	4e-04
pfam12128		1198	pfam12128, DUF3584, Protein of unknown function (D	5e-04
PTZ00121		2084	PTZ00121, PTZ00121, MAEBL; Provisional	6e-04
COG4372		499	COG4372, COG4372, Uncharacterized protein conserve	7e-04
COG3096		1480	COG3096, MukB, Uncharacterized protein involved in	7e-04
COG0419		908	COG0419, SbcC, ATPase involved in DNA repair [DNA	8e-04
pfam02463		1162	pfam02463, SMC_N, RecF/RecN/SMC N terminal domain	8e-04
TIGR02680		1353	TIGR02680, TIGR02680, TIGR02680 family protein	0.001
PRK00409		782	PRK00409, PRK00409, recombination and DNA strand e	0.001
TIGR02168		1179	TIGR02168, SMC_prok_B, chromosome segregation prot	0.002
TIGR02169		1164	TIGR02169, SMC_prok_A, chromosome segregation prot	0.002
PRK03918		880	PRK03918, PRK03918, chromosome segregation protein	0.002
COG1579		239	COG1579, COG1579, Zn-ribbon protein, possibly nucl	0.002
COG4372		499	COG4372, COG4372, Uncharacterized protein conserve	0.002
PTZ00491		850	PTZ00491, PTZ00491, major vault protein; Provision	0.002
PRK03918		880	PRK03918, PRK03918, chromosome segregation protein	0.003
pfam13868		349	pfam13868, Trichoplein, Tumour suppressor, Mitosta	0.003
TIGR01069		771	TIGR01069, mutS2, MutS2 family protein	0.003
PTZ00121		2084	PTZ00121, PTZ00121, MAEBL; Provisional	0.004
PRK02224		880	PRK02224, PRK02224, chromosome segregation protein	0.004
COG5185		622	COG5185, HEC1, Protein involved in chromosome segr	0.004
TIGR03319		514	TIGR03319, RNase_Y, ribonuclease Y	0.004
TIGR04211		198	TIGR04211, SH3_and_anchor, SH3 domain protein	0.004

>gnl\|CDD\|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

 Score = 61.2 bits (149), Expect = 2e-09
 Identities = 49/189 (25%), Positives = 81/189 (42%), Gaps = 17/189 (8%)

Query: 523 SDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEA 582
             A+ ++L R+EAE             +A++ KE+ E  E E+    E+++  E+   EA
Sbjct: 728 ISALRKDLARLEAEVEQ------LEERIAQLSKELTE-LEAEIEELEERLEEAEEELAEA 780

Query: 583 RQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
             E+E L A+ E  K  L   R A++     L+ L  E     E L S +  I+  + R+
Sbjct: 781 EAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRL 840

Query: 643 NMLRKEAENENQEIARLQY----------ELEVERKALSMARAWAEDEAKRAREQAKALE 692
             L ++ E  +++I  L            ELE E +AL   RA  E+     R + + L 
Sbjct: 841 EDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELS 900

Query: 693 GARDRWERQ 701
                 E +
Sbjct: 901 EELRELESK 909

SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]. Length = 1179

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|173412 PTZ00121, PTZ00121, MAEBL; Provisional	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type	Back alignment and domain information

>gnl\|CDD\|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	Back alignment and domain information

>gnl\|CDD\|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type	Back alignment and domain information

>gnl\|CDD\|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	Back alignment and domain information

>gnl\|CDD\|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional	Back alignment and domain information

>gnl\|CDD\|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional	Back alignment and domain information

>gnl\|CDD\|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

>gnl\|CDD\|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional	Back alignment and domain information

>gnl\|CDD\|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein, HlyD family	Back alignment and domain information

>gnl\|CDD\|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin	Back alignment and domain information

>gnl\|CDD\|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]	Back alignment and domain information

>gnl\|CDD\|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

>gnl\|CDD\|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

>gnl\|CDD\|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type	Back alignment and domain information

>gnl\|CDD\|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	Back alignment and domain information

>gnl\|CDD\|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin	Back alignment and domain information

>gnl\|CDD\|225288 COG2433, COG2433, Uncharacterized conserved protein [Function unknown]	Back alignment and domain information

>gnl\|CDD\|237177 PRK12704, PRK12704, phosphodiesterase; Provisional	Back alignment and domain information

>gnl\|CDD\|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein	Back alignment and domain information

>gnl\|CDD\|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type	Back alignment and domain information

>gnl\|CDD\|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584)	Back alignment and domain information

>gnl\|CDD\|173412 PTZ00121, PTZ00121, MAEBL; Provisional	Back alignment and domain information

>gnl\|CDD\|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	Back alignment and domain information

>gnl\|CDD\|225638 COG3096, MukB, Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	Back alignment and domain information

>gnl\|CDD\|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain	Back alignment and domain information

>gnl\|CDD\|233973 TIGR02680, TIGR02680, TIGR02680 family protein	Back alignment and domain information

>gnl\|CDD\|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	Back alignment and domain information

>gnl\|CDD\|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

>gnl\|CDD\|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type	Back alignment and domain information

>gnl\|CDD\|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional	Back alignment and domain information

>gnl\|CDD\|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]	Back alignment and domain information

>gnl\|CDD\|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	Back alignment and domain information

>gnl\|CDD\|240439 PTZ00491, PTZ00491, major vault protein; Provisional	Back alignment and domain information

>gnl\|CDD\|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional	Back alignment and domain information

>gnl\|CDD\|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin	Back alignment and domain information

>gnl\|CDD\|130141 TIGR01069, mutS2, MutS2 family protein	Back alignment and domain information

>gnl\|CDD\|173412 PTZ00121, PTZ00121, MAEBL; Provisional	Back alignment and domain information

>gnl\|CDD\|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional	Back alignment and domain information

>gnl\|CDD\|227512 COG5185, HEC1, Protein involved in chromosome segregation, interacts with SMC proteins [Cell division and chromosome partitioning]	Back alignment and domain information

>gnl\|CDD\|188306 TIGR03319, RNase_Y, ribonuclease Y	Back alignment and domain information

>gnl\|CDD\|234501 TIGR04211, SH3_and_anchor, SH3 domain protein	Back alignment and domain information

Conserved Domains Detected by HHsearch

Original result of HHsearch against CDD database

ID	Alignment Graph	Length	Definition	Probability
Query		837
PF00395		45	SLH: S-layer homology domain; InterPro: IPR001119	98.95
PF00395		45	SLH: S-layer homology domain; InterPro: IPR001119	97.73
PF08317		325	Spc7: Spc7 kinetochore protein; InterPro: IPR01325	96.26
KOG0018		1141	consensus Structural maintenance of chromosome pro	95.81
KOG1029		1118	consensus Endocytic adaptor protein intersectin [S	95.75
smart00787		312	Spc7 Spc7 kinetochore protein. This domain is foun	94.93
KOG1029		1118	consensus Endocytic adaptor protein intersectin [S	94.67
PRK00106		535	hypothetical protein; Provisional	94.24
PF12128		1201	DUF3584: Protein of unknown function (DUF3584); In	94.2
TIGR03185		650	DNA_S_dndD DNA sulfur modification protein DndD. T	94.08
PF09726		697	Macoilin: Transmembrane protein; InterPro: IPR0191	93.97
COG4942		420	Membrane-bound metallopeptidase [Cell division and	93.82
PF09726		697	Macoilin: Transmembrane protein; InterPro: IPR0191	93.4
PF12072		201	DUF3552: Domain of unknown function (DUF3552); Int	93.33
PRK00106		535	hypothetical protein; Provisional	92.82
PRK12704		520	phosphodiesterase; Provisional	92.72
KOG0994		1758	consensus Extracellular matrix glycoprotein Lamini	92.42
KOG0161		1930	consensus Myosin class II heavy chain [Cytoskeleto	91.97
COG1196		1163	Smc Chromosome segregation ATPases [Cell division	91.0
TIGR03319		514	YmdA_YtgF conserved hypothetical protein YmdA/YtgF	90.82
PRK12704		520	phosphodiesterase; Provisional	89.26
TIGR03319		514	YmdA_YtgF conserved hypothetical protein YmdA/YtgF	88.9
KOG0161		1930	consensus Myosin class II heavy chain [Cytoskeleto	88.76
PF10186		302	Atg14: UV radiation resistance protein and autopha	88.46
KOG4302		660	consensus Microtubule-associated protein essential	88.39
PTZ00121		2084	MAEBL; Provisional	87.65
PTZ00121		2084	MAEBL; Provisional	87.64
TIGR02169		1164	SMC_prok_A chromosome segregation protein SMC, pri	87.09
TIGR01069		771	mutS2 MutS2 family protein. Function of MutS2 is u	86.75
PRK11637		428	AmiB activator; Provisional	85.82
PLN03188		1320	kinesin-12 family protein; Provisional	85.63
PF05667		594	DUF812: Protein of unknown function (DUF812); Inte	85.41
KOG0579		1187	consensus Ste20-like serine/threonine protein kina	85.38
PF00769		246	ERM: Ezrin/radixin/moesin family; InterPro: IPR011	85.22
TIGR02168		1179	SMC_prok_B chromosome segregation protein SMC, com	84.97
KOG0995		581	consensus Centromere-associated protein HEC1 [Cell	84.48
KOG3850		455	consensus Predicted membrane protein [Function unk	83.65
KOG0977		546	consensus Nuclear envelope protein lamin, intermed	83.57
TIGR00606		1311	rad50 rad50. This family is based on the phylogeno	83.15
PF05701		522	WEMBL: Weak chloroplast movement under blue light;	82.34
KOG2685		421	consensus Cystoskeletal protein Tektin [Cytoskelet	82.34
PF00038		312	Filament: Intermediate filament protein; InterPro:	82.26
PF10186		302	Atg14: UV radiation resistance protein and autopha	81.69
KOG0933		1174	consensus Structural maintenance of chromosome pro	80.14
COG1579		239	Zn-ribbon protein, possibly nucleic acid-binding [	80.11

>PF00395 SLH: S-layer homology domain; InterPro: IPR001119 S-layers are paracrystalline mono-layered assemblies of (glyco)proteins which coat the surface of bacteria [, ]	Back alignment and domain information

Probab=98.95  E-value=7.2e-10  Score=86.33  Aligned_cols=44  Identities=27%  Similarity=0.421  Sum_probs=31.5

Q ss_pred             CccCCCCCCCCh-HHHHHHHHcCCcccccccccCCCCCCCCcccCCCCcCcHHHHH
Q 003229          387 AFDDITPEDPDF-SSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLV  441 (837)
Q Consensus       387 AF~DV~~ddPyf-~yIQAAAEAGIIsGkLSG~~~~s~dDG~~tFrPDsPITRQELA  441 (837)
                      .|+||+..+|+| .+|+.+++.|||.|+.           +++|+|+++|||+|||
T Consensus         1 ~F~Dv~~~~~~~a~~i~~~~~~gi~~G~~-----------~~~f~P~~~iTR~e~A   45 (45)
T PF00395_consen    1 PFKDVPSISWAYAEAIQWLYQLGIISGYS-----------DGTFNPNDPITRAEAA   45 (45)
T ss_dssp             -BTTB-TTSSSTTHHHHHHHHTTSS---T-----------TS---TTSB-BHHHHH
T ss_pred             CCCCCCCCcHHHHHHHHHHHHcCCcccCC-----------CCeECCCCCcCHHHhC
Confidence            599999999955 9999999999999973           3379999999999986

Several S-layer proteins and some other cell wall proteins contain one or more copies of a domain of about 50-60 residues, which has been called SLH (for S-layer homology). Although it was originally proposed that SLH domains bind to peptidoglycan, it is now evident that pyruvylated secondary cell wall polymers (SCWPs), which are either teichoic acids, teichuronic acids, lipoteichoic acids or lipoglycans, serve as the anchoring structures for SLH motifs in the Gram-positive cell wall [, ]. However, the study of S-layer protein SbpA of Bacillus sphaericus revealed that SLH motifs are not sufficient for specific binding to SCWPs. Thus, the molecular basis explaining SLH affinity and specificity of interaction with cell wall polymers are not completely elucidated [].; PDB: 3PYW_A.

>PF00395 SLH: S-layer homology domain; InterPro: IPR001119 S-layers are paracrystalline mono-layered assemblies of (glyco)proteins which coat the surface of bacteria [, ]	Back alignment and domain information

>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association []	Back alignment and domain information

>KOG0018 consensus Structural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) [Cell cycle control, cell division, chromosome partitioning]	Back alignment and domain information

>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]	Back alignment and domain information

>smart00787 Spc7 Spc7 kinetochore protein	Back alignment and domain information

>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]	Back alignment and domain information

>PRK00106 hypothetical protein; Provisional	Back alignment and domain information

>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins	Back alignment and domain information

>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD	Back alignment and domain information

>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes	Back alignment and domain information

>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]	Back alignment and domain information

>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes	Back alignment and domain information

>PF12072 DUF3552: Domain of unknown function (DUF3552); InterPro: IPR022711 This presumed domain is functionally uncharacterised	Back alignment and domain information

>PRK00106 hypothetical protein; Provisional	Back alignment and domain information

>PRK12704 phosphodiesterase; Provisional	Back alignment and domain information

>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures]	Back alignment and domain information

>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]	Back alignment and domain information

>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]	Back alignment and domain information

>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF	Back alignment and domain information

>PRK12704 phosphodiesterase; Provisional	Back alignment and domain information

>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF	Back alignment and domain information

>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]	Back alignment and domain information

>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking	Back alignment and domain information

>KOG4302 consensus Microtubule-associated protein essential for anaphase spindle elongation [Cell cycle control, cell division, chromosome partitioning; Cytoskeleton]	Back alignment and domain information

>PTZ00121 MAEBL; Provisional	Back alignment and domain information

>PTZ00121 MAEBL; Provisional	Back alignment and domain information

>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type	Back alignment and domain information

>TIGR01069 mutS2 MutS2 family protein	Back alignment and domain information

>PRK11637 AmiB activator; Provisional	Back alignment and domain information

>PLN03188 kinesin-12 family protein; Provisional	Back alignment and domain information

>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function	Back alignment and domain information

>KOG0579 consensus Ste20-like serine/threonine protein kinase [Signal transduction mechanisms]	Back alignment and domain information

>PF00769 ERM: Ezrin/radixin/moesin family; InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []	Back alignment and domain information

>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type	Back alignment and domain information

>KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning]	Back alignment and domain information

>KOG3850 consensus Predicted membrane protein [Function unknown]	Back alignment and domain information

>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]	Back alignment and domain information

>TIGR00606 rad50 rad50	Back alignment and domain information

>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function	Back alignment and domain information

>KOG2685 consensus Cystoskeletal protein Tektin [Cytoskeleton]	Back alignment and domain information

>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope	Back alignment and domain information

>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking	Back alignment and domain information

>KOG0933 consensus Structural maintenance of chromosome protein 2 (chromosome condensation complex Condensin, subunit E) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]	Back alignment and domain information

>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]	Back alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST

Original result of BLAST against Protein Data Bank

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST

Original result of RPS-BLAST against PDB70 database

ID	Alignment Graph	Length	Definition	E-value
Query		837
1i84_S		1184	Smooth muscle myosin heavy chain; muscle protein,	3e-12
1i84_S		1184	Smooth muscle myosin heavy chain; muscle protein,	1e-10
1i84_S		1184	Smooth muscle myosin heavy chain; muscle protein,	1e-08
1i84_S		1184	Smooth muscle myosin heavy chain; muscle protein,	2e-08
1i84_S		1184	Smooth muscle myosin heavy chain; muscle protein,	6e-08
1vt4_I		1221	APAF-1 related killer DARK; drosophila apoptosome,	1e-11
1vt4_I		1221	APAF-1 related killer DARK; drosophila apoptosome,	1e-09
1vt4_I		1221	APAF-1 related killer DARK; drosophila apoptosome,	3e-04
2dfs_A		1080	Myosin-5A; myosin-V, inhibited state, cryoelectron	5e-10
2dfs_A		1080	Myosin-5A; myosin-V, inhibited state, cryoelectron	2e-07
2dfs_A		1080	Myosin-5A; myosin-V, inhibited state, cryoelectron	3e-07
2zuo_A		861	MVP, major vault protein; repeat domains, protein-	5e-09
2zuo_A		861	MVP, major vault protein; repeat domains, protein-	7e-08
2zuo_A		861	MVP, major vault protein; repeat domains, protein-	1e-07
2zuo_A		861	MVP, major vault protein; repeat domains, protein-	2e-06
2zuo_A		861	MVP, major vault protein; repeat domains, protein-	2e-04
3na7_A		256	HP0958; flagellar biogenesis, flagellum export, C4	6e-06
3na7_A		256	HP0958; flagellar biogenesis, flagellum export, C4	2e-04
3lvg_D		190	LCB, clathrin light chain B; SELF assembly, coated	3e-05
3lvg_D		190	LCB, clathrin light chain B; SELF assembly, coated	4e-04
3lvg_D		190	LCB, clathrin light chain B; SELF assembly, coated	4e-04
3lvg_D		190	LCB, clathrin light chain B; SELF assembly, coated	9e-04
2qag_B		427	Septin-6, protein NEDD5; cell cycle, cell division	8e-05
3vkg_A		3245	Dynein heavy chain, cytoplasmic; AAA+ protein, mol	1e-04
3vkg_A		3245	Dynein heavy chain, cytoplasmic; AAA+ protein, mol	1e-04
3vkg_A		3245	Dynein heavy chain, cytoplasmic; AAA+ protein, mol	5e-04
1c1g_A		284	Tropomyosin; contractIle protein; 7.00A {Sus scrof	1e-04
1c1g_A		284	Tropomyosin; contractIle protein; 7.00A {Sus scrof	4e-04
1c1g_A		284	Tropomyosin; contractIle protein; 7.00A {Sus scrof	8e-04
1f5n_A		592	Interferon-induced guanylate-binding protein 1; GB	3e-04
2fxo_A		129	Myosin heavy chain, cardiac muscle beta isoform; c	4e-04
2ycu_A		995	Non muscle myosin 2C, alpha-actinin; motor protein	5e-04
2i1j_A		575	Moesin; FERM, coiled-coil, C-ermad, ERM, radixin,	6e-04
3pyw_A		203	S-layer protein SAP; SLH-domains, polysaccharide b	8e-04
1cii_A		602	Colicin IA; bacteriocin, ION channel formation, tr	8e-04

>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184	Back alignment and structure

 Score = 69.5 bits (170), Expect = 3e-12
 Identities = 39/199 (19%), Positives = 90/199 (45%), Gaps = 8/199 (4%)

Query: 521  EASDAVNEELQRIEAE-SAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMA 579
            +    + ++  ++  E    E  VS+ +  +AE E++     + +   E    ++  ++ 
Sbjct: 990  DDILIMEDQNNKLTKERKLLEERVSDLTTNLAEEEEKAKNLTKLKNKHESMISELEVRLK 1049

Query: 580  EE--ARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
            +E  +RQELE+++ + E +   L ++ A +++++  L     + EE+L++ ++    +  
Sbjct: 1050 KEEKSRQELEKIKRKLEGESSDLHEQIAELQAQIAELKAQLAKKEEELQAALA---RLED 1106

Query: 638  EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDR 697
            E  + N   K+       I+ LQ +LE E+ A + A     D ++        LE   D 
Sbjct: 1107 ETSQKNNALKKIRELESHISDLQEDLESEKAARNKAEKQKRDLSEELEALKTELEDTLDT 1166

Query: 698  WERQGIKVVVDKDLREESD 716
               Q  + +   D +++ D
Sbjct: 1167 TATQ--QELRGSDYKDDDD 1183

PyMOL of 1i84

>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184	Back alignment and structure

>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184	Back alignment and structure

>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184	Back alignment and structure

>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184	Back alignment and structure

>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221	Back alignment and structure

>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221	Back alignment and structure

>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221	Back alignment and structure

>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080	Back alignment and structure

>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080	Back alignment and structure

>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080	Back alignment and structure

>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861	Back alignment and structure

>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861	Back alignment and structure

>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861	Back alignment and structure

>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861	Back alignment and structure

>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex, cytoplasm, ribonucleoprotein, structural protein; 3.50A {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A Length = 861	Back alignment and structure

>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Length = 256	Back alignment and structure

>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Length = 256	Back alignment and structure

>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190	Back alignment and structure

>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190	Back alignment and structure

>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190	Back alignment and structure

>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190	Back alignment and structure

>2qag_B Septin-6, protein NEDD5; cell cycle, cell division, GTP-binding, nucleotide-binding, phosphorylation, acetylation, alternative splicing, coiled coil; HET: GDP GTP; 4.00A {Homo sapiens} Length = 427	Back alignment and structure

>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* 3vkh_C* Length = 3245	Back alignment and structure

>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* 3vkh_C* Length = 3245	Back alignment and structure

>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* 3vkh_C* Length = 3245	Back alignment and structure

>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Length = 284	Back alignment and structure

>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Length = 284	Back alignment and structure

>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Length = 284	Back alignment and structure

>1f5n_A Interferon-induced guanylate-binding protein 1; GBP, GTP hydrolysis, GDP, GMP, dynamin related, large GTPase family. GMPPNP, GPPNHP.; HET: GNP; 1.70A {Homo sapiens} SCOP: a.114.1.1 c.37.1.8 PDB: 1dg3_A* 2b8w_A* 2b92_A* 2bc9_A* 2d4h_A* Length = 592	Back alignment and structure

>2fxo_A Myosin heavy chain, cardiac muscle beta isoform; coiled coil (dimeric, parallel), familial hypertrophic cardiomyopathy, FHC-associated mutant E924K; 2.50A {Homo sapiens} SCOP: h.1.26.1 PDB: 2fxm_A Length = 129	Back alignment and structure

>2ycu_A Non muscle myosin 2C, alpha-actinin; motor protein; HET: AOV; 2.25A {Homo sapiens} PDB: 1br1_A* 1br4_A* 1br2_A* Length = 995	Back alignment and structure

>2i1j_A Moesin; FERM, coiled-coil, C-ermad, ERM, radixin, ezrin, MER actin binding, masking, regulation, SELF-inhibition, cell A membrane protein; 2.10A {Spodoptera frugiperda} PDB: 2i1k_A 1e5w_A Length = 575	Back alignment and structure

>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-2, protein structure initiative; 1.80A {Bacillus anthracis} Length = 203	Back alignment and structure

>1cii_A Colicin IA; bacteriocin, ION channel formation, transmembrane protein; 3.00A {Escherichia coli} SCOP: f.1.1.1 h.4.3.1 Length = 602	Back alignment and structure

Structure Templates Detected by HHsearch

Original result of HHsearch against PDB70 database

ID	Alignment Graph	Length	Definition	Probability
Query		837
3pyw_A		203	S-layer protein SAP; SLH-domains, polysaccharide b	99.96
4aq1_A		892	SBSB protein; structural protein, S-layer protein,	99.87
3pyw_A		203	S-layer protein SAP; SLH-domains, polysaccharide b	99.74
4aq1_A		892	SBSB protein; structural protein, S-layer protein,	99.38
3oja_B		597	Anopheles plasmodium-responsive leucine-rich REPE	90.75
2dfs_A		1080	Myosin-5A; myosin-V, inhibited state, cryoelectron	90.37
3oja_B		597	Anopheles plasmodium-responsive leucine-rich REPE	87.89
1i84_S		1184	Smooth muscle myosin heavy chain; muscle protein,	87.61
2no2_A		107	HIP-I, huntingtin-interacting protein 1; clathrin	87.03
2dfs_A		1080	Myosin-5A; myosin-V, inhibited state, cryoelectron	86.75
3na7_A		256	HP0958; flagellar biogenesis, flagellum export, C4	86.45
3vkg_A		3245	Dynein heavy chain, cytoplasmic; AAA+ protein, mol	84.83
3vkg_A		3245	Dynein heavy chain, cytoplasmic; AAA+ protein, mol	84.43
2fxo_A		129	Myosin heavy chain, cardiac muscle beta isoform; c	80.52

>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-biology, protein structure initiative; 1.80A {Bacillus anthracis}	Back alignment and structure

Probab=99.96  E-value=9.4e-29  Score=240.01  Aligned_cols=169  Identities=20%  Similarity=0.181  Sum_probs=132.3

Q ss_pred             cccccC-CCceeeccccchhhHHHHHhcccccccC-CccCCCCCccHHHHHHHHHHhcccccccccccccccccccCCCC
Q 003229          308 AALQVL-PGKVLVPAVVDQVQGQALSALQVLKVIE-ADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTD  385 (837)
Q Consensus       308 aaf~v~-p~rv~VPvavD~~qIeALAaLGILkgiE-gtF~Pn~pITRAEFArwLVRAls~L~R~p~~kI~Pam~is~st~  385 (837)
                      .+|.+. .++    -+.  .+|+.|..+|+++|++ ++|+|+++|||+|||.||+++++.          +   ......
T Consensus         3 ~~F~Dv~~~~----wa~--~~I~~l~~~gii~G~~dg~F~P~~~iTR~e~a~~l~r~~~~----------~---~~~~~~   63 (203)
T 3pyw_A            3 KTFPDVPADH----WGI--DSINYLVEKGAVKGNDKGMFEPGKELTRAEAATMMAQILNL----------P---IDKDAK   63 (203)
T ss_dssp             CCBTTBCTTS----TTH--HHHHHHHHTTSCCCCTTSCCCTTCBCBHHHHHHHHHHHTTC----------C---CCTTCC
T ss_pred             CCCcCCCCCc----HHH--HHHHHHHHCcCEecCCCCeECCCCCchHHHHHHHHHHHhCC----------C---CCcCCC
Confidence            357444 555    233  4899999999999997 699999999999999999999871          1   112345


Q ss_pred             CCccCCCCCCCChHHHHHHHHcCCcccccccccCCCCCCCCcccCCCCcCcHHHHHHHHHH-HhhcCCchhhhhhhhccC
Q 003229          386 LAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMA-LEKRQLPEANKKILYQLS  464 (837)
Q Consensus       386 ~AF~DV~~ddPyf~yIQAAAEAGIIsGkLSG~~~~s~dDG~~tFrPDsPITRQELAvwkVA-Le~~~LP~ad~k~L~~~s  464 (837)
                      ..|+||+ +||++.+|+.|+++|||.|+           +++ |+|+++|||+||+.|+++ +++..   ..  ......
T Consensus        64 ~~F~Dv~-~~w~~~~i~~a~~~Gii~g~-----------~~~-F~P~~~iTR~e~a~~l~r~~~~~~---~~--~~~~~~  125 (203)
T 3pyw_A           64 PSFADSQ-GQWYTPFIAAVEKAGVIKGT-----------GNG-FEPNGKIDRVSMASLLVEAYKLDT---KV--NGTPAT  125 (203)
T ss_dssp             CSCGGGC-SSTTHHHHHHHHHTTSSCCC-----------SSC-CCTTSBCBHHHHHHHHHHHHTGGG---GS--SSCCCC
T ss_pred             CCCCCCC-cHHHHHHHHHHHHCcCeecC-----------CCC-CCCCCcccHHHHHHHHHHHhcccc---cc--ccccCC
Confidence            6899995 99999999999999999997           234 999999999999999875 43321   10  011245


Q ss_pred             CcccccCcCcchHHHHHHHHhcCcccceecccCCCccccCCCCCcHHHHHHHHHhc
Q 003229          465 GFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIG  520 (837)
Q Consensus       465 gF~DadkIspwA~~AVaadL~aGd~GII~~vfG~tg~FqPnkPVTRAEAAAaL~sG  520 (837)
                      .|.|...| +|+.++|......   ||+.   |..+.|+|++++||+|||++|++-
T Consensus       126 ~f~D~~~i-~wa~~~v~~~~~~---Gi~~---g~~~~f~P~~~iTRaE~A~~l~r~  174 (203)
T 3pyw_A          126 KFKDLETL-NWGKEKANILVEL---GISV---GTGDQWEPKKTVTKAEAAQFIAKT  174 (203)
T ss_dssp             CCHHHHTC-SSSHHHHHHHHHT---TCSC---CCSSCCCTTSBCBHHHHHHHHHHH
T ss_pred             CCCCCCcc-HHHHHHHHHHHHC---cCEe---CCCCCcCCCCCCcHHHHHHHHHHH
Confidence            79999999 8999999877755   4653   444589999999999999999964

PyMOL of 3pyw

>4aq1_A SBSB protein; structural protein, S-layer protein, nanobody; 2.42A {Geobacillus stearothermophilus} PDB: 4aq1_C	Back alignment and structure

>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-biology, protein structure initiative; 1.80A {Bacillus anthracis}	Back alignment and structure

>4aq1_A SBSB protein; structural protein, S-layer protein, nanobody; 2.42A {Geobacillus stearothermophilus} PDB: 4aq1_C	Back alignment and structure

>3oja_B Anopheles plasmodium-responsive leucine-rich REPE 1; coiled-coil, helix-loop-helix, leucine-rich repeat, protein; HET: NAG MAN; 2.70A {Anopheles gambiae}	Back alignment and structure

>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus}	Back alignment and structure

>3oja_B Anopheles plasmodium-responsive leucine-rich REPE 1; coiled-coil, helix-loop-helix, leucine-rich repeat, protein; HET: NAG MAN; 2.70A {Anopheles gambiae}	Back alignment and structure

>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A	Back alignment and structure

>2no2_A HIP-I, huntingtin-interacting protein 1; clathrin light chain binding, HIP1 coiled-coil domain, endocytosis, clathrin SELF-assembly, cell adhesion; 2.80A {Homo sapiens}	Back alignment and structure

>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus}	Back alignment and structure

>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori}	Back alignment and structure

>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A*	Back alignment and structure

>3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A*	Back alignment and structure

>2fxo_A Myosin heavy chain, cardiac muscle beta isoform; coiled coil (dimeric, parallel), familial hypertrophic cardiomyopathy, FHC-associated mutant E924K; 2.50A {Homo sapiens} SCOP: h.1.26.1 PDB: 2fxm_A	Back alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST

Original result of RPS-BLAST against SCOP70(version1.75) database

ID	Alignment Graph	Length	Definition	E-value
Query		837
d2es4d1		280	a.137.15.1 (D:53-332) Lipase chaperone LifO (LipB)	2e-06
d2es4d1		280	a.137.15.1 (D:53-332) Lipase chaperone LifO (LipB)	0.002
d1f5na1		300	a.114.1.1 (A:284-583) Interferon-induced guanylate	4e-04
d1f5na1		300	a.114.1.1 (A:284-583) Interferon-induced guanylate	0.002

>d1f5na1 a.114.1.1 (A:284-583) Interferon-induced guanylate-binding protein 1 (GBP1), C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Length = 300	Back information, alignment and structure

>d1f5na1 a.114.1.1 (A:284-583) Interferon-induced guanylate-binding protein 1 (GBP1), C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Length = 300	Back information, alignment and structure

Homologous Domains Detected by HHsearch

Original result of HHsearch against SCOP70(version1.75) database

No hit with probability above 80.00