Citrus Sinensis ID: 000199


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------840-------850-------860-------870-------880-------890-------900-------910-------920-------930-------940-------950-------960-------970-------980-------990------1000------1010------1020------1030------1040------1050------1060------1070------1080------1090------1100------1110------1120------1130------1140------1150------1160------1170------1180------1190------1200------1210------1220------1230------1240------1250------1260------1270------1280------1290------1300------1310------1320------1330------1340------1350------1360------1370------1380------1390------1400------1410------1420------1430------1440------1450------1460------1470------1480------1490------1500------1510------1520------1530------1540------1550------1560------1570------1580------1590------1600------1610------1620------1630------1640------1650------1660------1670------1680------1690------1700------1710------1720------1730------1740------1750------1760------1770------1780------1790------1800------1810------1820------1830------1840------1850------1860------1870--
MSRVARWKVEKTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLTSKTGFREFEQQRELRERGLQSGNDQISPDQSCGGINNQLDKVNARVRFKDKSKELPSLEEEAGLNEDYTESAVGFDGSSNTSESLYAEKHDASSTHEIDSLKSIVSGDLAGLNQSLQQEKGDPSDQRFSAQGTNDWVPGWGLDYSADNDLAIAHEENNRLRGCLEMAESSIHELKLEVSSLQSHADEIGMEAQNFAQKLAAEIASGEQLAEEVSALKSECSHLKSDLERLIKLRSCHPFTSWEAVKQGTEITMSTPNLVTCDRSKVNEIGEISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALASHEHSSVNPADSVGGKIFELLRELDESKAERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEIEAMRLDVDEQVLRFSEERRDLESLNKELERRAVSAEAALKRARLNYSIAVNQLQKDLELLSSQVLSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLACQDQYIGVKKQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIRLMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLESNLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLEKESLENGNLRRETSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNKSADQNLESKDLTSVIMRLEVLQRNACQKIRQLMQEKKALIDEKDRAEMSFSKSESDIVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDLDYLEVELQQLSSKNRDLAQEILALQVVTEEFDRSKQTISELTEENRALMVALQDKSEESVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSLQNKSEEAAKLAVELDSVRNSLQSVHDELHGERSLSDELKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQEYAEESSRLASEGNTSKETLQSLRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLDLDQQKSELVQLKLLVLDLESEKSRASEESSSVTSLQSELSEMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHENNSLIAQSEEYKSRAETMADNYGEHKSQLALEVERMKQLLVGSEEEIDDLMMSREELEIKVVVLKAKLAEQHAQVISSEGYIDEQKMLQNQCNELRRKLSEQILKTEEFRNLSIHLKELKDKADAECLKLHEKRESEGLPTGMQESLRIAFIKEQCETKVQELKHHLSISKKHSEEMLWKLQDAIDEIENRKKSEAAHLKKNEELGVKILELEAELQSLISDKREKTKAYDLAKAELECSLMSLECCKEEKEKLEVSLHECNEEKSRLYSDLSLMKKLLYSSTFPCRKEGNDGLHKESCISNELTGRNVQKTTNADTKSHGRMSADDTGNGPTGDVDEYLEHENMTNGIDAQNLCLGLSDEGSYSCTLMKEHPEQDVLQSSCLNGSSSLALVNQENTKSNDTKDLAIINDQFRVQSLKSSMDLLNEELERMKNENSLSRGDHNFDPKFSSLQRELMD
cccccccccccccccEEEEEccccccccccccccEEEEEEEcccccccccccccccccccccccccHHHHHHccccccccccccccEEEEEEcccccccccHHHHccccccccccccccccccccccccccHHHHHHHHHcccccccHHHHHHHHHHHcccccccccccccccccccccHHHHHHHHHccccccccccHHHHcccccHHHHHHcccccccccccccHHHHcccccHHccHHHHHHHccccccccccccccccccccHHHccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHcccHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHccccHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHcc
cccccccccccccEEEEEEEEEEEEEcccccccEEEEEEEEcccccEEEEEcEEEEccccccccccEEEEEEEEcccccccccHEEEEEEEEccccccccEEEEEEEHHHHHHcccccEEEEEcccccccEEEEEEEEEEccccccccccccccccccccccccccccccccccccccccccccccEEccccccccccHHHHcccccccccccccccccccccccccccccccccHHHHHHHHHccccccccccccccccccccHHHHHHccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHcccccHHHHHHHHHHHHHHcccccccHHHHHHHHHHHccccHHHHHHHHHHcHHHHHHHHHHcHHHHHHHHHHHccHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHc
msrvarwkveKTKVKVVFRLQFHATHIQQTGWDKLFISfipadsgkaigkttkanvrngtckwadpiYETTRLLQDIKTKQYDEKLYKFVVAMgssrssilgeatinladyadaskpstvllplhggdsgtILHVTVQLLTSKTGFREFEQQRELRERGlqsgndqispdqscgginnqLDKVNARVrfkdkskelpsleeeaglnedytesavgfdgssntseslyaekhdasstheidslksivsGDLAGLNQslqqekgdpsdqrfsaqgtndwvpgwgldysadndLAIAHEENNRLRGCLEMAESSIHELKLEVSSLQSHADEIGMEAQNFAQKLAAEIASGEQLAEEVSALKSECSHLKSDLERLIKLrschpftsweavkqgteitmstpnlvtcdrskvneigeiSLNKYEQYLsgtgletelyqpdldmlhcvgipalashehssvnpadsvggKIFELLRELDESKAERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEIEAMRLDVDEQVLRFSEERRDLESLNKELERRAVSAEAALKRARLNYSIAVNQLQKDLELLSSQVLSMYQTNENLIKqafvdspqptcqEYQDMVqnrklnpeeshanrhlacqDQYIgvkkqhlggdiLIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIRLMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLesnlqdvtcenrHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLEKeslengnlrRETSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNksadqnleskdLTSVIMRLEVLQRNACQKIRQLMQEKKALIDEkdraemsfsksesDIVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDLDYLEVELQQLSSKNRDLAQEILALQVVTEEFDRSKQTISELTEENRALMVALQDKSEESVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSLQNKSEEAAKLAVELDSVRNSLQSVHDelhgerslsdelKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQEYAEESSrlasegntskETLQSLRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLDLDQQKSELVQLKLLVLDLEseksraseesssVTSLQSELSEMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHENNSLIAQSEEYKSRAETMADNYGEHKSQLALEVERMKQLLVGSEEEIDDLMMSREELEIKVVVLKAKLAEQHAQVISSEGYIDEQKMLQNQCNELRRKLSEQILKTEEFRNLSIHLKELKDKADAECLKLHEkreseglptgmQESLRIAFIKEQCETKVQELKHHLSISKKHSEEMLWKLQDAIDEIENRKKSEAAHLKKNEELGVKILELEAELQSLISDKREKTKAYDLAKAELECSLMSLECCKEEKEKLEVSLHECNEEKSRLYSDLSLMKKLLysstfpcrkegndglhkescisneltgrnvqkttnadtkshgrmsaddtgngptgdvdeylehenmtngidaqnlclglsdegsysctlmkehpeqdvlqssclngssslalvnqentksndtkdlaiINDQFRVQSLKSSMDLLNEELERMknenslsrgdhnfdpkfSSLQRELMD
msrvarwkvektkvKVVFRLQFHathiqqtgwDKLFISFIPAdsgkaigkttkanvrngtckwadpiyETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLTSKTGFREFEQQRELRERGLqsgndqispdqscgginNQLDKVNARVrfkdkskelpsleeeaglnedyteSAVGFDGSSNTSESLYAEKHDASSTHEIDSLKSIVSGDLAGLNQSLQQEKGDPSDQRFSAQGTNDWVPGWGLDYSADNDLAIAHEENNRLRGCLEMAESSIHELKLEVSSLQSHADEIGMEAQNFAQKLAAEIASGEQLAEEVSALKSECSHLKSDLERLIKLRSCHPFtsweavkqgteitmstpnlvtcdrskVNEIGEISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALASHEhssvnpadsvGGKIFELLRELDESKAERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEIEAMRLDVDEQVLRfseerrdleslnkelERRAVSAEAALKRARLNYSIAVNQLQKDLELLSSQVLSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLACQDQYIGVKKQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIRLMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLESNLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLekeslengnlrrETSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNKSADQNLESKDLTSVIMRLEVLQRNACQKIRQLMQEKKALidekdraemsfsksesdiVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVsseveennaqrhtdlfsdLDYLEVELQQLSSKNRDLAQEILALqvvteefdrskQTISELTEENRALMVALQDKSEESVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSLQNKSEEAAKLAVELDSVRNSLQSvhdelhgerslsdelKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQEYAEESsrlasegntsketLQSLRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLDLDQQKSELVQLKLLVLDLeseksraseesssvtsLQSELSEMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHENNSLIAQSEEYKSRAETMADNYGEHKSQLALEVERMKQLLVGSEEEIDDLMMSREELEIKVVVLKAKLAEQHAQVISSEGYIDEQKMLQNQCNELRRKLSEQILKTEEFRNLSIHLKELKDKADAECLKLHekreseglptgmQESLRIAFIKEQCETKVQELKHHLSISKKHSEEMLWKLQDAIDEIENRKKSEAAHLKKNEELGVKILELEAELQSlisdkrektKAYDLAKAELECSLMSLECCKEEKEKLEVSLHECNEEKSRLYSDLSLMKKLLYSSTFPCRKEGNDGLHKEscisneltgrnvqkttnadtkshgrmsaddtgngPTGDVDEYLEHENMTNGIDAQNLCLGLSDEGSYSCTLMKEHPEQDVLQSSCLNGSSSLALVNQENtksndtkdlaiiNDQFRVQSLKSSMDLLNEELERMKnenslsrgdhnfdpkfsslqrelmd
MSRVARWKVEKTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLTSKTgfrefeqqrelrerglQSGNDQISPDQSCGGINNQLDKVNARVRFKDKSKELPSLEEEAGLNEDYTESAVGFDGSSNTSESLYAEKHDASSTHEIDSLKSIVSGDLAGLNQSLQQEKGDPSDQRFSAQGTNDWVPGWGLDYSADNDLAIAHEENNRLRGCLEMAESSIHELKLEVSSLQSHADEIGMEAQNFAQKLAAEIASGEQLAEEVSALKSECSHLKSDLERLIKLRSCHPFTSWEAVKQGTEITMSTPNLVTCDRSKVNEIGEISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALASHEHSSVNPADSVGGKIFELLRELDESKAERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEIEAMRLDVDEQVLRFSEERRDLESLNKELERRAVSAEAALKRARLNYSIAVNQLQKDLELLSSQVLSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLACQDQYIGVKKQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIRLMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLESNLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLEKESLENGNLRRETSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNKSADQNLESKDLTSVIMRLEVLQRNACQKIRQLMQEKKALIDEKDRAEMSFSKSESDIVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDLDYLEVELQQLSSKNRDLAQEILALQVVTEEFDRSKQTISELTEENRALMVALQDKSEESVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSLQNKSEEAAKLAVELDSVRNSLQSVHDELHGERSLSDELKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQEYAEESSRLASEGNTSKETLQSLRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLDLDqqkselvqlkllvldleseksraseesssvtslqselseMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHENNSLIAQSEEYKSRAETMADNYGEHKSQLALEVERMKQLLVGSEEEIDDLMMSREELEIKVVVLKAKLAEQHAQVISSEGYIDEQKMLQNQCNELRRKLSEQILKTEEFRNLSIHLKELKDKADAECLKLHEKRESEGLPTGMQESLRIAFIKEQCETKVQELKHHLSISKKHSEEMLWKLQDAIDEIENRKKSEAAHLKKNEELGVKILELEAELQSLISDKREKTKAYDLAKAELECSLMSlecckeekekleVSLHECNEEKSRLYSDLSLMKKLLYSSTFPCRKEGNDGLHKESCISNELTGRNVQKTTNADTKSHGRMSADDTGNGPTGDVDEYLEHENMTNGIDAQNLCLGLSDEGSYSCTLMKEHPEQDVLQSSCLNGSSSLALVNQENTKSNDTKDLAIINDQFRVQSLKSSMDLLNEELERMKNENSLSRGDHNFDPKFSSLQRELMD
*****RWKVEKTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLTSKTGF********************************************************************************************************************************NDWVPGWGLDYSADNDLAIAHEENNRLRGCLEM*******************************************************HLKSDLERLIKLRSCHPFTSWEAVKQGTEITMSTPNLVTCDRSKVNEIGEISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALAS**************KIFELL*******************MECYYEALIQELE*****MLGELQNL***HSSCLYTVSSAKAEIEAMRLDV******************************ALKRARLNYSIAVNQLQKDLELLSSQVLSMYQTNENLIKQAFV****************************HLACQDQYIGVKKQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEI**********************LLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLESNLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAE****************************LETVKIDFDELASVNKNLQRTINNL**************FSELCL*************LTSVIMRLEVLQRNACQKIRQL************************IVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKL***************DLFSDLDYLEVELQQL*****DLAQEILALQVVTEE******************************************************************************************************************************************************************************************************************************************L***LNEKHCQLLDLDQQKSELVQLKLLVLD*************************HELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHE***************************************L*********LMMSREELEIKVVVLKAKLAEQHAQVISSEGYIDEQKMLQNQCN********QILKTEEFRNLSIHLK*****************************LRIAFIKEQCETKVQELKHH**********MLWKLQ***********************GVKILELEAELQ*L*********AYDLAKAELECSLMSLECCKEE***LEVSLHEC*****RLYSDLSLMKKLLYSSTFPC************************************************************NGIDAQNLCLGLSDEGSYSCTL**************************************II*************************************************
***********T***VVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTV**************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************RE***
********VEKTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLTSKTGFREFEQQRELRERGLQSGNDQISPDQSCGGINNQLDKVNARVRFKDKSKELPSLEEEAGLNEDYTESAVGFDGSSNTSE**************IDSLKSIVSGDLAGLNQS**************AQGTNDWVPGWGLDYSADNDLAIAHEENNRLRGCLEMAESSIHELKLEVSSLQSHADEIGMEAQNFAQKLAAEIASGEQLAEEVSALKSECSHLKSDLERLIKLRSCHPFTSWEAVKQGTEITMSTPNLVTCDRSKVNEIGEISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALAS********ADSVGGKIFELLRELDESKAERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEIEAMRLDVDEQVLRFSEERRDLESLNKELERRAVSAEAALKRARLNYSIAVNQLQKDLELLSSQVLSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLACQDQYIGVKKQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIRLMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLESNLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLEKESLENGNLRRETSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNKSADQNLESKDLTSVIMRLEVLQRNACQKIRQLMQEKKALIDEKDRAEMSFSKSESDIVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDLDYLEVELQQLSSKNRDLAQEILALQVVTEEFDRSKQTISELTEENRALMVALQDKSEESVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSLQNKSEEAAKLAVELDSVRNSLQSVHDELHGERSLSDELKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQE*********************LRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLDLDQQKSELVQLKLLVLDLE********************SEMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHENNSLIAQSEEYKSRAETMADNYGEHKSQLALEVERMKQLLVGSEEEIDDLMMSREELEIKVVVLKAKLAEQHAQVISSEGYIDEQKMLQNQCNELRRKLSEQILKTEEFRNLSIHLKELKDKADAECLKLHEKRESEGLPTGMQESLRIAFIKEQCETKVQELKHHLSISKKHSEEMLWKLQDAIDEIENRKKSEAAHLKKNEELGVKILELEAELQSLISDKREKTKAYDLAKAELECSLMSLECCKEEKEKLEVSLHECNEEKSRLYSDLSLMKKLLYSSTFPCRKEGNDGLHKESCISNELTGRNVQK******************NGPTGDVDEYLEHENMTNGIDAQNLCLGLSDEGSYSCTLMKEHPEQDVLQSSCLNGSSSLALVNQENTKSNDTKDLAIINDQFRVQSLKSSMDLLNEELERMKNENSLSRGDHNFDPKFSSLQRELMD
**********KTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLTSK***************************************************************************************************************************************************LAI*H**NNRLRGCLEMAESSIHELKLEVSSLQSHADEIGMEAQNFAQKLAAEIASGEQLAEEVSALKSECSHLKSDLERLIKLRSCHPFTSWEAVKQGTEITMSTPNLVTCDRSKVNEIGEISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALASHEHSSVNPADSVGGKIFELLRELDESKAERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEIEAMRLDVDEQVLRFSEERRDLESLNKELERRAVSAEAALKRARLNYSIAVNQLQKDLELLSSQVLSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLACQDQYIGVKKQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIRLMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLESNLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLEKESLENGNLRRETSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNKSADQNLESKDLTSVIMRLEVLQRNACQKIRQLMQEKKALIDEKDRAEMSFSKSESDIVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDLDYLEVELQQLSSKNRDLAQEILALQVVTEEFDRSKQTISELTEENRALMVALQDKSEESVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSLQNKSEEAAKLAVELDSVRNSLQSVHDELHGERSLSDELKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQEYAEESSRLASEGNTSKETLQSLRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLDLDQQKSELVQLKLLVLDLESEKSRASEESSSVTSLQSELSEMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHENNSLIAQSEEYKSRAETMADNYGEHKSQLALEVERMKQLLVGSEEEIDDLMMSREELEIKVVVLKAKLAEQHAQVISSEGYIDEQKMLQNQCNELRRKLSEQILKTEEFRNLSIHLKELKDKADAECLKLHEKRESEGLPTGMQESLRIAFIKEQCETKVQELKHHLSISKKHSEEMLWKLQDAIDEIENRKKSEAAHLKKNEELGVKILELEAELQSLISDKREKTKAYDLAKAELECSLMSLECCKEEKEKLEVSLHECNEEKSRLYSDLSLMKKLLYSSTFPCRKEGNDGLHKESCISNELTGRNVQKTTNADTKSHGRMSADDTG*GPTGDVDEYLEHENMTNGIDAQNLCLGLSDEGSYSCTLMKEHPEQDVLQSSCLNGSSSLALVNQENTKSNDTKDLAIINDQFRVQSLKSSMDLLNEELERMKNENSL******FDPKFSSLQR****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSRVARWKVEKTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLTSKTGFREFEQQRELRERGLQSGNDQISPDQSCGGINNQLDKVNARVRFKDKSKELPSLEEEAGLNEDYTESAVGFDGSSNTSESLYAEKHDASSTHEIDSLKSIVSGDLAGLNQSLQQEKGDPSDQRFSAQGTNDWVPGWGLDYSADNDLAIAHEENNRLRGCxxxxxxxxxxxxxxxxxxxxxADEIGMEAQNFAQKLAAEIASGEQLAEExxxxxxxxxxxxxxxxxxxxxRSCHPFTSWEAVKQGTEITMSTPNLVTCDRSKVNEIGEISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALASHEHSSVNPADSVGGKxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxHSSCLYTVSSAKAEIEAMRLDxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxYSIAVNQLQKDLELLSSQVLSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLACQDQYIGVKKQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTxxxxxxxxxxxxxxxxxxxxxLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQxxxxxxxxxxxxxxxxxxxxxLSEWESLMMDFKSFQSKYEAIAAExxxxxxxxxxxxxxxxxxxxxTSLLQKELETVKIDxxxxxxxxxxxxxxxxxxxxxxxxxxxxYGESFSELCLHNKSADQNLESKDLTSVIMRLEVLxxxxxxxxxxxxxxxxxxxxxxxxxxxxFSKSESDIVLVKQKFEHDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxSVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSxxxxxxxxxxxxxxxxxxxxxLQSVHDELHGERSLSDELKSRAIDISSQxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxSSRLASEGNTSKETxxxxxxxxxxxxxxxxxxxxxVTDLTSQLNEKHCQLLDxxxxxxxxxxxxxxxxxxxxxxxxxxxxSSSVTSLQSELSEMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKNVDVETVLNSCLAREAQCNEExxxxxxxxxxxxxxxxxxxxxNRVLFHENNSLIAQSEEYKSRAETMADNYGEHxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxVISxxxxxxxxxxxxxxxxxxxxxxxxxxxxTEEFRNLSIHLKELKDKADAECLKLHEKRESEGLPTGMQESLRIAFIKEQCETKVQELKHHLSISxxxxxxxxxxxxxxxxxxxxxKKSEAAHLKKxxxxxxxxxxxxxxxxxxxxxxxxxxxxYDLAKAELECSLMSxxxxxxxxxxxxxxxxxxxxxKSRLYSDLSLMKKLLYSSTFPCRKEGNDGLHKESCISNELTGRNVQKTTNADTKSHGRMSADDTGNGPTGDVDEYLEHENMTNGIDAQNLCLGLSDEGSYSCTLMKEHPEQDVLQSSCLNGSSSLALVNQENTKSNDTKDLAIINDQFRxxxxxxxxxxxxxxxxxxxxxxxxxxxxHNFDPKFSSLQRELMD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1872
255545606 1998 ATP binding protein, putative [Ricinus c 0.951 0.891 0.546 0.0
15219189 1999 uncharacterized protein [Arabidopsis tha 0.947 0.887 0.452 0.0
92806911970 F2E2.13 [Arabidopsis thaliana] 0.932 0.885 0.448 0.0
297850656 2000 hypothetical protein ARALYDRAFT_889705 [ 0.946 0.886 0.448 0.0
2240631851566 predicted protein [Populus trichocarpa] 0.758 0.906 0.523 0.0
3021419051038 unnamed protein product [Vitis vinifera] 0.502 0.905 0.602 0.0
356510227 1986 PREDICTED: uncharacterized protein LOC10 0.725 0.683 0.478 0.0
2240847481011 predicted protein [Populus trichocarpa] 0.487 0.903 0.478 0.0
2181843851899 hypothetical protein OsI_33230 [Oryza sa 0.591 0.582 0.387 0.0
2226126961899 hypothetical protein OsJ_31239 [Oryza sa 0.591 0.582 0.386 0.0
>gi|255545606|ref|XP_002513863.1| ATP binding protein, putative [Ricinus communis] gi|223546949|gb|EEF48446.1| ATP binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score = 1841 bits (4768), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1062/1944 (54%), Positives = 1369/1944 (70%), Gaps = 163/1944 (8%)

Query: 1    MSRVARWKVEKTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGT 60
            MSR+ +WK+EKTKVKVVFRLQFHATHI   GWDKLFISFIPADSGK   KTTKA+VRNGT
Sbjct: 1    MSRITKWKLEKTKVKVVFRLQFHATHIPLVGWDKLFISFIPADSGKVTSKTTKASVRNGT 60

Query: 61   CKWADPIYETTRLLQDIKTKQYDEKLYKFVVAMGSSRSSILGEATINLADYADASKPSTV 120
            CKWADPIYETTRLLQDIKTKQYDEKLYK V+AMGSSRSSILGEATINLA YADA KP  +
Sbjct: 61   CKWADPIYETTRLLQDIKTKQYDEKLYKLVIAMGSSRSSILGEATINLAHYADALKPFVI 120

Query: 121  LLPLHGGDSGTILHVTVQLLTSKTGFREFEQQRELRERGLQSGNDQISPDQSCG------ 174
             LPLHG DSGTILHVTVQLLTSKTGFREFEQQRELRERGLQ+  DQ SPD+S G      
Sbjct: 121  ALPLHGCDSGTILHVTVQLLTSKTGFREFEQQRELRERGLQT--DQHSPDESSGRKVSSS 178

Query: 175  --GINNQLDK---VNARVRFKDKSKELPSLEEEAGLNEDYTESAVGFDGSSNTSESLYAE 229
               I  Q+DK    + RV+F++KSK+L SLEEE    ++Y +S VGFDGSSNTSESLYAE
Sbjct: 179  VETITEQIDKDHKAHTRVKFREKSKDLSSLEEEVVPTDEYADSGVGFDGSSNTSESLYAE 238

Query: 230  KHDASSTHEIDSLKSIVSGDLAGLN--QSLQQEKGDPSDQRFSAQGTNDWVPGWGLDYSA 287
            KH+ SSTHEIDSL+S VSGDLAG++  QS Q EKGDP D RFS QGTNDWV GW  DYS 
Sbjct: 239  KHETSSTHEIDSLRSTVSGDLAGISPSQSPQLEKGDPPDNRFSVQGTNDWVQGWSSDYSV 298

Query: 288  DNDLAIAHEENNRLRGCLEMAESSIHELKLEVSSLQSHADEIGMEAQNFAQKLAAEIASG 347
            DNDLA A+EEN+RLRG LE AESSIHELK+EVSSLQ+HADEIG EAQ FA++LAAEIASG
Sbjct: 299  DNDLAAAYEENSRLRGSLEAAESSIHELKMEVSSLQNHADEIGHEAQKFAKELAAEIASG 358

Query: 348  EQLAEEV-------SALKSECSHLK--------------------------------SDL 368
            E L  EV       S LK +   LK                                 D 
Sbjct: 359  EDLVNEVSVLKSECSKLKDDLEQLKISKLCPSFIDREAFGAEQDQISLRWLKGLLAMEDK 418

Query: 369  ERLIKLRSCHPFTS----------------WEAVKQGTEITMSTPNLVTCDRSKVNEIGE 412
             R ++ ++C  +                   + +KQG+ +T+S+PNL+  + + + EI E
Sbjct: 419  IRELQNKACLGYNERDLRIFQSDIEALLDVLQNLKQGSGLTVSSPNLILSEGASLKEIRE 478

Query: 413  ISLNKYEQYLSGTGLETELYQPDLDMLHCVGIPALASHEHSSVNPADSVGGKIFELLREL 472
            +S  K  Q+ +GTG + +LYQP+  MLHC+ IP L SHE  +V+  +++  KIFELLREL
Sbjct: 479  MSPYKNGQFATGTGFDVDLYQPE-GMLHCLNIPNLISHESDTVDTTNAMKNKIFELLREL 537

Query: 473  DESKAERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEI 532
            D+SKAERESLAKKMDQMECYYEAL+QELEENQR +L ELQNLRNEHS+CLY +SS KA++
Sbjct: 538  DDSKAERESLAKKMDQMECYYEALVQELEENQRQLLQELQNLRNEHSTCLYAISSTKADM 597

Query: 533  EAMRLDVDEQVLRFSEERRDLESLNKELERRAVSAEAALKRARLNYSIAVNQLQKDLELL 592
            E+M   ++EQ+LR + ++ D+ES NKELERRA++AEAALKRARLNYSIAV+QLQKDLELL
Sbjct: 598  ESMHQGLNEQILRLAGDKHDMESFNKELERRALTAEAALKRARLNYSIAVDQLQKDLELL 657

Query: 593  SSQVLSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLACQDQYIGVK 652
            S QVLSMY++NENLI+QAFVDS  P          +R  +  E +A + L  ++Q  G++
Sbjct: 658  SFQVLSMYESNENLIRQAFVDSSPPN---------SRGCDSGE-YAVKLLQFENQSAGIR 707

Query: 653  KQHLGGDILIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIR 712
            KQ LGGDI +++LKRSL+LQEGLYRKVEEE CE+  VNIYLDV SK LQ+TL+ A  +++
Sbjct: 708  KQQLGGDIHLDELKRSLHLQEGLYRKVEEEVCEMHFVNIYLDVLSKALQETLVGACEDVQ 767

Query: 713  LMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLES 772
             +KE+++EL+ QLEL   S +LL+Q+LQ AM+++H LNEYKA+CIAKCNDMAL+NQ L +
Sbjct: 768  HLKEKVNELTQQLELLGNSKQLLIQKLQIAMDEVHSLNEYKAACIAKCNDMALENQTLGA 827

Query: 773  NLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLEKESLENGNLRRE 832
            +LQ+++ EN  L QK++EW+S++++++ ++ K EA AAE  EL  LLEK++LE G L+ E
Sbjct: 828  DLQNMSHENHLLMQKIAEWKSMVIEYRGYEEKLEAYAAENGELTCLLEKKTLEIGILQNE 887

Query: 833  TSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNKSADQN 892
               LQ EL+T+KI+F ELAS N+NLQ  +N+LQNK+ ++L SY +S  E+ L ++S+ Q+
Sbjct: 888  NISLQDELKTIKIEFAELASGNENLQNFVNSLQNKLQNLLLSYDKSIIEIHLVSESSSQD 947

Query: 893  LESKDLTSVIMRLEVLQRNACQKIRQLMQEKKALIDEKDRAEMSFSKSESDIVLVKQKFE 952
            L++KDL  ++M+LE LQ NAC KI QL++EKK L+ EKD A++S + +ESD   +K KFE
Sbjct: 948  LQNKDLPGLLMQLEELQHNACNKILQLVEEKKYLMHEKDVAQLSITAAESDTASMKWKFE 1007

Query: 953  HDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDLDYLEVELQ 1012
            H++RNM++K   SN LLQKLQL  EA A++L VSSE+EE  AQ+  +LFS +D LEVELQ
Sbjct: 1008 HEIRNMVEKLDKSNVLLQKLQLDVEAFANRLGVSSELEEKYAQQQNELFSGIDQLEVELQ 1067

Query: 1013 QLSSKNRDLAQEILALQVVTEEFDRSKQTISELTEENRALMVALQDKSEESVKLALEVDS 1072
            +L+SKNRDLA EI+AL+           T +ELT+EN+AL V LQDK+EES KL+ E+ S
Sbjct: 1068 ELTSKNRDLANEIIALET---------GTAAELTKENQALTVYLQDKNEESSKLSSELKS 1118

Query: 1073 FKQSFQSLHDE-LLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQKTAVLTK 1131
             K+S QSL+DE + +  S  D ++      ++QL++      + D  KS L      L  
Sbjct: 1119 LKESLQSLYDENMALIASSHDKMEK-----SAQLAS------EVDGLKSSL----QSLRD 1163

Query: 1132 ENQDLMVSLQNKSEEAAKLAVELDSVRNSLQSVHDELHGERSLSDELKSRAIDISSQLNE 1191
            ENQ LMV+ Q+K+ EAAKL +EL+S++ +LQSV+DE      +S +       ++S+LN 
Sbjct: 1164 ENQALMVASQDKAAEAAKLELELNSLKGNLQSVNDENQALMVISRDKTEECAKLASELNN 1223

Query: 1192 KQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQEYAEESSRLASEGNTSKETLQSLRDE 1251
             ++ L                L  + +AL++   +  +ES++ A E N  +E+LQSL ++
Sbjct: 1224 LKESL--------------QSLHDDKKALVL---DKKDESAQFAGELNCLRESLQSLHNQ 1266

Query: 1252 LQSERSLRDELKNVVTDLTSQLNEKHCQLLDLDQQKSELVQLKLLVLDL----ESEKSRA 1307
            L  ERSLR+ L++ VTD  S+LNEK  Q+L L++  S+L    L V  L    E     A
Sbjct: 1267 LHGERSLREGLESKVTDQISKLNEKEYQVLRLNKSVSDLESENLRVCSLLSHYEDSLKIA 1326

Query: 1308 SEESSSVTSLQSELSEMHELLLAVDVRLIFTRTQYEAWVEELVQQVYSTDRLLTELHTKN 1367
             EE SS+  L+ EL +M ELL+A DV LIFT+TQYE    ELV Q+ ++D  L EL  K+
Sbjct: 1327 REECSSIPDLKIELCKMDELLIATDVSLIFTKTQYENKAAELVLQLRASDTYLDELQKKH 1386

Query: 1368 VDVETVLNSCLAREAQCNEENARLLTSLDTLRSELDSAIAENRVLFHENNSLIAQSEEYK 1427
            ++VET LN CLA EA+  EENA+LL SL+++RSEL+++IAENR+L   N    A+ EEYK
Sbjct: 1387 IEVETTLNRCLANEAEYTEENAKLLASLNSMRSELEASIAENRLLVEANRVTTAELEEYK 1446

Query: 1428 SRAETMADNYGEHKSQLALEVERMKQLLVGSEEEIDDLMMSREELEIKVVVLKAKLAEQH 1487
              A  +  N  E + Q +L VER+K LLV SEEEID+L++S+EELE+KV+VLKAKL E+ 
Sbjct: 1447 DWARDVRLN-CEDQRQHSLVVERLKHLLVSSEEEIDNLVLSKEELEVKVLVLKAKLDEEQ 1505

Query: 1488 AQVISSEGYIDEQKMLQNQCNELRRKLSEQILKTEEFRNLSIHLKELKDKADAECLKLHE 1547
            AQ+ + E Y+DE  +L+ Q NEL ++L++QILKTEEFRNLSIHLKELKDKA+AEC+   E
Sbjct: 1506 AQITTMERYLDELMILKKQYNELSQRLADQILKTEEFRNLSIHLKELKDKAEAECVHARE 1565

Query: 1548 KRESEGLPTGMQESLRIAFIKEQCETKVQELKHHLSISKKHSEEMLWKLQDAIDEIENRK 1607
            K+++E  P  MQESLRIAFIKEQ ET++QELK  LSISKKHSEEMLWKLQDAIDE +N K
Sbjct: 1566 KKDTEA-PVAMQESLRIAFIKEQYETRLQELKQQLSISKKHSEEMLWKLQDAIDENDNMK 1624

Query: 1608 KSEAAHLKKNEELGVKILELEAELQSLISDKREKTKAYDLAKAELECSLMSLECCKEEKE 1667
            KSEA HLKKNEELGVKILELEAELQ+++SDKRE+  AYDL KAE+ECSL+SLECCKEEK+
Sbjct: 1625 KSEACHLKKNEELGVKILELEAELQAVLSDKRERMNAYDLMKAEMECSLISLECCKEEKQ 1684

Query: 1668 KLEVSLHECNEEKSRLYSDLSLMKKLLYSSTFPCRKEGNDGLHKESCISNELTGRNVQKT 1727
            KLE SL ECNEEKS+L  +++ MK+LL +S                      + RN+++ 
Sbjct: 1685 KLEASLQECNEEKSKLAVEIAQMKELLENSK---------------------SARNIKEK 1723

Query: 1728 TNADTKSHGRMSADDTGNGPTGDVDEYLEHENMTNGIDAQNLCLGLSDEGSYSCTLMKEH 1787
             N ++     + +D              +++ +   +    + L  + +G  S  L   H
Sbjct: 1724 GNCESCRVDSIFSDICD-----------KNQKILKFLPPCTVILN-TLKGFVSKYLFALH 1771

Query: 1788 PEQDVLQSSCLNGSSSLALVNQENTKSNDTKDLAIINDQFRVQSLKSSMDLLNEELERMK 1847
              QD L SS +NG  S  L+N E    +D K LA+IND FR ++LKSSMD LN ELERMK
Sbjct: 1772 G-QDALLSSGVNGVQSSMLLNDERFLHSDMKQLALINDHFRAENLKSSMDHLNNELERMK 1830

Query: 1848 NENSLSRGDHNFDPKFSSLQRELM 1871
            NENSL + DH FD KF +LQ E M
Sbjct: 1831 NENSLLQNDHYFDKKFPALQSEFM 1854




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|15219189|ref|NP_173625.1| uncharacterized protein [Arabidopsis thaliana] gi|332192069|gb|AEE30190.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|9280691|gb|AAF86560.1|AC069252_19 F2E2.13 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297850656|ref|XP_002893209.1| hypothetical protein ARALYDRAFT_889705 [Arabidopsis lyrata subsp. lyrata] gi|297339051|gb|EFH69468.1| hypothetical protein ARALYDRAFT_889705 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|224063185|ref|XP_002301031.1| predicted protein [Populus trichocarpa] gi|222842757|gb|EEE80304.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|302141905|emb|CBI19108.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356510227|ref|XP_003523841.1| PREDICTED: uncharacterized protein LOC100814687 [Glycine max] Back     alignment and taxonomy information
>gi|224084748|ref|XP_002307399.1| predicted protein [Populus trichocarpa] gi|222856848|gb|EEE94395.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|218184385|gb|EEC66812.1| hypothetical protein OsI_33230 [Oryza sativa Indica Group] Back     alignment and taxonomy information
>gi|222612696|gb|EEE50828.1| hypothetical protein OsJ_31239 [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1872
TAIR|locus:20305411999 AT1G22060 [Arabidopsis thalian 0.510 0.477 0.399 2.2e-189
TAIR|locus:2145141853 AT5G52280 "AT5G52280" [Arabido 0.439 0.963 0.227 4.5e-44
TAIR|locus:20384461029 AT1G63300 "AT1G63300" [Arabido 0.523 0.952 0.221 3.5e-37
TAIR|locus:2162996983 AT5G41140 "AT5G41140" [Arabido 0.490 0.933 0.200 7.4e-35
UNIPROTKB|E1BRE51411 EEA1 "Uncharacterized protein" 0.550 0.729 0.207 8.6e-26
UNIPROTKB|F1NMQ0 2150 GOLGA4 "Uncharacterized protei 0.496 0.432 0.218 4.7e-34
UNIPROTKB|F1NI68 2142 GOLGA4 "Uncharacterized protei 0.496 0.433 0.218 1.6e-33
DICTYBASE|DDB_G02905031492 DDB_G0290503 [Dictyostelium di 0.582 0.731 0.207 1.9e-33
UNIPROTKB|Q3V6T2 1871 CCDC88A "Girdin" [Homo sapiens 0.487 0.487 0.204 3.1e-33
ZFIN|ZDB-GENE-060929-860 2690 cenpe "centromere protein E" [ 0.692 0.482 0.215 6.7e-33
TAIR|locus:2030541 AT1G22060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1641 (582.7 bits), Expect = 2.2e-189, Sum P(2) = 2.2e-189
 Identities = 403/1009 (39%), Positives = 606/1009 (60%)

Query:   305 LEMAESSIHELKLEVSSLQ---SHADEIGMEAQNFAQKLAAEIASGEQLAEE-VSALKSE 360
             + + +S   +LK E+  L+   SH      +  N    L      G  + E+ +  ++++
Sbjct:   349 VSVLKSECSKLKEEMERLRNVKSHVLFNSKDQDNVPHSLQLRWLQGLLVVEDNIREIQNK 408

Query:   361 CSHLKSDLERLIKLRSCHPFTSWEAVKQG--TEITMSTPNLVTCDRSKV--NEIGEISLN 416
               +   D +  + L     F S   V Q   T+I     +  T    K+   +  E  L+
Sbjct:   409 VCYGYHDRDLRLFLSD---FESLLGVLQDFKTQIEQPISHFSTVPSEKIIMTDSKERGLS 465

Query:   417 KYEQYLSGTGLETELYQPDLDMLHCVGIPALASHEHSSVNPADSVGGKIFELLRELDESK 476
             K + ++SG+ ++T++YQP+LD L  +G+P L S E +S +   ++  KI EL+R LDESK
Sbjct:   466 KAKHFVSGSEVDTDIYQPELDPLQYLGMPDLTSREPNSADSVSAMRDKILELVRGLDESK 525

Query:   477 AERESLAKKMDQMECYYEALIQELEENQRHMLGELQNLRNEHSSCLYTVSSAKAEIEAMR 536
             AER+SL KKMDQMECYYE+L+QELEE QR +L ELQ+LR EHS+CLY++S AKAE+E +R
Sbjct:   526 AERDSLTKKMDQMECYYESLVQELEETQRQLLVELQSLRTEHSTCLYSISGAKAEMETLR 585

Query:   537 LDVDEQVLRFSEERRDLESLNKELERRAVSAEAALKRARLNYSIAVNQLQKDLELLSSQV 596
              D++EQ LRFSEE++ L+S N+EL++RA++AEAALKRARLNYSIAVN LQKDLELLSSQV
Sbjct:   586 HDMNEQTLRFSEEKKTLDSFNEELDKRAMAAEAALKRARLNYSIAVNHLQKDLELLSSQV 645

Query:   597 LSMYQTNENLIKQAFVDSPQPTCQEYQDMVQNRKLNPEESHANRHLAC---QDQYIGVKK 653
             +SM++TNENLIKQAF   P+P  Q + + +Q+   +  E    R +     Q++  G+K+
Sbjct:   646 VSMFETNENLIKQAF---PEPP-QSFHECIQSTDDSISEKQDTRDVKLIQFQNEKKGMKE 701

Query:   654 QHLGGDI-LIEDLKRSLYLQEGLYRKVEEEACELLSVNIYLDVFSKTLQKTLLEASGEIR 712
             + L GDI L+ED+KRSL++QE LY+KVEEE  E+ S N+YL+VFS  L++T LEAS +IR
Sbjct:   702 RPLKGDIILLEDMKRSLHVQESLYQKVEEELYEMHSRNLYLEVFSNILRETFLEASVDIR 761

Query:   713 LMKERMDELSHQLELSTESNELLMQRLQTAMNDIHFLNEYKASCIAKCNDMALQNQLLES 772
             +MK ++DEL  QLELSTE+ E+L QRL   ++++  L E K +CIAK N +ALQNQ LE+
Sbjct:   762 IMKAKIDELGWQLELSTEAKEILKQRLDITLDEVCSLKEEKTTCIAKWNAVALQNQSLEA 821

Query:   773 NLQDVTCENRHLTQKLSEWESLMMDFKSFQSKYEAIAAEKTELANLLEKESLENGNLRRE 832
             NLQ++T EN  L QK+ E ES++++ KS+++ YE    EK ELA L+EKE++E  + R  
Sbjct:   822 NLQNITHENLILLQKIDELESVVLESKSWKTNYETCICEKKELAELMEKEAVEKAHYRTR 881

Query:   833 TSLLQKELETVKIDFDELASVNKNLQRTINNLQNKMHDMLSSYGESFSELCLHNKSADQN 892
              + +Q E + V+  FD+LA+ N NLQ+ +++L +K+ + L  Y E    L    +  D +
Sbjct:   882 LATVQAEFDAVRGKFDDLATANGNLQQNLSSLTDKLINTLGCYNEKLVSLPQW-EGVDLD 940

Query:   893 LESKDLTSVIMRLEVLQRNACQKIRQLMQEKKALIDEKDRAEMSFSKSESDIVLVKQKFE 952
              ES DLT    +L+      C+K   L+ E   L+ EK   E     +ESD++ +KQ  E
Sbjct:   941 FESHDLTE---QLDKFLCKICEKCFVLISENNGLMKEKSMTESYLRAAESDVMELKQMHE 997

Query:   953 HDLRNMIDKQSVSNALLQKLQLRFEAVADKLKVSSEVEENNAQRHTDLFSDLDYLEVELQ 1012
             +D++ M+ K   S ALL++LQL  E+V DK+KV +E E+N   RH DL S LD+ E E+ 
Sbjct:   998 NDVQCMVTKLEASTALLRRLQLETESVMDKMKVITEDEQNYESRHLDLLSRLDHFENEMH 1057

Query:  1013 QLSSKNRDLAQEILALQVVTEEFDRSKQTISELTEENRALMVALQDKSEESVKLALEVDS 1072
              L SKN  L QEI  L  V  E  R+K  + EL EE + ++V+LQDKS+E++ L  E+++
Sbjct:  1058 LLVSKNEGLGQEISELSSVAVEHGRTKLLVEELAEEKKRVLVSLQDKSQETLGLVRELEN 1117

Query:  1073 FKQSFQSLHDELLVERSLRDDLKSAVSDITSQLSAKHSQLLDFDQQKSELIQ-KTAV--- 1128
              K++F   H EL +ER+LR +L+  + D+TS++ AK S+L+ FD+Q SEL++ K  V   
Sbjct:  1118 LKKTFD--H-ELRLERNLRQELEIKMQDLTSEVIAKSSKLMSFDEQSSELVRLKQMVSDL 1174

Query:  1129 ----------LTKENQDLMVSLQNKSEEAAKLAVEL-----DSVRNSLQSVHDELHGERS 1173
                       LT+    L  SL   S   + L  ++      SV   +Q V      E +
Sbjct:  1175 ELEKATHTHRLTRYETSLR-SLTRDSSYISDLESQILEMMEISVAADIQIVFTRTEWE-T 1232

Query:  1174 LSDELKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMVSLQEYAEESSR 1233
              +D+L     ++ +  N+ +   +     N+  I+ +A+L S    L +  +       R
Sbjct:  1233 YADKLHKDHFEVLTAFNDSRN--VGAQHMNAN-IKLLADLDSLKSELKIE-RNLRNNLDR 1288

Query:  1234 LASEGNTSKETLQSLRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLD 1282
                E  +  +    L +    ++S  + L+ +V +L S   EK  Q L+
Sbjct:  1289 RVEELTSELDEKHLLLENFDLQKSQVELLEKMVAELES---EKSFQRLE 1334


GO:0005737 "cytoplasm" evidence=ISM
GO:0005773 "vacuole" evidence=IDA
GO:0010075 "regulation of meristem growth" evidence=RCA
TAIR|locus:2145141 AT5G52280 "AT5G52280" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2038446 AT1G63300 "AT1G63300" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2162996 AT5G41140 "AT5G41140" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|E1BRE5 EEA1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|F1NMQ0 GOLGA4 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|F1NI68 GOLGA4 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0290503 DDB_G0290503 [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|Q3V6T2 CCDC88A "Girdin" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-060929-860 cenpe "centromere protein E" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1872
pfam10358142 pfam10358, NT-C2, N-terminal C2 in EEIG1 and EHBP1 2e-24
TIGR021681179 TIGR02168, SMC_prok_B, chromosome segregation prot 7e-16
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 4e-14
TIGR021681179 TIGR02168, SMC_prok_B, chromosome segregation prot 5e-14
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 2e-12
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 4e-11
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 4e-10
COG0419908 COG0419, SbcC, ATPase involved in DNA repair [DNA 3e-08
TIGR021681179 TIGR02168, SMC_prok_B, chromosome segregation prot 1e-07
TIGR021681179 TIGR02168, SMC_prok_B, chromosome segregation prot 2e-07
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 2e-07
TIGR021691164 TIGR02169, SMC_prok_A, chromosome segregation prot 2e-07
TIGR006061311 TIGR00606, rad50, rad50 3e-07
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 6e-07
TIGR006181042 TIGR00618, sbcc, exonuclease SbcC 1e-06
COG0419908 COG0419, SbcC, ATPase involved in DNA repair [DNA 2e-06
pfam121281198 pfam12128, DUF3584, Protein of unknown function (D 4e-06
pfam05483787 pfam05483, SCP-1, Synaptonemal complex protein 1 ( 4e-06
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 5e-06
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 6e-06
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 8e-06
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 9e-06
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 1e-05
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 2e-05
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 2e-05
TIGR021691164 TIGR02169, SMC_prok_A, chromosome segregation prot 2e-05
pfam01576859 pfam01576, Myosin_tail_1, Myosin tail 2e-05
COG11961163 COG1196, Smc, Chromosome segregation ATPases [Cell 3e-05
pfam01576859 pfam01576, Myosin_tail_1, Myosin tail 4e-05
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 7e-05
pfam024631162 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain 8e-05
pfam12128 1198 pfam12128, DUF3584, Protein of unknown function (D 2e-04
PRK02224880 PRK02224, PRK02224, chromosome segregation protein 2e-04
PRK02224880 PRK02224, PRK02224, chromosome segregation protein 3e-04
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 4e-04
PHA02562562 PHA02562, 46, endonuclease subunit; Provisional 4e-04
pfam10186307 pfam10186, Atg14, UV radiation resistance protein 4e-04
PLN02939 977 PLN02939, PLN02939, transferase, transferring glyc 6e-04
COG3206458 COG3206, GumC, Uncharacterized protein involved in 7e-04
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 8e-04
COG0419 908 COG0419, SbcC, ATPase involved in DNA repair [DNA 0.001
PRK02224880 PRK02224, PRK02224, chromosome segregation protein 0.001
PRK02224880 PRK02224, PRK02224, chromosome segregation protein 0.001
TIGR04211198 TIGR04211, SH3_and_anchor, SH3 domain protein 0.001
TIGR02168 1179 TIGR02168, SMC_prok_B, chromosome segregation prot 0.002
COG0419 908 COG0419, SbcC, ATPase involved in DNA repair [DNA 0.002
TIGR021691164 TIGR02169, SMC_prok_A, chromosome segregation prot 0.002
pfam12128 1198 pfam12128, DUF3584, Protein of unknown function (D 0.002
pfam09726680 pfam09726, Macoilin, Transmembrane protein 0.002
>gnl|CDD|220715 pfam10358, NT-C2, N-terminal C2 in EEIG1 and EHBP1 proteins Back     alignment and domain information
 Score =  100 bits (250), Expect = 2e-24
 Identities = 41/131 (31%), Positives = 60/131 (45%), Gaps = 3/131 (2%)

Query: 13  KVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTR 72
           KVK  F L  H          +LF+ +   D     G T+KA V NG   W +       
Sbjct: 4   KVKFQFDLSIHELQNVPLVNGELFVKWKRGDKKGNSGTTSKALVNNGRAVWNEEFSIPCT 63

Query: 73  LLQDIKTKQYDEKLYKFVVAM--GSSRSSILGEATINLADYADASKPSTV-LLPLHGGDS 129
           L  D K  +++ KL  FVV       +  +LG+A I+LA+YA++ +P+T  LL       
Sbjct: 64  LFVDKKGGKFEPKLLTFVVYKVTKKGKKKVLGKADIDLAEYANSKEPTTRRLLLKKCSKK 123

Query: 130 GTILHVTVQLL 140
              L +T+ LL
Sbjct: 124 NATLSITISLL 134


This version of the C2 domain was initally identified in the vertebrate estrogen early-induced gene 1 (EEIG1), and its Drosophila ortholog required for uptake of dsRNA via the endocytotic machinery to induce RNAi silencing. It is also in C.elegans ortholog Sym-3 (SYnthetic lethal with Mec-3) and the mammalian protein EHBP1 (EH domain Binding Protein-1) that regulates endocytotic recycling and two plant proteins, RPG that regulates Rhizobium-directed polar growth and PMI1 (Plastid Movement Impaired 1) that is essential for intracellular movement of chloroplasts in response to blue light. Length = 142

>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|129694 TIGR00606, rad50, rad50 Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584) Back     alignment and domain information
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1) Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain Back     alignment and domain information
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584) Back     alignment and domain information
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional Back     alignment and domain information
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and autophagy-related subunit 14 Back     alignment and domain information
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups Back     alignment and domain information
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional Back     alignment and domain information
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein Back     alignment and domain information
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584) Back     alignment and domain information
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 1872
PF10358143 NT-C2: N-terminal C2 in EEIG1 and EHBP1 proteins; 99.75
KOG4674 1822 consensus Uncharacterized conserved coiled-coil pr 99.67
KOG01611930 consensus Myosin class II heavy chain [Cytoskeleto 99.67
KOG01611930 consensus Myosin class II heavy chain [Cytoskeleto 99.52
TIGR006061311 rad50 rad50. This family is based on the phylogeno 99.49
KOG46741822 consensus Uncharacterized conserved coiled-coil pr 99.38
TIGR006061311 rad50 rad50. This family is based on the phylogeno 99.35
COG11961163 Smc Chromosome segregation ATPases [Cell division 99.23
TIGR021681179 SMC_prok_B chromosome segregation protein SMC, com 99.12
PRK02224880 chromosome segregation protein; Provisional 99.05
PRK02224880 chromosome segregation protein; Provisional 98.83
PRK03918880 chromosome segregation protein; Provisional 98.8
TIGR021681179 SMC_prok_B chromosome segregation protein SMC, com 98.8
TIGR021691164 SMC_prok_A chromosome segregation protein SMC, pri 98.77
TIGR021691164 SMC_prok_A chromosome segregation protein SMC, pri 98.71
PRK03918880 chromosome segregation protein; Provisional 98.34
PF01576859 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Mu 98.2
PF10174775 Cast: RIM-binding protein of the cytomatrix active 98.15
PF01576859 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Mu 98.15
COG11961163 Smc Chromosome segregation ATPases [Cell division 98.13
PF121281201 DUF3584: Protein of unknown function (DUF3584); In 98.08
PF10174775 Cast: RIM-binding protein of the cytomatrix active 98.06
PF121281201 DUF3584: Protein of unknown function (DUF3584); In 97.99
KOG09961293 consensus Structural maintenance of chromosome pro 97.78
PF05701522 WEMBL: Weak chloroplast movement under blue light; 97.77
PF00038312 Filament: Intermediate filament protein; InterPro: 97.7
KOG0978698 consensus E3 ubiquitin ligase involved in syntaxin 97.67
PF05483786 SCP-1: Synaptonemal complex protein 1 (SCP-1); Int 97.63
PF00038312 Filament: Intermediate filament protein; InterPro: 97.54
PF07888546 CALCOCO1: Calcium binding and coiled-coil domain ( 97.4
PF05701522 WEMBL: Weak chloroplast movement under blue light; 97.3
KOG0978698 consensus E3 ubiquitin ligase involved in syntaxin 97.3
PF05622713 HOOK: HOOK protein; InterPro: IPR008636 This famil 97.22
PRK01156895 chromosome segregation protein; Provisional 97.22
PF07888546 CALCOCO1: Calcium binding and coiled-coil domain ( 97.22
PF05622713 HOOK: HOOK protein; InterPro: IPR008636 This famil 97.18
KOG0977546 consensus Nuclear envelope protein lamin, intermed 97.15
KOG4673961 consensus Transcription factor TMF, TATA element m 97.05
PF05557722 MAD: Mitotic checkpoint protein; InterPro: IPR0086 96.94
KOG0976 1265 consensus Rho/Rac1-interacting serine/threonine ki 96.73
KOG0977546 consensus Nuclear envelope protein lamin, intermed 96.64
COG0419908 SbcC ATPase involved in DNA repair [DNA replicatio 96.59
PF05483786 SCP-1: Synaptonemal complex protein 1 (SCP-1); Int 96.53
KOG4643 1195 consensus Uncharacterized coiled-coil protein [Fun 96.37
PHA02562562 46 endonuclease subunit; Provisional 96.35
KOG09961293 consensus Structural maintenance of chromosome pro 96.16
COG0419908 SbcC ATPase involved in DNA repair [DNA replicatio 96.15
PF09730 717 BicD: Microtubule-associated protein Bicaudal-D; I 95.93
PRK04778569 septation ring formation regulator EzrA; Provision 95.86
PF05557722 MAD: Mitotic checkpoint protein; InterPro: IPR0086 95.85
PF15070617 GOLGA2L5: Putative golgin subfamily A member 2-lik 95.52
KOG0999772 consensus Microtubule-associated protein Bicaudal- 95.5
KOG06121317 consensus Rho-associated, coiled-coil containing p 95.26
KOG0971 1243 consensus Microtubule-associated protein dynactin 94.99
KOG0995581 consensus Centromere-associated protein HEC1 [Cell 94.94
PRK01156895 chromosome segregation protein; Provisional 94.62
PF09730717 BicD: Microtubule-associated protein Bicaudal-D; I 94.61
PRK04778569 septation ring formation regulator EzrA; Provision 94.58
PF06160560 EzrA: Septation ring formation regulator, EzrA ; I 94.58
KOG4643 1195 consensus Uncharacterized coiled-coil protein [Fun 94.54
KOG0976 1265 consensus Rho/Rac1-interacting serine/threonine ki 94.32
PHA02562562 46 endonuclease subunit; Provisional 94.28
PRK048631486 mukB cell division protein MukB; Provisional 93.59
PRK11637428 AmiB activator; Provisional 93.56
PF14662193 CCDC155: Coiled-coil region of CCDC155 93.54
KOG0946970 consensus ER-Golgi vesicle-tethering protein p115 92.84
PRK04863 1486 mukB cell division protein MukB; Provisional 92.64
KOG4673961 consensus Transcription factor TMF, TATA element m 91.13
PRK11637428 AmiB activator; Provisional 91.13
PF135141111 AAA_27: AAA domain 91.04
KOG06121317 consensus Rho-associated, coiled-coil containing p 90.43
PF10473140 CENP-F_leu_zip: Leucine-rich repeats of kinetochor 90.42
KOG0980980 consensus Actin-binding protein SLA2/Huntingtin-in 89.67
PF09726697 Macoilin: Transmembrane protein; InterPro: IPR0191 89.36
KOG4593716 consensus Mitotic checkpoint protein MAD1 [Cell cy 89.16
PF09789319 DUF2353: Uncharacterized coiled-coil protein (DUF2 88.88
KOG0963629 consensus Transcription factor/CCAAT displacement 88.68
PF09726697 Macoilin: Transmembrane protein; InterPro: IPR0191 88.29
KOG09941758 consensus Extracellular matrix glycoprotein Lamini 88.18
PF08317325 Spc7: Spc7 kinetochore protein; InterPro: IPR01325 87.66
PF00261237 Tropomyosin: Tropomyosin; InterPro: IPR000533 Trop 87.5
PF08317325 Spc7: Spc7 kinetochore protein; InterPro: IPR01325 87.3
PF04849306 HAP1_N: HAP1 N-terminal conserved region; InterPro 87.08
TIGR03185650 DNA_S_dndD DNA sulfur modification protein DndD. T 86.98
cd08389124 C2A_Synaptotagmin-14_16 C2A domain first repeat pr 86.89
PF09789319 DUF2353: Uncharacterized coiled-coil protein (DUF2 86.83
PF14662193 CCDC155: Coiled-coil region of CCDC155 85.78
KOG09711243 consensus Microtubule-associated protein dynactin 85.58
smart00787312 Spc7 Spc7 kinetochore protein. This domain is foun 85.52
TIGR01005754 eps_transp_fam exopolysaccharide transport protein 85.49
KOG0995581 consensus Centromere-associated protein HEC1 [Cell 85.17
PF09787511 Golgin_A5: Golgin subfamily A member 5; InterPro: 84.91
PRK09039343 hypothetical protein; Validated 84.66
PF05667594 DUF812: Protein of unknown function (DUF812); Inte 84.17
KOG0946970 consensus ER-Golgi vesicle-tethering protein p115 84.06
PF10473140 CENP-F_leu_zip: Leucine-rich repeats of kinetochor 83.33
KOG02501074 consensus DNA repair protein RAD18 (SMC family pro 82.97
PF15070617 GOLGA2L5: Putative golgin subfamily A member 2-lik 82.7
KOG09941758 consensus Extracellular matrix glycoprotein Lamini 82.49
TIGR03007498 pepcterm_ChnLen polysaccharide chain length determ 82.48
PF13851201 GAS: Growth-arrest specific micro-tubule binding 82.41
PF12718143 Tropomyosin_1: Tropomyosin like; InterPro: IPR0005 80.81
KOG4593716 consensus Mitotic checkpoint protein MAD1 [Cell cy 80.76
>PF10358 NT-C2: N-terminal C2 in EEIG1 and EHBP1 proteins; InterPro: IPR019448 This entry represents the N-terminal 150 residues of a family of conserved proteins which are induced by oestrogen [] Back     alignment and domain information
Probab=99.75  E-value=1.3e-18  Score=173.92  Aligned_cols=135  Identities=32%  Similarity=0.500  Sum_probs=125.2

Q ss_pred             ccceeEEEEEeeeecccccccccceEEEEeecCCCccccccccccccCCcccCCcccchhhhhccccchhchhhhhhhhh
Q 000199           11 KTKVKVVFRLQFHATHIQQTGWDKLFISFIPADSGKAIGKTTKANVRNGTCKWADPIYETTRLLQDIKTKQYDEKLYKFV   90 (1872)
Q Consensus        11 K~k~K~vFklqFhAtqVp~~gwd~L~vSiVp~D~GK~taKt~Ka~Vr~G~C~W~dpiyEtvkl~qD~kt~~~~ekiYkfV   90 (1872)
                      |.+.|+.|.|.+|..+..+..|..++|++--++.+++.+.|.++.|.+|.|.|++++..++++..|.+++.|++|.|+|+
T Consensus         2 ~~~~kf~~~l~i~~l~~~p~~~~~v~v~wkr~~~~~~~~~t~~~~~~~~~v~w~e~~~~~~tl~~~~k~~~~~~K~~~~~   81 (143)
T PF10358_consen    2 KKAVKFQFDLTIHELENLPSSNGKVFVKWKRGDKSKGSGTTSRANVKNGKVQWNEEFSFPCTLYRDKKSKEFQPKELKFS   81 (143)
T ss_pred             CceeeEEEEEEEEEeECcCCCCCEEEEEEEECCCCccceeeeeeeccccEEEEeeEEEEEEEEEEcCCCCcEeeEEEEEE
Confidence            67899999999888876667999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hhcC--CCcccccceeecchhhccccc-CCeeeeecCCCC-CCCCeEEEEeeeecCCCC
Q 000199           91 VAMG--SSRSSILGEATINLADYADAS-KPSTVLLPLHGG-DSGTILHVTVQLLTSKTG  145 (1872)
Q Consensus        91 Vs~G--SsksgiLGEasinlAdY~~a~-kp~~VSLPLk~c-nsGtvLHVtIQ~l~~~~~  145 (1872)
                      |-.+  +.+...||.++||||+|++.. +|.++.+||+.| ..+|+|+|+|++..-+++
T Consensus        82 v~~~~~~~~k~~lG~~~inLaey~~~~~~~~~~~~~l~~~~~~~a~L~isi~~~~~~~~  140 (143)
T PF10358_consen   82 VFEVDGSGKKKVLGKVSINLAEYANEDEEPITVRLLLKKCKKSNATLSISISLSELRED  140 (143)
T ss_pred             EEEecCCCccceEEEEEEEHHHhhCcCCCcEEEEEeCccCCCCCcEEEEEEEEEECccC
Confidence            9775  666689999999999999996 999999999999 999999999999986543



Proteins in this entry are usually annotated as Fam102A, Fam102B, or Eeig1 (early oestrogen-responsive gene product 1).

>KOG4674 consensus Uncharacterized conserved coiled-coil protein [Function unknown] Back     alignment and domain information
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] Back     alignment and domain information
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] Back     alignment and domain information
>TIGR00606 rad50 rad50 Back     alignment and domain information
>KOG4674 consensus Uncharacterized conserved coiled-coil protein [Function unknown] Back     alignment and domain information
>TIGR00606 rad50 rad50 Back     alignment and domain information
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>PRK02224 chromosome segregation protein; Provisional Back     alignment and domain information
>PRK02224 chromosome segregation protein; Provisional Back     alignment and domain information
>PRK03918 chromosome segregation protein; Provisional Back     alignment and domain information
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type Back     alignment and domain information
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>PRK03918 chromosome segregation protein; Provisional Back     alignment and domain information
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril Back     alignment and domain information
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion [] Back     alignment and domain information
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril Back     alignment and domain information
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins Back     alignment and domain information
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion [] Back     alignment and domain information
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins Back     alignment and domain information
>KOG0996 consensus Structural maintenance of chromosome protein 4 (chromosome condensation complex Condensin, subunit C) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function Back     alignment and domain information
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope Back     alignment and domain information
>KOG0978 consensus E3 ubiquitin ligase involved in syntaxin degradation [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>PF05483 SCP-1: Synaptonemal complex protein 1 (SCP-1); InterPro: IPR008827 Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex Back     alignment and domain information
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope Back     alignment and domain information
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT) Back     alignment and domain information
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function Back     alignment and domain information
>KOG0978 consensus E3 ubiquitin ligase involved in syntaxin degradation [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms Back     alignment and domain information
>PRK01156 chromosome segregation protein; Provisional Back     alignment and domain information
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT) Back     alignment and domain information
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms Back     alignment and domain information
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure] Back     alignment and domain information
>KOG4673 consensus Transcription factor TMF, TATA element modulatory factor [Transcription] Back     alignment and domain information
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins Back     alignment and domain information
>KOG0976 consensus Rho/Rac1-interacting serine/threonine kinase Citron [Signal transduction mechanisms] Back     alignment and domain information
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure] Back     alignment and domain information
>COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>PF05483 SCP-1: Synaptonemal complex protein 1 (SCP-1); InterPro: IPR008827 Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex Back     alignment and domain information
>KOG4643 consensus Uncharacterized coiled-coil protein [Function unknown] Back     alignment and domain information
>PHA02562 46 endonuclease subunit; Provisional Back     alignment and domain information
>KOG0996 consensus Structural maintenance of chromosome protein 4 (chromosome condensation complex Condensin, subunit C) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair] Back     alignment and domain information
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) [] Back     alignment and domain information
>PRK04778 septation ring formation regulator EzrA; Provisional Back     alignment and domain information
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins Back     alignment and domain information
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5 Back     alignment and domain information
>KOG0999 consensus Microtubule-associated protein Bicaudal-D [Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
>KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms] Back     alignment and domain information
>KOG0971 consensus Microtubule-associated protein dynactin DCTN1/Glued [Cell cycle control, cell division, chromosome partitioning; Cytoskeleton] Back     alignment and domain information
>KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>PRK01156 chromosome segregation protein; Provisional Back     alignment and domain information
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) [] Back     alignment and domain information
>PRK04778 septation ring formation regulator EzrA; Provisional Back     alignment and domain information
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site Back     alignment and domain information
>KOG4643 consensus Uncharacterized coiled-coil protein [Function unknown] Back     alignment and domain information
>KOG0976 consensus Rho/Rac1-interacting serine/threonine kinase Citron [Signal transduction mechanisms] Back     alignment and domain information
>PHA02562 46 endonuclease subunit; Provisional Back     alignment and domain information
>PRK04863 mukB cell division protein MukB; Provisional Back     alignment and domain information
>PRK11637 AmiB activator; Provisional Back     alignment and domain information
>PF14662 CCDC155: Coiled-coil region of CCDC155 Back     alignment and domain information
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
>PRK04863 mukB cell division protein MukB; Provisional Back     alignment and domain information
>KOG4673 consensus Transcription factor TMF, TATA element modulatory factor [Transcription] Back     alignment and domain information
>PRK11637 AmiB activator; Provisional Back     alignment and domain information
>PF13514 AAA_27: AAA domain Back     alignment and domain information
>KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms] Back     alignment and domain information
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ] Back     alignment and domain information
>KOG0980 consensus Actin-binding protein SLA2/Huntingtin-interacting protein Hip1 [Cytoskeleton] Back     alignment and domain information
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes Back     alignment and domain information
>KOG4593 consensus Mitotic checkpoint protein MAD1 [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function Back     alignment and domain information
>KOG0963 consensus Transcription factor/CCAAT displacement protein CDP1 [Transcription] Back     alignment and domain information
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes Back     alignment and domain information
>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures] Back     alignment and domain information
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [] Back     alignment and domain information
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells Back     alignment and domain information
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [] Back     alignment and domain information
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues Back     alignment and domain information
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD Back     alignment and domain information
>cd08389 C2A_Synaptotagmin-14_16 C2A domain first repeat present in Synaptotagmins 14 and 16 Back     alignment and domain information
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function Back     alignment and domain information
>PF14662 CCDC155: Coiled-coil region of CCDC155 Back     alignment and domain information
>KOG0971 consensus Microtubule-associated protein dynactin DCTN1/Glued [Cell cycle control, cell division, chromosome partitioning; Cytoskeleton] Back     alignment and domain information
>smart00787 Spc7 Spc7 kinetochore protein Back     alignment and domain information
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family Back     alignment and domain information
>KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>PF09787 Golgin_A5: Golgin subfamily A member 5; InterPro: IPR019177 This entry represents a family of proteins involved in maintaining Golgi structure Back     alignment and domain information
>PRK09039 hypothetical protein; Validated Back     alignment and domain information
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function Back     alignment and domain information
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ] Back     alignment and domain information
>KOG0250 consensus DNA repair protein RAD18 (SMC family protein) [Replication, recombination and repair] Back     alignment and domain information
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5 Back     alignment and domain information
>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures] Back     alignment and domain information
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily Back     alignment and domain information
>PF13851 GAS: Growth-arrest specific micro-tubule binding Back     alignment and domain information
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells Back     alignment and domain information
>KOG4593 consensus Mitotic checkpoint protein MAD1 [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1872
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 1e-20
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 8e-19
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 6e-14
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 1e-13
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 4e-12
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 1e-11
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 2e-10
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 6e-10
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 2e-06
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 2e-04
1vt4_I1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-19
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 8e-19
1vt4_I1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-18
1vt4_I1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-17
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-15
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 1e-08
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 2e-07
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 2e-07
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 4e-06
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 6e-06
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 8e-06
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 4e-05
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 5e-04
1sjj_A863 Actinin; 3-helix bundle, calponin homology domain, 4e-06
1c1g_A284 Tropomyosin; contractIle protein; 7.00A {Sus scrof 7e-04
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
 Score = 98.4 bits (245), Expect = 1e-20
 Identities = 58/322 (18%), Positives = 129/322 (40%), Gaps = 11/322 (3%)

Query: 994  AQRHTDLFSDLDYLEVELQQLSSKNRDLAQEILALQV-----------VTEEFDRSKQTI 1042
             +            E EL++L  K+  L +E   LQ              E   R     
Sbjct: 863  DEELQRTKERQQKAEAELKELEQKHTQLCEEKNLLQEKLQAETELYAEAEEMRVRLAAKK 922

Query: 1043 SELTEENRALMVALQDKSEESVKLALEVDSFKQSFQSLHDELLVERSLRDDLKSAVSDIT 1102
             EL E    +   ++++ E S +L  E    +Q    L ++L  E + R  L+       
Sbjct: 923  QELEEILHEMEARIEEEEERSQQLQAEKKKMQQQMLDLEEQLEEEEAARQKLQLEKVTAD 982

Query: 1103 SQLSAKHSQLLDFDQQKSELIQKTAVLTKENQDLMVSLQNKSEEAAKLAVELDSVRNSLQ 1162
             ++      +L  + Q ++L ++  +L +   DL  +L  + E+A  L    +   + + 
Sbjct: 983  GKIKKMEDDILIMEDQNNKLTKERKLLEERVSDLTTNLAEEEEKAKNLTKLKNKHESMIS 1042

Query: 1163 SVHDELHGERSLSDELKSRAIDISSQLNEKQQQLIDFDQQNSEMIQKIAELTSENQALMV 1222
             +   L  E     EL+     +  + ++  +Q+ +   Q +E+  ++A+   E QA + 
Sbjct: 1043 ELEVRLKKEEKSRQELEKIKRKLEGESSDLHEQIAELQAQIAELKAQLAKKEEELQAALA 1102

Query: 1223 SLQEYAEESSRLASEGNTSKETLQSLRDELQSERSLRDELKNVVTDLTSQLNEKHCQLLD 1282
             L++   + +    +    +  +  L+++L+SE++ R++ +    DL+ +L     +L D
Sbjct: 1103 RLEDETSQKNNALKKIRELESHISDLQEDLESEKAARNKAEKQKRDLSEELEALKTELED 1162

Query: 1283 LDQQKSELVQLKLLVLDLESEK 1304
                 +   +L+      + +K
Sbjct: 1163 TLDTTATQQELRGSDYKDDDDK 1184


>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>1sjj_A Actinin; 3-helix bundle, calponin homology domain, calmodulin-like domain, actin binding protein, contractIle protein; 20.00A {Gallus gallus} SCOP: i.15.1.1 Length = 863 Back     alignment and structure
>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Length = 284 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query1872
1c1g_A284 Tropomyosin; contractIle protein; 7.00A {Sus scrof 97.64
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 97.38
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 97.25
3ol1_A119 Vimentin; structural genomics, PSI-2, protein stru 88.73
3na7_A256 HP0958; flagellar biogenesis, flagellum export, C4 87.59
3swk_A86 Vimentin; cytoskeleton, intermediate filament, alp 85.57
3na7_A256 HP0958; flagellar biogenesis, flagellum export, C4 82.3
3ol1_A119 Vimentin; structural genomics, PSI-2, protein stru 81.72
>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Back     alignment and structure
Probab=97.64  E-value=0.071  Score=55.01  Aligned_cols=10  Identities=10%  Similarity=0.218  Sum_probs=3.8

Q ss_pred             HHHHHHHHhh
Q 000199         1006 YLEVELQQLS 1015 (1872)
Q Consensus      1006 ~~e~elq~l~ 1015 (1872)
                      .|..++..+.
T Consensus        10 ~l~~~~~~~~   19 (284)
T 1c1g_A           10 MLKLDKENAL   19 (284)
T ss_dssp             HHHHHHHHHH
T ss_pred             HHHHhHHHHH
Confidence            3333333333



>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Back     alignment and structure
>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Back     alignment and structure
>3swk_A Vimentin; cytoskeleton, intermediate filament, alpha-helix, structural; 1.70A {Homo sapiens} Back     alignment and structure
>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query1872
d1rlwa_126 Domain from cytosolic phospholipase A2 {Human (Hom 81.0
>d1rlwa_ b.7.1.1 (A:) Domain from cytosolic phospholipase A2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All beta proteins
fold: C2 domain-like
superfamily: C2 domain (Calcium/lipid-binding domain, CaLB)
family: PLC-like (P variant)
domain: Domain from cytosolic phospholipase A2
species: Human (Homo sapiens) [TaxId: 9606]
Probab=81.00  E-value=0.54  Score=43.20  Aligned_cols=46  Identities=13%  Similarity=0.211  Sum_probs=36.4

Q ss_pred             CCCcccccceeecchhhcccccCCeeeeecCCCCCCCCeEEEEeeeec
Q 000199           94 GSSRSSILGEATINLADYADASKPSTVLLPLHGGDSGTILHVTVQLLT  141 (1872)
Q Consensus        94 GSsksgiLGEasinlAdY~~a~kp~~VSLPLk~cnsGtvLHVtIQ~l~  141 (1872)
                      +..+..+||++.|+|+++.. ..+..+.+||+.... .-+|++++.++
T Consensus        80 d~~~d~~lG~~~i~L~~l~~-~~~~~~~~~L~~~~~-g~i~~~l~~~~  125 (126)
T d1rlwa_          80 NYVMDETLGTATFTVSSMKV-GEKKEVPFIFNQVTE-MVLEMSLEVAS  125 (126)
T ss_dssp             CSSCCEEEEEEEEEGGGSCT-TCEEEEEEEETTTEE-EEEEEEEECCC
T ss_pred             CCCCCCeEEEEEEEHHHccC-CCeEEEEEEccCCCe-EEEEEEEEEEe
Confidence            34457899999999999754 678899999988754 45599988765