Citrus Sinensis ID: 039238


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------
MKRKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQIDASKENSQLIVLKEILKQREAEIKDLRHQLEYRVNVWSVSADDPSNPRKRINQNHSSSKEAGGRLHDSAKFNDGENSTRIDDRSEKSVGATISREITVEPLEKQGISQDEGFKNGGGIQGKGINEGEGKRKNIGNSTEQIVDDQATVLKPRDDSSEVKGDLQHTITGEWESEKPENSQDEKQEVQTTSKGGMKLEVLDSRTRLKGKHGHVNRTKGRRWRAMAKNRVSENNGASSMSSRRFYRLDQDELRSRSEGQVSTEGLEKREGKETGENPQELSKAMDAKLLKVNPEDAKLMEKRDENADPNQQLQQQNAPDRECLRIKILMIQHLTFQLMQKNGSPMKINNQKKKPTASIKS
cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccccHHHHHccccccccccccccccccccccccccccccHHHHHHHHHHccccccHHHcccccccccccccHHccccHHHHHHHHcccccccccccccccccccccHHHHHHHccccccccccHHccccccccccHHHHHHHHHHHcccccHHHHHHHHHHHccccHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHcccccHHccccccccccccc
ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccEEccccccHHHccccccccccccccccccEccccccccccccHcHcccccHHHHcccccccccccccccccccEEccccHHHccccccccccccccEEEEcccccccccccHHccEEccccccccHcccccccEEEEcccccEEEEEEcccccccccccccccccccEEEEEEEccEEEccccccccccEEEcccccccccccccEEccccccccccccccccccccccccHHHHHccccHHHHHHHHHcccccccHHHHHcccccHHHHHHHHHHHHHHHHHHHHHcccccEEccccccccccccc
MKRKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQIDASKENSQLIVLKEILKQREAEIKDLRHQLEYRVNVwsvsaddpsnprkrinqnhssskeaggrlhdsakfndgenstriddrseksvgaTISREItveplekqgisqdegfkngggiqgkginegegkrknignstEQIVDDqatvlkprddssevkgdlqhtitgewesekpensqdekQEVQttskggmklevldsrtrlkgkhghvnrtkgrRWRAMAKNRvsenngassmssrrfyrldqdelrsrsegqvsteglekregketgenpQELSKAMDAKLLKVNPEDAKLMekrdenadpnqqlqqqnapdreCLRIKILMIQHLTFQLMqkngspmkinnqkkkptasiks
mkrkneemkvklyslrtqkmeLDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQReqidaskensqLIVLKEILKQREAEIKDLRHQLEYRVNVwsvsaddpsnprkrinqnhssskeaggrlhdsakfndgenstriddrseksvgatisreitveplekqgisqdegfkngggiqgkginegegkrknignsteqivddqatvlkprddssevkgdLQHTitgewesekpensqdekqevqttskggmklevldsrtrlkgkhghvnrtkgrrwramaknrvsenngassmssrrfyrldqdelrsrsegqvsteglekregketgenpqelskamdakLLKVNPEDAKLMEKrdenadpnqqlqqqnapdrECLRIKILMIQHLTFQLMqkngspmkinnqkkkptasiks
MKRKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQIDASKENSQLIVLKEILKQREAEIKDLRHQLEYRVNVWSVSADDPSNPRKRINQNHSSSKEAGGRLHDSAKFNDGENSTRIDDRSEKSVGATISREITVEPLEKQGISQDegfkngggiqgkginegegkRKNIGNSTEQIVDDQATVLKPRDDSSEVKGDLQHTITGEWESEKPENSQDEKQEVQTTSKGGMKLEVLDSRTRLKGKHGHVNRTKGRRWRAMAKNRVSENNGASSMSSRRFYRLDQDELRSRSEGQVSTEGLEKREGKETGENPQELSKAMDAKLLKVNPEDAKLMEKRDENADPNQQLQQQNAPDRECLRIKILMIQHLTFQLMQKNGSPMKINNQKKKPTASIKS
*********************************************************************LIVLKEILKQREAEIKDLRHQLEYRVNVWSV*************************************************************************************************************************************************************************************************************************************************************************************ECLRIKILMIQHLTFQLM**********************
*********************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************
********KVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQIDASKENSQLIVLKEILKQREAEIKDLRHQLEYRVNVWSVSADDPSNPRKRINQNHSSSKEAGGRLHDSAKFNDGENSTRIDDRSEKSVGATISREITVEPLEKQGISQDEGFKNGGGIQGKGINEGEGKRKNIGNSTEQIVDDQATVLK************QHTITGE********************KGGMKLEVLDSRTRLKGKHGHVNRTKGRRWRAMAKN************SRRFYRLDQ******************************LSKAMDAKLLKVNPEDAKLMEKRDENADPNQQLQQQNAPDRECLRIKILMIQHLTFQLMQKNGSPMKI*************
*****EEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQIDASKENSQLIVLKEILKQREAEIKDLRHQLEY***************************************************************************************************************Q*T*L********VK*D********************KQEVQTTSKGGMKLEVLDS***************GRRWRAMAKNRVSENN***************************************************AKL*K**P*DAKLMEKRDENADPNQQLQQQNAPDRECLRIKILMIQHLTFQLMQKNGSP****************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKRKNEEMKVKLYSLRTQxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxSQxxxxxxxxxxxxxxxxxxxxxLEYRVNVWSVSADDPSNPRKRINQNHSSSKEAGGRLHDSAKFNDGENSTRIDDRSEKSVGATISREITVEPLEKQGISQDEGFKNGGGIQGKGINEGEGKRKNIGNSTEQIVDDQATVLKPRDDSSEVKGDLQHTITGEWESEKPENSQDEKQEVQTTSKGGMKLEVLDSRTRLKGKHGHVNRTKGRRWRAMAKNRVSENNGASSMSSRRFYRLDQDELRSRSEGQVSTEGLEKREGKETGENPQELSKAMDAKLLKVNPEDAKLMEKRDENADPNQQLQQQNAPDRECLRIKILMIQHLTFQLMQKNGSPMKINNQKKKPTASIKS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query417
224124620 531 predicted protein [Populus trichocarpa] 0.899 0.706 0.392 5e-46
255553881 602 micronuclear linker histone polyprotein, 0.817 0.566 0.375 1e-38
224127110507 predicted protein [Populus trichocarpa] 0.798 0.656 0.368 3e-37
225447256517 PREDICTED: uncharacterized protein LOC10 0.671 0.541 0.376 2e-34
297739273482 unnamed protein product [Vitis vinifera] 0.671 0.580 0.376 2e-34
356544908515 PREDICTED: uncharacterized protein LOC10 0.729 0.590 0.346 2e-27
356515206499 PREDICTED: uncharacterized protein LOC10 0.671 0.561 0.315 5e-23
388503622473 unknown [Medicago truncatula] 0.609 0.536 0.288 1e-16
358345127473 hypothetical protein MTR_050s0015 [Medic 0.609 0.536 0.288 1e-16
218200563415 hypothetical protein OsI_28026 [Oryza sa 0.211 0.212 0.366 0.0004
>gi|224124620|ref|XP_002319377.1| predicted protein [Populus trichocarpa] gi|222857753|gb|EEE95300.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 178/453 (39%), Positives = 238/453 (52%), Gaps = 78/453 (17%)

Query: 1   MKRKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQRE- 59
           MKRK EEMK K+YSLR QKMELDRRLLEMQSTI S+KDEQK +ESALEEKQ+EIKM RE 
Sbjct: 1   MKRKTEEMKAKIYSLRIQKMELDRRLLEMQSTIGSMKDEQKIMESALEEKQSEIKMLRET 60

Query: 60  QIDASKENSQLIVLKEILKQREAEIKDLRHQLEYRVNVWSVSADDPSNPR----KRINQN 115
              A KE  Q+ VL+E LKQ+EAEI+DL+H+LEY   +WSVS DDPS+P       +N  
Sbjct: 61  NKGAEKETLQMEVLRESLKQKEAEIEDLKHRLEYPAKIWSVSTDDPSSPSVNRTATLNMI 120

Query: 116 HSSSKEAGGR-----LHDSAKFNDGENSTRIDDRSEKS-----VGATISREI-------- 157
             +  EAG       L +SA  ++G NST+ +  +  S     VG T + E         
Sbjct: 121 SENKIEAGKSEEEVLLQESANDSNGLNSTKGNGGNTTSINQEQVGDTATVENASESKVAI 180

Query: 158 ---TVEPLEKQGISQDEGFKNGGGI-------------QGKGINEGEGKRKNIGNSTEQI 201
              T EP E+Q +   EG +NG  I             Q KG + G G+  N  N+TE  
Sbjct: 181 PDRTEEPREEQ-LQMLEGSRNGRAIGITNDQVNKSESSQEKGTS-GSGEENNASNATETN 238

Query: 202 VDDQATVLKPRDDSSE--VKGDLQHTIT--GEWESEKPENSQDEKQEVQTTSKGGMKLEV 257
           V     V K  D  +E   K   +H +T  G+ E E       E +  Q T +GG+KL++
Sbjct: 239 VSRIGRVSKITDADNEEKSKDGEEHKVTRDGKLELENVL----EAEGHQETFRGGVKLKM 294

Query: 258 LD-SRTRLKGKHGHVNRTKGRRWRAMAKNRV-----SENNGASSMSSRRFYRLDQDELRS 311
           +D SR   + K+ H +R +G+R     +NR+      ENNGA  M SR+    DQ  L  
Sbjct: 295 MDNSRNTRREKYRHASRVRGKRGEMGTRNRLLEIRNHENNGAEKMRSRKSPTDDQGRLID 354

Query: 312 RSEGQVSTEGLEKREGKETGENPQELSKAMDAKLLKV----NPEDAKLMEKRDENADPNQ 367
           R EG+ S +G             +E+ KA+D+   K     N ED+K ++ +      N 
Sbjct: 355 REEGRASNDG-----------KTEEIRKAVDSSDGKTMEHQNHEDSKDLQNKLGKDGTNH 403

Query: 368 QLQQQNAPDRECLRIKILMIQHLTFQLMQKNGS 400
           Q+    + D E L  + L I H + +L   NGS
Sbjct: 404 QM----SEDHETL--ERLRIAHDSKELT--NGS 428




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255553881|ref|XP_002517981.1| micronuclear linker histone polyprotein, putative [Ricinus communis] gi|223542963|gb|EEF44499.1| micronuclear linker histone polyprotein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224127110|ref|XP_002329400.1| predicted protein [Populus trichocarpa] gi|222870450|gb|EEF07581.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225447256|ref|XP_002272957.1| PREDICTED: uncharacterized protein LOC100246486 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297739273|emb|CBI28924.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356544908|ref|XP_003540889.1| PREDICTED: uncharacterized protein LOC100791306 [Glycine max] Back     alignment and taxonomy information
>gi|356515206|ref|XP_003526292.1| PREDICTED: uncharacterized protein LOC100814540 [Glycine max] Back     alignment and taxonomy information
>gi|388503622|gb|AFK39877.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|358345127|ref|XP_003636634.1| hypothetical protein MTR_050s0015 [Medicago truncatula] gi|355502569|gb|AES83772.1| hypothetical protein MTR_050s0015 [Medicago truncatula] Back     alignment and taxonomy information
>gi|218200563|gb|EEC82990.1| hypothetical protein OsI_28026 [Oryza sativa Indica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query417
SGD|S000001803 1875 MLP1 "Myosin-like protein asso 0.877 0.195 0.189 1.2e-07
DICTYBASE|DDB_G0290503 1492 DDB_G0290503 [Dictyostelium di 0.913 0.255 0.211 7e-07
DICTYBASE|DDB_G0271058 1528 vilC "villin-like protein C" [ 0.868 0.236 0.207 8.9e-06
UNIPROTKB|E2R2W8 1835 PDE4DIP "Uncharacterized prote 0.177 0.040 0.328 2.2e-05
UNIPROTKB|E1BRE5 1411 EEA1 "Uncharacterized protein" 0.964 0.284 0.191 2.2e-05
FB|FBgn0265045 9270 Strn-Mlck "Stretchin-Mlck" [Dr 0.812 0.036 0.214 2.4e-05
UNIPROTKB|Q3V6T2 1871 CCDC88A "Girdin" [Homo sapiens 0.944 0.210 0.204 3.1e-05
UNIPROTKB|I3LK17 1741 CCDC88A "Uncharacterized prote 0.956 0.229 0.205 3.6e-05
UNIPROTKB|F1SQK4 1819 CCDC88A "Uncharacterized prote 0.956 0.219 0.205 3.8e-05
WB|WBGene00019064 1974 myo-5 [Caenorhabditis elegans 0.925 0.195 0.202 4.2e-05
SGD|S000001803 MLP1 "Myosin-like protein associated with the nuclear envelope" [Saccharomyces cerevisiae (taxid:4932)] Back     alignment and assigned GO terms
 Score = 158 (60.7 bits), Expect = 1.2e-07, P = 1.2e-07
 Identities = 74/390 (18%), Positives = 183/390 (46%)

Query:     1 MKRKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQ 60
             ++  N  ++ +L +   +K EL   L +++  +  ++ E  AL+ +++EK+ E+K+ +E+
Sbjct:  1219 LRESNITLRNELENNNNKKKELQSELDKLKQNVAPIESELTALKYSMQEKEQELKLAKEE 1278

Query:    61 IDASKENSQLIVLK-EIL-----KQREAEIKDLRHQLEYRVNVWSVSADDPSNPRKRINQ 114
             +   K+ SQ I+ K E L     ++ E+EI++L+ +LE +    + + +  +  R++  +
Sbjct:  1279 VHRWKKRSQDILEKHEQLSSSDYEKLESEIENLKEELENKERQGAEAEEKFNRLRRQAQE 1338

Query:   115 NHSSSKEAGGRLHDSAK-FNDG----ENS-TRIDDRSEKSVGATISR-EITVEPLEKQGI 167
                +SK +   L +      D     ENS +  + R E+   A +++    +E + K   
Sbjct:  1339 RLKTSKLSQDSLTEQVNSLRDAKNVLENSLSEANARIEELQNAKVAQGNNQLEAIRKL-- 1396

Query:   168 SQDXXXXXXXXXXXXXXXXXXXXRKNIGNSTEQIVDDQATVLKPRDDSSEVKG---DLQH 224
              Q+                       I    E+I   +  + K R    +++    + Q+
Sbjct:  1397 -QEDAEKASRELQAKLEESTTSYESTINGLNEEITTLKEEIEKQRQIQQQLQATSANEQN 1455

Query:   225 TITGEWESEKPENSQDEKQEVQTTSKGGMKLEVLDSRTRLKGKHGHVNRTK-GRRWRAMA 283
              ++   ES K    +D+ + ++  ++  +  ++L+++ RL  +  ++N  +  ++W +  
Sbjct:  1456 DLSNIVESMKKSFEEDKIKFIKEKTQE-VNEKILEAQERLN-QPSNINMEEIKKKWESEH 1513

Query:   284 KNRVSENNGASSMSSRRFYRLDQDELRSRSEGQVSTEGLEKREGKETGENPQELSKAMDA 343
             +  VS+    +  + ++  RL  +E  ++   +   E LEK   ++  E  + + ++ + 
Sbjct:  1514 EQEVSQKIREAEEALKKRIRLPTEEKINKIIERKKEE-LEKEFEEKVEERIKSMEQSGEI 1572

Query:   344 KLLKVNPEDAKLMEKRDE-NADPNQQLQQQ 372
              ++     +AK+ EK+ E   + N++LQ++
Sbjct:  1573 DVVLRKQLEAKVQEKQKELENEYNKKLQEE 1602




GO:0043021 "ribonucleoprotein complex binding" evidence=IGI;IPI
GO:0051028 "mRNA transport" evidence=IEA
GO:0006281 "DNA repair" evidence=IEA
GO:0005634 "nucleus" evidence=IEA
GO:0005816 "spindle pole body" evidence=IDA
GO:0090204 "protein localization to nuclear pore" evidence=IMP
GO:0090203 "transcriptional activation by promoter-terminator looping" evidence=IMP
GO:1901925 "negative regulation of protein import into nucleus during spindle assembly checkpoint" evidence=IGI
GO:0006606 "protein import into nucleus" evidence=IEA;IGI
GO:0005643 "nuclear pore" evidence=IEA
GO:0034398 "telomere tethering at nuclear periphery" evidence=IGI
GO:0016973 "poly(A)+ mRNA export from nucleus" evidence=IMP
GO:0030529 "ribonucleoprotein complex" evidence=IPI
GO:0006974 "response to DNA damage stimulus" evidence=IEA
GO:0006810 "transport" evidence=IEA
GO:0071048 "nuclear retention of unspliced pre-mRNA at the site of transcription" evidence=IGI;IMP
GO:0016021 "integral to membrane" evidence=ISM
GO:0005654 "nucleoplasm" evidence=IDA
GO:0005635 "nuclear envelope" evidence=IDA
DICTYBASE|DDB_G0290503 DDB_G0290503 [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0271058 vilC "villin-like protein C" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|E2R2W8 PDE4DIP "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|E1BRE5 EEA1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
FB|FBgn0265045 Strn-Mlck "Stretchin-Mlck" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|Q3V6T2 CCDC88A "Girdin" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|I3LK17 CCDC88A "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|F1SQK4 CCDC88A "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
WB|WBGene00019064 myo-5 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query417
TIGR02169 1164 TIGR02169, SMC_prok_A, chromosome segregation prot 2e-04
COG2433652 COG2433, COG2433, Uncharacterized conserved protei 0.001
COG1196 1163 COG1196, Smc, Chromosome segregation ATPases [Cell 0.003
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
 Score = 43.1 bits (102), Expect = 2e-04
 Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 5/96 (5%)

Query: 2   KRKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQI 61
           KR+  E+K +L  L+ +   L   L ++ + I  ++ +   LE   E+K  EIK Q  ++
Sbjct: 398 KREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKL 457

Query: 62  -----DASKENSQLIVLKEILKQREAEIKDLRHQLE 92
                D SK   +L  LKE   + E E+  L+ +L 
Sbjct: 458 EQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA 493


SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]. Length = 1164

>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 417
PF09726697 Macoilin: Transmembrane protein; InterPro: IPR0191 95.22
PRK11637428 AmiB activator; Provisional 90.6
PRK11637428 AmiB activator; Provisional 89.02
PF00038312 Filament: Intermediate filament protein; InterPro: 88.57
PF12761195 End3: Actin cytoskeleton-regulatory complex protei 87.96
PF11559151 ADIP: Afadin- and alpha -actinin-Binding; InterPro 87.67
PF07106169 TBPIP: Tat binding protein 1(TBP-1)-interacting pr 86.05
PF10146230 zf-C4H2: Zinc finger-containing protein ; InterPro 85.85
PF07888546 CALCOCO1: Calcium binding and coiled-coil domain ( 85.59
PF12718143 Tropomyosin_1: Tropomyosin like; InterPro: IPR0005 84.63
COG4026290 Uncharacterized protein containing TOPRIM domain, 84.5
PRK09039343 hypothetical protein; Validated 84.44
PF13851201 GAS: Growth-arrest specific micro-tubule binding 84.15
KOG0977546 consensus Nuclear envelope protein lamin, intermed 83.55
PF00038312 Filament: Intermediate filament protein; InterPro: 82.77
PF08614194 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR 81.73
PF10473140 CENP-F_leu_zip: Leucine-rich repeats of kinetochor 81.41
PRK10884206 SH3 domain-containing protein; Provisional 81.15
PF07106169 TBPIP: Tat binding protein 1(TBP-1)-interacting pr 81.1
TIGR021691164 SMC_prok_A chromosome segregation protein SMC, pri 80.92
COG1196 1163 Smc Chromosome segregation ATPases [Cell division 80.39
COG2433652 Uncharacterized conserved protein [Function unknow 80.34
PF05667594 DUF812: Protein of unknown function (DUF812); Inte 80.32
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes Back     alignment and domain information
Probab=95.22  E-value=0.087  Score=56.87  Aligned_cols=93  Identities=16%  Similarity=0.328  Sum_probs=67.3

Q ss_pred             cchhhhhhhhhhhhhhhhhHHHHHHhhhhhhhHhhHHHHHHHHHHHHHHHHHHHHHHHHhh-------------------
Q 039238            2 KRKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQID-------------------   62 (417)
Q Consensus         2 kRK~EEmkAK~~sLr~QKmEL~rkl~emqSTi~sLKdeQK~lEsAl~EKq~eI~~m~E~~~-------------------   62 (417)
                      |.+..+|+..+..||.+-.....++..|++-+..|....|+-+...+..-+.|.-||++..                   
T Consensus       544 r~r~~~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~~e~~~~~e~L~~aL~amqdk~~~LE~sLsaEtriKldLfsa  623 (697)
T PF09726_consen  544 RQRRRQLESELKKLRRELKQKEEQIRELESELQELRKYEKESEKDTEVLMSALSAMQDKNQHLENSLSAETRIKLDLFSA  623 (697)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence            3455678888888888888888888888888877776534444444445556666776665                   


Q ss_pred             hccchhHHHHHHHHHHHhHHHHHHHHHhhhcc
Q 039238           63 ASKENSQLIVLKEILKQREAEIKDLRHQLEYR   94 (417)
Q Consensus        63 a~k~~~Qm~aL~E~LkqKEAEiEdlK~rle~P   94 (417)
                      .+---.|++.+--.|.++|.||+|||.++..=
T Consensus       624 Lg~akrq~ei~~~~~~~~d~ei~~lk~ki~~~  655 (697)
T PF09726_consen  624 LGDAKRQLEIAQGQLRKKDKEIEELKAKIAQL  655 (697)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            22226789999999999999999999998543



; GO: 0016021 integral to membrane

>PRK11637 AmiB activator; Provisional Back     alignment and domain information
>PRK11637 AmiB activator; Provisional Back     alignment and domain information
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope Back     alignment and domain information
>PF12761 End3: Actin cytoskeleton-regulatory complex protein END3 Back     alignment and domain information
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch Back     alignment and domain information
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences Back     alignment and domain information
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT) Back     alignment and domain information
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells Back     alignment and domain information
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only] Back     alignment and domain information
>PRK09039 hypothetical protein; Validated Back     alignment and domain information
>PF13851 GAS: Growth-arrest specific micro-tubule binding Back     alignment and domain information
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure] Back     alignment and domain information
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope Back     alignment and domain information
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells Back     alignment and domain information
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ] Back     alignment and domain information
>PRK10884 SH3 domain-containing protein; Provisional Back     alignment and domain information
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences Back     alignment and domain information
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type Back     alignment and domain information
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Back     alignment and domain information
>COG2433 Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query417
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 1e-05
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 2e-04
3lvg_D190 LCB, clathrin light chain B; SELF assembly, coated 3e-04
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
 Score = 46.6 bits (111), Expect = 1e-05
 Identities = 18/90 (20%), Positives = 34/90 (37%), Gaps = 2/90 (2%)

Query: 3    RKNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQID 62
            R  E  K     L  + M+L R++ E      SL ++   LE     +  +++   E++ 
Sbjct: 914  RSVERYKKLHIGLENKIMQLQRKIDEQNKEYKSLLEKMNNLEITYSTETEKLRSDVERL- 972

Query: 63   ASKENSQLIVLKEILKQREAEIKDLRHQLE 92
                  +       +   + EI  LR +L 
Sbjct: 973  -RMSEEEAKNATNRVLSLQEEIAKLRKELH 1001


>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query417
3na7_A256 HP0958; flagellar biogenesis, flagellum export, C4 92.15
1c1g_A284 Tropomyosin; contractIle protein; 7.00A {Sus scrof 91.7
1c1g_A284 Tropomyosin; contractIle protein; 7.00A {Sus scrof 89.35
3mq7_A121 Bone marrow stromal antigen 2; HIV, antiviral prot 86.71
3o0z_A168 RHO-associated protein kinase 1; coiled-coil, tran 86.28
1l8d_A112 DNA double-strand break repair RAD50 ATPase; zinc 85.78
3tnu_B129 Keratin, type II cytoskeletal 5; coiled-coil, stru 83.89
3tnu_A131 Keratin, type I cytoskeletal 14; coiled-coil, stru 83.41
3tnu_A131 Keratin, type I cytoskeletal 14; coiled-coil, stru 80.54
>3na7_A HP0958; flagellar biogenesis, flagellum export, C4 Zn-ribbon, coiled post-transcriptional, gene regulation, chaperone; HET: EPE; 2.20A {Helicobacter pylori} Back     alignment and structure
Probab=92.15  E-value=1.3  Score=40.16  Aligned_cols=64  Identities=16%  Similarity=0.178  Sum_probs=48.6

Q ss_pred             hhhhhhhhhhhhhhhhhHHHHHHhhhhhhhHhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccch
Q 039238            4 KNEEMKVKLYSLRTQKMELDRRLLEMQSTIDSLKDEQKALESALEEKQNEIKMQREQIDASKEN   67 (417)
Q Consensus         4 K~EEmkAK~~sLr~QKmEL~rkl~emqSTi~sLKdeQK~lEsAl~EKq~eI~~m~E~~~a~k~~   67 (417)
                      ...++-+.+..|..+...|.+++...+..+..|+.+.+.+|..+++....|...+++....+.+
T Consensus        26 ~~~~lp~el~~le~~~~~l~~~~~~~~~~l~d~~~~~~~~e~~i~~~~~ri~~~~~~l~~v~~~   89 (256)
T 3na7_A           26 LIREKRKDLDKALNDKEAKNKAILNLEEEKLALKLQVSKNEQTLQDTNAKIASIQKKMSEIKSE   89 (256)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCSSS
T ss_pred             HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCH
Confidence            3445667777777777888888888888888888888888888888888888777776644443



>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Back     alignment and structure
>1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A Back     alignment and structure
>3mq7_A Bone marrow stromal antigen 2; HIV, antiviral protein; 2.28A {Homo sapiens} PDB: 3mqc_A 3mqb_A 3mkx_A 3nwh_A 2xg7_A* 2x7a_A Back     alignment and structure
>3o0z_A RHO-associated protein kinase 1; coiled-coil, transferase; HET: MSE; 2.33A {Homo sapiens} Back     alignment and structure
>1l8d_A DNA double-strand break repair RAD50 ATPase; zinc finger, DNA repair, recombination, HOOK motif, replication; HET: DNA CIT; 2.20A {Pyrococcus furiosus} SCOP: h.4.12.1 Back     alignment and structure
>3tnu_B Keratin, type II cytoskeletal 5; coiled-coil, structural support, cytosolic protein; 3.00A {Homo sapiens} Back     alignment and structure
>3tnu_A Keratin, type I cytoskeletal 14; coiled-coil, structural support, cytosolic protein; 3.00A {Homo sapiens} Back     alignment and structure
>3tnu_A Keratin, type I cytoskeletal 14; coiled-coil, structural support, cytosolic protein; 3.00A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00