Citrus Sinensis ID: 017472


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-
MQIIEEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSKRFQPNKPSTKAQAAVALTSGRMAKAISNELSRLEAERSSRQAEMAEIRSQLFDSGDIQRWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERATYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRILRSWVEDEARKSQARAKVLEEVTRRWRWGNHA
cccccccccccccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccHHHHHHHHHcccccccccccccccccccccccccccccccccHHHHHHHHHHHccccccHHHHHHHHHHccccccccccHHHHHHHHHHHHHcccHHHHHHccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
ccEEcccccccccccHHHHHHHHHHHHHHHHccccccEEEcccccccccccccccccccccHHHHHHHHHccccccccccccccccccccccccEEcccccccHHHHHHHHccHHccccccccHHHHHHHcccccHHHccHHHHHHHHHHHHcccHHHHHHHHcccccccccccccHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccc
mqiieedvnadelctRREYARWLVRINSLlernpklrivpckslsgtvvaafndvdvedpdiESIQALAeagvipsqllgkhygsdgskgqggiyffperfisRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFgqskrfqpnkpstkAQAAVALTSGRMAKAISNELSRLEAERSSRQAEMAEIRSQlfdsgdiqrWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERATYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRILRSWVEDEARKSQARAKVLEEVTRRWRWGNHA
mqiieedvnadelctrreyARWLVRInsllernpklrivpckslsgtVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQIsrtkmsymdVREINSEASLGLFMDMLAGEKSIARRVFGQskrfqpnkpstkAQAAVALTSGRMAKAISNELSRLEAERSSRQAEMAEIrsqlfdsgdiqrWWDKKFSeerargfevekLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERATYVAEKCTLQDTLSDLQTKLeglldtksrseaEKEALRILRSWVEDEARKSQARAKVleevtrrwrwgnha
MQIIEEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSKRFQPNKPSTKAQAAVALTSGRMAKAISNELSRLEAERSSRQAEMAEIRSQLFDSGDIQRWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERATYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRILRSWVEDEARKSQARAKVLEEVTRRWRWGNHA
********NADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIAR*********************************************************LFDSGDIQRWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLK***************TYVAEKCTLQDTL*************************ILRSWV****************VTRRWR*****
*QI**EDVNADELCTRREYARWLVRINSLLE***************TVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLG*********GQGGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSKRFQPNKPSTKAQAAVALTSGRMAKA***********************************************************************************************************************************************************************************
MQIIEEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSK*************AVALTSGRMAKAISNE**************MAEIRSQLFDSGDIQRWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERATYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRILRSWVED********AKVLEEVTRRWRWGNHA
MQIIEEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDPDIESIQALAEAGVIP****************GGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSKRFQPNKPSTKAQAAVALTSGRMAKAISNELSRLEAERSSRQAEMAEIRSQLFDSGDIQRWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERATYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRILRSWVEDEARKSQARAKVLEEVTRRWRW****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MQIIEEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSKRFQPNKPSTKAQAAVALTSGRMxxxxxxxxxxxxxxxxxxxxxxxxxxxxLFDSGDIQRWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLxxxxxxxxxxxxxxxxxxxxxYVAEKCTLQDTxxxxxxxxxxxxxxxxxxxxxKEALRILRSWVEDEARKSQARAKVLEEVTRRWRWGNHA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query371
224111752 566 predicted protein [Populus trichocarpa] 0.997 0.653 0.690 1e-142
255566819 564 conserved hypothetical protein [Ricinus 0.997 0.656 0.663 1e-136
297737877 576 unnamed protein product [Vitis vinifera] 1.0 0.644 0.653 1e-133
359472711 547 PREDICTED: uncharacterized protein LOC10 1.0 0.678 0.653 1e-132
449435256 595 PREDICTED: uncharacterized protein LOC10 0.997 0.621 0.628 1e-130
147860148 836 hypothetical protein VITISV_020007 [Viti 1.0 0.443 0.638 1e-130
449493510 1097 PREDICTED: LOW QUALITY PROTEIN: uncharac 0.994 0.336 0.625 1e-129
356502489 536 PREDICTED: uncharacterized protein LOC10 0.991 0.686 0.631 1e-125
356498348 535 PREDICTED: uncharacterized protein LOC10 0.991 0.687 0.626 1e-123
297814772 558 hypothetical protein ARALYDRAFT_904734 [ 0.986 0.655 0.592 1e-121
>gi|224111752|ref|XP_002315964.1| predicted protein [Populus trichocarpa] gi|222865004|gb|EEF02135.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  510 bits (1313), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 256/371 (69%), Positives = 307/371 (82%), Gaps = 1/371 (0%)

Query: 1   MQIIEEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDP 60
           ++IIE+DV ADELCTRREYARWL+R+NS+LERN K RIVP  SLSG+V+AAF+D+ VEDP
Sbjct: 197 LKIIEDDVAADELCTRREYARWLLRLNSMLERNQKHRIVPSISLSGSVIAAFDDLGVEDP 256

Query: 61  DIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKP 120
           D ESIQALAE+G+IPS+L G +  +D S G+    F+PERFISR DLINWKAQ++Y   P
Sbjct: 257 DFESIQALAESGIIPSKLSGTNSCADSSDGRS-FCFYPERFISRQDLINWKAQLEYGFLP 315

Query: 121 DIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSKRFQPNKPSTKAQA 180
            I EQ+S+TK+ YMDV+EI+S+A+  L  DMLAG+KSI R+VFGQS+RFQPNKP TKAQA
Sbjct: 316 GITEQMSKTKVYYMDVKEISSDATPELLTDMLAGDKSIIRKVFGQSRRFQPNKPLTKAQA 375

Query: 181 AVALTSGRMAKAISNELSRLEAERSSRQAEMAEIRSQLFDSGDIQRWWDKKFSEERARGF 240
           AVAL SGRM++A+ NE+ RLEAE+S RQA + EIR++  + GDI+R+WD+K +EE+ RGF
Sbjct: 376 AVALISGRMSEAVYNEILRLEAEKSLRQATVKEIRNEFLERGDIKRFWDEKMNEEKIRGF 435

Query: 241 EVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERA 300
           EVEKLYIAA  DLEEE IVQ K Y E LKEKAAMDCQRQLLL+LK+EVDEMS RL SER+
Sbjct: 436 EVEKLYIAALHDLEEEKIVQVKTYEEYLKEKAAMDCQRQLLLHLKEEVDEMSERLASERS 495

Query: 301 TYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRILRSWVEDEARKSQARAKVLEE 360
            Y AE+C LQ+ LS LQ K E +LDTKS  EAE EALRILRSWVEDEARKSQARA+VLEE
Sbjct: 496 VYAAEQCNLQELLSKLQFKQEVMLDTKSILEAEIEALRILRSWVEDEARKSQARARVLEE 555

Query: 361 VTRRWRWGNHA 371
           V RRW+W N A
Sbjct: 556 VGRRWKWDNQA 566




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255566819|ref|XP_002524393.1| conserved hypothetical protein [Ricinus communis] gi|223536354|gb|EEF38004.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|297737877|emb|CBI27078.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|359472711|ref|XP_003631189.1| PREDICTED: uncharacterized protein LOC100259365 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449435256|ref|XP_004135411.1| PREDICTED: uncharacterized protein LOC101214855 [Cucumis sativus] Back     alignment and taxonomy information
>gi|147860148|emb|CAN78724.1| hypothetical protein VITISV_020007 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449493510|ref|XP_004159324.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101228994 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356502489|ref|XP_003520051.1| PREDICTED: uncharacterized protein LOC100813930 [Glycine max] Back     alignment and taxonomy information
>gi|356498348|ref|XP_003518015.1| PREDICTED: uncharacterized protein LOC100803220 [Glycine max] Back     alignment and taxonomy information
>gi|297814772|ref|XP_002875269.1| hypothetical protein ARALYDRAFT_904734 [Arabidopsis lyrata subsp. lyrata] gi|297321107|gb|EFH51528.1| hypothetical protein ARALYDRAFT_904734 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query371
TAIR|locus:2102375558 AT3G25680 "AT3G25680" [Arabido 0.989 0.657 0.579 1.4e-110
TAIR|locus:2172853 946 AT5G23890 [Arabidopsis thalian 0.967 0.379 0.345 2.5e-54
TAIR|locus:2156687 761 AT5G52410 [Arabidopsis thalian 0.978 0.477 0.323 1.8e-48
UNIPROTKB|Q0VF96 1302 CGNL1 "Cingulin-like protein 1 0.625 0.178 0.256 1.5e-08
MGI|MGI:1915428 1298 Cgnl1 "cingulin-like 1" [Mus m 0.450 0.128 0.284 1.7e-06
UNIPROTKB|G5E5V6 1293 LOC517509 "Uncharacterized pro 0.633 0.181 0.233 2.3e-06
UNIPROTKB|Q27991 1976 MYH10 "Myosin-10" [Bos taurus 0.549 0.103 0.286 3.7e-05
ZFIN|ZDB-GENE-991123-5 1938 vmhc "ventricular myosin heavy 0.630 0.120 0.226 6.6e-05
UNIPROTKB|E2QV06 1295 CGNL1 "Uncharacterized protein 0.466 0.133 0.255 8e-05
UNIPROTKB|P35580 1976 MYH10 "Myosin-10" [Homo sapien 0.549 0.103 0.282 0.00017
TAIR|locus:2102375 AT3G25680 "AT3G25680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1092 (389.5 bits), Expect = 1.4e-110, P = 1.4e-110
 Identities = 215/371 (57%), Positives = 282/371 (76%)

Query:     1 MQIIEEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDP 60
             ++I E+D+ ADELCT+REYARWLVR NSLLERNP   IVP  +L+G+ + AF+D++  DP
Sbjct:   192 LKIYEDDIVADELCTKREYARWLVRSNSLLERNPMHMIVPAVALAGSSIPAFDDINTSDP 251

Query:    61 DIESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLINWKAQVDYEVKP 120
             D E IQALAEAG+  S+L G+    D     G   F PE F+SR DL+NWKAQ++    P
Sbjct:   252 DFEYIQALAEAGITSSKLSGE----DSRNDLGNSNFNPESFVSRLDLVNWKAQLECGFHP 307

Query:   121 DIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVFGQSKRFQPNKPSTKAQA 180
             +I+E+ISRTK+ Y+D + IN + +LG F+D L G+KS  R VFG+ KRFQPN+P TKAQA
Sbjct:   308 EIMEEISRTKVDYIDTKNINPDMALGFFLDFLMGDKSTIRNVFGRIKRFQPNRPVTKAQA 367

Query:   181 AVALTSGRMAKAISNELSRLEAERSSRQAEMAEIRSQLFDSGDIQRWWDKKFSEERARGF 240
             AVALTSG+M KAI+ ELSRLEAE  S++AE  EIRS+L + G+I+++WD+K   ER+RGF
Sbjct:   368 AVALTSGKMVKAITAELSRLEAESLSQKAETEEIRSELLEKGEIRQFWDEKIQAERSRGF 427

Query:   241 EVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQLLLNLKDEVDEMSGRLESERA 300
             E+E+LY++   ++EEE   QEK  AE LKEKAA+DCQ+QLL +L +E+DEMS RL S+++
Sbjct:   428 EMEELYLSRVNEVEEEKTTQEKWSAERLKEKAAIDCQKQLLNSLTEEIDEMSQRLISDKS 487

Query:   301 TYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRILRSWVEDEARKSQARAKVLEE 360
              Y+ E   LQ+ LSDLQ+KLE L+D +S  EAE EALRILRSW+EDE + SQARAKVLEE
Sbjct:   488 VYLTEHSKLQEMLSDLQSKLESLIDKRSILEAEVEALRILRSWIEDEGKASQARAKVLEE 547

Query:   361 VTRRWRWGNHA 371
               RRW+W ++A
Sbjct:   548 AGRRWKWNDNA 558




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0009536 "plastid" evidence=IDA
TAIR|locus:2172853 AT5G23890 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2156687 AT5G52410 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q0VF96 CGNL1 "Cingulin-like protein 1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1915428 Cgnl1 "cingulin-like 1" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|G5E5V6 LOC517509 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q27991 MYH10 "Myosin-10" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-991123-5 vmhc "ventricular myosin heavy chain" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|E2QV06 CGNL1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|P35580 MYH10 "Myosin-10" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.X.389.1
hypothetical protein (554 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 371
PF0039545 SLH: S-layer homology domain; InterPro: IPR001119 99.23
PF0039545 SLH: S-layer homology domain; InterPro: IPR001119 98.37
PF08317325 Spc7: Spc7 kinetochore protein; InterPro: IPR01325 94.31
smart00787312 Spc7 Spc7 kinetochore protein. This domain is foun 91.79
PF09726697 Macoilin: Transmembrane protein; InterPro: IPR0191 90.46
PRK00106 535 hypothetical protein; Provisional 89.83
KOG1029 1118 consensus Endocytic adaptor protein intersectin [S 86.17
PRK00106 535 hypothetical protein; Provisional 86.04
TIGR01069 771 mutS2 MutS2 family protein. Function of MutS2 is u 85.13
KOG0161 1930 consensus Myosin class II heavy chain [Cytoskeleto 83.2
KOG0161 1930 consensus Myosin class II heavy chain [Cytoskeleto 83.17
COG4942 420 Membrane-bound metallopeptidase [Cell division and 82.87
PF09726 697 Macoilin: Transmembrane protein; InterPro: IPR0191 80.17
>PF00395 SLH: S-layer homology domain; InterPro: IPR001119 S-layers are paracrystalline mono-layered assemblies of (glyco)proteins which coat the surface of bacteria [, ] Back     alignment and domain information
Probab=99.23  E-value=8.8e-12  Score=89.12  Aligned_cols=44  Identities=30%  Similarity=0.307  Sum_probs=32.0

Q ss_pred             CCcCCCCCCCCh-HHHHHHHHcCCccCCcCCCCCCCCCCCCCCCccCCCCCCCcHHHHH
Q 017472           51 AFNDVDVEDPDI-ESIQALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDLI  108 (371)
Q Consensus        51 aF~DV~~~hp~~-~~IqalaeaGIIsG~lsg~~~~~~~~d~~~~~~F~Pd~pITRqEma  108 (371)
                      .|+||+..+|+| .+|+.|++.|||.|+              ++++|+|+++|||+|||
T Consensus         1 ~F~Dv~~~~~~~a~~i~~~~~~gi~~G~--------------~~~~f~P~~~iTR~e~A   45 (45)
T PF00395_consen    1 PFKDVPSISWAYAEAIQWLYQLGIISGY--------------SDGTFNPNDPITRAEAA   45 (45)
T ss_dssp             -BTTB-TTSSSTTHHHHHHHHTTSS-----------------TTS---TTSB-BHHHHH
T ss_pred             CCCCCCCCcHHHHHHHHHHHHcCCcccC--------------CCCeECCCCCcCHHHhC
Confidence            599999999966 999999999999987              35799999999999986



Several S-layer proteins and some other cell wall proteins contain one or more copies of a domain of about 50-60 residues, which has been called SLH (for S-layer homology). Although it was originally proposed that SLH domains bind to peptidoglycan, it is now evident that pyruvylated secondary cell wall polymers (SCWPs), which are either teichoic acids, teichuronic acids, lipoteichoic acids or lipoglycans, serve as the anchoring structures for SLH motifs in the Gram-positive cell wall [, ]. However, the study of S-layer protein SbpA of Bacillus sphaericus revealed that SLH motifs are not sufficient for specific binding to SCWPs. Thus, the molecular basis explaining SLH affinity and specificity of interaction with cell wall polymers are not completely elucidated [].; PDB: 3PYW_A.

>PF00395 SLH: S-layer homology domain; InterPro: IPR001119 S-layers are paracrystalline mono-layered assemblies of (glyco)proteins which coat the surface of bacteria [, ] Back     alignment and domain information
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [] Back     alignment and domain information
>smart00787 Spc7 Spc7 kinetochore protein Back     alignment and domain information
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes Back     alignment and domain information
>PRK00106 hypothetical protein; Provisional Back     alignment and domain information
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
>PRK00106 hypothetical protein; Provisional Back     alignment and domain information
>TIGR01069 mutS2 MutS2 family protein Back     alignment and domain information
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] Back     alignment and domain information
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] Back     alignment and domain information
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Back     alignment and domain information
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query371
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-08
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-04
1i84_S 1184 Smooth muscle myosin heavy chain; muscle protein, 4e-06
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 6e-06
1i84_S 1184 Smooth muscle myosin heavy chain; muscle protein, 2e-04
1i84_S1184 Smooth muscle myosin heavy chain; muscle protein, 7e-04
2dfs_A1080 Myosin-5A; myosin-V, inhibited state, cryoelectron 4e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 54.5 bits (130), Expect = 4e-08
 Identities = 49/401 (12%), Positives = 114/401 (28%), Gaps = 109/401 (27%)

Query: 1   MQIIEEDVNADELC-----------TRREYARWLVRINSLLERNPKLRIVPCKSLSGT-V 48
           + + E+    +  C           ++ E    ++     +     LR+          +
Sbjct: 22  LSVFEDAFVDNFDCKDVQDMPKSILSKEEIDH-IIMSKDAVSGT--LRLFWTLLSKQEEM 78

Query: 49  VAAFNDVDVEDPDIESI-QALAEAGVIPSQLLGKHYGSDGSKGQGGIYFFPERFISRYDL 107
           V  F + +V   + + +   +      PS ++ + Y     +       F +  +SR   
Sbjct: 79  VQKFVE-EVLRINYKFLMSPIKTEQRQPS-MMTRMYIEQRDRLYNDNQVFAKYNVSR--- 133

Query: 108 INWKAQVDYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDML-AGEKSIARRVFGQS 166
                            ++ R  +      E+    ++ +   +L +G+  +A  V   S
Sbjct: 134 ------------LQPYLKL-RQAL-----LELRPAKNVLID-GVLGSGKTWVALDVC-LS 173

Query: 167 KRFQPNKPSTKAQAAVA-LTSGRMAKA------ISNELSRLEAERSSRQAEMAEIRSQLF 219
            + Q           +  L              +   L +++   +SR    + I+ ++ 
Sbjct: 174 YKVQCKMDF-----KIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIH 228

Query: 220 DSGDIQRWWDKKFSEERARGFEVEKLYIAARCDLEEELIVQEKNYAEDLKEKAAMDCQRQ 279
                         +   R     K Y    C L   L++     A+      A +   +
Sbjct: 229 SI------------QAELRRLLKSKPY--ENC-L---LVLLNVQNAKAWN---AFNLSCK 267

Query: 280 LLLNLKDEVDEMSGRLESERATYVAEKCTLQDTLSDLQTKLEGLLDTKSRSEAEKEALRI 339
           +LL  + +                     + D LS   T     LD  S +    E   +
Sbjct: 268 ILLTTRFK--------------------QVTDFLSAATTT-HISLDHHSMTLTPDEVKSL 306

Query: 340 LRSWVEDEARK--SQAR----------AKVLEEVTRRW-RW 367
           L  +++   +    +            A+ + +    W  W
Sbjct: 307 LLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNW 347


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 Back     alignment and structure
>2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query371
3pyw_A203 S-layer protein SAP; SLH-domains, polysaccharide b 99.95
4aq1_A 892 SBSB protein; structural protein, S-layer protein, 99.84
3pyw_A203 S-layer protein SAP; SLH-domains, polysaccharide b 99.83
4aq1_A 892 SBSB protein; structural protein, S-layer protein, 99.58
>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-biology, protein structure initiative; 1.80A {Bacillus anthracis} Back     alignment and structure
Probab=99.95  E-value=4.8e-28  Score=220.52  Aligned_cols=142  Identities=21%  Similarity=0.252  Sum_probs=114.1

Q ss_pred             CCCCCCCCCCcHHHHHHHHHHHhcccccCCcccccccccCCCCCCCCCcCCCCCCCChHHHHHHHHcCCccCCcCCCCCC
Q 017472            5 EEDVNADELCTRREYARWLVRINSLLERNPKLRIVPCKSLSGTVVAAFNDVDVEDPDIESIQALAEAGVIPSQLLGKHYG   84 (371)
Q Consensus         5 e~~F~Pn~~ITRaEFA~~Lvra~~~l~r~~~~~i~Pa~~~~~~~~~aF~DV~~~hp~~~~IqalaeaGIIsG~lsg~~~~   84 (371)
                      +++|+|+++|||+||+.||+++++. .        +    .....+.|+||+ +||++++|+.+++.|||+|+       
T Consensus        32 dg~F~P~~~iTR~e~a~~l~r~~~~-~--------~----~~~~~~~F~Dv~-~~w~~~~i~~a~~~Gii~g~-------   90 (203)
T 3pyw_A           32 KGMFEPGKELTRAEAATMMAQILNL-P--------I----DKDAKPSFADSQ-GQWYTPFIAAVEKAGVIKGT-------   90 (203)
T ss_dssp             TSCCCTTCBCBHHHHHHHHHHHTTC-C--------C----CTTCCCSCGGGC-SSTTHHHHHHHHHTTSSCCC-------
T ss_pred             CCeECCCCCchHHHHHHHHHHHhCC-C--------C----CcCCCCCCCCCC-cHHHHHHHHHHHHCcCeecC-------
Confidence            4689999999999999999999851 1        1    123457899996 89999999999999999986       


Q ss_pred             CCCCCCCCCccCCCCCCCcHHHHHHHHHhh-hccCCChhhhhhhccCCCccccccccHHHHHHHHHhHhhhhcccccccc
Q 017472           85 SDGSKGQGGIYFFPERFISRYDLINWKAQV-DYEVKPDIVEQISRTKMSYMDVREINSEASLGLFMDMLAGEKSIARRVF  163 (371)
Q Consensus        85 ~~~~d~~~~~~F~Pd~pITRqEma~~kv~L-~~~~lp~~~~~~~~~~~~F~Dv~~I~~~a~~av~a~l~aG~~~II~~~f  163 (371)
                             +++ |+|+.+|||+||+.|++.+ ....  ...   ......|.|++.| +|+..+|....   ..||+.   
T Consensus        91 -------~~~-F~P~~~iTR~e~a~~l~r~~~~~~--~~~---~~~~~~f~D~~~i-~wa~~~v~~~~---~~Gi~~---  150 (203)
T 3pyw_A           91 -------GNG-FEPNGKIDRVSMASLLVEAYKLDT--KVN---GTPATKFKDLETL-NWGKEKANILV---ELGISV---  150 (203)
T ss_dssp             -------SSC-CCTTSBCBHHHHHHHHHHHHTGGG--GSS---SCCCCCCHHHHTC-SSSHHHHHHHH---HTTCSC---
T ss_pred             -------CCC-CCCCCcccHHHHHHHHHHHhcccc--ccc---cccCCCCCCCCcc-HHHHHHHHHHH---HCcCEe---
Confidence                   347 9999999999999987733 3221  000   0124689999999 78999987764   467877   


Q ss_pred             CCCccccCCCCCcHHHHHHHHHhc
Q 017472          164 GQSKRFQPNKPSTKAQAAVALTSG  187 (371)
Q Consensus       164 G~~~~F~P~kpVTRAEAAa~L~~g  187 (371)
                      |++++|+|++++||+|+|++|++.
T Consensus       151 g~~~~f~P~~~iTRaE~A~~l~r~  174 (203)
T 3pyw_A          151 GTGDQWEPKKTVTKAEAAQFIAKT  174 (203)
T ss_dssp             CCSSCCCTTSBCBHHHHHHHHHHH
T ss_pred             CCCCCcCCCCCCcHHHHHHHHHHH
Confidence            877899999999999999999965



>4aq1_A SBSB protein; structural protein, S-layer protein, nanobody; 2.42A {Geobacillus stearothermophilus} PDB: 4aq1_C Back     alignment and structure
>3pyw_A S-layer protein SAP; SLH-domains, polysaccharide binding, GST-SLH, cell WALL, STR genomics, PSI-biology, protein structure initiative; 1.80A {Bacillus anthracis} Back     alignment and structure
>4aq1_A SBSB protein; structural protein, S-layer protein, nanobody; 2.42A {Geobacillus stearothermophilus} PDB: 4aq1_C Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00