Citrus Sinensis ID: 041618


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------13
MKSMLSRLRNAVRKVKVLLNLSLHRWRIASSMIGASGSKRRLSFNDRPGLRACTDDNYADESSTDSASAGPRSHGGESLHRTISYPSEDDIDKRAEMFIANFHRQLQIERQISLELRYCRANSFEYMSP
ccHHHHHHHHHHHHHHEEEEEccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccEEEEccccccHHHHHHHHHHHHHHHHHHHHHHHHcEEEEEEEccccccccc
cccHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccEEccccccHHHHHHHHHHHHHHHHHHHHHHHHHHEEHHHHcccccccccc
MKSMLSRLRNAVRKVKVLLNLSLHRWRIASSmigasgskrrlsfndrpglractddnyadesstdsasagprshggeslhrtisypseddiDKRAEMFIANFHRQLQIERQISLELRYCransfeymsp
mksmlsrlRNAVRKVKVLLNLSLHRWRIAssmigasgskrrlsfndrpglractddnyadesstdsasagprshggeslhrtisypSEDDIDKRAEMFIANFHRQLQIERQISLELRYCRANSFEYMSP
MKSMLSRLRNAVRKVKVLLNLSLHRWRIASSMIGASGSKRRLSFNDRPGLRACTDDNYADESSTDSASAGPRSHGGESLHRTISYPSEDDIDKRAEMFIANFHRQLQIERQISLELRYCRANSFEYMSP
*********NAVRKVKVLLNLSLHRWRIASSMI*************************************************************AEMFIANFHRQLQIERQISLELRYCRANSF*****
*******LRNAVRKVKVLLNLSLHRWRIASSM************NDRPGLRACT***************************************RAEMFIANFHRQLQIERQISLELRYCRANS******
MKSMLSRLRNAVRKVKVLLNLSLHRWRIASSMIGASGSKRRLSFNDRPGLRACTDDN**********************HRTISYPSEDDIDKRAEMFIANFHRQLQIERQISLELRYCRANSFEYMSP
**SMLSRLRNAVRKVKVLLNLSLHRWRIASSMIGAS**********************************************ISYPSEDDIDKRAEMFIANFHRQLQIERQISLELRYCRANSF*****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooo
oooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKSMLSRLRNAVRKVKVLLNLSLHRWRIASSMIGASGSKRRLSFNDRPGLRACTDDNYADESSTDSASAGPRSHGGESLHRTISYPSEDDIDKRAEMFIANFHRQLQIERQISLELRYCRANSFEYMSP
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query129
225446076128 PREDICTED: uncharacterized protein LOC10 0.868 0.875 0.601 7e-33
356562451134 PREDICTED: uncharacterized protein LOC10 0.930 0.895 0.578 3e-31
359485093128 PREDICTED: uncharacterized protein LOC10 0.852 0.859 0.595 8e-31
224143756125 predicted protein [Populus trichocarpa] 0.906 0.936 0.612 1e-29
356547332132 PREDICTED: uncharacterized protein LOC10 0.860 0.840 0.595 1e-28
449457151120 PREDICTED: uncharacterized protein LOC10 0.891 0.958 0.532 2e-27
255646825132 unknown [Glycine max] 0.844 0.825 0.569 5e-27
357479721132 hypothetical protein MTR_4g128470 [Medic 0.922 0.901 0.545 1e-26
388514623132 unknown [Medicago truncatula] 0.922 0.901 0.545 1e-26
116830679131 unknown [Arabidopsis thaliana] 0.906 0.893 0.503 2e-22
>gi|225446076|ref|XP_002272519.1| PREDICTED: uncharacterized protein LOC100264996 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  145 bits (365), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 74/123 (60%), Positives = 88/123 (71%), Gaps = 11/123 (8%)

Query: 3   SMLSRLRNAVRKVKVLLNLSLHRWRIASSMIGASGSKRRLSFNDRPGLRACTDDNYADES 62
           S+LSRLR AV+KV  LLN  +HRW +AS M+  S   RR SFNDRPGLR CT+D      
Sbjct: 11  SLLSRLRRAVKKVGFLLNFGIHRWHVAS-MLRTSSETRRFSFNDRPGLRGCTED------ 63

Query: 63  STDSASAGPRSHGGESLHRTISYPSEDDIDKRAEMFIANFHRQLQIERQISLELRYCRAN 122
            TDS      S G   + RTIS P E+D+DKRAE+FIANF+RQLQ+ERQ+SLELRYC+ N
Sbjct: 64  -TDSEDQPGSSRG---IQRTISGPCEEDVDKRAEVFIANFYRQLQMERQVSLELRYCKGN 119

Query: 123 SFE 125
           SFE
Sbjct: 120 SFE 122




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356562451|ref|XP_003549485.1| PREDICTED: uncharacterized protein LOC100798434 [Glycine max] Back     alignment and taxonomy information
>gi|359485093|ref|XP_003633214.1| PREDICTED: uncharacterized protein LOC100259802 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224143756|ref|XP_002325063.1| predicted protein [Populus trichocarpa] gi|222866497|gb|EEF03628.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356547332|ref|XP_003542068.1| PREDICTED: uncharacterized protein LOC100814899 [Glycine max] Back     alignment and taxonomy information
>gi|449457151|ref|XP_004146312.1| PREDICTED: uncharacterized protein LOC101218317 [Cucumis sativus] gi|449510648|ref|XP_004163722.1| PREDICTED: uncharacterized protein LOC101224640 [Cucumis sativus] Back     alignment and taxonomy information
>gi|255646825|gb|ACU23884.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357479721|ref|XP_003610146.1| hypothetical protein MTR_4g128470 [Medicago truncatula] gi|355511201|gb|AES92343.1| hypothetical protein MTR_4g128470 [Medicago truncatula] Back     alignment and taxonomy information
>gi|388514623|gb|AFK45373.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|116830679|gb|ABK28297.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query129
TAIR|locus:2174537130 AT5G57510 "AT5G57510" [Arabido 0.899 0.892 0.529 5.9e-25
TAIR|locus:2159128188 AT5G13880 "AT5G13880" [Arabido 0.217 0.148 0.571 0.00038
TAIR|locus:2174537 AT5G57510 "AT5G57510" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 284 (105.0 bits), Expect = 5.9e-25, P = 5.9e-25
 Identities = 71/134 (52%), Positives = 91/134 (67%)

Query:     3 SMLSRLRNAVRKVKVLLNLSLHR-WRIASSMIGASGSKR--RLSFNDRPGLRACTDDNYA 59
             S+L+RLR AV KVK +L+  ++  W +   M+G+S S    RLSFNDRPGL A   +N  
Sbjct:     8 SLLNRLRQAVNKVKFVLSFKINSLWGLVP-MLGSSSSSSSLRLSFNDRPGLTAAFTENEP 66

Query:    60 DESSTDSASAGPRSHGGESLHRTISYP--SEDDIDKRAEMFIANFHRQLQIERQISLELR 117
             D  ST S      S G  +L+RT+SY   S++DID +AEMFIANF+RQL+IERQISLEL+
Sbjct:    67 D--STGS------SRG--ALYRTVSYDQSSDEDIDNKAEMFIANFYRQLKIERQISLELK 116

Query:   118 YCRAN--SFEYMSP 129
             YC+ N  SF Y SP
Sbjct:   117 YCQGNNQSFNYRSP 130




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0071456 "cellular response to hypoxia" evidence=IEP
GO:0010200 "response to chitin" evidence=RCA
TAIR|locus:2159128 AT5G13880 "AT5G13880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_LG_XVIII000815
hypothetical protein (125 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query129
pfam0555338 pfam05553, DUF761, Cotton fibre expressed protein 1e-07
>gnl|CDD|218633 pfam05553, DUF761, Cotton fibre expressed protein Back     alignment and domain information
 Score = 44.5 bits (106), Expect = 1e-07
 Identities = 16/29 (55%), Positives = 24/29 (82%)

Query: 87  SEDDIDKRAEMFIANFHRQLQIERQISLE 115
           S+D++D+RAE FIA F  QL+++RQ SL+
Sbjct: 1   SDDEVDRRAEEFIARFREQLRLQRQESLK 29


This family consists of several plant proteins of unknown function. Three of the sequences (from Gossypium hirsutum) in this family are described as cotton fibre expressed proteins. The remaining sequences, found in Arabidopsis thaliana, are uncharacterized. Length = 38

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 129
PF0555338 DUF761: Cotton fibre expressed protein; InterPro: 99.59
>PF05553 DUF761: Cotton fibre expressed protein; InterPro: IPR008480 This family consists of several plant proteins of unknown function Back     alignment and domain information
Probab=99.59  E-value=1.7e-15  Score=94.62  Aligned_cols=34  Identities=53%  Similarity=0.870  Sum_probs=32.0

Q ss_pred             CchhHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 041618           87 SEDDIDKRAEMFIANFHRQLQIERQISLELRYCRA  121 (129)
Q Consensus        87 ~dd~ID~rAE~FI~kF~~QmrLQRq~Sl~lRY~~~  121 (129)
                      ++++||++||+||++||+||||||++|++ ||+++
T Consensus         1 ~~~evd~rAe~FI~~f~~qlrlqr~~S~~-ry~em   34 (38)
T PF05553_consen    1 SDDEVDRRAEEFIAKFREQLRLQRQESLQ-RYQEM   34 (38)
T ss_pred             CchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH
Confidence            47899999999999999999999999996 99986



Three of the sequences from Gossypium hirsutum (Upland cotton) in this family are described as G. hirsutum fibre expressed proteins []. The remaining sequences, found in Arabidopsis thaliana, are uncharacterised.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00