Citrus Sinensis ID: 028978


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-
MGFFSKEEKSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSASLSPSSLSLSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINCM
cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHEEEEcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHcEEccHHHHHHHHHHHHHHHHHcc
cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHccccHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHcc
MGFFSKEEKSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLpsallsaslspsslslstlsshfndydfryslvdiplISIIRSAVIICVYglcdgprrsrgpylgiTTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINCM
mgffskeeKSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSASLSPSSLSLSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINCM
MGFFSKEEKSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADtllpsallsaslspsslslstlssHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINCM
**********KIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSAS*******LSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINC*
***********IILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSASLSPSSLSLSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVF***************MALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYD*FLINC*
********KSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSASLSPSSLSLSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINCM
*******EKSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSASLSPSSLSLSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINCM
iiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooo
iiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooo
iiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGFFSKEEKSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSASLSPSSLSLSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYKIDIEAVSLLISVYDIFLINCM
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query201
356539961205 PREDICTED: uncharacterized protein LOC10 0.975 0.956 0.736 5e-70
357461197216 hypothetical protein MTR_3g070400 [Medic 0.935 0.870 0.738 1e-69
449463028207 PREDICTED: uncharacterized protein LOC10 0.925 0.898 0.761 5e-68
224101137206 predicted protein [Populus trichocarpa] 0.925 0.902 0.796 3e-64
255587900253 hydrolase, putative [Ricinus communis] g 0.925 0.735 0.751 3e-62
225448408201 PREDICTED: uncharacterized protein LOC10 0.905 0.905 0.748 4e-62
359486412197 PREDICTED: uncharacterized protein LOC10 0.900 0.918 0.741 3e-61
30688139209 Alpha/beta hydrolase related protein [Ar 0.930 0.894 0.651 3e-60
356551462200 PREDICTED: uncharacterized protein LOC10 0.925 0.93 0.659 3e-60
297823919208 hypothetical protein ARALYDRAFT_483049 [ 0.930 0.899 0.631 6e-59
>gi|356539961|ref|XP_003538461.1| PREDICTED: uncharacterized protein LOC100795655 [Glycine max] Back     alignment and taxonomy information
 Score =  269 bits (687), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/197 (73%), Positives = 167/197 (84%), Gaps = 1/197 (0%)

Query: 1   MGFFSKEEKSKIILRAFKTLFFLITMLVSLLLFSAPVLLAIADTLLPSALLSASLSPSSL 60
           MGF SKEEKSK ILR  KTLFFLITM++SLLLFSAPVLL IAD L+PSALLS+    SS 
Sbjct: 1   MGFSSKEEKSKRILRGLKTLFFLITMVISLLLFSAPVLLVIADALVPSALLSSLSQ-SSF 59

Query: 61  SLSTLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLS 120
           S+ TL+SHF++YDFRYSL+DIPL+SIIRS +I CVY LCDGPR S GPYLGITT+CSVLS
Sbjct: 60  SMETLASHFHNYDFRYSLIDIPLVSIIRSFIIFCVYSLCDGPRLSHGPYLGITTMCSVLS 119

Query: 121 LIFVSLKASYVFSVADIDRGVYVRAMEMALFICSLALAVGHIVVAYRTSCRERKKLLVYK 180
           L+FVS KA Y+FSV+ IDR  YVRA E+ALF+CS ALAVGH+VVAYRTSCRER+KLLVYK
Sbjct: 120 LMFVSFKAVYIFSVSGIDRSGYVRATEIALFVCSCALAVGHVVVAYRTSCRERRKLLVYK 179

Query: 181 IDIEAVSLLISVYDIFL 197
           IDIEA+S   + Y  +L
Sbjct: 180 IDIEAISACKNGYPRYL 196




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|357461197|ref|XP_003600880.1| hypothetical protein MTR_3g070400 [Medicago truncatula] gi|355489928|gb|AES71131.1| hypothetical protein MTR_3g070400 [Medicago truncatula] Back     alignment and taxonomy information
>gi|449463028|ref|XP_004149236.1| PREDICTED: uncharacterized protein LOC101213138 isoform 1 [Cucumis sativus] gi|449463030|ref|XP_004149237.1| PREDICTED: uncharacterized protein LOC101213138 isoform 2 [Cucumis sativus] gi|449520685|ref|XP_004167364.1| PREDICTED: uncharacterized LOC101213138 isoform 1 [Cucumis sativus] gi|449520687|ref|XP_004167365.1| PREDICTED: uncharacterized LOC101213138 isoform 2 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224101137|ref|XP_002312157.1| predicted protein [Populus trichocarpa] gi|222851977|gb|EEE89524.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255587900|ref|XP_002534435.1| hydrolase, putative [Ricinus communis] gi|223525303|gb|EEF27950.1| hydrolase, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|225448408|ref|XP_002273406.1| PREDICTED: uncharacterized protein LOC100257064 isoform 1 [Vitis vinifera] gi|297736619|emb|CBI25490.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|359486412|ref|XP_003633440.1| PREDICTED: uncharacterized protein LOC100257064 isoform 2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|30688139|ref|NP_850325.1| Alpha/beta hydrolase related protein [Arabidopsis thaliana] gi|26449647|dbj|BAC41948.1| unknown protein [Arabidopsis thaliana] gi|30017255|gb|AAP12861.1| At2g40095 [Arabidopsis thaliana] gi|330254683|gb|AEC09777.1| Alpha/beta hydrolase related protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|356551462|ref|XP_003544094.1| PREDICTED: uncharacterized protein LOC100806343 [Glycine max] Back     alignment and taxonomy information
>gi|297823919|ref|XP_002879842.1| hypothetical protein ARALYDRAFT_483049 [Arabidopsis lyrata subsp. lyrata] gi|297325681|gb|EFH56101.1| hypothetical protein ARALYDRAFT_483049 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query201
TAIR|locus:504956035209 AT2G40095 "AT2G40095" [Arabido 0.930 0.894 0.587 6.5e-51
TAIR|locus:2082043215 SUE4 "AT3G55880" [Arabidopsis 0.950 0.888 0.544 1.5e-49
TAIR|locus:2034220 647 AT1G80280 [Arabidopsis thalian 0.920 0.285 0.389 6.1e-29
TAIR|locus:2037828 648 AT1G15490 [Arabidopsis thalian 0.870 0.270 0.376 1.2e-26
TAIR|locus:2011476 633 AT1G52750 [Arabidopsis thalian 0.835 0.265 0.361 2.1e-20
TAIR|locus:504956035 AT2G40095 "AT2G40095" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 529 (191.3 bits), Expect = 6.5e-51, P = 6.5e-51
 Identities = 118/201 (58%), Positives = 136/201 (67%)

Query:     1 MGFFSKEEKSKI------ILRAFKTLFFLITMLVSLLLFSAPVLLAIADXXXXXXXXXXX 54
             MGF SKEE SK       + R  KT+F L TM  S LL SAP+ LA+AD           
Sbjct:     1 MGF-SKEEISKTRKLWSSLWRGIKTIFVLFTMFFSFLLVSAPIFLAVADALLPSALLHRF 59

Query:    55 XXXXXXXXXXXXXHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITT 114
                          H  +YDFR+SL+DIPLISIIRSAVI+CVYGLCDGP+ SRGPYL IT 
Sbjct:    60 SSPATLSS-----HLTNYDFRHSLIDIPLISIIRSAVILCVYGLCDGPKLSRGPYLTITM 114

Query:   115 ICSVLSLIFVSLKASYVFSVADIDRGV--YVRAMEMALFICSLALAVGHIVVAYRTSCRE 172
             ICS+ SLI+VSLKA++VF    I  G   Y RA E+ALF+CS  LA+GHIVVAYRTSCRE
Sbjct:   115 ICSISSLIYVSLKAAFVFGEPAIGDGGGNYFRAAEVALFLCSSVLAIGHIVVAYRTSCRE 174

Query:   173 RKKLLVYKIDIEAVSLLISVY 193
             RKKLLV+KIDIEAVS   +VY
Sbjct:   175 RKKLLVFKIDIEAVSACKNVY 195




GO:0003674 "molecular_function" evidence=ND
GO:0005739 "mitochondrion" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2082043 SUE4 "AT3G55880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2034220 AT1G80280 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2037828 AT1G15490 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2011476 AT1G52750 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 201
PF10457171 MENTAL: Cholesterol-capturing domain; InterPro: IP 99.49
>PF10457 MENTAL: Cholesterol-capturing domain; InterPro: IPR019498 The following proteins share a conserved region called the MENTAL (MLN64 N-terminal) domain, composed of four transmembrane helices with three short intervening loops [, , ]: Animal MLN64 (metastatic lymph node 64), a late endosomal membrane protein containing a carboxyl-terminal cholesterol binding START domain (IPR002913 from INTERPRO) Back     alignment and domain information
Probab=99.49  E-value=1.2e-14  Score=121.59  Aligned_cols=105  Identities=33%  Similarity=0.444  Sum_probs=88.8

Q ss_pred             hhhhhhcccCcccccccchhHHHHHHHHHHHHHHhhcCCCCCCCCchHHHHHHHhHHHHHHHHhhhceecccccccchhh
Q 028978           63 STLSSHFNDYDFRYSLVDIPLISIIRSAVIICVYGLCDGPRRSRGPYLGITTICSVLSLIFVSLKASYVFSVADIDRGVY  142 (201)
Q Consensus        63 ~~~~~~~~~Y~FrsSLvDIPlvSi~RS~~IlCvY~~Cdgp~Ls~gpYLgit~~cs~~S~~~vsvKA~~Vf~~~~~~~~~~  142 (201)
                      ..+..|..+|+||+||.||++++++|.++++++|++|   ++.|+++.++||++|.   .|+.+|+. +|. +...++. 
T Consensus        35 ~~l~~ei~~Y~~~~SLFDivllA~~Rf~vLil~Ya~~---rl~hw~~iaitT~~S~---afli~Kv~-~~~-~~~s~~~-  105 (171)
T PF10457_consen   35 SALQNEINHYDFKTSLFDIVLLAIFRFLVLILFYALL---RLRHWWPIAITTLVSC---AFLIVKVF-FFD-STSSQNA-  105 (171)
T ss_pred             HHHHHHHhheehhhhHHHHHHHHHHHHHHHHHHHHHh---ccCcceEeEeHHhhhh---HHhhheee-EEe-cccCCCC-
Confidence            6799999999999999999999999999999999999   5669999999987654   58889997 664 2222221 


Q ss_pred             HhhhhHHHHHHHHHHHHhHH-HHHhhhhhhhhhhhccc
Q 028978          143 VRAMEMALFICSLALAVGHI-VVAYRTSCRERKKLLVY  179 (201)
Q Consensus       143 ~~~~~~~LflsS~vfAl~Hi-vvAYRtSCraRRKLlv~  179 (201)
                         .+-+|+++|+++|-++. ++.||--.|||++.-.+
T Consensus       106 ---~~y~L~I~SfvlaW~E~WfldfrVlPqE~~~~~~~  140 (171)
T PF10457_consen  106 ---FEYLLIITSFVLAWIETWFLDFRVLPQEREAERRY  140 (171)
T ss_pred             ---ceEEehHHHHHHHHHHHHHHhheecchhHHHHHHH
Confidence               23589999999999999 99999999999998655



It is probably involved in intracellular cholesterol transport. Mammalian MENTHO (MLN64 N-terminal domain homologue), a late endosomal protein containing only the MENTAL domain. It is probably involved in cellular cholesterol homoeostasis. The ~170-amino acid MENTAL domain mediates MLN64 and MENTHO homo- and hetero- interactions, targets both proteins to late endosomes and binds cholesterol. The MENTAL domain might serve to maintain cholesterol at the membrane of late endosomes prior to its shuttle to cytoplasmic acceptor(s) through the START domain.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00