Citrus Sinensis ID: 034218


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-
MVRKAPDTADREKGRYTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM
ccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHHc
cccccccHHHcccccEEEEccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHHHHHc
mvrkapdtadrekgrytlirdpenfqfgiydkplpcfgcgvgwfSFLLGFVFPLMWYYgtflyfgnhcrkdprERAGLAASAIAAMACSVVMLVIVVFRLM
mvrkapdtadrekgrytlirdpenFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM
MVRKAPDTADREKGRYTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERaglaasaiaamaCSVVMLVIVVFRLM
***************YTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFR**
***************YTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM
***********EKGRYTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM
*************GRYTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiii
oooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHiii
ooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooo
ooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVRKAPDTADREKGRYTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query101 2.2.26 [Sep-21-2011]
Q8L9S1158 60S ribosomal protein L18 no no 0.633 0.405 0.408 2e-05
>sp|Q8L9S1|R18A1_ARATH 60S ribosomal protein L18a-1 OS=Arabidopsis thaliana GN=RPL18AA PE=2 SV=2 Back     alignment and function desciption
 Score = 47.8 bits (112), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 41/71 (57%), Gaps = 7/71 (9%)

Query: 31  DKPLPCFGCGVGWFSFLLGFVF-PLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACS 89
           +  LPC G G+GWF F++GF+F  + WY G F+        D RE+AG  A +IA    S
Sbjct: 87  EHELPCCGLGMGWFLFIMGFLFGGIPWYLGAFIVLVTSV--DHREKAGYVACSIA----S 140

Query: 90  VVMLVIVVFRL 100
           VV L+ V+  +
Sbjct: 141 VVYLIAVMLGM 151





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query101
356565888144 PREDICTED: uncharacterized protein LOC10 0.970 0.680 0.683 1e-34
388497666134 unknown [Lotus japonicus] 0.930 0.701 0.712 7e-34
356558389138 PREDICTED: uncharacterized protein LOC10 0.910 0.666 0.706 1e-33
255546063142 conserved hypothetical protein [Ricinus 0.940 0.669 0.789 2e-33
356530415138 PREDICTED: uncharacterized protein LOC10 0.910 0.666 0.695 3e-33
224091200141 predicted protein [Populus trichocarpa] 0.930 0.666 0.810 4e-33
388496450134 unknown [Lotus japonicus] 0.930 0.701 0.702 5e-33
388506132147 unknown [Medicago truncatula] 0.970 0.666 0.663 6e-33
224142908146 predicted protein [Populus trichocarpa] 0.960 0.664 0.765 9e-33
297793223138 hypothetical protein ARALYDRAFT_495799 [ 0.920 0.673 0.698 2e-32
>gi|356565888|ref|XP_003551168.1| PREDICTED: uncharacterized protein LOC100795083 [Glycine max] Back     alignment and taxonomy information
 Score =  150 bits (378), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 67/98 (68%), Positives = 79/98 (80%)

Query: 4   KAPDTADREKGRYTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLY 63
           ++ DT  R KG+Y LIRDPE+FQ G+YDKPLPCFGCG+GWFSFL GF+ P MW+Y T LY
Sbjct: 42  QSTDTTGRGKGKYALIRDPEDFQMGVYDKPLPCFGCGIGWFSFLFGFLCPPMWFYATILY 101

Query: 64  FGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM 101
           FGNH RKDPRERAGL ASAIAA+ CSV  L+I V  ++
Sbjct: 102 FGNHYRKDPRERAGLGASAIAALVCSVTSLIIGVILVL 139




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388497666|gb|AFK36899.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|356558389|ref|XP_003547489.1| PREDICTED: uncharacterized protein LOC100814053 [Glycine max] Back     alignment and taxonomy information
>gi|255546063|ref|XP_002514091.1| conserved hypothetical protein [Ricinus communis] gi|223546547|gb|EEF48045.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356530415|ref|XP_003533777.1| PREDICTED: uncharacterized protein LOC100791750 [Glycine max] Back     alignment and taxonomy information
>gi|224091200|ref|XP_002309204.1| predicted protein [Populus trichocarpa] gi|222855180|gb|EEE92727.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388496450|gb|AFK36291.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|388506132|gb|AFK41132.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|224142908|ref|XP_002324774.1| predicted protein [Populus trichocarpa] gi|222866208|gb|EEF03339.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297793223|ref|XP_002864496.1| hypothetical protein ARALYDRAFT_495799 [Arabidopsis lyrata subsp. lyrata] gi|297310331|gb|EFH40755.1| hypothetical protein ARALYDRAFT_495799 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query101
TAIR|locus:2120840133 AT4G26060 [Arabidopsis thalian 0.851 0.646 0.627 7.1e-29
TAIR|locus:2164620137 AT5G57060 [Arabidopsis thalian 0.920 0.678 0.580 3.1e-28
TAIR|locus:1006230715103 AT1G54217 [Arabidopsis thalian 0.900 0.883 0.478 1.3e-20
TAIR|locus:2024847153 AT1G53560 [Arabidopsis thalian 0.603 0.398 0.371 1.6e-06
TAIR|locus:505006347132 AT3G14595 [Arabidopsis thalian 0.366 0.280 0.5 5.5e-06
TAIR|locus:2020317154 AT1G17080 [Arabidopsis thalian 0.445 0.292 0.428 1.9e-05
TAIR|locus:2198269 312 RPL18AA "60S ribosomal protein 0.287 0.092 0.533 9.9e-05
TAIR|locus:2120840 AT4G26060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 321 (118.1 bits), Expect = 7.1e-29, P = 7.1e-29
 Identities = 54/86 (62%), Positives = 65/86 (75%)

Query:    13 KGRYTLIRDPENFQFGIYDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDP 72
             KGRYTL++D +  + G+YDKPLPCFGCG+GWFSFLLGFVFP +WYY T LYFGN+ RKDP
Sbjct:    45 KGRYTLVKDVDEVENGVYDKPLPCFGCGIGWFSFLLGFVFPFLWYYATLLYFGNYYRKDP 104

Query:    73 RERXXXXXXXXXXXXCSVVMLVIVVF 98
             RER             S+++LVIVVF
Sbjct:   105 RERAGLAASAITAMGFSLLLLVIVVF 130




GO:0003735 "structural constituent of ribosome" evidence=IEA
GO:0005840 "ribosome" evidence=IEA
GO:0006412 "translation" evidence=IEA
TAIR|locus:2164620 AT5G57060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:1006230715 AT1G54217 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2024847 AT1G53560 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006347 AT3G14595 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2020317 AT1G17080 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2198269 RPL18AA "60S ribosomal protein L18A-1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
grail3.0112001901
hypothetical protein (142 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 101
PF1025194 PEN-2: Presenilin enhancer-2 subunit of gamma secr 93.31
PF07787248 DUF1625: Protein of unknown function (DUF1625); In 81.27
>PF10251 PEN-2: Presenilin enhancer-2 subunit of gamma secretase; InterPro: IPR019379 This entry is a short, 101 peptide protein, which is the smallest subunit of the gamma-secretase aspartyl protease complex Back     alignment and domain information
Probab=93.31  E-value=0.37  Score=33.99  Aligned_cols=49  Identities=31%  Similarity=0.604  Sum_probs=36.8

Q ss_pred             HHHHHHHH-hHHHHHHHHHHhcccccCC------cccchhHHHHHHHHHHHHHHHHH
Q 034218           45 SFLLGFVF-PLMWYYGTFLYFGNHCRKD------PRERAGLAASAIAAMACSVVMLV   94 (101)
Q Consensus        45 lFllGFf~-~ipWY~gafl~~c~~~r~D------~REkpGl~AcaIAa~v~tia~ii   94 (101)
                      .|+.||++ |..|.+-++-++= ...++      ++-|.=.+.|+|.+++.+++++.
T Consensus        13 yf~~GFa~LP~lW~vN~~wF~~-~af~~p~~~~~~~Ir~YVi~SaiG~~vw~v~l~~   68 (94)
T PF10251_consen   13 YFLGGFAFLPFLWLVNVVWFFR-EAFSKPPYDEQPQIRKYVIRSAIGFLVWTVVLIS   68 (94)
T ss_pred             HHHHHHHHhHHHHHHHHHHHhH-HHhcCCCcchhHHHHHHHHHHHHHHHHHHHHHHH
Confidence            58899988 7999999876643 22233      36677788999999999887765



It catalyses the intra-membrane cleavage of a subset of type I transmembrane proteins. The other active constituents of the complex are presenilin (PS) nicastrin and anterior pharynx defective-1 (APH-1) protein. Presenilin enhancer-2 (PEN-2) adopts a hairpin orientation in the membrane with its N- and C-terminal domains facing the luminal/extracellular space. The C-terminal domain maintains PS stability within the complex [].

>PF07787 DUF1625: Protein of unknown function (DUF1625); InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00