Citrus Sinensis ID: 034219


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-
MRHVATDLTAPSGAGLVQWNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSSITGPSSGDQKDEKSAEAIDPPAALEPKIVVIMAGDDHPTRIARPSASTS
ccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccEEEEEccccccccccccccccc
ccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccEEEEEcccccccEEEccccccc
mrhvatdltapsgaglvqwnsplpymfgGLLLVFGVIAVALMFLACchdkssitgpssgdqkdeksaeaidppaalepkIVVIMagddhptriarpsasts
mrhvatdltapsgaglvqWNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSSITGPSSGDQKDEKSAEaidppaalepKIVVIMAGDDHPtriarpsasts
MRHVATDLTAPSGAGLVQWNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSSITGPSSGDQKDEKSAEAIDPPAALEPKIVVIMAGDDHPTRIARPSASTS
************GAGLVQWNSPLPYMFGGLLLVFGVIAVALMFLACCH*******************************IVVIM*****************
******************WNSPLPYMFGGLLLVFGVIAVALMFLAC********************************KIVVIMAGDDHPTRIARPSA***
MRHVATDLTAPSGAGLVQWNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSS****************AIDPPAALEPKIVVIMAGDDHPTR*********
***************LVQWNSPLPYMFGGLLLVFGVIAVALMFLACCHDK***********************AALEPKIVVIMAGDDHPTRIARP*****
ooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MRHVATDLTAPSGAGLVQWNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSSITGPSSGDQKDEKSAEAIDPPAALEPKIVVIMAGDDHPTRIARPSASTS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query101 2.2.26 [Sep-21-2011]
Q8S8A0156 Protein GLUTAMINE DUMPER yes no 0.801 0.519 0.452 2e-13
O81775158 Protein GLUTAMINE DUMPER no no 0.782 0.5 0.451 3e-12
Q9FHH5148 Protein GLUTAMINE DUMPER no no 0.980 0.668 0.373 6e-11
Q9SW07129 Protein GLUTAMINE DUMPER no no 0.821 0.643 0.348 1e-07
Q3E965131 Protein GLUTAMINE DUMPER no no 0.772 0.595 0.407 4e-06
Q3E8L097 Protein GLUTAMINE DUMPER no no 0.851 0.886 0.301 7e-05
>sp|Q8S8A0|GDU4_ARATH Protein GLUTAMINE DUMPER 4 OS=Arabidopsis thaliana GN=GDU4 PE=2 SV=1 Back     alignment and function desciption
 Score = 74.3 bits (181), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 55/84 (65%), Gaps = 3/84 (3%)

Query: 19  WNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSSITGPSSGDQKDEKSAEAID---PPAA 75
           W+SP+PY+FGGL  + G+IA AL+ LAC + + S +G  SG++ DE+           AA
Sbjct: 33  WHSPVPYLFGGLAAMLGLIAFALLILACSYWRLSTSGDDSGERVDEEKESRSGVKAASAA 92

Query: 76  LEPKIVVIMAGDDHPTRIARPSAS 99
            E K++VIMAGDD P  +A P+A+
Sbjct: 93  CEEKVLVIMAGDDLPRFLATPAAN 116




Probable subunit of an amino acid transporter involved in the regulation of the amino acid metabolism. Stimulates amino acid export by activating nonselective amino acid facilitators.
Arabidopsis thaliana (taxid: 3702)
>sp|O81775|GDU1_ARATH Protein GLUTAMINE DUMPER 1 OS=Arabidopsis thaliana GN=GDU1 PE=1 SV=1 Back     alignment and function description
>sp|Q9FHH5|GDU3_ARATH Protein GLUTAMINE DUMPER 3 OS=Arabidopsis thaliana GN=GDU3 PE=2 SV=1 Back     alignment and function description
>sp|Q9SW07|GDU2_ARATH Protein GLUTAMINE DUMPER 2 OS=Arabidopsis thaliana GN=GDU2 PE=2 SV=1 Back     alignment and function description
>sp|Q3E965|GDU5_ARATH Protein GLUTAMINE DUMPER 5 OS=Arabidopsis thaliana GN=GDU5 PE=2 SV=2 Back     alignment and function description
>sp|Q3E8L0|GDU7_ARATH Protein GLUTAMINE DUMPER 7 OS=Arabidopsis thaliana GN=GDU7 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query101
25554191688 conserved hypothetical protein [Ricinus 0.772 0.886 0.552 6e-19
224127552106 predicted protein [Populus trichocarpa] 0.900 0.858 0.516 3e-16
255579958115 conserved hypothetical protein [Ricinus 0.920 0.808 0.505 2e-15
38851241798 unknown [Medicago truncatula] 0.851 0.877 0.477 2e-14
224127548109 predicted protein [Populus trichocarpa] 0.782 0.724 0.468 2e-13
388496890113 unknown [Medicago truncatula] 0.871 0.778 0.426 2e-13
255618736118 conserved hypothetical protein [Ricinus 0.881 0.754 0.437 6e-13
224077576106 predicted protein [Populus trichocarpa] 0.900 0.858 0.457 1e-12
255579956124 conserved hypothetical protein [Ricinus 0.920 0.75 0.479 1e-12
298204421119 unnamed protein product [Vitis vinifera] 0.801 0.680 0.458 2e-12
>gi|255541916|ref|XP_002512022.1| conserved hypothetical protein [Ricinus communis] gi|223549202|gb|EEF50691.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score = 97.8 bits (242), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 59/85 (69%), Gaps = 7/85 (8%)

Query: 16  LVQWNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSSITGPSSGDQKDEKSAEAIDPPAA 75
           L QWN+PLPY+FGGL+L+ G ++VAL+ LAC H KSS        +K+EKS    D   A
Sbjct: 4   LAQWNTPLPYVFGGLVLIIGFMSVALIILACSHYKSS-------GEKEEKSRRTTDAVVA 56

Query: 76  LEPKIVVIMAGDDHPTRIARPSAST 100
           +E KI VIMAGD HPT +A+PSA T
Sbjct: 57  METKIAVIMAGDRHPTHLAKPSALT 81




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224127552|ref|XP_002329306.1| predicted protein [Populus trichocarpa] gi|222870760|gb|EEF07891.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255579958|ref|XP_002530814.1| conserved hypothetical protein [Ricinus communis] gi|223529635|gb|EEF31582.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|388512417|gb|AFK44270.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|224127548|ref|XP_002329305.1| predicted protein [Populus trichocarpa] gi|222870759|gb|EEF07890.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388496890|gb|AFK36511.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|255618736|ref|XP_002539968.1| conserved hypothetical protein [Ricinus communis] gi|223500659|gb|EEF22415.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224077576|ref|XP_002305310.1| predicted protein [Populus trichocarpa] gi|222848274|gb|EEE85821.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255579956|ref|XP_002530813.1| conserved hypothetical protein [Ricinus communis] gi|223529634|gb|EEF31581.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|298204421|emb|CBI16901.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query101
TAIR|locus:505006270156 GDU4 "AT2G24762" [Arabidopsis 0.801 0.519 0.464 1.1e-14
TAIR|locus:2124894158 GDU1 "glutamine dumper 1" [Ara 0.782 0.5 0.451 5.6e-13
TAIR|locus:1005716497111 GDU6 "AT3G30725" [Arabidopsis 0.772 0.702 0.45 3.9e-12
TAIR|locus:2149448131 GDU5 "AT5G24920" [Arabidopsis 0.772 0.595 0.407 8.2e-12
TAIR|locus:504954872148 GDU3 "AT5G57685" [Arabidopsis 0.970 0.662 0.386 2.2e-11
TAIR|locus:2117517129 GDU2 "AT4G25760" [Arabidopsis 0.782 0.612 0.365 5.9e-09
TAIR|locus:216663597 GDU7 "AT5G38770" [Arabidopsis 0.861 0.896 0.326 3.4e-06
TAIR|locus:505006270 GDU4 "AT2G24762" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 187 (70.9 bits), Expect = 1.1e-14, P = 1.1e-14
 Identities = 39/84 (46%), Positives = 57/84 (67%)

Query:    19 WNSPLPYMFGGLLLVFGVIAVALMFLACCHDKSSITGPSSGDQKDEK--SAEAIDPP-AA 75
             W+SP+PY+FGGL  + G+IA AL+ LAC + + S +G  SG++ DE+  S   +    AA
Sbjct:    33 WHSPVPYLFGGLAAMLGLIAFALLILACSYWRLSTSGDDSGERVDEEKESRSGVKAASAA 92

Query:    76 LEPKIVVIMAGDDHPTRIARPSAS 99
              E K++VIMAGDD P  +A P+A+
Sbjct:    93 CEEKVLVIMAGDDLPRFLATPAAN 116




GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0080143 "regulation of amino acid export" evidence=IMP
TAIR|locus:2124894 GDU1 "glutamine dumper 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:1005716497 GDU6 "AT3G30725" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2149448 GDU5 "AT5G24920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504954872 GDU3 "AT5G57685" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2117517 GDU2 "AT4G25760" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2166635 GDU7 "AT5G38770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_scaffold_123000020
hypothetical protein (106 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 101
PF01102122 Glycophorin_A: Glycophorin A; InterPro: IPR001195 92.14
PF0243938 Adeno_E3_CR2: Adenovirus E3 region protein CR2; In 90.17
PF0720498 Orthoreo_P10: Orthoreovirus membrane fusion protei 86.62
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane Back     alignment and domain information
Probab=92.14  E-value=0.14  Score=36.98  Aligned_cols=33  Identities=18%  Similarity=0.280  Sum_probs=23.3

Q ss_pred             cccCChhHHHHHHHHHHHHHHHHHHHHHhhhccc
Q 034219           18 QWNSPLPYMFGGLLLVFGVIAVALMFLACCHDKS   51 (101)
Q Consensus        18 ~W~SPvPYLFgGLa~MlgLIAvALliLaCSy~k~   51 (101)
                      .+..|+ -....|++|.|+|++.|||+-|-+|+.
T Consensus        60 ~fs~~~-i~~Ii~gv~aGvIg~Illi~y~irR~~   92 (122)
T PF01102_consen   60 RFSEPA-IIGIIFGVMAGVIGIILLISYCIRRLR   92 (122)
T ss_dssp             SSS-TC-HHHHHHHHHHHHHHHHHHHHHHHHHHS
T ss_pred             Cccccc-eeehhHHHHHHHHHHHHHHHHHHHHHh
Confidence            444454 244568899999999999988887554



Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.

>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host [] Back     alignment and domain information
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query101
2knc_A54 Integrin alpha-IIB; transmembrane signaling, prote 87.66
>2knc_A Integrin alpha-IIB; transmembrane signaling, protein structure, cell A cleavage on PAIR of basic residues, disease mutation, disul bond, glycoprotein; NMR {Homo sapiens} Back     alignment and structure
Probab=87.66  E-value=1.7  Score=26.65  Aligned_cols=33  Identities=12%  Similarity=0.175  Sum_probs=21.0

Q ss_pred             CChhHHHHHHHHHHHHHHHHHHHH---HhhhcccCC
Q 034219           21 SPLPYMFGGLLLVFGVIAVALMFL---ACCHDKSSI   53 (101)
Q Consensus        21 SPvPYLFgGLa~MlgLIAvALliL---aCSy~k~~s   53 (101)
                      +++|+-..-++++.||+-++|+++   -|-+.|...
T Consensus         8 ~~vp~wiIi~svl~GLllL~li~~~LwK~GFFkR~~   43 (54)
T 2knc_A            8 RAIPIWWVLVGVLGGLLLLTILVLAMWKVGFFKRNR   43 (54)
T ss_dssp             TTCCHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTC
T ss_pred             CCcchHHHHHHHHHHHHHHHHHHHHHHHcCcccCCC
Confidence            356666666677777777776665   566666543




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00