Citrus Sinensis ID: 034029


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-----
MKPAAANTTAPIDLWHSPIPYLFTSLALVLVLIAVALVLLLCSYHKRYSNSSSGNGRDQENQPAVLPKVLDPEPKVVVIMAGDDKPRYLATQATISSKPCCCEQV
ccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccEEEEEEccccccccccccccccccccccccc
ccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccEEEEEcccccccEEcccccccccccccccc
mkpaaanttapidlwhspipylFTSLALVLVLIAVALVLLLCSYHkrysnsssgngrdqenqpavlpkvldpepkvVVIMagddkprylatqatisskpccceqv
mkpaaanttapidlwhSPIPYLFTSLALVLVLIAVALVLLLCSYHKRysnsssgngrdqenqpavlpkvldpepKVVVIMAGDDKPRYLATqatisskpccceqv
MKPAAANTTAPIDLWHSPIPYLFTSlalvlvliavalvlllCSYHKRYSNSSSGNGRDQENQPAVLPKVLDPEPKVVVIMAGDDKPRYLATQATISSKPCCCEQV
*********APIDLWHSPIPYLFTSLALVLVLIAVALVLLLCSYHKR***************************KVVVIMAGDDKPRYLATQATI**********
***************HSPIPYLFTSLALVLVLIAVALVLLLC********************************KVVVIMAGDDKPRYLATQATISSKPCCC***
********TAPIDLWHSPIPYLFTSLALVLVLIAVALVLLLCSYHKRY*************QPAVLPKVLDPEPKVVVIMAGDDKPRYLATQATISSKPCCCEQV
*************LWHSPIPYLFTSLALVLVLIAVALVLLLCSYHKR**********************LDPEPKVVVIMAGDDKPRYLATQATISSKPC*****
oooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKPAAANTTAPIDLWHSPIPYLFTSLALVLVLIAVALVLLLCSYHKRYSNSSSGNGRDQENQPAVLPKVLDPEPKVVVIMAGDDKPRYLATQATISSKPCCCEQV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query105 2.2.26 [Sep-21-2011]
Q8S8A0156 Protein GLUTAMINE DUMPER yes no 0.895 0.602 0.47 4e-15
O81775158 Protein GLUTAMINE DUMPER no no 0.8 0.531 0.489 2e-14
Q9SW07129 Protein GLUTAMINE DUMPER no no 0.923 0.751 0.442 2e-13
Q9FHH5148 Protein GLUTAMINE DUMPER no no 0.838 0.594 0.436 4e-11
Q3E965131 Protein GLUTAMINE DUMPER no no 0.704 0.564 0.454 1e-06
Q3EAV6111 Protein GLUTAMINE DUMPER no no 0.771 0.729 0.435 0.0001
Q3E8L097 Protein GLUTAMINE DUMPER no no 0.714 0.773 0.357 0.0002
>sp|Q8S8A0|GDU4_ARATH Protein GLUTAMINE DUMPER 4 OS=Arabidopsis thaliana GN=GDU4 PE=2 SV=1 Back     alignment and function desciption
 Score = 80.1 bits (196), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/100 (47%), Positives = 57/100 (57%), Gaps = 6/100 (6%)

Query: 6   ANTTAPIDLWHSPIPYLFTSLALVLVLIAVALVLLLCSYHK-RYSNSSSGNGRDQENQPA 64
            N   PI  WHSP+PYLF  LA +L LIA AL++L CSY +   S   SG   D+E +  
Sbjct: 24  GNHRPPISPWHSPVPYLFGGLAAMLGLIAFALLILACSYWRLSTSGDDSGERVDEEKESR 83

Query: 65  VLPKVLDP--EPKVVVIMAGDDKPRYLATQATISSKPCCC 102
              K      E KV+VIMAGDD PR+LAT A   +  C C
Sbjct: 84  SGVKAASAACEEKVLVIMAGDDLPRFLATPA---ANKCMC 120




Probable subunit of an amino acid transporter involved in the regulation of the amino acid metabolism. Stimulates amino acid export by activating nonselective amino acid facilitators.
Arabidopsis thaliana (taxid: 3702)
>sp|O81775|GDU1_ARATH Protein GLUTAMINE DUMPER 1 OS=Arabidopsis thaliana GN=GDU1 PE=1 SV=1 Back     alignment and function description
>sp|Q9SW07|GDU2_ARATH Protein GLUTAMINE DUMPER 2 OS=Arabidopsis thaliana GN=GDU2 PE=2 SV=1 Back     alignment and function description
>sp|Q9FHH5|GDU3_ARATH Protein GLUTAMINE DUMPER 3 OS=Arabidopsis thaliana GN=GDU3 PE=2 SV=1 Back     alignment and function description
>sp|Q3E965|GDU5_ARATH Protein GLUTAMINE DUMPER 5 OS=Arabidopsis thaliana GN=GDU5 PE=2 SV=2 Back     alignment and function description
>sp|Q3EAV6|GDU6_ARATH Protein GLUTAMINE DUMPER 6 OS=Arabidopsis thaliana GN=GDU6 PE=2 SV=1 Back     alignment and function description
>sp|Q3E8L0|GDU7_ARATH Protein GLUTAMINE DUMPER 7 OS=Arabidopsis thaliana GN=GDU7 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query105
22412754088 predicted protein [Populus trichocarpa] 0.780 0.931 0.516 6e-15
255579958115 conserved hypothetical protein [Ricinus 0.857 0.782 0.489 7e-15
356567160123 PREDICTED: uncharacterized protein LOC10 0.961 0.821 0.450 8e-15
449438981105 PREDICTED: protein GLUTAMINE DUMPER 4-li 0.857 0.857 0.514 1e-14
449531155105 PREDICTED: protein GLUTAMINE DUMPER 4-li 0.857 0.857 0.514 2e-14
297821831160 hypothetical protein ARALYDRAFT_481335 [ 0.885 0.581 0.457 2e-14
255579956124 conserved hypothetical protein [Ricinus 0.723 0.612 0.519 4e-14
297802864157 hypothetical protein ARALYDRAFT_913294 [ 0.8 0.535 0.5 1e-13
18400628156 glutamine dumper 4 [Arabidopsis thaliana 0.895 0.602 0.47 2e-13
388505148135 unknown [Medicago truncatula] 0.847 0.659 0.483 2e-13
>gi|224127540|ref|XP_002329303.1| predicted protein [Populus trichocarpa] gi|222870757|gb|EEF07888.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 64/93 (68%), Gaps = 11/93 (11%)

Query: 1  MKPAAANTTAPIDLWHSPIPYLFTSLALVLVLIAVALVLLLCSYHKRYSNSSSGNGRDQE 60
          M+P++ ++T+ I++W SP PYLF SL L+L +IAVAL  L CSY   Y NSS     DQE
Sbjct: 1  MRPSS-DSTSTINVWRSPTPYLFGSLGLLLAIIAVALTSLACSY---YRNSSG----DQE 52

Query: 61 NQPAVLPK---VLDPEPKVVVIMAGDDKPRYLA 90
           +PA +     VL  EP++VV+MAG+DKP YLA
Sbjct: 53 EKPAAMLTSMPVLHAEPEIVVVMAGEDKPTYLA 85




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255579958|ref|XP_002530814.1| conserved hypothetical protein [Ricinus communis] gi|223529635|gb|EEF31582.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356567160|ref|XP_003551789.1| PREDICTED: uncharacterized protein LOC100781671 [Glycine max] Back     alignment and taxonomy information
>gi|449438981|ref|XP_004137266.1| PREDICTED: protein GLUTAMINE DUMPER 4-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449531155|ref|XP_004172553.1| PREDICTED: protein GLUTAMINE DUMPER 4-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|297821831|ref|XP_002878798.1| hypothetical protein ARALYDRAFT_481335 [Arabidopsis lyrata subsp. lyrata] gi|297324637|gb|EFH55057.1| hypothetical protein ARALYDRAFT_481335 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|255579956|ref|XP_002530813.1| conserved hypothetical protein [Ricinus communis] gi|223529634|gb|EEF31581.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|297802864|ref|XP_002869316.1| hypothetical protein ARALYDRAFT_913294 [Arabidopsis lyrata subsp. lyrata] gi|297315152|gb|EFH45575.1| hypothetical protein ARALYDRAFT_913294 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18400628|ref|NP_565577.1| glutamine dumper 4 [Arabidopsis thaliana] gi|75160399|sp|Q8S8A0.1|GDU4_ARATH RecName: Full=Protein GLUTAMINE DUMPER 4 gi|20198236|gb|AAM15477.1| Expressed protein [Arabidopsis thaliana] gi|88900418|gb|ABD57521.1| At2g24762 [Arabidopsis thaliana] gi|330252530|gb|AEC07624.1| glutamine dumper 4 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|388505148|gb|AFK40640.1| unknown [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query105
TAIR|locus:505006270156 GDU4 "AT2G24762" [Arabidopsis 0.885 0.596 0.393 9.4e-11
TAIR|locus:2117517129 GDU2 "AT4G25760" [Arabidopsis 0.923 0.751 0.355 8.4e-10
TAIR|locus:2124894158 GDU1 "glutamine dumper 1" [Ara 0.838 0.556 0.385 1.1e-09
TAIR|locus:1005716497111 GDU6 "AT3G30725" [Arabidopsis 0.914 0.864 0.317 7.6e-09
TAIR|locus:504954872148 GDU3 "AT5G57685" [Arabidopsis 0.838 0.594 0.349 6.8e-08
TAIR|locus:2149448131 GDU5 "AT5G24920" [Arabidopsis 0.771 0.618 0.313 1e-06
TAIR|locus:216663597 GDU7 "AT5G38770" [Arabidopsis 0.476 0.515 0.425 3.9e-05
TAIR|locus:505006270 GDU4 "AT2G24762" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 150 (57.9 bits), Expect = 9.4e-11, P = 9.4e-11
 Identities = 39/99 (39%), Positives = 47/99 (47%)

Query:     7 NTTAPIDLWHSPIPYLFTSXXXXXXXXXXXXXXXXCSYHK-RYSNSSSGNGRDQENQPAV 65
             N   PI  WHSP+PYLF                  CSY +   S   SG   D+E +   
Sbjct:    25 NHRPPISPWHSPVPYLFGGLAAMLGLIAFALLILACSYWRLSTSGDDSGERVDEEKESRS 84

Query:    66 LPKVLDP--EPKVVVIMAGDDKPRYLATQATISSKPCCC 102
               K      E KV+VIMAGDD PR+LAT A  ++K C C
Sbjct:    85 GVKAASAACEEKVLVIMAGDDLPRFLATPA--ANK-CMC 120




GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0080143 "regulation of amino acid export" evidence=IMP
TAIR|locus:2117517 GDU2 "AT4G25760" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124894 GDU1 "glutamine dumper 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:1005716497 GDU6 "AT3G30725" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504954872 GDU3 "AT5G57685" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2149448 GDU5 "AT5G24920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2166635 GDU7 "AT5G38770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.01230017
hypothetical protein (88 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 105
PF01102122 Glycophorin_A: Glycophorin A; InterPro: IPR001195 89.33
PF0243938 Adeno_E3_CR2: Adenovirus E3 region protein CR2; In 87.72
PF0720498 Orthoreo_P10: Orthoreovirus membrane fusion protei 85.97
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane Back     alignment and domain information
Probab=89.33  E-value=0.47  Score=34.55  Aligned_cols=36  Identities=14%  Similarity=0.250  Sum_probs=24.1

Q ss_pred             ccccCchHHHHHHHHHHHHHHHHHHHHHHHhhcccCC
Q 034029           13 DLWHSPIPYLFTSLALVLVLIAVALVLLLCSYHKRYS   49 (105)
Q Consensus        13 ~~W~SPvPYLFgGLA~MlgLIAvALliLaCSy~K~~s   49 (105)
                      ..+..|+= ....|++|.|+|++.|||+-|-+|+...
T Consensus        59 h~fs~~~i-~~Ii~gv~aGvIg~Illi~y~irR~~Kk   94 (122)
T PF01102_consen   59 HRFSEPAI-IGIIFGVMAGVIGIILLISYCIRRLRKK   94 (122)
T ss_dssp             SSSS-TCH-HHHHHHHHHHHHHHHHHHHHHHHHHS--
T ss_pred             cCccccce-eehhHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            34555542 4456889999999999998888765533



Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.

>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host [] Back     alignment and domain information
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query105
2knc_A54 Integrin alpha-IIB; transmembrane signaling, prote 90.91
>2knc_A Integrin alpha-IIB; transmembrane signaling, protein structure, cell A cleavage on PAIR of basic residues, disease mutation, disul bond, glycoprotein; NMR {Homo sapiens} Back     alignment and structure
Probab=90.91  E-value=0.71  Score=28.66  Aligned_cols=34  Identities=18%  Similarity=0.365  Sum_probs=21.9

Q ss_pred             CchHHHHHHHHHHHHHHHHHHHHH---HHhhcccCCC
Q 034029           17 SPIPYLFTSLALVLVLIAVALVLL---LCSYHKRYSN   50 (105)
Q Consensus        17 SPvPYLFgGLA~MlgLIAvALliL---aCSy~K~~s~   50 (105)
                      +++|.-..-++++.||+-++|+++   -|-+.|+...
T Consensus         8 ~~vp~wiIi~svl~GLllL~li~~~LwK~GFFkR~~~   44 (54)
T 2knc_A            8 RAIPIWWVLVGVLGGLLLLTILVLAMWKVGFFKRNRP   44 (54)
T ss_dssp             TTCCHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTCC
T ss_pred             CCcchHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCC
Confidence            346666666677777777776655   5777776554




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00