Citrus Sinensis ID: 035876


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------
MVSPGIASNSTRDGGVSGGGLGHWNSPLPYLFGGLGLMMGLITVALMILACSYRKSSSNSATDHADEDKSAGHDKQVEMQTEMEPKIVVIMAGDDNPSYLAKPVSCTFYHQQTEQVL
ccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccccccc
ccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccEEEEEcccccccEEEccEEcccccccccccc
mvspgiasnstrdggvsggglghwnsplpylfgglGLMMGLITVALMILACSYrksssnsatdhadedksaghdKQVEMQTEMEPKIVVIMagddnpsylakpvsctfyhqqteqvl
mvspgiasnstrdggvsggGLGHWNSPLPYLFGGLGLMMGLITVALMILACSYRKSSSNSATDHADEDKSAGHDKQVEMQTEMEPKIVVIMAGDDNPSYLAKPVSCTFYHQQTEQVL
MVSPGIASNSTRDGGVSGGGLGHWNSPLPYlfgglglmmglITVALMILACSYRKSSSNSATDHADEDKSAGHDKQVEMQTEMEPKIVVIMAGDDNPSYLAKPVSCTFYHQQTEQVL
******************GGLGHWNSPLPYLFGGLGLMMGLITVALMILACSY*********************************IVVIMAGDDNPSYLAKPVSCTFYH*******
**********************HWNSPLPYLFGGLGLMMGLITVALMILAC*********************************PKIVVIMAGDDNPSYLAKPVS************
**************GVSGGGLGHWNSPLPYLFGGLGLMMGLITVALMILACSY*********************KQVEMQTEMEPKIVVIMAGDDNPSYLAKPVSCTFYHQQTEQVL
*********************GHWNSPLPYLFGGLGLMMGLITVALMILACSYRK*************************TEMEPKIVVIMAGDDNPSYLAKPVSCTFY********
oooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVSPGIASNSTRDGGVSGGGLGHWNSPLPYLFGGLGLMMGLITVALMILACSYRKSSSNSATDHADEDKSAGHDKQVEMQTEMEPKIVVIMAGDDNPSYLAKPVSCTFYHQQTEQVL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query117 2.2.26 [Sep-21-2011]
Q9SW07129 Protein GLUTAMINE DUMPER yes no 0.786 0.713 0.469 1e-16
O81775158 Protein GLUTAMINE DUMPER no no 0.700 0.518 0.516 4e-15
Q8S8A0156 Protein GLUTAMINE DUMPER no no 0.743 0.557 0.445 6e-15
Q9FHH5148 Protein GLUTAMINE DUMPER no no 0.769 0.608 0.428 4e-13
Q3E965131 Protein GLUTAMINE DUMPER no no 0.658 0.587 0.475 7e-10
Q3E8L097 Protein GLUTAMINE DUMPER no no 0.735 0.886 0.404 7e-09
Q3EAV6111 Protein GLUTAMINE DUMPER no no 0.666 0.702 0.487 3e-07
>sp|Q9SW07|GDU2_ARATH Protein GLUTAMINE DUMPER 2 OS=Arabidopsis thaliana GN=GDU2 PE=2 SV=1 Back     alignment and function desciption
 Score = 85.1 bits (209), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 58/98 (59%), Gaps = 6/98 (6%)

Query: 24  WNSPLPYLFGGLGLMMGLITVALMILACSYRKSSSNSATD-HADEDKSAGHDKQVEMQTE 82
           W+SP+PYLFGGL  M+ LI VAL+ILACSY + S ++  D  A +D    +D      TE
Sbjct: 28  WHSPVPYLFGGLAAMLALICVALLILACSYWRLSGSAERDLEAGDDAKPDNDTNKTKHTE 87

Query: 83  MEPKIVVIMAGDDNPSYLAKPV-----SCTFYHQQTEQ 115
           M  K +VIMAGD  P+YLA P      SCT      E+
Sbjct: 88  MPEKFLVIMAGDVRPTYLATPATRSEQSCTCGDHNEEE 125




Probable subunit of an amino acid transporter involved in the regulation of the amino acid metabolism. Stimulates amino acid export by activating nonselective amino acid facilitators.
Arabidopsis thaliana (taxid: 3702)
>sp|O81775|GDU1_ARATH Protein GLUTAMINE DUMPER 1 OS=Arabidopsis thaliana GN=GDU1 PE=1 SV=1 Back     alignment and function description
>sp|Q8S8A0|GDU4_ARATH Protein GLUTAMINE DUMPER 4 OS=Arabidopsis thaliana GN=GDU4 PE=2 SV=1 Back     alignment and function description
>sp|Q9FHH5|GDU3_ARATH Protein GLUTAMINE DUMPER 3 OS=Arabidopsis thaliana GN=GDU3 PE=2 SV=1 Back     alignment and function description
>sp|Q3E965|GDU5_ARATH Protein GLUTAMINE DUMPER 5 OS=Arabidopsis thaliana GN=GDU5 PE=2 SV=2 Back     alignment and function description
>sp|Q3E8L0|GDU7_ARATH Protein GLUTAMINE DUMPER 7 OS=Arabidopsis thaliana GN=GDU7 PE=2 SV=1 Back     alignment and function description
>sp|Q3EAV6|GDU6_ARATH Protein GLUTAMINE DUMPER 6 OS=Arabidopsis thaliana GN=GDU6 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query117
224127552106 predicted protein [Populus trichocarpa] 0.794 0.877 0.663 1e-27
224127548109 predicted protein [Populus trichocarpa] 0.837 0.899 0.63 8e-27
224077576106 predicted protein [Populus trichocarpa] 0.880 0.971 0.616 1e-25
255618736118 conserved hypothetical protein [Ricinus 0.837 0.830 0.588 5e-24
255579958115 conserved hypothetical protein [Ricinus 0.769 0.782 0.552 6e-22
224127544106 predicted protein [Populus trichocarpa] 0.880 0.971 0.570 3e-21
255579956124 conserved hypothetical protein [Ricinus 0.683 0.645 0.602 3e-21
255616790125 conserved hypothetical protein [Ricinus 0.837 0.784 0.504 5e-21
356567164106 PREDICTED: uncharacterized protein LOC10 0.811 0.896 0.5 2e-19
356567160123 PREDICTED: uncharacterized protein LOC10 0.769 0.731 0.467 3e-19
>gi|224127552|ref|XP_002329306.1| predicted protein [Populus trichocarpa] gi|222870760|gb|EEF07891.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  127 bits (318), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 65/98 (66%), Positives = 72/98 (73%), Gaps = 5/98 (5%)

Query: 19  GGLGHWNSPLPYLFGGLGLMMGLITVALMILACSYRKSSSNSATDHADEDKSAGHDKQVE 78
            G    NSP+PYLF  +GLM GL+ VALMILACSYRKSSS SATD   ++KSA    QVE
Sbjct: 14  AGYQDRNSPIPYLFSSIGLMFGLVAVALMILACSYRKSSSTSATDPEVQEKSAN---QVE 70

Query: 79  MQTEMEPKIVVIMAGDDNPSYLAKPVSCTFYHQQTEQV 116
           M+ EMEPKIVVIMAGDD P+YLA PVSC    Q  EQV
Sbjct: 71  MKAEMEPKIVVIMAGDDRPTYLANPVSCNC--QSDEQV 106




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224127548|ref|XP_002329305.1| predicted protein [Populus trichocarpa] gi|222870759|gb|EEF07890.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224077576|ref|XP_002305310.1| predicted protein [Populus trichocarpa] gi|222848274|gb|EEE85821.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255618736|ref|XP_002539968.1| conserved hypothetical protein [Ricinus communis] gi|223500659|gb|EEF22415.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|255579958|ref|XP_002530814.1| conserved hypothetical protein [Ricinus communis] gi|223529635|gb|EEF31582.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224127544|ref|XP_002329304.1| predicted protein [Populus trichocarpa] gi|222870758|gb|EEF07889.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255579956|ref|XP_002530813.1| conserved hypothetical protein [Ricinus communis] gi|223529634|gb|EEF31581.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|255616790|ref|XP_002539784.1| conserved hypothetical protein [Ricinus communis] gi|223502421|gb|EEF22599.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356567164|ref|XP_003551791.1| PREDICTED: uncharacterized protein LOC100782742 [Glycine max] Back     alignment and taxonomy information
>gi|356567160|ref|XP_003551789.1| PREDICTED: uncharacterized protein LOC100781671 [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query117
TAIR|locus:1005716497111 GDU6 "AT3G30725" [Arabidopsis 0.683 0.720 0.439 5e-12
TAIR|locus:2117517129 GDU2 "AT4G25760" [Arabidopsis 0.700 0.635 0.421 5e-12
TAIR|locus:2124894158 GDU1 "glutamine dumper 1" [Ara 0.700 0.518 0.426 2e-10
TAIR|locus:505006270156 GDU4 "AT2G24762" [Arabidopsis 0.743 0.557 0.380 5.2e-10
TAIR|locus:2149448131 GDU5 "AT5G24920" [Arabidopsis 0.658 0.587 0.378 2.9e-09
TAIR|locus:504954872148 GDU3 "AT5G57685" [Arabidopsis 0.786 0.621 0.34 3.3e-08
TAIR|locus:216663597 GDU7 "AT5G38770" [Arabidopsis 0.470 0.567 0.467 2.3e-07
TAIR|locus:1005716497 GDU6 "AT3G30725" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 162 (62.1 bits), Expect = 5.0e-12, P = 5.0e-12
 Identities = 36/82 (43%), Positives = 49/82 (59%)

Query:    24 WNSPLPYXXXXXXXXXXXITVALMILACSYRKSSSNSATDHADEDKSAGHDKQVEMQT-E 82
             W SP+PY           I +AL+ L C+++K SS+S  +H DE+   G DK  +  T E
Sbjct:    10 WKSPVPYLFGGLFLLVLLIALALLSLVCTHQKPSSSSNNNHMDEEDDVG-DKDAKPITRE 68

Query:    83 MEPKIVVIMAGDDNPSYLAKPV 104
               PKIVVI+AGD+ P+ LA PV
Sbjct:    69 YLPKIVVILAGDNKPTCLAVPV 90




GO:0003674 "molecular_function" evidence=ND
GO:0080143 "regulation of amino acid export" evidence=IMP
TAIR|locus:2117517 GDU2 "AT4G25760" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124894 GDU1 "glutamine dumper 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006270 GDU4 "AT2G24762" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2149448 GDU5 "AT5G24920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504954872 GDU3 "AT5G57685" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2166635 GDU7 "AT5G38770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 117
PF01102122 Glycophorin_A: Glycophorin A; InterPro: IPR001195 87.3
PF0243938 Adeno_E3_CR2: Adenovirus E3 region protein CR2; In 86.66
PF0720498 Orthoreo_P10: Orthoreovirus membrane fusion protei 85.35
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane Back     alignment and domain information
Probab=87.30  E-value=0.62  Score=34.69  Aligned_cols=35  Identities=26%  Similarity=0.404  Sum_probs=23.7

Q ss_pred             CCCCChhHHHHHHHHHHHHHHHHHHHHHhhhcccCC
Q 035876           23 HWNSPLPYLFGGLGLMMGLITVALMILACSYRKSSS   58 (117)
Q Consensus        23 ~W~SPvPYLFgGLA~MLgLIAvALLILaCSy~K~ss   58 (117)
                      .+..|+= ...-|++|.|+|++.|||+-|-||+...
T Consensus        60 ~fs~~~i-~~Ii~gv~aGvIg~Illi~y~irR~~Kk   94 (122)
T PF01102_consen   60 RFSEPAI-IGIIFGVMAGVIGIILLISYCIRRLRKK   94 (122)
T ss_dssp             SSS-TCH-HHHHHHHHHHHHHHHHHHHHHHHHHS--
T ss_pred             Cccccce-eehhHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            4445542 4456889999999999998888765543



Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.

>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host [] Back     alignment and domain information
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query117
2knc_A54 Integrin alpha-IIB; transmembrane signaling, prote 91.48
>2knc_A Integrin alpha-IIB; transmembrane signaling, protein structure, cell A cleavage on PAIR of basic residues, disease mutation, disul bond, glycoprotein; NMR {Homo sapiens} Back     alignment and structure
Probab=91.48  E-value=0.53  Score=29.93  Aligned_cols=34  Identities=21%  Similarity=0.483  Sum_probs=23.7

Q ss_pred             CChhHHHHHHHHHHHHHHHHHHHHH---hhhcccCCC
Q 035876           26 SPLPYLFGGLGLMMGLITVALMILA---CSYRKSSSN   59 (117)
Q Consensus        26 SPvPYLFgGLA~MLgLIAvALLILa---CSy~K~ss~   59 (117)
                      +++|.-..-++++.||+-++|++++   |-+.|+...
T Consensus         8 ~~vp~wiIi~svl~GLllL~li~~~LwK~GFFkR~~~   44 (54)
T 2knc_A            8 RAIPIWWVLVGVLGGLLLLTILVLAMWKVGFFKRNRP   44 (54)
T ss_dssp             TTCCHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTCC
T ss_pred             CCcchHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCC
Confidence            4566666667777788777777666   777776654




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00