Citrus Sinensis ID: 044736


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120------
MQLRRIKKAVKKLNMILSVLKLSRWHLASICGTTFSKNRRSLSFSDRVGLQYSTTVDQDSSMGKSVRSPQRTKGCAYDHYGDDDNDNIDRNSEVFITNFRRRLLYERQVSLELLYRRVNSFDNNYA
ccHHHHHHHHHHHHHHHHHHcccHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEcccccccccc
cHHHHHHHHHHHHHHHHHHHHHcHcHHHHHHccccccccEEEEEccccccccccccccccccccccccccEEcccccccccccccHcHHHHHHHHHHHHHHHHHHHHHHHEEHHHccccccccccc
MQLRRIKKAVKKLNMILSVLKLSRWHLASIcgttfsknrrslsfsdrvglqysttvdqdssmgksvrspqrtkgcaydhygdddndnidrnSEVFITNFRRRLLYERQVSLELLYRRVNSFDNNYA
mqlrrikkaVKKLNMILSVLKLSRWHLASICGttfsknrrslsfsdRVGLQysttvdqdssmgksvrspqrtKGCAYDhygdddndnidrNSEVFITNFRRRLLYERQVSlellyrrvnsfdnnya
MQLRRIKKAVKKLNMILSVLKLSRWHLASICGTTFSKNRRSLSFSDRVGLQYSTTVDQDSSMGKSVRSPQRTKGCAYDHYGdddndnidrnSEVFITNFRRRLLYERQVSLELLYRRVNSFDNNYA
*******KAVKKLNMILSVLKLSRWHLASICGTTFSKNR**L*F*******************************AYDHYG*****NIDRNSEVFITNFRRRLLYERQVSLELLYRRVNS******
******K*AVKKLNMILSVLKLSRWHLA************************************************************DRNSEVFITNFRRRLLYERQVSLELLYRRVN*******
MQLRRIKKAVKKLNMILSVLKLSRWHLASICGTTFSKNRRSLSFSDRVGLQYSTT****************TKGCAYDHYGDDDNDNIDRNSEVFITNFRRRLLYERQVSLELLYRRVNSFDNNYA
*QLRRIKKAVKKLNMILSVLKLSRWHLASICGTTFSKNRRSLSFSDRVGLQYST********************C*YDHYGDDDNDNIDRNSEVFITNFRRRLLYERQVSLELLYRRVNSF*****
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MQLRRIKKAVKKLNMILSVLKLSRWHLASICGTTFSKNRRSLSFSDRVGLQYSTTVDQDSSMGKSVRSPQRTKGCAYDHYGDDDNDNIDRNSEVFITNFRRRLLYERQVSLELLYRRVNSFDNNYA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query126
356562451134 PREDICTED: uncharacterized protein LOC10 0.880 0.828 0.420 8e-14
255646825132 unknown [Glycine max] 0.833 0.795 0.432 2e-13
255548568126 conserved hypothetical protein [Ricinus 0.880 0.880 0.429 4e-13
356547332132 PREDICTED: uncharacterized protein LOC10 0.833 0.795 0.423 1e-12
224121176128 predicted protein [Populus trichocarpa] 0.896 0.882 0.443 1e-11
225446076128 PREDICTED: uncharacterized protein LOC10 0.880 0.867 0.421 2e-11
449457151120 PREDICTED: uncharacterized protein LOC10 0.880 0.925 0.422 2e-11
15242150130 uncharacterized protein [Arabidopsis tha 0.936 0.907 0.398 3e-11
116830679131 unknown [Arabidopsis thaliana] 0.936 0.900 0.398 3e-11
224143756125 predicted protein [Populus trichocarpa] 0.880 0.888 0.408 8e-11
>gi|356562451|ref|XP_003549485.1| PREDICTED: uncharacterized protein LOC100798434 [Glycine max] Back     alignment and taxonomy information
 Score = 81.6 bits (200), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 53/126 (42%), Positives = 74/126 (58%), Gaps = 15/126 (11%)

Query: 3   LRRIKKAVKKLNMILSVLKLSR-WHLASIC-GTTFSKNRRSLSFSDRVGLQYSTT----V 56
           L R++ AVKK+ ++LS   LS  WH A+I  G + SK  R +SF+DR GL   T      
Sbjct: 14  LSRLRVAVKKMKLLLSATVLSHAWHAATILRGVSMSK--RQISFNDRPGLMMCTASSDET 71

Query: 57  DQDSSMGKSVRSPQRTKGCAYDHYGDDDNDNIDRNSEVFITNFRRRLLYERQVSLELLYR 116
           D +  +  +  S QRT  C  D       D+ID+ +E+FI+NFRR+L  ERQ+SL+L Y 
Sbjct: 72  DSEGLVSPAHHSLQRTISCPSD-------DDIDKRAEMFISNFRRQLNMERQISLQLRYC 124

Query: 117 RVNSFD 122
             NSF+
Sbjct: 125 SQNSFE 130




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255646825|gb|ACU23884.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|255548568|ref|XP_002515340.1| conserved hypothetical protein [Ricinus communis] gi|223545284|gb|EEF46789.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356547332|ref|XP_003542068.1| PREDICTED: uncharacterized protein LOC100814899 [Glycine max] Back     alignment and taxonomy information
>gi|224121176|ref|XP_002330762.1| predicted protein [Populus trichocarpa] gi|224121184|ref|XP_002330764.1| predicted protein [Populus trichocarpa] gi|222872564|gb|EEF09695.1| predicted protein [Populus trichocarpa] gi|222872566|gb|EEF09697.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225446076|ref|XP_002272519.1| PREDICTED: uncharacterized protein LOC100264996 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449457151|ref|XP_004146312.1| PREDICTED: uncharacterized protein LOC101218317 [Cucumis sativus] gi|449510648|ref|XP_004163722.1| PREDICTED: uncharacterized protein LOC101224640 [Cucumis sativus] Back     alignment and taxonomy information
>gi|15242150|ref|NP_200559.1| uncharacterized protein [Arabidopsis thaliana] gi|9758312|dbj|BAB08786.1| unnamed protein product [Arabidopsis thaliana] gi|91805711|gb|ABE65584.1| hypothetical protein At5g57510 [Arabidopsis thaliana] gi|332009526|gb|AED96909.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|116830679|gb|ABK28297.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224143756|ref|XP_002325063.1| predicted protein [Populus trichocarpa] gi|222866497|gb|EEF03628.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query126
TAIR|locus:2174537130 AT5G57510 "AT5G57510" [Arabido 0.936 0.907 0.365 1.9e-12
TAIR|locus:2174537 AT5G57510 "AT5G57510" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 166 (63.5 bits), Expect = 1.9e-12, P = 1.9e-12
 Identities = 45/123 (36%), Positives = 66/123 (53%)

Query:     3 LRRIKKAVKKLNMILSVLKLSRWHLASICGTTFSKNRRSLSFSDRVGLQYSTTVDQDSSM 62
             L R+++AV K+  +LS    S W L  + G++ S +   LSF+DR GL  + T ++  S 
Sbjct:    10 LNRLRQAVNKVKFVLSFKINSLWGLVPMLGSSSSSSSLRLSFNDRPGLTAAFTENEPDST 69

Query:    63 GKSVRSPQRTKGCAYDHYGXXXXXXXXXXSEVFITNFRRRLLYERQVSLELLYRRVNSFD 122
             G S  +  RT   +YD             +E+FI NF R+L  ERQ+SLEL Y + N+  
Sbjct:    70 GSSRGALYRT--VSYDQ---SSDEDIDNKAEMFIANFYRQLKIERQISLELKYCQGNNQS 124

Query:   123 NNY 125
              NY
Sbjct:   125 FNY 127


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.323   0.133   0.381    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      126       116   0.00091  102 3  11 22  0.41    30
                                                     29  0.45    32


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  574 (61 KB)
  Total size of DFA:  116 KB (2076 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  10.48u 0.11s 10.59t   Elapsed:  00:00:01
  Total cpu time:  10.48u 0.11s 10.59t   Elapsed:  00:00:01
  Start:  Fri May 10 07:38:55 2013   End:  Fri May 10 07:38:56 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0071456 "cellular response to hypoxia" evidence=IEP
GO:0010200 "response to chitin" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.01470022
annotation not avaliable (128 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 126
PF0555338 DUF761: Cotton fibre expressed protein; InterPro: 99.35
>PF05553 DUF761: Cotton fibre expressed protein; InterPro: IPR008480 This family consists of several plant proteins of unknown function Back     alignment and domain information
Probab=99.35  E-value=4.5e-13  Score=83.45  Aligned_cols=34  Identities=41%  Similarity=0.712  Sum_probs=30.8

Q ss_pred             CCchhhhhHHHHHHHHHHHHhHhhhhhhhheehhc
Q 044736           84 DNDNIDRNSEVFITNFRRRLLYERQVSLELLYRRV  118 (126)
Q Consensus        84 ~~~dId~~Ae~FI~nFr~Ql~lqRq~Slel~Y~~~  118 (126)
                      ++++||++||+||++||+||+||||.|+ .+|+++
T Consensus         1 ~~~evd~rAe~FI~~f~~qlrlqr~~S~-~ry~em   34 (38)
T PF05553_consen    1 SDDEVDRRAEEFIAKFREQLRLQRQESL-QRYQEM   34 (38)
T ss_pred             CchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH
Confidence            3679999999999999999999999999 688765



Three of the sequences from Gossypium hirsutum (Upland cotton) in this family are described as G. hirsutum fibre expressed proteins []. The remaining sequences, found in Arabidopsis thaliana, are uncharacterised.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00