Citrus Sinensis ID: 030628


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170----
MMEQLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR
cHHHHHHHHccccccccEEEHHHHcccccccccccccccccccHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHcccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHcccccHHHHccHHHHHHHHHHHHHHHHHHHcc
cHHHHHHHccccEEEEEEEcccEEEcccccccccccccccccHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
mmeqlskrkkgvtfgsfkvskeikfadkqpifpwgprfaksspqdIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLksfepavsptyteegdddgRALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYaggfipaflfdnqelivtgSSAVLLFIMASYYR
mmeqlskrkkgvtfgsfkvskeikfadkqpifpWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKsfepavsptyteegdddgRALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR
MMEQLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR
************TFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAV*************ALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYY*
*****************KVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKSFE********************GKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR
**********GVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR
***QLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHi
oooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHi
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHi
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MMEQLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWIAIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLALVFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query174
224144561 285 predicted protein [Populus trichocarpa] 1.0 0.610 0.752 1e-74
224123536253 predicted protein [Populus trichocarpa] 1.0 0.687 0.758 5e-74
225453038 285 PREDICTED: uncharacterized protein LOC10 1.0 0.610 0.752 2e-73
358248094 282 uncharacterized protein LOC100814672 [Gl 1.0 0.617 0.758 5e-73
449465818 285 PREDICTED: uncharacterized protein LOC10 1.0 0.610 0.735 6e-72
356505987 282 PREDICTED: uncharacterized protein LOC10 1.0 0.617 0.747 2e-71
217075548 285 unknown [Medicago truncatula] 1.0 0.610 0.729 2e-68
297849170 294 heat shock protein binding protein [Arab 1.0 0.591 0.701 2e-65
18390922 294 Chloroplast J-like domain 1-containing p 1.0 0.591 0.695 3e-65
21593057 294 unknown [Arabidopsis thaliana] 1.0 0.591 0.695 4e-65
>gi|224144561|ref|XP_002325333.1| predicted protein [Populus trichocarpa] gi|222862208|gb|EEE99714.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  283 bits (725), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 131/174 (75%), Positives = 156/174 (89%)

Query: 1   MMEQLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWI 60
           MM QLS RKKGVT+GSFKV+K+IK+ADKQP+ PWGPRFAKSS  D+RINLAISAAFTAWI
Sbjct: 112 MMAQLSNRKKGVTYGSFKVAKDIKYADKQPVVPWGPRFAKSSENDMRINLAISAAFTAWI 171

Query: 61  AIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLAL 120
            IKR AEYKPLQFLAFAFVYR FEKLK+FEP VSPTY+E+G+D+GR LR+GKR+LR LAL
Sbjct: 172 LIKRNAEYKPLQFLAFAFVYRIFEKLKTFEPPVSPTYSEDGEDEGRTLRLGKRILRALAL 231

Query: 121 VFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR 174
           VFG I  +SLAYTGILNLIE AG +IPAFL++NQEL++T +++ +L+I+ASYYR
Sbjct: 232 VFGSITFASLAYTGILNLIEMAGSYIPAFLYNNQELLITTATSAILYILASYYR 285




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224123536|ref|XP_002319104.1| predicted protein [Populus trichocarpa] gi|222857480|gb|EEE95027.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225453038|ref|XP_002266453.1| PREDICTED: uncharacterized protein LOC100243269 [Vitis vinifera] gi|302143637|emb|CBI22390.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|358248094|ref|NP_001240067.1| uncharacterized protein LOC100814672 [Glycine max] gi|255638094|gb|ACU19361.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|449465818|ref|XP_004150624.1| PREDICTED: uncharacterized protein LOC101213790 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356505987|ref|XP_003521770.1| PREDICTED: uncharacterized protein LOC100805399 [Glycine max] Back     alignment and taxonomy information
>gi|217075548|gb|ACJ86134.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|297849170|ref|XP_002892466.1| heat shock protein binding protein [Arabidopsis lyrata subsp. lyrata] gi|297338308|gb|EFH68725.1| heat shock protein binding protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18390922|ref|NP_563823.1| Chloroplast J-like domain 1-containing protein [Arabidopsis thaliana] gi|13877839|gb|AAK43997.1|AF370182_1 unknown protein [Arabidopsis thaliana] gi|16323504|gb|AAL15246.1| unknown protein [Arabidopsis thaliana] gi|332190204|gb|AEE28325.1| Chloroplast J-like domain 1-containing protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|21593057|gb|AAM65006.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query174
TAIR|locus:2025660294 CJD1 "Chloroplast J-like domai 1.0 0.591 0.695 5.5e-61
TAIR|locus:2025660 CJD1 "Chloroplast J-like domain 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 624 (224.7 bits), Expect = 5.5e-61, P = 5.5e-61
 Identities = 121/174 (69%), Positives = 141/174 (81%)

Query:     1 MMEQLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWI 60
             M  QL  RKKGVTFGSFKVSK+IK+ADKQPI PWGPRF++SS  D+ INLAIS  F+AWI
Sbjct:   121 MYAQLMNRKKGVTFGSFKVSKDIKYADKQPIIPWGPRFSRSSKNDMLINLAISVVFSAWI 180

Query:    61 AIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLAL 120
             AIKR  EYKPLQF++F FVYR FEKLKSFE   SP Y EEG++ GR LRMGKRLLR L+L
Sbjct:   181 AIKRNVEYKPLQFMSFVFVYRIFEKLKSFEAPSSPIYNEEGEESGRGLRMGKRLLRSLSL 240

Query:   121 VFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR 174
             VFG I ++SLAYTG LN IEY G  IP  L++NQELIVT SSA +L+++AS+YR
Sbjct:   241 VFGSILLASLAYTGFLNGIEYMGYSIPMVLYNNQELIVTASSAFMLYVIASFYR 294


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.326   0.140   0.415    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      174       174   0.00075  109 3  11 22  0.47    31
                                                     31  0.49    34


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  560 (60 KB)
  Total size of DFA:  133 KB (2084 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  15.30u 0.18s 15.48t   Elapsed:  00:00:00
  Total cpu time:  15.30u 0.18s 15.48t   Elapsed:  00:00:00
  Start:  Sat May 11 08:14:51 2013   End:  Sat May 11 08:14:51 2013


GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0006631 "fatty acid metabolic process" evidence=IMP
GO:0031969 "chloroplast membrane" evidence=IDA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 174
PF11833194 DUF3353: Protein of unknown function (DUF3353); In 99.39
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised Back     alignment and domain information
Probab=99.39  E-value=1.4e-12  Score=107.85  Aligned_cols=140  Identities=27%  Similarity=0.422  Sum_probs=102.0

Q ss_pred             ChhhhhhhhcCccccceeeehhhccccC-----------CCCccCCCC----CCCCCcchhhHhHHHHHHHHHHHHhhcc
Q 030628            1 MMEQLSKRKKGVTFGSFKVSKEIKFADK-----------QPIFPWGPR----FAKSSPQDIRINLAISAAFTAWIAIKRY   65 (174)
Q Consensus         1 MmaQLs~RkkG~t~Gs~~VsKdikYADk-----------qp~~PW~PR----~~~s~~kDi~IN~aIsa~~~~wi~~~~~   65 (174)
                      ||++|..|+||    .++|+++|||||+           ++..||.+|    +...+.+|+.+..++-+++.+|......
T Consensus        38 LM~rL~~Rq~G----ki~v~~~ir~ad~~~~~~~~~~~~~~~p~wl~~~~~~~~~P~~~~l~~~~~~f~~L~~~~~~~~~  113 (194)
T PF11833_consen   38 LMERLRQRQKG----KIKVPERIRYADREEPKPPNPKPSNPSPPWLQRLLPSFDTPSSQDLLIRAAAFGALGLWSLLFPA  113 (194)
T ss_pred             HHHHHHHHHcC----CCCccHHHHHhhhccccccCCCCCCccchHHHhcccceeCCCcchHHHHHHHHHHHHHHHHHHcC
Confidence            69999999999    6899999999999           777899999    5888899999999998999999998877


Q ss_pred             cccchHHHHHHHHHHHHHHHhhhcCCCCCCCCCCCCCCchhhhhhhHHHHHHHHHHHHHHHHHhhhHHhHH-HHHHHhCc
Q 030628           66 AEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLALVFGVIAVSSLAYTGIL-NLIEYAGG  144 (174)
Q Consensus        66 a~~kPl~f~af~~v~r~f~KL~~~ePp~~~~~~e~ge~~gr~~r~~KR~lRsl~LvfG~~~v~sL~yT~~l-n~ie~~g~  144 (174)
                      ++...+|.+ ++. ...+..|++                     -++|+.|++++.+|++++ .+.....+ +.+..  .
T Consensus       114 ~~~~~l~La-l~~-~~~iyfl~~---------------------K~~~~~rA~~~~~~~L~~-G~~lGs~l~~~l~~--~  167 (194)
T PF11833_consen  114 ASGPGLQLA-LGL-GACIYFLNR---------------------KERKLGRAFLWTLGGLVV-GLILGSLLASWLPV--D  167 (194)
T ss_pred             CCCcchHHH-HHH-HHHHHHHHH---------------------hcchHHHHHHHHHHHHHH-HHHHHHHHHhhccc--c
Confidence            777777733 322 223333322                     178999999999999988 44444444 33322  1


Q ss_pred             ccchheeccceeeeehhhhHHHHhhhhc
Q 030628          145 FIPAFLFDNQELIVTGSSAVLLFIMASY  172 (174)
Q Consensus       145 ~iP~~~y~~Qel~vt~~~~v~L~~~~s~  172 (174)
                      .+|  .-.+=|.++|..+-++|....+|
T Consensus       168 ~~p--~~~s~~~~~sl~~~i~lwl~s~f  193 (194)
T PF11833_consen  168 IVP--GPWSPEQLVSLFTYILLWLVSLF  193 (194)
T ss_pred             cCC--CCCCHHHHHHHHHHHHHHHHHhc
Confidence            223  34455777777777777766555



This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00