Citrus Sinensis ID: 030383


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------18
MAITCSSSSLICFSSDFASSSLNLFYSKPNIIIRTHLLFCPRNQKPLNSSVSDKRRNRVSFFSGFSTKRVESFKEELFQAIKPLDRGAEASPEDQARVDQIARKLEAVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRPKFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLVQ
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHcccccccccccccccEEEEEEEccHHHccccccccccccccEEEEEEccccEEEEEEEcccccHHHHHHcc
ccEEcccccHEEEcccccccccccccccccEcEccEEEccccccccccccccccccccEEEcccccHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHccccccccHHHccccEEEEEEcHHHHHcccccccccccccEEEEEEccccEEEEEEcccHHHHHHHHHcc
maitcsssslicfssdfassslnlfyskpniiirthllfcprnqkplnssvsdkrrnrvsffsgfSTKRVESFKEELFQAIkpldrgaeaspedQARVDQIARKLEAVNDIkeplksnllngkWELLYTTSQSLlqtkrpkflrpngkiyqaiNIDTLRaqnietwpFFNQVIIFLVQ
maitcsssslicFSSDFASSSLNLFYSKPNIIIRTHLLFCPRNQkplnssvsdkrrnrvsffsgfstkrveSFKEELFQAIKpldrgaeaspedqARVDQIARKLEavndikeplksnllngkwELLYTTSQSllqtkrpkflrpngKIYQAINIDTLRAQNIETWPFFNQVIIFLVQ
MAITCsssslicfssdfassslnlfYSKPNIIIRTHLLFCPRNQKPLNSSVSDKRRNRVSFFSGFSTKRVESFKEELFQAIKPLDRGAEASPEDQARVDQIARKLEAVNDIKEPLKSNLLNGKWEllyttsqsllqtKRPKFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLVQ
********SLICFSSDFASSSLNLFYSKPNIIIRTHLLFCPR*******************F********************************************AVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRPKFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLV*
**************************************************************************EELFQAIKPLDRGAEA**EDQARVDQIARKLEAVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRPKFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLV*
MAITCSSSSLICFSSDFASSSLNLFYSKPNIIIRTHLLFCPRNQKPLNSSVSDKRRNRVSFFSGFSTKRVESFKEELFQAIKPLDR*********ARVDQIARKLEAVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRPKFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLVQ
****CSSSSLICFSSDFASSSLNLFYSKPNIIIRTHLLFCPRNQ****SS*SDKRRNRVSFFSGFSTKRVESFKEELFQAIKPLDRGAEASPEDQARVDQIARKLEAVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRPKFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLVQ
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
SSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAITCSSSSLICFSSDFASSSLNLFYSKPNIIIRTHLLFCPRNQKPLNSSVSDKRRNRVSFFSGFSTKRVESFKEELFQAIKPLDRGAEASPEDQARVDQIARKLEAVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRPKFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLVQ
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query178 2.2.26 [Sep-21-2011]
Q9LU85242 Probable plastid-lipid-as yes no 0.674 0.495 0.569 3e-35
Q6DBN2234 Probable plastid-lipid-as no no 0.668 0.508 0.559 1e-32
Q6K439 319 Probable plastid-lipid-as no no 0.550 0.307 0.320 0.0004
O81439 318 Probable plastid-lipid-as no no 0.544 0.305 0.32 0.0004
>sp|Q9LU85|PAP4_ARATH Probable plastid-lipid-associated protein 4, chloroplastic OS=Arabidopsis thaliana GN=PAP4 PE=1 SV=1 Back     alignment and function desciption
 Score =  147 bits (371), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 74/130 (56%), Positives = 97/130 (74%), Gaps = 10/130 (7%)

Query: 52  SDKRRNRV----SFFSGFSTK-----RVESFKEELFQAIKPLDRGAEASPEDQARVDQIA 102
            D+ R RV    SF   F T+     + +  K+EL +AI+PL+RGA ASP+DQ R+DQ+A
Sbjct: 44  GDRERLRVQAIFSFPPAFLTRNGRAEKQKQLKQELLEAIEPLERGATASPDDQLRIDQLA 103

Query: 103 RKLEAVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRPKFLRPNGKIYQAINIDTLRAQN 162
           RK+EAVN  KEPLKS+L+NGKWEL+YTTS S+LQ K+P+FLR     YQ+IN+DTL+ QN
Sbjct: 104 RKVEAVNPTKEPLKSDLVNGKWELIYTTSASILQAKKPRFLRSITN-YQSINVDTLKVQN 162

Query: 163 IETWPFFNQV 172
           +ETWPF+N V
Sbjct: 163 METWPFYNSV 172





Arabidopsis thaliana (taxid: 3702)
>sp|Q6DBN2|PAP5_ARATH Probable plastid-lipid-associated protein 5, chloroplastic OS=Arabidopsis thaliana GN=PAP5 PE=2 SV=1 Back     alignment and function description
>sp|Q6K439|PAP2_ORYSJ Probable plastid-lipid-associated protein 2, chloroplastic OS=Oryza sativa subsp. japonica GN=PAP2 PE=2 SV=1 Back     alignment and function description
>sp|O81439|PAP1_ARATH Probable plastid-lipid-associated protein 1, chloroplastic OS=Arabidopsis thaliana GN=PAP1 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query178
356525936240 PREDICTED: probable plastid-lipid-associ 0.825 0.612 0.662 3e-50
356557505245 PREDICTED: probable plastid-lipid-associ 0.707 0.514 0.765 1e-49
298204587 320 unnamed protein product [Vitis vinifera] 0.842 0.468 0.690 3e-49
225448063256 PREDICTED: probable plastid-lipid-associ 0.842 0.585 0.690 4e-49
357445617248 hypothetical protein MTR_2g007640 [Medic 0.966 0.693 0.576 6e-48
217073420248 unknown [Medicago truncatula] gi|3885181 0.966 0.693 0.571 3e-47
224070527225 predicted protein [Populus trichocarpa] 0.685 0.542 0.741 3e-45
89355891174 plastid fibrillin 3 [Coffea canephora] 0.606 0.620 0.768 1e-43
255575408262 structural molecule, putative [Ricinus c 0.960 0.652 0.543 1e-41
449449595243 PREDICTED: probable plastid-lipid-associ 0.696 0.510 0.679 2e-41
>gi|356525936|ref|XP_003531577.1| PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic-like isoform 1 [Glycine max] Back     alignment and taxonomy information
 Score =  203 bits (516), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 104/157 (66%), Positives = 120/157 (76%), Gaps = 10/157 (6%)

Query: 23  NLFYSKPNIIIRTHLLFCPRNQKPLNSSVSDKRRNRVSFFSGFST--KRVESFKEELFQA 80
           +LF S P  +  THL+        L  + S K  N VSFF GF T  + V+S K +L++A
Sbjct: 27  SLFSSSPQKLNTTHLI--------LQVADSGKWSNMVSFFPGFITEGRDVQSLKVDLYEA 78

Query: 81  IKPLDRGAEASPEDQARVDQIARKLEAVNDIKEPLKSNLLNGKWELLYTTSQSLLQTKRP 140
           I PLDRGAEA+PEDQ RVDQIARKLEAVN +KEPLKS+LLNGKWEL YTTSQS+LQT+RP
Sbjct: 79  IAPLDRGAEATPEDQQRVDQIARKLEAVNPVKEPLKSDLLNGKWELFYTTSQSILQTQRP 138

Query: 141 KFLRPNGKIYQAINIDTLRAQNIETWPFFNQVIIFLV 177
           K LRPNGKIYQAIN+DTLRAQNIETWPF+NQ    LV
Sbjct: 139 KLLRPNGKIYQAINVDTLRAQNIETWPFYNQATANLV 175




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356557505|ref|XP_003547056.1| PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic-like [Glycine max] Back     alignment and taxonomy information
>gi|298204587|emb|CBI23862.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225448063|ref|XP_002273405.1| PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic [Vitis vinifera] Back     alignment and taxonomy information
>gi|357445617|ref|XP_003593086.1| hypothetical protein MTR_2g007640 [Medicago truncatula] gi|124360438|gb|ABN08448.1| PAP fibrillin [Medicago truncatula] gi|355482134|gb|AES63337.1| hypothetical protein MTR_2g007640 [Medicago truncatula] Back     alignment and taxonomy information
>gi|217073420|gb|ACJ85069.1| unknown [Medicago truncatula] gi|388518147|gb|AFK47135.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|224070527|ref|XP_002303164.1| predicted protein [Populus trichocarpa] gi|222840596|gb|EEE78143.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|89355891|gb|ABD72269.1| plastid fibrillin 3 [Coffea canephora] Back     alignment and taxonomy information
>gi|255575408|ref|XP_002528606.1| structural molecule, putative [Ricinus communis] gi|223531951|gb|EEF33764.1| structural molecule, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|449449595|ref|XP_004142550.1| PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic-like [Cucumis sativus] gi|449479703|ref|XP_004155682.1| PREDICTED: probable plastid-lipid-associated protein 4, chloroplastic-like [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query178
TAIR|locus:2092150242 AT3G26070 [Arabidopsis thalian 0.668 0.491 0.527 7.1e-29
TAIR|locus:2092165234 AT3G26080 [Arabidopsis thalian 0.707 0.538 0.492 2.8e-27
TAIR|locus:2092150 AT3G26070 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 321 (118.1 bits), Expect = 7.1e-29, P = 7.1e-29
 Identities = 68/129 (52%), Positives = 87/129 (67%)

Query:    53 DKRRNRV----SFFSGFSTK--RVES---FKEELFQAIKPLDRGAEASPEDQARVDQIAR 103
             D+ R RV    SF   F T+  R E     K+EL +AI+PL+RGA ASP+DQ R+DQ+AR
Sbjct:    45 DRERLRVQAIFSFPPAFLTRNGRAEKQKQLKQELLEAIEPLERGATASPDDQLRIDQLAR 104

Query:   104 KLEAVNDIKEPLKSNLLNGKWEXXXXXXXXXXXXKRPKFLRPNGKIYQAINIDTLRAQNI 163
             K+EAVN  KEPLKS+L+NGKWE            K+P+FLR     YQ+IN+DTL+ QN+
Sbjct:   105 KVEAVNPTKEPLKSDLVNGKWELIYTTSASILQAKKPRFLRSITN-YQSINVDTLKVQNM 163

Query:   164 ETWPFFNQV 172
             ETWPF+N V
Sbjct:   164 ETWPFYNSV 172




GO:0005198 "structural molecule activity" evidence=IEA;ISS
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IEA;IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0010287 "plastoglobule" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
TAIR|locus:2092165 AT3G26080 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query178
pfam04755196 pfam04755, PAP_fibrillin, PAP_fibrillin 4e-22
>gnl|CDD|218245 pfam04755, PAP_fibrillin, PAP_fibrillin Back     alignment and domain information
 Score = 87.9 bits (218), Expect = 4e-22
 Identities = 41/99 (41%), Positives = 57/99 (57%), Gaps = 5/99 (5%)

Query: 71  ESFKEELFQAIKPLDRGAEASPEDQARVDQIARKLEAVNDIKEPLKS-NLLNGKWELLYT 129
           E  K +L  AI   +RG  AS +D+A ++    +LEA+N    P +S +LLNGKW LLYT
Sbjct: 1   ERLKRKLLDAIYGTNRGLRASSDDRAEIESAVTQLEALNPTPAPTESLDLLNGKWRLLYT 60

Query: 130 TSQ---SLLQTKRPKFLRPNGKIYQAINIDTLRAQNIET 165
           TS+    LL   R   L+  G+IYQ I+++ L   N  T
Sbjct: 61  TSKELLPLLARGRLPLLKV-GQIYQTIDVNNLTVYNSVT 98


This family identifies a conserved region found in a number of plastid lipid-associated proteins (PAPs), and in a number of putative fibrillin proteins. Length = 196

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 178
PF04755198 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 99.94
TIGR0211680 toxin_Txe_YoeB toxin-antitoxin system, toxin compo 81.58
>PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle Back     alignment and domain information
Probab=99.94  E-value=1.6e-26  Score=187.17  Aligned_cols=99  Identities=36%  Similarity=0.614  Sum_probs=90.8

Q ss_pred             HHHHHHHHHHhcccCCCCCCCHHHHHHHHHHHHHHHhcCCCCCCCcC-CCCCceEEEEEecccccccc---cCCCCcccc
Q 030383           71 ESFKEELFQAIKPLDRGAEASPEDQARVDQIARKLEAVNDIKEPLKS-NLLNGKWELLYTTSQSLLQT---KRPKFLRPN  146 (178)
Q Consensus        71 ~~lK~~LL~ai~gt~RG~~as~~~r~~IeelI~qLEalNPt~~P~~s-~lL~G~W~LvYTTs~~il~~---~rp~~l~~v  146 (178)
                      +++|++||+++++++||+.+++++|++|+++|++||++||+++|+++ ++|+|+|+|+|||+.++.+.   +++++ ..+
T Consensus         1 ~~~K~~Ll~~~~~~~rG~~~~~~~~~~i~~~v~~LE~~np~~~p~~s~~~L~G~W~Lvytt~~~~~~~l~~~~~~~-~~~   79 (198)
T PF04755_consen    1 QDLKQELLQAVAGTNRGLRASPEDREEIEELVEELEALNPTPDPADSLPLLDGRWELVYTTSPEIRSLLQRGRLPG-VRV   79 (198)
T ss_pred             ChHHHHHHHHHhccCCCccCCHHHHHHHHHHHHHHHHhCCCCCCcCCchhcCcEEEEEeecCCCcccccccccccc-ccc
Confidence            47899999999999999999999999999999999999999999998 99999999999999999954   45544 469


Q ss_pred             cceEEEEECCCCeeEEEEecccch
Q 030383          147 GKIYQAINIDTLRAQNIETWPFFN  170 (178)
Q Consensus       147 G~I~Q~ID~~~~~v~N~v~fpf~n  170 (178)
                      |+|||+||.+++++.|+|+|..|+
T Consensus        80 ~~v~Q~id~~~~~~~N~v~~~~~~  103 (198)
T PF04755_consen   80 GRVFQTIDADNGRVENVVELSGFP  103 (198)
T ss_pred             cceEEEEECCCceEEEEEEEeccC
Confidence            999999999999999999976655



The coat may contain receptors for attachment to the thylakoid membrane as well as regulatory proteins that may function in the transfer of lipids to and from the thylakoid membranes.). This entry also represents a number of putative fibrillin proteins.; GO: 0005198 structural molecule activity, 0009507 chloroplast

>TIGR02116 toxin_Txe_YoeB toxin-antitoxin system, toxin component, Txe/YoeB family Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00