Citrus Sinensis ID: 015769
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 400 | ||||||
| 224053000 | 399 | predicted protein [Populus trichocarpa] | 0.99 | 0.992 | 0.744 | 1e-170 | |
| 255574416 | 409 | structural molecule, putative [Ricinus c | 0.89 | 0.870 | 0.775 | 1e-166 | |
| 225435622 | 396 | PREDICTED: probable plastid-lipid-associ | 0.97 | 0.979 | 0.699 | 1e-155 | |
| 449448776 | 436 | PREDICTED: probable plastid-lipid-associ | 0.887 | 0.814 | 0.754 | 1e-154 | |
| 356576885 | 377 | PREDICTED: probable plastid-lipid-associ | 0.935 | 0.992 | 0.670 | 1e-150 | |
| 297847472 | 409 | plastid-lipid associated protein pap [Ar | 0.982 | 0.960 | 0.650 | 1e-148 | |
| 30694898 | 409 | putative plastid-lipid-associated protei | 0.977 | 0.955 | 0.629 | 1e-148 | |
| 255646338 | 377 | unknown [Glycine max] | 0.857 | 0.909 | 0.728 | 1e-146 | |
| 388509688 | 380 | unknown [Lotus japonicus] | 0.932 | 0.981 | 0.656 | 1e-140 | |
| 357445071 | 388 | hypothetical protein MTR_1g116320 [Medic | 0.955 | 0.984 | 0.643 | 1e-140 |
| >gi|224053000|ref|XP_002297657.1| predicted protein [Populus trichocarpa] gi|222844915|gb|EEE82462.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 603 bits (1554), Expect = e-170, Method: Compositional matrix adjust.
Identities = 300/403 (74%), Positives = 341/403 (84%), Gaps = 7/403 (1%)
Query: 1 MALKFHIT-NVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTD--QQQLA 57
MALKFH +++ + F+ S KL S +T+ V QSSL D QQQ++
Sbjct: 1 MALKFHAAIHLSLQLSPSPTLFAPSFC--KPQKLLKSSVKKTH--VCQSSLVDEQQQQIS 56
Query: 58 FTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMF 117
F +ENQLI+AL+GIQGRG+SAS +QLN+V AVKVLEGL+GV +PTGS+LIEGRWQLMF
Sbjct: 57 FNEQENQLINALVGIQGRGKSASPQQLNEVGHAVKVLEGLEGVSEPTGSNLIEGRWQLMF 116
Query: 118 TTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGK 177
TTRPGTASPIQRTFVGV+ FSVFQE+ LRTNDPRVSNIVKFS AIGELKVEAAA+I++GK
Sbjct: 117 TTRPGTASPIQRTFVGVDFFSVFQEVYLRTNDPRVSNIVKFSNAIGELKVEAAATIENGK 176
Query: 178 RILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 237
RILFQFD+AAFSF FLPFK PYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV
Sbjct: 177 RILFQFDRAAFSFNFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 236
Query: 238 LQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDSW 297
LQKKTEPRQ LLSAI TGT V +AINEFI NQ+ A++E EL++GEWQM+WSSQMETDSW
Sbjct: 237 LQKKTEPRQRLLSAIWTGTGVLEAINEFIKLNQNVAKDEMELIDGEWQMIWSSQMETDSW 296
Query: 298 IENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGF 357
IENAG GLMGKQIV KNGQ+KF VDILLG +FSMTGT+ KSS NTY+V MDDAAII G F
Sbjct: 297 IENAGRGLMGKQIVTKNGQLKFVVDILLGVRFSMTGTFVKSSPNTYDVKMDDAAIIGGMF 356
Query: 358 GYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTDGSNQK 400
G P++METKINL+LLYSDDK+RISRGY NI+FVH RTDG+ QK
Sbjct: 357 GLPVEMETKINLELLYSDDKIRISRGYKNIVFVHARTDGTRQK 399
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255574416|ref|XP_002528121.1| structural molecule, putative [Ricinus communis] gi|223532460|gb|EEF34251.1| structural molecule, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|225435622|ref|XP_002283329.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic [Vitis vinifera] gi|297746405|emb|CBI16461.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|449448776|ref|XP_004142141.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Cucumis sativus] gi|449503596|ref|XP_004162081.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|356576885|ref|XP_003556560.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297847472|ref|XP_002891617.1| plastid-lipid associated protein pap [Arabidopsis lyrata subsp. lyrata] gi|297337459|gb|EFH67876.1| plastid-lipid associated protein pap [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|30694898|ref|NP_175522.2| putative plastid-lipid-associated protein 12 [Arabidopsis thaliana] gi|75154826|sp|Q8LAP6.1|PAP12_ARATH RecName: Full=Probable plastid-lipid-associated protein 12, chloroplastic; AltName: Full=Fibrillin-12; Flags: Precursor gi|21593276|gb|AAM65225.1| unknown [Arabidopsis thaliana] gi|26452116|dbj|BAC43147.1| unknown protein [Arabidopsis thaliana] gi|28950975|gb|AAO63411.1| At1g51115 [Arabidopsis thaliana] gi|332194502|gb|AEE32623.1| putative plastid-lipid-associated protein 12 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|255646338|gb|ACU23652.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|388509688|gb|AFK42910.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|357445071|ref|XP_003592813.1| hypothetical protein MTR_1g116320 [Medicago truncatula] gi|355481861|gb|AES63064.1| hypothetical protein MTR_1g116320 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 400 | ||||||
| TAIR|locus:2026052 | 409 | AT1G51110 [Arabidopsis thalian | 0.895 | 0.875 | 0.675 | 4e-129 | |
| TAIR|locus:2132006 | 310 | AT4G22240 [Arabidopsis thalian | 0.402 | 0.519 | 0.235 | 1.9e-09 | |
| UNIPROTKB|P80471 | 326 | P80471 "Light-induced protein, | 0.107 | 0.131 | 0.466 | 1.7e-07 | |
| UNIPROTKB|O99019 | 326 | O99019 "Light-induced protein, | 0.357 | 0.438 | 0.259 | 6e-07 | |
| TAIR|locus:2136627 | 318 | FIB "fibrillin" [Arabidopsis t | 0.1 | 0.125 | 0.476 | 4.1e-06 | |
| TAIR|locus:2178133 | 273 | AT5G09820 [Arabidopsis thalian | 0.462 | 0.677 | 0.248 | 2.6e-05 | |
| TAIR|locus:2092150 | 242 | AT3G26070 [Arabidopsis thalian | 0.247 | 0.409 | 0.256 | 0.00014 |
| TAIR|locus:2026052 AT1G51110 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1267 (451.1 bits), Expect = 4.0e-129, P = 4.0e-129
Identities = 244/361 (67%), Positives = 298/361 (82%)
Query: 35 NCSQNRTNGLVAQSSLTDQ-QQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKV 93
NC R + + S++TDQ QQ +F E +LIDALIGIQGRG+SAS +QLNDVE AVKV
Sbjct: 48 NCRTLRIS-CSSSSTVTDQTQQSSFNDAELKLIDALIGIQGRGKSASPKQLNDVESAVKV 106
Query: 94 LEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLR-TNDPRV 152
LEGL+G+ +PT S LIEGRW+LMFTTRPGTASPIQRTF GV+ F+VFQ++ L+ TNDPRV
Sbjct: 107 LEGLEGIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRV 166
Query: 153 SNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKG 212
SNIVKFS+ IGELKVEA ASIKDGKR+LF+FD+AAF KFLPFK PYPVPFRLLGDEAKG
Sbjct: 167 SNIVKFSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKG 226
Query: 213 WLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQST 272
WLDTTYLSPSGNLRISRGNKGTTFVLQK+T PRQ LL+ IS V +AI+EF++SN ++
Sbjct: 227 WLDTTYLSPSGNLRISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNS 286
Query: 273 AXXXXXXXXXXWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMT 332
A WQM+WSSQM TDSWIENA NGLMG+QI++K+G++KFEV+I+ F+FSM
Sbjct: 287 AEDNYELLEGSWQMIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMK 346
Query: 333 GTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHL 392
G + KS ++TY++ MDDAAII G FGYP+ + I L++LY+D+KMRISRG++NI+FVH+
Sbjct: 347 GKFIKSESSTYDLKMDDAAIIGGAFGYPVDITNNIELKILYTDEKMRISRGFDNIIFVHI 406
Query: 393 R 393
R
Sbjct: 407 R 407
|
|
| TAIR|locus:2132006 AT4G22240 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|P80471 P80471 "Light-induced protein, chloroplastic" [Solanum tuberosum (taxid:4113)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|O99019 O99019 "Light-induced protein, chloroplastic" [Solanum demissum (taxid:50514)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2136627 FIB "fibrillin" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2178133 AT5G09820 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2092150 AT3G26070 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 400 | |||
| pfam04755 | 196 | pfam04755, PAP_fibrillin, PAP_fibrillin | 2e-48 |
| >gnl|CDD|218245 pfam04755, PAP_fibrillin, PAP_fibrillin | Back alignment and domain information |
|---|
Score = 162 bits (412), Expect = 2e-48
Identities = 67/195 (34%), Positives = 95/195 (48%), Gaps = 25/195 (12%)
Query: 64 QLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGS-SLIEGRWQLMFTTRPG 122
+L+DA+ G RG AS+ ++E AV LE L P PT S L+ G+W+L++TT
Sbjct: 6 KLLDAIYGTN-RGLRASSDDRAEIESAVTQLEALNPTPAPTESLDLLNGKWRLLYTTSKE 64
Query: 123 TASPIQRTFVG-VETFSVFQEISLRTNDPRVSNIVKFSE--AIGELKVEAAASIKDGKRI 179
+ R + ++ ++Q I N+ V N V FS A G V A I+ KR+
Sbjct: 65 LLPLLARGRLPLLKVGQIYQTI--DVNNLTVYNSVTFSGPLAEGSFSVRAKFEIRSPKRV 122
Query: 180 LFQFDKAAFSFKFLPFKF----------------PYPVPFRLLGDEAKGWLDTTYLSPSG 223
+F++ L P+PF L GD AKGWL+TTYL
Sbjct: 123 QIRFERGVLGTPQLLKGSLTPLQDTASNIRGISSQLPLPFPLSGDRAKGWLETTYLDE-- 180
Query: 224 NLRISRGNKGTTFVL 238
+LRISRG+ G+ FVL
Sbjct: 181 DLRISRGDGGSLFVL 195
|
This family identifies a conserved region found in a number of plastid lipid-associated proteins (PAPs), and in a number of putative fibrillin proteins. Length = 196 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 400 | |||
| PF04755 | 198 | PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 | 100.0 | |
| PF04755 | 198 | PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 | 99.97 |
| >PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.9e-37 Score=287.11 Aligned_cols=176 Identities=40% Similarity=0.681 Sum_probs=152.3
Q ss_pred hHHHHHHHHHHhcccCCCCCCChhhHHHHHHHHHHHHhcCCCCCCCCC-CCcceEEEEEEecCCCCCCccccc-ccCcee
Q 015769 59 TGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGS-SLIEGRWQLMFTTRPGTASPIQRT-FVGVET 136 (400)
Q Consensus 59 ~~~K~~LL~al~~t~~rG~~as~~~r~~I~~lI~~LEalnpt~~P~~s-~lL~G~W~LvyTt~~~t~sp~~r~-~~Gv~~ 136 (400)
.++|++||++++++ +||..++++++++|+++|++||++||+++|+++ ++|+|+|+|+|||++++.+.+++. .+++..
T Consensus 1 ~~~K~~Ll~~~~~~-~rG~~~~~~~~~~i~~~v~~LE~~np~~~p~~s~~~L~G~W~Lvytt~~~~~~~l~~~~~~~~~~ 79 (198)
T PF04755_consen 1 QDLKQELLQAVAGT-NRGLRASPEDREEIEELVEELEALNPTPDPADSLPLLDGRWELVYTTSPEIRSLLQRGRLPGVRV 79 (198)
T ss_pred ChHHHHHHHHHhcc-CCCccCCHHHHHHHHHHHHHHHHhCCCCCCcCCchhcCcEEEEEeecCCCccccccccccccccc
Confidence 36899999999999 899999999999999999999999999999998 999999999999999988776653 456777
Q ss_pred eeeeEEEEEeCCCCcEEEEEEEec---cceeEEEEEEEEEecCCEEEEEEEeeeeeeccc-----------------ccc
Q 015769 137 FSVFQEISLRTNDPRVSNIVKFSE---AIGELKVEAAASIKDGKRILFQFDKAAFSFKFL-----------------PFK 196 (400)
Q Consensus 137 ~~v~Q~i~ld~~~~~v~N~V~f~~---~~g~l~V~a~~~~~~~~rv~v~F~~~~l~~~~~-----------------~f~ 196 (400)
+++||.|+ .+++++.|+|+|.. ..|.+.+.+.+++.++.|+.++|+++.+.++.. ...
T Consensus 80 ~~v~Q~id--~~~~~~~N~v~~~~~~~~~~~~~v~a~~~~~~~~rv~v~f~~~~l~~~~~l~~~l~~~~~~~~~v~~~~~ 157 (198)
T PF04755_consen 80 GRVFQTID--ADNGRVENVVELSGFPLLEGSVSVRASLEVRSPRRVEVTFERASLKPPSLLKGVLGPLKDALNNVPRGIS 157 (198)
T ss_pred cceEEEEE--CCCceEEEEEEEeccCceEEEEEEEEEEEEccccEEEEEEEeeEEcccceeeccchhhhhhhhhcccccc
Confidence 89999984 56799999999765 357899999999999999999999998865211 112
Q ss_pred cCCCccccccCCCcceEEEEEEecCCCceEEEeCCCCcEEEEe
Q 015769 197 FPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQ 239 (400)
Q Consensus 197 ~p~P~~f~l~g~~~~Gwld~tYLD~~~dlRIsRG~kG~~FVl~ 239 (400)
..+|+++++.+..++|||||||||+ ||||+||++|++|||.
T Consensus 158 ~~~~~~~~~~~~~~~g~l~~tYLDe--dlRI~Rg~~G~~fVl~ 198 (198)
T PF04755_consen 158 DELPVPLPLPGGSPKGWLDTTYLDE--DLRISRGNKGSLFVLK 198 (198)
T ss_pred cccccccccCCCCCceEEEEEEECC--CeEEEEcCCCCEEEeC
Confidence 2355666666678999999999999 9999999999999984
|
The coat may contain receptors for attachment to the thylakoid membrane as well as regulatory proteins that may function in the transfer of lipids to and from the thylakoid membranes.). This entry also represents a number of putative fibrillin proteins.; GO: 0005198 structural molecule activity, 0009507 chloroplast |
| >PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 400 | |||
| 4gzv_A | 142 | Hypothetical protein; AN eight-stranded beta barre | 84.59 |
| >4gzv_A Hypothetical protein; AN eight-stranded beta barrel, lipocalin family, structural joint center for structural genomics, JCSG; 1.95A {Bacteroides ovatus} | Back alignment and structure |
|---|
Probab=84.59 E-value=8 Score=33.39 Aligned_cols=71 Identities=15% Similarity=0.209 Sum_probs=46.6
Q ss_pred cccccceeEEEE-EecccCchhhhhhhcCCCCCeeEEccCCceEEEEEeccC-eEEEEEEEEEEeCCcEEEEEEee
Q 015769 276 ERELLEGEWQML-WSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLG-FKFSMTGTYAKSSTNTYNVTMDD 349 (400)
Q Consensus 276 ~~~lL~G~W~L~-yts~~e~~~~l~~~~~gl~~~Q~Id~~~~~~N~v~~~~~-~~~~~~a~~~~~s~~rv~V~F~~ 349 (400)
.+.-|.|.|+|. |.++......--+. ...+=||.+++++.|++-.... ......++|+..|+..+.=..++
T Consensus 8 ~~~~L~GvWQ~c~y~~~~~~~~~~l~~---s~~~KIlS~DgtF~N~~m~~~~~aiIt~yGtY~l~SD~~Y~E~i~k 80 (142)
T 4gzv_A 8 KAADLKGIWQLCHYVSESPDVPGALKP---SNTFKVLSDDGRIVNFTIIPGADAIITGYGTYKQLTDDSYKESIEK 80 (142)
T ss_dssp GSCCCCEEEEEEEEEESSTTSCCEEEE---EEEEEEECTTSEEEEEECCTTSCCEEEEEEEEEEEETTEEEEEEEE
T ss_pred ccccceEEeEEeeEeecCCCCCceEee---cCceEEECCCCcEEEEEEeCCCCcEEEEeEEEEecCCCeeeeeecc
Confidence 456799999987 55554332211011 1137889899999999865433 33577999999998876544443
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00