Citrus Sinensis ID: 016802
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 382 | ||||||
| 224053000 | 399 | predicted protein [Populus trichocarpa] | 0.989 | 0.947 | 0.714 | 1e-160 | |
| 255574416 | 409 | structural molecule, putative [Ricinus c | 0.884 | 0.826 | 0.733 | 1e-154 | |
| 225435622 | 396 | PREDICTED: probable plastid-lipid-associ | 0.968 | 0.934 | 0.664 | 1e-144 | |
| 449448776 | 436 | PREDICTED: probable plastid-lipid-associ | 0.882 | 0.772 | 0.718 | 1e-143 | |
| 356576885 | 377 | PREDICTED: probable plastid-lipid-associ | 0.931 | 0.944 | 0.637 | 1e-139 | |
| 297847472 | 409 | plastid-lipid associated protein pap [Ar | 0.981 | 0.916 | 0.620 | 1e-137 | |
| 30694898 | 409 | putative plastid-lipid-associated protei | 0.976 | 0.911 | 0.598 | 1e-137 | |
| 255646338 | 377 | unknown [Glycine max] | 0.934 | 0.946 | 0.633 | 1e-135 | |
| 357445071 | 388 | hypothetical protein MTR_1g116320 [Medic | 0.952 | 0.938 | 0.620 | 1e-131 | |
| 388509688 | 380 | unknown [Lotus japonicus] | 0.929 | 0.934 | 0.629 | 1e-130 |
| >gi|224053000|ref|XP_002297657.1| predicted protein [Populus trichocarpa] gi|222844915|gb|EEE82462.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 288/403 (71%), Positives = 326/403 (80%), Gaps = 25/403 (6%)
Query: 1 MALKFHIT-NVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTD--QQQLA 57
MALKFH +++ + F+ S KL S +T+ V QSSL D QQQ++
Sbjct: 1 MALKFHAAIHLSLQLSPSPTLFAPSFC--KPQKLLKSSVKKTH--VCQSSLVDEQQQQIS 56
Query: 58 FTGEENQLIDALIGIQGRGRSASARQLN------------------TGSSLIEGRWQLMF 99
F +ENQLI+AL+GIQGRG+SAS +QLN TGS+LIEGRWQLMF
Sbjct: 57 FNEQENQLINALVGIQGRGKSASPQQLNEVGHAVKVLEGLEGVSEPTGSNLIEGRWQLMF 116
Query: 100 TTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGK 159
TTRPGTASPIQRTFVGV+ FSVFQE+ LRTNDPRVSNIVKFS AIGELKVEAAA+I++GK
Sbjct: 117 TTRPGTASPIQRTFVGVDFFSVFQEVYLRTNDPRVSNIVKFSNAIGELKVEAAATIENGK 176
Query: 160 RILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 219
RILFQFD+AAFSF FLPFK PYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV
Sbjct: 177 RILFQFDRAAFSFNFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 236
Query: 220 LQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDSW 279
LQKKTEPRQ LLSAI TGT V +AINEFI NQ+ A++E EL++GEWQM+WSSQMETDSW
Sbjct: 237 LQKKTEPRQRLLSAIWTGTGVLEAINEFIKLNQNVAKDEMELIDGEWQMIWSSQMETDSW 296
Query: 280 IENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGF 339
IENAG GLMGKQIV KNGQ+KF VDILLG +FSMTGT+ KSS NTY+V MDDAAII G F
Sbjct: 297 IENAGRGLMGKQIVTKNGQLKFVVDILLGVRFSMTGTFVKSSPNTYDVKMDDAAIIGGMF 356
Query: 340 GYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTDGSNQK 382
G P++METKINL+LLYSDDK+RISRGY NI+FVH RTDG+ QK
Sbjct: 357 GLPVEMETKINLELLYSDDKIRISRGYKNIVFVHARTDGTRQK 399
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255574416|ref|XP_002528121.1| structural molecule, putative [Ricinus communis] gi|223532460|gb|EEF34251.1| structural molecule, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|225435622|ref|XP_002283329.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic [Vitis vinifera] gi|297746405|emb|CBI16461.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|449448776|ref|XP_004142141.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Cucumis sativus] gi|449503596|ref|XP_004162081.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|356576885|ref|XP_003556560.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297847472|ref|XP_002891617.1| plastid-lipid associated protein pap [Arabidopsis lyrata subsp. lyrata] gi|297337459|gb|EFH67876.1| plastid-lipid associated protein pap [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|30694898|ref|NP_175522.2| putative plastid-lipid-associated protein 12 [Arabidopsis thaliana] gi|75154826|sp|Q8LAP6.1|PAP12_ARATH RecName: Full=Probable plastid-lipid-associated protein 12, chloroplastic; AltName: Full=Fibrillin-12; Flags: Precursor gi|21593276|gb|AAM65225.1| unknown [Arabidopsis thaliana] gi|26452116|dbj|BAC43147.1| unknown protein [Arabidopsis thaliana] gi|28950975|gb|AAO63411.1| At1g51115 [Arabidopsis thaliana] gi|332194502|gb|AEE32623.1| putative plastid-lipid-associated protein 12 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|255646338|gb|ACU23652.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357445071|ref|XP_003592813.1| hypothetical protein MTR_1g116320 [Medicago truncatula] gi|355481861|gb|AES63064.1| hypothetical protein MTR_1g116320 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|388509688|gb|AFK42910.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 382 | ||||||
| TAIR|locus:2026052 | 409 | AT1G51110 [Arabidopsis thalian | 0.759 | 0.709 | 0.694 | 4.6e-120 | |
| TAIR|locus:2132006 | 310 | AT4G22240 [Arabidopsis thalian | 0.112 | 0.138 | 0.488 | 5.3e-05 |
| TAIR|locus:2026052 AT1G51110 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1078 (384.5 bits), Expect = 4.6e-120, Sum P(2) = 4.6e-120
Identities = 202/291 (69%), Positives = 245/291 (84%)
Query: 86 TGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLR-TNDPRVSNIVKFSEAI 144
T S LIEGRW+LMFTTRPGTASPIQRTF GV+ F+VFQ++ L+ TNDPRVSNIVKFS+ I
Sbjct: 117 TDSDLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRVSNIVKFSDFI 176
Query: 145 GELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPS 204
GELKVEA ASIKDGKR+LF+FD+AAF KFLPFK PYPVPFRLLGDEAKGWLDTTYLSPS
Sbjct: 177 GELKVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPS 236
Query: 205 GNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAXXXXXXXXX 264
GNLRISRGNKGTTFVLQK+T PRQ LL+ IS V +AI+EF++SN ++A
Sbjct: 237 GNLRISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNSAEDNYELLEG 296
Query: 265 XWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNT 324
WQM+WSSQM TDSWIENA NGLMG+QI++K+G++KFEV+I+ F+FSM G + KS ++T
Sbjct: 297 SWQMIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMKGKFIKSESST 356
Query: 325 YNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLR 375
Y++ MDDAAII G FGYP+ + I L++LY+D+KMRISRG++NI+FVH+R
Sbjct: 357 YDLKMDDAAIIGGAFGYPVDITNNIELKILYTDEKMRISRGFDNIIFVHIR 407
|
|
| TAIR|locus:2132006 AT4G22240 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| gw1.I.6685.1 | hypothetical protein (348 aa) | ||||||||||
(Populus trichocarpa) | |||||||||||
| estExt_Genewise1_v1.C_LG_X6026 | • | • | 0.526 | ||||||||
| gw1.III.1875.1 | • | • | 0.441 | ||||||||
| estExt_fgenesh4_pg.C_1520028 | • | • | 0.428 | ||||||||
| estExt_Genewise1_v1.C_LG_III1082 | • | 0.408 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 382 | |||
| pfam04755 | 196 | pfam04755, PAP_fibrillin, PAP_fibrillin | 2e-37 |
| >gnl|CDD|218245 pfam04755, PAP_fibrillin, PAP_fibrillin | Back alignment and domain information |
|---|
Score = 133 bits (337), Expect = 2e-37
Identities = 57/195 (29%), Positives = 83/195 (42%), Gaps = 43/195 (22%)
Query: 64 QLIDALIGIQGRGRSASARQLNTGSSLIE-------------------GRWQLMFTTRPG 104
+L+DA+ G RG AS+ S + G+W+L++TT
Sbjct: 6 KLLDAIYGTN-RGLRASSDDRAEIESAVTQLEALNPTPAPTESLDLLNGKWRLLYTTSKE 64
Query: 105 TASPIQRTFVG-VETFSVFQEISLRTNDPRVSNIVKFSE--AIGELKVEAAASIKDGKRI 161
+ R + ++ ++Q I N+ V N V FS A G V A I+ KR+
Sbjct: 65 LLPLLARGRLPLLKVGQIYQTI--DVNNLTVYNSVTFSGPLAEGSFSVRAKFEIRSPKRV 122
Query: 162 LFQFDKAAFSFKFLPFKF----------------PYPVPFRLLGDEAKGWLDTTYLSPSG 205
+F++ L P+PF L GD AKGWL+TTYL
Sbjct: 123 QIRFERGVLGTPQLLKGSLTPLQDTASNIRGISSQLPLPFPLSGDRAKGWLETTYLDE-- 180
Query: 206 NLRISRGNKGTTFVL 220
+LRISRG+ G+ FVL
Sbjct: 181 DLRISRGDGGSLFVL 195
|
This family identifies a conserved region found in a number of plastid lipid-associated proteins (PAPs), and in a number of putative fibrillin proteins. Length = 196 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 382 | |||
| PF04755 | 198 | PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 | 100.0 | |
| PF04755 | 198 | PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 | 99.97 |
| >PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle | Back alignment and domain information |
|---|
Probab=100.00 E-value=8.1e-33 Score=254.02 Aligned_cols=158 Identities=40% Similarity=0.677 Sum_probs=133.8
Q ss_pred hHHHHHHHHHhcccCCCccccChhhhc------------------cCC-CcccceEEEEEecCCCCCCcccc-cccCcee
Q 016802 59 TGEENQLIDALIGIQGRGRSASARQLN------------------TGS-SLIEGRWQLMFTTRPGTASPIQR-TFVGVET 118 (382)
Q Consensus 59 ~~~K~~LL~ai~g~~~RG~~as~e~~~------------------~~~-~lL~G~W~LiYTS~~~~~sp~~~-~~~gl~~ 118 (382)
+++|++||++++++ +||+.+++++++ .++ ++|+|+|+|+|||+++..++++. ..+++..
T Consensus 1 ~~~K~~Ll~~~~~~-~rG~~~~~~~~~~i~~~v~~LE~~np~~~p~~s~~~L~G~W~Lvytt~~~~~~~l~~~~~~~~~~ 79 (198)
T PF04755_consen 1 QDLKQELLQAVAGT-NRGLRASPEDREEIEELVEELEALNPTPDPADSLPLLDGRWELVYTTSPEIRSLLQRGRLPGVRV 79 (198)
T ss_pred ChHHHHHHHHHhcc-CCCccCCHHHHHHHHHHHHHHHHhCCCCCCcCCchhcCcEEEEEeecCCCccccccccccccccc
Confidence 47999999999999 999999999986 334 89999999999999998877664 4567888
Q ss_pred eeeEEEEEeeCCCCcEEEEEEeec---cceEEEEEEEEEEeCCcEEEEEEEeeEeeeccc-----------------ccc
Q 016802 119 FSVFQEISLRTNDPRVSNIVKFSE---AIGELKVEAAASIKDGKRILFQFDKAAFSFKFL-----------------PFK 178 (382)
Q Consensus 119 ~~i~Q~I~id~~~~~v~NvV~f~~---~~g~l~V~A~~~v~~~~rv~V~F~~~~v~~~~~-----------------~~~ 178 (382)
++|||.| |.+++++.|+|+|.. ..|.+.+.|.+++.+++|+.|+|+++.+.++.. ...
T Consensus 80 ~~v~Q~i--d~~~~~~~N~v~~~~~~~~~~~~~v~a~~~~~~~~rv~v~f~~~~l~~~~~l~~~l~~~~~~~~~v~~~~~ 157 (198)
T PF04755_consen 80 GRVFQTI--DADNGRVENVVELSGFPLLEGSVSVRASLEVRSPRRVEVTFERASLKPPSLLKGVLGPLKDALNNVPRGIS 157 (198)
T ss_pred cceEEEE--ECCCceEEEEEEEeccCceEEEEEEEEEEEEccccEEEEEEEeeEEcccceeeccchhhhhhhhhcccccc
Confidence 9999999 889999999999776 367899999999999999999999999865210 112
Q ss_pred cCCCccccccCCCcceEEEEEEecCCCceEEEeCCCCcEEEEe
Q 016802 179 FPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQ 221 (382)
Q Consensus 179 ~P~p~~f~l~~~~~~Gwld~TYLDe~~dlRIsRG~kG~vFVl~ 221 (382)
..+|+++++.+..++|||||||||| ||||+||++|++|||.
T Consensus 158 ~~~~~~~~~~~~~~~g~l~~tYLDe--dlRI~Rg~~G~~fVl~ 198 (198)
T PF04755_consen 158 DELPVPLPLPGGSPKGWLDTTYLDE--DLRISRGNKGSLFVLK 198 (198)
T ss_pred cccccccccCCCCCceEEEEEEECC--CeEEEEcCCCCEEEeC
Confidence 2345556666678999999999999 9999999999999984
|
The coat may contain receptors for attachment to the thylakoid membrane as well as regulatory proteins that may function in the transfer of lipids to and from the thylakoid membranes.). This entry also represents a number of putative fibrillin proteins.; GO: 0005198 structural molecule activity, 0009507 chloroplast |
| >PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 382 | |||
| 4gzv_A | 142 | Hypothetical protein; AN eight-stranded beta barre | 83.47 |
| >4gzv_A Hypothetical protein; AN eight-stranded beta barrel, lipocalin family, structural joint center for structural genomics, JCSG; 1.95A {Bacteroides ovatus} | Back alignment and structure |
|---|
Probab=83.47 E-value=12 Score=32.42 Aligned_cols=72 Identities=15% Similarity=0.178 Sum_probs=46.5
Q ss_pred ccccccceeEEEE-EecccCcchhhhhhcCCCCCeeEEccCCceEEEEEecCCe-EEEEEEEEEEeCCcEEEEEEee
Q 016802 257 EERELLEGEWQML-WSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGF-KFSMTGTYAKSSTNTYNVTMDD 331 (382)
Q Consensus 257 ~~~~lL~G~W~L~-ySS~~e~~~~l~~~~~gl~~~Q~Id~~~~~~N~v~~~~~~-~~~v~a~~e~~s~~rv~V~F~~ 331 (382)
-.+.-|.|.|+|. |.++....+.--+. ...+-||.++|++.|++-..... -....++|+..|+..+.=..++
T Consensus 7 ~~~~~L~GvWQ~c~y~~~~~~~~~~l~~---s~~~KIlS~DgtF~N~~m~~~~~aiIt~yGtY~l~SD~~Y~E~i~k 80 (142)
T 4gzv_A 7 FKAADLKGIWQLCHYVSESPDVPGALKP---SNTFKVLSDDGRIVNFTIIPGADAIITGYGTYKQLTDDSYKESIEK 80 (142)
T ss_dssp GGSCCCCEEEEEEEEEESSTTSCCEEEE---EEEEEEECTTSEEEEEECCTTSCCEEEEEEEEEEEETTEEEEEEEE
T ss_pred cccccceEEeEEeeEeecCCCCCceEee---cCceEEECCCCcEEEEEEeCCCCcEEEEeEEEEecCCCeeeeeecc
Confidence 3456799999997 55554322111011 11378899999999998544333 3467889999998876554444
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00