Citrus Sinensis ID: 015769


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400
MALKFHITNVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTDQQQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTDGSNQK
cccccccccccccccccccccccccccccccccccccccccccEEEcccccHHHHccHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHcccccccccccccccEEEEEEEcccccccccccccccccccEEEEEEEEEcccccEEEEEEcccccEEEEEEEEEEEEcccEEEEEEEEEEEEEcccccccccccccccccccccEEEEEEEEcccccEEEEEcccccEEEEEccccHHHHHHHHHcccHHHHHHHHHHHHccccccHHHHHHccccEEEEEEEccccccHHHHcccccccEEEEcccccEEEEEEEcccEEEEEEEEEEEccccEEEEEEEEcEEEEcccccccccccEEEEEEEEEEccEEEEEcccccEEEEEEccccccc
ccEEEEEEcccccccccccccccccccccccccccccEEEccEEEEccccccccEEcccHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHccccccccccHHccccEEEEEEccccccccccccHcccccccEEEEEEEEccccEEEEEEEccccccEEEEEEEEEEccccEEEEEEEEEEEccccccccccccccccccccccccEEEEEEEcccccEEEEccccccEEEEEccccHHHHHHHHHHccccHHHHHHHHHHHccccccccHHHHcccEEEEEcccccccHHHHHHHccccccEEEEccccEEEEEEccccEEEEEEEEEEEccccEEEEEcccEEEEEccccccEcccccEEEEEEEEcccEEEEEccccEEEEEEEccccccc
malkfhitnvnfhfcacsssfsrstlafsssklsncsqnrtnglvaqssltdQQQLAFTGEENQLIDALIGIqgrgrsasarqLNDVECAVKVLEglqgvpdptgsslieGRWQlmfttrpgtaspiqrtfvgvETFSVFQEIslrtndprvsniVKFSEAIGELKVEAAASIKDGKRILFQFDKaafsfkflpfkfpypvpfrllgdeakgwldttylspsgnlrisrgnkgttfvlqkkteprQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDswienagnglmGKQIVKKNGQMKFEVDILLgfkfsmtgtyaksstntynvtmddaaiicggfgypikmETKINLQllysddkmrisrGYNNILFVHLrtdgsnqk
MALKFHITNVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTDQQQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEglqgvpdptgsSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQeislrtndprvsnIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTylspsgnlrisrgnkgttfvlqkkteprqTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFvhlrtdgsnqk
MALKFHITNVNFHFcacsssfsrstlafsssklsNCSQNRTNGLVAQSSLTDQQQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAeeerellegeWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTDGSNQK
**LKFHITNVNFHFCACSSSF***********************************AFTGEENQLIDALIGIQGRGR*ASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVL*****************************************LEGEWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLR*******
*****************************************************************IDALIGIQ************DVECAVKVLEGLQGVPDPTGSSLIEGRWQLMFTTRPG***PIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLL*DEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQT*********Q*EQAI********************EWQMLWSS*M*******NAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVH**T******
MALKFHITNVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTDQQQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSN*********LLEGEWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTDGSNQK
*ALKFHITNVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTDQQQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTD*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MALKFHITNVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTDQQQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTDGSNQK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query400 2.2.26 [Sep-21-2011]
Q8LAP6409 Probable plastid-lipid-as yes no 0.977 0.955 0.629 1e-150
Q94KU6319 Plastid lipid-associated N/A no 0.435 0.545 0.269 3e-06
O49629310 Probable plastid-lipid-as no no 0.435 0.561 0.260 1e-05
P80471326 Light-induced protein, ch N/A no 0.44 0.539 0.264 2e-05
O99019326 Light-induced protein, ch N/A no 0.44 0.539 0.259 6e-05
Q9LU85242 Probable plastid-lipid-as no no 0.387 0.640 0.243 0.0001
Q6K439319 Probable plastid-lipid-as no no 0.422 0.529 0.271 0.0003
>sp|Q8LAP6|PAP12_ARATH Probable plastid-lipid-associated protein 12, chloroplastic OS=Arabidopsis thaliana GN=PAP12 PE=2 SV=1 Back     alignment and function desciption
 Score =  530 bits (1365), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 257/408 (62%), Positives = 322/408 (78%), Gaps = 17/408 (4%)

Query: 1   MALKFHITNVNFHFCACSSSFSRSTLAFSSSKLS-------------NCSQNRTNGLVAQ 47
           +A++F+   ++   C C    S  +L+  S + +             NC   R +   + 
Sbjct: 2   VAVRFYAVEMSLP-CLCPCPSSPISLSLCSPRFNLLNTTSRRLGLSRNCRTLRIS-CSSS 59

Query: 48  SSLTDQ-QQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGS 106
           S++TDQ QQ +F   E +LIDALIGIQGRG+SAS +QLNDVE AVKVLEGL+G+ +PT S
Sbjct: 60  STVTDQTQQSSFNDAELKLIDALIGIQGRGKSASPKQLNDVESAVKVLEGLEGIQNPTDS 119

Query: 107 SLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLR-TNDPRVSNIVKFSEAIGEL 165
            LIEGRW+LMFTTRPGTASPIQRTF GV+ F+VFQ++ L+ TNDPRVSNIVKFS+ IGEL
Sbjct: 120 DLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRVSNIVKFSDFIGEL 179

Query: 166 KVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNL 225
           KVEA ASIKDGKR+LF+FD+AAF  KFLPFK PYPVPFRLLGDEAKGWLDTTYLSPSGNL
Sbjct: 180 KVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNL 239

Query: 226 RISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQ 285
           RISRGNKGTTFVLQK+T PRQ LL+ IS    V +AI+EF++SN ++AE+  ELLEG WQ
Sbjct: 240 RISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNSAEDNYELLEGSWQ 299

Query: 286 MLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNV 345
           M+WSSQM TDSWIENA NGLMG+QI++K+G++KFEV+I+  F+FSM G + KS ++TY++
Sbjct: 300 MIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMKGKFIKSESSTYDL 359

Query: 346 TMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLR 393
            MDDAAII G FGYP+ +   I L++LY+D+KMRISRG++NI+FVH+R
Sbjct: 360 KMDDAAIIGGAFGYPVDITNNIELKILYTDEKMRISRGFDNIIFVHIR 407





Arabidopsis thaliana (taxid: 3702)
>sp|Q94KU6|PAP2_BRACM Plastid lipid-associated protein 2, chloroplastic OS=Brassica campestris GN=PAP2 PE=1 SV=1 Back     alignment and function description
>sp|O49629|PAP2_ARATH Probable plastid-lipid-associated protein 2, chloroplastic OS=Arabidopsis thaliana GN=PAP2 PE=1 SV=1 Back     alignment and function description
>sp|P80471|LIPC_SOLTU Light-induced protein, chloroplastic OS=Solanum tuberosum PE=1 SV=2 Back     alignment and function description
>sp|O99019|LIPC_SOLDE Light-induced protein, chloroplastic OS=Solanum demissum PE=1 SV=1 Back     alignment and function description
>sp|Q9LU85|PAP4_ARATH Probable plastid-lipid-associated protein 4, chloroplastic OS=Arabidopsis thaliana GN=PAP4 PE=1 SV=1 Back     alignment and function description
>sp|Q6K439|PAP2_ORYSJ Probable plastid-lipid-associated protein 2, chloroplastic OS=Oryza sativa subsp. japonica GN=PAP2 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query400
224053000399 predicted protein [Populus trichocarpa] 0.99 0.992 0.744 1e-170
255574416409 structural molecule, putative [Ricinus c 0.89 0.870 0.775 1e-166
225435622396 PREDICTED: probable plastid-lipid-associ 0.97 0.979 0.699 1e-155
449448776436 PREDICTED: probable plastid-lipid-associ 0.887 0.814 0.754 1e-154
356576885377 PREDICTED: probable plastid-lipid-associ 0.935 0.992 0.670 1e-150
297847472409 plastid-lipid associated protein pap [Ar 0.982 0.960 0.650 1e-148
30694898409 putative plastid-lipid-associated protei 0.977 0.955 0.629 1e-148
255646338377 unknown [Glycine max] 0.857 0.909 0.728 1e-146
388509688380 unknown [Lotus japonicus] 0.932 0.981 0.656 1e-140
357445071388 hypothetical protein MTR_1g116320 [Medic 0.955 0.984 0.643 1e-140
>gi|224053000|ref|XP_002297657.1| predicted protein [Populus trichocarpa] gi|222844915|gb|EEE82462.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  603 bits (1554), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 300/403 (74%), Positives = 341/403 (84%), Gaps = 7/403 (1%)

Query: 1   MALKFHIT-NVNFHFCACSSSFSRSTLAFSSSKLSNCSQNRTNGLVAQSSLTD--QQQLA 57
           MALKFH   +++       + F+ S       KL   S  +T+  V QSSL D  QQQ++
Sbjct: 1   MALKFHAAIHLSLQLSPSPTLFAPSFC--KPQKLLKSSVKKTH--VCQSSLVDEQQQQIS 56

Query: 58  FTGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGSSLIEGRWQLMF 117
           F  +ENQLI+AL+GIQGRG+SAS +QLN+V  AVKVLEGL+GV +PTGS+LIEGRWQLMF
Sbjct: 57  FNEQENQLINALVGIQGRGKSASPQQLNEVGHAVKVLEGLEGVSEPTGSNLIEGRWQLMF 116

Query: 118 TTRPGTASPIQRTFVGVETFSVFQEISLRTNDPRVSNIVKFSEAIGELKVEAAASIKDGK 177
           TTRPGTASPIQRTFVGV+ FSVFQE+ LRTNDPRVSNIVKFS AIGELKVEAAA+I++GK
Sbjct: 117 TTRPGTASPIQRTFVGVDFFSVFQEVYLRTNDPRVSNIVKFSNAIGELKVEAAATIENGK 176

Query: 178 RILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 237
           RILFQFD+AAFSF FLPFK PYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV
Sbjct: 177 RILFQFDRAAFSFNFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 236

Query: 238 LQKKTEPRQTLLSAISTGTQVEQAINEFISSNQSTAEEERELLEGEWQMLWSSQMETDSW 297
           LQKKTEPRQ LLSAI TGT V +AINEFI  NQ+ A++E EL++GEWQM+WSSQMETDSW
Sbjct: 237 LQKKTEPRQRLLSAIWTGTGVLEAINEFIKLNQNVAKDEMELIDGEWQMIWSSQMETDSW 296

Query: 298 IENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMTGTYAKSSTNTYNVTMDDAAIICGGF 357
           IENAG GLMGKQIV KNGQ+KF VDILLG +FSMTGT+ KSS NTY+V MDDAAII G F
Sbjct: 297 IENAGRGLMGKQIVTKNGQLKFVVDILLGVRFSMTGTFVKSSPNTYDVKMDDAAIIGGMF 356

Query: 358 GYPIKMETKINLQLLYSDDKMRISRGYNNILFVHLRTDGSNQK 400
           G P++METKINL+LLYSDDK+RISRGY NI+FVH RTDG+ QK
Sbjct: 357 GLPVEMETKINLELLYSDDKIRISRGYKNIVFVHARTDGTRQK 399




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255574416|ref|XP_002528121.1| structural molecule, putative [Ricinus communis] gi|223532460|gb|EEF34251.1| structural molecule, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|225435622|ref|XP_002283329.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic [Vitis vinifera] gi|297746405|emb|CBI16461.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449448776|ref|XP_004142141.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Cucumis sativus] gi|449503596|ref|XP_004162081.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356576885|ref|XP_003556560.1| PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic-like [Glycine max] Back     alignment and taxonomy information
>gi|297847472|ref|XP_002891617.1| plastid-lipid associated protein pap [Arabidopsis lyrata subsp. lyrata] gi|297337459|gb|EFH67876.1| plastid-lipid associated protein pap [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|30694898|ref|NP_175522.2| putative plastid-lipid-associated protein 12 [Arabidopsis thaliana] gi|75154826|sp|Q8LAP6.1|PAP12_ARATH RecName: Full=Probable plastid-lipid-associated protein 12, chloroplastic; AltName: Full=Fibrillin-12; Flags: Precursor gi|21593276|gb|AAM65225.1| unknown [Arabidopsis thaliana] gi|26452116|dbj|BAC43147.1| unknown protein [Arabidopsis thaliana] gi|28950975|gb|AAO63411.1| At1g51115 [Arabidopsis thaliana] gi|332194502|gb|AEE32623.1| putative plastid-lipid-associated protein 12 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|255646338|gb|ACU23652.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|388509688|gb|AFK42910.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|357445071|ref|XP_003592813.1| hypothetical protein MTR_1g116320 [Medicago truncatula] gi|355481861|gb|AES63064.1| hypothetical protein MTR_1g116320 [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query400
TAIR|locus:2026052409 AT1G51110 [Arabidopsis thalian 0.895 0.875 0.675 4e-129
TAIR|locus:2132006310 AT4G22240 [Arabidopsis thalian 0.402 0.519 0.235 1.9e-09
UNIPROTKB|P80471326 P80471 "Light-induced protein, 0.107 0.131 0.466 1.7e-07
UNIPROTKB|O99019326 O99019 "Light-induced protein, 0.357 0.438 0.259 6e-07
TAIR|locus:2136627318 FIB "fibrillin" [Arabidopsis t 0.1 0.125 0.476 4.1e-06
TAIR|locus:2178133273 AT5G09820 [Arabidopsis thalian 0.462 0.677 0.248 2.6e-05
TAIR|locus:2092150242 AT3G26070 [Arabidopsis thalian 0.247 0.409 0.256 0.00014
TAIR|locus:2026052 AT1G51110 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1267 (451.1 bits), Expect = 4.0e-129, P = 4.0e-129
 Identities = 244/361 (67%), Positives = 298/361 (82%)

Query:    35 NCSQNRTNGLVAQSSLTDQ-QQLAFTGEENQLIDALIGIQGRGRSASARQLNDVECAVKV 93
             NC   R +   + S++TDQ QQ +F   E +LIDALIGIQGRG+SAS +QLNDVE AVKV
Sbjct:    48 NCRTLRIS-CSSSSTVTDQTQQSSFNDAELKLIDALIGIQGRGKSASPKQLNDVESAVKV 106

Query:    94 LEGLQGVPDPTGSSLIEGRWQLMFTTRPGTASPIQRTFVGVETFSVFQEISLR-TNDPRV 152
             LEGL+G+ +PT S LIEGRW+LMFTTRPGTASPIQRTF GV+ F+VFQ++ L+ TNDPRV
Sbjct:   107 LEGLEGIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRV 166

Query:   153 SNIVKFSEAIGELKVEAAASIKDGKRILFQFDKAAFSFKFLPFKFPYPVPFRLLGDEAKG 212
             SNIVKFS+ IGELKVEA ASIKDGKR+LF+FD+AAF  KFLPFK PYPVPFRLLGDEAKG
Sbjct:   167 SNIVKFSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKG 226

Query:   213 WLDTTYLSPSGNLRISRGNKGTTFVLQKKTEPRQTLLSAISTGTQVEQAINEFISSNQST 272
             WLDTTYLSPSGNLRISRGNKGTTFVLQK+T PRQ LL+ IS    V +AI+EF++SN ++
Sbjct:   227 WLDTTYLSPSGNLRISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNS 286

Query:   273 AXXXXXXXXXXWQMLWSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLGFKFSMT 332
             A          WQM+WSSQM TDSWIENA NGLMG+QI++K+G++KFEV+I+  F+FSM 
Sbjct:   287 AEDNYELLEGSWQMIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMK 346

Query:   333 GTYAKSSTNTYNVTMDDAAIICGGFGYPIKMETKINLQLLYSDDKMRISRGYNNILFVHL 392
             G + KS ++TY++ MDDAAII G FGYP+ +   I L++LY+D+KMRISRG++NI+FVH+
Sbjct:   347 GKFIKSESSTYDLKMDDAAIIGGAFGYPVDITNNIELKILYTDEKMRISRGFDNIIFVHI 406

Query:   393 R 393
             R
Sbjct:   407 R 407




GO:0005198 "structural molecule activity" evidence=IEA;ISS
GO:0009507 "chloroplast" evidence=ISM;IEA;IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0016020 "membrane" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0006098 "pentose-phosphate shunt" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
TAIR|locus:2132006 AT4G22240 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|P80471 P80471 "Light-induced protein, chloroplastic" [Solanum tuberosum (taxid:4113)] Back     alignment and assigned GO terms
UNIPROTKB|O99019 O99019 "Light-induced protein, chloroplastic" [Solanum demissum (taxid:50514)] Back     alignment and assigned GO terms
TAIR|locus:2136627 FIB "fibrillin" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2178133 AT5G09820 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2092150 AT3G26070 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q8LAP6PAP12_ARATHNo assigned EC number0.62990.97750.9559yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query400
pfam04755196 pfam04755, PAP_fibrillin, PAP_fibrillin 2e-48
>gnl|CDD|218245 pfam04755, PAP_fibrillin, PAP_fibrillin Back     alignment and domain information
 Score =  162 bits (412), Expect = 2e-48
 Identities = 67/195 (34%), Positives = 95/195 (48%), Gaps = 25/195 (12%)

Query: 64  QLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGS-SLIEGRWQLMFTTRPG 122
           +L+DA+ G   RG  AS+    ++E AV  LE L   P PT S  L+ G+W+L++TT   
Sbjct: 6   KLLDAIYGTN-RGLRASSDDRAEIESAVTQLEALNPTPAPTESLDLLNGKWRLLYTTSKE 64

Query: 123 TASPIQRTFVG-VETFSVFQEISLRTNDPRVSNIVKFSE--AIGELKVEAAASIKDGKRI 179
               + R  +  ++   ++Q I    N+  V N V FS   A G   V A   I+  KR+
Sbjct: 65  LLPLLARGRLPLLKVGQIYQTI--DVNNLTVYNSVTFSGPLAEGSFSVRAKFEIRSPKRV 122

Query: 180 LFQFDKAAFSFKFLPFKF----------------PYPVPFRLLGDEAKGWLDTTYLSPSG 223
             +F++       L                      P+PF L GD AKGWL+TTYL    
Sbjct: 123 QIRFERGVLGTPQLLKGSLTPLQDTASNIRGISSQLPLPFPLSGDRAKGWLETTYLDE-- 180

Query: 224 NLRISRGNKGTTFVL 238
           +LRISRG+ G+ FVL
Sbjct: 181 DLRISRGDGGSLFVL 195


This family identifies a conserved region found in a number of plastid lipid-associated proteins (PAPs), and in a number of putative fibrillin proteins. Length = 196

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 400
PF04755198 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 100.0
PF04755198 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 99.97
>PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle Back     alignment and domain information
Probab=100.00  E-value=1.9e-37  Score=287.11  Aligned_cols=176  Identities=40%  Similarity=0.681  Sum_probs=152.3

Q ss_pred             hHHHHHHHHHHhcccCCCCCCChhhHHHHHHHHHHHHhcCCCCCCCCC-CCcceEEEEEEecCCCCCCccccc-ccCcee
Q 015769           59 TGEENQLIDALIGIQGRGRSASARQLNDVECAVKVLEGLQGVPDPTGS-SLIEGRWQLMFTTRPGTASPIQRT-FVGVET  136 (400)
Q Consensus        59 ~~~K~~LL~al~~t~~rG~~as~~~r~~I~~lI~~LEalnpt~~P~~s-~lL~G~W~LvyTt~~~t~sp~~r~-~~Gv~~  136 (400)
                      .++|++||++++++ +||..++++++++|+++|++||++||+++|+++ ++|+|+|+|+|||++++.+.+++. .+++..
T Consensus         1 ~~~K~~Ll~~~~~~-~rG~~~~~~~~~~i~~~v~~LE~~np~~~p~~s~~~L~G~W~Lvytt~~~~~~~l~~~~~~~~~~   79 (198)
T PF04755_consen    1 QDLKQELLQAVAGT-NRGLRASPEDREEIEELVEELEALNPTPDPADSLPLLDGRWELVYTTSPEIRSLLQRGRLPGVRV   79 (198)
T ss_pred             ChHHHHHHHHHhcc-CCCccCCHHHHHHHHHHHHHHHHhCCCCCCcCCchhcCcEEEEEeecCCCccccccccccccccc
Confidence            36899999999999 899999999999999999999999999999998 999999999999999988776653 456777


Q ss_pred             eeeeEEEEEeCCCCcEEEEEEEec---cceeEEEEEEEEEecCCEEEEEEEeeeeeeccc-----------------ccc
Q 015769          137 FSVFQEISLRTNDPRVSNIVKFSE---AIGELKVEAAASIKDGKRILFQFDKAAFSFKFL-----------------PFK  196 (400)
Q Consensus       137 ~~v~Q~i~ld~~~~~v~N~V~f~~---~~g~l~V~a~~~~~~~~rv~v~F~~~~l~~~~~-----------------~f~  196 (400)
                      +++||.|+  .+++++.|+|+|..   ..|.+.+.+.+++.++.|+.++|+++.+.++..                 ...
T Consensus        80 ~~v~Q~id--~~~~~~~N~v~~~~~~~~~~~~~v~a~~~~~~~~rv~v~f~~~~l~~~~~l~~~l~~~~~~~~~v~~~~~  157 (198)
T PF04755_consen   80 GRVFQTID--ADNGRVENVVELSGFPLLEGSVSVRASLEVRSPRRVEVTFERASLKPPSLLKGVLGPLKDALNNVPRGIS  157 (198)
T ss_pred             cceEEEEE--CCCceEEEEEEEeccCceEEEEEEEEEEEEccccEEEEEEEeeEEcccceeeccchhhhhhhhhcccccc
Confidence            89999984  56799999999765   357899999999999999999999998865211                 112


Q ss_pred             cCCCccccccCCCcceEEEEEEecCCCceEEEeCCCCcEEEEe
Q 015769          197 FPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQ  239 (400)
Q Consensus       197 ~p~P~~f~l~g~~~~Gwld~tYLD~~~dlRIsRG~kG~~FVl~  239 (400)
                      ..+|+++++.+..++|||||||||+  ||||+||++|++|||.
T Consensus       158 ~~~~~~~~~~~~~~~g~l~~tYLDe--dlRI~Rg~~G~~fVl~  198 (198)
T PF04755_consen  158 DELPVPLPLPGGSPKGWLDTTYLDE--DLRISRGNKGSLFVLK  198 (198)
T ss_pred             cccccccccCCCCCceEEEEEEECC--CeEEEEcCCCCEEEeC
Confidence            2355666666678999999999999  9999999999999984



The coat may contain receptors for attachment to the thylakoid membrane as well as regulatory proteins that may function in the transfer of lipids to and from the thylakoid membranes.). This entry also represents a number of putative fibrillin proteins.; GO: 0005198 structural molecule activity, 0009507 chloroplast

>PF04755 PAP_fibrillin: PAP_fibrillin; InterPro: IPR006843 This family identifies a conserved domain found in a number of plastid lipid-associated proteins (PAPs) that are thought to form together with other plastoglobulins a coat on the surface of the lipoprotein particle Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query400
4gzv_A142 Hypothetical protein; AN eight-stranded beta barre 84.59
>4gzv_A Hypothetical protein; AN eight-stranded beta barrel, lipocalin family, structural joint center for structural genomics, JCSG; 1.95A {Bacteroides ovatus} Back     alignment and structure
Probab=84.59  E-value=8  Score=33.39  Aligned_cols=71  Identities=15%  Similarity=0.209  Sum_probs=46.6

Q ss_pred             cccccceeEEEE-EecccCchhhhhhhcCCCCCeeEEccCCceEEEEEeccC-eEEEEEEEEEEeCCcEEEEEEee
Q 015769          276 ERELLEGEWQML-WSSQMETDSWIENAGNGLMGKQIVKKNGQMKFEVDILLG-FKFSMTGTYAKSSTNTYNVTMDD  349 (400)
Q Consensus       276 ~~~lL~G~W~L~-yts~~e~~~~l~~~~~gl~~~Q~Id~~~~~~N~v~~~~~-~~~~~~a~~~~~s~~rv~V~F~~  349 (400)
                      .+.-|.|.|+|. |.++......--+.   ...+=||.+++++.|++-.... ......++|+..|+..+.=..++
T Consensus         8 ~~~~L~GvWQ~c~y~~~~~~~~~~l~~---s~~~KIlS~DgtF~N~~m~~~~~aiIt~yGtY~l~SD~~Y~E~i~k   80 (142)
T 4gzv_A            8 KAADLKGIWQLCHYVSESPDVPGALKP---SNTFKVLSDDGRIVNFTIIPGADAIITGYGTYKQLTDDSYKESIEK   80 (142)
T ss_dssp             GSCCCCEEEEEEEEEESSTTSCCEEEE---EEEEEEECTTSEEEEEECCTTSCCEEEEEEEEEEEETTEEEEEEEE
T ss_pred             ccccceEEeEEeeEeecCCCCCceEee---cCceEEECCCCcEEEEEEeCCCCcEEEEeEEEEecCCCeeeeeecc
Confidence            456799999987 55554332211011   1137889899999999865433 33577999999998876544443




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00