Citrus Sinensis ID: 032435


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-
MASITACLPSPLLVQGRQKLSLFITLPKLPLSPLNEKQNCLAIVAKASGESSESSTSLTVFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEKKIKESF
ccccccccccccccccccccccccccccccccccHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHc
ccEEEEEccccccccccccccccccccccccccccccccEEEEEEEcccccccccccHHHEEHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHEHHHHHHHEEEEEEEcccHHHHHHHHccc
masitaclpspllvqgrqklslfitlpklplsplnekQNCLAIVAKasgessesstsLTVFKSVQNvwdnssedrlglIGLGFAGIVALWASVNLitaidklpiipNALELIGILFSTVSVIEIIIWHGCCEWEKKIKESF
masitaclpspllvqGRQKLSLFITLPKLPLSPLNEKQNCLAIVAKAsgessesstslTVFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEKKIKESF
MASITACLPSPLLVQGRQKLSLFITLPKLPLSPLNEKQNCLAIvakasgessesstsltvfksVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEKKIKESF
*****ACLPSPLLVQGRQKLSLFITLPKLPLSPLNEKQNCLAIVA*************TVFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEK******
**************************************************************SVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEKKI****
MASITACLPSPLLVQGRQKLSLFITLPKLPLSPLNEKQNCLAIVAK************TVFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEKKIKESF
*ASITACLPSPLLVQGRQKLSLFITLPKLPLSPLNEKQNCLAIVAKAS******STSLTVFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEKKIKESF
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MASITACLPSPLLVQGRQKLSLFITLPKLPLSPLNEKQNCLAIVAKASGESSESSTSLTVFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCEWEKKIKESF
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query141 2.2.26 [Sep-21-2011]
Q8LCA1174 Thylakoid membrane phosph no no 0.404 0.327 0.431 3e-09
>sp|Q8LCA1|TMP14_ARATH Thylakoid membrane phosphoprotein 14 kDa, chloroplastic OS=Arabidopsis thaliana GN=TMP14 PE=1 SV=2 Back     alignment and function desciption
 Score = 60.8 bits (146), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 40/58 (68%), Gaps = 1/58 (1%)

Query: 60  VFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFS 117
           + K+ Q  W+   +D+  +  L FAG+VALW S  +I+AID+LP++P  LEL+GI ++
Sbjct: 86  IVKTAQEAWEKV-DDKYAIGSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYT 142





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query141
224070909146 predicted protein [Populus trichocarpa] 0.815 0.787 0.630 4e-33
18403896156 uncharacterized protein [Arabidopsis tha 0.829 0.75 0.614 1e-30
21554231156 unknown [Arabidopsis thaliana] 0.829 0.75 0.606 1e-29
224054368146 predicted protein [Populus trichocarpa] 0.815 0.787 0.655 1e-29
118488080146 unknown [Populus trichocarpa] 0.822 0.794 0.627 9e-29
79319729155 uncharacterized protein [Arabidopsis tha 0.822 0.748 0.606 9e-29
388497330178 unknown [Lotus japonicus] 0.808 0.640 0.666 7e-27
297852972158 hypothetical protein ARALYDRAFT_892226 [ 0.822 0.734 0.576 1e-25
255549936146 Thylakoid membrane phosphoprotein 14 kDa 0.815 0.787 0.621 2e-24
388496712146 unknown [Medicago truncatula] 0.808 0.780 0.644 4e-23
>gi|224070909|ref|XP_002303292.1| predicted protein [Populus trichocarpa] gi|222840724|gb|EEE78271.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  145 bits (367), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 75/119 (63%), Positives = 96/119 (80%), Gaps = 4/119 (3%)

Query: 1   MASITACLPSP-LLVQGRQKLSLFITLPKLPLSPLNEKQNCLAIVAKASGESSESSTSLT 59
           MASITA LPSP LLV G++ LS   TL  LPLSP+ ++QNC+++V +A+GESSESS  L 
Sbjct: 1   MASITASLPSPPLLVHGKRTLS--STLQTLPLSPIKDRQNCVSVVVRATGESSESSAPLG 58

Query: 60  VFKSVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFST 118
           + KSV+N+WD+S EDRL L+GLGFA +VA+W S  LI AIDKLP++P+ LELIGILFS+
Sbjct: 59  IVKSVKNIWDDS-EDRLALVGLGFAALVAIWTSAKLILAIDKLPVVPSVLELIGILFSS 116




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|18403896|ref|NP_564603.1| uncharacterized protein [Arabidopsis thaliana] gi|6850346|gb|AAF29409.1|AC022354_8 unknown protein [Arabidopsis thaliana] gi|15294156|gb|AAK95255.1|AF410269_1 At1g52220/F9I5_10 [Arabidopsis thaliana] gi|20453285|gb|AAM19881.1| At1g52220/F9I5_10 [Arabidopsis thaliana] gi|332194648|gb|AEE32769.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|21554231|gb|AAM63306.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224054368|ref|XP_002298225.1| predicted protein [Populus trichocarpa] gi|222845483|gb|EEE83030.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|118488080|gb|ABK95860.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|79319729|ref|NP_001031173.1| uncharacterized protein [Arabidopsis thaliana] gi|332194649|gb|AEE32770.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|388497330|gb|AFK36731.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|297852972|ref|XP_002894367.1| hypothetical protein ARALYDRAFT_892226 [Arabidopsis lyrata subsp. lyrata] gi|297340209|gb|EFH70626.1| hypothetical protein ARALYDRAFT_892226 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|255549936|ref|XP_002516019.1| Thylakoid membrane phosphoprotein 14 kDa, chloroplast precursor, putative [Ricinus communis] gi|223544924|gb|EEF46439.1| Thylakoid membrane phosphoprotein 14 kDa, chloroplast precursor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|388496712|gb|AFK36422.1| unknown [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query141
TAIR|locus:2037435156 AT1G52220 "AT1G52220" [Arabido 0.829 0.75 0.519 4.2e-24
TAIR|locus:2044335174 PSI-P "photosystem I P subunit 0.347 0.281 0.48 2.9e-09
TAIR|locus:2037435 AT1G52220 "AT1G52220" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 276 (102.2 bits), Expect = 4.2e-24, P = 4.2e-24
 Identities = 66/127 (51%), Positives = 79/127 (62%)

Query:     1 MASITACLPSPLLVQGR-------QKLSLFITLPKLPLSPLNEKQN--CLAIXXXXXXXX 51
             MASI+A LPSPLL+  R       QKL   +T     LSPL+  +N   +++        
Sbjct:     1 MASISATLPSPLLLTQRKSNLTSIQKLPFSLTRGTNDLSPLSLTRNPSSISLMVKASGES 60

Query:    52 XXXXXXXXXXXXVQNVWDNSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALEL 111
                         +QNVWD S EDRLGLIGLGFAGIVALWAS+NLITAIDKLP+I +  EL
Sbjct:    61 SDSSTDLDVVSTIQNVWDKS-EDRLGLIGLGFAGIVALWASLNLITAIDKLPVISSGFEL 119

Query:   112 IGILFST 118
             +GILFST
Sbjct:   120 VGILFST 126




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0006364 "rRNA processing" evidence=RCA
GO:0009657 "plastid organization" evidence=RCA
GO:0010103 "stomatal complex morphogenesis" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0019684 "photosynthesis, light reaction" evidence=RCA
GO:0035304 "regulation of protein dephosphorylation" evidence=RCA
GO:0043085 "positive regulation of catalytic activity" evidence=RCA
TAIR|locus:2044335 PSI-P "photosystem I P subunit" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query141
PLN02777167 PLN02777, PLN02777, photosystem I P subunit (PSI-P 1e-14
pfam1415990 pfam14159, DUF4308, Domain of unknown function (DU 4e-12
>gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P) Back     alignment and domain information
 Score = 66.6 bits (162), Expect = 1e-14
 Identities = 40/108 (37%), Positives = 60/108 (55%), Gaps = 15/108 (13%)

Query: 24  ITLPKLPLSPLNEKQN-------CLAI----VAKASGES---SESSTSLTVFKSVQNVWD 69
           ++LP LP  P+            C  I    V  A+GE+    E++    + K+VQ  WD
Sbjct: 29  VSLPTLPPPPVQSHNRPAKATAYCRKIARNVVTMATGEAPAEVETTELPEIVKTVQEAWD 88

Query: 70  NSSEDRLGLIGLGFAGIVALWASVNLITAIDKLPIIPNALELIGILFS 117
              ED+  +  L FAG+VALW S  +I+AID+LP++P  LEL+GI ++
Sbjct: 89  KV-EDKYAVSSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYT 135


Length = 167

>gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 141
PLN02777167 photosystem I P subunit (PSI-P) 100.0
PF1415990 CAAD: CAAD domains of cyanobacterial aminoacyl-tRN 99.96
PF1415990 CAAD: CAAD domains of cyanobacterial aminoacyl-tRN 88.47
>PLN02777 photosystem I P subunit (PSI-P) Back     alignment and domain information
Probab=100.00  E-value=4.2e-43  Score=277.32  Aligned_cols=119  Identities=34%  Similarity=0.647  Sum_probs=110.6

Q ss_pred             cccccCCCCCCCCch-----------hhhhhHhHHHHhcCCCCCCcc--h-hhHHHHHHHHhhhhccchhHHHHHHHHHH
Q 032435           21 SLFITLPKLPLSPLN-----------EKQNCLAIVAKASGESSESST--S-LTVFKSVQNVWDNSSEDRLGLIGLGFAGI   86 (141)
Q Consensus        21 ~~~~~~~~~p~~~~~-----------~~~~~~~v~~~as~e~~~~~~--~-~e~~~~lq~~Wd~~~e~k~~~~~l~~aai   86 (141)
                      +++.++|.||.|++.           ||+..++|++||++|++++..  + +|++|++||+||+ +|||+++++++++++
T Consensus        26 ~~~~~lp~lppp~~~~~~~~~~~~~~c~~~~r~vv~~a~ge~s~~~~~~~~~ei~k~~~e~Wd~-~EdK~av~~l~~aai  104 (167)
T PLN02777         26 PQCVSLPTLPPPPVQSHNRPAKATAYCRKIARNVVTMATGEAPAEVETTELPEIVKTVQEAWDK-VEDKYAVSSLAFAGV  104 (167)
T ss_pred             CccccCCCCCCCCcccCCCcchhHHHHHHhHHHHHHHhccCCCcccccccHHHHHHHHHHHHhh-hcchhHHHHHHHHHH
Confidence            478889999877763           889999999999999887532  2 5999999999999 999999999999999


Q ss_pred             HHHHHHHHHHHHhhccCCchhhHHHHHHHHHhhhhhhhhhcccchh-HHHHhhhc
Q 032435           87 VALWASVNLITAIDKLPIIPNALELIGILFSTVSVIEIIIWHGCCE-WEKKIKES  140 (141)
Q Consensus        87 val~~~~~vl~AId~iPLlp~llELVGigYt~WFvyRyLl~~~~R~-l~~kik~~  140 (141)
                      |++|++.+||+|||+|||+|++||||||||++||+||||+|++||| |++||++.
T Consensus       105 Val~v~~~VL~AId~lPLlP~lLELVGigYs~WF~yRyLLfke~ReeL~~ki~~l  159 (167)
T PLN02777        105 VALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFAYKNLVFKPDREALIEKIKDT  159 (167)
T ss_pred             HHHHHHHHHHHHHhccccccchHHHhhhhhhhhhhhhHhcCcccHHHHHHHHHHH
Confidence            9999999999999999999999999999999999999999999999 99999975



>PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase Back     alignment and domain information
>PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00