Citrus Sinensis ID: 036354


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------
HEIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQVGKKENSSTGWLSMCH
ccccccEEEEEEEEEccccHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccc
cccEEEEEEEEEEEEcccHHHHHHHHHHHHHHHHHHHHHHccccccHEEHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccc
HEIATFFAVSFeakysampafKFFVIANAIVSIYGFLVLflplddlaigCCFDMLFTMLLTSSISAALTIAQVgkkensstgwlsmch
HEIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQVgkkensstgwlsmch
HEIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQVGKKENSSTGWLSMCH
**IATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQV***************
**IATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQVGKKENSSTGWLSMCH
HEIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQV***************
**IATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQVGKKENSSTGW*SMC*
iiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiii
oooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooo
SSSSSSSSSSSSSSSSSSiiHHHHHHHHHHHHHHHHHHHHooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
HEIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLAIGCCFDMLFTMLLTSSISAALTIAQVGKKENSSTGWLSMCH
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query88 2.2.26 [Sep-21-2011]
B9N5U6171 CASP-like protein POPTRDR yes no 0.988 0.508 0.629 2e-25
B9N2D0162 CASP-like protein POPTRDR yes no 0.988 0.537 0.629 1e-24
D7M2M8164 CASP-like protein ARALYDR N/A no 0.988 0.530 0.617 4e-24
B9RW00163 CASP-like protein RCOM_11 N/A no 0.988 0.533 0.606 4e-24
Q9ZT81164 CASP-like protein At4g035 yes no 0.988 0.530 0.595 2e-23
D7KCH2164 CASP-like protein ARALYDR N/A no 0.988 0.530 0.550 3e-21
Q9LR57164 CASP-like protein At1g037 no no 0.988 0.530 0.550 3e-21
A7PJ32161 CASP-like protein VIT_12s yes no 0.965 0.527 0.593 8e-20
B9RZ92164 CASP-like protein RCOM_09 N/A no 0.988 0.530 0.528 3e-17
B9N3F4163 CASP-like protein POPTRDR yes no 0.988 0.533 0.483 2e-14
>sp|B9N5U6|CSPL4_POPTR CASP-like protein POPTRDRAFT_270504 OS=Populus trichocarpa GN=POPTRDRAFT_270504 PE=3 SV=2 Back     alignment and function desciption
 Score =  114 bits (284), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 56/89 (62%), Positives = 69/89 (77%), Gaps = 2/89 (2%)

Query: 1   HEIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDD--LAIGCCFDMLFTM 58
           HE  +FFA+S+EAKYS  PAFK+FVIANAIV++YGFL LF+P +     +    D++FTM
Sbjct: 31  HEKGSFFALSYEAKYSDTPAFKYFVIANAIVTVYGFLALFIPSESPLWRLVLALDLVFTM 90

Query: 59  LLTSSISAALTIAQVGKKENSSTGWLSMC 87
           LL SSISAAL +AQVGKK NSS GWL +C
Sbjct: 91  LLISSISAALAVAQVGKKGNSSAGWLPVC 119





Populus trichocarpa (taxid: 3694)
>sp|B9N2D0|CSPLK_POPTR CASP-like protein POPTRDRAFT_670582 OS=Populus trichocarpa GN=POPTRDRAFT_670582 PE=2 SV=1 Back     alignment and function description
>sp|D7M2M8|CSPLF_ARALL CASP-like protein ARALYDRAFT_327471 OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_327471 PE=3 SV=1 Back     alignment and function description
>sp|B9RW00|CSPL6_RICCO CASP-like protein RCOM_1174750 OS=Ricinus communis GN=RCOM_1174750 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZT81|CSPLK_ARATH CASP-like protein At4g03540 OS=Arabidopsis thaliana GN=At4g03540 PE=2 SV=1 Back     alignment and function description
>sp|D7KCH2|CSPLE_ARALL CASP-like protein ARALYDRAFT_470341 OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_470341 PE=3 SV=1 Back     alignment and function description
>sp|Q9LR57|CSPL1_ARATH CASP-like protein At1g03700 OS=Arabidopsis thaliana GN=At1g03700 PE=2 SV=1 Back     alignment and function description
>sp|A7PJ32|CSPL2_VITVI CASP-like protein VIT_12s0028g03760 OS=Vitis vinifera GN=VIT_12s0028g03760 PE=2 SV=1 Back     alignment and function description
>sp|B9RZ92|CSPL2_RICCO CASP-like protein RCOM_0936380 OS=Ricinus communis GN=RCOM_0936380 PE=3 SV=1 Back     alignment and function description
>sp|B9N3F4|CSPL9_POPTR CASP-like protein POPTRDRAFT_810994 OS=Populus trichocarpa GN=POPTRDRAFT_810994 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query88
224121696159 predicted protein [Populus trichocarpa] 0.988 0.547 0.629 9e-24
341958683171 RecName: Full=CASP-like protein POPTRDRA 0.988 0.508 0.629 1e-23
449438494164 PREDICTED: CASP-like protein RCOM_117475 0.988 0.530 0.606 2e-23
224127057162 predicted protein [Populus trichocarpa] 0.988 0.537 0.629 4e-23
449479160164 PREDICTED: CASP-like protein RCOM_117475 0.988 0.530 0.595 4e-23
297813957164 integral membrane family protein [Arabid 0.988 0.530 0.617 2e-22
255553757163 conserved hypothetical protein [Ricinus 0.988 0.533 0.606 2e-22
356542938163 PREDICTED: CASP-like protein RCOM_117475 0.988 0.533 0.595 4e-22
15236329164 uncharacterized protein [Arabidopsis tha 0.988 0.530 0.595 9e-22
356517606163 PREDICTED: CASP-like protein RCOM_117475 0.988 0.533 0.584 1e-21
>gi|224121696|ref|XP_002330630.1| predicted protein [Populus trichocarpa] gi|222872234|gb|EEF09365.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  114 bits (284), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 56/89 (62%), Positives = 69/89 (77%), Gaps = 2/89 (2%)

Query: 1   HEIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDD--LAIGCCFDMLFTM 58
           HE  +FFA+S+EAKYS  PAFK+FVIANAIV++YGFL LF+P +     +    D++FTM
Sbjct: 31  HEKGSFFALSYEAKYSDTPAFKYFVIANAIVTVYGFLALFIPSESPLWRLVLALDLVFTM 90

Query: 59  LLTSSISAALTIAQVGKKENSSTGWLSMC 87
           LL SSISAAL +AQVGKK NSS GWL +C
Sbjct: 91  LLISSISAALAVAQVGKKGNSSAGWLPVC 119




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|341958683|sp|B9N5U6.2|CSPL4_POPTR RecName: Full=CASP-like protein POPTRDRAFT_270504 Back     alignment and taxonomy information
>gi|449438494|ref|XP_004137023.1| PREDICTED: CASP-like protein RCOM_1174750-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224127057|ref|XP_002329375.1| predicted protein [Populus trichocarpa] gi|288559183|sp|B9N2D0.1|CSPLK_POPTR RecName: Full=CASP-like protein POPTRDRAFT_670582 gi|222870425|gb|EEF07556.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449479160|ref|XP_004155522.1| PREDICTED: CASP-like protein RCOM_1174750-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|297813957|ref|XP_002874862.1| integral membrane family protein [Arabidopsis lyrata subsp. lyrata] gi|341958563|sp|D7M2M8.1|CSPLF_ARALL RecName: Full=CASP-like protein ARALYDRAFT_327471 gi|297320699|gb|EFH51121.1| integral membrane family protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|255553757|ref|XP_002517919.1| conserved hypothetical protein [Ricinus communis] gi|288559178|sp|B9RW00.1|CSPL6_RICCO RecName: Full=CASP-like protein RCOM_1174750 gi|223542901|gb|EEF44437.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356542938|ref|XP_003539921.1| PREDICTED: CASP-like protein RCOM_1174750-like [Glycine max] Back     alignment and taxonomy information
>gi|15236329|ref|NP_192263.1| uncharacterized protein [Arabidopsis thaliana] gi|75216592|sp|Q9ZT81.1|CSPLK_ARATH RecName: Full=CASP-like protein At4g03540 gi|4206208|gb|AAD11596.1| hypothetical protein [Arabidopsis thaliana] gi|4263041|gb|AAD15310.1| hypothetical protein [Arabidopsis thaliana] gi|7270677|emb|CAB77839.1| hypothetical protein [Arabidopsis thaliana] gi|332656935|gb|AEE82335.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|356517606|ref|XP_003527478.1| PREDICTED: CASP-like protein RCOM_1174750-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query88
TAIR|locus:2128776164 AT4G03540 "AT4G03540" [Arabido 0.977 0.524 0.602 1e-22
TAIR|locus:2020773164 AT1G03700 "AT1G03700" [Arabido 0.977 0.524 0.556 4.9e-21
UNIPROTKB|A2Y2B7162 BLE3 "CASP-like protein BLE3" 0.977 0.530 0.363 6.4e-12
UNIPROTKB|Q84UT5162 BLE3 "CASP-like protein BLE3" 0.977 0.530 0.363 6.4e-12
TAIR|locus:2150986187 CASP5 "Casparian strip membran 0.965 0.454 0.35 2.6e-08
TAIR|locus:2117308170 AT4G25040 "AT4G25040" [Arabido 0.977 0.505 0.358 6.8e-08
TAIR|locus:2160747202 CASP4 "AT5G06200" [Arabidopsis 0.954 0.415 0.363 2.2e-07
TAIR|locus:2053514206 CASP1 "AT2G36100" [Arabidopsis 0.943 0.402 0.37 3.2e-07
TAIR|locus:2163386197 AT5G44550 [Arabidopsis thalian 0.886 0.395 0.344 3.4e-07
TAIR|locus:2080742204 CASP2 "AT3G11550" [Arabidopsis 0.954 0.411 0.343 1.6e-06
TAIR|locus:2128776 AT4G03540 "AT4G03540" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 263 (97.6 bits), Expect = 1.0e-22, P = 1.0e-22
 Identities = 53/88 (60%), Positives = 67/88 (76%)

Query:     2 EIATFFAVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLPLDDLA--IGCCFDMLFTML 59
             E A+F A+S EAKY+ M AFK+FVIANA+VS+Y FLVLFLP + L        D++ TML
Sbjct:    33 ERASFLAISLEAKYTDMAAFKYFVIANAVVSVYSFLVLFLPKESLLWKFVVVLDLVMTML 92

Query:    60 LTSSISAALTIAQVGKKENSSTGWLSMC 87
             LTSS+SAAL +AQVGKK N++ GWL +C
Sbjct:    93 LTSSLSAALAVAQVGKKGNANAGWLPIC 120




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2020773 AT1G03700 "AT1G03700" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|A2Y2B7 BLE3 "CASP-like protein BLE3" [Oryza sativa Indica Group (taxid:39946)] Back     alignment and assigned GO terms
UNIPROTKB|Q84UT5 BLE3 "CASP-like protein BLE3" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2150986 CASP5 "Casparian strip membrane domain protein 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2117308 AT4G25040 "AT4G25040" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2160747 CASP4 "AT5G06200" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2053514 CASP1 "AT2G36100" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2163386 AT5G44550 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2080742 CASP2 "AT3G11550" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query88
TIGR01569154 TIGR01569, A_tha_TIGR01569, plant integral membran 1e-13
pfam04535150 pfam04535, DUF588, Domain of unknown function (DUF 5e-13
>gnl|CDD|233471 TIGR01569, A_tha_TIGR01569, plant integral membrane protein TIGR01569 Back     alignment and domain information
 Score = 61.3 bits (149), Expect = 1e-13
 Identities = 35/98 (35%), Positives = 48/98 (48%), Gaps = 11/98 (11%)

Query: 1   HEIATFF--AVSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFLP---------LDDLAIG 49
            E    F   ++F+AK+S +PAF +FV+ANAI   Y  L L +             L   
Sbjct: 22  RETKVVFVQLITFKAKFSDLPAFVYFVVANAIACGYSLLSLVVSIFGLLKRRVFFKLIAL 81

Query: 50  CCFDMLFTMLLTSSISAALTIAQVGKKENSSTGWLSMC 87
              D++   LL+S  SAA  +A VGK  N   GWL +C
Sbjct: 82  FFLDLVMLALLSSGTSAAAAVAYVGKLGNKEAGWLKIC 119


This model describes a region of ~160 residues found exclusively in plant proteins, generally as the near complete length of the protein. At least 24 different members are found in Arabidopsis thaliana. Members have four predicted transmembrane regions, the last of which is preceded by an invariant CXXXXX[FY]C motif. The family is not functionally characterized. Length = 154

>gnl|CDD|218134 pfam04535, DUF588, Domain of unknown function (DUF588) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 88
TIGR01569154 A_tha_TIGR01569 plant integral membrane protein TI 99.97
PF04535149 DUF588: Domain of unknown function (DUF588); Inter 99.94
>TIGR01569 A_tha_TIGR01569 plant integral membrane protein TIGR01569 Back     alignment and domain information
Probab=99.97  E-value=7.3e-31  Score=183.86  Aligned_cols=88  Identities=40%  Similarity=0.677  Sum_probs=82.1

Q ss_pred             CcceeeEE--EEEEEEeccCCchHHHHHHHHHHHHHHHHHHhc------cCC---chhhhhhhHHHHHHHHHHHHHHHHH
Q 036354            1 HEIATFFA--VSFEAKYSAMPAFKFFVIANAIVSIYGFLVLFL------PLD---DLAIGCCFDMLFTMLLTSSISAALT   69 (88)
Q Consensus         1 ~~t~~~~~--i~~~a~ysd~~af~~fV~anai~~~Ysll~l~~------r~~---~~~~l~~~D~v~~~ll~sa~sAA~~   69 (88)
                      |||.++++  +++++||+|+|+|+|||++|+|+++|+++|+++      |++   .+|++|++||+|+|+++||++||++
T Consensus        22 ~qt~~~~~~~~~~~a~f~d~~af~y~v~anai~~~Ysll~l~~~~~~~~~~~~~~~~~~~f~~D~v~~~Ll~sa~sAA~a  101 (154)
T TIGR01569        22 RETKVVFVQLITFKAKFSDLPAFVYFVVANAIACGYSLLSLVVSIFGLLKRRVFFKLIALFFLDLVMLALLSSGTSAAAA  101 (154)
T ss_pred             cceeeeecccceeeeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence            68999988  999999999999999999999999999999987      222   3579999999999999999999999


Q ss_pred             HHHHHhcCcccCCccCcCC
Q 036354           70 IAQVGKKENSSTGWLSMCH   88 (88)
Q Consensus        70 i~yl~~~Gn~~~~W~~iC~   88 (88)
                      ++|++|+||+|++|+|+|+
T Consensus       102 v~~l~~~G~~~~~W~~iC~  120 (154)
T TIGR01569       102 VAYVGKLGNKEAGWLKICG  120 (154)
T ss_pred             HHHHHHccccccchhhHHH
Confidence            9999999999999999996



This model describes a region of ~160 residues found exclusively in plant proteins, generally as the near complete length of the protein. At least 24 different members are found in Arabidopsis thaliana. Members have four predicted transmembrane regions, the last of which is preceded by an invariant CXXXXX[FY]C motif. The family is not functionally characterized.

>PF04535 DUF588: Domain of unknown function (DUF588); InterPro: IPR006702 This family of plant proteins contains a domain that may have a catalytic activity Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00