Citrus Sinensis ID: 028106


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210---
MGSGSTHDHVAYAPLPTENVIVLPVYYQPDLRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDCYPE
ccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHEEEEEEcccEEEEEEEEEEEEEEccccccEEEEEEEEEEEEEccccEEEEEcccEEEEEEccEEEEEEEEcccEEEcccEEEEEEEEEEEEEEEcccHHHHHHHHHccEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEccccEEEccccccc
cccccccccccccccccccEEEcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccEEEEEEEEEEEEEEccccccEEEEEEEEEEEEEccccEEEEEccEEEEEEEcccEEEEEEcccccccccccEEEEEEEEEEcHHHcccHHHHHHHHHcccEcEEEEEEEcEEEEEEEEEEEEEEEEEEEEEEEccccEEEEcccccc
mgsgsthdhvayaplptenvivlpvyyqpdlrrwrrrrnlsrCLCTAAAIASLLAVLVFIfypsdpylQLARIHLNhirvnsspqptldlSFSLVVKvhnrdffslnydsldvsigyrgrelgsvrshggrvrargssyvnaslklnglEVIHDVIYLIEDLIkgvipfdtvTMVKGELGVLFFEIPLKAKVSCEVYvntsnqtivrqdcype
mgsgsthdhvayaplptenvivLPVYYQPDLRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRelgsvrshggrvrargssyvnaslklnGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYvntsnqtivrqdcype
MGSGSTHDHVAYAPLPTENVIVLPVYYQPDlrrwrrrrnlsrclctaaaiasllavlVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDCYPE
*********VAYAPLPTENVIVLPVYYQPDLRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQ*****
**************************************NLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDC***
********HVAYAPLPTENVIVLPVYYQPDLRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDCYPE
*******DHVAYAPLPTENVIVLPVYYQPDLRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDC***
oooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGSGSTHDHVAYAPLPTENVIVLPVYYQPDLRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDCYPE
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query213
255546523 336 conserved hypothetical protein [Ricinus 0.812 0.514 0.706 3e-66
30682408215 late embryogenesis abundant hydroxyproli 0.962 0.953 0.570 6e-59
297790732224 hypothetical protein ARALYDRAFT_497058 [ 0.962 0.915 0.584 1e-58
110743779213 hypothetical protein [Arabidopsis thalia 0.920 0.920 0.583 2e-58
224119776210 predicted protein [Populus trichocarpa] 0.948 0.961 0.590 1e-57
356531257257 PREDICTED: uncharacterized protein LOC10 0.910 0.754 0.574 5e-57
297738839218 unnamed protein product [Vitis vinifera] 0.957 0.935 0.553 1e-56
225445268212 PREDICTED: uncharacterized protein LOC10 0.957 0.962 0.553 1e-56
449461975215 PREDICTED: uncharacterized protein LOC10 0.971 0.962 0.548 4e-53
356522728194 PREDICTED: uncharacterized protein LOC10 0.807 0.886 0.594 4e-51
>gi|255546523|ref|XP_002514321.1| conserved hypothetical protein [Ricinus communis] gi|223546777|gb|EEF48275.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  257 bits (656), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 123/174 (70%), Positives = 145/174 (83%), Gaps = 1/174 (0%)

Query: 40  LSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVH 99
           L RCL +A  I   L+  V+  YPSDP LQLARI LNHI VNSSP+ TLDLSFSL+++V 
Sbjct: 63  LRRCLLSAITIL-FLSAAVYFLYPSDPSLQLARIQLNHIHVNSSPKFTLDLSFSLLIRVK 121

Query: 100 NRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLI 159
           NRDFFSL+YD+LDVS+GYRG+ELG V S GG VR RGSSY++A+L LNGLE++HDV YLI
Sbjct: 122 NRDFFSLDYDTLDVSVGYRGKELGVVSSKGGNVRPRGSSYISAALDLNGLEIVHDVFYLI 181

Query: 160 EDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDCYPE 213
           EDL +GVIPFDT TMV GELG  FF+IP+KAKVSCEVYVNT NQTIVR+DC+P+
Sbjct: 182 EDLARGVIPFDTDTMVNGELGFFFFKIPIKAKVSCEVYVNTVNQTIVREDCHPQ 235




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|30682408|ref|NP_193063.2| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] gi|62867635|gb|AAY17421.1| At4g13270 [Arabidopsis thaliana] gi|66841356|gb|AAY57315.1| At4g13270 [Arabidopsis thaliana] gi|332657857|gb|AEE83257.1| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297790732|ref|XP_002863251.1| hypothetical protein ARALYDRAFT_497058 [Arabidopsis lyrata subsp. lyrata] gi|297309085|gb|EFH39510.1| hypothetical protein ARALYDRAFT_497058 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|110743779|dbj|BAE99725.1| hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224119776|ref|XP_002331158.1| predicted protein [Populus trichocarpa] gi|222873241|gb|EEF10372.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356531257|ref|XP_003534194.1| PREDICTED: uncharacterized protein LOC100793858 [Glycine max] Back     alignment and taxonomy information
>gi|297738839|emb|CBI28084.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225445268|ref|XP_002284574.1| PREDICTED: uncharacterized protein LOC100254347 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449461975|ref|XP_004148717.1| PREDICTED: uncharacterized protein LOC101219269 [Cucumis sativus] gi|449501291|ref|XP_004161330.1| PREDICTED: uncharacterized protein LOC101225993 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356522728|ref|XP_003529998.1| PREDICTED: uncharacterized protein LOC100775891, partial [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query213
TAIR|locus:2142040215 AT4G13270 "AT4G13270" [Arabido 0.976 0.967 0.523 2.7e-52
TAIR|locus:2018154227 AT1G52330 "AT1G52330" [Arabido 0.629 0.590 0.477 6e-32
TAIR|locus:2080320235 AT3G54200 "AT3G54200" [Arabido 0.704 0.638 0.245 1.9e-12
TAIR|locus:2100177186 AT3G44380 "AT3G44380" [Arabido 0.676 0.774 0.244 8.7e-08
TAIR|locus:504955821189 AT3G05975 "AT3G05975" [Arabido 0.708 0.798 0.220 1.9e-05
TAIR|locus:2142040 AT4G13270 "AT4G13270" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
 Identities = 113/216 (52%), Positives = 145/216 (67%)

Query:     1 MGSGSTHDH-VAYAPLPT----ENVIVLPVYYQPDXXXXXXXXXXXXXXXXXXXXXXXXX 55
             M S    D+ + Y PLP+    ++VI+L  Y +                           
Sbjct:     1 MASSKHEDYGIPYTPLPSSQPSQSVILLTPYRR---HRRPSLLRNLRCSLLFTAVILLLS 57

Query:    56 XXVFIFYPSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSI 115
               V++ YPSDP + ++RI+LNHI V  S +  LDLSFSL +KV NRDFFSL+YDSL VSI
Sbjct:    58 AAVYLLYPSDPDITVSRINLNHISVVDSHKIALDLSFSLTIKVRNRDFFSLDYDSLVVSI 117

Query:   116 GYRGRELGSVRSHGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMV 175
             GYRGRELG V+S GG ++AR SSY++A+L+L+GLEV+HDVIYLI DL KGVIPFDT+  V
Sbjct:   118 GYRGRELGLVKSKGGHLKARDSSYIDATLELDGLEVVHDVIYLIGDLAKGVIPFDTIAQV 177

Query:   176 KGELGVLFFEIPLKAKVSCEVYVNTSNQTIVRQDCY 211
             +G+LGVL F IP++ KVSCEVYVN +NQ I  QDC+
Sbjct:   178 QGDLGVLLFNIPIQGKVSCEVYVNVNNQKISHQDCH 213




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0000902 "cell morphogenesis" evidence=RCA
GO:0016049 "cell growth" evidence=RCA
GO:0048193 "Golgi vesicle transport" evidence=RCA
TAIR|locus:2018154 AT1G52330 "AT1G52330" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2080320 AT3G54200 "AT3G54200" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2100177 AT3G44380 "AT3G44380" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955821 AT3G05975 "AT3G05975" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
AT4G13270
unknown protein; unknown protein; FUNCTIONS IN- molecular_function unknown; INVOLVED IN- biological_process unknown; LOCATED IN- chloroplast; EXPRESSED IN- 23 plant structures; EXPRESSED DURING- 13 growth stages; BEST Arabidopsis thaliana protein match is- unknown protein (TAIR-AT1G52330.1); Has 109 Blast hits to 109 proteins in 10 species- Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes - 0 (source- NCBI BLink). (215 aa)
(Arabidopsis thaliana)
Predicted Functional Partners:
AT4G01960
unknown protein; unknown protein; FUNCTIONS IN- molecular_function unknown; INVOLVED IN- biolog [...] (236 aa)
       0.414

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query213
PLN03160219 PLN03160, PLN03160, uncharacterized protein; Provi 2e-15
pfam0316898 pfam03168, LEA_2, Late embryogenesis abundant prot 2e-07
>gnl|CDD|215609 PLN03160, PLN03160, uncharacterized protein; Provisional Back     alignment and domain information
 Score = 71.7 bits (176), Expect = 2e-15
 Identities = 46/191 (24%), Positives = 95/191 (49%), Gaps = 8/191 (4%)

Query: 29  PDLRRWRRRRNLSRCLCTAAAI---ASLLAVLVF-IFYPSDPYLQLARIHLNHIRVNSSP 84
             L++ RRR  +  C C  A +   A+ + VLVF +F   DP +++  + +  + + ++ 
Sbjct: 26  NHLKKTRRRNCIKCCGCITATLLILATTILVLVFTVFRVKDPVIKMNGVTVTKLELINNT 85

Query: 85  QPTLDLSFSLV--VKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARGSSYVNA 142
                 + +L+  V V N +  S  Y +   +I Y G  +G  R+  G+ +AR +  +N 
Sbjct: 86  TLRPGTNITLIADVSVKNPNVASFKYSNTTTTIYYGGTVVGEARTPPGKAKARRTMRMNV 145

Query: 143 SLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVL-FFEIPLKAKVSCEVYVNTS 201
           ++ +   + I  V  L+ D+  G++  ++ T + G++ +L   +  +  K++C + VN +
Sbjct: 146 TVDII-PDKILSVPGLLTDISSGLLNMNSYTRIGGKVKILKIIKKHVVVKMNCTMTVNIT 204

Query: 202 NQTIVRQDCYP 212
           +Q I  Q C  
Sbjct: 205 SQAIQGQKCKR 215


Length = 219

>gnl|CDD|217400 pfam03168, LEA_2, Late embryogenesis abundant protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 213
PLN03160219 uncharacterized protein; Provisional 100.0
PF03168101 LEA_2: Late embryogenesis abundant protein; InterP 99.53
PF07092238 DUF1356: Protein of unknown function (DUF1356); In 99.51
smart00769100 WHy Water Stress and Hypersensitive response. 99.04
COG5608161 LEA14-like dessication related protein [Defense me 98.35
PF12751387 Vac7: Vacuolar segregation subunit 7; InterPro: IP 97.58
PLN03160219 uncharacterized protein; Provisional 90.46
PF11837106 DUF3357: Domain of unknown function (DUF3357); Int 90.18
PRK05529255 cell division protein FtsQ; Provisional 81.17
PF14155112 DUF4307: Domain of unknown function (DUF4307) 81.05
PF11906149 DUF3426: Protein of unknown function (DUF3426); In 80.39
>PLN03160 uncharacterized protein; Provisional Back     alignment and domain information
Probab=100.00  E-value=2e-40  Score=276.51  Aligned_cols=159  Identities=21%  Similarity=0.421  Sum_probs=146.2

Q ss_pred             HhheeeEEecCCCeEEEEEEEEcceeecCC--CCceeeEEEEEEEEEEcCCceEEEEcceEEEEEECCEEEeeeEeCCCc
Q 028106           54 LAVLVFIFYPSDPYLQLARIHLNHIRVNSS--PQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGR  131 (213)
Q Consensus        54 l~~~~~v~~P~~P~~~V~~~~l~~f~~~~~--p~~~l~~~l~~~v~v~NPN~~~i~y~~~~~~v~Y~g~~lg~~~~p~f~  131 (213)
                      ++++|++||||+|+|+|+++++++|+++..  +...+|++++++++++|||+++|+|+++++.++|+|+.+|++.+|+|+
T Consensus        55 ~~l~~~vfrPk~P~~~v~~v~l~~~~~~~~~~~~~~~n~tl~~~v~v~NPN~~~~~Y~~~~~~v~Y~g~~vG~a~~p~g~  134 (219)
T PLN03160         55 LVLVFTVFRVKDPVIKMNGVTVTKLELINNTTLRPGTNITLIADVSVKNPNVASFKYSNTTTTIYYGGTVVGEARTPPGK  134 (219)
T ss_pred             HheeeEEEEccCCeEEEEEEEEeeeeeccCCCCceeEEEEEEEEEEEECCCceeEEEcCeEEEEEECCEEEEEEEcCCcc
Confidence            346899999999999999999999998652  246789999999999999999999999999999999999999999999


Q ss_pred             cCCCCeEEEEEEEEEcceeeccchHHHHhhhhcCeEeEEEEEEEEEEEEEE-EEeeceEEEEEEEEEEeCCcceeecCcC
Q 028106          132 VRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVL-FFEIPLKAKVSCEVYVNTSNQTIVRQDC  210 (213)
Q Consensus       132 ~~~r~tt~v~~~v~~~~~~l~~~~~~l~~D~~~G~v~l~~~~~v~grv~v~-~~~~~~~~~v~C~v~v~~~~~~i~~~~C  210 (213)
                      |++++++.+++++.+.+.++.++ .+|.+|+++|.++|+++++++||++++ +++++++++++|++.|+..++++++++|
T Consensus       135 ~~ar~T~~l~~tv~~~~~~~~~~-~~L~~D~~~G~v~l~~~~~v~gkVkv~~i~k~~v~~~v~C~v~V~~~~~~i~~~~C  213 (219)
T PLN03160        135 AKARRTMRMNVTVDIIPDKILSV-PGLLTDISSGLLNMNSYTRIGGKVKILKIIKKHVVVKMNCTMTVNITSQAIQGQKC  213 (219)
T ss_pred             cCCCCeEEEEEEEEEEeceeccc-hhHHHHhhCCeEEEEEEEEEEEEEEEEEEEEEEEEEEEEeEEEEECCCCEEeccEe
Confidence            99999999999998888776654 579999999999999999999999999 6688999999999999999999999999


Q ss_pred             ccC
Q 028106          211 YPE  213 (213)
Q Consensus       211 ~~~  213 (213)
                      +.+
T Consensus       214 ~~~  216 (219)
T PLN03160        214 KRH  216 (219)
T ss_pred             ccc
Confidence            875



>PF03168 LEA_2: Late embryogenesis abundant protein; InterPro: IPR004864 Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress [, ] Back     alignment and domain information
>PF07092 DUF1356: Protein of unknown function (DUF1356); InterPro: IPR009790 This family consists of several hypothetical mammalian proteins of around 250 residues in length Back     alignment and domain information
>smart00769 WHy Water Stress and Hypersensitive response Back     alignment and domain information
>COG5608 LEA14-like dessication related protein [Defense mechanisms] Back     alignment and domain information
>PF12751 Vac7: Vacuolar segregation subunit 7; InterPro: IPR024260 Vac7 is localised at the vacuole membrane, a location which is consistent with its involvement in vacuole morphology and inheritance [] Back     alignment and domain information
>PLN03160 uncharacterized protein; Provisional Back     alignment and domain information
>PF11837 DUF3357: Domain of unknown function (DUF3357); InterPro: IPR021792 This entry represents the N-terminal domain of beta-fructofuranosidase, whcih is involved in the hydrolysis of terminal non-reducing beta-D-fructofuranoside residues in beta-D-fructofuranosides Back     alignment and domain information
>PRK05529 cell division protein FtsQ; Provisional Back     alignment and domain information
>PF14155 DUF4307: Domain of unknown function (DUF4307) Back     alignment and domain information
>PF11906 DUF3426: Protein of unknown function (DUF3426); InterPro: IPR021834 This family of proteins are functionally uncharacterised Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query213
1yyc_A174 LEA protein, putative late embryogenesis abundant 98.83
1xo8_A151 AT1G01470; structural genomics, protein structure 98.75
3but_A136 Uncharacterized protein AF_0446; lipid binding pro 98.42
>1yyc_A LEA protein, putative late embryogenesis abundant protein; structural genomics, protein structure initiative, CESG; NMR {Arabidopsis thaliana} Back     alignment and structure
Probab=98.83  E-value=1.7e-08  Score=80.36  Aligned_cols=101  Identities=13%  Similarity=0.151  Sum_probs=81.4

Q ss_pred             CCCeEEEEEEEEcceeecCCCCceeeEEEEEEEEEEcCCceEEEEcceEEEEEECCEEEeeeEeCC-CccCCCCeEEEEE
Q 028106           64 SDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHG-GRVRARGSSYVNA  142 (213)
Q Consensus        64 ~~P~~~V~~~~l~~f~~~~~p~~~l~~~l~~~v~v~NPN~~~i~y~~~~~~v~Y~g~~lg~~~~p~-f~~~~r~tt~v~~  142 (213)
                      +.|++++.++++.+++.       ..++|.+.++++|||-+++.+.+++.++.-+|..++++.++. +.+++++++.+.+
T Consensus        43 ~~PeV~v~~v~~~~~~l-------~~~~~~l~LrV~NPN~~pLpi~gi~Y~L~vnG~~lasG~s~~~~tIpa~g~~~v~V  115 (174)
T 1yyc_A           43 PTPEATVDDVDFKGVTR-------DGVDYHAKVSVKNPYSQSIPICQISYILKSATRTIASGTIPDPGSLVGSGTTVLDV  115 (174)
T ss_dssp             CCCEEEEEEEEEEEECS-------SSEEEEEEEEEEECSSSCCBCCSEEEEEEESSSCEEEEEESCCCBCCSSEEEEEEE
T ss_pred             CCCEEEEEEeEEecccc-------ceEEEEEEEEEECCCCCCccccceEEEEEECCEEEEEEecCCCceECCCCcEEEEE
Confidence            78999999999998865       346789999999999999999999999999999999999875 7999999999999


Q ss_pred             EEEEcceeeccchHHHHhhhhc-CeEeEEEEEEE
Q 028106          143 SLKLNGLEVIHDVIYLIEDLIK-GVIPFDTVTMV  175 (213)
Q Consensus       143 ~v~~~~~~l~~~~~~l~~D~~~-G~v~l~~~~~v  175 (213)
                      ++++....+    .++++|+.. +.++.++++++
T Consensus       116 pv~v~~~~l----~~~~~~l~~~~~i~Y~L~g~L  145 (174)
T 1yyc_A          116 PVKVAYSIA----VSLMKDMCTDWDIDYQLDIGL  145 (174)
T ss_dssp             EEEESHHHH----HHTCCCCCSSEEECEEEEEEE
T ss_pred             EEEEEHHHH----HHHHHhcCCCCccceEEEEEE
Confidence            988775332    234455543 45665554443



>1xo8_A AT1G01470; structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG, unknown function; NMR {Arabidopsis thaliana} SCOP: b.1.25.1 Back     alignment and structure
>3but_A Uncharacterized protein AF_0446; lipid binding protein, beta barrel, protein structure initia PSI-2; 1.91A {Archaeoglobus fulgidus dsm 4304} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 213
d1xo8a_151 b.1.25.1 (A:) Putative dessication related protein 2e-04
>d1xo8a_ b.1.25.1 (A:) Putative dessication related protein LEA14 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 151 Back     information, alignment and structure

class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: LEA14-like
family: LEA14-like
domain: Putative dessication related protein LEA14
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 37.9 bits (88), Expect = 2e-04
 Identities = 21/126 (16%), Positives = 47/126 (37%), Gaps = 14/126 (11%)

Query: 66  PYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSV 125
           P   +  + L  +  +S         +   V V N    S+    +  +    GRE+G  
Sbjct: 22  PEGSVTDVDLKDVNRDSV-------EYLAKVSVTNPYSHSIPICEISFTFHSAGREIGKG 74

Query: 126 RS-HGGRVRARGSSYVNASLKLNGLEVIHDVIYLIEDLIKG-VIPFDTVTMVKGELGVLF 183
           +    G ++A+  + ++  + +        +  L  D+     I ++    +  +L V+ 
Sbjct: 75  KIPDPGSLKAKDMTALDIPVVVP----YSILFNLARDVGVDWDIDYELQIGLTIDLPVVG 130

Query: 184 -FEIPL 188
            F IP+
Sbjct: 131 EFTIPI 136


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query213
d1xo8a_151 Putative dessication related protein LEA14 {Thale 98.74
>d1xo8a_ b.1.25.1 (A:) Putative dessication related protein LEA14 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: LEA14-like
family: LEA14-like
domain: Putative dessication related protein LEA14
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=98.74  E-value=5.3e-09  Score=79.70  Aligned_cols=122  Identities=15%  Similarity=0.173  Sum_probs=88.3

Q ss_pred             cCCCeEEEEEEEEcceeecCCCCceeeEEEEEEEEEEcCCceEEEEcceEEEEEECCEEEeeeEeC-CCccCCCCeEEEE
Q 028106           63 PSDPYLQLARIHLNHIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSH-GGRVRARGSSYVN  141 (213)
Q Consensus        63 P~~P~~~V~~~~l~~f~~~~~p~~~l~~~l~~~v~v~NPN~~~i~y~~~~~~v~Y~g~~lg~~~~p-~f~~~~r~tt~v~  141 (213)
                      -+.|++++.++++.+++.       ...++.+.++++|||-+++...+.+.+++.+|..++++..+ ++.+++++++.+.
T Consensus        19 ~~kPev~l~~v~i~~v~~-------~~~~l~~~l~V~NPN~~~l~i~~l~y~l~~~g~~ia~G~~~~~~~ipa~~~~~v~   91 (151)
T d1xo8a_          19 IPKPEGSVTDVDLKDVNR-------DSVEYLAKVSVTNPYSHSIPICEISFTFHSAGREIGKGKIPDPGSLKAKDMTALD   91 (151)
T ss_dssp             CCSCCCBCSEEEECCCTT-------TEECEEEEEEEECSSSSCCCCEEEEEEEESSSSCEEEEEEEECCCCSSSSEEEEE
T ss_pred             CCCCeEEEEEEEeeeccc-------ceEEEEEEEEEECCCCCceeeeeEEEEEEECCEEEEeEecCCCcEEcCCCcEEEE
Confidence            467999999999998765       34568999999999999999999999999999999999875 5799999999999


Q ss_pred             EEEEEcceeeccchHHHHhhhhc-CeEeEEEEEEEEEEEEEE--EE-eeceEEEEEEEEEEe
Q 028106          142 ASLKLNGLEVIHDVIYLIEDLIK-GVIPFDTVTMVKGELGVL--FF-EIPLKAKVSCEVYVN  199 (213)
Q Consensus       142 ~~v~~~~~~l~~~~~~l~~D~~~-G~v~l~~~~~v~grv~v~--~~-~~~~~~~v~C~v~v~  199 (213)
                      +++++...    +...+.+|+.+ +.++.+    ++|++.+.  .+ ...+.+.-+-++.+.
T Consensus        92 vpv~v~~~----~l~~~~~~i~~~~~i~Y~----l~g~l~~d~pv~G~~tiP~~~~Gei~lP  145 (151)
T d1xo8a_          92 IPVVVPYS----ILFNLARDVGVDWDIDYE----LQIGLTIDLPVVGEFTIPISSKGEIKLP  145 (151)
T ss_dssp             ECCCEEHH----HHHHHHHHHHHHSEEEEE----EEEEEEECCTTTSSEEEEEEEEEEEECC
T ss_pred             EEEEEEHH----HHHHHHHhhccCCCccEE----EEEEEEEecCccCceEeeeccCCEEECC
Confidence            88776532    23345566543 455544    55555542  11 223444444455543