Citrus Sinensis ID: 041147


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220--
MEIEPTKPPVVAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHNHLSSLSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKAYHYDDVATVQAVQRVLEMLNTSDNNHAVASAAPSPLVTLPGFGRSPLVRQLRITDSPFPLKDEGDGQVDKAAEDFIKKFYKDLMLQKSMAAFESPYHHSWDR
ccccccccHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHccccHHHHHHcHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHccccccccHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccc
ccccccccHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHccHHHHHHHHHHHHHHccccccccccccccccccccEEEEEcccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHccccccccccccccccccccccccccccccEEEEccccccccccccccccHHHHHHHHHHHHHHHHHHHHHcccccccccccc
meieptkppvvaKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIfhnhlsslscrsndahlsfispkeyefscsnspafnfsypftnyghnkrkhHHYHLQKAAkayhyddvaTVQAVQRVLEMLNTsdnnhavasaapsplvtlpgfgrsplvrqlritdspfplkdegdgqVDKAAEDFIKKFYKDLMLQKSMaafespyhhswdr
meieptkppvvakklWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHNHLSSLSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKAYHYDDVATVQAVQRVLEMLNTSDNNHAVASAAPSPLVTLPGFGRSPLVRQLRITdspfplkdegdgqVDKAAEDFIKKFYKDLMLQKSMaafespyhhswdr
MEIEPTKPPVVAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHNHLSSLSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKAYHYDDVATVQAVQRVLEMLNTSDNNHAVASAAPSPLVTLPGFGRSPLVRQLRITDSPFPLKDEGDGQVDKAAEDFIKKFYKDLMLQKSMAAFESPYHHSWDR
*********VVAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHNHLSSLSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKAYHYDDVATVQAVQRVLEMLNT**************LVTLPGFGRSPLVRQLRI******************AEDFIKKFYKDLMLQK***************
*************KLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIG*******************************************************HYHLQKAAKAYHYDDVATVQAVQRVLEMLNTSDNNHAVASAAPSPLVTL*******************PLKDEGDGQVDKAAEDFIKKFYKDLMLQKSMAAFESPYHHS*D*
********PVVAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHNHLSSLSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHN**********KAAKAYHYDDVATVQAVQRVLEMLNTSDNNHAVASAAPSPLVTLPGFGRSPLVRQLRITDSPFPLKDEGDGQVDKAAEDFIKKFYKDLMLQKSMAAFE*********
******KPPVVAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHNHLSS**CRS*DAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKAYHYDDVATVQAVQRVLEMLNT*************PLVTLPGFGRSPLVRQLRITDSPFPLKDEGDGQVDKAAEDFIKKFYKDLMLQKSMAAFESPYH*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEIEPTKPPVVAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHNHLSSLSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKAYHYDDVATVQAVQRVLEMLNTSDNNHAVASAAPSPLVTLPGFGRSPLVRQLRITDSPFPLKDEGDGQVDKAAEDFIKKFYKDLMLQKSMAAFESPYHHSWDR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query222
255546355206 conserved hypothetical protein [Ricinus 0.927 1.0 0.662 2e-72
224142201209 predicted protein [Populus trichocarpa] 0.932 0.990 0.675 1e-71
225445114201 PREDICTED: uncharacterized protein LOC10 0.891 0.985 0.651 5e-71
449466266208 PREDICTED: uncharacterized protein LOC10 0.927 0.990 0.567 4e-60
388510770209 unknown [Lotus japonicus] 0.914 0.971 0.602 1e-59
147768061208 hypothetical protein VITISV_030073 [Viti 0.855 0.913 0.546 6e-55
225442493208 PREDICTED: uncharacterized protein LOC10 0.855 0.913 0.546 1e-54
351721474189 uncharacterized protein LOC100305871 [Gl 0.824 0.968 0.517 3e-49
357454807195 hypothetical protein MTR_2g101120 [Medic 0.828 0.943 0.507 6e-47
356556470203 PREDICTED: uncharacterized protein LOC10 0.873 0.955 0.492 1e-46
>gi|255546355|ref|XP_002514237.1| conserved hypothetical protein [Ricinus communis] gi|223546693|gb|EEF48191.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  277 bits (708), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 147/222 (66%), Positives = 172/222 (77%), Gaps = 16/222 (7%)

Query: 1   MEIEPTKPPVVAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFH 60
           ME+E + P V AKKLW IVR VFFM++ GISKS+IMVD HLMLKRGNKLAGKAIGNL++H
Sbjct: 1   MEVEASTPEV-AKKLWHIVRAVFFMLRKGISKSRIMVDLHLMLKRGNKLAGKAIGNLVYH 59

Query: 61  NHLSSLSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKA 120
           ++ SS SCRSNDA L+FISP+EYEFSCSNSPA  F YPF+ +      H H+      K+
Sbjct: 60  HNHSSFSCRSNDA-LNFISPREYEFSCSNSPA-TF-YPFSAHKRKHHHHLHF-----TKS 111

Query: 121 YHYDDVATVQAVQRVLEMLNTSDNNHAVASAAPSPLVTLPGFGRSPLVRQLRITDSPFPL 180
           Y Y+DV TV AVQ++LEMLN    N     A+P   + LPGFG+SP+VRQLRITDSPFPL
Sbjct: 112 YKYNDVTTVAAVQKMLEMLN----NEVQVEASP---MVLPGFGKSPMVRQLRITDSPFPL 164

Query: 181 KDEGDGQVDKAAEDFIKKFYKDLMLQKSMAAFESPYHHSWDR 222
           KDEGD QVDKAAE+FIKKFYKDL LQK++AA ESPYH  W R
Sbjct: 165 KDEGDSQVDKAAEEFIKKFYKDLKLQKTVAALESPYHGMWGR 206




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224142201|ref|XP_002324447.1| predicted protein [Populus trichocarpa] gi|222865881|gb|EEF03012.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225445114|ref|XP_002283815.1| PREDICTED: uncharacterized protein LOC100266427 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449466266|ref|XP_004150847.1| PREDICTED: uncharacterized protein LOC101210555 [Cucumis sativus] gi|449488123|ref|XP_004157945.1| PREDICTED: uncharacterized protein LOC101229363 [Cucumis sativus] Back     alignment and taxonomy information
>gi|388510770|gb|AFK43451.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|147768061|emb|CAN64915.1| hypothetical protein VITISV_030073 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225442493|ref|XP_002283976.1| PREDICTED: uncharacterized protein LOC100248099 [Vitis vinifera] Back     alignment and taxonomy information
>gi|351721474|ref|NP_001237978.1| uncharacterized protein LOC100305871 [Glycine max] gi|255626845|gb|ACU13767.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357454807|ref|XP_003597684.1| hypothetical protein MTR_2g101120 [Medicago truncatula] gi|355486732|gb|AES67935.1| hypothetical protein MTR_2g101120 [Medicago truncatula] Back     alignment and taxonomy information
>gi|356556470|ref|XP_003546548.1| PREDICTED: uncharacterized protein LOC100786928 [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query222
TAIR|locus:2034071208 AT1G52140 "AT1G52140" [Arabido 0.851 0.908 0.448 1.5e-35
TAIR|locus:2119911200 AT4G29110 "AT4G29110" [Arabido 0.432 0.48 0.537 1.5e-33
TAIR|locus:2094932205 AT3G16330 "AT3G16330" [Arabido 0.824 0.892 0.435 1.1e-30
TAIR|locus:2134118199 AT4G32860 "AT4G32860" [Arabido 0.531 0.592 0.323 7.9e-13
TAIR|locus:2034071 AT1G52140 "AT1G52140" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 384 (140.2 bits), Expect = 1.5e-35, P = 1.5e-35
 Identities = 96/214 (44%), Positives = 128/214 (59%)

Query:    11 VAKKLWKIVRIVFFMIKAGISKSKIMVDFHLMLKRGNKLAGKAIGNLIFHN----HL-SS 65
             ++KKLW IVR + +MI+ G+SK+K++ DF+  LKRG         NL+FH     H  S+
Sbjct:     7 ISKKLWNIVRFLLYMIRKGVSKNKLIADFNATLKRGK--------NLMFHQRRRVHAGST 58

Query:    66 LSCRSNDAHLSFISPKEYEFSCSNSPAFNFSYPFTNYGHNKRKHHHYHLQKAAKAYHY-- 123
              S   N    +  S +EYEFSCSN+P  N+S+PF+N    ++K H+ +L    +      
Sbjct:    59 ASAALNATSATASSRQEYEFSCSNTP--NYSFPFSNMAFMRKKSHN-NLFTCGQTPQTLD 115

Query:   124 DDVATVQAVQRVLEMLNTSDN-NHAVASAAPSPLVTLPGFGRSPLVRQLRITDSPFPLKD 182
             DDVA  +AV  +L  +    N   A  + A SP    PGFG++PLVR LR+TDSPFPL  
Sbjct:   116 DDVAAARAVLELLNGVGEKGNVTPADLTVALSPY--FPGFGQTPLVRPLRVTDSPFPLTP 173

Query:   183 E-GD---GQVDKAAEDFIKKFYKDLMLQKSMAAF 212
             E GD   G VDKAA+DFIKKFYK+L  QK M  F
Sbjct:   174 ENGDVANGHVDKAADDFIKKFYKNLNQQKKMIEF 207




GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2119911 AT4G29110 "AT4G29110" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2094932 AT3G16330 "AT3G16330" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2134118 AT4G32860 "AT4G32860" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query222
pfam0555338 pfam05553, DUF761, Cotton fibre expressed protein 9e-06
>gnl|CDD|218633 pfam05553, DUF761, Cotton fibre expressed protein Back     alignment and domain information
 Score = 41.0 bits (97), Expect = 9e-06
 Identities = 12/33 (36%), Positives = 21/33 (63%), Gaps = 2/33 (6%)

Query: 184 GDGQVDKAAEDFIKKFYKDLMLQK--SMAAFES 214
            D +VD+ AE+FI +F + L LQ+  S+  ++ 
Sbjct: 1   SDDEVDRRAEEFIARFREQLRLQRQESLKRYQE 33


This family consists of several plant proteins of unknown function. Three of the sequences (from Gossypium hirsutum) in this family are described as cotton fibre expressed proteins. The remaining sequences, found in Arabidopsis thaliana, are uncharacterized. Length = 38

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 222
PF0555338 DUF761: Cotton fibre expressed protein; InterPro: 99.28
>PF05553 DUF761: Cotton fibre expressed protein; InterPro: IPR008480 This family consists of several plant proteins of unknown function Back     alignment and domain information
Probab=99.28  E-value=1.4e-12  Score=86.65  Aligned_cols=36  Identities=39%  Similarity=0.598  Sum_probs=33.5

Q ss_pred             ChhhhHHHHHHHHHHHHHHhhHHhhhhcCCccccccCC
Q 041147          185 DGQVDKAAEDFIKKFYKDLMLQKSMAAFESPYHHSWDR  222 (222)
Q Consensus       185 ~~~VD~~AeeFI~rFy~qlr~Q~~~~~~~~~~~~~~~~  222 (222)
                      +.+||++||+||++||+|||+|++.++.  .|.+||+|
T Consensus         2 ~~evd~rAe~FI~~f~~qlrlqr~~S~~--ry~eml~R   37 (38)
T PF05553_consen    2 DDEVDRRAEEFIAKFREQLRLQRQESLQ--RYQEMLAR   37 (38)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHhc
Confidence            4689999999999999999999999999  69999986



Three of the sequences from Gossypium hirsutum (Upland cotton) in this family are described as G. hirsutum fibre expressed proteins []. The remaining sequences, found in Arabidopsis thaliana, are uncharacterised.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query222
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-06
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 47.5 bits (112), Expect = 2e-06
 Identities = 21/101 (20%), Positives = 40/101 (39%), Gaps = 12/101 (11%)

Query: 109 HHHYHL--QKAAKAYHYDDVATV--QAVQRVLEMLNTSDNNHAVASAAP-SPLVTLPGFG 163
           HHH+H+  +     Y Y D+ +V   A     +  +  D   ++ S      ++      
Sbjct: 2   HHHHHMDFETGEHQYQYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSK--- 58

Query: 164 RSPLVRQLRITDSPFPLKDEGDGQVDKAAEDFIKKFYKDLM 204
              +   LR+  +   L  + +  V K  E+ ++  YK LM
Sbjct: 59  -DAVSGTLRLFWT---LLSKQEEMVQKFVEEVLRINYKFLM 95


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00