Citrus Sinensis ID: 031883


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-
MWLGNVKVLSMETLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIPSEVQLQDEKSLAPKTRKKPKDNKMKFGC
cccccHHHHHHHHHHHHHHHccccccccccccccccHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHcccccccHHHHHHHHHHHHcccEEEEEEccccccccEEEEccccccccccccccccccHHHHHHcccccccccccccccccc
ccccHHHHHHHHHHHHHHHHccccccccEEccccccHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHcccccccHHHHHHHHHHHHcccEEEEEEcccccccEEEEEccccccccccccccHHHHHHHHcccccccccccccccccccc
mwlgnvkvLSMETLVRYYSSSRKAAVnvrkinpkvptqQAYSIAQSLYGIIkqhgpltvsntwihakdagisglngkTHMKIMLKWMRGRKMLKLFCngvgsnkkflhstlpeepqtdqleipsevqlqdekslapktrkkpkdnkmkfgc
mwlgnvkvlsMETLVRYYSssrkaavnvrkinpkvptQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDagisglngktHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEepqtdqleipsevqlqdekslapktrkkpkdnkmkfgc
MWLGNVKVLSMETLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIPSEVQLQDEKSLAPKTRKKPKDNKMKFGC
*WLGNVKVLSMETLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFL********************************************
***GNVKVLSMETLVRYY*****************PTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAG****NGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHST**************************************FGC
MWLGNVKVLSMETLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIP****************************
*WLGNVKVLSMETLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQ***********************************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MWLGNVKVLSMETLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIPSEVQLQDEKSLAPKTRKKPKDNKMKFGC
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query151
255555853142 conserved hypothetical protein [Ricinus 0.781 0.830 0.691 1e-41
225426494141 PREDICTED: uncharacterized protein LOC10 0.695 0.744 0.775 2e-41
224061533144 predicted protein [Populus trichocarpa] 0.761 0.798 0.647 5e-37
18410491151 uncharacterized protein [Arabidopsis tha 0.774 0.774 0.630 1e-36
297842141148 hypothetical protein ARALYDRAFT_476532 [ 0.728 0.743 0.651 5e-36
449456295154 PREDICTED: uncharacterized protein LOC10 0.894 0.876 0.532 3e-34
125546480140 hypothetical protein OsI_14364 [Oryza sa 0.768 0.828 0.55 4e-29
27573361158 unknown protein [Oryza sativa Japonica G 0.768 0.734 0.55 4e-29
297601996174 Os03g0852500 [Oryza sativa Japonica Grou 0.768 0.666 0.55 6e-29
242032213160 hypothetical protein SORBIDRAFT_01g00088 0.662 0.625 0.598 3e-27
>gi|255555853|ref|XP_002518962.1| conserved hypothetical protein [Ricinus communis] gi|223541949|gb|EEF43495.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  174 bits (441), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 83/120 (69%), Positives = 101/120 (84%), Gaps = 2/120 (1%)

Query: 13  TLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGIS 72
           +++RY+S  R+ AVNVRKINPKVP  +A  I+QSLY IIK+ GPL+++NTW +AK+A IS
Sbjct: 17  SMLRYFS--RQRAVNVRKINPKVPFPEAALISQSLYNIIKERGPLSIANTWNYAKEANIS 74

Query: 73  GLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIPSEVQLQDEK 132
           GLN KTHMKIMLKWMRGRKMLK FCN VGS+KKFLHSTLPEEP++DQ ++ SE  LQ EK
Sbjct: 75  GLNSKTHMKIMLKWMRGRKMLKFFCNHVGSSKKFLHSTLPEEPRSDQSKVLSEPNLQTEK 134




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225426494|ref|XP_002277461.1| PREDICTED: uncharacterized protein LOC100253102 [Vitis vinifera] gi|297742484|emb|CBI34633.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224061533|ref|XP_002300527.1| predicted protein [Populus trichocarpa] gi|222847785|gb|EEE85332.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|18410491|ref|NP_565076.1| uncharacterized protein [Arabidopsis thaliana] gi|12325135|gb|AAG52511.1|AC016662_5 unknown protein; 73656-74659 [Arabidopsis thaliana] gi|16323093|gb|AAL15281.1| At1g73940/F2P9_19 [Arabidopsis thaliana] gi|21360419|gb|AAM47325.1| At1g73940/F2P9_19 [Arabidopsis thaliana] gi|21592788|gb|AAM64737.1| unknown [Arabidopsis thaliana] gi|332197407|gb|AEE35528.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297842141|ref|XP_002888952.1| hypothetical protein ARALYDRAFT_476532 [Arabidopsis lyrata subsp. lyrata] gi|297334793|gb|EFH65211.1| hypothetical protein ARALYDRAFT_476532 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449456295|ref|XP_004145885.1| PREDICTED: uncharacterized protein LOC101211684 [Cucumis sativus] gi|449507219|ref|XP_004162966.1| PREDICTED: uncharacterized LOC101211684 [Cucumis sativus] Back     alignment and taxonomy information
>gi|125546480|gb|EAY92619.1| hypothetical protein OsI_14364 [Oryza sativa Indica Group] gi|125588672|gb|EAZ29336.1| hypothetical protein OsJ_13403 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|27573361|gb|AAO20079.1| unknown protein [Oryza sativa Japonica Group] gi|108712147|gb|ABF99942.1| expressed protein [Oryza sativa Japonica Group] gi|215697183|dbj|BAG91177.1| unnamed protein product [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|297601996|ref|NP_001051924.2| Os03g0852500 [Oryza sativa Japonica Group] gi|255675055|dbj|BAF13838.2| Os03g0852500, partial [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|242032213|ref|XP_002463501.1| hypothetical protein SORBIDRAFT_01g000880 [Sorghum bicolor] gi|241917355|gb|EER90499.1| hypothetical protein SORBIDRAFT_01g000880 [Sorghum bicolor] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query151
TAIR|locus:2031481151 AT1G73940 "AT1G73940" [Arabido 0.807 0.807 0.593 4.9e-37
TAIR|locus:2158034145 AT5G49410 "AT5G49410" [Arabido 0.728 0.758 0.385 1.9e-19
TAIR|locus:2031481 AT1G73940 "AT1G73940" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 398 (145.2 bits), Expect = 4.9e-37, P = 4.9e-37
 Identities = 76/128 (59%), Positives = 97/128 (75%)

Query:    14 LVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISG 73
             LVRY S  R+ AVNVRKINPKVP Q+AY+I+ SLY + K HGPL+V NTW+ A++AG+SG
Sbjct:    30 LVRYMS--RERAVNVRKINPKVPIQEAYAISNSLYDLFKLHGPLSVPNTWLRAQEAGVSG 87

Query:    74 LNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIPSEVQLQDEKS 133
             LN KTHMK++LKWMRG+KMLKL CN VGS+KKF H+ LPE+P  +Q   P    ++++K 
Sbjct:    88 LNSKTHMKLLLKWMRGKKMLKLICNQVGSSKKFFHTVLPEDPLQEQPAAP----IENKKQ 143

Query:   134 LAPKTRKK 141
                K R K
Sbjct:   144 AVKKKRSK 151




GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0006364 "rRNA processing" evidence=RCA
TAIR|locus:2158034 AT5G49410 "AT5G49410" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00000464001
SubName- Full=Chromosome chr2 scaffold_105, whole genome shotgun sequence; (141 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 151
PF0506672 HARE-HTH: HB1, ASXL, restriction endonuclease HTH 95.78
PF06163127 DUF977: Bacterial protein of unknown function (DUF 90.99
PF03981141 Ubiq_cyt_C_chap: Ubiquinol-cytochrome C chaperone 83.93
PF03965115 Penicillinase_R: Penicillinase repressor; InterPro 83.01
PF06224 327 HTH_42: Winged helix DNA-binding domain; InterPro: 82.41
>PF05066 HARE-HTH: HB1, ASXL, restriction endonuclease HTH domain; InterPro: IPR007759 DNA-directed RNA polymerases 2 Back     alignment and domain information
Probab=95.78  E-value=0.013  Score=39.04  Aligned_cols=38  Identities=24%  Similarity=0.613  Sum_probs=28.4

Q ss_pred             HHHHHHHHHHHHhC-CcchhhhhHHHHHhccCCCCchhH
Q 031883           42 SIAQSLYGIIKQHG-PLTVSNTWIHAKDAGISGLNGKTH   79 (151)
Q Consensus        42 tItr~l~dIlkeHG-PLTVsntW~hake~gi~gL~SK~H   79 (151)
                      |+..+.+.||+++| |+|+.+.|+.+++.|+-.-.+|+=
T Consensus         2 t~~eaa~~vL~~~~~pm~~~eI~~~i~~~~~~~~~~k~p   40 (72)
T PF05066_consen    2 TFKEAAYEVLEEAGRPMTFKEIWEEIQERGLYKKSGKTP   40 (72)
T ss_dssp             -HHHHHHHHHHHH-S-EEHHHHHHHHHHHHTS---GGGG
T ss_pred             CHHHHHHHHHHhcCCCcCHHHHHHHHHHhCCCCcccCCH
Confidence            56788999999998 999999999999998876656653



7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The delta protein is a dispensable subunit of Bacillus subtilis RNA polymerase (RNAP) that has major effects on the biochemical properties of the purified enzyme. In the presence of delta, RNAP displays an increased specificity of transcription, a decreased affinity for nucleic acids, and an increased efficiency of RNA synthesis because of enhanced recycling []. The delta protein, contains two distinct regions, an N-terminal domain and a glutamate and aspartate residue-rich C-terminal region [].; GO: 0003677 DNA binding, 0006351 transcription, DNA-dependent; PDB: 2KRC_A.

>PF06163 DUF977: Bacterial protein of unknown function (DUF977); InterPro: IPR010382 This entry consists of several hypothetical bacterial proteins of unknown function Back     alignment and domain information
>PF03981 Ubiq_cyt_C_chap: Ubiquinol-cytochrome C chaperone ; InterPro: IPR021150 Saccharomyces cerevisiae ubiquinol-cytochrome C chaperone is required for assembly of coenzyme QF-2-cytochrome C reductase Back     alignment and domain information
>PF03965 Penicillinase_R: Penicillinase repressor; InterPro: IPR005650 Proteins in this entry are transcriptional regulators found in a variety of bacteria and a small number of archaea Back     alignment and domain information
>PF06224 HTH_42: Winged helix DNA-binding domain; InterPro: IPR009351 This is a family of conserved bacterial proteins with unknown function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query151
2fu4_A83 Ferric uptake regulation protein; DNA binding doma 81.65
>2fu4_A Ferric uptake regulation protein; DNA binding domain, helix-turn-helix, DNA binding protein; 1.80A {Escherichia coli} Back     alignment and structure
Probab=81.65  E-value=4.5  Score=25.48  Aligned_cols=50  Identities=8%  Similarity=0.183  Sum_probs=40.6

Q ss_pred             HHHHHHHHHh--CCcchhhhhHHHHHhccCCCCchhHHHHHHHHhhccceeeEE
Q 031883           45 QSLYGIIKQH--GPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLF   96 (151)
Q Consensus        45 r~l~dIlkeH--GPLTVsntW~hake~gi~gL~SK~HMKimLkWMreRq~lKL~   96 (151)
                      ..|+++|.++  +|+|+.+.++.+++.+. ++ |+.=.-..|+.|.+...|..+
T Consensus        20 ~~IL~~l~~~~~~~~s~~el~~~l~~~~~-~i-s~~TVyR~L~~L~~~Glv~~~   71 (83)
T 2fu4_A           20 LKILEVLQEPDNHHVSAEDLYKRLIDMGE-EI-GLATVYRVLNQFDDAGIVTRH   71 (83)
T ss_dssp             HHHHHHHTSGGGSSBCHHHHHHHHHHTTC-CC-CHHHHHHHHHHHHHHTSEEEE
T ss_pred             HHHHHHHHhCCCCCCCHHHHHHHHHHhCC-CC-CHhhHHHHHHHHHHCCCeEEE
Confidence            4578888877  79999999999988753 33 677788889999999998755




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query151
d1p6ra_82 Penicillinase repressor BlaI {Bacillus licheniform 83.82
>d1p6ra_ a.4.5.39 (A:) Penicillinase repressor BlaI {Bacillus licheniformis [TaxId: 1402]} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: "Winged helix" DNA-binding domain
family: Penicillinase repressor
domain: Penicillinase repressor BlaI
species: Bacillus licheniformis [TaxId: 1402]
Probab=83.82  E-value=2.4  Score=27.00  Aligned_cols=62  Identities=13%  Similarity=0.178  Sum_probs=48.7

Q ss_pred             HHHHHHHHhCCcchhhhhHHHHHhccCCCCchhHHHHHHHHhhccceeeEEeeccCCCcceeeecCCCC
Q 031883           46 SLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEE  114 (151)
Q Consensus        46 ~l~dIlkeHGPLTVsntW~hake~gi~gL~SK~HMKimLkWMreRq~lKL~CnhvG~~KqFLytt~~~~  114 (151)
                      .|.+||=++||+|+.+.-+.+.+.  .++ +.+=.+-+|+=|-++..|..    ....+.|+|++--+.
T Consensus        13 ~IM~~lW~~~~~t~~ei~~~l~~~--~~~-~~tTv~T~L~rL~~KG~l~~----~k~gr~~~Y~p~is~   74 (82)
T d1p6ra_          13 EVMKVIWKHSSINTNEVIKELSKT--STW-SPKTIQTMLLRLIKKGALNH----HKEGRVFVYTPNIDE   74 (82)
T ss_dssp             HHHHHHHTSSSEEHHHHHHHHHHH--SCC-CHHHHHHHHHHHHHTTSEEE----EEETTEEEEEESCSS
T ss_pred             HHHHHHHhCCCCCHHHHHHHhccc--cCC-cHhHHHHHHHHHHHCCCeEE----EecCCeEEEEecCCH
Confidence            578888899999999999998774  344 46677888888999999874    345678999976553