Citrus Sinensis ID: 031883
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 151 | ||||||
| 255555853 | 142 | conserved hypothetical protein [Ricinus | 0.781 | 0.830 | 0.691 | 1e-41 | |
| 225426494 | 141 | PREDICTED: uncharacterized protein LOC10 | 0.695 | 0.744 | 0.775 | 2e-41 | |
| 224061533 | 144 | predicted protein [Populus trichocarpa] | 0.761 | 0.798 | 0.647 | 5e-37 | |
| 18410491 | 151 | uncharacterized protein [Arabidopsis tha | 0.774 | 0.774 | 0.630 | 1e-36 | |
| 297842141 | 148 | hypothetical protein ARALYDRAFT_476532 [ | 0.728 | 0.743 | 0.651 | 5e-36 | |
| 449456295 | 154 | PREDICTED: uncharacterized protein LOC10 | 0.894 | 0.876 | 0.532 | 3e-34 | |
| 125546480 | 140 | hypothetical protein OsI_14364 [Oryza sa | 0.768 | 0.828 | 0.55 | 4e-29 | |
| 27573361 | 158 | unknown protein [Oryza sativa Japonica G | 0.768 | 0.734 | 0.55 | 4e-29 | |
| 297601996 | 174 | Os03g0852500 [Oryza sativa Japonica Grou | 0.768 | 0.666 | 0.55 | 6e-29 | |
| 242032213 | 160 | hypothetical protein SORBIDRAFT_01g00088 | 0.662 | 0.625 | 0.598 | 3e-27 |
| >gi|255555853|ref|XP_002518962.1| conserved hypothetical protein [Ricinus communis] gi|223541949|gb|EEF43495.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 174 bits (441), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 83/120 (69%), Positives = 101/120 (84%), Gaps = 2/120 (1%)
Query: 13 TLVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGIS 72
+++RY+S R+ AVNVRKINPKVP +A I+QSLY IIK+ GPL+++NTW +AK+A IS
Sbjct: 17 SMLRYFS--RQRAVNVRKINPKVPFPEAALISQSLYNIIKERGPLSIANTWNYAKEANIS 74
Query: 73 GLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIPSEVQLQDEK 132
GLN KTHMKIMLKWMRGRKMLK FCN VGS+KKFLHSTLPEEP++DQ ++ SE LQ EK
Sbjct: 75 GLNSKTHMKIMLKWMRGRKMLKFFCNHVGSSKKFLHSTLPEEPRSDQSKVLSEPNLQTEK 134
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225426494|ref|XP_002277461.1| PREDICTED: uncharacterized protein LOC100253102 [Vitis vinifera] gi|297742484|emb|CBI34633.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224061533|ref|XP_002300527.1| predicted protein [Populus trichocarpa] gi|222847785|gb|EEE85332.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|18410491|ref|NP_565076.1| uncharacterized protein [Arabidopsis thaliana] gi|12325135|gb|AAG52511.1|AC016662_5 unknown protein; 73656-74659 [Arabidopsis thaliana] gi|16323093|gb|AAL15281.1| At1g73940/F2P9_19 [Arabidopsis thaliana] gi|21360419|gb|AAM47325.1| At1g73940/F2P9_19 [Arabidopsis thaliana] gi|21592788|gb|AAM64737.1| unknown [Arabidopsis thaliana] gi|332197407|gb|AEE35528.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297842141|ref|XP_002888952.1| hypothetical protein ARALYDRAFT_476532 [Arabidopsis lyrata subsp. lyrata] gi|297334793|gb|EFH65211.1| hypothetical protein ARALYDRAFT_476532 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|449456295|ref|XP_004145885.1| PREDICTED: uncharacterized protein LOC101211684 [Cucumis sativus] gi|449507219|ref|XP_004162966.1| PREDICTED: uncharacterized LOC101211684 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|125546480|gb|EAY92619.1| hypothetical protein OsI_14364 [Oryza sativa Indica Group] gi|125588672|gb|EAZ29336.1| hypothetical protein OsJ_13403 [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
| >gi|27573361|gb|AAO20079.1| unknown protein [Oryza sativa Japonica Group] gi|108712147|gb|ABF99942.1| expressed protein [Oryza sativa Japonica Group] gi|215697183|dbj|BAG91177.1| unnamed protein product [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
| >gi|297601996|ref|NP_001051924.2| Os03g0852500 [Oryza sativa Japonica Group] gi|255675055|dbj|BAF13838.2| Os03g0852500, partial [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
| >gi|242032213|ref|XP_002463501.1| hypothetical protein SORBIDRAFT_01g000880 [Sorghum bicolor] gi|241917355|gb|EER90499.1| hypothetical protein SORBIDRAFT_01g000880 [Sorghum bicolor] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 151 | ||||||
| TAIR|locus:2031481 | 151 | AT1G73940 "AT1G73940" [Arabido | 0.807 | 0.807 | 0.593 | 4.9e-37 | |
| TAIR|locus:2158034 | 145 | AT5G49410 "AT5G49410" [Arabido | 0.728 | 0.758 | 0.385 | 1.9e-19 |
| TAIR|locus:2031481 AT1G73940 "AT1G73940" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 398 (145.2 bits), Expect = 4.9e-37, P = 4.9e-37
Identities = 76/128 (59%), Positives = 97/128 (75%)
Query: 14 LVRYYSSSRKAAVNVRKINPKVPTQQAYSIAQSLYGIIKQHGPLTVSNTWIHAKDAGISG 73
LVRY S R+ AVNVRKINPKVP Q+AY+I+ SLY + K HGPL+V NTW+ A++AG+SG
Sbjct: 30 LVRYMS--RERAVNVRKINPKVPIQEAYAISNSLYDLFKLHGPLSVPNTWLRAQEAGVSG 87
Query: 74 LNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEEPQTDQLEIPSEVQLQDEKS 133
LN KTHMK++LKWMRG+KMLKL CN VGS+KKF H+ LPE+P +Q P ++++K
Sbjct: 88 LNSKTHMKLLLKWMRGKKMLKLICNQVGSSKKFFHTVLPEDPLQEQPAAP----IENKKQ 143
Query: 134 LAPKTRKK 141
K R K
Sbjct: 144 AVKKKRSK 151
|
|
| TAIR|locus:2158034 AT5G49410 "AT5G49410" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00000464001 | SubName- Full=Chromosome chr2 scaffold_105, whole genome shotgun sequence; (141 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 151 | |||
| PF05066 | 72 | HARE-HTH: HB1, ASXL, restriction endonuclease HTH | 95.78 | |
| PF06163 | 127 | DUF977: Bacterial protein of unknown function (DUF | 90.99 | |
| PF03981 | 141 | Ubiq_cyt_C_chap: Ubiquinol-cytochrome C chaperone | 83.93 | |
| PF03965 | 115 | Penicillinase_R: Penicillinase repressor; InterPro | 83.01 | |
| PF06224 | 327 | HTH_42: Winged helix DNA-binding domain; InterPro: | 82.41 |
| >PF05066 HARE-HTH: HB1, ASXL, restriction endonuclease HTH domain; InterPro: IPR007759 DNA-directed RNA polymerases 2 | Back alignment and domain information |
|---|
Probab=95.78 E-value=0.013 Score=39.04 Aligned_cols=38 Identities=24% Similarity=0.613 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHhC-CcchhhhhHHHHHhccCCCCchhH
Q 031883 42 SIAQSLYGIIKQHG-PLTVSNTWIHAKDAGISGLNGKTH 79 (151)
Q Consensus 42 tItr~l~dIlkeHG-PLTVsntW~hake~gi~gL~SK~H 79 (151)
|+..+.+.||+++| |+|+.+.|+.+++.|+-.-.+|+=
T Consensus 2 t~~eaa~~vL~~~~~pm~~~eI~~~i~~~~~~~~~~k~p 40 (72)
T PF05066_consen 2 TFKEAAYEVLEEAGRPMTFKEIWEEIQERGLYKKSGKTP 40 (72)
T ss_dssp -HHHHHHHHHHHH-S-EEHHHHHHHHHHHHTS---GGGG
T ss_pred CHHHHHHHHHHhcCCCcCHHHHHHHHHHhCCCCcccCCH
Confidence 56788999999998 999999999999998876656653
|
7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The delta protein is a dispensable subunit of Bacillus subtilis RNA polymerase (RNAP) that has major effects on the biochemical properties of the purified enzyme. In the presence of delta, RNAP displays an increased specificity of transcription, a decreased affinity for nucleic acids, and an increased efficiency of RNA synthesis because of enhanced recycling []. The delta protein, contains two distinct regions, an N-terminal domain and a glutamate and aspartate residue-rich C-terminal region [].; GO: 0003677 DNA binding, 0006351 transcription, DNA-dependent; PDB: 2KRC_A. |
| >PF06163 DUF977: Bacterial protein of unknown function (DUF977); InterPro: IPR010382 This entry consists of several hypothetical bacterial proteins of unknown function | Back alignment and domain information |
|---|
| >PF03981 Ubiq_cyt_C_chap: Ubiquinol-cytochrome C chaperone ; InterPro: IPR021150 Saccharomyces cerevisiae ubiquinol-cytochrome C chaperone is required for assembly of coenzyme QF-2-cytochrome C reductase | Back alignment and domain information |
|---|
| >PF03965 Penicillinase_R: Penicillinase repressor; InterPro: IPR005650 Proteins in this entry are transcriptional regulators found in a variety of bacteria and a small number of archaea | Back alignment and domain information |
|---|
| >PF06224 HTH_42: Winged helix DNA-binding domain; InterPro: IPR009351 This is a family of conserved bacterial proteins with unknown function | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 151 | |||
| 2fu4_A | 83 | Ferric uptake regulation protein; DNA binding doma | 81.65 |
| >2fu4_A Ferric uptake regulation protein; DNA binding domain, helix-turn-helix, DNA binding protein; 1.80A {Escherichia coli} | Back alignment and structure |
|---|
Probab=81.65 E-value=4.5 Score=25.48 Aligned_cols=50 Identities=8% Similarity=0.183 Sum_probs=40.6
Q ss_pred HHHHHHHHHh--CCcchhhhhHHHHHhccCCCCchhHHHHHHHHhhccceeeEE
Q 031883 45 QSLYGIIKQH--GPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLF 96 (151)
Q Consensus 45 r~l~dIlkeH--GPLTVsntW~hake~gi~gL~SK~HMKimLkWMreRq~lKL~ 96 (151)
..|+++|.++ +|+|+.+.++.+++.+. ++ |+.=.-..|+.|.+...|..+
T Consensus 20 ~~IL~~l~~~~~~~~s~~el~~~l~~~~~-~i-s~~TVyR~L~~L~~~Glv~~~ 71 (83)
T 2fu4_A 20 LKILEVLQEPDNHHVSAEDLYKRLIDMGE-EI-GLATVYRVLNQFDDAGIVTRH 71 (83)
T ss_dssp HHHHHHHTSGGGSSBCHHHHHHHHHHTTC-CC-CHHHHHHHHHHHHHHTSEEEE
T ss_pred HHHHHHHHhCCCCCCCHHHHHHHHHHhCC-CC-CHhhHHHHHHHHHHCCCeEEE
Confidence 4578888877 79999999999988753 33 677788889999999998755
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 151 | |||
| d1p6ra_ | 82 | Penicillinase repressor BlaI {Bacillus licheniform | 83.82 |
| >d1p6ra_ a.4.5.39 (A:) Penicillinase repressor BlaI {Bacillus licheniformis [TaxId: 1402]} | Back information, alignment and structure |
|---|
class: All alpha proteins fold: DNA/RNA-binding 3-helical bundle superfamily: "Winged helix" DNA-binding domain family: Penicillinase repressor domain: Penicillinase repressor BlaI species: Bacillus licheniformis [TaxId: 1402]
Probab=83.82 E-value=2.4 Score=27.00 Aligned_cols=62 Identities=13% Similarity=0.178 Sum_probs=48.7
Q ss_pred HHHHHHHHhCCcchhhhhHHHHHhccCCCCchhHHHHHHHHhhccceeeEEeeccCCCcceeeecCCCC
Q 031883 46 SLYGIIKQHGPLTVSNTWIHAKDAGISGLNGKTHMKIMLKWMRGRKMLKLFCNGVGSNKKFLHSTLPEE 114 (151)
Q Consensus 46 ~l~dIlkeHGPLTVsntW~hake~gi~gL~SK~HMKimLkWMreRq~lKL~CnhvG~~KqFLytt~~~~ 114 (151)
.|.+||=++||+|+.+.-+.+.+. .++ +.+=.+-+|+=|-++..|.. ....+.|+|++--+.
T Consensus 13 ~IM~~lW~~~~~t~~ei~~~l~~~--~~~-~~tTv~T~L~rL~~KG~l~~----~k~gr~~~Y~p~is~ 74 (82)
T d1p6ra_ 13 EVMKVIWKHSSINTNEVIKELSKT--STW-SPKTIQTMLLRLIKKGALNH----HKEGRVFVYTPNIDE 74 (82)
T ss_dssp HHHHHHHTSSSEEHHHHHHHHHHH--SCC-CHHHHHHHHHHHHHTTSEEE----EEETTEEEEEESCSS
T ss_pred HHHHHHHhCCCCCHHHHHHHhccc--cCC-cHhHHHHHHHHHHHCCCeEE----EecCCeEEEEecCCH
Confidence 578888899999999999998774 344 46677888888999999874 345678999976553
|