Citrus Sinensis ID: 047520
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 102 | ||||||
| 224126313 | 788 | predicted protein [Populus trichocarpa] | 1.0 | 0.129 | 0.496 | 2e-27 | |
| 357490617 | 892 | Zinc finger MYM-type protein [Medicago t | 1.0 | 0.114 | 0.480 | 2e-26 | |
| 15233987 | 664 | uncharacterized protein [Arabidopsis tha | 0.980 | 0.150 | 0.542 | 1e-25 | |
| 15221982 | 769 | TTF-type zinc finger protein with HAT di | 0.980 | 0.130 | 0.48 | 1e-24 | |
| 8954063 | 811 | Contains similarity to a transposable el | 0.980 | 0.123 | 0.48 | 2e-24 | |
| 357439181 | 665 | hypothetical protein MTR_1g040620 [Medic | 1.0 | 0.153 | 0.488 | 5e-24 | |
| 15230070 | 536 | General transcription factor 2-related z | 0.980 | 0.186 | 0.464 | 3e-23 | |
| 15230069 | 505 | General transcription factor 2-related z | 0.980 | 0.198 | 0.456 | 2e-22 | |
| 11994527 | 571 | unnamed protein product [Arabidopsis tha | 0.921 | 0.164 | 0.504 | 5e-22 | |
| 15222347 | 496 | General transcription factor 2-related z | 0.980 | 0.201 | 0.456 | 1e-21 |
| >gi|224126313|ref|XP_002319808.1| predicted protein [Populus trichocarpa] gi|222858184|gb|EEE95731.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats.
Identities = 63/127 (49%), Positives = 83/127 (65%), Gaps = 25/127 (19%)
Query: 1 MVVALRYADKNGYVFERFIGFKHVTCTTAISLKEALDQLFSKYGLSISRLRKQGYDGASN 60
M V LRY D NG++ ERF+G +HV TTA SLK A++ LFSK+GLSISRLR QGYDGASN
Sbjct: 298 MAVVLRYVDNNGHIIERFLGIQHVRDTTASSLKAAIEALFSKHGLSISRLRGQGYDGASN 357
Query: 61 IQGEFNA-------------------------IVAVAKKHDQINSFFNVIANVINVVGAS 95
++GEFN +VAV KKH+++ FN I+++IN+VGAS
Sbjct: 358 MRGEFNGLKALILNSNPSAYYVHCFAHRLQLTLVAVTKKHNEVGDVFNFISSIINIVGAS 417
Query: 96 CKRRDIL 102
CKR +++
Sbjct: 418 CKRMEVI 424
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|357490617|ref|XP_003615596.1| Zinc finger MYM-type protein [Medicago truncatula] gi|355516931|gb|AES98554.1| Zinc finger MYM-type protein [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|15233987|ref|NP_192704.1| uncharacterized protein [Arabidopsis thaliana] gi|4538896|emb|CAB39633.1| putative protein [Arabidopsis thaliana] gi|7267661|emb|CAB78089.1| putative protein [Arabidopsis thaliana] gi|332657378|gb|AEE82778.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|15221982|ref|NP_173360.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] gi|332191703|gb|AEE29824.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|8954063|gb|AAF82236.1|AC069143_12 Contains similarity to a transposable element Tip100 protein for transposase from Ipomoea purpurea gb|4063769 and is a member of the transmembrane 4 family PF|00335 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|357439181|ref|XP_003589867.1| hypothetical protein MTR_1g040620 [Medicago truncatula] gi|355478915|gb|AES60118.1| hypothetical protein MTR_1g040620 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|15230070|ref|NP_189621.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] gi|332644086|gb|AEE77607.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|15230069|ref|NP_189620.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] gi|332644085|gb|AEE77606.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|11994527|dbj|BAB02591.1| unnamed protein product [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|15222347|ref|NP_174954.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] gi|332193792|gb|AEE31913.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 102 | ||||||
| TAIR|locus:2202165 | 769 | AT1G19260 [Arabidopsis thalian | 0.725 | 0.096 | 0.567 | 4.8e-19 | |
| TAIR|locus:2829839 | 505 | AT3G29763 "AT3G29763" [Arabido | 0.745 | 0.150 | 0.526 | 7.3e-19 | |
| TAIR|locus:2829849 | 536 | AT3G29765 [Arabidopsis thalian | 0.745 | 0.141 | 0.526 | 7.4e-19 | |
| TAIR|locus:2118430 | 664 | AT4G09660 [Arabidopsis thalian | 0.980 | 0.150 | 0.457 | 4.2e-18 | |
| TAIR|locus:4515102803 | 592 | AT2G06541 [Arabidopsis thalian | 0.666 | 0.114 | 0.558 | 9.8e-17 | |
| TAIR|locus:2033523 | 496 | AT1G41920 "AT1G41920" [Arabido | 0.647 | 0.133 | 0.590 | 2.5e-16 | |
| TAIR|locus:2036700 | 206 | AT1G42710 "AT1G42710" [Arabido | 0.578 | 0.286 | 0.559 | 3.1e-12 | |
| TAIR|locus:5019474775 | 393 | AT3G31402 "AT3G31402" [Arabido | 0.558 | 0.145 | 0.543 | 2e-11 | |
| TAIR|locus:2093847 | 522 | AT3G29450 "AT3G29450" [Arabido | 0.372 | 0.072 | 0.684 | 3.1e-11 | |
| UNIPROTKB|E1BC73 | 1130 | ZMYM1 "Uncharacterized protein | 0.725 | 0.065 | 0.324 | 0.0001 |
| TAIR|locus:2202165 AT1G19260 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 215 (80.7 bits), Expect = 4.8e-19, Sum P(2) = 4.8e-19
Identities = 42/74 (56%), Positives = 55/74 (74%)
Query: 1 MVVALRYADKNGYVFERFIGFKHVTCTTAISLKEALDQLFSKYGLSISRLRKQGYDGASN 60
M V R+ DK G V ERFIG HV T+++SLK A+D LF+KYGLS+ +LR QGYDGASN
Sbjct: 280 MAVVFRFVDKYGVVKERFIGVIHVQETSSLSLKSAIDSLFAKYGLSLKKLRGQGYDGASN 339
Query: 61 IQGEFNAIVAVAKK 74
++GEFN + ++ K
Sbjct: 340 MKGEFNGLRSLILK 353
|
|
| TAIR|locus:2829839 AT3G29763 "AT3G29763" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2829849 AT3G29765 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2118430 AT4G09660 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:4515102803 AT2G06541 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2033523 AT1G41920 "AT1G41920" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2036700 AT1G42710 "AT1G42710" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:5019474775 AT3G31402 "AT3G31402" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2093847 AT3G29450 "AT3G29450" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|E1BC73 ZMYM1 "Uncharacterized protein" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| eugene3.00130851 | hypothetical protein (788 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 102 | |||
| pfam14291 | 179 | pfam14291, DUF4371, Domain of unknown function (DU | 1e-28 |
| >gnl|CDD|222655 pfam14291, DUF4371, Domain of unknown function (DUF4371) | Back alignment and domain information |
|---|
Score = 101 bits (253), Expect = 1e-28
Identities = 40/66 (60%), Positives = 52/66 (78%)
Query: 1 MVVALRYADKNGYVFERFIGFKHVTCTTAISLKEALDQLFSKYGLSISRLRKQGYDGASN 60
M V LRY DK G V ERF+G HV TT+++LK A+D L +K+GLS+S++R QGYDGASN
Sbjct: 113 MAVVLRYVDKKGQVKERFLGVVHVEETTSLTLKSAIDSLLAKHGLSLSKVRGQGYDGASN 172
Query: 61 IQGEFN 66
++GEFN
Sbjct: 173 MKGEFN 178
|
Length = 179 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 102 | |||
| PF14291 | 235 | DUF4371: Domain of unknown function (DUF4371) | 99.91 | |
| KOG1121 | 641 | consensus Tam3-transposase (Ac family) [Replicatio | 97.61 | |
| PF04937 | 153 | DUF659: Protein of unknown function (DUF 659); Int | 96.97 |
| >PF14291 DUF4371: Domain of unknown function (DUF4371) | Back alignment and domain information |
|---|
Probab=99.91 E-value=3.7e-25 Score=151.69 Aligned_cols=66 Identities=45% Similarity=0.739 Sum_probs=64.1
Q ss_pred CEEEEEEEcCCCeEEEEeeeeEEcCCCChHHHHHHHHHHHHhcCCCccceeeeeecCCcccccchh
Q 047520 1 MVVALRYADKNGYVFERFIGFKHVTCTTAISLKEALDQLFSKYGLSISRLRKQGYDGASNIQGEFN 66 (102)
Q Consensus 1 l~i~vryv~~~~~i~e~fl~~~~~~~~ta~~i~~~i~~~l~~~~L~~~~~~g~~~Dgas~m~g~~~ 66 (102)
|+++||||++++.++|+||+|++++++||++|++.|++.|+++|||+++|+||+|||||+|+|+++
T Consensus 169 l~i~vRyv~~~~~i~E~Fl~f~~~~~~ta~~l~~~i~~~L~~~~l~~~~~~gq~yDgas~M~G~~~ 234 (235)
T PF14291_consen 169 LSICVRYVDKDGKIKERFLGFVELEDTTAESLFNAIKDVLEKLGLDLSNCRGQCYDGASNMSGKHN 234 (235)
T ss_pred hhheeeeeccCcceeeeeeeeeccCCccHHHHHHHHHHHHHHcCCCHHHcCcccccChHhheeccC
Confidence 689999999888999999999999999999999999999999999999999999999999999986
|
|
| >KOG1121 consensus Tam3-transposase (Ac family) [Replication, recombination and repair] | Back alignment and domain information |
|---|
| >PF04937 DUF659: Protein of unknown function (DUF 659); InterPro: IPR007021 These are transposase-like proteins with no known function | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 102 | |||
| 2bw3_A | 534 | Transposase; DNA recombination, transposition; 2.0 | 99.63 |
| >2bw3_A Transposase; DNA recombination, transposition; 2.0A {Musca domestica} SCOP: a.270.1.1 c.55.3.12 PDB: 2bw3_B | Back alignment and structure |
|---|
Probab=99.63 E-value=1.8e-15 Score=111.90 Aligned_cols=98 Identities=16% Similarity=0.225 Sum_probs=85.8
Q ss_pred CEEEEEEEcCCCeEEEEeeeeEEcC--CCChHHHHHHHHHHHHhcCCCccceeeeeecCCcccccchh------------
Q 047520 1 MVVALRYADKNGYVFERFIGFKHVT--CTTAISLKEALDQLFSKYGLSISRLRKQGYDGASNIQGEFN------------ 66 (102)
Q Consensus 1 l~i~vryv~~~~~i~e~fl~~~~~~--~~ta~~i~~~i~~~l~~~~L~~~~~~g~~~Dgas~m~g~~~------------ 66 (102)
|++++||+ +++++++++|+|.+++ ++||+.|++.|.+++++|||+..++++..+|||+||.|..+
T Consensus 117 l~i~v~~i-~~~~~~~~~L~~~~~~~~~~t~~~i~~~i~~~l~~~~l~~~~~v~~vtDnasn~~~~~~~~~~i~C~~H~l 195 (534)
T 2bw3_A 117 LGVTLHYH-ENNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSIKFVTDRGANVVKSLANNIRINCSSHLL 195 (534)
T ss_dssp EEEEEEEE-ETTEEEEEEEEEEEECGGGCSHHHHHHHHHHHHHTTTCCCCTTCEEEECCCHHHHHHTTTSEEEECHHHHH
T ss_pred eEEEEEEe-eCCceEEEEEeEeecCCCCCCHHHHHHHHHHHHHHcCCCcccceeEECCCHHHHHHHHhcCcccccHHHHH
Confidence 57999999 4789999999999998 59999999999999999999866678888899999988754
Q ss_pred --HHHHHHhhchhHHHHHHHHHHHHHHhhccchhh
Q 047520 67 --AIVAVAKKHDQINSFFNVIANVINVVGASCKRR 99 (102)
Q Consensus 67 --~v~~~~~~~~~~~~~~~~~~~~~~~f~~S~kr~ 99 (102)
++++.++..+.+..+++.++++++||++|+.|.
T Consensus 196 nLi~~~~l~~~~~~~~~l~~~~~iv~~i~~s~~~~ 230 (534)
T 2bw3_A 196 SNVLENSFEETPELNMPILACKNIVKYFKKANLQH 230 (534)
T ss_dssp HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHTTCGG
T ss_pred HHHHHHHhhhhhhHHHHHHHHHHHHHHHhccHhhH
Confidence 445556667788999999999999999998875
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 102 | ||||
| d2bw3a2 | 447 | c.55.3.12 (A:163-609) Transposase Hermes, catalyti | 2e-08 |
| >d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} Length = 447 | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Ribonuclease H-like motif superfamily: Ribonuclease H-like family: Hermes transposase-like domain: Transposase Hermes, catalytic domain species: House fly (Musca domestica) [TaxId: 7370]
Score = 47.7 bits (112), Expect = 2e-08
Identities = 16/118 (13%), Positives = 43/118 (36%), Gaps = 17/118 (14%)
Query: 1 MVVALRYADKNGYVFERFIGFKHVTC--TTAISLKEALDQLFSKYGLSISRLRKQGYDGA 58
+ V L Y + N + + +G K + +TA ++ + L +FS++ + K D
Sbjct: 30 LGVTLHYHE-NNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSIKFVTDRG 88
Query: 59 SNIQGEFNAIVAV--------------AKKHDQINSFFNVIANVINVVGASCKRRDIL 102
+N+ + + ++ ++N N++ + + +
Sbjct: 89 ANVVKSLANNIRINCSSHLLSNVLENSFEETPELNMPILACKNIVKYFKKANLQHRLR 146
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 102 | |||
| d2bw3a2 | 447 | Transposase Hermes, catalytic domain {House fly (M | 99.81 |
| >d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Ribonuclease H-like motif superfamily: Ribonuclease H-like family: Hermes transposase-like domain: Transposase Hermes, catalytic domain species: House fly (Musca domestica) [TaxId: 7370]
Probab=99.81 E-value=7.7e-20 Score=128.36 Aligned_cols=100 Identities=16% Similarity=0.234 Sum_probs=91.2
Q ss_pred CEEEEEEEcCCCeEEEEeeeeEEcC--CCChHHHHHHHHHHHHhcCCCccceeeeeecCCcccccchh------------
Q 047520 1 MVVALRYADKNGYVFERFIGFKHVT--CTTAISLKEALDQLFSKYGLSISRLRKQGYDGASNIQGEFN------------ 66 (102)
Q Consensus 1 l~i~vryv~~~~~i~e~fl~~~~~~--~~ta~~i~~~i~~~l~~~~L~~~~~~g~~~Dgas~m~g~~~------------ 66 (102)
|+|++||++ ++.++|++|+|.+++ ++||++|++.|.+++++|||+.++++|++||||+||.|...
T Consensus 30 l~v~~~yi~-~~~~~~~~L~~~~~~~~~~tg~~i~~~l~~~l~~~~i~~~~~~~~~~D~a~n~~~~~~~~~~i~C~aH~L 108 (447)
T d2bw3a2 30 LGVTLHYHE-NNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSIKFVTDRGANVVKSLANNIRINCSSHLL 108 (447)
T ss_dssp EEEEEEEEE-TTEEEEEEEEEEEECGGGCSHHHHHHHHHHHHHTTTCCCCTTCEEEECCCHHHHHHTTTSEEEECHHHHH
T ss_pred EEEEEEEEE-CCceeEEEEEEEEeCCCCCcHHHHHHHHHHHHHHcCCCcceEEEEEcCCcHhhHHHhhccccccchhhhh
Confidence 589999998 578999999999985 58999999999999999999999999999999999998866
Q ss_pred --HHHHHHhhchhHHHHHHHHHHHHHHhhccchhhhc
Q 047520 67 --AIVAVAKKHDQINSFFNVIANVINVVGASCKRRDI 101 (102)
Q Consensus 67 --~v~~~~~~~~~~~~~~~~~~~~~~~f~~S~kr~~~ 101 (102)
++.+.++..+.+.++++.++++++||+.|++|.+.
T Consensus 109 nl~v~d~~~~~~~~~~~~~~~~~i~~~~~~s~~~~~~ 145 (447)
T d2bw3a2 109 SNVLENSFEETPELNMPILACKNIVKYFKKANLQHRL 145 (447)
T ss_dssp HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHTTCGGGG
T ss_pred hhhhHHHHhhhHHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 56677788899999999999999999999988754
|