Citrus Sinensis ID: 041241
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 145 | ||||||
| 224126313 | 788 | predicted protein [Populus trichocarpa] | 0.979 | 0.180 | 0.587 | 1e-38 | |
| 242080863 | 776 | hypothetical protein SORBIDRAFT_07g00579 | 0.889 | 0.166 | 0.519 | 2e-32 | |
| 18854992 | 1283 | putative transposase [Oryza sativa Japon | 0.979 | 0.110 | 0.475 | 4e-32 | |
| 32488762 | 897 | OSJNBb0016D16.6 [Oryza sativa Japonica G | 0.882 | 0.142 | 0.5 | 7e-32 | |
| 224042031 | 897 | putative transposase [Oryza sativa Japon | 0.882 | 0.142 | 0.5 | 8e-32 | |
| 15221982 | 769 | TTF-type zinc finger protein with HAT di | 0.889 | 0.167 | 0.534 | 3e-31 | |
| 8954063 | 811 | Contains similarity to a transposable el | 0.889 | 0.159 | 0.534 | 3e-31 | |
| 116317825 | 793 | OSIGBa0126J24.6 [Oryza sativa Indica Gro | 0.889 | 0.162 | 0.496 | 3e-31 | |
| 15230069 | 505 | General transcription factor 2-related z | 0.889 | 0.255 | 0.527 | 5e-31 | |
| 15233987 | 664 | uncharacterized protein [Arabidopsis tha | 0.889 | 0.194 | 0.527 | 1e-30 |
| >gi|224126313|ref|XP_002319808.1| predicted protein [Populus trichocarpa] gi|222858184|gb|EEE95731.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 164 bits (415), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/143 (58%), Positives = 107/143 (74%), Gaps = 1/143 (0%)
Query: 4 EIQANERIFLSF-YFLVDHNEDINAVTFDNAPENLQMTSNEIKKDIVSCAAVETTNIIIK 62
E +N+ +L +FL +NE I VTF AP + ++TS +I+KDI AA E TN+IIK
Sbjct: 216 ECSSNQGNYLELLHFLSRNNEAIKRVTFSEAPRHNKLTSPDIQKDITQAAAEEITNVIIK 275
Query: 63 EMGDILFSILIDESCDIFTKEQMAVVLRYVDKNGYVVEHFIGIEHVTSTTSISLKEALDK 122
++GD LFSILIDES DI KEQMAVVLRYVD NG+++E F+GI+HV TT+ SLK A++
Sbjct: 276 DLGDSLFSILIDESRDISIKEQMAVVLRYVDNNGHIIERFLGIQHVRDTTASSLKAAIEA 335
Query: 123 LFSRHGLSMSRLHRQGYDEASNM 145
LFS+HGLS+SRL QGYD ASNM
Sbjct: 336 LFSKHGLSISRLRGQGYDGASNM 358
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|242080863|ref|XP_002445200.1| hypothetical protein SORBIDRAFT_07g005790 [Sorghum bicolor] gi|241941550|gb|EES14695.1| hypothetical protein SORBIDRAFT_07g005790 [Sorghum bicolor] | Back alignment and taxonomy information |
|---|
| >gi|18854992|gb|AAL79684.1|AC087599_3 putative transposase [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
| >gi|32488762|emb|CAE04315.1| OSJNBb0016D16.6 [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
| >gi|224042031|gb|ACN38703.1| putative transposase [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
| >gi|15221982|ref|NP_173360.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] gi|332191703|gb|AEE29824.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|8954063|gb|AAF82236.1|AC069143_12 Contains similarity to a transposable element Tip100 protein for transposase from Ipomoea purpurea gb|4063769 and is a member of the transmembrane 4 family PF|00335 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|116317825|emb|CAH65861.1| OSIGBa0126J24.6 [Oryza sativa Indica Group] | Back alignment and taxonomy information |
|---|
| >gi|15230069|ref|NP_189620.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] gi|332644085|gb|AEE77606.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|15233987|ref|NP_192704.1| uncharacterized protein [Arabidopsis thaliana] gi|4538896|emb|CAB39633.1| putative protein [Arabidopsis thaliana] gi|7267661|emb|CAB78089.1| putative protein [Arabidopsis thaliana] gi|332657378|gb|AEE82778.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 145 | ||||||
| TAIR|locus:2829839 | 505 | AT3G29763 "AT3G29763" [Arabido | 0.979 | 0.281 | 0.503 | 2.3e-31 | |
| TAIR|locus:2118430 | 664 | AT4G09660 [Arabidopsis thalian | 0.979 | 0.213 | 0.503 | 2e-30 | |
| TAIR|locus:2202165 | 769 | AT1G19260 [Arabidopsis thalian | 0.979 | 0.184 | 0.503 | 3e-30 | |
| TAIR|locus:4515102803 | 592 | AT2G06541 [Arabidopsis thalian | 0.958 | 0.234 | 0.464 | 1.1e-26 | |
| TAIR|locus:2829849 | 536 | AT3G29765 [Arabidopsis thalian | 0.737 | 0.199 | 0.560 | 7.7e-26 | |
| TAIR|locus:2033523 | 496 | AT1G41920 "AT1G41920" [Arabido | 0.944 | 0.276 | 0.402 | 1.4e-19 | |
| TAIR|locus:5019474775 | 393 | AT3G31402 "AT3G31402" [Arabido | 0.772 | 0.284 | 0.375 | 2.2e-15 | |
| TAIR|locus:2036700 | 206 | AT1G42710 "AT1G42710" [Arabido | 0.434 | 0.305 | 0.555 | 3.4e-13 | |
| TAIR|locus:2093847 | 522 | AT3G29450 "AT3G29450" [Arabido | 0.558 | 0.155 | 0.426 | 9.6e-11 | |
| UNIPROTKB|Q5SW02 | 1067 | ZMYM1 "Zinc finger MYM-type pr | 0.724 | 0.098 | 0.284 | 2.2e-08 |
| TAIR|locus:2829839 AT3G29763 "AT3G29763" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 345 (126.5 bits), Expect = 2.3e-31, P = 2.3e-31
Identities = 72/143 (50%), Positives = 95/143 (66%)
Query: 4 EIQANERIFLSFY-FLVDHNEDINAVTFDNAPENLQMTSNEIKKDIVSCAAVETTNIIIK 62
E N+ FL ++ NE + V NAP+N QMTS I+KDIV C + E T II+
Sbjct: 66 EESTNKGNFLELLKYIAGQNEVVKKVVLKNAPKNNQMTSPPIQKDIVHCFSEEVTRSIIE 125
Query: 63 EMGDILFSILIDESCDIFTKEQMAVVLRYVDKNGYVVEHFIGIEHVTSTTSISLKEALDK 122
EM + +F +L+DES D KEQMAVV R+VDK G V E FIG+ HV T+S+SLK A+D
Sbjct: 126 EMDNDVFGLLVDESADASDKEQMAVVFRFVDKYGVVKERFIGVIHVQETSSLSLKSAIDS 185
Query: 123 LFSRHGLSMSRLHRQGYDEASNM 145
LF+++GLS+ +L +GYD ASNM
Sbjct: 186 LFAKYGLSLKKLRGKGYDGASNM 208
|
|
| TAIR|locus:2118430 AT4G09660 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2202165 AT1G19260 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:4515102803 AT2G06541 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2829849 AT3G29765 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2033523 AT1G41920 "AT1G41920" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:5019474775 AT3G31402 "AT3G31402" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2036700 AT1G42710 "AT1G42710" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2093847 AT3G29450 "AT3G29450" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q5SW02 ZMYM1 "Zinc finger MYM-type protein 1" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 145 | |||
| pfam14291 | 179 | pfam14291, DUF4371, Domain of unknown function (DU | 4e-32 |
| >gnl|CDD|222655 pfam14291, DUF4371, Domain of unknown function (DUF4371) | Back alignment and domain information |
|---|
Score = 111 bits (281), Expect = 4e-32
Identities = 42/68 (61%), Positives = 53/68 (77%)
Query: 78 DIFTKEQMAVVLRYVDKNGYVVEHFIGIEHVTSTTSISLKEALDKLFSRHGLSMSRLHRQ 137
D+ KEQMAVVLRYVDK G V E F+G+ HV TTS++LK A+D L ++HGLS+S++ Q
Sbjct: 106 DVSQKEQMAVVLRYVDKKGQVKERFLGVVHVEETTSLTLKSAIDSLLAKHGLSLSKVRGQ 165
Query: 138 GYDEASNM 145
GYD ASNM
Sbjct: 166 GYDGASNM 173
|
Length = 179 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 145 | |||
| PF14291 | 235 | DUF4371: Domain of unknown function (DUF4371) | 100.0 | |
| PF04937 | 153 | DUF659: Protein of unknown function (DUF 659); Int | 98.03 | |
| KOG1121 | 641 | consensus Tam3-transposase (Ac family) [Replicatio | 96.8 |
| >PF14291 DUF4371: Domain of unknown function (DUF4371) | Back alignment and domain information |
|---|
Probab=100.00 E-value=7.9e-44 Score=272.86 Aligned_cols=140 Identities=33% Similarity=0.506 Sum_probs=134.7
Q ss_pred ccCCChHHHHHHhHhhcCHHHHHHHhhcCCCcccccCcchHHHHHHHHHHHHHHHHHhhhccceeEEEecccccccccce
Q 041241 5 IQANERIFLSFYFLVDHNEDINAVTFDNAPENLQMTSNEIKKDIVSCAAVETTNIIIKEMGDILFSILIDESCDIFTKEQ 84 (145)
Q Consensus 5 ~~~~gnf~~ll~l~~~~~~~l~~~~l~~~~~~~~y~S~~~~~~ii~~ia~~i~~~i~~~l~~~~fSi~~DettDis~~~q 84 (145)
-.|+|||++|+++++++||.+++|+.+..+.+..|+|+++|+++ +++|+.+++.|++++++++|||++|||||+++++|
T Consensus 90 s~n~GNFl~ll~l~~~~d~~l~~~~~~~~~~~~~~~s~~iq~~i-~~~a~~v~~~I~~~v~~~~FSii~DettDis~~eQ 168 (235)
T PF14291_consen 90 SLNNGNFLELLELLAKYDPELKKHLSKNAPKNAKYSSKTIQNEI-EILADHVRQSIVEEVKSKYFSIIVDETTDISNKEQ 168 (235)
T ss_pred ccccccHHHHHHHHHhhcccchhhhhcccccceeccHHHHHHHH-HHHHHHHHHHHHhhccccceeeeeeccccccccch
Confidence 35789999999999999999999966778888999999999998 99999999999999988999999999999999999
Q ss_pred eEEEEEEeecCCceeeeeecceecccCchhHHHHHHHHHHHHcCCCCCCeeEEeecCCCCC
Q 041241 85 MAVVLRYVDKNGYVVEHFIGIEHVTSTTSISLKEALDKLFSRHGLSMSRLHRQGYDEASNM 145 (145)
Q Consensus 85 l~i~vryv~~~~~i~e~fl~~~~~~~~tae~i~~~i~~~l~~~~l~~~~~~~~~~Dgas~M 145 (145)
|+|+||||+.++.|+|+||+|.+++++||++|+++|++.|.++|||+++|+||||||||+|
T Consensus 169 l~i~vRyv~~~~~i~E~Fl~f~~~~~~ta~~l~~~i~~~L~~~~l~~~~~~gq~yDgas~M 229 (235)
T PF14291_consen 169 LSICVRYVDKDGKIKERFLGFVELEDTTAESLFNAIKDVLEKLGLDLSNCRGQCYDGASNM 229 (235)
T ss_pred hhheeeeeccCcceeeeeeeeeccCCccHHHHHHHHHHHHHHcCCCHHHcCcccccChHhh
Confidence 9999999998889999999999999999999999999999999999999999999999998
|
|
| >PF04937 DUF659: Protein of unknown function (DUF 659); InterPro: IPR007021 These are transposase-like proteins with no known function | Back alignment and domain information |
|---|
| >KOG1121 consensus Tam3-transposase (Ac family) [Replication, recombination and repair] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 145 | |||
| 2bw3_A | 534 | Transposase; DNA recombination, transposition; 2.0 | 99.68 |
| >2bw3_A Transposase; DNA recombination, transposition; 2.0A {Musca domestica} SCOP: a.270.1.1 c.55.3.12 PDB: 2bw3_B | Back alignment and structure |
|---|
Probab=99.68 E-value=5.8e-17 Score=134.70 Aligned_cols=138 Identities=16% Similarity=0.220 Sum_probs=111.1
Q ss_pred ceeccCCChHHHHHHhHhhcCHHHHHHHhhcCCCcccccCc-chHHHHHHHHHHHHHHHHHhhhcc----ceeEEEeccc
Q 041241 2 SLEIQANERIFLSFYFLVDHNEDINAVTFDNAPENLQMTSN-EIKKDIVSCAAVETTNIIIKEMGD----ILFSILIDES 76 (145)
Q Consensus 2 ~~~~~~~gnf~~ll~l~~~~~~~l~~~~l~~~~~~~~y~S~-~~~~~ii~~ia~~i~~~i~~~l~~----~~fSi~~Det 76 (145)
.+++.++.+|+++++.+...+|.+.++ +. .+....|. ++... +..++..+++.++++++. .+|||++|+|
T Consensus 33 Pf~~ve~~~F~~~l~~l~~~~p~~~~~-~~---~~~~~ps~~~~~~~-i~~~~~~~~~~l~~~l~~~~~~~~~sl~~D~w 107 (534)
T 2bw3_A 33 PFSAVSGSGFIDMIKFFIKVKAEYGEH-VN---VEELLPSPITLSRK-VTSDAKEKKALIGREIKSAVEKDGASATIDLW 107 (534)
T ss_dssp CGGGGGCHHHHHHHHHHHHHHHHHCSC-BC---HHHHSCCHHHHHHH-HHHHHHHHHHHHHHHHHHHHHTTCCEEEEEEE
T ss_pred cchhhCCHHHHHHHHHHHhhhhhhccc-cc---hhhcCCCcccHHHH-HHHHHHHHHHHHHHHHHhhhcCCceEEEEecc
Confidence 467788889999999998888876443 11 00112223 34444 455688888999999886 5999999999
Q ss_pred ccc-cccceeEEEEEEeecCCceeeeeecceecc--cCchhHHHHHHHHHHHHcCCCCCCeeEEeecCCCCC
Q 041241 77 CDI-FTKEQMAVVLRYVDKNGYVVEHFIGIEHVT--STTSISLKEALDKLFSRHGLSMSRLHRQGYDEASNM 145 (145)
Q Consensus 77 tDi-s~~~ql~i~vryv~~~~~i~e~fl~~~~~~--~~tae~i~~~i~~~l~~~~l~~~~~~~~~~Dgas~M 145 (145)
||. .+.++++|++||++ ++.+++++|++.+++ ++||+.|++.|.++|.++|++..+++++.+|||++|
T Consensus 108 t~~~~~~~~l~i~v~~i~-~~~~~~~~L~~~~~~~~~~t~~~i~~~i~~~l~~~~l~~~~~v~~vtDnasn~ 178 (534)
T 2bw3_A 108 TDNYIKRNFLGVTLHYHE-NNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSIKFVTDRGANV 178 (534)
T ss_dssp EETTTTEEEEEEEEEEEE-TTEEEEEEEEEEEECGGGCSHHHHHHHHHHHHHTTTCCCCTTCEEEECCCHHH
T ss_pred cCCCCCcceeEEEEEEee-CCceEEEEEeEeecCCCCCCHHHHHHHHHHHHHHcCCCcccceeEECCCHHHH
Confidence 998 47899999999994 678999999999998 599999999999999999999767888888999875
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 145 | ||||
| d2bw3a2 | 447 | c.55.3.12 (A:163-609) Transposase Hermes, catalyti | 4e-06 |
| >d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} Length = 447 | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Ribonuclease H-like motif superfamily: Ribonuclease H-like family: Hermes transposase-like domain: Transposase Hermes, catalytic domain species: House fly (Musca domestica) [TaxId: 7370]
Score = 42.7 bits (99), Expect = 4e-06
Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 4/80 (5%)
Query: 69 FSILIDE-SCDIFTKEQMAVVLRYVDKNGYVVEHFIGIEHVT--STTSISLKEALDKLFS 125
S ID + + + + V L Y + N + + +G++ + +T+ ++ + L +FS
Sbjct: 13 ASATIDLWTDNYIKRNFLGVTLHYHE-NNELRDLILGLKSLDFERSTAENIYKKLKAIFS 71
Query: 126 RHGLSMSRLHRQGYDEASNM 145
+ + + D +N+
Sbjct: 72 QFNVEDLSSIKFVTDRGANV 91
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 145 | |||
| d2bw3a2 | 447 | Transposase Hermes, catalytic domain {House fly (M | 99.6 |
| >d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Ribonuclease H-like motif superfamily: Ribonuclease H-like family: Hermes transposase-like domain: Transposase Hermes, catalytic domain species: House fly (Musca domestica) [TaxId: 7370]
Probab=99.60 E-value=7.1e-16 Score=120.81 Aligned_cols=86 Identities=19% Similarity=0.227 Sum_probs=75.9
Q ss_pred HHHhhhccceeEEEeccccc-ccccceeEEEEEEeecCCceeeeeecceecc--cCchhHHHHHHHHHHHHcCCCCCCee
Q 041241 59 IIIKEMGDILFSILIDESCD-IFTKEQMAVVLRYVDKNGYVVEHFIGIEHVT--STTSISLKEALDKLFSRHGLSMSRLH 135 (145)
Q Consensus 59 ~i~~~l~~~~fSi~~DettD-is~~~ql~i~vryv~~~~~i~e~fl~~~~~~--~~tae~i~~~i~~~l~~~~l~~~~~~ 135 (145)
+|++.+.+..+|+.+|+||+ .++..||+|++||++ ++.+++++|++.++. .+||++|++.|.++|+++||++.+++
T Consensus 3 ~~k~~~~~~~~s~~~D~Wts~~~~~~~l~v~~~yi~-~~~~~~~~L~~~~~~~~~~tg~~i~~~l~~~l~~~~i~~~~~~ 81 (447)
T d2bw3a2 3 EIKSAVEKDGASATIDLWTDNYIKRNFLGVTLHYHE-NNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSI 81 (447)
T ss_dssp HHHHHHHTTCCEEEEEEEEETTTTEEEEEEEEEEEE-TTEEEEEEEEEEEECGGGCSHHHHHHHHHHHHHTTTCCCCTTC
T ss_pred hHHHHHHcCCEEEEEEeecCCCCCeEEEEEEEEEEE-CCceeEEEEEEEEeCCCCCcHHHHHHHHHHHHHHcCCCcceEE
Confidence 44555556789999999776 667899999999998 578999999999885 58999999999999999999999999
Q ss_pred EEeecCCCCC
Q 041241 136 RQGYDEASNM 145 (145)
Q Consensus 136 ~~~~Dgas~M 145 (145)
|+++|||++|
T Consensus 82 ~~~~D~a~n~ 91 (447)
T d2bw3a2 82 KFVTDRGANV 91 (447)
T ss_dssp EEEECCCHHH
T ss_pred EEEcCCcHhh
Confidence 9999999876
|