Citrus Sinensis ID: 043933


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------13
MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMIT
ccHHHHHHHHcHHHHHHHHHccccccccHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHccccccccccc
ccHHHHHHcccHHHHHHHHHHHHcccccHHHHccHHHHHHHHHHccccccHHHHHHHHHHHHcccHHHHHHHHHHccccccHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHccccEEEEc
mqlqelnnrfnesNTELLICLARLCPNDLFAAFDKEKLLRLVefypkdffAIDLIALEMQLDLYRIGELARTMVntkkdkvyPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRnrmgdqwlmit
mqlqelnnRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKnllrnrmgdqwlmit
MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMIT
**************TELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLM**
MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMI*
MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMIT
MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMIT
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQLDLYRIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMIT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query128
224126313 788 predicted protein [Populus trichocarpa] 0.976 0.158 0.546 2e-33
357490617 892 Zinc finger MYM-type protein [Medicago t 0.976 0.140 0.517 2e-32
8954063 811 Contains similarity to a transposable el 0.976 0.154 0.517 1e-31
15221982 769 TTF-type zinc finger protein with HAT di 0.976 0.162 0.517 1e-31
15230070 536 General transcription factor 2-related z 0.976 0.233 0.510 1e-28
242047832 641 hypothetical protein SORBIDRAFT_02g00613 0.968 0.193 0.510 1e-28
116308850 864 H1005F08.16 [Oryza sativa Indica Group] 0.968 0.143 0.536 1e-27
356536945 557 PREDICTED: uncharacterized protein LOC10 0.953 0.219 0.477 1e-27
357143537 809 PREDICTED: zinc finger MYM-type protein 0.960 0.152 0.452 2e-27
125605388 377 hypothetical protein OsJ_29039 [Oryza sa 0.968 0.328 0.463 3e-27
>gi|224126313|ref|XP_002319808.1| predicted protein [Populus trichocarpa] gi|222858184|gb|EEE95731.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  146 bits (369), Expect = 2e-33,   Method: Composition-based stats.
 Identities = 76/139 (54%), Positives = 98/139 (70%), Gaps = 14/139 (10%)

Query: 1   MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQ 60
           MQL EL++RF E++TELL+C+A L P+D F+AF+KEKLLRL  FYP +F  +DL+ L  Q
Sbjct: 615 MQLTELDDRFTETSTELLLCVACLNPSDSFSAFNKEKLLRLALFYPSEFSIVDLMILGDQ 674

Query: 61  LDLY--------------RIGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVF 106
           LD Y               I  LA  MV TKK+ ++PLVY L+ L+L+LPVA ATVERVF
Sbjct: 675 LDTYIIDLRGDDEFSDIEGIASLAEKMVKTKKNLIFPLVYMLIKLSLLLPVATATVERVF 734

Query: 107 SSMTFVKNLLRNRMGDQWL 125
           S+M  VK+ LRNRMGD+W+
Sbjct: 735 SAMHIVKSRLRNRMGDKWM 753




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|357490617|ref|XP_003615596.1| Zinc finger MYM-type protein [Medicago truncatula] gi|355516931|gb|AES98554.1| Zinc finger MYM-type protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|8954063|gb|AAF82236.1|AC069143_12 Contains similarity to a transposable element Tip100 protein for transposase from Ipomoea purpurea gb|4063769 and is a member of the transmembrane 4 family PF|00335 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|15221982|ref|NP_173360.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] gi|332191703|gb|AEE29824.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|15230070|ref|NP_189621.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] gi|332644086|gb|AEE77607.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|242047832|ref|XP_002461662.1| hypothetical protein SORBIDRAFT_02g006133 [Sorghum bicolor] gi|241925039|gb|EER98183.1| hypothetical protein SORBIDRAFT_02g006133 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|116308850|emb|CAH65987.1| H1005F08.16 [Oryza sativa Indica Group] Back     alignment and taxonomy information
>gi|356536945|ref|XP_003536992.1| PREDICTED: uncharacterized protein LOC100819269 [Glycine max] Back     alignment and taxonomy information
>gi|357143537|ref|XP_003572955.1| PREDICTED: zinc finger MYM-type protein 1-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|125605388|gb|EAZ44424.1| hypothetical protein OsJ_29039 [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query128
TAIR|locus:2202165769 AT1G19260 [Arabidopsis thalian 0.976 0.162 0.517 4.6e-29
TAIR|locus:2829849536 AT3G29765 [Arabidopsis thalian 0.976 0.233 0.510 3.6e-28
TAIR|locus:4515102803592 AT2G06541 [Arabidopsis thalian 0.953 0.206 0.489 1.1e-24
TAIR|locus:2118430664 AT4G09660 [Arabidopsis thalian 0.851 0.164 0.504 2.2e-23
TAIR|locus:5019474775393 AT3G31402 "AT3G31402" [Arabido 0.945 0.307 0.404 5.8e-19
TAIR|locus:2033523496 AT1G41920 "AT1G41920" [Arabido 0.5 0.129 0.593 1.3e-13
TAIR|locus:2829839505 AT3G29763 "AT3G29763" [Arabido 0.320 0.081 0.609 3.3e-07
TAIR|locus:5019474771168 AT3G30235 "AT3G30235" [Arabido 0.304 0.232 0.564 7.4e-05
TAIR|locus:2202165 AT1G19260 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 332 (121.9 bits), Expect = 4.6e-29, P = 4.6e-29
 Identities = 72/139 (51%), Positives = 95/139 (68%)

Query:     1 MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQ 60
             MQLQE N+RF+E N+ELLIC++ L P D F  FDK  L+RL EFYP DF  ++  +L+ Q
Sbjct:   596 MQLQEFNDRFDEVNSELLICMSSLSPIDSFRQFDKSMLVRLTEFYPDDFSFVERRSLDHQ 655

Query:    61 LDLYR--------------IGELARTMVNTKKDKVYPLVYQLVTLALILPVAIATVERVF 106
             L++Y               +G+LAR MV T+K   +PLVY+L+ ++LILPVA ATVER F
Sbjct:   656 LEIYLDNVKNDERFTDLKCLGDLARVMVETRKHLSHPLVYRLLKVSLILPVATATVERCF 715

Query:   107 SSMTFVKNLLRNRMGDQWL 125
             S+M FVK  LRNR+GD +L
Sbjct:   716 SAMNFVKTTLRNRIGDMFL 734




GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0046983 "protein dimerization activity" evidence=IEA
GO:0009506 "plasmodesma" evidence=IDA
TAIR|locus:2829849 AT3G29765 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4515102803 AT2G06541 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2118430 AT4G09660 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:5019474775 AT3G31402 "AT3G31402" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2033523 AT1G41920 "AT1G41920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2829839 AT3G29763 "AT3G29763" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:5019474771 AT3G30235 "AT3G30235" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query128
pfam0569982 pfam05699, Dimer_Tnp_hAT, hAT family dimerisation 8e-05
>gnl|CDD|218702 pfam05699, Dimer_Tnp_hAT, hAT family dimerisation domain Back     alignment and domain information
 Score = 38.4 bits (90), Expect = 8e-05
 Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 82  YPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQ 123
           YP + +L    L +PV+ A  ER FS  T  K LL +R   +
Sbjct: 31  YPNLSKLARDILSIPVSSAASERSFS--TLGKVLLESRNRLE 70


This dimerisation domain is found at the C terminus of the transposases of elements belonging to the Activator superfamily (hAT element superfamily). The isolated dimerisation domain forms extremely stable dimers in vitro. Length = 82

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 128
PF0569986 Dimer_Tnp_hAT: hAT family C-terminal dimerisation 99.3
KOG1121641 consensus Tam3-transposase (Ac family) [Replicatio 93.93
>PF05699 Dimer_Tnp_hAT: hAT family C-terminal dimerisation region; InterPro: IPR008906 This dimerisation domain is found at the C terminus of the transposases of elements belonging to the Activator superfamily (hAT element superfamily) Back     alignment and domain information
Probab=99.30  E-value=1.1e-12  Score=81.65  Aligned_cols=49  Identities=27%  Similarity=0.356  Sum_probs=40.1

Q ss_pred             cCCCchhHHHHHHHhhhcccccchhhhhhhhhhhhhhhhccCCCccccc
Q 043933           78 KDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLM  126 (128)
Q Consensus        78 ~~~~fpnl~~ll~i~ltlPvssa~~ERsFS~lk~iKt~lRstm~~~rL~  126 (128)
                      ....||+++++++.++++|+|+|.+||+||+++++++.-|++++.+.+.
T Consensus        29 ~~~~fP~L~~lA~~~Lsip~ss~~~ER~FS~~~~~~~~~r~~l~~~~~~   77 (86)
T PF05699_consen   29 NSSRFPNLAKLARKYLSIPASSASSERSFSAMGKILTRNRNRLSPENVE   77 (86)
T ss_dssp             TTTTSHHHHHHHHHHHTS-S-TTTTHHHHHHTHHHHH-TTT---HHHHH
T ss_pred             CchhchHHHHHHHHHHHhhccccccccccchhhcccccCccCCCHHHHH
Confidence            6778999999999999999999999999999999999999999987654



The isolated dimerisation domain forms extremely stable dimers in vitro [].; GO: 0046983 protein dimerization activity; PDB: 2BW3_A.

>KOG1121 consensus Tam3-transposase (Ac family) [Replication, recombination and repair] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query128
2bw3_A534 Transposase; DNA recombination, transposition; 2.0 98.52
>2bw3_A Transposase; DNA recombination, transposition; 2.0A {Musca domestica} SCOP: a.270.1.1 c.55.3.12 PDB: 2bw3_B Back     alignment and structure
Probab=98.52  E-value=3.3e-08  Score=78.13  Aligned_cols=50  Identities=32%  Similarity=0.385  Sum_probs=46.7

Q ss_pred             cCCCchhHHHHHHHhhhcccccchhhhhhhhhhhhhhhhccCCCcccccC
Q 043933           78 KDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMI  127 (128)
Q Consensus        78 ~~~~fpnl~~ll~i~ltlPvssa~~ERsFS~lk~iKt~lRstm~~~rL~~  127 (128)
                      ....||++.++++-+|++|+|++.|||.||..+++.+..||+++.++++.
T Consensus       472 ~~~~~p~L~~~a~~~l~~p~ss~~~er~fs~~~~~~~~~r~~l~~~~~~~  521 (534)
T 2bw3_A          472 NSKKYPKLSKLALSLLSIPASSAASERTFSLAGNIITEKRNRIGQQTVDS  521 (534)
T ss_dssp             TTTTSHHHHHHHHHHHTSCSCTTTTHHHHHHTHHHHHCTTTCCCHHHHHH
T ss_pred             CCccCChHHHHHHHHhcCCcccccccccccccccccCcccccCCHHHHHH
Confidence            35689999999999999999999999999999999999999999998753




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 128
d2bw3a2447 c.55.3.12 (A:163-609) Transposase Hermes, catalyti 5e-08
>d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} Length = 447 Back     information, alignment and structure

class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Hermes transposase-like
domain: Transposase Hermes, catalytic domain
species: House fly (Musca domestica) [TaxId: 7370]
 Score = 47.7 bits (112), Expect = 5e-08
 Identities = 25/134 (18%), Positives = 45/134 (33%), Gaps = 9/134 (6%)

Query: 1   MQLQELNNRFNESNTELLICLARLCPNDLFAAFDKEKLLRLVEFYPKDFFAIDLIALEMQ 60
           ++L    + FNE +   L        N +      + +     F+P+             
Sbjct: 299 LELINRMSSFNELSATQLNQSDSNSHNSIDLTSHSKDISTTSFFFPQLTQNNSREPPVCP 358

Query: 61  LDLYRIGELARTMVNTKKD---------KVYPLVYQLVTLALILPVAIATVERVFSSMTF 111
            D +        +++             K YP + +L    L +P + A  ER FS    
Sbjct: 359 SDEFEFYRKEIVILSEDFKVMEWWNLNSKKYPKLSKLALSLLSIPASSAASERTFSLAGN 418

Query: 112 VKNLLRNRMGDQWL 125
           +    RNR+G Q +
Sbjct: 419 IITEKRNRIGQQTV 432


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query128
d2bw3a2447 Transposase Hermes, catalytic domain {House fly (M 99.42
>d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Hermes transposase-like
domain: Transposase Hermes, catalytic domain
species: House fly (Musca domestica) [TaxId: 7370]
Probab=99.42  E-value=2e-14  Score=107.52  Aligned_cols=50  Identities=32%  Similarity=0.385  Sum_probs=46.9

Q ss_pred             cCCCchhHHHHHHHhhhcccccchhhhhhhhhhhhhhhhccCCCcccccC
Q 043933           78 KDKVYPLVYQLVTLALILPVAIATVERVFSSMTFVKNLLRNRMGDQWLMI  127 (128)
Q Consensus        78 ~~~~fpnl~~ll~i~ltlPvssa~~ERsFS~lk~iKt~lRstm~~~rL~~  127 (128)
                      ....||+++++++.++++|+|+|+|||+||++++|||++||+|+++|++.
T Consensus       385 ~~~~yP~L~~lA~~~Lsip~tsa~~ER~FS~~~~i~t~~R~~L~~e~le~  434 (447)
T d2bw3a2         385 NSKKYPKLSKLALSLLSIPASSAASERTFSLAGNIITEKRNRIGQQTVDS  434 (447)
T ss_dssp             TTTTSHHHHHHHHHHHTSCSCTTTTHHHHHHTHHHHHCTTTCCCHHHHHH
T ss_pred             CCcccHHHHHHHHHHhCCCCcccHHHHHHhccccccCcccCCCCHHHHHH
Confidence            35679999999999999999999999999999999999999999999863