Citrus Sinensis ID: 033506


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------12
MGEELSGSSAAITSSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAEKVKRPKPIEALEAESKNEKEKPPKKSKASKRMRKMLELERRLQETEFDRAFRREFWPDNV
ccccccccccccccccHHHHHHHHcccccccccccccccccccEEEEEccccccccccccccccHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccc
ccccHcccccccccccHHHHHHHHccccccccccccccccEEEEEEEEEcccccccccccccccccccccHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccc
mgeelsgssaaitssnIGFQLLKkhgwkegtglgiaeqgrlepirthvkknkrgigaekvkrpkpieALEAEsknekekppkkskASKRMRKMLELERRLQETEFDRAFRREFWPDNV
mgeelsgssaaitssNIGFQLLKKHGWKEGTGlgiaeqgrlepirthvkknkrgigaekvkrpkpiealeaesknekekppkkskaskrmRKMLElerrlqetefdrafrrefwpdnv
MGEELSGSSAAITSSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAEKVKRPKPIEALeaesknekekppkkskaskrmrkmlelerrlQETEFDRAFRREFWPDNV
***************NIGFQLLKKHGWKEGTGLGIAE*********************************************************************************
*******************QLLKKHGWKEGTG**I**QGRLEPIRTHVKKNKRGI*************************************************FDRAFRREFWPDNV
**********AITSSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAEKVKRPKPIEALEA*******************RKMLELERRLQETEFDRAFRREFWPDNV
*************SSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNK**I*****************************KASKRMRKMLELERRLQETEFDRAFRREFWPDNV
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGEELSGSSAAITSSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAEKVKRPKPIEALEAESKNEKEKPPKKSKASKRMRKMLELERRLQETEFDRAFRREFWPDNV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query118 2.2.26 [Sep-21-2011]
Q6DGZ0262 Coiled-coil domain-contai yes no 0.762 0.343 0.413 2e-08
Q6C233812 Protein SQS1 OS=Yarrowia yes no 0.474 0.068 0.517 5e-07
A2A6A1 1505 G patch domain-containing yes no 0.381 0.029 0.555 7e-07
Q9UKJ3 1502 G patch domain-containing yes no 0.381 0.029 0.555 8e-07
Q09655371 G patch domain and ankyri no no 0.449 0.142 0.471 3e-06
Q2KI19260 Coiled-coil domain-contai no no 0.872 0.396 0.333 7e-06
Q9V7A7336 G patch domain and ankyri no no 0.406 0.142 0.416 1e-05
Q6DF57261 Coiled-coil domain-contai no no 0.516 0.233 0.426 1e-05
Q94C11443 SURP and G-patch domain-c no no 0.406 0.108 0.469 1e-05
Q8N954259 Coiled-coil domain-contai no no 0.872 0.397 0.342 1e-05
>sp|Q6DGZ0|CCD75_DANRE Coiled-coil domain-containing protein 75 OS=Danio rerio GN=ccdc75 PE=2 SV=1 Back     alignment and function desciption
 Score = 57.8 bits (138), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 55/92 (59%), Gaps = 2/92 (2%)

Query: 10  AAITSSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAEKVKRPKPIEAL 69
           ++I S N GF LL+K G+K G GLG    GR+EP+  ++K ++ GIG E+VK+ K  E L
Sbjct: 66  SSIGSQNKGFALLQKMGYKAGQGLGKEGAGRVEPVPLNIKTDRGGIGMEEVKKRKADEEL 125

Query: 70  EAESK--NEKEKPPKKSKASKRMRKMLELERR 99
           +   +  + K+   KKS    R+RK  E E R
Sbjct: 126 QNYRRKVHMKQHLEKKSIEDFRVRKRTEREER 157





Danio rerio (taxid: 7955)
>sp|Q6C233|SQS1_YARLI Protein SQS1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=SQS1 PE=3 SV=1 Back     alignment and function description
>sp|A2A6A1|GPTC8_MOUSE G patch domain-containing protein 8 OS=Mus musculus GN=Gpatch8 PE=2 SV=1 Back     alignment and function description
>sp|Q9UKJ3|GPTC8_HUMAN G patch domain-containing protein 8 OS=Homo sapiens GN=GPATCH8 PE=1 SV=2 Back     alignment and function description
>sp|Q09655|GPAN1_CAEEL G patch domain and ankyrin repeat-containing protein 1 homolog OS=Caenorhabditis elegans GN=ZK1320.7 PE=4 SV=3 Back     alignment and function description
>sp|Q2KI19|CCD75_BOVIN Coiled-coil domain-containing protein 75 OS=Bos taurus GN=CCDC75 PE=2 SV=2 Back     alignment and function description
>sp|Q9V7A7|GPAN1_DROME G patch domain and ankyrin repeat-containing protein 1 homolog OS=Drosophila melanogaster GN=CG8152 PE=2 SV=1 Back     alignment and function description
>sp|Q6DF57|CCD75_XENTR Coiled-coil domain-containing protein 75 OS=Xenopus tropicalis GN=ccdc75 PE=2 SV=1 Back     alignment and function description
>sp|Q94C11|SUGP1_ARATH SURP and G-patch domain-containing protein 1-like protein OS=Arabidopsis thaliana GN=At3g52120 PE=2 SV=1 Back     alignment and function description
>sp|Q8N954|CCD75_HUMAN Coiled-coil domain-containing protein 75 OS=Homo sapiens GN=CCDC75 PE=1 SV=3 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query118
356536473123 PREDICTED: coiled-coil domain-containing 0.983 0.943 0.622 2e-32
255544157118 nucleic acid binding protein, putative [ 0.966 0.966 0.601 2e-32
224100471133 predicted protein [Populus trichocarpa] 1.0 0.887 0.563 4e-32
388516687122 unknown [Medicago truncatula] 1.0 0.967 0.606 2e-31
449458728126 PREDICTED: G patch domain and ankyrin re 0.957 0.896 0.615 1e-30
225463787123 PREDICTED: G patch domain and ankyrin re 1.0 0.959 0.601 2e-29
357146779113 PREDICTED: coiled-coil domain-containing 0.864 0.902 0.605 4e-27
326527615113 predicted protein [Hordeum vulgare subsp 0.915 0.955 0.637 5e-27
294461428152 unknown [Picea sitchensis] 0.940 0.730 0.598 7e-27
18415848141 D111/G-patch domain-containing protein [ 0.889 0.744 0.623 3e-26
>gi|356536473|ref|XP_003536762.1| PREDICTED: coiled-coil domain-containing protein 75-like [Glycine max] Back     alignment and taxonomy information
 Score =  142 bits (359), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/122 (62%), Positives = 97/122 (79%), Gaps = 6/122 (4%)

Query: 3   EELSGSSAAITSSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAE---- 58
           EE+SG + AI SSNIGFQLLKKHGWKEGTGLG++EQGRLEP+ THVK NKRG+GA+    
Sbjct: 2   EEMSGLATAINSSNIGFQLLKKHGWKEGTGLGVSEQGRLEPVETHVKNNKRGLGADKAKK 61

Query: 59  KVKRPKPIEALEAESKNEKEK-PPKKSKA-SKRMRKMLELERRLQETEFDRAFRREFWPD 116
           KV + KP ++  ++  N+++  P KKSK  SKRMRKM E E++++E EF+RAF REFWP+
Sbjct: 62  KVVKAKPDQSDSSKGNNQQDHLPQKKSKTLSKRMRKMQEFEKKMREKEFERAFFREFWPE 121

Query: 117 NV 118
           NV
Sbjct: 122 NV 123




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255544157|ref|XP_002513141.1| nucleic acid binding protein, putative [Ricinus communis] gi|223548152|gb|EEF49644.1| nucleic acid binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224100471|ref|XP_002311890.1| predicted protein [Populus trichocarpa] gi|222851710|gb|EEE89257.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388516687|gb|AFK46405.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|449458728|ref|XP_004147099.1| PREDICTED: G patch domain and ankyrin repeat-containing protein 1 homolog [Cucumis sativus] Back     alignment and taxonomy information
>gi|225463787|ref|XP_002269816.1| PREDICTED: G patch domain and ankyrin repeats-containing protein 1 [Vitis vinifera] gi|297742720|emb|CBI35354.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357146779|ref|XP_003574108.1| PREDICTED: coiled-coil domain-containing protein 75-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|326527615|dbj|BAK08082.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|294461428|gb|ADE76275.1| unknown [Picea sitchensis] Back     alignment and taxonomy information
>gi|18415848|ref|NP_568199.1| D111/G-patch domain-containing protein [Arabidopsis thaliana] gi|26451562|dbj|BAC42878.1| unknown protein [Arabidopsis thaliana] gi|332003934|gb|AED91317.1| D111/G-patch domain-containing protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query118
TAIR|locus:2100549180 AT3G52350 [Arabidopsis thalian 0.542 0.355 0.656 2.1e-25
TAIR|locus:505006587141 AT5G08535 [Arabidopsis thalian 0.533 0.446 0.686 9.1e-25
WB|WBGene00013224511 Y55D9A.2 [Caenorhabditis elega 0.440 0.101 0.5 6.8e-10
ZFIN|ZDB-GENE-040718-89262 gpatch11 "G patch domain conta 0.508 0.229 0.466 1.9e-08
UNIPROTKB|F1RJ22 312 PINX1 "Uncharacterized protein 0.364 0.137 0.465 8.4e-08
TAIR|locus:2146839 301 AT5G26610 [Arabidopsis thalian 0.466 0.182 0.446 9.9e-08
UNIPROTKB|B0UZ51356 GPANK1 "G patch domain and ank 0.423 0.140 0.4 3.2e-07
UNIPROTKB|E1C8L8255 CCDC75 "Uncharacterized protei 0.516 0.239 0.409 5e-07
UNIPROTKB|O95872356 GPANK1 "G patch domain and ank 0.423 0.140 0.4 5.3e-07
WB|WBGene00014256371 ZK1320.7 [Caenorhabditis elega 0.466 0.148 0.482 5.4e-07
TAIR|locus:2100549 AT3G52350 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 215 (80.7 bits), Expect = 2.1e-25, Sum P(2) = 2.1e-25
 Identities = 42/64 (65%), Positives = 49/64 (76%)

Query:     4 ELSGSSAAITSSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAEKVKRP 63
             E S  S AI+SSNIGFQLLKKHGWKEGTGLGI EQG L P++   K NK+G+GAEK  + 
Sbjct:    69 ESSNPSTAISSSNIGFQLLKKHGWKEGTGLGITEQGILVPLQAEPKHNKQGLGAEKPAKR 128

Query:    64 KPIE 67
             KP +
Sbjct:   129 KPAQ 132


GO:0003676 "nucleic acid binding" evidence=IEA;ISS
GO:0005622 "intracellular" evidence=ISS
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:505006587 AT5G08535 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
WB|WBGene00013224 Y55D9A.2 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040718-89 gpatch11 "G patch domain containing 11" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|F1RJ22 PINX1 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
TAIR|locus:2146839 AT5G26610 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|B0UZ51 GPANK1 "G patch domain and ankyrin repeat-containing protein 1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|E1C8L8 CCDC75 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|O95872 GPANK1 "G patch domain and ankyrin repeat-containing protein 1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
WB|WBGene00014256 ZK1320.7 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00080018
hypothetical protein (133 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query118
pfam0158545 pfam01585, G-patch, G-patch domain 7e-15
smart0044347 smart00443, G_patch, glycine rich nucleic binding 7e-14
>gnl|CDD|144978 pfam01585, G-patch, G-patch domain Back     alignment and domain information
 Score = 62.9 bits (154), Expect = 7e-15
 Identities = 25/44 (56%), Positives = 33/44 (75%)

Query: 15 SNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAE 58
          SNIGF+LL+K GWK G GLG  EQG  EPI   ++ +++G+GAE
Sbjct: 2  SNIGFKLLQKMGWKPGQGLGKNEQGITEPIEAKIRPDRKGLGAE 45


This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines. Length = 45

>gnl|CDD|197727 smart00443, G_patch, glycine rich nucleic binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 118
PF0158545 G-patch: G-patch domain; InterPro: IPR000467 The D 99.62
smart0044347 G_patch glycine rich nucleic binding domain. A pre 99.52
KOG2384223 consensus Major histocompatibility complex protein 99.52
KOG2809 326 consensus Telomerase elongation inhibitor/RNA matu 99.27
KOG0965988 consensus Predicted RNA-binding protein, contains 98.94
PF1265677 G-patch_2: DExH-box splicing factor binding site 98.92
KOG2184 767 consensus Tuftelin-interacting protein TIP39, cont 98.83
KOG2185486 consensus Predicted RNA-processing protein, contai 98.36
KOG3673 845 consensus FtsJ-like RNA methyltransferase [RNA pro 98.14
KOG1996378 consensus mRNA splicing factor [RNA processing and 97.97
KOG0154573 consensus RNA-binding protein RBM5 and related pro 97.95
KOG1994268 consensus Predicted RNA binding protein, contains 97.85
KOG4315 455 consensus G-patch nucleic acid binding protein [Ge 97.79
KOG4368757 consensus Predicted RNA binding protein, contains 97.61
KOG2138 883 consensus Predicted RNA binding protein, contains 96.37
KOG1994 268 consensus Predicted RNA binding protein, contains 94.24
>PF01585 G-patch: G-patch domain; InterPro: IPR000467 The D111/G-patch domain [] is a short conserved region of about 40 amino acids which occurs in a number of putative RNA-binding proteins, including tumor suppressor and DNA-damage-repair proteins, suggesting that this domain may have an RNA binding function Back     alignment and domain information
Probab=99.62  E-value=5.9e-16  Score=93.51  Aligned_cols=45  Identities=56%  Similarity=1.006  Sum_probs=43.1

Q ss_pred             CCcHHHHHHHhcCCCCCCCCCCCCCCccCceeeeeeCCCcccccc
Q 033506           14 SSNIGFQLLKKHGWKEGTGLGIAEQGRLEPIRTHVKKNKRGIGAE   58 (118)
Q Consensus        14 ~s~~G~kmL~kmGW~~G~GLG~~~qGi~~PI~~~~k~d~~GLG~~   58 (118)
                      ++++|++||.+|||++|+|||++.+|+++||++..+.++.|||+.
T Consensus         1 t~~~g~~lm~kmGw~~G~GLGk~~~G~~~pi~~~~~~~~~GlG~~   45 (45)
T PF01585_consen    1 TSSIGFKLMKKMGWKPGQGLGKNGQGIAEPIEVKKKKDRKGLGAE   45 (45)
T ss_pred             CCcHHHHHHHHCCCCCCcCCCcCCccCCcceEEeeEcCCccccCC
Confidence            478999999999999999999999999999999999999999974



This domain has seven highly conserved glycines. A multiple alignment of a small subset of D111/G-patch domains is shown in Fig. 2b of [].; GO: 0003676 nucleic acid binding, 0005622 intracellular

>smart00443 G_patch glycine rich nucleic binding domain Back     alignment and domain information
>KOG2384 consensus Major histocompatibility complex protein BAT4, contains G-patch and ankyrin domains [General function prediction only] Back     alignment and domain information
>KOG2809 consensus Telomerase elongation inhibitor/RNA maturation protein PINX1 [RNA processing and modification; Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>KOG0965 consensus Predicted RNA-binding protein, contains SWAP and G-patch domains [General function prediction only] Back     alignment and domain information
>PF12656 G-patch_2: DExH-box splicing factor binding site Back     alignment and domain information
>KOG2184 consensus Tuftelin-interacting protein TIP39, contains G-patch domain [RNA processing and modification] Back     alignment and domain information
>KOG2185 consensus Predicted RNA-processing protein, contains G-patch domain [RNA processing and modification] Back     alignment and domain information
>KOG3673 consensus FtsJ-like RNA methyltransferase [RNA processing and modification] Back     alignment and domain information
>KOG1996 consensus mRNA splicing factor [RNA processing and modification] Back     alignment and domain information
>KOG0154 consensus RNA-binding protein RBM5 and related proteins, contain G-patch and RRM domains [General function prediction only] Back     alignment and domain information
>KOG1994 consensus Predicted RNA binding protein, contains G-patch and Zn-finger domains [RNA processing and modification] Back     alignment and domain information
>KOG4315 consensus G-patch nucleic acid binding protein [General function prediction only] Back     alignment and domain information
>KOG4368 consensus Predicted RNA binding protein, contains SWAP, RPR and G-patch domains [General function prediction only] Back     alignment and domain information
>KOG2138 consensus Predicted RNA binding protein, contains G-patch domain [RNA processing and modification] Back     alignment and domain information
>KOG1994 consensus Predicted RNA binding protein, contains G-patch and Zn-finger domains [RNA processing and modification] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00