Citrus Sinensis ID: 020999


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------32
MEATATREAVTSATTGVTVVGSDAPSEFHIAPRSENPTLTSAPPPPATQPPAPAPPPALPLKKKRGRPRKYGPDGTVTMALSPKPISSAAPSPPVIDFSAEKPRKVKPASSFSKSKYEVENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKHKKQKNEPISIATPTAAIPISSADPKGSLTTSTFRGDGWSSLPSDSRNKPTDINASLPVG
cccccccccccccccEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccccEEEEEEEcccccHHHHHHHHHHcccEEEEEEEEcccEEEEEEcccccccccEEEEEEEEEEEEEEEEcccccccccccccccEEEEccccccEEEEEcccEEEEcccEEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
cccccccEEEEcccccEEEEcccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHcccccccccccccccEEEEEcccHHHHHHHHHHHHccccEEEEEEcccEEEEEEEEcccccccEEEEEcEEEEEEEccccccccccccccccccEEEEEEcccccEEEccEHHHEEccccEEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
MEATATREAvtsattgvtvvgsdapsefhiaprsenptltsappppatqppapapppalplkkkrgrprkygpdgtvtmalspkpissaapsppvidfsaekprkvkpassfskskyevENIGEWVAcsvganftphiitvntgedvTMKIISFSQQGPRAICILSangvissvtlrqpdssggtltyegRFEILSlsgsfmpsdsggtrsrsggmsvslaspdgrvvgGGVAGLLVAASPVQVVVGSFlagnqheqkhkkqknepisiatptaaipissadpkgslttstfrgdgwsslpsdsrnkptdinaslpvg
meatatreavtsattgvtvvgsdAPSEFHIAPRSENPTLTSAPPPPATqppapapppalplkkkrgrprkYGPDGTVTMALSPKPISSAAPSPPVIDfsaekprkvkpassfskskyeVENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISsvtlrqpdssggtLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEqkhkkqknepisiatptaaipissadpkGSLTTSTFrgdgwsslpsdsrnkptdinaslpvg
MEATATREavtsattgvtvvgsDAPSEFHIAPRSENptltsappppatqppapapppalplkkkrgrprkygpDGTVTMalspkpissaapsppVIDFSAEKPRKVKPASSFSKSKYEVENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPsdsggtrsrsggmsvsLASPDGRvvgggvagllvaaSPVQVVVGSFLAGnqheqkhkkqknepisiatptaaipissaDPKGSLTTSTFRGDGWSSLPSDSRNKPTDINASLPVG
********************************************************************************************************************YEVENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLR*******TLTYEGRFEILS*****************************RVVGGGVAGLLVAASPVQVVVGSFLA*******************************************************************
*********************************************************************************************************************************VGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSD*******SGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFL********************************************************************
*************TTGVTVVGSDAPSEFHIAPRSENPT****************PPPALPLKKKRGRPRKYGPDGTVTMALSPKPISSAAPSPPVIDFSAE**************KYEVENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSF****************VSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGN***********EPISIATPTAAIPISSADPKGSLTTSTFRGDGWSSLPSDSRNKPTDINASLPVG
******REAVTSATTGVTVVGSDAPSEF***********************************************TVTM*************************************Y*VENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAG******************************************************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEATATREAVTSATTGVTVVGSDAPSEFHIAPRSENPTLTSAPPPPATQPPAPAPPPALPLKKKRGRPRKYGPDGTVTMALSPKPISSAAPSPPVIDFSAEKPRKVKPASSFSKSKYEVENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKHKKQKNEPISIATPTAAIPISSADPKGSLTTSTFRGDGWSSLPSDSRNKPTDINASLPVG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query318 2.2.26 [Sep-21-2011]
Q9S7C9311 Putative DNA-binding prot no no 0.345 0.353 0.368 2e-06
Q2JVA4621 Bifunctional protein GlmU yes no 0.298 0.152 0.345 0.0006
>sp|Q9S7C9|ESCA_ARATH Putative DNA-binding protein ESCAROLA OS=Arabidopsis thaliana GN=ESC PE=2 SV=1 Back     alignment and function desciption
 Score = 53.5 bits (127), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 72/122 (59%), Gaps = 12/122 (9%)

Query: 137 HIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQP---------DSSGGTLT 187
           H++ V+ G D+   + +++++  R + +L  NG +S+VTLRQP            GG +T
Sbjct: 116 HVLEVSPGADIVESVSTYARRRGRGVSVLGGNGTVSNVTLRQPVTPGNGGGVSGGGGVVT 175

Query: 188 YEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVG 247
             GRFEILSL+G+ +P         +GG+S+ LA   G+VVGG V   L+A++PV ++  
Sbjct: 176 LHGRFEILSLTGTVLPPP---APPGAGGLSIFLAGGQGQVVGGSVVAPLIASAPVILMAA 232

Query: 248 SF 249
           SF
Sbjct: 233 SF 234





Arabidopsis thaliana (taxid: 3702)
>sp|Q2JVA4|GLMU_SYNJA Bifunctional protein GlmU OS=Synechococcus sp. (strain JA-3-3Ab) GN=glmU PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query318
356514170327 PREDICTED: uncharacterized protein LOC10 0.977 0.951 0.723 1e-119
147794107 457 hypothetical protein VITISV_031923 [Viti 0.943 0.656 0.751 1e-118
225426649346 PREDICTED: uncharacterized protein LOC10 0.949 0.872 0.75 1e-117
449459666330 PREDICTED: uncharacterized protein LOC10 0.952 0.918 0.757 1e-117
356563280331 PREDICTED: uncharacterized protein LOC10 0.943 0.906 0.722 1e-116
255537127322 DNA binding protein, putative [Ricinus c 0.974 0.962 0.760 1e-114
297742667261 unnamed protein product [Vitis vinifera] 0.814 0.992 0.865 1e-113
224074727333 predicted protein [Populus trichocarpa] 0.990 0.945 0.708 1e-105
22328578356 AT-hook motif nuclear-localized protein 0.981 0.876 0.629 1e-104
4586113365 putative DNA-binding protein [Arabidopsi 0.981 0.854 0.607 1e-101
>gi|356514170|ref|XP_003525779.1| PREDICTED: uncharacterized protein LOC100801730 [Glycine max] Back     alignment and taxonomy information
 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 236/326 (72%), Positives = 272/326 (83%), Gaps = 15/326 (4%)

Query: 8   EAVTSATTGVTVVGSDAPSEFHIAPRSENPT----LTSAPPPPATQPPAPAPPP------ 57
           EA    ++GVTVVGSDAPS++H+APR++NP      T+  P  A    +P+ PP      
Sbjct: 2   EASGGVSSGVTVVGSDAPSDYHVAPRTDNPAPASGSTTQIPATAGSALSPSHPPHTAAME 61

Query: 58  ----ALPLKKKRGRPRKYGPDGTVTMALSPKPISSAAPSPPVIDFSAEKPRKVKPASSFS 113
                +P KKKRGRPRKY PDG+VTMALSPKPISS+AP PPVIDFS+EK  K+KPASS S
Sbjct: 62  AYPATMPAKKKRGRPRKYAPDGSVTMALSPKPISSSAPLPPVIDFSSEKRGKIKPASSVS 121

Query: 114 KSKYEVENIGEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISS 173
           K+K+E+EN+GEWVACSVGANFTPHIITVN+GEDVTMK+ISFSQQGPRAICILSANGVISS
Sbjct: 122 KAKFELENLGEWVACSVGANFTPHIITVNSGEDVTMKVISFSQQGPRAICILSANGVISS 181

Query: 174 VTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVA 233
           VTLRQPDSSGGTLTYEGRFEILSLSGSFMP++SGGTRSRSGGMSVSLASPDGRVVGGGVA
Sbjct: 182 VTLRQPDSSGGTLTYEGRFEILSLSGSFMPNESGGTRSRSGGMSVSLASPDGRVVGGGVA 241

Query: 234 GLLVAASPVQVVVGSFLAGNQHEQKHKKQKNEPISIATPTAAIPISSADPKGSLT-TSTF 292
           GLLVAASPVQVVVGSFLAGNQHEQK +KQK+E IS  TP A +PIS+ DP   L+  S+ 
Sbjct: 242 GLLVAASPVQVVVGSFLAGNQHEQKPRKQKHEVISSVTPAAVVPISTLDPVSILSAASSI 301

Query: 293 RGDGWSSLPSDSRNKPTDINASLPVG 318
           R D WS++P+++++KP DIN SLP G
Sbjct: 302 RNDNWSAMPAEAKDKPADINVSLPAG 327




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|147794107|emb|CAN62363.1| hypothetical protein VITISV_031923 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225426649|ref|XP_002274756.1| PREDICTED: uncharacterized protein LOC100244375 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449459666|ref|XP_004147567.1| PREDICTED: uncharacterized protein LOC101210208 [Cucumis sativus] gi|449523579|ref|XP_004168801.1| PREDICTED: uncharacterized LOC101210208 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356563280|ref|XP_003549892.1| PREDICTED: uncharacterized protein LOC100794202 [Glycine max] Back     alignment and taxonomy information
>gi|255537127|ref|XP_002509630.1| DNA binding protein, putative [Ricinus communis] gi|223549529|gb|EEF51017.1| DNA binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|297742667|emb|CBI34816.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224074727|ref|XP_002304442.1| predicted protein [Populus trichocarpa] gi|222841874|gb|EEE79421.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|22328578|ref|NP_192945.2| AT-hook motif nuclear-localized protein 1 [Arabidopsis thaliana] gi|17979485|gb|AAL50079.1| AT4g12080/F16J13_150 [Arabidopsis thaliana] gi|23506149|gb|AAN31086.1| At4g12080/F16J13_150 [Arabidopsis thaliana] gi|118420990|dbj|BAF37220.1| AT-hook motif nuclear localized protein 1 [Arabidopsis thaliana] gi|332657694|gb|AEE83094.1| AT-hook motif nuclear-localized protein 1 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|4586113|emb|CAB40949.1| putative DNA-binding protein [Arabidopsis thaliana] gi|7267909|emb|CAB78251.1| putative DNA-binding protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query318
TAIR|locus:2118091356 AHL1 "AT-hook motif nuclear-lo 0.694 0.620 0.565 1.4e-60
TAIR|locus:2132599334 AT4G22770 [Arabidopsis thalian 0.562 0.535 0.562 2.1e-54
TAIR|locus:2126946318 AT4G00200 [Arabidopsis thalian 0.459 0.459 0.591 3.7e-46
TAIR|locus:2122684404 AHL3 "AT-HOOK MOTIF NUCLEAR LO 0.443 0.349 0.560 1.7e-34
TAIR|locus:2153142419 AHL4 "AT-HOOK MOTIF NUCLEAR LO 0.386 0.293 0.585 3.7e-32
TAIR|locus:2167988404 AT5G62260 [Arabidopsis thalian 0.393 0.309 0.568 2e-31
TAIR|locus:2051038351 AT2G33620 [Arabidopsis thalian 0.449 0.407 0.493 1.3e-29
TAIR|locus:2050766348 AT2G45850 [Arabidopsis thalian 0.471 0.431 0.440 1.3e-29
TAIR|locus:2098861354 AT3G61310 [Arabidopsis thalian 0.471 0.423 0.434 2.4e-28
TAIR|locus:2031321378 AT1G63470 [Arabidopsis thalian 0.481 0.404 0.403 6.6e-26
TAIR|locus:2118091 AHL1 "AT-hook motif nuclear-localized protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 580 (209.2 bits), Expect = 1.4e-60, Sum P(2) = 1.4e-60
 Identities = 129/228 (56%), Positives = 145/228 (63%)

Query:    95 VIDFSA-EKPRKVKPASSFSKSKY--EVENIGEWVACSVGANFTPHIITVNTGEDVTMKI 151
             VIDFSA EK  KVKP +SF+++KY  +VEN+GEW  CSVG NFTPHIITVNTGEDVTMKI
Sbjct:   128 VIDFSASEKRSKVKPTNSFNRTKYHHQVENLGEWAPCSVGGNFTPHIITVNTGEDVTMKI 187

Query:   152 ISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPXXXXXXXX 211
             ISFSQQGPR+IC+LSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMP        
Sbjct:   188 ISFSQQGPRSICVLSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRS 247

Query:   212 XXXXXXXXLASPDGRXXXXXXXXXXXXXSPVQVVVGSFLAGXXXXXXXXXXXXXXXXXXX 271
                     LASPDGR             SPVQVVVGSFLAG                   
Sbjct:   248 RTGGMSVSLASPDGRVVGGGLAGLLVAASPVQVVVGSFLAGTDHQDQKPKKNKHDFMLSS 307

Query:   272 XXXXX-XXXXXDPKG--SLTTSTFRGDGW-SSLPSDSRNKPTDINASL 315
                        D +   S+++     + W +SL SD RNK TDIN ++
Sbjct:   308 PTAAIPISSAADHRTIHSVSSLPVNNNTWQTSLASDPRNKHTDINVNV 355


GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0005739 "mitochondrion" evidence=IDA
GO:0005730 "nucleolus" evidence=IDA
GO:0005737 "cytoplasm" evidence=IDA
GO:0016132 "brassinosteroid biosynthetic process" evidence=RCA
TAIR|locus:2132599 AT4G22770 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2126946 AT4G00200 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2122684 AHL3 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2153142 AHL4 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2167988 AT5G62260 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051038 AT2G33620 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2050766 AT2G45850 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2098861 AT3G61310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031321 AT1G63470 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00001020001
SubName- Full=Chromosome chr2 scaffold_112, whole genome shotgun sequence; (260 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query318
pfam03479120 pfam03479, DUF296, Domain of unknown function (DUF 4e-40
cd11378113 cd11378, DUF296, Domain of unknown function found 5e-36
COG1661141 COG1661, COG1661, Predicted DNA-binding protein wi 5e-05
PRK14951618 PRK14951, PRK14951, DNA polymerase III subunits ga 0.003
PRK14950585 PRK14950, PRK14950, DNA polymerase III subunits ga 0.004
>gnl|CDD|217587 pfam03479, DUF296, Domain of unknown function (DUF296) Back     alignment and domain information
 Score =  136 bits (344), Expect = 4e-40
 Identities = 53/123 (43%), Positives = 70/123 (56%), Gaps = 9/123 (7%)

Query: 134 FTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPD---SSGGTLTYEG 190
             PH++ +  GED+   + +F++Q      +LS  G +S+VTLRQPD    S G +T EG
Sbjct: 1   GRPHVLRLEPGEDLVESLEAFARQRGIGAAVLSGIGAVSNVTLRQPDEEAKSYGVVTLEG 60

Query: 191 RFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVA-GLLVAASPVQVVVGSF 249
           RFEILSLSG+  P         SG + VSLA PDG+VVGG +A G + A   V V   SF
Sbjct: 61  RFEILSLSGTISPGG-----KPSGHLHVSLADPDGQVVGGHLAEGTVFATGEVVVTELSF 115

Query: 250 LAG 252
              
Sbjct: 116 ENA 118


This putative domain is found in proteins that contain AT-hook motifs pfam02178, which strongly suggests a DNA-binding function for the proteins as a whole. There are three highly conserved histidine residues, eg at 117, 119 and 133 in Reut_B5223, which should be a structurally conserved metal-binding unit, based on structural comparison with known metal-binding structures. The proteins should work as trimers. Length = 120

>gnl|CDD|211390 cd11378, DUF296, Domain of unknown function found in archaea, bacteria, and plants Back     alignment and domain information
>gnl|CDD|224575 COG1661, COG1661, Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Back     alignment and domain information
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional Back     alignment and domain information
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 318
PF03479120 DUF296: Domain of unknown function (DUF296); Inter 99.95
COG1661141 Predicted DNA-binding protein with PD1-like DNA-bi 99.92
PF0217813 AT_hook: AT hook motif; InterPro: IPR017956 AT hoo 96.39
smart0038426 AT_hook DNA binding domain with preference for A/T 96.3
PF14621219 RFX5_DNA_bdg: RFX5 DNA-binding domain 83.46
>PF03479 DUF296: Domain of unknown function (DUF296); InterPro: IPR005175 This putative conserved domain is found in proteins that contain AT-hook motifs IPR000637 from INTERPRO, suggesting a DNA-binding function for the proteins as a whole, however, the function of this domain is unknown Back     alignment and domain information
Probab=99.95  E-value=1.3e-28  Score=205.64  Aligned_cols=111  Identities=32%  Similarity=0.469  Sum_probs=98.9

Q ss_pred             ceeEEEEecCCCcHHHHHHHHHHcCCccEEEEEeeceeeEEEEecCCC--CCCceeeecceeEEEeeeccccCCCCCCCC
Q 020999          134 FTPHIITVNTGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDS--SGGTLTYEGRFEILSLSGSFMPSDSGGTRS  211 (318)
Q Consensus       134 f~phVIrV~~GEDIve~I~~Faq~~~raicILSaiGaVsnVTLrq~~~--~~~~vt~eG~FEILSLSGsis~~e~g~~~~  211 (318)
                      ||+|++||++||||+++|++||+++.+..|+++++|+|++|+|++++.  ....++|+|+|||+||+|||...++    +
T Consensus         1 ~r~~~~rl~~Gedl~~~l~~~~~~~~i~~~~is~iGsl~~~~l~~~~~~~~~~~~~~~g~~Ei~sl~G~i~~~~g----~   76 (120)
T PF03479_consen    1 GRVFVIRLDPGEDLLESLEAFAREHGIRSGVISGIGSLSNVTLGYYDPPSYYEPLEFEGPFEIISLSGTISPEDG----K   76 (120)
T ss_dssp             EEEEEEEEETTSBHHHHHHHHHHHHT-SSEEEEEEEEEEEEEEEEEETTTEEEEEEEESEEEEEEEEEEEEEETT----E
T ss_pred             CcEEEEEECCCCHHHHHHHHHHHHCCCcEEEEEEEeEEeEEEEEEecccCCcceEEecccEEEEEeEEEEECCCC----C
Confidence            789999999999999999999999889899999999999999999954  3468899999999999999998444    5


Q ss_pred             CCCceEEEEeCCCCcEEeeecCCcceeecceEEEEEEc
Q 020999          212 RSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSF  249 (318)
Q Consensus       212 ~~~HLHVSLA~~dGqViGGHV~g~lIAastVEVVv~sF  249 (318)
                      ++.|||++|++.||+|+||||..+.+ ..++||+|..+
T Consensus        77 ~~~HlHisl~~~~g~v~gGHl~~g~v-~~t~Ev~i~~~  113 (120)
T PF03479_consen   77 PFVHLHISLADPDGQVFGGHLLEGTV-FATAEVVITEL  113 (120)
T ss_dssp             EEEEEEEEEE-TTSEEEEEEEEEEEE-EEEEEEEEEEE
T ss_pred             CcceEEEEEECCCCeEEeeEeCCCEE-eEEEEEEEEEe
Confidence            89999999999999999999996666 67999999984



Overexpression of a protein containing this domain, Q9S7C9 from SWISSPROT, in Arabidopsis thaliana causes late flowering and modified leaf development []. ; PDB: 2DT4_A 2P6Y_A 3HWU_A 3HTN_A 2NMU_A 2H6L_A 2HX0_A.

>COG1661 Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Back     alignment and domain information
>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions Back     alignment and domain information
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions Back     alignment and domain information
>PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query318
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 6e-29
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 1e-27
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 3e-18
2h6l_A146 Hypothetical protein; NESG GR103, structural genom 1e-08
3htn_A149 Putative DNA binding protein; DUF269 family protei 1e-08
3hwu_A147 Putative DNA-binding protein; YP_299413.1, structu 2e-08
>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Length = 154 Back     alignment and structure
 Score =  107 bits (269), Expect = 6e-29
 Identities = 26/128 (20%), Positives = 57/128 (44%), Gaps = 10/128 (7%)

Query: 123 GEWVACSVGANFTPHIITVNTGEDVTMKIISFSQQ-GPRAICILSANGVISSVTLRQPDS 181
              V+    +    + + +  G++V  ++ +F QQ   RA  I    G ++ V LR    
Sbjct: 7   SMTVSHHNASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQ 66

Query: 182 SGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASP 241
              T +  G FE++SL+G+   +           + ++++ P G ++GG +       + 
Sbjct: 67  -EATTSLTGTFEVISLNGTLELTGEH--------LHLAVSDPYGVMLGGHMMPGCTVRTT 117

Query: 242 VQVVVGSF 249
           +++V+G  
Sbjct: 118 LELVIGEL 125


>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Length = 142 Back     alignment and structure
>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Length = 143 Back     alignment and structure
>2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 Length = 146 Back     alignment and structure
>3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} Length = 149 Back     alignment and structure
>3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} Length = 147 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query318
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 99.94
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 99.94
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 99.94
3htn_A149 Putative DNA binding protein; DUF269 family protei 99.94
2h6l_A146 Hypothetical protein; NESG GR103, structural genom 99.93
3hwu_A147 Putative DNA-binding protein; YP_299413.1, structu 99.91
2ezd_A26 High mobility group protein HMG-I/HMG-Y; DNA bindi 96.37
>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Back     alignment and structure
Probab=99.94  E-value=1.2e-26  Score=197.88  Aligned_cols=116  Identities=25%  Similarity=0.384  Sum_probs=102.2

Q ss_pred             ceeEEEEecCCCcHHHHHHHHHHc-CCccEEEEEeeceeeEEEEecCCCCCCceeeecceeEEEeeeccccCCCCCCCCC
Q 020999          134 FTPHIITVNTGEDVTMKIISFSQQ-GPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSR  212 (318)
Q Consensus       134 f~phVIrV~~GEDIve~I~~Faq~-~~raicILSaiGaVsnVTLrq~~~~~~~vt~eG~FEILSLSGsis~~e~g~~~~~  212 (318)
                      |++|++||++||||+++|.+||++ +.+++||++++|++++++|++++... +++|+|+|||+||+|||+++        
T Consensus         2 ~r~~~lrL~~Gedl~~~i~~~~~~~~i~~a~v~~~iGsl~~~~l~~~~~~~-~~~~~g~~EIlsl~Gti~~~--------   72 (142)
T 2p6y_A            2 IHLIALRLTRGMDLKQQIVQLVQQHRIHAGSIASCVGCLSTLHIRLADSVS-TLQVSAPFEILSLSGTLTYQ--------   72 (142)
T ss_dssp             CEEEEEEECTTCBHHHHHHHHHHHTTCSSEEEEEEEEEEEEEEEECTTSSC-EEEECSCEEEEEEEEEECSS--------
T ss_pred             CcEEEEEECCCCcHHHHHHHHHHHhCCCEEEEEEeEEEEEeEEEECCCCCc-cEecCCcEEEEEeEEEEeCC--------
Confidence            789999999999999999999976 55578999999999999999999864 78899999999999999985        


Q ss_pred             CCceEEEEeCCCCcEEeeecCCcceeecceEEEEEEccCCchhhhcc
Q 020999          213 SGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKH  259 (318)
Q Consensus       213 ~~HLHVSLA~~dGqViGGHV~g~lIAastVEVVv~sF~~~~~~~~k~  259 (318)
                      ..|||++|+|.||+++||||.+++++..++||+|.+| .+..++|++
T Consensus        73 ~~HlHisl~~~~G~v~GGHl~~g~~V~~t~Ev~i~~~-~~~~~~R~~  118 (142)
T 2p6y_A           73 HCHLHIAVADAQGRVWGGHLLEGNLINTTAELMIHHY-PQHHFTREF  118 (142)
T ss_dssp             CEEEEEEEECTTSCEEEEEECTTCEECC-EEEEEEEC-TTEEEEEEE
T ss_pred             CCEEEEEEECCCCCEEccccCCCCeEEEEEEEEEEEc-cCCeEEEee
Confidence            2599999999999999999998777789999999995 444566655



>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Back     alignment and structure
>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Back     alignment and structure
>3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} SCOP: d.290.1.0 Back     alignment and structure
>2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 Back     alignment and structure
>3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} Back     alignment and structure
>2ezd_A High mobility group protein HMG-I/HMG-Y; DNA binding protein, minor groove DNA binding, transcriptional CO-activator, architectural factor; HET: DNA; NMR {Homo sapiens} SCOP: j.10.1.1 PDB: 2eze_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 318
d2hx0a1136 d.290.1.3 (A:6-141) Hypothetical protein STM3071 { 2e-24
d2h6la1138 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {A 5e-17
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Length = 136 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
 Score = 94.2 bits (234), Expect = 2e-24
 Identities = 24/124 (19%), Positives = 55/124 (44%), Gaps = 11/124 (8%)

Query: 137 HIITVNTGEDVTMKIISFS-QQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEIL 195
           + + +  G++V  ++ +F  Q   RA  I    G ++ V LR       T +  G FE++
Sbjct: 9   YALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQ-EATTSLTGTFEVI 67

Query: 196 SLSGSFMPSDSGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQH 255
           SL+G+   +           + ++++ P G ++GG +       + +++V+G        
Sbjct: 68  SLNGTLELTGEH--------LHLAVSDPYGVMLGGHMMPGCTVRTTLELVIGEL-PALTF 118

Query: 256 EQKH 259
            ++ 
Sbjct: 119 SRQP 122


>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Length = 138 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query318
d2hx0a1136 Hypothetical protein STM3071 {Salmonella typhimuri 99.95
d2h6la1138 Hypothetical protein AF0104 {Archaeoglobus fulgidu 99.91
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
Probab=99.95  E-value=1.6e-27  Score=200.30  Aligned_cols=119  Identities=20%  Similarity=0.376  Sum_probs=107.4

Q ss_pred             CCCceeEEEEecCCCcHHHHHHHHHH-cCCccEEEEEeeceeeEEEEecCCCCCCceeeecceeEEEeeeccccCCCCCC
Q 020999          131 GANFTPHIITVNTGEDVTMKIISFSQ-QGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGT  209 (318)
Q Consensus       131 g~~f~phVIrV~~GEDIve~I~~Faq-~~~raicILSaiGaVsnVTLrq~~~~~~~vt~eG~FEILSLSGsis~~e~g~~  209 (318)
                      ++..|.|++||++||||+++|++||+ ++.+++||++++|++++|+|++|+.. ....++|+|||+||+|||...+.   
T Consensus         3 ~~~~R~~~lrl~~Gedl~~~i~~~~~~~~I~~a~V~~~iGs~~~~~~~~~~~~-~~~~~~g~~Ei~sl~G~I~~~~~---   78 (136)
T d2hx0a1           3 ASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQE-ATTSLTGTFEVISLNGTLELTGE---   78 (136)
T ss_dssp             CCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTCS-SCEEEEEEEEEEEEEEEEETTEE---
T ss_pred             CCCCcEEEEEECCCChHHHHHHHHHHHhCCCEEEEEEEeeeeEEEEEEeCCCC-CcEEecCcEEEEEEEEEeccCCC---
Confidence            56789999999999999999999995 56678999999999999999999876 57789999999999999987664   


Q ss_pred             CCCCCceEEEEeCCCCcEEeeecCCcceeecceEEEEEEccCCchhhhcc
Q 020999          210 RSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKH  259 (318)
Q Consensus       210 ~~~~~HLHVSLA~~dGqViGGHV~g~lIAastVEVVv~sF~~~~~~~~k~  259 (318)
                           |||++|+|.||+++||||+++++++.++||+|.+| .+..++|++
T Consensus        79 -----HlH~~~a~~~g~v~gGhL~~g~~v~~t~Eivi~~l-~~~~~~R~~  122 (136)
T d2hx0a1          79 -----HLHLAVSDPYGVMLGGHMMPGCTVRTTLELVIGEL-PALTFSRQP  122 (136)
T ss_dssp             -----EEEEEEECTTSCEEEEEECTTCEEEEEEEEEEEEC-TTEEEEEEE
T ss_pred             -----eEEEEEECCCCcEEeEEecCCcEEEEEEEEEEEEc-cCCceEEcc
Confidence                 99999999999999999999999999999999996 555666655



>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure