Citrus Sinensis ID: 017619
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 368 | ||||||
| 359481929 | 489 | PREDICTED: heparan-alpha-glucosaminide N | 0.842 | 0.633 | 0.744 | 1e-136 | |
| 147817637 | 511 | hypothetical protein VITISV_004036 [Viti | 0.842 | 0.606 | 0.696 | 1e-132 | |
| 255556868 | 519 | conserved hypothetical protein [Ricinus | 0.864 | 0.612 | 0.719 | 1e-132 | |
| 224069583 | 496 | predicted protein [Populus trichocarpa] | 0.839 | 0.622 | 0.700 | 1e-125 | |
| 356503734 | 508 | PREDICTED: heparan-alpha-glucosaminide N | 0.861 | 0.624 | 0.677 | 1e-123 | |
| 356570776 | 509 | PREDICTED: LOW QUALITY PROTEIN: heparan- | 0.850 | 0.614 | 0.674 | 1e-121 | |
| 356548323 | 419 | PREDICTED: heparan-alpha-glucosaminide N | 0.668 | 0.587 | 0.788 | 1e-116 | |
| 357152403 | 498 | PREDICTED: heparan-alpha-glucosaminide N | 0.834 | 0.616 | 0.619 | 1e-113 | |
| 449440411 | 488 | PREDICTED: heparan-alpha-glucosaminide N | 0.815 | 0.614 | 0.669 | 1e-113 | |
| 242067981 | 512 | hypothetical protein SORBIDRAFT_05g00697 | 0.801 | 0.576 | 0.609 | 1e-107 |
| >gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Vitis vinifera] gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 236/317 (74%), Positives = 267/317 (84%), Gaps = 7/317 (2%)
Query: 3 DLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKR 62
D + VEEGLG ++D + + I K++ EQKGE Q L++QKSKR
Sbjct: 4 DAKRVEEGLGHVH-----KEDISEKADKIEKDESSATPAQSVEQKGEEQ--PLIKQKSKR 56
Query: 63 VATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
VATLDAFRGLT+VLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALALKK
Sbjct: 57 VATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALALKK 116
Query: 123 VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVV 182
+P+I+ AVKKI RTLKLLFWGI+LQGGYSHAPD LSYGVDMKHIRW GILQRIA+VY V
Sbjct: 117 IPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQRIAVVYFV 176
Query: 183 VALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSD 242
VALIETLTTKRRP V++ H SI +AY+WQWIGGF+AF+IY+ITTY+LYVP+WSF D
Sbjct: 177 VALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSFVIDQD 236
Query: 243 HGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
H K+Y VKCGMRGHLGPACNAVGYVDR++WGINHLYS PVW+RL+ACTLSSPNSGP RE
Sbjct: 237 HEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFRE 296
Query: 303 DAPSWCRAPFEPEGLLS 319
DAPSWC APFEPEGLLS
Sbjct: 297 DAPSWCYAPFEPEGLLS 313
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis] gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa] gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Brachypodium distachyon] | Back alignment and taxonomy information |
|---|
| >gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor] gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 368 | ||||||
| TAIR|locus:2180305 | 472 | AT5G27730 "AT5G27730" [Arabido | 0.701 | 0.546 | 0.601 | 6.4e-83 | |
| TAIR|locus:2160902 | 440 | AT5G47900 "AT5G47900" [Arabido | 0.809 | 0.677 | 0.466 | 1.3e-68 | |
| UNIPROTKB|F1NBK1 | 584 | HGSNAT "Uncharacterized protei | 0.722 | 0.455 | 0.291 | 1.2e-23 | |
| MGI|MGI:1196297 | 656 | Hgsnat "heparan-alpha-glucosam | 0.557 | 0.312 | 0.352 | 2e-22 | |
| UNIPROTKB|Q68CP4 | 663 | HGSNAT "Heparan-alpha-glucosam | 0.557 | 0.309 | 0.333 | 6.2e-19 | |
| UNIPROTKB|F1MF45 | 592 | HGSNAT "Uncharacterized protei | 0.654 | 0.407 | 0.317 | 6.5e-19 | |
| DICTYBASE|DDB_G0286315 | 675 | DDB_G0286315 "transmembrane pr | 0.326 | 0.177 | 0.418 | 1.2e-15 | |
| UNIPROTKB|Q489U3 | 358 | CPS_0413 "Putative membrane pr | 0.331 | 0.340 | 0.384 | 3e-14 | |
| TIGR_CMR|CPS_0413 | 358 | CPS_0413 "putative membrane pr | 0.331 | 0.340 | 0.384 | 3e-14 | |
| DICTYBASE|DDB_G0270192 | 426 | DDB_G0270192 "DUF1624 family p | 0.304 | 0.262 | 0.385 | 6.9e-10 |
| TAIR|locus:2180305 AT5G27730 "AT5G27730" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 831 (297.6 bits), Expect = 6.4e-83, P = 6.4e-83
Identities = 157/261 (60%), Positives = 187/261 (71%)
Query: 62 RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
R+A+LD FRGLTV LMILVDDAGG + I H+PWNGC LADFVMPFFLFIVGV+IAL+LK
Sbjct: 36 RLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLK 95
Query: 122 KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYV 181
++ A KK+ FRT KLLFWG++LQGG+SHAPD L+YGVD+ +R+CGILQRIAL Y+
Sbjct: 96 RISNKFEACKKVGFRTCKLLFWGLLLQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYL 155
Query: 182 VVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS 241
VVAL+E T L SIF +Y W WI VIY+ T Y YVP+W F +
Sbjct: 156 VVALVEIFTKDSHEENLSTGRFSIFKSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYD 215
Query: 242 DHGV---KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
V K V CG+RG L P CNAVGYVDR++ GINH+Y P W R +ACT SP G
Sbjct: 216 KDSVLYGKILSVSCGVRGKLNPPCNAVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEG 275
Query: 299 PLREDAPSWCRAPFEPEGLLS 319
+R+DAPSWCRAPFEPEG+LS
Sbjct: 276 AIRQDAPSWCRAPFEPEGILS 296
|
|
| TAIR|locus:2160902 AT5G47900 "AT5G47900" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1NBK1 HGSNAT "Uncharacterized protein" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| MGI|MGI:1196297 Hgsnat "heparan-alpha-glucosaminide N-acetyltransferase" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q68CP4 HGSNAT "Heparan-alpha-glucosaminide N-acetyltransferase" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1MF45 HGSNAT "Uncharacterized protein" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
| DICTYBASE|DDB_G0286315 DDB_G0286315 "transmembrane protein" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q489U3 CPS_0413 "Putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] | Back alignment and assigned GO terms |
|---|
| TIGR_CMR|CPS_0413 CPS_0413 "putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] | Back alignment and assigned GO terms |
|---|
| DICTYBASE|DDB_G0270192 DDB_G0270192 "DUF1624 family protein" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 368 | |||
| COG4299 | 371 | COG4299, COG4299, Uncharacterized protein conserve | 4e-23 |
| >gnl|CDD|226749 COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
Score = 98.8 bits (246), Expect = 4e-23
Identities = 64/227 (28%), Positives = 88/227 (38%), Gaps = 46/227 (20%)
Query: 58 QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
Q + R+ +LD FRGLTV+LMILV++AG Y ++ H+ W G TL D V P+FLF VG
Sbjct: 3 QPAFRLTSLDVFRGLTVLLMILVNNAGLGDSTYRQLSHAHWGGLTLTDLVFPWFLFCVGA 62
Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD-MKHIRWCGIL 173
A+ + K+ K N ++R F L G + + V R G+L
Sbjct: 63 AMPFSASKMNKANVTTWP-LYRRAAERFALGYLMGAF---VTVRDWSVTSHSLTRGMGVL 118
Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
QRIAL Y+ AL+ R WQ + + Y + P
Sbjct: 119 QRIALAYLFAALLVRQLRGR-----------------WQALLAAVLLAGYWLFLMFTPHP 161
Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYS 280
LG N D +HLYS
Sbjct: 162 AAP---------------------LGGIGNVGESADPLQILNDHLYS 187
|
Length = 371 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 368 | |||
| KOG4683 | 549 | consensus Uncharacterized conserved protein [Funct | 100.0 | |
| COG4299 | 371 | Uncharacterized protein conserved in bacteria [Fun | 100.0 | |
| PF07786 | 223 | DUF1624: Protein of unknown function (DUF1624); In | 99.8 | |
| COG3503 | 323 | Predicted membrane protein [Function unknown] | 99.43 | |
| COG2311 | 394 | Predicted membrane protein [Function unknown] | 99.33 | |
| PRK10835 | 373 | hypothetical protein; Provisional | 98.7 | |
| PF10129 | 358 | OpgC_C: OpgC protein; InterPro: IPR014550 There is | 97.53 | |
| PRK03854 | 375 | opgC glucans biosynthesis protein; Provisional | 97.17 | |
| COG4645 | 410 | Uncharacterized protein conserved in bacteria [Fun | 96.96 | |
| PF01757 | 340 | Acyl_transf_3: Acyltransferase family; InterPro: I | 96.82 | |
| COG3274 | 332 | Predicted O-acyltransferase [General function pred | 92.54 | |
| COG1835 | 386 | Predicted acyltransferases [Lipid metabolism] | 87.52 |
| >KOG4683 consensus Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.6e-62 Score=478.85 Aligned_cols=270 Identities=43% Similarity=0.753 Sum_probs=231.2
Q ss_pred hhhccCchhHHHHHHHHHHHHHHHHHhccCcccccccccCCCchhhHHHHHHHHHHHHHHHHHHHhhccCChhHHHHHHH
Q 017619 55 LLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKII 134 (368)
Q Consensus 55 ~~~~~~~Rl~SLD~lRGltm~lMIlVN~~g~~~~~l~Ha~WnG~t~aDlVfP~FlFivGvSi~ls~~r~~~k~~~~~kil 134 (368)
..+...+|+.|+|+|||+|+++||+||+.|+.||+++|++|||++++|+|||+|+||||+|+++|+|+..+|.+..+|.-
T Consensus 136 sla~~r~RL~SLD~FRGltValMIlVdd~GG~~p~I~HapWnG~~LADfVmPfFLfIvGVsials~K~~s~rf~a~rKa~ 215 (549)
T KOG4683|consen 136 SLATQRKRLRSLDTFRGLTVALMILVDDGGGGYPWIEHAPWNGLHLADFVMPFFLFIVGVSIALSVKSQSSRFSATRKAK 215 (549)
T ss_pred hcCCCchhhhhhhhhcCceEEEEEEEecCCCCchhhhcCCcCCccHHHHHHHHHHHHHHhhhhhhhhhhhhhhhHhHHHH
Confidence 34445689999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhhcCCCCCCcccccccCccceehhhHHHHHHHHHHHHHHHHHhhcCCCCCCCccchhhhhhhHHHHH
Q 017619 135 FRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214 (368)
Q Consensus 135 rRsl~LfllGl~l~~~~~~~~~~l~~~~~~~~~r~~GVLqrIgl~Yli~all~l~~~~~~~~~~~~~~~~i~~~~~~qw~ 214 (368)
.|+.+|+++|++++++|.+.++++++++|.+++|++|||||+|++|+++|++..+..+..... ..||-+
T Consensus 216 ~R~cklllwgLflqGgf~h~~~nLTygidve~lR~mGILQr~~~ayLVvAi~~~~~~~~~~~~-----------~S~~R~ 284 (549)
T KOG4683|consen 216 ARICKLLLWGLFLQGGFLHSMSNLTYGIDVEQLRIMGILQRFGVAYLVVAILHTLCCRPISPQ-----------RSWQRA 284 (549)
T ss_pred HHHHHHHHHHHHHhhhcccCcccccCCccHHHHHHHHHHHHhhHHHHHHHHHhhhccCCCccc-----------cchhhh
Confidence 999999999999999999999999999999999999999999999999999998887642211 123445
Q ss_pred HHHHHHHHHHHHHHhcccCCCCcc---ccCCCCcceeeeecccCCCCCc-cccHhhhhhhhhccccccCCCCcccccccc
Q 017619 215 GGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYIVKCGMRGHLGP-ACNAVGYVDRELWGINHLYSDPVWSRLEAC 290 (368)
Q Consensus 215 ~~~~lL~~y~~l~~~l~VP~~~~~---~~~~~~~~~~~~~~g~~g~l~~-~~N~ag~IDR~vlG~~HlY~~Ptw~~~~~c 290 (368)
+....+..|.+..++-+||+|.+- ...|+.++.+.+.||.++...+ .||++||.||+++|++|||++|+|+|.+.|
T Consensus 285 V~~~~L~~~~~~~~~~~V~~~~~~~~~~~~~~~~r~~~~~~G~~~~~~~P~CnAvGy~DrqvLGi~HiY~hP~~~r~k~c 364 (549)
T KOG4683|consen 285 VHDVCLFSGELAVLLALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNAHPKCNAVGYADRQVLGIAHIYQHPTAKRVKDC 364 (549)
T ss_pred hhHHHHHHHHHHHHHHhhhhhhceecccccCCCCcccccCCcccccCCCCCccchhhhHHhhhhhHHHhcCchHHHhhhc
Confidence 555555555555555555554322 1223345555556777777764 599999999999999999999999999999
Q ss_pred cCCCCCCCCCCCCCCCcCCCCCCCccchhchhHHHHHHHHHHHHHhhhh
Q 017619 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSYSLWYNWYSLWTCFDSFQGS 339 (368)
Q Consensus 291 ~~~~~~~g~~~~~~~~w~~~~fDPEGLLstL~ai~~y~~~~~~~~~~~~ 339 (368)
+.++|+.|+++.|+|+||++||||||+||+|.|++. .+.|...|.
T Consensus 365 s~n~P~nG~l~~DAPSWCqapFdPEGilssi~avv~----~llG~h~Gh 409 (549)
T KOG4683|consen 365 SINYPNNGPLPPDAPSWCQAPFDPEGILSSILAVVQ----VLLGAHAGH 409 (549)
T ss_pred ccCCCCCCCCCCCCchhhcCCCChHHHHHHHHHHHH----HHHHhhcCe
Confidence 999999999999999999999999999999999977 778887773
|
|
| >COG4299 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >PF07786 DUF1624: Protein of unknown function (DUF1624); InterPro: IPR012429 These sequences are found in hypothetical proteins of unknown function expressed by bacterial and archaeal species | Back alignment and domain information |
|---|
| >COG3503 Predicted membrane protein [Function unknown] | Back alignment and domain information |
|---|
| >COG2311 Predicted membrane protein [Function unknown] | Back alignment and domain information |
|---|
| >PRK10835 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PF10129 OpgC_C: OpgC protein; InterPro: IPR014550 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function | Back alignment and domain information |
|---|
| >PRK03854 opgC glucans biosynthesis protein; Provisional | Back alignment and domain information |
|---|
| >COG4645 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >PF01757 Acyl_transf_3: Acyltransferase family; InterPro: IPR002656 This entry contains a range of acyltransferase enzymes as well as yet uncharacterised proteins from Caenorhabditis elegans | Back alignment and domain information |
|---|
| >COG3274 Predicted O-acyltransferase [General function prediction only] | Back alignment and domain information |
|---|
| >COG1835 Predicted acyltransferases [Lipid metabolism] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00