Citrus Sinensis ID: 017018
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 379 | ||||||
| 255578131 | 368 | conserved hypothetical protein [Ricinus | 0.968 | 0.997 | 0.776 | 1e-180 | |
| 224106772 | 382 | predicted protein [Populus trichocarpa] | 0.989 | 0.981 | 0.777 | 1e-177 | |
| 359478357 | 380 | PREDICTED: uncharacterized protein LOC10 | 0.992 | 0.989 | 0.769 | 1e-174 | |
| 297849098 | 382 | hypothetical protein ARALYDRAFT_888027 [ | 0.989 | 0.981 | 0.742 | 1e-174 | |
| 449456016 | 393 | PREDICTED: uncharacterized protein LOC10 | 0.989 | 0.954 | 0.749 | 1e-173 | |
| 297826107 | 381 | hypothetical protein ARALYDRAFT_481676 [ | 0.981 | 0.976 | 0.752 | 1e-171 | |
| 30683843 | 374 | uncharacterized protein [Arabidopsis tha | 0.970 | 0.983 | 0.753 | 1e-171 | |
| 18390845 | 382 | uncharacterized protein [Arabidopsis tha | 0.989 | 0.981 | 0.750 | 1e-169 | |
| 297746271 | 356 | unnamed protein product [Vitis vinifera] | 0.928 | 0.988 | 0.773 | 1e-163 | |
| 356541916 | 391 | PREDICTED: uncharacterized protein LOC10 | 0.960 | 0.930 | 0.701 | 1e-163 |
| >gi|255578131|ref|XP_002529935.1| conserved hypothetical protein [Ricinus communis] gi|223530565|gb|EEF32443.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 636 bits (1641), Expect = e-180, Method: Compositional matrix adjust.
Identities = 292/376 (77%), Positives = 335/376 (89%), Gaps = 9/376 (2%)
Query: 1 MGTSHRSGFLKKTNDGGRLIITTFIGVLIGYFVGRSFQSVSFSKIHLPTSISTSFDIAGK 60
MG+ HRSG +KTND RLI+TT +GV+ G+FVG SF SVS +KIH+P+ +S D++
Sbjct: 1 MGSIHRSGLSRKTNDSARLIVTTILGVVFGFFVGISFPSVSLTKIHIPSGFISSTDMSST 60
Query: 61 EENINQISESPFSNGSSPRIYVSTNPRGAELLPPGIVVSESDFFLRRLWGEPSEDLKKKP 120
++NI G++P+IY NPRGAELLPPGIVVSESDF+LRRLWGEPSEDLKKKP
Sbjct: 61 DQNIR---------GNAPKIYAPRNPRGAELLPPGIVVSESDFYLRRLWGEPSEDLKKKP 111
Query: 121 KYLLTITVGLNQKKNIDRMVKKFSEDFQIMLFHYDGQTSQWDEFEWSKSAIHVSIRRQTK 180
KYL+T TVG +Q+ NI+ +KKFS+DF I+LFHYDG+ S+WD+FEWS++AIHVS+RRQTK
Sbjct: 112 KYLVTFTVGYDQRNNINAAIKKFSDDFTILLFHYDGRVSEWDQFEWSRTAIHVSVRRQTK 171
Query: 181 WWYAKRFLHPDVVAAYEYIFIWDEDLGVEHFNGDKYMELVKKHGLDISQPGLEPNNGLTW 240
WWYAKRFLHPD+VAAYEYIFIWDEDLGVEHFN DKY+ELVKKHGL+ISQPGLEPNNGLTW
Sbjct: 172 WWYAKRFLHPDIVAAYEYIFIWDEDLGVEHFNADKYIELVKKHGLEISQPGLEPNNGLTW 231
Query: 241 QMTKRRGDQEVHKVTEEKPGWCSDPHLPPCAAFVEIMAPVFSREAWRCVWHMIQNDLVHG 300
QMTKRRGD+EVHK TEEKPGWCSDP LPPCAAFVEIMAPVFSREAWRCVWHM+QNDLVHG
Sbjct: 232 QMTKRRGDKEVHKDTEEKPGWCSDPRLPPCAAFVEIMAPVFSREAWRCVWHMVQNDLVHG 291
Query: 301 WGLDFALRRCVEPAHEKIGVVDSQWIIHQVIPSLGSQGQAENGKAPWEGVRARCKNEWSL 360
WGLDFALRRCVEPAHEKIGVVDSQWIIHQVIPSLG+QG++E GKAPWEGVRARCKNEWS+
Sbjct: 292 WGLDFALRRCVEPAHEKIGVVDSQWIIHQVIPSLGNQGKSEKGKAPWEGVRARCKNEWSM 351
Query: 361 FQNRLANADKAYLEEI 376
FQ+RLA+ADKAYL +I
Sbjct: 352 FQSRLADADKAYLSQI 367
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224106772|ref|XP_002314280.1| predicted protein [Populus trichocarpa] gi|222850688|gb|EEE88235.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|359478357|ref|XP_002285240.2| PREDICTED: uncharacterized protein LOC100267242 isoform 1 [Vitis vinifera] gi|359478359|ref|XP_003632112.1| PREDICTED: uncharacterized protein LOC100267242 isoform 2 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|297849098|ref|XP_002892430.1| hypothetical protein ARALYDRAFT_888027 [Arabidopsis lyrata subsp. lyrata] gi|297338272|gb|EFH68689.1| hypothetical protein ARALYDRAFT_888027 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|449456016|ref|XP_004145746.1| PREDICTED: uncharacterized protein LOC101216740 [Cucumis sativus] gi|449524368|ref|XP_004169195.1| PREDICTED: uncharacterized LOC101216740 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|297826107|ref|XP_002880936.1| hypothetical protein ARALYDRAFT_481676 [Arabidopsis lyrata subsp. lyrata] gi|297326775|gb|EFH57195.1| hypothetical protein ARALYDRAFT_481676 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|30683843|ref|NP_850118.1| uncharacterized protein [Arabidopsis thaliana] gi|30683846|ref|NP_850119.1| uncharacterized protein [Arabidopsis thaliana] gi|238479383|ref|NP_001154537.1| uncharacterized protein [Arabidopsis thaliana] gi|62320360|dbj|BAD94744.1| hypothetical protein [Arabidopsis thaliana] gi|330253009|gb|AEC08103.1| uncharacterized protein [Arabidopsis thaliana] gi|330253010|gb|AEC08104.1| uncharacterized protein [Arabidopsis thaliana] gi|330253011|gb|AEC08105.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|18390845|ref|NP_563805.1| uncharacterized protein [Arabidopsis thaliana] gi|79317279|ref|NP_001030994.1| uncharacterized protein [Arabidopsis thaliana] gi|15810567|gb|AAL07171.1| putative storage protein [Arabidopsis thaliana] gi|20259569|gb|AAM14127.1| putative storage protein [Arabidopsis thaliana] gi|332190112|gb|AEE28233.1| uncharacterized protein [Arabidopsis thaliana] gi|332190113|gb|AEE28234.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297746271|emb|CBI16327.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|356541916|ref|XP_003539418.1| PREDICTED: uncharacterized protein LOC100785818 [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 379 | ||||||
| TAIR|locus:2205140 | 382 | AT1G08040 [Arabidopsis thalian | 0.989 | 0.981 | 0.750 | 4.2e-166 | |
| TAIR|locus:2057557 | 374 | AT2G28310 "AT2G28310" [Arabido | 0.970 | 0.983 | 0.753 | 8.1e-163 | |
| TAIR|locus:2008640 | 404 | AT1G67850 "AT1G67850" [Arabido | 0.992 | 0.930 | 0.664 | 2.9e-151 | |
| TAIR|locus:2086721 | 398 | AT3G27470 "AT3G27470" [Arabido | 0.815 | 0.776 | 0.729 | 2.1e-143 | |
| TAIR|locus:2031905 | 401 | AT1G13000 [Arabidopsis thalian | 0.989 | 0.935 | 0.617 | 8.3e-138 | |
| TAIR|locus:2079291 | 396 | AT3G26440 [Arabidopsis thalian | 0.989 | 0.946 | 0.619 | 1.3e-132 | |
| TAIR|locus:504956279 | 381 | AT1G24570 [Arabidopsis thalian | 0.963 | 0.958 | 0.589 | 4.3e-125 | |
| TAIR|locus:2124608 | 389 | AT4G18530 "AT4G18530" [Arabido | 0.802 | 0.781 | 0.429 | 8.2e-67 | |
| TAIR|locus:2008425 | 425 | AT1G61240 [Arabidopsis thalian | 0.686 | 0.611 | 0.479 | 4.1e-65 | |
| TAIR|locus:2202124 | 438 | AT1G11170 [Arabidopsis thalian | 0.643 | 0.557 | 0.494 | 1.6e-63 |
| TAIR|locus:2205140 AT1G08040 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1616 (573.9 bits), Expect = 4.2e-166, P = 4.2e-166
Identities = 283/377 (75%), Positives = 332/377 (88%)
Query: 1 MGTSHRSGFLKKTNDGGRLIITTFIGVLIGYFVGRSFQSVSFSKIHLPTSISTSFDIAGK 60
MGT HRSG ++TN+ +LIITT +GV+ G+FVG + SF KI LP+ + +S D+A
Sbjct: 1 MGTMHRSGAPRRTNENAKLIITTIVGVVFGFFVGITLPLGSFRKISLPSGLMSSLDVAMS 60
Query: 61 EENINQISESPFSNGS--SPRIYVSTNPRGAELLPPGIVVSESDFFLRRLWGEPSEDLKK 118
+ + SP GS SP+IYV TNP GAELLPPGI+V+E+DF+LRRLWGEPSEDLKK
Sbjct: 61 DGKLFSGGRSPEDIGSRKSPKIYVPTNPHGAELLPPGIIVAETDFYLRRLWGEPSEDLKK 120
Query: 119 KPKYLLTITVGLNQKKNIDRMVKKFSEDFQIMLFHYDGQTSQWDEFEWSKSAIHVSIRRQ 178
KPKYL+T TVG Q+ NI+ VKKFSEDFQI+LFHYDG+T++WD+FEWSK+AIH+S ++Q
Sbjct: 121 KPKYLVTFTVGFEQRNNINAAVKKFSEDFQILLFHYDGRTTEWDQFEWSKNAIHISAKKQ 180
Query: 179 TKWWYAKRFLHPDVVAAYEYIFIWDEDLGVEHFNGDKYMELVKKHGLDISQPGLEPNNGL 238
TKWWYAKRFLHPDVV+AYEYIFIWDEDLGVEHFN D+Y+ELVKKHGL+ISQPGLEPNNGL
Sbjct: 181 TKWWYAKRFLHPDVVSAYEYIFIWDEDLGVEHFNADRYVELVKKHGLEISQPGLEPNNGL 240
Query: 239 TWQMTKRRGDQEVHKVTEEKPGWCSDPHLPPCAAFVEIMAPVFSREAWRCVWHMIQNDLV 298
TW+MTKRRGD++VHK T+EKPGWCSDPHLPPCAAFVEIMAPVFSREAWRCVWHMIQNDLV
Sbjct: 241 TWEMTKRRGDRDVHKETKEKPGWCSDPHLPPCAAFVEIMAPVFSREAWRCVWHMIQNDLV 300
Query: 299 HGWGLDFALRRCVEPAHEKIGVVDSQWIIHQVIPSLGSQGQAENGKAPWEGVRARCKNEW 358
HGWGLDFALRRCVEPAHEKIGVVDSQWIIHQVIPSLGSQG++E GK+PW+GVR RC+NEW
Sbjct: 301 HGWGLDFALRRCVEPAHEKIGVVDSQWIIHQVIPSLGSQGESEEGKSPWQGVRERCRNEW 360
Query: 359 SLFQNRLANADKAYLEE 375
++FQNR+A ADKAY+E+
Sbjct: 361 TMFQNRVAEADKAYMEQ 377
|
|
| TAIR|locus:2057557 AT2G28310 "AT2G28310" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008640 AT1G67850 "AT1G67850" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2086721 AT3G27470 "AT3G27470" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2031905 AT1G13000 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2079291 AT3G26440 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:504956279 AT1G24570 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2124608 AT4G18530 "AT4G18530" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008425 AT1G61240 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2202124 AT1G11170 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| estExt_fgenesh4_pm.C_LG_IX0699 | hypothetical protein (382 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 379 | |||
| pfam05212 | 294 | pfam05212, DUF707, Protein of unknown function (DU | 0.0 |
| >gnl|CDD|218501 pfam05212, DUF707, Protein of unknown function (DUF707) | Back alignment and domain information |
|---|
Score = 558 bits (1440), Expect = 0.0
Identities = 216/294 (73%), Positives = 254/294 (86%), Gaps = 1/294 (0%)
Query: 80 IYVSTNPRGAELLPPGIVVSESDFFLRRLWGEPSEDLKKKPKYLLTITVGLNQKKNIDRM 139
I+V TNPRGAE LPPGIVVSESDF+LRRLWG P ED+ KPKYLL TVG +QK N+D
Sbjct: 1 IWVPTNPRGAERLPPGIVVSESDFYLRRLWGSPEEDVASKPKYLLAFTVGYSQKANVDAC 60
Query: 140 VKKFSEDFQIMLFHYDGQTSQWDEFEWSKSAIHVSIRRQTKWWYAKRFLHPDVVAAYEYI 199
VKKFS++F I+LFHYDG+T++WDE EWSK AIHVS ++QTKWW+AKRFLHPD+VA YEYI
Sbjct: 61 VKKFSDNFTIVLFHYDGRTTEWDELEWSKKAIHVSAKKQTKWWFAKRFLHPDIVAPYEYI 120
Query: 200 FIWDEDLGVEHFNGDKYMELVKKHGLDISQPGLEPNNG-LTWQMTKRRGDQEVHKVTEEK 258
F+WDEDLGV++F+ ++Y+++VKKHGL+ISQPGL+P+ G +TW++TKRRGD EVHK T EK
Sbjct: 121 FLWDEDLGVDNFDAEEYIKIVKKHGLEISQPGLDPDRGQITWRITKRRGDGEVHKDTREK 180
Query: 259 PGWCSDPHLPPCAAFVEIMAPVFSREAWRCVWHMIQNDLVHGWGLDFALRRCVEPAHEKI 318
C + PPC FVEIMAPVFSR+AWRCVWHMIQNDLVHGWGLDFALRRCVEPAHEKI
Sbjct: 181 GRCCDNSTGPPCTGFVEIMAPVFSRDAWRCVWHMIQNDLVHGWGLDFALRRCVEPAHEKI 240
Query: 319 GVVDSQWIIHQVIPSLGSQGQAENGKAPWEGVRARCKNEWSLFQNRLANADKAY 372
GVVDSQW++HQ IPSLGSQG AENGKAPW+GVR RCK EW++FQ R A A+K Y
Sbjct: 241 GVVDSQWVVHQGIPSLGSQGTAENGKAPWQGVRDRCKAEWTMFQRRWAAAEKEY 294
|
This family consists of several uncharacterized proteins from Arabidopsis thaliana. Length = 294 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 379 | |||
| PF05212 | 294 | DUF707: Protein of unknown function (DUF707); Inte | 100.0 | |
| cd02510 | 299 | pp-GalNAc-T pp-GalNAc-T initiates the formation of | 94.55 | |
| cd04186 | 166 | GT_2_like_c Subfamily of Glycosyltransferase Famil | 94.4 | |
| TIGR01556 | 281 | rhamnosyltran L-rhamnosyltransferase. Rhamnolipids | 94.32 | |
| cd02526 | 237 | GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl tr | 93.66 | |
| cd04185 | 202 | GT_2_like_b Subfamily of Glycosyltransferase Famil | 93.41 | |
| PF13641 | 228 | Glyco_tranf_2_3: Glycosyltransferase like family 2 | 91.63 | |
| cd02520 | 196 | Glucosylceramide_synthase Glucosylceramide synthas | 90.51 | |
| cd02525 | 249 | Succinoglycan_BP_ExoA ExoA is involved in the bios | 88.6 | |
| cd04195 | 201 | GT2_AmsE_like GT2_AmsE_like is involved in exopoly | 87.78 | |
| cd06433 | 202 | GT_2_WfgS_like WfgS and WfeV are involved in O-ant | 85.6 | |
| COG1216 | 305 | Predicted glycosyltransferases [General function p | 83.42 |
| >PF05212 DUF707: Protein of unknown function (DUF707); InterPro: IPR007877 This family consists of uncharacterised proteins from Arabidopsis thaliana | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.4e-130 Score=940.78 Aligned_cols=293 Identities=65% Similarity=1.185 Sum_probs=287.2
Q ss_pred ccccCCCCCCCCCCCCccccCCccccccccCCCCcCcCCCCceEEEEEeccccccchhHHhhccccCcEEEEEEeCCCCC
Q 017018 80 IYVSTNPRGAELLPPGIVVSESDFFLRRLWGEPSEDLKKKPKYLLTITVGLNQKKNIDRMVKKFSEDFQIMLFHYDGQTS 159 (379)
Q Consensus 80 ~~~~~~p~g~e~lP~giv~~~sd~~l~~lwg~p~~~~~~~~kyLv~~~VG~~qk~~Vd~~v~kf~~~F~v~LfhYDG~~~ 159 (379)
||++++|+|+|+||+|||+++|||+||||||.|.+|...++|||||||||+|||++||++|+||++|||||||||||+++
T Consensus 1 ~~~~~~p~g~e~Lp~giv~~~sd~~~r~lw~~p~~~~~~~~k~Lla~~VG~kqk~~vd~~v~Kf~~nF~i~LfhYDg~vd 80 (294)
T PF05212_consen 1 IWVPCNPRGAERLPPGIVVRESDLELRPLWGNPSEDLPKKPKYLLAMTVGIKQKDNVDAIVKKFSDNFDIMLFHYDGRVD 80 (294)
T ss_pred CCcCCCCCccccCCCCccccCCCceeeecCCCccccccCCCceEEEEEecHHHHhhhhHHHhhhccCceEEEEEecCCcC
Confidence 68999999999999999999999999999999999999899999999999999999999999999999999999999999
Q ss_pred ccccccccCccEEEEEeccccccccccccCcccccCccEEEEecccccCCCCChHHHHHHHHHhCCcccCCCccCCCC-c
Q 017018 160 QWDEFEWSKSAIHVSIRRQTKWWYAKRFLHPDVVAAYEYIFIWDEDLGVEHFNGDKYMELVKKHGLDISQPGLEPNNG-L 238 (379)
Q Consensus 160 ~w~d~~Ws~~aihv~~~kqtKWw~akRfLHPdiva~YDYIflwDDDL~vd~f~i~ry~~Ivr~~gLeISQPALd~~S~-i 238 (379)
+|+|||||++||||+++|||||||||||||||+|++|||||||||||+||+|+|+|||+||++||||||||||+++++ +
T Consensus 81 ~w~~~~ws~~aiHv~~~kqtKww~akrfLHPdiv~~YdYiflwDeDL~vd~f~~~ry~~Ivk~~gLeISQPALd~~~~~~ 160 (294)
T PF05212_consen 81 EWDDFEWSDRAIHVSARKQTKWWFAKRFLHPDIVAPYDYIFLWDEDLGVDHFDINRYFEIVKKEGLEISQPALDPDSSEI 160 (294)
T ss_pred chhhcccccceEEEEeccceEEeehhhhcChhhhccceeEEecCCccCcCcCCHHHHHHHHHHhCCcccCcccCCCCcee
Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999875 9
Q ss_pred ceeeeeecCCcceeeeeccCCCCCCCCCCCCccceEEEecccccHHHHHHhhhhhcCCCcccchhhHhhhHhhcCCCCeE
Q 017018 239 TWQMTKRRGDQEVHKVTEEKPGWCSDPHLPPCAAFVEIMAPVFSREAWRCVWHMIQNDLVHGWGLDFALRRCVEPAHEKI 318 (379)
Q Consensus 239 sh~iT~R~~~~~vHr~~~~~~g~C~~~~~ppcT~FVEiMAPVFSR~Awrcvw~miqnDLvhGWGLDf~~~~c~~~~~~ki 318 (379)
||++|+|++++++|+.+.+++++|+++|+||||||||||||||||+|||||||||||||+|||||||+|++|++.+++||
T Consensus 161 ~~~iT~R~~~~~vhr~~~~~~~~~~~~~~ppct~fVEiMAPVFSr~Awrcvw~miqNDLvhGWGLDf~~~~c~~~~~~ki 240 (294)
T PF05212_consen 161 HHPITKRRPDSEVHRKTRGGPRCCDDSTGPPCTGFVEIMAPVFSRAAWRCVWHMIQNDLVHGWGLDFKWGYCAGDRHKKI 240 (294)
T ss_pred eeeEEeecCCceeEeccCCCCCcCCCCCCCCcceEEEEecceechHHHHHHHhcccCCCccccchhhhHHHHhccccccE
Confidence 99999999999999988888888999999999999999999999999999999999999999999999999998889999
Q ss_pred EEEeeeeEEeeccCCCCCCCCCCCCCCchHHHHHhHHHHHHHHHHHHHHHHHHH
Q 017018 319 GVVDSQWIIHQVIPSLGSQGQAENGKAPWEGVRARCKNEWSLFQNRLANADKAY 372 (379)
Q Consensus 319 GVVDa~~VvH~g~PtLg~~g~~~~~~~~~~~vR~r~~~E~~~F~~R~~~A~k~~ 372 (379)
||||||||+|+|+||||+||.++.+.++|.+||+||++||++|++||++|+|+|
T Consensus 241 GVVDs~~VvH~gvptLG~~~~~~~~~~~~~~Vr~r~~~E~~~F~~R~~~a~~~~ 294 (294)
T PF05212_consen 241 GVVDSQYVVHTGVPTLGGQGNSEKGKDPREEVRRRSFAEMRIFQKRWANAVKEY 294 (294)
T ss_pred EEEeeEEEEEcCCCcCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 999999999999999999999999999999999999999999999999999987
|
|
| >cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans | Back alignment and domain information |
|---|
| >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >TIGR01556 rhamnosyltran L-rhamnosyltransferase | Back alignment and domain information |
|---|
| >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase | Back alignment and domain information |
|---|
| >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A | Back alignment and domain information |
|---|
| >cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis | Back alignment and domain information |
|---|
| >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan | Back alignment and domain information |
|---|
| >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis | Back alignment and domain information |
|---|
| >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis | Back alignment and domain information |
|---|
| >COG1216 Predicted glycosyltransferases [General function prediction only] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 379 | |||
| 1qg8_A | 255 | Protein (spore coat polysaccharide biosynthesis P | 94.09 | |
| 3f1y_A | 387 | Mannosyl-3-phosphoglycerate synthase; GT-A type gl | 90.92 | |
| 2ffu_A | 501 | Ppgalnact-2, polypeptide N-acetylgalactosaminyltra | 89.44 | |
| 2d7i_A | 570 | Polypeptide N-acetylgalactosaminyltransferase 10; | 89.36 | |
| 1xhb_A | 472 | Polypeptide N-acetylgalactosaminyltransferase 1; g | 89.26 | |
| 2z86_A | 625 | Chondroitin synthase; GT-A, glycosyltransferase A, | 86.15 | |
| 2z86_A | 625 | Chondroitin synthase; GT-A, glycosyltransferase A, | 84.16 |
| >1qg8_A Protein (spore coat polysaccharide biosynthesis P SPSA); glycosyltransferase, transferase; 1.50A {Bacillus subtilis} SCOP: c.68.1.1 PDB: 1h7q_A* 1h7l_A 1qgq_A* 1qgs_A* | Back alignment and structure |
|---|
Probab=94.09 E-value=0.15 Score=44.24 Aligned_cols=125 Identities=10% Similarity=-0.074 Sum_probs=67.4
Q ss_pred cCccEEEEecccccCCCCChHHHHHHHHHh-CCcccCCCccCCCCcceeeeeecCCcceeeeeccCCCCCCCCCCCCccc
Q 017018 194 AAYEYIFIWDEDLGVEHFNGDKYMELVKKH-GLDISQPGLEPNNGLTWQMTKRRGDQEVHKVTEEKPGWCSDPHLPPCAA 272 (379)
Q Consensus 194 a~YDYIflwDDDL~vd~f~i~ry~~Ivr~~-gLeISQPALd~~S~ish~iT~R~~~~~vHr~~~~~~g~C~~~~~ppcT~ 272 (379)
+..|||++.|+|..++...++++++.++++ +..+..+....- .....+..+..........-. ....
T Consensus 87 a~g~~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~-----~~~~ 154 (255)
T 1qg8_A 87 AEGEYITYATDDNIYMPDRLLKMVRELDTHPEKAVIYSASKTY-------HLNENRDIVKETVRPAAQVTW-----NAPC 154 (255)
T ss_dssp CCCSEEEEEETTEEECTTHHHHHHHHHHHCTTCCEEEEEEEEE-------EEC---CEEEEEEECCCSCBS-----CCTT
T ss_pred cCCCEEEEeCCCCccChHHHHHHHHHHHhCCCceEEEeceEEE-------EeCCCCcchhhccCchHHHHH-----hhcC
Confidence 578999999999999988899999999876 666655543210 000001111110000000000 0011
Q ss_pred eEEEecccccHHHHHHhh-hh---hcCCCcccchhhHhhhHhhcCCCCeEEEEeeeeEEeecc
Q 017018 273 FVEIMAPVFSREAWRCVW-HM---IQNDLVHGWGLDFALRRCVEPAHEKIGVVDSQWIIHQVI 331 (379)
Q Consensus 273 FVEiMAPVFSR~Awrcvw-~m---iqnDLvhGWGLDf~~~~c~~~~~~kiGVVDa~~VvH~g~ 331 (379)
++-..+-+|+|++++.+- .+ +..+.....+-|+.+...+... .++..++...+.|+..
T Consensus 155 ~~~~~~~~~rr~~~~~~g~~f~~~~~~~~~~~~~eD~~~~~r~~~~-g~~~~~~~~~~~~r~~ 216 (255)
T 1qg8_A 155 AIDHCSVMHRYSVLEKVKEKFGSYWDESPAFYRIGDARFFWRVNHF-YPFYPLDEELDLNYIT 216 (255)
T ss_dssp TSCGGGEEEETHHHHHHHHHHSSSSCCCGGGSTTHHHHHHHHHTTT-CCBEEEEEEEEEEEEC
T ss_pred CCccceEEEEHHHHHhhcccccccccCCcccchHHHHHHHHHHHHh-CCEEEecCcEEEEEEc
Confidence 222234479999998774 33 1112222345576554444323 3699999998888743
|
| >3f1y_A Mannosyl-3-phosphoglycerate synthase; GT-A type glycosyltransferase, GT-81, mannosyl-3-phosphoglyc synthase, GDP-mannose, transferas; 2.20A {Rubrobacter xylanophilus} PDB: 3kia_A* 3o3p_A* | Back alignment and structure |
|---|
| >2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* | Back alignment and structure |
|---|
| >2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* | Back alignment and structure |
|---|
| >1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 | Back alignment and structure |
|---|
| >2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* | Back alignment and structure |
|---|
| >2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 379 | |||
| d1xhba2 | 328 | Polypeptide N-acetylgalactosaminyltransferase 1, N | 90.45 |
| >d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Nucleotide-diphospho-sugar transferases superfamily: Nucleotide-diphospho-sugar transferases family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain species: Mouse (Mus musculus) [TaxId: 10090]
Probab=90.45 E-value=0.49 Score=40.97 Aligned_cols=137 Identities=12% Similarity=0.080 Sum_probs=74.9
Q ss_pred ccCccEEEEecccccCCCCChHHHHHHHHHhCCcccCCCccCCCCcceeeeee--------cCCcceeeeecc---CCCC
Q 017018 193 VAAYEYIFIWDEDLGVEHFNGDKYMELVKKHGLDISQPGLEPNNGLTWQMTKR--------RGDQEVHKVTEE---KPGW 261 (379)
Q Consensus 193 va~YDYIflwDDDL~vd~f~i~ry~~Ivr~~gLeISQPALd~~S~ish~iT~R--------~~~~~vHr~~~~---~~g~ 261 (379)
.+.-|||++.|+|..++.-.++++++.+++..-.+..|.++.-..-++..... ......+..... ....
T Consensus 105 ~a~gd~i~flD~D~~~~p~~l~~l~~~~~~~~~~~v~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (328)
T d1xhba2 105 VSRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRR 184 (328)
T ss_dssp HCCSSEEEEEESSEEECTTCHHHHHHHHHHCTTEEEEEEEEEECSSSCCEECCCTTEEEEECTTCCEEEEECCHHHHHHT
T ss_pred hhhcceeeecCcccccChhHHHHHHHHHhcCCCeEEecceeeeccCceeeccCCccccccccccccccccccchhhhhcc
Confidence 36789999999999999999999999999988877777653221111111100 001111111100 0001
Q ss_pred CCCCCCCCccceEEEecccccHHHHHHhhhhhcCCCcccch---hhHhhhHhhcCCCCeEEEEeeeeEEeeccCC
Q 017018 262 CSDPHLPPCAAFVEIMAPVFSREAWRCVWHMIQNDLVHGWG---LDFALRRCVEPAHEKIGVVDSQWIIHQVIPS 333 (379)
Q Consensus 262 C~~~~~ppcT~FVEiMAPVFSR~Awrcvw~miqnDLvhGWG---LDf~~~~c~~~~~~kiGVVDa~~VvH~g~Pt 333 (379)
..+...+..+.++...+=.++|++|..+=.+ ..+. ..|| +|+.+... ....+|..+-...|.|..+-.
T Consensus 185 ~~~~~~~~~~~~~~g~~~~irr~~f~~vGgf-De~~-~~~g~ED~Dl~~R~~--~~G~~i~~~p~~~v~H~~~~~ 255 (328)
T d1xhba2 185 KGDRTLPVRTPTMAGGLFSIDRDYFQEIGTY-DAGM-DIWGGENLEISFRIW--QCGGTLEIVTCSHVGHVFRKA 255 (328)
T ss_dssp TTCTTSCEECSBCCSSSEEEEHHHHHHTTSC-CTTS-CTTCCCCSHHHHHHH--HTTCEEEEEEEEEEEEEC---
T ss_pred ccccccccccceecceeeeeeHHHHHHhCCC-CCCC-cCcCchHHHHHHHHH--HhCCeEEEeCCeEEEEeCCCC
Confidence 1111222222333333335799999987333 2333 3565 34544433 345799999999999975444
|