Citrus Sinensis ID: 013577
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 440 | ||||||
| 356540241 | 399 | PREDICTED: uncharacterized protein LOC10 | 0.902 | 0.994 | 0.750 | 0.0 | |
| 359473149 | 491 | PREDICTED: uncharacterized protein LOC10 | 0.890 | 0.798 | 0.763 | 0.0 | |
| 356567576 | 399 | PREDICTED: uncharacterized protein LOC10 | 0.902 | 0.994 | 0.743 | 0.0 | |
| 356512666 | 397 | PREDICTED: uncharacterized protein LOC10 | 0.893 | 0.989 | 0.742 | 0.0 | |
| 356525441 | 397 | PREDICTED: uncharacterized protein LOC10 | 0.893 | 0.989 | 0.742 | 0.0 | |
| 296081357 | 385 | unnamed protein product [Vitis vinifera] | 0.868 | 0.992 | 0.766 | 0.0 | |
| 255578345 | 395 | conserved hypothetical protein [Ricinus | 0.893 | 0.994 | 0.722 | 0.0 | |
| 357463241 | 399 | hypothetical protein MTR_3g086610 [Medic | 0.9 | 0.992 | 0.718 | 0.0 | |
| 225452298 | 402 | PREDICTED: uncharacterized protein LOC10 | 0.904 | 0.990 | 0.717 | 0.0 | |
| 388518623 | 399 | unknown [Medicago truncatula] | 0.9 | 0.992 | 0.716 | 0.0 |
| >gi|356540241|ref|XP_003538598.1| PREDICTED: uncharacterized protein LOC100778486 [Glycine max] | Back alignment and taxonomy information |
|---|
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/429 (75%), Positives = 357/429 (83%), Gaps = 32/429 (7%)
Query: 14 MGTAARSSINRKTNESMRLIVTTFVGVVFGFFIGVSFPTLSLTKLTLPSSLLPTVE--YV 71
MG RSS RK +E MR VTTF+ +V GFFIGVS PTLS+TKL LPS LLP+++ Y+
Sbjct: 1 MGIITRSSAGRKPSEIMRFFVTTFISIVLGFFIGVSVPTLSITKLNLPSGLLPSIDLSYI 60
Query: 72 EDQRSGLSTQRFLNVFNPLNRNNGSSTRAQVLNDTSKIWVPTNPRGAERLPPKIVRAESD 131
ED+ +G F+N N N SS++ Q+ NDTSKIWVP+NPRGAERLPP IV AESD
Sbjct: 61 EDRYTGRQAWSFMN-----NGNKRSSSQVQLSNDTSKIWVPSNPRGAERLPPGIVEAESD 115
Query: 132 FYLRRLWGNPNEDLTSQPKYLVTFTVGYDQKNNINAAIKKVGFCFPPTIFFLCVHLCVTN 191
+YLRRLWG P+EDLTS+P YLVTFTVGYDQKNNI+AA+KK
Sbjct: 116 YYLRRLWGKPSEDLTSRPNYLVTFTVGYDQKNNIDAAVKK-------------------- 155
Query: 192 ICSKQFSDNFTIVLFHYDGRTTEWNEFEWSKRAIHVSVRKQTKWWYAKRFLHPDIVASYD 251
FS NFTI+LFHYDGRTTEW+EFEWSK+AIHVSV KQTKWWYAKRFLHPDIVA YD
Sbjct: 156 -----FSGNFTILLFHYDGRTTEWDEFEWSKQAIHVSVHKQTKWWYAKRFLHPDIVAPYD 210
Query: 252 YIFIWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNKGLTWQMTKRRGDREVHKETEE 311
YIFIWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPN+GLTWQMTKRRGD+EVHK TEE
Sbjct: 211 YIFIWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNRGLTWQMTKRRGDQEVHKVTEE 270
Query: 312 KPGWCSNPHLPPCAAFVEIMAPVFSRDAWRCVWHMIQNDLVHGWGLDFALRKCVEPAHEK 371
KPGWCS+PHLPPCAAFVEIMAPVFSRDAWRCVWHMIQNDLVHGWGLDFALRKCVEPAHEK
Sbjct: 271 KPGWCSDPHLPPCAAFVEIMAPVFSRDAWRCVWHMIQNDLVHGWGLDFALRKCVEPAHEK 330
Query: 372 IGVVDSQWIVHQTVPSLGNQGESKDGKAPWQGVRERCKKEWTMFQGRMANAEKAYFQALG 431
IGVVDSQWIVHQ +PSLGNQGES+ GKAPWQGVRERC+KEWTMFQ R+ANAE AY ++G
Sbjct: 331 IGVVDSQWIVHQGIPSLGNQGESQTGKAPWQGVRERCRKEWTMFQSRLANAEYAYHNSIG 390
Query: 432 ADTTDSSAH 440
DT++S+AH
Sbjct: 391 IDTSNSTAH 399
|
Source: Glycine max Species: Glycine max Genus: Glycine Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359473149|ref|XP_002282407.2| PREDICTED: uncharacterized protein LOC100243914 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|356567576|ref|XP_003551994.1| PREDICTED: uncharacterized protein LOC100811265 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356512666|ref|XP_003525038.1| PREDICTED: uncharacterized protein LOC100785650 isoform 1 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356525441|ref|XP_003531333.1| PREDICTED: uncharacterized protein LOC100805311 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|296081357|emb|CBI16790.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255578345|ref|XP_002530039.1| conserved hypothetical protein [Ricinus communis] gi|223530455|gb|EEF32339.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|357463241|ref|XP_003601902.1| hypothetical protein MTR_3g086610 [Medicago truncatula] gi|355490950|gb|AES72153.1| hypothetical protein MTR_3g086610 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|225452298|ref|XP_002272341.1| PREDICTED: uncharacterized protein LOC100264636 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|388518623|gb|AFK47373.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 440 | ||||||
| TAIR|locus:2008640 | 404 | AT1G67850 "AT1G67850" [Arabido | 0.559 | 0.608 | 0.849 | 3.9e-155 | |
| TAIR|locus:2205140 | 382 | AT1G08040 [Arabidopsis thalian | 0.540 | 0.623 | 0.810 | 3.8e-148 | |
| TAIR|locus:2057557 | 374 | AT2G28310 "AT2G28310" [Arabido | 0.543 | 0.639 | 0.803 | 2.7e-145 | |
| TAIR|locus:2086721 | 398 | AT3G27470 "AT3G27470" [Arabido | 0.559 | 0.618 | 0.760 | 1.3e-141 | |
| TAIR|locus:2031905 | 401 | AT1G13000 [Arabidopsis thalian | 0.55 | 0.603 | 0.772 | 7.4e-137 | |
| TAIR|locus:2079291 | 396 | AT3G26440 [Arabidopsis thalian | 0.559 | 0.621 | 0.747 | 6e-133 | |
| TAIR|locus:504956279 | 381 | AT1G24570 [Arabidopsis thalian | 0.559 | 0.645 | 0.754 | 3.9e-108 | |
| TAIR|locus:2008425 | 425 | AT1G61240 [Arabidopsis thalian | 0.497 | 0.515 | 0.491 | 2.9e-64 | |
| TAIR|locus:2124608 | 389 | AT4G18530 "AT4G18530" [Arabido | 0.456 | 0.516 | 0.509 | 2e-63 | |
| TAIR|locus:2202124 | 438 | AT1G11170 [Arabidopsis thalian | 0.497 | 0.5 | 0.495 | 2.8e-57 |
| TAIR|locus:2008640 AT1G67850 "AT1G67850" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1221 (434.9 bits), Expect = 3.9e-155, Sum P(2) = 3.9e-155
Identities = 209/246 (84%), Positives = 232/246 (94%)
Query: 193 CSKQFSDNFTIVLFHYDGRTTEWNEFEWSKRAIHVSVRKQTKWWYAKRFLHPDIVASYDY 252
C K+FS+NFTIVLFHYDGR TEW+EFEWSK AIH+SVRKQTKWWYAKRFLHPDIVA YDY
Sbjct: 154 CVKKFSENFTIVLFHYDGRVTEWDEFEWSKTAIHISVRKQTKWWYAKRFLHPDIVARYDY 213
Query: 253 IFIWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNKGLTWQMTKRRGDREVHKETEEK 312
IF+WDEDLGVEHFNAEEY+K+V+KHGLEISQPGLEPN+GLTWQMTKRRGD EVHK TEE+
Sbjct: 214 IFVWDEDLGVEHFNAEEYVKMVKKHGLEISQPGLEPNQGLTWQMTKRRGDMEVHKITEER 273
Query: 313 PGWCSNPHLPPCAAFVEIMAPVFSRDAWRCVWHMIQNDLVHGWGLDFALRKCVEPAHEKI 372
PGWCS+PHLPPCAAFVEIMAPVFSR+AWRCVWH+IQNDLVHGWGLDFALR+CVEPAHEKI
Sbjct: 274 PGWCSDPHLPPCAAFVEIMAPVFSRNAWRCVWHVIQNDLVHGWGLDFALRRCVEPAHEKI 333
Query: 373 GVVDSQWIVHQTVPSLGNQGESKDGKAPWQGVRERCKKEWTMFQGRMANAEKAYFQALGA 432
GVVDSQW+VHQ+ PSLGNQGE+ DGKAPWQGVR+RCKKEWTMFQ RMANAEK YF++L
Sbjct: 334 GVVDSQWVVHQSFPSLGNQGEATDGKAPWQGVRDRCKKEWTMFQSRMANAEKDYFKSLQV 393
Query: 433 DTTDSS 438
+ + +S
Sbjct: 394 EGSSNS 399
|
|
| TAIR|locus:2205140 AT1G08040 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2057557 AT2G28310 "AT2G28310" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2086721 AT3G27470 "AT3G27470" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2031905 AT1G13000 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2079291 AT3G26440 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:504956279 AT1G24570 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008425 AT1G61240 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2124608 AT4G18530 "AT4G18530" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2202124 AT1G11170 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00030765001 | SubName- Full=Chromosome chr1 scaffold_5, whole genome shotgun sequence; (385 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 440 | |||
| pfam05212 | 294 | pfam05212, DUF707, Protein of unknown function (DU | 0.0 |
| >gnl|CDD|218501 pfam05212, DUF707, Protein of unknown function (DUF707) | Back alignment and domain information |
|---|
Score = 580 bits (1497), Expect = 0.0
Identities = 230/319 (72%), Positives = 264/319 (82%), Gaps = 26/319 (8%)
Query: 109 IWVPTNPRGAERLPPKIVRAESDFYLRRLWGNPNEDLTSQPKYLVTFTVGYDQKNNINAA 168
IWVPTNPRGAERLPP IV +ESDFYLRRLWG+P ED+ S+PKYL+ FTVGY QK N++A
Sbjct: 1 IWVPTNPRGAERLPPGIVVSESDFYLRRLWGSPEEDVASKPKYLLAFTVGYSQKANVDAC 60
Query: 169 IKKVGFCFPPTIFFLCVHLCVTNICSKQFSDNFTIVLFHYDGRTTEWNEFEWSKRAIHVS 228
+KK FSDNFTIVLFHYDGRTTEW+E EWSK+AIHVS
Sbjct: 61 VKK-------------------------FSDNFTIVLFHYDGRTTEWDELEWSKKAIHVS 95
Query: 229 VRKQTKWWYAKRFLHPDIVASYDYIFIWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEP 288
+KQTKWW+AKRFLHPDIVA Y+YIF+WDEDLGV++F+AEEYIK+V+KHGLEISQPGL+P
Sbjct: 96 AKKQTKWWFAKRFLHPDIVAPYEYIFLWDEDLGVDNFDAEEYIKIVKKHGLEISQPGLDP 155
Query: 289 NKG-LTWQMTKRRGDREVHKETEEKPGWCSNPHLPPCAAFVEIMAPVFSRDAWRCVWHMI 347
++G +TW++TKRRGD EVHK+T EK C N PPC FVEIMAPVFSRDAWRCVWHMI
Sbjct: 156 DRGQITWRITKRRGDGEVHKDTREKGRCCDNSTGPPCTGFVEIMAPVFSRDAWRCVWHMI 215
Query: 348 QNDLVHGWGLDFALRKCVEPAHEKIGVVDSQWIVHQTVPSLGNQGESKDGKAPWQGVRER 407
QNDLVHGWGLDFALR+CVEPAHEKIGVVDSQW+VHQ +PSLG+QG +++GKAPWQGVR+R
Sbjct: 216 QNDLVHGWGLDFALRRCVEPAHEKIGVVDSQWVVHQGIPSLGSQGTAENGKAPWQGVRDR 275
Query: 408 CKKEWTMFQGRMANAEKAY 426
CK EWTMFQ R A AEK Y
Sbjct: 276 CKAEWTMFQRRWAAAEKEY 294
|
This family consists of several uncharacterized proteins from Arabidopsis thaliana. Length = 294 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 440 | |||
| PF05212 | 294 | DUF707: Protein of unknown function (DUF707); Inte | 100.0 | |
| cd02510 | 299 | pp-GalNAc-T pp-GalNAc-T initiates the formation of | 94.76 | |
| cd04185 | 202 | GT_2_like_b Subfamily of Glycosyltransferase Famil | 94.54 | |
| TIGR01556 | 281 | rhamnosyltran L-rhamnosyltransferase. Rhamnolipids | 94.43 | |
| cd04186 | 166 | GT_2_like_c Subfamily of Glycosyltransferase Famil | 94.4 | |
| cd02526 | 237 | GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl tr | 93.88 | |
| PF13641 | 228 | Glyco_tranf_2_3: Glycosyltransferase like family 2 | 91.81 | |
| cd02520 | 196 | Glucosylceramide_synthase Glucosylceramide synthas | 90.99 | |
| cd02525 | 249 | Succinoglycan_BP_ExoA ExoA is involved in the bios | 89.2 | |
| cd04195 | 201 | GT2_AmsE_like GT2_AmsE_like is involved in exopoly | 88.98 | |
| COG1216 | 305 | Predicted glycosyltransferases [General function p | 86.58 | |
| cd06433 | 202 | GT_2_WfgS_like WfgS and WfeV are involved in O-ant | 85.32 | |
| cd06421 | 234 | CESA_CelA_like CESA_CelA_like are involved in the | 83.4 | |
| cd06437 | 232 | CESA_CaSu_A2 Cellulose synthase catalytic subunit | 81.82 | |
| PLN02726 | 243 | dolichyl-phosphate beta-D-mannosyltransferase | 80.98 | |
| cd06442 | 224 | DPM1_like DPM1_like represents putative enzymes si | 80.05 | |
| PF01762 | 195 | Galactosyl_T: Galactosyltransferase; InterPro: IPR | 80.01 |
| >PF05212 DUF707: Protein of unknown function (DUF707); InterPro: IPR007877 This family consists of uncharacterised proteins from Arabidopsis thaliana | Back alignment and domain information |
|---|
Probab=100.00 E-value=6.4e-127 Score=931.05 Aligned_cols=293 Identities=65% Similarity=1.194 Sum_probs=287.3
Q ss_pred eeccCCCCCCCCCCCCceecCCCccccccCCCCCCCCCCCCceEEEEEeccccccchHHHHhhhccCCCCcccccccccc
Q 013577 109 IWVPTNPRGAERLPPKIVRAESDFYLRRLWGNPNEDLTSQPKYLVTFTVGYDQKNNINAAIKKVGFCFPPTIFFLCVHLC 188 (440)
Q Consensus 109 ~~~~~~p~g~e~lp~giv~~~sd~~~r~Lwg~~~~d~~~~~~~Lla~~vG~~qk~~v~~~v~k~~~~f~~~~~~~~~~~~ 188 (440)
||++++|+|+|+||||||+++|||+||||||+|++|...++|||||||||+|||++||++|+|
T Consensus 1 ~~~~~~p~g~e~Lp~giv~~~sd~~~r~lw~~p~~~~~~~~k~Lla~~VG~kqk~~vd~~v~K----------------- 63 (294)
T PF05212_consen 1 IWVPCNPRGAERLPPGIVVRESDLELRPLWGNPSEDLPKKPKYLLAMTVGIKQKDNVDAIVKK----------------- 63 (294)
T ss_pred CCcCCCCCccccCCCCccccCCCceeeecCCCccccccCCCceEEEEEecHHHHhhhhHHHhh-----------------
Confidence 689999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccccCCCCCCcEEEEEEECCCCCcccccccCCcceEEEeeccccccchhhccCcccccCccEEEEecCccccCCCCHH
Q 013577 189 VTNICSKQFSDNFTIVLFHYDGRTTEWNEFEWSKRAIHVSVRKQTKWWYAKRFLHPDIVASYDYIFIWDEDLGVEHFNAE 268 (440)
Q Consensus 189 ~~~~~s~~~~~nFdV~LFhYDG~v~eW~d~~Ws~~aiHVsa~kqtKWw~akRFLHPdiva~YDYIfLwDDDL~Vd~f~i~ 268 (440)
|++|||||||||||++++|++|+||++||||+++|||||||||||||||+|++|||||||||||+||+|+|+
T Consensus 64 --------f~~nF~i~LfhYDg~vd~w~~~~ws~~aiHv~~~kqtKww~akrfLHPdiv~~YdYiflwDeDL~vd~f~~~ 135 (294)
T PF05212_consen 64 --------FSDNFDIMLFHYDGRVDEWDDFEWSDRAIHVSARKQTKWWFAKRFLHPDIVAPYDYIFLWDEDLGVDHFDIN 135 (294)
T ss_pred --------hccCceEEEEEecCCcCchhhcccccceEEEEeccceEEeehhhhcChhhhccceeEEecCCccCcCcCCHH
Confidence 789999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHhCCcccCCCccCCCC-cceeeeeecCCcceeeecccCCCCCCCCCCCCccceEEeccccccHHHHHHhhhhh
Q 013577 269 EYIKLVRKHGLEISQPGLEPNKG-LTWQMTKRRGDREVHKETEEKPGWCSNPHLPPCAAFVEIMAPVFSRDAWRCVWHMI 347 (440)
Q Consensus 269 ryf~Ivr~~gLeISQPALd~~S~-ish~iT~R~~~~~vHr~~~~~~g~c~~~~~ppcTgFVEIMAPVFSR~Awrcvw~mi 347 (440)
|||+||++||||||||||+++++ +||++|+|+++.++||.+.+++++|+++++||||||||||||||||+|||||||||
T Consensus 136 ry~~Ivk~~gLeISQPALd~~~~~~~~~iT~R~~~~~vhr~~~~~~~~~~~~~~ppct~fVEiMAPVFSr~Awrcvw~mi 215 (294)
T PF05212_consen 136 RYFEIVKKEGLEISQPALDPDSSEIHHPITKRRPDSEVHRKTRGGPRCCDDSTGPPCTGFVEIMAPVFSRAAWRCVWHMI 215 (294)
T ss_pred HHHHHHHHhCCcccCcccCCCCceeeeeEEeecCCceeEeccCCCCCcCCCCCCCCcceEEEEecceechHHHHHHHhcc
Confidence 99999999999999999999875 99999999999999998888888999999999999999999999999999999999
Q ss_pred cCCCcceehHHHHhhHhhcCCCCcEEEEeeeeEEeeccCCCCCCCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH
Q 013577 348 QNDLVHGWGLDFALRKCVEPAHEKIGVVDSQWIVHQTVPSLGNQGESKDGKAPWQGVRERCKKEWTMFQGRMANAEKAY 426 (440)
Q Consensus 348 QNDLvhGWGLDf~w~~Ca~~~~~kIGVVDa~~VvH~g~PtLG~qG~~~~~~~~~~~VR~r~~~E~~~F~~R~~~A~k~~ 426 (440)
||||+|||||||+|++|++.+++||||||||||+|+++||||+||.++.+.++|.+||+||++||++|++||++|+|+|
T Consensus 216 qNDLvhGWGLDf~~~~c~~~~~~kiGVVDs~~VvH~gvptLG~~~~~~~~~~~~~~Vr~r~~~E~~~F~~R~~~a~~~~ 294 (294)
T PF05212_consen 216 QNDLVHGWGLDFKWGYCAGDRHKKIGVVDSQYVVHTGVPTLGGQGNSEKGKDPREEVRRRSFAEMRIFQKRWANAVKEY 294 (294)
T ss_pred cCCCccccchhhhHHHHhccccccEEEEeeEEEEEcCCCcCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 9999999999999999998889999999999999999999999999989999999999999999999999999999987
|
|
| >cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans | Back alignment and domain information |
|---|
| >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >TIGR01556 rhamnosyltran L-rhamnosyltransferase | Back alignment and domain information |
|---|
| >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase | Back alignment and domain information |
|---|
| >PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A | Back alignment and domain information |
|---|
| >cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis | Back alignment and domain information |
|---|
| >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan | Back alignment and domain information |
|---|
| >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis | Back alignment and domain information |
|---|
| >COG1216 Predicted glycosyltransferases [General function prediction only] | Back alignment and domain information |
|---|
| >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis | Back alignment and domain information |
|---|
| >cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose | Back alignment and domain information |
|---|
| >cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex | Back alignment and domain information |
|---|
| >PLN02726 dolichyl-phosphate beta-D-mannosyltransferase | Back alignment and domain information |
|---|
| >cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1 | Back alignment and domain information |
|---|
| >PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 440 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 1e-04 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 43.7 bits (102), Expect = 1e-04
Identities = 54/364 (14%), Positives = 101/364 (27%), Gaps = 132/364 (36%)
Query: 31 RLIVTT-------FVGVVFGFFIGVSFPTLSLTK----------LTLPSSLLPTVEYV-- 71
++++TT F+ I + +++LT L LP E +
Sbjct: 267 KILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDLP-REVLTT 325
Query: 72 ---------EDQRSGLST-QRFLNV-FNPLNRNNGSSTRAQ-------------VLNDTS 107
E R GL+T + +V + L SS V ++
Sbjct: 326 NPRRLSIIAESIRDGLATWDNWKHVNCDKLTTIIESSLNVLEPAEYRKMFDRLSVFPPSA 385
Query: 108 KIWVPTNPRGAERLPPKIVRAESDFYLRRLWGNPNED--------------LTSQPKYLV 153
I P + L +W + + + QPK
Sbjct: 386 HI------------PTIL--------LSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKES- 424
Query: 154 TFTVGYD------QKNNINAAI-KKV--------GFC----FPPTI-----FFLCVHLCV 189
T ++ K A+ + + F PP + + HL
Sbjct: 425 TISI-PSIYLELKVKLENEYALHRSIVDHYNIPKTFDSDDLIPPYLDQYFYSHIGHHL-- 481
Query: 190 TNICSKQFSDNFTIVLFHYDGRTTEWNEFEWSKRAIHVSVRKQTKWWYAKRFLHPDI--V 247
NI + F +V +F + ++ I R + W A + + +
Sbjct: 482 KNIEHPERMTLFRMVFL----------DFRFLEQKI----RHDSTAWNASGSILNTLQQL 527
Query: 248 ASY-DYIFIWDEDL-----GVEHF--NAEEYIKLVRKHGLEISQPGLEPNKGLTWQMTKR 299
Y YI D + F EE L+ ++ + L ++ +
Sbjct: 528 KFYKPYICDNDPKYERLVNAILDFLPKIEE--NLICSKYTDLLRIALMAEDEAIFEEAHK 585
Query: 300 RGDR 303
+ R
Sbjct: 586 QVQR 589
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 440 | |||
| 1qg8_A | 255 | Protein (spore coat polysaccharide biosynthesis P | 93.46 | |
| 3f1y_A | 387 | Mannosyl-3-phosphoglycerate synthase; GT-A type gl | 90.98 | |
| 2d7i_A | 570 | Polypeptide N-acetylgalactosaminyltransferase 10; | 90.68 | |
| 1xhb_A | 472 | Polypeptide N-acetylgalactosaminyltransferase 1; g | 89.9 | |
| 2ffu_A | 501 | Ppgalnact-2, polypeptide N-acetylgalactosaminyltra | 89.54 | |
| 2z86_A | 625 | Chondroitin synthase; GT-A, glycosyltransferase A, | 88.89 | |
| 2z86_A | 625 | Chondroitin synthase; GT-A, glycosyltransferase A, | 83.45 |
| >1qg8_A Protein (spore coat polysaccharide biosynthesis P SPSA); glycosyltransferase, transferase; 1.50A {Bacillus subtilis} SCOP: c.68.1.1 PDB: 1h7q_A* 1h7l_A 1qgq_A* 1qgs_A* | Back alignment and structure |
|---|
Probab=93.46 E-value=0.23 Score=43.66 Aligned_cols=120 Identities=10% Similarity=-0.036 Sum_probs=68.9
Q ss_pred cCccEEEEecCccccCCCCHHHHHHHHHHh-CCcccCCCcc-----CCCCcceeeeeecCCcceeeecccCCCCCCCCCC
Q 013577 248 ASYDYIFIWDEDLGVEHFNAEEYIKLVRKH-GLEISQPGLE-----PNKGLTWQMTKRRGDREVHKETEEKPGWCSNPHL 321 (440)
Q Consensus 248 a~YDYIfLwDDDL~Vd~f~i~ryf~Ivr~~-gLeISQPALd-----~~S~ish~iT~R~~~~~vHr~~~~~~g~c~~~~~ 321 (440)
+..|||++.|+|..++.-.++++++.++++ +..+..+... .+....+.. . .+...+..
T Consensus 87 a~g~~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~-------------- 150 (255)
T 1qg8_A 87 AEGEYITYATDDNIYMPDRLLKMVRELDTHPEKAVIYSASKTYHLNENRDIVKET-V-RPAAQVTW-------------- 150 (255)
T ss_dssp CCCSEEEEEETTEEECTTHHHHHHHHHHHCTTCCEEEEEEEEEEEC---CEEEEE-E-ECCCSCBS--------------
T ss_pred cCCCEEEEeCCCCccChHHHHHHHHHHHhCCCceEEEeceEEEEeCCCCcchhhc-c-CchHHHHH--------------
Confidence 578999999999999998899999999876 6666655442 111110100 0 01100000
Q ss_pred CCccceEEeccccccHHHHHHhh-hh---hcCCCcceehHHHHhhHhhcCCCCcEEEEeeeeEEeecc
Q 013577 322 PPCAAFVEIMAPVFSRDAWRCVW-HM---IQNDLVHGWGLDFALRKCVEPAHEKIGVVDSQWIVHQTV 385 (440)
Q Consensus 322 ppcTgFVEIMAPVFSR~Awrcvw-~m---iQNDLvhGWGLDf~w~~Ca~~~~~kIGVVDa~~VvH~g~ 385 (440)
....++-..+-+|+|++++.+- .+ +..+.....+-|+.+...+... .++..++...+.|+..
T Consensus 151 -~~~~~~~~~~~~~rr~~~~~~g~~f~~~~~~~~~~~~~eD~~~~~r~~~~-g~~~~~~~~~~~~r~~ 216 (255)
T 1qg8_A 151 -NAPCAIDHCSVMHRYSVLEKVKEKFGSYWDESPAFYRIGDARFFWRVNHF-YPFYPLDEELDLNYIT 216 (255)
T ss_dssp -CCTTTSCGGGEEEETHHHHHHHHHHSSSSCCCGGGSTTHHHHHHHHHTTT-CCBEEEEEEEEEEEEC
T ss_pred -hhcCCCccceEEEEHHHHHhhcccccccccCCcccchHHHHHHHHHHHHh-CCEEEecCcEEEEEEc
Confidence 0011222234589999999875 33 1112222345676655444333 3699999998888743
|
| >3f1y_A Mannosyl-3-phosphoglycerate synthase; GT-A type glycosyltransferase, GT-81, mannosyl-3-phosphoglyc synthase, GDP-mannose, transferas; 2.20A {Rubrobacter xylanophilus} PDB: 3kia_A* 3o3p_A* | Back alignment and structure |
|---|
| >2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* | Back alignment and structure |
|---|
| >1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 | Back alignment and structure |
|---|
| >2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* | Back alignment and structure |
|---|
| >2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* | Back alignment and structure |
|---|
| >2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 440 | |||
| d1xhba2 | 328 | Polypeptide N-acetylgalactosaminyltransferase 1, N | 91.19 |
| >d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Nucleotide-diphospho-sugar transferases superfamily: Nucleotide-diphospho-sugar transferases family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain species: Mouse (Mus musculus) [TaxId: 10090]
Probab=91.19 E-value=0.32 Score=42.77 Aligned_cols=139 Identities=9% Similarity=0.015 Sum_probs=75.6
Q ss_pred cCccEEEEecCccccCCCCHHHHHHHHHHhCCcccCCCccCCCCcceeeeee--------cCCcceeeeccc---CCCCC
Q 013577 248 ASYDYIFIWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNKGLTWQMTKR--------RGDREVHKETEE---KPGWC 316 (440)
Q Consensus 248 a~YDYIfLwDDDL~Vd~f~i~ryf~Ivr~~gLeISQPALd~~S~ish~iT~R--------~~~~~vHr~~~~---~~g~c 316 (440)
+.-|||++.|+|..++.-.++++++.+++..-.+..|.++.-..-++..... ......+..... .....
T Consensus 106 a~gd~i~flD~D~~~~p~~l~~l~~~~~~~~~~~v~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (328)
T d1xhba2 106 SRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 185 (328)
T ss_dssp CCSSEEEEEESSEEECTTCHHHHHHHHHHCTTEEEEEEEEEECSSSCCEECCCTTEEEEECTTCCEEEEECCHHHHHHTT
T ss_pred hhcceeeecCcccccChhHHHHHHHHHhcCCCeEEecceeeeccCceeeccCCccccccccccccccccccchhhhhccc
Confidence 6789999999999999999999999999988888777654221111110000 000111110000 00011
Q ss_pred CCCCCCCccceEEeccccccHHHHHHhhhhhcCCCcceeh-HHHHhhHhhcCCCCcEEEEeeeeEEeeccCCC
Q 013577 317 SNPHLPPCAAFVEIMAPVFSRDAWRCVWHMIQNDLVHGWG-LDFALRKCVEPAHEKIGVVDSQWIVHQTVPSL 388 (440)
Q Consensus 317 ~~~~~ppcTgFVEIMAPVFSR~Awrcvw~miQNDLvhGWG-LDf~w~~Ca~~~~~kIGVVDa~~VvH~g~PtL 388 (440)
.+...+..+.++...+=.++|++|..+=.+ . +....|| =|.-+..-+.....+|..+-...|.|..+-..
T Consensus 186 ~~~~~~~~~~~~~g~~~~irr~~f~~vGgf-D-e~~~~~g~ED~Dl~~R~~~~G~~i~~~p~~~v~H~~~~~~ 256 (328)
T d1xhba2 186 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTY-D-AGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKAT 256 (328)
T ss_dssp TCTTSCEECSBCCSSSEEEEHHHHHHTTSC-C-TTSCTTCCCCSHHHHHHHHTTCEEEEEEEEEEEEEC----
T ss_pred cccccccccceecceeeeeeHHHHHHhCCC-C-CCCcCcCchHHHHHHHHHHhCCeEEEeCCeEEEEeCCCCC
Confidence 112222223333333345799999987433 3 3234564 34444333333457999999999999754444
|