Citrus Sinensis ID: 021185
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 316 | ||||||
| 359486223 | 413 | PREDICTED: uncharacterized protein LOC10 | 0.977 | 0.748 | 0.709 | 1e-132 | |
| 240255999 | 389 | uncharacterized protein [Arabidopsis tha | 0.892 | 0.724 | 0.727 | 1e-126 | |
| 255575938 | 370 | conserved hypothetical protein [Ricinus | 0.889 | 0.759 | 0.683 | 1e-124 | |
| 356560121 | 385 | PREDICTED: uncharacterized protein LOC10 | 0.930 | 0.763 | 0.680 | 1e-123 | |
| 356541848 | 387 | PREDICTED: uncharacterized protein LOC10 | 0.936 | 0.764 | 0.686 | 1e-122 | |
| 357445211 | 406 | hypothetical protein MTR_2g005330 [Medic | 0.930 | 0.724 | 0.656 | 1e-121 | |
| 224120054 | 314 | predicted protein [Populus trichocarpa] | 0.952 | 0.958 | 0.676 | 1e-119 | |
| 224129464 | 299 | predicted protein [Populus trichocarpa] | 0.905 | 0.956 | 0.679 | 1e-118 | |
| 359476620 | 466 | PREDICTED: uncharacterized protein LOC10 | 0.990 | 0.671 | 0.644 | 1e-118 | |
| 449462816 | 414 | PREDICTED: uncharacterized protein LOC10 | 0.965 | 0.736 | 0.632 | 1e-118 |
| >gi|359486223|ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera] gi|297739491|emb|CBI29673.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 477 bits (1227), Expect = e-132, Method: Compositional matrix adjust.
Identities = 220/310 (70%), Positives = 256/310 (82%), Gaps = 1/310 (0%)
Query: 1 MRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWK 60
++PLW + P +LLA+A GIKQK+IV+QIV KF +FVVMLFHYDGVVDEW+
Sbjct: 97 VQPLWGATLNGEKSSPSKSLLAMAVGIKQKEIVNQIVEKFILSNFVVMLFHYDGVVDEWR 156
Query: 61 DLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKD 120
+ W+D AIHV+ NQTKWWFAKRFLHPDIVAEYNYIFLWDED+GVENF+P RY+SIV+D
Sbjct: 157 EFAWSDHAIHVTVVNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFHPGRYVSIVED 216
Query: 121 EGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMA 180
EGLEISQPALDP KS VHH ITAR RNS+ HRR YK++GSGRCDD STAPPC+GWVEMMA
Sbjct: 217 EGLEISQPALDPKKSRVHHQITARVRNSRVHRRTYKHRGSGRCDDQSTAPPCVGWVEMMA 276
Query: 181 PVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVT 240
PVFS+AAWRC W+MIQN+LIHAWG+D+QLGYCAQGDRTKNVGVVDSEY+VHL LPTLGV
Sbjct: 277 PVFSKAAWRCVWHMIQNELIHAWGVDMQLGYCAQGDRTKNVGVVDSEYVVHLALPTLGVL 336
Query: 241 TEPELNTVGQASDDL-EQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVED 299
E EL G L E++ VALA S+ + DNR VRRQS+IEMQIFR+RW +AV++
Sbjct: 337 DENELRGEGHDHSSLREKLPKSVALAQSEFHKVDNRSAVRRQSFIEMQIFRSRWANAVKE 396
Query: 300 DKCWVDPYGQ 309
DKCW+DPY Q
Sbjct: 397 DKCWIDPYAQ 406
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|240255999|ref|NP_193588.5| uncharacterized protein [Arabidopsis thaliana] gi|332658658|gb|AEE84058.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|255575938|ref|XP_002528866.1| conserved hypothetical protein [Ricinus communis] gi|223531717|gb|EEF33540.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|356560121|ref|XP_003548344.1| PREDICTED: uncharacterized protein LOC100797710 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356541848|ref|XP_003539384.1| PREDICTED: uncharacterized protein LOC100526994 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357445211|ref|XP_003592883.1| hypothetical protein MTR_2g005330 [Medicago truncatula] gi|355481931|gb|AES63134.1| hypothetical protein MTR_2g005330 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|224120054|ref|XP_002331125.1| predicted protein [Populus trichocarpa] gi|222872853|gb|EEF09984.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224129464|ref|XP_002328723.1| predicted protein [Populus trichocarpa] gi|222839021|gb|EEE77372.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|359476620|ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244499 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|449462816|ref|XP_004149136.1| PREDICTED: uncharacterized protein LOC101205845 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 316 | ||||||
| TAIR|locus:2124608 | 389 | AT4G18530 "AT4G18530" [Arabido | 0.936 | 0.760 | 0.701 | 3.7e-119 | |
| TAIR|locus:2202124 | 438 | AT1G11170 [Arabidopsis thalian | 0.946 | 0.682 | 0.554 | 1.1e-92 | |
| TAIR|locus:2008425 | 425 | AT1G61240 [Arabidopsis thalian | 0.958 | 0.712 | 0.542 | 2.7e-91 | |
| TAIR|locus:2031905 | 401 | AT1G13000 [Arabidopsis thalian | 0.765 | 0.603 | 0.506 | 3.2e-65 | |
| TAIR|locus:2086721 | 398 | AT3G27470 "AT3G27470" [Arabido | 0.743 | 0.590 | 0.5 | 2.9e-64 | |
| TAIR|locus:2205140 | 382 | AT1G08040 [Arabidopsis thalian | 0.75 | 0.620 | 0.495 | 4.2e-63 | |
| TAIR|locus:504956279 | 381 | AT1G24570 [Arabidopsis thalian | 0.734 | 0.608 | 0.485 | 5.4e-63 | |
| TAIR|locus:2079291 | 396 | AT3G26440 [Arabidopsis thalian | 0.734 | 0.585 | 0.5 | 7.9e-62 | |
| TAIR|locus:2057557 | 374 | AT2G28310 "AT2G28310" [Arabido | 0.731 | 0.617 | 0.487 | 7.9e-62 | |
| TAIR|locus:2008640 | 404 | AT1G67850 "AT1G67850" [Arabido | 0.731 | 0.571 | 0.495 | 2.1e-61 |
| TAIR|locus:2124608 AT4G18530 "AT4G18530" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1173 (418.0 bits), Expect = 3.7e-119, P = 3.7e-119
Identities = 214/305 (70%), Positives = 245/305 (80%)
Query: 4 LWS-SPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDL 62
LW+ +K M+LLA+A GIKQK++V+++++KFP +DF VMLFHYDGVVD+WK
Sbjct: 93 LWNYDDTKKRRPNHSMSLLAMAVGIKQKELVNKVIQKFPPRDFAVMLFHYDGVVDDWKQY 152
Query: 63 VWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEG 122
W + AIHVS NQTKWWFAKRFLHPDIVAEY YIFLWDED+GV +FNP+RYLSIVK+EG
Sbjct: 153 PWNNHAIHVSVMNQTKWWFAKRFLHPDIVAEYEYIFLWDEDLGVGHFNPQRYLSIVKEEG 212
Query: 123 LEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPV 182
LEISQPALD KSEVHHPITARR+ SK HRRMYKYKGSGRCDD+ST PPCIGWVEMMAPV
Sbjct: 213 LEISQPALDTSKSEVHHPITARRKKSKVHRRMYKYKGSGRCDDHSTNPPCIGWVEMMAPV 272
Query: 183 FSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTE 242
FSRAAWRC+WYMIQNDLIHAWGLD QLGYCAQGDR KNVGVVD+EYI+H GLPTLGV
Sbjct: 273 FSRAAWRCSWYMIQNDLIHAWGLDTQLGYCAQGDRKKNVGVVDAEYIIHYGLPTLGV--- 329
Query: 243 PELNTVGQASDDLEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKC 302
V AS L + + +SR DNRPEVR +S++EM+ F+ RWK AV DD C
Sbjct: 330 -----VETASSALRNETDSKSTESLESREVDNRPEVRMKSFVEMKRFKERWKKAVRDDTC 384
Query: 303 WVDPY 307
WVDPY
Sbjct: 385 WVDPY 389
|
|
| TAIR|locus:2202124 AT1G11170 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008425 AT1G61240 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2031905 AT1G13000 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2086721 AT3G27470 "AT3G27470" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2205140 AT1G08040 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:504956279 AT1G24570 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2079291 AT3G26440 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2057557 AT2G28310 "AT2G28310" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008640 AT1G67850 "AT1G67850" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00035311001 | SubName- Full=Chromosome undetermined scaffold_77, whole genome shotgun sequence; (411 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 316 | |||
| pfam05212 | 294 | pfam05212, DUF707, Protein of unknown function (DU | 1e-131 | |
| COG5096 | 757 | COG5096, COG5096, Vesicle coat complex, various su | 0.002 |
| >gnl|CDD|218501 pfam05212, DUF707, Protein of unknown function (DUF707) | Back alignment and domain information |
|---|
Score = 374 bits (961), Expect = e-131
Identities = 136/300 (45%), Positives = 173/300 (57%), Gaps = 31/300 (10%)
Query: 1 MRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWK 60
+R LW SP + + P LLA G QK VD V+KF S +F ++LFHYDG EW
Sbjct: 26 LRRLWGSPEE-DVASKPKYLLAFTVGYSQKANVDACVKKF-SDNFTIVLFHYDGRTTEWD 83
Query: 61 DLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKD 120
+L W+ +AIHVSA QTKWWFAKRFLHPDIVA Y YIFLWDED+GV+NF+ Y+ IVK
Sbjct: 84 ELEWSKKAIHVSAKKQTKWWFAKRFLHPDIVAPYEYIFLWDEDLGVDNFDAEEYIKIVKK 143
Query: 121 EGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMA 180
GLEISQP LDP + ++ IT RR + + H+ + ST PPC G+VE+MA
Sbjct: 144 HGLEISQPGLDPDRGQITWRITKRRGDGEVHKDTREKGRCCD---NSTGPPCTGFVEIMA 200
Query: 181 PVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVT 240
PVFSR AWRC W+MIQNDL+H WGLD L C + +GVVDS+++VH G+P+LG
Sbjct: 201 PVFSRDAWRCVWHMIQNDLVHGWGLDFALRRCV-EPAHEKIGVVDSQWVVHQGIPSLGSQ 259
Query: 241 TEPELNTVGQASDDLEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDD 300
E VR + E +F+ RW A ++
Sbjct: 260 GTAE-------------------------NGKAPWQGVRDRCKAEWTMFQRRWAAAEKEY 294
|
This family consists of several uncharacterized proteins from Arabidopsis thaliana. Length = 294 |
| >gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits [Intracellular trafficking and secretion] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 316 | |||
| PF05212 | 294 | DUF707: Protein of unknown function (DUF707); Inte | 100.0 | |
| cd04185 | 202 | GT_2_like_b Subfamily of Glycosyltransferase Famil | 95.24 | |
| cd04186 | 166 | GT_2_like_c Subfamily of Glycosyltransferase Famil | 94.62 | |
| TIGR01556 | 281 | rhamnosyltran L-rhamnosyltransferase. Rhamnolipids | 94.49 | |
| cd02510 | 299 | pp-GalNAc-T pp-GalNAc-T initiates the formation of | 92.42 | |
| cd02526 | 237 | GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl tr | 90.27 | |
| cd02520 | 196 | Glucosylceramide_synthase Glucosylceramide synthas | 89.69 | |
| cd02525 | 249 | Succinoglycan_BP_ExoA ExoA is involved in the bios | 88.22 | |
| PF13641 | 228 | Glyco_tranf_2_3: Glycosyltransferase like family 2 | 88.02 | |
| cd06421 | 234 | CESA_CelA_like CESA_CelA_like are involved in the | 85.5 | |
| cd04195 | 201 | GT2_AmsE_like GT2_AmsE_like is involved in exopoly | 84.28 | |
| COG1216 | 305 | Predicted glycosyltransferases [General function p | 83.81 | |
| cd06433 | 202 | GT_2_WfgS_like WfgS and WfeV are involved in O-ant | 81.3 | |
| cd06442 | 224 | DPM1_like DPM1_like represents putative enzymes si | 81.01 |
| >PF05212 DUF707: Protein of unknown function (DUF707); InterPro: IPR007877 This family consists of uncharacterised proteins from Arabidopsis thaliana | Back alignment and domain information |
|---|
Probab=100.00 E-value=6.1e-122 Score=865.89 Aligned_cols=268 Identities=65% Similarity=1.195 Sum_probs=256.9
Q ss_pred CCCCCCCCCCCCCCCCCCcEEEEEecccchhhHHHHHhhCCCCCcEEEEEEecCcccccccccccCceeEEEeecccchh
Q 021185 1 MRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWW 80 (316)
Q Consensus 1 ~r~Lwg~~~~~~~~~~~k~Ll~~~VG~kqk~~vd~~v~kf~~~nF~v~LfhYDg~vd~w~d~ews~~aiHv~a~kqtKWw 80 (316)
||||||+|+++. ..++|||||||||+|||++||++|+|| ++|||||||||||+||+|++||||++||||++.||||||
T Consensus 26 ~r~lw~~p~~~~-~~~~k~Lla~~VG~kqk~~vd~~v~Kf-~~nF~i~LfhYDg~vd~w~~~~ws~~aiHv~~~kqtKww 103 (294)
T PF05212_consen 26 LRPLWGNPSEDL-PKKPKYLLAMTVGIKQKDNVDAIVKKF-SDNFDIMLFHYDGRVDEWDDFEWSDRAIHVSARKQTKWW 103 (294)
T ss_pred eeecCCCccccc-cCCCceEEEEEecHHHHhhhhHHHhhh-ccCceEEEEEecCCcCchhhcccccceEEEEeccceEEe
Confidence 799999999985 567899999999999999999999999 899999999999999999999999999999999999999
Q ss_pred hhccccCccccccccEEEEecccccCCCCCHHHHHHHHHHhCCcccCCCcCCCCCccccccceeccCcccceeeecccCC
Q 021185 81 FAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGS 160 (316)
Q Consensus 81 ~akRfLHPdiv~~YDYIflwDDDL~vd~f~i~ry~~ivr~~gLeISQPALd~~s~~~sh~iT~r~~~~~vHr~~~~~~~~ 160 (316)
|||||||||||++|||||||||||+||+|+|+|||+||++||||||||||+++||++||+||+|++.++|||. .++.
T Consensus 104 ~akrfLHPdiv~~YdYiflwDeDL~vd~f~~~ry~~Ivk~~gLeISQPALd~~~~~~~~~iT~R~~~~~vhr~---~~~~ 180 (294)
T PF05212_consen 104 FAKRFLHPDIVAPYDYIFLWDEDLGVDHFDINRYFEIVKKEGLEISQPALDPDSSEIHHPITKRRPDSEVHRK---TRGG 180 (294)
T ss_pred ehhhhcChhhhccceeEEecCCccCcCcCCHHHHHHHHHHhCCcccCcccCCCCceeeeeEEeecCCceeEec---cCCC
Confidence 9999999999999999999999999999999999999999999999999999998999999999999999983 5777
Q ss_pred CCCCCCCCCCCccceEEEecccccHHHHHHHhhhhcCCCcccchhhHHhhhhhcCCCCCcEEEEeeceEEEccCCCCCCC
Q 021185 161 GRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVT 240 (316)
Q Consensus 161 ~~C~~~~~~ppcTgFVEiMaPVFSR~Awrcvw~miqNdlvhGWGLD~~w~~c~~g~~~~kiGVVDa~~V~H~~~ptlg~~ 240 (316)
+.|.+++++||||||||||||||||+|||||||||||||+|||||||+|+||+ +++++||||||||||+|+|+|||||+
T Consensus 181 ~~~~~~~~~ppct~fVEiMAPVFSr~Awrcvw~miqNDLvhGWGLDf~~~~c~-~~~~~kiGVVDs~~VvH~gvptLG~~ 259 (294)
T PF05212_consen 181 PRCCDDSTGPPCTGFVEIMAPVFSRAAWRCVWHMIQNDLVHGWGLDFKWGYCA-GDRHKKIGVVDSQYVVHTGVPTLGGQ 259 (294)
T ss_pred CCcCCCCCCCCcceEEEEecceechHHHHHHHhcccCCCccccchhhhHHHHh-ccccccEEEEeeEEEEEcCCCcCCCc
Confidence 88888999999999999999999999999999999999999999999999999 78999999999999999999999998
Q ss_pred CCcccccccCCCchhhhhccccccCCCCCCCCCChHHHHhhhHHHHHHHHHHHHHhHhc
Q 021185 241 TEPELNTVGQASDDLEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVED 299 (316)
Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Vr~r~~~E~~~f~~R~~~a~~~ 299 (316)
+.++. +.++|.+||+||++||++|++||++|++|
T Consensus 260 ~~~~~-------------------------~~~~~~~Vr~r~~~E~~~F~~R~~~a~~~ 293 (294)
T PF05212_consen 260 GNSEK-------------------------GKDPREEVRRRSFAEMRIFQKRWANAVKE 293 (294)
T ss_pred ccccc-------------------------CCchHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 86632 45789999999999999999999999986
|
|
| >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >TIGR01556 rhamnosyltran L-rhamnosyltransferase | Back alignment and domain information |
|---|
| >cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans | Back alignment and domain information |
|---|
| >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase | Back alignment and domain information |
|---|
| >cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis | Back alignment and domain information |
|---|
| >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan | Back alignment and domain information |
|---|
| >PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A | Back alignment and domain information |
|---|
| >cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose | Back alignment and domain information |
|---|
| >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis | Back alignment and domain information |
|---|
| >COG1216 Predicted glycosyltransferases [General function prediction only] | Back alignment and domain information |
|---|
| >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis | Back alignment and domain information |
|---|
| >cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1 | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 316 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 3e-04 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 41.8 bits (97), Expect = 3e-04
Identities = 54/406 (13%), Positives = 99/406 (24%), Gaps = 152/406 (37%)
Query: 28 KQKKIV----DQIVRKFPSK---DFVVMLF---HYDGVVDEWKDLVWADRAIHVSAANQT 77
+ K I+ D V F K D + D ++ + R + Q
Sbjct: 17 QYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQE 76
Query: 78 KWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNP----RRYLSIVKDEGLEISQPALDPV 133
+ ++F+ + V NY FL I E P R Y+ + L
Sbjct: 77 E--MVQKFV--EEVLRINYKFLMSP-IKTEQRQPSMMTRMYIEQR--DRLYNDNQVFAKY 129
Query: 134 KSEVHHPITARRRNSKAHRR--------MYKYKGSGR-------CDDYS--TAPPC-IGW 175
P R+ R + GSG+ C Y I W
Sbjct: 130 NVSRLQPYLKLRQALLELRPAKNVLIDGV---LGSGKTWVALDVCLSYKVQCKMDFKIFW 186
Query: 176 VEM------------------------------MAPVFSRAAW---------------RC 190
+ + + + R C
Sbjct: 187 LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENC 246
Query: 191 AWYMIQNDLIHAW---GLDIQLGYCAQGDR----TKNVGVVD---SEYIVHLGL-PTLGV 239
++ ++ +A ++ C + T+ V D + H+ L
Sbjct: 247 --LLVLLNVQNAKAWNAFNLS---C----KILLTTRFKQVTDFLSAATTTHISLDHHSMT 297
Query: 240 TTEPE-----LNTVGQASDDL-EQIA--NPVALA---------PSQSRRYDN-------- 274
T E L + DL ++ NP L+ + + +
Sbjct: 298 LTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTT 357
Query: 275 ---------RPEVRRQSYIEMQIFRNR-----------WKHAVEDD 300
P R+ + + +F W ++ D
Sbjct: 358 IIESSLNVLEPAEYRKMFDRLSVFPPSAHIPTILLSLIWFDVIKSD 403
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 316 | |||
| 2z86_A | 625 | Chondroitin synthase; GT-A, glycosyltransferase A, | 87.8 | |
| 1qg8_A | 255 | Protein (spore coat polysaccharide biosynthesis P | 85.36 | |
| 1xhb_A | 472 | Polypeptide N-acetylgalactosaminyltransferase 1; g | 84.39 |
| >2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* | Back alignment and structure |
|---|
Probab=87.80 E-value=1.8 Score=42.38 Aligned_cols=216 Identities=13% Similarity=0.127 Sum_probs=100.4
Q ss_pred CCcEEEEEecccchhhHHHHHhhC----CCCCcEEEEEEecCccccccc-cc-ccC-ceeEEEeecccchhhhccccCcc
Q 021185 17 PMNLLAIAAGIKQKKIVDQIVRKF----PSKDFVVMLFHYDGVVDEWKD-LV-WAD-RAIHVSAANQTKWWFAKRFLHPD 89 (316)
Q Consensus 17 ~k~Ll~~~VG~kqk~~vd~~v~kf----~~~nF~v~LfhYDg~vd~w~d-~e-ws~-~aiHv~a~kqtKWw~akRfLHPd 89 (316)
++.-|++|+=.. ...+...+... ...++.|+++. ||..|+=.+ ++ +.+ ..+++........-.+...-.-=
T Consensus 93 p~vsviIp~~n~-~~~l~~~l~sl~~q~~~~~~eiivvD-d~s~d~t~~~~~~~~~~~~i~~i~~~~~~~g~~~a~N~g~ 170 (625)
T 2z86_A 93 DGLSIVIPTYNR-AKILAITLACLCNQKTIYDYEVIVAD-DGSKENIEEIVREFESLLNIKYVRQKDYGYQLCAVRNLGL 170 (625)
T ss_dssp CCEEEEEEESSC-HHHHHHHHHHHHTCCCSSCEEEEEEE-ESCSSCHHHHHHTTTTTSCEEEEEECCCSCCHHHHHHHHH
T ss_pred CcEEEEEecCCc-HHHHHHHHHHHHhhccCCCeEEEEEe-CCCchhHHHHHHHhhhcCCeEEEEeCCCCcchhHHHHHHH
Confidence 345566666433 34444444432 22378887776 677664322 11 111 12333333221111111100111
Q ss_pred ccccccEEEEecccccCCCCCHHHHHHHHHHhC-CcccCCCcCCCCCccccc------cceec-cC----cccceeeecc
Q 021185 90 IVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEG-LEISQPALDPVKSEVHHP------ITARR-RN----SKAHRRMYKY 157 (316)
Q Consensus 90 iv~~YDYIflwDDDL~vd~f~i~ry~~ivr~~g-LeISQPALd~~s~~~sh~------iT~r~-~~----~~vHr~~~~~ 157 (316)
-.+.+|||++.|+|..++...+.++++.+.+.. .-+.+|..-...+..... -..+. +. ..+.......
T Consensus 171 ~~a~g~~v~~lD~D~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (625)
T 2z86_A 171 RAAKYNYVAILDCDMAPNPLWVQSYMELLAVDDNVALIGPRKYIDTSKHTYLDFLSQKSLINEIPEIITNNQVAGKVEQN 250 (625)
T ss_dssp HHCCSSEEEEECTTEEECTTHHHHHHHHHHHCTTEEEECCEEEECCTTCCHHHHHHSTTGGGTSCC--------------
T ss_pred HhCCcCEEEEECCCCCCCHHHHHHHHHHHhcCCceEEEEeeeeccCcccchhhcccchHHhhhcchhccCCchhhhhccC
Confidence 135789999999999999999999999998743 444444321111000000 00000 00 0000000000
Q ss_pred cCCCC------CC-CCCCCCCccceEEEecccccHHHHHHHhhhhcCCCcccch-hhHHhhhhhcCCCCCcEEEEeeceE
Q 021185 158 KGSGR------CD-DYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWG-LDIQLGYCAQGDRTKNVGVVDSEYI 229 (316)
Q Consensus 158 ~~~~~------C~-~~~~~ppcTgFVEiMaPVFSR~Awrcvw~miqNdlvhGWG-LD~~w~~c~~g~~~~kiGVVDa~~V 229 (316)
.+..+ .. ..........++=..+=+|+|+++..+ ..+ +....+|| =|+-+..-+. ..+.++.++....+
T Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~rr~~l~~i-Ggf-de~~~~~g~ED~dl~~Rl~-~~G~~i~~~p~~~v 327 (625)
T 2z86_A 251 KSVDWRIEHFKNTDNLRLCNTPFRFFSGGNVAFAKKWLFRA-GWF-DEEFTHWGGEDNEFGYRLY-REGCYFRSVEGAMA 327 (625)
T ss_dssp -CCCTHHHHHHHTTTTTTCSCGGGGCCTTEEEEETHHHHHH-CCC-CTTCSSCCCHHHHHHHHHH-HTTCEEEECGGGCE
T ss_pred CccccchhhhcccccccccCCceeEEeeceeeeEHHHHHHh-CCC-ccccccCCcchhhhhhhHH-hCCceEEEcccchh
Confidence 00000 00 000000111233334557899999988 333 45555664 5665543332 23469999999999
Q ss_pred EEccCCCC
Q 021185 230 VHLGLPTL 237 (316)
Q Consensus 230 ~H~~~ptl 237 (316)
.|...++.
T Consensus 328 ~H~~~~~~ 335 (625)
T 2z86_A 328 YHQEPPGK 335 (625)
T ss_dssp EEECCC--
T ss_pred hccCCccc
Confidence 99976543
|
| >1qg8_A Protein (spore coat polysaccharide biosynthesis P SPSA); glycosyltransferase, transferase; 1.50A {Bacillus subtilis} SCOP: c.68.1.1 PDB: 1h7q_A* 1h7l_A 1qgq_A* 1qgs_A* | Back alignment and structure |
|---|
| >1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 316 | |||
| d1xhba2 | 328 | Polypeptide N-acetylgalactosaminyltransferase 1, N | 86.56 |
| >d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Nucleotide-diphospho-sugar transferases superfamily: Nucleotide-diphospho-sugar transferases family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain species: Mouse (Mus musculus) [TaxId: 10090]
Probab=86.56 E-value=0.33 Score=41.13 Aligned_cols=215 Identities=13% Similarity=0.101 Sum_probs=103.4
Q ss_pred CCCcEEEEEecccchh----hHHHHHhhCCCCCcE-EEEEEecCcccccc--ccc-c---cCceeEEEeecccchhhhcc
Q 021185 16 PPMNLLAIAAGIKQKK----IVDQIVRKFPSKDFV-VMLFHYDGVVDEWK--DLV-W---ADRAIHVSAANQTKWWFAKR 84 (316)
Q Consensus 16 ~~k~Ll~~~VG~kqk~----~vd~~v~kf~~~nF~-v~LfhYDg~vd~w~--d~e-w---s~~aiHv~a~kqtKWw~akR 84 (316)
.|..=|++|+=..... -+++++..= ..++. =+++.=||+.|+-- .++ + ...-|++....+..=. +.-
T Consensus 21 ~P~vSIIIp~yNe~~~~l~~~l~Si~~qt-~~~~~~EIIvVDdgS~d~~~~~~l~~~~~~~~~~i~vi~~~~n~G~-~~a 98 (328)
T d1xhba2 21 LPTTSVVIVFHNEAWSTLLRTVHSVINRS-PRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGL-IRA 98 (328)
T ss_dssp CCCEEEEEEESSCCHHHHHHHHHHHHHSS-CGGGEEEEEEEECSCCCGGGTHHHHHHHHSSSSCEEEEECSSCCCH-HHH
T ss_pred CCCEEEEEeccCCcHHHHHHHHHHHHhcC-CCCCCeEEEEEECCCChhhHHHHHHHHHHhcCCCeEEEEecccccc-hHH
Confidence 3455567776543333 344555442 23442 24555688877521 111 1 1122555333222211 111
Q ss_pred ccCccccccccEEEEecccccCCCCCHHHHHHHHHHhCCcccCCCcCCCCCc-cc----cccceec--cCcccceeeecc
Q 021185 85 FLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSE-VH----HPITARR--RNSKAHRRMYKY 157 (316)
Q Consensus 85 fLHPdiv~~YDYIflwDDDL~vd~f~i~ry~~ivr~~gLeISQPALd~~s~~-~s----h~iT~r~--~~~~vHr~~~~~ 157 (316)
.-..=-.+.-|||++.|+|..++.-.++++++.+.+..-.+..|.++.-.+. .. ...+.-. .....+......
T Consensus 99 ~N~Gi~~a~gd~i~flD~D~~~~p~~l~~l~~~~~~~~~~~v~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (328)
T d1xhba2 99 RLKGAAVSRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQ 178 (328)
T ss_dssp HHHHHHHCCSSEEEEEESSEEECTTCHHHHHHHHHHCTTEEEEEEEEEECSSSCCEECCCTTEEEEECTTCCEEEEECCH
T ss_pred HHHHHHhhhcceeeecCcccccChhHHHHHHHHHhcCCCeEEecceeeeccCceeeccCCccccccccccccccccccch
Confidence 1111224678999999999999999999999999998877777765321100 00 0000000 000000000000
Q ss_pred cCCCCCCCCCCCCCccceEEEecccccHHHHHHHhhhhcCCCcccch---hhHHhhhhhcCCCCCcEEEEeeceEEEccC
Q 021185 158 KGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWG---LDIQLGYCAQGDRTKNVGVVDSEYIVHLGL 234 (316)
Q Consensus 158 ~~~~~C~~~~~~ppcTgFVEiMaPVFSR~Awrcvw~miqNdlvhGWG---LD~~w~~c~~g~~~~kiGVVDa~~V~H~~~ 234 (316)
..........+.+..+.++..-+=.++|++|..+ ..+ +.....|| +|+-+..... +.+|..+-...|.|...
T Consensus 179 ~~~~~~~~~~~~~~~~~~~~g~~~~irr~~f~~v-Ggf-De~~~~~g~ED~Dl~~R~~~~---G~~i~~~p~~~v~H~~~ 253 (328)
T d1xhba2 179 REMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEI-GTY-DAGMDIWGGENLEISFRIWQC---GGTLEIVTCSHVGHVFR 253 (328)
T ss_dssp HHHHHTTTCTTSCEECSBCCSSSEEEEHHHHHHT-TSC-CTTSCTTCCCCSHHHHHHHHT---TCEEEEEEEEEEEEEC-
T ss_pred hhhhccccccccccccceecceeeeeeHHHHHHh-CCC-CCCCcCcCchHHHHHHHHHHh---CCeEEEeCCeEEEEeCC
Confidence 0000000011122222233222234789999988 433 55556665 4555554433 46899999999999754
Q ss_pred CCC
Q 021185 235 PTL 237 (316)
Q Consensus 235 ptl 237 (316)
.+.
T Consensus 254 ~~~ 256 (328)
T d1xhba2 254 KAT 256 (328)
T ss_dssp ---
T ss_pred CCC
Confidence 333
|