Citrus Sinensis ID: 021149
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 316 | ||||||
| 359486223 | 413 | PREDICTED: uncharacterized protein LOC10 | 0.930 | 0.711 | 0.735 | 1e-130 | |
| 240255999 | 389 | uncharacterized protein [Arabidopsis tha | 0.892 | 0.724 | 0.727 | 1e-126 | |
| 255575938 | 370 | conserved hypothetical protein [Ricinus | 0.854 | 0.729 | 0.688 | 1e-120 | |
| 356560121 | 385 | PREDICTED: uncharacterized protein LOC10 | 0.879 | 0.722 | 0.696 | 1e-120 | |
| 357445211 | 406 | hypothetical protein MTR_2g005330 [Medic | 0.886 | 0.689 | 0.669 | 1e-120 | |
| 356541848 | 387 | PREDICTED: uncharacterized protein LOC10 | 0.886 | 0.723 | 0.703 | 1e-120 | |
| 449462816 | 414 | PREDICTED: uncharacterized protein LOC10 | 0.933 | 0.712 | 0.641 | 1e-116 | |
| 449515369 | 402 | PREDICTED: uncharacterized LOC101205845, | 0.933 | 0.733 | 0.641 | 1e-116 | |
| 224120054 | 314 | predicted protein [Populus trichocarpa] | 0.927 | 0.933 | 0.681 | 1e-116 | |
| 359476620 | 466 | PREDICTED: uncharacterized protein LOC10 | 0.939 | 0.637 | 0.659 | 1e-115 |
| >gi|359486223|ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera] gi|297739491|emb|CBI29673.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 217/295 (73%), Positives = 250/295 (84%), Gaps = 1/295 (0%)
Query: 16 PPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAAN 75
P +LLA+A GIKQK+IV+QIV KF +FVVMLFHYDGVVDEW++ W+D AIHV+ N
Sbjct: 112 PSKSLLAMAVGIKQKEIVNQIVEKFILSNFVVMLFHYDGVVDEWREFAWSDHAIHVTVVN 171
Query: 76 QTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKS 135
QTKWWFAKRFLHPDIVAEYNYIFLWDED+GVENF+P RY+SIV+DEGLEISQPALDP KS
Sbjct: 172 QTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFHPGRYVSIVEDEGLEISQPALDPKKS 231
Query: 136 EVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMI 195
VHH ITAR RNS+ HRR YK++GSGRCDD STAPPC+GWVEMMAPVFS+AAWRC W+MI
Sbjct: 232 RVHHQITARVRNSRVHRRTYKHRGSGRCDDQSTAPPCVGWVEMMAPVFSKAAWRCVWHMI 291
Query: 196 QNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDDL 255
QN+LIHAWG+D+QLGYCAQGDRTKNVGVVDSEY+VHL LPTLGV E EL G L
Sbjct: 292 QNELIHAWGVDMQLGYCAQGDRTKNVGVVDSEYVVHLALPTLGVLDENELRGEGHDHSSL 351
Query: 256 -EQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQ 309
E++ VALA S+ + DNR VRRQS+IEMQIFR+RW +AV++DKCW+DPY Q
Sbjct: 352 REKLPKSVALAQSEFHKVDNRSAVRRQSFIEMQIFRSRWANAVKEDKCWIDPYAQ 406
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|240255999|ref|NP_193588.5| uncharacterized protein [Arabidopsis thaliana] gi|332658658|gb|AEE84058.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|255575938|ref|XP_002528866.1| conserved hypothetical protein [Ricinus communis] gi|223531717|gb|EEF33540.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|356560121|ref|XP_003548344.1| PREDICTED: uncharacterized protein LOC100797710 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357445211|ref|XP_003592883.1| hypothetical protein MTR_2g005330 [Medicago truncatula] gi|355481931|gb|AES63134.1| hypothetical protein MTR_2g005330 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|356541848|ref|XP_003539384.1| PREDICTED: uncharacterized protein LOC100526994 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449462816|ref|XP_004149136.1| PREDICTED: uncharacterized protein LOC101205845 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449515369|ref|XP_004164722.1| PREDICTED: uncharacterized LOC101205845, partial [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|224120054|ref|XP_002331125.1| predicted protein [Populus trichocarpa] gi|222872853|gb|EEF09984.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|359476620|ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244499 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 316 | ||||||
| TAIR|locus:2124608 | 389 | AT4G18530 "AT4G18530" [Arabido | 0.905 | 0.735 | 0.719 | 1.3e-118 | |
| TAIR|locus:2202124 | 438 | AT1G11170 [Arabidopsis thalian | 0.892 | 0.643 | 0.569 | 4.5e-91 | |
| TAIR|locus:2008425 | 425 | AT1G61240 [Arabidopsis thalian | 0.901 | 0.670 | 0.559 | 4.6e-89 | |
| TAIR|locus:2031905 | 401 | AT1G13000 [Arabidopsis thalian | 0.718 | 0.566 | 0.523 | 3.7e-64 | |
| TAIR|locus:2086721 | 398 | AT3G27470 "AT3G27470" [Arabido | 0.693 | 0.550 | 0.515 | 3.3e-63 | |
| TAIR|locus:504956279 | 381 | AT1G24570 [Arabidopsis thalian | 0.693 | 0.574 | 0.497 | 2.6e-61 | |
| TAIR|locus:2205140 | 382 | AT1G08040 [Arabidopsis thalian | 0.712 | 0.589 | 0.502 | 7.1e-61 | |
| TAIR|locus:2079291 | 396 | AT3G26440 [Arabidopsis thalian | 0.683 | 0.545 | 0.518 | 7.1e-61 | |
| TAIR|locus:2008640 | 404 | AT1G67850 "AT1G67850" [Arabido | 0.683 | 0.534 | 0.513 | 5e-60 | |
| TAIR|locus:2057557 | 374 | AT2G28310 "AT2G28310" [Arabido | 0.693 | 0.585 | 0.497 | 6.4e-60 |
| TAIR|locus:2124608 AT4G18530 "AT4G18530" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1168 (416.2 bits), Expect = 1.3e-118, P = 1.3e-118
Identities = 213/296 (71%), Positives = 243/296 (82%)
Query: 14 QRP--PMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHV 71
+RP M+LLA+A GIKQK++V+++++KFP +DF VMLFHYDGVVD+WK W + AIHV
Sbjct: 102 RRPNHSMSLLAMAVGIKQKELVNKVIQKFPPRDFAVMLFHYDGVVDDWKQYPWNNHAIHV 161
Query: 72 SAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALD 131
S NQTKWWFAKRFLHPDIVAEY YIFLWDED+GV +FNP+RYLSIVK+EGLEISQPALD
Sbjct: 162 SVMNQTKWWFAKRFLHPDIVAEYEYIFLWDEDLGVGHFNPQRYLSIVKEEGLEISQPALD 221
Query: 132 PVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCA 191
KSEVHHPITARR+ SK HRRMYKYKGSGRCDD+ST PPCIGWVEMMAPVFSRAAWRC+
Sbjct: 222 TSKSEVHHPITARRKKSKVHRRMYKYKGSGRCDDHSTNPPCIGWVEMMAPVFSRAAWRCS 281
Query: 192 WYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQA 251
WYMIQNDLIHAWGLD QLGYCAQGDR KNVGVVD+EYI+H GLPTLGV V A
Sbjct: 282 WYMIQNDLIHAWGLDTQLGYCAQGDRKKNVGVVDAEYIIHYGLPTLGV--------VETA 333
Query: 252 SDDLEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPY 307
S L + + +SR DNRPEVR +S++EM+ F+ RWK AV DD CWVDPY
Sbjct: 334 SSALRNETDSKSTESLESREVDNRPEVRMKSFVEMKRFKERWKKAVRDDTCWVDPY 389
|
|
| TAIR|locus:2202124 AT1G11170 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008425 AT1G61240 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2031905 AT1G13000 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2086721 AT3G27470 "AT3G27470" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:504956279 AT1G24570 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2205140 AT1G08040 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2079291 AT3G26440 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008640 AT1G67850 "AT1G67850" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2057557 AT2G28310 "AT2G28310" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| AT4G18530 | unknown protein; unknown protein; FUNCTIONS IN- molecular_function unknown; INVOLVED IN- biological_process unknown; LOCATED IN- endomembrane system; EXPRESSED IN- 18 plant structures; EXPRESSED DURING- 10 growth stages; CONTAINS InterPro DOMAIN/s- Protein of unknown function DUF707 (InterPro-IPR007877); BEST Arabidopsis thaliana protein match is- unknown protein (TAIR-AT4G12840.2); Has 173 Blast hits to 173 proteins in 12 species- Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 169; Viruses - 0; Other Eukaryotes - 4 (source- NCBI BLink). (389 aa) | |||||||
(Arabidopsis thaliana) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 316 | |||
| pfam05212 | 294 | pfam05212, DUF707, Protein of unknown function (DU | 1e-127 | |
| COG5096 | 757 | COG5096, COG5096, Vesicle coat complex, various su | 0.003 |
| >gnl|CDD|218501 pfam05212, DUF707, Protein of unknown function (DUF707) | Back alignment and domain information |
|---|
Score = 365 bits (938), Expect = e-127
Identities = 131/286 (45%), Positives = 165/286 (57%), Gaps = 30/286 (10%)
Query: 15 RPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAA 74
P LLA G QK VD V+KF S +F ++LFHYDG EW +L W+ +AIHVSA
Sbjct: 39 SKPKYLLAFTVGYSQKANVDACVKKF-SDNFTIVLFHYDGRTTEWDELEWSKKAIHVSAK 97
Query: 75 NQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVK 134
QTKWWFAKRFLHPDIVA Y YIFLWDED+GV+NF+ Y+ IVK GLEISQP LDP +
Sbjct: 98 KQTKWWFAKRFLHPDIVAPYEYIFLWDEDLGVDNFDAEEYIKIVKKHGLEISQPGLDPDR 157
Query: 135 SEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYM 194
++ IT RR + + H+ + ST PPC G+VE+MAPVFSR AWRC W+M
Sbjct: 158 GQITWRITKRRGDGEVHKDTREKGRCCD---NSTGPPCTGFVEIMAPVFSRDAWRCVWHM 214
Query: 195 IQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDD 254
IQNDL+H WGLD L C + +GVVDS+++VH G+P+LG E
Sbjct: 215 IQNDLVHGWGLDFALRRCV-EPAHEKIGVVDSQWVVHQGIPSLGSQGTAE---------- 263
Query: 255 LEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDD 300
VR + E +F+ RW A ++
Sbjct: 264 ---------------NGKAPWQGVRDRCKAEWTMFQRRWAAAEKEY 294
|
This family consists of several uncharacterized proteins from Arabidopsis thaliana. Length = 294 |
| >gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits [Intracellular trafficking and secretion] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 316 | |||
| PF05212 | 294 | DUF707: Protein of unknown function (DUF707); Inte | 100.0 | |
| cd04185 | 202 | GT_2_like_b Subfamily of Glycosyltransferase Famil | 94.45 | |
| cd04186 | 166 | GT_2_like_c Subfamily of Glycosyltransferase Famil | 93.86 | |
| TIGR01556 | 281 | rhamnosyltran L-rhamnosyltransferase. Rhamnolipids | 93.08 | |
| cd02510 | 299 | pp-GalNAc-T pp-GalNAc-T initiates the formation of | 90.9 | |
| cd02525 | 249 | Succinoglycan_BP_ExoA ExoA is involved in the bios | 88.41 | |
| cd02526 | 237 | GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl tr | 88.36 | |
| cd02520 | 196 | Glucosylceramide_synthase Glucosylceramide synthas | 88.36 | |
| PF13641 | 228 | Glyco_tranf_2_3: Glycosyltransferase like family 2 | 86.55 | |
| cd06421 | 234 | CESA_CelA_like CESA_CelA_like are involved in the | 86.51 | |
| cd04195 | 201 | GT2_AmsE_like GT2_AmsE_like is involved in exopoly | 82.46 |
| >PF05212 DUF707: Protein of unknown function (DUF707); InterPro: IPR007877 This family consists of uncharacterised proteins from Arabidopsis thaliana | Back alignment and domain information |
|---|
Probab=100.00 E-value=4.7e-118 Score=839.37 Aligned_cols=260 Identities=65% Similarity=1.184 Sum_probs=248.6
Q ss_pred ccCCCCCCCcEEEEEecccchhhHHHHhhcCCCCCcEEEEEEecCccCccccccccCceeEEEeeccchhhhhhhccCcc
Q 021149 10 CQNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPD 89 (316)
Q Consensus 10 ~~~~~~~~k~Lva~~vG~kqk~~vd~~v~kf~~~nF~v~LfhYDg~v~~w~d~ews~~aiHv~a~kqtKWw~akRfLhPd 89 (316)
.++...++|||||||||+|||++||++|+|| ++|||||||||||+||+|++||||++||||++.|||||||||||||||
T Consensus 34 ~~~~~~~~k~Lla~~VG~kqk~~vd~~v~Kf-~~nF~i~LfhYDg~vd~w~~~~ws~~aiHv~~~kqtKww~akrfLHPd 112 (294)
T PF05212_consen 34 SEDLPKKPKYLLAMTVGIKQKDNVDAIVKKF-SDNFDIMLFHYDGRVDEWDDFEWSDRAIHVSARKQTKWWFAKRFLHPD 112 (294)
T ss_pred cccccCCCceEEEEEecHHHHhhhhHHHhhh-ccCceEEEEEecCCcCchhhcccccceEEEEeccceEEeehhhhcChh
Confidence 4556677899999999999999999999999 899999999999999999999999999999999999999999999999
Q ss_pred ccccccEEEEeccccccCCCChHHHHHHHHHhCCcccccccCCCCCceeccccccccCcccceeeeeccCCCCCCCCCCC
Q 021149 90 IVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTA 169 (316)
Q Consensus 90 iv~~YdYIflwDDDL~vd~f~i~ry~~Ivr~~gLeISQPALd~~s~~ish~iT~r~~~~~vHr~~~~~~~~~~C~~~~~~ 169 (316)
||++|||||||||||+||+|+|+|||+||++||||||||||+++|+++||+||+|++.++|||. .++++.|.+++++
T Consensus 113 iv~~YdYiflwDeDL~vd~f~~~ry~~Ivk~~gLeISQPALd~~~~~~~~~iT~R~~~~~vhr~---~~~~~~~~~~~~~ 189 (294)
T PF05212_consen 113 IVAPYDYIFLWDEDLGVDHFDINRYFEIVKKEGLEISQPALDPDSSEIHHPITKRRPDSEVHRK---TRGGPRCCDDSTG 189 (294)
T ss_pred hhccceeEEecCCccCcCcCCHHHHHHHHHHhCCcccCcccCCCCceeeeeEEeecCCceeEec---cCCCCCcCCCCCC
Confidence 9999999999999999999999999999999999999999999998999999999999999993 5778888899999
Q ss_pred CCccceEEEeccccchHHHHHHhhhccCCCcccchhhhhhhhhhcCCCCCcEEEEeeceEEeccCCCCCCCCCccccccc
Q 021149 170 PPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVG 249 (316)
Q Consensus 170 ppcTgFVEiMaPVFSR~Awrcvw~miqndlvhGWGLD~~w~~c~~g~~~~kiGVVDa~~V~H~~~ptlg~~~~~~~~~~~ 249 (316)
||||||||||||||||+|||||||||||||+|||||||+|+||+ +++++||||||||||+|+++|||||++.++.
T Consensus 190 ppct~fVEiMAPVFSr~Awrcvw~miqNDLvhGWGLDf~~~~c~-~~~~~kiGVVDs~~VvH~gvptLG~~~~~~~---- 264 (294)
T PF05212_consen 190 PPCTGFVEIMAPVFSRAAWRCVWHMIQNDLVHGWGLDFKWGYCA-GDRHKKIGVVDSQYVVHTGVPTLGGQGNSEK---- 264 (294)
T ss_pred CCcceEEEEecceechHHHHHHHhcccCCCccccchhhhHHHHh-ccccccEEEEeeEEEEEcCCCcCCCcccccc----
Confidence 99999999999999999999999999999999999999999999 6899999999999999999999999886632
Q ss_pred CCCchhhhhccccccCCCCCCCCCChhHHHhhhHHHHHHHHHHHHHhHhc
Q 021149 250 QASDDLEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVED 299 (316)
Q Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~Vr~r~~~E~~~f~~Rw~~A~~~ 299 (316)
+.++|.+||+||++||++|++||++|++|
T Consensus 265 ---------------------~~~~~~~Vr~r~~~E~~~F~~R~~~a~~~ 293 (294)
T PF05212_consen 265 ---------------------GKDPREEVRRRSFAEMRIFQKRWANAVKE 293 (294)
T ss_pred ---------------------CCchHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 45789999999999999999999999986
|
|
| >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function | Back alignment and domain information |
|---|
| >TIGR01556 rhamnosyltran L-rhamnosyltransferase | Back alignment and domain information |
|---|
| >cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans | Back alignment and domain information |
|---|
| >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan | Back alignment and domain information |
|---|
| >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase | Back alignment and domain information |
|---|
| >cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis | Back alignment and domain information |
|---|
| >PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A | Back alignment and domain information |
|---|
| >cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose | Back alignment and domain information |
|---|
| >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 316 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 2e-04 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 41.8 bits (97), Expect = 2e-04
Identities = 54/406 (13%), Positives = 99/406 (24%), Gaps = 152/406 (37%)
Query: 28 KQKKIV----DQIVRKFPSK---DFVVMLF---HYDGVVDEWKDLVWADRAIHVSAANQT 77
+ K I+ D V F K D + D ++ + R + Q
Sbjct: 17 QYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQE 76
Query: 78 KWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNP----RRYLSIVKDEGLEISQPALDPV 133
+ ++F+ + V NY FL I E P R Y+ + L
Sbjct: 77 E--MVQKFV--EEVLRINYKFLMSP-IKTEQRQPSMMTRMYIEQR--DRLYNDNQVFAKY 129
Query: 134 KSEVHHPITARRRNSKAHRR--------MYKYKGSGR-------CDDYS--TAPPC-IGW 175
P R+ R + GSG+ C Y I W
Sbjct: 130 NVSRLQPYLKLRQALLELRPAKNVLIDGV---LGSGKTWVALDVCLSYKVQCKMDFKIFW 186
Query: 176 VEM------------------------------MAPVFSRAAW---------------RC 190
+ + + + R C
Sbjct: 187 LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENC 246
Query: 191 AWYMIQNDLIHAW---GLDIQLGYCAQGDR----TKNVGVVD---SEYIVHLGL-PTLGV 239
++ ++ +A ++ C + T+ V D + H+ L
Sbjct: 247 --LLVLLNVQNAKAWNAFNLS---C----KILLTTRFKQVTDFLSAATTTHISLDHHSMT 297
Query: 240 TTEPE-----LNTVGQASDDL-EQIA--NPVALA---------PSQSRRYDN-------- 274
T E L + DL ++ NP L+ + + +
Sbjct: 298 LTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTT 357
Query: 275 ---------RPEVRRQSYIEMQIFRNR-----------WKHAVEDD 300
P R+ + + +F W ++ D
Sbjct: 358 IIESSLNVLEPAEYRKMFDRLSVFPPSAHIPTILLSLIWFDVIKSD 403
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 316 | |||
| 1qg8_A | 255 | Protein (spore coat polysaccharide biosynthesis P | 88.31 | |
| 1xhb_A | 472 | Polypeptide N-acetylgalactosaminyltransferase 1; g | 82.21 | |
| 2z86_A | 625 | Chondroitin synthase; GT-A, glycosyltransferase A, | 81.87 |
| >1qg8_A Protein (spore coat polysaccharide biosynthesis P SPSA); glycosyltransferase, transferase; 1.50A {Bacillus subtilis} SCOP: c.68.1.1 PDB: 1h7q_A* 1h7l_A 1qgq_A* 1qgs_A* | Back alignment and structure |
|---|
Probab=88.31 E-value=1 Score=37.99 Aligned_cols=119 Identities=8% Similarity=-0.073 Sum_probs=67.4
Q ss_pred ccccEEEEeccccccCCCChHHHHHHHHHh-CCccccccc-----CCCCCceeccccccccCcccceeeeeccCCCCCCC
Q 021149 92 AEYNYIFLWDEDIGVENFNPRRYLSIVKDE-GLEISQPAL-----DPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDD 165 (316)
Q Consensus 92 ~~YdYIflwDDDL~vd~f~i~ry~~Ivr~~-gLeISQPAL-----d~~s~~ish~iT~r~~~~~vHr~~~~~~~~~~C~~ 165 (316)
+..|||++.|+|..++...+.++++.+.+. +..+..+.. +.+ +...+.. . .+...+. .
T Consensus 87 a~g~~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~~~~-~-~~~~~~~----------~--- 150 (255)
T 1qg8_A 87 AEGEYITYATDDNIYMPDRLLKMVRELDTHPEKAVIYSASKTYHLNEN-RDIVKET-V-RPAAQVT----------W--- 150 (255)
T ss_dssp CCCSEEEEEETTEEECTTHHHHHHHHHHHCTTCCEEEEEEEEEEEC----CEEEEE-E-ECCCSCB----------S---
T ss_pred cCCCEEEEeCCCCccChHHHHHHHHHHHhCCCceEEEeceEEEEeCCC-Ccchhhc-c-CchHHHH----------H---
Confidence 578999999999999998999999999876 666666553 221 1111110 0 0000000 0
Q ss_pred CCCCCCccceEEEeccccchHHHHHHh-hh--ccC-CCcccchhhhhhhhhhcCCCCCcEEEEeeceEEecc
Q 021149 166 YSTAPPCIGWVEMMAPVFSRAAWRCAW-YM--IQN-DLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLG 233 (316)
Q Consensus 166 ~~~~ppcTgFVEiMaPVFSR~Awrcvw-~m--iqn-dlvhGWGLD~~w~~c~~g~~~~kiGVVDa~~V~H~~ 233 (316)
....++-..+=+|+|++++.+- .+ ..+ +.....+-|+.+...+.. .+ ++..++...+.|+.
T Consensus 151 -----~~~~~~~~~~~~~rr~~~~~~g~~f~~~~~~~~~~~~~eD~~~~~r~~~-~g-~~~~~~~~~~~~r~ 215 (255)
T 1qg8_A 151 -----NAPCAIDHCSVMHRYSVLEKVKEKFGSYWDESPAFYRIGDARFFWRVNH-FY-PFYPLDEELDLNYI 215 (255)
T ss_dssp -----CCTTTSCGGGEEEETHHHHHHHHHHSSSSCCCGGGSTTHHHHHHHHHTT-TC-CBEEEEEEEEEEEE
T ss_pred -----hhcCCCccceEEEEHHHHHhhcccccccccCCcccchHHHHHHHHHHHH-hC-CEEEecCcEEEEEE
Confidence 0011111234478999998863 22 011 222334567766544432 23 69999998888775
|
| >1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 | Back alignment and structure |
|---|
| >2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 316 | |||
| d1xhba2 | 328 | Polypeptide N-acetylgalactosaminyltransferase 1, N | 82.77 |
| >d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Nucleotide-diphospho-sugar transferases superfamily: Nucleotide-diphospho-sugar transferases family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain species: Mouse (Mus musculus) [TaxId: 10090]
Probab=82.77 E-value=0.62 Score=39.31 Aligned_cols=214 Identities=14% Similarity=0.101 Sum_probs=103.3
Q ss_pred CCCcEEEEEecccchh----hHHHHhhcCCCCCcE-EEEEEecCccCccc-c-c-ccc---CceeEEEeeccch-hhhhh
Q 021149 16 PPMNLLAIAAGIKQKK----IVDQIVRKFPSKDFV-VMLFHYDGVVDEWK-D-L-VWA---DRAIHVSAANQTK-WWFAK 83 (316)
Q Consensus 16 ~~k~Lva~~vG~kqk~----~vd~~v~kf~~~nF~-v~LfhYDg~v~~w~-d-~-ews---~~aiHv~a~kqtK-Ww~ak 83 (316)
.|..=|++|+=..... -+++++.+- ..++. =+++.=||+.|+-- . + ++. ..-|++....+.. .-.+.
T Consensus 21 ~P~vSIIIp~yNe~~~~l~~~l~Si~~qt-~~~~~~EIIvVDdgS~d~~~~~~l~~~~~~~~~~i~vi~~~~n~G~~~a~ 99 (328)
T d1xhba2 21 LPTTSVVIVFHNEAWSTLLRTVHSVINRS-PRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRAR 99 (328)
T ss_dssp CCCEEEEEEESSCCHHHHHHHHHHHHHSS-CGGGEEEEEEEECSCCCGGGTHHHHHHHHSSSSCEEEEECSSCCCHHHHH
T ss_pred CCCEEEEEeccCCcHHHHHHHHHHHHhcC-CCCCCeEEEEEECCCChhhHHHHHHHHHHhcCCCeEEEEecccccchHHH
Confidence 3555677777543333 345555542 23432 24555688877521 1 1 111 1124543322222 11111
Q ss_pred hccCccccccccEEEEeccccccCCCChHHHHHHHHHhCCcccccccCC---CCCcee--ccccccccCccccee--eee
Q 021149 84 RFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDP---VKSEVH--HPITARRRNSKAHRR--MYK 156 (316)
Q Consensus 84 RfLhPdiv~~YdYIflwDDDL~vd~f~i~ry~~Ivr~~gLeISQPALd~---~s~~is--h~iT~r~~~~~vHr~--~~~ 156 (316)
. ..=-.+.-|||++.|+|..++.-.++++++.+.+..-.+..|.++. +...+. ...+.-.-...++.+ .+.
T Consensus 100 N--~Gi~~a~gd~i~flD~D~~~~p~~l~~l~~~~~~~~~~~v~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (328)
T d1xhba2 100 L--KGAAVSRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVP 177 (328)
T ss_dssp H--HHHHHCCSSEEEEEESSEEECTTCHHHHHHHHHHCTTEEEEEEEEEECSSSCCEECCCTTEEEEECTTCCEEEEECC
T ss_pred H--HHHHhhhcceeeecCcccccChhHHHHHHHHHhcCCCeEEecceeeeccCceeeccCCccccccccccccccccccc
Confidence 1 1122467899999999999999999999999999888777776532 110000 000000000000000 000
Q ss_pred ccCCCCCCCCCCCCCccceEEEeccccchHHHHHHhhhccCCCcccch---hhhhhhhhhcCCCCCcEEEEeeceEEecc
Q 021149 157 YKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWG---LDIQLGYCAQGDRTKNVGVVDSEYIVHLG 233 (316)
Q Consensus 157 ~~~~~~C~~~~~~ppcTgFVEiMaPVFSR~Awrcvw~miqndlvhGWG---LD~~w~~c~~g~~~~kiGVVDa~~V~H~~ 233 (316)
...........+.+..+.++..-+=.++|++|..+= . ++.....|| +|+-+..... +.+|..+-...|.|..
T Consensus 178 ~~~~~~~~~~~~~~~~~~~~~g~~~~irr~~f~~vG-g-fDe~~~~~g~ED~Dl~~R~~~~---G~~i~~~p~~~v~H~~ 252 (328)
T d1xhba2 178 QREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIG-T-YDAGMDIWGGENLEISFRIWQC---GGTLEIVTCSHVGHVF 252 (328)
T ss_dssp HHHHHHTTTCTTSCEECSBCCSSSEEEEHHHHHHTT-S-CCTTSCTTCCCCSHHHHHHHHT---TCEEEEEEEEEEEEEC
T ss_pred hhhhhccccccccccccceecceeeeeeHHHHHHhC-C-CCCCCcCcCchHHHHHHHHHHh---CCeEEEeCCeEEEEeC
Confidence 000000001111222222332222246899999872 2 455555665 4555544433 4689999999999975
Q ss_pred CCCC
Q 021149 234 LPTL 237 (316)
Q Consensus 234 ~ptl 237 (316)
..+.
T Consensus 253 ~~~~ 256 (328)
T d1xhba2 253 RKAT 256 (328)
T ss_dssp ----
T ss_pred CCCC
Confidence 4333
|