Citrus Sinensis ID: 020804
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 321 | ||||||
| 15228739 | 361 | heme binding protein [Arabidopsis thalia | 0.573 | 0.509 | 0.793 | 3e-88 | |
| 297817576 | 361 | hypothetical protein ARALYDRAFT_907811 [ | 0.573 | 0.509 | 0.782 | 4e-88 | |
| 356499833 | 397 | PREDICTED: uncharacterized protein LOC10 | 0.573 | 0.463 | 0.793 | 2e-86 | |
| 359487547 | 362 | PREDICTED: uncharacterized protein LOC10 | 0.623 | 0.552 | 0.721 | 3e-86 | |
| 255548411 | 364 | conserved hypothetical protein [Ricinus | 0.573 | 0.505 | 0.793 | 3e-86 | |
| 147819855 | 362 | hypothetical protein VITISV_023420 [Viti | 0.623 | 0.552 | 0.717 | 7e-86 | |
| 224128266 | 362 | predicted protein [Populus trichocarpa] | 0.573 | 0.508 | 0.782 | 1e-85 | |
| 356534742 | 365 | PREDICTED: uncharacterized protein LOC10 | 0.573 | 0.504 | 0.777 | 2e-83 | |
| 449436413 | 375 | PREDICTED: uncharacterized protein LOC10 | 0.566 | 0.485 | 0.782 | 4e-83 | |
| 449507823 | 381 | PREDICTED: LOW QUALITY PROTEIN: uncharac | 0.566 | 0.477 | 0.782 | 5e-83 |
| >gi|15228739|ref|NP_191796.1| heme binding protein [Arabidopsis thaliana] gi|7340708|emb|CAB82951.1| putative protein [Arabidopsis thaliana] gi|19423876|gb|AAL87316.1| unknown protein [Arabidopsis thaliana] gi|22136956|gb|AAM91707.1| unknown protein [Arabidopsis thaliana] gi|332646823|gb|AEE80344.1| heme binding protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 146/184 (79%), Positives = 165/184 (89%)
Query: 125 ESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKM 184
ESG+WSCESDSEI+VLA+F+PG+ITLDGH DDW+DIDGSEF L PALDP ++HEY GKM
Sbjct: 23 ESGEWSCESDSEIQVLADFRPGIITLDGHNDDWKDIDGSEFPLRPALDPDSDHEYDAGKM 82
Query: 185 NVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCT 244
VKALHDG D+YFLL++DG Y Y KGEN +CPS+ALMFQIG+ ATYHNMGGCKEG SCT
Sbjct: 83 TVKALHDGRDIYFLLEIDGNYAYDKGENNKCPSVALMFQIGDQATYHNMGGCKEGTDSCT 142
Query: 245 SKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGM 304
SK C+G EVDIMHFSIG+AIPGRLYGGNP+DN EGNGGDRFGHLVD+YAW PHCRYLDG+
Sbjct: 143 SKACRGFEVDIMHFSIGNAIPGRLYGGNPIDNGEGNGGDRFGHLVDIYAWNPHCRYLDGL 202
Query: 305 GPSG 308
GPSG
Sbjct: 203 GPSG 206
|
Source: Arabidopsis thaliana Species: Arabidopsis thaliana Genus: Arabidopsis Family: Brassicaceae Order: Brassicales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|297817576|ref|XP_002876671.1| hypothetical protein ARALYDRAFT_907811 [Arabidopsis lyrata subsp. lyrata] gi|297322509|gb|EFH52930.1| hypothetical protein ARALYDRAFT_907811 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|356499833|ref|XP_003518741.1| PREDICTED: uncharacterized protein LOC100786799 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|359487547|ref|XP_002277687.2| PREDICTED: uncharacterized protein LOC100244357 [Vitis vinifera] gi|296089782|emb|CBI39601.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255548411|ref|XP_002515262.1| conserved hypothetical protein [Ricinus communis] gi|223545742|gb|EEF47246.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|147819855|emb|CAN71815.1| hypothetical protein VITISV_023420 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224128266|ref|XP_002320284.1| predicted protein [Populus trichocarpa] gi|222861057|gb|EEE98599.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356534742|ref|XP_003535911.1| PREDICTED: uncharacterized protein LOC100798285 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449436413|ref|XP_004135987.1| PREDICTED: uncharacterized protein LOC101219938 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449507823|ref|XP_004163138.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101219938 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 321 | ||||||
| TAIR|locus:2096064 | 361 | AT3G62370 [Arabidopsis thalian | 0.573 | 0.509 | 0.793 | 2e-86 | |
| TAIR|locus:2132168 | 725 | AT4G02010 [Arabidopsis thalian | 0.383 | 0.169 | 0.422 | 1.9e-20 |
| TAIR|locus:2096064 AT3G62370 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 864 (309.2 bits), Expect = 2.0e-86, P = 2.0e-86
Identities = 146/184 (79%), Positives = 165/184 (89%)
Query: 125 ESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKM 184
ESG+WSCESDSEI+VLA+F+PG+ITLDGH DDW+DIDGSEF L PALDP ++HEY GKM
Sbjct: 23 ESGEWSCESDSEIQVLADFRPGIITLDGHNDDWKDIDGSEFPLRPALDPDSDHEYDAGKM 82
Query: 185 NVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCT 244
VKALHDG D+YFLL++DG Y Y KGEN +CPS+ALMFQIG+ ATYHNMGGCKEG SCT
Sbjct: 83 TVKALHDGRDIYFLLEIDGNYAYDKGENNKCPSVALMFQIGDQATYHNMGGCKEGTDSCT 142
Query: 245 SKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGM 304
SK C+G EVDIMHFSIG+AIPGRLYGGNP+DN EGNGGDRFGHLVD+YAW PHCRYLDG+
Sbjct: 143 SKACRGFEVDIMHFSIGNAIPGRLYGGNPIDNGEGNGGDRFGHLVDIYAWNPHCRYLDGL 202
Query: 305 GPSG 308
GPSG
Sbjct: 203 GPSG 206
|
|
| TAIR|locus:2132168 AT4G02010 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 321 | |||
| pfam09459 | 180 | pfam09459, EB_dh, Ethylbenzene dehydrogenase | 2e-20 | |
| smart00887 | 209 | smart00887, EB_dh, Ethylbenzene dehydrogenase | 8e-16 | |
| cd00241 | 158 | cd00241, DOMON_like, Domon-like ligand-binding dom | 1e-04 |
| >gnl|CDD|220250 pfam09459, EB_dh, Ethylbenzene dehydrogenase | Back alignment and domain information |
|---|
Score = 86.3 bits (214), Expect = 2e-20
Identities = 33/146 (22%), Positives = 48/146 (32%), Gaps = 30/146 (20%)
Query: 154 ADDWEDIDGSEFSLLPALDPHAEHEYKGGKMN--VKALHDGHDVYFLLQVDGE---YVYS 208
A DW E L P + + E + KG VKA +DG ++YF L
Sbjct: 1 APDWSKAPPVEIPLYPGPNVYPEPDPKGATKPVTVKAAYDGENIYFRLSWKDPTRSLEKQ 60
Query: 209 KGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRL 268
++ +A+MF G+ G F G+A GR
Sbjct: 61 GEDDYYEDKVAVMFPDGKVTPAAGAGCWLS-------------CHKDARFPAGAAGRGRK 107
Query: 269 YGGNPVDNSEGNGGDRFGHLVDVYAW 294
Y G+ G VD++ W
Sbjct: 108 YMGDS------------GQPVDLWHW 121
|
Eythylbenzene dehydrogenase is a heterotrimer of three subunits that catalyzes the anaerobic degradation of hydrocarbons. The alpha subunit contains the catalytic centre as a Molybdenum cofactor-complex. This removes an electron-pair from the hydrocarbon and passes it along an electron transport system involving iron-sulphur complexes held in the beta subunit and a Haem b molecule contained in the gamma subunit. The electron-pair is then subsequently passed to an as yet unknown receiver. The enzyme is found in a variety of different bacteria. Length = 180 |
| >gnl|CDD|214885 smart00887, EB_dh, Ethylbenzene dehydrogenase | Back alignment and domain information |
|---|
| >gnl|CDD|187675 cd00241, DOMON_like, Domon-like ligand-binding domains | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 321 | |||
| PF09459 | 261 | EB_dh: Ethylbenzene dehydrogenase; InterPro: IPR01 | 99.64 | |
| TIGR03477 | 205 | DMSO_red_II_gam DMSO reductase family type II enzy | 98.16 | |
| PF04478 | 154 | Mid2: Mid2 like cell wall stress sensor; InterPro: | 97.79 | |
| PTZ00382 | 96 | Variant-specific surface protein (VSP); Provisiona | 97.27 | |
| PF01102 | 122 | Glycophorin_A: Glycophorin A; InterPro: IPR001195 | 97.19 | |
| PF15102 | 146 | TMEM154: TMEM154 protein family | 97.04 | |
| PF13908 | 179 | Shisa: Wnt and FGF inhibitory regulator | 96.22 | |
| PF02480 | 439 | Herpes_gE: Alphaherpesvirus glycoprotein E; InterP | 96.2 | |
| PF08693 | 40 | SKG6: Transmembrane alpha-helix domain; InterPro: | 95.95 | |
| PF12273 | 130 | RCR: Chitin synthesis regulation, resistance to Co | 95.6 | |
| PF03302 | 397 | VSP: Giardia variant-specific surface protein; Int | 94.9 | |
| PF04478 | 154 | Mid2: Mid2 like cell wall stress sensor; InterPro: | 94.6 | |
| PF05454 | 290 | DAG1: Dystroglycan (Dystrophin-associated glycopro | 94.55 | |
| PF15345 | 233 | TMEM51: Transmembrane protein 51 | 94.54 | |
| PF14575 | 75 | EphA2_TM: Ephrin type-A receptor 2 transmembrane d | 94.1 | |
| PF06697 | 278 | DUF1191: Protein of unknown function (DUF1191); In | 94.07 | |
| PF01299 | 306 | Lamp: Lysosome-associated membrane glycoprotein (L | 93.86 | |
| PF07204 | 98 | Orthoreo_P10: Orthoreovirus membrane fusion protei | 93.77 | |
| PF08374 | 221 | Protocadherin: Protocadherin; InterPro: IPR013585 | 93.62 | |
| PF12768 | 281 | Rax2: Cortical protein marker for cell polarity | 93.19 | |
| PF06452 | 185 | DUF1083: Domain of unknown function (DUF1083); Int | 93.09 | |
| PF06679 | 163 | DUF1180: Protein of unknown function (DUF1180); In | 92.77 | |
| PF12877 | 684 | DUF3827: Domain of unknown function (DUF3827); Int | 92.72 | |
| PF01034 | 64 | Syndecan: Syndecan domain; InterPro: IPR001050 The | 92.52 | |
| PF13908 | 179 | Shisa: Wnt and FGF inhibitory regulator | 92.03 | |
| cd00005 | 186 | CBM9 Family 9 carbohydrate-binding module (CBM), p | 91.92 | |
| PF02439 | 38 | Adeno_E3_CR2: Adenovirus E3 region protein CR2; In | 91.86 | |
| PF07213 | 79 | DAP10: DAP10 membrane protein; InterPro: IPR009861 | 91.01 | |
| PF15069 | 143 | FAM163: FAM163 family | 89.79 | |
| PF05568 | 189 | ASFV_J13L: African swine fever virus J13L protein; | 89.22 | |
| PF10873 | 155 | DUF2668: Protein of unknown function (DUF2668); In | 89.08 | |
| PF15102 | 146 | TMEM154: TMEM154 protein family | 88.68 | |
| PF02480 | 439 | Herpes_gE: Alphaherpesvirus glycoprotein E; InterP | 86.34 | |
| COG3889 | 872 | Predicted solute binding protein [General function | 86.17 | |
| PF14610 | 189 | DUF4448: Protein of unknown function (DUF4448) | 86.04 | |
| PHA03265 | 402 | envelope glycoprotein D; Provisional | 85.71 | |
| PF06365 | 202 | CD34_antigen: CD34/Podocalyxin family; InterPro: I | 85.56 | |
| PF15065 | 350 | NCU-G1: Lysosomal transcription factor, NCU-G1 | 84.98 | |
| TIGR01478 | 295 | STEVOR variant surface antigen, stevor family. Thi | 83.58 | |
| PF02009 | 299 | Rifin_STEVOR: Rifin/stevor family; InterPro: IPR00 | 83.58 | |
| PF05393 | 94 | Hum_adeno_E3A: Human adenovirus early E3A glycopro | 83.49 | |
| PLN03150 | 623 | hypothetical protein; Provisional | 83.05 | |
| PF12191 | 129 | stn_TNFRSF12A: Tumour necrosis factor receptor stn | 82.89 | |
| PF01034 | 64 | Syndecan: Syndecan domain; InterPro: IPR001050 The | 82.86 | |
| PF05808 | 162 | Podoplanin: Podoplanin; InterPro: IPR008783 This f | 82.5 | |
| PF05083 | 74 | LST1: LST-1 protein; InterPro: IPR007775 B144/LST1 | 82.13 | |
| PF05454 | 290 | DAG1: Dystroglycan (Dystrophin-associated glycopro | 81.61 | |
| PTZ00370 | 296 | STEVOR; Provisional | 80.7 |
| >PF09459 EB_dh: Ethylbenzene dehydrogenase; InterPro: IPR019020 This entry represents a haem-binding domain found in cytochromes b558/566 (subunit A), c-551 and c-552, as well as in members of the type-II members of the microbial dimethyl sulphoxide (DMSO) reductase family | Back alignment and domain information |
|---|
Probab=99.64 E-value=5.3e-17 Score=149.46 Aligned_cols=127 Identities=32% Similarity=0.585 Sum_probs=60.5
Q ss_pred CCcccccCccccccccCCCC--CCCCCCCCeeEEEEeecCccEEEEEEecC---cee------eec--CCcccCCceeee
Q 020804 155 DDWEDIDGSEFSLLPALDPH--AEHEYKGGKMNVKALHDGHDVYFLLQVDG---EYV------YSK--GENTRCPSIALM 221 (321)
Q Consensus 155 ~dw~~i~~~~~~l~pal~~d--~~~~y~~g~~~vk~~hdg~~~ffll~v~g---~y~------~~~--~~~~~c~svalm 221 (321)
.+|++|+..+++|.|.+++. +..++....|+|||+|||++||||||.+. ++. |.+ ....-+..+|+|
T Consensus 2 ~~W~~~p~~~v~L~pg~~~~p~~~~~~~~~~v~VkAa~dg~~Iyfll~W~d~t~~~~~~p~~~~~~~~~~~~yeDk~Avm 81 (261)
T PF09459_consen 2 PDWSKAPPVEVPLYPGQSSYPEPPPKGGTIPVEVKAAHDGENIYFLLEWPDPTRSYERHPDGGWVQAGEDDYYEDKVAVM 81 (261)
T ss_dssp HHHHTS-EEEEE-EE--GGG----T-----EEEEEEEE-SSEEEEEEEEE-----S------------STT----EEEEE
T ss_pred chhccCCCeEEEECCCccCCccccCCCCcEEEEEEEEECCCeEEEEEEecCCCCCccccccccccccCCCCcCcceEEEE
Confidence 58999999999999996543 44588889999999999999999999988 233 223 567788999999
Q ss_pred eeecCCceeeecCCCCCCCCcccccccCCceeeEEEEEeccccCceecCCCCCCCCCCCCCCccccceeeeecCC-----
Q 020804 222 FQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTP----- 296 (321)
Q Consensus 222 f~~g~~a~y~nmggc~~~~~~c~~~~c~g~evdi~hf~i~~aipgr~yg~n~~d~~~g~g~d~~g~l~d~yawnp----- 296 (321)
|.+| ++.++.+.|| ..+|...+.|+.|+.+++ ||.|-+.. |+++|++.|++
T Consensus 82 f~~g-~v~~~~~~Gc--------~~~ch~~~~~~p~~~~~~---~~ky~~~~------------g~~vdlW~Wka~r~~~ 137 (261)
T PF09459_consen 82 FSDG-DVPYFGQDGC--------WHTCHKPLRDMPAAPIGR---GRKYMGDS------------GEPVDLWHWKASRSGM 137 (261)
T ss_dssp E----------------------------ESST--T--GG-------GT-BT------------TB-EEEEEEET-----
T ss_pred eeec-cccccccccc--------cccccCCcccccCCCccc---ceeeeCCC------------CeEEEEEEeccccccc
Confidence 9999 8888865554 678999999999998887 88898874 99999999999
Q ss_pred --------cccccCCCC
Q 020804 297 --------HCRYLDGMG 305 (321)
Q Consensus 297 --------hcr~~dg~~ 305 (321)
+|||.+|.|
T Consensus 138 ~~d~~~~~~r~~~~G~g 154 (261)
T PF09459_consen 138 ADDGYVFGKRRYDAGYG 154 (261)
T ss_dssp -----------------
T ss_pred ccccccccccccccccc
Confidence 899999998
|
The DMSO reductase family is a large and rapidly expanding group of enzymes found in bacteria and archaea that share a common form of molybdenum cofactor known as bis(molybdopterin guanine dinucleotide)Mo []. In addition to the molybdopterin subunit, these enzymes also contain an iron-sulphur subunit. These include two distinct but very closely related periplasmic proteins of anaerobic respiration: selenate reductase and chlorate reductase []. Other proteins containing this subunit include dimethyl sulphide dehydrogenase and ethylbenzene dehydrogenase [, , ]. One member of the DMSO reductase family is eythylbenzene dehydrogenase, which is a heterotrimer of three subunits that catalyses the anaerobic degradation of hydrocarbons (alpha, beta and gamma subunits). This entry matches the gamma subunit, whose structure is known []. The alpha subunit contains the catalytic centre as a Molybdenum cofactor-complex. This removes an electron-pair from the hydrocarbon and passes it along an electron transport system involving iron-sulphur complexes held in the beta subunit and a Haem b molecule contained in the gamma subunit. The electron-pair is then subsequently passed to an as yet unknown receiver. The enzyme is found in a variety of different bacteria.; GO: 0020037 heme binding; PDB: 2IVF_C. |
| >TIGR03477 DMSO_red_II_gam DMSO reductase family type II enzyme, heme b subunit | Back alignment and domain information |
|---|
| >PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region | Back alignment and domain information |
|---|
| >PTZ00382 Variant-specific surface protein (VSP); Provisional | Back alignment and domain information |
|---|
| >PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane | Back alignment and domain information |
|---|
| >PF15102 TMEM154: TMEM154 protein family | Back alignment and domain information |
|---|
| >PF13908 Shisa: Wnt and FGF inhibitory regulator | Back alignment and domain information |
|---|
| >PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein | Back alignment and domain information |
|---|
| >PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ] | Back alignment and domain information |
|---|
| >PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls | Back alignment and domain information |
|---|
| >PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein) | Back alignment and domain information |
|---|
| >PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region | Back alignment and domain information |
|---|
| >PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5 | Back alignment and domain information |
|---|
| >PF15345 TMEM51: Transmembrane protein 51 | Back alignment and domain information |
|---|
| >PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A | Back alignment and domain information |
|---|
| >PF06697 DUF1191: Protein of unknown function (DUF1191); InterPro: IPR010605 This family contains hypothetical plant proteins of unknown function | Back alignment and domain information |
|---|
| >PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear | Back alignment and domain information |
|---|
| >PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences | Back alignment and domain information |
|---|
| >PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains | Back alignment and domain information |
|---|
| >PF12768 Rax2: Cortical protein marker for cell polarity | Back alignment and domain information |
|---|
| >PF06452 DUF1083: Domain of unknown function (DUF1083); InterPro: IPR010502 This entry represents the family 9 carbohydrate-binding module (CBD9), which exhibit an immunoglobulin-like beta-sandwich fold, with an additional beta-strand at the N terminus [] | Back alignment and domain information |
|---|
| >PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins | Back alignment and domain information |
|---|
| >PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ] | Back alignment and domain information |
|---|
| >PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ] | Back alignment and domain information |
|---|
| >PF13908 Shisa: Wnt and FGF inhibitory regulator | Back alignment and domain information |
|---|
| >cd00005 CBM9 Family 9 carbohydrate-binding module (CBM), plays a role in microbial degradation of cellulose and hemicellulose found in plants; previously called cellulose-binding domain; the binding sites of the CBMs for which structures have been determined are of two general types: flat surfaces comprising predominantly aromatic residues tryptophan and tyrosine and extended shallow grooves; this domain frequently occurs in tandem | Back alignment and domain information |
|---|
| >PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host [] | Back alignment and domain information |
|---|
| >PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins | Back alignment and domain information |
|---|
| >PF15069 FAM163: FAM163 family | Back alignment and domain information |
|---|
| >PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ] | Back alignment and domain information |
|---|
| >PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [] | Back alignment and domain information |
|---|
| >PF15102 TMEM154: TMEM154 protein family | Back alignment and domain information |
|---|
| >PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein | Back alignment and domain information |
|---|
| >COG3889 Predicted solute binding protein [General function prediction only] | Back alignment and domain information |
|---|
| >PF14610 DUF4448: Protein of unknown function (DUF4448) | Back alignment and domain information |
|---|
| >PHA03265 envelope glycoprotein D; Provisional | Back alignment and domain information |
|---|
| >PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins | Back alignment and domain information |
|---|
| >PF15065 NCU-G1: Lysosomal transcription factor, NCU-G1 | Back alignment and domain information |
|---|
| >TIGR01478 STEVOR variant surface antigen, stevor family | Back alignment and domain information |
|---|
| >PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world | Back alignment and domain information |
|---|
| >PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2 | Back alignment and domain information |
|---|
| >PLN03150 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins | Back alignment and domain information |
|---|
| >PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ] | Back alignment and domain information |
|---|
| >PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [] | Back alignment and domain information |
|---|
| >PF05083 LST1: LST-1 protein; InterPro: IPR007775 B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length | Back alignment and domain information |
|---|
| >PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5 | Back alignment and domain information |
|---|
| >PTZ00370 STEVOR; Provisional | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 321 | |||
| 2ivf_C | 214 | Ethylbenzene dehydrogenase gamma-subunit; anaerobi | 3e-18 |
| >2ivf_C Ethylbenzene dehydrogenase gamma-subunit; anaerobic hydrocarbon degradation, MOCO, Fe/S cluster, MO- B enzyme, DMSO reductase family; HET: MES MGD MD1 HEM; 1.88A {Aromatoleum aromaticum} Length = 214 | Back alignment and structure |
|---|
Score = 81.0 bits (199), Expect = 3e-18
Identities = 23/142 (16%), Positives = 44/142 (30%), Gaps = 24/142 (16%)
Query: 148 ITLDGHADDWEDIDGSEFSLLPA-------LDPHAEHEYKGG---KMNVKALHDGHDVYF 197
+ LD A W + + F + P + P G +++V ALH+G +
Sbjct: 12 LLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIAL 71
Query: 198 LLQ-VDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIM 256
L+ ++ N+ + MF + A MG G V+
Sbjct: 72 RLKWASEKHDKIVDLNSFVDGVGAMFPVARGAQAVTMGA-------------TGRPVNAW 118
Query: 257 HFSIGSAIPGRLYGGNPVDNSE 278
++ + P +
Sbjct: 119 YWKANANEPMEIVAEGFSAVRR 140
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 321 | |||
| 2ivf_C | 214 | Ethylbenzene dehydrogenase gamma-subunit; anaerobi | 99.85 | |
| 2k1k_A | 38 | Ephrin type-A receptor 1; EPHA1, receptor tyrosine | 96.84 | |
| 2ks1_B | 44 | Epidermal growth factor receptor; ERBB1, ERBB2, tr | 96.13 | |
| 1i82_A | 189 | Xylanase A, endo-1,4-beta-xylanase A; cellobiose c | 96.09 | |
| 2l2t_A | 44 | Receptor tyrosine-protein kinase ERBB-4; transmemb | 95.47 | |
| 2jwa_A | 44 | Receptor tyrosine-protein kinase ERBB-2; transmemb | 95.29 | |
| 1q55_A | 880 | EP-cadherin, C-cadherin; trans interaction, desmos | 92.36 | |
| 2k9y_A | 41 | Ephrin type-A receptor 2; receptor tyrosine kinase | 92.26 | |
| 2knc_A | 54 | Integrin alpha-IIB; transmembrane signaling, prote | 90.17 | |
| 2l8s_A | 54 | Integrin alpha-1; transmembrane region, detergent | 88.04 | |
| 2klu_A | 70 | T-cell surface glycoprotein CD4; cell membrane, di | 86.27 | |
| 2l34_A | 33 | TYRO protein tyrosine kinase-binding protein; immu | 82.72 | |
| 2k1a_A | 42 | Integrin alpha-IIB; single-PASS transmembrane segm | 80.75 |
| >2ivf_C Ethylbenzene dehydrogenase gamma-subunit; anaerobic hydrocarbon degradation, MOCO, Fe/S cluster, MO- B enzyme, DMSO reductase family; HET: MES MGD MD1 HEM; 1.88A {Aromatoleum aromaticum} | Back alignment and structure |
|---|
Probab=99.85 E-value=3.7e-22 Score=179.82 Aligned_cols=119 Identities=21% Similarity=0.305 Sum_probs=107.7
Q ss_pred eeeecCCCCCcccccCcccccccc-----CCCCC-----CCCCCCCeeEEEEeecCccEEEEEEecCceeeecCC-cccC
Q 020804 147 LITLDGHADDWEDIDGSEFSLLPA-----LDPHA-----EHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGE-NTRC 215 (321)
Q Consensus 147 ~~tldg~~~dw~~i~~~~~~l~pa-----l~~d~-----~~~y~~g~~~vk~~hdg~~~ffll~v~g~y~~~~~~-~~~c 215 (321)
.|++|+++.+|++++..+++|.|+ +|+++ ..++...+|+|||+|||++|||||+.+.+|++...+ ...|
T Consensus 11 ~~~~d~~~~~W~~ap~~~v~L~~~~~~~p~~~~~~~~~~~~~~~~~~v~VkAa~dg~~i~f~l~W~D~t~~~~~~~~~f~ 90 (214)
T 2ivf_C 11 ELLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIALRLKWASEKHDKIVDLNSFV 90 (214)
T ss_dssp HHHTCTTCHHHHTSCEEEEECEECCGGGGTTTCTTGGGCCSCCCCCEEEEEEEECSSEEEEEEEEECCCCCSCCSTTCCC
T ss_pred cccCCCChHHHhcCCceEEEccCCccccccccccccccccCCCCceEEEEEEEECCCeEEEEEEECCCCCCccccccccC
Confidence 568999999999999999999999 88887 788999999999999999999999999999998777 7779
Q ss_pred CceeeeeeecCCceeeecCCCCCCCCcccccccCCceeeEEEEEeccccCceecCCCCCCCCCCCCCCcccccee
Q 020804 216 PSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVD 290 (321)
Q Consensus 216 ~svalmf~~g~~a~y~nmggc~~~~~~c~~~~c~g~evdi~hf~i~~aipgr~yg~n~~d~~~g~g~d~~g~l~d 290 (321)
..||+||++|++++|+.||. .|++|||+||..+. |.++|..++| ||++.+
T Consensus 91 D~vAvmfp~~~~~~~~~MG~-------------~~~~vdiw~Wka~~---------~~~~~~~a~G---fgs~~~ 140 (214)
T 2ivf_C 91 DGVGAMFPVARGAQAVTMGA-------------TGRPVNAWYWKANA---------NEPMEIVAEG---FSAVRR 140 (214)
T ss_dssp CEEEEEEESSTTCCGGGTCB-------------TTBCEEEEEEETTC---------SSCEEEEESS---TTSEEE
T ss_pred ceEEEEeEcCCCCcccccCC-------------CCcEEEEEEEecCC---------CcceeeccCC---cccccc
Confidence 99999999999999999994 67999999999875 3577777766 999887
|
| >2k1k_A Ephrin type-A receptor 1; EPHA1, receptor tyrosine kinase, dimeric transmembrane domain, ATP-binding, glycoprotein, nucleotide-binding; NMR {Homo sapiens} PDB: 2k1l_A | Back alignment and structure |
|---|
| >2ks1_B Epidermal growth factor receptor; ERBB1, ERBB2, transmembrane, heterodimer, complex, tyrosine receptor, bicelles, transferase; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >1i82_A Xylanase A, endo-1,4-beta-xylanase A; cellobiose complex, hydrolase; HET: BGC; 1.90A {Thermotoga maritima} SCOP: b.1.9.2 PDB: 1i8a_A* 1i8u_A | Back alignment and structure |
|---|
| >2l2t_A Receptor tyrosine-protein kinase ERBB-4; transmembrane dimer, membrane domain, membrane protei; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >2jwa_A Receptor tyrosine-protein kinase ERBB-2; transmembrane helix dimer, protein kinase receptor membrane domain, ATP-binding, glycoprotein; NMR {Homo sapiens} PDB: 2ks1_A | Back alignment and structure |
|---|
| >1q55_A EP-cadherin, C-cadherin; trans interaction, desmosome, junction, adhesion, structural protein; HET: NAG NDG; 30.00A {Mus musculus} SCOP: i.20.1.1 PDB: 1q5a_A* 1q5b_A* 1q5c_A* | Back alignment and structure |
|---|
| >2k9y_A Ephrin type-A receptor 2; receptor tyrosine kinase, membrane protein, dimeric transmembrane domain, ephrin receptor, ATP-binding, glycoprotein; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >2knc_A Integrin alpha-IIB; transmembrane signaling, protein structure, cell A cleavage on PAIR of basic residues, disease mutation, disul bond, glycoprotein; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >2l8s_A Integrin alpha-1; transmembrane region, detergent micelle, CE adhesion; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >2klu_A T-cell surface glycoprotein CD4; cell membrane, disulfide bond, HOST- virus interaction, immune response, immunoglobulin domain, lipoprotein; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >2l34_A TYRO protein tyrosine kinase-binding protein; immunoreceptor, transmembrane assembly, DAP12, protein bindi; NMR {Homo sapiens} PDB: 2l35_B | Back alignment and structure |
|---|
| >2k1a_A Integrin alpha-IIB; single-PASS transmembrane segment, alternative splicing, calcium, cell adhesion, cleavage on PAIR of basic residues; NMR {Homo sapiens} PDB: 2k9j_A | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 321 | |||
| d1i8aa_ | 189 | Xylanase 10A {Thermotoga maritima [TaxId: 2336]} | 95.68 |
| >d1i8aa_ b.1.9.2 (A:) Xylanase 10A {Thermotoga maritima [TaxId: 2336]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: Immunoglobulin-like beta-sandwich superfamily: CBD9-like family: Family 9 carbohydrate-binding module, CBD9 domain: Xylanase 10A species: Thermotoga maritima [TaxId: 2336]
Probab=95.68 E-value=0.0059 Score=48.85 Aligned_cols=76 Identities=18% Similarity=0.289 Sum_probs=49.9
Q ss_pred eCceeeecCCCCC-cccccCccccccccCCCCCCCCCCCCeeEEEEeecCccEEEEEEecCceeeecC-CcccCCceeee
Q 020804 144 KPGLITLDGHADD-WEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKG-ENTRCPSIALM 221 (321)
Q Consensus 144 ~pg~~tldg~~~d-w~~i~~~~~~l~pal~~d~~~~y~~g~~~vk~~hdg~~~ffll~v~g~y~~~~~-~~~~c~svalm 221 (321)
+=|.|||||..++ |+..+-.. ++-...+..... -+-+||+++|.+.+||+++|--+...... .--..-+|.++
T Consensus 6 ~~g~p~IDG~lde~W~~a~~~~--~~~~~~~~~~~~---~~t~v~~~~D~~~LYv~~~~~D~~~~~~~~~~~~~D~vei~ 80 (189)
T d1i8aa_ 6 KYGTPVIDGEIDEIWNTTEEIE--TKAVAMGSLDKN---ATAKVRVLWDENYLYVLAIVKDPVLNKDNSNPWEQDSVEIF 80 (189)
T ss_dssp EECCCCSSSSCCGGGGGSCEEE--CCEEEESCTTTS---CEEEEEEEECSSEEEEEEEEECSSCCCCSSSGGGSSEEEEE
T ss_pred ccCCCEECccCChHHhcCcccc--cceeccCCCCCC---CcEEEEEEEecCeEEEEEEEEcCCcccccCCccCCCeEEEE
Confidence 3499999999999 99876543 333333333322 36799999999999999999765432211 11123567777
Q ss_pred eee
Q 020804 222 FQI 224 (321)
Q Consensus 222 f~~ 224 (321)
|.-
T Consensus 81 id~ 83 (189)
T d1i8aa_ 81 IDE 83 (189)
T ss_dssp EES
T ss_pred EcC
Confidence 754
|