Citrus Sinensis ID: 020804

Local Sequence Feature Prediction

Prediction and (Method)	Result

Residue Number Marker

Protein Sequence

Secondary Structure (PSIPRED)

Secondary Structure Prediction (SSPRO)

Coil and Loop (DISEMBL)

Flexible Loop (DISEMBL)

Low Complexity Region (SEG)

Disordered region (IsUnstruct)

Disordered Region (DISOPRED)

Disordered Region (DISEMBL)

Disordered Region (DISPRO)

Transmembrane Helix (TMHMM)

Transmembrane Helix (HMMTOP)

Transmembrane Helix (MEMSAT)

TM Helix, Signal Peptide (MEMSAT_SVM)

TM Helix, Signal Peptide (Phobius)

Signal Peptide (SignalP HMM Mode)

Signal Peptide (SignalP NN Mode)

Coiled Coils (COILS)

Positional Conservation

--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-

MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPSQAPLASSPPMKAPTHRASPSLPSSTSNKGKHSNLILLFGIGTGLLITAIISVLIICSCAFRRRNSKASPKETESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK

ccccccccccccHHHHHHHcccccccEEEEEccEEccEEEEEEEEcccccccccccccccccccccccccccccccccccccccccEEEEEHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccEEEEEEEEccEEEEccccccccccccccccccccccccccccccccEEEEEEEEccccEEEEEEEcccEEEEcccccccccEEEEEEEccccEEEEccccccccccccccccccEEEEEEEEEEccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccccccccc

ccccccccEEEcHHHHHHHHHHHHccEEEEcccEEccEEEEEEEccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccEEEEEEEEcccEEEEccccccHHccccccccccccccccccccccccEEEEEEEEcccEEEEEEEEcccEEEEccccccccEEEEEEEcccccEEEccccccccccccccccccccEEEEEEEEEcccccccEcccccccccccccccccccccEEEccccccccccccccccccHcccHHEEccc

mditphtgisfsasdAAAINFSLtthkvhfdstlvgdykllnftwfeppapsqaplassppmkapthraspslpsstsnkgkhsNLILLFGIGTGLLITAIISVLIICSCafrrrnskaspketesgkwscesdsEIRVLAEfkpglitldghaddwedidgsefsllpaldphaeheykggkmnvkalhdghDVYFLLQVdgeyvyskgentrcpsIALMFQIgedatyhnmggckegigsctsktckghevdimhfsigsaipgrlyggnpvdnsegnggdrfgHLVDVYawtphcryldgmgpsgiklnhfrsfishk

mditphtgisfsasDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPSQAPLASSPPMKAPTHRASPSLPSSTSNKGKHSNLILLFGIGTGLLITAIISVLIICSCAFrrrnskaspketesgkwscesdsEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK

MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEppapsqaplassppMKAPTHRASPSLPSSTSNKGKHSNlillfgigtgllitaiiSVLIICSCAFRRRNSKASPKETESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK

*************SDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFE*************************************NLILLFGIGTGLLITAIISVLIICSCAFR**********************EIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVD****NGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSF****

*DITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPP*************************************ILLFGIGTGLLITAIISVLIICSCA****************************LAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHF*SFI***

MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAP*****************************GKHSNLILLFGIGTGLLITAIISVLIICSCAFRRR******************DSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK

MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPSQAP***************************HSNLILLFGIGTGLLITAIISVLIICSCAFRR****************CESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISH*

oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii

oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST

Original result of BLAST against SWISS-PROT Database

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST

Original result of BLAST against Nonredundant Database

GI	Alignment Graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		321
15228739		361	heme binding protein [Arabidopsis thalia	0.573	0.509	0.793	3e-88
297817576		361	hypothetical protein ARALYDRAFT_907811 [	0.573	0.509	0.782	4e-88
356499833		397	PREDICTED: uncharacterized protein LOC10	0.573	0.463	0.793	2e-86
359487547		362	PREDICTED: uncharacterized protein LOC10	0.623	0.552	0.721	3e-86
255548411		364	conserved hypothetical protein [Ricinus	0.573	0.505	0.793	3e-86
147819855		362	hypothetical protein VITISV_023420 [Viti	0.623	0.552	0.717	7e-86
224128266		362	predicted protein [Populus trichocarpa]	0.573	0.508	0.782	1e-85
356534742		365	PREDICTED: uncharacterized protein LOC10	0.573	0.504	0.777	2e-83
449436413		375	PREDICTED: uncharacterized protein LOC10	0.566	0.485	0.782	4e-83
449507823		381	PREDICTED: LOW QUALITY PROTEIN: uncharac	0.566	0.477	0.782	5e-83

>gi\|15228739\|ref\|NP_191796.1\| heme binding protein [Arabidopsis thaliana] gi\|7340708\|emb\|CAB82951.1\| putative protein [Arabidopsis thaliana] gi\|19423876\|gb\|AAL87316.1\| unknown protein [Arabidopsis thaliana] gi\|22136956\|gb\|AAM91707.1\| unknown protein [Arabidopsis thaliana] gi\|332646823\|gb\|AEE80344.1\| heme binding protein [Arabidopsis thaliana]	Back alignment and taxonomy information

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 146/184 (79%), Positives = 165/184 (89%)

Query: 125 ESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKM 184
           ESG+WSCESDSEI+VLA+F+PG+ITLDGH DDW+DIDGSEF L PALDP ++HEY  GKM
Sbjct: 23  ESGEWSCESDSEIQVLADFRPGIITLDGHNDDWKDIDGSEFPLRPALDPDSDHEYDAGKM 82

Query: 185 NVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCT 244
            VKALHDG D+YFLL++DG Y Y KGEN +CPS+ALMFQIG+ ATYHNMGGCKEG  SCT
Sbjct: 83  TVKALHDGRDIYFLLEIDGNYAYDKGENNKCPSVALMFQIGDQATYHNMGGCKEGTDSCT 142

Query: 245 SKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGM 304
           SK C+G EVDIMHFSIG+AIPGRLYGGNP+DN EGNGGDRFGHLVD+YAW PHCRYLDG+
Sbjct: 143 SKACRGFEVDIMHFSIGNAIPGRLYGGNPIDNGEGNGGDRFGHLVDIYAWNPHCRYLDGL 202

Query: 305 GPSG 308
           GPSG
Sbjct: 203 GPSG 206

Source: Arabidopsis thaliana

Species: Arabidopsis thaliana

Genus: Arabidopsis

Family: Brassicaceae

Order: Brassicales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi\|297817576\|ref\|XP_002876671.1\| hypothetical protein ARALYDRAFT_907811 [Arabidopsis lyrata subsp. lyrata] gi\|297322509\|gb\|EFH52930.1\| hypothetical protein ARALYDRAFT_907811 [Arabidopsis lyrata subsp. lyrata]	Back alignment and taxonomy information

>gi\|356499833\|ref\|XP_003518741.1\| PREDICTED: uncharacterized protein LOC100786799 [Glycine max]	Back alignment and taxonomy information

>gi\|359487547\|ref\|XP_002277687.2\| PREDICTED: uncharacterized protein LOC100244357 [Vitis vinifera] gi\|296089782\|emb\|CBI39601.3\| unnamed protein product [Vitis vinifera]	Back alignment and taxonomy information

>gi\|255548411\|ref\|XP_002515262.1\| conserved hypothetical protein [Ricinus communis] gi\|223545742\|gb\|EEF47246.1\| conserved hypothetical protein [Ricinus communis]	Back alignment and taxonomy information

>gi\|147819855\|emb\|CAN71815.1\| hypothetical protein VITISV_023420 [Vitis vinifera]	Back alignment and taxonomy information

>gi\|224128266\|ref\|XP_002320284.1\| predicted protein [Populus trichocarpa] gi\|222861057\|gb\|EEE98599.1\| predicted protein [Populus trichocarpa]	Back alignment and taxonomy information

>gi\|356534742\|ref\|XP_003535911.1\| PREDICTED: uncharacterized protein LOC100798285 [Glycine max]	Back alignment and taxonomy information

>gi\|449436413\|ref\|XP_004135987.1\| PREDICTED: uncharacterized protein LOC101219938 [Cucumis sativus]	Back alignment and taxonomy information

>gi\|449507823\|ref\|XP_004163138.1\| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101219938 [Cucumis sativus]	Back alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST

Original result of BLAST against Gene Ontology (AMIGO)

ID	Alignment graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		321
TAIR\|locus:2096064		361	AT3G62370 [Arabidopsis thalian	0.573	0.509	0.793	2e-86
TAIR\|locus:2132168		725	AT4G02010 [Arabidopsis thalian	0.383	0.169	0.422	1.9e-20

TAIR\|locus:2096064 AT3G62370 [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

 Score = 864 (309.2 bits), Expect = 2.0e-86, P = 2.0e-86
 Identities = 146/184 (79%), Positives = 165/184 (89%)

Query:   125 ESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKM 184
             ESG+WSCESDSEI+VLA+F+PG+ITLDGH DDW+DIDGSEF L PALDP ++HEY  GKM
Sbjct:    23 ESGEWSCESDSEIQVLADFRPGIITLDGHNDDWKDIDGSEFPLRPALDPDSDHEYDAGKM 82

Query:   185 NVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCT 244
              VKALHDG D+YFLL++DG Y Y KGEN +CPS+ALMFQIG+ ATYHNMGGCKEG  SCT
Sbjct:    83 TVKALHDGRDIYFLLEIDGNYAYDKGENNKCPSVALMFQIGDQATYHNMGGCKEGTDSCT 142

Query:   245 SKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGM 304
             SK C+G EVDIMHFSIG+AIPGRLYGGNP+DN EGNGGDRFGHLVD+YAW PHCRYLDG+
Sbjct:   143 SKACRGFEVDIMHFSIGNAIPGRLYGGNPIDNGEGNGGDRFGHLVDIYAWNPHCRYLDGL 202

Query:   305 GPSG 308
             GPSG
Sbjct:   203 GPSG 206

GO:0020037 "heme binding" evidence=IEA

GO:0005794 "Golgi apparatus" evidence=IDA

GO:0005768 "endosome" evidence=IDA

GO:0005802 "trans-Golgi network" evidence=IDA

TAIR\|locus:2132168 AT4G02010 [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries

Original result of BLAST against SWISS-PROT

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server

Original result from Ezypred Server

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software

No EC number assignment, probably not an enzyme!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING

Original result from the STRING server

Fail to connect to STRING server

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST

Original result of RPS-BLAST against CDD database part I

ID	Alignment Graph	Length	Definition	E-value
Query		321
pfam09459		180	pfam09459, EB_dh, Ethylbenzene dehydrogenase	2e-20
smart00887		209	smart00887, EB_dh, Ethylbenzene dehydrogenase	8e-16
cd00241		158	cd00241, DOMON_like, Domon-like ligand-binding dom	1e-04

>gnl\|CDD\|220250 pfam09459, EB_dh, Ethylbenzene dehydrogenase	Back alignment and domain information

 Score = 86.3 bits (214), Expect = 2e-20
 Identities = 33/146 (22%), Positives = 48/146 (32%), Gaps = 30/146 (20%)

Query: 154 ADDWEDIDGSEFSLLPALDPHAEHEYKGGKMN--VKALHDGHDVYFLLQVDGE---YVYS 208
           A DW      E  L P  + + E + KG      VKA +DG ++YF L            
Sbjct: 1   APDWSKAPPVEIPLYPGPNVYPEPDPKGATKPVTVKAAYDGENIYFRLSWKDPTRSLEKQ 60

Query: 209 KGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRL 268
             ++     +A+MF  G+       G                       F  G+A  GR 
Sbjct: 61  GEDDYYEDKVAVMFPDGKVTPAAGAGCWLS-------------CHKDARFPAGAAGRGRK 107

Query: 269 YGGNPVDNSEGNGGDRFGHLVDVYAW 294
           Y G+             G  VD++ W
Sbjct: 108 YMGDS------------GQPVDLWHW 121

Eythylbenzene dehydrogenase is a heterotrimer of three subunits that catalyzes the anaerobic degradation of hydrocarbons. The alpha subunit contains the catalytic centre as a Molybdenum cofactor-complex. This removes an electron-pair from the hydrocarbon and passes it along an electron transport system involving iron-sulphur complexes held in the beta subunit and a Haem b molecule contained in the gamma subunit. The electron-pair is then subsequently passed to an as yet unknown receiver. The enzyme is found in a variety of different bacteria. Length = 180

>gnl\|CDD\|214885 smart00887, EB_dh, Ethylbenzene dehydrogenase	Back alignment and domain information

>gnl\|CDD\|187675 cd00241, DOMON_like, Domon-like ligand-binding domains	Back alignment and domain information

Conserved Domains Detected by HHsearch

Original result of HHsearch against CDD database

ID	Alignment Graph	Length	Definition	Probability
Query		321
PF09459		261	EB_dh: Ethylbenzene dehydrogenase; InterPro: IPR01	99.64
TIGR03477		205	DMSO_red_II_gam DMSO reductase family type II enzy	98.16
PF04478		154	Mid2: Mid2 like cell wall stress sensor; InterPro:	97.79
PTZ00382		96	Variant-specific surface protein (VSP); Provisiona	97.27
PF01102		122	Glycophorin_A: Glycophorin A; InterPro: IPR001195	97.19
PF15102		146	TMEM154: TMEM154 protein family	97.04
PF13908		179	Shisa: Wnt and FGF inhibitory regulator	96.22
PF02480		439	Herpes_gE: Alphaherpesvirus glycoprotein E; InterP	96.2
PF08693		40	SKG6: Transmembrane alpha-helix domain; InterPro:	95.95
PF12273		130	RCR: Chitin synthesis regulation, resistance to Co	95.6
PF03302		397	VSP: Giardia variant-specific surface protein; Int	94.9
PF04478		154	Mid2: Mid2 like cell wall stress sensor; InterPro:	94.6
PF05454		290	DAG1: Dystroglycan (Dystrophin-associated glycopro	94.55
PF15345		233	TMEM51: Transmembrane protein 51	94.54
PF14575		75	EphA2_TM: Ephrin type-A receptor 2 transmembrane d	94.1
PF06697		278	DUF1191: Protein of unknown function (DUF1191); In	94.07
PF01299		306	Lamp: Lysosome-associated membrane glycoprotein (L	93.86
PF07204		98	Orthoreo_P10: Orthoreovirus membrane fusion protei	93.77
PF08374		221	Protocadherin: Protocadherin; InterPro: IPR013585	93.62
PF12768		281	Rax2: Cortical protein marker for cell polarity	93.19
PF06452		185	DUF1083: Domain of unknown function (DUF1083); Int	93.09
PF06679		163	DUF1180: Protein of unknown function (DUF1180); In	92.77
PF12877		684	DUF3827: Domain of unknown function (DUF3827); Int	92.72
PF01034		64	Syndecan: Syndecan domain; InterPro: IPR001050 The	92.52
PF13908		179	Shisa: Wnt and FGF inhibitory regulator	92.03
cd00005		186	CBM9 Family 9 carbohydrate-binding module (CBM), p	91.92
PF02439		38	Adeno_E3_CR2: Adenovirus E3 region protein CR2; In	91.86
PF07213		79	DAP10: DAP10 membrane protein; InterPro: IPR009861	91.01
PF15069		143	FAM163: FAM163 family	89.79
PF05568		189	ASFV_J13L: African swine fever virus J13L protein;	89.22
PF10873		155	DUF2668: Protein of unknown function (DUF2668); In	89.08
PF15102		146	TMEM154: TMEM154 protein family	88.68
PF02480		439	Herpes_gE: Alphaherpesvirus glycoprotein E; InterP	86.34
COG3889		872	Predicted solute binding protein [General function	86.17
PF14610		189	DUF4448: Protein of unknown function (DUF4448)	86.04
PHA03265		402	envelope glycoprotein D; Provisional	85.71
PF06365		202	CD34_antigen: CD34/Podocalyxin family; InterPro: I	85.56
PF15065		350	NCU-G1: Lysosomal transcription factor, NCU-G1	84.98
TIGR01478		295	STEVOR variant surface antigen, stevor family. Thi	83.58
PF02009		299	Rifin_STEVOR: Rifin/stevor family; InterPro: IPR00	83.58
PF05393		94	Hum_adeno_E3A: Human adenovirus early E3A glycopro	83.49
PLN03150		623	hypothetical protein; Provisional	83.05
PF12191		129	stn_TNFRSF12A: Tumour necrosis factor receptor stn	82.89
PF01034		64	Syndecan: Syndecan domain; InterPro: IPR001050 The	82.86
PF05808		162	Podoplanin: Podoplanin; InterPro: IPR008783 This f	82.5
PF05083		74	LST1: LST-1 protein; InterPro: IPR007775 B144/LST1	82.13
PF05454		290	DAG1: Dystroglycan (Dystrophin-associated glycopro	81.61
PTZ00370		296	STEVOR; Provisional	80.7

>PF09459 EB_dh: Ethylbenzene dehydrogenase; InterPro: IPR019020 This entry represents a haem-binding domain found in cytochromes b558/566 (subunit A), c-551 and c-552, as well as in members of the type-II members of the microbial dimethyl sulphoxide (DMSO) reductase family	Back alignment and domain information

Probab=99.64  E-value=5.3e-17  Score=149.46  Aligned_cols=127  Identities=32%  Similarity=0.585  Sum_probs=60.5

Q ss_pred             CCcccccCccccccccCCCC--CCCCCCCCeeEEEEeecCccEEEEEEecC---cee------eec--CCcccCCceeee
Q 020804          155 DDWEDIDGSEFSLLPALDPH--AEHEYKGGKMNVKALHDGHDVYFLLQVDG---EYV------YSK--GENTRCPSIALM  221 (321)
Q Consensus       155 ~dw~~i~~~~~~l~pal~~d--~~~~y~~g~~~vk~~hdg~~~ffll~v~g---~y~------~~~--~~~~~c~svalm  221 (321)
                      .+|++|+..+++|.|.+++.  +..++....|+|||+|||++||||||.+.   ++.      |.+  ....-+..+|+|
T Consensus         2 ~~W~~~p~~~v~L~pg~~~~p~~~~~~~~~~v~VkAa~dg~~Iyfll~W~d~t~~~~~~p~~~~~~~~~~~~yeDk~Avm   81 (261)
T PF09459_consen    2 PDWSKAPPVEVPLYPGQSSYPEPPPKGGTIPVEVKAAHDGENIYFLLEWPDPTRSYERHPDGGWVQAGEDDYYEDKVAVM   81 (261)
T ss_dssp             HHHHTS-EEEEE-EE--GGG----T-----EEEEEEEE-SSEEEEEEEEE-----S------------STT----EEEEE
T ss_pred             chhccCCCeEEEECCCccCCccccCCCCcEEEEEEEEECCCeEEEEEEecCCCCCccccccccccccCCCCcCcceEEEE
Confidence            58999999999999996543  44588889999999999999999999988   233      223  567788999999


Q ss_pred             eeecCCceeeecCCCCCCCCcccccccCCceeeEEEEEeccccCceecCCCCCCCCCCCCCCccccceeeeecCC-----
Q 020804          222 FQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTP-----  296 (321)
Q Consensus       222 f~~g~~a~y~nmggc~~~~~~c~~~~c~g~evdi~hf~i~~aipgr~yg~n~~d~~~g~g~d~~g~l~d~yawnp-----  296 (321)
                      |.+| ++.++.+.||        ..+|...+.|+.|+.+++   ||.|-+..            |+++|++.|++     
T Consensus        82 f~~g-~v~~~~~~Gc--------~~~ch~~~~~~p~~~~~~---~~ky~~~~------------g~~vdlW~Wka~r~~~  137 (261)
T PF09459_consen   82 FSDG-DVPYFGQDGC--------WHTCHKPLRDMPAAPIGR---GRKYMGDS------------GEPVDLWHWKASRSGM  137 (261)
T ss_dssp             E----------------------------ESST--T--GG-------GT-BT------------TB-EEEEEEET-----
T ss_pred             eeec-cccccccccc--------cccccCCcccccCCCccc---ceeeeCCC------------CeEEEEEEeccccccc
Confidence            9999 8888865554        678999999999998887   88898874            99999999999     


Q ss_pred             --------cccccCCCC
Q 020804          297 --------HCRYLDGMG  305 (321)
Q Consensus       297 --------hcr~~dg~~  305 (321)
                              +|||.+|.|
T Consensus       138 ~~d~~~~~~r~~~~G~g  154 (261)
T PF09459_consen  138 ADDGYVFGKRRYDAGYG  154 (261)
T ss_dssp             -----------------
T ss_pred             ccccccccccccccccc
Confidence                    899999998

The DMSO reductase family is a large and rapidly expanding group of enzymes found in bacteria and archaea that share a common form of molybdenum cofactor known as bis(molybdopterin guanine dinucleotide)Mo []. In addition to the molybdopterin subunit, these enzymes also contain an iron-sulphur subunit. These include two distinct but very closely related periplasmic proteins of anaerobic respiration: selenate reductase and chlorate reductase []. Other proteins containing this subunit include dimethyl sulphide dehydrogenase and ethylbenzene dehydrogenase [, , ]. One member of the DMSO reductase family is eythylbenzene dehydrogenase, which is a heterotrimer of three subunits that catalyses the anaerobic degradation of hydrocarbons (alpha, beta and gamma subunits). This entry matches the gamma subunit, whose structure is known []. The alpha subunit contains the catalytic centre as a Molybdenum cofactor-complex. This removes an electron-pair from the hydrocarbon and passes it along an electron transport system involving iron-sulphur complexes held in the beta subunit and a Haem b molecule contained in the gamma subunit. The electron-pair is then subsequently passed to an as yet unknown receiver. The enzyme is found in a variety of different bacteria.; GO: 0020037 heme binding; PDB: 2IVF_C.

>TIGR03477 DMSO_red_II_gam DMSO reductase family type II enzyme, heme b subunit	Back alignment and domain information

>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region	Back alignment and domain information

>PTZ00382 Variant-specific surface protein (VSP); Provisional	Back alignment and domain information

>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane	Back alignment and domain information

>PF15102 TMEM154: TMEM154 protein family	Back alignment and domain information

>PF13908 Shisa: Wnt and FGF inhibitory regulator	Back alignment and domain information

>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein	Back alignment and domain information

>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]	Back alignment and domain information

>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls	Back alignment and domain information

>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein)	Back alignment and domain information

>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region	Back alignment and domain information

>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5	Back alignment and domain information

>PF15345 TMEM51: Transmembrane protein 51	Back alignment and domain information

>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A	Back alignment and domain information

>PF06697 DUF1191: Protein of unknown function (DUF1191); InterPro: IPR010605 This family contains hypothetical plant proteins of unknown function	Back alignment and domain information

>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear	Back alignment and domain information

>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences	Back alignment and domain information

>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains	Back alignment and domain information

>PF12768 Rax2: Cortical protein marker for cell polarity	Back alignment and domain information

>PF06452 DUF1083: Domain of unknown function (DUF1083); InterPro: IPR010502 This entry represents the family 9 carbohydrate-binding module (CBD9), which exhibit an immunoglobulin-like beta-sandwich fold, with an additional beta-strand at the N terminus []	Back alignment and domain information

>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins	Back alignment and domain information

>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]	Back alignment and domain information

>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]	Back alignment and domain information

>PF13908 Shisa: Wnt and FGF inhibitory regulator	Back alignment and domain information

>cd00005 CBM9 Family 9 carbohydrate-binding module (CBM), plays a role in microbial degradation of cellulose and hemicellulose found in plants; previously called cellulose-binding domain; the binding sites of the CBMs for which structures have been determined are of two general types: flat surfaces comprising predominantly aromatic residues tryptophan and tyrosine and extended shallow grooves; this domain frequently occurs in tandem

Back alignment and domain information

>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []	Back alignment and domain information

>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins	Back alignment and domain information

>PF15069 FAM163: FAM163 family	Back alignment and domain information

>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ]	Back alignment and domain information

>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []	Back alignment and domain information

>PF15102 TMEM154: TMEM154 protein family	Back alignment and domain information

>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein	Back alignment and domain information

>COG3889 Predicted solute binding protein [General function prediction only]	Back alignment and domain information

>PF14610 DUF4448: Protein of unknown function (DUF4448)	Back alignment and domain information

>PHA03265 envelope glycoprotein D; Provisional	Back alignment and domain information

>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins	Back alignment and domain information

>PF15065 NCU-G1: Lysosomal transcription factor, NCU-G1	Back alignment and domain information

>TIGR01478 STEVOR variant surface antigen, stevor family	Back alignment and domain information

>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world	Back alignment and domain information

>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2	Back alignment and domain information

>PLN03150 hypothetical protein; Provisional	Back alignment and domain information

>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins	Back alignment and domain information

>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]	Back alignment and domain information

>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes []	Back alignment and domain information

>PF05083 LST1: LST-1 protein; InterPro: IPR007775 B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length	Back alignment and domain information

>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5	Back alignment and domain information

>PTZ00370 STEVOR; Provisional	Back alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST

Original result of BLAST against Protein Data Bank

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST

Original result of RPS-BLAST against PDB70 database

ID	Alignment Graph	Length	Definition	E-value
Query		321
2ivf_C		214	Ethylbenzene dehydrogenase gamma-subunit; anaerobi	3e-18

>2ivf_C Ethylbenzene dehydrogenase gamma-subunit; anaerobic hydrocarbon degradation, MOCO, Fe/S cluster, MO- B enzyme, DMSO reductase family; HET: MES MGD MD1 HEM; 1.88A {Aromatoleum aromaticum} Length = 214	Back alignment and structure

 Score = 81.0 bits (199), Expect = 3e-18
 Identities = 23/142 (16%), Positives = 44/142 (30%), Gaps = 24/142 (16%)

Query: 148 ITLDGHADDWEDIDGSEFSLLPA-------LDPHAEHEYKGG---KMNVKALHDGHDVYF 197
           + LD  A  W   + + F + P        + P        G   +++V ALH+G  +  
Sbjct: 12  LLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIAL 71

Query: 198 LLQ-VDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIM 256
            L+    ++      N+    +  MF +   A    MG               G  V+  
Sbjct: 72  RLKWASEKHDKIVDLNSFVDGVGAMFPVARGAQAVTMGA-------------TGRPVNAW 118

Query: 257 HFSIGSAIPGRLYGGNPVDNSE 278
           ++   +  P  +          
Sbjct: 119 YWKANANEPMEIVAEGFSAVRR 140

PyMOL of 2ivf

Structure Templates Detected by HHsearch

Original result of HHsearch against PDB70 database

ID	Alignment Graph	Length	Definition	Probability
Query		321
2ivf_C		214	Ethylbenzene dehydrogenase gamma-subunit; anaerobi	99.85
2k1k_A		38	Ephrin type-A receptor 1; EPHA1, receptor tyrosine	96.84
2ks1_B		44	Epidermal growth factor receptor; ERBB1, ERBB2, tr	96.13
1i82_A		189	Xylanase A, endo-1,4-beta-xylanase A; cellobiose c	96.09
2l2t_A		44	Receptor tyrosine-protein kinase ERBB-4; transmemb	95.47
2jwa_A		44	Receptor tyrosine-protein kinase ERBB-2; transmemb	95.29
1q55_A		880	EP-cadherin, C-cadherin; trans interaction, desmos	92.36
2k9y_A		41	Ephrin type-A receptor 2; receptor tyrosine kinase	92.26
2knc_A		54	Integrin alpha-IIB; transmembrane signaling, prote	90.17
2l8s_A		54	Integrin alpha-1; transmembrane region, detergent	88.04
2klu_A		70	T-cell surface glycoprotein CD4; cell membrane, di	86.27
2l34_A		33	TYRO protein tyrosine kinase-binding protein; immu	82.72
2k1a_A		42	Integrin alpha-IIB; single-PASS transmembrane segm	80.75

>2ivf_C Ethylbenzene dehydrogenase gamma-subunit; anaerobic hydrocarbon degradation, MOCO, Fe/S cluster, MO- B enzyme, DMSO reductase family; HET: MES MGD MD1 HEM; 1.88A {Aromatoleum aromaticum}	Back alignment and structure

Probab=99.85  E-value=3.7e-22  Score=179.82  Aligned_cols=119  Identities=21%  Similarity=0.305  Sum_probs=107.7

Q ss_pred             eeeecCCCCCcccccCcccccccc-----CCCCC-----CCCCCCCeeEEEEeecCccEEEEEEecCceeeecCC-cccC
Q 020804          147 LITLDGHADDWEDIDGSEFSLLPA-----LDPHA-----EHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGE-NTRC  215 (321)
Q Consensus       147 ~~tldg~~~dw~~i~~~~~~l~pa-----l~~d~-----~~~y~~g~~~vk~~hdg~~~ffll~v~g~y~~~~~~-~~~c  215 (321)
                      .|++|+++.+|++++..+++|.|+     +|+++     ..++...+|+|||+|||++|||||+.+.+|++...+ ...|
T Consensus        11 ~~~~d~~~~~W~~ap~~~v~L~~~~~~~p~~~~~~~~~~~~~~~~~~v~VkAa~dg~~i~f~l~W~D~t~~~~~~~~~f~   90 (214)
T 2ivf_C           11 ELLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIALRLKWASEKHDKIVDLNSFV   90 (214)
T ss_dssp             HHHTCTTCHHHHTSCEEEEECEECCGGGGTTTCTTGGGCCSCCCCCEEEEEEEECSSEEEEEEEEECCCCCSCCSTTCCC
T ss_pred             cccCCCChHHHhcCCceEEEccCCccccccccccccccccCCCCceEEEEEEEECCCeEEEEEEECCCCCCccccccccC
Confidence            568999999999999999999999     88887     788999999999999999999999999999998777 7779


Q ss_pred             CceeeeeeecCCceeeecCCCCCCCCcccccccCCceeeEEEEEeccccCceecCCCCCCCCCCCCCCcccccee
Q 020804          216 PSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVD  290 (321)
Q Consensus       216 ~svalmf~~g~~a~y~nmggc~~~~~~c~~~~c~g~evdi~hf~i~~aipgr~yg~n~~d~~~g~g~d~~g~l~d  290 (321)
                      ..||+||++|++++|+.||.             .|++|||+||..+.         |.++|..++|   ||++.+
T Consensus        91 D~vAvmfp~~~~~~~~~MG~-------------~~~~vdiw~Wka~~---------~~~~~~~a~G---fgs~~~  140 (214)
T 2ivf_C           91 DGVGAMFPVARGAQAVTMGA-------------TGRPVNAWYWKANA---------NEPMEIVAEG---FSAVRR  140 (214)
T ss_dssp             CEEEEEEESSTTCCGGGTCB-------------TTBCEEEEEEETTC---------SSCEEEEESS---TTSEEE
T ss_pred             ceEEEEeEcCCCCcccccCC-------------CCcEEEEEEEecCC---------CcceeeccCC---cccccc
Confidence            99999999999999999994             67999999999875         3577777766   999887

PyMOL of 2ivf

>2k1k_A Ephrin type-A receptor 1; EPHA1, receptor tyrosine kinase, dimeric transmembrane domain, ATP-binding, glycoprotein, nucleotide-binding; NMR {Homo sapiens} PDB: 2k1l_A	Back alignment and structure

>2ks1_B Epidermal growth factor receptor; ERBB1, ERBB2, transmembrane, heterodimer, complex, tyrosine receptor, bicelles, transferase; NMR {Homo sapiens}	Back alignment and structure

>1i82_A Xylanase A, endo-1,4-beta-xylanase A; cellobiose complex, hydrolase; HET: BGC; 1.90A {Thermotoga maritima} SCOP: b.1.9.2 PDB: 1i8a_A* 1i8u_A	Back alignment and structure

>2l2t_A Receptor tyrosine-protein kinase ERBB-4; transmembrane dimer, membrane domain, membrane protei; NMR {Homo sapiens}	Back alignment and structure

>2jwa_A Receptor tyrosine-protein kinase ERBB-2; transmembrane helix dimer, protein kinase receptor membrane domain, ATP-binding, glycoprotein; NMR {Homo sapiens} PDB: 2ks1_A	Back alignment and structure

>1q55_A EP-cadherin, C-cadherin; trans interaction, desmosome, junction, adhesion, structural protein; HET: NAG NDG; 30.00A {Mus musculus} SCOP: i.20.1.1 PDB: 1q5a_A* 1q5b_A* 1q5c_A*	Back alignment and structure

>2k9y_A Ephrin type-A receptor 2; receptor tyrosine kinase, membrane protein, dimeric transmembrane domain, ephrin receptor, ATP-binding, glycoprotein; NMR {Homo sapiens}	Back alignment and structure

>2knc_A Integrin alpha-IIB; transmembrane signaling, protein structure, cell A cleavage on PAIR of basic residues, disease mutation, disul bond, glycoprotein; NMR {Homo sapiens}	Back alignment and structure

>2l8s_A Integrin alpha-1; transmembrane region, detergent micelle, CE adhesion; NMR {Homo sapiens}	Back alignment and structure

>2klu_A T-cell surface glycoprotein CD4; cell membrane, disulfide bond, HOST- virus interaction, immune response, immunoglobulin domain, lipoprotein; NMR {Homo sapiens}	Back alignment and structure

>2l34_A TYRO protein tyrosine kinase-binding protein; immunoreceptor, transmembrane assembly, DAP12, protein bindi; NMR {Homo sapiens} PDB: 2l35_B	Back alignment and structure

>2k1a_A Integrin alpha-IIB; single-PASS transmembrane segment, alternative splicing, calcium, cell adhesion, cleavage on PAIR of basic residues; NMR {Homo sapiens} PDB: 2k9j_A	Back alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST

Original result of RPS-BLAST against SCOP70(version1.75) database

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch

Original result of HHsearch against SCOP70(version1.75) database

ID	Alignment Graph	Length	Definition	Probability
Query		321
d1i8aa_		189	Xylanase 10A {Thermotoga maritima [TaxId: 2336]}	95.68

>d1i8aa_ b.1.9.2 (A:) Xylanase 10A {Thermotoga maritima [TaxId: 2336]}	Back information, alignment and structure

class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: CBD9-like
family: Family 9 carbohydrate-binding module, CBD9
domain: Xylanase 10A
species: Thermotoga maritima [TaxId: 2336]

Probab=95.68  E-value=0.0059  Score=48.85  Aligned_cols=76  Identities=18%  Similarity=0.289  Sum_probs=49.9

Q ss_pred             eCceeeecCCCCC-cccccCccccccccCCCCCCCCCCCCeeEEEEeecCccEEEEEEecCceeeecC-CcccCCceeee
Q 020804          144 KPGLITLDGHADD-WEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKG-ENTRCPSIALM  221 (321)
Q Consensus       144 ~pg~~tldg~~~d-w~~i~~~~~~l~pal~~d~~~~y~~g~~~vk~~hdg~~~ffll~v~g~y~~~~~-~~~~c~svalm  221 (321)
                      +=|.|||||..++ |+..+-..  ++-...+.....   -+-+||+++|.+.+||+++|--+...... .--..-+|.++
T Consensus         6 ~~g~p~IDG~lde~W~~a~~~~--~~~~~~~~~~~~---~~t~v~~~~D~~~LYv~~~~~D~~~~~~~~~~~~~D~vei~   80 (189)
T d1i8aa_           6 KYGTPVIDGEIDEIWNTTEEIE--TKAVAMGSLDKN---ATAKVRVLWDENYLYVLAIVKDPVLNKDNSNPWEQDSVEIF   80 (189)
T ss_dssp             EECCCCSSSSCCGGGGGSCEEE--CCEEEESCTTTS---CEEEEEEEECSSEEEEEEEEECSSCCCCSSSGGGSSEEEEE
T ss_pred             ccCCCEECccCChHHhcCcccc--cceeccCCCCCC---CcEEEEEEEecCeEEEEEEEEcCCcccccCCccCCCeEEEE
Confidence            3499999999999 99876543  333333333322   36799999999999999999765432211 11123567777


Q ss_pred             eee
Q 020804          222 FQI  224 (321)
Q Consensus       222 f~~  224 (321)
                      |.-
T Consensus        81 id~   83 (189)
T d1i8aa_          81 IDE   83 (189)
T ss_dssp             EES
T ss_pred             EcC
Confidence            754

PyMOL of d1i8aa_