Citrus Sinensis ID: 019150


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-----
MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPSQAPLASSPPMKAPTHRASPSLPSSTSNKGKHSNLILLFGIGTGLLITAIISVLIICSCAFRRRNSKASPKETAKPRLLLLLFVLSTVSIGWVNSHEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK
ccccccccccccHHHHHHHHcccccEEEEEccEEEccEEEEEEEEEccccccccccccccccccccccccccccccccccccccccEEEHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccHHHHHHHHHHEEEEccccccccccccccccccEEEEEEEEcEEEEEccccccccccccccccccccccccccccccccEEEEEEEEccccEEEEEEEcccEEEEcccccccccEEEEEEEccccEEEEccccccccccccccccccEEEEEEEEEEccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccccccccc
ccccccccEEEcHHHHHHHHHHHHccEEEEcccEEccEEEEEEEccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccEEEEEEEEccccccccccccccccEEcccccEEEEEEEEcccEEEEccccccHHccccccccccccccccccccccccEEEEEEEEcccEEEEEEEEcccEEEEccccccccEEEEEEEcccccEEEccccccccccccccccccccEEEEEEEEEcccccccEcccccccccccccccccccccEEEccccccccccccccccccccccHHEEccc
mditphtgisfsasdAAAINFSLtthkvhfdstlvgdykllnftwfeppapsqaplassppmkapthraspslpsstsnkgkhsNLILLFGIGTGLLITAIISVLIICSCafrrrnskaspketakPRLLLLLFVLSTVSigwvnsheesgkwscesdSEIRVLAEfkpglitldghaddwedidgsefsllpaldphaeheykggkmnvkalhdghDVYFLLQVdgeyvyskgentrcpsIALMFQIgedatyhnmggckegigsctsktckghevdimhfsigsaipgrlyggnpvdnsegnggdrfgHLVDVYawtphcryldgmgpsgiklnhfrsfishk
mditphtgisfsasDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPSQAPLASSPPMKAPTHRASPSLPSSTSNKGKHSNLILLFGIGTGLLITAIISVLIICSCAFrrrnskaspketakprlLLLLFVLSTVSIGWVNSHEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK
MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEppapsqaplassppMKAPTHRASPSLPSSTSNKGKHSNlillfgigtgllitaiiSVLIICSCAFRRRNSKASPKETAKPRlllllFVLSTVSIGWVNSHEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK
*************SDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFE*************************************NLILLFGIGTGLLITAIISVLIICSCAFRR************PRLLLLLFVLSTVSIGWVNSHEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVD****NGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSF****
*DITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPS***********************************LLFGIGTGLLITAIISVLIICSC*********************LLFVLSTVSIGWVNSH************EIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHF*SFI***
MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAP*****************************GKHSNLILLFGIGTGLLITAIISVLIICSCAFRRR********TAKPRLLLLLFVLSTVSIGWVN************DSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK
MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPSQAP***************************HSNLILLFGIGTGLLITAIISVLIICSCAFRRRNSKASPKETAKPRLLLLLFVLSTVSIGWVNSHEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISH*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDITPHTGISFSASDAAAINFSLTTHKVHFDSTLVGDYKLLNFTWFEPPAPSQAPLASSPPMKAPTHRASPSLPSSTSNKGKHSNLILLFGIGTGLLITAIISVLIICSCAFRRRNSKASPKETAKPRLLLLLFVLSTVSIGWVNSHEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLDGMGPSGIKLNHFRSFISHK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query345
224128266362 predicted protein [Populus trichocarpa] 0.556 0.530 0.786 1e-90
297817576361 hypothetical protein ARALYDRAFT_907811 [ 0.571 0.545 0.747 2e-90
255548411364 conserved hypothetical protein [Ricinus 0.556 0.527 0.786 3e-90
15228739361 heme binding protein [Arabidopsis thalia 0.571 0.545 0.752 3e-90
359487547362 PREDICTED: uncharacterized protein LOC10 0.582 0.555 0.736 2e-89
356499833397 PREDICTED: uncharacterized protein LOC10 0.547 0.476 0.793 5e-89
147819855362 hypothetical protein VITISV_023420 [Viti 0.582 0.555 0.731 6e-89
356534742365 PREDICTED: uncharacterized protein LOC10 0.547 0.517 0.777 8e-86
449436413375 PREDICTED: uncharacterized protein LOC10 0.542 0.498 0.777 3e-85
357442211365 hypothetical protein MTR_1g086810 [Medic 0.547 0.517 0.767 4e-85
>gi|224128266|ref|XP_002320284.1| predicted protein [Populus trichocarpa] gi|222861057|gb|EEE98599.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 151/192 (78%), Positives = 172/192 (89%)

Query: 141 IGWVNSHEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAE 200
           IGWVNSH+ESG+WSCESD EIR+ AEFKPG ITLDGHADDW+DIDG + SLLPALDP  +
Sbjct: 17  IGWVNSHQESGEWSCESDEEIRIEAEFKPGFITLDGHADDWKDIDGLDSSLLPALDPDDD 76

Query: 201 HEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGC 260
            +Y GGKM VKALHDG+D++FLLQVDG Y Y+KG+N +CPS+ALMF IG++ATYHNMGGC
Sbjct: 77  KKYTGGKMTVKALHDGNDMFFLLQVDGNYAYTKGDNKKCPSVALMFPIGDEATYHNMGGC 136

Query: 261 KEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTP 320
           KEG G+C  KTCKGHEVDIMHFSIG+AIPGRLYGGNP+DN EGNGGDRFGHLVD+Y+W P
Sbjct: 137 KEGTGTCNRKTCKGHEVDIMHFSIGNAIPGRLYGGNPLDNGEGNGGDRFGHLVDLYSWNP 196

Query: 321 HCRYLDGMGPSG 332
           HCRYLDG GPSG
Sbjct: 197 HCRYLDGTGPSG 208




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297817576|ref|XP_002876671.1| hypothetical protein ARALYDRAFT_907811 [Arabidopsis lyrata subsp. lyrata] gi|297322509|gb|EFH52930.1| hypothetical protein ARALYDRAFT_907811 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|255548411|ref|XP_002515262.1| conserved hypothetical protein [Ricinus communis] gi|223545742|gb|EEF47246.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|15228739|ref|NP_191796.1| heme binding protein [Arabidopsis thaliana] gi|7340708|emb|CAB82951.1| putative protein [Arabidopsis thaliana] gi|19423876|gb|AAL87316.1| unknown protein [Arabidopsis thaliana] gi|22136956|gb|AAM91707.1| unknown protein [Arabidopsis thaliana] gi|332646823|gb|AEE80344.1| heme binding protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|359487547|ref|XP_002277687.2| PREDICTED: uncharacterized protein LOC100244357 [Vitis vinifera] gi|296089782|emb|CBI39601.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356499833|ref|XP_003518741.1| PREDICTED: uncharacterized protein LOC100786799 [Glycine max] Back     alignment and taxonomy information
>gi|147819855|emb|CAN71815.1| hypothetical protein VITISV_023420 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356534742|ref|XP_003535911.1| PREDICTED: uncharacterized protein LOC100798285 [Glycine max] Back     alignment and taxonomy information
>gi|449436413|ref|XP_004135987.1| PREDICTED: uncharacterized protein LOC101219938 [Cucumis sativus] Back     alignment and taxonomy information
>gi|357442211|ref|XP_003591383.1| hypothetical protein MTR_1g086810 [Medicago truncatula] gi|355480431|gb|AES61634.1| hypothetical protein MTR_1g086810 [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query345
TAIR|locus:2096064 361 AT3G62370 [Arabidopsis thalian 0.539 0.515 0.790 1.8e-87
TAIR|locus:2132168 725 AT4G02010 [Arabidopsis thalian 0.371 0.176 0.429 2.8e-22
TAIR|locus:2096064 AT3G62370 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 874 (312.7 bits), Expect = 1.8e-87, P = 1.8e-87
 Identities = 147/186 (79%), Positives = 167/186 (89%)

Query:   147 HEESGKWSCESDSEIRVLAEFKPGLITLDGHADDWEDIDGSEFSLLPALDPHAEHEYKGG 206
             H+ESG+WSCESDSEI+VLA+F+PG+ITLDGH DDW+DIDGSEF L PALDP ++HEY  G
Sbjct:    21 HQESGEWSCESDSEIQVLADFRPGIITLDGHNDDWKDIDGSEFPLRPALDPDSDHEYDAG 80

Query:   207 KMNVKALHDGHDVYFLLQVDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGS 266
             KM VKALHDG D+YFLL++DG Y Y KGEN +CPS+ALMFQIG+ ATYHNMGGCKEG  S
Sbjct:    81 KMTVKALHDGRDIYFLLEIDGNYAYDKGENNKCPSVALMFQIGDQATYHNMGGCKEGTDS 140

Query:   267 CTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTPHCRYLD 326
             CTSK C+G EVDIMHFSIG+AIPGRLYGGNP+DN EGNGGDRFGHLVD+YAW PHCRYLD
Sbjct:   141 CTSKACRGFEVDIMHFSIGNAIPGRLYGGNPIDNGEGNGGDRFGHLVDIYAWNPHCRYLD 200

Query:   327 GMGPSG 332
             G+GPSG
Sbjct:   201 GLGPSG 206




GO:0020037 "heme binding" evidence=IEA
GO:0005794 "Golgi apparatus" evidence=IDA
GO:0005768 "endosome" evidence=IDA
GO:0005802 "trans-Golgi network" evidence=IDA
TAIR|locus:2132168 AT4G02010 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00140641
hypothetical protein (363 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query345
pfam09459180 pfam09459, EB_dh, Ethylbenzene dehydrogenase 2e-20
smart00887209 smart00887, EB_dh, Ethylbenzene dehydrogenase 7e-16
cd00241158 cd00241, DOMON_like, Domon-like ligand-binding dom 9e-05
>gnl|CDD|220250 pfam09459, EB_dh, Ethylbenzene dehydrogenase Back     alignment and domain information
 Score = 86.7 bits (215), Expect = 2e-20
 Identities = 33/146 (22%), Positives = 48/146 (32%), Gaps = 30/146 (20%)

Query: 178 ADDWEDIDGSEFSLLPALDPHAEHEYKGGKMN--VKALHDGHDVYFLLQVDGE---YVYS 232
           A DW      E  L P  + + E + KG      VKA +DG ++YF L            
Sbjct: 1   APDWSKAPPVEIPLYPGPNVYPEPDPKGATKPVTVKAAYDGENIYFRLSWKDPTRSLEKQ 60

Query: 233 KGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRL 292
             ++     +A+MF  G+       G                       F  G+A  GR 
Sbjct: 61  GEDDYYEDKVAVMFPDGKVTPAAGAGCWLS-------------CHKDARFPAGAAGRGRK 107

Query: 293 YGGNPVDNSEGNGGDRFGHLVDVYAW 318
           Y G+             G  VD++ W
Sbjct: 108 YMGDS------------GQPVDLWHW 121


Eythylbenzene dehydrogenase is a heterotrimer of three subunits that catalyzes the anaerobic degradation of hydrocarbons. The alpha subunit contains the catalytic centre as a Molybdenum cofactor-complex. This removes an electron-pair from the hydrocarbon and passes it along an electron transport system involving iron-sulphur complexes held in the beta subunit and a Haem b molecule contained in the gamma subunit. The electron-pair is then subsequently passed to an as yet unknown receiver. The enzyme is found in a variety of different bacteria. Length = 180

>gnl|CDD|214885 smart00887, EB_dh, Ethylbenzene dehydrogenase Back     alignment and domain information
>gnl|CDD|187675 cd00241, DOMON_like, Domon-like ligand-binding domains Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 345
PF09459261 EB_dh: Ethylbenzene dehydrogenase; InterPro: IPR01 99.62
TIGR03477205 DMSO_red_II_gam DMSO reductase family type II enzy 98.25
PF04478154 Mid2: Mid2 like cell wall stress sensor; InterPro: 97.35
PF01102122 Glycophorin_A: Glycophorin A; InterPro: IPR001195 96.93
PTZ0038296 Variant-specific surface protein (VSP); Provisiona 96.73
PF0869340 SKG6: Transmembrane alpha-helix domain; InterPro: 95.55
PF02480439 Herpes_gE: Alphaherpesvirus glycoprotein E; InterP 94.34
PF03302397 VSP: Giardia variant-specific surface protein; Int 94.22
PF13908179 Shisa: Wnt and FGF inhibitory regulator 93.79
PF12273130 RCR: Chitin synthesis regulation, resistance to Co 93.63
PF06697278 DUF1191: Protein of unknown function (DUF1191); In 93.48
PF05454290 DAG1: Dystroglycan (Dystrophin-associated glycopro 93.41
PF15102146 TMEM154: TMEM154 protein family 93.14
PF06452185 DUF1083: Domain of unknown function (DUF1083); Int 91.58
PF04478154 Mid2: Mid2 like cell wall stress sensor; InterPro: 91.37
PF01299306 Lamp: Lysosome-associated membrane glycoprotein (L 91.0
PF10873155 DUF2668: Protein of unknown function (DUF2668); In 90.66
PF15069143 FAM163: FAM163 family 90.61
PF0721379 DAP10: DAP10 membrane protein; InterPro: IPR009861 90.43
PF02480439 Herpes_gE: Alphaherpesvirus glycoprotein E; InterP 90.35
PF12768281 Rax2: Cortical protein marker for cell polarity 90.31
PF1457575 EphA2_TM: Ephrin type-A receptor 2 transmembrane d 90.29
PF08374221 Protocadherin: Protocadherin; InterPro: IPR013585 90.2
PF05808162 Podoplanin: Podoplanin; InterPro: IPR008783 This f 90.09
PF0720498 Orthoreo_P10: Orthoreovirus membrane fusion protei 89.15
PF15345233 TMEM51: Transmembrane protein 51 88.78
PF0243938 Adeno_E3_CR2: Adenovirus E3 region protein CR2; In 88.17
cd00005186 CBM9 Family 9 carbohydrate-binding module (CBM), p 87.89
PF0103464 Syndecan: Syndecan domain; InterPro: IPR001050 The 87.47
PF0539394 Hum_adeno_E3A: Human adenovirus early E3A glycopro 86.74
COG3889872 Predicted solute binding protein [General function 86.51
PF02009299 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR00 83.09
PF13908179 Shisa: Wnt and FGF inhibitory regulator 82.53
PF05454290 DAG1: Dystroglycan (Dystrophin-associated glycopro 81.83
PF12877 684 DUF3827: Domain of unknown function (DUF3827); Int 81.51
PF0869340 SKG6: Transmembrane alpha-helix domain; InterPro: 80.84
PHA03265402 envelope glycoprotein D; Provisional 80.83
>PF09459 EB_dh: Ethylbenzene dehydrogenase; InterPro: IPR019020 This entry represents a haem-binding domain found in cytochromes b558/566 (subunit A), c-551 and c-552, as well as in members of the type-II members of the microbial dimethyl sulphoxide (DMSO) reductase family Back     alignment and domain information
Probab=99.62  E-value=1.2e-16  Score=150.12  Aligned_cols=128  Identities=32%  Similarity=0.580  Sum_probs=60.6

Q ss_pred             ccccccccCccccccccCCCC--CCCCCCCCeeEEEEeecCccEEEEEEecC---cee------eec--CCcccCCceee
Q 019150          178 ADDWEDIDGSEFSLLPALDPH--AEHEYKGGKMNVKALHDGHDVYFLLQVDG---EYV------YSK--GENTRCPSIAL  244 (345)
Q Consensus       178 ~~dwk~V~G~~~~~~~al~~~--~~~~y~~g~~~vk~~~d~~~~~f~~~v~g---~y~------~~~--~~~~~c~~~~~  244 (345)
                      +.+|++|+..+++|.|.+++.  +..++....|+|||+|||++||||||.+.   ++.      |.+  ....-|..+|+
T Consensus         1 ~~~W~~~p~~~v~L~pg~~~~p~~~~~~~~~~v~VkAa~dg~~Iyfll~W~d~t~~~~~~p~~~~~~~~~~~~yeDk~Av   80 (261)
T PF09459_consen    1 DPDWSKAPPVEVPLYPGQSSYPEPPPKGGTIPVEVKAAHDGENIYFLLEWPDPTRSYERHPDGGWVQAGEDDYYEDKVAV   80 (261)
T ss_dssp             -HHHHTS-EEEEE-EE--GGG----T-----EEEEEEEE-SSEEEEEEEEE-----S------------STT----EEEE
T ss_pred             CchhccCCCeEEEECCCccCCccccCCCCcEEEEEEEEECCCeEEEEEEecCCCCCccccccccccccCCCCcCcceEEE
Confidence            368999999999999996543  44588889999999999999999999988   233      223  56778899999


Q ss_pred             eeeecCCceeeecCCCCCCCCcccccccCCceeeEEEEEeccccCccccCCCCCCCCCCCCCCccccceeeeecCC----
Q 019150          245 MFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVDVYAWTP----  320 (345)
Q Consensus       245 m~~~g~~a~~~~mggc~~~~~~c~~~~c~~~~vdi~h~~~~~~~~g~~yg~n~~d~~~g~g~d~~~~~~d~y~~~p----  320 (345)
                      ||.+| ++.++.+.||        ..+|...+.|+.|+.+++   ||.|-+..            |+++|++.|++    
T Consensus        81 mf~~g-~v~~~~~~Gc--------~~~ch~~~~~~p~~~~~~---~~ky~~~~------------g~~vdlW~Wka~r~~  136 (261)
T PF09459_consen   81 MFSDG-DVPYFGQDGC--------WHTCHKPLRDMPAAPIGR---GRKYMGDS------------GEPVDLWHWKASRSG  136 (261)
T ss_dssp             EE----------------------------ESST--T--GG-------GT-BT------------TB-EEEEEEET----
T ss_pred             Eeeec-cccccccccc--------cccccCCcccccCCCccc---ceeeeCCC------------CeEEEEEEecccccc
Confidence            99999 8888866554        678999999999998887   88898874            99999999999    


Q ss_pred             ---------cccccCCCC
Q 019150          321 ---------HCRYLDGMG  329 (345)
Q Consensus       321 ---------~cr~~d~~~  329 (345)
                               +|||.+|.|
T Consensus       137 ~~~d~~~~~~r~~~~G~g  154 (261)
T PF09459_consen  137 MADDGYVFGKRRYDAGYG  154 (261)
T ss_dssp             ------------------
T ss_pred             cccccccccccccccccc
Confidence                     899999998



The DMSO reductase family is a large and rapidly expanding group of enzymes found in bacteria and archaea that share a common form of molybdenum cofactor known as bis(molybdopterin guanine dinucleotide)Mo []. In addition to the molybdopterin subunit, these enzymes also contain an iron-sulphur subunit. These include two distinct but very closely related periplasmic proteins of anaerobic respiration: selenate reductase and chlorate reductase []. Other proteins containing this subunit include dimethyl sulphide dehydrogenase and ethylbenzene dehydrogenase [, , ]. One member of the DMSO reductase family is eythylbenzene dehydrogenase, which is a heterotrimer of three subunits that catalyses the anaerobic degradation of hydrocarbons (alpha, beta and gamma subunits). This entry matches the gamma subunit, whose structure is known []. The alpha subunit contains the catalytic centre as a Molybdenum cofactor-complex. This removes an electron-pair from the hydrocarbon and passes it along an electron transport system involving iron-sulphur complexes held in the beta subunit and a Haem b molecule contained in the gamma subunit. The electron-pair is then subsequently passed to an as yet unknown receiver. The enzyme is found in a variety of different bacteria.; GO: 0020037 heme binding; PDB: 2IVF_C.

>TIGR03477 DMSO_red_II_gam DMSO reductase family type II enzyme, heme b subunit Back     alignment and domain information
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region Back     alignment and domain information
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane Back     alignment and domain information
>PTZ00382 Variant-specific surface protein (VSP); Provisional Back     alignment and domain information
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ] Back     alignment and domain information
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein Back     alignment and domain information
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein) Back     alignment and domain information
>PF13908 Shisa: Wnt and FGF inhibitory regulator Back     alignment and domain information
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls Back     alignment and domain information
>PF06697 DUF1191: Protein of unknown function (DUF1191); InterPro: IPR010605 This family contains hypothetical plant proteins of unknown function Back     alignment and domain information
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5 Back     alignment and domain information
>PF15102 TMEM154: TMEM154 protein family Back     alignment and domain information
>PF06452 DUF1083: Domain of unknown function (DUF1083); InterPro: IPR010502 This entry represents the family 9 carbohydrate-binding module (CBD9), which exhibit an immunoglobulin-like beta-sandwich fold, with an additional beta-strand at the N terminus [] Back     alignment and domain information
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region Back     alignment and domain information
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear Back     alignment and domain information
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [] Back     alignment and domain information
>PF15069 FAM163: FAM163 family Back     alignment and domain information
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins Back     alignment and domain information
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein Back     alignment and domain information
>PF12768 Rax2: Cortical protein marker for cell polarity Back     alignment and domain information
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A Back     alignment and domain information
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains Back     alignment and domain information
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [] Back     alignment and domain information
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences Back     alignment and domain information
>PF15345 TMEM51: Transmembrane protein 51 Back     alignment and domain information
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host [] Back     alignment and domain information
>cd00005 CBM9 Family 9 carbohydrate-binding module (CBM), plays a role in microbial degradation of cellulose and hemicellulose found in plants; previously called cellulose-binding domain; the binding sites of the CBMs for which structures have been determined are of two general types: flat surfaces comprising predominantly aromatic residues tryptophan and tyrosine and extended shallow grooves; this domain frequently occurs in tandem Back     alignment and domain information
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ] Back     alignment and domain information
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2 Back     alignment and domain information
>COG3889 Predicted solute binding protein [General function prediction only] Back     alignment and domain information
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world Back     alignment and domain information
>PF13908 Shisa: Wnt and FGF inhibitory regulator Back     alignment and domain information
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5 Back     alignment and domain information
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ] Back     alignment and domain information
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ] Back     alignment and domain information
>PHA03265 envelope glycoprotein D; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query345
2ivf_C214 Ethylbenzene dehydrogenase gamma-subunit; anaerobi 4e-18
>2ivf_C Ethylbenzene dehydrogenase gamma-subunit; anaerobic hydrocarbon degradation, MOCO, Fe/S cluster, MO- B enzyme, DMSO reductase family; HET: MES MGD MD1 HEM; 1.88A {Aromatoleum aromaticum} Length = 214 Back     alignment and structure
 Score = 81.0 bits (199), Expect = 4e-18
 Identities = 23/142 (16%), Positives = 44/142 (30%), Gaps = 24/142 (16%)

Query: 172 ITLDGHADDWEDIDGSEFSLLPA-------LDPHAEHEYKGG---KMNVKALHDGHDVYF 221
           + LD  A  W   + + F + P        + P        G   +++V ALH+G  +  
Sbjct: 12  LLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIAL 71

Query: 222 LLQ-VDGEYVYSKGENTRCPSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIM 280
            L+    ++      N+    +  MF +   A    MG               G  V+  
Sbjct: 72  RLKWASEKHDKIVDLNSFVDGVGAMFPVARGAQAVTMGA-------------TGRPVNAW 118

Query: 281 HFSIGSAIPGRLYGGNPVDNSE 302
           ++   +  P  +          
Sbjct: 119 YWKANANEPMEIVAEGFSAVRR 140


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query345
2ivf_C214 Ethylbenzene dehydrogenase gamma-subunit; anaerobi 99.85
2k1k_A38 Ephrin type-A receptor 1; EPHA1, receptor tyrosine 96.54
2ks1_B44 Epidermal growth factor receptor; ERBB1, ERBB2, tr 96.16
2l2t_A44 Receptor tyrosine-protein kinase ERBB-4; transmemb 95.87
2jwa_A44 Receptor tyrosine-protein kinase ERBB-2; transmemb 95.35
1i82_A189 Xylanase A, endo-1,4-beta-xylanase A; cellobiose c 94.6
2k9y_A41 Ephrin type-A receptor 2; receptor tyrosine kinase 90.2
1q55_A880 EP-cadherin, C-cadherin; trans interaction, desmos 88.16
2klu_A70 T-cell surface glycoprotein CD4; cell membrane, di 83.31
2l34_A33 TYRO protein tyrosine kinase-binding protein; immu 81.95
>2ivf_C Ethylbenzene dehydrogenase gamma-subunit; anaerobic hydrocarbon degradation, MOCO, Fe/S cluster, MO- B enzyme, DMSO reductase family; HET: MES MGD MD1 HEM; 1.88A {Aromatoleum aromaticum} Back     alignment and structure
Probab=99.85  E-value=1.1e-21  Score=179.86  Aligned_cols=119  Identities=21%  Similarity=0.305  Sum_probs=107.8

Q ss_pred             eeecCCcccccccccCcccccccc-----CCCCC-----CCCCCCCeeEEEEeecCccEEEEEEecCceeeecCC-cccC
Q 019150          171 LITLDGHADDWEDIDGSEFSLLPA-----LDPHA-----EHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKGE-NTRC  239 (345)
Q Consensus       171 ~ItlDG~~~dwk~V~G~~~~~~~a-----l~~~~-----~~~y~~g~~~vk~~~d~~~~~f~~~v~g~y~~~~~~-~~~c  239 (345)
                      .|++|+++.+|+++++.+++|.|+     +|+++     ..++...+|+|||+|||++|||+|+.+.+|++...+ ...|
T Consensus        11 ~~~~d~~~~~W~~ap~~~v~L~~~~~~~p~~~~~~~~~~~~~~~~~~v~VkAa~dg~~i~f~l~W~D~t~~~~~~~~~f~   90 (214)
T 2ivf_C           11 ELLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIALRLKWASEKHDKIVDLNSFV   90 (214)
T ss_dssp             HHHTCTTCHHHHTSCEEEEECEECCGGGGTTTCTTGGGCCSCCCCCEEEEEEEECSSEEEEEEEEECCCCCSCCSTTCCC
T ss_pred             cccCCCChHHHhcCCceEEEccCCccccccccccccccccCCCCceEEEEEEEECCCeEEEEEEECCCCCCccccccccC
Confidence            468899999999999999999999     88887     788999999999999999999999999999998777 7779


Q ss_pred             CceeeeeeecCCceeeecCCCCCCCCcccccccCCceeeEEEEEeccccCccccCCCCCCCCCCCCCCcccccee
Q 019150          240 PSIALMFQIGEDATYHNMGGCKEGIGSCTSKTCKGHEVDIMHFSIGSAIPGRLYGGNPVDNSEGNGGDRFGHLVD  314 (345)
Q Consensus       240 ~~~~~m~~~g~~a~~~~mggc~~~~~~c~~~~c~~~~vdi~h~~~~~~~~g~~yg~n~~d~~~g~g~d~~~~~~d  314 (345)
                      ..||+||++|++++|+.||.             .||+|||+||..+.         |.++|..++|   ||++.+
T Consensus        91 D~vAvmfp~~~~~~~~~MG~-------------~~~~vdiw~Wka~~---------~~~~~~~a~G---fgs~~~  140 (214)
T 2ivf_C           91 DGVGAMFPVARGAQAVTMGA-------------TGRPVNAWYWKANA---------NEPMEIVAEG---FSAVRR  140 (214)
T ss_dssp             CEEEEEEESSTTCCGGGTCB-------------TTBCEEEEEEETTC---------SSCEEEEESS---TTSEEE
T ss_pred             ceEEEEeEcCCCCcccccCC-------------CCcEEEEEEEecCC---------CcceeeccCC---cccccc
Confidence            99999999999999999994             67999999999875         4577777766   999887



>2k1k_A Ephrin type-A receptor 1; EPHA1, receptor tyrosine kinase, dimeric transmembrane domain, ATP-binding, glycoprotein, nucleotide-binding; NMR {Homo sapiens} PDB: 2k1l_A Back     alignment and structure
>2ks1_B Epidermal growth factor receptor; ERBB1, ERBB2, transmembrane, heterodimer, complex, tyrosine receptor, bicelles, transferase; NMR {Homo sapiens} Back     alignment and structure
>2l2t_A Receptor tyrosine-protein kinase ERBB-4; transmembrane dimer, membrane domain, membrane protei; NMR {Homo sapiens} Back     alignment and structure
>2jwa_A Receptor tyrosine-protein kinase ERBB-2; transmembrane helix dimer, protein kinase receptor membrane domain, ATP-binding, glycoprotein; NMR {Homo sapiens} PDB: 2ks1_A Back     alignment and structure
>1i82_A Xylanase A, endo-1,4-beta-xylanase A; cellobiose complex, hydrolase; HET: BGC; 1.90A {Thermotoga maritima} SCOP: b.1.9.2 PDB: 1i8a_A* 1i8u_A Back     alignment and structure
>2k9y_A Ephrin type-A receptor 2; receptor tyrosine kinase, membrane protein, dimeric transmembrane domain, ephrin receptor, ATP-binding, glycoprotein; NMR {Homo sapiens} Back     alignment and structure
>1q55_A EP-cadherin, C-cadherin; trans interaction, desmosome, junction, adhesion, structural protein; HET: NAG NDG; 30.00A {Mus musculus} SCOP: i.20.1.1 PDB: 1q5a_A* 1q5b_A* 1q5c_A* Back     alignment and structure
>2klu_A T-cell surface glycoprotein CD4; cell membrane, disulfide bond, HOST- virus interaction, immune response, immunoglobulin domain, lipoprotein; NMR {Homo sapiens} Back     alignment and structure
>2l34_A TYRO protein tyrosine kinase-binding protein; immunoreceptor, transmembrane assembly, DAP12, protein bindi; NMR {Homo sapiens} PDB: 2l35_B Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query345
d1i8aa_189 Xylanase 10A {Thermotoga maritima [TaxId: 2336]} 94.67
>d1i8aa_ b.1.9.2 (A:) Xylanase 10A {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: CBD9-like
family: Family 9 carbohydrate-binding module, CBD9
domain: Xylanase 10A
species: Thermotoga maritima [TaxId: 2336]
Probab=94.67  E-value=0.021  Score=46.61  Aligned_cols=79  Identities=19%  Similarity=0.278  Sum_probs=51.4

Q ss_pred             eeeecceeecCCcccc-cccccCccccccccCCCCCCCCCCCCeeEEEEeecCccEEEEEEecCceeeecC-CcccCCce
Q 019150          165 AEFKPGLITLDGHADD-WEDIDGSEFSLLPALDPHAEHEYKGGKMNVKALHDGHDVYFLLQVDGEYVYSKG-ENTRCPSI  242 (345)
Q Consensus       165 aef~PG~ItlDG~~~d-wk~V~G~~~~~~~al~~~~~~~y~~g~~~vk~~~d~~~~~f~~~v~g~y~~~~~-~~~~c~~~  242 (345)
                      |..+-|.+++||..++ |+..+-..  ++-..++.....   -+-+||+++|.+.+||+++|--+...... .--..-+|
T Consensus         3 a~~~~g~p~IDG~lde~W~~a~~~~--~~~~~~~~~~~~---~~t~v~~~~D~~~LYv~~~~~D~~~~~~~~~~~~~D~v   77 (189)
T d1i8aa_           3 ATAKYGTPVIDGEIDEIWNTTEEIE--TKAVAMGSLDKN---ATAKVRVLWDENYLYVLAIVKDPVLNKDNSNPWEQDSV   77 (189)
T ss_dssp             EEEEECCCCSSSSCCGGGGGSCEEE--CCEEEESCTTTS---CEEEEEEEECSSEEEEEEEEECSSCCCCSSSGGGSSEE
T ss_pred             eecccCCCEECccCChHHhcCcccc--cceeccCCCCCC---CcEEEEEEEecCeEEEEEEEEcCCcccccCCccCCCeE
Confidence            4456699999999888 99876443  333333333322   36799999999999999999765432211 11123567


Q ss_pred             eeeeee
Q 019150          243 ALMFQI  248 (345)
Q Consensus       243 ~~m~~~  248 (345)
                      .++|.-
T Consensus        78 ei~id~   83 (189)
T d1i8aa_          78 EIFIDE   83 (189)
T ss_dssp             EEEEES
T ss_pred             EEEEcC
Confidence            777754