Citrus Sinensis ID: 012283


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------
MATTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA
ccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEccccHHHHHHHHHHHHHHHHHccccEEEEEEccccHHHHHHcccccEEEEEccccccccHHHHHHHHHHHHcccccEEEEEccccHHHHHHHHHHccccEEEcccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccccccccccccccccccHHHHHHHHHHHHHHccccccEEEEcccccccHHccccccccccccHHHHHHHHHHHHHHcccEEEEcccccHHHHHHHHcccccccccccHHHHHHHHHHccEEEEcccHHHHHHHHHcccEEEEEccccccccccccccccEEEEEEccccccccccHHHHHHHHHHHHHHcccc
cccccccccccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccHccccccccccccEEEEEEcccccHHHHHHHHHHHHHHHccccEEEEEEcccHHHHHHHcccccEEEEEccccccccHHHHHHHHHHHHHccccEEEEccccHHHHHHHHHHccccEEEEEcccccccccccEEEEcccccccccHHHHHHHHHHHHHHHccccccccccccccccEEcccHHHHHHHHHHHHHcccccccEEEEEccccccccccHHHcHHHHHccHHHHHHHcccHHccccEEEEcccccHHHHHHHHHccccccccccHHHHHHHHHHccEEEEcccHHHHHHHHccccEEEEEcccccccccccccccccEEEEEcccccEccccHHHHHHHHHHHcHHHHcc
mattsllpkstspflsnplsslpsthfikpsplfnppdnhslliTTKRLSFQThakkknpwldpfddgedpdmqygslfadgkqdedprppddpnnpygflkfpmgfnpeiaslplkirgdvrrccciisggvyenllffPAIQllkdrypgvLIDVIASARGKQTFELNKNVRwanvydldddwpepaeytDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDrvsyiypnvnaagAGLLLSETFTAESMNLSERGYNMYEQMVDWlgrpfrsvprhpvpplrVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIesdskasmqsrgdtdsllpIQVWAEIANGlrefrplfviphekeregvedvvgddasivfITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSElkgrlfvpnaeekkctvissrtgklidtpVEAVLNAMQIFNESLALA
mattsllpkstspflsnplsSLPSTHFIKPSPLFNPPDNHSLLITTKRLSfqthakkknpwldpfddGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASargkqtfelnknvrwanvydldddwpepAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKyknagaeqgkyIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKgrlfvpnaeekkctvissrtgklidTPVEAVLNAMQIFNESLALA
MATTSLLPKstspflsnplsslpsTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQdedprppddpnnpYGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA
******************************************LITT***************************************************YGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGI****************LLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFN******
********KSTSPFLSNPLS**************************************NPWLDPFDDGEDPDMQY*************************LKFPMGFNPEIASL**KIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL*
************PFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNPWLDPFDDGEDPDMQYGSLFADG********PDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESD***********DSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA
*******PKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLA**
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MATTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query467
255543315466 conserved hypothetical protein [Ricinus 0.993 0.995 0.795 0.0
341833960463 hypothetical protein [Pyrus x bretschnei 0.982 0.991 0.743 0.0
357467927472 hypothetical protein MTR_4g007080 [Medic 0.963 0.953 0.730 0.0
224115860465 predicted protein [Populus trichocarpa] 0.993 0.997 0.732 0.0
363808214471 uncharacterized protein LOC100802136 [Gl 0.995 0.987 0.753 0.0
356523261467 PREDICTED: uncharacterized protein LOC10 0.991 0.991 0.754 0.0
359483160466 PREDICTED: uncharacterized protein LOC10 0.980 0.982 0.753 0.0
388501644460 unknown [Medicago truncatula] 0.937 0.952 0.710 0.0
297844518460 hypothetical protein ARALYDRAFT_471797 [ 0.978 0.993 0.729 0.0
18394307461 NDH-dependent cyclic electron flow 1 [Ar 0.967 0.980 0.736 0.0
>gi|255543315|ref|XP_002512720.1| conserved hypothetical protein [Ricinus communis] gi|223547731|gb|EEF49223.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  737 bits (1903), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/469 (79%), Positives = 412/469 (87%), Gaps = 5/469 (1%)

Query: 1   MATTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNP--PDNHSLLITTKRLSFQTHAKKK 58
           MA+TSLLPK T PFL+NP SSL STHF KPS   NP  PD+HS        +   HA KK
Sbjct: 1   MASTSLLPKIT-PFLTNP-SSLSSTHFTKPS-FLNPLSPDHHSACARKPTTTTVLHANKK 57

Query: 59  NPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
           NPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPD+PNNPYGFLKFP G++ EIASLPLKI
Sbjct: 58  NPWLDPFDDGEDPEMEYGSLFADGKQEEDPRPPDNPNNPYGFLKFPKGYSVEIASLPLKI 117

Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
           RGDVRRCCC+ISGGVYENLLFFPAIQL+KDRYPGV IDV+AS RGKQT+ELNKNVRWAN 
Sbjct: 118 RGDVRRCCCVISGGVYENLLFFPAIQLIKDRYPGVQIDVLASPRGKQTYELNKNVRWAND 177

Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
           YD DDDWPEPAEY D+LG++KNRYYDMVLSTKLAGLGHA+FLFMT+ARDRVSYI+PNVNA
Sbjct: 178 YDPDDDWPEPAEYVDMLGILKNRYYDMVLSTKLAGLGHASFLFMTSARDRVSYIHPNVNA 237

Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
           AGAGLLLSETFT +  NL+E GY+MY QM+DWLGRPFRSVPRH VPPLRVSISR+LKE V
Sbjct: 238 AGAGLLLSETFTPDITNLAEGGYHMYHQMLDWLGRPFRSVPRHTVPPLRVSISRKLKEFV 297

Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
             KY+ AGAE+GKYIVIHGIESDSKASMQSRGDTDSLLP+QVWAEIA  +R F+P+FVIP
Sbjct: 298 EAKYRAAGAEKGKYIVIHGIESDSKASMQSRGDTDSLLPLQVWAEIAKDIRAFKPVFVIP 357

Query: 359 HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
           HEKERE VE++VGDD SIV ITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 
Sbjct: 358 HEKERENVEEIVGDDTSIVMITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFG 417

Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
           S  KGRLF+PNAEEKKC+++SS+TGKL D  V  V  AMQI + SLALA
Sbjct: 418 SIEKGRLFIPNAEEKKCSIVSSKTGKLKDIDVGDVKQAMQILDLSLALA 466




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|341833960|gb|AEK94315.1| hypothetical protein [Pyrus x bretschneideri] Back     alignment and taxonomy information
>gi|357467927|ref|XP_003604248.1| hypothetical protein MTR_4g007080 [Medicago truncatula] gi|355505303|gb|AES86445.1| hypothetical protein MTR_4g007080 [Medicago truncatula] Back     alignment and taxonomy information
>gi|224115860|ref|XP_002332075.1| predicted protein [Populus trichocarpa] gi|222831961|gb|EEE70438.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|363808214|ref|NP_001242232.1| uncharacterized protein LOC100802136 [Glycine max] gi|255635425|gb|ACU18065.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356523261|ref|XP_003530259.1| PREDICTED: uncharacterized protein LOC100803710 [Glycine max] Back     alignment and taxonomy information
>gi|359483160|ref|XP_002264393.2| PREDICTED: uncharacterized protein LOC100241596 [Vitis vinifera] gi|298204672|emb|CBI25170.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|388501644|gb|AFK38888.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|297844518|ref|XP_002890140.1| hypothetical protein ARALYDRAFT_471797 [Arabidopsis lyrata subsp. lyrata] gi|297335982|gb|EFH66399.1| hypothetical protein ARALYDRAFT_471797 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18394307|ref|NP_563986.1| NDH-dependent cyclic electron flow 1 [Arabidopsis thaliana] gi|6587804|gb|AAF18495.1|AC010924_8 ESTs gb|T43280, gb|AA395195 come from this gene [Arabidopsis thaliana] gi|16323198|gb|AAL15333.1| At1g15980/T24D18_8 [Arabidopsis thaliana] gi|21537186|gb|AAM61527.1| unknown [Arabidopsis thaliana] gi|21700921|gb|AAM70584.1| At1g15980/T24D18_8 [Arabidopsis thaliana] gi|332191271|gb|AEE29392.1| NDH-dependent cyclic electron flow 1 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query467
TAIR|locus:2200452461 PnsB1 "Photosynthetic NDH subc 0.916 0.928 0.741 3.4e-171
TAIR|locus:2200452 PnsB1 "Photosynthetic NDH subcomplex B 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1664 (590.8 bits), Expect = 3.4e-171, P = 3.4e-171
 Identities = 319/430 (74%), Positives = 360/430 (83%)

Query:    39 NHSLLITTKRLSFQTHAKKKN-PWLDPFDDGEDPDMQYGSLFADGKQXXXXXXXXXXXXX 97
             N    +T++      + KKKN PWLDPFD GEDPD +YGSLFADGKQ             
Sbjct:    31 NFQTRLTSRSSDVSVNLKKKNNPWLDPFDSGEDPDNEYGSLFADGKQDEDPRPPDNPDNP 90

Query:    98 YGFLKFPMGFNPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDV 157
             YGFLKFP G+  E+ASLPLKIRGDVRRCCC+ISGGVYENLLFFP IQL+KDRYPGV +D+
Sbjct:    91 YGFLKFPKGYTVELASLPLKIRGDVRRCCCVISGGVYENLLFFPTIQLIKDRYPGVQVDI 150

Query:   158 IASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHA 217
             + + RGKQT+ELNKNVRWANVYD DD WPEPAEYTD++G++K RYYDMVLSTKLAGLGHA
Sbjct:   151 LTTERGKQTYELNKNVRWANVYDPDDHWPEPAEYTDMIGLLKGRYYDMVLSTKLAGLGHA 210

Query:   218 AFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRS 277
             AFLFMTTARDRVSYIYPNVN+AGAGL+LSETFTAE+ NLSE GY+MY QM DWLGRPFRS
Sbjct:   211 AFLFMTTARDRVSYIYPNVNSAGAGLMLSETFTAENTNLSELGYSMYTQMEDWLGRPFRS 270

Query:   278 VPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLP 337
             VPR P+ PLRVSISR++KEVVA KY+NAGA  GK+IVIHGIESDSKASMQS+GD DSLL 
Sbjct:   271 VPRTPLLPLRVSISRKVKEVVAAKYRNAGAVTGKFIVIHGIESDSKASMQSKGDADSLLS 330

Query:   338 IQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIA 397
             ++ WA+I  G+R F+P+FVIPHEKERE VED VGDD SIVFITTPGQLAALINDSAGVIA
Sbjct:   331 LEKWAKIIKGVRGFKPVFVIPHEKERENVEDFVGDDTSIVFITTPGQLAALINDSAGVIA 390

Query:   398 TNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNA 456
             TNTAAIQLANAR+KP I LFSSE KG+LFVP AEEK  C +I+S+TGKL D  +  V NA
Sbjct:   391 TNTAAIQLANARDKPCIGLFSSEEKGKLFVPYAEEKSNCVIIASKTGKLADIDIGTVKNA 450

Query:   457 MQIFNESLAL 466
             MQ+F  SLAL
Sbjct:   451 MQVFEGSLAL 460


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.136   0.404    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      467       439   0.00089  118 3  11 22  0.39    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  607 (65 KB)
  Total size of DFA:  256 KB (2137 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  33.57u 0.18s 33.75t   Elapsed:  00:00:01
  Total cpu time:  33.57u 0.18s 33.75t   Elapsed:  00:00:01
  Start:  Fri May 10 02:19:03 2013   End:  Fri May 10 02:19:04 2013


GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0009773 "photosynthetic electron transport in photosystem I" evidence=IMP
GO:0010598 "NAD(P)H dehydrogenase complex (plastoquinone)" evidence=IMP;IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
GO:0006098 "pentose-phosphate shunt" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0015979 "photosynthesis" evidence=RCA
GO:0016117 "carotenoid biosynthetic process" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query467
COG0859334 COG0859, RfaF, ADP-heptose:LPS heptosyltransferase 3e-18
cd03789279 cd03789, GT1_LPS_heptosyltransferase, Lipopolysacc 1e-04
cd03789279 cd03789, GT1_LPS_heptosyltransferase, Lipopolysacc 0.003
TIGR02201344 TIGR02201, heptsyl_trn_III, lipopolysaccharide hep 0.003
>gnl|CDD|223928 COG0859, RfaF, ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
 Score = 85.1 bits (211), Expect = 3e-18
 Identities = 71/357 (19%), Positives = 126/357 (35%), Gaps = 38/357 (10%)

Query: 122 VRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDL 181
           + +   I    + + +L  P ++ LK  YP   IDV+         +LN  +    + D 
Sbjct: 1   MMKILVIRLSKLGDVVLTLPLLRTLKKAYPNAKIDVLVPKGFAPILKLNPEIDKVIIIDK 60

Query: 182 DDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSYIYPNVNAAG 240
                   E   +L  ++   YD V+   L GL  +A L  +     R+ +   +     
Sbjct: 61  KKKGLGLKERLALLRTLRKERYDAVID--LQGLLKSALLALLLGIPFRIGFDKKSA---- 114

Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
             LLL++ +        +     Y  +++ LG      P+    PL             E
Sbjct: 115 RELLLNKFYPRLDKPEGQHVVERYLALLEDLGLYPPPEPQL-DFPLPRPP--------IE 165

Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE-FRPLFVIPH 359
             KN       YIVI+       +   ++       P++ +AE+A  L      + +   
Sbjct: 166 LAKNLAKFDRPYIVINP----GASRGSAKR-----WPLEHYAELAELLIAKGYQVVLFGG 216

Query: 360 EKEREGVEDVVGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
             E E  E++     + V +       +LAALI  +  VI  ++  + LA A   P+IAL
Sbjct: 217 PDEEERAEEIAKGLPNAVILAGKTSLEELAALIAGADLVIGNDSGPMHLAAALGTPTIAL 276

Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTG----KLIDTPVEAV--LNAMQIFNESLALA 467
           +          P   + K   IS        K      E +  +   ++   + AL 
Sbjct: 277 YGPTSPAFTPPP---DPKLPGISGNLDCSPCKPSGGHHECLKDIEPEKVLEAAEALL 330


Length = 334

>gnl|CDD|99964 cd03789, GT1_LPS_heptosyltransferase, Lipopolysaccharide heptosyltransferase is involved in the biosynthesis of lipooligosaccharide (LOS) Back     alignment and domain information
>gnl|CDD|99964 cd03789, GT1_LPS_heptosyltransferase, Lipopolysaccharide heptosyltransferase is involved in the biosynthesis of lipooligosaccharide (LOS) Back     alignment and domain information
>gnl|CDD|131256 TIGR02201, heptsyl_trn_III, lipopolysaccharide heptosyltransferase III, putative Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 467
PRK10422352 lipopolysaccharide core biosynthesis protein; Prov 100.0
TIGR02201344 heptsyl_trn_III lipopolysaccharide heptosyltransfe 100.0
PRK10916348 ADP-heptose:LPS heptosyltransferase II; Provisiona 100.0
TIGR02195334 heptsyl_trn_II lipopolysaccharide heptosyltransfer 100.0
PRK10964322 ADP-heptose:LPS heptosyl transferase I; Provisiona 100.0
TIGR02193319 heptsyl_trn_I lipopolysaccharide heptosyltransfera 100.0
COG0859334 RfaF ADP-heptose:LPS heptosyltransferase [Cell env 100.0
cd03789279 GT1_LPS_heptosyltransferase Lipopolysaccharide hep 100.0
PF01075247 Glyco_transf_9: Glycosyltransferase family 9 (hept 99.97
TIGR03568365 NeuC_NnaA UDP-N-acetyl-D-glucosamine 2-epimerase, 99.02
PRK10017426 colanic acid biosynthesis protein; Provisional 98.84
PF04007335 DUF354: Protein of unknown function (DUF354); Inte 98.27
PRK00025380 lpxB lipid-A-disaccharide synthase; Reviewed 98.24
TIGR03590279 PseG pseudaminic acid biosynthesis-associated prot 98.01
PF13528318 Glyco_trans_1_3: Glycosyl transferase family 1 97.6
TIGR03609298 S_layer_CsaB polysaccharide pyruvyl transferase Cs 97.57
TIGR00236365 wecB UDP-N-acetylglucosamine 2-epimerase. Epimeras 97.54
PRK05749425 3-deoxy-D-manno-octulosonic-acid transferase; Revi 97.53
cd03786363 GT1_UDP-GlcNAc_2-Epimerase Bacterial members of th 97.45
COG3980318 spsG Spore coat polysaccharide biosynthesis protei 97.41
TIGR03492396 conserved hypothetical protein. This protein famil 97.39
cd03807365 GT1_WbnK_like This family is most closely related 97.27
COG0707357 MurG UDP-N-acetylglucosamine:LPS N-acetylglucosami 97.22
cd03785350 GT1_MurG MurG is an N-acetylglucosaminyltransferas 97.2
PRK12446352 undecaprenyldiphospho-muramoylpentapeptide beta-N- 97.16
TIGR01133348 murG undecaprenyldiphospho-muramoylpentapeptide be 97.13
PRK00726357 murG undecaprenyldiphospho-muramoylpentapeptide be 97.04
COG1817346 Uncharacterized protein conserved in archaea [Func 96.95
PF02684373 LpxB: Lipid-A-disaccharide synthetase; InterPro: I 96.88
cd03808359 GT1_cap1E_like This family is most closely related 96.85
TIGR00661321 MJ1255 conserved hypothetical protein. This model 96.82
PRK13609380 diacylglycerol glucosyltransferase; Provisional 96.8
COG1519419 KdtA 3-deoxy-D-manno-octulosonic-acid transferase 96.74
PRK01021608 lpxB lipid-A-disaccharide synthase; Reviewed 96.68
PRK13608391 diacylglycerol glucosyltransferase; Provisional 96.66
cd04951360 GT1_WbdM_like This family is most closely related 96.56
TIGR03088374 stp2 sugar transferase, PEP-CTERM/EpsH1 system ass 96.54
PF04230286 PS_pyruv_trans: Polysaccharide pyruvyl transferase 96.51
cd03784401 GT1_Gtf_like This family includes the Gtfs, a grou 96.43
cd03811353 GT1_WabH_like This family is most closely related 96.3
COG2327385 WcaK Polysaccharide pyruvyl transferase family pro 96.29
TIGR00215385 lpxB lipid-A-disaccharide synthase. Lipid-A precur 96.28
cd03819355 GT1_WavL_like This family is most closely related 96.25
cd04962371 GT1_like_5 This family is most closely related to 96.06
COG0763381 LpxB Lipid A disaccharide synthetase [Cell envelop 95.92
cd03820348 GT1_amsD_like This family is most closely related 95.56
cd03823359 GT1_ExpE7_like This family is most closely related 95.51
cd03801374 GT1_YqgM_like This family is most closely related 95.45
PRK14089347 ipid-A-disaccharide synthase; Provisional 95.41
cd03817374 GT1_UGDG_like This family is most closely related 95.21
cd03812358 GT1_CapH_like This family is most closely related 95.06
PLN02871465 UDP-sulfoquinovose:DAG sulfoquinovosyltransferase 95.05
cd03799355 GT1_amsK_like This is a family of GT1 glycosyltran 94.54
PLN02605382 monogalactosyldiacylglycerol synthase 93.76
cd03798377 GT1_wlbH_like This family is most closely related 93.26
cd03802335 GT1_AviGT4_like This family is most closely relate 93.24
cd03794394 GT1_wbuB_like This family is most closely related 92.94
PF02350346 Epimerase_2: UDP-N-acetylglucosamine 2-epimerase; 92.93
cd04955363 GT1_like_6 This family is most closely related to 92.27
PRK09922359 UDP-D-galactose:(glucosyl)lipopolysaccharide-1,6-D 91.6
TIGR01426392 MGT glycosyltransferase, MGT family. This model de 88.88
cd03795357 GT1_like_4 This family is most closely related to 88.28
COG0381383 WecB UDP-N-acetylglucosamine 2-epimerase [Cell env 87.9
cd03816415 GT1_ALG1_like This family is most closely related 87.61
PF13477139 Glyco_trans_4_2: Glycosyl transferase 4-like 87.1
PF04413186 Glycos_transf_N: 3-Deoxy-D-manno-octulosonic-acid 86.96
COG3660329 Predicted nucleoside-diphosphate-sugar epimerase [ 85.47
PF14595129 Thioredoxin_9: Thioredoxin; PDB: 1Z6N_A. 83.97
PF04464369 Glyphos_transf: CDP-Glycerol:Poly(glycerophosphate 82.3
PF05159269 Capsule_synth: Capsule polysaccharide biosynthesis 80.33
>PRK10422 lipopolysaccharide core biosynthesis protein; Provisional Back     alignment and domain information
Probab=100.00  E-value=1.3e-52  Score=428.07  Aligned_cols=321  Identities=17%  Similarity=0.149  Sum_probs=249.2

Q ss_pred             CCccEEEEEecCCchhHHhHHHHHHHHHHHCCCcEEEEEEcCCchhhhhcCCCCCEEEEecCCCC--CCChHHHHHHHHH
Q 012283          120 GDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDD--WPEPAEYTDILGV  197 (467)
Q Consensus       120 ~~~~rILII~~~~IGD~Il~tP~l~aLk~~yP~a~I~ll~~~~~~~l~~~~p~Id~ii~~~~~~~--~~~~~~~~~l~~~  197 (467)
                      +++||||||++++|||+|+++|++++||++||+++|+|++.+.++++++.+|+||+|+.++++..  +..+..+++++++
T Consensus         3 ~~~~~ILii~~~~iGD~vl~~P~l~~Lk~~~P~a~I~~l~~~~~~~l~~~~P~id~vi~~~~~~~~~~~~~~~~~~l~~~   82 (352)
T PRK10422          3 KPFRRILIIKMRFHGDMLLTTPVISSLKKNYPDAKIDVLLYQDTIPILSENPEINALYGIKNKKAGASEKIKNFFSLIKV   82 (352)
T ss_pred             CCCceEEEEEecccCceeeHHHHHHHHHHHCCCCeEEEEeccChHHHhccCCCceEEEEeccccccHHHHHHHHHHHHHH
Confidence            46799999999999999999999999999999999999999999999999999999999987641  1234567888999


Q ss_pred             hHhCCCcEEEEcccCCchHHHHHHHhCCCeeEeccCCCCCccccccccceeecCCccccccchhhHHHHHHHHcCCCCCC
Q 012283          198 MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRS  277 (467)
Q Consensus       198 Lr~~~yDlvI~l~~~~~~~~ll~~l~gak~riG~~~~~~~~~~~~~~~t~~i~~~~~~~~~~~~h~~~~lL~~Lgi~~~~  277 (467)
                      ||+++||++||++. +++++++++++|++.||||....+....|..+|++.++...    .+....+.++++.+|+... 
T Consensus        83 lr~~~yD~vidl~~-~~~s~ll~~l~~a~~rig~~~~~~~~~~~~~~~~~~~~~~~----~h~~~~~~~ll~~lg~~~~-  156 (352)
T PRK10422         83 LRANKYDLIVNLTD-QWMVALLVRLLNARVKISQDYHHRQSAFWRKSFTHLVPLQG----GHIVESNLSVLTPLGLSSL-  156 (352)
T ss_pred             HhhCCCCEEEEccc-chHHHHHHHHhCCCeEEeeccccccchhHHHHhcccCCCCC----cchHHhhHhHHhhcCCCCC-
Confidence            99999999999996 57899999999999999998554332234446666553221    1223333457788887531 


Q ss_pred             CCCCCCCCceeecCHHHHHHHHHHHHHcCCCCCcEEEEecCCCCccccccCCCCCCCCCCHHHHHHHHHHhhhCC-CEEE
Q 012283          278 VPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR-PLFV  356 (467)
Q Consensus       278 v~~~~~p~~~l~l~~~~~~~a~~~l~~~~l~~~~~I~i~pgas~s~~~~~~r~~~K~rWP~e~~~~Li~~L~~~~-~Vvl  356 (467)
                           .+.+.+.+++++.+.+++.+...+.. +++|+||||++.         ..| +||.|+|++|++.|.+++ .|++
T Consensus       157 -----~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~i~i~pga~~---------~~K-~Wp~e~fa~l~~~L~~~~~~vvl  220 (352)
T PRK10422        157 -----VKETTMSYRPESWKRMRRQLDHLGVT-QNYVVIQPTARQ---------IFK-CWDNDKFSAVIDALQARGYEVVL  220 (352)
T ss_pred             -----CCcceeecCHHHHHHHHHHHHhcCCC-CCeEEEecCCCc---------ccc-CCCHHHHHHHHHHHHHCCCeEEE
Confidence                 23336667776666666666655543 689999997543         346 699999999999998776 5788


Q ss_pred             ecCccc--HHHHHHHHhcCC--CC---cccCCHHHHHHHHHhcCEEEeCCchHHHHHHhcCCCEEEEeCCCCCCCccccC
Q 012283          357 IPHEKE--REGVEDVVGDDA--SI---VFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPN  429 (467)
Q Consensus       357 ~g~~~e--~~~~~~i~~~~~--~~---~~~~sL~el~alI~~a~lvIg~DTG~~HLAaAlg~PtVaLFg~t~p~~~~~P~  429 (467)
                      +|+++|  .+.+++|.+.+.  ..   ..++||.|++++|++|+++||||||+||||+|+|+|||+|||+|++. .|+|+
T Consensus       221 ~ggp~e~e~~~~~~i~~~~~~~~~~~l~g~~sL~el~ali~~a~l~v~nDSGp~HlAaA~g~P~v~lfGpt~p~-~~~P~  299 (352)
T PRK10422        221 TSGPDKDDLACVNEIAQGCQTPPVTALAGKTTFPELGALIDHAQLFIGVDSAPAHIAAAVNTPLICLFGATDHI-FWRPW  299 (352)
T ss_pred             EcCCChHHHHHHHHHHHhcCCCccccccCCCCHHHHHHHHHhCCEEEecCCHHHHHHHHcCCCEEEEECCCCcc-ccCCC
Confidence            888754  445577776532  22   22789999999999999999999999999999999999999999864 68898


Q ss_pred             CCCCceEeec-------------CCCCCCCCCCHHHHHHHHHHHHHhh
Q 012283          430 AEEKKCTVIS-------------SRTGKLIDTPVEAVLNAMQIFNESL  464 (467)
Q Consensus       430 ~~~~~c~i~~-------------~~~~cm~~Is~e~V~~ai~~ll~~~  464 (467)
                       +++..++..             ....||++|+||+|++++++++.++
T Consensus       300 -~~~~~v~~~~~~~~~pc~~~~~~~~~Cm~~I~~~~V~~~~~~ll~~~  346 (352)
T PRK10422        300 -SDNMIQFWAGDYQEMPTRDELDRNEKYLSVIPAADVIAAVDKLLPSS  346 (352)
T ss_pred             -CCCeeEEECCCcccCcCcccCCccccHhhcCCHHHHHHHHHHHHhcc
Confidence             543322211             1246999999999999999998765



>TIGR02201 heptsyl_trn_III lipopolysaccharide heptosyltransferase III, putative Back     alignment and domain information
>PRK10916 ADP-heptose:LPS heptosyltransferase II; Provisional Back     alignment and domain information
>TIGR02195 heptsyl_trn_II lipopolysaccharide heptosyltransferase II Back     alignment and domain information
>PRK10964 ADP-heptose:LPS heptosyl transferase I; Provisional Back     alignment and domain information
>TIGR02193 heptsyl_trn_I lipopolysaccharide heptosyltransferase I Back     alignment and domain information
>COG0859 RfaF ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>cd03789 GT1_LPS_heptosyltransferase Lipopolysaccharide heptosyltransferase is involved in the biosynthesis of lipooligosaccharide (LOS) Back     alignment and domain information
>PF01075 Glyco_transf_9: Glycosyltransferase family 9 (heptosyltransferase); InterPro: IPR002201 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases Back     alignment and domain information
>TIGR03568 NeuC_NnaA UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing Back     alignment and domain information
>PRK10017 colanic acid biosynthesis protein; Provisional Back     alignment and domain information
>PF04007 DUF354: Protein of unknown function (DUF354); InterPro: IPR007152 Members of this family are around 350 amino acids in length Back     alignment and domain information
>PRK00025 lpxB lipid-A-disaccharide synthase; Reviewed Back     alignment and domain information
>TIGR03590 PseG pseudaminic acid biosynthesis-associated protein PseG Back     alignment and domain information
>PF13528 Glyco_trans_1_3: Glycosyl transferase family 1 Back     alignment and domain information
>TIGR03609 S_layer_CsaB polysaccharide pyruvyl transferase CsaB Back     alignment and domain information
>TIGR00236 wecB UDP-N-acetylglucosamine 2-epimerase Back     alignment and domain information
>PRK05749 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed Back     alignment and domain information
>cd03786 GT1_UDP-GlcNAc_2-Epimerase Bacterial members of the UDP-N-Acetylglucosamine (GlcNAc) 2-Epimerase family are known to catalyze the reversible interconversion of UDP-GlcNAc and UDP-N-acetylmannosamine (UDP-ManNAc) Back     alignment and domain information
>COG3980 spsG Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>TIGR03492 conserved hypothetical protein Back     alignment and domain information
>cd03807 GT1_WbnK_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>COG0707 MurG UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>cd03785 GT1_MurG MurG is an N-acetylglucosaminyltransferase, the last enzyme involved in the intracellular phase of peptidoglycan biosynthesis Back     alignment and domain information
>PRK12446 undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase; Reviewed Back     alignment and domain information
>TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase Back     alignment and domain information
>PRK00726 murG undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase; Provisional Back     alignment and domain information
>COG1817 Uncharacterized protein conserved in archaea [Function unknown] Back     alignment and domain information
>PF02684 LpxB: Lipid-A-disaccharide synthetase; InterPro: IPR003835 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases Back     alignment and domain information
>cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>TIGR00661 MJ1255 conserved hypothetical protein Back     alignment and domain information
>PRK13609 diacylglycerol glucosyltransferase; Provisional Back     alignment and domain information
>COG1519 KdtA 3-deoxy-D-manno-octulosonic-acid transferase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>PRK01021 lpxB lipid-A-disaccharide synthase; Reviewed Back     alignment and domain information
>PRK13608 diacylglycerol glucosyltransferase; Provisional Back     alignment and domain information
>cd04951 GT1_WbdM_like This family is most closely related to the GT1 family of glycosyltransferases and is named after WbdM in Escherichia coli Back     alignment and domain information
>TIGR03088 stp2 sugar transferase, PEP-CTERM/EpsH1 system associated Back     alignment and domain information
>PF04230 PS_pyruv_trans: Polysaccharide pyruvyl transferase; InterPro: IPR007345 Pyruvyl-transferases are involved in peptidoglycan-associated polymer biosynthesis Back     alignment and domain information
>cd03784 GT1_Gtf_like This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin Back     alignment and domain information
>cd03811 GT1_WabH_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>COG2327 WcaK Polysaccharide pyruvyl transferase family protein [Cell wall/membrane/envelope biogenesis] Back     alignment and domain information
>TIGR00215 lpxB lipid-A-disaccharide synthase Back     alignment and domain information
>cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>COG0763 LpxB Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known Back     alignment and domain information
>PRK14089 ipid-A-disaccharide synthase; Provisional Back     alignment and domain information
>cd03817 GT1_UGDG_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>cd03812 GT1_CapH_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>PLN02871 UDP-sulfoquinovose:DAG sulfoquinovosyltransferase Back     alignment and domain information
>cd03799 GT1_amsK_like This is a family of GT1 glycosyltransferases found specifically in certain bacteria Back     alignment and domain information
>PLN02605 monogalactosyldiacylglycerol synthase Back     alignment and domain information
>cd03798 GT1_wlbH_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>PF02350 Epimerase_2: UDP-N-acetylglucosamine 2-epimerase; InterPro: IPR003331 UDP-N-acetylglucosamine 2-epimerase 5 Back     alignment and domain information
>cd04955 GT1_like_6 This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>PRK09922 UDP-D-galactose:(glucosyl)lipopolysaccharide-1,6-D-galactosyltransferase; Provisional Back     alignment and domain information
>TIGR01426 MGT glycosyltransferase, MGT family Back     alignment and domain information
>cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>COG0381 WecB UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>cd03816 GT1_ALG1_like This family is most closely related to the GT1 family of glycosyltransferases Back     alignment and domain information
>PF13477 Glyco_trans_4_2: Glycosyl transferase 4-like Back     alignment and domain information
>PF04413 Glycos_transf_N: 3-Deoxy-D-manno-octulosonic-acid transferase (kdotransferase); InterPro: IPR007507 This is a domain found in proteins that transfer activated sugars to a variety of substrates, including glycogen, fructose-6-phosphate and lipopolysaccharides Back     alignment and domain information
>COG3660 Predicted nucleoside-diphosphate-sugar epimerase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>PF14595 Thioredoxin_9: Thioredoxin; PDB: 1Z6N_A Back     alignment and domain information
>PF04464 Glyphos_transf: CDP-Glycerol:Poly(glycerophosphate) glycerophosphotransferase ; InterPro: IPR007554 Wall-associated teichoic acids are a heterogeneous class of phosphate-rich polymers that are covalently linked to the cell wall peptidoglycan of Gram-positive bacteria Back     alignment and domain information
>PF05159 Capsule_synth: Capsule polysaccharide biosynthesis protein; InterPro: IPR007833 This family includes export proteins involved in capsule polysaccharide biosynthesis, such as KpsS P42218 from SWISSPROT and LipB P57038 from SWISSPROT Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query467
2gt1_A326 Lipopolysaccharide heptosyltransferase-1; GT-B fol 2e-20
3tov_A349 Glycosyl transferase family 9; structural genomics 6e-16
1psw_A348 ADP-heptose LPS heptosyltransferase II; structural 8e-14
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
>2gt1_A Lipopolysaccharide heptosyltransferase-1; GT-B fold; 1.90A {Escherichia coli UTI89} PDB: 2h1f_A* 2h1h_A* Length = 326 Back     alignment and structure
 Score = 90.7 bits (226), Expect = 2e-20
 Identities = 48/339 (14%), Positives = 98/339 (28%), Gaps = 53/339 (15%)

Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNV---------RWANVYDLDDDWPE 187
           L   PA+   +   PG+  D +      Q    +  V         RW   +        
Sbjct: 15  LHTLPALTDAQQAIPGIKFDWVVEEGFAQIPSWHAAVERVIPVAIRRWRKAWFSAPIK-- 72

Query: 188 PAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSYIYPNVNAAGAGLLLS 246
            AE       ++ + YD V+     GL  +A L        +    +       A L  +
Sbjct: 73  -AERKAFREALQAKNYDAVID--AQGLVKSAALVTRLAHGVKHGMDWQTAREPLASLFYN 129

Query: 247 ETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAG 306
                       +  +  E+  +   +      +                +      N  
Sbjct: 130 RKHHIA------KQQHAVERTRELFAKSLG-YSKPQTQGDY--------AIAQHFLTNLP 174

Query: 307 AEQGKYIVIHGIES-DSKASMQSRGDTDSLLPIQVWAEIANGLRE--FRPLFVIPHEKER 363
            + G+Y V     + D K             P + W E+   L +   R         E 
Sbjct: 175 TDAGEYAVFLHATTRDDK-----------HWPEEHWRELIGLLADSGIRIKLPWGAPHEE 223

Query: 364 EGVEDVVGDDASIVFI--TTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF--SS 419
           E  + +    A +  +   +   +A ++  +  V++ +T    L  A ++P+I ++  + 
Sbjct: 224 ERAKRLAEGFAYVEVLPKMSLEGVARVLAGAKFVVSVDTGLSHLTAALDRPNITVYGPTD 283

Query: 420 ELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQ 458
                L     +     V  +   +L      AV   ++
Sbjct: 284 P---GLIGGYGK--NQMVCRAPGNELSQLTANAVKQFIE 317


>3tov_A Glycosyl transferase family 9; structural genomics, PSI-BIOL protein structure initiative, midwest center for structural genomics, MCSG; 2.98A {Veillonella parvula} Length = 349 Back     alignment and structure
>1psw_A ADP-heptose LPS heptosyltransferase II; structural genomics, NYSGXRC, LPS biosynthetic pathway, PSI, protein structure initiative; 2.00A {Escherichia coli} SCOP: c.87.1.7 Length = 348 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query467
3tov_A349 Glycosyl transferase family 9; structural genomics 100.0
2gt1_A326 Lipopolysaccharide heptosyltransferase-1; GT-B fol 100.0
1psw_A348 ADP-heptose LPS heptosyltransferase II; structural 100.0
3otg_A412 CALG1; calicheamicin, TDP, structural genomics, PS 98.1
4fzr_A398 SSFS6; structural genomics, PSI-biology, protein s 97.88
3dzc_A396 UDP-N-acetylglucosamine 2-epimerase; structural ge 97.75
3okp_A394 GDP-mannose-dependent alpha-(1-6)-phosphatidylino 97.72
3rsc_A415 CALG2; TDP, enediyne, structural genomics, PSI-2, 97.56
1vgv_A384 UDP-N-acetylglucosamine 2-epimerase; structural ge 97.55
3ot5_A403 UDP-N-acetylglucosamine 2-epimerase; structural ge 97.5
3oti_A398 CALG3; calicheamicin, TDP, structural genomics, PS 97.47
3ia7_A402 CALG4; glycosysltransferase, calicheamicin, enediy 97.43
2p6p_A384 Glycosyl transferase; X-RAY-diffraction,urdamycina 97.35
1f0k_A364 MURG, UDP-N-acetylglucosamine-N-acetylmuramyl- (pe 97.34
1v4v_A376 UDP-N-acetylglucosamine 2-epimerase; UDP-GLCNAC, t 97.33
3beo_A375 UDP-N-acetylglucosamine 2-epimerase; UDP-GLCNAC, a 97.19
4hwg_A385 UDP-N-acetylglucosamine 2-epimerase; ssgcid, struc 97.12
3tsa_A391 SPNG, NDP-rhamnosyltransferase; glycosyltransferas 97.03
2iyf_A430 OLED, oleandomycin glycosyltransferase; antibiotic 97.01
3hbm_A282 UDP-sugar hydrolase; PSEG; 1.80A {Campylobacter je 97.0
3h4t_A404 Glycosyltransferase GTFA, glycosyltransferase; van 96.96
3s2u_A365 UDP-N-acetylglucosamine--N-acetylmuramyl-(pentape 96.91
2gek_A406 Phosphatidylinositol mannosyltransferase (PIMA); G 96.78
3fro_A439 GLGA glycogen synthase; glycosyltransferase family 96.59
3c48_A438 Predicted glycosyltransferases; retaining glycosyl 96.56
2iya_A424 OLEI, oleandomycin glycosyltransferase; carbohydra 96.44
4amg_A400 Snogd; transferase, polyketide biosynthesis, GT1 f 96.44
3oy2_A413 Glycosyltransferase B736L; rossmann fold, GDP-mann 95.95
2jjm_A394 Glycosyl transferase, group 1 family protein; anth 95.89
2iuy_A342 Avigt4, glycosyltransferase; antibiotics, family G 95.83
2bfw_A200 GLGA glycogen synthase; glycosyltransferase family 95.08
2o6l_A170 UDP-glucuronosyltransferase 2B7; drug metabolism, 92.17
2f62_A161 Nucleoside 2-deoxyribosyltransferase; SGPP, struct 90.01
2xci_A374 KDO-transferase, 3-deoxy-D-manno-2-octulosonic aci 89.89
2f9f_A177 First mannosyl transferase (WBAZ-1); alpha-beta pr 88.54
2yjn_A441 ERYCIII, glycosyltransferase; transferase, cytochr 87.53
3qhp_A166 Type 1 capsular polysaccharide biosynthesis prote 86.47
1rrv_A416 Glycosyltransferase GTFD; GT-B, glycosyltransferas 85.76
2iw1_A374 Lipopolysaccharide core biosynthesis protein RFAG; 85.19
2vsy_A568 XCC0866; transferase, glycosyl transferase, GT-B, 84.65
2x6q_A416 Trehalose-synthase TRET; biosynthetic protein; 2.2 84.63
1iir_A415 Glycosyltransferase GTFB; rossmann fold; 1.80A {Am 80.15
>3tov_A Glycosyl transferase family 9; structural genomics, PSI-BIOL protein structure initiative, midwest center for structural genomics, MCSG; 2.98A {Veillonella parvula} Back     alignment and structure
Probab=100.00  E-value=4.6e-52  Score=422.17  Aligned_cols=318  Identities=15%  Similarity=0.142  Sum_probs=260.1

Q ss_pred             CCccEEEEEecCCchhHHhHHHHHHHHHHHCCCcEEEEEEcCCchhhhhcCCCCCEEEEecCCCCCCChHHHHHHHHHhH
Q 012283          120 GDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMK  199 (467)
Q Consensus       120 ~~~~rILII~~~~IGD~Il~tP~l~aLk~~yP~a~I~ll~~~~~~~l~~~~p~Id~ii~~~~~~~~~~~~~~~~l~~~Lr  199 (467)
                      -+.||||||+.++|||+|+++|++++||++||+++|++++.+.++++++.+|+||+|+.++++...+++..+++++++||
T Consensus         6 l~~~~iLvi~~~~lGD~i~~~P~l~~L~~~~P~a~I~~l~~~~~~~l~~~~p~vd~vi~~~~~~~~~~~~~~~~l~~~Lr   85 (349)
T 3tov_A            6 LDYKRIVVTFLMHLGDVILTTPFLEVLRKAAPHSHITYVIDEKLQQVMEYNPNIDELIVVDKKGRHNSISGLNEVAREIN   85 (349)
T ss_dssp             CTTCEEEEECCCCHHHHHTTHHHHHHHHHHCTTSEEEEEEEGGGGGGTSSCTTCSEEEEECCSSHHHHHHHHHHHHHHHH
T ss_pred             CCCCEEEEEecCcccHHHHHHHHHHHHHHHCCCCEEEEEECcchhHHHhcCCCccEEEEeCcccccccHHHHHHHHHHHh
Confidence            45789999999999999999999999999999999999999999999999999999999997642234567788999999


Q ss_pred             hCCC-cEEEEcccCCchHHHHHHHhCCCeeEeccCCCCCccccccccceeecCCccccccchhhHHHHHHHHcCCCCCCC
Q 012283          200 NRYY-DMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSV  278 (467)
Q Consensus       200 ~~~y-DlvI~l~~~~~~~~ll~~l~gak~riG~~~~~~~~~~~~~~~t~~i~~~~~~~~~~~~h~~~~lL~~Lgi~~~~v  278 (467)
                      +++| |++||++. +.+++++++++|++.|+||....     ++++|++.++.....  .|..+.+.++++.+|+...  
T Consensus        86 ~~~y~D~vidl~~-~~rs~~l~~~~~a~~riG~~~~~-----~~~~~~~~~~~~~~~--~h~v~r~~~ll~~lg~~~~--  155 (349)
T 3tov_A           86 AKGKTDIVINLHP-NERTSYLAWKIHAPITTGMSHFL-----FRPFMTKYTRLDRKT--RHAADMYINVLEQLGVTDT--  155 (349)
T ss_dssp             HHCCCCEEEECCC-SHHHHHHHHHHCCSEEEECCCTT-----TGGGCSEECCCCTTT--SCHHHHHHHHHHHTTCCCC--
T ss_pred             hCCCCeEEEECCC-ChHHHHHHHHhCCCeEEecCCCC-----ccccccccccCCCCC--ccHHHHHHHHHHHhCCCcc--
Confidence            9999 99999996 67999999999999999998432     235778766543220  1233444467888998631  


Q ss_pred             CCCCCCCceeecCHHHHHHHHHHHHHcCCC-CCcEEEEecCCCCccccccCCCCCCCCCCHHHHHHHHHHhhhCC-CEEE
Q 012283          279 PRHPVPPLRVSISRRLKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR-PLFV  356 (467)
Q Consensus       279 ~~~~~p~~~l~l~~~~~~~a~~~l~~~~l~-~~~~I~i~pgas~s~~~~~~r~~~K~rWP~e~~~~Li~~L~~~~-~Vvl  356 (467)
                         ..+.+++.+++++.+.+++++.+.|+. ++++|+||||++.         ..| +||.|+|++|++.|.+++ .|++
T Consensus       156 ---~~~~~~l~~~~~~~~~~~~~l~~~g~~~~~~~i~i~pga~~---------~~k-~wp~~~~~~l~~~l~~~g~~vvl  222 (349)
T 3tov_A          156 ---SNSGLHIEICEEWRCQAQEFYSSHGLTDTDILIGFNIGSAV---------PEK-RWPAERFAHVADYFGRLGYKTVF  222 (349)
T ss_dssp             ---CCCCCCCCCCHHHHHHHHHHHHHTTCCTTCCEEEEECCCSS---------GGG-CCCHHHHHHHHHHHHHHTCEEEE
T ss_pred             ---CCCceeeeCCHHHHHHHHHHHHHcCCCCCCCEEEEeCCCCC---------ccC-CCCHHHHHHHHHHHHhCCCeEEE
Confidence               112346777888888888888888775 3689999997543         345 699999999999998877 4778


Q ss_pred             ecCcccHHHHHHHHhcCCC-Cc---ccCCHHHHHHHHHhcCEEEeCCchHHHHHHhcCCCEEEEeCCCCCCCccccCCCC
Q 012283          357 IPHEKEREGVEDVVGDDAS-IV---FITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEE  432 (467)
Q Consensus       357 ~g~~~e~~~~~~i~~~~~~-~~---~~~sL~el~alI~~a~lvIg~DTG~~HLAaAlg~PtVaLFg~t~p~~~~~P~~~~  432 (467)
                      +|+++|++.++++.+.++. ..   .++||.|++++|++|+++||+|||+||||+|+|+|+|+|||+++|. .|+|+ ++
T Consensus       223 ~g~~~e~~~~~~i~~~~~~~~~~l~g~~sl~e~~ali~~a~~~i~~DsG~~HlAaa~g~P~v~lfg~t~p~-~~~P~-~~  300 (349)
T 3tov_A          223 FGGPMDLEMVQPVVEQMETKPIVATGKFQLGPLAAAMNRCNLLITNDSGPMHVGISQGVPIVALYGPSNPF-FYGPY-QA  300 (349)
T ss_dssp             CCCTTTHHHHHHHHHTCSSCCEECTTCCCHHHHHHHHHTCSEEEEESSHHHHHHHTTTCCEEEECSSCCHH-HHSCT-TC
T ss_pred             EeCcchHHHHHHHHHhcccccEEeeCCCCHHHHHHHHHhCCEEEECCCCHHHHHHhcCCCEEEEECCCCcc-ccCCC-CC
Confidence            9999999999999876643 22   2789999999999999999999999999999999999999999874 68898 54


Q ss_pred             CceEeec-----------------CCCCCCCCCCHHHHHHHHHHHHHh
Q 012283          433 KKCTVIS-----------------SRTGKLIDTPVEAVLNAMQIFNES  463 (467)
Q Consensus       433 ~~c~i~~-----------------~~~~cm~~Is~e~V~~ai~~ll~~  463 (467)
                      +. .++.                 ++..||++|+||+|+++++++|.+
T Consensus       301 ~~-~vl~~~~~C~C~~~~~~~C~~~~~~Cm~~I~~~~V~~a~~~lL~~  347 (349)
T 3tov_A          301 HA-IVLETMDSYEIGKSMKKIIKEGNYKGLSVISEEQVIKAAETLLLE  347 (349)
T ss_dssp             SE-EEECHHHHHHHHHHTTCCCCGGGCSTTTTSCHHHHHHHHHHHHHH
T ss_pred             Ce-EEEeCCCCcCccCCccCCCCCCccchhhcCCHHHHHHHHHHHHhh
Confidence            43 2221                 246799999999999999999875



>2gt1_A Lipopolysaccharide heptosyltransferase-1; GT-B fold; 1.90A {Escherichia coli UTI89} PDB: 2h1f_A* 2h1h_A* Back     alignment and structure
>1psw_A ADP-heptose LPS heptosyltransferase II; structural genomics, NYSGXRC, LPS biosynthetic pathway, PSI, protein structure initiative; 2.00A {Escherichia coli} SCOP: c.87.1.7 Back     alignment and structure
>3otg_A CALG1; calicheamicin, TDP, structural genomics, PSI-2, protein STRU initiative, center for eukaryotic structural genomics, CESG fold; HET: TYD; 2.08A {Micromonospora echinospora} PDB: 3oth_A* Back     alignment and structure
>4fzr_A SSFS6; structural genomics, PSI-biology, protein structure initiati enzyme discovery for natural product biosynthesis, natPro; 2.40A {Streptomyces SP} PDB: 4g2t_A* Back     alignment and structure
>3dzc_A UDP-N-acetylglucosamine 2-epimerase; structural genomics, in diseases, isomerase, center for structural genomics of INFE diseases, csgid; 2.35A {Vibrio cholerae} Back     alignment and structure
>3okp_A GDP-mannose-dependent alpha-(1-6)-phosphatidylino monomannoside mannosyltransferase...; GT-B fold, alpha-mannosyltransferase; HET: GDD; 2.00A {Corynebacterium glutamicum} PDB: 3okc_A* 3oka_A* Back     alignment and structure
>3rsc_A CALG2; TDP, enediyne, structural genomics, PSI-2, protein structure initiative, center for eukaryotic structural genomics; HET: TYD C0T; 2.19A {Micromonospora echinospora} PDB: 3iaa_A* Back     alignment and structure
>1vgv_A UDP-N-acetylglucosamine 2-epimerase; structural genomics, isomerase; HET: UD1; 2.31A {Escherichia coli} SCOP: c.87.1.3 PDB: 1f6d_A* Back     alignment and structure
>3ot5_A UDP-N-acetylglucosamine 2-epimerase; structural genomics, center for structural genomics of infec diseases, csgid, alpha beta; HET: PGE; 2.20A {Listeria monocytogenes} Back     alignment and structure
>3oti_A CALG3; calicheamicin, TDP, structural genomics, PSI-2, protein STRU initiative, center for eukaryotic structural genomics, CESG fold; HET: TYD C0T; 1.60A {Micromonospora echinospora} PDB: 3d0q_A* 3d0r_A* Back     alignment and structure
>3ia7_A CALG4; glycosysltransferase, calicheamicin, enediyne, transf; 1.91A {Micromonospora echinospora} Back     alignment and structure
>2p6p_A Glycosyl transferase; X-RAY-diffraction,urdamycina-biosynthesis; 1.88A {Streptomyces fradiae} Back     alignment and structure
>1f0k_A MURG, UDP-N-acetylglucosamine-N-acetylmuramyl- (pentapeptide) pyrophosphoryl-undecaprenol...; rossmann fold, transferase; 1.90A {Escherichia coli} SCOP: c.87.1.2 PDB: 1nlm_A* Back     alignment and structure
>1v4v_A UDP-N-acetylglucosamine 2-epimerase; UDP-GLCNAC, two domains, homodimer, riken structural genomics/proteomics initiative, RSGI; HET: MSE; 1.80A {Thermus thermophilus} SCOP: c.87.1.3 Back     alignment and structure
>3beo_A UDP-N-acetylglucosamine 2-epimerase; UDP-GLCNAC, allosteric, regulation, isomerase; HET: UD1 UDP; 1.70A {Bacillus anthracis} PDB: 1o6c_A Back     alignment and structure
>4hwg_A UDP-N-acetylglucosamine 2-epimerase; ssgcid, structural genomics, seattle structural genomics center for infectious disease, isomerase; 2.00A {Rickettsia bellii} Back     alignment and structure
>3tsa_A SPNG, NDP-rhamnosyltransferase; glycosyltransferase; HET: GLC; 1.70A {Saccharopolyspora spinosa} PDB: 3uyk_A* 3uyl_A* Back     alignment and structure
>2iyf_A OLED, oleandomycin glycosyltransferase; antibiotic resistance, glycosylation, enzyme, macrolide, carbohydrate; HET: ERY UDP; 1.7A {Streptomyces antibioticus} Back     alignment and structure
>3hbm_A UDP-sugar hydrolase; PSEG; 1.80A {Campylobacter jejuni subsp} PDB: 3hbn_A* Back     alignment and structure
>3h4t_A Glycosyltransferase GTFA, glycosyltransferase; vancomycin, teicoplanin, ORF1, natural products, antibiotic; HET: UDP; 1.15A {Amycolatopsis orientalis} SCOP: c.87.1.5 PDB: 3h4i_A* 1pn3_A* 1pnv_A* Back     alignment and structure
>3s2u_A UDP-N-acetylglucosamine--N-acetylmuramyl-(pentape pyrophosphoryl-undecaprenol N-acetylglucosamine...; N-acetylglucosaminyl transferase; HET: UD1; 2.23A {Pseudomonas aeruginosa} Back     alignment and structure
>2gek_A Phosphatidylinositol mannosyltransferase (PIMA); GT4 glycosyltransferase, rossmann fold, complex; HET: GDP; 2.40A {Mycobacterium smegmatis} PDB: 2gej_A* Back     alignment and structure
>3fro_A GLGA glycogen synthase; glycosyltransferase family, UDP/ADP-glucose-glycogen synthas rossman folds, transferase; HET: NHF; 2.50A {Pyrococcus abyssi} SCOP: c.87.1.8 PDB: 2bis_A* 3l01_A* Back     alignment and structure
>3c48_A Predicted glycosyltransferases; retaining glycosyltransferase, beta alpha beta, substrate AS catalysis; 2.10A {Corynebacterium glutamicum} PDB: 3c4v_A* 3c4q_A* Back     alignment and structure
>2iya_A OLEI, oleandomycin glycosyltransferase; carbohydrate, glycosylation, enzyme, macrolide; HET: UDP ZIO; 1.7A {Streptomyces antibioticus} Back     alignment and structure
>4amg_A Snogd; transferase, polyketide biosynthesis, GT1 family, nogalamyci; HET: MLY; 2.59A {Streptomyces nogalater} PDB: 4an4_A* 4amb_A* Back     alignment and structure
>3oy2_A Glycosyltransferase B736L; rossmann fold, GDP-mannose, sugar, VIRU proteins, viral protein,transferase; 2.31A {Paramecium bursaria chlorella virus NY} PDB: 3oy7_A* Back     alignment and structure
>2jjm_A Glycosyl transferase, group 1 family protein; anthrax, nucleotide, carbohydrate; 3.10A {Bacillus anthracis} PDB: 3mbo_A* Back     alignment and structure
>2iuy_A Avigt4, glycosyltransferase; antibiotics, family GT-4, avilamycin A; HET: MES; 2.1A {Streptomyces viridochromogenes} PDB: 2iv3_A* Back     alignment and structure
>2bfw_A GLGA glycogen synthase; glycosyltransferase family 5 UDP/ADP-glucose-glycogen syntha rossman folds, transferase; 1.8A {Pyrococcus abyssi} SCOP: c.87.1.8 Back     alignment and structure
>2o6l_A UDP-glucuronosyltransferase 2B7; drug metabolism, rossman, MAD, enzyme, nucleotide binding, sugar,UDP-glucuronosyltransferase, UGT; 1.80A {Homo sapiens} Back     alignment and structure
>2f62_A Nucleoside 2-deoxyribosyltransferase; SGPP, structural genomics, PSI, S genomics of pathogenic protozoa consortium; HET: 12M; 1.50A {Trypanosoma brucei} SCOP: c.23.14.1 PDB: 2a0k_A* 2f2t_A* 2f64_A* 2f67_A* Back     alignment and structure
>2xci_A KDO-transferase, 3-deoxy-D-manno-2-octulosonic acid transferase; KDTA, GSEA, glycosyltransferase superfamily B,; HET: PG4; 2.00A {Aquifex aeolicus} PDB: 2xcu_A* Back     alignment and structure
>2f9f_A First mannosyl transferase (WBAZ-1); alpha-beta protein, structural genomics, PSI, protein struct initiative; 1.80A {Archaeoglobus fulgidus} SCOP: c.87.1.8 Back     alignment and structure
>2yjn_A ERYCIII, glycosyltransferase; transferase, cytochrome P450; 3.09A {Saccharopolyspora erythraea} Back     alignment and structure
>3qhp_A Type 1 capsular polysaccharide biosynthesis prote (CAPJ); rossmann fold, glycosyltransferase, transferase; 1.50A {Helicobacter pylori} Back     alignment and structure
>1rrv_A Glycosyltransferase GTFD; GT-B, glycosyltransferase, rossmann fold, glycopeptide, VACO antibiotic, transferase-antibiotic complex; HET: OMZ GHP OMY 3FG TYD BGC; 2.00A {Amycolatopsis orientalis} SCOP: c.87.1.5 Back     alignment and structure
>2iw1_A Lipopolysaccharide core biosynthesis protein RFAG; transferase, lipopolysaccharide biosynthesis, family GT-4, glycosyltransferase, LPS; HET: U2F; 1.5A {Escherichia coli} SCOP: c.87.1.8 PDB: 2iv7_A* Back     alignment and structure
>2vsy_A XCC0866; transferase, glycosyl transferase, GT-B, OGT, protein O-GLCN; HET: NHE; 2.10A {Xanthomonas campestris PV} PDB: 2jlb_A* 2xgm_A* 2xgo_A* 2xgs_A* 2vsn_A* Back     alignment and structure
>2x6q_A Trehalose-synthase TRET; biosynthetic protein; 2.20A {Pyrococcus horikoshii} PDB: 2x6r_A 2xa1_A 2xa2_A* 2xa9_A* 2xmp_A* Back     alignment and structure
>1iir_A Glycosyltransferase GTFB; rossmann fold; 1.80A {Amycolatopsis orientalis} SCOP: c.87.1.5 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 467
d1pswa_348 c.87.1.7 (A:) ADP-heptose LPS heptosyltransferase 7e-07
>d1pswa_ c.87.1.7 (A:) ADP-heptose LPS heptosyltransferase II {Escherichia coli [TaxId: 562]} Length = 348 Back     information, alignment and structure

class: Alpha and beta proteins (a/b)
fold: UDP-Glycosyltransferase/glycogen phosphorylase
superfamily: UDP-Glycosyltransferase/glycogen phosphorylase
family: ADP-heptose LPS heptosyltransferase II
domain: ADP-heptose LPS heptosyltransferase II
species: Escherichia coli [TaxId: 562]
 Score = 48.7 bits (114), Expect = 7e-07
 Identities = 44/335 (13%), Positives = 87/335 (25%), Gaps = 22/335 (6%)

Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILG 196
           ++     + L+ RYP  +IDV+A A  +        V  A    L     E  E   +  
Sbjct: 15  MMSQSLYRTLQARYPQAIIDVMAPAWCRPLLSRMPEVNEAIPMPLGHGALEIGERRKLGH 74

Query: 197 VMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNL 256
            ++ + YD             +F                        LL++    +    
Sbjct: 75  SLREKRYDRAYVL------PNSFKSALVPLFAGIPHRTGWRGEMRYGLLNDVRVLDKEAW 128

Query: 257 SERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIH 316
                       D            P+   ++ +S   K     ++  +           
Sbjct: 129 PLMVERYIALAYDKGIMRTAQDLPQPLLWPQLQVSEGEKSYTCNQFSLSSERPMIGFCPG 188

Query: 317 GIESDSKA-SMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDAS 375
                +K        +    L  + +  +  G  +                +    + A 
Sbjct: 189 AEFGPAKRWPHYHYAELAKQLIDEGYQVVLFGSAKDHEAGNEILAALNTEQQAWCRNLAG 248

Query: 376 IVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSEL---------KGRLF 426
                   Q   LI     ++  ++  + +A A  +P +AL+             K R+ 
Sbjct: 249 ETQ---LDQAVILIAACKAIVTNDSGLMHVAAALNRPLVALYGPSSPDFTPPLSHKARVI 305

Query: 427 VPNAEEKKCTVISSRTGK---LIDTPVEAVLNAMQ 458
                  K     +  G    LID   + VL  + 
Sbjct: 306 RLITGYHKVRKGDAAEGYHQSLIDITPQRVLEELN 340


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query467
d1pswa_348 ADP-heptose LPS heptosyltransferase II {Escherichi 100.0
d1o6ca_377 UDP-N-acetylglucosamine 2-epimerase {Bacillus subt 97.16
d1v4va_373 UDP-N-acetylglucosamine 2-epimerase {Thermus therm 97.0
d1f6da_376 UDP-N-acetylglucosamine 2-epimerase {Escherichia c 96.91
d1f0ka_351 Peptidoglycan biosynthesis glycosyltransferase Mur 96.61
d2bfwa1196 Glycogen synthase 1, GlgA {Pyrococcus abyssi [TaxI 93.44
d2f62a1152 Nucleoside 2-deoxyribosyltransferase {Trypanosoma 92.61
d2iw1a1370 Lipopolysaccharide core biosynthesis protein RfaG 86.93
d2f9fa1166 First mannosyl transferase WbaZ {Archaeoglobus ful 80.57
>d1pswa_ c.87.1.7 (A:) ADP-heptose LPS heptosyltransferase II {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: UDP-Glycosyltransferase/glycogen phosphorylase
superfamily: UDP-Glycosyltransferase/glycogen phosphorylase
family: ADP-heptose LPS heptosyltransferase II
domain: ADP-heptose LPS heptosyltransferase II
species: Escherichia coli [TaxId: 562]
Probab=100.00  E-value=1.6e-46  Score=375.73  Aligned_cols=316  Identities=17%  Similarity=0.155  Sum_probs=234.8

Q ss_pred             cEEEEEecCCchhHHhHHHHHHHHHHHCCCcEEEEEEcCCchhhhhcCCCCCEEEEecCCCCCCChHHHHHHHHHhHhCC
Q 012283          123 RRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRY  202 (467)
Q Consensus       123 ~rILII~~~~IGD~Il~tP~l~aLk~~yP~a~I~ll~~~~~~~l~~~~p~Id~ii~~~~~~~~~~~~~~~~l~~~Lr~~~  202 (467)
                      ||||||++++|||+|+++|++++||+.||+++|+|+|++.++++++.+|+||+|+.++.+.....+..+.++++.+++++
T Consensus         1 MkILii~~~~iGD~il~~p~i~~Lk~~~P~~~I~~l~~~~~~~l~~~~p~id~v~~~~~~~~~~~~~~~~~l~~~l~~~~   80 (348)
T d1pswa_           1 MKILVIGPSWVGDMMMSQSLYRTLQARYPQAIIDVMAPAWCRPLLSRMPEVNEAIPMPLGHGALEIGERRKLGHSLREKR   80 (348)
T ss_dssp             CEEEEECCSSHHHHHHHHHHHHHHHHHSTTCEEEEEECGGGHHHHTTCTTEEEEEEC-------CHHHHHHHHHHTTTTT
T ss_pred             CeEEEEcCCChHHHHHHHHHHHHHHHHCCCCEEEEEEChhHHHHHhhCCCcCEEEEecCccccchhhhhhhHHHHhhhcc
Confidence            89999999999999999999999999999999999999999999999999999999987664467788899999999999


Q ss_pred             CcEEEEcccCCchHHHHHHHhCCCeeEeccCCCCCccccccccceeecCCccccccchhhHHHHHHH---HcCCCCC--C
Q 012283          203 YDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVD---WLGRPFR--S  277 (467)
Q Consensus       203 yDlvI~l~~~~~~~~ll~~l~gak~riG~~~~~~~~~~~~~~~t~~i~~~~~~~~~~~~h~~~~lL~---~Lgi~~~--~  277 (467)
                      ||++|+++. ++++++++++++++.++|+....+..     +++........    ...|...+.+.   .++....  .
T Consensus        81 ~D~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~  150 (348)
T d1pswa_          81 YDRAYVLPN-SFKSALVPLFAGIPHRTGWRGEMRYG-----LLNDVRVLDKE----AWPLMVERYIALAYDKGIMRTAQD  150 (348)
T ss_dssp             CSEEEECSC-CSGGGHHHHHTTCSEEEEECTTTCTT-----TCTEEECCCTT----TCCSHHHHHHHTTSCGGGCSSGGG
T ss_pred             cceEeeccc-ccchhhHHHhhccccccccccccccc-----ccccccccccc----cchhHHHHHHHHHHhhhccccccc
Confidence            999999996 68999999999999999997543322     33332222211    12344443332   2222110  0


Q ss_pred             CCCCCCCCceeecCHHHHHHHHHHHHHcCCC-CCcEEEEecCCCCccccccCCCCCCCCCCHHHHHHHHHHhhhCC-CEE
Q 012283          278 VPRHPVPPLRVSISRRLKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR-PLF  355 (467)
Q Consensus       278 v~~~~~p~~~l~l~~~~~~~a~~~l~~~~l~-~~~~I~i~pgas~s~~~~~~r~~~K~rWP~e~~~~Li~~L~~~~-~Vv  355 (467)
                      .+. ........+++++.......   .++. .+++|++++|++.        +..| +||.|+|++|+++|.+++ .++
T Consensus       151 ~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~~~~~--------~~~k-~wp~~~~~~L~~~l~~~~~~iv  217 (348)
T d1pswa_         151 LPQ-PLLWPQLQVSEGEKSYTCNQ---FSLSSERPMIGFCPGAEF--------GPAK-RWPHYHYAELAKQLIDEGYQVV  217 (348)
T ss_dssp             SCS-SCCCCCCCCCHHHHHHHHHH---TTCCSSSCEEEEECCCTT--------CGGG-SCCHHHHHHHHHHHHHTTCEEE
T ss_pred             ccc-ccccccccCCHHHHHHHHHH---hccccCCCeEEeccccch--------hhcc-ccchHHHhhhHHHHhhcCCccc
Confidence            000 00111344566555544333   3332 3678999986543        2335 699999999999999887 488


Q ss_pred             EecCcccHHHHHHHHhcCCC-----C---cccCCHHHHHHHHHhcCEEEeCCchHHHHHHhcCCCEEEEeCCCCCCCccc
Q 012283          356 VIPHEKEREGVEDVVGDDAS-----I---VFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFV  427 (467)
Q Consensus       356 l~g~~~e~~~~~~i~~~~~~-----~---~~~~sL~el~alI~~a~lvIg~DTG~~HLAaAlg~PtVaLFg~t~p~~~~~  427 (467)
                      ++|+++|++..+++.+.+..     .   ..++||.|++++|++|++|||+|||++|||+|+|+|+|+|||++++. .|+
T Consensus       218 l~g~~~e~~~~~~~~~~~~~~~~~~~~~l~g~~sl~el~~li~~a~l~I~~Dtg~~HlAaa~g~p~i~lfg~~~~~-~~~  296 (348)
T d1pswa_         218 LFGSAKDHEAGNEILAALNTEQQAWCRNLAGETQLDQAVILIAACKAIVTNDSGLMHVAAALNRPLVALYGPSSPD-FTP  296 (348)
T ss_dssp             ECCCGGGHHHHHHHHTTSCHHHHTTEEECTTTSCHHHHHHHHHTSSEEEEESSHHHHHHHHTTCCEEEEESSSCTT-SSC
T ss_pred             cccccchHHHHHHHHHhhhcccccccccccCCccHHHHHHHHhcceeEeecCccHHHHHHHcCCCEEEEECCCCHh-hhC
Confidence            89999999999988775532     1   12689999999999999999999999999999999999999999765 578


Q ss_pred             cCCCCCceEeec--------------CCCCCCCCCCHHHHHHHHHHHHHh
Q 012283          428 PNAEEKKCTVIS--------------SRTGKLIDTPVEAVLNAMQIFNES  463 (467)
Q Consensus       428 P~~~~~~c~i~~--------------~~~~cm~~Is~e~V~~ai~~ll~~  463 (467)
                      |+ ++...++..              ....||++|+||+|+++++++|.+
T Consensus       297 P~-~~~~~~l~~~~~c~~c~~~~~~~~~~~c~~~I~~~~V~~~~~~lL~~  345 (348)
T d1pswa_         297 PL-SHKARVIRLITGYHKVRKGDAAEGYHQSLIDITPQRVLEELNALLLQ  345 (348)
T ss_dssp             CC-CTTEEEEESSCC----------CCCCHHHHTSCHHHHHHHHHHHHHH
T ss_pred             CC-CCCeEEEecCCCCCcccCCCCCCCchhhHhCCCHHHHHHHHHHHhcc
Confidence            98 544422211              123489999999999999999875



>d1o6ca_ c.87.1.3 (A:) UDP-N-acetylglucosamine 2-epimerase {Bacillus subtilis [TaxId: 1423]} Back     information, alignment and structure
>d1v4va_ c.87.1.3 (A:) UDP-N-acetylglucosamine 2-epimerase {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure
>d1f6da_ c.87.1.3 (A:) UDP-N-acetylglucosamine 2-epimerase {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1f0ka_ c.87.1.2 (A:) Peptidoglycan biosynthesis glycosyltransferase MurG {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d2bfwa1 c.87.1.8 (A:218-413) Glycogen synthase 1, GlgA {Pyrococcus abyssi [TaxId: 29292]} Back     information, alignment and structure
>d2f62a1 c.23.14.1 (A:9-160) Nucleoside 2-deoxyribosyltransferase {Trypanosoma brucei [TaxId: 5691]} Back     information, alignment and structure
>d2iw1a1 c.87.1.8 (A:2-371) Lipopolysaccharide core biosynthesis protein RfaG {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d2f9fa1 c.87.1.8 (A:2-167) First mannosyl transferase WbaZ {Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure