BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>008551
MPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEF
RADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKEKHDLKYVYVWHAI
TGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYD
ELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRNNDIICCM
SHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQPDWDMFHSLHP
MAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPAR
DGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRV
AGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLV
KMFNSGGAIKELRYESEGTATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGL
VTLTLRVPKEELYLWNISFEL

High Scoring Gene Products

Symbol, full name Information P value
SIP1
AT1G55740
protein from Arabidopsis thaliana 1.4e-245
SIP2
AT3G57520
protein from Arabidopsis thaliana 3.8e-205
SIP1
AT5G40390
protein from Arabidopsis thaliana 2.5e-113
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 2.4e-108
STS1
Stachyose synthase
protein from Pisum sativum 3.6e-101
STS
AT4G01970
protein from Arabidopsis thaliana 1.8e-93
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 4.1e-35
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 1.6e-23
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 5.2e-20

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  008551
        (561 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  2366  1.4e-245  1
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...  1832  3.8e-205  2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...  1118  2.5e-113  1
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...  1071  2.4e-108  1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...   821  3.6e-101  2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...   756  1.8e-93   2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   384  1.3e-39   2
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   337  4.1e-35   3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   234  1.6e-23   2
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   196  5.2e-20   3


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 2366 (837.9 bits), Expect = 1.4e-245, P = 1.4e-245
 Identities = 427/559 (76%), Positives = 486/559 (86%)

Query:     1 MPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEF 60
             MPDMLNWFGWCTWDAFYT+VT + VKQGLES + GG+ PKF+IIDDGWQSVGMD +  EF
Sbjct:   193 MPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGWQSVGMDETSVEF 252

Query:    61 RADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKEKHDLKYVYVWHAI 120
              ADN ANFANRLTHIKENHKFQK+GKEG R +DP+L L H++T+IK  + LKYVYVWHAI
Sbjct:   253 NADNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITDIKSNNSLKYVYVWHAI 312

Query:   121 TGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYD 180
             TGYWGGV+PGV+GMEHYESK+ YPVSSPGV S+E C   +SI KNGLGLVNPEKVF FY+
Sbjct:   313 TGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESITKNGLGLVNPEKVFSFYN 372

Query:   181 ELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRNNDIICCM 240
             +LHSYLAS G+DGVKVDVQNILETLGAGHGGRVKL++KYHQALEASI+RNF +N II CM
Sbjct:   373 DLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQALEASISRNFPDNGIISCM 432

Query:   241 SHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQPDWDMFHSLHP 300
             SHNTDGLYSAK++AVIRASDDFWPRDPASHTIHIASVAYNT+FLGEFMQPDWDMFHSLHP
Sbjct:   433 SHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLFLGEFMQPDWDMFHSLHP 492

Query:   301 MAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPAR 360
             MAEYH AARAVGGCAIYVSDKPGQHDFNLLRKLVL DGSILRAKLPGRPT DC FSDP R
Sbjct:   493 MAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRAKLPGRPTSDCFFSDPVR 552

Query:   361 DGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRV 420
             D KSLLKIWNLN+FTGV+GVFNCQGAGWC+  K+ LIHD++PGT +G +R  DV YL +V
Sbjct:   553 DNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPGTISGCVRTNDVHYLHKV 612

Query:   421 AGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLV 480
             A  EWTGD+I YSHL GE+ YLPK+ +LP+TL  REYEV+TVVPVKE S G++FAP+GL+
Sbjct:   613 AAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVVPVKEFSDGSKFAPVGLM 672

Query:   481 KMFNSGGAIKELRYESEGTA-TVDMKVRGCGEFGAYSSAR-PRRIAVDSEEVQFGYEEES 538
             +MFNSGGAI  LRY+ EGT   V MK+RG G  G YSS R PR + VDS++V++ YE ES
Sbjct:   673 EMFNSGGAIVSLRYDDEGTKFVVRMKLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYEPES 732

Query:   539 GLVTLTLRVPKEELYLWNI 557
             GLVT TL VP++ELYLW++
Sbjct:   733 GLVTFTLGVPEKELYLWDV 751


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 1832 (650.0 bits), Expect = 3.8e-205, Sum P(2) = 3.8e-205
 Identities = 330/492 (67%), Positives = 394/492 (80%)

Query:     1 MPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEF 60
             +P  L+WFGWCTWDAFYTDVT EGV +GL+S  +GG PPKF+IIDDGWQ +         
Sbjct:   193 LPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDGWQQIENKEKDENC 252

Query:    61 RADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKEKHDLKYVYVWHAI 120
                  A FA RL  IKEN KFQK+     +++    GL+ +V   K++H++K VY WHA+
Sbjct:   253 VVQEGAQFATRLVGIKENAKFQKS----DQKDTQVSGLKSVVDNAKQRHNVKQVYAWHAL 308

Query:   121 TGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYD 180
              GYWGGV+P  +GMEHY+S + YPV SPGV  N+P    DS+A +GLGLVNP+KVF+FY+
Sbjct:   309 AGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVNPKKVFNFYN 368

Query:   181 ELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRNNDIICCM 240
             ELHSYLAS GIDGVKVDVQNI+ETLGAG GGRV L+R Y QALEASIARNF +N  I CM
Sbjct:   369 ELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNFTDNGCISCM 428

Query:   241 SHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQPDWDMFHSLHP 300
              HNTDGLYSAK++A++RASDDF+PRDPASHTIHIASVAYN++FLGEFMQPDWDMFHSLHP
Sbjct:   429 CHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEFMQPDWDMFHSLHP 488

Query:   301 MAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPAR 360
              AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVLPDGS+LRAKLPGRPTRDCLF+DPAR
Sbjct:   489 TAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTRDCLFADPAR 548

Query:   361 DGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRV 420
             DG SLLKIWN+N FTG+VGVFNCQGAGWC+  KKN IHD  PGT TG IRA D D + +V
Sbjct:   549 DGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRADDADLISQV 608

Query:   421 AGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLV 480
             AG++W+GD+I Y++  GEV  LPK A++P+TLK  EYE++ + P+KE++    FAPIGLV
Sbjct:   609 AGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEITENISFAPIGLV 668

Query:   481 KMFNSGGAIKEL 492
              MFNS GAI+ +
Sbjct:   669 DMFNSSGAIESI 680

 Score = 175 (66.7 bits), Expect = 3.8e-205, Sum P(2) = 3.8e-205
 Identities = 33/59 (55%), Positives = 42/59 (71%)

Query:   499 TATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNI 557
             TA V + VRGCG FGAYSS RP + AV+S E  F Y+ E GLVTL L V +EE++ W++
Sbjct:   711 TALVSVSVRGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHV 769


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 1118 (398.6 bits), Expect = 2.5e-113, P = 2.5e-113
 Identities = 234/560 (41%), Positives = 333/560 (59%)

Query:     2 PDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEFR 61
             P +++ FGWCTWDAFY  V  +GV +G++    GG PP  ++IDDGWQS+G D  G +  
Sbjct:   218 PGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDDGWQSIGHDSDGIDVE 277

Query:    62 ADNTA----NFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKEKHD-LKYVYV 116
               N          RL   +ENHKF K+    + + D  +G++  V ++K++   + Y+YV
Sbjct:   278 GMNITVAGEQMPCRLLKFEENHKF-KDYVSPKDQND--VGMKAFVRDLKDEFSTVDYIYV 334

Query:   117 WHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPEKVF 176
             WHA+ GYWGG+RP    +    S +  P  SPG++      A D I + G+G  +P+   
Sbjct:   335 WHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKIIETGIGFASPDLAK 392

Query:   177 HFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRNNDI 236
              FY+ LHS+L +AGIDGVKVDV +ILE L   +GGRV L++ Y +AL +S+ ++F  N +
Sbjct:   393 EFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKALTSSVNKHFNGNGV 452

Query:   237 ICCMSHNTDGLYSAKRSAVI-RASDDFWPRDPASHT--------IHIASVAYNTIFLGEF 287
             I  M H  D ++    +  + R  DDFW  DP+            H+   AYN++++G F
Sbjct:   453 IASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHMVHCAYNSLWMGNF 512

Query:   288 MQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPG 347
             +QPDWDMF S HP AE+H A+RA+ G  IY+SD  G+HDF+LL++LVLP+GSILR +   
Sbjct:   513 IQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLVLPNGSILRCEYYA 572

Query:   348 RPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPGTTTG 407
              PTRD LF DP  DGK++LKIWNLN +TGV+G FNCQG GWCR  ++N    E   T T 
Sbjct:   573 LPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRRNQCFSECVNTLTA 632

Query:   408 FIRAKDVDY----LP-RVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTV 462
                 KDV++     P  +A  E     ++ S    ++     N  L +TL+  ++E+ TV
Sbjct:   633 TTSPKDVEWNSGSSPISIANVEEFALFLSQSK---KLLLSGLNDDLELTLEPFKFELITV 689

Query:   463 VPVKELSSGT-RFAPIGLVKMFNSGGAIKELRYESEGTATVDMKVRGCGEFGAYSSARPR 521
              PV  +   + RFAPIGLV M N+ GAI+ L Y  E   +V++ V G GEF  Y+S +P 
Sbjct:   690 SPVVTIEGNSVRFAPIGLVNMLNTSGAIRSLVYNDE---SVEVGVFGAGEFRVYASKKPV 746

Query:   522 RIAVDSEEVQFGYEEESGLV 541
                +D E V+FGYE+   +V
Sbjct:   747 SCLIDGEVVEFGYEDSMVMV 766


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 1071 (382.1 bits), Expect = 2.4e-108, P = 2.4e-108
 Identities = 234/563 (41%), Positives = 327/563 (58%)

Query:     2 PDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDP----SG 57
             P +++ FGWCTWDAFY  V  EGV +G+     GG PP  ++IDDGWQS+  D     SG
Sbjct:   221 PPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLIDDGWQSICHDDDDLGSG 280

Query:    58 FEFRADNTAN--FANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKEKHD-LKYV 114
              E     +A      RL   +EN+KF++   +G        G+   V E+K     ++ V
Sbjct:   281 AEGMNRTSAGEQMPCRLIKFQENYKFREY--KG--------GMGGFVREMKAAFPTVEQV 330

Query:   115 YVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPEK 174
             YVWHA+ GYWGG+RPG  G+    +K+  P  SPG+Q      A D I  NG+GLV+P +
Sbjct:   331 YVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDKIVNNGVGLVDPRR 388

Query:   175 VFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRNN 234
                 Y+ LHS+L ++GIDGVKVDV ++LE +   +GGRV+L++ Y   L  S+ R+F  N
Sbjct:   389 ARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFAGLTESVRRHFNGN 448

Query:   235 DIICCMSHNTDG-LYSAKRSAVIRASDDFWPRDPASHT--------IHIASVAYNTIFLG 285
              +I  M H  D  L   +  A+ R  DDFW  DP+            H+   AYN++++G
Sbjct:   449 GVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGCHMVHCAYNSLWMG 508

Query:   286 EFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKL 345
              F+ PDWDMF S HP A +H A+RAV G  +YVSD  G HDF+LLR+L LPDG+ILR + 
Sbjct:   509 AFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRRLALPDGTILRCER 568

Query:   346 PGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPGTT 405
                PTRDCLF+DP  DGK++LKIWN+N F+GV+G FNCQG GW R  ++N+         
Sbjct:   569 YALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREARRNMCAAGFSVPV 628

Query:   406 TGFIRAKDVDYLPRVAGDEWTGDAIA-YSHLGGEVAYLPKNATLPITLKSREYEVYTVVP 464
             T      DV++     G    GD  A Y     ++  L ++ ++ +TL+   YE+  V P
Sbjct:   629 TARASPADVEWSHGGGG----GDRFAVYFVEARKLQLLRRDESVELTLEPFTYELLVVAP 684

Query:   465 VKELSS---GTRFAPIGLVKMFNSGGAIKELRY-ESEGTATVDMKVRGCGEFGAYSSARP 520
             V+ + S   G  FAPIGL  M N+GGA++       +G    ++ V+G GE  AYSSARP
Sbjct:   685 VRAIVSPELGIGFAPIGLANMLNAGGAVQGFEAARKDGDVAAEVAVKGAGEMVAYSSARP 744

Query:   521 RRIAVDSEEVQFGYEEESGLVTL 543
             R   V+ ++ +F YE+  G+VT+
Sbjct:   745 RLCKVNGQDAEFKYED--GIVTV 765


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 821 (294.1 bits), Expect = 3.6e-101, Sum P(2) = 3.6e-101
 Identities = 180/460 (39%), Positives = 263/460 (57%)

Query:    88 GQREEDPA-LGLRHIVTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPV 145
             G++ E  +  GL+    +++ K   L  VYVWHA+ G WGGVRP  T   H ++K+    
Sbjct:   373 GEKSEMKSEYGLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HLDTKIVPCK 429

Query:   146 SSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETL 205
              SPG+       A   I+K  LGLV+P +    YD +HSYLA +GI GVKVDV + LE +
Sbjct:   430 LSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYV 489

Query:   206 GAGHGGRVKLSRKYHQALEASIARNFRNNDIICCMSHNTDGLY-SAKRSAVIRASDDFWP 264
                +GGRV L++ Y++ L  SI +NF  N +I  M H  D  +   K+ ++ R  DDFW 
Sbjct:   490 CDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDFWF 549

Query:   265 RDPASHT--------IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAI 316
             +DP            +H+   +YN++++G+ +QPDWDMF S H  A++H  +RA+ G  I
Sbjct:   550 QDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGGPI 609

Query:   317 YVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTG 376
             YVSD  G HDF+L++KLV PDG+I +      PTRDCLF +P  D  ++LKIWN N + G
Sbjct:   610 YVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKYGG 669

Query:   377 VVGVFNCQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDA---IAYS 433
             V+G FNCQGAGW  + +K     E      G +   +V++  +       G A   + Y 
Sbjct:   670 VIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSH-LGKAEEYVVYL 728

Query:   434 HLGGEVAYLP-KNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKEL 492
             +   E++ +  K+  +  T++   +E+Y+ VPV +L  G +FAPIGL  MFNSGG + +L
Sbjct:   729 NQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNSGGTVIDL 788

Query:   493 RYESEGTATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQF 532
              Y   G     +KV+G G F AYSS  P++  ++  EV F
Sbjct:   789 EYVGNGAK---IKVKGGGSFLAYSSESPKKFQLNGCEVDF 825

 Score = 202 (76.2 bits), Expect = 3.6e-101, Sum P(2) = 3.6e-101
 Identities = 39/89 (43%), Positives = 55/89 (61%)

Query:     1 MPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEF 60
             +P++++ FGWCTWDAFY  V   G+  GL+ F KGG+ P+F+IIDDGWQS+  D  G++ 
Sbjct:   214 IPNLVDKFGWCTWDAFYLTVNPIGIFHGLDDFSKGGVEPRFVIIDDGWQSISFD--GYDP 271

Query:    61 RAD--NTA----NFANRLTHIKENHKFQK 83
               D  N        + RL    E +KF+K
Sbjct:   272 NEDAKNLVLGGEQMSGRLHRFDECYKFRK 300


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 756 (271.2 bits), Expect = 1.8e-93, Sum P(2) = 1.8e-93
 Identities = 167/463 (36%), Positives = 261/463 (56%)

Query:   114 VYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPE 173
             +YVWHA+ G W GVRP    M   ++K+     SP + +     A D + + G+GLV+P 
Sbjct:   416 IYVWHALCGAWNGVRPET--MMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473

Query:   174 KVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRN 233
             K   FYD +HSYLAS G+ G K+DV   LE+L   HGGRV+L++ Y+  L  S+ +NF  
Sbjct:   474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533

Query:   234 NDIICCMSHNTDGLYSA-KRSAVIRASDDFWPRDPASHT--------IHIASVAYNTIFL 284
              D+I  M    +  + A K+ ++ R  DDFW +DP            +H+   +YN+I++
Sbjct:   534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593

Query:   285 GEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQ--HDFNLLRKLVLPDGSILR 342
             G+ +QPDWDMF S H  AEYH A+RA+ G  +Y+SD  G+  H+F+L++KL   DG+I R
Sbjct:   594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653

Query:   343 AKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQP 402
                   PTRD LF +P  D +S+LKI+N N F GV+G FNCQGAGW     +   + E  
Sbjct:   654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713

Query:   403 GTTTGFIRAKDV--DYLPRVAGDE--WTGDAIAYSHLGGEVAYL-PKNATLPITLKSREY 457
              T +G +   D+  D  P  AG +  +TGD + Y     E+ ++  K+  + ITL+   +
Sbjct:   714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773

Query:   458 EVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELR-YESEGTATVDMKVRGCGEFGAYS 516
             ++ + VPV EL S         + + N    +  ++  +  G  ++ + V+G G F AYS
Sbjct:   774 DLLSFVPVTELVSSG--VRFAPLGLINMFNCVGTVQDMKVTGDNSIRVDVKGEGRFMAYS 831

Query:   517 SARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNISF 559
             S+ P +  ++ +E +F +EEE+G ++  +   +E   + ++SF
Sbjct:   832 SSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEESGGISHLSF 874

 Score = 194 (73.4 bits), Expect = 1.8e-93, Sum P(2) = 1.8e-93
 Identities = 40/92 (43%), Positives = 53/92 (57%)

Query:     1 MPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEF 60
             +P +++ FGWCTWDA Y  V    +  G++ FE GG+ PKF+IIDDGWQS+  D    + 
Sbjct:   228 LPKIVDKFGWCTWDACYLTVDPATIWTGVKEFEDGGVCPKFVIIDDGWQSINFDGDELDK 287

Query:    61 RADNTA----NFANRLTHIKENHKFQKNGKEG 88
              A+N          RLT  KE  KF +N K G
Sbjct:   288 DAENLVLGGEQMTARLTSFKECKKF-RNYKGG 318


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 384 (140.2 bits), Expect = 1.3e-39, Sum P(2) = 1.3e-39
 Identities = 131/450 (29%), Positives = 218/450 (48%)

Query:    97 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEP 155
             GL   VT I+E+H +++Y+ VWHA+ GYWGG+ P  +    Y+++               
Sbjct:   384 GLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTREV------------- 430

Query:   156 CDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKL 215
               A +S  +  +  ++P  +  FY++ +++L+ +GI GVK D Q+ L+ L A    R   
Sbjct:   431 --ALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRSY 487

Query:   216 SRKYHQALEASIARNFRNNDIICCMSHNTDGLYSA-----KRSAVIRASDDFWPRDPASH 270
             +  Y  A   S  R+F     I CMS     ++ +     K + V+R S+DF+P    SH
Sbjct:   488 ANAYQDAWTISSLRHF-GPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546

Query:   271 TIHIASVAYNTIFLGEFMQ--PDWDMFHSLHP----MAEYHGAARAVGGCAIYVSDKPGQ 324
             T H+   A+N + L  ++   PDWDMF +L       A +H AAR + G  IY++DKPGQ
Sbjct:   547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605

Query:   325 HDFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSL-LKIWN--LNDFTGV 377
             HD  L++++      G+   LR  +  R T D ++ D  ++G  L +  ++      +G+
Sbjct:   606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662

Query:   378 VGVFNCQG---AGWCRVGKKNLIHDEQPGTTTGFI-RAKDVDYLPRVAGDEWTGDAIAYS 433
             +GVFN      +    V     I+D+Q    TG+I RA       R+ G+  +  A++ +
Sbjct:   663 IGVFNVSNRVESVIIPVADFPGIYDDQE--ETGYIVRAHRTG---RIVGELHSSSAVSVT 717

Query:   434 --HLGGEV--AYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAI 489
                   EV  AY  K  T  +  K +E E  + +P  ++S     A +GL++      A+
Sbjct:   718 LNERRWEVLTAYPVKTLTFKMNSKDKENE--SSMPTADVSVDV--AILGLLRKMTGVAAL 773

Query:   490 --KELRYESEGTATVDMKVRGCGEFGAYSS 517
                ++  E  G   VD+ ++  G  G Y S
Sbjct:   774 VSSDIYIEDTGRLRVDVGIKALGVLGIYFS 803

 Score = 104 (41.7 bits), Expect = 1.3e-39, Sum P(2) = 1.3e-39
 Identities = 25/86 (29%), Positives = 44/86 (51%)

Query:     7 WF---GWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGF----- 58
             W+    +CTW+    D++ E +   L+  +  GI  + +IIDD WQS+  + +G      
Sbjct:   312 WYDGLAYCTWNGLGQDLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNEGAGSWHRAL 371

Query:    59 -EFRADNTA---NFANRLTHIKENHK 80
              +F A++ A     A  +T I+E H+
Sbjct:   372 TQFEANSKAFPNGLAKAVTTIREQHR 397


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 337 (123.7 bits), Expect = 4.1e-35, Sum P(3) = 4.1e-35
 Identities = 101/326 (30%), Positives = 159/326 (48%)

Query:    97 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRP-GVTGMEHYESKMQYPVSSPGVQSNE 154
             GL+ +V+EI++++  ++ + VWH I GYWGG+ P G    ++   K+Q    +  VQ   
Sbjct:   404 GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKIQLRDEAE-VQ--- 459

Query:   155 PCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVK 214
             P D FD    +G      E V   YD+ +++LA  G+   KVD Q  L+   A    R  
Sbjct:   460 PKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFLD-YPAHANDRKN 511

Query:   215 LSRKYHQALEASIARNFRNNDIICCMSHNTDGLYSAKRSA-------VIRASDDFWPRDP 267
             L R Y  A  A+ +++F    I C        L+S  +         + R SDDF+P + 
Sbjct:   512 LIRPYQDAWTAAASKHFGGRAIACMAQTPQSILHSLLQQGRSEGPMLMARNSDDFFPDEV 571

Query:   268 ASHTIHIASVAYNTIFLGEF-MQPDWDMFHSLHPM-AEYHGAARAVGGCAIYVSDKPGQH 325
              SHT H+   A+N + +    +  DWDMF +  P  A  H  AR++ G  IY++D PG+H
Sbjct:   572 GSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMSGGPIYITDAPGEH 631

Query:   326 DFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVF 381
             D  L++++     DG    LRA  PGR     L+       + LL++ + +   G++GVF
Sbjct:   632 DVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRVRSGHQGVGMLGVF 687

Query:   382 NCQGAGWCRVGKKNLIHDEQPGTTTG 407
             N    G   +G++  + D   G   G
Sbjct:   688 NVCNRG-SLLGEQVRLDDIFDGEKAG 712

 Score = 88 (36.0 bits), Expect = 4.1e-35, Sum P(3) = 4.1e-35
 Identities = 18/54 (33%), Positives = 29/54 (53%)

Query:     3 DMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPS 56
             D  + F +CTW++   D++ + +   L    + GI    +IIDD WQS+  D S
Sbjct:   331 DWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGS 384

 Score = 70 (29.7 bits), Expect = 4.1e-35, Sum P(3) = 4.1e-35
 Identities = 21/89 (23%), Positives = 41/89 (46%)

Query:   437 GE-VAYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELRYE 495
             GE +A   +   + + L+   +E++T  P+ +L  G   A +GLV    +  A+  + Y 
Sbjct:   724 GEMIAPASRETVIEVGLEEGGFEIFTAYPITKLG-GLAVATLGLVGKMATAAAVSHVSYS 782

Query:   496 S--EGTATVDMKV----RGCGEFGAYSSA 518
                EG   V ++V    +  G  G ++ +
Sbjct:   783 KHHEGFIPVGVEVSVSLKALGTLGIFAQS 811


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 234 (87.4 bits), Expect = 1.6e-23, Sum P(2) = 1.6e-23
 Identities = 67/199 (33%), Positives = 97/199 (48%)

Query:   171 NPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARN 230
             N E    FY      +     D VKVD Q ++  +       +  SR    AL+ S+ + 
Sbjct:   342 NLEDAIGFYKAFDGNILR-DFDLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGK- 398

Query:   231 FRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQP 290
                 D+I CMS N +   +   S V+R S D+ P       +HI   AYN++     + P
Sbjct:   399 ----DVINCMSMNPENYCNYFYSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYP 454

Query:   291 DWDMFHSLHPMAEYHGAARAVGGCAIYVSDK-PGQHDFNLLRKLVLPDGSILRAKLPGRP 349
             D+DMF S  P A+ H  AR   G  IY++D+ P + +  LLR  VLP+G ++R   P   
Sbjct:   455 DYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALI 514

Query:   350 TRDCLFSDPARDGKSLLKI 368
             T D LF DP R+ + LLK+
Sbjct:   515 TEDLLFKDPLRE-RVLLKL 532

 Score = 115 (45.5 bits), Expect = 1.6e-23, Sum P(2) = 1.6e-23
 Identities = 27/80 (33%), Positives = 42/80 (52%)

Query:     2 PD-MLNWFGWCTWDAFYT-DVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFE 59
             PD ++N  GWC+W+AF T D+  E + + ++   + G+   ++IIDDGWQ    D +   
Sbjct:   217 PDKVMNGLGWCSWNAFLTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQDQNNDRAIRS 276

Query:    60 FRADNTA---NFANRLTHIK 76
                DN      F N +  IK
Sbjct:   277 LNPDNKKFPNGFKNTVRAIK 296

 Score = 79 (32.9 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 32/105 (30%), Positives = 45/105 (42%)

Query:    97 GLRHIVTEIKEKHDLKYVYVWHAITGYWGGVRP------GVTGMEHYESKMQYPVSSPGV 150
             G ++ V  IK    +KYV +WHAI  +WGG+         V G  ++ + +   V SP +
Sbjct:   287 GFKNTVRAIKSL-GVKYVGLWHAINAHWGGMSQELMKSLNVNG--YFTNFLNSYVPSPNL 343

Query:   151 QSNEPC-DAFDSIAKNGLGLVNPEK--VFH-FYDELHSYLASAGI 191
             +       AFD        LV  +   V H  YD     LAS  I
Sbjct:   344 EDAIGFYKAFDGNILRDFDLVKVDNQWVIHAIYDSFPIGLASRNI 388


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 196 (74.1 bits), Expect = 5.2e-20, Sum P(3) = 5.2e-20
 Identities = 53/193 (27%), Positives = 91/193 (47%)

Query:   173 EKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFR 232
             EK+  +Y+     +   G D +K+D Q+    L  G    ++ ++  + ALE    R   
Sbjct:   348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHR--M 405

Query:   233 NNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQPDW 292
                ++ CM+ N   +     S+V RAS D+   D      H+     NT+ LG+ + PD 
Sbjct:   406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465

Query:   293 DMFHSLHPMA-EYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTR 351
             DMFHS   +       ++A+ G  +Y+SD P +   + +R L+   G I R   P  PT 
Sbjct:   466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525

Query:   352 DCLFSDPARDGKS 364
             + + ++P + GK+
Sbjct:   526 ESILTNPLQSGKA 538

 Score = 107 (42.7 bits), Expect = 5.2e-20, Sum P(3) = 5.2e-20
 Identities = 14/42 (33%), Positives = 28/42 (66%)

Query:     6 NWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 47
             ++ GWCTW+ ++ D+    +   +++ E  GIP ++++IDDG
Sbjct:   228 DYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG 269

 Score = 58 (25.5 bits), Expect = 5.2e-20, Sum P(3) = 5.2e-20
 Identities = 6/22 (27%), Positives = 17/22 (77%)

Query:   106 KEKHDLKYVYVWHAITGYWGGV 127
             K+   ++++ +W++++GYW G+
Sbjct:   299 KQADKIRWIGLWYSLSGYWMGI 320


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.138   0.431    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      561       561   0.00099  119 3  11 22  0.41    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  626 (67 KB)
  Total size of DFA:  357 KB (2177 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  45.23u 0.15s 45.38t   Elapsed:  00:00:02
  Total cpu time:  45.23u 0.15s 45.38t   Elapsed:  00:00:02
  Start:  Fri May 10 01:33:24 2013   End:  Fri May 10 01:33:26 2013

Back to top