BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>004371
MTVGAGISVSDGNLMVKGSCVLANVKENIVVTPAAGGALVDGAFIGVTSDQLGSRRVFPV
GKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSAL
YTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAV
KTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFII
IDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVT
EIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIA
KNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQAL
EASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIF
LGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRA
KLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPG
TTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVV
PVKELSSGTRFAPIGLVKMFNSGGAIKELRYESEGTATVDMKVRGCGEFGAYSSARPRRI
AVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNISFEL

High Scoring Gene Products

Symbol, full name Information P value
SIP1
AT1G55740
protein from Arabidopsis thaliana 0.
SIP2
AT3G57520
protein from Arabidopsis thaliana 5.5e-270
SIP1
AT5G40390
protein from Arabidopsis thaliana 3.1e-147
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 6.5e-138
STS1
Stachyose synthase
protein from Pisum sativum 9.4e-122
STS
AT4G01970
protein from Arabidopsis thaliana 4.7e-116
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 1.7e-29
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 6.9e-29
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 4.4e-20

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  004371
        (758 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  3121  0.        1
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...  2445  5.5e-270  2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...  1438  3.1e-147  1
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...  1350  6.5e-138  1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...   821  9.4e-122  2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...   756  4.7e-116  2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   384  5.7e-40   2
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   234  1.7e-29   2
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   337  6.9e-29   3
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   196  4.4e-20   3


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 3121 (1103.7 bits), Expect = 0., P = 0.
 Identities = 576/756 (76%), Positives = 653/756 (86%)

Query:     1 MTVGAGISVSDGNLMVKGSCVLANVKENIVVTPAAGGALVDGAFIGVTSDQLGSRRVFPV 60
             MTVGAGISV+D +L+V G  VL  V EN++VTPA+G AL+DGAFIGVTSDQ GS RVF +
Sbjct:     1 MTVGAGISVTDSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHRVFSL 60

Query:    61 GKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSAL 120
             GKLE LRFMCVFRFK+WWMTQRMG  G+++P ETQFL+VEA +GS  D G   G +QS+ 
Sbjct:    61 GKLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGS--DLG---GRDQSSS 115

Query:   121 YTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAV 180
             Y VFLPILEGDFRAVLQGNE NELEICLESGDP VD+FEGSHLVFVAAGSDPFDVIT AV
Sbjct:   116 YVVFLPILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAV 175

Query:   181 KTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFII 240
             K VE+HL TFSHRERKKMPDMLNWFGWCTWDAFYT+VT + VKQGLES + GG+ PKF+I
Sbjct:   176 KAVEQHLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVI 235

Query:   241 IDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVT 300
             IDDGWQSVGMD +  EF ADN ANFANRLTHIKENHKFQK+GKEG R +DP+L L H++T
Sbjct:   236 IDDGWQSVGMDETSVEFNADNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVIT 295

Query:   301 EIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIA 360
             +IK  + LKYVYVWHAITGYWGGV+PGV+GMEHYESK+ YPVSSPGV S+E C   +SI 
Sbjct:   296 DIKSNNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESIT 355

Query:   361 KNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQAL 420
             KNGLGLVNPEKVF FY++LHSYLAS G+DGVKVDVQNILETLGAGHGGRVKL++KYHQAL
Sbjct:   356 KNGLGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQAL 415

Query:   421 EASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIF 480
             EASI+RNF +N II CMSHNTDGLYSAK++AVIRASDDFWPRDPASHTIHIASVAYNT+F
Sbjct:   416 EASISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLF 475

Query:   481 LGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRA 540
             LGEFMQPDWDMFHSLHPMAEYH AARAVGGCAIYVSDKPGQHDFNLLRKLVL DGSILRA
Sbjct:   476 LGEFMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRA 535

Query:   541 KLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPG 600
             KLPGRPT DC FSDP RD KSLLKIWNLN+FTGV+GVFNCQGAGWC+  K+ LIHD++PG
Sbjct:   536 KLPGRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPG 595

Query:   601 TTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVV 660
             T +G +R  DV YL +VA  EWTGD+I YSHL GE+ YLPK+ +LP+TL  REYEV+TVV
Sbjct:   596 TISGCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVV 655

Query:   661 PVKELSSGTRFAPIGLVKMFNSGGAIKELRYESEGTA-TVDMKVRGCGEFGAYSSAR-PR 718
             PVKE S G++FAP+GL++MFNSGGAI  LRY+ EGT   V MK+RG G  G YSS R PR
Sbjct:   656 PVKEFSDGSKFAPVGLMEMFNSGGAIVSLRYDDEGTKFVVRMKLRGSGLVGVYSSVRRPR 715

Query:   719 RIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNI 754
              + VDS++V++ YE ESGLVT TL VP++ELYLW++
Sbjct:   716 SVTVDSDDVEYRYEPESGLVTFTLGVPEKELYLWDV 751


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 2445 (865.7 bits), Expect = 5.5e-270, Sum P(2) = 5.5e-270
 Identities = 441/689 (64%), Positives = 543/689 (78%)

Query:     1 MTVGAGISVSDGNLMVKGSCVLANVKENIVVTPAAGGALVDGAFIGVTSDQLGSRRVFPV 60
             MT+ + ISV + NL+V+G  +L  + +NI++TP  G   V G+FIG T +Q  S  VFP+
Sbjct:     1 MTITSNISVQNDNLVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQSKSLHVFPI 60

Query:    61 GKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSAL 120
             G LEGLRFMC FRFK+WWMTQRMG+CG+D+P ETQF+++E++     DE    G++   +
Sbjct:    61 GVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESK-----DEVEGNGDDAPTV 115

Query:   121 YTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAV 180
             YTVFLP+LEG FRAVLQGNE+NE+EIC ESGD  V+  +G+HLV+V AG++PF+VI  +V
Sbjct:   116 YTVFLPLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSV 175

Query:   181 KTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFII 240
             K VERH+ TF HRE+KK+P  L+WFGWCTWDAFYTDVT EGV +GL+S  +GG PPKF+I
Sbjct:   176 KAVERHMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLI 235

Query:   241 IDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVT 300
             IDDGWQ +              A FA RL  IKEN KFQK+     +++    GL+ +V 
Sbjct:   236 IDDGWQQIENKEKDENCVVQEGAQFATRLVGIKENAKFQKS----DQKDTQVSGLKSVVD 291

Query:   301 EIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIA 360
               K++H++K VY WHA+ GYWGGV+P  +GMEHY+S + YPV SPGV  N+P    DS+A
Sbjct:   292 NAKQRHNVKQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLA 351

Query:   361 KNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQAL 420
              +GLGLVNP+KVF+FY+ELHSYLAS GIDGVKVDVQNI+ETLGAG GGRV L+R Y QAL
Sbjct:   352 VHGLGLVNPKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQAL 411

Query:   421 EASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIF 480
             EASIARNF +N  I CM HNTDGLYSAK++A++RASDDF+PRDPASHTIHIASVAYN++F
Sbjct:   412 EASIARNFTDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLF 471

Query:   481 LGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRA 540
             LGEFMQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVLPDGS+LRA
Sbjct:   472 LGEFMQPDWDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA 531

Query:   541 KLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPG 600
             KLPGRPTRDCLF+DPARDG SLLKIWN+N FTG+VGVFNCQGAGWC+  KKN IHD  PG
Sbjct:   532 KLPGRPTRDCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPG 591

Query:   601 TTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVV 660
             T TG IRA D D + +VAG++W+GD+I Y++  GEV  LPK A++P+TLK  EYE++ + 
Sbjct:   592 TLTGSIRADDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHIS 651

Query:   661 PVKELSSGTRFAPIGLVKMFNSGGAIKEL 689
             P+KE++    FAPIGLV MFNS GAI+ +
Sbjct:   652 PLKEITENISFAPIGLVDMFNSSGAIESI 680

 Score = 175 (66.7 bits), Expect = 5.5e-270, Sum P(2) = 5.5e-270
 Identities = 33/59 (55%), Positives = 42/59 (71%)

Query:   696 TATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNI 754
             TA V + VRGCG FGAYSS RP + AV+S E  F Y+ E GLVTL L V +EE++ W++
Sbjct:   711 TALVSVSVRGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHV 769


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 1438 (511.3 bits), Expect = 3.1e-147, P = 3.1e-147
 Identities = 305/759 (40%), Positives = 444/759 (58%)

Query:     9 VSDGNLMVKGSCVLANVKENIVVTPAA-----GGALVD---GAFIGVTSD-QLGSRRVFP 59
             + D  L+  G  VL +V  N+ +T +       G  +D   G+FIG   D +  S  V  
Sbjct:    24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83

Query:    60 VGKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSA 119
             +GKL+ +RFM +FRFK+WW T  +G+ G+D+  ETQ ++++ + GS    GS  G     
Sbjct:    84 IGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRP--- 139

Query:   120 LYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNA 179
              Y + LP+LEG FR+  Q  E +++ +C+ESG  +V   E   +V+V AG DPF ++ +A
Sbjct:   140 -YVLLLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDA 198

Query:   180 VKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFI 239
             +K +  H+ TF   E K  P +++ FGWCTWDAFY  V  +GV +G++    GG PP  +
Sbjct:   199 MKVIRVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLV 258

Query:   240 IIDDGWQSVGMDPSGFEFRADNTA----NFANRLTHIKENHKFQKNGKEGQREEDPALGL 295
             +IDDGWQS+G D  G +    N          RL   +ENHKF K+    + + D  +G+
Sbjct:   259 LIDDGWQSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKF-KDYVSPKDQND--VGM 315

Query:   296 RHIVTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCD 354
             +  V ++K++   + Y+YVWHA+ GYWGG+RP    +    S +  P  SPG++      
Sbjct:   316 KAFVRDLKDEFSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDL 373

Query:   355 AFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSR 414
             A D I + G+G  +P+    FY+ LHS+L +AGIDGVKVDV +ILE L   +GGRV L++
Sbjct:   374 AVDKIIETGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAK 433

Query:   415 KYHQALEASIARNFRNNDIICCMSHNTDGLYSAKRSAVI-RASDDFWPRDPASHT----- 468
              Y +AL +S+ ++F  N +I  M H  D ++    +  + R  DDFW  DP+        
Sbjct:   434 AYFKALTSSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFW 493

Query:   469 ---IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFN 525
                 H+   AYN++++G F+QPDWDMF S HP AE+H A+RA+ G  IY+SD  G+HDF+
Sbjct:   494 LQGCHMVHCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFD 553

Query:   526 LLRKLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGW 585
             LL++LVLP+GSILR +    PTRD LF DP  DGK++LKIWNLN +TGV+G FNCQG GW
Sbjct:   554 LLKRLVLPNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGW 613

Query:   586 CRVGKKNLIHDEQPGTTTGFIRAKDVDY----LP-RVAGDEWTGDAIAYSHLGGEVAYLP 640
             CR  ++N    E   T T     KDV++     P  +A  E     ++ S    ++    
Sbjct:   614 CRETRRNQCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFALFLSQSK---KLLLSG 670

Query:   641 KNATLPITLKSREYEVYTVVPVKELSSGT-RFAPIGLVKMFNSGGAIKELRYESEGTATV 699
              N  L +TL+  ++E+ TV PV  +   + RFAPIGLV M N+ GAI+ L Y  E   +V
Sbjct:   671 LNDDLELTLEPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSLVYNDE---SV 727

Query:   700 DMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLV 738
             ++ V G GEF  Y+S +P    +D E V+FGYE+   +V
Sbjct:   728 EVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMV 766


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 1350 (480.3 bits), Expect = 6.5e-138, P = 6.5e-138
 Identities = 305/757 (40%), Positives = 431/757 (56%)

Query:    13 NLMVKGSCVLANVKENIVVTPAAG-------GALVDGAFIGVTSDQLGSRRVFPVGKLEG 65
             +L V G   L +V  NI +TPA+         A   G+F+G  +     R V P+GKL  
Sbjct:    34 DLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHVVPIGKLRD 93

Query:    66 LRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSALYTVFL 125
              RFM +FRFK+WW T  +G  G+DV  ETQ ++++ + G+   + S  G      Y + L
Sbjct:    94 TRFMSIFRFKVWWTTHWVGTNGRDVENETQMMILD-QSGT---KSSPTGPRP---YVLLL 146

Query:   126 PILEGDFRAVLQ-GNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAVKTVE 184
             PI+EG FRA L+ G  ++ + + LESG   V        V++ AG DPFD++ +A++ V 
Sbjct:   147 PIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMRVVR 206

Query:   185 RHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 244
              HL TF   E K  P +++ FGWCTWDAFY  V  EGV +G+     GG PP  ++IDDG
Sbjct:   207 AHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLIDDG 266

Query:   245 WQSVGMDP----SGFEFRADNTAN--FANRLTHIKENHKFQKNGKEGQREEDPALGLRHI 298
             WQS+  D     SG E     +A      RL   +EN+KF++   +G        G+   
Sbjct:   267 WQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFREY--KG--------GMGGF 316

Query:   299 VTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFD 357
             V E+K     ++ VYVWHA+ GYWGG+RPG  G+    +K+  P  SPG+Q      A D
Sbjct:   317 VREMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVD 374

Query:   358 SIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYH 417
              I  NG+GLV+P +    Y+ LHS+L ++GIDGVKVDV ++LE +   +GGRV+L++ Y 
Sbjct:   375 KIVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYF 434

Query:   418 QALEASIARNFRNNDIICCMSHNTDG-LYSAKRSAVIRASDDFWPRDPASHT-------- 468
               L  S+ R+F  N +I  M H  D  L   +  A+ R  DDFW  DP+           
Sbjct:   435 AGLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQG 494

Query:   469 IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLR 528
              H+   AYN++++G F+ PDWDMF S HP A +H A+RAV G  +YVSD  G HDF+LLR
Sbjct:   495 CHMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLR 554

Query:   529 KLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRV 588
             +L LPDG+ILR +    PTRDCLF+DP  DGK++LKIWN+N F+GV+G FNCQG GW R 
Sbjct:   555 RLALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSRE 614

Query:   589 GKKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDAIA-YSHLGGEVAYLPKNATLPI 647
              ++N+         T      DV++     G    GD  A Y     ++  L ++ ++ +
Sbjct:   615 ARRNMCAAGFSVPVTARASPADVEWSHGGGG----GDRFAVYFVEARKLQLLRRDESVEL 670

Query:   648 TLKSREYEVYTVVPVKELSS---GTRFAPIGLVKMFNSGGAIKELRY-ESEGTATVDMKV 703
             TL+   YE+  V PV+ + S   G  FAPIGL  M N+GGA++       +G    ++ V
Sbjct:   671 TLEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFEAARKDGDVAAEVAV 730

Query:   704 RGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTL 740
             +G GE  AYSSARPR   V+ ++ +F YE+  G+VT+
Sbjct:   731 KGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTV 765


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 821 (294.1 bits), Expect = 9.4e-122, Sum P(2) = 9.4e-122
 Identities = 180/460 (39%), Positives = 263/460 (57%)

Query:   285 GQREEDPA-LGLRHIVTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPV 342
             G++ E  +  GL+    +++ K   L  VYVWHA+ G WGGVRP  T   H ++K+    
Sbjct:   373 GEKSEMKSEYGLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HLDTKIVPCK 429

Query:   343 SSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETL 402
              SPG+       A   I+K  LGLV+P +    YD +HSYLA +GI GVKVDV + LE +
Sbjct:   430 LSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYV 489

Query:   403 GAGHGGRVKLSRKYHQALEASIARNFRNNDIICCMSHNTDGLY-SAKRSAVIRASDDFWP 461
                +GGRV L++ Y++ L  SI +NF  N +I  M H  D  +   K+ ++ R  DDFW 
Sbjct:   490 CDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDFWF 549

Query:   462 RDPASHT--------IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAI 513
             +DP            +H+   +YN++++G+ +QPDWDMF S H  A++H  +RA+ G  I
Sbjct:   550 QDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGGPI 609

Query:   514 YVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTG 573
             YVSD  G HDF+L++KLV PDG+I +      PTRDCLF +P  D  ++LKIWN N + G
Sbjct:   610 YVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKYGG 669

Query:   574 VVGVFNCQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDA---IAYS 630
             V+G FNCQGAGW  + +K     E      G +   +V++  +       G A   + Y 
Sbjct:   670 VIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSH-LGKAEEYVVYL 728

Query:   631 HLGGEVAYLP-KNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKEL 689
             +   E++ +  K+  +  T++   +E+Y+ VPV +L  G +FAPIGL  MFNSGG + +L
Sbjct:   729 NQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNSGGTVIDL 788

Query:   690 RYESEGTATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQF 729
              Y   G     +KV+G G F AYSS  P++  ++  EV F
Sbjct:   789 EYVGNGAK---IKVKGGGSFLAYSSESPKKFQLNGCEVDF 825

 Score = 397 (144.8 bits), Expect = 9.4e-122, Sum P(2) = 9.4e-122
 Identities = 85/245 (34%), Positives = 129/245 (52%)

Query:    42 GAFIGVTSDQLGSRRVFPVGKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEA 101
             G F G + +    R +  +G   G  F+ +FRFK WW TQ +G  G D+  ETQ++++E 
Sbjct:    72 GGFFGFSHETPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131

Query:   102 REGSHFDEGSQYGEEQSALYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGS 161
              E              +  Y V +PI+E  FR+ L     + ++I  ESG   V E   +
Sbjct:   132 PE--------------TKSYVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFN 177

Query:   162 HLVFVAAGSDPFDVITNAVKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEG 221
              + +V    +P+D++  A   +  HL +F   E K +P++++ FGWCTWDAFY  V   G
Sbjct:   178 SIAYVHFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIG 237

Query:   222 VKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEFRAD--NTA----NFANRLTHIKEN 275
             +  GL+ F KGG+ P+F+IIDDGWQS+  D  G++   D  N        + RL    E 
Sbjct:   238 IFHGLDDFSKGGVEPRFVIIDDGWQSISFD--GYDPNEDAKNLVLGGEQMSGRLHRFDEC 295

Query:   276 HKFQK 280
             +KF+K
Sbjct:   296 YKFRK 300


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 756 (271.2 bits), Expect = 4.7e-116, Sum P(2) = 4.7e-116
 Identities = 167/463 (36%), Positives = 261/463 (56%)

Query:   311 VYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPE 370
             +YVWHA+ G W GVRP    M   ++K+     SP + +     A D + + G+GLV+P 
Sbjct:   416 IYVWHALCGAWNGVRPET--MMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473

Query:   371 KVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRN 430
             K   FYD +HSYLAS G+ G K+DV   LE+L   HGGRV+L++ Y+  L  S+ +NF  
Sbjct:   474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533

Query:   431 NDIICCMSHNTDGLYSA-KRSAVIRASDDFWPRDPASHT--------IHIASVAYNTIFL 481
              D+I  M    +  + A K+ ++ R  DDFW +DP            +H+   +YN+I++
Sbjct:   534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593

Query:   482 GEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQ--HDFNLLRKLVLPDGSILR 539
             G+ +QPDWDMF S H  AEYH A+RA+ G  +Y+SD  G+  H+F+L++KL   DG+I R
Sbjct:   594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653

Query:   540 AKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQP 599
                   PTRD LF +P  D +S+LKI+N N F GV+G FNCQGAGW     +   + E  
Sbjct:   654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713

Query:   600 GTTTGFIRAKDV--DYLPRVAGDE--WTGDAIAYSHLGGEVAYL-PKNATLPITLKSREY 654
              T +G +   D+  D  P  AG +  +TGD + Y     E+ ++  K+  + ITL+   +
Sbjct:   714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773

Query:   655 EVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELR-YESEGTATVDMKVRGCGEFGAYS 713
             ++ + VPV EL S         + + N    +  ++  +  G  ++ + V+G G F AYS
Sbjct:   774 DLLSFVPVTELVSSG--VRFAPLGLINMFNCVGTVQDMKVTGDNSIRVDVKGEGRFMAYS 831

Query:   714 SARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNISF 756
             S+ P +  ++ +E +F +EEE+G ++  +   +E   + ++SF
Sbjct:   832 SSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEESGGISHLSF 874

 Score = 408 (148.7 bits), Expect = 4.7e-116, Sum P(2) = 4.7e-116
 Identities = 99/299 (33%), Positives = 150/299 (50%)

Query:     8 SVSDGNLMVKGSC-VLANVKENIVVTPAAGGAL-VD---------------GAFIGVTSD 50
             ++S+G+L  K S  +L +V +N+  TP +  ++  D               G F+G T +
Sbjct:    35 NLSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAPLPILLRVQANAHKGGFLGFTKE 94

Query:    51 QLGSRRVFPVGKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEG 110
                 R    +G+ E   F+ +FRFKMWW T  +G  G D+  ETQ+++++  E    D  
Sbjct:    95 SPSDRLTNSLGRFEDREFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKIPE---IDS- 150

Query:   111 SQYGEEQSALYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGS 170
                       Y   +P +EG FRA L   E+  + IC ESG   V E     + ++    
Sbjct:   151 ----------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIAYIHICD 200

Query:   171 DPFDVITNAVKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFE 230
             +P++++  A   +  H+ TF   E KK+P +++ FGWCTWDA Y  V    +  G++ FE
Sbjct:   201 NPYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFE 260

Query:   231 KGGIPPKFIIIDDGWQSVGMDPSGFEFRADNTA----NFANRLTHIKENHKFQKNGKEG 285
              GG+ PKF+IIDDGWQS+  D    +  A+N          RLT  KE  KF +N K G
Sbjct:   261 DGGVCPKFVIIDDGWQSINFDGDELDKDAENLVLGGEQMTARLTSFKECKKF-RNYKGG 318


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 384 (140.2 bits), Expect = 5.7e-40, Sum P(2) = 5.7e-40
 Identities = 131/450 (29%), Positives = 218/450 (48%)

Query:   294 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEP 352
             GL   VT I+E+H +++Y+ VWHA+ GYWGG+ P  +    Y+++               
Sbjct:   384 GLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTREV------------- 430

Query:   353 CDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKL 412
               A +S  +  +  ++P  +  FY++ +++L+ +GI GVK D Q+ L+ L A    R   
Sbjct:   431 --ALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRSY 487

Query:   413 SRKYHQALEASIARNFRNNDIICCMSHNTDGLYSA-----KRSAVIRASDDFWPRDPASH 467
             +  Y  A   S  R+F     I CMS     ++ +     K + V+R S+DF+P    SH
Sbjct:   488 ANAYQDAWTISSLRHF-GPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546

Query:   468 TIHIASVAYNTIFLGEFMQ--PDWDMFHSLHP----MAEYHGAARAVGGCAIYVSDKPGQ 521
             T H+   A+N + L  ++   PDWDMF +L       A +H AAR + G  IY++DKPGQ
Sbjct:   547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605

Query:   522 HDFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSL-LKIWN--LNDFTGV 574
             HD  L++++      G+   LR  +  R T D ++ D  ++G  L +  ++      +G+
Sbjct:   606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662

Query:   575 VGVFNCQG---AGWCRVGKKNLIHDEQPGTTTGFI-RAKDVDYLPRVAGDEWTGDAIAYS 630
             +GVFN      +    V     I+D+Q    TG+I RA       R+ G+  +  A++ +
Sbjct:   663 IGVFNVSNRVESVIIPVADFPGIYDDQE--ETGYIVRAHRTG---RIVGELHSSSAVSVT 717

Query:   631 --HLGGEV--AYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAI 686
                   EV  AY  K  T  +  K +E E  + +P  ++S     A +GL++      A+
Sbjct:   718 LNERRWEVLTAYPVKTLTFKMNSKDKENE--SSMPTADVSVDV--AILGLLRKMTGVAAL 773

Query:   687 --KELRYESEGTATVDMKVRGCGEFGAYSS 714
                ++  E  G   VD+ ++  G  G Y S
Sbjct:   774 VSSDIYIEDTGRLRVDVGIKALGVLGIYFS 803

 Score = 122 (48.0 bits), Expect = 5.7e-40, Sum P(2) = 5.7e-40
 Identities = 43/173 (24%), Positives = 77/173 (44%)

Query:   120 LYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNA 179
             ++ V L +   D   VL      E+ I  ++ +     F+    V  A  +D F+V T+A
Sbjct:   230 VHVVLLGVTVDDTLTVLGSGPAGEVVIKSQNDNATPSRFQ----VLAATAAD-FEVATSA 284

Query:   180 VKTVERHLLTFSHRERKKMPD---MLNWF---GWCTWDAFYTDVTGEGVKQGLESFEKGG 233
             +    R L+       +  P    +  W+    +CTW+    D++ E +   L+  +  G
Sbjct:   285 LIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTWNGLGQDLSEEKILSALDDLKTAG 344

Query:   234 IPPKFIIIDDGWQSVGMDPSGF------EFRADNTA---NFANRLTHIKENHK 277
             I  + +IIDD WQS+  + +G       +F A++ A     A  +T I+E H+
Sbjct:   345 IRIRTLIIDDNWQSLDNEGAGSWHRALTQFEANSKAFPNGLAKAVTTIREQHR 397


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 234 (87.4 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
 Identities = 67/199 (33%), Positives = 97/199 (48%)

Query:   368 NPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARN 427
             N E    FY      +     D VKVD Q ++  +       +  SR    AL+ S+ + 
Sbjct:   342 NLEDAIGFYKAFDGNILR-DFDLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGK- 398

Query:   428 FRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQP 487
                 D+I CMS N +   +   S V+R S D+ P       +HI   AYN++     + P
Sbjct:   399 ----DVINCMSMNPENYCNYFYSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYP 454

Query:   488 DWDMFHSLHPMAEYHGAARAVGGCAIYVSDK-PGQHDFNLLRKLVLPDGSILRAKLPGRP 546
             D+DMF S  P A+ H  AR   G  IY++D+ P + +  LLR  VLP+G ++R   P   
Sbjct:   455 DYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALI 514

Query:   547 TRDCLFSDPARDGKSLLKI 565
             T D LF DP R+ + LLK+
Sbjct:   515 TEDLLFKDPLRE-RVLLKL 532

 Score = 177 (67.4 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
 Identities = 38/125 (30%), Positives = 63/125 (50%)

Query:   154 DVDEFEGSHLVFVAAGSDPFDVITNAVKTVERHLLTFSHRERKKMPD-MLNWFGWCTWDA 212
             + DE + S+ + +    +P+  I NA+    +   TF  R+ K  PD ++N  GWC+W+A
Sbjct:   172 NTDEIKRSYFLSIGTSDNPYKAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNA 231

Query:   213 FYT-DVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEFRADNTA---NFANR 268
             F T D+  E + + ++   + G+   ++IIDDGWQ    D +      DN      F N 
Sbjct:   232 FLTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQDQNNDRAIRSLNPDNKKFPNGFKNT 291

Query:   269 LTHIK 273
             +  IK
Sbjct:   292 VRAIK 296

 Score = 79 (32.9 bits), Expect = 5.0e-13, Sum P(2) = 5.0e-13
 Identities = 32/105 (30%), Positives = 45/105 (42%)

Query:   294 GLRHIVTEIKEKHDLKYVYVWHAITGYWGGVRP------GVTGMEHYESKMQYPVSSPGV 347
             G ++ V  IK    +KYV +WHAI  +WGG+         V G  ++ + +   V SP +
Sbjct:   287 GFKNTVRAIKSL-GVKYVGLWHAINAHWGGMSQELMKSLNVNG--YFTNFLNSYVPSPNL 343

Query:   348 QSNEPC-DAFDSIAKNGLGLVNPEK--VFH-FYDELHSYLASAGI 388
             +       AFD        LV  +   V H  YD     LAS  I
Sbjct:   344 EDAIGFYKAFDGNILRDFDLVKVDNQWVIHAIYDSFPIGLASRNI 388


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 337 (123.7 bits), Expect = 6.9e-29, Sum P(3) = 6.9e-29
 Identities = 101/326 (30%), Positives = 159/326 (48%)

Query:   294 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRP-GVTGMEHYESKMQYPVSSPGVQSNE 351
             GL+ +V+EI++++  ++ + VWH I GYWGG+ P G    ++   K+Q    +  VQ   
Sbjct:   404 GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKIQLRDEAE-VQ--- 459

Query:   352 PCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVK 411
             P D FD    +G      E V   YD+ +++LA  G+   KVD Q  L+   A    R  
Sbjct:   460 PKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFLD-YPAHANDRKN 511

Query:   412 LSRKYHQALEASIARNFRNNDIICCMSHNTDGLYSAKRSA-------VIRASDDFWPRDP 464
             L R Y  A  A+ +++F    I C        L+S  +         + R SDDF+P + 
Sbjct:   512 LIRPYQDAWTAAASKHFGGRAIACMAQTPQSILHSLLQQGRSEGPMLMARNSDDFFPDEV 571

Query:   465 ASHTIHIASVAYNTIFLGEF-MQPDWDMFHSLHPM-AEYHGAARAVGGCAIYVSDKPGQH 522
              SHT H+   A+N + +    +  DWDMF +  P  A  H  AR++ G  IY++D PG+H
Sbjct:   572 GSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMSGGPIYITDAPGEH 631

Query:   523 DFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVF 578
             D  L++++     DG    LRA  PGR     L+       + LL++ + +   G++GVF
Sbjct:   632 DVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRVRSGHQGVGMLGVF 687

Query:   579 NCQGAGWCRVGKKNLIHDEQPGTTTG 604
             N    G   +G++  + D   G   G
Sbjct:   688 NVCNRG-SLLGEQVRLDDIFDGEKAG 712

 Score = 151 (58.2 bits), Expect = 1.6e-08, Sum P(3) = 1.6e-08
 Identities = 45/157 (28%), Positives = 82/157 (52%)

Query:   185 RHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 244
             +H LT   + R ++ D  + F +CTW++   D++ + +   L    + GI    +IIDD 
Sbjct:   319 KHSLT---QARAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDN 375

Query:   245 WQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKE 304
             WQS  +D  G +         A+R    +   +F+ N ++G  +     GL+ +V+EI++
Sbjct:   376 WQS--LDGDGSD---------ASR----RRWERFEAN-QQGFPQ-----GLKGLVSEIRK 414

Query:   305 KH-DLKYVYVWHAITGYWGGVRP-GVTGMEHYESKMQ 339
             ++  ++ + VWH I GYWGG+ P G    ++   K+Q
Sbjct:   415 QNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKIQ 451

 Score = 70 (29.7 bits), Expect = 6.9e-29, Sum P(3) = 6.9e-29
 Identities = 21/89 (23%), Positives = 41/89 (46%)

Query:   634 GE-VAYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELRYE 692
             GE +A   +   + + L+   +E++T  P+ +L  G   A +GLV    +  A+  + Y 
Sbjct:   724 GEMIAPASRETVIEVGLEEGGFEIFTAYPITKLG-GLAVATLGLVGKMATAAAVSHVSYS 782

Query:   693 S--EGTATVDMKV----RGCGEFGAYSSA 715
                EG   V ++V    +  G  G ++ +
Sbjct:   783 KHHEGFIPVGVEVSVSLKALGTLGIFAQS 811

 Score = 41 (19.5 bits), Expect = 6.9e-29, Sum P(3) = 6.9e-29
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:   116 EQSALYTVFLPILEGDFRAVLQG 138
             E+ A +TV +P LE +   V+ G
Sbjct:    23 EKDATFTVGVPALELEHGGVING 45


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 196 (74.1 bits), Expect = 4.4e-20, Sum P(3) = 4.4e-20
 Identities = 53/193 (27%), Positives = 91/193 (47%)

Query:   370 EKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFR 429
             EK+  +Y+     +   G D +K+D Q+    L  G    ++ ++  + ALE    R   
Sbjct:   348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHR--M 405

Query:   430 NNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQPDW 489
                ++ CM+ N   +     S+V RAS D+   D      H+     NT+ LG+ + PD 
Sbjct:   406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465

Query:   490 DMFHSLHPMA-EYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTR 548
             DMFHS   +       ++A+ G  +Y+SD P +   + +R L+   G I R   P  PT 
Sbjct:   466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525

Query:   549 DCLFSDPARDGKS 561
             + + ++P + GK+
Sbjct:   526 ESILTNPLQSGKA 538

 Score = 114 (45.2 bits), Expect = 4.4e-20, Sum P(3) = 4.4e-20
 Identities = 21/84 (25%), Positives = 46/84 (54%)

Query:   163 LVFVAAGSDPFDVITNAVKTV--ERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGE 220
             L+F    S  + V ++A  ++  ++ +     R  K+  +  ++ GWCTW+ ++ D+   
Sbjct:   187 LIF-RKSSSVYHVFSDAYDSLIADKAVSALRKRADKQYFNAFDYLGWCTWEHYHYDIDET 245

Query:   221 GVKQGLESFEKGGIPPKFIIIDDG 244
              +   +++ E  GIP ++++IDDG
Sbjct:   246 KILNDIDAIEASGIPVRYVLIDDG 269

 Score = 58 (25.5 bits), Expect = 4.4e-20, Sum P(3) = 4.4e-20
 Identities = 6/22 (27%), Positives = 17/22 (77%)

Query:   303 KEKHDLKYVYVWHAITGYWGGV 324
             K+   ++++ +W++++GYW G+
Sbjct:   299 KQADKIRWIGLWYSLSGYWMGI 320

 Score = 55 (24.4 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
 Identities = 21/96 (21%), Positives = 40/96 (41%)

Query:   176 ITNAVKTVERHLLTFSHRERKKMPDMLNWFG-WCTWDAFYTDVTGEG-----VKQGLESF 229
             +T+ V   +R    +S   ++K  D + W G W +   ++  ++ E      ++Q L S+
Sbjct:   278 LTSLVPDKKRFPNGWSRIMKRKQADKIRWIGLWYSLSGYWMGISAENDFPPEIRQVLHSY 337

Query:   230 EKGGIPPKFIIIDDGWQSV---GMDPSGFEF-RADN 261
                 +P       + W       M   GF+F + DN
Sbjct:   338 NGSLLPGTSTEKIETWYEYYVRTMKEYGFDFLKIDN 373


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.138   0.426    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      758       758   0.00091  121 3  11 22  0.40    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  628 (67 KB)
  Total size of DFA:  421 KB (2202 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  59.73u 0.09s 59.82t   Elapsed:  00:00:03
  Total cpu time:  59.74u 0.09s 59.83t   Elapsed:  00:00:03
  Start:  Tue May 21 09:25:42 2013   End:  Tue May 21 09:25:45 2013

Back to top