BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>046494
MTVTAKATIIKDGCLMVRGNVVLTGVPQNVVVSPSSFIGATSAAPPSSRHVFTLGVLPDG
YRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTFYILLLPVL
DGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSIKILEKHKG
TFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQET
INEFCKDGEPLIEGTQFAIRLVDIKENCKFNSSGSDNSCNDLHEFIDEIKEKYGLKYVYM
WHALAGYWGGVLPSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVK
VDVQSLMETLGSGYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSMKSAV
ARASEDFMPGEPTFQTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCA
VYVSDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLS
GVIGVFNCQGAGSWPMKEDMHRKPASPLSISGHVCPLDIEFLERVAGENWNGDCAVYAFN
SGVLTKLPKKGNLEVSLATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFEYI
MDLSKYIIKIKGKGCGRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLPGECTLRDI
EFVY

High Scoring Gene Products

Symbol, full name Information P value
SIP2
AT3G57520
protein from Arabidopsis thaliana 2.3e-204
SIP1
AT1G55740
protein from Arabidopsis thaliana 2.6e-184
SIP1
AT5G40390
protein from Arabidopsis thaliana 4.6e-135
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 3.3e-130
STS
AT4G01970
protein from Arabidopsis thaliana 5.0e-119
STS1
Stachyose synthase
protein from Pisum sativum 4.5e-105
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 6.9e-34
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 2.0e-31
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 9.6e-19

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  046494
        (724 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...  1077  2.3e-204  3
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  1079  2.6e-184  2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...   734  4.6e-135  2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...   771  3.3e-130  2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...   650  5.0e-119  4
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...   677  4.5e-105  2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   414  1.4e-35   1
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   291  6.9e-34   2
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   377  2.0e-31   1
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   219  9.6e-19   2


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 1077 (384.2 bits), Expect = 2.3e-204, Sum P(3) = 2.3e-204
 Identities = 196/341 (57%), Positives = 264/341 (77%)

Query:   321 DIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLL 380
             DI MDSL  +G+G+++P+K+F+FYN+LHSYLA+ G+DGVKVDVQ+++ETLG+G GGRV L
Sbjct:   344 DIVMDSLAVHGLGLVNPKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSL 403

Query:   381 TRQYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIA 440
             TR YQQALE S+A NF DN  I CM HN+  LYS+ ++A+ RAS+DF P +P   T+HIA
Sbjct:   404 TRSYQQALEASIARNFTDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIA 463

Query:   441 SVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYVSDKPGVHDFKILKRLVL 500
             SVA+NSL LGE + PDWDMF S H TAE+HA ARA+GGCA+YVSDKPG H+F +L++LVL
Sbjct:   464 SVAYNSLFLGEFMQPDWDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVL 523

Query:   501 PDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFNCQGAGSW---PMK 557
             PDGSVLRA+  GRPTRDCLF DP  DG SLLKIWN+NK +G++GVFNCQGAG W     K
Sbjct:   524 PDGSVLRAKLPGRPTRDCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAG-WCKETKK 582

Query:   558 EDMHRKPASPLSISGHVCPLDIEFLERVAGENWNGDCAVYAFNSGVLTKLPKKGNLEVSL 617
               +H    SP +++G +   D + + +VAGE+W+GD  VYA+ SG + +LPK  ++ ++L
Sbjct:   583 NQIH--DTSPGTLTGSIRADDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTL 640

Query:   618 ATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFE 658
               L+ E++ I P++ + +++ FAPIGL+DM+NS GA+ES +
Sbjct:   641 KVLEYELFHISPLKEITENISFAPIGLVDMFNSSGAIESID 681

 Score = 826 (295.8 bits), Expect = 2.3e-204, Sum P(3) = 2.3e-204
 Identities = 156/329 (47%), Positives = 218/329 (66%)

Query:     1 MTVTAKATIIKDGCLMVRGNVVLTGVPQNVVVSP--------SSFIGATSAAPPSSRHVF 52
             MT+T+  ++  D  L+V+G  +LT +P N++++P         SFIGAT      S HVF
Sbjct:     1 MTITSNISVQNDN-LVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQS-KSLHVF 58

Query:    53 TLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTF 112
              +GVL +G RF+C FRFK+WWM  R+G    ++P+ETQ +LLE++++   + D A   T 
Sbjct:    59 PIGVL-EGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAP--TV 115

Query:   113 YILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSI 172
             Y + LP+L+GQFRA LQG   N+++ C ESGD +V+TS+    V++++G NPFE+I+ S+
Sbjct:   116 YTVFLPLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSV 175

Query:   173 KILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLV 232
             K +E+H  TF H E KK+P  LDWFGWCTWDAFY  V  +G+ EGL S  EGG  P+FL+
Sbjct:   176 KAVERHMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLI 235

Query:   233 IDDGWQETINEFCKDGEPLI-EGTQFAIRLVDIKENCKFNSSGS-DNSCNDLHEFIDEIK 290
             IDDGWQ+  N+  KD   ++ EG QFA RLV IKEN KF  S   D   + L   +D  K
Sbjct:   236 IDDGWQQIENKE-KDENCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAK 294

Query:   291 EKYGLKYVYMWHALAGYWGGVLPSSDIMK 319
             +++ +K VY WHALAGYWGGV P++  M+
Sbjct:   295 QRHNVKQVYAWHALAGYWGGVKPAASGME 323

 Score = 113 (44.8 bits), Expect = 2.3e-204, Sum P(3) = 2.3e-204
 Identities = 21/36 (58%), Positives = 28/36 (77%)

Query:   677 RFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP 712
             RFGAYSS +P  C V++ E +FTY+AE GL+T+ LP
Sbjct:   723 RFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLP 758


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 1079 (384.9 bits), Expect = 2.6e-184, Sum P(2) = 2.6e-184
 Identities = 205/390 (52%), Positives = 272/390 (69%)

Query:   324 MDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLLTRQ 383
             ++S+ K G+G+++P+K+F FYNDLHSYLA+ GVDGVKVDVQ+++ETLG+G+GGRV L ++
Sbjct:   351 LESITKNGLGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKK 410

Query:   384 YQQALEQSVAWNFKDNNLICCMSHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIASVA 443
             Y QALE S++ NF DN +I CMSHN+  LYS+ K+AV RAS+DF P +P   T+HIASVA
Sbjct:   411 YHQALEASISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVA 470

Query:   444 FNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYVSDKPGVHDFKILKRLVLPDG 503
             +N+L LGE + PDWDMF S H  AE+HA ARA+GGCA+YVSDKPG HDF +L++LVL DG
Sbjct:   471 YNTLFLGEFMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDG 530

Query:   504 SVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFNCQGAGSWPMKEDMHR- 562
             S+LRA+  GRPT DC F DPV D KSLLKIWNLN+ +GVIGVFNCQGAG W   E  +  
Sbjct:   531 SILRAKLPGRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAG-WCKNEKRYLI 589

Query:   563 KPASPLSISGHVCPLDIEFLERVAGENWNGDCAVYAFNSGVLTKLPKKGNLEVSLATLKC 622
                 P +ISG V   D+ +L +VA   W GD  VY+   G L  LPK  +L V+L   + 
Sbjct:   590 HDQEPGTISGCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREY 649

Query:   623 EIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFEYIMDLSKYXXXXXXXXXXRFGAYS 682
             E++T+ P++       FAP+GL++M+NSGGA+ S  Y  + +K+            G YS
Sbjct:   650 EVFTVVPVKEFSDGSKFAPVGLMEMFNSGGAIVSLRYDDEGTKFVVRMKLRGSGLVGVYS 709

Query:   683 S-SKPKCCMVDTKEEEFTYNAEDGLLTVKL 711
             S  +P+   VD+ + E+ Y  E GL+T  L
Sbjct:   710 SVRRPRSVTVDSDDVEYRYEPESGLVTFTL 739

 Score = 731 (262.4 bits), Expect = 2.6e-184, Sum P(2) = 2.6e-184
 Identities = 153/331 (46%), Positives = 203/331 (61%)

Query:     1 MTVTAKATIIKDGCLMVRGNVVLTGVPQNVVVSPSS--------FIGATSAAPPSSRHVF 52
             MTV A  ++  D  L+V G+ VL GVP+NV+V+P+S        FIG TS    S R VF
Sbjct:     1 MTVGAGISVT-DSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHR-VF 58

Query:    53 TLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTF 112
             +LG L D  RF+C+FRFK+WWM  R+G +  E+P ETQ L++EA + S  D      ++ 
Sbjct:    59 SLGKLED-LRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGS--DLGGRDQSSS 115

Query:   113 YILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSI 172
             Y++ LP+L+G FRA LQG   N+L+ C+ESGD +V   E    VF+ +G +PF++I  ++
Sbjct:   116 YVVFLPILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAV 175

Query:   173 KILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLV 232
             K +E+H  TFSH E KK+P  L+WFGWCTWDAFY  V  + +K+GL S   GG +P+F++
Sbjct:   176 KAVEQHLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVI 235

Query:   233 IDDGWQ-----ETINEFCKDGEPLIEGTQFAIRLVDIKENCKFNSSGS-----DNSCNDL 282
             IDDGWQ     ET  EF  D         FA RL  IKEN KF   G      D+    L
Sbjct:   236 IDDGWQSVGMDETSVEFNADN-----AANFANRLTHIKENHKFQKDGKEGHRVDDPSLSL 290

Query:   283 HEFIDEIKEKYGLKYVYMWHALAGYWGGVLP 313
                I +IK    LKYVY+WHA+ GYWGGV P
Sbjct:   291 GHVITDIKSNNSLKYVYVWHAITGYWGGVKP 321


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 734 (263.4 bits), Expect = 4.6e-135, Sum P(2) = 4.6e-135
 Identities = 168/429 (39%), Positives = 244/429 (56%)

Query:   313 PSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGS 372
             P   +  +D+A+D + + G+G   P    +FY  LHS+L N+G+DGVKVDV  ++E L  
Sbjct:   364 PGLKLTMEDLAVDKIIETGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQ 423

Query:   373 GYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM--- 428
              YGGRV L + Y +AL  SV  +F  N +I  M H N +    +   ++ R  +DF    
Sbjct:   424 KYGGRVDLAKAYFKALTSSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTD 483

Query:   429 P-GEP--TF--QTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYV 483
             P G+P  TF  Q  H+   A+NSL +G  + PDWDMFQS H  AEFHA +RA+ G  +Y+
Sbjct:   484 PSGDPNGTFWLQGCHMVHCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYI 543

Query:   484 SDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVI 543
             SD  G HDF +LKRLVLP+GS+LR  +   PTRD LFEDP+ DGK++LKIWNLNK +GVI
Sbjct:   544 SDCVGKHDFDLLKRLVLPNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVI 603

Query:   544 GVFNCQGAGSWPMKEDMHRKPASPL--SISGHVCPLDIEF---LERVAGENWNGDCAVYA 598
             G FNCQG G W  +E    +  S    +++    P D+E+      ++  N   + A++ 
Sbjct:   604 GAFNCQGGG-W-CRETRRNQCFSECVNTLTATTSPKDVEWNSGSSPISIANVE-EFALFL 660

Query:   599 FNSGVLTKLPKKGNLEVSLATLKCEIYTICPIRVL-GQDLLFAPIGLLDMYNSGGAVESF 657
               S  L       +LE++L   K E+ T+ P+  + G  + FAPIGL++M N+ GA+ S 
Sbjct:   661 SQSKKLLLSGLNDDLELTLEPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL 720

Query:   658 EYIMDLSKYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP--GEC 715
              Y    +             F  Y+S KP  C++D +  EF Y  ED ++ V++P  G  
Sbjct:   721 VY----NDESVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGY--EDSMVMVQVPWSGPD 774

Query:   716 TLRDIEFVY 724
              L  I++++
Sbjct:   775 GLSSIQYLF 783

 Score = 610 (219.8 bits), Expect = 4.6e-135, Sum P(2) = 4.6e-135
 Identities = 128/331 (38%), Positives = 192/331 (58%)

Query:    10 IKDGCLMVRGNVVLTGVPQNVV----------------VSPSSFIGATSAAPPSSRHVFT 53
             ++D  L+  G VVLT VP NV                 VS  SFIG      P S HV +
Sbjct:    24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83

Query:    54 LGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTFY 113
             +G L +  RF+ +FRFK+WW    VG +  ++  ETQ+++L+ +  S     + S    Y
Sbjct:    84 IGKLKN-IRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRP-Y 140

Query:   114 ILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSIK 173
             +LLLP+L+G FR++ Q    +D+  CVESG + V  SE  + V++++GD+PF+L+KD++K
Sbjct:   141 VLLLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDAMK 200

Query:   174 ILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVI 233
             ++  H  TF  LE K  P  +D FGWCTWDAFY  VNP G+ +G+   ++GGC P  ++I
Sbjct:   201 VIRVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLI 260

Query:   234 DDGWQETINEFCKDG---EPL---IEGTQFAIRLVDIKENCKFNSSGSDNSCND--LHEF 285
             DDGWQ   ++   DG   E +   + G Q   RL+  +EN KF    S    ND  +  F
Sbjct:   261 DDGWQSIGHD--SDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAF 318

Query:   286 IDEIKEKYG-LKYVYMWHALAGYWGGVLPSS 315
             + ++K+++  + Y+Y+WHAL GYWGG+ P +
Sbjct:   319 VRDLKDEFSTVDYIYVWHALCGYWGGLRPEA 349


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 771 (276.5 bits), Expect = 3.3e-130, Sum P(2) = 3.3e-130
 Identities = 175/423 (41%), Positives = 242/423 (57%)

Query:   320 KDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVL 379
             +D+A+D +   GVG++DP++  + Y  LHS+L  SG+DGVKVDV  L+E +   YGGRV 
Sbjct:   369 EDLAVDKIVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVE 428

Query:   380 LTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM---P-GEP-- 432
             L + Y   L +SV  +F  N +I  M H N + L  +   A+ R  +DF    P G+P  
Sbjct:   429 LAKAYFAGLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDG 488

Query:   433 TF--QTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYVSDKPGVH 490
             TF  Q  H+   A+NSL +G  + PDWDMFQS H  A FHA +RA+ G  VYVSD  G H
Sbjct:   489 TFWLQGCHMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCH 548

Query:   491 DFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFNCQG 550
             DF +L+RL LPDG++LR      PTRDCLF DP+ DGK++LKIWN+NK SGV+G FNCQG
Sbjct:   549 DFDLLRRLALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQG 608

Query:   551 AGSWPMKEDMHRKPASPLSI--SGHVCPLDIEFLERVAGENWNGD-CAVYAFNSGVLTKL 607
              G W  +E      A+  S+  +    P D+E+     G    GD  AVY   +  L  L
Sbjct:   609 GG-WS-REARRNMCAAGFSVPVTARASPADVEWSHGGGG----GDRFAVYFVEARKLQLL 662

Query:   608 PKKGNLEVSLATLKCEIYTICPIRVLGQDLL---FAPIGLLDMYNSGGAVESFEYIMDLS 664
              +  ++E++L     E+  + P+R +    L   FAPIGL +M N+GGAV+ FE      
Sbjct:   663 RRDESVELTLEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFEAARKDG 722

Query:   665 KYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP--GECT-LRDIE 721
                            AYSS++P+ C V+ ++ EF Y  EDG++TV +P  G    L  +E
Sbjct:   723 DVAAEVAVKGAGEMVAYSSARPRLCKVNGQDAEFKY--EDGIVTVDVPWTGSSKKLSRVE 780

Query:   722 FVY 724
             + Y
Sbjct:   781 YFY 783

 Score = 527 (190.6 bits), Expect = 3.3e-130, Sum P(2) = 3.3e-130
 Identities = 119/328 (36%), Positives = 183/328 (55%)

Query:    10 IKDGCLMVRGNVVLTGVPQNVVVSPSSFIGATSAAP------------PSS--RHVFTLG 55
             +K   L V G+  L  VP N+ ++P+S +   S  P            P++  RHV  +G
Sbjct:    30 LKGKDLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHVVPIG 89

Query:    56 VLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTFYIL 115
              L D  RF+ +FRFK+WW    VG +  +V  ETQM++L+    S   +        Y+L
Sbjct:    90 KLRDT-RFMSIFRFKVWWTTHWVGTNGRDVENETQMMILD---QSGTKSSPTGPRP-YVL 144

Query:   116 LLPVLDGQFRATLQGTPTND-LQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSIKI 174
             LLP+++G FRA L+     D +   +ESG S+V+ S    AV++++GD+PF+L+KD++++
Sbjct:   145 LLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMRV 204

Query:   175 LEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVID 234
             +  H GTF  +E K  P  +D FGWCTWDAFY +V+P+G+ EG+    +GGC P  ++ID
Sbjct:   205 VRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLID 264

Query:   235 DGWQETINE---FCKDGEPLIE---GTQFAIRLVDIKENCKFNSSGSDNSCNDLHEFIDE 288
             DGWQ   ++        E +     G Q   RL+  +EN KF           +  F+ E
Sbjct:   265 DGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFREYKGG-----MGGFVRE 319

Query:   289 IKEKYG-LKYVYMWHALAGYWGGVLPSS 315
             +K  +  ++ VY+WHAL GYWGG+ P +
Sbjct:   320 MKAAFPTVEQVYVWHALCGYWGGLRPGA 347


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 650 (233.9 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
 Identities = 148/417 (35%), Positives = 231/417 (55%)

Query:   313 PSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGS 372
             PS      D+A+D + + G+G++ P K  +FY+ +HSYLA+ GV G K+DV   +E+L  
Sbjct:   448 PSLGATMADLAVDKVVEAGIGLVHPSKAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAE 507

Query:   373 GYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM--- 428
              +GGRV L + Y   L +S+  NF   ++I  M   N +   ++ + ++ R  +DF    
Sbjct:   508 EHGGRVELAKAYYDGLTESMIKNFNGTDVIASMQQCNEFFFLATKQISIGRVGDDFWWQD 567

Query:   429 P-GEPT----FQTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYV 483
             P G+P      Q +H+   ++NS+ +G+++ PDWDMFQS H  AE+HA +RA+ G  VY+
Sbjct:   568 PYGDPQGVYWLQGVHMIHCSYNSIWMGQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYL 627

Query:   484 SDKPGV--HDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSG 541
             SD  G   H+F ++K+L   DG++ R  H   PTRD LF++P+ D +S+LKI+N NK  G
Sbjct:   628 SDHLGKASHNFDLIKKLAFFDGTIPRCVHYALPTRDSLFKNPLFDKESILKIFNFNKFGG 687

Query:   542 VIGVFNCQGAGSWPMKEDMHRKPASPLSISGHVCPLDIEFLER--VAGEN--WNGDCAVY 597
             VIG FNCQGAG  P +           ++SG V   DIE+ +    AG    + GD  VY
Sbjct:   688 VIGTFNCQGAGWSPEEHRFKGYKECYTTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVY 747

Query:   598 AFNSGVLTKLPKKGN-LEVSLATLKCEIYTICPI-RVLGQDLLFAPIGLLDMYNSGGAVE 655
                S  +  +  K   ++++L     ++ +  P+  ++   + FAP+GL++M+N  G V+
Sbjct:   748 KQQSEEILFMNSKSEAMKITLEPSAFDLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQ 807

Query:   656 SFEYIMDLSKYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP 712
               +   D S            RF AYSSS P  C ++ KE EF +  E G L+  +P
Sbjct:   808 DMKVTGDNS---IRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVP 861

 Score = 433 (157.5 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
 Identities = 102/262 (38%), Positives = 144/262 (54%)

Query:    37 FIGATSAAPPSSRHVFTLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEA 96
             F+G T  +P S R   +LG   D   FL LFRFK+WW    +GKS S++  ETQ ++L+ 
Sbjct:    88 FLGFTKESP-SDRLTNSLGRFEDR-EFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKI 145

Query:    97 REDSPLDADAASDNTFYILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAV 156
              E   +D+        Y+ ++P ++G FRA+L      ++  C ESG + V+ S      
Sbjct:   146 PE---IDS--------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIA 194

Query:   157 FINSGDNPFELIKDSIKILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKE 216
             +I+  DNP+ L+K++   L  H  TF  LE KK+P+ +D FGWCTWDA Y  V+P  I  
Sbjct:   195 YIHICDNPYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWT 254

Query:   217 GLHSFLEGGCSPRFLVIDDGWQETIN----EFCKDGEPLI-EGTQFAIRLVDIKENCKF- 270
             G+  F +GG  P+F++IDDGWQ +IN    E  KD E L+  G Q   RL   KE  KF 
Sbjct:   255 GVKEFEDGGVCPKFVIIDDGWQ-SINFDGDELDKDAENLVLGGEQMTARLTSFKECKKFR 313

Query:   271 NSSGSDNSCNDLHEFIDEIKEK 292
             N  G     +D   F + +K K
Sbjct:   314 NYKGGSFITSDASHF-NPLKPK 334

 Score = 85 (35.0 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
 Identities = 20/58 (34%), Positives = 34/58 (58%)

Query:   269 KFNSSGSDN-SCNDLHEFIDEIKEKY-GLKYVYMWHALAGYWGGVLPSSDI-MKKDIA 323
             K  S GSD+ S + +  F  +++ ++  L  +Y+WHAL G W GV P + + +K  +A
Sbjct:   385 KEESLGSDDVSGSGMAAFTKDLRLRFKSLDDIYVWHALCGAWNGVRPETMMDLKAKVA 442

 Score = 49 (22.3 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query:    10 IKDGCLMVRGNV-VLTGVPQNVVVSPSSFIGATSAAP 45
             + +G L  + +  +L  VPQNV  +P S    ++ AP
Sbjct:    36 LSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAP 72


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 677 (243.4 bits), Expect = 4.5e-105, Sum P(2) = 4.5e-105
 Identities = 155/427 (36%), Positives = 237/427 (55%)

Query:   313 PSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGS 372
             P  D   +D+A+  + K  +G++ P +  + Y+ +HSYLA SG+ GVKVDV   +E +  
Sbjct:   432 PGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYVCD 491

Query:   373 GYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM--- 428
              YGGRV L + Y + L +S+  NF  N +I  M H N +    + + ++ R  +DF    
Sbjct:   492 EYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDFWFQD 551

Query:   429 P-GEP--TF--QTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYV 483
             P G+P  +F  Q +H+   ++NSL +G+++ PDWDMFQS H  A+FHA +RA+ G  +YV
Sbjct:   552 PNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGGPIYV 611

Query:   484 SDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVI 543
             SD  G HDF ++K+LV PDG++ +  +   PTRDCLF++P+ D  ++LKIWN NK  GVI
Sbjct:   612 SDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKYGGVI 671

Query:   544 GVFNCQGAGSWPMKEDMHRKPASPLSISG--HVCPLDIEFLERVAGENWNGDCAVYAFNS 601
             G FNCQGAG  P+ +     P     I G  HV  ++ +  E  +      +  VY   +
Sbjct:   672 GAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKAEEYVVYLNQA 731

Query:   602 GVLTKLPKKGN-LEVSLATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFEYI 660
               L+ +  K   ++ ++     E+Y+  P+  L   + FAPIGL +M+NSGG V   EY+
Sbjct:   732 EELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNSGGTVIDLEYV 791

Query:   661 MDLSKYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLPG---ECTL 717
              + +K            F AYSS  PK   ++  E +F +   DG L V +P     C +
Sbjct:   792 GNGAKIKVKGGGS----FLAYSSESPKKFQLNGCEVDFEWLG-DGKLCVNVPWIEEACGV 846

Query:   718 RDIEFVY 724
              D+E  +
Sbjct:   847 SDMEIFF 853

 Score = 383 (139.9 bits), Expect = 4.5e-105, Sum P(2) = 4.5e-105
 Identities = 87/238 (36%), Positives = 132/238 (55%)

Query:    37 FIGATSAAPPSSRHVFTLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEA 96
             F G  S   PS R + ++G   +G  FL +FRFK WW    +GKS S++ METQ +L+E 
Sbjct:    74 FFGF-SHETPSDRLMNSIGSF-NGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131

Query:    97 REDSPLDADAASDNTFYILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAV 156
              E              Y++++P+++  FR+ L     + ++   ESG + V+ S      
Sbjct:   132 PETKS-----------YVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIA 180

Query:   157 FINSGDNPFELIKDSIKILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKE 216
             +++  +NP++L+K++   +  H  +F  LE K IP  +D FGWCTWDAFY  VNP GI  
Sbjct:   181 YVHFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFH 240

Query:   217 GLHSFLEGGCSPRFLVIDDGWQE-TINEFC--KDGEPLI-EGTQFAIRLVDIKENCKF 270
             GL  F +GG  PRF++IDDGWQ  + + +   +D + L+  G Q + RL    E  KF
Sbjct:   241 GLDDFSKGGVEPRFVIIDDGWQSISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKF 298

 Score = 238 (88.8 bits), Expect = 3.0e-52, Sum P(2) = 3.0e-52
 Identities = 53/143 (37%), Positives = 78/143 (54%)

Query:   282 LHEFIDEIKEKY-GLKYVYMWHALAGYWGGVLPSS----------------DIMKKDIAM 324
             L  F  +++ K+ GL  VY+WHAL G WGGV P +                D   +D+A+
Sbjct:   384 LKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETTHLDTKIVPCKLSPGLDGTMEDLAV 443

Query:   325 DSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLLTRQY 384
               + K  +G++ P +  + Y+ +HSYLA SG+ GVKVDV   +E +   YGGRV L + Y
Sbjct:   444 VEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYVCDEYGGRVDLAKVY 503

Query:   385 QQALEQSVAWNFKDNNLICCMSH 407
              + L +S+  NF  N +I  M H
Sbjct:   504 YEGLTKSIVKNFNGNGMIASMQH 526


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 414 (150.8 bits), Expect = 1.4e-35, P = 1.4e-35
 Identities = 139/482 (28%), Positives = 223/482 (46%)

Query:    94 LEAREDSPLDADAASDNTFYILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAF 153
             L   ED+ L +   +D    +LL   +D        G P  ++   ++S + +  T   F
Sbjct:   213 LNFTEDAILLSFLRTDGVHVVLLGVTVDDTLTVLGSG-PAGEV--VIKSQNDNA-TPSRF 268

Query:   154 EAVFINSGDNPFE-----LIKDSIKILEKHKGTFSHLENKK-IPRHLDWFGWCTWDAFYK 207
             + +   + D  FE     LI ++ +++  ++ T       + +    D   +CTW+   +
Sbjct:   269 QVLAATAAD--FEVATSALIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTWNGLGQ 326

Query:   208 QVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQETINEFCKDGEPLIEGTQFAIRLVDIKEN 267
              ++ + I   L      G   R L+IDD WQ   NE        +  TQF          
Sbjct:   327 DLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNEGAGSWHRAL--TQF---------- 374

Query:   268 CKFNSSGSDNSCNDLHEFIDEIKEKY-GLKYVYMWHALAGYWGGVLPSSD---IMK-KDI 322
              + NS    N    L + +  I+E++  ++Y+ +WHAL GYWGG+ P      I K +++
Sbjct:   375 -EANSKAFPNG---LAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTREV 430

Query:   323 AMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLLTR 382
             A++S  +  +  IDP  I  FYND +++L+ SG+ GVK D QS ++ L      R     
Sbjct:   431 ALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLLADPEDRRSY-AN 489

Query:   383 QYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSM-----KSAVARASEDFMPGEPTFQTL 437
              YQ A   S   +F     I CMS    +++ S       + V R S DF P      T 
Sbjct:   490 AYQDAWTISSLRHFGPK-AISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSHTW 548

Query:   438 HIASVAFNSLLLGEIV-VPDWDMFQSKHET----AEFHATARALGGCAVYVSDKPGVHDF 492
             H+   A N+LL   +  +PDWDMFQ+  E     A FHA AR + G  +Y++DKPG HD 
Sbjct:   549 HVFCNAHNALLTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQHDI 608

Query:   493 KILKRLVLP--DGSVLRARH--AGRPTRDCLFEDPVMDGKSL-LKIWN--LNKLSGVIGV 545
              ++K++      G+ +  R   A R T D ++ D + +G  L +  ++      SG+IGV
Sbjct:   609 PLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGIIGV 665

Query:   546 FN 547
             FN
Sbjct:   666 FN 667


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 291 (107.5 bits), Expect = 6.9e-34, Sum P(2) = 6.9e-34
 Identities = 92/274 (33%), Positives = 133/274 (48%)

Query:   280 NDLHEFIDEIKEKYGLKYVYMWHALAGYWGGVLPSSDIMKK----DIAMDSLEKYGVGII 335
             N     +  IK   G+KYV +WHA+  +WGG+  S ++MK         + L  Y V   
Sbjct:   286 NGFKNTVRAIKS-LGVKYVGLWHAINAHWGGM--SQELMKSLNVNGYFTNFLNSY-VPSP 341

Query:   336 DPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYG-GRVLLTRQYQQALEQSVAW 394
             + +    FY      +     D VKVD Q ++  +   +  G  L +R  Q AL+ SV  
Sbjct:   342 NLEDAIGFYKAFDGNILRD-FDLVKVDNQWVIHAIYDSFPIG--LASRNIQIALQYSVG- 397

Query:   395 NFKDNNLICCMSHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIASVAFNSLLLGEIVV 454
               KD  +I CMS N  +  +   S V R S D++P       LHI   A+NSLL   IV 
Sbjct:   398 --KD--VINCMSMNPENYCNYFYSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVY 453

Query:   455 PDWDMFQSKHETAEFHATARALGGCAVYVSDK-PGVHDFKILKRLVLPDGSVLRARHAGR 513
             PD+DMF S    A+ H  AR   G  +Y++D+ P   + ++L+  VLP+G V+R      
Sbjct:   454 PDYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPAL 513

Query:   514 PTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFN 547
              T D LF+DP+ + + LLK+    K    I  FN
Sbjct:   514 ITEDLLFKDPLRE-RVLLKLKGKVKGYNAIAFFN 546

 Score = 157 (60.3 bits), Expect = 6.9e-34, Sum P(2) = 6.9e-34
 Identities = 32/99 (32%), Positives = 57/99 (57%)

Query:   149 TSEAFEAVFINSG--DNPFELIKDSIKILEKHKGTFSHLENKKIP-RHLDWFGWCTWDAF 205
             T E   + F++ G  DNP++ I+++I I  K   TF   + K  P + ++  GWC+W+AF
Sbjct:   173 TDEIKRSYFLSIGTSDNPYKAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNAF 232

Query:   206 Y-KQVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQETINE 243
               K +N + + + +   +E G    +++IDDGWQ+  N+
Sbjct:   233 LTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQDQNND 271


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 377 (137.8 bits), Expect = 2.0e-31, P = 2.0e-31
 Identities = 137/487 (28%), Positives = 218/487 (44%)

Query:   195 DWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQETINEFCKDGEPLIEG 254
             D F +CTW++  + ++   I   L    E G +   L+IDD WQ        DG+    G
Sbjct:   334 DGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSL------DGD----G 383

Query:   255 TQFAIRLVDIKENCKFNSSGSDNSCNDLHEFIDEI-KEKYGLKYVYMWHALAGYWGGVLP 313
             +  + R     E  + N  G       L   + EI K+   ++ + +WH + GYWGG+ P
Sbjct:   384 SDASRRRW---ERFEANQQGFPQGLKGL---VSEIRKQNPQIRNIAVWHGIFGYWGGMSP 437

Query:   314 SSDI-----MKKDIAMDSLE----KYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQ 364
             S  +     M+K    D  E     +    +D + +   Y+D +++LA+ GV   KVD Q
Sbjct:   438 SGPMASKYKMRKIQLRDEAEVQPKDFDFYTVDGEDVHKMYDDFYAFLADCGVSAAKVDTQ 497

Query:   365 SLMETLGSGYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSM----KSA- 419
               ++        R  L R YQ A   + + +F     I CM+    S+  S+    +S  
Sbjct:   498 GFLDYPAHA-NDRKNLIRPYQDAWTAAASKHF-GGRAIACMAQTPQSILHSLLQQGRSEG 555

Query:   420 ---VARASEDFMPGEPTFQTLHIASVAFNSLLLGEI-VVPDWDMFQSKH-ETAEFHATAR 474
                +AR S+DF P E    T H+   A N+LL+  + V+ DWDMFQ+   + A  HA AR
Sbjct:   556 PMLMARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVAR 615

Query:   475 ALGGCAVYVSDKPGVHDFKILKRLVLPDGSVLR-ARHAGRPTRDCLFEDPVMDGKSLLKI 533
             ++ G  +Y++D PG HD +++K++          A  A  P R  L+       + LL++
Sbjct:   616 SMSGGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT-LWPYGGHGEQRLLRV 674

Query:   534 WNLNKLSGVIGVFNCQGAGSWPMKEDMHRKPASPLSISGHVCPLDIEFLERVAGENWNGD 593
              + ++  G++GVFN    GS                + G    LD  F    AGE   G 
Sbjct:   675 RSGHQGVGMLGVFNVCNRGS----------------LLGEQVRLDDIFDGEKAGE---GS 715

Query:   594 CAVYAFNSG-VLTKLPKKGNLEVSLATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGG 652
               +  F++G ++    ++  +EV L     EI+T  PI  LG  L  A +GL+    +  
Sbjct:   716 FVISRFSTGEMIAPASRETVIEVGLEEGGFEIFTAYPITKLG-GLAVATLGLVGKMATAA 774

Query:   653 AVESFEY 659
             AV    Y
Sbjct:   775 AVSHVSY 781


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 219 (82.2 bits), Expect = 9.6e-19, Sum P(2) = 9.6e-19
 Identities = 64/245 (26%), Positives = 113/245 (46%)

Query:   290 KEKYGLKYVYMWHALAGYWGGVLPSSDIMKKDIAMDSLEKYGVGII---DPQKIFDFYND 346
             K+   ++++ +W++L+GYW G+   +D   +      L  Y   ++     +KI  +Y  
Sbjct:   299 KQADKIRWIGLWYSLSGYWMGISAENDFPPE--IRQVLHSYNGSLLPGTSTEKIETWYEY 356

Query:   347 LHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLL-TRQYQQALEQSVAWNFKDNNLICCM 405
                 +   G D +K+D QS    L  G G +V+   +    ALE     +     L+ CM
Sbjct:   357 YVRTMKEYGFDFLKIDNQSFTLPLYMG-GTQVIRQAKDCNLALEHQT--HRMQMGLMNCM 413

Query:   406 SHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIASVAFNSLLLGEIVVPDWDMFQSKHE 465
             + N  ++  ++ S+V RAS D+   +      H+     N+L+LG+ V PD DMF S   
Sbjct:   414 AQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDHDMFHSCDT 473

Query:   466 TA-EFHATARALGGCAVYVSDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPV 524
                   A ++A+ G  VY+SD P       ++ L+   G + R      PT + +  +P+
Sbjct:   474 VCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTPESILTNPL 533

Query:   525 MDGKS 529
               GK+
Sbjct:   534 QSGKA 538

 Score = 90 (36.7 bits), Expect = 9.6e-19, Sum P(2) = 9.6e-19
 Identities = 18/67 (26%), Positives = 33/67 (49%)

Query:   170 DSIKILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPR 229
             DS+ I +K         +K+     D+ GWCTW+ ++  ++   I   + +    G   R
Sbjct:   204 DSL-IADKAVSALRKRADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVR 262

Query:   230 FLVIDDG 236
             +++IDDG
Sbjct:   263 YVLIDDG 269


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.138   0.426    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      724       714   0.00084  121 3  11 22  0.39    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  628 (67 KB)
  Total size of DFA:  403 KB (2194 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  56.13u 0.14s 56.27t   Elapsed:  00:00:02
  Total cpu time:  56.13u 0.14s 56.27t   Elapsed:  00:00:02
  Start:  Sat May 11 14:23:34 2013   End:  Sat May 11 14:23:36 2013

Back to top