BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>004032
MAPSLSKNVLDAIGLLDSQIPPSISLEGSNFLANGHPIFTQVPINIIATPSPFTSANKTK
HTAGCFVGFDADESSDRHVVPIGKLNGIRFMSIFRFKAWWTTHWVGNSGKDMEHETHLMI
LDKNDLGRPYVLLLPILEGPFRASLQPGTDNYVDMCVESGSSQIRCSSFRSCLYMRVGDD
PYSLVKEAMKVVRVHLGTFKLLEEKTVPGIVDKFGWCTWDAFYLQVHPKGVYEGVKGLVE
GGCPPGLVLIDDGWQSICHDDEPIIDQEGMNRTSAGEQMPCRLIDFEENYKFRDYKSPRV
PSNKGMGAFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMPESRLIAPKLSQGLQTT
MEDLAVEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLLEMVAEDFGGRV
ELAKAYYKALTASVRKHFKGNGVIASMEHCNDFMYLGTETISLGRVGDDFWCSDPKGVKN
GTFWLQGCHMVHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDSVGN
HNFDLLKALVMPDGSILRCQFYALPTRDCLFENPLHDGKTVLKIWNLNKHTGVLGLFNCQ
GGGWCSVTRKNVGFSMFSNTLTCLASPNDIEWNNGKDPISVKGVDVFAVYKFQENKLKLL
KFSDDLEVTVEPFNFELLTVSPVTVLPKGSIQFAPIGLVNMLNTGGAVQSLAFDDDENLV
RIEVKGCGEMKVFASEKPLMCKVDGASAEFSYEDQMATVQVPWPNNSSKLTVVEFLFE

High Scoring Gene Products

Symbol, full name Information P value
SIP1
AT5G40390
protein from Arabidopsis thaliana 1.0e-283
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 1.6e-276
STS1
Stachyose synthase
protein from Pisum sativum 5.4e-195
STS
AT4G01970
protein from Arabidopsis thaliana 4.5e-189
SIP2
AT3G57520
protein from Arabidopsis thaliana 1.1e-150
SIP1
AT1G55740
protein from Arabidopsis thaliana 2.2e-146
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 3.9e-28
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 1.1e-27
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 1.0e-10

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  004032
        (778 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...  2726  1.0e-283  1
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...  2658  1.6e-276  1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...  1291  5.4e-195  2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...  1235  4.5e-189  2
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...  1396  1.1e-150  2
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  1430  2.2e-146  1
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   224  3.9e-28   3
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   290  1.1e-27   3
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   278  1.0e-23   2
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   186  1.0e-10   1


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 2726 (964.7 bits), Expect = 1.0e-283, P = 1.0e-283
 Identities = 500/788 (63%), Positives = 611/788 (77%)

Query:     2 APSLSKNVLDAIGLLDSQIPPSISLEGSNFLANGHPIFTQVPINIIATPSPFT---SANK 58
             +P L+K+  D+ G+          LE S  LANG  + T VP+N+  T SP+        
Sbjct:     3 SPCLTKS--DS-GINGVDFTEKFRLEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVP 59

Query:    59 TKHTAGCFVGFDAD-ESSDRHVVPIGKLNGIRFMSIFRFKAWWTTHWVGNSGKDMEHETH 117
                +AG F+GF+ D E    HV  IGKL  IRFMSIFRFK WWTTHWVG++G+D+E+ET 
Sbjct:    60 LDVSAGSFIGFNLDGEPKSHHVASIGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQ 119

Query:   118 LMILDKNDL--------GRPYVLLLPILEGPFRASLQPGTDNYVDMCVESGSSQIRCSSF 169
             ++ILD++          GRPYVLLLP+LEG FR+S Q G D+ V +CVESGS+++  S F
Sbjct:   120 IIILDQSGSDSGPGSGSGRPYVLLLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEF 179

Query:   170 RSCLYMRVGDDPYSLVKEAMKVVRVHLGTFKLLEEKTVPGIVDKFGWCTWDAFYLQVHPK 229
             R  +Y+  GDDP+ LVK+AMKV+RVH+ TFKLLEEK+ PGIVDKFGWCTWDAFYL V+P 
Sbjct:   180 RQIVYVHAGDDPFKLVKDAMKVIRVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPD 239

Query:   230 GVYEGVKGLVEGGCPPGLVLIDDGWQSICHDDEPIIDQEGMNRTSAGEQMPCRLIDFEEN 289
             GV++GVK LV+GGCPPGLVLIDDGWQSI HD + I D EGMN T AGEQMPCRL+ FEEN
Sbjct:   240 GVHKGVKCLVDGGCPPGLVLIDDGWQSIGHDSDGI-DVEGMNITVAGEQMPCRLLKFEEN 298

Query:   290 YKFRDYKSPRVPSNKGMGAFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMPESRLI 349
             +KF+DY SP+  ++ GM AFVRDLKDEF +V+++YVWHALCGYWGG+RP    +P S +I
Sbjct:   299 HKFKDYVSPKDQNDVGMKAFVRDLKDEFSTVDYIYVWHALCGYWGGLRPEAPALPPSTII 358

Query:   350 APKLSQGLQTTMEDLAVEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLL 409
              P+LS GL+ TMEDLAV+KI++ G+G   P+L +  YEGLHSHL++ GIDGVKVDVIH+L
Sbjct:   359 RPELSPGLKLTMEDLAVDKIIETGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHIL 418

Query:   410 EMVAEDFGGRVELAKAYYKALTASVRKHFKGNGVIASMEHCNDFMYLGTETISLGRVGDD 469
             EM+ + +GGRV+LAKAY+KALT+SV KHF GNGVIASMEHCNDFM+LGTE ISLGRVGDD
Sbjct:   419 EMLCQKYGGRVDLAKAYFKALTSSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDD 478

Query:   470 FWCSDPKGVKNGTFWLQGCHMVHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAASRAISG 529
             FWC+DP G  NGTFWLQGCHMVHCAYNSLWMGN IQPDWDMFQSTHPCAEFHAASRAISG
Sbjct:   479 FWCTDPSGDPNGTFWLQGCHMVHCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISG 538

Query:   530 GPIYISDSVGNHNFDLLKALVMPDGSILRCQFYALPTRDCLFENPLHDGKTVLKIWNLNK 589
             GPIYISD VG H+FDLLK LV+P+GSILRC++YALPTRD LFE+PLHDGKT+LKIWNLNK
Sbjct:   539 GPIYISDCVGKHDFDLLKRLVLPNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNK 598

Query:   590 HTGVLGLFNCQGGGWCSVTRKNVGFSMFSNTLTCLASPNDIEWNNGKDPISVKGVDVFAV 649
             +TGV+G FNCQGGGWC  TR+N  FS   NTLT   SP D+EWN+G  PIS+  V+ FA+
Sbjct:   599 YTGVIGAFNCQGGGWCRETRRNQCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFAL 658

Query:   650 YKFQENKLKLLKFSDDLEVTVEPFNFELLTVSPVTVLPKGSIQFAPIGLVNMLNTGGAVQ 709
             +  Q  KL L   +DDLE+T+EPF FEL+TVSPV  +   S++FAPIGLVNMLNT GA++
Sbjct:   659 FLSQSKKLLLSGLNDDLELTLEPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIR 718

Query:   710 SLAFDDDENLVRIEVKGCGEMKVFASEKPLMCKVDGASAEFSYEDQMATVQVPWPNNSSK 769
             SL ++D+   V + V G GE +V+AS+KP+ C +DG   EF YED M  VQVPW +    
Sbjct:   719 SLVYNDES--VEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPW-SGPDG 775

Query:   770 LTVVEFLF 777
             L+ +++LF
Sbjct:   776 LSSIQYLF 783


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 2658 (940.7 bits), Expect = 1.6e-276, P = 1.6e-276
 Identities = 500/795 (62%), Positives = 604/795 (75%)

Query:     1 MAPSLSKNVLDAIG---LLDSQI-PPSISLEGSNFLANGHPIFTQVPINIIATPSPFTSA 56
             MAP+LSK   D IG    +D  I PP  +L+G +   +GHP    VP NI  TP+     
Sbjct:     1 MAPNLSKAKDDLIGDVVAVDGLIKPPRFTLKGKDLAVDGHPFLLDVPANIRLTPASTLVP 60

Query:    57 NKT--KHTAGCFVGFDADESSDRHVVPIGKLNGIRFMSIFRFKAWWTTHWVGNSGKDMEH 114
             N       AG F+GFDA  + DRHVVPIGKL   RFMSIFRFK WWTTHWVG +G+D+E+
Sbjct:    61 NSDVPAAAAGSFLGFDAPAAKDRHVVPIGKLRDTRFMSIFRFKVWWTTHWVGTNGRDVEN 120

Query:   115 ETHLMILDKNDL-----G-RPYVLLLPILEGPFRASLQPG-TDNYVDMCVESGSSQIRCS 167
             ET +MILD++       G RPYVLLLPI+EGPFRA L+ G  ++YV M +ESGSS +R S
Sbjct:   121 ETQMMILDQSGTKSSPTGPRPYVLLLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGS 180

Query:   168 SFRSCLYMRVGDDPYSLVKEAMKVVRVHLGTFKLLEEKTVPGIVDKFGWCTWDAFYLQVH 227
              FRS +Y+  GDDP+ LVK+AM+VVR HLGTF+L+EEKT P IVDKFGWCTWDAFYL+VH
Sbjct:   181 VFRSAVYLHAGDDPFDLVKDAMRVVRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVH 240

Query:   228 PKGVYEGVKGLVEGGCPPGLVLIDDGWQSICHDDEPI-IDQEGMNRTSAGEQMPCRLIDF 286
             P+GV+EGV+ L +GGCPPGLVLIDDGWQSICHDD+ +    EGMNRTSAGEQMPCRLI F
Sbjct:   241 PEGVWEGVRRLADGGCPPGLVLIDDGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKF 300

Query:   287 EENYKFRDYKSPRVPSNKGMGAFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMPES 346
             +ENYKFR+YK        GMG FVR++K  F +VE VYVWHALCGYWGG+RP   G+P +
Sbjct:   301 QENYKFREYKG-------GMGGFVREMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLPPA 353

Query:   347 RLIAPKLSQGLQTTMEDLAVEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVI 406
             +++AP+LS GLQ TMEDLAV+KIV+NGVGLV P   + LYEGLHSHL++ GIDGVKVDVI
Sbjct:   354 KVVAPRLSPGLQRTMEDLAVDKIVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVI 413

Query:   407 HLLEMVAEDFGGRVELAKAYYKALTASVRKHFKGNGVIASMEHCNDFMYLGTETISLGRV 466
             HLLEMV E++GGRVELAKAY+  LT SVR+HF GNGVIASMEHCNDFM LGTE ++LGRV
Sbjct:   414 HLLEMVCEEYGGRVELAKAYFAGLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRV 473

Query:   467 GDDFWCSDPKGVKNGTFWLQGCHMVHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAASRA 526
             GDDFWC+DP G  +GTFWLQGCHMVHCAYNSLWMG  I PDWDMFQSTHPCA FHAASRA
Sbjct:   474 GDDFWCTDPSGDPDGTFWLQGCHMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRA 533

Query:   527 ISGGPIYISDSVGNHNFDLLKALVMPDGSILRCQFYALPTRDCLFENPLHDGKTVLKIWN 586
             +SGGP+Y+SD+VG H+FDLL+ L +PDG+ILRC+ YALPTRDCLF +PLHDGKT+LKIWN
Sbjct:   534 VSGGPVYVSDAVGCHDFDLLRRLALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWN 593

Query:   587 LNKHTGVLGLFNCQGGGWCSVTRKNVGFSMFSNTLTCLASPNDIEWNNGKDPISVKGVDV 646
             +NK +GVLG FNCQGGGW    R+N+  + FS  +T  ASP D+EW++G       G D 
Sbjct:   594 VNKFSGVLGAFNCQGGGWSREARRNMCAAGFSVPVTARASPADVEWSHGGG-----GGDR 648

Query:   647 FAVYKFQENKLKLLKFSDDLEVTVEPFNFELLTVSPVTVL--PKGSIQFAPIGLVNMLNT 704
             FAVY  +  KL+LL+  + +E+T+EPF +ELL V+PV  +  P+  I FAPIGL NMLN 
Sbjct:   649 FAVYFVEARKLQLLRRDESVELTLEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNA 708

Query:   705 GGAVQSL--AFDDDENLVRIEVKGCGEMKVFASEKPLMCKVDGASAEFSYEDQMATVQVP 762
             GGAVQ    A  D +    + VKG GEM  ++S +P +CKV+G  AEF YED + TV VP
Sbjct:   709 GGAVQGFEAARKDGDVAAEVAVKGAGEMVAYSSARPRLCKVNGQDAEFKYEDGIVTVDVP 768

Query:   763 WPNNSSKLTVVEFLF 777
             W  +S KL+ VE+ +
Sbjct:   769 WTGSSKKLSRVEYFY 783


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 1291 (459.5 bits), Expect = 5.4e-195, Sum P(2) = 5.4e-195
 Identities = 242/489 (49%), Positives = 327/489 (66%)

Query:   291 KFRDYKSPRVPSNKGMGAFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMPESRLIA 350
             +F   +   + S  G+ AF +DL+ +FK ++ VYVWHALCG WGG+RP    + +++++ 
Sbjct:   369 QFSSGEKSEMKSEYGLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETTHL-DTKIVP 427

Query:   351 PKLSQGLQTTMEDLAVEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLLE 410
              KLS GL  TMEDLAV +I    +GLV P     LY+ +HS+L   GI GVKVDVIH LE
Sbjct:   428 CKLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLE 487

Query:   411 MVAEDFGGRVELAKAYYKALTASVRKHFKGNGVIASMEHCNDFMYLGTETISLGRVGDDF 470
              V +++GGRV+LAK YY+ LT S+ K+F GNG+IASM+HCNDF +LGT+ IS+GRVGDDF
Sbjct:   488 YVCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDF 547

Query:   471 WCSDPKGVKNGTFWLQGCHMVHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAASRAISGG 530
             W  DP G   G+FWLQG HM+HC+YNSLWMG +IQPDWDMFQS H CA+FHA SRAI GG
Sbjct:   548 WFQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGG 607

Query:   531 PIYISDSVGNHNFDLLKALVMPDGSILRCQFYALPTRDCLFENPLHDGKTVLKIWNLNKH 590
             PIY+SD+VG+H+FDL+K LV PDG+I +C ++ LPTRDCLF+NPL D  TVLKIWN NK+
Sbjct:   608 PIYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKY 667

Query:   591 TGVLGLFNCQGGGWCSVTRKNVGFSMFSNTLTCLASPNDIEWNNGKDPISVKGVDVFAVY 650
              GV+G FNCQG GW  + +K  GF      +       ++EW+  ++   +   + + VY
Sbjct:   668 GGVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKAEEYVVY 727

Query:   651 KFQENKLKLLKF-SDDLEVTVEPFNFELLTVSPVTVLPKGSIQFAPIGLVNMLNTGGAVQ 709
               Q  +L L+   S+ ++ T++P  FEL +  PVT L  G I+FAPIGL NM N+GG V 
Sbjct:   728 LNQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLC-GGIKFAPIGLTNMFNSGGTVI 786

Query:   710 SLAFDDDENLVRIEVKGCGEMKVFASEKPLMCKVDGASAEFSY-EDQMATVQVPWPNNSS 768
              L +    N  +I+VKG G    ++SE P   +++G   +F +  D    V VPW   + 
Sbjct:   787 DLEYVG--NGAKIKVKGGGSFLAYSSESPKKFQLNGCEVDFEWLGDGKLCVNVPWIEEAC 844

Query:   769 KLTVVEFLF 777
              ++ +E  F
Sbjct:   845 GVSDMEIFF 853

 Score = 620 (223.3 bits), Expect = 5.4e-195, Sum P(2) = 5.4e-195
 Identities = 131/312 (41%), Positives = 186/312 (59%)

Query:     1 MAPSLSKNVLDAIGLLDSQIPPSISLEGSNFLANGHPIFTQVPINI-------IATPS-- 51
             MAP L+    + I     +      L    F   G P+F  VP N+       I  PS  
Sbjct:     1 MAPPLNSTTSNLI-----KTESIFDLSERKFKVKGFPLFHDVPENVSFRSFSSICKPSES 55

Query:    52 --PFTSANKT---KHTAGCFVGFDADESSDRHVVPIGKLNGIRFMSIFRFKAWWTTHWVG 106
               P +   K     H  G F GF  +  SDR +  IG  NG  F+SIFRFK WW+T W+G
Sbjct:    56 NAPPSLLQKVLAYSHKGG-FFGFSHETPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIG 114

Query:   107 NSGKDMEHETHLMILDKNDLGRPYVLLLPILEGPFRASLQPGTDNYVDMCVESGSSQIRC 166
              SG D++ ET  ++++  +  + YV+++PI+E  FR++L PG +++V +  ESGS++++ 
Sbjct:   115 KSGSDLQMETQWILIEVPET-KSYVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKE 173

Query:   167 SSFRSCLYMRVGDDPYSLVKEAMKVVRVHLGTFKLLEEKTVPGIVDKFGWCTWDAFYLQV 226
             S+F S  Y+   ++PY L+KEA   +RVHL +F+LLEEKT+P +VDKFGWCTWDAFYL V
Sbjct:   174 STFNSIAYVHFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTV 233

Query:   227 HPKGVYEGVKGLVEGGCPPGLVLIDDGWQSICHDD-EPIIDQEGMNRTSAGEQMPCRLID 285
             +P G++ G+    +GG  P  V+IDDGWQSI  D  +P  +++  N    GEQM  RL  
Sbjct:   234 NPIGIFHGLDDFSKGGVEPRFVIIDDGWQSISFDGYDP--NEDAKNLVLGGEQMSGRLHR 291

Query:   286 FEENYKFRDYKS 297
             F+E YKFR Y+S
Sbjct:   292 FDECYKFRKYES 303


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 1235 (439.8 bits), Expect = 4.5e-189, Sum P(2) = 4.5e-189
 Identities = 238/500 (47%), Positives = 321/500 (64%)

Query:   286 FEENYKFRDYKSPRVPSNKGMGAFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMPE 345
             F+E  K     S  V S  GM AF +DL+  FKS++ +YVWHALCG W G+RP    M  
Sbjct:   380 FDEVEKEESLGSDDV-SGSGMAAFTKDLRLRFKSLDDIYVWHALCGAWNGVRPETM-MDL 437

Query:   346 SRLIAP-KLSQGLQTTMEDLAVEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVD 404
                +AP +LS  L  TM DLAV+K+V+ G+GLV P      Y+ +HS+L SVG+ G K+D
Sbjct:   438 KAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPSKAHEFYDSMHSYLASVGVTGAKID 497

Query:   405 VIHLLEMVAEDFGGRVELAKAYYKALTASVRKHFKGNGVIASMEHCNDFMYLGTETISLG 464
             V   LE +AE+ GGRVELAKAYY  LT S+ K+F G  VIASM+ CN+F +L T+ IS+G
Sbjct:   498 VFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNGTDVIASMQQCNEFFFLATKQISIG 557

Query:   465 RVGDDFWCSDPKGVKNGTFWLQGCHMVHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAAS 524
             RVGDDFW  DP G   G +WLQG HM+HC+YNS+WMG +IQPDWDMFQS H CAE+HAAS
Sbjct:   558 RVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWMGQMIQPDWDMFQSDHVCAEYHAAS 617

Query:   525 RAISGGPIYISDSVG--NHNFDLLKALVMPDGSILRCQFYALPTRDCLFENPLHDGKTVL 582
             RAI GGP+Y+SD +G  +HNFDL+K L   DG+I RC  YALPTRD LF+NPL D +++L
Sbjct:   618 RAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPRCVHYALPTRDSLFKNPLFDKESIL 677

Query:   583 KIWNLNKHTGVLGLFNCQGGGWCSVTRKNVGFSMFSNTLTCLASPNDIEWNNGKDPIS-- 640
             KI+N NK  GV+G FNCQG GW     +  G+     T++     +DIEW+   +     
Sbjct:   678 KIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECYTTVSGTVHVSDIEWDQNPEAAGSQ 737

Query:   641 VKGVDVFAVYKFQENKLKLLKF-SDDLEVTVEPFNFELLTVSPVTVLPKGSIQFAPIGLV 699
             V     + VYK Q  ++  +   S+ +++T+EP  F+LL+  PVT L    ++FAP+GL+
Sbjct:   738 VTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAFDLLSFVPVTELVSSGVRFAPLGLI 797

Query:   700 NMLNTGGAVQSLAFDDDENLVRIEVKGCGEMKVFASEKPLMCKVDGASAEFSYEDQMATV 759
             NM N  G VQ +    D N +R++VKG G    ++S  P+ C ++   AEF +E++   +
Sbjct:   798 NMFNCVGTVQDMKVTGD-NSIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKL 856

Query:   760 Q--VPWPNNSSKLTVVEFLF 777
                VPW   S  ++ + F F
Sbjct:   857 SFFVPWVEESGGISHLSFTF 876

 Score = 620 (223.3 bits), Expect = 4.5e-189, Sum P(2) = 4.5e-189
 Identities = 129/288 (44%), Positives = 176/288 (61%)

Query:    21 PPSISL-EGSNFLANGHPIFTQVPINIIATP---------SPFTSANKTKHTA--GCFVG 68
             P S +L EGS    +  PI   VP N+  TP         +P     + +  A  G F+G
Sbjct:    31 PNSFNLSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAPLPILLRVQANAHKGGFLG 90

Query:    69 FDADESSDRHVVPIGKLNGIRFMSIFRFKAWWTTHWVGNSGKDMEHETHLMILDKNDLGR 128
             F  +  SDR    +G+     F+S+FRFK WW+T W+G SG D++ ET  ++L   ++  
Sbjct:    91 FTKESPSDRLTNSLGRFEDREFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKIPEIDS 150

Query:   129 PYVLLLPILEGPFRASLQPGTDNYVDMCVESGSSQIRCSSFRSCLYMRVGDDPYSLVKEA 188
              YV ++P +EG FRASL PG    V +C ESGS++++ SSF+S  Y+ + D+PY+L+KEA
Sbjct:   151 -YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIAYIHICDNPYNLMKEA 209

Query:   189 MKVVRVHLGTFKLLEEKTVPGIVDKFGWCTWDAFYLQVHPKGVYEGVKGLVEGGCPPGLV 248
                +RVH+ TFKLLEEK +P IVDKFGWCTWDA YL V P  ++ GVK   +GG  P  V
Sbjct:   210 FSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFEDGGVCPKFV 269

Query:   249 LIDDGWQSICHDDEPIIDQEGMNRTSAGEQMPCRLIDFEENYKFRDYK 296
             +IDDGWQSI  D + + D++  N    GEQM  RL  F+E  KFR+YK
Sbjct:   270 IIDDGWQSINFDGDEL-DKDAENLVLGGEQMTARLTSFKECKKFRNYK 316


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 1396 (496.5 bits), Expect = 1.1e-150, Sum P(2) = 1.1e-150
 Identities = 293/706 (41%), Positives = 417/706 (59%)

Query:    20 IPPSISLEGSNFLANGHPIFTQVPINIIATPSPFTSANKTKHTAGCFVGFDADESSDRHV 79
             I  +IS++  N +  G  I T++P NII TP    + N     +G F+G   ++S   HV
Sbjct:     3 ITSNISVQNDNLVVQGKTILTKIPDNIILTP---VTGNG--FVSGSFIGATFEQSKSLHV 57

Query:    80 VPIGKLNGIRFMSIFRFKAWWTTHWVGNSGKDMEHETHLMILDKNDL--GR----P--YV 131
              PIG L G+RFM  FRFK WW T  +G+ GKD+  ET  M+L+  D   G     P  Y 
Sbjct:    58 FPIGVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAPTVYT 117

Query:   132 LLLPILEGPFRASLQPGTDNYVDMCVESGSSQIRCSSFRSCLYMRVGDDPYSLVKEAMKV 191
             + LP+LEG FRA LQ    N +++C ESG   +  S     +Y+  G +P+ ++++++K 
Sbjct:   118 VFLPLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSVKA 177

Query:   192 VRVHLGTFKLLEEKTVPGIVDKFGWCTWDAFYLQVHPKGVYEGVKGLVEGGCPPGLVLID 251
             V  H+ TF   E+K +P  +D FGWCTWDAFY  V  +GV EG+K L EGG PP  ++ID
Sbjct:   178 VERHMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIID 237

Query:   252 DGWQSICHD--DEPIIDQEGMNRTSAGEQMPCRLIDFEENYKFR--DYKSPRVPSNKGMG 307
             DGWQ I +   DE  + QEG        Q   RL+  +EN KF+  D K  +V    G+ 
Sbjct:   238 DGWQQIENKEKDENCVVQEGA-------QFATRLVGIKENAKFQKSDQKDTQV---SGLK 287

Query:   308 AFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMP--ESRLIAPKLSQGLQTTMEDLA 365
             + V + K    +V+ VY WHAL GYWGG++P  +GM   +S L  P  S G+     D+ 
Sbjct:   288 SVVDNAKQRH-NVKQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIV 346

Query:   366 VEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLLEMVAEDFGGRVELAKA 425
             ++ +  +G+GLV P+ V N Y  LHS+L S GIDGVKVDV +++E +    GGRV L ++
Sbjct:   347 MDSLAVHGLGLVNPKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRS 406

Query:   426 YYKALTASVRKHFKGNGVIASMEHCNDFMYLGTETISLGRVGDDFWCSDPKGVKNGTFWL 485
             Y +AL AS+ ++F  NG I+ M H  D +Y   +T ++ R  DDF+  DP          
Sbjct:   407 YQQALEASIARNFTDNGCISCMCHNTDGLYSAKQT-AIVRASDDFYPRDPAS-------- 457

Query:   486 QGCHMVHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDSVGNHNFDL 545
                H+   AYNSL++G  +QPDWDMF S HP AE+HAA+RA+ G  IY+SD  GNHNFDL
Sbjct:   458 HTIHIASVAYNSLFLGEFMQPDWDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDL 517

Query:   546 LKALVMPDGSILRCQFYALPTRDCLFENPLHDGKTVLKIWNLNKHTGVLGLFNCQGGGWC 605
             L+ LV+PDGS+LR +    PTRDCLF +P  DG ++LKIWN+NK TG++G+FNCQG GWC
Sbjct:   518 LRKLVLPDGSVLRAKLPGRPTRDCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWC 577

Query:   606 SVTRKNVGFSMFSNTLTCLASPNDIEWNNGKDPISVKGVDVFAVYKFQENKLKLLKFSDD 665
               T+KN        TLT     +D +  +        G  +  VY ++  ++  L     
Sbjct:   578 KETKKNQIHDTSPGTLTGSIRADDADLISQVAGEDWSGDSI--VYAYRSGEVVRLPKGAS 635

Query:   666 LEVTVEPFNFELLTVSPVTVLPKGSIQFAPIGLVNMLNTGGAVQSL 711
             + +T++   +EL  +SP+  + + +I FAPIGLV+M N+ GA++S+
Sbjct:   636 IPLTLKVLEYELFHISPLKEITE-NISFAPIGLVDMFNSSGAIESI 680

 Score = 96 (38.9 bits), Expect = 1.1e-150, Sum P(2) = 1.1e-150
 Identities = 15/46 (32%), Positives = 28/46 (60%)

Query:   719 LVRIEVKGCGEMKVFASEKPLMCKVDGASAEFSYEDQMATVQVPWP 764
             LV + V+GCG    ++S++PL C V+    +F+Y+ ++  V +  P
Sbjct:   713 LVSVSVRGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLP 758


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 1430 (508.4 bits), Expect = 2.2e-146, P = 2.2e-146
 Identities = 304/752 (40%), Positives = 441/752 (58%)

Query:    24 ISLEGSNFLANGHPIFTQVPINIIATPSPFTSANKTKHTAGCFVGFDADESSDRHVVPIG 83
             IS+  S+ +  GH +   VP N++ TP+   S N      G F+G  +D++    V  +G
Sbjct:     7 ISVTDSDLVVLGHRVLHGVPENVLVTPA---SGNAL--IDGAFIGVTSDQTGSHRVFSLG 61

Query:    84 KLNGIRFMSIFRFKAWWTTHWVGNSGKDMEHETHLMILDKN---DLG-----RPYVLLLP 135
             KL  +RFM +FRFK WW T  +G +GK++  ET  +I++ N   DLG       YV+ LP
Sbjct:    62 KLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGSDLGGRDQSSSYVVFLP 121

Query:   136 ILEGPFRASLQPGTDNYVDMCVESGSSQIRCSSFRSCLYMRVGDDPYSLVKEAMKVVRVH 195
             ILEG FRA LQ    N +++C+ESG   +        +++  G DP+ ++ +A+K V  H
Sbjct:   122 ILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAVKAVEQH 181

Query:   196 LGTFKLLEEKTVPGIVDKFGWCTWDAFYLQVHPKGVYEGVKGLVEGGCPPGLVLIDDGWQ 255
             L TF   E K +P +++ FGWCTWDAFY  V  K V +G++ L  GG  P  V+IDDGWQ
Sbjct:   182 LQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGWQ 241

Query:   256 SICHDDEPIIDQEGMNRTSAGEQMPCRLIDFEENYKF-RDYKSP-RVPS-NKGMGAFVRD 312
             S+  D+  +   E  N  +A      RL   +EN+KF +D K   RV   +  +G  + D
Sbjct:   242 SVGMDETSV---E-FNADNAAN-FANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITD 296

Query:   313 LKDEFKSVEHVYVWHALCGYWGGIRPNVAGMP--ESRLIAPKLSQGLQTTMEDLAVEKIV 370
             +K    S+++VYVWHA+ GYWGG++P V+GM   ES++  P  S G+ ++     +E I 
Sbjct:   297 IKSN-NSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESIT 355

Query:   371 DNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLLEMVAEDFGGRVELAKAYYKAL 430
              NG+GLV PE V + Y  LHS+L SVG+DGVKVDV ++LE +    GGRV+LAK Y++AL
Sbjct:   356 KNGLGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQAL 415

Query:   431 TASVRKHFKGNGVIASMEHCNDFMYLGTETISLGRVGDDFWCSDPKGVKNGTFWLQGCHM 490
              AS+ ++F  NG+I+ M H  D +Y   +T  + R  DDFW  DP             H+
Sbjct:   416 EASISRNFPDNGIISCMSHNTDGLYSAKKTAVI-RASDDFWPRDPAS--------HTIHI 466

Query:   491 VHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDSVGNHNFDLLKALV 550
                AYN+L++G  +QPDWDMF S HP AE+HAA+RA+ G  IY+SD  G H+F+LL+ LV
Sbjct:   467 ASVAYNTLFLGEFMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLV 526

Query:   551 MPDGSILRCQFYALPTRDCLFENPLHDGKTVLKIWNLNKHTGVLGLFNCQGGGWCSVTRK 610
             + DGSILR +    PT DC F +P+ D K++LKIWNLN+ TGV+G+FNCQG GWC   ++
Sbjct:   527 LRDGSILRAKLPGRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKR 586

Query:   611 NVGFSMFSNTLTCLASPNDIEWNNGKDPISVKGVDVFAVYKFQENKLKLLKFSDDLEVTV 670
              +       T++     ND+ + +        G  +  VY     +L  L     L VT+
Sbjct:   587 YLIHDQEPGTISGCVRTNDVHYLHKVAAFEWTGDSI--VYSHLRGELVYLPKDTSLPVTL 644

Query:   671 EPFNFELLTVSPVTVLPKGSIQFAPIGLVNMLNTGGAVQSLAFDDDEN--LVRIEVKGCG 728
              P  +E+ TV PV     GS +FAP+GL+ M N+GGA+ SL +DD+    +VR++++G G
Sbjct:   645 MPREYEVFTVVPVKEFSDGS-KFAPVGLMEMFNSGGAIVSLRYDDEGTKFVVRMKLRGSG 703

Query:   729 EMKVFAS-EKPLMCKVDGASAEFSYEDQMATV 759
              + V++S  +P    VD    E+ YE +   V
Sbjct:   704 LVGVYSSVRRPRSVTVDSDDVEYRYEPESGLV 735


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 224 (83.9 bits), Expect = 3.9e-28, Sum P(3) = 3.9e-28
 Identities = 48/123 (39%), Positives = 70/123 (56%)

Query:   483 FWLQGC--HMVHCAYNSLWMGNVIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDSVGN 540
             FW  G   H++  AYNSL   +++ PD+DMF S  P A+ H  +R  SGGPIYI+D    
Sbjct:   429 FWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPE 488

Query:   541 H-NFDLLKALVMPDGSILRCQFYALPTRDCLFENPLHDGKTVLKIWNLNKHTGVLGLFNC 599
               N +LL+  V+P+G ++R    AL T D LF++PL + + +LK+    K    +  FN 
Sbjct:   489 RTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRE-RVLLKLKGKVKGYNAIAFFNL 547

Query:   600 QGG 602
               G
Sbjct:   548 NSG 550

 Score = 158 (60.7 bits), Expect = 3.9e-28, Sum P(3) = 3.9e-28
 Identities = 31/91 (34%), Positives = 55/91 (60%)

Query:   174 YMRVG--DDPYSLVKEAMKVVRVHLGTFKLLEEKTVPG-IVDKFGWCTWDAFYLQ-VHPK 229
             ++ +G  D+PY  ++ A+ +      TFKL +EK  P  +++  GWC+W+AF  + ++ +
Sbjct:   181 FLSIGTSDNPYKAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNAFLTKDLNEE 240

Query:   230 GVYEGVKGLVEGGCPPGLVLIDDGWQSICHD 260
              + + VKG++E G     V+IDDGWQ   +D
Sbjct:   241 NLIKVVKGIIERGLRLNWVIIDDGWQDQNND 271

 Score = 61 (26.5 bits), Expect = 3.9e-28, Sum P(3) = 3.9e-28
 Identities = 15/40 (37%), Positives = 22/40 (55%)

Query:   301 PSNK----GMGAFVRDLKDEFKSVEHVYVWHALCGYWGGI 336
             P NK    G    VR +K     V++V +WHA+  +WGG+
Sbjct:   279 PDNKKFPNGFKNTVRAIKS--LGVKYVGLWHAINAHWGGM 316


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 290 (107.1 bits), Expect = 1.1e-27, Sum P(3) = 1.1e-27
 Identities = 90/310 (29%), Positives = 150/310 (48%)

Query:   304 KGMGAFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMPESRLIAPKLSQGLQTTMED 363
             +G+   V +++ +   + ++ VWH + GYWGG+ P+  G   S+    K+       + D
Sbjct:   403 QGLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPS--GPMASKYKMRKIQ------LRD 454

Query:   364 LAVEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLLEMVAEDFGGRVELA 423
              A  +  D     V  E V  +Y+  ++ L   G+   KVD    L+  A     R  L 
Sbjct:   455 EAEVQPKDFDFYTVDGEDVHKMYDDFYAFLADCGVSAAKVDTQGFLDYPAHA-NDRKNLI 513

Query:   424 KAYYKALTASVRKHFKGNGVIASMEHCNDFMYL----GTET--ISLGRVGDDFWCSDPKG 477
             + Y  A TA+  KHF G  +    +     ++     G     + + R  DDF+   P  
Sbjct:   514 RPYQDAWTAAASKHFGGRAIACMAQTPQSILHSLLQQGRSEGPMLMARNSDDFF---PDE 570

Query:   478 VKNGTFWLQGCHMVHCAYNSLWMGNV-IQPDWDMFQSTHP-CAEFHAASRAISGGPIYIS 535
             V + T W   C+    A+N+L M ++ +  DWDMFQ+T P  A  HA +R++SGGPIYI+
Sbjct:   571 VGSHT-WHVFCN----AHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMSGGPIYIT 625

Query:   536 DSVGNHNFDLLKALVMP--DGSILRCQFYALPTRDCLFENPLHDGKTVLKIWNLNKHTGV 593
             D+ G H+ +L+K +     DG  +  +    P R  L+    H  + +L++ + ++  G+
Sbjct:   626 DAPGEHDVELIKQMTAQTADGRTIALRADE-PGRT-LWPYGGHGEQRLLRVRSGHQGVGM 683

Query:   594 LGLFN-CQGG 602
             LG+FN C  G
Sbjct:   684 LGVFNVCNRG 693

 Score = 78 (32.5 bits), Expect = 1.1e-27, Sum P(3) = 1.1e-27
 Identities = 25/76 (32%), Positives = 38/76 (50%)

Query:   666 LEVTVEPFNFELLTVSPVTVLPKGSIQFAPIGLVNMLNTGGAVQSLAFDDD-ENL--VRI 722
             +EV +E   FE+ T  P+T L  G +  A +GLV  + T  AV  +++    E    V +
Sbjct:   736 IEVGLEEGGFEIFTAYPITKL--GGLAVATLGLVGKMATAAAVSHVSYSKHHEGFIPVGV 793

Query:   723 EV----KGCGEMKVFA 734
             EV    K  G + +FA
Sbjct:   794 EVSVSLKALGTLGIFA 809

 Score = 73 (30.8 bits), Expect = 1.1e-27, Sum P(3) = 1.1e-27
 Identities = 15/49 (30%), Positives = 24/49 (48%)

Query:   212 DKFGWCTWDAFYLQVHPKGVYEGVKGLVEGGCPPGLVLIDDGWQSICHD 260
             D F +CTW++    +    +   +  L E G     ++IDD WQS+  D
Sbjct:   334 DGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGD 382


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 278 (102.9 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
 Identities = 86/255 (33%), Positives = 126/255 (49%)

Query:   305 GMGAFVRDLKDEFKSVEHVYVWHALCGYWGGIRPNVAGMPESRLIAPKLSQGLQTTMEDL 364
             G+   V  ++++ +++E++ VWHAL GYWGGI P      E  L A      +  T E +
Sbjct:   384 GLAKAVTTIREQHRNIEYIVVWHALFGYWGGISP------EGSLAA------IYKTRE-V 430

Query:   365 AVEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLLEMVAEDFGGRVELAK 424
             A+       +  + P  +Q  Y   ++ L   GI GVK D    L+++A D   R   A 
Sbjct:   431 ALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLLA-DPEDRRSYAN 489

Query:   425 AYYKALTASVRKHFKGNGVIASMEHCNDFMY---LGTE--TISLGRVGDDFWCSDPKGVK 479
             AY  A T S  +HF G   I+ M      ++   L T   TI + R  +DF+   P  + 
Sbjct:   490 AYQDAWTISSLRHF-GPKAISCMSQIPQTIFHSQLPTNKPTIVV-RNSNDFF---PD-ID 543

Query:   480 NGTFWLQGCHMVHCAYNSLWMGNVIQPDWDMFQSTHP-----CAEFHAASRAISGGPIYI 534
             +   W   C+  H A  + ++  +  PDWDMFQ T P      A FHAA+R ISGGPIYI
Sbjct:   544 DSHTWHVFCN-AHNALLTRYLNGL--PDWDMFQ-TLPENGLDYASFHAAARCISGGPIYI 599

Query:   535 SDSVGNHNFDLLKAL 549
             +D  G H+  L+K +
Sbjct:   600 TDKPGQHDIPLIKQM 614

 Score = 80 (33.2 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
 Identities = 46/202 (22%), Positives = 86/202 (42%)

Query:    72 DESSDRHV----VPIGKLNGI-RFMSIFRFKAWWTTHWVG-NSGKDMEHETH-LMILDKN 124
             +E+ D H     +P+G  + + RF ++ R +    T W+G   GKD  + T   ++L   
Sbjct:   170 EEARDGHSGLLRLPLGTPSSMSRFFALARVE----TSWLGPRQGKDKLNFTEDAILLSFL 225

Query:   125 DLGRPYVLLLPILEGPFRASLQPGTDNYVDMCVESGSSQIRCSSFRSCLYMRVGDDPYS- 183
                  +V+LL +        L  G     ++ ++S +     S F+  L     D   + 
Sbjct:   226 RTDGVHVVLLGVTVDDTLTVLGSGPAG--EVVIKSQNDNATPSRFQ-VLAATAADFEVAT 282

Query:   184 --LVKEAMKVVRVHLGTFKL-LEEKTVPGIVDKFGWCTWDAFYLQVHPKGVYEGVKGLVE 240
               L+ EA ++VR +  T +     + +    D   +CTW+     +  + +   +  L  
Sbjct:   283 SALIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTWNGLGQDLSEEKILSALDDLKT 342

Query:   241 GGCPPGLVLIDDGWQSICHDDE 262
              G     ++IDD WQS+  D+E
Sbjct:   343 AGIRIRTLIIDDNWQSL--DNE 362

 Score = 37 (18.1 bits), Expect = 3.2e-19, Sum P(2) = 3.2e-19
 Identities = 6/14 (42%), Positives = 9/14 (64%)

Query:   258 CHDDEPIIDQEGMN 271
             CHD E +I   G++
Sbjct:   113 CHDGELVIVSRGLS 126


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 186 (70.5 bits), Expect = 1.0e-10, P = 1.0e-10
 Identities = 87/335 (25%), Positives = 136/335 (40%)

Query:   253 GWQSICHDDEPIIDQEGMNRTSAGEQ--MPCRLIDFEENY---KFRDYKSPRVPSNKGM- 306
             GW +  H    I + + +N   A E   +P R +  ++ +   K R   S  VP  K   
Sbjct:   231 GWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDGHIANKNRQLTS-LVPDKKRFP 289

Query:   307 GAFVRDLK-DEFKSVEHVYVWHALCGYWGGIRPNVAGMPESRLIAPKLSQGLQTTMEDLA 365
               + R +K  +   +  + +W++L GYW GI       PE R +    +  L   +   +
Sbjct:   290 NGWSRIMKRKQADKIRWIGLWYSLSGYWMGISAENDFPPEIRQVLHSYNGSL---LPGTS 346

Query:   366 VEKIVDNGVGLVPPELVQNLYEGLHSHLESVGIDGVKVDVIHLLEMVAEDFGGRVELAKA 425
              EKI             +  YE     ++  G D +K+D  +    +    GG   + +A
Sbjct:   347 TEKI-------------ETWYEYYVRTMKEYGFDFLKID--NQSFTLPLYMGGTQVIRQA 391

Query:   426 YYKALTASVRKHFKGNGVIASMEHCNDFMYLGTETISLGRVGDDFWCSDPKGVKNGTFWL 485
                 L    + H    G++  M   N      T   S+ R   D+   D    K+     
Sbjct:   392 KDCNLALEHQTHRMQMGLMNCMAQ-NVLNIDHTLYSSVTRASIDYKKYDENMAKS----- 445

Query:   486 QGCHMVHCAYNSLWMGNVIQPDWDMFQSTHP-CAEFHAASRAISGGPIYISDSVGNHNFD 544
                H+     N+L +G  + PD DMF S    C    A S+AISGGP+Y+SDS      D
Sbjct:   446 ---HLFQSYTNTLILGQTVWPDHDMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIAD 502

Query:   545 LLKALVMPDGSILRCQFYALPTRDCLFENPLHDGK 579
              ++ L+   G I R    A+PT + +  NPL  GK
Sbjct:   503 NIRPLIDETGKIFRPAAPAIPTPESILTNPLQSGK 537


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.138   0.437    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      778       778   0.00094  121 3  11 22  0.39    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  630 (67 KB)
  Total size of DFA:  443 KB (2209 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  63.08u 0.09s 63.17t   Elapsed:  00:00:03
  Total cpu time:  63.08u 0.09s 63.17t   Elapsed:  00:00:03
  Start:  Fri May 10 19:08:37 2013   End:  Fri May 10 19:08:40 2013

Back to top