BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>003897
MAPSISKVASGVRTLVDGSDNQSTNIDITLEDSKLHANGHVFLSDVPDNVTLTPSTATAT
EKSVFSNVGSFIGFDSFEPKSRHVVPIGKLKNIRFMSIFRFKVWWTTHWVGSNGRDLENE
TQLVILDNSTDTGRPYVLLLPIVEGPFRASLQPGADDYVDVCVESGSTKVTGDSFRSVVY
VHLGDDPFKLVKDAMRVVRSHLGTFKLLDEKTPPPIVDKFGWCTWDAFYLTVQPHGVMEG
VKGLVDGGCPPGLVLIDDGWQSISHDEDPIDSEGINRTAAGEQMPCRLLRYQENFKFRDY
VSPNGGDSSDNKGMGAFIRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPEKTTVVKP
KLSPGLELTMEDLAVDKIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEI
LCENYGGRVDLAKAYYKALTASVRKHFKGNGVIASMEHCNDFMLLGTEAIALGRVGDDFW
CTDPSGDPNGTFWLQGCHMVHCAYNSLWMGNFIHPDWDMFQSTHPCAEFHAASRAISGGP
IYVSDCVGKHNFPLLKRLSMPDGSILRCEYYALPTRDCLFADPLHDGKTMLKIWNLNKYT
GVIGAFNCQGGGWCREARRNTCASQFSQKVTAKTNPNDIEWNSGKNPISIEGVQVFAMYL
QEAKKLVLSKPYENIEISLEPFSFELITVSAVTLLPGGTSPSVQFAPIGLVNMLNTGGAI
QSLSYDDDENSVEIGVKGSGEMRVFASEKPRACKIDGNEVAFEYEGHMVAIQVPWSSPSG
LSVIEYLF

High Scoring Gene Products

Symbol, full name Information P value
SIP1
AT5G40390
protein from Arabidopsis thaliana 1.5e-314
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 4.5e-288
STS1
Stachyose synthase
protein from Pisum sativum 1.8e-196
STS
AT4G01970
protein from Arabidopsis thaliana 2.5e-188
SIP1
AT1G55740
protein from Arabidopsis thaliana 5.3e-143
SIP2
AT3G57520
protein from Arabidopsis thaliana 4.4e-90
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 6.5e-28
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 7.4e-28
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 2.3e-10

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  003897
        (788 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...  3017  1.5e-314  1
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...  2767  4.5e-288  1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...  1303  1.8e-196  2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...  1210  2.5e-188  2
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  1398  5.3e-143  1
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...   845  4.4e-90   2
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   284  6.5e-28   3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   219  7.4e-28   3
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   269  5.3e-25   2
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   142  2.3e-10   3


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 3017 (1067.1 bits), Expect = 1.5e-314, P = 1.5e-314
 Identities = 565/784 (72%), Positives = 644/784 (82%)

Query:    19 SDNQSTNIDIT----LEDSKLHANGHVFLSDVPDNVTLTPSTATATEKSVFSNV--GSFI 72
             SD+    +D T    LEDS L ANG V L+DVP NVTLT S     +  V  +V  GSFI
Sbjct:     9 SDSGINGVDFTEKFRLEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFI 68

Query:    73 GFD-SFEPKSRHVVPIGKLKNIRFMSIFRFKVWWTTHWVGSNGRDLENETQLVILDNS-T 130
             GF+   EPKS HV  IGKLKNIRFMSIFRFKVWWTTHWVGSNGRD+ENETQ++ILD S +
Sbjct:    69 GFNLDGEPKSHHVASIGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILDQSGS 128

Query:   131 DTG------RPYVLLLPIVEGPFRASLQPGADDYVDVCVESGSTKVTGDSFRSVVYVHLG 184
             D+G      RPYVLLLP++EG FR+S Q G DD V VCVESGST+VTG  FR +VYVH G
Sbjct:   129 DSGPGSGSGRPYVLLLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAG 188

Query:   185 DDPFKLVKDAMRVVRSHLGTFKLLDEKTPPPIVDKFGWCTWDAFYLTVQPHGVMEGVKGL 244
             DDPFKLVKDAM+V+R H+ TFKLL+EK+PP IVDKFGWCTWDAFYLTV P GV +GVK L
Sbjct:   189 DDPFKLVKDAMKVIRVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCL 248

Query:   245 VDGGCPPGLVLIDDGWQSISHDEDPIDSEGINRTAAGEQMPCRLLRYQENFKFRDYVSPN 304
             VDGGCPPGLVLIDDGWQSI HD D ID EG+N T AGEQMPCRLL+++EN KF+DYVSP 
Sbjct:   249 VDGGCPPGLVLIDDGWQSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPK 308

Query:   305 GGDSSDNKGMGAFIRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPEKTTVVKPKLSP 364
               D +D  GM AF+RDLKDEF TVD +YVWHALCGYWGGLRP  P LP  +T+++P+LSP
Sbjct:   309 --DQND-VGMKAFVRDLKDEFSTVDYIYVWHALCGYWGGLRPEAPALPP-STIIRPELSP 364

Query:   365 GLELTMEDLAVDKIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEILCEN 424
             GL+LTMEDLAVDKI+  G+GF  P+L  + YEGLHSHL+  GIDGVKVDVIH+LE+LC+ 
Sbjct:   365 GLKLTMEDLAVDKIIETGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQK 424

Query:   425 YGGRVDLAKAYYKALTASVRKHFKGNGVIASMEHCNDFMLLGTEAIALGRVGDDFWCTDP 484
             YGGRVDLAKAY+KALT+SV KHF GNGVIASMEHCNDFM LGTEAI+LGRVGDDFWCTDP
Sbjct:   425 YGGRVDLAKAYFKALTSSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDP 484

Query:   485 SGDPNGTFWLQGCHMVHCAYNSLWMGNFIHPDWDMFQSTHPCAEFHAASRAISGGPIYVS 544
             SGDPNGTFWLQGCHMVHCAYNSLWMGNFI PDWDMFQSTHPCAEFHAASRAISGGPIY+S
Sbjct:   485 SGDPNGTFWLQGCHMVHCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYIS 544

Query:   545 DCVGKHNFPLLKRLSMPDGSILRCEYYALPTRDCLFADPLHDGKTMLKIWNLNKYTGVIG 604
             DCVGKH+F LLKRL +P+GSILRCEYYALPTRD LF DPLHDGKTMLKIWNLNKYTGVIG
Sbjct:   545 DCVGKHDFDLLKRLVLPNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIG 604

Query:   605 AFNCQGGGWCREARRNTCASQFSQKVTAKTNPNDIEWNSGKNPISIEGVQVFAMYLQEAK 664
             AFNCQGGGWCRE RRN C S+    +TA T+P D+EWNSG +PISI  V+ FA++L ++K
Sbjct:   605 AFNCQGGGWCRETRRNQCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFALFLSQSK 664

Query:   665 KLVLSKPYENIEISLEPFSFELITVSAVTLLPGGTSPSVQFAPIGLVNMLNTGGAIQSLS 724
             KL+LS   +++E++LEPF FELITVS V  + G    SV+FAPIGLVNMLNT GAI+SL 
Sbjct:   665 KLLLSGLNDDLELTLEPFKFELITVSPVVTIEGN---SVRFAPIGLVNMLNTSGAIRSLV 721

Query:   725 YDDDENSVEIGVKGSGEMRVFASEKPRACKIDGNEVAFEYEGHMVAIQVPWSSPSGLSVI 784
             Y+D+  SVE+GV G+GE RV+AS+KP +C IDG  V F YE  MV +QVPWS P GLS I
Sbjct:   722 YNDE--SVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPWSGPDGLSSI 779

Query:   785 EYLF 788
             +YLF
Sbjct:   780 QYLF 783


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 2767 (979.1 bits), Expect = 4.5e-288, P = 4.5e-288
 Identities = 521/802 (64%), Positives = 622/802 (77%)

Query:     1 MAPSISKVASGVRTLVDGSDNQSTNIDITLEDSKLHANGHVFLSDVPDNVTLTPSTATAT 60
             MAP++SK    +   V   D        TL+   L  +GH FL DVP N+ LTP++    
Sbjct:     1 MAPNLSKAKDDLIGDVVAVDGLIKPPRFTLKGKDLAVDGHPFLLDVPANIRLTPASTLVP 60

Query:    61 EKSV-FSNVGSFIGFDSFEPKSRHVVPIGKLKNIRFMSIFRFKVWWTTHWVGSNGRDLEN 119
                V  +  GSF+GFD+   K RHVVPIGKL++ RFMSIFRFKVWWTTHWVG+NGRD+EN
Sbjct:    61 NSDVPAAAAGSFLGFDAPAAKDRHVVPIGKLRDTRFMSIFRFKVWWTTHWVGTNGRDVEN 120

Query:   120 ETQLVILDNS----TDTG-RPYVLLLPIVEGPFRASLQPG-ADDYVDVCVESGSTKVTGD 173
             ETQ++ILD S    + TG RPYVLLLPIVEGPFRA L+ G A+DYV + +ESGS+ V G 
Sbjct:   121 ETQMMILDQSGTKSSPTGPRPYVLLLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGS 180

Query:   174 SFRSVVYVHLGDDPFKLVKDAMRVVRSHLGTFKLLDEKTPPPIVDKFGWCTWDAFYLTVQ 233
              FRS VY+H GDDPF LVKDAMRVVR+HLGTF+L++EKTPPPIVDKFGWCTWDAFYL V 
Sbjct:   181 VFRSAVYLHAGDDPFDLVKDAMRVVRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVH 240

Query:   234 PHGVMEGVKGLVDGGCPPGLVLIDDGWQSISHDEDPIDS--EGINRTAAGEQMPCRLLRY 291
             P GV EGV+ L DGGCPPGLVLIDDGWQSI HD+D + S  EG+NRT+AGEQMPCRL+++
Sbjct:   241 PEGVWEGVRRLADGGCPPGLVLIDDGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKF 300

Query:   292 QENFKFRDYVSPNGGDSSDNKGMGAFIRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGL 351
             QEN+KFR+Y    GG       MG F+R++K  F TV+QVYVWHALCGYWGGLRP  PGL
Sbjct:   301 QENYKFREY---KGG-------MGGFVREMKAAFPTVEQVYVWHALCGYWGGLRPGAPGL 350

Query:   352 PEKTTVVKPKLSPGLELTMEDLAVDKIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVK 411
             P    VV P+LSPGL+ TMEDLAVDKIVNNGVG V P    ++YEGLHSHL+  GIDGVK
Sbjct:   351 PP-AKVVAPRLSPGLQRTMEDLAVDKIVNNGVGLVDPRRARELYEGLHSHLQASGIDGVK 409

Query:   412 VDVIHLLEILCENYGGRVDLAKAYYKALTASVRKHFKGNGVIASMEHCNDFMLLGTEAIA 471
             VDVIHLLE++CE YGGRV+LAKAY+  LT SVR+HF GNGVIASMEHCNDFMLLGTEA+A
Sbjct:   410 VDVIHLLEMVCEEYGGRVELAKAYFAGLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVA 469

Query:   472 LGRVGDDFWCTDPSGDPNGTFWLQGCHMVHCAYNSLWMGNFIHPDWDMFQSTHPCAEFHA 531
             LGRVGDDFWCTDPSGDP+GTFWLQGCHMVHCAYNSLWMG FIHPDWDMFQSTHPCA FHA
Sbjct:   470 LGRVGDDFWCTDPSGDPDGTFWLQGCHMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHA 529

Query:   532 ASRAISGGPIYVSDCVGKHNFPLLKRLSMPDGSILRCEYYALPTRDCLFADPLHDGKTML 591
             ASRA+SGGP+YVSD VG H+F LL+RL++PDG+ILRCE YALPTRDCLFADPLHDGKTML
Sbjct:   530 ASRAVSGGPVYVSDAVGCHDFDLLRRLALPDGTILRCERYALPTRDCLFADPLHDGKTML 589

Query:   592 KIWNLNKYTGVIGAFNCQGGGWCREARRNTCASQFSQKVTAKTNPNDIEWNSGKNPISIE 651
             KIWN+NK++GV+GAFNCQGGGW REARRN CA+ FS  VTA+ +P D+EW+ G       
Sbjct:   590 KIWNVNKFSGVLGAFNCQGGGWSREARRNMCAAGFSVPVTARASPADVEWSHGGG----- 644

Query:   652 GVQVFAMYLQEAKKLVLSKPYENIEISLEPFSFELITVSAVTLLPGGTSPS--VQFAPIG 709
             G   FA+Y  EA+KL L +  E++E++LEPF++EL+ V+ V  +    SP   + FAPIG
Sbjct:   645 GGDRFAVYFVEARKLQLLRRDESVELTLEPFTYELLVVAPVRAI---VSPELGIGFAPIG 701

Query:   710 LVNMLNTGGAIQSL--SYDDDENSVEIGVKGSGEMRVFASEKPRACKIDGNEVAFEYEGH 767
             L NMLN GGA+Q    +  D + + E+ VKG+GEM  ++S +PR CK++G +  F+YE  
Sbjct:   702 LANMLNAGGAVQGFEAARKDGDVAAEVAVKGAGEMVAYSSARPRLCKVNGQDAEFKYEDG 761

Query:   768 MVAIQVPWSSPSG-LSVIEYLF 788
             +V + VPW+  S  LS +EY +
Sbjct:   762 IVTVDVPWTGSSKKLSRVEYFY 783


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 1303 (463.7 bits), Expect = 1.8e-196, Sum P(2) = 1.8e-196
 Identities = 248/491 (50%), Positives = 334/491 (68%)

Query:   304 NGGDSSDNK---GMGAFIRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPEKTTVVKP 360
             + G+ S+ K   G+ AF +DL+ +FK +D VYVWHALCG WGG+RP    L  K  +V  
Sbjct:   371 SSGEKSEMKSEYGLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETTHLDTK--IVPC 428

Query:   361 KLSPGLELTMEDLAVDKIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEI 420
             KLSPGL+ TMEDLAV +I    +G V P   +++Y+ +HS+L + GI GVKVDVIH LE 
Sbjct:   429 KLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEY 488

Query:   421 LCENYGGRVDLAKAYYKALTASVRKHFKGNGVIASMEHCNDFMLLGTEAIALGRVGDDFW 480
             +C+ YGGRVDLAK YY+ LT S+ K+F GNG+IASM+HCNDF  LGT+ I++GRVGDDFW
Sbjct:   489 VCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDFW 548

Query:   481 CTDPSGDPNGTFWLQGCHMVHCAYNSLWMGNFIHPDWDMFQSTHPCAEFHAASRAISGGP 540
               DP+GDP G+FWLQG HM+HC+YNSLWMG  I PDWDMFQS H CA+FHA SRAI GGP
Sbjct:   549 FQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGGP 608

Query:   541 IYVSDCVGKHNFPLLKRLSMPDGSILRCEYYALPTRDCLFADPLHDGKTMLKIWNLNKYT 600
             IYVSD VG H+F L+K+L  PDG+I +C Y+ LPTRDCLF +PL D  T+LKIWN NKY 
Sbjct:   609 IYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKYG 668

Query:   601 GVIGAFNCQGGGWCREARRNTCASQFSQKVTAKTNPNDIEWNSGKNPISIEGVQVFAMYL 660
             GVIGAFNCQG GW    ++     +  + +    +  ++EW+  +    +   + + +YL
Sbjct:   669 GVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKAEEYVVYL 728

Query:   661 QEAKKL-VLSKPYENIEISLEPFSFELITVSAVTLLPGGTSPSVQFAPIGLVNMLNTGGA 719
              +A++L +++   E I+ +++P +FEL +   VT L GG    ++FAPIGL NM N+GG 
Sbjct:   729 NQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGG----IKFAPIGLTNMFNSGGT 784

Query:   720 IQSLSYDDDENSVEIGVKGSGEMRVFASEKPRACKIDGNEVAFEYEGH-MVAIQVPWSSP 778
             +  L Y    N  +I VKG G    ++SE P+  +++G EV FE+ G   + + VPW   
Sbjct:   785 VIDLEYVG--NGAKIKVKGGGSFLAYSSESPKKFQLNGCEVDFEWLGDGKLCVNVPWIEE 842

Query:   779 S-GLSVIEYLF 788
             + G+S +E  F
Sbjct:   843 ACGVSDMEIFF 853

 Score = 622 (224.0 bits), Expect = 1.8e-196, Sum P(2) = 1.8e-196
 Identities = 127/286 (44%), Positives = 175/286 (61%)

Query:    30 LEDSKLHANGHVFLSDVPDNVTLT-------PSTATAT----EKSV-FSNVGSFIGFDSF 77
             L + K    G     DVP+NV+         PS + A     +K + +S+ G F GF   
Sbjct:    21 LSERKFKVKGFPLFHDVPENVSFRSFSSICKPSESNAPPSLLQKVLAYSHKGGFFGFSHE 80

Query:    78 EPKSRHVVPIGKLKNIRFMSIFRFKVWWTTHWVGSNGRDLENETQLVILDNSTDTGRPYV 137
              P  R +  IG      F+SIFRFK WW+T W+G +G DL+ ETQ ++++   +T + YV
Sbjct:    81 TPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIE-VPET-KSYV 138

Query:   138 LLLPIVEGPFRASLQPGADDYVDVCVESGSTKVTGDSFRSVVYVHLGDDPFKLVKDAMRV 197
             +++PI+E  FR++L PG +D+V +  ESGSTKV   +F S+ YVH  ++P+ L+K+A   
Sbjct:   139 VIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIAYVHFSENPYDLMKEAYSA 198

Query:   198 VRSHLGTFKLLDEKTPPPIVDKFGWCTWDAFYLTVQPHGVMEGVKGLVDGGCPPGLVLID 257
             +R HL +F+LL+EKT P +VDKFGWCTWDAFYLTV P G+  G+     GG  P  V+ID
Sbjct:   199 IRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGLDDFSKGGVEPRFVIID 258

Query:   258 DGWQSISHDE-DPIDSEGINRTAAGEQMPCRLLRYQENFKFRDYVS 302
             DGWQSIS D  DP + +  N    GEQM  RL R+ E +KFR Y S
Sbjct:   259 DGWQSISFDGYDP-NEDAKNLVLGGEQMSGRLHRFDECYKFRKYES 303


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 1210 (431.0 bits), Expect = 2.5e-188, Sum P(2) = 2.5e-188
 Identities = 235/495 (47%), Positives = 320/495 (64%)

Query:   305 GGDSSDNKGMGAFIRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPEKTTVVKPKLSP 364
             G D     GM AF +DL+  FK++D +YVWHALCG W G+RP    +  K  V   +LSP
Sbjct:   390 GSDDVSGSGMAAFTKDLRLRFKSLDDIYVWHALCGAWNGVRPETM-MDLKAKVAPFELSP 448

Query:   365 GLELTMEDLAVDKIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEILCEN 424
              L  TM DLAVDK+V  G+G V P    + Y+ +HS+L  VG+ G K+DV   LE L E 
Sbjct:   449 SLGATMADLAVDKVVEAGIGLVHPSKAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEE 508

Query:   425 YGGRVDLAKAYYKALTASVRKHFKGNGVIASMEHCNDFMLLGTEAIALGRVGDDFWCTDP 484
             +GGRV+LAKAYY  LT S+ K+F G  VIASM+ CN+F  L T+ I++GRVGDDFW  DP
Sbjct:   509 HGGRVELAKAYYDGLTESMIKNFNGTDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDP 568

Query:   485 SGDPNGTFWLQGCHMVHCAYNSLWMGNFIHPDWDMFQSTHPCAEFHAASRAISGGPIYVS 544
              GDP G +WLQG HM+HC+YNS+WMG  I PDWDMFQS H CAE+HAASRAI GGP+Y+S
Sbjct:   569 YGDPQGVYWLQGVHMIHCSYNSIWMGQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLS 628

Query:   545 DCVGK--HNFPLLKRLSMPDGSILRCEYYALPTRDCLFADPLHDGKTMLKIWNLNKYTGV 602
             D +GK  HNF L+K+L+  DG+I RC +YALPTRD LF +PL D +++LKI+N NK+ GV
Sbjct:   629 DHLGKASHNFDLIKKLAFFDGTIPRCVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGV 688

Query:   603 IGAFNCQGGGWCREARRNTCASQFSQKVTAKTNPNDIEWNSGKNPISIEGVQV-----FA 657
             IG FNCQG GW  E  R     +    V+   + +DIEW+  +NP    G QV     + 
Sbjct:   689 IGTFNCQGAGWSPEEHRFKGYKECYTTVSGTVHVSDIEWD--QNP-EAAGSQVTYTGDYL 745

Query:   658 MYLQEAKKLV-LSKPYENIEISLEPFSFELITVSAVTLLPGGTSPSVQFAPIGLVNMLNT 716
             +Y Q++++++ ++   E ++I+LEP +F+L++   VT L    S  V+FAP+GL+NM N 
Sbjct:   746 VYKQQSEEILFMNSKSEAMKITLEPSAFDLLSFVPVTEL---VSSGVRFAPLGLINMFNC 802

Query:   717 GGAIQSLSYDDDENSVEIGVKGSGEMRVFASEKPRACKIDGNEVAFEYEGHM--VAIQVP 774
              G +Q +    D NS+ + VKG G    ++S  P  C ++  E  F++E     ++  VP
Sbjct:   803 VGTVQDMKVTGD-NSIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVP 861

Query:   775 WSSPSG-LSVIEYLF 788
             W   SG +S + + F
Sbjct:   862 WVEESGGISHLSFTF 876

 Score = 638 (229.6 bits), Expect = 2.5e-188, Sum P(2) = 2.5e-188
 Identities = 129/269 (47%), Positives = 170/269 (63%)

Query:    43 LSDVPDNVTLTP--STATATEKS------VFSNV--GSFIGFDSFEPKSRHVVPIGKLKN 92
             L DVP NVT TP  S + +T+        V +N   G F+GF    P  R    +G+ ++
Sbjct:    50 LFDVPQNVTFTPFSSHSISTDAPLPILLRVQANAHKGGFLGFTKESPSDRLTNSLGRFED 109

Query:    93 IRFMSIFRFKVWWTTHWVGSNGRDLENETQLVILD-NSTDTGRPYVLLLPIVEGPFRASL 151
               F+S+FRFK+WW+T W+G +G DL+ ETQ V+L     D+   YV ++P +EG FRASL
Sbjct:   110 REFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKIPEIDS---YVAIIPTIEGAFRASL 166

Query:   152 QPGADDYVDVCVESGSTKVTGDSFRSVVYVHLGDDPFKLVKDAMRVVRSHLGTFKLLDEK 211
              PG    V +C ESGSTKV   SF+S+ Y+H+ D+P+ L+K+A   +R H+ TFKLL+EK
Sbjct:   167 TPGEKGNVLICAESGSTKVKESSFKSIAYIHICDNPYNLMKEAFSALRVHMNTFKLLEEK 226

Query:   212 TPPPIVDKFGWCTWDAFYLTVQPHGVMEGVKGLVDGGCPPGLVLIDDGWQSISHDEDPID 271
               P IVDKFGWCTWDA YLTV P  +  GVK   DGG  P  V+IDDGWQSI+ D D +D
Sbjct:   227 KLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFEDGGVCPKFVIIDDGWQSINFDGDELD 286

Query:   272 SEGINRTAAGEQMPCRLLRYQENFKFRDY 300
              +  N    GEQM  RL  ++E  KFR+Y
Sbjct:   287 KDAENLVLGGEQMTARLTSFKECKKFRNY 315


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 1398 (497.2 bits), Expect = 5.3e-143, P = 5.3e-143
 Identities = 301/752 (40%), Positives = 438/752 (58%)

Query:    28 ITLEDSKLHANGHVFLSDVPDNVTLTPSTATATEKSVFSNVGSFIGFDSFEPKSRHVVPI 87
             I++ DS L   GH  L  VP+NV +TP++  A         G+FIG  S +  S  V  +
Sbjct:     7 ISVTDSDLVVLGHRVLHGVPENVLVTPASGNALID------GAFIGVTSDQTGSHRVFSL 60

Query:    88 GKLKNIRFMSIFRFKVWWTTHWVGSNGRDLENETQLVILD--NSTDTG-----RPYVLLL 140
             GKL+++RFM +FRFK+WW T  +G+NG+++  ETQ +I++    +D G       YV+ L
Sbjct:    61 GKLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGSDLGGRDQSSSYVVFL 120

Query:   141 PIVEGPFRASLQPGADDYVDVCVESGSTKVTGDSFRS--VVYVHLGDDPFKLVKDAMRVV 198
             PI+EG FRA LQ    + +++C+ESG   V  D F    +V+V  G DPF ++  A++ V
Sbjct:   121 PILEGDFRAVLQGNEANELEICLESGDPTV--DQFEGSHLVFVAAGSDPFDVITKAVKAV 178

Query:   199 RSHLGTFKLLDEKTPPPIVDKFGWCTWDAFYLTVQPHGVMEGVKGLVDGGCPPGLVLIDD 258
               HL TF   + K  P +++ FGWCTWDAFY  V    V +G++ L  GG  P  V+IDD
Sbjct:   179 EQHLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDD 238

Query:   259 GWQSISHDEDPIDSEGINRTAAGEQMPCRLLRYQENFKFRDYVSPNGGDSSDNKGMGAFI 318
             GWQS+  DE  ++    N  AA      RL   +EN KF+            +  +G  I
Sbjct:   239 GWQSVGMDETSVEFNADN--AAN--FANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVI 294

Query:   319 RDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPE-KTTVVKPKLSPGLELTMEDLA-VD 376
              D+K    ++  VYVWHA+ GYWGG++P + G+   ++ V  P  SPG+ ++ E+   ++
Sbjct:   295 TDIKSN-NSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGV-MSSENCGCLE 352

Query:   377 KIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEILCENYGGRVDLAKAYY 436
              I  NG+G V PE V   Y  LHS+L  VG+DGVKVDV ++LE L   +GGRV LAK Y+
Sbjct:   353 SITKNGLGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYH 412

Query:   437 KALTASVRKHFKGNGVIASMEHCNDFMLLGTEAIALGRVGDDFWCTDPSGDPNGTFWLQG 496
             +AL AS+ ++F  NG+I+ M H  D  L   +  A+ R  DDFW  DP+           
Sbjct:   413 QALEASISRNFPDNGIISCMSHNTDG-LYSAKKTAVIRASDDFWPRDPASHT-------- 463

Query:   497 CHMVHCAYNSLWMGNFIHPDWDMFQSTHPCAEFHAASRAISGGPIYVSDCVGKHNFPLLK 556
              H+   AYN+L++G F+ PDWDMF S HP AE+HAA+RA+ G  IYVSD  G+H+F LL+
Sbjct:   464 IHIASVAYNTLFLGEFMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLR 523

Query:   557 RLSMPDGSILRCEYYALPTRDCLFADPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRE 616
             +L + DGSILR +    PT DC F+DP+ D K++LKIWNLN++TGVIG FNCQG GWC+ 
Sbjct:   524 KLVLRDGSILRAKLPGRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKN 583

Query:   617 ARRNTCASQFSQKVTAKTNPNDIEWNSGKNPISIEGVQVFAMYLQEAKKLVLSKPYENIE 676
              +R     Q    ++     ND+ +          G  +   +L+   +LV      ++ 
Sbjct:   584 EKRYLIHDQEPGTISGCVRTNDVHYLHKVAAFEWTGDSIVYSHLRG--ELVYLPKDTSLP 641

Query:   677 ISLEPFSFELITVSAVTLLPGGTSPSVQFAPIGLVNMLNTGGAIQSLSYDDDENS--VEI 734
             ++L P  +E+ TV  V     G+    +FAP+GL+ M N+GGAI SL YDD+     V +
Sbjct:   642 VTLMPREYEVFTVVPVKEFSDGS----KFAPVGLMEMFNSGGAIVSLRYDDEGTKFVVRM 697

Query:   735 GVKGSGEMRVFAS-EKPRACKIDGNEVAFEYE 765
              ++GSG + V++S  +PR+  +D ++V + YE
Sbjct:   698 KLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYE 729


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 845 (302.5 bits), Expect = 4.4e-90, Sum P(2) = 4.4e-90
 Identities = 183/469 (39%), Positives = 263/469 (56%)

Query:    27 DITLEDSKLHANGHVFLSDVPDNVTLTPSTATATEKSVFSNVGSFIGFDSFEPKSRHVVP 86
             +I++++  L   G   L+ +PDN+ LTP T        F + GSFIG    + KS HV P
Sbjct:     6 NISVQNDNLVVQGKTILTKIPDNIILTPVTGNG-----FVS-GSFIGATFEQSKSLHVFP 59

Query:    87 IGKLKNIRFMSIFRFKVWWTTHWVGSNGRDLENETQLVILD-------NSTDTGRPYVLL 139
             IG L+ +RFM  FRFK+WW T  +GS G+D+  ETQ ++L+       N  D    Y + 
Sbjct:    60 IGVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAPTVYTVF 119

Query:   140 LPIVEGPFRASLQPGADDYVDVCVESGSTKVTGDSFRSVVYVHLGDDPFKLVKDAMRVVR 199
             LP++EG FRA LQ    + +++C ESG   V       +VYVH G +PF++++ +++ V 
Sbjct:   120 LPLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSVKAVE 179

Query:   200 SHLGTFKLLDEKTPPPIVDKFGWCTWDAFYLTVQPHGVMEGVKGLVDGGCPPGLVLIDDG 259
              H+ TF   ++K  P  +D FGWCTWDAFY  V   GV EG+K L +GG PP  ++IDDG
Sbjct:   180 RHMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDG 239

Query:   260 WQSISHDEDPIDSEGINRTAAGEQMPCRLLRYQENFKFRDYVSPNGGDSSDNK--GMGAF 317
             WQ I + E   D   +     G Q   RL+  +EN KF+        D  D +  G+ + 
Sbjct:   240 WQQIENKEK--DENCV--VQEGAQFATRLVGIKENAKFQK------SDQKDTQVSGLKSV 289

Query:   318 IRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPE-KTTVVKPKLSPGLELTMEDLAVD 376
             + + K     V QVY WHAL GYWGG++P   G+    + +  P  SPG+     D+ +D
Sbjct:   290 VDNAKQRHN-VKQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMD 348

Query:   377 KIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEILCENYGGRVDLAKAYY 436
              +  +G+G V P+ V   Y  LHS+L   GIDGVKVDV +++E L    GGRV L ++Y 
Sbjct:   349 SLAVHGLGLVNPKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQ 408

Query:   437 KALTASVRKHFKGNGVIASMEHCNDFMLLGTEAIALGRVGDDFWCTDPS 485
             +AL AS+ ++F  NG I+ M H  D  L   +  A+ R  DDF+  DP+
Sbjct:   409 QALEASIARNFTDNGCISCMCHNTDG-LYSAKQTAIVRASDDFYPRDPA 456

 Score = 840 (300.8 bits), Expect = 1.5e-89, Sum P(2) = 1.5e-89
 Identities = 180/455 (39%), Positives = 261/455 (57%)

Query:   281 GEQMPCRLLRYQENFKFRDYVSPNGGDSSDNK--GMGAFIRDLKDEFKTVDQVYVWHALC 338
             G Q   RL+  +EN KF+        D  D +  G+ + + + K     V QVY WHAL 
Sbjct:   257 GAQFATRLVGIKENAKFQK------SDQKDTQVSGLKSVVDNAKQRHN-VKQVYAWHALA 309

Query:   339 GYWGGLRPNIPGLPE-KTTVVKPKLSPGLELTMEDLAVDKIVNNGVGFVPPELVDQMYEG 397
             GYWGG++P   G+    + +  P  SPG+     D+ +D +  +G+G V P+ V   Y  
Sbjct:   310 GYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVNPKKVFNFYNE 369

Query:   398 LHSHLEKVGIDGVKVDVIHLLEILCENYGGRVDLAKAYYKALTASVRKHFKGNGVIASME 457
             LHS+L   GIDGVKVDV +++E L    GGRV L ++Y +AL AS+ ++F  NG I+ M 
Sbjct:   370 LHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNFTDNGCISCMC 429

Query:   458 HCNDFMLLGTEAIALGRVGDDFWCTDPSGDPNGTFWLQGCHMVHCAYNSLWMGNFIHPDW 517
             H  D  L   +  A+ R  DDF+  DP+            H+   AYNSL++G F+ PDW
Sbjct:   430 HNTDG-LYSAKQTAIVRASDDFYPRDPASHT--------IHIASVAYNSLFLGEFMQPDW 480

Query:   518 DMFQSTHPCAEFHAASRAISGGPIYVSDCVGKHNFPLLKRLSMPDGSILRCEYYALPTRD 577
             DMF S HP AE+HAA+RA+ G  IYVSD  G HNF LL++L +PDGS+LR +    PTRD
Sbjct:   481 DMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTRD 540

Query:   578 CLFADPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCREARRNTCASQFSQKVTAKTNPN 637
             CLFADP  DG ++LKIWN+NK+TG++G FNCQG GWC+E ++N         +T     +
Sbjct:   541 CLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRAD 600

Query:   638 DIEWNSGKNPISIEGVQVFAMYLQEAKKLVLSKPYENIEISLEPFSFELITVSAVTLLPG 697
             D +  S        G  +  +Y   + ++V      +I ++L+   +EL  +S +  +  
Sbjct:   601 DADLISQVAGEDWSGDSI--VYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEI-- 656

Query:   698 GTSPSVQFAPIGLVNMLNTGGAIQSLSYDD--DEN 730
               + ++ FAPIGLV+M N+ GAI+S+  +   D+N
Sbjct:   657 --TENISFAPIGLVDMFNSSGAIESIDINHVTDKN 689

 Score = 73 (30.8 bits), Expect = 4.4e-90, Sum P(2) = 4.4e-90
 Identities = 12/45 (26%), Positives = 24/45 (53%)

Query:   732 VEIGVKGSGEMRVFASEKPRACKIDGNEVAFEYEGH--MVAIQVP 774
             V + V+G G    ++S++P  C ++  E  F Y+    +V + +P
Sbjct:   714 VSVSVRGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLP 758


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 284 (105.0 bits), Expect = 6.5e-28, Sum P(3) = 6.5e-28
 Identities = 92/319 (28%), Positives = 155/319 (48%)

Query:   312 KGMGAFIRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPEKTTVVKPKLSPGLELTME 371
             +G+   + +++ +   +  + VWH + GYWGG+ P+ P +  K  + K +L    E+  +
Sbjct:   403 QGLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGP-MASKYKMRKIQLRDEAEVQPK 461

Query:   372 DLAVDKIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEILCENYGGRVDL 431
             D   D    +G      E V +MY+  ++ L   G+   KVD    L+    +   R +L
Sbjct:   462 DF--DFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFLDYPA-HANDRKNL 512

Query:   432 AKAYYKALTASVRKHFKGNGVIASMEHCNDFM--LL--G-TEA-IALGRVGDDFWCTDPS 485
              + Y  A TA+  KHF G  +    +     +  LL  G +E  + + R  DDF+  D  
Sbjct:   513 IRPYQDAWTAAASKHFGGRAIACMAQTPQSILHSLLQQGRSEGPMLMARNSDDFF-PDEV 571

Query:   486 GDPNGTFWLQGCHMVHCAYNSLWMGNF-IHPDWDMFQSTHP-CAEFHAASRAISGGPIYV 543
             G      W   C+    A+N+L M +  +  DWDMFQ+T P  A  HA +R++SGGPIY+
Sbjct:   572 GSHT---WHVFCN----AHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMSGGPIYI 624

Query:   544 SDCVGKHNFPLLKRLSMP--DGSILRCEYYALPTRDCLFADPLHDGKTMLKIWNLNKYTG 601
             +D  G+H+  L+K+++    DG  +       P R  L+    H  + +L++ + ++  G
Sbjct:   625 TDAPGEHDVELIKQMTAQTADGRTIALRADE-PGRT-LWPYGGHGEQRLLRVRSGHQGVG 682

Query:   602 VIGAFN-CQGGGWCREARR 619
             ++G FN C  G    E  R
Sbjct:   683 MLGVFNVCNRGSLLGEQVR 701

 Score = 81 (33.6 bits), Expect = 6.5e-28, Sum P(3) = 6.5e-28
 Identities = 23/74 (31%), Positives = 37/74 (50%)

Query:   675 IEISLEPFSFELITVSAVTLLPGGTSPSVQFAPIGLVNMLNTGGAIQSLSYDDD-ENSVE 733
             IE+ LE   FE+ T   +T L GG +     A +GLV  + T  A+  +SY    E  + 
Sbjct:   736 IEVGLEEGGFEIFTAYPITKL-GGLA----VATLGLVGKMATAAAVSHVSYSKHHEGFIP 790

Query:   734 IGVKGSGEMRVFAS 747
             +GV+ S  ++   +
Sbjct:   791 VGVEVSVSLKALGT 804

 Score = 79 (32.9 bits), Expect = 6.5e-28, Sum P(3) = 6.5e-28
 Identities = 17/66 (25%), Positives = 29/66 (43%)

Query:   218 DKFGWCTWDAFYLTVQPHGVMEGVKGLVDGGCPPGLVLIDDGWQSISHDEDPIDSEGINR 277
             D F +CTW++    +    ++  +  L + G     ++IDD WQS+  D          R
Sbjct:   334 DGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGSDASRRRWER 393

Query:   278 TAAGEQ 283
               A +Q
Sbjct:   394 FEANQQ 399


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 219 (82.2 bits), Expect = 7.4e-28, Sum P(3) = 7.4e-28
 Identities = 48/123 (39%), Positives = 69/123 (56%)

Query:   492 FWLQGC--HMVHCAYNSLWMGNFIHPDWDMFQSTHPCAEFHAASRAISGGPIYVSDCVGK 549
             FW  G   H++  AYNSL   + ++PD+DMF S  P A+ H  +R  SGGPIY++D   +
Sbjct:   429 FWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPE 488

Query:   550 H-NFPLLKRLSMPDGSILRCEYYALPTRDCLFADPLHDGKTMLKIWNLNKYTGVIGAFNC 608
               N  LL+   +P+G ++R +  AL T D LF DPL + + +LK+    K    I  FN 
Sbjct:   489 RTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRE-RVLLKLKGKVKGYNAIAFFNL 547

Query:   609 QGG 611
               G
Sbjct:   548 NSG 550

 Score = 166 (63.5 bits), Expect = 7.4e-28, Sum P(3) = 7.4e-28
 Identities = 38/121 (31%), Positives = 70/121 (57%)

Query:   155 ADDYVDVCVESGSTKV-TG---DSFRSVVYVHLG--DDPFKLVKDAMRVVRSHLGTFKLL 208
             +++YV   +   S ++ TG   D  +   ++ +G  D+P+K +++A+ +      TFKL 
Sbjct:   152 SNNYVTAYLFGDSVRLYTGFNTDEIKRSYFLSIGTSDNPYKAIENAINIASKETFTFKLR 211

Query:   209 DEKT-PPPIVDKFGWCTWDAFYLT--VQPHGVMEGVKGLVDGGCPPGLVLIDDGWQSISH 265
              EK  P  +++  GWC+W+AF LT  +    +++ VKG+++ G     V+IDDGWQ  ++
Sbjct:   212 KEKGFPDKVMNGLGWCSWNAF-LTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQDQNN 270

Query:   266 D 266
             D
Sbjct:   271 D 271

 Score = 56 (24.8 bits), Expect = 7.4e-28, Sum P(3) = 7.4e-28
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query:   310 DNKGMGAFIRDLKDEFKTVDQVYV--WHALCGYWGGL 344
             DNK      ++     K++   YV  WHA+  +WGG+
Sbjct:   280 DNKKFPNGFKNTVRAIKSLGVKYVGLWHAINAHWGGM 316

 Score = 39 (18.8 bits), Expect = 1.2e-07, Sum P(2) = 1.2e-07
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query:   598 KYTGVIGAFNCQGGGWCREARRN 620
             KY G+  A N   GG  +E  ++
Sbjct:   301 KYVGLWHAINAHWGGMSQELMKS 323


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 269 (99.8 bits), Expect = 5.3e-25, Sum P(2) = 5.3e-25
 Identities = 78/256 (30%), Positives = 126/256 (49%)

Query:   313 GMGAFIRDLKDEFKTVDQVYVWHALCGYWGGLRPNIPGLPEKTTVVKPKLSPGLELTMED 372
             G+   +  ++++ + ++ + VWHAL GYWGG+ P   G      + K +          +
Sbjct:   384 GLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPE--G--SLAAIYKTR----------E 429

Query:   373 LAVDKIVNNGVGFVPPELVDQMYEGLHSHLEKVGIDGVKVDVIHLLEILCENYGGRVDLA 432
             +A++      +  + P  + + Y   ++ L + GI GVK D    L++L +    R   A
Sbjct:   430 VALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLLADPEDRR-SYA 488

Query:   433 KAYYKALTASVRKHFKGNGVIASMEHCNDFML---LGT-EAIALGRVGDDFWCTDPSGDP 488
              AY  A T S  +HF G   I+ M      +    L T +   + R  +DF+   P  D 
Sbjct:   489 NAYQDAWTISSLRHF-GPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFF---PDIDD 544

Query:   489 NGTFWLQGCHMVHCAYNSLWMGNFIHPDWDMFQSTHP-----CAEFHAASRAISGGPIYV 543
             + T W   C+  H A  + ++     PDWDMFQ T P      A FHAA+R ISGGPIY+
Sbjct:   545 SHT-WHVFCN-AHNALLTRYLNGL--PDWDMFQ-TLPENGLDYASFHAAARCISGGPIYI 599

Query:   544 SDCVGKHNFPLLKRLS 559
             +D  G+H+ PL+K+++
Sbjct:   600 TDKPGQHDIPLIKQMT 615

 Score = 102 (41.0 bits), Expect = 5.3e-25, Sum P(2) = 5.3e-25
 Identities = 45/189 (23%), Positives = 83/189 (43%)

Query:    85 VPIGKLKNI-RFMSIFRFKVWWTTHWVGSN-GRDLENETQLVILDNSTDTGRPYVLLLPI 142
             +P+G   ++ RF ++ R +    T W+G   G+D  N T+  IL +   T   +V+LL +
Sbjct:   182 LPLGTPSSMSRFFALARVE----TSWLGPRQGKDKLNFTEDAILLSFLRTDGVHVVLLGV 237

Query:   143 VEGPFRASLQPGADDYVDVCVESGSTKVTGDSFRSVVYVHLGDDPFK---LVKDAMRVVR 199
                     L  G+    +V ++S +   T   F+ V+     D       L+ +A R+VR
Sbjct:   238 TVDDTLTVL--GSGPAGEVVIKSQNDNATPSRFQ-VLAATAADFEVATSALIYEARRLVR 294

Query:   200 SHLGTFKLLDEKTP--PPIVDKFGWCTWDAFYLTVQPHGVMEGVKGLVDGGCPPGLVLID 257
              +  T +    +T       D   +CTW+     +    ++  +  L   G     ++ID
Sbjct:   295 PYENTAQG-GPRTQWLSEWYDGLAYCTWNGLGQDLSEEKILSALDDLKTAGIRIRTLIID 353

Query:   258 DGWQSISHD 266
             D WQS+ ++
Sbjct:   354 DNWQSLDNE 362


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 142 (55.0 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
 Identities = 33/92 (35%), Positives = 47/92 (51%)

Query:   498 HMVHCAYNSLWMGNFIHPDWDMFQSTHP-CAEFHAASRAISGGPIYVSDCVGKHNFPLLK 556
             H+     N+L +G  + PD DMF S    C    A S+AISGGP+Y+SD   +     ++
Sbjct:   446 HLFQSYTNTLILGQTVWPDHDMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIR 505

Query:   557 RLSMPDGSILRCEYYALPTRDCLFADPLHDGK 588
              L    G I R    A+PT + +  +PL  GK
Sbjct:   506 PLIDETGKIFRPAAPAIPTPESILTNPLQSGK 537

 Score = 88 (36.0 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query:   218 DKFGWCTWDAFYLTVQPHGVMEGVKGLVDGGCPPGLVLIDDG 259
             D  GWCTW+ ++  +    ++  +  +   G P   VLIDDG
Sbjct:   228 DYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG 269

 Score = 47 (21.6 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
 Identities = 6/12 (50%), Positives = 10/12 (83%)

Query:   333 VWHALCGYWGGL 344
             +W++L GYW G+
Sbjct:   309 LWYSLSGYWMGI 320


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.138   0.432    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      788       788   0.00095  121 3  11 22  0.40    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  629 (67 KB)
  Total size of DFA:  444 KB (2210 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:01
  No. of threads or processors used:  24
  Search cpu time:  65.09u 0.10s 65.19t   Elapsed:  00:00:05
  Total cpu time:  65.10u 0.10s 65.20t   Elapsed:  00:00:06
  Start:  Fri May 10 04:08:05 2013   End:  Fri May 10 04:08:11 2013

Back to top