BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>004090
MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM
GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL
PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK
YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW
QQIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNV
KYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVH
PKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNF
PDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPD
WDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR
DCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRV
TDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSN
ISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVR
GCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV

High Scoring Gene Products

Symbol, full name Information P value
SIP2
AT3G57520
protein from Arabidopsis thaliana 0.
SIP1
AT1G55740
protein from Arabidopsis thaliana 1.6e-259
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 1.2e-138
SIP1
AT5G40390
protein from Arabidopsis thaliana 7.4e-137
STS1
Stachyose synthase
protein from Pisum sativum 4.0e-124
STS
AT4G01970
protein from Arabidopsis thaliana 1.0e-123
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 2.1e-31
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 3.6e-31
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 1.9e-23

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  004090
        (774 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...  3379  0.        1
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  2376  1.6e-259  2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...  1306  1.2e-138  2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...  1340  7.4e-137  1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...   783  4.0e-124  3
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...   734  1.0e-123  3
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   340  7.2e-37   3
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   337  2.1e-31   3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   238  3.6e-31   3
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   196  1.9e-23   4


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 3379 (1194.5 bits), Expect = 0., P = 0.
 Identities = 618/775 (79%), Positives = 690/775 (89%)

Query:     1 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 60
             MT+  NIS+ + NLVV GKTILT +PDNIILTP  G G V+G+FIGAT   SKSLHVFP+
Sbjct:     1 MTITSNISVQNDNLVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQSKSLHVFPI 60

Query:    61 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 120
             GVLE LRFMCCFRFKLWWMTQRMG+CGKD+PLETQFML+ESKD  E + DD PT+YTVFL
Sbjct:    61 GVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAPTVYTVFL 120

Query:   121 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 180
             PLLEGQFR+ LQGNE NEIEIC ESGD AVET+QG +LVY HAG NPFEVI Q+VKAVE+
Sbjct:   121 PLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSVKAVER 180

Query:   181 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 240
             +MQTF HREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLS GGTPPKFLIIDDGW
Sbjct:   181 HMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDGW 240

Query:   241 QQIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNV 300
             QQIENK K+E NC+VQEGAQFA+RL GIKEN+KFQK  Q   QVSGLK VVD +KQ HNV
Sbjct:   241 QQIENKEKDE-NCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAKQRHNV 299

Query:   301 KYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVH 360
             K VY WHALAGYWGGVKPAA GMEHYD+ALAYPV SPGV+GNQPDIVMDSLAVHGLGLV+
Sbjct:   300 KQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVN 359

Query:   361 PKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNF 420
             PKKVFNFYNELH+YLASCG+DGVKVDVQNIIETLGAG GGRVSLTRSY QALEASIARNF
Sbjct:   360 PKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNF 419

Query:   421 PDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPD 480
              DNGCISCMCHNTDG+YS+KQTA++RASDD+YPRDPASHTIHI+SVAYN+LFLGEFMQPD
Sbjct:   420 TDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEFMQPD 479

Query:   481 WDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 540
             WDMFHSLHP AEYH AARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA+LPGRPTR
Sbjct:   480 WDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTR 539

Query:   541 DCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRV 600
             DCLFADPARDG SLLK+WN+NK +G+VGVFNCQGAGWCK TKK +IHD SPGTLT S+R 
Sbjct:   540 DCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRA 599

Query:   601 TDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSN 660
              D + ++Q+AG  W+GD+IVYA+RSGEVVRLPKGAS+P+TLKVLEYELFH  PLKEI+ N
Sbjct:   600 DDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEITEN 659

Query:   661 ISFAAIGLLDMFNSGGAVENVEV-HMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKV 719
             ISFA IGL+DMFNS GA+E++++ H+++K P+ FDGE+SS  + +LSDNRSPTA +S+ V
Sbjct:   660 ISFAPIGLVDMFNSSGAIESIDINHVTDKNPEFFDGEISSA-SPALSDNRSPTALVSVSV 718

Query:   720 RGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV 774
             RGCGRFG YSSQRPLKC V S +TDFTYD+  GL+T+ LPV  EEM+RW VEI V
Sbjct:   719 RGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHVEILV 773


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 2376 (841.5 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
 Identities = 433/683 (63%), Positives = 539/683 (78%)

Query:     1 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 60
             MTV   IS++D +LVV G  +L GVP+N+++TP +G  L+ GAFIG T+  + S  VF +
Sbjct:     1 MTVGAGISVTDSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHRVFSL 60

Query:    61 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 120
             G LEDLRFMC FRFKLWWMTQRMGT GK++P ETQF++VE+   S+    D  + Y VFL
Sbjct:    61 GKLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGSDLGGRDQSSSYVVFL 120

Query:   121 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 180
             P+LEG FR+ LQGNE NE+EICLESGD  V+  +G +LV+  AG +PF+VI++AVKAVE+
Sbjct:   121 PILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAVKAVEQ 180

Query:   181 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 240
             ++QTF+HRE+KK+P  L+WFGWCTWDAFYT+VTA+ V +GL+SL AGG  PKF+IIDDGW
Sbjct:   181 HLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGW 240

Query:   241 QQIE-NKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS----GLKHVVDESK 295
             Q +  ++   E N      A FA+RLT IKEN KFQK  +   +V      L HV+ + K
Sbjct:   241 QSVGMDETSVEFNA--DNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITDIK 298

Query:   296 QNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHG 355
              N+++KYVYVWHA+ GYWGGVKP   GMEHY++ +AYPV+SPGVM ++    ++S+  +G
Sbjct:   299 SNNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESITKNG 358

Query:   356 LGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEAS 415
             LGLV+P+KVF+FYN+LH+YLAS GVDGVKVDVQNI+ETLGAGHGGRV L + YHQALEAS
Sbjct:   359 LGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQALEAS 418

Query:   416 IARNFPDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGE 475
             I+RNFPDNG ISCM HNTDG+YS+K+TAVIRASDD++PRDPASHTIHI+SVAYNTLFLGE
Sbjct:   419 ISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLFLGE 478

Query:   476 FMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLP 535
             FMQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVL DGS+LRA+LP
Sbjct:   479 FMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRAKLP 538

Query:   536 GRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLT 595
             GRPT DC F+DP RD  SLLK+WN+N+ +GV+GVFNCQGAGWCK  K+  IHD+ PGT++
Sbjct:   539 GRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPGTIS 598

Query:   596 ASVRVTDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLK 655
               VR  DV  + ++A   W GD+IVY+H  GE+V LPK  S+PVTL   EYE+F   P+K
Sbjct:   599 GCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVVPVK 658

Query:   656 EISSNISFAAIGLLDMFNSGGAV 678
             E S    FA +GL++MFNSGGA+
Sbjct:   659 EFSDGSKFAPVGLMEMFNSGGAI 681

 Score = 145 (56.1 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
 Identities = 29/68 (42%), Positives = 42/68 (61%)

Query:   707 DNRSPTATISLKVRGCGRFGIYSS-QRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEM 765
             D+      + +K+RG G  G+YSS +RP   TV S   ++ Y+  +GL+T TL VPE+E+
Sbjct:   687 DDEGTKFVVRMKLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYEPESGLVTFTLGVPEKEL 746

Query:   766 YRWPVEIQ 773
             Y W V IQ
Sbjct:   747 YLWDVVIQ 754


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 1306 (464.8 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
 Identities = 289/724 (39%), Positives = 407/724 (56%)

Query:     5 PNISISDGNLVVHGKTILTGVPDNIILTPGNGV-------GLVAGAFIGATASHSKSLHV 57
             P  ++   +L V G   L  VP NI LTP + +          AG+F+G  A  +K  HV
Sbjct:    26 PRFTLKGKDLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHV 85

Query:    58 FPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYT 117
              P+G L D RFM  FRFK+WW T  +GT G+DV  ETQ M+++      S    GP  Y 
Sbjct:    86 VPIGKLRDTRFMSIFRFKVWWTTHWVGTNGRDVENETQMMILDQSGTKSSPT--GPRPYV 143

Query:   118 VFLPLLEGQFRSALQ-GNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVK 176
             + LP++EG FR+ L+ G   + + + LESG + V  +     VY HAG +PF+++  A++
Sbjct:   144 LLLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMR 203

Query:   177 AVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLII 236
              V  ++ TF   E+K  P  +D FGWCTWDAFY  V  EGV EG++ L+ GG PP  ++I
Sbjct:   204 VVRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLI 263

Query:   237 DDGWQQIENKPKE-----ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVV 291
             DDGWQ I +   +     E       G Q   RL   +EN KF+      E   G+   V
Sbjct:   264 DDGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFR------EYKGGMGGFV 317

Query:   292 DESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDS 350
              E K     V+ VYVWHAL GYWGG++P A G+      +  P  SPG+     D+ +D 
Sbjct:   318 REMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDK 375

Query:   351 LAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQ 410
             +  +G+GLV P++    Y  LH++L + G+DGVKVDV +++E +   +GGRV L ++Y  
Sbjct:   376 IVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFA 435

Query:   411 ALEASIARNFPDNGCISCMCHNTDG-IYSSKQTAVIRASDDYYPRDPASHT--------I 461
              L  S+ R+F  NG I+ M H  D  +  ++  A+ R  DD++  DP+            
Sbjct:   436 GLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGC 495

Query:   462 HISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRK 521
             H+   AYN+L++G F+ PDWDMF S HP A +H A+RAV G  +YVSD  G H+FDLLR+
Sbjct:   496 HMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRR 555

Query:   522 LVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKIT 581
             L LPDG++LR +    PTRDCLFADP  DG ++LK+WNVNK SGV+G FNCQG GW +  
Sbjct:   556 LALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREA 615

Query:   582 KKTRIHDESPGTLTASVRVTDVENMAQIAGAGWNGDAI-VYAHRSGEVVRLPKGASVPVT 640
             ++          +TA     DVE     +  G  GD   VY   + ++  L +  SV +T
Sbjct:   616 RRNMCAAGFSVPVTARASPADVE----WSHGGGGGDRFAVYFVEARKLQLLRRDESVELT 671

Query:   641 LKVLEYELFHFCPLKEISS---NISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEV 697
             L+   YEL    P++ I S    I FA IGL +M N+GGAV+  E   + +K    DG+V
Sbjct:   672 LEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFE---AARK----DGDV 724

Query:   698 SSEL 701
             ++E+
Sbjct:   725 AAEV 728

 Score = 72 (30.4 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query:   719 VRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLP 759
             V+G G    YSS RP  C V     +F Y+   G++T+ +P
Sbjct:   730 VKGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTVDVP 768


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 1340 (476.8 bits), Expect = 7.4e-137, P = 7.4e-137
 Identities = 289/782 (36%), Positives = 443/782 (56%)

Query:     9 ISDGNLVVHGKTILTGVPDNIILTPG------NGVGL--VAGAFIGATAS-HSKSLHVFP 59
             + D  L+ +G+ +LT VP N+ LT        +GV L   AG+FIG       KS HV  
Sbjct:    24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83

Query:    60 MGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGP-TIYTV 118
             +G L+++RFM  FRFK+WW T  +G+ G+D+  ETQ ++++ +  S+S    G    Y +
Sbjct:    84 IGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRPYVL 142

Query:   119 FLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV 178
              LPLLEG FRS+ Q  E++++ +C+ESG   V  ++   +VY HAG +PF+++  A+K +
Sbjct:   143 LLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDAMKVI 202

Query:   179 EKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDD 238
               +M TF   E+K  P  +D FGWCTWDAFY  V  +GV +G+K L  GG PP  ++IDD
Sbjct:   203 RVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDD 262

Query:   239 GWQQIENKPKE---ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS-GLKHVVDES 294
             GWQ I +       E   I   G Q   RL   +EN KF+      +Q   G+K  V + 
Sbjct:   263 GWQSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAFVRDL 322

Query:   295 KQNHN-VKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAV 353
             K   + V Y+YVWHAL GYWGG++P A  +    + +  P  SPG+     D+ +D +  
Sbjct:   323 KDEFSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKIIE 380

Query:   354 HGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALE 413
              G+G   P     FY  LH++L + G+DGVKVDV +I+E L   +GGRV L ++Y +AL 
Sbjct:   381 TGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKALT 440

Query:   414 ASIARNFPDNGCISCMCHNTDGIYSSKQTAVI-RASDDYYPRDPASHT--------IHIS 464
             +S+ ++F  NG I+ M H  D ++   +   + R  DD++  DP+            H+ 
Sbjct:   441 SSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHMV 500

Query:   465 SVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVL 524
               AYN+L++G F+QPDWDMF S HP AE+H A+RA+ G  IY+SD  G H+FDLL++LVL
Sbjct:   501 HCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLVL 560

Query:   525 PDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKT 584
             P+GS+LR +    PTRD LF DP  DG ++LK+WN+NK +GV+G FNCQG GWC+ T++ 
Sbjct:   561 PNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRRN 620

Query:   585 RIHDESPGTLTASVRVTDVE---NMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTL 641
             +   E   TLTA+    DVE     + I+ A     A+ +  +S +++       + +TL
Sbjct:   621 QCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFAL-FLSQSKKLLLSGLNDDLELTL 679

Query:   642 KVLEYELFHFCPLKEISSN-ISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSE 700
             +  ++EL    P+  I  N + FA IGL++M N+ GA+ ++          +++ E    
Sbjct:   680 EPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL----------VYNDE---- 725

Query:   701 LTTSLSDNRSPTATISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPV 760
                          ++ + V G G F +Y+S++P+ C +     +F Y+ +  ++ +    
Sbjct:   726 -------------SVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPWSG 772

Query:   761 PE 762
             P+
Sbjct:   773 PD 774


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 783 (280.7 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
 Identities = 180/488 (36%), Positives = 273/488 (55%)

Query:   209 YTDVTAEGVD-EGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGAQFASRLTG 267
             +TD+  +G++ E L+         K         +IE+K K+    +V+E       L G
Sbjct:   319 FTDLILKGIEHEKLRKKREEAISSK----SSDLAEIESKIKK----VVKE----IDDLFG 366

Query:   268 IKENSKFQKKCQNSEQVSGLKHVVDESKQNHN-VKYVYVWHALAGYWGGVKPAADGMEHY 326
              ++ S  +K    SE   GLK    + +     +  VYVWHAL G WGGV+P      H 
Sbjct:   367 GEQFSSGEKSEMKSEY--GLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HL 421

Query:   327 DTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVD 386
             DT +     SPG+ G   D+ +  ++   LGLVHP +    Y+ +H+YLA  G+ GVKVD
Sbjct:   422 DTKIVPCKLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVD 481

Query:   387 VQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIY-SSKQTAVI 445
             V + +E +   +GGRV L + Y++ L  SI +NF  NG I+ M H  D  +  +KQ ++ 
Sbjct:   482 VIHSLEYVCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMG 541

Query:   446 RASDDYYPRDPASHT--------IHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAA 497
             R  DD++ +DP            +H+   +YN+L++G+ +QPDWDMF S H  A++H  +
Sbjct:   542 RVGDDFWFQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGS 601

Query:   498 RAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKV 557
             RA+ G  IYVSD  G+H+FDL++KLV PDG++ +      PTRDCLF +P  D T++LK+
Sbjct:   602 RAICGGPIYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKI 661

Query:   558 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVE--NMAQIAGAGWN 615
             WN NK  GV+G FNCQGAGW  I +K R   E    +  +V VT+VE     + +  G  
Sbjct:   662 WNFNKYGGVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKA 721

Query:   616 GDAIVYAHRSGEVVRLP-KGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNS 674
              + +VY +++ E+  +  K   +  T++   +EL+ F P+ ++   I FA IGL +MFNS
Sbjct:   722 EEYVVYLNQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNS 781

Query:   675 GGAVENVE 682
             GG V ++E
Sbjct:   782 GGTVIDLE 789

 Score = 406 (148.0 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
 Identities = 85/238 (35%), Positives = 126/238 (52%)

Query:    42 GAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVES 101
             G F G +        +  +G      F+  FRFK WW TQ +G  G D+ +ETQ++L+E 
Sbjct:    72 GGFFGFSHETPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131

Query:   102 KDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYT 161
              +            Y V +P++E  FRSAL    N+ ++I  ESG   V+ +    + Y 
Sbjct:   132 PETKS---------YVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIAYV 182

Query:   162 HAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGL 221
             H   NP++++ +A  A+  ++ +F   E+K +P+ +D FGWCTWDAFY  V   G+  GL
Sbjct:   183 HFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGL 242

Query:   222 KSLSAGGTPPKFLIIDDGWQQIE---NKPKEESNCIVQEGAQFASRLTGIKENSKFQK 276
                S GG  P+F+IIDDGWQ I      P E++  +V  G Q + RL    E  KF+K
Sbjct:   243 DDFSKGGVEPRFVIIDDGWQSISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKFRK 300

 Score = 66 (28.3 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
 Identities = 16/47 (34%), Positives = 25/47 (53%)

Query:   717 LKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 763
             +KV+G G F  YSS+ P K  +   + DF +    G + + +P  EE
Sbjct:   797 IKVKGGGSFLAYSSESPKKFQLNGCEVDFEW-LGDGKLCVNVPWIEE 842


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 734 (263.4 bits), Expect = 1.0e-123, Sum P(3) = 1.0e-123
 Identities = 152/398 (38%), Positives = 237/398 (59%)

Query:   303 VYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPK 362
             +YVWHAL G W GV+P  + M      +A    SP +     D+ +D +   G+GLVHP 
Sbjct:   416 IYVWHALCGAWNGVRP--ETMMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473

Query:   363 KVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPD 422
             K   FY+ +H+YLAS GV G K+DV   +E+L   HGGRV L ++Y+  L  S+ +NF  
Sbjct:   474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533

Query:   423 NGCISCMCHNTDGIY-SSKQTAVIRASDDYYPRDPASHT--------IHISSVAYNTLFL 473
                I+ M    +  + ++KQ ++ R  DD++ +DP            +H+   +YN++++
Sbjct:   534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593

Query:   474 GEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPG--NHNFDLLRKLVLPDGSVLR 531
             G+ +QPDWDMF S H  AEYH A+RA+ G  +Y+SD  G  +HNFDL++KL   DG++ R
Sbjct:   594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653

Query:   532 AQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESP 591
                   PTRD LF +P  D  S+LK++N NK  GV+G FNCQGAGW     + + + E  
Sbjct:   654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713

Query:   592 GTLTASVRVTDVE--NMAQIAGAG--WNGDAIVYAHRSGEVVRL-PKGASVPVTLKVLEY 646
              T++ +V V+D+E     + AG+   + GD +VY  +S E++ +  K  ++ +TL+   +
Sbjct:   714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773

Query:   647 ELFHFCPLKE-ISSNISFAAIGLLDMFNSGGAVENVEV 683
             +L  F P+ E +SS + FA +GL++MFN  G V++++V
Sbjct:   774 DLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQDMKV 811

 Score = 425 (154.7 bits), Expect = 1.0e-123, Sum P(3) = 1.0e-123
 Identities = 106/307 (34%), Positives = 148/307 (48%)

Query:     5 PN-ISISDGNLVVHGKT-ILTGVPDNIILTP--GNGVGLVA--------------GAFIG 46
             PN  ++S+G+L     T IL  VP N+  TP   + +   A              G F+G
Sbjct:    31 PNSFNLSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAPLPILLRVQANAHKGGFLG 90

Query:    47 ATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSE 106
              T           +G  ED  F+  FRFK+WW T  +G  G D+  ETQ+++++     E
Sbjct:    91 FTKESPSDRLTNSLGRFEDREFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKIP---E 147

Query:   107 SDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPN 166
              D       Y   +P +EG FR++L   E   + IC ESG   V+ +    + Y H   N
Sbjct:   148 IDS------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIAYIHICDN 201

Query:   167 PFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSA 226
             P+ ++ +A  A+  +M TF   E+KKLP  +D FGWCTWDA Y  V    +  G+K    
Sbjct:   202 PYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFED 261

Query:   227 GGTPPKFLIIDDGWQQI----ENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSE 282
             GG  PKF+IIDDGWQ I    +   K+  N +V  G Q  +RLT  KE  KF+     S 
Sbjct:   262 GGVCPKFVIIDDGWQSINFDGDELDKDAEN-LVLGGEQMTARLTSFKECKKFRNYKGGSF 320

Query:   283 QVSGLKH 289
               S   H
Sbjct:   321 ITSDASH 327

 Score = 92 (37.4 bits), Expect = 1.0e-123, Sum P(3) = 1.0e-123
 Identities = 18/50 (36%), Positives = 30/50 (60%)

Query:   714 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 763
             +I + V+G GRF  YSS  P+KC +   + +F ++  TG ++  +P  EE
Sbjct:   816 SIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEE 865


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 340 (124.7 bits), Expect = 7.2e-37, Sum P(3) = 7.2e-37
 Identities = 94/305 (30%), Positives = 152/305 (49%)

Query:   285 SGLKHVVDESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQ 343
             +GL   V   ++ H N++Y+ VWHAL GYWGG+ P       Y T               
Sbjct:   383 NGLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTR-------------- 428

Query:   344 PDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVS 403
              ++ ++S     +  + P  +  FYN+ +A+L+  G+ GVK D Q+ ++ L A    R S
Sbjct:   429 -EVALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRS 486

Query:   404 LTRSYHQALEASIARNFPDNG--CISCMCHNT--DGIYSSKQTAVIRASDDYYPRDPASH 459
                +Y  A   S  R+F      C+S +        + ++K T V+R S+D++P    SH
Sbjct:   487 YANAYQDAWTISSLRHFGPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546

Query:   460 TIHISSVAYNTLFLGEFMQ--PDWDMFHSLHPA----AEYHGAARAVGGCAIYVSDKPGN 513
             T H+   A+N L L  ++   PDWDMF +L       A +H AAR + G  IY++DKPG 
Sbjct:   547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605

Query:   514 HNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSL-LKVWN--VNKCSGV 566
             H+  L++++      G+   LR  +  R T D ++ D  ++G  L +  ++      SG+
Sbjct:   606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662

Query:   567 VGVFN 571
             +GVFN
Sbjct:   663 IGVFN 667

 Score = 123 (48.4 bits), Expect = 7.2e-37, Sum P(3) = 7.2e-37
 Identities = 57/221 (25%), Positives = 93/221 (42%)

Query:    33 PGNGVGLVAGAFIGATASHSKSLHVFPMGVLEDL-RFMCCFRFKLWWMTQRMGTCGKDVP 91
             PG  +  ++G    A   HS  L + P+G    + RF    R +  W+  R G   KD  
Sbjct:   158 PGAALWNISGPVEEARDGHSGLLRL-PLGTPSSMSRFFALARVETSWLGPRQG---KDKL 213

Query:    92 LETQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVE 151
               T+  ++ S   +     DG  ++ V L +      + L      E+ I  ++ DNA  
Sbjct:   214 NFTEDAILLSFLRT-----DG--VHVVLLGVTVDDTLTVLGSGPAGEVVIKSQN-DNATP 265

Query:   152 TNQGLYLVYTHAGPNPFEVISQAV-----KAVEKYMQTFTHREKKK-LPSFLDWFGWCTW 205
             +   + L  T A    FEV + A+     + V  Y  T     + + L  + D   +CTW
Sbjct:   266 SRFQV-LAATAAD---FEVATSALIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTW 321

Query:   206 DAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENK 246
             +    D++ E +   L  L   G   + LIIDD WQ ++N+
Sbjct:   322 NGLGQDLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNE 362

 Score = 61 (26.5 bits), Expect = 7.2e-37, Sum P(3) = 7.2e-37
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query:   619 IVYAHRSGEVV-RLPKGASVPVTLKVLEYELFHFCPLKEIS 658
             IV AHR+G +V  L   ++V VTL    +E+    P+K ++
Sbjct:   695 IVRAHRTGRIVGELHSSSAVSVTLNERRWEVLTAYPVKTLT 735

 Score = 41 (19.5 bits), Expect = 8.6e-35, Sum P(3) = 8.6e-35
 Identities = 8/30 (26%), Positives = 19/30 (63%)

Query:   667 GLLDMFNSGGAVENVEVHMSEKKPDLFDGE 696
             G++ +FN    VE+V + +++  P ++D +
Sbjct:   661 GIIGVFNVSNRVESVIIPVADF-PGIYDDQ 689


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 337 (123.7 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
 Identities = 103/331 (31%), Positives = 156/331 (47%)

Query:   273 KFQKKCQNSEQVSGLKHVVDE-SKQNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALA 331
             +F+   Q   Q  GLK +V E  KQN  ++ + VWH + GYWGG+ P+      Y     
Sbjct:   393 RFEANQQGFPQ--GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKI 450

Query:   332 YPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNII 391
                    V   QP    D   V G      + V   Y++ +A+LA CGV   KVD Q  +
Sbjct:   451 QLRDEAEV---QPKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFL 500

Query:   392 ETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS--KQ------TA 443
             +   A    R +L R Y  A  A+ +++F     I+CM      I  S  +Q        
Sbjct:   501 D-YPAHANDRKNLIRPYQDAWTAAASKHFGGRA-IACMAQTPQSILHSLLQQGRSEGPML 558

Query:   444 VIRASDDYYPRDPASHTIHISSVAYNTLFLGEF-MQPDWDMFHSLHPA-AEYHGAARAVG 501
             + R SDD++P +  SHT H+   A+N L +    +  DWDMF +  P  A  H  AR++ 
Sbjct:   559 MARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMS 618

Query:   502 GCAIYVSDKPGNHNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSLLKV 557
             G  IY++D PG H+ +L++++     DG    LRA  PGR     L+         LL+V
Sbjct:   619 GGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRV 674

Query:   558 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 588
              + ++  G++GVFN    G   + ++ R+ D
Sbjct:   675 RSGHQGVGMLGVFNVCNRG-SLLGEQVRLDD 704

 Score = 90 (36.7 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
 Identities = 18/62 (29%), Positives = 32/62 (51%)

Query:   190 KKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKE 249
             + ++  + D F +CTW++   D++ + +   L  LS  G     LIIDD WQ ++    +
Sbjct:   326 RAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGSD 385

Query:   250 ES 251
              S
Sbjct:   386 AS 387

 Score = 46 (21.3 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query:   666 IGLLDMFN--SGGAVENVEVHMSEKKPDLFDGEVSSE 700
             +G+L +FN  + G++   +V +     D+FDGE + E
Sbjct:   681 VGMLGVFNVCNRGSLLGEQVRLD----DIFDGEKAGE 713


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 238 (88.8 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
 Identities = 67/192 (34%), Positives = 96/192 (50%)

Query:   381 DGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSSK 440
             D VKVD Q +I  +       ++ +R+   AL+ S+ ++      I+CM  N +   +  
Sbjct:   362 DLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGKDV-----INCMSMNPENYCNYF 415

Query:   441 QTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAV 500
              + V+R S DY P       +HI   AYN+L     + PD+DMF S  P A+ H  AR  
Sbjct:   416 YSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVF 475

Query:   501 GGCAIYVSDK-PGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN 559
              G  IY++D+ P   N +LLR  VLP+G V+R   P   T D LF DP R+   LLK+  
Sbjct:   476 SGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRERV-LLKLKG 534

Query:   560 VNKCSGVVGVFN 571
               K    +  FN
Sbjct:   535 KVKGYNAIAFFN 546

 Score = 156 (60.0 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
 Identities = 42/139 (30%), Positives = 64/139 (46%)

Query:   116 YTVFLPLLEGQFRSALQGNENNEI-------EICLESGDNAVETNQGLYLVYTHAGPNPF 168
             YTVF  +  G    A     NN +        + L +G N  E  +  Y +      NP+
Sbjct:   133 YTVFALVKSGNSYEAFFTLSNNYVTAYLFGDSVRLYTGFNTDEIKRS-YFLSIGTSDNPY 191

Query:   169 EVISQAVKAVEKYMQTFTHREKKKLPS-FLDWFGWCTWDAFYT-DVTAEGVDEGLKSLSA 226
             + I  A+    K   TF  R++K  P   ++  GWC+W+AF T D+  E + + +K +  
Sbjct:   192 KAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNAFLTKDLNEENLIKVVKGIIE 251

Query:   227 GGTPPKFLIIDDGWQQIEN 245
              G    ++IIDDGWQ   N
Sbjct:   252 RGLRLNWVIIDDGWQDQNN 270

 Score = 77 (32.2 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
 Identities = 21/63 (33%), Positives = 32/63 (50%)

Query:   254 IVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYW 313
             I+ +G Q  +    I+  +   KK  N     G K+ V   K +  VKYV +WHA+  +W
Sbjct:   260 IIDDGWQDQNNDRAIRSLNPDNKKFPN-----GFKNTVRAIK-SLGVKYVGLWHAINAHW 313

Query:   314 GGV 316
             GG+
Sbjct:   314 GGM 316


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 196 (74.1 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
 Identities = 54/191 (28%), Positives = 84/191 (43%)

Query:   362 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 421
             +K+  +Y      +   G D +K+D Q+    L  G    +   +  + ALE    R   
Sbjct:   348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHRM-- 405

Query:   422 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 481
               G ++CM  N   I  +  ++V RAS DY   D      H+     NTL LG+ + PD 
Sbjct:   406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465

Query:   482 DMFHSLHPAA-EYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 540
             DMFHS           ++A+ G  +Y+SD P     D +R L+   G + R   P  PT 
Sbjct:   466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525

Query:   541 DCLFADPARDG 551
             + +  +P + G
Sbjct:   526 ESILTNPLQSG 536

 Score = 130 (50.8 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
 Identities = 34/151 (22%), Positives = 79/151 (52%)

Query:   129 SALQGNENNEIEICLES-GDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV--EKYMQTF 185
             S  Q N++  + + + + G++A+ T +   L++  +  + + V S A  ++  +K +   
Sbjct:   158 SWFQVNQDGTLTLYVSTLGEDAL-TGRLPLLIFRKSS-SVYHVFSDAYDSLIADKAVSAL 215

Query:   186 THREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIEN 245
               R  K+  +  D+ GWCTW+ ++ D+    +   + ++ A G P ++++IDDG   I N
Sbjct:   216 RKRADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG--HIAN 273

Query:   246 KPKEESNCIVQEGAQFASRLTGIKENSKFQK 276
             K ++ ++ +V +  +F +  + I +  +  K
Sbjct:   274 KNRQLTS-LVPDKKRFPNGWSRIMKRKQADK 303

 Score = 66 (28.3 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query:   295 KQNHNVKYVYVWHALAGYWGGVKPAAD 321
             KQ   ++++ +W++L+GYW G+    D
Sbjct:   299 KQADKIRWIGLWYSLSGYWMGISAEND 325

 Score = 50 (22.7 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
 Identities = 19/80 (23%), Positives = 37/80 (46%)

Query:   648 LFHFCPLKEISSNISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSD 707
             LFH CP+++      +A IG+ + + S   V+ ++   +EK   + D   +  L    +D
Sbjct:   621 LFHLCPIRK-----GWAVIGIQEKYLSPATVQILK-RTTEKL--ILDVHCTGTLRI-WAD 671

Query:   708 NRSPTATISLKVRGCGRFGI 727
             +       S+ ++  GR  I
Sbjct:   672 SHGKQELRSIPIKKAGRIEI 691


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.135   0.417    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      774       774   0.00093  121 3  11 22  0.41    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  624 (66 KB)
  Total size of DFA:  424 KB (2203 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  63.93u 0.11s 64.04t   Elapsed:  00:00:03
  Total cpu time:  63.94u 0.11s 64.05t   Elapsed:  00:00:03
  Start:  Sat May 11 00:40:47 2013   End:  Sat May 11 00:40:50 2013

Back to top