BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>003500
MLRTAHVQLQQLKWFPALWKSRGHHRISFQNYKPLVLRRSKMTVAPNISISDGNLVVHGK
TILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWM
TQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEI
EICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDW
FGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGA
QFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYWGGVKPA
ADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCG
VDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS
KQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARA
VGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN
VNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVENMAQIAGAGWNGDAI
VYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNSGGAVE
NVEVHMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVRGCGRFGIYSSQRPLKCTVG
SIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV

High Scoring Gene Products

Symbol, full name Information P value
SIP2
AT3G57520
protein from Arabidopsis thaliana 0.
SIP1
AT1G55740
protein from Arabidopsis thaliana 1.6e-259
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 1.2e-138
SIP1
AT5G40390
protein from Arabidopsis thaliana 7.4e-137
STS
AT4G01970
protein from Arabidopsis thaliana 3.5e-125
STS1
Stachyose synthase
protein from Pisum sativum 4.0e-124
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 3.0e-31
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 4.6e-31
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 2.6e-23

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  003500
        (815 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...  3379  0.        1
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  2376  1.6e-259  2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...  1306  1.2e-138  2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...  1340  7.4e-137  1
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...   734  3.5e-125  3
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...   783  4.0e-124  3
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   340  1.0e-36   3
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   337  3.0e-31   3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   238  4.6e-31   3
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   196  2.6e-23   4


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 3379 (1194.5 bits), Expect = 0., P = 0.
 Identities = 618/775 (79%), Positives = 690/775 (89%)

Query:    42 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 101
             MT+  NIS+ + NLVV GKTILT +PDNIILTP  G G V+G+FIGAT   SKSLHVFP+
Sbjct:     1 MTITSNISVQNDNLVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQSKSLHVFPI 60

Query:   102 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 161
             GVLE LRFMCCFRFKLWWMTQRMG+CGKD+PLETQFML+ESKD  E + DD PT+YTVFL
Sbjct:    61 GVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAPTVYTVFL 120

Query:   162 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 221
             PLLEGQFR+ LQGNE NEIEIC ESGD AVET+QG +LVY HAG NPFEVI Q+VKAVE+
Sbjct:   121 PLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSVKAVER 180

Query:   222 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 281
             +MQTF HREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLS GGTPPKFLIIDDGW
Sbjct:   181 HMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDGW 240

Query:   282 QQIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNV 341
             QQIENK K+E NC+VQEGAQFA+RL GIKEN+KFQK  Q   QVSGLK VVD +KQ HNV
Sbjct:   241 QQIENKEKDE-NCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAKQRHNV 299

Query:   342 KYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVH 401
             K VY WHALAGYWGGVKPAA GMEHYD+ALAYPV SPGV+GNQPDIVMDSLAVHGLGLV+
Sbjct:   300 KQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVN 359

Query:   402 PKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNF 461
             PKKVFNFYNELH+YLASCG+DGVKVDVQNIIETLGAG GGRVSLTRSY QALEASIARNF
Sbjct:   360 PKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNF 419

Query:   462 PDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPD 521
              DNGCISCMCHNTDG+YS+KQTA++RASDD+YPRDPASHTIHI+SVAYN+LFLGEFMQPD
Sbjct:   420 TDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEFMQPD 479

Query:   522 WDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 581
             WDMFHSLHP AEYH AARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA+LPGRPTR
Sbjct:   480 WDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTR 539

Query:   582 DCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRV 641
             DCLFADPARDG SLLK+WN+NK +G+VGVFNCQGAGWCK TKK +IHD SPGTLT S+R 
Sbjct:   540 DCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRA 599

Query:   642 TDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSN 701
              D + ++Q+AG  W+GD+IVYA+RSGEVVRLPKGAS+P+TLKVLEYELFH  PLKEI+ N
Sbjct:   600 DDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEITEN 659

Query:   702 ISFAAIGLLDMFNSGGAVENVEV-HMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKV 760
             ISFA IGL+DMFNS GA+E++++ H+++K P+ FDGE+SS  + +LSDNRSPTA +S+ V
Sbjct:   660 ISFAPIGLVDMFNSSGAIESIDINHVTDKNPEFFDGEISSA-SPALSDNRSPTALVSVSV 718

Query:   761 RGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV 815
             RGCGRFG YSSQRPLKC V S +TDFTYD+  GL+T+ LPV  EEM+RW VEI V
Sbjct:   719 RGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHVEILV 773


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 2376 (841.5 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
 Identities = 433/683 (63%), Positives = 539/683 (78%)

Query:    42 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 101
             MTV   IS++D +LVV G  +L GVP+N+++TP +G  L+ GAFIG T+  + S  VF +
Sbjct:     1 MTVGAGISVTDSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHRVFSL 60

Query:   102 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 161
             G LEDLRFMC FRFKLWWMTQRMGT GK++P ETQF++VE+   S+    D  + Y VFL
Sbjct:    61 GKLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGSDLGGRDQSSSYVVFL 120

Query:   162 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 221
             P+LEG FR+ LQGNE NE+EICLESGD  V+  +G +LV+  AG +PF+VI++AVKAVE+
Sbjct:   121 PILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAVKAVEQ 180

Query:   222 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 281
             ++QTF+HRE+KK+P  L+WFGWCTWDAFYT+VTA+ V +GL+SL AGG  PKF+IIDDGW
Sbjct:   181 HLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGW 240

Query:   282 QQIE-NKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS----GLKHVVDESK 336
             Q +  ++   E N      A FA+RLT IKEN KFQK  +   +V      L HV+ + K
Sbjct:   241 QSVGMDETSVEFNA--DNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITDIK 298

Query:   337 QNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHG 396
              N+++KYVYVWHA+ GYWGGVKP   GMEHY++ +AYPV+SPGVM ++    ++S+  +G
Sbjct:   299 SNNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESITKNG 358

Query:   397 LGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEAS 456
             LGLV+P+KVF+FYN+LH+YLAS GVDGVKVDVQNI+ETLGAGHGGRV L + YHQALEAS
Sbjct:   359 LGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQALEAS 418

Query:   457 IARNFPDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGE 516
             I+RNFPDNG ISCM HNTDG+YS+K+TAVIRASDD++PRDPASHTIHI+SVAYNTLFLGE
Sbjct:   419 ISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLFLGE 478

Query:   517 FMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLP 576
             FMQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVL DGS+LRA+LP
Sbjct:   479 FMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRAKLP 538

Query:   577 GRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLT 636
             GRPT DC F+DP RD  SLLK+WN+N+ +GV+GVFNCQGAGWCK  K+  IHD+ PGT++
Sbjct:   539 GRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPGTIS 598

Query:   637 ASVRVTDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLK 696
               VR  DV  + ++A   W GD+IVY+H  GE+V LPK  S+PVTL   EYE+F   P+K
Sbjct:   599 GCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVVPVK 658

Query:   697 EISSNISFAAIGLLDMFNSGGAV 719
             E S    FA +GL++MFNSGGA+
Sbjct:   659 EFSDGSKFAPVGLMEMFNSGGAI 681

 Score = 145 (56.1 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
 Identities = 29/68 (42%), Positives = 42/68 (61%)

Query:   748 DNRSPTATISLKVRGCGRFGIYSS-QRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEM 806
             D+      + +K+RG G  G+YSS +RP   TV S   ++ Y+  +GL+T TL VPE+E+
Sbjct:   687 DDEGTKFVVRMKLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYEPESGLVTFTLGVPEKEL 746

Query:   807 YRWPVEIQ 814
             Y W V IQ
Sbjct:   747 YLWDVVIQ 754


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 1306 (464.8 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
 Identities = 289/724 (39%), Positives = 407/724 (56%)

Query:    46 PNISISDGNLVVHGKTILTGVPDNIILTPGNGV-------GLVAGAFIGATASHSKSLHV 98
             P  ++   +L V G   L  VP NI LTP + +          AG+F+G  A  +K  HV
Sbjct:    26 PRFTLKGKDLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHV 85

Query:    99 FPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYT 158
              P+G L D RFM  FRFK+WW T  +GT G+DV  ETQ M+++      S    GP  Y 
Sbjct:    86 VPIGKLRDTRFMSIFRFKVWWTTHWVGTNGRDVENETQMMILDQSGTKSSPT--GPRPYV 143

Query:   159 VFLPLLEGQFRSALQ-GNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVK 217
             + LP++EG FR+ L+ G   + + + LESG + V  +     VY HAG +PF+++  A++
Sbjct:   144 LLLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMR 203

Query:   218 AVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLII 277
              V  ++ TF   E+K  P  +D FGWCTWDAFY  V  EGV EG++ L+ GG PP  ++I
Sbjct:   204 VVRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLI 263

Query:   278 DDGWQQIENKPKE-----ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVV 332
             DDGWQ I +   +     E       G Q   RL   +EN KF+      E   G+   V
Sbjct:   264 DDGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFR------EYKGGMGGFV 317

Query:   333 DESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDS 391
              E K     V+ VYVWHAL GYWGG++P A G+      +  P  SPG+     D+ +D 
Sbjct:   318 REMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDK 375

Query:   392 LAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQ 451
             +  +G+GLV P++    Y  LH++L + G+DGVKVDV +++E +   +GGRV L ++Y  
Sbjct:   376 IVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFA 435

Query:   452 ALEASIARNFPDNGCISCMCHNTDG-IYSSKQTAVIRASDDYYPRDPASHT--------I 502
              L  S+ R+F  NG I+ M H  D  +  ++  A+ R  DD++  DP+            
Sbjct:   436 GLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGC 495

Query:   503 HISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRK 562
             H+   AYN+L++G F+ PDWDMF S HP A +H A+RAV G  +YVSD  G H+FDLLR+
Sbjct:   496 HMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRR 555

Query:   563 LVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKIT 622
             L LPDG++LR +    PTRDCLFADP  DG ++LK+WNVNK SGV+G FNCQG GW +  
Sbjct:   556 LALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREA 615

Query:   623 KKTRIHDESPGTLTASVRVTDVENMAQIAGAGWNGDAI-VYAHRSGEVVRLPKGASVPVT 681
             ++          +TA     DVE     +  G  GD   VY   + ++  L +  SV +T
Sbjct:   616 RRNMCAAGFSVPVTARASPADVE----WSHGGGGGDRFAVYFVEARKLQLLRRDESVELT 671

Query:   682 LKVLEYELFHFCPLKEISS---NISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEV 738
             L+   YEL    P++ I S    I FA IGL +M N+GGAV+  E   + +K    DG+V
Sbjct:   672 LEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFE---AARK----DGDV 724

Query:   739 SSEL 742
             ++E+
Sbjct:   725 AAEV 728

 Score = 72 (30.4 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query:   760 VRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLP 800
             V+G G    YSS RP  C V     +F Y+   G++T+ +P
Sbjct:   730 VKGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTVDVP 768


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 1340 (476.8 bits), Expect = 7.4e-137, P = 7.4e-137
 Identities = 289/782 (36%), Positives = 443/782 (56%)

Query:    50 ISDGNLVVHGKTILTGVPDNIILTPG------NGVGL--VAGAFIGATAS-HSKSLHVFP 100
             + D  L+ +G+ +LT VP N+ LT        +GV L   AG+FIG       KS HV  
Sbjct:    24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83

Query:   101 MGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGP-TIYTV 159
             +G L+++RFM  FRFK+WW T  +G+ G+D+  ETQ ++++ +  S+S    G    Y +
Sbjct:    84 IGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRPYVL 142

Query:   160 FLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV 219
              LPLLEG FRS+ Q  E++++ +C+ESG   V  ++   +VY HAG +PF+++  A+K +
Sbjct:   143 LLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDAMKVI 202

Query:   220 EKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDD 279
               +M TF   E+K  P  +D FGWCTWDAFY  V  +GV +G+K L  GG PP  ++IDD
Sbjct:   203 RVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDD 262

Query:   280 GWQQIENKPKE---ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS-GLKHVVDES 335
             GWQ I +       E   I   G Q   RL   +EN KF+      +Q   G+K  V + 
Sbjct:   263 GWQSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAFVRDL 322

Query:   336 KQNHN-VKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAV 394
             K   + V Y+YVWHAL GYWGG++P A  +    + +  P  SPG+     D+ +D +  
Sbjct:   323 KDEFSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKIIE 380

Query:   395 HGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALE 454
              G+G   P     FY  LH++L + G+DGVKVDV +I+E L   +GGRV L ++Y +AL 
Sbjct:   381 TGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKALT 440

Query:   455 ASIARNFPDNGCISCMCHNTDGIYSSKQTAVI-RASDDYYPRDPASHT--------IHIS 505
             +S+ ++F  NG I+ M H  D ++   +   + R  DD++  DP+            H+ 
Sbjct:   441 SSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHMV 500

Query:   506 SVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVL 565
               AYN+L++G F+QPDWDMF S HP AE+H A+RA+ G  IY+SD  G H+FDLL++LVL
Sbjct:   501 HCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLVL 560

Query:   566 PDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKT 625
             P+GS+LR +    PTRD LF DP  DG ++LK+WN+NK +GV+G FNCQG GWC+ T++ 
Sbjct:   561 PNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRRN 620

Query:   626 RIHDESPGTLTASVRVTDVE---NMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTL 682
             +   E   TLTA+    DVE     + I+ A     A+ +  +S +++       + +TL
Sbjct:   621 QCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFAL-FLSQSKKLLLSGLNDDLELTL 679

Query:   683 KVLEYELFHFCPLKEISSN-ISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSE 741
             +  ++EL    P+  I  N + FA IGL++M N+ GA+ ++          +++ E    
Sbjct:   680 EPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL----------VYNDE---- 725

Query:   742 LTTSLSDNRSPTATISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPV 801
                          ++ + V G G F +Y+S++P+ C +     +F Y+ +  ++ +    
Sbjct:   726 -------------SVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPWSG 772

Query:   802 PE 803
             P+
Sbjct:   773 PD 774


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 734 (263.4 bits), Expect = 3.5e-125, Sum P(3) = 3.5e-125
 Identities = 152/398 (38%), Positives = 237/398 (59%)

Query:   344 VYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPK 403
             +YVWHAL G W GV+P  + M      +A    SP +     D+ +D +   G+GLVHP 
Sbjct:   416 IYVWHALCGAWNGVRP--ETMMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473

Query:   404 KVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPD 463
             K   FY+ +H+YLAS GV G K+DV   +E+L   HGGRV L ++Y+  L  S+ +NF  
Sbjct:   474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533

Query:   464 NGCISCMCHNTDGIY-SSKQTAVIRASDDYYPRDPASHT--------IHISSVAYNTLFL 514
                I+ M    +  + ++KQ ++ R  DD++ +DP            +H+   +YN++++
Sbjct:   534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593

Query:   515 GEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPG--NHNFDLLRKLVLPDGSVLR 572
             G+ +QPDWDMF S H  AEYH A+RA+ G  +Y+SD  G  +HNFDL++KL   DG++ R
Sbjct:   594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653

Query:   573 AQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESP 632
                   PTRD LF +P  D  S+LK++N NK  GV+G FNCQGAGW     + + + E  
Sbjct:   654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713

Query:   633 GTLTASVRVTDVE--NMAQIAGAG--WNGDAIVYAHRSGEVVRL-PKGASVPVTLKVLEY 687
              T++ +V V+D+E     + AG+   + GD +VY  +S E++ +  K  ++ +TL+   +
Sbjct:   714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773

Query:   688 ELFHFCPLKE-ISSNISFAAIGLLDMFNSGGAVENVEV 724
             +L  F P+ E +SS + FA +GL++MFN  G V++++V
Sbjct:   774 DLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQDMKV 811

 Score = 439 (159.6 bits), Expect = 3.5e-125, Sum P(3) = 3.5e-125
 Identities = 110/320 (34%), Positives = 155/320 (48%)

Query:    33 KPLVLRRSKMTVAPN-ISISDGNLVVHGKT-ILTGVPDNIILTP--GNGVGLVA------ 82
             KPL +  +K  + PN  ++S+G+L     T IL  VP N+  TP   + +   A      
Sbjct:    18 KPLFVPITKPILQPNSFNLSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAPLPILL 77

Query:    83 --------GAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLE 134
                     G F+G T           +G  ED  F+  FRFK+WW T  +G  G D+  E
Sbjct:    78 RVQANAHKGGFLGFTKESPSDRLTNSLGRFEDREFLSLFRFKMWWSTAWIGKSGSDLQAE 137

Query:   135 TQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETN 194
             TQ+++++     E D       Y   +P +EG FR++L   E   + IC ESG   V+ +
Sbjct:   138 TQWVMLKIP---EIDS------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKES 188

Query:   195 QGLYLVYTHAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVT 254
                 + Y H   NP+ ++ +A  A+  +M TF   E+KKLP  +D FGWCTWDA Y  V 
Sbjct:   189 SFKSIAYIHICDNPYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVD 248

Query:   255 AEGVDEGLKSLSAGGTPPKFLIIDDGWQQI----ENKPKEESNCIVQEGAQFASRLTGIK 310
                +  G+K    GG  PKF+IIDDGWQ I    +   K+  N +V  G Q  +RLT  K
Sbjct:   249 PATIWTGVKEFEDGGVCPKFVIIDDGWQSINFDGDELDKDAEN-LVLGGEQMTARLTSFK 307

Query:   311 ENSKFQKKCQNSEQVSGLKH 330
             E  KF+     S   S   H
Sbjct:   308 ECKKFRNYKGGSFITSDASH 327

 Score = 92 (37.4 bits), Expect = 3.5e-125, Sum P(3) = 3.5e-125
 Identities = 18/50 (36%), Positives = 30/50 (60%)

Query:   755 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 804
             +I + V+G GRF  YSS  P+KC +   + +F ++  TG ++  +P  EE
Sbjct:   816 SIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEE 865


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 783 (280.7 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
 Identities = 180/488 (36%), Positives = 273/488 (55%)

Query:   250 YTDVTAEGVD-EGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGAQFASRLTG 308
             +TD+  +G++ E L+         K         +IE+K K+    +V+E       L G
Sbjct:   319 FTDLILKGIEHEKLRKKREEAISSK----SSDLAEIESKIKK----VVKE----IDDLFG 366

Query:   309 IKENSKFQKKCQNSEQVSGLKHVVDESKQNHN-VKYVYVWHALAGYWGGVKPAADGMEHY 367
              ++ S  +K    SE   GLK    + +     +  VYVWHAL G WGGV+P      H 
Sbjct:   367 GEQFSSGEKSEMKSEY--GLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HL 421

Query:   368 DTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVD 427
             DT +     SPG+ G   D+ +  ++   LGLVHP +    Y+ +H+YLA  G+ GVKVD
Sbjct:   422 DTKIVPCKLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVD 481

Query:   428 VQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIY-SSKQTAVI 486
             V + +E +   +GGRV L + Y++ L  SI +NF  NG I+ M H  D  +  +KQ ++ 
Sbjct:   482 VIHSLEYVCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMG 541

Query:   487 RASDDYYPRDPASHT--------IHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAA 538
             R  DD++ +DP            +H+   +YN+L++G+ +QPDWDMF S H  A++H  +
Sbjct:   542 RVGDDFWFQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGS 601

Query:   539 RAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKV 598
             RA+ G  IYVSD  G+H+FDL++KLV PDG++ +      PTRDCLF +P  D T++LK+
Sbjct:   602 RAICGGPIYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKI 661

Query:   599 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVE--NMAQIAGAGWN 656
             WN NK  GV+G FNCQGAGW  I +K R   E    +  +V VT+VE     + +  G  
Sbjct:   662 WNFNKYGGVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKA 721

Query:   657 GDAIVYAHRSGEVVRLP-KGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNS 715
              + +VY +++ E+  +  K   +  T++   +EL+ F P+ ++   I FA IGL +MFNS
Sbjct:   722 EEYVVYLNQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNS 781

Query:   716 GGAVENVE 723
             GG V ++E
Sbjct:   782 GGTVIDLE 789

 Score = 406 (148.0 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
 Identities = 85/238 (35%), Positives = 126/238 (52%)

Query:    83 GAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVES 142
             G F G +        +  +G      F+  FRFK WW TQ +G  G D+ +ETQ++L+E 
Sbjct:    72 GGFFGFSHETPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131

Query:   143 KDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYT 202
              +            Y V +P++E  FRSAL    N+ ++I  ESG   V+ +    + Y 
Sbjct:   132 PETKS---------YVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIAYV 182

Query:   203 HAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGL 262
             H   NP++++ +A  A+  ++ +F   E+K +P+ +D FGWCTWDAFY  V   G+  GL
Sbjct:   183 HFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGL 242

Query:   263 KSLSAGGTPPKFLIIDDGWQQIE---NKPKEESNCIVQEGAQFASRLTGIKENSKFQK 317
                S GG  P+F+IIDDGWQ I      P E++  +V  G Q + RL    E  KF+K
Sbjct:   243 DDFSKGGVEPRFVIIDDGWQSISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKFRK 300

 Score = 66 (28.3 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
 Identities = 16/47 (34%), Positives = 25/47 (53%)

Query:   758 LKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 804
             +KV+G G F  YSS+ P K  +   + DF +    G + + +P  EE
Sbjct:   797 IKVKGGGSFLAYSSESPKKFQLNGCEVDFEW-LGDGKLCVNVPWIEE 842


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 340 (124.7 bits), Expect = 1.0e-36, Sum P(3) = 1.0e-36
 Identities = 94/305 (30%), Positives = 152/305 (49%)

Query:   326 SGLKHVVDESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQ 384
             +GL   V   ++ H N++Y+ VWHAL GYWGG+ P       Y T               
Sbjct:   383 NGLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTR-------------- 428

Query:   385 PDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVS 444
              ++ ++S     +  + P  +  FYN+ +A+L+  G+ GVK D Q+ ++ L A    R S
Sbjct:   429 -EVALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRS 486

Query:   445 LTRSYHQALEASIARNFPDNG--CISCMCHNT--DGIYSSKQTAVIRASDDYYPRDPASH 500
                +Y  A   S  R+F      C+S +        + ++K T V+R S+D++P    SH
Sbjct:   487 YANAYQDAWTISSLRHFGPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546

Query:   501 TIHISSVAYNTLFLGEFMQ--PDWDMFHSLHPA----AEYHGAARAVGGCAIYVSDKPGN 554
             T H+   A+N L L  ++   PDWDMF +L       A +H AAR + G  IY++DKPG 
Sbjct:   547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605

Query:   555 HNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSL-LKVWN--VNKCSGV 607
             H+  L++++      G+   LR  +  R T D ++ D  ++G  L +  ++      SG+
Sbjct:   606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662

Query:   608 VGVFN 612
             +GVFN
Sbjct:   663 IGVFN 667

 Score = 123 (48.4 bits), Expect = 1.0e-36, Sum P(3) = 1.0e-36
 Identities = 57/221 (25%), Positives = 93/221 (42%)

Query:    74 PGNGVGLVAGAFIGATASHSKSLHVFPMGVLEDL-RFMCCFRFKLWWMTQRMGTCGKDVP 132
             PG  +  ++G    A   HS  L + P+G    + RF    R +  W+  R G   KD  
Sbjct:   158 PGAALWNISGPVEEARDGHSGLLRL-PLGTPSSMSRFFALARVETSWLGPRQG---KDKL 213

Query:   133 LETQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVE 192
               T+  ++ S   +     DG  ++ V L +      + L      E+ I  ++ DNA  
Sbjct:   214 NFTEDAILLSFLRT-----DG--VHVVLLGVTVDDTLTVLGSGPAGEVVIKSQN-DNATP 265

Query:   193 TNQGLYLVYTHAGPNPFEVISQAV-----KAVEKYMQTFTHREKKK-LPSFLDWFGWCTW 246
             +   + L  T A    FEV + A+     + V  Y  T     + + L  + D   +CTW
Sbjct:   266 SRFQV-LAATAAD---FEVATSALIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTW 321

Query:   247 DAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENK 287
             +    D++ E +   L  L   G   + LIIDD WQ ++N+
Sbjct:   322 NGLGQDLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNE 362

 Score = 61 (26.5 bits), Expect = 1.0e-36, Sum P(3) = 1.0e-36
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query:   660 IVYAHRSGEVV-RLPKGASVPVTLKVLEYELFHFCPLKEIS 699
             IV AHR+G +V  L   ++V VTL    +E+    P+K ++
Sbjct:   695 IVRAHRTGRIVGELHSSSAVSVTLNERRWEVLTAYPVKTLT 735

 Score = 41 (19.5 bits), Expect = 1.2e-34, Sum P(3) = 1.2e-34
 Identities = 8/30 (26%), Positives = 19/30 (63%)

Query:   708 GLLDMFNSGGAVENVEVHMSEKKPDLFDGE 737
             G++ +FN    VE+V + +++  P ++D +
Sbjct:   661 GIIGVFNVSNRVESVIIPVADF-PGIYDDQ 689


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 337 (123.7 bits), Expect = 3.0e-31, Sum P(3) = 3.0e-31
 Identities = 103/331 (31%), Positives = 156/331 (47%)

Query:   314 KFQKKCQNSEQVSGLKHVVDE-SKQNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALA 372
             +F+   Q   Q  GLK +V E  KQN  ++ + VWH + GYWGG+ P+      Y     
Sbjct:   393 RFEANQQGFPQ--GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKI 450

Query:   373 YPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNII 432
                    V   QP    D   V G      + V   Y++ +A+LA CGV   KVD Q  +
Sbjct:   451 QLRDEAEV---QPKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFL 500

Query:   433 ETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS--KQ------TA 484
             +   A    R +L R Y  A  A+ +++F     I+CM      I  S  +Q        
Sbjct:   501 D-YPAHANDRKNLIRPYQDAWTAAASKHFGGRA-IACMAQTPQSILHSLLQQGRSEGPML 558

Query:   485 VIRASDDYYPRDPASHTIHISSVAYNTLFLGEF-MQPDWDMFHSLHPA-AEYHGAARAVG 542
             + R SDD++P +  SHT H+   A+N L +    +  DWDMF +  P  A  H  AR++ 
Sbjct:   559 MARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMS 618

Query:   543 GCAIYVSDKPGNHNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSLLKV 598
             G  IY++D PG H+ +L++++     DG    LRA  PGR     L+         LL+V
Sbjct:   619 GGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRV 674

Query:   599 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 629
              + ++  G++GVFN    G   + ++ R+ D
Sbjct:   675 RSGHQGVGMLGVFNVCNRG-SLLGEQVRLDD 704

 Score = 90 (36.7 bits), Expect = 3.0e-31, Sum P(3) = 3.0e-31
 Identities = 18/62 (29%), Positives = 32/62 (51%)

Query:   231 KKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKE 290
             + ++  + D F +CTW++   D++ + +   L  LS  G     LIIDD WQ ++    +
Sbjct:   326 RAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGSD 385

Query:   291 ES 292
              S
Sbjct:   386 AS 387

 Score = 46 (21.3 bits), Expect = 3.0e-31, Sum P(3) = 3.0e-31
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query:   707 IGLLDMFN--SGGAVENVEVHMSEKKPDLFDGEVSSE 741
             +G+L +FN  + G++   +V +     D+FDGE + E
Sbjct:   681 VGMLGVFNVCNRGSLLGEQVRLD----DIFDGEKAGE 713


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 238 (88.8 bits), Expect = 4.6e-31, Sum P(3) = 4.6e-31
 Identities = 67/192 (34%), Positives = 96/192 (50%)

Query:   422 DGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSSK 481
             D VKVD Q +I  +       ++ +R+   AL+ S+ ++      I+CM  N +   +  
Sbjct:   362 DLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGKDV-----INCMSMNPENYCNYF 415

Query:   482 QTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAV 541
              + V+R S DY P       +HI   AYN+L     + PD+DMF S  P A+ H  AR  
Sbjct:   416 YSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVF 475

Query:   542 GGCAIYVSDK-PGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN 600
              G  IY++D+ P   N +LLR  VLP+G V+R   P   T D LF DP R+   LLK+  
Sbjct:   476 SGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRERV-LLKLKG 534

Query:   601 VNKCSGVVGVFN 612
               K    +  FN
Sbjct:   535 KVKGYNAIAFFN 546

 Score = 156 (60.0 bits), Expect = 4.6e-31, Sum P(3) = 4.6e-31
 Identities = 42/139 (30%), Positives = 64/139 (46%)

Query:   157 YTVFLPLLEGQFRSALQGNENNEI-------EICLESGDNAVETNQGLYLVYTHAGPNPF 209
             YTVF  +  G    A     NN +        + L +G N  E  +  Y +      NP+
Sbjct:   133 YTVFALVKSGNSYEAFFTLSNNYVTAYLFGDSVRLYTGFNTDEIKRS-YFLSIGTSDNPY 191

Query:   210 EVISQAVKAVEKYMQTFTHREKKKLPS-FLDWFGWCTWDAFYT-DVTAEGVDEGLKSLSA 267
             + I  A+    K   TF  R++K  P   ++  GWC+W+AF T D+  E + + +K +  
Sbjct:   192 KAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNAFLTKDLNEENLIKVVKGIIE 251

Query:   268 GGTPPKFLIIDDGWQQIEN 286
              G    ++IIDDGWQ   N
Sbjct:   252 RGLRLNWVIIDDGWQDQNN 270

 Score = 77 (32.2 bits), Expect = 4.6e-31, Sum P(3) = 4.6e-31
 Identities = 21/63 (33%), Positives = 32/63 (50%)

Query:   295 IVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYW 354
             I+ +G Q  +    I+  +   KK  N     G K+ V   K +  VKYV +WHA+  +W
Sbjct:   260 IIDDGWQDQNNDRAIRSLNPDNKKFPN-----GFKNTVRAIK-SLGVKYVGLWHAINAHW 313

Query:   355 GGV 357
             GG+
Sbjct:   314 GGM 316


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 196 (74.1 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
 Identities = 54/191 (28%), Positives = 84/191 (43%)

Query:   403 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 462
             +K+  +Y      +   G D +K+D Q+    L  G    +   +  + ALE    R   
Sbjct:   348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHRM-- 405

Query:   463 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 522
               G ++CM  N   I  +  ++V RAS DY   D      H+     NTL LG+ + PD 
Sbjct:   406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465

Query:   523 DMFHSLHPAA-EYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 581
             DMFHS           ++A+ G  +Y+SD P     D +R L+   G + R   P  PT 
Sbjct:   466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525

Query:   582 DCLFADPARDG 592
             + +  +P + G
Sbjct:   526 ESILTNPLQSG 536

 Score = 130 (50.8 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
 Identities = 34/151 (22%), Positives = 79/151 (52%)

Query:   170 SALQGNENNEIEICLES-GDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV--EKYMQTF 226
             S  Q N++  + + + + G++A+ T +   L++  +  + + V S A  ++  +K +   
Sbjct:   158 SWFQVNQDGTLTLYVSTLGEDAL-TGRLPLLIFRKSS-SVYHVFSDAYDSLIADKAVSAL 215

Query:   227 THREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIEN 286
               R  K+  +  D+ GWCTW+ ++ D+    +   + ++ A G P ++++IDDG   I N
Sbjct:   216 RKRADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG--HIAN 273

Query:   287 KPKEESNCIVQEGAQFASRLTGIKENSKFQK 317
             K ++ ++ +V +  +F +  + I +  +  K
Sbjct:   274 KNRQLTS-LVPDKKRFPNGWSRIMKRKQADK 303

 Score = 66 (28.3 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query:   336 KQNHNVKYVYVWHALAGYWGGVKPAAD 362
             KQ   ++++ +W++L+GYW G+    D
Sbjct:   299 KQADKIRWIGLWYSLSGYWMGISAEND 325

 Score = 50 (22.7 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
 Identities = 19/80 (23%), Positives = 37/80 (46%)

Query:   689 LFHFCPLKEISSNISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSD 748
             LFH CP+++      +A IG+ + + S   V+ ++   +EK   + D   +  L    +D
Sbjct:   621 LFHLCPIRK-----GWAVIGIQEKYLSPATVQILK-RTTEKL--ILDVHCTGTLRI-WAD 671

Query:   749 NRSPTATISLKVRGCGRFGI 768
             +       S+ ++  GR  I
Sbjct:   672 SHGKQELRSIPIKKAGRIEI 691


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.135   0.419    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      815       815   0.00099  121 3  11 22  0.40    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  625 (66 KB)
  Total size of DFA:  445 KB (2211 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  67.81u 0.08s 67.89t   Elapsed:  00:00:03
  Total cpu time:  67.81u 0.08s 67.89t   Elapsed:  00:00:03
  Start:  Tue May 21 14:41:12 2013   End:  Tue May 21 14:41:15 2013

Back to top