BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>007685
MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ
QIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVK
YVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHP
KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP
DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW
DMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRD
CLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVT
DVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSNI
SFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVRG
CGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV

High Scoring Gene Products

Symbol, full name Information P value
SIP2
AT3G57520
protein from Arabidopsis thaliana 4.9e-275
SIP1
AT1G55740
protein from Arabidopsis thaliana 1.9e-201
SIP1
AT5G40390
protein from Arabidopsis thaliana 1.9e-106
RFS
Galactinol--sucrose galactosyltransferase
protein from Oryza sativa Japonica Group 3.5e-105
STS
AT4G01970
protein from Arabidopsis thaliana 8.0e-102
STS1
Stachyose synthase
protein from Pisum sativum 8.3e-83
MGG_11554
Seed imbibition protein
protein from Magnaporthe oryzae 70-15 1.7e-32
galS
Alpha-galactosidase
protein from Sulfolobus solfataricus P2 5.6e-24
BT_3797
Possible alpha-galactosidase
protein from Bacteroides thetaiotaomicron VPI-5482 1.2e-22

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  007685
        (593 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702...  2644  4.9e-275  1
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702...  1827  1.9e-201  2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702...  1005  1.9e-106  2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact...   989  3.5e-105  2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ...   734  8.0e-102  3
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci...   783  8.3e-83   2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric...   340  2.3e-35   3
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot...   337  1.7e-32   3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec...   238  5.6e-24   2
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto...   196  1.2e-22   4


>TAIR|locus:2103488 [details] [associations]
            symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
            "raffinose catabolic process" evidence=IDA] [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            [GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
            [GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
            GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
            EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
            GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
            IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
            RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
            ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
            EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
            TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
            ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
            BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
        Length = 773

 Score = 2644 (935.8 bits), Expect = 4.9e-275, P = 4.9e-275
 Identities = 482/594 (81%), Positives = 536/594 (90%)

Query:     1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
             MQTF HREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLS GGTPPKFLIIDDGWQ
Sbjct:   182 MQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDGWQ 241

Query:    61 QIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVK 120
             QIENK K+E NC+VQEGAQFA+RL GIKEN+KFQK  Q   QVSGLK VVD +KQ HNVK
Sbjct:   242 QIENKEKDE-NCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAKQRHNVK 300

Query:   121 YVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHP 180
              VY WHALAGYWGGVKPAA GMEHYD+ALAYPV SPGV+GNQPDIVMDSLAVHGLGLV+P
Sbjct:   301 QVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVNP 360

Query:   181 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 240
             KKVFNFYNELH+YLASCG+DGVKVDVQNIIETLGAG GGRVSLTRSY QALEASIARNF 
Sbjct:   361 KKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNFT 420

Query:   241 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 300
             DNGCISCMCHNTDG+YS+KQTA++RASDD+YPRDPASHTIHI+SVAYN+LFLGEFMQPDW
Sbjct:   421 DNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEFMQPDW 480

Query:   301 DMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRD 360
             DMFHSLHP AEYH AARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA+LPGRPTRD
Sbjct:   481 DMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTRD 540

Query:   361 CLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVT 420
             CLFADPARDG SLLK+WN+NK +G+VGVFNCQGAGWCK TKK +IHD SPGTLT S+R  
Sbjct:   541 CLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRAD 600

Query:   421 DVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSNI 480
             D + ++Q+AG  W+GD+IVYA+RSGEVVRLPKGAS+P+TLKVLEYELFH  PLKEI+ NI
Sbjct:   601 DADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEITENI 660

Query:   481 SFAAIGLLDMFNSGGAVENVEV-HMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVR 539
             SFA IGL+DMFNS GA+E++++ H+++K P+ FDGE+SS  + +LSDNRSPTA +S+ VR
Sbjct:   661 SFAPIGLVDMFNSSGAIESIDINHVTDKNPEFFDGEISSA-SPALSDNRSPTALVSVSVR 719

Query:   540 GCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV 593
             GCGRFG YSSQRPLKC V S +TDFTYD+  GL+T+ LPV  EEM+RW VEI V
Sbjct:   720 GCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHVEILV 773


>TAIR|locus:2020452 [details] [associations]
            symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
            IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
            UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
            PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
            KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
            InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
            ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
            Uniprot:Q84VX0
        Length = 754

 Score = 1827 (648.2 bits), Expect = 1.9e-201, Sum P(2) = 1.9e-201
 Identities = 330/502 (65%), Positives = 403/502 (80%)

Query:     1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
             +QTF+HRE+KK+P  L+WFGWCTWDAFYT+VTA+ V +GL+SL AGG  PKF+IIDDGWQ
Sbjct:   182 LQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGWQ 241

Query:    61 QIE-NKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS----GLKHVVDESKQ 115
              +  ++   E N      A FA+RLT IKEN KFQK  +   +V      L HV+ + K 
Sbjct:   242 SVGMDETSVEFNA--DNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITDIKS 299

Query:   116 NHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGL 175
             N+++KYVYVWHA+ GYWGGVKP   GMEHY++ +AYPV+SPGVM ++    ++S+  +GL
Sbjct:   300 NNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESITKNGL 359

Query:   176 GLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASI 235
             GLV+P+KVF+FYN+LH+YLAS GVDGVKVDVQNI+ETLGAGHGGRV L + YHQALEASI
Sbjct:   360 GLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQALEASI 419

Query:   236 ARNFPDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEF 295
             +RNFPDNG ISCM HNTDG+YS+K+TAVIRASDD++PRDPASHTIHI+SVAYNTLFLGEF
Sbjct:   420 SRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLFLGEF 479

Query:   296 MQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPG 355
             MQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVL DGS+LRA+LPG
Sbjct:   480 MQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRAKLPG 539

Query:   356 RPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTA 415
             RPT DC F+DP RD  SLLK+WN+N+ +GV+GVFNCQGAGWCK  K+  IHD+ PGT++ 
Sbjct:   540 RPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPGTISG 599

Query:   416 SVRVTDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKE 475
              VR  DV  + ++A   W GD+IVY+H  GE+V LPK  S+PVTL   EYE+F   P+KE
Sbjct:   600 CVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVVPVKE 659

Query:   476 ISSNISFAAIGLLDMFNSGGAV 497
              S    FA +GL++MFNSGGA+
Sbjct:   660 FSDGSKFAPVGLMEMFNSGGAI 681

 Score = 145 (56.1 bits), Expect = 1.9e-201, Sum P(2) = 1.9e-201
 Identities = 29/68 (42%), Positives = 42/68 (61%)

Query:   526 DNRSPTATISLKVRGCGRFGIYSS-QRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEM 584
             D+      + +K+RG G  G+YSS +RP   TV S   ++ Y+  +GL+T TL VPE+E+
Sbjct:   687 DDEGTKFVVRMKLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYEPESGLVTFTLGVPEKEL 746

Query:   585 YRWPVEIQ 592
             Y W V IQ
Sbjct:   747 YLWDVVIQ 754


>TAIR|locus:2170528 [details] [associations]
            symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0005986 "sucrose biosynthetic process" evidence=IMP]
            [GO:0010325 "raffinose family oligosaccharide biosynthetic process"
            evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
            evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
            activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0006979 "response to oxidative stress" evidence=IEP]
            [GO:0009414 "response to water deprivation" evidence=IEP]
            [GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
            InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
            CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
            EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
            EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
            UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
            PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
            KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
            InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
            Uniprot:Q9FND9
        Length = 783

 Score = 1005 (358.8 bits), Expect = 1.9e-106, Sum P(2) = 1.9e-106
 Identities = 208/518 (40%), Positives = 302/518 (58%)

Query:     1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
             M TF   E+K  P  +D FGWCTWDAFY  V  +GV +G+K L  GG PP  ++IDDGWQ
Sbjct:   206 MNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDDGWQ 265

Query:    61 QIENKPKE---ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS-GLKHVVDESKQN 116
              I +       E   I   G Q   RL   +EN KF+      +Q   G+K  V + K  
Sbjct:   266 SIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAFVRDLKDE 325

Query:   117 HN-VKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGL 175
              + V Y+YVWHAL GYWGG++P A  +    + +  P  SPG+     D+ +D +   G+
Sbjct:   326 FSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKIIETGI 383

Query:   176 GLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASI 235
             G   P     FY  LH++L + G+DGVKVDV +I+E L   +GGRV L ++Y +AL +S+
Sbjct:   384 GFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKALTSSV 443

Query:   236 ARNFPDNGCISCMCHNTDGIYSSKQTAVI-RASDDYYPRDPASHT--------IHISSVA 286
              ++F  NG I+ M H  D ++   +   + R  DD++  DP+            H+   A
Sbjct:   444 NKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHMVHCA 503

Query:   287 YNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDG 346
             YN+L++G F+QPDWDMF S HP AE+H A+RA+ G  IY+SD  G H+FDLL++LVLP+G
Sbjct:   504 YNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLVLPNG 563

Query:   347 SVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIH 406
             S+LR +    PTRD LF DP  DG ++LK+WN+NK +GV+G FNCQG GWC+ T++ +  
Sbjct:   564 SILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRRNQCF 623

Query:   407 DESPGTLTASVRVTDVE---NMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVL 463
              E   TLTA+    DVE     + I+ A     A+ +  +S +++       + +TL+  
Sbjct:   624 SECVNTLTATTSPKDVEWNSGSSPISIANVEEFAL-FLSQSKKLLLSGLNDDLELTLEPF 682

Query:   464 EYELFHFCPLKEISSN-ISFAAIGLLDMFNSGGAVENV 500
             ++EL    P+  I  N + FA IGL++M N+ GA+ ++
Sbjct:   683 KFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL 720

 Score = 68 (29.0 bits), Expect = 1.9e-106, Sum P(2) = 1.9e-106
 Identities = 11/49 (22%), Positives = 27/49 (55%)

Query:   533 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPE 581
             ++ + V G G F +Y+S++P+ C +     +F Y+ +  ++ +    P+
Sbjct:   726 SVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPWSGPD 774


>UNIPROTKB|Q5VQG4 [details] [associations]
            symbol:RFS "Galactinol--sucrose galactosyltransferase"
            species:39947 "Oryza sativa Japonica Group" [GO:0047274
            "galactinol-sucrose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
            SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
            EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
            eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
            UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
            KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
            Uniprot:Q5VQG4
        Length = 783

 Score = 989 (353.2 bits), Expect = 3.5e-105, Sum P(2) = 3.5e-105
 Identities = 220/537 (40%), Positives = 305/537 (56%)

Query:     3 TFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQI 62
             TF   E+K  P  +D FGWCTWDAFY  V  EGV EG++ L+ GG PP  ++IDDGWQ I
Sbjct:   211 TFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLIDDGWQSI 270

Query:    63 ENKPKE-----ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNH 117
              +   +     E       G Q   RL   +EN KF+      E   G+   V E K   
Sbjct:   271 CHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFR------EYKGGMGGFVREMKAAF 324

Query:   118 -NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLG 176
               V+ VYVWHAL GYWGG++P A G+      +  P  SPG+     D+ +D +  +G+G
Sbjct:   325 PTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDKIVNNGVG 382

Query:   177 LVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIA 236
             LV P++    Y  LH++L + G+DGVKVDV +++E +   +GGRV L ++Y   L  S+ 
Sbjct:   383 LVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFAGLTESVR 442

Query:   237 RNFPDNGCISCMCHNTDG-IYSSKQTAVIRASDDYYPRDPASHT--------IHISSVAY 287
             R+F  NG I+ M H  D  +  ++  A+ R  DD++  DP+            H+   AY
Sbjct:   443 RHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGCHMVHCAY 502

Query:   288 NTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGS 347
             N+L++G F+ PDWDMF S HP A +H A+RAV G  +YVSD  G H+FDLLR+L LPDG+
Sbjct:   503 NSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRRLALPDGT 562

Query:   348 VLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 407
             +LR +    PTRDCLFADP  DG ++LK+WNVNK SGV+G FNCQG GW +  ++     
Sbjct:   563 ILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREARRNMCAA 622

Query:   408 ESPGTLTASVRVTDVENMAQIAGAGWNGDAI-VYAHRSGEVVRLPKGASVPVTLKVLEYE 466
                  +TA     DVE     +  G  GD   VY   + ++  L +  SV +TL+   YE
Sbjct:   623 GFSVPVTARASPADVE----WSHGGGGGDRFAVYFVEARKLQLLRRDESVELTLEPFTYE 678

Query:   467 LFHFCPLKEISS---NISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSEL 520
             L    P++ I S    I FA IGL +M N+GGAV+  E   + +K    DG+V++E+
Sbjct:   679 LLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFE---AARK----DGDVAAEV 728

 Score = 72 (30.4 bits), Expect = 3.5e-105, Sum P(2) = 3.5e-105
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query:   538 VRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLP 578
             V+G G    YSS RP  C V     +F Y+   G++T+ +P
Sbjct:   730 VKGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTVDVP 768


>TAIR|locus:2141425 [details] [associations]
            symbol:STS "AT4G01970" species:3702 "Arabidopsis
            thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
            compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
            [GO:0047268 "galactinol-raffinose galactosyltransferase activity"
            evidence=ISS] [GO:0006979 "response to oxidative stress"
            evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
            InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
            GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
            EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
            UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
            PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
            KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
            InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
            Uniprot:Q9SYJ4
        Length = 876

 Score = 734 (263.4 bits), Expect = 8.0e-102, Sum P(3) = 8.0e-102
 Identities = 152/398 (38%), Positives = 237/398 (59%)

Query:   122 VYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPK 181
             +YVWHAL G W GV+P  + M      +A    SP +     D+ +D +   G+GLVHP 
Sbjct:   416 IYVWHALCGAWNGVRP--ETMMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473

Query:   182 KVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPD 241
             K   FY+ +H+YLAS GV G K+DV   +E+L   HGGRV L ++Y+  L  S+ +NF  
Sbjct:   474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533

Query:   242 NGCISCMCHNTDGIY-SSKQTAVIRASDDYYPRDPASHT--------IHISSVAYNTLFL 292
                I+ M    +  + ++KQ ++ R  DD++ +DP            +H+   +YN++++
Sbjct:   534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593

Query:   293 GEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPG--NHNFDLLRKLVLPDGSVLR 350
             G+ +QPDWDMF S H  AEYH A+RA+ G  +Y+SD  G  +HNFDL++KL   DG++ R
Sbjct:   594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653

Query:   351 AQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESP 410
                   PTRD LF +P  D  S+LK++N NK  GV+G FNCQGAGW     + + + E  
Sbjct:   654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713

Query:   411 GTLTASVRVTDVE--NMAQIAGAG--WNGDAIVYAHRSGEVVRL-PKGASVPVTLKVLEY 465
              T++ +V V+D+E     + AG+   + GD +VY  +S E++ +  K  ++ +TL+   +
Sbjct:   714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773

Query:   466 ELFHFCPLKE-ISSNISFAAIGLLDMFNSGGAVENVEV 502
             +L  F P+ E +SS + FA +GL++MFN  G V++++V
Sbjct:   774 DLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQDMKV 811

 Score = 217 (81.4 bits), Expect = 8.0e-102, Sum P(3) = 8.0e-102
 Identities = 49/112 (43%), Positives = 59/112 (52%)

Query:     1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
             M TF   E+KKLP  +D FGWCTWDA Y  V    +  G+K    GG  PKF+IIDDGWQ
Sbjct:   217 MNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFEDGGVCPKFVIIDDGWQ 276

Query:    61 QI----ENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKH 108
              I    +   K+  N +V  G Q  +RLT  KE  KF+     S   S   H
Sbjct:   277 SINFDGDELDKDAEN-LVLGGEQMTARLTSFKECKKFRNYKGGSFITSDASH 327

 Score = 92 (37.4 bits), Expect = 8.0e-102, Sum P(3) = 8.0e-102
 Identities = 18/50 (36%), Positives = 30/50 (60%)

Query:   533 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 582
             +I + V+G GRF  YSS  P+KC +   + +F ++  TG ++  +P  EE
Sbjct:   816 SIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEE 865


>UNIPROTKB|Q93XK2 [details] [associations]
            symbol:STS1 "Stachyose synthase" species:3888 "Pisum
            sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
            "oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
            "galactinol-raffinose galactosyltransferase activity" evidence=IDA]
            InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
            Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
            InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
            EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
            BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
            Uniprot:Q93XK2
        Length = 853

 Score = 783 (280.7 bits), Expect = 8.3e-83, Sum P(2) = 8.3e-83
 Identities = 180/488 (36%), Positives = 273/488 (55%)

Query:    28 YTDVTAEGVD-EGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGAQFASRLTG 86
             +TD+  +G++ E L+         K         +IE+K K+    +V+E       L G
Sbjct:   319 FTDLILKGIEHEKLRKKREEAISSK----SSDLAEIESKIKK----VVKE----IDDLFG 366

Query:    87 IKENSKFQKKCQNSEQVSGLKHVVDESKQNHN-VKYVYVWHALAGYWGGVKPAADGMEHY 145
              ++ S  +K    SE   GLK    + +     +  VYVWHAL G WGGV+P      H 
Sbjct:   367 GEQFSSGEKSEMKSEY--GLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HL 421

Query:   146 DTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVD 205
             DT +     SPG+ G   D+ +  ++   LGLVHP +    Y+ +H+YLA  G+ GVKVD
Sbjct:   422 DTKIVPCKLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVD 481

Query:   206 VQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIY-SSKQTAVI 264
             V + +E +   +GGRV L + Y++ L  SI +NF  NG I+ M H  D  +  +KQ ++ 
Sbjct:   482 VIHSLEYVCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMG 541

Query:   265 RASDDYYPRDPASHT--------IHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAA 316
             R  DD++ +DP            +H+   +YN+L++G+ +QPDWDMF S H  A++H  +
Sbjct:   542 RVGDDFWFQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGS 601

Query:   317 RAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKV 376
             RA+ G  IYVSD  G+H+FDL++KLV PDG++ +      PTRDCLF +P  D T++LK+
Sbjct:   602 RAICGGPIYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKI 661

Query:   377 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVE--NMAQIAGAGWN 434
             WN NK  GV+G FNCQGAGW  I +K R   E    +  +V VT+VE     + +  G  
Sbjct:   662 WNFNKYGGVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKA 721

Query:   435 GDAIVYAHRSGEVVRLP-KGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNS 493
              + +VY +++ E+  +  K   +  T++   +EL+ F P+ ++   I FA IGL +MFNS
Sbjct:   722 EEYVVYLNQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNS 781

Query:   494 GGAVENVE 501
             GG V ++E
Sbjct:   782 GGTVIDLE 789

 Score = 217 (81.4 bits), Expect = 5.0e-16, Sum P(2) = 5.0e-16
 Identities = 43/98 (43%), Positives = 57/98 (58%)

Query:     1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
             + +F   E+K +P+ +D FGWCTWDAFY  V   G+  GL   S GG  P+F+IIDDGWQ
Sbjct:   203 LNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGLDDFSKGGVEPRFVIIDDGWQ 262

Query:    61 QIE---NKPKEESNCIVQEGAQFASRLTGIKENSKFQK 95
              I      P E++  +V  G Q + RL    E  KF+K
Sbjct:   263 SISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKFRK 300

 Score = 66 (28.3 bits), Expect = 8.3e-83, Sum P(2) = 8.3e-83
 Identities = 16/47 (34%), Positives = 25/47 (53%)

Query:   536 LKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 582
             +KV+G G F  YSS+ P K  +   + DF +    G + + +P  EE
Sbjct:   797 IKVKGGGSFLAYSSESPKKFQLNGCEVDFEW-LGDGKLCVNVPWIEE 842


>ASPGD|ASPL0000010056 [details] [associations]
            symbol:aglF species:162425 "Emericella nidulans"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
            CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
            EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
            GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
            OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
        Length = 863

 Score = 340 (124.7 bits), Expect = 2.3e-35, Sum P(3) = 2.3e-35
 Identities = 94/305 (30%), Positives = 152/305 (49%)

Query:   104 SGLKHVVDESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQ 162
             +GL   V   ++ H N++Y+ VWHAL GYWGG+ P       Y T               
Sbjct:   383 NGLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTR-------------- 428

Query:   163 PDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVS 222
              ++ ++S     +  + P  +  FYN+ +A+L+  G+ GVK D Q+ ++ L A    R S
Sbjct:   429 -EVALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRS 486

Query:   223 LTRSYHQALEASIARNFPDNG--CISCMCHNT--DGIYSSKQTAVIRASDDYYPRDPASH 278
                +Y  A   S  R+F      C+S +        + ++K T V+R S+D++P    SH
Sbjct:   487 YANAYQDAWTISSLRHFGPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546

Query:   279 TIHISSVAYNTLFLGEFMQ--PDWDMFHSLHPA----AEYHGAARAVGGCAIYVSDKPGN 332
             T H+   A+N L L  ++   PDWDMF +L       A +H AAR + G  IY++DKPG 
Sbjct:   547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605

Query:   333 HNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSL-LKVWN--VNKCSGV 385
             H+  L++++      G+   LR  +  R T D ++ D  ++G  L +  ++      SG+
Sbjct:   606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662

Query:   386 VGVFN 390
             +GVFN
Sbjct:   663 IGVFN 667

 Score = 98 (39.6 bits), Expect = 2.3e-35, Sum P(3) = 2.3e-35
 Identities = 18/54 (33%), Positives = 28/54 (51%)

Query:    12 LPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENK 65
             L  + D   +CTW+    D++ E +   L  L   G   + LIIDD WQ ++N+
Sbjct:   309 LSEWYDGLAYCTWNGLGQDLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNE 362

 Score = 61 (26.5 bits), Expect = 2.3e-35, Sum P(3) = 2.3e-35
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query:   438 IVYAHRSGEVV-RLPKGASVPVTLKVLEYELFHFCPLKEIS 477
             IV AHR+G +V  L   ++V VTL    +E+    P+K ++
Sbjct:   695 IVRAHRTGRIVGELHSSSAVSVTLNERRWEVLTAYPVKTLT 735

 Score = 41 (19.5 bits), Expect = 2.7e-33, Sum P(3) = 2.7e-33
 Identities = 8/30 (26%), Positives = 19/30 (63%)

Query:   486 GLLDMFNSGGAVENVEVHMSEKKPDLFDGE 515
             G++ +FN    VE+V + +++  P ++D +
Sbjct:   661 GIIGVFNVSNRVESVIIPVADF-PGIYDDQ 689


>UNIPROTKB|G4NBB7 [details] [associations]
            symbol:MGG_11554 "Seed imbibition protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0052051 "interaction with host via protein
            secreted by type II secretion system" evidence=IDA]
            InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
            Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
            EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
            Uniprot:G4NBB7
        Length = 908

 Score = 337 (123.7 bits), Expect = 1.7e-32, Sum P(3) = 1.7e-32
 Identities = 103/331 (31%), Positives = 156/331 (47%)

Query:    92 KFQKKCQNSEQVSGLKHVVDE-SKQNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALA 150
             +F+   Q   Q  GLK +V E  KQN  ++ + VWH + GYWGG+ P+      Y     
Sbjct:   393 RFEANQQGFPQ--GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKI 450

Query:   151 YPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNII 210
                    V   QP    D   V G      + V   Y++ +A+LA CGV   KVD Q  +
Sbjct:   451 QLRDEAEV---QPKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFL 500

Query:   211 ETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS--KQ------TA 262
             +   A    R +L R Y  A  A+ +++F     I+CM      I  S  +Q        
Sbjct:   501 D-YPAHANDRKNLIRPYQDAWTAAASKHFGGRA-IACMAQTPQSILHSLLQQGRSEGPML 558

Query:   263 VIRASDDYYPRDPASHTIHISSVAYNTLFLGEF-MQPDWDMFHSLHPA-AEYHGAARAVG 320
             + R SDD++P +  SHT H+   A+N L +    +  DWDMF +  P  A  H  AR++ 
Sbjct:   559 MARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMS 618

Query:   321 GCAIYVSDKPGNHNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSLLKV 376
             G  IY++D PG H+ +L++++     DG    LRA  PGR     L+         LL+V
Sbjct:   619 GGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRV 674

Query:   377 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 407
              + ++  G++GVFN    G   + ++ R+ D
Sbjct:   675 RSGHQGVGMLGVFNVCNRG-SLLGEQVRLDD 704

 Score = 90 (36.7 bits), Expect = 1.7e-32, Sum P(3) = 1.7e-32
 Identities = 18/62 (29%), Positives = 32/62 (51%)

Query:     9 KKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKE 68
             + ++  + D F +CTW++   D++ + +   L  LS  G     LIIDD WQ ++    +
Sbjct:   326 RAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGSD 385

Query:    69 ES 70
              S
Sbjct:   386 AS 387

 Score = 46 (21.3 bits), Expect = 1.7e-32, Sum P(3) = 1.7e-32
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query:   485 IGLLDMFN--SGGAVENVEVHMSEKKPDLFDGEVSSE 519
             +G+L +FN  + G++   +V +     D+FDGE + E
Sbjct:   681 VGMLGVFNVCNRGSLLGEQVRLD----DIFDGEKAGE 713


>UNIPROTKB|Q97U94 [details] [associations]
            symbol:galS "Alpha-galactosidase" species:273057
            "Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
            activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
            [GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
            "glycosylceramide catabolic process" evidence=ISS]
            InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
            GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
            EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
            ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
            KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
            ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
            InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
        Length = 648

 Score = 238 (88.8 bits), Expect = 5.6e-24, Sum P(2) = 5.6e-24
 Identities = 67/192 (34%), Positives = 96/192 (50%)

Query:   200 DGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSSK 259
             D VKVD Q +I  +       ++ +R+   AL+ S+ ++      I+CM  N +   +  
Sbjct:   362 DLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGKDV-----INCMSMNPENYCNYF 415

Query:   260 QTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAV 319
              + V+R S DY P       +HI   AYN+L     + PD+DMF S  P A+ H  AR  
Sbjct:   416 YSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVF 475

Query:   320 GGCAIYVSDK-PGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN 378
              G  IY++D+ P   N +LLR  VLP+G V+R   P   T D LF DP R+   LLK+  
Sbjct:   476 SGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRERV-LLKLKG 534

Query:   379 VNKCSGVVGVFN 390
               K    +  FN
Sbjct:   535 KVKGYNAIAFFN 546

 Score = 116 (45.9 bits), Expect = 5.6e-24, Sum P(2) = 5.6e-24
 Identities = 24/64 (37%), Positives = 37/64 (57%)

Query:     3 TFTHREKKKLPS-FLDWFGWCTWDAFYT-DVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
             TF  R++K  P   ++  GWC+W+AF T D+  E + + +K +   G    ++IIDDGWQ
Sbjct:   207 TFKLRKEKGFPDKVMNGLGWCSWNAFLTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQ 266

Query:    61 QIEN 64
                N
Sbjct:   267 DQNN 270

 Score = 77 (32.2 bits), Expect = 6.6e-20, Sum P(2) = 6.6e-20
 Identities = 21/63 (33%), Positives = 32/63 (50%)

Query:    73 IVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYW 132
             I+ +G Q  +    I+  +   KK  N     G K+ V   K +  VKYV +WHA+  +W
Sbjct:   260 IIDDGWQDQNNDRAIRSLNPDNKKFPN-----GFKNTVRAIK-SLGVKYVGLWHAINAHW 313

Query:   133 GGV 135
             GG+
Sbjct:   314 GGM 316


>UNIPROTKB|Q8A170 [details] [associations]
            symbol:BT_3797 "Possible alpha-galactosidase"
            species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
            "alpha-galactosidase activity" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
            process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
            evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
            evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
            InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
            GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
            EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
            ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
            PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
            ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
            Uniprot:Q8A170
        Length = 693

 Score = 196 (74.1 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
 Identities = 54/191 (28%), Positives = 84/191 (43%)

Query:   181 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 240
             +K+  +Y      +   G D +K+D Q+    L  G    +   +  + ALE    R   
Sbjct:   348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHRM-- 405

Query:   241 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 300
               G ++CM  N   I  +  ++V RAS DY   D      H+     NTL LG+ + PD 
Sbjct:   406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465

Query:   301 DMFHSLHPAA-EYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 359
             DMFHS           ++A+ G  +Y+SD P     D +R L+   G + R   P  PT 
Sbjct:   466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525

Query:   360 DCLFADPARDG 370
             + +  +P + G
Sbjct:   526 ESILTNPLQSG 536

 Score = 115 (45.5 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
 Identities = 23/89 (25%), Positives = 49/89 (55%)

Query:     7 REKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKP 66
             R  K+  +  D+ GWCTW+ ++ D+    +   + ++ A G P ++++IDDG   I NK 
Sbjct:   218 RADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG--HIANKN 275

Query:    67 KEESNCIVQEGAQFASRLTGIKENSKFQK 95
             ++ ++ +V +  +F +  + I +  +  K
Sbjct:   276 RQLTS-LVPDKKRFPNGWSRIMKRKQADK 303

 Score = 66 (28.3 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query:   114 KQNHNVKYVYVWHALAGYWGGVKPAAD 140
             KQ   ++++ +W++L+GYW G+    D
Sbjct:   299 KQADKIRWIGLWYSLSGYWMGISAEND 325

 Score = 50 (22.7 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
 Identities = 19/80 (23%), Positives = 37/80 (46%)

Query:   467 LFHFCPLKEISSNISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSD 526
             LFH CP+++      +A IG+ + + S   V+ ++   +EK   + D   +  L    +D
Sbjct:   621 LFHLCPIRK-----GWAVIGIQEKYLSPATVQILK-RTTEKL--ILDVHCTGTLRI-WAD 671

Query:   527 NRSPTATISLKVRGCGRFGI 546
             +       S+ ++  GR  I
Sbjct:   672 SHGKQELRSIPIKKAGRIEI 691


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.134   0.418    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      593       593   0.00084  120 3  11 22  0.41    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  623 (66 KB)
  Total size of DFA:  361 KB (2179 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  48.76u 0.17s 48.93t   Elapsed:  00:00:02
  Total cpu time:  48.76u 0.17s 48.93t   Elapsed:  00:00:02
  Start:  Fri May 10 09:00:05 2013   End:  Fri May 10 09:00:07 2013

Back to top