Your job contains 1 sequence.
>046494
MTVTAKATIIKDGCLMVRGNVVLTGVPQNVVVSPSSFIGATSAAPPSSRHVFTLGVLPDG
YRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTFYILLLPVL
DGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSIKILEKHKG
TFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQET
INEFCKDGEPLIEGTQFAIRLVDIKENCKFNSSGSDNSCNDLHEFIDEIKEKYGLKYVYM
WHALAGYWGGVLPSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVK
VDVQSLMETLGSGYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSMKSAV
ARASEDFMPGEPTFQTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCA
VYVSDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLS
GVIGVFNCQGAGSWPMKEDMHRKPASPLSISGHVCPLDIEFLERVAGENWNGDCAVYAFN
SGVLTKLPKKGNLEVSLATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFEYI
MDLSKYIIKIKGKGCGRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLPGECTLRDI
EFVY
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 046494
(724 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702... 1077 2.3e-204 3
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702... 1079 2.6e-184 2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702... 734 4.6e-135 2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact... 771 3.3e-130 2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ... 650 5.0e-119 4
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci... 677 4.5e-105 2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric... 414 1.4e-35 1
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec... 291 6.9e-34 2
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot... 377 2.0e-31 1
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto... 219 9.6e-19 2
>TAIR|locus:2103488 [details] [associations]
symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
"raffinose catabolic process" evidence=IDA] [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
[GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
[GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
Length = 773
Score = 1077 (384.2 bits), Expect = 2.3e-204, Sum P(3) = 2.3e-204
Identities = 196/341 (57%), Positives = 264/341 (77%)
Query: 321 DIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLL 380
DI MDSL +G+G+++P+K+F+FYN+LHSYLA+ G+DGVKVDVQ+++ETLG+G GGRV L
Sbjct: 344 DIVMDSLAVHGLGLVNPKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSL 403
Query: 381 TRQYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIA 440
TR YQQALE S+A NF DN I CM HN+ LYS+ ++A+ RAS+DF P +P T+HIA
Sbjct: 404 TRSYQQALEASIARNFTDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIA 463
Query: 441 SVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYVSDKPGVHDFKILKRLVL 500
SVA+NSL LGE + PDWDMF S H TAE+HA ARA+GGCA+YVSDKPG H+F +L++LVL
Sbjct: 464 SVAYNSLFLGEFMQPDWDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVL 523
Query: 501 PDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFNCQGAGSW---PMK 557
PDGSVLRA+ GRPTRDCLF DP DG SLLKIWN+NK +G++GVFNCQGAG W K
Sbjct: 524 PDGSVLRAKLPGRPTRDCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAG-WCKETKK 582
Query: 558 EDMHRKPASPLSISGHVCPLDIEFLERVAGENWNGDCAVYAFNSGVLTKLPKKGNLEVSL 617
+H SP +++G + D + + +VAGE+W+GD VYA+ SG + +LPK ++ ++L
Sbjct: 583 NQIH--DTSPGTLTGSIRADDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTL 640
Query: 618 ATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFE 658
L+ E++ I P++ + +++ FAPIGL+DM+NS GA+ES +
Sbjct: 641 KVLEYELFHISPLKEITENISFAPIGLVDMFNSSGAIESID 681
Score = 826 (295.8 bits), Expect = 2.3e-204, Sum P(3) = 2.3e-204
Identities = 156/329 (47%), Positives = 218/329 (66%)
Query: 1 MTVTAKATIIKDGCLMVRGNVVLTGVPQNVVVSP--------SSFIGATSAAPPSSRHVF 52
MT+T+ ++ D L+V+G +LT +P N++++P SFIGAT S HVF
Sbjct: 1 MTITSNISVQNDN-LVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQS-KSLHVF 58
Query: 53 TLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTF 112
+GVL +G RF+C FRFK+WWM R+G ++P+ETQ +LLE++++ + D A T
Sbjct: 59 PIGVL-EGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAP--TV 115
Query: 113 YILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSI 172
Y + LP+L+GQFRA LQG N+++ C ESGD +V+TS+ V++++G NPFE+I+ S+
Sbjct: 116 YTVFLPLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSV 175
Query: 173 KILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLV 232
K +E+H TF H E KK+P LDWFGWCTWDAFY V +G+ EGL S EGG P+FL+
Sbjct: 176 KAVERHMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLI 235
Query: 233 IDDGWQETINEFCKDGEPLI-EGTQFAIRLVDIKENCKFNSSGS-DNSCNDLHEFIDEIK 290
IDDGWQ+ N+ KD ++ EG QFA RLV IKEN KF S D + L +D K
Sbjct: 236 IDDGWQQIENKE-KDENCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAK 294
Query: 291 EKYGLKYVYMWHALAGYWGGVLPSSDIMK 319
+++ +K VY WHALAGYWGGV P++ M+
Sbjct: 295 QRHNVKQVYAWHALAGYWGGVKPAASGME 323
Score = 113 (44.8 bits), Expect = 2.3e-204, Sum P(3) = 2.3e-204
Identities = 21/36 (58%), Positives = 28/36 (77%)
Query: 677 RFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP 712
RFGAYSS +P C V++ E +FTY+AE GL+T+ LP
Sbjct: 723 RFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLP 758
>TAIR|locus:2020452 [details] [associations]
symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
Uniprot:Q84VX0
Length = 754
Score = 1079 (384.9 bits), Expect = 2.6e-184, Sum P(2) = 2.6e-184
Identities = 205/390 (52%), Positives = 272/390 (69%)
Query: 324 MDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLLTRQ 383
++S+ K G+G+++P+K+F FYNDLHSYLA+ GVDGVKVDVQ+++ETLG+G+GGRV L ++
Sbjct: 351 LESITKNGLGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKK 410
Query: 384 YQQALEQSVAWNFKDNNLICCMSHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIASVA 443
Y QALE S++ NF DN +I CMSHN+ LYS+ K+AV RAS+DF P +P T+HIASVA
Sbjct: 411 YHQALEASISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVA 470
Query: 444 FNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYVSDKPGVHDFKILKRLVLPDG 503
+N+L LGE + PDWDMF S H AE+HA ARA+GGCA+YVSDKPG HDF +L++LVL DG
Sbjct: 471 YNTLFLGEFMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDG 530
Query: 504 SVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFNCQGAGSWPMKEDMHR- 562
S+LRA+ GRPT DC F DPV D KSLLKIWNLN+ +GVIGVFNCQGAG W E +
Sbjct: 531 SILRAKLPGRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAG-WCKNEKRYLI 589
Query: 563 KPASPLSISGHVCPLDIEFLERVAGENWNGDCAVYAFNSGVLTKLPKKGNLEVSLATLKC 622
P +ISG V D+ +L +VA W GD VY+ G L LPK +L V+L +
Sbjct: 590 HDQEPGTISGCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREY 649
Query: 623 EIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFEYIMDLSKYXXXXXXXXXXRFGAYS 682
E++T+ P++ FAP+GL++M+NSGGA+ S Y + +K+ G YS
Sbjct: 650 EVFTVVPVKEFSDGSKFAPVGLMEMFNSGGAIVSLRYDDEGTKFVVRMKLRGSGLVGVYS 709
Query: 683 S-SKPKCCMVDTKEEEFTYNAEDGLLTVKL 711
S +P+ VD+ + E+ Y E GL+T L
Sbjct: 710 SVRRPRSVTVDSDDVEYRYEPESGLVTFTL 739
Score = 731 (262.4 bits), Expect = 2.6e-184, Sum P(2) = 2.6e-184
Identities = 153/331 (46%), Positives = 203/331 (61%)
Query: 1 MTVTAKATIIKDGCLMVRGNVVLTGVPQNVVVSPSS--------FIGATSAAPPSSRHVF 52
MTV A ++ D L+V G+ VL GVP+NV+V+P+S FIG TS S R VF
Sbjct: 1 MTVGAGISVT-DSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHR-VF 58
Query: 53 TLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTF 112
+LG L D RF+C+FRFK+WWM R+G + E+P ETQ L++EA + S D ++
Sbjct: 59 SLGKLED-LRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGS--DLGGRDQSSS 115
Query: 113 YILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSI 172
Y++ LP+L+G FRA LQG N+L+ C+ESGD +V E VF+ +G +PF++I ++
Sbjct: 116 YVVFLPILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAV 175
Query: 173 KILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLV 232
K +E+H TFSH E KK+P L+WFGWCTWDAFY V + +K+GL S GG +P+F++
Sbjct: 176 KAVEQHLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVI 235
Query: 233 IDDGWQ-----ETINEFCKDGEPLIEGTQFAIRLVDIKENCKFNSSGS-----DNSCNDL 282
IDDGWQ ET EF D FA RL IKEN KF G D+ L
Sbjct: 236 IDDGWQSVGMDETSVEFNADN-----AANFANRLTHIKENHKFQKDGKEGHRVDDPSLSL 290
Query: 283 HEFIDEIKEKYGLKYVYMWHALAGYWGGVLP 313
I +IK LKYVY+WHA+ GYWGGV P
Sbjct: 291 GHVITDIKSNNSLKYVYVWHAITGYWGGVKP 321
>TAIR|locus:2170528 [details] [associations]
symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0005986 "sucrose biosynthetic process" evidence=IMP]
[GO:0010325 "raffinose family oligosaccharide biosynthetic process"
evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0009414 "response to water deprivation" evidence=IEP]
[GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
Uniprot:Q9FND9
Length = 783
Score = 734 (263.4 bits), Expect = 4.6e-135, Sum P(2) = 4.6e-135
Identities = 168/429 (39%), Positives = 244/429 (56%)
Query: 313 PSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGS 372
P + +D+A+D + + G+G P +FY LHS+L N+G+DGVKVDV ++E L
Sbjct: 364 PGLKLTMEDLAVDKIIETGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQ 423
Query: 373 GYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM--- 428
YGGRV L + Y +AL SV +F N +I M H N + + ++ R +DF
Sbjct: 424 KYGGRVDLAKAYFKALTSSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTD 483
Query: 429 P-GEP--TF--QTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYV 483
P G+P TF Q H+ A+NSL +G + PDWDMFQS H AEFHA +RA+ G +Y+
Sbjct: 484 PSGDPNGTFWLQGCHMVHCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYI 543
Query: 484 SDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVI 543
SD G HDF +LKRLVLP+GS+LR + PTRD LFEDP+ DGK++LKIWNLNK +GVI
Sbjct: 544 SDCVGKHDFDLLKRLVLPNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVI 603
Query: 544 GVFNCQGAGSWPMKEDMHRKPASPL--SISGHVCPLDIEF---LERVAGENWNGDCAVYA 598
G FNCQG G W +E + S +++ P D+E+ ++ N + A++
Sbjct: 604 GAFNCQGGG-W-CRETRRNQCFSECVNTLTATTSPKDVEWNSGSSPISIANVE-EFALFL 660
Query: 599 FNSGVLTKLPKKGNLEVSLATLKCEIYTICPIRVL-GQDLLFAPIGLLDMYNSGGAVESF 657
S L +LE++L K E+ T+ P+ + G + FAPIGL++M N+ GA+ S
Sbjct: 661 SQSKKLLLSGLNDDLELTLEPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL 720
Query: 658 EYIMDLSKYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP--GEC 715
Y + F Y+S KP C++D + EF Y ED ++ V++P G
Sbjct: 721 VY----NDESVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGY--EDSMVMVQVPWSGPD 774
Query: 716 TLRDIEFVY 724
L I++++
Sbjct: 775 GLSSIQYLF 783
Score = 610 (219.8 bits), Expect = 4.6e-135, Sum P(2) = 4.6e-135
Identities = 128/331 (38%), Positives = 192/331 (58%)
Query: 10 IKDGCLMVRGNVVLTGVPQNVV----------------VSPSSFIGATSAAPPSSRHVFT 53
++D L+ G VVLT VP NV VS SFIG P S HV +
Sbjct: 24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83
Query: 54 LGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTFY 113
+G L + RF+ +FRFK+WW VG + ++ ETQ+++L+ + S + S Y
Sbjct: 84 IGKLKN-IRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRP-Y 140
Query: 114 ILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSIK 173
+LLLP+L+G FR++ Q +D+ CVESG + V SE + V++++GD+PF+L+KD++K
Sbjct: 141 VLLLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDAMK 200
Query: 174 ILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVI 233
++ H TF LE K P +D FGWCTWDAFY VNP G+ +G+ ++GGC P ++I
Sbjct: 201 VIRVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLI 260
Query: 234 DDGWQETINEFCKDG---EPL---IEGTQFAIRLVDIKENCKFNSSGSDNSCND--LHEF 285
DDGWQ ++ DG E + + G Q RL+ +EN KF S ND + F
Sbjct: 261 DDGWQSIGHD--SDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAF 318
Query: 286 IDEIKEKYG-LKYVYMWHALAGYWGGVLPSS 315
+ ++K+++ + Y+Y+WHAL GYWGG+ P +
Sbjct: 319 VRDLKDEFSTVDYIYVWHALCGYWGGLRPEA 349
>UNIPROTKB|Q5VQG4 [details] [associations]
symbol:RFS "Galactinol--sucrose galactosyltransferase"
species:39947 "Oryza sativa Japonica Group" [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
Uniprot:Q5VQG4
Length = 783
Score = 771 (276.5 bits), Expect = 3.3e-130, Sum P(2) = 3.3e-130
Identities = 175/423 (41%), Positives = 242/423 (57%)
Query: 320 KDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVL 379
+D+A+D + GVG++DP++ + Y LHS+L SG+DGVKVDV L+E + YGGRV
Sbjct: 369 EDLAVDKIVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVE 428
Query: 380 LTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM---P-GEP-- 432
L + Y L +SV +F N +I M H N + L + A+ R +DF P G+P
Sbjct: 429 LAKAYFAGLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDG 488
Query: 433 TF--QTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYVSDKPGVH 490
TF Q H+ A+NSL +G + PDWDMFQS H A FHA +RA+ G VYVSD G H
Sbjct: 489 TFWLQGCHMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCH 548
Query: 491 DFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFNCQG 550
DF +L+RL LPDG++LR PTRDCLF DP+ DGK++LKIWN+NK SGV+G FNCQG
Sbjct: 549 DFDLLRRLALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQG 608
Query: 551 AGSWPMKEDMHRKPASPLSI--SGHVCPLDIEFLERVAGENWNGD-CAVYAFNSGVLTKL 607
G W +E A+ S+ + P D+E+ G GD AVY + L L
Sbjct: 609 GG-WS-REARRNMCAAGFSVPVTARASPADVEWSHGGGG----GDRFAVYFVEARKLQLL 662
Query: 608 PKKGNLEVSLATLKCEIYTICPIRVLGQDLL---FAPIGLLDMYNSGGAVESFEYIMDLS 664
+ ++E++L E+ + P+R + L FAPIGL +M N+GGAV+ FE
Sbjct: 663 RRDESVELTLEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFEAARKDG 722
Query: 665 KYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP--GECT-LRDIE 721
AYSS++P+ C V+ ++ EF Y EDG++TV +P G L +E
Sbjct: 723 DVAAEVAVKGAGEMVAYSSARPRLCKVNGQDAEFKY--EDGIVTVDVPWTGSSKKLSRVE 780
Query: 722 FVY 724
+ Y
Sbjct: 781 YFY 783
Score = 527 (190.6 bits), Expect = 3.3e-130, Sum P(2) = 3.3e-130
Identities = 119/328 (36%), Positives = 183/328 (55%)
Query: 10 IKDGCLMVRGNVVLTGVPQNVVVSPSSFIGATSAAP------------PSS--RHVFTLG 55
+K L V G+ L VP N+ ++P+S + S P P++ RHV +G
Sbjct: 30 LKGKDLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHVVPIG 89
Query: 56 VLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEAREDSPLDADAASDNTFYIL 115
L D RF+ +FRFK+WW VG + +V ETQM++L+ S + Y+L
Sbjct: 90 KLRDT-RFMSIFRFKVWWTTHWVGTNGRDVENETQMMILD---QSGTKSSPTGPRP-YVL 144
Query: 116 LLPVLDGQFRATLQGTPTND-LQFCVESGDSSVQTSEAFEAVFINSGDNPFELIKDSIKI 174
LLP+++G FRA L+ D + +ESG S+V+ S AV++++GD+PF+L+KD++++
Sbjct: 145 LLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMRV 204
Query: 175 LEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVID 234
+ H GTF +E K P +D FGWCTWDAFY +V+P+G+ EG+ +GGC P ++ID
Sbjct: 205 VRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLID 264
Query: 235 DGWQETINE---FCKDGEPLIE---GTQFAIRLVDIKENCKFNSSGSDNSCNDLHEFIDE 288
DGWQ ++ E + G Q RL+ +EN KF + F+ E
Sbjct: 265 DGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFREYKGG-----MGGFVRE 319
Query: 289 IKEKYG-LKYVYMWHALAGYWGGVLPSS 315
+K + ++ VY+WHAL GYWGG+ P +
Sbjct: 320 MKAAFPTVEQVYVWHALCGYWGGLRPGA 347
>TAIR|locus:2141425 [details] [associations]
symbol:STS "AT4G01970" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0047268 "galactinol-raffinose galactosyltransferase activity"
evidence=ISS] [GO:0006979 "response to oxidative stress"
evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
Uniprot:Q9SYJ4
Length = 876
Score = 650 (233.9 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
Identities = 148/417 (35%), Positives = 231/417 (55%)
Query: 313 PSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGS 372
PS D+A+D + + G+G++ P K +FY+ +HSYLA+ GV G K+DV +E+L
Sbjct: 448 PSLGATMADLAVDKVVEAGIGLVHPSKAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAE 507
Query: 373 GYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM--- 428
+GGRV L + Y L +S+ NF ++I M N + ++ + ++ R +DF
Sbjct: 508 EHGGRVELAKAYYDGLTESMIKNFNGTDVIASMQQCNEFFFLATKQISIGRVGDDFWWQD 567
Query: 429 P-GEPT----FQTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYV 483
P G+P Q +H+ ++NS+ +G+++ PDWDMFQS H AE+HA +RA+ G VY+
Sbjct: 568 PYGDPQGVYWLQGVHMIHCSYNSIWMGQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYL 627
Query: 484 SDKPGV--HDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSG 541
SD G H+F ++K+L DG++ R H PTRD LF++P+ D +S+LKI+N NK G
Sbjct: 628 SDHLGKASHNFDLIKKLAFFDGTIPRCVHYALPTRDSLFKNPLFDKESILKIFNFNKFGG 687
Query: 542 VIGVFNCQGAGSWPMKEDMHRKPASPLSISGHVCPLDIEFLER--VAGEN--WNGDCAVY 597
VIG FNCQGAG P + ++SG V DIE+ + AG + GD VY
Sbjct: 688 VIGTFNCQGAGWSPEEHRFKGYKECYTTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVY 747
Query: 598 AFNSGVLTKLPKKGN-LEVSLATLKCEIYTICPI-RVLGQDLLFAPIGLLDMYNSGGAVE 655
S + + K ++++L ++ + P+ ++ + FAP+GL++M+N G V+
Sbjct: 748 KQQSEEILFMNSKSEAMKITLEPSAFDLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQ 807
Query: 656 SFEYIMDLSKYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLP 712
+ D S RF AYSSS P C ++ KE EF + E G L+ +P
Sbjct: 808 DMKVTGDNS---IRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVP 861
Score = 433 (157.5 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
Identities = 102/262 (38%), Positives = 144/262 (54%)
Query: 37 FIGATSAAPPSSRHVFTLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEA 96
F+G T +P S R +LG D FL LFRFK+WW +GKS S++ ETQ ++L+
Sbjct: 88 FLGFTKESP-SDRLTNSLGRFEDR-EFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKI 145
Query: 97 REDSPLDADAASDNTFYILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAV 156
E +D+ Y+ ++P ++G FRA+L ++ C ESG + V+ S
Sbjct: 146 PE---IDS--------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIA 194
Query: 157 FINSGDNPFELIKDSIKILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKE 216
+I+ DNP+ L+K++ L H TF LE KK+P+ +D FGWCTWDA Y V+P I
Sbjct: 195 YIHICDNPYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWT 254
Query: 217 GLHSFLEGGCSPRFLVIDDGWQETIN----EFCKDGEPLI-EGTQFAIRLVDIKENCKF- 270
G+ F +GG P+F++IDDGWQ +IN E KD E L+ G Q RL KE KF
Sbjct: 255 GVKEFEDGGVCPKFVIIDDGWQ-SINFDGDELDKDAENLVLGGEQMTARLTSFKECKKFR 313
Query: 271 NSSGSDNSCNDLHEFIDEIKEK 292
N G +D F + +K K
Sbjct: 314 NYKGGSFITSDASHF-NPLKPK 334
Score = 85 (35.0 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
Identities = 20/58 (34%), Positives = 34/58 (58%)
Query: 269 KFNSSGSDN-SCNDLHEFIDEIKEKY-GLKYVYMWHALAGYWGGVLPSSDI-MKKDIA 323
K S GSD+ S + + F +++ ++ L +Y+WHAL G W GV P + + +K +A
Sbjct: 385 KEESLGSDDVSGSGMAAFTKDLRLRFKSLDDIYVWHALCGAWNGVRPETMMDLKAKVA 442
Score = 49 (22.3 bits), Expect = 5.0e-119, Sum P(4) = 5.0e-119
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 10 IKDGCLMVRGNV-VLTGVPQNVVVSPSSFIGATSAAP 45
+ +G L + + +L VPQNV +P S ++ AP
Sbjct: 36 LSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAP 72
>UNIPROTKB|Q93XK2 [details] [associations]
symbol:STS1 "Stachyose synthase" species:3888 "Pisum
sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
"oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
"galactinol-raffinose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
Uniprot:Q93XK2
Length = 853
Score = 677 (243.4 bits), Expect = 4.5e-105, Sum P(2) = 4.5e-105
Identities = 155/427 (36%), Positives = 237/427 (55%)
Query: 313 PSSDIMKKDIAMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGS 372
P D +D+A+ + K +G++ P + + Y+ +HSYLA SG+ GVKVDV +E +
Sbjct: 432 PGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYVCD 491
Query: 373 GYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSH-NSYSLYSSMKSAVARASEDFM--- 428
YGGRV L + Y + L +S+ NF N +I M H N + + + ++ R +DF
Sbjct: 492 EYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDFWFQD 551
Query: 429 P-GEP--TF--QTLHIASVAFNSLLLGEIVVPDWDMFQSKHETAEFHATARALGGCAVYV 483
P G+P +F Q +H+ ++NSL +G+++ PDWDMFQS H A+FHA +RA+ G +YV
Sbjct: 552 PNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGGPIYV 611
Query: 484 SDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPVMDGKSLLKIWNLNKLSGVI 543
SD G HDF ++K+LV PDG++ + + PTRDCLF++P+ D ++LKIWN NK GVI
Sbjct: 612 SDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKYGGVI 671
Query: 544 GVFNCQGAGSWPMKEDMHRKPASPLSISG--HVCPLDIEFLERVAGENWNGDCAVYAFNS 601
G FNCQGAG P+ + P I G HV ++ + E + + VY +
Sbjct: 672 GAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKAEEYVVYLNQA 731
Query: 602 GVLTKLPKKGN-LEVSLATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGGAVESFEYI 660
L+ + K ++ ++ E+Y+ P+ L + FAPIGL +M+NSGG V EY+
Sbjct: 732 EELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNSGGTVIDLEYV 791
Query: 661 MDLSKYXXXXXXXXXXRFGAYSSSKPKCCMVDTKEEEFTYNAEDGLLTVKLPG---ECTL 717
+ +K F AYSS PK ++ E +F + DG L V +P C +
Sbjct: 792 GNGAKIKVKGGGS----FLAYSSESPKKFQLNGCEVDFEWLG-DGKLCVNVPWIEEACGV 846
Query: 718 RDIEFVY 724
D+E +
Sbjct: 847 SDMEIFF 853
Score = 383 (139.9 bits), Expect = 4.5e-105, Sum P(2) = 4.5e-105
Identities = 87/238 (36%), Positives = 132/238 (55%)
Query: 37 FIGATSAAPPSSRHVFTLGVLPDGYRFLCLFRFKIWWMIPRVGKSASEVPMETQMLLLEA 96
F G S PS R + ++G +G FL +FRFK WW +GKS S++ METQ +L+E
Sbjct: 74 FFGF-SHETPSDRLMNSIGSF-NGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131
Query: 97 REDSPLDADAASDNTFYILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAFEAV 156
E Y++++P+++ FR+ L + ++ ESG + V+ S
Sbjct: 132 PETKS-----------YVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIA 180
Query: 157 FINSGDNPFELIKDSIKILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKE 216
+++ +NP++L+K++ + H +F LE K IP +D FGWCTWDAFY VNP GI
Sbjct: 181 YVHFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFH 240
Query: 217 GLHSFLEGGCSPRFLVIDDGWQE-TINEFC--KDGEPLI-EGTQFAIRLVDIKENCKF 270
GL F +GG PRF++IDDGWQ + + + +D + L+ G Q + RL E KF
Sbjct: 241 GLDDFSKGGVEPRFVIIDDGWQSISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKF 298
Score = 238 (88.8 bits), Expect = 3.0e-52, Sum P(2) = 3.0e-52
Identities = 53/143 (37%), Positives = 78/143 (54%)
Query: 282 LHEFIDEIKEKY-GLKYVYMWHALAGYWGGVLPSS----------------DIMKKDIAM 324
L F +++ K+ GL VY+WHAL G WGGV P + D +D+A+
Sbjct: 384 LKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETTHLDTKIVPCKLSPGLDGTMEDLAV 443
Query: 325 DSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLLTRQY 384
+ K +G++ P + + Y+ +HSYLA SG+ GVKVDV +E + YGGRV L + Y
Sbjct: 444 VEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYVCDEYGGRVDLAKVY 503
Query: 385 QQALEQSVAWNFKDNNLICCMSH 407
+ L +S+ NF N +I M H
Sbjct: 504 YEGLTKSIVKNFNGNGMIASMQH 526
>ASPGD|ASPL0000010056 [details] [associations]
symbol:aglF species:162425 "Emericella nidulans"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
Length = 863
Score = 414 (150.8 bits), Expect = 1.4e-35, P = 1.4e-35
Identities = 139/482 (28%), Positives = 223/482 (46%)
Query: 94 LEAREDSPLDADAASDNTFYILLLPVLDGQFRATLQGTPTNDLQFCVESGDSSVQTSEAF 153
L ED+ L + +D +LL +D G P ++ ++S + + T F
Sbjct: 213 LNFTEDAILLSFLRTDGVHVVLLGVTVDDTLTVLGSG-PAGEV--VIKSQNDNA-TPSRF 268
Query: 154 EAVFINSGDNPFE-----LIKDSIKILEKHKGTFSHLENKK-IPRHLDWFGWCTWDAFYK 207
+ + + D FE LI ++ +++ ++ T + + D +CTW+ +
Sbjct: 269 QVLAATAAD--FEVATSALIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTWNGLGQ 326
Query: 208 QVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQETINEFCKDGEPLIEGTQFAIRLVDIKEN 267
++ + I L G R L+IDD WQ NE + TQF
Sbjct: 327 DLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNEGAGSWHRAL--TQF---------- 374
Query: 268 CKFNSSGSDNSCNDLHEFIDEIKEKY-GLKYVYMWHALAGYWGGVLPSSD---IMK-KDI 322
+ NS N L + + I+E++ ++Y+ +WHAL GYWGG+ P I K +++
Sbjct: 375 -EANSKAFPNG---LAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTREV 430
Query: 323 AMDSLEKYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLLTR 382
A++S + + IDP I FYND +++L+ SG+ GVK D QS ++ L R
Sbjct: 431 ALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLLADPEDRRSY-AN 489
Query: 383 QYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSM-----KSAVARASEDFMPGEPTFQTL 437
YQ A S +F I CMS +++ S + V R S DF P T
Sbjct: 490 AYQDAWTISSLRHFGPK-AISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSHTW 548
Query: 438 HIASVAFNSLLLGEIV-VPDWDMFQSKHET----AEFHATARALGGCAVYVSDKPGVHDF 492
H+ A N+LL + +PDWDMFQ+ E A FHA AR + G +Y++DKPG HD
Sbjct: 549 HVFCNAHNALLTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQHDI 608
Query: 493 KILKRLVLP--DGSVLRARH--AGRPTRDCLFEDPVMDGKSL-LKIWN--LNKLSGVIGV 545
++K++ G+ + R A R T D ++ D + +G L + ++ SG+IGV
Sbjct: 609 PLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGIIGV 665
Query: 546 FN 547
FN
Sbjct: 666 FN 667
>UNIPROTKB|Q97U94 [details] [associations]
symbol:galS "Alpha-galactosidase" species:273057
"Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
[GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
"glycosylceramide catabolic process" evidence=ISS]
InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
Length = 648
Score = 291 (107.5 bits), Expect = 6.9e-34, Sum P(2) = 6.9e-34
Identities = 92/274 (33%), Positives = 133/274 (48%)
Query: 280 NDLHEFIDEIKEKYGLKYVYMWHALAGYWGGVLPSSDIMKK----DIAMDSLEKYGVGII 335
N + IK G+KYV +WHA+ +WGG+ S ++MK + L Y V
Sbjct: 286 NGFKNTVRAIKS-LGVKYVGLWHAINAHWGGM--SQELMKSLNVNGYFTNFLNSY-VPSP 341
Query: 336 DPQKIFDFYNDLHSYLANSGVDGVKVDVQSLMETLGSGYG-GRVLLTRQYQQALEQSVAW 394
+ + FY + D VKVD Q ++ + + G L +R Q AL+ SV
Sbjct: 342 NLEDAIGFYKAFDGNILRD-FDLVKVDNQWVIHAIYDSFPIG--LASRNIQIALQYSVG- 397
Query: 395 NFKDNNLICCMSHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIASVAFNSLLLGEIVV 454
KD +I CMS N + + S V R S D++P LHI A+NSLL IV
Sbjct: 398 --KD--VINCMSMNPENYCNYFYSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVY 453
Query: 455 PDWDMFQSKHETAEFHATARALGGCAVYVSDK-PGVHDFKILKRLVLPDGSVLRARHAGR 513
PD+DMF S A+ H AR G +Y++D+ P + ++L+ VLP+G V+R
Sbjct: 454 PDYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPAL 513
Query: 514 PTRDCLFEDPVMDGKSLLKIWNLNKLSGVIGVFN 547
T D LF+DP+ + + LLK+ K I FN
Sbjct: 514 ITEDLLFKDPLRE-RVLLKLKGKVKGYNAIAFFN 546
Score = 157 (60.3 bits), Expect = 6.9e-34, Sum P(2) = 6.9e-34
Identities = 32/99 (32%), Positives = 57/99 (57%)
Query: 149 TSEAFEAVFINSG--DNPFELIKDSIKILEKHKGTFSHLENKKIP-RHLDWFGWCTWDAF 205
T E + F++ G DNP++ I+++I I K TF + K P + ++ GWC+W+AF
Sbjct: 173 TDEIKRSYFLSIGTSDNPYKAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNAF 232
Query: 206 Y-KQVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQETINE 243
K +N + + + + +E G +++IDDGWQ+ N+
Sbjct: 233 LTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQDQNND 271
>UNIPROTKB|G4NBB7 [details] [associations]
symbol:MGG_11554 "Seed imbibition protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
evidence=ND] [GO:0052051 "interaction with host via protein
secreted by type II secretion system" evidence=IDA]
InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
Uniprot:G4NBB7
Length = 908
Score = 377 (137.8 bits), Expect = 2.0e-31, P = 2.0e-31
Identities = 137/487 (28%), Positives = 218/487 (44%)
Query: 195 DWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPRFLVIDDGWQETINEFCKDGEPLIEG 254
D F +CTW++ + ++ I L E G + L+IDD WQ DG+ G
Sbjct: 334 DGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSL------DGD----G 383
Query: 255 TQFAIRLVDIKENCKFNSSGSDNSCNDLHEFIDEI-KEKYGLKYVYMWHALAGYWGGVLP 313
+ + R E + N G L + EI K+ ++ + +WH + GYWGG+ P
Sbjct: 384 SDASRRRW---ERFEANQQGFPQGLKGL---VSEIRKQNPQIRNIAVWHGIFGYWGGMSP 437
Query: 314 SSDI-----MKKDIAMDSLE----KYGVGIIDPQKIFDFYNDLHSYLANSGVDGVKVDVQ 364
S + M+K D E + +D + + Y+D +++LA+ GV KVD Q
Sbjct: 438 SGPMASKYKMRKIQLRDEAEVQPKDFDFYTVDGEDVHKMYDDFYAFLADCGVSAAKVDTQ 497
Query: 365 SLMETLGSGYGGRVLLTRQYQQALEQSVAWNFKDNNLICCMSHNSYSLYSSM----KSA- 419
++ R L R YQ A + + +F I CM+ S+ S+ +S
Sbjct: 498 GFLDYPAHA-NDRKNLIRPYQDAWTAAASKHF-GGRAIACMAQTPQSILHSLLQQGRSEG 555
Query: 420 ---VARASEDFMPGEPTFQTLHIASVAFNSLLLGEI-VVPDWDMFQSKH-ETAEFHATAR 474
+AR S+DF P E T H+ A N+LL+ + V+ DWDMFQ+ + A HA AR
Sbjct: 556 PMLMARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVAR 615
Query: 475 ALGGCAVYVSDKPGVHDFKILKRLVLPDGSVLR-ARHAGRPTRDCLFEDPVMDGKSLLKI 533
++ G +Y++D PG HD +++K++ A A P R L+ + LL++
Sbjct: 616 SMSGGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT-LWPYGGHGEQRLLRV 674
Query: 534 WNLNKLSGVIGVFNCQGAGSWPMKEDMHRKPASPLSISGHVCPLDIEFLERVAGENWNGD 593
+ ++ G++GVFN GS + G LD F AGE G
Sbjct: 675 RSGHQGVGMLGVFNVCNRGS----------------LLGEQVRLDDIFDGEKAGE---GS 715
Query: 594 CAVYAFNSG-VLTKLPKKGNLEVSLATLKCEIYTICPIRVLGQDLLFAPIGLLDMYNSGG 652
+ F++G ++ ++ +EV L EI+T PI LG L A +GL+ +
Sbjct: 716 FVISRFSTGEMIAPASRETVIEVGLEEGGFEIFTAYPITKLG-GLAVATLGLVGKMATAA 774
Query: 653 AVESFEY 659
AV Y
Sbjct: 775 AVSHVSY 781
>UNIPROTKB|Q8A170 [details] [associations]
symbol:BT_3797 "Possible alpha-galactosidase"
species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
"alpha-galactosidase activity" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
Uniprot:Q8A170
Length = 693
Score = 219 (82.2 bits), Expect = 9.6e-19, Sum P(2) = 9.6e-19
Identities = 64/245 (26%), Positives = 113/245 (46%)
Query: 290 KEKYGLKYVYMWHALAGYWGGVLPSSDIMKKDIAMDSLEKYGVGII---DPQKIFDFYND 346
K+ ++++ +W++L+GYW G+ +D + L Y ++ +KI +Y
Sbjct: 299 KQADKIRWIGLWYSLSGYWMGISAENDFPPE--IRQVLHSYNGSLLPGTSTEKIETWYEY 356
Query: 347 LHSYLANSGVDGVKVDVQSLMETLGSGYGGRVLL-TRQYQQALEQSVAWNFKDNNLICCM 405
+ G D +K+D QS L G G +V+ + ALE + L+ CM
Sbjct: 357 YVRTMKEYGFDFLKIDNQSFTLPLYMG-GTQVIRQAKDCNLALEHQT--HRMQMGLMNCM 413
Query: 406 SHNSYSLYSSMKSAVARASEDFMPGEPTFQTLHIASVAFNSLLLGEIVVPDWDMFQSKHE 465
+ N ++ ++ S+V RAS D+ + H+ N+L+LG+ V PD DMF S
Sbjct: 414 AQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDHDMFHSCDT 473
Query: 466 TA-EFHATARALGGCAVYVSDKPGVHDFKILKRLVLPDGSVLRARHAGRPTRDCLFEDPV 524
A ++A+ G VY+SD P ++ L+ G + R PT + + +P+
Sbjct: 474 VCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTPESILTNPL 533
Query: 525 MDGKS 529
GK+
Sbjct: 534 QSGKA 538
Score = 90 (36.7 bits), Expect = 9.6e-19, Sum P(2) = 9.6e-19
Identities = 18/67 (26%), Positives = 33/67 (49%)
Query: 170 DSIKILEKHKGTFSHLENKKIPRHLDWFGWCTWDAFYKQVNPQGIKEGLHSFLEGGCSPR 229
DS+ I +K +K+ D+ GWCTW+ ++ ++ I + + G R
Sbjct: 204 DSL-IADKAVSALRKRADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVR 262
Query: 230 FLVIDDG 236
+++IDDG
Sbjct: 263 YVLIDDG 269
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.320 0.138 0.426 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 724 714 0.00084 121 3 11 22 0.39 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 10
No. of states in DFA: 628 (67 KB)
Total size of DFA: 403 KB (2194 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 56.13u 0.14s 56.27t Elapsed: 00:00:02
Total cpu time: 56.13u 0.14s 56.27t Elapsed: 00:00:02
Start: Sat May 11 14:23:34 2013 End: Sat May 11 14:23:36 2013