Your job contains 1 sequence.
>005020
MELLLVLLPISWAVAESFLLANLSMGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVV
EAREGSHFDEGSQYGEEQSALYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFE
GSHLVFVAAGSDPFDVITNAVKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTG
EGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQ
KNGKEGQREEDPALGLRHIVTEIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQ
YPVSSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNIL
ETLGAGHGGRVKLSRKYHQALEASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDF
WPRDPASHTIHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKP
GQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFN
CQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYL
PKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELRYESEGTATV
DMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNISFEL
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 005020
(719 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702... 2903 1.8e-302 1
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702... 2289 1.7e-253 2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702... 1379 5.5e-141 1
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact... 1289 1.9e-131 1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci... 821 3.2e-119 2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ... 756 7.9e-112 2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric... 384 4.0e-40 2
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot... 337 5.3e-35 4
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec... 234 1.4e-29 2
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto... 196 3.5e-20 3
>TAIR|locus:2020452 [details] [associations]
symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
Uniprot:Q84VX0
Length = 754
Score = 2903 (1027.0 bits), Expect = 1.8e-302, P = 1.8e-302
Identities = 534/700 (76%), Positives = 603/700 (86%)
Query: 18 FLLANLSMGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEE 77
F L L LRFMCVFRFK+WWMTQRMG G+++P ETQFL+VEA +GS D G G +
Sbjct: 58 FSLGKLE-DLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGS--DLG---GRD 111
Query: 78 QSALYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVI 137
QS+ Y VFLPILEGDFRAVLQGNE NELEICLESGDP VD+FEGSHLVFVAAGSDPFDVI
Sbjct: 112 QSSSYVVFLPILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVI 171
Query: 138 TNAVKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPP 197
T AVK VE+HL TFSHRERKKMPDMLNWFGWCTWDAFYT+VT + VKQGLES + GG+ P
Sbjct: 172 TKAVKAVEQHLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTP 231
Query: 198 KFIIIDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLR 257
KF+IIDDGWQSVGMD + EF ADN ANFANRLTHIKENHKFQK+GKEG R +DP+L L
Sbjct: 232 KFVIIDDGWQSVGMDETSVEFNADNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLG 291
Query: 258 HIVTEIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAF 317
H++T+IK + LKYVYVWHAITGYWGGV+PGV+GMEHYESK+ YPVSSPGV S+E C
Sbjct: 292 HVITDIKSNNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCL 351
Query: 318 DSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKY 377
+SI KNGLGLVNPEKVF FY++LHSYLAS G+DGVKVDVQNILETLGAGHGGRVKL++KY
Sbjct: 352 ESITKNGLGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKY 411
Query: 378 HQALEASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAY 437
HQALEASI+RNF +N II CMSHNTDGLYSAK++AVIRASDDFWPRDPASHTIHIASVAY
Sbjct: 412 HQALEASISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAY 471
Query: 438 NTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGS 497
NT+FLGEFMQPDWDMFHSLHPMAEYH AARAVGGCAIYVSDKPGQHDFNLLRKLVL DGS
Sbjct: 472 NTLFLGEFMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGS 531
Query: 498 ILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHD 557
ILRAKLPGRPT DC FSDP RD KSLLKIWNLN+FTGV+GVFNCQGAGWC+ K+ LIHD
Sbjct: 532 ILRAKLPGRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHD 591
Query: 558 EQPGTTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEV 617
++PGT +G +R DV YL +VA EWTGD+I YSHL GE+ YLPK+ +LP+TL REYEV
Sbjct: 592 QEPGTISGCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEV 651
Query: 618 YTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELRYESEGTA-TVDMKVRGCGEFGAYSSA 676
+TVVPVKE S G++FAP+GL++MFNSGGAI LRY+ EGT V MK+RG G G YSS
Sbjct: 652 FTVVPVKEFSDGSKFAPVGLMEMFNSGGAIVSLRYDDEGTKFVVRMKLRGSGLVGVYSSV 711
Query: 677 R-PRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNI 715
R PR + VDS++V++ YE ESGLVT TL VP++ELYLW++
Sbjct: 712 RRPRSVTVDSDDVEYRYEPESGLVTFTLGVPEKELYLWDV 751
>TAIR|locus:2103488 [details] [associations]
symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
"raffinose catabolic process" evidence=IDA] [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
[GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
[GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
Length = 773
Score = 2289 (810.8 bits), Expect = 1.7e-253, Sum P(2) = 1.7e-253
Identities = 412/625 (65%), Positives = 501/625 (80%)
Query: 26 GLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSALYTVF 85
GLRFMC FRFK+WWMTQRMG+CG+D+P ETQF+++E++ DE G++ +YTVF
Sbjct: 65 GLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESK-----DEVEGNGDDAPTVYTVF 119
Query: 86 LPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAVKTVE 145
LP+LEG FRAVLQGNE+NE+EIC ESGD V+ +G+HLV+V AG++PF+VI +VK VE
Sbjct: 120 LPLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSVKAVE 179
Query: 146 RHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 205
RH+ TF HRE+KK+P L+WFGWCTWDAFYTDVT EGV +GL+S +GG PPKF+IIDDG
Sbjct: 180 RHMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDG 239
Query: 206 WQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKE 265
WQ + A FA RL IKEN KFQK+ +++ GL+ +V K+
Sbjct: 240 WQQIENKEKDENCVVQEGAQFATRLVGIKENAKFQKS----DQKDTQVSGLKSVVDNAKQ 295
Query: 266 KHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGL 325
+H++K VY WHA+ GYWGGV+P +GMEHY+S + YPV SPGV N+P DS+A +GL
Sbjct: 296 RHNVKQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGL 355
Query: 326 GLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASI 385
GLVNP+KVF+FY+ELHSYLAS GIDGVKVDVQNI+ETLGAG GGRV L+R Y QALEASI
Sbjct: 356 GLVNPKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASI 415
Query: 386 ARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEF 445
ARNF +N I CM HNTDGLYSAK++A++RASDDF+PRDPASHTIHIASVAYN++FLGEF
Sbjct: 416 ARNFTDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEF 475
Query: 446 MQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPG 505
MQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVLPDGS+LRAKLPG
Sbjct: 476 MQPDWDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPG 535
Query: 506 RPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPGTTTG 565
RPTRDCLF+DPARDG SLLKIWN+N FTG+VGVFNCQGAGWC+ KKN IHD PGT TG
Sbjct: 536 RPTRDCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTG 595
Query: 566 FIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVVPVKE 625
IRA D D + +VAG++W+GD+I Y++ GEV LPK A++P+TLK EYE++ + P+KE
Sbjct: 596 SIRADDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKE 655
Query: 626 LSSGTRFAPIGLVKMFNSGGAIKEL 650
++ FAPIGLV MFNS GAI+ +
Sbjct: 656 ITENISFAPIGLVDMFNSSGAIESI 680
Score = 175 (66.7 bits), Expect = 1.7e-253, Sum P(2) = 1.7e-253
Identities = 33/59 (55%), Positives = 42/59 (71%)
Query: 657 TATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNI 715
TA V + VRGCG FGAYSS RP + AV+S E F Y+ E GLVTL L V +EE++ W++
Sbjct: 711 TALVSVSVRGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHV 769
>TAIR|locus:2170528 [details] [associations]
symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0005986 "sucrose biosynthetic process" evidence=IMP]
[GO:0010325 "raffinose family oligosaccharide biosynthetic process"
evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0009414 "response to water deprivation" evidence=IEP]
[GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
Uniprot:Q9FND9
Length = 783
Score = 1379 (490.5 bits), Expect = 5.5e-141, P = 5.5e-141
Identities = 285/693 (41%), Positives = 413/693 (59%)
Query: 27 LRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSALYTVFL 86
+RFM +FRFK+WW T +G+ G+D+ ETQ ++++ + GS GS G Y + L
Sbjct: 90 IRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRP----YVLLL 144
Query: 87 PILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAVKTVER 146
P+LEG FR+ Q E +++ +C+ESG +V E +V+V AG DPF ++ +A+K +
Sbjct: 145 PLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDAMKVIRV 204
Query: 147 HLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGW 206
H+ TF E K P +++ FGWCTWDAFY V +GV +G++ GG PP ++IDDGW
Sbjct: 205 HMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDDGW 264
Query: 207 QSVGMDPSGFEFRADNTA----NFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTE 262
QS+G D G + N RL +ENHKF K+ + + D +G++ V +
Sbjct: 265 QSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKF-KDYVSPKDQND--VGMKAFVRD 321
Query: 263 IKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIA 321
+K++ + Y+YVWHA+ GYWGG+RP + S + P SPG++ A D I
Sbjct: 322 LKDEFSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKII 379
Query: 322 KNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQAL 381
+ G+G +P+ FY+ LHS+L +AGIDGVKVDV +ILE L +GGRV L++ Y +AL
Sbjct: 380 ETGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKAL 439
Query: 382 EASIARNFRNNDIICCMSHNTDGLYSAKRSAVI-RASDDFWPRDPASHT--------IHI 432
+S+ ++F N +I M H D ++ + + R DDFW DP+ H+
Sbjct: 440 TSSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHM 499
Query: 433 ASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLV 492
AYN++++G F+QPDWDMF S HP AE+H A+RA+ G IY+SD G+HDF+LL++LV
Sbjct: 500 VHCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLV 559
Query: 493 LPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKK 552
LP+GSILR + PTRD LF DP DGK++LKIWNLN +TGV+G FNCQG GWCR ++
Sbjct: 560 LPNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRR 619
Query: 553 NLIHDEQPGTTTGFIRAKDVDY----LP-RVAGDEWTGDAIAYSHLGGEVAYLPKNATLP 607
N E T T KDV++ P +A E ++ S ++ N L
Sbjct: 620 NQCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFALFLSQSK---KLLLSGLNDDLE 676
Query: 608 ITLKSREYEVYTVVPVKELSSGT-RFAPIGLVKMFNSGGAIKELRYESEGTATVDMKVRG 666
+TL+ ++E+ TV PV + + RFAPIGLV M N+ GAI+ L Y E +V++ V G
Sbjct: 677 LTLEPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSLVYNDE---SVEVGVFG 733
Query: 667 CGEFGAYSSARPRRIAVDSEEVQFGYEEESGLV 699
GEF Y+S +P +D E V+FGYE+ +V
Sbjct: 734 AGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMV 766
>UNIPROTKB|Q5VQG4 [details] [associations]
symbol:RFS "Galactinol--sucrose galactosyltransferase"
species:39947 "Oryza sativa Japonica Group" [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
Uniprot:Q5VQG4
Length = 783
Score = 1289 (458.8 bits), Expect = 1.9e-131, P = 1.9e-131
Identities = 285/696 (40%), Positives = 403/696 (57%)
Query: 28 RFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSALYTVFLP 87
RFM +FRFK+WW T +G G+DV ETQ ++++ + G+ + S G Y + LP
Sbjct: 95 RFMSIFRFKVWWTTHWVGTNGRDVENETQMMILD-QSGT---KSSPTGPRP---YVLLLP 147
Query: 88 ILEGDFRAVLQ-GNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAVKTVER 146
I+EG FRA L+ G ++ + + LESG V V++ AG DPFD++ +A++ V
Sbjct: 148 IVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMRVVRA 207
Query: 147 HLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGW 206
HL TF E K P +++ FGWCTWDAFY V EGV +G+ GG PP ++IDDGW
Sbjct: 208 HLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLIDDGW 267
Query: 207 QSVGMDP----SGFEFRADNTAN--FANRLTHIKENHKFQKNGKEGQREEDPALGLRHIV 260
QS+ D SG E +A RL +EN+KF++ +G G+ V
Sbjct: 268 QSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFREY--KG--------GMGGFV 317
Query: 261 TEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDS 319
E+K ++ VYVWHA+ GYWGG+RPG G+ +K+ P SPG+Q A D
Sbjct: 318 REMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDK 375
Query: 320 IAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQ 379
I NG+GLV+P + Y+ LHS+L ++GIDGVKVDV ++LE + +GGRV+L++ Y
Sbjct: 376 IVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFA 435
Query: 380 ALEASIARNFRNNDIICCMSHNTDG-LYSAKRSAVIRASDDFWPRDPASHT--------I 430
L S+ R+F N +I M H D L + A+ R DDFW DP+
Sbjct: 436 GLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGC 495
Query: 431 HIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRK 490
H+ AYN++++G F+ PDWDMF S HP A +H A+RAV G +YVSD G HDF+LLR+
Sbjct: 496 HMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRR 555
Query: 491 LVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVG 550
L LPDG+ILR + PTRDCLF+DP DGK++LKIWN+N F+GV+G FNCQG GW R
Sbjct: 556 LALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREA 615
Query: 551 KKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDAIA-YSHLGGEVAYLPKNATLPIT 609
++N+ T DV++ G GD A Y ++ L ++ ++ +T
Sbjct: 616 RRNMCAAGFSVPVTARASPADVEWSHGGGG----GDRFAVYFVEARKLQLLRRDESVELT 671
Query: 610 LKSREYEVYTVVPVKELSS---GTRFAPIGLVKMFNSGGAIKELRY-ESEGTATVDMKVR 665
L+ YE+ V PV+ + S G FAPIGL M N+GGA++ +G ++ V+
Sbjct: 672 LEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFEAARKDGDVAAEVAVK 731
Query: 666 GCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTL 701
G GE AYSSARPR V+ ++ +F YE+ G+VT+
Sbjct: 732 GAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTV 765
>UNIPROTKB|Q93XK2 [details] [associations]
symbol:STS1 "Stachyose synthase" species:3888 "Pisum
sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
"oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
"galactinol-raffinose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
Uniprot:Q93XK2
Length = 853
Score = 821 (294.1 bits), Expect = 3.2e-119, Sum P(2) = 3.2e-119
Identities = 180/460 (39%), Positives = 263/460 (57%)
Query: 246 GQREEDPA-LGLRHIVTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPV 303
G++ E + GL+ +++ K L VYVWHA+ G WGGVRP T H ++K+
Sbjct: 373 GEKSEMKSEYGLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HLDTKIVPCK 429
Query: 304 SSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETL 363
SPG+ A I+K LGLV+P + YD +HSYLA +GI GVKVDV + LE +
Sbjct: 430 LSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYV 489
Query: 364 GAGHGGRVKLSRKYHQALEASIARNFRNNDIICCMSHNTDGLY-SAKRSAVIRASDDFWP 422
+GGRV L++ Y++ L SI +NF N +I M H D + K+ ++ R DDFW
Sbjct: 490 CDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDFWF 549
Query: 423 RDPASHT--------IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAI 474
+DP +H+ +YN++++G+ +QPDWDMF S H A++H +RA+ G I
Sbjct: 550 QDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGGPI 609
Query: 475 YVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTG 534
YVSD G HDF+L++KLV PDG+I + PTRDCLF +P D ++LKIWN N + G
Sbjct: 610 YVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKYGG 669
Query: 535 VVGVFNCQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDA---IAYS 591
V+G FNCQGAGW + +K E G + +V++ + G A + Y
Sbjct: 670 VIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSH-LGKAEEYVVYL 728
Query: 592 HLGGEVAYLP-KNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKEL 650
+ E++ + K+ + T++ +E+Y+ VPV +L G +FAPIGL MFNSGG + +L
Sbjct: 729 NQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNSGGTVIDL 788
Query: 651 RYESEGTATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQF 690
Y G +KV+G G F AYSS P++ ++ EV F
Sbjct: 789 EYVGNGAK---IKVKGGGSFLAYSSESPKKFQLNGCEVDF 825
Score = 373 (136.4 bits), Expect = 3.2e-119, Sum P(2) = 3.2e-119
Identities = 80/222 (36%), Positives = 120/222 (54%)
Query: 26 GLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSALYTVF 85
G F+ +FRFK WW TQ +G G D+ ETQ++++E E + Y V
Sbjct: 95 GKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEVPE--------------TKSYVVI 140
Query: 86 LPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAVKTVE 145
+PI+E FR+ L + ++I ESG V E + + +V +P+D++ A +
Sbjct: 141 IPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIAYVHFSENPYDLMKEAYSAIR 200
Query: 146 RHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 205
HL +F E K +P++++ FGWCTWDAFY V G+ GL+ F KGG+ P+F+IIDDG
Sbjct: 201 VHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGLDDFSKGGVEPRFVIIDDG 260
Query: 206 WQSVGMDPSGFEFRAD--NTA----NFANRLTHIKENHKFQK 241
WQS+ D G++ D N + RL E +KF+K
Sbjct: 261 WQSISFD--GYDPNEDAKNLVLGGEQMSGRLHRFDECYKFRK 300
>TAIR|locus:2141425 [details] [associations]
symbol:STS "AT4G01970" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0047268 "galactinol-raffinose galactosyltransferase activity"
evidence=ISS] [GO:0006979 "response to oxidative stress"
evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
Uniprot:Q9SYJ4
Length = 876
Score = 756 (271.2 bits), Expect = 7.9e-112, Sum P(2) = 7.9e-112
Identities = 167/463 (36%), Positives = 261/463 (56%)
Query: 272 VYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPE 331
+YVWHA+ G W GVRP M ++K+ SP + + A D + + G+GLV+P
Sbjct: 416 IYVWHALCGAWNGVRPET--MMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473
Query: 332 KVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRN 391
K FYD +HSYLAS G+ G K+DV LE+L HGGRV+L++ Y+ L S+ +NF
Sbjct: 474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533
Query: 392 NDIICCMSHNTDGLYSA-KRSAVIRASDDFWPRDPASHT--------IHIASVAYNTIFL 442
D+I M + + A K+ ++ R DDFW +DP +H+ +YN+I++
Sbjct: 534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593
Query: 443 GEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQ--HDFNLLRKLVLPDGSILR 500
G+ +QPDWDMF S H AEYH A+RA+ G +Y+SD G+ H+F+L++KL DG+I R
Sbjct: 594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653
Query: 501 AKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQP 560
PTRD LF +P D +S+LKI+N N F GV+G FNCQGAGW + + E
Sbjct: 654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713
Query: 561 GTTTGFIRAKDV--DYLPRVAGDE--WTGDAIAYSHLGGEVAYL-PKNATLPITLKSREY 615
T +G + D+ D P AG + +TGD + Y E+ ++ K+ + ITL+ +
Sbjct: 714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773
Query: 616 EVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELR-YESEGTATVDMKVRGCGEFGAYS 674
++ + VPV EL S + + N + ++ + G ++ + V+G G F AYS
Sbjct: 774 DLLSFVPVTELVSSG--VRFAPLGLINMFNCVGTVQDMKVTGDNSIRVDVKGEGRFMAYS 831
Query: 675 SARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNISF 717
S+ P + ++ +E +F +EEE+G ++ + +E + ++SF
Sbjct: 832 SSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEESGGISHLSF 874
Score = 368 (134.6 bits), Expect = 7.9e-112, Sum P(2) = 7.9e-112
Identities = 81/222 (36%), Positives = 117/222 (52%)
Query: 29 FMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSALYTVFLPI 88
F+ +FRFKMWW T +G G D+ ETQ+++++ E D Y +P
Sbjct: 112 FLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKIPE---IDS-----------YVAIIPT 157
Query: 89 LEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAVKTVERHL 148
+EG FRA L E+ + IC ESG V E + ++ +P++++ A + H+
Sbjct: 158 IEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIAYIHICDNPYNLMKEAFSALRVHM 217
Query: 149 LTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDGWQS 208
TF E KK+P +++ FGWCTWDA Y V + G++ FE GG+ PKF+IIDDGWQS
Sbjct: 218 NTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFEDGGVCPKFVIIDDGWQS 277
Query: 209 VGMDPSGFEFRADNTA----NFANRLTHIKENHKFQKNGKEG 246
+ D + A+N RLT KE KF +N K G
Sbjct: 278 INFDGDELDKDAENLVLGGEQMTARLTSFKECKKF-RNYKGG 318
>ASPGD|ASPL0000010056 [details] [associations]
symbol:aglF species:162425 "Emericella nidulans"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
Length = 863
Score = 384 (140.2 bits), Expect = 4.0e-40, Sum P(2) = 4.0e-40
Identities = 131/450 (29%), Positives = 218/450 (48%)
Query: 255 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEP 313
GL VT I+E+H +++Y+ VWHA+ GYWGG+ P + Y+++
Sbjct: 384 GLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTREV------------- 430
Query: 314 CDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKL 373
A +S + + ++P + FY++ +++L+ +GI GVK D Q+ L+ L A R
Sbjct: 431 --ALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRSY 487
Query: 374 SRKYHQALEASIARNFRNNDIICCMSHNTDGLYSA-----KRSAVIRASDDFWPRDPASH 428
+ Y A S R+F I CMS ++ + K + V+R S+DF+P SH
Sbjct: 488 ANAYQDAWTISSLRHF-GPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546
Query: 429 TIHIASVAYNTIFLGEFMQ--PDWDMFHSLHP----MAEYHGAARAVGGCAIYVSDKPGQ 482
T H+ A+N + L ++ PDWDMF +L A +H AAR + G IY++DKPGQ
Sbjct: 547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605
Query: 483 HDFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSL-LKIWN--LNDFTGV 535
HD L++++ G+ LR + R T D ++ D ++G L + ++ +G+
Sbjct: 606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662
Query: 536 VGVFNCQG---AGWCRVGKKNLIHDEQPGTTTGFI-RAKDVDYLPRVAGDEWTGDAIAYS 591
+GVFN + V I+D+Q TG+I RA R+ G+ + A++ +
Sbjct: 663 IGVFNVSNRVESVIIPVADFPGIYDDQE--ETGYIVRAHRTG---RIVGELHSSSAVSVT 717
Query: 592 --HLGGEV--AYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAI 647
EV AY K T + K +E E + +P ++S A +GL++ A+
Sbjct: 718 LNERRWEVLTAYPVKTLTFKMNSKDKENE--SSMPTADVSVDV--AILGLLRKMTGVAAL 773
Query: 648 --KELRYESEGTATVDMKVRGCGEFGAYSS 675
++ E G VD+ ++ G G Y S
Sbjct: 774 VSSDIYIEDTGRLRVDVGIKALGVLGIYFS 803
Score = 122 (48.0 bits), Expect = 4.0e-40, Sum P(2) = 4.0e-40
Identities = 43/173 (24%), Positives = 77/173 (44%)
Query: 81 LYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNA 140
++ V L + D VL E+ I ++ + F+ V A +D F+V T+A
Sbjct: 230 VHVVLLGVTVDDTLTVLGSGPAGEVVIKSQNDNATPSRFQ----VLAATAAD-FEVATSA 284
Query: 141 VKTVERHLLTFSHRERKKMPD---MLNWF---GWCTWDAFYTDVTGEGVKQGLESFEKGG 194
+ R L+ + P + W+ +CTW+ D++ E + L+ + G
Sbjct: 285 LIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTWNGLGQDLSEEKILSALDDLKTAG 344
Query: 195 IPPKFIIIDDGWQSVGMDPSGF------EFRADNTA---NFANRLTHIKENHK 238
I + +IIDD WQS+ + +G +F A++ A A +T I+E H+
Sbjct: 345 IRIRTLIIDDNWQSLDNEGAGSWHRALTQFEANSKAFPNGLAKAVTTIREQHR 397
>UNIPROTKB|G4NBB7 [details] [associations]
symbol:MGG_11554 "Seed imbibition protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
evidence=ND] [GO:0052051 "interaction with host via protein
secreted by type II secretion system" evidence=IDA]
InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
Uniprot:G4NBB7
Length = 908
Score = 337 (123.7 bits), Expect = 5.3e-35, Sum P(4) = 5.3e-35
Identities = 101/326 (30%), Positives = 159/326 (48%)
Query: 255 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRP-GVTGMEHYESKMQYPVSSPGVQSNE 312
GL+ +V+EI++++ ++ + VWH I GYWGG+ P G ++ K+Q + VQ
Sbjct: 404 GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKIQLRDEAE-VQ--- 459
Query: 313 PCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVK 372
P D FD +G E V YD+ +++LA G+ KVD Q L+ A R
Sbjct: 460 PKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFLD-YPAHANDRKN 511
Query: 373 LSRKYHQALEASIARNFRNNDIICCMSHNTDGLYSAKRSA-------VIRASDDFWPRDP 425
L R Y A A+ +++F I C L+S + + R SDDF+P +
Sbjct: 512 LIRPYQDAWTAAASKHFGGRAIACMAQTPQSILHSLLQQGRSEGPMLMARNSDDFFPDEV 571
Query: 426 ASHTIHIASVAYNTIFLGEF-MQPDWDMFHSLHPM-AEYHGAARAVGGCAIYVSDKPGQH 483
SHT H+ A+N + + + DWDMF + P A H AR++ G IY++D PG+H
Sbjct: 572 GSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMSGGPIYITDAPGEH 631
Query: 484 DFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVF 539
D L++++ DG LRA PGR L+ + LL++ + + G++GVF
Sbjct: 632 DVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRVRSGHQGVGMLGVF 687
Query: 540 NCQGAGWCRVGKKNLIHDEQPGTTTG 565
N G +G++ + D G G
Sbjct: 688 NVCNRG-SLLGEQVRLDDIFDGEKAG 712
Score = 97 (39.2 bits), Expect = 5.3e-35, Sum P(4) = 5.3e-35
Identities = 22/69 (31%), Positives = 37/69 (53%)
Query: 146 RHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 205
+H LT + R ++ D + F +CTW++ D++ + + L + GI +IIDD
Sbjct: 319 KHSLT---QARAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDN 375
Query: 206 WQSVGMDPS 214
WQS+ D S
Sbjct: 376 WQSLDGDGS 384
Score = 70 (29.7 bits), Expect = 5.3e-35, Sum P(4) = 5.3e-35
Identities = 21/89 (23%), Positives = 41/89 (46%)
Query: 595 GE-VAYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELRYE 653
GE +A + + + L+ +E++T P+ +L G A +GLV + A+ + Y
Sbjct: 724 GEMIAPASRETVIEVGLEEGGFEIFTAYPITKLG-GLAVATLGLVGKMATAAAVSHVSYS 782
Query: 654 S--EGTATVDMKV----RGCGEFGAYSSA 676
EG V ++V + G G ++ +
Sbjct: 783 KHHEGFIPVGVEVSVSLKALGTLGIFAQS 811
Score = 41 (19.5 bits), Expect = 5.3e-35, Sum P(4) = 5.3e-35
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 77 EQSALYTVFLPILEGDFRAVLQG 99
E+ A +TV +P LE + V+ G
Sbjct: 23 EKDATFTVGVPALELEHGGVING 45
>UNIPROTKB|Q97U94 [details] [associations]
symbol:galS "Alpha-galactosidase" species:273057
"Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
[GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
"glycosylceramide catabolic process" evidence=ISS]
InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
Length = 648
Score = 234 (87.4 bits), Expect = 1.4e-29, Sum P(2) = 1.4e-29
Identities = 67/199 (33%), Positives = 97/199 (48%)
Query: 329 NPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARN 388
N E FY + D VKVD Q ++ + + SR AL+ S+ +
Sbjct: 342 NLEDAIGFYKAFDGNILR-DFDLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGK- 398
Query: 389 FRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQP 448
D+I CMS N + + S V+R S D+ P +HI AYN++ + P
Sbjct: 399 ----DVINCMSMNPENYCNYFYSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYP 454
Query: 449 DWDMFHSLHPMAEYHGAARAVGGCAIYVSDK-PGQHDFNLLRKLVLPDGSILRAKLPGRP 507
D+DMF S P A+ H AR G IY++D+ P + + LLR VLP+G ++R P
Sbjct: 455 DYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALI 514
Query: 508 TRDCLFSDPARDGKSLLKI 526
T D LF DP R+ + LLK+
Sbjct: 515 TEDLLFKDPLRE-RVLLKL 532
Score = 177 (67.4 bits), Expect = 1.4e-29, Sum P(2) = 1.4e-29
Identities = 38/125 (30%), Positives = 63/125 (50%)
Query: 115 DVDEFEGSHLVFVAAGSDPFDVITNAVKTVERHLLTFSHRERKKMPD-MLNWFGWCTWDA 173
+ DE + S+ + + +P+ I NA+ + TF R+ K PD ++N GWC+W+A
Sbjct: 172 NTDEIKRSYFLSIGTSDNPYKAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNA 231
Query: 174 FYT-DVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEFRADNTA---NFANR 229
F T D+ E + + ++ + G+ ++IIDDGWQ D + DN F N
Sbjct: 232 FLTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQDQNNDRAIRSLNPDNKKFPNGFKNT 291
Query: 230 LTHIK 234
+ IK
Sbjct: 292 VRAIK 296
Score = 79 (32.9 bits), Expect = 4.3e-13, Sum P(2) = 4.3e-13
Identities = 32/105 (30%), Positives = 45/105 (42%)
Query: 255 GLRHIVTEIKEKHDLKYVYVWHAITGYWGGVRP------GVTGMEHYESKMQYPVSSPGV 308
G ++ V IK +KYV +WHAI +WGG+ V G ++ + + V SP +
Sbjct: 287 GFKNTVRAIKSL-GVKYVGLWHAINAHWGGMSQELMKSLNVNG--YFTNFLNSYVPSPNL 343
Query: 309 QSNEPC-DAFDSIAKNGLGLVNPEK--VFH-FYDELHSYLASAGI 349
+ AFD LV + V H YD LAS I
Sbjct: 344 EDAIGFYKAFDGNILRDFDLVKVDNQWVIHAIYDSFPIGLASRNI 388
>UNIPROTKB|Q8A170 [details] [associations]
symbol:BT_3797 "Possible alpha-galactosidase"
species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
"alpha-galactosidase activity" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
Uniprot:Q8A170
Length = 693
Score = 196 (74.1 bits), Expect = 3.5e-20, Sum P(3) = 3.5e-20
Identities = 53/193 (27%), Positives = 91/193 (47%)
Query: 331 EKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFR 390
EK+ +Y+ + G D +K+D Q+ L G ++ ++ + ALE R
Sbjct: 348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHR--M 405
Query: 391 NNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQPDW 450
++ CM+ N + S+V RAS D+ D H+ NT+ LG+ + PD
Sbjct: 406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465
Query: 451 DMFHSLHPMA-EYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTR 509
DMFHS + ++A+ G +Y+SD P + + +R L+ G I R P PT
Sbjct: 466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525
Query: 510 DCLFSDPARDGKS 522
+ + ++P + GK+
Sbjct: 526 ESILTNPLQSGKA 538
Score = 114 (45.2 bits), Expect = 3.5e-20, Sum P(3) = 3.5e-20
Identities = 21/84 (25%), Positives = 46/84 (54%)
Query: 124 LVFVAAGSDPFDVITNAVKTV--ERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGE 181
L+F S + V ++A ++ ++ + R K+ + ++ GWCTW+ ++ D+
Sbjct: 187 LIF-RKSSSVYHVFSDAYDSLIADKAVSALRKRADKQYFNAFDYLGWCTWEHYHYDIDET 245
Query: 182 GVKQGLESFEKGGIPPKFIIIDDG 205
+ +++ E GIP ++++IDDG
Sbjct: 246 KILNDIDAIEASGIPVRYVLIDDG 269
Score = 58 (25.5 bits), Expect = 3.5e-20, Sum P(3) = 3.5e-20
Identities = 6/22 (27%), Positives = 17/22 (77%)
Query: 264 KEKHDLKYVYVWHAITGYWGGV 285
K+ ++++ +W++++GYW G+
Sbjct: 299 KQADKIRWIGLWYSLSGYWMGI 320
Score = 55 (24.4 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
Identities = 21/96 (21%), Positives = 40/96 (41%)
Query: 137 ITNAVKTVERHLLTFSHRERKKMPDMLNWFG-WCTWDAFYTDVTGEG-----VKQGLESF 190
+T+ V +R +S ++K D + W G W + ++ ++ E ++Q L S+
Sbjct: 278 LTSLVPDKKRFPNGWSRIMKRKQADKIRWIGLWYSLSGYWMGISAENDFPPEIRQVLHSY 337
Query: 191 EKGGIPPKFIIIDDGWQSV---GMDPSGFEF-RADN 222
+P + W M GF+F + DN
Sbjct: 338 NGSLLPGTSTEKIETWYEYYVRTMKEYGFDFLKIDN 373
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.320 0.138 0.429 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 719 719 0.00085 121 3 11 22 0.39 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 10
No. of states in DFA: 630 (67 KB)
Total size of DFA: 417 KB (2200 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 57.00u 0.10s 57.10t Elapsed: 00:00:03
Total cpu time: 57.00u 0.10s 57.10t Elapsed: 00:00:03
Start: Sat May 11 01:18:24 2013 End: Sat May 11 01:18:27 2013