Your job contains 1 sequence.
>004371
MTVGAGISVSDGNLMVKGSCVLANVKENIVVTPAAGGALVDGAFIGVTSDQLGSRRVFPV
GKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSAL
YTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAV
KTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFII
IDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVT
EIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIA
KNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQAL
EASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIF
LGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRA
KLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPG
TTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVV
PVKELSSGTRFAPIGLVKMFNSGGAIKELRYESEGTATVDMKVRGCGEFGAYSSARPRRI
AVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNISFEL
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 004371
(758 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702... 3121 0. 1
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702... 2445 5.5e-270 2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702... 1438 3.1e-147 1
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact... 1350 6.5e-138 1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci... 821 9.4e-122 2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ... 756 4.7e-116 2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric... 384 5.7e-40 2
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec... 234 1.7e-29 2
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot... 337 6.9e-29 3
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto... 196 4.4e-20 3
>TAIR|locus:2020452 [details] [associations]
symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
Uniprot:Q84VX0
Length = 754
Score = 3121 (1103.7 bits), Expect = 0., P = 0.
Identities = 576/756 (76%), Positives = 653/756 (86%)
Query: 1 MTVGAGISVSDGNLMVKGSCVLANVKENIVVTPAAGGALVDGAFIGVTSDQLGSRRVFPV 60
MTVGAGISV+D +L+V G VL V EN++VTPA+G AL+DGAFIGVTSDQ GS RVF +
Sbjct: 1 MTVGAGISVTDSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHRVFSL 60
Query: 61 GKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSAL 120
GKLE LRFMCVFRFK+WWMTQRMG G+++P ETQFL+VEA +GS D G G +QS+
Sbjct: 61 GKLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGS--DLG---GRDQSSS 115
Query: 121 YTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAV 180
Y VFLPILEGDFRAVLQGNE NELEICLESGDP VD+FEGSHLVFVAAGSDPFDVIT AV
Sbjct: 116 YVVFLPILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAV 175
Query: 181 KTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFII 240
K VE+HL TFSHRERKKMPDMLNWFGWCTWDAFYT+VT + VKQGLES + GG+ PKF+I
Sbjct: 176 KAVEQHLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVI 235
Query: 241 IDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVT 300
IDDGWQSVGMD + EF ADN ANFANRLTHIKENHKFQK+GKEG R +DP+L L H++T
Sbjct: 236 IDDGWQSVGMDETSVEFNADNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVIT 295
Query: 301 EIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIA 360
+IK + LKYVYVWHAITGYWGGV+PGV+GMEHYESK+ YPVSSPGV S+E C +SI
Sbjct: 296 DIKSNNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESIT 355
Query: 361 KNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQAL 420
KNGLGLVNPEKVF FY++LHSYLAS G+DGVKVDVQNILETLGAGHGGRVKL++KYHQAL
Sbjct: 356 KNGLGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQAL 415
Query: 421 EASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIF 480
EASI+RNF +N II CMSHNTDGLYSAK++AVIRASDDFWPRDPASHTIHIASVAYNT+F
Sbjct: 416 EASISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLF 475
Query: 481 LGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRA 540
LGEFMQPDWDMFHSLHPMAEYH AARAVGGCAIYVSDKPGQHDFNLLRKLVL DGSILRA
Sbjct: 476 LGEFMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRA 535
Query: 541 KLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPG 600
KLPGRPT DC FSDP RD KSLLKIWNLN+FTGV+GVFNCQGAGWC+ K+ LIHD++PG
Sbjct: 536 KLPGRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPG 595
Query: 601 TTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVV 660
T +G +R DV YL +VA EWTGD+I YSHL GE+ YLPK+ +LP+TL REYEV+TVV
Sbjct: 596 TISGCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVV 655
Query: 661 PVKELSSGTRFAPIGLVKMFNSGGAIKELRYESEGTA-TVDMKVRGCGEFGAYSSAR-PR 718
PVKE S G++FAP+GL++MFNSGGAI LRY+ EGT V MK+RG G G YSS R PR
Sbjct: 656 PVKEFSDGSKFAPVGLMEMFNSGGAIVSLRYDDEGTKFVVRMKLRGSGLVGVYSSVRRPR 715
Query: 719 RIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNI 754
+ VDS++V++ YE ESGLVT TL VP++ELYLW++
Sbjct: 716 SVTVDSDDVEYRYEPESGLVTFTLGVPEKELYLWDV 751
>TAIR|locus:2103488 [details] [associations]
symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
"raffinose catabolic process" evidence=IDA] [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
[GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
[GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
Length = 773
Score = 2445 (865.7 bits), Expect = 5.5e-270, Sum P(2) = 5.5e-270
Identities = 441/689 (64%), Positives = 543/689 (78%)
Query: 1 MTVGAGISVSDGNLMVKGSCVLANVKENIVVTPAAGGALVDGAFIGVTSDQLGSRRVFPV 60
MT+ + ISV + NL+V+G +L + +NI++TP G V G+FIG T +Q S VFP+
Sbjct: 1 MTITSNISVQNDNLVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQSKSLHVFPI 60
Query: 61 GKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSAL 120
G LEGLRFMC FRFK+WWMTQRMG+CG+D+P ETQF+++E++ DE G++ +
Sbjct: 61 GVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESK-----DEVEGNGDDAPTV 115
Query: 121 YTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAV 180
YTVFLP+LEG FRAVLQGNE+NE+EIC ESGD V+ +G+HLV+V AG++PF+VI +V
Sbjct: 116 YTVFLPLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSV 175
Query: 181 KTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFII 240
K VERH+ TF HRE+KK+P L+WFGWCTWDAFYTDVT EGV +GL+S +GG PPKF+I
Sbjct: 176 KAVERHMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLI 235
Query: 241 IDDGWQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVT 300
IDDGWQ + A FA RL IKEN KFQK+ +++ GL+ +V
Sbjct: 236 IDDGWQQIENKEKDENCVVQEGAQFATRLVGIKENAKFQKS----DQKDTQVSGLKSVVD 291
Query: 301 EIKEKHDLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIA 360
K++H++K VY WHA+ GYWGGV+P +GMEHY+S + YPV SPGV N+P DS+A
Sbjct: 292 NAKQRHNVKQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLA 351
Query: 361 KNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQAL 420
+GLGLVNP+KVF+FY+ELHSYLAS GIDGVKVDVQNI+ETLGAG GGRV L+R Y QAL
Sbjct: 352 VHGLGLVNPKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQAL 411
Query: 421 EASIARNFRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIF 480
EASIARNF +N I CM HNTDGLYSAK++A++RASDDF+PRDPASHTIHIASVAYN++F
Sbjct: 412 EASIARNFTDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLF 471
Query: 481 LGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRA 540
LGEFMQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVLPDGS+LRA
Sbjct: 472 LGEFMQPDWDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA 531
Query: 541 KLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQPG 600
KLPGRPTRDCLF+DPARDG SLLKIWN+N FTG+VGVFNCQGAGWC+ KKN IHD PG
Sbjct: 532 KLPGRPTRDCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPG 591
Query: 601 TTTGFIRAKDVDYLPRVAGDEWTGDAIAYSHLGGEVAYLPKNATLPITLKSREYEVYTVV 660
T TG IRA D D + +VAG++W+GD+I Y++ GEV LPK A++P+TLK EYE++ +
Sbjct: 592 TLTGSIRADDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHIS 651
Query: 661 PVKELSSGTRFAPIGLVKMFNSGGAIKEL 689
P+KE++ FAPIGLV MFNS GAI+ +
Sbjct: 652 PLKEITENISFAPIGLVDMFNSSGAIESI 680
Score = 175 (66.7 bits), Expect = 5.5e-270, Sum P(2) = 5.5e-270
Identities = 33/59 (55%), Positives = 42/59 (71%)
Query: 696 TATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNI 754
TA V + VRGCG FGAYSS RP + AV+S E F Y+ E GLVTL L V +EE++ W++
Sbjct: 711 TALVSVSVRGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHV 769
>TAIR|locus:2170528 [details] [associations]
symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0005986 "sucrose biosynthetic process" evidence=IMP]
[GO:0010325 "raffinose family oligosaccharide biosynthetic process"
evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0009414 "response to water deprivation" evidence=IEP]
[GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
Uniprot:Q9FND9
Length = 783
Score = 1438 (511.3 bits), Expect = 3.1e-147, P = 3.1e-147
Identities = 305/759 (40%), Positives = 444/759 (58%)
Query: 9 VSDGNLMVKGSCVLANVKENIVVTPAA-----GGALVD---GAFIGVTSD-QLGSRRVFP 59
+ D L+ G VL +V N+ +T + G +D G+FIG D + S V
Sbjct: 24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83
Query: 60 VGKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSA 119
+GKL+ +RFM +FRFK+WW T +G+ G+D+ ETQ ++++ + GS GS G
Sbjct: 84 IGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRP--- 139
Query: 120 LYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNA 179
Y + LP+LEG FR+ Q E +++ +C+ESG +V E +V+V AG DPF ++ +A
Sbjct: 140 -YVLLLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDA 198
Query: 180 VKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFI 239
+K + H+ TF E K P +++ FGWCTWDAFY V +GV +G++ GG PP +
Sbjct: 199 MKVIRVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLV 258
Query: 240 IIDDGWQSVGMDPSGFEFRADNTA----NFANRLTHIKENHKFQKNGKEGQREEDPALGL 295
+IDDGWQS+G D G + N RL +ENHKF K+ + + D +G+
Sbjct: 259 LIDDGWQSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKF-KDYVSPKDQND--VGM 315
Query: 296 RHIVTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCD 354
+ V ++K++ + Y+YVWHA+ GYWGG+RP + S + P SPG++
Sbjct: 316 KAFVRDLKDEFSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDL 373
Query: 355 AFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSR 414
A D I + G+G +P+ FY+ LHS+L +AGIDGVKVDV +ILE L +GGRV L++
Sbjct: 374 AVDKIIETGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAK 433
Query: 415 KYHQALEASIARNFRNNDIICCMSHNTDGLYSAKRSAVI-RASDDFWPRDPASHT----- 468
Y +AL +S+ ++F N +I M H D ++ + + R DDFW DP+
Sbjct: 434 AYFKALTSSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFW 493
Query: 469 ---IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFN 525
H+ AYN++++G F+QPDWDMF S HP AE+H A+RA+ G IY+SD G+HDF+
Sbjct: 494 LQGCHMVHCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFD 553
Query: 526 LLRKLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGW 585
LL++LVLP+GSILR + PTRD LF DP DGK++LKIWNLN +TGV+G FNCQG GW
Sbjct: 554 LLKRLVLPNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGW 613
Query: 586 CRVGKKNLIHDEQPGTTTGFIRAKDVDY----LP-RVAGDEWTGDAIAYSHLGGEVAYLP 640
CR ++N E T T KDV++ P +A E ++ S ++
Sbjct: 614 CRETRRNQCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFALFLSQSK---KLLLSG 670
Query: 641 KNATLPITLKSREYEVYTVVPVKELSSGT-RFAPIGLVKMFNSGGAIKELRYESEGTATV 699
N L +TL+ ++E+ TV PV + + RFAPIGLV M N+ GAI+ L Y E +V
Sbjct: 671 LNDDLELTLEPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSLVYNDE---SV 727
Query: 700 DMKVRGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLV 738
++ V G GEF Y+S +P +D E V+FGYE+ +V
Sbjct: 728 EVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMV 766
>UNIPROTKB|Q5VQG4 [details] [associations]
symbol:RFS "Galactinol--sucrose galactosyltransferase"
species:39947 "Oryza sativa Japonica Group" [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
Uniprot:Q5VQG4
Length = 783
Score = 1350 (480.3 bits), Expect = 6.5e-138, P = 6.5e-138
Identities = 305/757 (40%), Positives = 431/757 (56%)
Query: 13 NLMVKGSCVLANVKENIVVTPAAG-------GALVDGAFIGVTSDQLGSRRVFPVGKLEG 65
+L V G L +V NI +TPA+ A G+F+G + R V P+GKL
Sbjct: 34 DLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHVVPIGKLRD 93
Query: 66 LRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEGSQYGEEQSALYTVFL 125
RFM +FRFK+WW T +G G+DV ETQ ++++ + G+ + S G Y + L
Sbjct: 94 TRFMSIFRFKVWWTTHWVGTNGRDVENETQMMILD-QSGT---KSSPTGPRP---YVLLL 146
Query: 126 PILEGDFRAVLQ-GNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNAVKTVE 184
PI+EG FRA L+ G ++ + + LESG V V++ AG DPFD++ +A++ V
Sbjct: 147 PIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMRVVR 206
Query: 185 RHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 244
HL TF E K P +++ FGWCTWDAFY V EGV +G+ GG PP ++IDDG
Sbjct: 207 AHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLIDDG 266
Query: 245 WQSVGMDP----SGFEFRADNTAN--FANRLTHIKENHKFQKNGKEGQREEDPALGLRHI 298
WQS+ D SG E +A RL +EN+KF++ +G G+
Sbjct: 267 WQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFREY--KG--------GMGGF 316
Query: 299 VTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFD 357
V E+K ++ VYVWHA+ GYWGG+RPG G+ +K+ P SPG+Q A D
Sbjct: 317 VREMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVD 374
Query: 358 SIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYH 417
I NG+GLV+P + Y+ LHS+L ++GIDGVKVDV ++LE + +GGRV+L++ Y
Sbjct: 375 KIVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYF 434
Query: 418 QALEASIARNFRNNDIICCMSHNTDG-LYSAKRSAVIRASDDFWPRDPASHT-------- 468
L S+ R+F N +I M H D L + A+ R DDFW DP+
Sbjct: 435 AGLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQG 494
Query: 469 IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQHDFNLLR 528
H+ AYN++++G F+ PDWDMF S HP A +H A+RAV G +YVSD G HDF+LLR
Sbjct: 495 CHMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLR 554
Query: 529 KLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRV 588
+L LPDG+ILR + PTRDCLF+DP DGK++LKIWN+N F+GV+G FNCQG GW R
Sbjct: 555 RLALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSRE 614
Query: 589 GKKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDAIA-YSHLGGEVAYLPKNATLPI 647
++N+ T DV++ G GD A Y ++ L ++ ++ +
Sbjct: 615 ARRNMCAAGFSVPVTARASPADVEWSHGGGG----GDRFAVYFVEARKLQLLRRDESVEL 670
Query: 648 TLKSREYEVYTVVPVKELSS---GTRFAPIGLVKMFNSGGAIKELRY-ESEGTATVDMKV 703
TL+ YE+ V PV+ + S G FAPIGL M N+GGA++ +G ++ V
Sbjct: 671 TLEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFEAARKDGDVAAEVAV 730
Query: 704 RGCGEFGAYSSARPRRIAVDSEEVQFGYEEESGLVTL 740
+G GE AYSSARPR V+ ++ +F YE+ G+VT+
Sbjct: 731 KGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTV 765
>UNIPROTKB|Q93XK2 [details] [associations]
symbol:STS1 "Stachyose synthase" species:3888 "Pisum
sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
"oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
"galactinol-raffinose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
Uniprot:Q93XK2
Length = 853
Score = 821 (294.1 bits), Expect = 9.4e-122, Sum P(2) = 9.4e-122
Identities = 180/460 (39%), Positives = 263/460 (57%)
Query: 285 GQREEDPA-LGLRHIVTEIKEKHD-LKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPV 342
G++ E + GL+ +++ K L VYVWHA+ G WGGVRP T H ++K+
Sbjct: 373 GEKSEMKSEYGLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HLDTKIVPCK 429
Query: 343 SSPGVQSNEPCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETL 402
SPG+ A I+K LGLV+P + YD +HSYLA +GI GVKVDV + LE +
Sbjct: 430 LSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVDVIHSLEYV 489
Query: 403 GAGHGGRVKLSRKYHQALEASIARNFRNNDIICCMSHNTDGLY-SAKRSAVIRASDDFWP 461
+GGRV L++ Y++ L SI +NF N +I M H D + K+ ++ R DDFW
Sbjct: 490 CDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMGRVGDDFWF 549
Query: 462 RDPASHT--------IHIASVAYNTIFLGEFMQPDWDMFHSLHPMAEYHGAARAVGGCAI 513
+DP +H+ +YN++++G+ +QPDWDMF S H A++H +RA+ G I
Sbjct: 550 QDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGSRAICGGPI 609
Query: 514 YVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTG 573
YVSD G HDF+L++KLV PDG+I + PTRDCLF +P D ++LKIWN N + G
Sbjct: 610 YVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKIWNFNKYGG 669
Query: 574 VVGVFNCQGAGWCRVGKKNLIHDEQPGTTTGFIRAKDVDYLPRVAGDEWTGDA---IAYS 630
V+G FNCQGAGW + +K E G + +V++ + G A + Y
Sbjct: 670 VIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSH-LGKAEEYVVYL 728
Query: 631 HLGGEVAYLP-KNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKEL 689
+ E++ + K+ + T++ +E+Y+ VPV +L G +FAPIGL MFNSGG + +L
Sbjct: 729 NQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNSGGTVIDL 788
Query: 690 RYESEGTATVDMKVRGCGEFGAYSSARPRRIAVDSEEVQF 729
Y G +KV+G G F AYSS P++ ++ EV F
Sbjct: 789 EYVGNGAK---IKVKGGGSFLAYSSESPKKFQLNGCEVDF 825
Score = 397 (144.8 bits), Expect = 9.4e-122, Sum P(2) = 9.4e-122
Identities = 85/245 (34%), Positives = 129/245 (52%)
Query: 42 GAFIGVTSDQLGSRRVFPVGKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEA 101
G F G + + R + +G G F+ +FRFK WW TQ +G G D+ ETQ++++E
Sbjct: 72 GGFFGFSHETPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131
Query: 102 REGSHFDEGSQYGEEQSALYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGS 161
E + Y V +PI+E FR+ L + ++I ESG V E +
Sbjct: 132 PE--------------TKSYVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFN 177
Query: 162 HLVFVAAGSDPFDVITNAVKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEG 221
+ +V +P+D++ A + HL +F E K +P++++ FGWCTWDAFY V G
Sbjct: 178 SIAYVHFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIG 237
Query: 222 VKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEFRAD--NTA----NFANRLTHIKEN 275
+ GL+ F KGG+ P+F+IIDDGWQS+ D G++ D N + RL E
Sbjct: 238 IFHGLDDFSKGGVEPRFVIIDDGWQSISFD--GYDPNEDAKNLVLGGEQMSGRLHRFDEC 295
Query: 276 HKFQK 280
+KF+K
Sbjct: 296 YKFRK 300
>TAIR|locus:2141425 [details] [associations]
symbol:STS "AT4G01970" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0047268 "galactinol-raffinose galactosyltransferase activity"
evidence=ISS] [GO:0006979 "response to oxidative stress"
evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
Uniprot:Q9SYJ4
Length = 876
Score = 756 (271.2 bits), Expect = 4.7e-116, Sum P(2) = 4.7e-116
Identities = 167/463 (36%), Positives = 261/463 (56%)
Query: 311 VYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEPCDAFDSIAKNGLGLVNPE 370
+YVWHA+ G W GVRP M ++K+ SP + + A D + + G+GLV+P
Sbjct: 416 IYVWHALCGAWNGVRPET--MMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473
Query: 371 KVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFRN 430
K FYD +HSYLAS G+ G K+DV LE+L HGGRV+L++ Y+ L S+ +NF
Sbjct: 474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533
Query: 431 NDIICCMSHNTDGLYSA-KRSAVIRASDDFWPRDPASHT--------IHIASVAYNTIFL 481
D+I M + + A K+ ++ R DDFW +DP +H+ +YN+I++
Sbjct: 534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593
Query: 482 GEFMQPDWDMFHSLHPMAEYHGAARAVGGCAIYVSDKPGQ--HDFNLLRKLVLPDGSILR 539
G+ +QPDWDMF S H AEYH A+RA+ G +Y+SD G+ H+F+L++KL DG+I R
Sbjct: 594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653
Query: 540 AKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVFNCQGAGWCRVGKKNLIHDEQP 599
PTRD LF +P D +S+LKI+N N F GV+G FNCQGAGW + + E
Sbjct: 654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713
Query: 600 GTTTGFIRAKDV--DYLPRVAGDE--WTGDAIAYSHLGGEVAYL-PKNATLPITLKSREY 654
T +G + D+ D P AG + +TGD + Y E+ ++ K+ + ITL+ +
Sbjct: 714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773
Query: 655 EVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELR-YESEGTATVDMKVRGCGEFGAYS 713
++ + VPV EL S + + N + ++ + G ++ + V+G G F AYS
Sbjct: 774 DLLSFVPVTELVSSG--VRFAPLGLINMFNCVGTVQDMKVTGDNSIRVDVKGEGRFMAYS 831
Query: 714 SARPRRIAVDSEEVQFGYEEESGLVTLTLRVPKEELYLWNISF 756
S+ P + ++ +E +F +EEE+G ++ + +E + ++SF
Sbjct: 832 SSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEESGGISHLSF 874
Score = 408 (148.7 bits), Expect = 4.7e-116, Sum P(2) = 4.7e-116
Identities = 99/299 (33%), Positives = 150/299 (50%)
Query: 8 SVSDGNLMVKGSC-VLANVKENIVVTPAAGGAL-VD---------------GAFIGVTSD 50
++S+G+L K S +L +V +N+ TP + ++ D G F+G T +
Sbjct: 35 NLSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAPLPILLRVQANAHKGGFLGFTKE 94
Query: 51 QLGSRRVFPVGKLEGLRFMCVFRFKMWWMTQRMGNCGQDVPFETQFLVVEAREGSHFDEG 110
R +G+ E F+ +FRFKMWW T +G G D+ ETQ+++++ E D
Sbjct: 95 SPSDRLTNSLGRFEDREFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKIPE---IDS- 150
Query: 111 SQYGEEQSALYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGS 170
Y +P +EG FRA L E+ + IC ESG V E + ++
Sbjct: 151 ----------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIAYIHICD 200
Query: 171 DPFDVITNAVKTVERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFE 230
+P++++ A + H+ TF E KK+P +++ FGWCTWDA Y V + G++ FE
Sbjct: 201 NPYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFE 260
Query: 231 KGGIPPKFIIIDDGWQSVGMDPSGFEFRADNTA----NFANRLTHIKENHKFQKNGKEG 285
GG+ PKF+IIDDGWQS+ D + A+N RLT KE KF +N K G
Sbjct: 261 DGGVCPKFVIIDDGWQSINFDGDELDKDAENLVLGGEQMTARLTSFKECKKF-RNYKGG 318
>ASPGD|ASPL0000010056 [details] [associations]
symbol:aglF species:162425 "Emericella nidulans"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
Length = 863
Score = 384 (140.2 bits), Expect = 5.7e-40, Sum P(2) = 5.7e-40
Identities = 131/450 (29%), Positives = 218/450 (48%)
Query: 294 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRPGVTGMEHYESKMQYPVSSPGVQSNEP 352
GL VT I+E+H +++Y+ VWHA+ GYWGG+ P + Y+++
Sbjct: 384 GLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTREV------------- 430
Query: 353 CDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKL 412
A +S + + ++P + FY++ +++L+ +GI GVK D Q+ L+ L A R
Sbjct: 431 --ALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRSY 487
Query: 413 SRKYHQALEASIARNFRNNDIICCMSHNTDGLYSA-----KRSAVIRASDDFWPRDPASH 467
+ Y A S R+F I CMS ++ + K + V+R S+DF+P SH
Sbjct: 488 ANAYQDAWTISSLRHF-GPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546
Query: 468 TIHIASVAYNTIFLGEFMQ--PDWDMFHSLHP----MAEYHGAARAVGGCAIYVSDKPGQ 521
T H+ A+N + L ++ PDWDMF +L A +H AAR + G IY++DKPGQ
Sbjct: 547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605
Query: 522 HDFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSL-LKIWN--LNDFTGV 574
HD L++++ G+ LR + R T D ++ D ++G L + ++ +G+
Sbjct: 606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662
Query: 575 VGVFNCQG---AGWCRVGKKNLIHDEQPGTTTGFI-RAKDVDYLPRVAGDEWTGDAIAYS 630
+GVFN + V I+D+Q TG+I RA R+ G+ + A++ +
Sbjct: 663 IGVFNVSNRVESVIIPVADFPGIYDDQE--ETGYIVRAHRTG---RIVGELHSSSAVSVT 717
Query: 631 --HLGGEV--AYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAI 686
EV AY K T + K +E E + +P ++S A +GL++ A+
Sbjct: 718 LNERRWEVLTAYPVKTLTFKMNSKDKENE--SSMPTADVSVDV--AILGLLRKMTGVAAL 773
Query: 687 --KELRYESEGTATVDMKVRGCGEFGAYSS 714
++ E G VD+ ++ G G Y S
Sbjct: 774 VSSDIYIEDTGRLRVDVGIKALGVLGIYFS 803
Score = 122 (48.0 bits), Expect = 5.7e-40, Sum P(2) = 5.7e-40
Identities = 43/173 (24%), Positives = 77/173 (44%)
Query: 120 LYTVFLPILEGDFRAVLQGNEQNELEICLESGDPDVDEFEGSHLVFVAAGSDPFDVITNA 179
++ V L + D VL E+ I ++ + F+ V A +D F+V T+A
Sbjct: 230 VHVVLLGVTVDDTLTVLGSGPAGEVVIKSQNDNATPSRFQ----VLAATAAD-FEVATSA 284
Query: 180 VKTVERHLLTFSHRERKKMPD---MLNWF---GWCTWDAFYTDVTGEGVKQGLESFEKGG 233
+ R L+ + P + W+ +CTW+ D++ E + L+ + G
Sbjct: 285 LIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTWNGLGQDLSEEKILSALDDLKTAG 344
Query: 234 IPPKFIIIDDGWQSVGMDPSGF------EFRADNTA---NFANRLTHIKENHK 277
I + +IIDD WQS+ + +G +F A++ A A +T I+E H+
Sbjct: 345 IRIRTLIIDDNWQSLDNEGAGSWHRALTQFEANSKAFPNGLAKAVTTIREQHR 397
>UNIPROTKB|Q97U94 [details] [associations]
symbol:galS "Alpha-galactosidase" species:273057
"Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
[GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
"glycosylceramide catabolic process" evidence=ISS]
InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
Length = 648
Score = 234 (87.4 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
Identities = 67/199 (33%), Positives = 97/199 (48%)
Query: 368 NPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARN 427
N E FY + D VKVD Q ++ + + SR AL+ S+ +
Sbjct: 342 NLEDAIGFYKAFDGNILR-DFDLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGK- 398
Query: 428 FRNNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQP 487
D+I CMS N + + S V+R S D+ P +HI AYN++ + P
Sbjct: 399 ----DVINCMSMNPENYCNYFYSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYP 454
Query: 488 DWDMFHSLHPMAEYHGAARAVGGCAIYVSDK-PGQHDFNLLRKLVLPDGSILRAKLPGRP 546
D+DMF S P A+ H AR G IY++D+ P + + LLR VLP+G ++R P
Sbjct: 455 DYDMFMSYDPYAKVHLVARVFSGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALI 514
Query: 547 TRDCLFSDPARDGKSLLKI 565
T D LF DP R+ + LLK+
Sbjct: 515 TEDLLFKDPLRE-RVLLKL 532
Score = 177 (67.4 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
Identities = 38/125 (30%), Positives = 63/125 (50%)
Query: 154 DVDEFEGSHLVFVAAGSDPFDVITNAVKTVERHLLTFSHRERKKMPD-MLNWFGWCTWDA 212
+ DE + S+ + + +P+ I NA+ + TF R+ K PD ++N GWC+W+A
Sbjct: 172 NTDEIKRSYFLSIGTSDNPYKAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNA 231
Query: 213 FYT-DVTGEGVKQGLESFEKGGIPPKFIIIDDGWQSVGMDPSGFEFRADNTA---NFANR 268
F T D+ E + + ++ + G+ ++IIDDGWQ D + DN F N
Sbjct: 232 FLTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQDQNNDRAIRSLNPDNKKFPNGFKNT 291
Query: 269 LTHIK 273
+ IK
Sbjct: 292 VRAIK 296
Score = 79 (32.9 bits), Expect = 5.0e-13, Sum P(2) = 5.0e-13
Identities = 32/105 (30%), Positives = 45/105 (42%)
Query: 294 GLRHIVTEIKEKHDLKYVYVWHAITGYWGGVRP------GVTGMEHYESKMQYPVSSPGV 347
G ++ V IK +KYV +WHAI +WGG+ V G ++ + + V SP +
Sbjct: 287 GFKNTVRAIKSL-GVKYVGLWHAINAHWGGMSQELMKSLNVNG--YFTNFLNSYVPSPNL 343
Query: 348 QSNEPC-DAFDSIAKNGLGLVNPEK--VFH-FYDELHSYLASAGI 388
+ AFD LV + V H YD LAS I
Sbjct: 344 EDAIGFYKAFDGNILRDFDLVKVDNQWVIHAIYDSFPIGLASRNI 388
>UNIPROTKB|G4NBB7 [details] [associations]
symbol:MGG_11554 "Seed imbibition protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
evidence=ND] [GO:0052051 "interaction with host via protein
secreted by type II secretion system" evidence=IDA]
InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
Uniprot:G4NBB7
Length = 908
Score = 337 (123.7 bits), Expect = 6.9e-29, Sum P(3) = 6.9e-29
Identities = 101/326 (30%), Positives = 159/326 (48%)
Query: 294 GLRHIVTEIKEKH-DLKYVYVWHAITGYWGGVRP-GVTGMEHYESKMQYPVSSPGVQSNE 351
GL+ +V+EI++++ ++ + VWH I GYWGG+ P G ++ K+Q + VQ
Sbjct: 404 GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKIQLRDEAE-VQ--- 459
Query: 352 PCDAFDSIAKNGLGLVNPEKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVK 411
P D FD +G E V YD+ +++LA G+ KVD Q L+ A R
Sbjct: 460 PKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFLD-YPAHANDRKN 511
Query: 412 LSRKYHQALEASIARNFRNNDIICCMSHNTDGLYSAKRSA-------VIRASDDFWPRDP 464
L R Y A A+ +++F I C L+S + + R SDDF+P +
Sbjct: 512 LIRPYQDAWTAAASKHFGGRAIACMAQTPQSILHSLLQQGRSEGPMLMARNSDDFFPDEV 571
Query: 465 ASHTIHIASVAYNTIFLGEF-MQPDWDMFHSLHPM-AEYHGAARAVGGCAIYVSDKPGQH 522
SHT H+ A+N + + + DWDMF + P A H AR++ G IY++D PG+H
Sbjct: 572 GSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMSGGPIYITDAPGEH 631
Query: 523 DFNLLRKLVLP--DGSI--LRAKLPGRPTRDCLFSDPARDGKSLLKIWNLNDFTGVVGVF 578
D L++++ DG LRA PGR L+ + LL++ + + G++GVF
Sbjct: 632 DVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRVRSGHQGVGMLGVF 687
Query: 579 NCQGAGWCRVGKKNLIHDEQPGTTTG 604
N G +G++ + D G G
Sbjct: 688 NVCNRG-SLLGEQVRLDDIFDGEKAG 712
Score = 151 (58.2 bits), Expect = 1.6e-08, Sum P(3) = 1.6e-08
Identities = 45/157 (28%), Positives = 82/157 (52%)
Query: 185 RHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGEGVKQGLESFEKGGIPPKFIIIDDG 244
+H LT + R ++ D + F +CTW++ D++ + + L + GI +IIDD
Sbjct: 319 KHSLT---QARAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDN 375
Query: 245 WQSVGMDPSGFEFRADNTANFANRLTHIKENHKFQKNGKEGQREEDPALGLRHIVTEIKE 304
WQS +D G + A+R + +F+ N ++G + GL+ +V+EI++
Sbjct: 376 WQS--LDGDGSD---------ASR----RRWERFEAN-QQGFPQ-----GLKGLVSEIRK 414
Query: 305 KH-DLKYVYVWHAITGYWGGVRP-GVTGMEHYESKMQ 339
++ ++ + VWH I GYWGG+ P G ++ K+Q
Sbjct: 415 QNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKIQ 451
Score = 70 (29.7 bits), Expect = 6.9e-29, Sum P(3) = 6.9e-29
Identities = 21/89 (23%), Positives = 41/89 (46%)
Query: 634 GE-VAYLPKNATLPITLKSREYEVYTVVPVKELSSGTRFAPIGLVKMFNSGGAIKELRYE 692
GE +A + + + L+ +E++T P+ +L G A +GLV + A+ + Y
Sbjct: 724 GEMIAPASRETVIEVGLEEGGFEIFTAYPITKLG-GLAVATLGLVGKMATAAAVSHVSYS 782
Query: 693 S--EGTATVDMKV----RGCGEFGAYSSA 715
EG V ++V + G G ++ +
Sbjct: 783 KHHEGFIPVGVEVSVSLKALGTLGIFAQS 811
Score = 41 (19.5 bits), Expect = 6.9e-29, Sum P(3) = 6.9e-29
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 116 EQSALYTVFLPILEGDFRAVLQG 138
E+ A +TV +P LE + V+ G
Sbjct: 23 EKDATFTVGVPALELEHGGVING 45
>UNIPROTKB|Q8A170 [details] [associations]
symbol:BT_3797 "Possible alpha-galactosidase"
species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
"alpha-galactosidase activity" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
Uniprot:Q8A170
Length = 693
Score = 196 (74.1 bits), Expect = 4.4e-20, Sum P(3) = 4.4e-20
Identities = 53/193 (27%), Positives = 91/193 (47%)
Query: 370 EKVFHFYDELHSYLASAGIDGVKVDVQNILETLGAGHGGRVKLSRKYHQALEASIARNFR 429
EK+ +Y+ + G D +K+D Q+ L G ++ ++ + ALE R
Sbjct: 348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHR--M 405
Query: 430 NNDIICCMSHNTDGLYSAKRSAVIRASDDFWPRDPASHTIHIASVAYNTIFLGEFMQPDW 489
++ CM+ N + S+V RAS D+ D H+ NT+ LG+ + PD
Sbjct: 406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465
Query: 490 DMFHSLHPMA-EYHGAARAVGGCAIYVSDKPGQHDFNLLRKLVLPDGSILRAKLPGRPTR 548
DMFHS + ++A+ G +Y+SD P + + +R L+ G I R P PT
Sbjct: 466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525
Query: 549 DCLFSDPARDGKS 561
+ + ++P + GK+
Sbjct: 526 ESILTNPLQSGKA 538
Score = 114 (45.2 bits), Expect = 4.4e-20, Sum P(3) = 4.4e-20
Identities = 21/84 (25%), Positives = 46/84 (54%)
Query: 163 LVFVAAGSDPFDVITNAVKTV--ERHLLTFSHRERKKMPDMLNWFGWCTWDAFYTDVTGE 220
L+F S + V ++A ++ ++ + R K+ + ++ GWCTW+ ++ D+
Sbjct: 187 LIF-RKSSSVYHVFSDAYDSLIADKAVSALRKRADKQYFNAFDYLGWCTWEHYHYDIDET 245
Query: 221 GVKQGLESFEKGGIPPKFIIIDDG 244
+ +++ E GIP ++++IDDG
Sbjct: 246 KILNDIDAIEASGIPVRYVLIDDG 269
Score = 58 (25.5 bits), Expect = 4.4e-20, Sum P(3) = 4.4e-20
Identities = 6/22 (27%), Positives = 17/22 (77%)
Query: 303 KEKHDLKYVYVWHAITGYWGGV 324
K+ ++++ +W++++GYW G+
Sbjct: 299 KQADKIRWIGLWYSLSGYWMGI 320
Score = 55 (24.4 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
Identities = 21/96 (21%), Positives = 40/96 (41%)
Query: 176 ITNAVKTVERHLLTFSHRERKKMPDMLNWFG-WCTWDAFYTDVTGEG-----VKQGLESF 229
+T+ V +R +S ++K D + W G W + ++ ++ E ++Q L S+
Sbjct: 278 LTSLVPDKKRFPNGWSRIMKRKQADKIRWIGLWYSLSGYWMGISAENDFPPEIRQVLHSY 337
Query: 230 EKGGIPPKFIIIDDGWQSV---GMDPSGFEF-RADN 261
+P + W M GF+F + DN
Sbjct: 338 NGSLLPGTSTEKIETWYEYYVRTMKEYGFDFLKIDN 373
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.138 0.426 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 758 758 0.00091 121 3 11 22 0.40 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 10
No. of states in DFA: 628 (67 KB)
Total size of DFA: 421 KB (2202 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 59.73u 0.09s 59.82t Elapsed: 00:00:03
Total cpu time: 59.74u 0.09s 59.83t Elapsed: 00:00:03
Start: Tue May 21 09:25:42 2013 End: Tue May 21 09:25:45 2013