Your job contains 1 sequence.
>003500
MLRTAHVQLQQLKWFPALWKSRGHHRISFQNYKPLVLRRSKMTVAPNISISDGNLVVHGK
TILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWM
TQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEI
EICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDW
FGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGA
QFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYWGGVKPA
ADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCG
VDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS
KQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARA
VGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN
VNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVENMAQIAGAGWNGDAI
VYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNSGGAVE
NVEVHMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVRGCGRFGIYSSQRPLKCTVG
SIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 003500
(815 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702... 3379 0. 1
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702... 2376 1.6e-259 2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact... 1306 1.2e-138 2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702... 1340 7.4e-137 1
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ... 734 3.5e-125 3
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci... 783 4.0e-124 3
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric... 340 1.0e-36 3
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot... 337 3.0e-31 3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec... 238 4.6e-31 3
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto... 196 2.6e-23 4
>TAIR|locus:2103488 [details] [associations]
symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
"raffinose catabolic process" evidence=IDA] [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
[GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
[GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
Length = 773
Score = 3379 (1194.5 bits), Expect = 0., P = 0.
Identities = 618/775 (79%), Positives = 690/775 (89%)
Query: 42 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 101
MT+ NIS+ + NLVV GKTILT +PDNIILTP G G V+G+FIGAT SKSLHVFP+
Sbjct: 1 MTITSNISVQNDNLVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQSKSLHVFPI 60
Query: 102 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 161
GVLE LRFMCCFRFKLWWMTQRMG+CGKD+PLETQFML+ESKD E + DD PT+YTVFL
Sbjct: 61 GVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAPTVYTVFL 120
Query: 162 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 221
PLLEGQFR+ LQGNE NEIEIC ESGD AVET+QG +LVY HAG NPFEVI Q+VKAVE+
Sbjct: 121 PLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSVKAVER 180
Query: 222 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 281
+MQTF HREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLS GGTPPKFLIIDDGW
Sbjct: 181 HMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDGW 240
Query: 282 QQIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNV 341
QQIENK K+E NC+VQEGAQFA+RL GIKEN+KFQK Q QVSGLK VVD +KQ HNV
Sbjct: 241 QQIENKEKDE-NCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAKQRHNV 299
Query: 342 KYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVH 401
K VY WHALAGYWGGVKPAA GMEHYD+ALAYPV SPGV+GNQPDIVMDSLAVHGLGLV+
Sbjct: 300 KQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVN 359
Query: 402 PKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNF 461
PKKVFNFYNELH+YLASCG+DGVKVDVQNIIETLGAG GGRVSLTRSY QALEASIARNF
Sbjct: 360 PKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNF 419
Query: 462 PDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPD 521
DNGCISCMCHNTDG+YS+KQTA++RASDD+YPRDPASHTIHI+SVAYN+LFLGEFMQPD
Sbjct: 420 TDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEFMQPD 479
Query: 522 WDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 581
WDMFHSLHP AEYH AARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA+LPGRPTR
Sbjct: 480 WDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTR 539
Query: 582 DCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRV 641
DCLFADPARDG SLLK+WN+NK +G+VGVFNCQGAGWCK TKK +IHD SPGTLT S+R
Sbjct: 540 DCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRA 599
Query: 642 TDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSN 701
D + ++Q+AG W+GD+IVYA+RSGEVVRLPKGAS+P+TLKVLEYELFH PLKEI+ N
Sbjct: 600 DDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEITEN 659
Query: 702 ISFAAIGLLDMFNSGGAVENVEV-HMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKV 760
ISFA IGL+DMFNS GA+E++++ H+++K P+ FDGE+SS + +LSDNRSPTA +S+ V
Sbjct: 660 ISFAPIGLVDMFNSSGAIESIDINHVTDKNPEFFDGEISSA-SPALSDNRSPTALVSVSV 718
Query: 761 RGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV 815
RGCGRFG YSSQRPLKC V S +TDFTYD+ GL+T+ LPV EEM+RW VEI V
Sbjct: 719 RGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHVEILV 773
>TAIR|locus:2020452 [details] [associations]
symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
Uniprot:Q84VX0
Length = 754
Score = 2376 (841.5 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
Identities = 433/683 (63%), Positives = 539/683 (78%)
Query: 42 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 101
MTV IS++D +LVV G +L GVP+N+++TP +G L+ GAFIG T+ + S VF +
Sbjct: 1 MTVGAGISVTDSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHRVFSL 60
Query: 102 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 161
G LEDLRFMC FRFKLWWMTQRMGT GK++P ETQF++VE+ S+ D + Y VFL
Sbjct: 61 GKLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGSDLGGRDQSSSYVVFL 120
Query: 162 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 221
P+LEG FR+ LQGNE NE+EICLESGD V+ +G +LV+ AG +PF+VI++AVKAVE+
Sbjct: 121 PILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAVKAVEQ 180
Query: 222 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 281
++QTF+HRE+KK+P L+WFGWCTWDAFYT+VTA+ V +GL+SL AGG PKF+IIDDGW
Sbjct: 181 HLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGW 240
Query: 282 QQIE-NKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS----GLKHVVDESK 336
Q + ++ E N A FA+RLT IKEN KFQK + +V L HV+ + K
Sbjct: 241 QSVGMDETSVEFNA--DNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITDIK 298
Query: 337 QNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHG 396
N+++KYVYVWHA+ GYWGGVKP GMEHY++ +AYPV+SPGVM ++ ++S+ +G
Sbjct: 299 SNNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESITKNG 358
Query: 397 LGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEAS 456
LGLV+P+KVF+FYN+LH+YLAS GVDGVKVDVQNI+ETLGAGHGGRV L + YHQALEAS
Sbjct: 359 LGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQALEAS 418
Query: 457 IARNFPDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGE 516
I+RNFPDNG ISCM HNTDG+YS+K+TAVIRASDD++PRDPASHTIHI+SVAYNTLFLGE
Sbjct: 419 ISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLFLGE 478
Query: 517 FMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLP 576
FMQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVL DGS+LRA+LP
Sbjct: 479 FMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRAKLP 538
Query: 577 GRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLT 636
GRPT DC F+DP RD SLLK+WN+N+ +GV+GVFNCQGAGWCK K+ IHD+ PGT++
Sbjct: 539 GRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPGTIS 598
Query: 637 ASVRVTDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLK 696
VR DV + ++A W GD+IVY+H GE+V LPK S+PVTL EYE+F P+K
Sbjct: 599 GCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVVPVK 658
Query: 697 EISSNISFAAIGLLDMFNSGGAV 719
E S FA +GL++MFNSGGA+
Sbjct: 659 EFSDGSKFAPVGLMEMFNSGGAI 681
Score = 145 (56.1 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
Identities = 29/68 (42%), Positives = 42/68 (61%)
Query: 748 DNRSPTATISLKVRGCGRFGIYSS-QRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEM 806
D+ + +K+RG G G+YSS +RP TV S ++ Y+ +GL+T TL VPE+E+
Sbjct: 687 DDEGTKFVVRMKLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYEPESGLVTFTLGVPEKEL 746
Query: 807 YRWPVEIQ 814
Y W V IQ
Sbjct: 747 YLWDVVIQ 754
>UNIPROTKB|Q5VQG4 [details] [associations]
symbol:RFS "Galactinol--sucrose galactosyltransferase"
species:39947 "Oryza sativa Japonica Group" [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
Uniprot:Q5VQG4
Length = 783
Score = 1306 (464.8 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
Identities = 289/724 (39%), Positives = 407/724 (56%)
Query: 46 PNISISDGNLVVHGKTILTGVPDNIILTPGNGV-------GLVAGAFIGATASHSKSLHV 98
P ++ +L V G L VP NI LTP + + AG+F+G A +K HV
Sbjct: 26 PRFTLKGKDLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHV 85
Query: 99 FPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYT 158
P+G L D RFM FRFK+WW T +GT G+DV ETQ M+++ S GP Y
Sbjct: 86 VPIGKLRDTRFMSIFRFKVWWTTHWVGTNGRDVENETQMMILDQSGTKSSPT--GPRPYV 143
Query: 159 VFLPLLEGQFRSALQ-GNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVK 217
+ LP++EG FR+ L+ G + + + LESG + V + VY HAG +PF+++ A++
Sbjct: 144 LLLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMR 203
Query: 218 AVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLII 277
V ++ TF E+K P +D FGWCTWDAFY V EGV EG++ L+ GG PP ++I
Sbjct: 204 VVRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLI 263
Query: 278 DDGWQQIENKPKE-----ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVV 332
DDGWQ I + + E G Q RL +EN KF+ E G+ V
Sbjct: 264 DDGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFR------EYKGGMGGFV 317
Query: 333 DESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDS 391
E K V+ VYVWHAL GYWGG++P A G+ + P SPG+ D+ +D
Sbjct: 318 REMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDK 375
Query: 392 LAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQ 451
+ +G+GLV P++ Y LH++L + G+DGVKVDV +++E + +GGRV L ++Y
Sbjct: 376 IVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFA 435
Query: 452 ALEASIARNFPDNGCISCMCHNTDG-IYSSKQTAVIRASDDYYPRDPASHT--------I 502
L S+ R+F NG I+ M H D + ++ A+ R DD++ DP+
Sbjct: 436 GLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGC 495
Query: 503 HISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRK 562
H+ AYN+L++G F+ PDWDMF S HP A +H A+RAV G +YVSD G H+FDLLR+
Sbjct: 496 HMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRR 555
Query: 563 LVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKIT 622
L LPDG++LR + PTRDCLFADP DG ++LK+WNVNK SGV+G FNCQG GW +
Sbjct: 556 LALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREA 615
Query: 623 KKTRIHDESPGTLTASVRVTDVENMAQIAGAGWNGDAI-VYAHRSGEVVRLPKGASVPVT 681
++ +TA DVE + G GD VY + ++ L + SV +T
Sbjct: 616 RRNMCAAGFSVPVTARASPADVE----WSHGGGGGDRFAVYFVEARKLQLLRRDESVELT 671
Query: 682 LKVLEYELFHFCPLKEISS---NISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEV 738
L+ YEL P++ I S I FA IGL +M N+GGAV+ E + +K DG+V
Sbjct: 672 LEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFE---AARK----DGDV 724
Query: 739 SSEL 742
++E+
Sbjct: 725 AAEV 728
Score = 72 (30.4 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 760 VRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLP 800
V+G G YSS RP C V +F Y+ G++T+ +P
Sbjct: 730 VKGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTVDVP 768
>TAIR|locus:2170528 [details] [associations]
symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0005986 "sucrose biosynthetic process" evidence=IMP]
[GO:0010325 "raffinose family oligosaccharide biosynthetic process"
evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0009414 "response to water deprivation" evidence=IEP]
[GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
Uniprot:Q9FND9
Length = 783
Score = 1340 (476.8 bits), Expect = 7.4e-137, P = 7.4e-137
Identities = 289/782 (36%), Positives = 443/782 (56%)
Query: 50 ISDGNLVVHGKTILTGVPDNIILTPG------NGVGL--VAGAFIGATAS-HSKSLHVFP 100
+ D L+ +G+ +LT VP N+ LT +GV L AG+FIG KS HV
Sbjct: 24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83
Query: 101 MGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGP-TIYTV 159
+G L+++RFM FRFK+WW T +G+ G+D+ ETQ ++++ + S+S G Y +
Sbjct: 84 IGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRPYVL 142
Query: 160 FLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV 219
LPLLEG FRS+ Q E++++ +C+ESG V ++ +VY HAG +PF+++ A+K +
Sbjct: 143 LLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDAMKVI 202
Query: 220 EKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDD 279
+M TF E+K P +D FGWCTWDAFY V +GV +G+K L GG PP ++IDD
Sbjct: 203 RVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDD 262
Query: 280 GWQQIENKPKE---ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS-GLKHVVDES 335
GWQ I + E I G Q RL +EN KF+ +Q G+K V +
Sbjct: 263 GWQSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAFVRDL 322
Query: 336 KQNHN-VKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAV 394
K + V Y+YVWHAL GYWGG++P A + + + P SPG+ D+ +D +
Sbjct: 323 KDEFSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKIIE 380
Query: 395 HGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALE 454
G+G P FY LH++L + G+DGVKVDV +I+E L +GGRV L ++Y +AL
Sbjct: 381 TGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKALT 440
Query: 455 ASIARNFPDNGCISCMCHNTDGIYSSKQTAVI-RASDDYYPRDPASHT--------IHIS 505
+S+ ++F NG I+ M H D ++ + + R DD++ DP+ H+
Sbjct: 441 SSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHMV 500
Query: 506 SVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVL 565
AYN+L++G F+QPDWDMF S HP AE+H A+RA+ G IY+SD G H+FDLL++LVL
Sbjct: 501 HCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLVL 560
Query: 566 PDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKT 625
P+GS+LR + PTRD LF DP DG ++LK+WN+NK +GV+G FNCQG GWC+ T++
Sbjct: 561 PNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRRN 620
Query: 626 RIHDESPGTLTASVRVTDVE---NMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTL 682
+ E TLTA+ DVE + I+ A A+ + +S +++ + +TL
Sbjct: 621 QCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFAL-FLSQSKKLLLSGLNDDLELTL 679
Query: 683 KVLEYELFHFCPLKEISSN-ISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSE 741
+ ++EL P+ I N + FA IGL++M N+ GA+ ++ +++ E
Sbjct: 680 EPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL----------VYNDE---- 725
Query: 742 LTTSLSDNRSPTATISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPV 801
++ + V G G F +Y+S++P+ C + +F Y+ + ++ +
Sbjct: 726 -------------SVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPWSG 772
Query: 802 PE 803
P+
Sbjct: 773 PD 774
>TAIR|locus:2141425 [details] [associations]
symbol:STS "AT4G01970" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0047268 "galactinol-raffinose galactosyltransferase activity"
evidence=ISS] [GO:0006979 "response to oxidative stress"
evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
Uniprot:Q9SYJ4
Length = 876
Score = 734 (263.4 bits), Expect = 3.5e-125, Sum P(3) = 3.5e-125
Identities = 152/398 (38%), Positives = 237/398 (59%)
Query: 344 VYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPK 403
+YVWHAL G W GV+P + M +A SP + D+ +D + G+GLVHP
Sbjct: 416 IYVWHALCGAWNGVRP--ETMMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473
Query: 404 KVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPD 463
K FY+ +H+YLAS GV G K+DV +E+L HGGRV L ++Y+ L S+ +NF
Sbjct: 474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533
Query: 464 NGCISCMCHNTDGIY-SSKQTAVIRASDDYYPRDPASHT--------IHISSVAYNTLFL 514
I+ M + + ++KQ ++ R DD++ +DP +H+ +YN++++
Sbjct: 534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593
Query: 515 GEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPG--NHNFDLLRKLVLPDGSVLR 572
G+ +QPDWDMF S H AEYH A+RA+ G +Y+SD G +HNFDL++KL DG++ R
Sbjct: 594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653
Query: 573 AQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESP 632
PTRD LF +P D S+LK++N NK GV+G FNCQGAGW + + + E
Sbjct: 654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713
Query: 633 GTLTASVRVTDVE--NMAQIAGAG--WNGDAIVYAHRSGEVVRL-PKGASVPVTLKVLEY 687
T++ +V V+D+E + AG+ + GD +VY +S E++ + K ++ +TL+ +
Sbjct: 714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773
Query: 688 ELFHFCPLKE-ISSNISFAAIGLLDMFNSGGAVENVEV 724
+L F P+ E +SS + FA +GL++MFN G V++++V
Sbjct: 774 DLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQDMKV 811
Score = 439 (159.6 bits), Expect = 3.5e-125, Sum P(3) = 3.5e-125
Identities = 110/320 (34%), Positives = 155/320 (48%)
Query: 33 KPLVLRRSKMTVAPN-ISISDGNLVVHGKT-ILTGVPDNIILTP--GNGVGLVA------ 82
KPL + +K + PN ++S+G+L T IL VP N+ TP + + A
Sbjct: 18 KPLFVPITKPILQPNSFNLSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAPLPILL 77
Query: 83 --------GAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLE 134
G F+G T +G ED F+ FRFK+WW T +G G D+ E
Sbjct: 78 RVQANAHKGGFLGFTKESPSDRLTNSLGRFEDREFLSLFRFKMWWSTAWIGKSGSDLQAE 137
Query: 135 TQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETN 194
TQ+++++ E D Y +P +EG FR++L E + IC ESG V+ +
Sbjct: 138 TQWVMLKIP---EIDS------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKES 188
Query: 195 QGLYLVYTHAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVT 254
+ Y H NP+ ++ +A A+ +M TF E+KKLP +D FGWCTWDA Y V
Sbjct: 189 SFKSIAYIHICDNPYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVD 248
Query: 255 AEGVDEGLKSLSAGGTPPKFLIIDDGWQQI----ENKPKEESNCIVQEGAQFASRLTGIK 310
+ G+K GG PKF+IIDDGWQ I + K+ N +V G Q +RLT K
Sbjct: 249 PATIWTGVKEFEDGGVCPKFVIIDDGWQSINFDGDELDKDAEN-LVLGGEQMTARLTSFK 307
Query: 311 ENSKFQKKCQNSEQVSGLKH 330
E KF+ S S H
Sbjct: 308 ECKKFRNYKGGSFITSDASH 327
Score = 92 (37.4 bits), Expect = 3.5e-125, Sum P(3) = 3.5e-125
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 755 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 804
+I + V+G GRF YSS P+KC + + +F ++ TG ++ +P EE
Sbjct: 816 SIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEE 865
>UNIPROTKB|Q93XK2 [details] [associations]
symbol:STS1 "Stachyose synthase" species:3888 "Pisum
sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
"oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
"galactinol-raffinose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
Uniprot:Q93XK2
Length = 853
Score = 783 (280.7 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
Identities = 180/488 (36%), Positives = 273/488 (55%)
Query: 250 YTDVTAEGVD-EGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGAQFASRLTG 308
+TD+ +G++ E L+ K +IE+K K+ +V+E L G
Sbjct: 319 FTDLILKGIEHEKLRKKREEAISSK----SSDLAEIESKIKK----VVKE----IDDLFG 366
Query: 309 IKENSKFQKKCQNSEQVSGLKHVVDESKQNHN-VKYVYVWHALAGYWGGVKPAADGMEHY 367
++ S +K SE GLK + + + VYVWHAL G WGGV+P H
Sbjct: 367 GEQFSSGEKSEMKSEY--GLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HL 421
Query: 368 DTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVD 427
DT + SPG+ G D+ + ++ LGLVHP + Y+ +H+YLA G+ GVKVD
Sbjct: 422 DTKIVPCKLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVD 481
Query: 428 VQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIY-SSKQTAVI 486
V + +E + +GGRV L + Y++ L SI +NF NG I+ M H D + +KQ ++
Sbjct: 482 VIHSLEYVCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMG 541
Query: 487 RASDDYYPRDPASHT--------IHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAA 538
R DD++ +DP +H+ +YN+L++G+ +QPDWDMF S H A++H +
Sbjct: 542 RVGDDFWFQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGS 601
Query: 539 RAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKV 598
RA+ G IYVSD G+H+FDL++KLV PDG++ + PTRDCLF +P D T++LK+
Sbjct: 602 RAICGGPIYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKI 661
Query: 599 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVE--NMAQIAGAGWN 656
WN NK GV+G FNCQGAGW I +K R E + +V VT+VE + + G
Sbjct: 662 WNFNKYGGVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKA 721
Query: 657 GDAIVYAHRSGEVVRLP-KGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNS 715
+ +VY +++ E+ + K + T++ +EL+ F P+ ++ I FA IGL +MFNS
Sbjct: 722 EEYVVYLNQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNS 781
Query: 716 GGAVENVE 723
GG V ++E
Sbjct: 782 GGTVIDLE 789
Score = 406 (148.0 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
Identities = 85/238 (35%), Positives = 126/238 (52%)
Query: 83 GAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVES 142
G F G + + +G F+ FRFK WW TQ +G G D+ +ETQ++L+E
Sbjct: 72 GGFFGFSHETPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131
Query: 143 KDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYT 202
+ Y V +P++E FRSAL N+ ++I ESG V+ + + Y
Sbjct: 132 PETKS---------YVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIAYV 182
Query: 203 HAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGL 262
H NP++++ +A A+ ++ +F E+K +P+ +D FGWCTWDAFY V G+ GL
Sbjct: 183 HFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGL 242
Query: 263 KSLSAGGTPPKFLIIDDGWQQIE---NKPKEESNCIVQEGAQFASRLTGIKENSKFQK 317
S GG P+F+IIDDGWQ I P E++ +V G Q + RL E KF+K
Sbjct: 243 DDFSKGGVEPRFVIIDDGWQSISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKFRK 300
Score = 66 (28.3 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
Identities = 16/47 (34%), Positives = 25/47 (53%)
Query: 758 LKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 804
+KV+G G F YSS+ P K + + DF + G + + +P EE
Sbjct: 797 IKVKGGGSFLAYSSESPKKFQLNGCEVDFEW-LGDGKLCVNVPWIEE 842
>ASPGD|ASPL0000010056 [details] [associations]
symbol:aglF species:162425 "Emericella nidulans"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
Length = 863
Score = 340 (124.7 bits), Expect = 1.0e-36, Sum P(3) = 1.0e-36
Identities = 94/305 (30%), Positives = 152/305 (49%)
Query: 326 SGLKHVVDESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQ 384
+GL V ++ H N++Y+ VWHAL GYWGG+ P Y T
Sbjct: 383 NGLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTR-------------- 428
Query: 385 PDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVS 444
++ ++S + + P + FYN+ +A+L+ G+ GVK D Q+ ++ L A R S
Sbjct: 429 -EVALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRS 486
Query: 445 LTRSYHQALEASIARNFPDNG--CISCMCHNT--DGIYSSKQTAVIRASDDYYPRDPASH 500
+Y A S R+F C+S + + ++K T V+R S+D++P SH
Sbjct: 487 YANAYQDAWTISSLRHFGPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546
Query: 501 TIHISSVAYNTLFLGEFMQ--PDWDMFHSLHPA----AEYHGAARAVGGCAIYVSDKPGN 554
T H+ A+N L L ++ PDWDMF +L A +H AAR + G IY++DKPG
Sbjct: 547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605
Query: 555 HNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSL-LKVWN--VNKCSGV 607
H+ L++++ G+ LR + R T D ++ D ++G L + ++ SG+
Sbjct: 606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662
Query: 608 VGVFN 612
+GVFN
Sbjct: 663 IGVFN 667
Score = 123 (48.4 bits), Expect = 1.0e-36, Sum P(3) = 1.0e-36
Identities = 57/221 (25%), Positives = 93/221 (42%)
Query: 74 PGNGVGLVAGAFIGATASHSKSLHVFPMGVLEDL-RFMCCFRFKLWWMTQRMGTCGKDVP 132
PG + ++G A HS L + P+G + RF R + W+ R G KD
Sbjct: 158 PGAALWNISGPVEEARDGHSGLLRL-PLGTPSSMSRFFALARVETSWLGPRQG---KDKL 213
Query: 133 LETQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVE 192
T+ ++ S + DG ++ V L + + L E+ I ++ DNA
Sbjct: 214 NFTEDAILLSFLRT-----DG--VHVVLLGVTVDDTLTVLGSGPAGEVVIKSQN-DNATP 265
Query: 193 TNQGLYLVYTHAGPNPFEVISQAV-----KAVEKYMQTFTHREKKK-LPSFLDWFGWCTW 246
+ + L T A FEV + A+ + V Y T + + L + D +CTW
Sbjct: 266 SRFQV-LAATAAD---FEVATSALIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTW 321
Query: 247 DAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENK 287
+ D++ E + L L G + LIIDD WQ ++N+
Sbjct: 322 NGLGQDLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNE 362
Score = 61 (26.5 bits), Expect = 1.0e-36, Sum P(3) = 1.0e-36
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 660 IVYAHRSGEVV-RLPKGASVPVTLKVLEYELFHFCPLKEIS 699
IV AHR+G +V L ++V VTL +E+ P+K ++
Sbjct: 695 IVRAHRTGRIVGELHSSSAVSVTLNERRWEVLTAYPVKTLT 735
Score = 41 (19.5 bits), Expect = 1.2e-34, Sum P(3) = 1.2e-34
Identities = 8/30 (26%), Positives = 19/30 (63%)
Query: 708 GLLDMFNSGGAVENVEVHMSEKKPDLFDGE 737
G++ +FN VE+V + +++ P ++D +
Sbjct: 661 GIIGVFNVSNRVESVIIPVADF-PGIYDDQ 689
>UNIPROTKB|G4NBB7 [details] [associations]
symbol:MGG_11554 "Seed imbibition protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
evidence=ND] [GO:0052051 "interaction with host via protein
secreted by type II secretion system" evidence=IDA]
InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
Uniprot:G4NBB7
Length = 908
Score = 337 (123.7 bits), Expect = 3.0e-31, Sum P(3) = 3.0e-31
Identities = 103/331 (31%), Positives = 156/331 (47%)
Query: 314 KFQKKCQNSEQVSGLKHVVDE-SKQNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALA 372
+F+ Q Q GLK +V E KQN ++ + VWH + GYWGG+ P+ Y
Sbjct: 393 RFEANQQGFPQ--GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKI 450
Query: 373 YPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNII 432
V QP D V G + V Y++ +A+LA CGV KVD Q +
Sbjct: 451 QLRDEAEV---QPKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFL 500
Query: 433 ETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS--KQ------TA 484
+ A R +L R Y A A+ +++F I+CM I S +Q
Sbjct: 501 D-YPAHANDRKNLIRPYQDAWTAAASKHFGGRA-IACMAQTPQSILHSLLQQGRSEGPML 558
Query: 485 VIRASDDYYPRDPASHTIHISSVAYNTLFLGEF-MQPDWDMFHSLHPA-AEYHGAARAVG 542
+ R SDD++P + SHT H+ A+N L + + DWDMF + P A H AR++
Sbjct: 559 MARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMS 618
Query: 543 GCAIYVSDKPGNHNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSLLKV 598
G IY++D PG H+ +L++++ DG LRA PGR L+ LL+V
Sbjct: 619 GGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRV 674
Query: 599 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 629
+ ++ G++GVFN G + ++ R+ D
Sbjct: 675 RSGHQGVGMLGVFNVCNRG-SLLGEQVRLDD 704
Score = 90 (36.7 bits), Expect = 3.0e-31, Sum P(3) = 3.0e-31
Identities = 18/62 (29%), Positives = 32/62 (51%)
Query: 231 KKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKE 290
+ ++ + D F +CTW++ D++ + + L LS G LIIDD WQ ++ +
Sbjct: 326 RAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGSD 385
Query: 291 ES 292
S
Sbjct: 386 AS 387
Score = 46 (21.3 bits), Expect = 3.0e-31, Sum P(3) = 3.0e-31
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 707 IGLLDMFN--SGGAVENVEVHMSEKKPDLFDGEVSSE 741
+G+L +FN + G++ +V + D+FDGE + E
Sbjct: 681 VGMLGVFNVCNRGSLLGEQVRLD----DIFDGEKAGE 713
>UNIPROTKB|Q97U94 [details] [associations]
symbol:galS "Alpha-galactosidase" species:273057
"Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
[GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
"glycosylceramide catabolic process" evidence=ISS]
InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
Length = 648
Score = 238 (88.8 bits), Expect = 4.6e-31, Sum P(3) = 4.6e-31
Identities = 67/192 (34%), Positives = 96/192 (50%)
Query: 422 DGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSSK 481
D VKVD Q +I + ++ +R+ AL+ S+ ++ I+CM N + +
Sbjct: 362 DLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGKDV-----INCMSMNPENYCNYF 415
Query: 482 QTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAV 541
+ V+R S DY P +HI AYN+L + PD+DMF S P A+ H AR
Sbjct: 416 YSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVF 475
Query: 542 GGCAIYVSDK-PGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN 600
G IY++D+ P N +LLR VLP+G V+R P T D LF DP R+ LLK+
Sbjct: 476 SGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRERV-LLKLKG 534
Query: 601 VNKCSGVVGVFN 612
K + FN
Sbjct: 535 KVKGYNAIAFFN 546
Score = 156 (60.0 bits), Expect = 4.6e-31, Sum P(3) = 4.6e-31
Identities = 42/139 (30%), Positives = 64/139 (46%)
Query: 157 YTVFLPLLEGQFRSALQGNENNEI-------EICLESGDNAVETNQGLYLVYTHAGPNPF 209
YTVF + G A NN + + L +G N E + Y + NP+
Sbjct: 133 YTVFALVKSGNSYEAFFTLSNNYVTAYLFGDSVRLYTGFNTDEIKRS-YFLSIGTSDNPY 191
Query: 210 EVISQAVKAVEKYMQTFTHREKKKLPS-FLDWFGWCTWDAFYT-DVTAEGVDEGLKSLSA 267
+ I A+ K TF R++K P ++ GWC+W+AF T D+ E + + +K +
Sbjct: 192 KAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNAFLTKDLNEENLIKVVKGIIE 251
Query: 268 GGTPPKFLIIDDGWQQIEN 286
G ++IIDDGWQ N
Sbjct: 252 RGLRLNWVIIDDGWQDQNN 270
Score = 77 (32.2 bits), Expect = 4.6e-31, Sum P(3) = 4.6e-31
Identities = 21/63 (33%), Positives = 32/63 (50%)
Query: 295 IVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYW 354
I+ +G Q + I+ + KK N G K+ V K + VKYV +WHA+ +W
Sbjct: 260 IIDDGWQDQNNDRAIRSLNPDNKKFPN-----GFKNTVRAIK-SLGVKYVGLWHAINAHW 313
Query: 355 GGV 357
GG+
Sbjct: 314 GGM 316
>UNIPROTKB|Q8A170 [details] [associations]
symbol:BT_3797 "Possible alpha-galactosidase"
species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
"alpha-galactosidase activity" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
Uniprot:Q8A170
Length = 693
Score = 196 (74.1 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
Identities = 54/191 (28%), Positives = 84/191 (43%)
Query: 403 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 462
+K+ +Y + G D +K+D Q+ L G + + + ALE R
Sbjct: 348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHRM-- 405
Query: 463 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 522
G ++CM N I + ++V RAS DY D H+ NTL LG+ + PD
Sbjct: 406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465
Query: 523 DMFHSLHPAA-EYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 581
DMFHS ++A+ G +Y+SD P D +R L+ G + R P PT
Sbjct: 466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525
Query: 582 DCLFADPARDG 592
+ + +P + G
Sbjct: 526 ESILTNPLQSG 536
Score = 130 (50.8 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
Identities = 34/151 (22%), Positives = 79/151 (52%)
Query: 170 SALQGNENNEIEICLES-GDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV--EKYMQTF 226
S Q N++ + + + + G++A+ T + L++ + + + V S A ++ +K +
Sbjct: 158 SWFQVNQDGTLTLYVSTLGEDAL-TGRLPLLIFRKSS-SVYHVFSDAYDSLIADKAVSAL 215
Query: 227 THREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIEN 286
R K+ + D+ GWCTW+ ++ D+ + + ++ A G P ++++IDDG I N
Sbjct: 216 RKRADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG--HIAN 273
Query: 287 KPKEESNCIVQEGAQFASRLTGIKENSKFQK 317
K ++ ++ +V + +F + + I + + K
Sbjct: 274 KNRQLTS-LVPDKKRFPNGWSRIMKRKQADK 303
Score = 66 (28.3 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 336 KQNHNVKYVYVWHALAGYWGGVKPAAD 362
KQ ++++ +W++L+GYW G+ D
Sbjct: 299 KQADKIRWIGLWYSLSGYWMGISAEND 325
Score = 50 (22.7 bits), Expect = 2.6e-23, Sum P(4) = 2.6e-23
Identities = 19/80 (23%), Positives = 37/80 (46%)
Query: 689 LFHFCPLKEISSNISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSD 748
LFH CP+++ +A IG+ + + S V+ ++ +EK + D + L +D
Sbjct: 621 LFHLCPIRK-----GWAVIGIQEKYLSPATVQILK-RTTEKL--ILDVHCTGTLRI-WAD 671
Query: 749 NRSPTATISLKVRGCGRFGI 768
+ S+ ++ GR I
Sbjct: 672 SHGKQELRSIPIKKAGRIEI 691
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.135 0.419 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 815 815 0.00099 121 3 11 22 0.40 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 10
No. of states in DFA: 625 (66 KB)
Total size of DFA: 445 KB (2211 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 67.81u 0.08s 67.89t Elapsed: 00:00:03
Total cpu time: 67.81u 0.08s 67.89t Elapsed: 00:00:03
Start: Tue May 21 14:41:12 2013 End: Tue May 21 14:41:15 2013