Your job contains 1 sequence.
>004090
MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM
GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL
PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK
YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW
QQIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNV
KYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVH
PKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNF
PDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPD
WDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR
DCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRV
TDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSN
ISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVR
GCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 004090
(774 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702... 3379 0. 1
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702... 2376 1.6e-259 2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact... 1306 1.2e-138 2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702... 1340 7.4e-137 1
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci... 783 4.0e-124 3
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ... 734 1.0e-123 3
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric... 340 7.2e-37 3
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot... 337 2.1e-31 3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec... 238 3.6e-31 3
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto... 196 1.9e-23 4
>TAIR|locus:2103488 [details] [associations]
symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
"raffinose catabolic process" evidence=IDA] [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
[GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
[GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
Length = 773
Score = 3379 (1194.5 bits), Expect = 0., P = 0.
Identities = 618/775 (79%), Positives = 690/775 (89%)
Query: 1 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 60
MT+ NIS+ + NLVV GKTILT +PDNIILTP G G V+G+FIGAT SKSLHVFP+
Sbjct: 1 MTITSNISVQNDNLVVQGKTILTKIPDNIILTPVTGNGFVSGSFIGATFEQSKSLHVFPI 60
Query: 61 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 120
GVLE LRFMCCFRFKLWWMTQRMG+CGKD+PLETQFML+ESKD E + DD PT+YTVFL
Sbjct: 61 GVLEGLRFMCCFRFKLWWMTQRMGSCGKDIPLETQFMLLESKDEVEGNGDDAPTVYTVFL 120
Query: 121 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 180
PLLEGQFR+ LQGNE NEIEIC ESGD AVET+QG +LVY HAG NPFEVI Q+VKAVE+
Sbjct: 121 PLLEGQFRAVLQGNEKNEIEICFESGDKAVETSQGTHLVYVHAGTNPFEVIRQSVKAVER 180
Query: 181 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 240
+MQTF HREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLS GGTPPKFLIIDDGW
Sbjct: 181 HMQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDGW 240
Query: 241 QQIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNV 300
QQIENK K+E NC+VQEGAQFA+RL GIKEN+KFQK Q QVSGLK VVD +KQ HNV
Sbjct: 241 QQIENKEKDE-NCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAKQRHNV 299
Query: 301 KYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVH 360
K VY WHALAGYWGGVKPAA GMEHYD+ALAYPV SPGV+GNQPDIVMDSLAVHGLGLV+
Sbjct: 300 KQVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVN 359
Query: 361 PKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNF 420
PKKVFNFYNELH+YLASCG+DGVKVDVQNIIETLGAG GGRVSLTRSY QALEASIARNF
Sbjct: 360 PKKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNF 419
Query: 421 PDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPD 480
DNGCISCMCHNTDG+YS+KQTA++RASDD+YPRDPASHTIHI+SVAYN+LFLGEFMQPD
Sbjct: 420 TDNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEFMQPD 479
Query: 481 WDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 540
WDMFHSLHP AEYH AARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA+LPGRPTR
Sbjct: 480 WDMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTR 539
Query: 541 DCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRV 600
DCLFADPARDG SLLK+WN+NK +G+VGVFNCQGAGWCK TKK +IHD SPGTLT S+R
Sbjct: 540 DCLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRA 599
Query: 601 TDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSN 660
D + ++Q+AG W+GD+IVYA+RSGEVVRLPKGAS+P+TLKVLEYELFH PLKEI+ N
Sbjct: 600 DDADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEITEN 659
Query: 661 ISFAAIGLLDMFNSGGAVENVEV-HMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKV 719
ISFA IGL+DMFNS GA+E++++ H+++K P+ FDGE+SS + +LSDNRSPTA +S+ V
Sbjct: 660 ISFAPIGLVDMFNSSGAIESIDINHVTDKNPEFFDGEISSA-SPALSDNRSPTALVSVSV 718
Query: 720 RGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV 774
RGCGRFG YSSQRPLKC V S +TDFTYD+ GL+T+ LPV EEM+RW VEI V
Sbjct: 719 RGCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHVEILV 773
>TAIR|locus:2020452 [details] [associations]
symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
Uniprot:Q84VX0
Length = 754
Score = 2376 (841.5 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
Identities = 433/683 (63%), Positives = 539/683 (78%)
Query: 1 MTVAPNISISDGNLVVHGKTILTGVPDNIILTPGNGVGLVAGAFIGATASHSKSLHVFPM 60
MTV IS++D +LVV G +L GVP+N+++TP +G L+ GAFIG T+ + S VF +
Sbjct: 1 MTVGAGISVTDSDLVVLGHRVLHGVPENVLVTPASGNALIDGAFIGVTSDQTGSHRVFSL 60
Query: 61 GVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYTVFL 120
G LEDLRFMC FRFKLWWMTQRMGT GK++P ETQF++VE+ S+ D + Y VFL
Sbjct: 61 GKLEDLRFMCVFRFKLWWMTQRMGTNGKEIPCETQFLIVEANQGSDLGGRDQSSSYVVFL 120
Query: 121 PLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAVEK 180
P+LEG FR+ LQGNE NE+EICLESGD V+ +G +LV+ AG +PF+VI++AVKAVE+
Sbjct: 121 PILEGDFRAVLQGNEANELEICLESGDPTVDQFEGSHLVFVAAGSDPFDVITKAVKAVEQ 180
Query: 181 YMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGW 240
++QTF+HRE+KK+P L+WFGWCTWDAFYT+VTA+ V +GL+SL AGG PKF+IIDDGW
Sbjct: 181 HLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGW 240
Query: 241 QQIE-NKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS----GLKHVVDESK 295
Q + ++ E N A FA+RLT IKEN KFQK + +V L HV+ + K
Sbjct: 241 QSVGMDETSVEFNA--DNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITDIK 298
Query: 296 QNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHG 355
N+++KYVYVWHA+ GYWGGVKP GMEHY++ +AYPV+SPGVM ++ ++S+ +G
Sbjct: 299 SNNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESITKNG 358
Query: 356 LGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEAS 415
LGLV+P+KVF+FYN+LH+YLAS GVDGVKVDVQNI+ETLGAGHGGRV L + YHQALEAS
Sbjct: 359 LGLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQALEAS 418
Query: 416 IARNFPDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGE 475
I+RNFPDNG ISCM HNTDG+YS+K+TAVIRASDD++PRDPASHTIHI+SVAYNTLFLGE
Sbjct: 419 ISRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLFLGE 478
Query: 476 FMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLP 535
FMQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVL DGS+LRA+LP
Sbjct: 479 FMQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRAKLP 538
Query: 536 GRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLT 595
GRPT DC F+DP RD SLLK+WN+N+ +GV+GVFNCQGAGWCK K+ IHD+ PGT++
Sbjct: 539 GRPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPGTIS 598
Query: 596 ASVRVTDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLK 655
VR DV + ++A W GD+IVY+H GE+V LPK S+PVTL EYE+F P+K
Sbjct: 599 GCVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVVPVK 658
Query: 656 EISSNISFAAIGLLDMFNSGGAV 678
E S FA +GL++MFNSGGA+
Sbjct: 659 EFSDGSKFAPVGLMEMFNSGGAI 681
Score = 145 (56.1 bits), Expect = 1.6e-259, Sum P(2) = 1.6e-259
Identities = 29/68 (42%), Positives = 42/68 (61%)
Query: 707 DNRSPTATISLKVRGCGRFGIYSS-QRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEM 765
D+ + +K+RG G G+YSS +RP TV S ++ Y+ +GL+T TL VPE+E+
Sbjct: 687 DDEGTKFVVRMKLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYEPESGLVTFTLGVPEKEL 746
Query: 766 YRWPVEIQ 773
Y W V IQ
Sbjct: 747 YLWDVVIQ 754
>UNIPROTKB|Q5VQG4 [details] [associations]
symbol:RFS "Galactinol--sucrose galactosyltransferase"
species:39947 "Oryza sativa Japonica Group" [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
Uniprot:Q5VQG4
Length = 783
Score = 1306 (464.8 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
Identities = 289/724 (39%), Positives = 407/724 (56%)
Query: 5 PNISISDGNLVVHGKTILTGVPDNIILTPGNGV-------GLVAGAFIGATASHSKSLHV 57
P ++ +L V G L VP NI LTP + + AG+F+G A +K HV
Sbjct: 26 PRFTLKGKDLAVDGHPFLLDVPANIRLTPASTLVPNSDVPAAAAGSFLGFDAPAAKDRHV 85
Query: 58 FPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGPTIYT 117
P+G L D RFM FRFK+WW T +GT G+DV ETQ M+++ S GP Y
Sbjct: 86 VPIGKLRDTRFMSIFRFKVWWTTHWVGTNGRDVENETQMMILDQSGTKSSPT--GPRPYV 143
Query: 118 VFLPLLEGQFRSALQ-GNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVK 176
+ LP++EG FR+ L+ G + + + LESG + V + VY HAG +PF+++ A++
Sbjct: 144 LLLPIVEGPFRACLESGKAEDYVHMVLESGSSTVRGSVFRSAVYLHAGDDPFDLVKDAMR 203
Query: 177 AVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLII 236
V ++ TF E+K P +D FGWCTWDAFY V EGV EG++ L+ GG PP ++I
Sbjct: 204 VVRAHLGTFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLI 263
Query: 237 DDGWQQIENKPKE-----ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVV 291
DDGWQ I + + E G Q RL +EN KF+ E G+ V
Sbjct: 264 DDGWQSICHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFR------EYKGGMGGFV 317
Query: 292 DESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDS 350
E K V+ VYVWHAL GYWGG++P A G+ + P SPG+ D+ +D
Sbjct: 318 REMKAAFPTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDK 375
Query: 351 LAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQ 410
+ +G+GLV P++ Y LH++L + G+DGVKVDV +++E + +GGRV L ++Y
Sbjct: 376 IVNNGVGLVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFA 435
Query: 411 ALEASIARNFPDNGCISCMCHNTDG-IYSSKQTAVIRASDDYYPRDPASHT--------I 461
L S+ R+F NG I+ M H D + ++ A+ R DD++ DP+
Sbjct: 436 GLTESVRRHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGC 495
Query: 462 HISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRK 521
H+ AYN+L++G F+ PDWDMF S HP A +H A+RAV G +YVSD G H+FDLLR+
Sbjct: 496 HMVHCAYNSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRR 555
Query: 522 LVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKIT 581
L LPDG++LR + PTRDCLFADP DG ++LK+WNVNK SGV+G FNCQG GW +
Sbjct: 556 LALPDGTILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREA 615
Query: 582 KKTRIHDESPGTLTASVRVTDVENMAQIAGAGWNGDAI-VYAHRSGEVVRLPKGASVPVT 640
++ +TA DVE + G GD VY + ++ L + SV +T
Sbjct: 616 RRNMCAAGFSVPVTARASPADVE----WSHGGGGGDRFAVYFVEARKLQLLRRDESVELT 671
Query: 641 LKVLEYELFHFCPLKEISS---NISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEV 697
L+ YEL P++ I S I FA IGL +M N+GGAV+ E + +K DG+V
Sbjct: 672 LEPFTYELLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFE---AARK----DGDV 724
Query: 698 SSEL 701
++E+
Sbjct: 725 AAEV 728
Score = 72 (30.4 bits), Expect = 1.2e-138, Sum P(2) = 1.2e-138
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 719 VRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLP 759
V+G G YSS RP C V +F Y+ G++T+ +P
Sbjct: 730 VKGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTVDVP 768
>TAIR|locus:2170528 [details] [associations]
symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0005986 "sucrose biosynthetic process" evidence=IMP]
[GO:0010325 "raffinose family oligosaccharide biosynthetic process"
evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0009414 "response to water deprivation" evidence=IEP]
[GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
Uniprot:Q9FND9
Length = 783
Score = 1340 (476.8 bits), Expect = 7.4e-137, P = 7.4e-137
Identities = 289/782 (36%), Positives = 443/782 (56%)
Query: 9 ISDGNLVVHGKTILTGVPDNIILTPG------NGVGL--VAGAFIGATAS-HSKSLHVFP 59
+ D L+ +G+ +LT VP N+ LT +GV L AG+FIG KS HV
Sbjct: 24 LEDSTLLANGQVVLTDVPVNVTLTSSPYLVDKDGVPLDVSAGSFIGFNLDGEPKSHHVAS 83
Query: 60 MGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSESDQDDGP-TIYTV 118
+G L+++RFM FRFK+WW T +G+ G+D+ ETQ ++++ + S+S G Y +
Sbjct: 84 IGKLKNIRFMSIFRFKVWWTTHWVGSNGRDIENETQIIILD-QSGSDSGPGSGSGRPYVL 142
Query: 119 FLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV 178
LPLLEG FRS+ Q E++++ +C+ESG V ++ +VY HAG +PF+++ A+K +
Sbjct: 143 LLPLLEGSFRSSFQSGEDDDVAVCVESGSTEVTGSEFRQIVYVHAGDDPFKLVKDAMKVI 202
Query: 179 EKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDD 238
+M TF E+K P +D FGWCTWDAFY V +GV +G+K L GG PP ++IDD
Sbjct: 203 RVHMNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDD 262
Query: 239 GWQQIENKPKE---ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS-GLKHVVDES 294
GWQ I + E I G Q RL +EN KF+ +Q G+K V +
Sbjct: 263 GWQSIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAFVRDL 322
Query: 295 KQNHN-VKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAV 353
K + V Y+YVWHAL GYWGG++P A + + + P SPG+ D+ +D +
Sbjct: 323 KDEFSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKIIE 380
Query: 354 HGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALE 413
G+G P FY LH++L + G+DGVKVDV +I+E L +GGRV L ++Y +AL
Sbjct: 381 TGIGFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKALT 440
Query: 414 ASIARNFPDNGCISCMCHNTDGIYSSKQTAVI-RASDDYYPRDPASHT--------IHIS 464
+S+ ++F NG I+ M H D ++ + + R DD++ DP+ H+
Sbjct: 441 SSVNKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHMV 500
Query: 465 SVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVL 524
AYN+L++G F+QPDWDMF S HP AE+H A+RA+ G IY+SD G H+FDLL++LVL
Sbjct: 501 HCAYNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLVL 560
Query: 525 PDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKT 584
P+GS+LR + PTRD LF DP DG ++LK+WN+NK +GV+G FNCQG GWC+ T++
Sbjct: 561 PNGSILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRRN 620
Query: 585 RIHDESPGTLTASVRVTDVE---NMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTL 641
+ E TLTA+ DVE + I+ A A+ + +S +++ + +TL
Sbjct: 621 QCFSECVNTLTATTSPKDVEWNSGSSPISIANVEEFAL-FLSQSKKLLLSGLNDDLELTL 679
Query: 642 KVLEYELFHFCPLKEISSN-ISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSE 700
+ ++EL P+ I N + FA IGL++M N+ GA+ ++ +++ E
Sbjct: 680 EPFKFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL----------VYNDE---- 725
Query: 701 LTTSLSDNRSPTATISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPV 760
++ + V G G F +Y+S++P+ C + +F Y+ + ++ +
Sbjct: 726 -------------SVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPWSG 772
Query: 761 PE 762
P+
Sbjct: 773 PD 774
>UNIPROTKB|Q93XK2 [details] [associations]
symbol:STS1 "Stachyose synthase" species:3888 "Pisum
sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
"oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
"galactinol-raffinose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
Uniprot:Q93XK2
Length = 853
Score = 783 (280.7 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
Identities = 180/488 (36%), Positives = 273/488 (55%)
Query: 209 YTDVTAEGVD-EGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGAQFASRLTG 267
+TD+ +G++ E L+ K +IE+K K+ +V+E L G
Sbjct: 319 FTDLILKGIEHEKLRKKREEAISSK----SSDLAEIESKIKK----VVKE----IDDLFG 366
Query: 268 IKENSKFQKKCQNSEQVSGLKHVVDESKQNHN-VKYVYVWHALAGYWGGVKPAADGMEHY 326
++ S +K SE GLK + + + VYVWHAL G WGGV+P H
Sbjct: 367 GEQFSSGEKSEMKSEY--GLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HL 421
Query: 327 DTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVD 386
DT + SPG+ G D+ + ++ LGLVHP + Y+ +H+YLA G+ GVKVD
Sbjct: 422 DTKIVPCKLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVD 481
Query: 387 VQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIY-SSKQTAVI 445
V + +E + +GGRV L + Y++ L SI +NF NG I+ M H D + +KQ ++
Sbjct: 482 VIHSLEYVCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMG 541
Query: 446 RASDDYYPRDPASHT--------IHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAA 497
R DD++ +DP +H+ +YN+L++G+ +QPDWDMF S H A++H +
Sbjct: 542 RVGDDFWFQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGS 601
Query: 498 RAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKV 557
RA+ G IYVSD G+H+FDL++KLV PDG++ + PTRDCLF +P D T++LK+
Sbjct: 602 RAICGGPIYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKI 661
Query: 558 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVE--NMAQIAGAGWN 615
WN NK GV+G FNCQGAGW I +K R E + +V VT+VE + + G
Sbjct: 662 WNFNKYGGVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKA 721
Query: 616 GDAIVYAHRSGEVVRLP-KGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNS 674
+ +VY +++ E+ + K + T++ +EL+ F P+ ++ I FA IGL +MFNS
Sbjct: 722 EEYVVYLNQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNS 781
Query: 675 GGAVENVE 682
GG V ++E
Sbjct: 782 GGTVIDLE 789
Score = 406 (148.0 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
Identities = 85/238 (35%), Positives = 126/238 (52%)
Query: 42 GAFIGATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVES 101
G F G + + +G F+ FRFK WW TQ +G G D+ +ETQ++L+E
Sbjct: 72 GGFFGFSHETPSDRLMNSIGSFNGKDFLSIFRFKTWWSTQWIGKSGSDLQMETQWILIEV 131
Query: 102 KDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYT 161
+ Y V +P++E FRSAL N+ ++I ESG V+ + + Y
Sbjct: 132 PETKS---------YVVIIPIIEKCFRSALFPGFNDHVKIIAESGSTKVKESTFNSIAYV 182
Query: 162 HAGPNPFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGL 221
H NP++++ +A A+ ++ +F E+K +P+ +D FGWCTWDAFY V G+ GL
Sbjct: 183 HFSENPYDLMKEAYSAIRVHLNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGL 242
Query: 222 KSLSAGGTPPKFLIIDDGWQQIE---NKPKEESNCIVQEGAQFASRLTGIKENSKFQK 276
S GG P+F+IIDDGWQ I P E++ +V G Q + RL E KF+K
Sbjct: 243 DDFSKGGVEPRFVIIDDGWQSISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKFRK 300
Score = 66 (28.3 bits), Expect = 4.0e-124, Sum P(3) = 4.0e-124
Identities = 16/47 (34%), Positives = 25/47 (53%)
Query: 717 LKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 763
+KV+G G F YSS+ P K + + DF + G + + +P EE
Sbjct: 797 IKVKGGGSFLAYSSESPKKFQLNGCEVDFEW-LGDGKLCVNVPWIEE 842
>TAIR|locus:2141425 [details] [associations]
symbol:STS "AT4G01970" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0047268 "galactinol-raffinose galactosyltransferase activity"
evidence=ISS] [GO:0006979 "response to oxidative stress"
evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
Uniprot:Q9SYJ4
Length = 876
Score = 734 (263.4 bits), Expect = 1.0e-123, Sum P(3) = 1.0e-123
Identities = 152/398 (38%), Positives = 237/398 (59%)
Query: 303 VYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPK 362
+YVWHAL G W GV+P + M +A SP + D+ +D + G+GLVHP
Sbjct: 416 IYVWHALCGAWNGVRP--ETMMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473
Query: 363 KVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPD 422
K FY+ +H+YLAS GV G K+DV +E+L HGGRV L ++Y+ L S+ +NF
Sbjct: 474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533
Query: 423 NGCISCMCHNTDGIY-SSKQTAVIRASDDYYPRDPASHT--------IHISSVAYNTLFL 473
I+ M + + ++KQ ++ R DD++ +DP +H+ +YN++++
Sbjct: 534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593
Query: 474 GEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPG--NHNFDLLRKLVLPDGSVLR 531
G+ +QPDWDMF S H AEYH A+RA+ G +Y+SD G +HNFDL++KL DG++ R
Sbjct: 594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653
Query: 532 AQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESP 591
PTRD LF +P D S+LK++N NK GV+G FNCQGAGW + + + E
Sbjct: 654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713
Query: 592 GTLTASVRVTDVE--NMAQIAGAG--WNGDAIVYAHRSGEVVRL-PKGASVPVTLKVLEY 646
T++ +V V+D+E + AG+ + GD +VY +S E++ + K ++ +TL+ +
Sbjct: 714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773
Query: 647 ELFHFCPLKE-ISSNISFAAIGLLDMFNSGGAVENVEV 683
+L F P+ E +SS + FA +GL++MFN G V++++V
Sbjct: 774 DLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQDMKV 811
Score = 425 (154.7 bits), Expect = 1.0e-123, Sum P(3) = 1.0e-123
Identities = 106/307 (34%), Positives = 148/307 (48%)
Query: 5 PN-ISISDGNLVVHGKT-ILTGVPDNIILTP--GNGVGLVA--------------GAFIG 46
PN ++S+G+L T IL VP N+ TP + + A G F+G
Sbjct: 31 PNSFNLSEGSLCAKDSTPILFDVPQNVTFTPFSSHSISTDAPLPILLRVQANAHKGGFLG 90
Query: 47 ATASHSKSLHVFPMGVLEDLRFMCCFRFKLWWMTQRMGTCGKDVPLETQFMLVESKDNSE 106
T +G ED F+ FRFK+WW T +G G D+ ETQ+++++ E
Sbjct: 91 FTKESPSDRLTNSLGRFEDREFLSLFRFKMWWSTAWIGKSGSDLQAETQWVMLKIP---E 147
Query: 107 SDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVETNQGLYLVYTHAGPN 166
D Y +P +EG FR++L E + IC ESG V+ + + Y H N
Sbjct: 148 IDS------YVAIIPTIEGAFRASLTPGEKGNVLICAESGSTKVKESSFKSIAYIHICDN 201
Query: 167 PFEVISQAVKAVEKYMQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSA 226
P+ ++ +A A+ +M TF E+KKLP +D FGWCTWDA Y V + G+K
Sbjct: 202 PYNLMKEAFSALRVHMNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFED 261
Query: 227 GGTPPKFLIIDDGWQQI----ENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSE 282
GG PKF+IIDDGWQ I + K+ N +V G Q +RLT KE KF+ S
Sbjct: 262 GGVCPKFVIIDDGWQSINFDGDELDKDAEN-LVLGGEQMTARLTSFKECKKFRNYKGGSF 320
Query: 283 QVSGLKH 289
S H
Sbjct: 321 ITSDASH 327
Score = 92 (37.4 bits), Expect = 1.0e-123, Sum P(3) = 1.0e-123
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 714 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 763
+I + V+G GRF YSS P+KC + + +F ++ TG ++ +P EE
Sbjct: 816 SIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEE 865
>ASPGD|ASPL0000010056 [details] [associations]
symbol:aglF species:162425 "Emericella nidulans"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
Length = 863
Score = 340 (124.7 bits), Expect = 7.2e-37, Sum P(3) = 7.2e-37
Identities = 94/305 (30%), Positives = 152/305 (49%)
Query: 285 SGLKHVVDESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQ 343
+GL V ++ H N++Y+ VWHAL GYWGG+ P Y T
Sbjct: 383 NGLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTR-------------- 428
Query: 344 PDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVS 403
++ ++S + + P + FYN+ +A+L+ G+ GVK D Q+ ++ L A R S
Sbjct: 429 -EVALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRS 486
Query: 404 LTRSYHQALEASIARNFPDNG--CISCMCHNT--DGIYSSKQTAVIRASDDYYPRDPASH 459
+Y A S R+F C+S + + ++K T V+R S+D++P SH
Sbjct: 487 YANAYQDAWTISSLRHFGPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546
Query: 460 TIHISSVAYNTLFLGEFMQ--PDWDMFHSLHPA----AEYHGAARAVGGCAIYVSDKPGN 513
T H+ A+N L L ++ PDWDMF +L A +H AAR + G IY++DKPG
Sbjct: 547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605
Query: 514 HNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSL-LKVWN--VNKCSGV 566
H+ L++++ G+ LR + R T D ++ D ++G L + ++ SG+
Sbjct: 606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662
Query: 567 VGVFN 571
+GVFN
Sbjct: 663 IGVFN 667
Score = 123 (48.4 bits), Expect = 7.2e-37, Sum P(3) = 7.2e-37
Identities = 57/221 (25%), Positives = 93/221 (42%)
Query: 33 PGNGVGLVAGAFIGATASHSKSLHVFPMGVLEDL-RFMCCFRFKLWWMTQRMGTCGKDVP 91
PG + ++G A HS L + P+G + RF R + W+ R G KD
Sbjct: 158 PGAALWNISGPVEEARDGHSGLLRL-PLGTPSSMSRFFALARVETSWLGPRQG---KDKL 213
Query: 92 LETQFMLVESKDNSESDQDDGPTIYTVFLPLLEGQFRSALQGNENNEIEICLESGDNAVE 151
T+ ++ S + DG ++ V L + + L E+ I ++ DNA
Sbjct: 214 NFTEDAILLSFLRT-----DG--VHVVLLGVTVDDTLTVLGSGPAGEVVIKSQN-DNATP 265
Query: 152 TNQGLYLVYTHAGPNPFEVISQAV-----KAVEKYMQTFTHREKKK-LPSFLDWFGWCTW 205
+ + L T A FEV + A+ + V Y T + + L + D +CTW
Sbjct: 266 SRFQV-LAATAAD---FEVATSALIYEARRLVRPYENTAQGGPRTQWLSEWYDGLAYCTW 321
Query: 206 DAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENK 246
+ D++ E + L L G + LIIDD WQ ++N+
Sbjct: 322 NGLGQDLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNE 362
Score = 61 (26.5 bits), Expect = 7.2e-37, Sum P(3) = 7.2e-37
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 619 IVYAHRSGEVV-RLPKGASVPVTLKVLEYELFHFCPLKEIS 658
IV AHR+G +V L ++V VTL +E+ P+K ++
Sbjct: 695 IVRAHRTGRIVGELHSSSAVSVTLNERRWEVLTAYPVKTLT 735
Score = 41 (19.5 bits), Expect = 8.6e-35, Sum P(3) = 8.6e-35
Identities = 8/30 (26%), Positives = 19/30 (63%)
Query: 667 GLLDMFNSGGAVENVEVHMSEKKPDLFDGE 696
G++ +FN VE+V + +++ P ++D +
Sbjct: 661 GIIGVFNVSNRVESVIIPVADF-PGIYDDQ 689
>UNIPROTKB|G4NBB7 [details] [associations]
symbol:MGG_11554 "Seed imbibition protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
evidence=ND] [GO:0052051 "interaction with host via protein
secreted by type II secretion system" evidence=IDA]
InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
Uniprot:G4NBB7
Length = 908
Score = 337 (123.7 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
Identities = 103/331 (31%), Positives = 156/331 (47%)
Query: 273 KFQKKCQNSEQVSGLKHVVDE-SKQNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALA 331
+F+ Q Q GLK +V E KQN ++ + VWH + GYWGG+ P+ Y
Sbjct: 393 RFEANQQGFPQ--GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKI 450
Query: 332 YPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNII 391
V QP D V G + V Y++ +A+LA CGV KVD Q +
Sbjct: 451 QLRDEAEV---QPKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFL 500
Query: 392 ETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS--KQ------TA 443
+ A R +L R Y A A+ +++F I+CM I S +Q
Sbjct: 501 D-YPAHANDRKNLIRPYQDAWTAAASKHFGGRA-IACMAQTPQSILHSLLQQGRSEGPML 558
Query: 444 VIRASDDYYPRDPASHTIHISSVAYNTLFLGEF-MQPDWDMFHSLHPA-AEYHGAARAVG 501
+ R SDD++P + SHT H+ A+N L + + DWDMF + P A H AR++
Sbjct: 559 MARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMS 618
Query: 502 GCAIYVSDKPGNHNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSLLKV 557
G IY++D PG H+ +L++++ DG LRA PGR L+ LL+V
Sbjct: 619 GGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRV 674
Query: 558 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 588
+ ++ G++GVFN G + ++ R+ D
Sbjct: 675 RSGHQGVGMLGVFNVCNRG-SLLGEQVRLDD 704
Score = 90 (36.7 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
Identities = 18/62 (29%), Positives = 32/62 (51%)
Query: 190 KKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKE 249
+ ++ + D F +CTW++ D++ + + L LS G LIIDD WQ ++ +
Sbjct: 326 RAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGSD 385
Query: 250 ES 251
S
Sbjct: 386 AS 387
Score = 46 (21.3 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 666 IGLLDMFN--SGGAVENVEVHMSEKKPDLFDGEVSSE 700
+G+L +FN + G++ +V + D+FDGE + E
Sbjct: 681 VGMLGVFNVCNRGSLLGEQVRLD----DIFDGEKAGE 713
>UNIPROTKB|Q97U94 [details] [associations]
symbol:galS "Alpha-galactosidase" species:273057
"Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
[GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
"glycosylceramide catabolic process" evidence=ISS]
InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
Length = 648
Score = 238 (88.8 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
Identities = 67/192 (34%), Positives = 96/192 (50%)
Query: 381 DGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSSK 440
D VKVD Q +I + ++ +R+ AL+ S+ ++ I+CM N + +
Sbjct: 362 DLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGKDV-----INCMSMNPENYCNYF 415
Query: 441 QTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAV 500
+ V+R S DY P +HI AYN+L + PD+DMF S P A+ H AR
Sbjct: 416 YSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVF 475
Query: 501 GGCAIYVSDK-PGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN 559
G IY++D+ P N +LLR VLP+G V+R P T D LF DP R+ LLK+
Sbjct: 476 SGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRERV-LLKLKG 534
Query: 560 VNKCSGVVGVFN 571
K + FN
Sbjct: 535 KVKGYNAIAFFN 546
Score = 156 (60.0 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
Identities = 42/139 (30%), Positives = 64/139 (46%)
Query: 116 YTVFLPLLEGQFRSALQGNENNEI-------EICLESGDNAVETNQGLYLVYTHAGPNPF 168
YTVF + G A NN + + L +G N E + Y + NP+
Sbjct: 133 YTVFALVKSGNSYEAFFTLSNNYVTAYLFGDSVRLYTGFNTDEIKRS-YFLSIGTSDNPY 191
Query: 169 EVISQAVKAVEKYMQTFTHREKKKLPS-FLDWFGWCTWDAFYT-DVTAEGVDEGLKSLSA 226
+ I A+ K TF R++K P ++ GWC+W+AF T D+ E + + +K +
Sbjct: 192 KAIENAINIASKETFTFKLRKEKGFPDKVMNGLGWCSWNAFLTKDLNEENLIKVVKGIIE 251
Query: 227 GGTPPKFLIIDDGWQQIEN 245
G ++IIDDGWQ N
Sbjct: 252 RGLRLNWVIIDDGWQDQNN 270
Score = 77 (32.2 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
Identities = 21/63 (33%), Positives = 32/63 (50%)
Query: 254 IVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYW 313
I+ +G Q + I+ + KK N G K+ V K + VKYV +WHA+ +W
Sbjct: 260 IIDDGWQDQNNDRAIRSLNPDNKKFPN-----GFKNTVRAIK-SLGVKYVGLWHAINAHW 313
Query: 314 GGV 316
GG+
Sbjct: 314 GGM 316
>UNIPROTKB|Q8A170 [details] [associations]
symbol:BT_3797 "Possible alpha-galactosidase"
species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
"alpha-galactosidase activity" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
Uniprot:Q8A170
Length = 693
Score = 196 (74.1 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
Identities = 54/191 (28%), Positives = 84/191 (43%)
Query: 362 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 421
+K+ +Y + G D +K+D Q+ L G + + + ALE R
Sbjct: 348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHRM-- 405
Query: 422 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 481
G ++CM N I + ++V RAS DY D H+ NTL LG+ + PD
Sbjct: 406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465
Query: 482 DMFHSLHPAA-EYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 540
DMFHS ++A+ G +Y+SD P D +R L+ G + R P PT
Sbjct: 466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525
Query: 541 DCLFADPARDG 551
+ + +P + G
Sbjct: 526 ESILTNPLQSG 536
Score = 130 (50.8 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
Identities = 34/151 (22%), Positives = 79/151 (52%)
Query: 129 SALQGNENNEIEICLES-GDNAVETNQGLYLVYTHAGPNPFEVISQAVKAV--EKYMQTF 185
S Q N++ + + + + G++A+ T + L++ + + + V S A ++ +K +
Sbjct: 158 SWFQVNQDGTLTLYVSTLGEDAL-TGRLPLLIFRKSS-SVYHVFSDAYDSLIADKAVSAL 215
Query: 186 THREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIEN 245
R K+ + D+ GWCTW+ ++ D+ + + ++ A G P ++++IDDG I N
Sbjct: 216 RKRADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG--HIAN 273
Query: 246 KPKEESNCIVQEGAQFASRLTGIKENSKFQK 276
K ++ ++ +V + +F + + I + + K
Sbjct: 274 KNRQLTS-LVPDKKRFPNGWSRIMKRKQADK 303
Score = 66 (28.3 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 295 KQNHNVKYVYVWHALAGYWGGVKPAAD 321
KQ ++++ +W++L+GYW G+ D
Sbjct: 299 KQADKIRWIGLWYSLSGYWMGISAEND 325
Score = 50 (22.7 bits), Expect = 1.9e-23, Sum P(4) = 1.9e-23
Identities = 19/80 (23%), Positives = 37/80 (46%)
Query: 648 LFHFCPLKEISSNISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSD 707
LFH CP+++ +A IG+ + + S V+ ++ +EK + D + L +D
Sbjct: 621 LFHLCPIRK-----GWAVIGIQEKYLSPATVQILK-RTTEKL--ILDVHCTGTLRI-WAD 671
Query: 708 NRSPTATISLKVRGCGRFGI 727
+ S+ ++ GR I
Sbjct: 672 SHGKQELRSIPIKKAGRIEI 691
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.318 0.135 0.417 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 774 774 0.00093 121 3 11 22 0.41 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 10
No. of states in DFA: 624 (66 KB)
Total size of DFA: 424 KB (2203 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 63.93u 0.11s 64.04t Elapsed: 00:00:03
Total cpu time: 63.94u 0.11s 64.05t Elapsed: 00:00:03
Start: Sat May 11 00:40:47 2013 End: Sat May 11 00:40:50 2013