Your job contains 1 sequence.
>007685
MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ
QIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVK
YVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHP
KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP
DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW
DMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRD
CLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVT
DVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSNI
SFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVRG
CGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 007685
(593 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2103488 - symbol:SIP2 "AT3G57520" species:3702... 2644 4.9e-275 1
TAIR|locus:2020452 - symbol:SIP1 "AT1G55740" species:3702... 1827 1.9e-201 2
TAIR|locus:2170528 - symbol:SIP1 "AT5G40390" species:3702... 1005 1.9e-106 2
UNIPROTKB|Q5VQG4 - symbol:RFS "Galactinol--sucrose galact... 989 3.5e-105 2
TAIR|locus:2141425 - symbol:STS "AT4G01970" species:3702 ... 734 8.0e-102 3
UNIPROTKB|Q93XK2 - symbol:STS1 "Stachyose synthase" speci... 783 8.3e-83 2
ASPGD|ASPL0000010056 - symbol:aglF species:162425 "Emeric... 340 2.3e-35 3
UNIPROTKB|G4NBB7 - symbol:MGG_11554 "Seed imbibition prot... 337 1.7e-32 3
UNIPROTKB|Q97U94 - symbol:galS "Alpha-galactosidase" spec... 238 5.6e-24 2
UNIPROTKB|Q8A170 - symbol:BT_3797 "Possible alpha-galacto... 196 1.2e-22 4
>TAIR|locus:2103488 [details] [associations]
symbol:SIP2 "AT3G57520" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0080167 "response to karrikin" evidence=IEP] [GO:0034484
"raffinose catabolic process" evidence=IDA] [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
[GO:0052692 "raffinose alpha-galactosidase activity" evidence=IDA]
[GO:0009506 "plasmodesma" evidence=IDA] InterPro:IPR013785
GO:GO:0009506 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0080167
EMBL:AL133248 GO:GO:0034484 CAZy:GH36 GO:GO:0052692 eggNOG:NOG06986
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 KO:K06617
GO:GO:0047274 EMBL:AY050772 EMBL:AK226370 IPI:IPI00526258
IPI:IPI00541537 IPI:IPI00544535 PIR:T46188 RefSeq:NP_191311.1
RefSeq:NP_850715.1 UniGene:At.22207 UniGene:At.30900
ProteinModelPortal:Q94A08 STRING:Q94A08 PaxDb:Q94A08 PRIDE:Q94A08
EnsemblPlants:AT3G57520.1 GeneID:824919 KEGG:ath:AT3G57520
TAIR:At3g57520 InParanoid:Q9SCM1 OMA:FHHREKK PhylomeDB:Q94A08
ProtClustDB:PLN02219 BioCyc:ARA:AT3G57520-MONOMER
BioCyc:MetaCyc:AT3G57520-MONOMER Uniprot:Q94A08
Length = 773
Score = 2644 (935.8 bits), Expect = 4.9e-275, P = 4.9e-275
Identities = 482/594 (81%), Positives = 536/594 (90%)
Query: 1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
MQTF HREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLS GGTPPKFLIIDDGWQ
Sbjct: 182 MQTFHHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDDGWQ 241
Query: 61 QIENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVK 120
QIENK K+E NC+VQEGAQFA+RL GIKEN+KFQK Q QVSGLK VVD +KQ HNVK
Sbjct: 242 QIENKEKDE-NCVVQEGAQFATRLVGIKENAKFQKSDQKDTQVSGLKSVVDNAKQRHNVK 300
Query: 121 YVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHP 180
VY WHALAGYWGGVKPAA GMEHYD+ALAYPV SPGV+GNQPDIVMDSLAVHGLGLV+P
Sbjct: 301 QVYAWHALAGYWGGVKPAASGMEHYDSALAYPVQSPGVLGNQPDIVMDSLAVHGLGLVNP 360
Query: 181 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 240
KKVFNFYNELH+YLASCG+DGVKVDVQNIIETLGAG GGRVSLTRSY QALEASIARNF
Sbjct: 361 KKVFNFYNELHSYLASCGIDGVKVDVQNIIETLGAGLGGRVSLTRSYQQALEASIARNFT 420
Query: 241 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 300
DNGCISCMCHNTDG+YS+KQTA++RASDD+YPRDPASHTIHI+SVAYN+LFLGEFMQPDW
Sbjct: 421 DNGCISCMCHNTDGLYSAKQTAIVRASDDFYPRDPASHTIHIASVAYNSLFLGEFMQPDW 480
Query: 301 DMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRD 360
DMFHSLHP AEYH AARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRA+LPGRPTRD
Sbjct: 481 DMFHSLHPTAEYHAAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAKLPGRPTRD 540
Query: 361 CLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVT 420
CLFADPARDG SLLK+WN+NK +G+VGVFNCQGAGWCK TKK +IHD SPGTLT S+R
Sbjct: 541 CLFADPARDGISLLKIWNMNKFTGIVGVFNCQGAGWCKETKKNQIHDTSPGTLTGSIRAD 600
Query: 421 DVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKEISSNI 480
D + ++Q+AG W+GD+IVYA+RSGEVVRLPKGAS+P+TLKVLEYELFH PLKEI+ NI
Sbjct: 601 DADLISQVAGEDWSGDSIVYAYRSGEVVRLPKGASIPLTLKVLEYELFHISPLKEITENI 660
Query: 481 SFAAIGLLDMFNSGGAVENVEV-HMSEKKPDLFDGEVSSELTTSLSDNRSPTATISLKVR 539
SFA IGL+DMFNS GA+E++++ H+++K P+ FDGE+SS + +LSDNRSPTA +S+ VR
Sbjct: 661 SFAPIGLVDMFNSSGAIESIDINHVTDKNPEFFDGEISSA-SPALSDNRSPTALVSVSVR 719
Query: 540 GCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEMYRWPVEIQV 593
GCGRFG YSSQRPLKC V S +TDFTYD+ GL+T+ LPV EEM+RW VEI V
Sbjct: 720 GCGRFGAYSSQRPLKCAVESTETDFTYDAEVGLVTLNLPVTREEMFRWHVEILV 773
>TAIR|locus:2020452 [details] [associations]
symbol:SIP1 "AT1G55740" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
InterPro:IPR013785 EMBL:CP002684 GenomeReviews:CT485782_GR
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0005975
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AC002328 PIR:C96599 EMBL:BT004640 EMBL:AK227977
IPI:IPI00543579 RefSeq:NP_175970.1 UniGene:At.47524
UniGene:At.67212 ProteinModelPortal:Q84VX0 IntAct:Q84VX0
PaxDb:Q84VX0 PRIDE:Q84VX0 EnsemblPlants:AT1G55740.1 GeneID:842023
KEGG:ath:AT1G55740 TAIR:At1g55740 HOGENOM:HOG000237551
InParanoid:Q84VX0 KO:K06617 OMA:LTHIKEN PhylomeDB:Q84VX0
ProtClustDB:PLN02355 Genevestigator:Q84VX0 GO:GO:0047274
Uniprot:Q84VX0
Length = 754
Score = 1827 (648.2 bits), Expect = 1.9e-201, Sum P(2) = 1.9e-201
Identities = 330/502 (65%), Positives = 403/502 (80%)
Query: 1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
+QTF+HRE+KK+P L+WFGWCTWDAFYT+VTA+ V +GL+SL AGG PKF+IIDDGWQ
Sbjct: 182 LQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAKDVKQGLESLKAGGVTPKFVIIDDGWQ 241
Query: 61 QIE-NKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS----GLKHVVDESKQ 115
+ ++ E N A FA+RLT IKEN KFQK + +V L HV+ + K
Sbjct: 242 SVGMDETSVEFNA--DNAANFANRLTHIKENHKFQKDGKEGHRVDDPSLSLGHVITDIKS 299
Query: 116 NHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGL 175
N+++KYVYVWHA+ GYWGGVKP GMEHY++ +AYPV+SPGVM ++ ++S+ +GL
Sbjct: 300 NNSLKYVYVWHAITGYWGGVKPGVSGMEHYESKVAYPVSSPGVMSSENCGCLESITKNGL 359
Query: 176 GLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASI 235
GLV+P+KVF+FYN+LH+YLAS GVDGVKVDVQNI+ETLGAGHGGRV L + YHQALEASI
Sbjct: 360 GLVNPEKVFSFYNDLHSYLASVGVDGVKVDVQNILETLGAGHGGRVKLAKKYHQALEASI 419
Query: 236 ARNFPDNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEF 295
+RNFPDNG ISCM HNTDG+YS+K+TAVIRASDD++PRDPASHTIHI+SVAYNTLFLGEF
Sbjct: 420 SRNFPDNGIISCMSHNTDGLYSAKKTAVIRASDDFWPRDPASHTIHIASVAYNTLFLGEF 479
Query: 296 MQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPG 355
MQPDWDMFHSLHP AEYH AARAVGGCAIYVSDKPG H+F+LLRKLVL DGS+LRA+LPG
Sbjct: 480 MQPDWDMFHSLHPMAEYHAAARAVGGCAIYVSDKPGQHDFNLLRKLVLRDGSILRAKLPG 539
Query: 356 RPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTA 415
RPT DC F+DP RD SLLK+WN+N+ +GV+GVFNCQGAGWCK K+ IHD+ PGT++
Sbjct: 540 RPTSDCFFSDPVRDNKSLLKIWNLNEFTGVIGVFNCQGAGWCKNEKRYLIHDQEPGTISG 599
Query: 416 SVRVTDVENMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVLEYELFHFCPLKE 475
VR DV + ++A W GD+IVY+H GE+V LPK S+PVTL EYE+F P+KE
Sbjct: 600 CVRTNDVHYLHKVAAFEWTGDSIVYSHLRGELVYLPKDTSLPVTLMPREYEVFTVVPVKE 659
Query: 476 ISSNISFAAIGLLDMFNSGGAV 497
S FA +GL++MFNSGGA+
Sbjct: 660 FSDGSKFAPVGLMEMFNSGGAI 681
Score = 145 (56.1 bits), Expect = 1.9e-201, Sum P(2) = 1.9e-201
Identities = 29/68 (42%), Positives = 42/68 (61%)
Query: 526 DNRSPTATISLKVRGCGRFGIYSS-QRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEEEM 584
D+ + +K+RG G G+YSS +RP TV S ++ Y+ +GL+T TL VPE+E+
Sbjct: 687 DDEGTKFVVRMKLRGSGLVGVYSSVRRPRSVTVDSDDVEYRYEPESGLVTFTLGVPEKEL 746
Query: 585 YRWPVEIQ 592
Y W V IQ
Sbjct: 747 YLWDVVIQ 754
>TAIR|locus:2170528 [details] [associations]
symbol:SIP1 "AT5G40390" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0005986 "sucrose biosynthetic process" evidence=IMP]
[GO:0010325 "raffinose family oligosaccharide biosynthetic process"
evidence=IMP] [GO:0019593 "mannitol biosynthetic process"
evidence=IMP] [GO:0047274 "galactinol-sucrose galactosyltransferase
activity" evidence=ISS] [GO:0009507 "chloroplast" evidence=IDA]
[GO:0006979 "response to oxidative stress" evidence=IEP]
[GO:0009414 "response to water deprivation" evidence=IEP]
[GO:0009737 "response to abscisic acid stimulus" evidence=IDA]
InterPro:IPR013785 GO:GO:0009737 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0009507 GO:GO:0006979
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0009414
CAZy:GH36 InterPro:IPR008811 Pfam:PF05691 GO:GO:0006012
EMBL:AB006702 HOGENOM:HOG000237551 KO:K06617 GO:GO:0047274
EMBL:AY062781 EMBL:AY081645 IPI:IPI00530152 RefSeq:NP_198855.1
UniGene:At.8441 ProteinModelPortal:Q9FND9 STRING:Q9FND9
PaxDb:Q9FND9 PRIDE:Q9FND9 EnsemblPlants:AT5G40390.1 GeneID:834037
KEGG:ath:AT5G40390 TAIR:At5g40390 eggNOG:NOG287560
InParanoid:Q9FND9 OMA:ETRRNQC PhylomeDB:Q9FND9 ProtClustDB:PLN02711
Uniprot:Q9FND9
Length = 783
Score = 1005 (358.8 bits), Expect = 1.9e-106, Sum P(2) = 1.9e-106
Identities = 208/518 (40%), Positives = 302/518 (58%)
Query: 1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
M TF E+K P +D FGWCTWDAFY V +GV +G+K L GG PP ++IDDGWQ
Sbjct: 206 MNTFKLLEEKSPPGIVDKFGWCTWDAFYLTVNPDGVHKGVKCLVDGGCPPGLVLIDDGWQ 265
Query: 61 QIENKPKE---ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVS-GLKHVVDESKQN 116
I + E I G Q RL +EN KF+ +Q G+K V + K
Sbjct: 266 SIGHDSDGIDVEGMNITVAGEQMPCRLLKFEENHKFKDYVSPKDQNDVGMKAFVRDLKDE 325
Query: 117 HN-VKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGL 175
+ V Y+YVWHAL GYWGG++P A + + + P SPG+ D+ +D + G+
Sbjct: 326 FSTVDYIYVWHALCGYWGGLRPEAPALP--PSTIIRPELSPGLKLTMEDLAVDKIIETGI 383
Query: 176 GLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASI 235
G P FY LH++L + G+DGVKVDV +I+E L +GGRV L ++Y +AL +S+
Sbjct: 384 GFASPDLAKEFYEGLHSHLQNAGIDGVKVDVIHILEMLCQKYGGRVDLAKAYFKALTSSV 443
Query: 236 ARNFPDNGCISCMCHNTDGIYSSKQTAVI-RASDDYYPRDPASHT--------IHISSVA 286
++F NG I+ M H D ++ + + R DD++ DP+ H+ A
Sbjct: 444 NKHFNGNGVIASMEHCNDFMFLGTEAISLGRVGDDFWCTDPSGDPNGTFWLQGCHMVHCA 503
Query: 287 YNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDG 346
YN+L++G F+QPDWDMF S HP AE+H A+RA+ G IY+SD G H+FDLL++LVLP+G
Sbjct: 504 YNSLWMGNFIQPDWDMFQSTHPCAEFHAASRAISGGPIYISDCVGKHDFDLLKRLVLPNG 563
Query: 347 SVLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIH 406
S+LR + PTRD LF DP DG ++LK+WN+NK +GV+G FNCQG GWC+ T++ +
Sbjct: 564 SILRCEYYALPTRDRLFEDPLHDGKTMLKIWNLNKYTGVIGAFNCQGGGWCRETRRNQCF 623
Query: 407 DESPGTLTASVRVTDVE---NMAQIAGAGWNGDAIVYAHRSGEVVRLPKGASVPVTLKVL 463
E TLTA+ DVE + I+ A A+ + +S +++ + +TL+
Sbjct: 624 SECVNTLTATTSPKDVEWNSGSSPISIANVEEFAL-FLSQSKKLLLSGLNDDLELTLEPF 682
Query: 464 EYELFHFCPLKEISSN-ISFAAIGLLDMFNSGGAVENV 500
++EL P+ I N + FA IGL++M N+ GA+ ++
Sbjct: 683 KFELITVSPVVTIEGNSVRFAPIGLVNMLNTSGAIRSL 720
Score = 68 (29.0 bits), Expect = 1.9e-106, Sum P(2) = 1.9e-106
Identities = 11/49 (22%), Positives = 27/49 (55%)
Query: 533 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPE 581
++ + V G G F +Y+S++P+ C + +F Y+ + ++ + P+
Sbjct: 726 SVEVGVFGAGEFRVYASKKPVSCLIDGEVVEFGYEDSMVMVQVPWSGPD 774
>UNIPROTKB|Q5VQG4 [details] [associations]
symbol:RFS "Galactinol--sucrose galactosyltransferase"
species:39947 "Oryza sativa Japonica Group" [GO:0047274
"galactinol-sucrose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 Gene3D:3.20.20.70 InterPro:IPR017853
SUPFAM:SSF51445 GO:GO:0005975 InterPro:IPR008811 Pfam:PF05691
EMBL:AP008207 EMBL:CM000138 EMBL:AP003282 KO:K06617 GO:GO:0047274
eggNOG:NOG287560 EMBL:AP003339 RefSeq:NP_001042137.1
UniGene:Os.61038 ProteinModelPortal:Q5VQG4 GeneID:4325200
KEGG:dosa:Os01t0170000-01 KEGG:osa:4325200 Gramene:Q5VQG4
Uniprot:Q5VQG4
Length = 783
Score = 989 (353.2 bits), Expect = 3.5e-105, Sum P(2) = 3.5e-105
Identities = 220/537 (40%), Positives = 305/537 (56%)
Query: 3 TFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQI 62
TF E+K P +D FGWCTWDAFY V EGV EG++ L+ GG PP ++IDDGWQ I
Sbjct: 211 TFRLMEEKTPPPIVDKFGWCTWDAFYLKVHPEGVWEGVRRLADGGCPPGLVLIDDGWQSI 270
Query: 63 ENKPKE-----ESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNH 117
+ + E G Q RL +EN KF+ E G+ V E K
Sbjct: 271 CHDDDDLGSGAEGMNRTSAGEQMPCRLIKFQENYKFR------EYKGGMGGFVREMKAAF 324
Query: 118 -NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLG 176
V+ VYVWHAL GYWGG++P A G+ + P SPG+ D+ +D + +G+G
Sbjct: 325 PTVEQVYVWHALCGYWGGLRPGAPGLP--PAKVVAPRLSPGLQRTMEDLAVDKIVNNGVG 382
Query: 177 LVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIA 236
LV P++ Y LH++L + G+DGVKVDV +++E + +GGRV L ++Y L S+
Sbjct: 383 LVDPRRARELYEGLHSHLQASGIDGVKVDVIHLLEMVCEEYGGRVELAKAYFAGLTESVR 442
Query: 237 RNFPDNGCISCMCHNTDG-IYSSKQTAVIRASDDYYPRDPASHT--------IHISSVAY 287
R+F NG I+ M H D + ++ A+ R DD++ DP+ H+ AY
Sbjct: 443 RHFNGNGVIASMEHCNDFMLLGTEAVALGRVGDDFWCTDPSGDPDGTFWLQGCHMVHCAY 502
Query: 288 NTLFLGEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGS 347
N+L++G F+ PDWDMF S HP A +H A+RAV G +YVSD G H+FDLLR+L LPDG+
Sbjct: 503 NSLWMGAFIHPDWDMFQSTHPCAAFHAASRAVSGGPVYVSDAVGCHDFDLLRRLALPDGT 562
Query: 348 VLRAQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 407
+LR + PTRDCLFADP DG ++LK+WNVNK SGV+G FNCQG GW + ++
Sbjct: 563 ILRCERYALPTRDCLFADPLHDGKTMLKIWNVNKFSGVLGAFNCQGGGWSREARRNMCAA 622
Query: 408 ESPGTLTASVRVTDVENMAQIAGAGWNGDAI-VYAHRSGEVVRLPKGASVPVTLKVLEYE 466
+TA DVE + G GD VY + ++ L + SV +TL+ YE
Sbjct: 623 GFSVPVTARASPADVE----WSHGGGGGDRFAVYFVEARKLQLLRRDESVELTLEPFTYE 678
Query: 467 LFHFCPLKEISS---NISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSEL 520
L P++ I S I FA IGL +M N+GGAV+ E + +K DG+V++E+
Sbjct: 679 LLVVAPVRAIVSPELGIGFAPIGLANMLNAGGAVQGFE---AARK----DGDVAAEV 728
Score = 72 (30.4 bits), Expect = 3.5e-105, Sum P(2) = 3.5e-105
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 538 VRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLP 578
V+G G YSS RP C V +F Y+ G++T+ +P
Sbjct: 730 VKGAGEMVAYSSARPRLCKVNGQDAEFKYED--GIVTVDVP 768
>TAIR|locus:2141425 [details] [associations]
symbol:STS "AT4G01970" species:3702 "Arabidopsis
thaliana" [GO:0004553 "hydrolase activity, hydrolyzing O-glycosyl
compounds" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISM]
[GO:0047268 "galactinol-raffinose galactosyltransferase activity"
evidence=ISS] [GO:0006979 "response to oxidative stress"
evidence=IEP] [GO:0080167 "response to karrikin" evidence=IEP]
InterPro:IPR013785 EMBL:CP002687 GenomeReviews:CT486007_GR
GO:GO:0006979 Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445
GO:GO:0005975 GO:GO:0080167 EMBL:AC007138 EMBL:AL161493 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 HOGENOM:HOG000237551 GO:GO:0047274
EMBL:AK229121 IPI:IPI00852301 PIR:C85025 RefSeq:NP_192106.3
UniGene:At.34347 ProteinModelPortal:Q9SYJ4 PaxDb:Q9SYJ4
PRIDE:Q9SYJ4 EnsemblPlants:AT4G01970.1 GeneID:828186
KEGG:ath:AT4G01970 TAIR:At4g01970 eggNOG:NOG318101
InParanoid:Q0WPF3 KO:K06611 OMA:IASMQQC GO:GO:0047268
Uniprot:Q9SYJ4
Length = 876
Score = 734 (263.4 bits), Expect = 8.0e-102, Sum P(3) = 8.0e-102
Identities = 152/398 (38%), Positives = 237/398 (59%)
Query: 122 VYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPK 181
+YVWHAL G W GV+P + M +A SP + D+ +D + G+GLVHP
Sbjct: 416 IYVWHALCGAWNGVRP--ETMMDLKAKVAPFELSPSLGATMADLAVDKVVEAGIGLVHPS 473
Query: 182 KVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPD 241
K FY+ +H+YLAS GV G K+DV +E+L HGGRV L ++Y+ L S+ +NF
Sbjct: 474 KAHEFYDSMHSYLASVGVTGAKIDVFQTLESLAEEHGGRVELAKAYYDGLTESMIKNFNG 533
Query: 242 NGCISCMCHNTDGIY-SSKQTAVIRASDDYYPRDPASHT--------IHISSVAYNTLFL 292
I+ M + + ++KQ ++ R DD++ +DP +H+ +YN++++
Sbjct: 534 TDVIASMQQCNEFFFLATKQISIGRVGDDFWWQDPYGDPQGVYWLQGVHMIHCSYNSIWM 593
Query: 293 GEFMQPDWDMFHSLHPAAEYHGAARAVGGCAIYVSDKPG--NHNFDLLRKLVLPDGSVLR 350
G+ +QPDWDMF S H AEYH A+RA+ G +Y+SD G +HNFDL++KL DG++ R
Sbjct: 594 GQMIQPDWDMFQSDHVCAEYHAASRAICGGPVYLSDHLGKASHNFDLIKKLAFFDGTIPR 653
Query: 351 AQLPGRPTRDCLFADPARDGTSLLKVWNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESP 410
PTRD LF +P D S+LK++N NK GV+G FNCQGAGW + + + E
Sbjct: 654 CVHYALPTRDSLFKNPLFDKESILKIFNFNKFGGVIGTFNCQGAGWSPEEHRFKGYKECY 713
Query: 411 GTLTASVRVTDVE--NMAQIAGAG--WNGDAIVYAHRSGEVVRL-PKGASVPVTLKVLEY 465
T++ +V V+D+E + AG+ + GD +VY +S E++ + K ++ +TL+ +
Sbjct: 714 TTVSGTVHVSDIEWDQNPEAAGSQVTYTGDYLVYKQQSEEILFMNSKSEAMKITLEPSAF 773
Query: 466 ELFHFCPLKE-ISSNISFAAIGLLDMFNSGGAVENVEV 502
+L F P+ E +SS + FA +GL++MFN G V++++V
Sbjct: 774 DLLSFVPVTELVSSGVRFAPLGLINMFNCVGTVQDMKV 811
Score = 217 (81.4 bits), Expect = 8.0e-102, Sum P(3) = 8.0e-102
Identities = 49/112 (43%), Positives = 59/112 (52%)
Query: 1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
M TF E+KKLP +D FGWCTWDA Y V + G+K GG PKF+IIDDGWQ
Sbjct: 217 MNTFKLLEEKKLPKIVDKFGWCTWDACYLTVDPATIWTGVKEFEDGGVCPKFVIIDDGWQ 276
Query: 61 QI----ENKPKEESNCIVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKH 108
I + K+ N +V G Q +RLT KE KF+ S S H
Sbjct: 277 SINFDGDELDKDAEN-LVLGGEQMTARLTSFKECKKFRNYKGGSFITSDASH 327
Score = 92 (37.4 bits), Expect = 8.0e-102, Sum P(3) = 8.0e-102
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 533 TISLKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 582
+I + V+G GRF YSS P+KC + + +F ++ TG ++ +P EE
Sbjct: 816 SIRVDVKGEGRFMAYSSSAPVKCYLNDKEAEFKWEEETGKLSFFVPWVEE 865
>UNIPROTKB|Q93XK2 [details] [associations]
symbol:STS1 "Stachyose synthase" species:3888 "Pisum
sativum" [GO:0005737 "cytoplasm" evidence=NAS] [GO:0009312
"oligosaccharide biosynthetic process" evidence=IDA] [GO:0047268
"galactinol-raffinose galactosyltransferase activity" evidence=IDA]
InterPro:IPR013785 UniPathway:UPA00925 GO:GO:0005737
Gene3D:3.20.20.70 InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36
InterPro:IPR008811 Pfam:PF05691 GO:GO:0009312 GO:GO:0047268
EMBL:AJ311087 EMBL:AJ512932 ProteinModelPortal:Q93XK2
BioCyc:MetaCyc:MONOMER-12485 BRENDA:2.4.1.67 GO:GO:0033532
Uniprot:Q93XK2
Length = 853
Score = 783 (280.7 bits), Expect = 8.3e-83, Sum P(2) = 8.3e-83
Identities = 180/488 (36%), Positives = 273/488 (55%)
Query: 28 YTDVTAEGVD-EGLKSLSAGGTPPKFLIIDDGWQQIENKPKEESNCIVQEGAQFASRLTG 86
+TD+ +G++ E L+ K +IE+K K+ +V+E L G
Sbjct: 319 FTDLILKGIEHEKLRKKREEAISSK----SSDLAEIESKIKK----VVKE----IDDLFG 366
Query: 87 IKENSKFQKKCQNSEQVSGLKHVVDESKQNHN-VKYVYVWHALAGYWGGVKPAADGMEHY 145
++ S +K SE GLK + + + VYVWHAL G WGGV+P H
Sbjct: 367 GEQFSSGEKSEMKSEY--GLKAFTKDLRTKFKGLDDVYVWHALCGAWGGVRPETT---HL 421
Query: 146 DTALAYPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVD 205
DT + SPG+ G D+ + ++ LGLVHP + Y+ +H+YLA G+ GVKVD
Sbjct: 422 DTKIVPCKLSPGLDGTMEDLAVVEISKASLGLVHPSQANELYDSMHSYLAESGITGVKVD 481
Query: 206 VQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIY-SSKQTAVI 264
V + +E + +GGRV L + Y++ L SI +NF NG I+ M H D + +KQ ++
Sbjct: 482 VIHSLEYVCDEYGGRVDLAKVYYEGLTKSIVKNFNGNGMIASMQHCNDFFFLGTKQISMG 541
Query: 265 RASDDYYPRDPASHT--------IHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAA 316
R DD++ +DP +H+ +YN+L++G+ +QPDWDMF S H A++H +
Sbjct: 542 RVGDDFWFQDPNGDPMGSFWLQGVHMIHCSYNSLWMGQMIQPDWDMFQSDHVCAKFHAGS 601
Query: 317 RAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKV 376
RA+ G IYVSD G+H+FDL++KLV PDG++ + PTRDCLF +P D T++LK+
Sbjct: 602 RAICGGPIYVSDNVGSHDFDLIKKLVFPDGTIPKCIYFPLPTRDCLFKNPLFDHTTVLKI 661
Query: 377 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHDESPGTLTASVRVTDVE--NMAQIAGAGWN 434
WN NK GV+G FNCQGAGW I +K R E + +V VT+VE + + G
Sbjct: 662 WNFNKYGGVIGAFNCQGAGWDPIMQKFRGFPECYKPIPGTVHVTEVEWDQKEETSHLGKA 721
Query: 435 GDAIVYAHRSGEVVRLP-KGASVPVTLKVLEYELFHFCPLKEISSNISFAAIGLLDMFNS 493
+ +VY +++ E+ + K + T++ +EL+ F P+ ++ I FA IGL +MFNS
Sbjct: 722 EEYVVYLNQAEELSLMTLKSEPIQFTIQPSTFELYSFVPVTKLCGGIKFAPIGLTNMFNS 781
Query: 494 GGAVENVE 501
GG V ++E
Sbjct: 782 GGTVIDLE 789
Score = 217 (81.4 bits), Expect = 5.0e-16, Sum P(2) = 5.0e-16
Identities = 43/98 (43%), Positives = 57/98 (58%)
Query: 1 MQTFTHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
+ +F E+K +P+ +D FGWCTWDAFY V G+ GL S GG P+F+IIDDGWQ
Sbjct: 203 LNSFRLLEEKTIPNLVDKFGWCTWDAFYLTVNPIGIFHGLDDFSKGGVEPRFVIIDDGWQ 262
Query: 61 QIE---NKPKEESNCIVQEGAQFASRLTGIKENSKFQK 95
I P E++ +V G Q + RL E KF+K
Sbjct: 263 SISFDGYDPNEDAKNLVLGGEQMSGRLHRFDECYKFRK 300
Score = 66 (28.3 bits), Expect = 8.3e-83, Sum P(2) = 8.3e-83
Identities = 16/47 (34%), Positives = 25/47 (53%)
Query: 536 LKVRGCGRFGIYSSQRPLKCTVGSIQTDFTYDSATGLMTMTLPVPEE 582
+KV+G G F YSS+ P K + + DF + G + + +P EE
Sbjct: 797 IKVKGGGSFLAYSSESPKKFQLNGCEVDFEW-LGDGKLCVNVPWIEE 842
>ASPGD|ASPL0000010056 [details] [associations]
symbol:aglF species:162425 "Emericella nidulans"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 EMBL:BN001302
CAZy:GH36 eggNOG:NOG06986 InterPro:IPR008811 Pfam:PF05691
EMBL:AACD01000062 RefSeq:XP_661478.1 EnsemblFungi:CADANIAT00004829
GeneID:2873297 KEGG:ani:AN3874.2 HOGENOM:HOG000189235 OMA:AISCMSQ
OrthoDB:EOG4B2X59 Uniprot:Q5B6F6
Length = 863
Score = 340 (124.7 bits), Expect = 2.3e-35, Sum P(3) = 2.3e-35
Identities = 94/305 (30%), Positives = 152/305 (49%)
Query: 104 SGLKHVVDESKQNH-NVKYVYVWHALAGYWGGVKPAADGMEHYDTALAYPVTSPGVMGNQ 162
+GL V ++ H N++Y+ VWHAL GYWGG+ P Y T
Sbjct: 383 NGLAKAVTTIREQHRNIEYIVVWHALFGYWGGISPEGSLAAIYKTR-------------- 428
Query: 163 PDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVS 222
++ ++S + + P + FYN+ +A+L+ G+ GVK D Q+ ++ L A R S
Sbjct: 429 -EVALNSTTRPSMLTIDPSDIQRFYNDFYAFLSRSGISGVKTDAQSFLDLL-ADPEDRRS 486
Query: 223 LTRSYHQALEASIARNFPDNG--CISCMCHNT--DGIYSSKQTAVIRASDDYYPRDPASH 278
+Y A S R+F C+S + + ++K T V+R S+D++P SH
Sbjct: 487 YANAYQDAWTISSLRHFGPKAISCMSQIPQTIFHSQLPTNKPTIVVRNSNDFFPDIDDSH 546
Query: 279 TIHISSVAYNTLFLGEFMQ--PDWDMFHSLHPA----AEYHGAARAVGGCAIYVSDKPGN 332
T H+ A+N L L ++ PDWDMF +L A +H AAR + G IY++DKPG
Sbjct: 547 TWHVFCNAHNAL-LTRYLNGLPDWDMFQTLPENGLDYASFHAAARCISGGPIYITDKPGQ 605
Query: 333 HNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSL-LKVWN--VNKCSGV 385
H+ L++++ G+ LR + R T D ++ D ++G L + ++ SG+
Sbjct: 606 HDIPLIKQMTASTIQGTTITLRPDIAAR-TLD-MYHD-IKEGHILCVGTYHGRAGSGSGI 662
Query: 386 VGVFN 390
+GVFN
Sbjct: 663 IGVFN 667
Score = 98 (39.6 bits), Expect = 2.3e-35, Sum P(3) = 2.3e-35
Identities = 18/54 (33%), Positives = 28/54 (51%)
Query: 12 LPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENK 65
L + D +CTW+ D++ E + L L G + LIIDD WQ ++N+
Sbjct: 309 LSEWYDGLAYCTWNGLGQDLSEEKILSALDDLKTAGIRIRTLIIDDNWQSLDNE 362
Score = 61 (26.5 bits), Expect = 2.3e-35, Sum P(3) = 2.3e-35
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 438 IVYAHRSGEVV-RLPKGASVPVTLKVLEYELFHFCPLKEIS 477
IV AHR+G +V L ++V VTL +E+ P+K ++
Sbjct: 695 IVRAHRTGRIVGELHSSSAVSVTLNERRWEVLTAYPVKTLT 735
Score = 41 (19.5 bits), Expect = 2.7e-33, Sum P(3) = 2.7e-33
Identities = 8/30 (26%), Positives = 19/30 (63%)
Query: 486 GLLDMFNSGGAVENVEVHMSEKKPDLFDGE 515
G++ +FN VE+V + +++ P ++D +
Sbjct: 661 GIIGVFNVSNRVESVIIPVADF-PGIYDDQ 689
>UNIPROTKB|G4NBB7 [details] [associations]
symbol:MGG_11554 "Seed imbibition protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
evidence=ND] [GO:0052051 "interaction with host via protein
secreted by type II secretion system" evidence=IDA]
InterPro:IPR013785 GO:GO:0003824 Gene3D:3.20.20.70 EMBL:CM001235
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0008152 InterPro:IPR008811
Pfam:PF05691 GO:GO:0052051 RefSeq:XP_003718463.1
EnsemblFungi:MGG_11554T0 GeneID:2675080 KEGG:mgr:MGG_11554
Uniprot:G4NBB7
Length = 908
Score = 337 (123.7 bits), Expect = 1.7e-32, Sum P(3) = 1.7e-32
Identities = 103/331 (31%), Positives = 156/331 (47%)
Query: 92 KFQKKCQNSEQVSGLKHVVDE-SKQNHNVKYVYVWHALAGYWGGVKPAADGMEHYDTALA 150
+F+ Q Q GLK +V E KQN ++ + VWH + GYWGG+ P+ Y
Sbjct: 393 RFEANQQGFPQ--GLKGLVSEIRKQNPQIRNIAVWHGIFGYWGGMSPSGPMASKYKMRKI 450
Query: 151 YPVTSPGVMGNQPDIVMDSLAVHGLGLVHPKKVFNFYNELHAYLASCGVDGVKVDVQNII 210
V QP D V G + V Y++ +A+LA CGV KVD Q +
Sbjct: 451 QLRDEAEV---QPKD-FDFYTVDG------EDVHKMYDDFYAFLADCGVSAAKVDTQGFL 500
Query: 211 ETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSS--KQ------TA 262
+ A R +L R Y A A+ +++F I+CM I S +Q
Sbjct: 501 D-YPAHANDRKNLIRPYQDAWTAAASKHFGGRA-IACMAQTPQSILHSLLQQGRSEGPML 558
Query: 263 VIRASDDYYPRDPASHTIHISSVAYNTLFLGEF-MQPDWDMFHSLHPA-AEYHGAARAVG 320
+ R SDD++P + SHT H+ A+N L + + DWDMF + P A H AR++
Sbjct: 559 MARNSDDFFPDEVGSHTWHVFCNAHNALLMRHLGVLLDWDMFQTTTPKYAALHAVARSMS 618
Query: 321 GCAIYVSDKPGNHNFDLLRKLVLP--DGSV--LRAQLPGRPTRDCLFADPARDGTSLLKV 376
G IY++D PG H+ +L++++ DG LRA PGR L+ LL+V
Sbjct: 619 GGPIYITDAPGEHDVELIKQMTAQTADGRTIALRADEPGRT----LWPYGGHGEQRLLRV 674
Query: 377 WNVNKCSGVVGVFNCQGAGWCKITKKTRIHD 407
+ ++ G++GVFN G + ++ R+ D
Sbjct: 675 RSGHQGVGMLGVFNVCNRG-SLLGEQVRLDD 704
Score = 90 (36.7 bits), Expect = 1.7e-32, Sum P(3) = 1.7e-32
Identities = 18/62 (29%), Positives = 32/62 (51%)
Query: 9 KKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKPKE 68
+ ++ + D F +CTW++ D++ + + L LS G LIIDD WQ ++ +
Sbjct: 326 RAQIDDWNDGFAYCTWNSLGQDLSHDKILGALTRLSESGINIANLIIDDNWQSLDGDGSD 385
Query: 69 ES 70
S
Sbjct: 386 AS 387
Score = 46 (21.3 bits), Expect = 1.7e-32, Sum P(3) = 1.7e-32
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 485 IGLLDMFN--SGGAVENVEVHMSEKKPDLFDGEVSSE 519
+G+L +FN + G++ +V + D+FDGE + E
Sbjct: 681 VGMLGVFNVCNRGSLLGEQVRLD----DIFDGEKAGE 713
>UNIPROTKB|Q97U94 [details] [associations]
symbol:galS "Alpha-galactosidase" species:273057
"Sulfolobus solfataricus P2" [GO:0004557 "alpha-galactosidase
activity" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0009311 "oligosaccharide metabolic process" evidence=ISS]
[GO:0016139 "glycoside catabolic process" evidence=ISS] [GO:0046477
"glycosylceramide catabolic process" evidence=ISS]
InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 CAZy:GH36 GO:GO:0004557
GO:GO:0052692 GO:GO:0016139 GO:GO:0046477 GO:GO:0009311
EMBL:AE006641 PIR:D90496 RefSeq:NP_344437.1
ProteinModelPortal:Q97U94 GeneID:1453146 GenomeReviews:AE006641_GR
KEGG:sso:SSO3127 eggNOG:NOG06986 HOGENOM:HOG000014928 OMA:YNAIAFF
ProtClustDB:CLSK883881 BRENDA:3.2.1.22 SABIO-RK:Q97U94
InterPro:IPR008811 Pfam:PF05691 Uniprot:Q97U94
Length = 648
Score = 238 (88.8 bits), Expect = 5.6e-24, Sum P(2) = 5.6e-24
Identities = 67/192 (34%), Positives = 96/192 (50%)
Query: 200 DGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFPDNGCISCMCHNTDGIYSSK 259
D VKVD Q +I + ++ +R+ AL+ S+ ++ I+CM N + +
Sbjct: 362 DLVKVDNQWVIHAIYDSFPIGLA-SRNIQIALQYSVGKDV-----INCMSMNPENYCNYF 415
Query: 260 QTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDWDMFHSLHPAAEYHGAARAV 319
+ V+R S DY P +HI AYN+L + PD+DMF S P A+ H AR
Sbjct: 416 YSNVMRNSIDYVPFWKDGTKLHIMFNAYNSLLTSHIVYPDYDMFMSYDPYAKVHLVARVF 475
Query: 320 GGCAIYVSDK-PGNHNFDLLRKLVLPDGSVLRAQLPGRPTRDCLFADPARDGTSLLKVWN 378
G IY++D+ P N +LLR VLP+G V+R P T D LF DP R+ LLK+
Sbjct: 476 SGGPIYITDRHPERTNIELLRMAVLPNGEVIRVDEPALITEDLLFKDPLRERV-LLKLKG 534
Query: 379 VNKCSGVVGVFN 390
K + FN
Sbjct: 535 KVKGYNAIAFFN 546
Score = 116 (45.9 bits), Expect = 5.6e-24, Sum P(2) = 5.6e-24
Identities = 24/64 (37%), Positives = 37/64 (57%)
Query: 3 TFTHREKKKLPS-FLDWFGWCTWDAFYT-DVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQ 60
TF R++K P ++ GWC+W+AF T D+ E + + +K + G ++IIDDGWQ
Sbjct: 207 TFKLRKEKGFPDKVMNGLGWCSWNAFLTKDLNEENLIKVVKGIIERGLRLNWVIIDDGWQ 266
Query: 61 QIEN 64
N
Sbjct: 267 DQNN 270
Score = 77 (32.2 bits), Expect = 6.6e-20, Sum P(2) = 6.6e-20
Identities = 21/63 (33%), Positives = 32/63 (50%)
Query: 73 IVQEGAQFASRLTGIKENSKFQKKCQNSEQVSGLKHVVDESKQNHNVKYVYVWHALAGYW 132
I+ +G Q + I+ + KK N G K+ V K + VKYV +WHA+ +W
Sbjct: 260 IIDDGWQDQNNDRAIRSLNPDNKKFPN-----GFKNTVRAIK-SLGVKYVGLWHAINAHW 313
Query: 133 GGV 135
GG+
Sbjct: 314 GGM 316
>UNIPROTKB|Q8A170 [details] [associations]
symbol:BT_3797 "Possible alpha-galactosidase"
species:226186 "Bacteroides thetaiotaomicron VPI-5482" [GO:0004557
"alpha-galactosidase activity" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0009311 "oligosaccharide metabolic
process" evidence=ISS] [GO:0016139 "glycoside catabolic process"
evidence=ISS] [GO:0046477 "glycosylceramide catabolic process"
evidence=ISS] InterPro:IPR013785 GO:GO:0005737 Gene3D:3.20.20.70
InterPro:IPR017853 SUPFAM:SSF51445 GO:GO:0004557 GO:GO:0016139
GO:GO:0046477 GO:GO:0009311 InterPro:IPR008811 Pfam:PF05691
EMBL:AE015928 GenomeReviews:AE015928_GR RefSeq:NP_812708.1
ProteinModelPortal:Q8A170 GeneID:1072651 KEGG:bth:BT_3797
PATRIC:21062607 HOGENOM:HOG000291022 OMA:YPDYDMW
ProtClustDB:CLSK2757476 BioCyc:BTHE226186:GJXV-3866-MONOMER
Uniprot:Q8A170
Length = 693
Score = 196 (74.1 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
Identities = 54/191 (28%), Positives = 84/191 (43%)
Query: 181 KKVFNFYNELHAYLASCGVDGVKVDVQNIIETLGAGHGGRVSLTRSYHQALEASIARNFP 240
+K+ +Y + G D +K+D Q+ L G + + + ALE R
Sbjct: 348 EKIETWYEYYVRTMKEYGFDFLKIDNQSFTLPLYMGGTQVIRQAKDCNLALEHQTHRM-- 405
Query: 241 DNGCISCMCHNTDGIYSSKQTAVIRASDDYYPRDPASHTIHISSVAYNTLFLGEFMQPDW 300
G ++CM N I + ++V RAS DY D H+ NTL LG+ + PD
Sbjct: 406 QMGLMNCMAQNVLNIDHTLYSSVTRASIDYKKYDENMAKSHLFQSYTNTLILGQTVWPDH 465
Query: 301 DMFHSLHPAA-EYHGAARAVGGCAIYVSDKPGNHNFDLLRKLVLPDGSVLRAQLPGRPTR 359
DMFHS ++A+ G +Y+SD P D +R L+ G + R P PT
Sbjct: 466 DMFHSCDTVCGSLMARSKAISGGPVYLSDSPSEFIADNIRPLIDETGKIFRPAAPAIPTP 525
Query: 360 DCLFADPARDG 370
+ + +P + G
Sbjct: 526 ESILTNPLQSG 536
Score = 115 (45.5 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
Identities = 23/89 (25%), Positives = 49/89 (55%)
Query: 7 REKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSAGGTPPKFLIIDDGWQQIENKP 66
R K+ + D+ GWCTW+ ++ D+ + + ++ A G P ++++IDDG I NK
Sbjct: 218 RADKQYFNAFDYLGWCTWEHYHYDIDETKILNDIDAIEASGIPVRYVLIDDG--HIANKN 275
Query: 67 KEESNCIVQEGAQFASRLTGIKENSKFQK 95
++ ++ +V + +F + + I + + K
Sbjct: 276 RQLTS-LVPDKKRFPNGWSRIMKRKQADK 303
Score = 66 (28.3 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 114 KQNHNVKYVYVWHALAGYWGGVKPAAD 140
KQ ++++ +W++L+GYW G+ D
Sbjct: 299 KQADKIRWIGLWYSLSGYWMGISAEND 325
Score = 50 (22.7 bits), Expect = 1.2e-22, Sum P(4) = 1.2e-22
Identities = 19/80 (23%), Positives = 37/80 (46%)
Query: 467 LFHFCPLKEISSNISFAAIGLLDMFNSGGAVENVEVHMSEKKPDLFDGEVSSELTTSLSD 526
LFH CP+++ +A IG+ + + S V+ ++ +EK + D + L +D
Sbjct: 621 LFHLCPIRK-----GWAVIGIQEKYLSPATVQILK-RTTEKL--ILDVHCTGTLRI-WAD 671
Query: 527 NRSPTATISLKVRGCGRFGI 546
+ S+ ++ GR I
Sbjct: 672 SHGKQELRSIPIKKAGRIEI 691
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.318 0.134 0.418 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 593 593 0.00084 120 3 11 22 0.41 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 10
No. of states in DFA: 623 (66 KB)
Total size of DFA: 361 KB (2179 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 48.76u 0.17s 48.93t Elapsed: 00:00:02
Total cpu time: 48.76u 0.17s 48.93t Elapsed: 00:00:02
Start: Fri May 10 09:00:05 2013 End: Fri May 10 09:00:07 2013