Your job contains 1 sequence.
>038979
SAAGVQLHEQNNHVVMNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRGIEATNY
KVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRLKGWPGFQ
LFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRERGHKLAYEEAVLLPNGEVDDKYQYS
MDAKDIRVHGWISTDSTVGFWQILPSSESRSFGPLKQFLTSHTGPISINTFHSTHYVGEN
FGMKFKDGEAWKKIFGPFLVYVNSVAGKGDRQMLWRDANRQFMNEVKSWPYKFPASKDFA
RSNKRGSISGRLIVKDRYVSRAGIAAKGAYVGLAKPGRAGSWQTECKGYQFWTVANEGGN
FSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNLVYKAPRNGPTLWEIGIP
DRSAAEFYIPNPNPKYINKLYVKHDRFRQYGLWERYAELHRKRDLVYEVWANNYRKDWYF
AQNTRKKGNKYEGSTWQIQFKLEGVVKKATYKLRVAVAAAHGAELQVRVNSRSARRPLFS
SGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYLSQPRKLDAFTGIMYDYLRFE
GPDPNS
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 038979
(606 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2024331 - symbol:AT1G09910 species:3702 "Arabi... 1711 5.0e-190 2
TAIR|locus:2024427 - symbol:AT1G09890 species:3702 "Arabi... 1740 4.5e-189 2
TAIR|locus:2024417 - symbol:AT1G09880 species:3702 "Arabi... 1715 4.2e-184 2
TAIR|locus:2136007 - symbol:AT4G24430 species:3702 "Arabi... 1661 1.1e-183 2
TAIR|locus:2066040 - symbol:AT2G22620 species:3702 "Arabi... 1622 9.7e-167 1
TAIR|locus:2121095 - symbol:AT4G38030 species:3702 "Arabi... 1588 3.9e-163 1
TAIR|locus:2121090 - symbol:AT4G37950 species:3702 "Arabi... 1518 1.0e-155 1
TAIR|locus:2200390 - symbol:AT1G65210 "AT1G65210" species... 498 3.8e-56 2
ASPGD|ASPL0000007043 - symbol:rglB species:162425 "Emeric... 203 7.8e-13 2
UNIPROTKB|Q5AZ85 - symbol:rglB "Rhamnogalacturonate lyase... 203 7.8e-13 2
ASPGD|ASPL0000094461 - symbol:AN12098 species:162425 "Eme... 150 1.9e-06 2
ASPGD|ASPL0000100087 - symbol:rglC species:162425 "Emeric... 150 1.9e-06 2
>TAIR|locus:2024331 [details] [associations]
symbol:AT1G09910 species:3702 "Arabidopsis thaliana"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0005576
"extracellular region" evidence=ISM] [GO:0005975 "carbohydrate
metabolic process" evidence=IEA] [GO:0016829 "lyase activity"
evidence=ISS] [GO:0030246 "carbohydrate binding" evidence=IEA]
InterPro:IPR011013 InterPro:IPR013784 InterPro:IPR014766
EMBL:CP002684 GO:GO:0005975 GO:GO:0030246 GO:GO:0004180
Gene3D:2.60.40.1120 InterPro:IPR008979 SUPFAM:SSF49785
SUPFAM:SSF74650 GO:GO:0016829 SUPFAM:SSF49452 InterPro:IPR010325
Pfam:PF06045 IPI:IPI00522987 RefSeq:NP_172462.2 UniGene:At.10238
ProteinModelPortal:F4I2M8 PRIDE:F4I2M8 EnsemblPlants:AT1G09910.1
GeneID:837523 KEGG:ath:AT1G09910 OMA:AKIKMMA Uniprot:F4I2M8
Length = 675
Score = 1711 (607.4 bits), Expect = 5.0e-190, Sum P(2) = 5.0e-190
Identities = 315/569 (55%), Positives = 409/569 (71%)
Query: 48 NEEDNRGI----EATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSG 103
NE +GI ++VI+ T+EQVE+SF R W P G P+NIDKRF+MLRGSSG
Sbjct: 105 NEPGGKGIFDVISGVTFRVIVETEEQVEISFLRTWDPSLEGKYIPLNIDKRFIMLRGSSG 164
Query: 104 FYSYAIYKRLKGWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRERGH--KLA 161
YSY IY+ LK WPGF+L R+ FK DKFHYM ++ +R+R MP D +G L
Sbjct: 165 VYSYGIYEHLKDWPGFELGETRIAFKLRKDKFHYMAVADDRKRIMPFPDDLCKGRCQTLD 224
Query: 162 YEEAVLL-----P--NGEVDDKYQYSMDAKDIRVHGWISTDSTVGFWQILPSSESRSFGP 214
Y+EA LL P GEVDDKYQYS + KD+RVHGWIS D VGFWQI PS+E RS GP
Sbjct: 225 YQEASLLTAPCDPRLQGEVDDKYQYSCENKDLRVHGWISFDPPVGFWQITPSNEFRSGGP 284
Query: 215 LKQFLTSHTGPISINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGKGDRQML 274
LKQ LTSH GP ++ FHSTHY G+ +F+ GE WKK++GP +Y+NS A D L
Sbjct: 285 LKQNLTSHVGPTTLAVFHSTHYAGKTMMPRFEHGEPWKKVYGPVFIYLNSTANGDDPLCL 344
Query: 275 WRDANRQFMNEVKSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKGAYVGLA 334
W DA + M EV+ WPY F AS D+ +S +RG+ GRL+++DR+++ I+A+GAYVGLA
Sbjct: 345 WDDAKIKMMAEVERWPYSFVASDDYPKSEERGTARGRLLIRDRFINNDLISARGAYVGLA 404
Query: 335 KPGRAGSWQTECKGYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITA 394
PG +GSWQ ECKGYQFW +A+E G FSI NV G YNLYAW+P FIGD+ +R+T+
Sbjct: 405 PPGDSGSWQIECKGYQFWAIADEAGYFSIGNVRPGEYNLYAWVPSFIGDYHNGTIVRVTS 464
Query: 395 GSAKQIGNLVYKAPRNGPTLWEIGIPDRSAAEFYIPNPNPKYINKLYVKH-DRFRQYGLW 453
G ++G++VY+ PR+GPTLWEIGIPDR A+EF+IP+P+P +N++ V H DRFRQYGLW
Sbjct: 465 GCMIEMGDIVYEPPRDGPTLWEIGIPDRKASEFFIPDPDPTLVNRVLVHHQDRFRQYGLW 524
Query: 454 ERYAELHRKRDLVYEVWANNYRKDWYFAQNTRKKGNKYEGSTWQIQFKLEGVVKKATYKL 513
++Y +++ DLVY V ++YR+DW+FA RKKG+ +EG+TWQI F LE + +KA YKL
Sbjct: 525 KKYTDMYPNDDLVYTVGVSDYRRDWFFAHVPRKKGDVHEGTTWQIIFNLENIDQKANYKL 584
Query: 514 RVAVAAAHGAELQVRVNSRSARRPLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRK 573
RVA+A+A AELQ+R+N A RPLF++G +GR+N+IARHGIHGVY L+ V++PG L +
Sbjct: 585 RVAIASATLAELQIRINDAEAIRPLFTTGLIGRDNSIARHGIHGVYMLYAVNIPGNRLVQ 644
Query: 574 GNNTIYLSQPRKLDAFTGIMYDYLRFEGP 602
G+NTI+L QPR F GIMYDY+R EGP
Sbjct: 645 GDNTIFLKQPRCNGPFQGIMYDYIRLEGP 673
Score = 153 (58.9 bits), Expect = 5.0e-190, Sum P(2) = 5.0e-190
Identities = 29/54 (53%), Positives = 40/54 (74%)
Query: 1 SAAGVQLHEQNNHVVMNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRG 54
S+ GV LH + +VVM+NGILQV++S P G + GI+Y G N+L V+N+E NRG
Sbjct: 45 SSHGVHLHVHDRYVVMDNGILQVTLSKPGGIITGIEYNGIDNVLEVRNKETNRG 98
>TAIR|locus:2024427 [details] [associations]
symbol:AT1G09890 species:3702 "Arabidopsis thaliana"
[GO:0005576 "extracellular region" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0016829 "lyase activity"
evidence=ISS] [GO:0030246 "carbohydrate binding" evidence=IEA]
InterPro:IPR013784 InterPro:IPR014766 EMBL:CP002684 GO:GO:0030246
GO:GO:0004180 Gene3D:2.60.40.1120 InterPro:IPR008979
SUPFAM:SSF49785 GO:GO:0016829 SUPFAM:SSF49452 IPI:IPI00516498
RefSeq:NP_172460.6 UniGene:At.49870 ProteinModelPortal:F4I2M6
PRIDE:F4I2M6 EnsemblPlants:AT1G09890.1 GeneID:837521
KEGG:ath:AT1G09890 OMA:AGSWQRE InterPro:IPR010325 Pfam:PF06045
Uniprot:F4I2M6
Length = 617
Score = 1740 (617.6 bits), Expect = 4.5e-189, Sum P(2) = 4.5e-189
Identities = 329/561 (58%), Positives = 409/561 (72%)
Query: 55 IEATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRLK 114
I+ +N++VI++ +EQ+ELSFTR W P G P+NIDKRF+ML GSSGFY+YAIY+ LK
Sbjct: 57 IKGSNFEVIVKNEEQIELSFTRKWDPSQEGKAVPLNIDKRFVMLSGSSGFYTYAIYEHLK 116
Query: 115 GWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDR--ERGHKLAYEEAVLLPN-- 170
WP F L R+ FK +KFHYM ++ +RQR MPL DR +RG LAY EAVLL N
Sbjct: 117 EWPAFSLAETRIAFKLRKEKFHYMAVTDDRQRFMPLPDDRLPDRGQALAYPEAVLLVNPL 176
Query: 171 -----GEVDDKYQYSMDAKDIRVHGWISTDS-TVGFWQILPSSESRSFGPLKQFLTSHTG 224
GEVDDKYQYS + KDI VHGWI T+ +VGFW I PS E R+ GP KQ LTSH G
Sbjct: 177 ESQFKGEVDDKYQYSCENKDITVHGWICTEQPSVGFWLITPSHEYRTGGPQKQNLTSHVG 236
Query: 225 PISINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGK-GDRQMLWRDANRQFM 283
P ++ F S HY GE+ KF +GEAWKK+FGP VY+NS D LW+DA Q
Sbjct: 237 PTALAVFISAHYTGEDLVPKFSEGEAWKKVFGPVFVYLNSSTDDDNDPLWLWQDAKSQMN 296
Query: 284 NEVKSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKGAYVGLAKPGRAGSWQ 343
E +SWPY FPAS D+ ++ +RG++ GRL+V+DRYV + IAA YVGLA PG AGSWQ
Sbjct: 297 VEAESWPYSFPASDDYVKTEQRGNVVGRLLVQDRYVDKDFIAANRGYVGLAVPGAAGSWQ 356
Query: 344 TECKGYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNL 403
ECK YQFWT +E G F I + G YNLYAWIPGFIGD+KY I IT+G + +L
Sbjct: 357 RECKEYQFWTRTDEEGFFYISGIRPGQYNLYAWIPGFIGDYKYDDVITITSGCYIYVEDL 416
Query: 404 VYKAPRNGPTLWEIGIPDRSAAEFYIPNPNPKYINKLYVKH-DRFRQYGLWERYAELHRK 462
VY+ PRNG TLWEIG PDRSAAEFY+P+PNPKYIN LY H DRFRQYGLWERYAEL+
Sbjct: 417 VYQPPRNGATLWEIGFPDRSAAEFYVPDPNPKYINNLYQNHPDRFRQYGLWERYAELYPD 476
Query: 463 RDLVYEVWANNYRKDWYFAQNTRKKGNK-YEGSTWQIQFKLEGVVKKATYKLRVAVAAAH 521
+DLVY V +++YRKDW++AQ TRKK NK Y+G+TWQI+F+L+ + K +Y LRVA+A+A
Sbjct: 477 KDLVYVVGSSDYRKDWFYAQVTRKKDNKTYQGTTWQIKFELKNIDKNHSYTLRVAIASAT 536
Query: 522 GAELQVRVNSRSARRPLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYLS 581
+ELQ+RVN+ +A P+F+SG +GR+N+IARHGIHG+Y LFNV+V G L +G NT++L+
Sbjct: 537 FSELQIRVNNANAS-PMFTSGLIGRDNSIARHGIHGLYWLFNVEVAGSKLLEGENTLFLT 595
Query: 582 QPRKLDAFTGIMYDYLRFEGP 602
QPR F GIMYDY+RFE P
Sbjct: 596 QPRSTSPFQGIMYDYIRFEAP 616
Score = 115 (45.5 bits), Expect = 4.5e-189, Sum P(2) = 4.5e-189
Identities = 23/39 (58%), Positives = 28/39 (71%)
Query: 16 MNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRG 54
M+NGI +V++S P G V GI+Y G NLL V NEE NRG
Sbjct: 1 MDNGIARVTLSKPDGIVTGIEYNGIDNLLEVLNEEVNRG 39
>TAIR|locus:2024417 [details] [associations]
symbol:AT1G09880 species:3702 "Arabidopsis thaliana"
[GO:0005576 "extracellular region" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0016829 "lyase activity"
evidence=ISS] [GO:0030246 "carbohydrate binding" evidence=IEA]
InterPro:IPR013784 InterPro:IPR014766 EMBL:CP002684
GenomeReviews:CT485782_GR GO:GO:0030246 GO:GO:0004180
Gene3D:2.60.40.1120 InterPro:IPR008979 SUPFAM:SSF49785
GO:GO:0016829 SUPFAM:SSF49452 EMBL:AC000132 CAZy:PL4
InterPro:IPR010325 Pfam:PF06045 IPI:IPI00540834 PIR:B86233
RefSeq:NP_172459.1 UniGene:At.51541 ProteinModelPortal:O04510
PaxDb:O04510 PRIDE:O04510 EnsemblPlants:AT1G09880.1 GeneID:837520
KEGG:ath:AT1G09880 TAIR:At1g09880 eggNOG:NOG244712
HOGENOM:HOG000243636 InParanoid:O04510 OMA:GSTLWEI PhylomeDB:O04510
ProtClustDB:CLSN2912685 Genevestigator:O04510 Uniprot:O04510
Length = 631
Score = 1715 (608.8 bits), Expect = 4.2e-184, Sum P(2) = 4.2e-184
Identities = 325/562 (57%), Positives = 411/562 (73%)
Query: 55 IEATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRLK 114
IEAT +VI + E++ELSFTR W ++ T PVNIDKRF+ML+ SSGFYSYAI++RL+
Sbjct: 63 IEATKMEVITQNDEKIELSFTRTWNT-SSTTAVPVNIDKRFVMLQNSSGFYSYAIFERLQ 121
Query: 115 GWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDR--ERGHKLAYEEAVLL---- 168
GWP +L N R+VFK N KFHYM IS +RQR MP+ DR RG LAY EAV L
Sbjct: 122 GWPAVELDNMRLVFKLNKKKFHYMAISDDRQRYMPVPDDRVPPRGQPLAYPEAVQLLDPI 181
Query: 169 -P--NGEVDDKYQYSMDAKDIRVHGWISTDSTVGFWQILPSSESRSFGPLKQFLTSHTGP 225
P GEVDDKY+YSM++KDI+VHGWIST+ +VGFWQI PS+E RS GPLKQFL SH GP
Sbjct: 182 EPEFKGEVDDKYEYSMESKDIKVHGWISTNDSVGFWQITPSNEFRSAGPLKQFLGSHVGP 241
Query: 226 ISINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGKGDRQMLWRDANRQFMNE 285
++ FHSTHYVG + M FK+GEAWKK+FGP +Y+NS D +LW +A Q E
Sbjct: 242 TNLAVFHSTHYVGADLIMSFKNGEAWKKVFGPVFIYLNSFPKGVDPLLLWHEAKNQTKIE 301
Query: 286 VKSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKGAYVGLAKPGRAGSWQTE 345
+ WPY F AS DF S++RGS+SGRL+V+DR++S I A G+YVGLA PG GSWQ E
Sbjct: 302 EEKWPYNFTASDDFPASDQRGSVSGRLLVRDRFISSEDIPANGSYVGLAAPGDVGSWQRE 361
Query: 346 CKGYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNLVY 405
CKGYQFW+ A+E G+FSI NV G YNLYA+ PGFIGD+ I+ GS +G+LVY
Sbjct: 362 CKGYQFWSKADENGSFSINNVRSGRYNLYAFAPGFIGDYHNDTVFDISPGSKISLGDLVY 421
Query: 406 KAPRNGPTLWEIGIPDRSAAEFYIPNPNPKYINKLYVKH-DRFRQYGLWERYAELHRKRD 464
+ PR+G TLWEIG+PDRSAAEFYIP+PNP ++NKLY+ H D++RQYGLWERY+EL+ D
Sbjct: 422 EPPRDGSTLWEIGVPDRSAAEFYIPDPNPSFVNKLYLNHSDKYRQYGLWERYSELYPDED 481
Query: 465 LVYEVWANNYRKDWYFAQNTRKKGNK-YEGSTWQIQFKLEGVVKKAT--YKLRVAVAAAH 521
+VY V ++Y K+W+F Q TRK+ N Y+G+TWQI+F+ + +K T +KLR+A+A ++
Sbjct: 482 MVYNVDIDDYSKNWFFMQVTRKQANGGYKGTTWQIRFQFDDKMKNVTGNFKLRIALATSN 541
Query: 522 GAELQVRVNSRSARRPLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYLS 581
AELQVRVN SA PLF + +GR+N IARHGIHG+Y L++V+VP L GNNTIYL+
Sbjct: 542 VAELQVRVNDLSADPPLFRTEQIGRDNTIARHGIHGLYWLYSVNVPAASLHVGNNTIYLT 601
Query: 582 QPRKLDAFTGIMYDYLRFEGPD 603
Q F G+MYDY+R E PD
Sbjct: 602 QALATSPFQGLMYDYIRLEYPD 623
Score = 93 (37.8 bits), Expect = 4.2e-184, Sum P(2) = 4.2e-184
Identities = 20/40 (50%), Positives = 27/40 (67%)
Query: 16 MNNGILQVSISTPQGFVIGIQYKGNKNLLNVQ-NEEDNRG 54
M N LQ+++S P+GFV GIQY G N+L N+E +RG
Sbjct: 1 MENRFLQLTLSNPEGFVTGIQYNGIDNVLAYYTNKEYDRG 40
>TAIR|locus:2136007 [details] [associations]
symbol:AT4G24430 species:3702 "Arabidopsis thaliana"
[GO:0005576 "extracellular region" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0016829 "lyase activity"
evidence=ISS] [GO:0030246 "carbohydrate binding" evidence=IEA]
[GO:0005774 "vacuolar membrane" evidence=IDA] InterPro:IPR013784
InterPro:IPR014766 GO:GO:0005774 EMBL:CP002687 GO:GO:0030246
GO:GO:0004180 Gene3D:2.60.40.1120 InterPro:IPR008979
SUPFAM:SSF49785 EMBL:AL161561 GO:GO:0016829 SUPFAM:SSF49452
EMBL:AL078637 CAZy:PL4 InterPro:IPR010325 Pfam:PF06045
HOGENOM:HOG000243636 IPI:IPI00533871 PIR:T09906 RefSeq:NP_567703.4
UniGene:At.26232 ProteinModelPortal:Q9STV1 PRIDE:Q9STV1
EnsemblPlants:AT4G24430.1 GeneID:828545 KEGG:ath:AT4G24430
TAIR:At4g24430 InParanoid:Q9STV1 OMA:PISLAMF PhylomeDB:Q9STV1
ProtClustDB:CLSN2927432 ArrayExpress:Q9STV1 Genevestigator:Q9STV1
Uniprot:Q9STV1
Length = 646
Score = 1661 (589.8 bits), Expect = 1.1e-183, Sum P(2) = 1.1e-183
Identities = 311/567 (54%), Positives = 412/567 (72%)
Query: 55 IEATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRLK 114
I+ T+++V++ +E VE+SF+R W +IAP+N+DKRF+M + +GFYSYAI++ L
Sbjct: 77 IKGTSFEVVVENEELVEISFSRKWDSSLQDSIAPINVDKRFIMRKDVTGFYSYAIFEHLA 136
Query: 115 GWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDR--ERGHKLAYEEAVLL--P- 169
WP F L R+V+K DKF YM I+ NRQR+MPL +DR +RG LAY EAVLL P
Sbjct: 137 EWPAFNLPQTRIVYKLRKDKFKYMAIADNRQRKMPLPEDRLGKRGRPLAYPEAVLLVHPV 196
Query: 170 ----NGEVDDKYQYSMDAKDIRVHGWISTDSTVGFWQILPSSESRSFGPLKQFLTSHTGP 225
GEVDDKY+YS + KD++VHGWIS + +G WQI+PS+E RS G KQ LTSH GP
Sbjct: 197 EDEFKGEVDDKYEYSSENKDLKVHGWISHNLDLGCWQIIPSNEFRSGGLSKQNLTSHVGP 256
Query: 226 ISINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGK-GDRQMLWRDANRQFMN 284
IS+ F S HY GE+ MK K G++WKK+FGP Y+N + K D LW+DA Q +
Sbjct: 257 ISLAMFLSAHYAGEDMVMKVKAGDSWKKVFGPVFTYLNCLPDKTSDPLSLWQDAKNQMLT 316
Query: 285 EVKSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKGAYVGLAKPGRAGSWQT 344
EV+SWPY FPAS+DF S+KRG ISGRL+V D+++S + A GA+VGLA PG GSWQ
Sbjct: 317 EVQSWPYDFPASEDFPVSDKRGCISGRLLVCDKFLSDDFLPANGAFVGLAPPGEVGSWQL 376
Query: 345 ECKGYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNLV 404
E KGYQFWT A+ G F+I ++ G YNL ++ G+IGD++Y I ITAG +GN+V
Sbjct: 377 ESKGYQFWTEADSDGYFAINDIREGEYNLNGYVTGWIGDYQYEQLINITAGCDIDVGNIV 436
Query: 405 YKAPRNGPTLWEIGIPDRSAAEFYIPNPNPKYINKLYVKH-DRFRQYGLWERYAELHRKR 463
Y+ PR+GPT+WEIGIPDRSAAEF++P+PNPKYINKLY+ H DRFRQYGLWERY EL+ K
Sbjct: 437 YEPPRDGPTVWEIGIPDRSAAEFFVPDPNPKYINKLYIGHPDRFRQYGLWERYTELYPKE 496
Query: 464 DLVYEVWANNYRKDWYFAQNTRKKGNK-YEGSTWQIQFKLEGVVKKATYKLRVAVAAAHG 522
DLV+ + ++Y+KDW+FA TRK G+ Y+ +TWQI+FKLE V K TYK+R+A+A A+
Sbjct: 497 DLVFTIGVSDYKKDWFFAHVTRKMGDDTYQKTTWQIKFKLENVQKSCTYKIRIALATANV 556
Query: 523 AELQVRVNSRSARR--PLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYL 580
AELQVR+N + P+F++G +G +NAIARHGIHG+Y+L+NVDVP + L +G+NT++L
Sbjct: 557 AELQVRMNDDDTEKTTPIFTTGVIGHDNAIARHGIHGIYRLYNVDVPSEKLVEGDNTLFL 616
Query: 581 SQPRKLD-AFTGIMYDYLRFEGPDPNS 606
+Q AF G+MYDY+R EGP +S
Sbjct: 617 TQTMTTTGAFNGLMYDYIRLEGPPLDS 643
Score = 143 (55.4 bits), Expect = 1.1e-183, Sum P(2) = 1.1e-183
Identities = 29/50 (58%), Positives = 35/50 (70%)
Query: 5 VQLHEQNNHVVMNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRG 54
VQL Q +HVVM NG ++V+IS P GFV GI Y+G NLL NE+ NRG
Sbjct: 7 VQLDVQESHVVMGNGKVKVTISKPDGFVTGISYQGVDNLLETHNEDFNRG 56
>TAIR|locus:2066040 [details] [associations]
symbol:AT2G22620 species:3702 "Arabidopsis thaliana"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0005576
"extracellular region" evidence=ISM] [GO:0005975 "carbohydrate
metabolic process" evidence=IEA] [GO:0016829 "lyase activity"
evidence=ISS] [GO:0030246 "carbohydrate binding" evidence=IEA]
InterPro:IPR011013 InterPro:IPR013784 InterPro:IPR014766
EMBL:CP002685 GO:GO:0005975 GO:GO:0030246 GO:GO:0004180
Gene3D:2.60.40.1120 InterPro:IPR008979 SUPFAM:SSF49785
SUPFAM:SSF74650 EMBL:AC006340 GO:GO:0016829 SUPFAM:SSF49452
CAZy:PL4 InterPro:IPR010325 Pfam:PF06045 ProtClustDB:CLSN2913234
IPI:IPI00518524 PIR:G84614 RefSeq:NP_179847.1 UniGene:At.39414
UniGene:At.39416 ProteinModelPortal:Q9ZQ51 PRIDE:Q9ZQ51
EnsemblPlants:AT2G22620.1 GeneID:816793 KEGG:ath:AT2G22620
TAIR:At2g22620 InParanoid:Q9ZQ51 OMA:VWAIGQA PhylomeDB:Q9ZQ51
Genevestigator:Q9ZQ51 Uniprot:Q9ZQ51
Length = 677
Score = 1622 (576.0 bits), Expect = 9.7e-167, P = 9.7e-167
Identities = 301/558 (53%), Positives = 391/558 (70%)
Query: 55 IEATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRLK 114
+E T +++I + +EQ+E+SFTR W G++ P+N+DKR+++ G SG Y Y I +RL+
Sbjct: 118 LEGTKFEIITQNEEQIEISFTRTWTISRRGSLVPLNVDKRYIIRSGVSGLYMYGILERLE 177
Query: 115 GWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRERGHKLAYEEAVLLPN---- 170
GWP + R+VFK NP KF +M IS +RQR MP DRE LAY+EAVLL N
Sbjct: 178 GWPDVDMDQIRIVFKLNPKKFDFMAISDDRQRSMPSMADRENSKSLAYKEAVLLTNPSNP 237
Query: 171 ---GEVDDKYQYSMDAKDIRVHGWISTDSTVGFWQILPSSESRSFGPLKQFLTSHTGPIS 227
GEVDDKY YSM+ KD VHGWIS+D VGFW I PS E R GP+KQ LTSH GPI+
Sbjct: 238 MFKGEVDDKYMYSMEDKDNNVHGWISSDPPVGFWMITPSDEFRLGGPIKQDLTSHAGPIT 297
Query: 228 INTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGKGDRQMLWRDANRQFMNEVK 287
++ F STHY G+ M +++GE WKK+FGP L Y+NSV+ K LWRDA RQ EVK
Sbjct: 298 LSMFTSTHYAGKEMRMDYRNGEPWKKVFGPVLAYLNSVSPKDSTLRLWRDAKRQMAAEVK 357
Query: 288 SWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKGAYVGLAKPGRAGSWQTECK 347
SWPY F S+D+ ++RG++ G+ ++KD YVSR I K A+VGLA G AGSWQTE K
Sbjct: 358 SWPYDFITSEDYPLRHQRGTLEGQFLIKDSYVSRLKIYGKFAFVGLAPIGEAGSWQTESK 417
Query: 348 GYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNLVYKA 407
GYQFWT A+ G F I+NV GNY+LYAW GFIGD+KY I IT GS +G LVY+
Sbjct: 418 GYQFWTKADRRGRFIIENVRAGNYSLYAWGSGFIGDYKYEQNITITPGSEMNVGPLVYEP 477
Query: 408 PRNGPTLWEIGIPDRSAAEFYIPNPNPKYINKLYVK--HDRFRQYGLWERYAELHRKRDL 465
PRNGPTLWEIG+PDR+A EFYIP+P P +NKLYV DRFRQYGLW+RYA+L+ + DL
Sbjct: 478 PRNGPTLWEIGVPDRTAGEFYIPDPYPTLMNKLYVNPLQDRFRQYGLWDRYADLYPQNDL 537
Query: 466 VYEVWANNYRKDWYFAQNTRKKGNK-YEGSTWQIQFKLEGVVKKATYKLRVAVAAAHGAE 524
VY + ++YR DW+FA R GN Y+ +TWQI F L+ V + Y LR+A+A+A +E
Sbjct: 538 VYTIGVSDYRSDWFFAHVARNVGNDTYQPTTWQIIFNLKNVNRIGRYTLRIALASAADSE 597
Query: 525 LQVRVNSRSARRPLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYLSQPR 584
LQ+R+N + +F++G +G++NAIARHGIHG+Y+L+++DV G +L G+NTI+L+Q R
Sbjct: 598 LQIRINDPKSDA-IFTTGFIGKDNAIARHGIHGLYRLYSIDVAGNLLSVGDNTIFLTQTR 656
Query: 585 KLDAFTGIMYDYLRFEGP 602
F GIMYDY+R E P
Sbjct: 657 SRTPFQGIMYDYIRLESP 674
Score = 365 (133.5 bits), Expect = 9.4e-31, P = 9.4e-31
Identities = 84/218 (38%), Positives = 126/218 (57%)
Query: 2 AAGVQLHEQNNH-VVMNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRG------ 54
A VQL + VV++NGI+QV+ S P+G + GI+Y G N+L+ ++ D+RG
Sbjct: 49 ALTVQLRRVGHDTVVVDNGIVQVTFSNPEGLITGIKYHGIDNVLD--DKIDDRGYWDVVW 106
Query: 55 -----------IEATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSG 103
+E T +++I + +EQ+E+SFTR W G++ P+N+DKR+++ G SG
Sbjct: 107 YEPEKKQKTDKLEGTKFEIITQNEEQIEISFTRTWTISRRGSLVPLNVDKRYIIRSGVSG 166
Query: 104 FYSYAIYKRLKGWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRERGHKLAYE 163
Y Y I +RL+GWP + R+VFK NP KF +M IS +RQR MP DRE LAY+
Sbjct: 167 LYMYGILERLEGWPDVDMDQIRIVFKLNPKKFDFMAISDDRQRSMPSMADRENSKSLAYK 226
Query: 164 EAVLLPNGEVDDKYQYSMDAKDIRVHGWISTDSTVGFW 201
EAVLL N + ++ +D K ++ D+ V W
Sbjct: 227 EAVLLTNPS-NPMFKGEVDDK--YMYSMEDKDNNVHGW 261
>TAIR|locus:2121095 [details] [associations]
symbol:AT4G38030 species:3702 "Arabidopsis thaliana"
[GO:0005576 "extracellular region" evidence=ISM] [GO:0016829 "lyase
activity" evidence=ISS] [GO:0030246 "carbohydrate binding"
evidence=IEA] InterPro:IPR013784 InterPro:IPR014766 EMBL:CP002687
GO:GO:0030246 GO:GO:0004180 Gene3D:2.60.40.1120 InterPro:IPR008979
SUPFAM:SSF49785 GO:GO:0016829 SUPFAM:SSF49452 InterPro:IPR010325
Pfam:PF06045 IPI:IPI00526557 RefSeq:NP_195516.2 UniGene:At.31211
UniGene:At.54646 ProteinModelPortal:F4JSW8
EnsemblPlants:AT4G38030.1 GeneID:829959 KEGG:ath:AT4G38030
OMA:RHGIHGM Uniprot:F4JSW8
Length = 667
Score = 1588 (564.1 bits), Expect = 3.9e-163, P = 3.9e-163
Identities = 298/563 (52%), Positives = 398/563 (70%)
Query: 55 IEATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRLK 114
IE TN+++I +T+EQVE+SF+R W+ +G I P+N+DKR+++ R +SG Y Y I++RL
Sbjct: 111 IEGTNFRIITQTQEQVEISFSRTWE---DGHI-PLNVDKRYIIRRNTSGIYMYGIFERLP 166
Query: 115 GWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRE--RGHK--LAYEEAVLL-- 168
WP + R+ FK NP++FHYM ++ NRQREMP + DR+ RGH L Y+EAV L
Sbjct: 167 EWPELDMGLIRIAFKLNPERFHYMAVADNRQREMPTEDDRDIKRGHAKALGYKEAVQLTH 226
Query: 169 PNG-----EVDDKYQYSMDAKDIRVHGWISTDSTVGFWQILPSSESRSFGPLKQFLTSHT 223
P+ +VDDKYQY+ + KD +VHGWIST S VGFW I PS E RS GP+KQ LTSH
Sbjct: 227 PHNSMFKNQVDDKYQYTCEIKDNKVHGWISTKSRVGFWIISPSGEYRSGGPIKQELTSHV 286
Query: 224 GPISINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGKGDR--QMLWRDANRQ 281
GP +I TF S HYVG + ++ GEAWKK+ GP +Y+NS + ++ +LW DA +Q
Sbjct: 287 GPTAITTFISGHYVGADMEAHYRPGEAWKKVLGPVFIYLNSDSTSNNKPQDLLWEDAKQQ 346
Query: 282 FMNEVKSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKGAYVGLAKPGRAGS 341
EVK+WPY F AS D+ +RGS++GRL+V DR+++ K AYVGLA PG AGS
Sbjct: 347 SEKEVKAWPYDFVASSDYLSRRERGSVTGRLLVNDRFLT----PGKSAYVGLAPPGEAGS 402
Query: 342 WQTECKGYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIG 401
WQT KGYQFWT NE G F+I+NV G YNLY W+PGFIGDF+Y + + AGS +G
Sbjct: 403 WQTNTKGYQFWTKTNETGYFTIENVRPGTYNLYGWVPGFIGDFRYQNLVNVAAGSVISLG 462
Query: 402 NLVYKAPRNGPTLWEIGIPDRSAAEFYIPNPNPKYINKLYVKH-DRFRQYGLWERYAELH 460
+VYK PRNGPTLWEIG+PDR+A E++IP P +N LY+ H D+FRQYGLW+RY EL+
Sbjct: 463 RVVYKPPRNGPTLWEIGVPDRTAREYFIPEPYKDTMNPLYLNHTDKFRQYGLWQRYTELY 522
Query: 461 RKRDLVYEVWANNYRKDWYFAQNTRKKGN-KYEGSTWQIQFKLEGVVKKATYKLRVAVAA 519
DLVY V +NY +DW++AQ TRK G+ Y + WQI F L V + +Y L+VA+A+
Sbjct: 523 PTHDLVYTVGVSNYSQDWFYAQVTRKTGDLTYVPTIWQIVFHLPYVNSRGSYTLQVALAS 582
Query: 520 AHGAELQVRVNSRSARRPLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIY 579
A A LQVR+N++++ P FS+G++G++NAIARHGIHG+Y+L+ +DVPG++LR G NTIY
Sbjct: 583 AAWANLQVRINNQNSW-PFFSTGTIGKDNAIARHGIHGMYRLYTIDVPGRLLRTGTNTIY 641
Query: 580 LSQPRKLDAFTGIMYDYLRFEGP 602
L QP+ F G+MYDY+R E P
Sbjct: 642 LRQPKAQGPFEGLMYDYIRLEEP 664
Score = 361 (132.1 bits), Expect = 2.5e-30, P = 2.5e-30
Identities = 80/197 (40%), Positives = 121/197 (61%)
Query: 8 HEQNNHVVMNNGILQVSISTPQGFVIGIQYKGNKNLLNV------------QNEEDNRG- 54
H V+++NGI++VS S PQG + GI+YKG N+L+ Q +G
Sbjct: 48 HHGTREVIVDNGIIRVSFSNPQGLITGIKYKGIDNVLHPHLRDRGYWDITWQGTNIRQGG 107
Query: 55 ---IEATNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYK 111
IE TN+++I +T+EQVE+SF+R W+ +G I P+N+DKR+++ R +SG Y Y I++
Sbjct: 108 LDRIEGTNFRIITQTQEQVEISFSRTWE---DGHI-PLNVDKRYIIRRNTSGIYMYGIFE 163
Query: 112 RLKGWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRE--RGHK--LAYEEAVL 167
RL WP + R+ FK NP++FHYM ++ NRQREMP + DR+ RGH L Y+EAV
Sbjct: 164 RLPEWPELDMGLIRIAFKLNPERFHYMAVADNRQREMPTEDDRDIKRGHAKALGYKEAVQ 223
Query: 168 LPNGEVDDKYQYSMDAK 184
L + + ++ +D K
Sbjct: 224 LTHPH-NSMFKNQVDDK 239
>TAIR|locus:2121090 [details] [associations]
symbol:AT4G37950 species:3702 "Arabidopsis thaliana"
[GO:0005576 "extracellular region" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0016829 "lyase activity"
evidence=ISS] [GO:0030246 "carbohydrate binding" evidence=IEA]
InterPro:IPR013784 InterPro:IPR014766 EMBL:CP002687 GO:GO:0030246
GO:GO:0004180 Gene3D:2.60.40.1120 InterPro:IPR008979
SUPFAM:SSF49785 GO:GO:0016829 SUPFAM:SSF49452 CAZy:PL4
InterPro:IPR010325 Pfam:PF06045 HOGENOM:HOG000243636 EMBL:BT004121
EMBL:AK228481 IPI:IPI00517709 RefSeq:NP_195508.2 UniGene:At.48950
UniGene:At.67725 ProteinModelPortal:Q84W85
EnsemblPlants:AT4G37950.1 GeneID:829951 KEGG:ath:AT4G37950
TAIR:At4g37950 InParanoid:Q84W85 OMA:GLWDRYS PhylomeDB:Q84W85
ProtClustDB:CLSN2913234 Genevestigator:Q84W85 Uniprot:Q84W85
Length = 678
Score = 1518 (539.4 bits), Expect = 1.0e-155, P = 1.0e-155
Identities = 287/562 (51%), Positives = 388/562 (69%)
Query: 58 TNYKVIMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRLKGWP 117
T + ++ +T EQ+E+SF+R + +G P+N+DKR+++ RG SG Y YA+ +RL GWP
Sbjct: 120 TKFDIVNQTSEQIEISFSRTFSQRGSGI--PLNVDKRYIIRRGVSGIYMYAVLERLIGWP 177
Query: 118 GFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRE----RGHKLAYEEAVLLPN--- 170
+ R+VFK N KF +M +S NRQ+ MP DR+ R LAY+EAV L N
Sbjct: 178 DVDMDQIRIVFKLNTTKFDFMAVSDNRQKIMPFDTDRDITKGRASPLAYKEAVHLINPQN 237
Query: 171 ----GEVDDKYQYSMDAKDIRVHGWISTDSTVGFWQILPSSESRSFGPLKQFLTSHTGPI 226
G+VDDKY YS++ KD +VHGWIS+D +GFW I PS E + GP+KQ LTSH GP
Sbjct: 238 HMLKGQVDDKYMYSVENKDNKVHGWISSDQRIGFWMITPSDEFHACGPIKQDLTSHVGPT 297
Query: 227 SINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGKGDRQMLWRDANRQFMNEV 286
+++ F S HY G++ +K E WKK+FGP VY+NS + R +LW DA RQ ++EV
Sbjct: 298 TLSMFTSVHYAGKDMNTNYKSKEPWKKVFGPVFVYLNSASS---RNLLWTDAKRQMVSEV 354
Query: 287 KSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAG-IAAKGAYVGLAKPGRAGSWQTE 345
+SWPY F S D+ ++RG++ G+L V DRY+ + + A+VGLA PG AGSWQTE
Sbjct: 355 QSWPYDFVKSVDYPLHHQRGTVKGQLFVIDRYIKNVTYLFGQFAFVGLALPGEAGSWQTE 414
Query: 346 CKGYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNLVY 405
KGYQFWT A++ G F+I NV G Y+LYAW+ GFIGD+KY I IT G +G++VY
Sbjct: 415 NKGYQFWTRADKMGMFTIANVRPGTYSLYAWVSGFIGDYKYVRDITITPGREIDVGHIVY 474
Query: 406 KAPRNGPTLWEIGIPDRSAAEFYIPNPNPKYINKLYVKH----DRFRQYGLWERYAELHR 461
PRNGPTLWEIG PDR+AAEFYIP+P+P KLY+ + DRFRQYGLW+RY+ L+
Sbjct: 475 VPPRNGPTLWEIGQPDRTAAEFYIPDPDPTLFTKLYLNYSNPQDRFRQYGLWDRYSVLYP 534
Query: 462 KRDLVYEVWANNYRKDWYFAQNTRKKGN-KYEGSTWQIQFKLEGVVKKATYKLRVAVAAA 520
+ DLV+ ++Y+KDW++A RK GN Y+ +TWQI+F L+ V++ Y LR+A+AAA
Sbjct: 535 RNDLVFTAGVSDYKKDWFYAHVNRKAGNGTYKATTWQIKFNLKAVIQTRIYTLRIALAAA 594
Query: 521 HGAELQVRVNSRSARRPLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYL 580
+L V VN ++ PLF +G +GR+NAIARHGIHG+YKL+N+DV GK+LR GNNTI+L
Sbjct: 595 STIDLLVWVNEVDSK-PLFITGLIGRDNAIARHGIHGLYKLYNIDVHGKLLRVGNNTIFL 653
Query: 581 SQPRKLDAFTGIMYDYLRFEGP 602
+ R D+F+G+MYDYLR EGP
Sbjct: 654 THGRNSDSFSGVMYDYLRLEGP 675
Score = 293 (108.2 bits), Expect = 1.2e-22, P = 1.2e-22
Identities = 66/162 (40%), Positives = 98/162 (60%)
Query: 11 NNHVVMNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRG-----------IEATN 59
+N VV++NGI+ V+ S+PQG + I+Y G N+LN Q E NRG + +T+
Sbjct: 58 DNQVVVDNGIIDVTFSSPQGLITRIKYNGLNNVLNDQIE--NRGYWDVVWYKPGQVSSTD 115
Query: 60 YKV------IMRTKEQVELSFTRMWQPYTNGTIAPVNIDKRFLMLRGSSGFYSYAIYKRL 113
Y V + +T EQ+E+SF+R + +G P+N+DKR+++ RG SG Y YA+ +RL
Sbjct: 116 YLVGTKFDIVNQTSEQIEISFSRTFSQRGSGI--PLNVDKRYIIRRGVSGIYMYAVLERL 173
Query: 114 KGWPGFQLFNNRMVFKPNPDKFHYMIISGNRQREMPLQQDRE 155
GWP + R+VFK N KF +M +S NRQ+ MP DR+
Sbjct: 174 IGWPDVDMDQIRIVFKLNTTKFDFMAVSDNRQKIMPFDTDRD 215
>TAIR|locus:2200390 [details] [associations]
symbol:AT1G65210 "AT1G65210" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
"cellular_component" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR
InterPro:IPR008979 SUPFAM:SSF49785 EMBL:AC007230 InterPro:IPR010325
Pfam:PF06045 IPI:IPI00517846 PIR:B96676 RefSeq:NP_176700.1
UniGene:At.66104 ProteinModelPortal:Q9S9K1 PRIDE:Q9S9K1
EnsemblPlants:AT1G65210.1 GeneID:842828 KEGG:ath:AT1G65210
TAIR:At1g65210 eggNOG:NOG282477 InParanoid:Q9S9K1 OMA:ISITQTR
PhylomeDB:Q9S9K1 Genevestigator:Q9S9K1 Uniprot:Q9S9K1
Length = 248
Score = 498 (180.4 bits), Expect = 3.8e-56, Sum P(2) = 3.8e-56
Identities = 91/185 (49%), Positives = 129/185 (69%)
Query: 420 PDRSAAEFYIPNPNPKYINKLYVKH-DRFRQYGLWERYAELHRKRDLVYEVWANNYRKDW 478
P + A E+++P P +N LY+ H D+FRQYGLW+RY EL+ DL+Y + +NY KDW
Sbjct: 62 PHQRAREYFVPEPYKNTMNPLYLNHTDKFRQYGLWQRYTELYPNHDLIYTIGVSNYSKDW 121
Query: 479 YFAQNTRKKGNK-YEGSTWQIQFKLEGVVKKATYKLRVAVAAAHGAELQVRVNSRSARRP 537
+++Q TRK G+ Y +TWQ F L V + +Y L++A+A+A A LQVR N+ R P
Sbjct: 122 FYSQVTRKIGDSTYTPTTWQTVFHLPYVNMRGSYTLQLALASAAWANLQVRFNNEYTR-P 180
Query: 538 LFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYLSQPRKLDAFTGIMYDYL 597
FS+G +GR+NAIARHGIHG+Y+L++++VPG++LR G NTIYL Q + G+MYDY+
Sbjct: 181 FFSTGYIGRDNAIARHGIHGLYRLYSINVPGRLLRTGTNTIYLRQAKASGPLEGVMYDYI 240
Query: 598 RFEGP 602
R E P
Sbjct: 241 RLEEP 245
Score = 98 (39.6 bits), Expect = 3.8e-56, Sum P(2) = 3.8e-56
Identities = 17/32 (53%), Positives = 26/32 (81%)
Query: 14 VVMNNGILQVSISTPQGFVIGIQYKGNKNLLN 45
V+++NGI+ VS S+PQG + GI+YKG N+L+
Sbjct: 30 VIVDNGIISVSFSSPQGLITGIKYKGVNNVLH 61
>ASPGD|ASPL0000007043 [details] [associations]
symbol:rglB species:162425 "Emericella nidulans"
[GO:0016837 "carbon-oxygen lyase activity, acting on
polysaccharides" evidence=IDA] [GO:0045490 "pectin catabolic
process" evidence=IDA] [GO:0030246 "carbohydrate binding"
evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR011013 InterPro:IPR013784 InterPro:IPR014766
GO:GO:0005576 GO:GO:0030246 GO:GO:0045490 EMBL:BN001301
GO:GO:0004180 Gene3D:2.60.40.1120 InterPro:IPR008979
SUPFAM:SSF49785 SUPFAM:SSF74650 SUPFAM:SSF49452 EMBL:AACD01000108
GO:GO:0016837 EMBL:DQ490501 RefSeq:XP_663999.1 mycoCLAP:RGL4B_EMENI
EnsemblFungi:CADANIAT00006584 GeneID:2871281 KEGG:ani:AN6395.2
eggNOG:NOG43733 HOGENOM:HOG000217023 OrthoDB:EOG4ZGSMR
Uniprot:Q5AZ85
Length = 660
Score = 203 (76.5 bits), Expect = 7.8e-13, Sum P(2) = 7.8e-13
Identities = 96/371 (25%), Positives = 151/371 (40%)
Query: 83 NGTIAPVNID-KRFLMLR-GSSGFYSYAIYKRLKGWPGF--QLFNNRMVFKPNPDKFHYM 138
N T P + +++ LR G +GF+ ++ F L R +F+PN D + ++
Sbjct: 109 NDTYTPTGQEFQQYWFLRDGETGFHMFSRLAYYNETTPFLRNLQELRTLFRPNTDLWTHL 168
Query: 139 IISGNRQREMPLQQDRERGHKLAYEEAVLLPNGEVDD-----------KYQYSMDAKDIR 187
S + Q PL D ++ ++A N DD KY +S +D
Sbjct: 169 T-SSDLQTA-PLPSDEAIAEQIVVQDATWRLNNTPDDAYYQQFSEYFTKYTFSNHWRDND 226
Query: 188 VHGWISTDST-----VGFWQILPSSESRSFGPLKQFLTSHTGPISINTFHSTHYVGENFG 242
VHG + ST G W ++ + ++ GPL LT I N S H+ GE
Sbjct: 227 VHGLYADGSTSDGTTYGAWLVMNTKDTYYGGPLHSDLT--VDGIIYNYIVSNHH-GEGTP 283
Query: 243 MKFKDGEAWKKIFGPFLVYVNSVAGKGDRQMLWRDANRQFMNEVKSWPYKFPAS--KD-- 298
+G + + FGP N G + L RD R + SW F S K
Sbjct: 284 -NITNG--FDRTFGPQFYLFNG-GGSSSLEEL-RDEARSLASP--SWNADFYDSIAKHVI 336
Query: 299 -FARSNKRGSISGRL-IVKDRYVSRAGIAAKGAYVGLAKPGRAGSWQTECKGYQFWTVAN 356
+ S++RGS+ G + + K+ A + G Y + A +Q+W +
Sbjct: 337 GYVPSSQRGSVKGTIKLPKNAKSPIAVLTVDGHYF---QDNSAVP-----SSHQYWADID 388
Query: 357 EGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNLVYKAPRNGPTLWE 416
+ G FSI V+ G Y L + G GDF + + A + I +K G +W
Sbjct: 389 KNGRFSIDRVVAGKYRLTVYADGIFGDFTRDGIV-VKARKSTSIKE-TWKPESAGTEIWR 446
Query: 417 IGIPDRSAAEF 427
+G PD+S+ EF
Sbjct: 447 LGTPDKSSGEF 457
Score = 47 (21.6 bits), Expect = 7.8e-13, Sum P(2) = 7.8e-13
Identities = 33/139 (23%), Positives = 51/139 (36%)
Query: 472 NNYRKDWYFAQNTRKKGNKYEGSTWQIQFKLEGVVKKATYKLRVAVAAAHGAELQVRVNS 531
+N DW + K + T + +L G K A+ V A+ A L +R
Sbjct: 523 SNTTHDWRIRFDLSAK-QLHARKTATLTIQLAGA-KAASGNTDVYNASEPYANLPLRSYI 580
Query: 532 RSARRPL-FSSGSVGRENAIARHGIHGVYKLFNV-DVPGKVLRKGNNTIYLSQPRK---- 585
PL G + I R + Y++ + P L++G+N + LS P
Sbjct: 581 NEQEEPLTMVIGYDQSSSCIVRSAV-SCYQVREKWEFPASWLKEGSNLLRLSLPTNGTNY 639
Query: 586 ----LDAFTGIMYDYLRFE 600
L + YD LR E
Sbjct: 640 ESAVLPTSVYVQYDALRLE 658
>UNIPROTKB|Q5AZ85 [details] [associations]
symbol:rglB "Rhamnogalacturonate lyase B" species:227321
"Aspergillus nidulans FGSC A4" [GO:0016837 "carbon-oxygen lyase
activity, acting on polysaccharides" evidence=IDA] [GO:0045490
"pectin catabolic process" evidence=IDA] InterPro:IPR011013
InterPro:IPR013784 InterPro:IPR014766 GO:GO:0005576 GO:GO:0030246
GO:GO:0045490 EMBL:BN001301 GO:GO:0004180 Gene3D:2.60.40.1120
InterPro:IPR008979 SUPFAM:SSF49785 SUPFAM:SSF74650 SUPFAM:SSF49452
EMBL:AACD01000108 GO:GO:0016837 EMBL:DQ490501 RefSeq:XP_663999.1
mycoCLAP:RGL4B_EMENI EnsemblFungi:CADANIAT00006584 GeneID:2871281
KEGG:ani:AN6395.2 eggNOG:NOG43733 HOGENOM:HOG000217023
OrthoDB:EOG4ZGSMR Uniprot:Q5AZ85
Length = 660
Score = 203 (76.5 bits), Expect = 7.8e-13, Sum P(2) = 7.8e-13
Identities = 96/371 (25%), Positives = 151/371 (40%)
Query: 83 NGTIAPVNID-KRFLMLR-GSSGFYSYAIYKRLKGWPGF--QLFNNRMVFKPNPDKFHYM 138
N T P + +++ LR G +GF+ ++ F L R +F+PN D + ++
Sbjct: 109 NDTYTPTGQEFQQYWFLRDGETGFHMFSRLAYYNETTPFLRNLQELRTLFRPNTDLWTHL 168
Query: 139 IISGNRQREMPLQQDRERGHKLAYEEAVLLPNGEVDD-----------KYQYSMDAKDIR 187
S + Q PL D ++ ++A N DD KY +S +D
Sbjct: 169 T-SSDLQTA-PLPSDEAIAEQIVVQDATWRLNNTPDDAYYQQFSEYFTKYTFSNHWRDND 226
Query: 188 VHGWISTDST-----VGFWQILPSSESRSFGPLKQFLTSHTGPISINTFHSTHYVGENFG 242
VHG + ST G W ++ + ++ GPL LT I N S H+ GE
Sbjct: 227 VHGLYADGSTSDGTTYGAWLVMNTKDTYYGGPLHSDLT--VDGIIYNYIVSNHH-GEGTP 283
Query: 243 MKFKDGEAWKKIFGPFLVYVNSVAGKGDRQMLWRDANRQFMNEVKSWPYKFPAS--KD-- 298
+G + + FGP N G + L RD R + SW F S K
Sbjct: 284 -NITNG--FDRTFGPQFYLFNG-GGSSSLEEL-RDEARSLASP--SWNADFYDSIAKHVI 336
Query: 299 -FARSNKRGSISGRL-IVKDRYVSRAGIAAKGAYVGLAKPGRAGSWQTECKGYQFWTVAN 356
+ S++RGS+ G + + K+ A + G Y + A +Q+W +
Sbjct: 337 GYVPSSQRGSVKGTIKLPKNAKSPIAVLTVDGHYF---QDNSAVP-----SSHQYWADID 388
Query: 357 EGGNFSIKNVLIGNYNLYAWIPGFIGDFKYHAAIRITAGSAKQIGNLVYKAPRNGPTLWE 416
+ G FSI V+ G Y L + G GDF + + A + I +K G +W
Sbjct: 389 KNGRFSIDRVVAGKYRLTVYADGIFGDFTRDGIV-VKARKSTSIKE-TWKPESAGTEIWR 446
Query: 417 IGIPDRSAAEF 427
+G PD+S+ EF
Sbjct: 447 LGTPDKSSGEF 457
Score = 47 (21.6 bits), Expect = 7.8e-13, Sum P(2) = 7.8e-13
Identities = 33/139 (23%), Positives = 51/139 (36%)
Query: 472 NNYRKDWYFAQNTRKKGNKYEGSTWQIQFKLEGVVKKATYKLRVAVAAAHGAELQVRVNS 531
+N DW + K + T + +L G K A+ V A+ A L +R
Sbjct: 523 SNTTHDWRIRFDLSAK-QLHARKTATLTIQLAGA-KAASGNTDVYNASEPYANLPLRSYI 580
Query: 532 RSARRPL-FSSGSVGRENAIARHGIHGVYKLFNV-DVPGKVLRKGNNTIYLSQPRK---- 585
PL G + I R + Y++ + P L++G+N + LS P
Sbjct: 581 NEQEEPLTMVIGYDQSSSCIVRSAV-SCYQVREKWEFPASWLKEGSNLLRLSLPTNGTNY 639
Query: 586 ----LDAFTGIMYDYLRFE 600
L + YD LR E
Sbjct: 640 ESAVLPTSVYVQYDALRLE 658
>ASPGD|ASPL0000094461 [details] [associations]
symbol:AN12098 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0005575
"cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR011013 InterPro:IPR013784
InterPro:IPR014766 GO:GO:0005576 GO:GO:0000272 GO:GO:0030246
EMBL:BN001302 GO:GO:0004180 Gene3D:2.60.40.1120 InterPro:IPR008979
SUPFAM:SSF49785 SUPFAM:SSF74650 GO:GO:0016829 EMBL:AACD01000067
SUPFAM:SSF49452 eggNOG:NOG72373 RefSeq:XP_661743.1 PRIDE:Q5B5P1
DNASU:2873563 EnsemblFungi:CADANIAT00004533 GeneID:2873563
KEGG:ani:AN4139.2 OrthoDB:EOG4MPMZC Uniprot:Q5B5P1
Length = 1041
Score = 150 (57.9 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
Identities = 131/566 (23%), Positives = 213/566 (37%)
Query: 1 SAAGVQLHEQNNHVVMNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRG----IE 56
SAA + + H ++N V+++ G V+ G ++LL + +G
Sbjct: 19 SAAALTTTSNSTHYTISNSRFSVAVAKSNGHVVDANLDG-QDLLGPLSGNSGKGPYLDCS 77
Query: 57 ATNYKVIMRTKEQVELSFT-RMWQPYTN----GTIAPVNID-KRFLMLRGS-SGFYSYA- 108
T E ++ T PY T N ++L LRG +G ++++
Sbjct: 78 CTPEGFWTPGAEPALVNGTDSTGTPYVGVIMTDTYETTNQTLSQYLFLRGEETGLHAFSR 137
Query: 109 -IYKRLKGWPGFQLFNNRMVFKPNPDKF-HYMIISGNRQREMPLQQDRERGHKLAYEEAV 166
Y + L R +F+PN + + H+ GN MPL K+ ++A
Sbjct: 138 VTYYNESDYFLRGLGELRTLFRPNTNLWTHFSGSEGN-YGPMPLSSTE----KITVQDAT 192
Query: 167 LLPNGEVDDKY--QYS---------MDAKDIRVHGWISTDST------VGFWQILPSSES 209
DD Y QYS +D VHG S ST G W + + E+
Sbjct: 193 TYLGDTTDDPYVSQYSDYFTKYTLTESWRDHDVHGHFSNGSTSGDGNTYGAWLVHNTRET 252
Query: 210 RSFGPLKQFLTSHTGPISINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGKG 269
GPL L I N S HY N + + + FGP + NS G G
Sbjct: 253 YYGGPLHADLV--VDGIVYNYIVSGHYGAPNPNLT----HGFDRTFGPQYYHFNS-GGPG 305
Query: 270 DR-QMLWRDANRQFMNEVKSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKG 328
+ L DA Q+ + W +F S N S +GR + + G AK
Sbjct: 306 TTLEELRADA-AQYASP--EWNAEFYDSIAKHIPNYVPS-TGRTTFRGKVNLPKG--AKK 359
Query: 329 AYVGLAKPGRAGSWQTECK-GYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYH 387
+ L++ + K Q+W + G F+I V+ G Y + + G F
Sbjct: 360 PIIVLSENEQDFQLNVFKKDSLQYWAEIDGSGAFTIPRVVKGTYRVTIYADEIFGWF-IK 418
Query: 388 AAIRITAGSAKQIGNLVYKAPRNGPTLWEIGIPDRSAAEF---YIPNPNPKYINKLYVKH 444
+++ +A +K G +W IG+PD+S+ EF Y P+ + ++
Sbjct: 419 DNVKVIGSNAH---TFTWKEETAGKEIWRIGVPDKSSGEFLHGYAPDTSKP------LQP 469
Query: 445 DRFRQYGLWERYAELHRKRDLV-YEVWANNYRKD-----WYF--AQNTRKKGNKY--EGS 494
+++R Y W +Y + V Y V ++ KD W F +Q + Y +
Sbjct: 470 EQYRIY--WGKYDYPSDFPEGVNYHVGKSDPAKDLNYIHWSFFPSQGNHLRNEPYYQNVN 527
Query: 495 TWQIQFKLEGV----VKKATYKLRVA 516
W I F L K AT+ +++A
Sbjct: 528 NWTITFDLTASQLRNTKTATFTVQLA 553
Score = 46 (21.3 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
Identities = 18/47 (38%), Positives = 22/47 (46%)
Query: 537 PLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYLSQP 583
P + SGS G + + +K F D GK LRKG N LS P
Sbjct: 591 PYWRSGSCGVRSGVQCQNTE--HK-FVFDA-GK-LRKGRNEFVLSLP 632
>ASPGD|ASPL0000100087 [details] [associations]
symbol:rglC species:162425 "Emericella nidulans"
[GO:0005575 "cellular_component" evidence=ND] [GO:0003824
"catalytic activity" evidence=IEA] [GO:0005975 "carbohydrate
metabolic process" evidence=IEA] [GO:0030246 "carbohydrate binding"
evidence=IEA] InterPro:IPR011013 InterPro:IPR013784
InterPro:IPR014766 GO:GO:0005576 GO:GO:0000272 GO:GO:0030246
EMBL:BN001302 GO:GO:0004180 Gene3D:2.60.40.1120 InterPro:IPR008979
SUPFAM:SSF49785 SUPFAM:SSF74650 GO:GO:0016829 EMBL:AACD01000067
SUPFAM:SSF49452 eggNOG:NOG72373 RefSeq:XP_661743.1 PRIDE:Q5B5P1
DNASU:2873563 EnsemblFungi:CADANIAT00004533 GeneID:2873563
KEGG:ani:AN4139.2 OrthoDB:EOG4MPMZC Uniprot:Q5B5P1
Length = 1041
Score = 150 (57.9 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
Identities = 131/566 (23%), Positives = 213/566 (37%)
Query: 1 SAAGVQLHEQNNHVVMNNGILQVSISTPQGFVIGIQYKGNKNLLNVQNEEDNRG----IE 56
SAA + + H ++N V+++ G V+ G ++LL + +G
Sbjct: 19 SAAALTTTSNSTHYTISNSRFSVAVAKSNGHVVDANLDG-QDLLGPLSGNSGKGPYLDCS 77
Query: 57 ATNYKVIMRTKEQVELSFT-RMWQPYTN----GTIAPVNID-KRFLMLRGS-SGFYSYA- 108
T E ++ T PY T N ++L LRG +G ++++
Sbjct: 78 CTPEGFWTPGAEPALVNGTDSTGTPYVGVIMTDTYETTNQTLSQYLFLRGEETGLHAFSR 137
Query: 109 -IYKRLKGWPGFQLFNNRMVFKPNPDKF-HYMIISGNRQREMPLQQDRERGHKLAYEEAV 166
Y + L R +F+PN + + H+ GN MPL K+ ++A
Sbjct: 138 VTYYNESDYFLRGLGELRTLFRPNTNLWTHFSGSEGN-YGPMPLSSTE----KITVQDAT 192
Query: 167 LLPNGEVDDKY--QYS---------MDAKDIRVHGWISTDST------VGFWQILPSSES 209
DD Y QYS +D VHG S ST G W + + E+
Sbjct: 193 TYLGDTTDDPYVSQYSDYFTKYTLTESWRDHDVHGHFSNGSTSGDGNTYGAWLVHNTRET 252
Query: 210 RSFGPLKQFLTSHTGPISINTFHSTHYVGENFGMKFKDGEAWKKIFGPFLVYVNSVAGKG 269
GPL L I N S HY N + + + FGP + NS G G
Sbjct: 253 YYGGPLHADLV--VDGIVYNYIVSGHYGAPNPNLT----HGFDRTFGPQYYHFNS-GGPG 305
Query: 270 DR-QMLWRDANRQFMNEVKSWPYKFPASKDFARSNKRGSISGRLIVKDRYVSRAGIAAKG 328
+ L DA Q+ + W +F S N S +GR + + G AK
Sbjct: 306 TTLEELRADA-AQYASP--EWNAEFYDSIAKHIPNYVPS-TGRTTFRGKVNLPKG--AKK 359
Query: 329 AYVGLAKPGRAGSWQTECK-GYQFWTVANEGGNFSIKNVLIGNYNLYAWIPGFIGDFKYH 387
+ L++ + K Q+W + G F+I V+ G Y + + G F
Sbjct: 360 PIIVLSENEQDFQLNVFKKDSLQYWAEIDGSGAFTIPRVVKGTYRVTIYADEIFGWF-IK 418
Query: 388 AAIRITAGSAKQIGNLVYKAPRNGPTLWEIGIPDRSAAEF---YIPNPNPKYINKLYVKH 444
+++ +A +K G +W IG+PD+S+ EF Y P+ + ++
Sbjct: 419 DNVKVIGSNAH---TFTWKEETAGKEIWRIGVPDKSSGEFLHGYAPDTSKP------LQP 469
Query: 445 DRFRQYGLWERYAELHRKRDLV-YEVWANNYRKD-----WYF--AQNTRKKGNKY--EGS 494
+++R Y W +Y + V Y V ++ KD W F +Q + Y +
Sbjct: 470 EQYRIY--WGKYDYPSDFPEGVNYHVGKSDPAKDLNYIHWSFFPSQGNHLRNEPYYQNVN 527
Query: 495 TWQIQFKLEGV----VKKATYKLRVA 516
W I F L K AT+ +++A
Sbjct: 528 NWTITFDLTASQLRNTKTATFTVQLA 553
Score = 46 (21.3 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
Identities = 18/47 (38%), Positives = 22/47 (46%)
Query: 537 PLFSSGSVGRENAIARHGIHGVYKLFNVDVPGKVLRKGNNTIYLSQP 583
P + SGS G + + +K F D GK LRKG N LS P
Sbjct: 591 PYWRSGSCGVRSGVQCQNTE--HK-FVFDA-GK-LRKGRNEFVLSLP 632
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.137 0.423 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 606 606 0.00086 120 3 11 22 0.39 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 12
No. of states in DFA: 628 (67 KB)
Total size of DFA: 371 KB (2180 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:01
No. of threads or processors used: 24
Search cpu time: 47.82u 0.11s 47.93t Elapsed: 00:00:02
Total cpu time: 47.82u 0.11s 47.93t Elapsed: 00:00:03
Start: Fri May 10 03:44:33 2013 End: Fri May 10 03:44:36 2013