Your job contains 1 sequence.
>009852
MEILSYNCPPNCQVVNLRWKLVKKASQSCESKPPSFREHRILCIRRDFRSGFSGYSISSW
CFSKNLDREKGSNSSNAVQGFRNLNSQVLSGSYDGYVIGGEEDAGSFPKEREAIPKVLIP
GLPDEYNGESGAPITSCFWEWKPKFNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLK
DLGKDYRAWAIDFLGQGMSLPDEDPTPRSKEGDSTEEKNFLWGFGDKAQPWASELAYSVD
LWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNPHLVKGVTLLNATPFWGFSPNPI
RSPKLARILPWSGTFPLPASVRKLIEFIWQKISDPESIAEVLKQVYADHATNVDTVFTRI
LETTQHPAAAASFASIMFAPQGNLSFREALSRCQMNGVPICLIYGKEDPWVKPVWGLQVK
RQVPEAPYYEISPAGHCPHDEVPEVVNYLLRGWIKNLESQGSVALPLLDDEENIQYVIAR
DLEFVREESKKSVRVRIYGSRFSLWNRIGSFIKSRFRKVEMNSS
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 009852
(524 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2159033 - symbol:PPH "AT5G13800" species:3702 ... 1556 4.6e-163 2
TAIR|locus:2115440 - symbol:AT4G36530 species:3702 "Arabi... 282 4.5e-34 2
TAIR|locus:2832896 - symbol:AT5G19850 species:3702 "Arabi... 199 1.0e-23 2
TAIR|locus:2122654 - symbol:AT4G25290 species:3702 "Arabi... 193 2.3e-15 2
TAIR|locus:2159823 - symbol:AT5G38520 species:3702 "Arabi... 166 2.4e-09 1
UNIPROTKB|Q48LN2 - symbol:catD1 "3-oxoadipate enol-lacton... 95 1.8e-05 2
UNIPROTKB|Q74EB1 - symbol:GSU1052 "Hydrolase or acyltrans... 103 0.00013 2
TIGR_CMR|GSU_1052 - symbol:GSU_1052 "hydrolase, alpha/bet... 103 0.00013 2
ZFIN|ZDB-GENE-080227-1 - symbol:ephx4 "epoxide hydrolase ... 84 0.00018 2
MGI|MGI:2686228 - symbol:Ephx4 "epoxide hydrolase 4" spec... 90 0.00029 2
TAIR|locus:2135843 - symbol:AT4G12830 species:3702 "Arabi... 99 0.00029 3
UNIPROTKB|F1N3G0 - symbol:EPHX4 "Uncharacterized protein"... 87 0.00064 2
>TAIR|locus:2159033 [details] [associations]
symbol:PPH "AT5G13800" species:3702 "Arabidopsis
thaliana" [GO:0016787 "hydrolase activity" evidence=ISS]
[GO:0009507 "chloroplast" evidence=IDA] [GO:0015996 "chlorophyll
catabolic process" evidence=RCA;IMP] [GO:0080124 "pheophytinase
activity" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0009507
eggNOG:COG0596 EMBL:AB005230 GO:GO:0015996 EMBL:AY056386
EMBL:AY070736 EMBL:AY149934 EMBL:AK317089 IPI:IPI00542352
RefSeq:NP_196884.1 RefSeq:NP_850815.1 UniGene:At.23976
ProteinModelPortal:Q9FFZ1 SMR:Q9FFZ1 MEROPS:S33.A34 PRIDE:Q9FFZ1
EnsemblPlants:AT5G13800.1 EnsemblPlants:AT5G13800.2 GeneID:831225
KEGG:ath:AT5G13800 TAIR:At5g13800 HOGENOM:HOG000006184
InParanoid:Q9FFZ1 OMA:VGSFHYE PhylomeDB:Q9FFZ1
ProtClustDB:CLSN2687043 BioCyc:ARA:AT5G13800-MONOMER
BioCyc:MetaCyc:AT5G13800-MONOMER Genevestigator:Q9FFZ1
GO:GO:0080124 Uniprot:Q9FFZ1
Length = 484
Score = 1556 (552.8 bits), Expect = 4.6e-163, Sum P(2) = 4.6e-163
Identities = 281/466 (60%), Positives = 348/466 (74%)
Query: 56 SISSWCFSKNLDREKGSNSSNAVQGFRNLNSQVLSGSYDGYVIGGEEDAGSFPKEREAIP 115
S+ +W R + SS G + + SG+ DGYV+G +D G + E+
Sbjct: 13 SVVTWSSKLATKRLVPNRSSLLFSGVKKSRLVIRSGNSDGYVVGENDDLGRIARRGESTS 72
Query: 116 KVLIPGLPDEYNGESGAPITSCFWEWKPKFNVHYEKAGCENVNSPPVLFLPGFGVGSFHY 175
KVLIPGLPDE NGE A I+ EWKPK VHYEKAGC+N+++P VLFLPGFGVGSFHY
Sbjct: 73 KVLIPGLPDESNGEIAARISHSHCEWKPKLRVHYEKAGCDNLDAPAVLFLPGFGVGSFHY 132
Query: 176 EKQLKDLGKDYRAWAIDFLGQGMSLPDEDPTPRSKEGDSTEEKNFLWGFGDKAQPWASEL 235
EKQL DLG+DYR WAIDFLGQG+SLP EDPT ++E S+E+K WGFGDK +PWA +L
Sbjct: 133 EKQLTDLGRDYRVWAIDFLGQGLSLPTEDPTTMTEETSSSEDKEPFWGFGDKTEPWADQL 192
Query: 236 AYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNPHLVKGVTLLNATPFWGF 295
+S+DLW+DQV YF++EVI EPVY+ GNSLGG+VA+YFAA +PHLVKGVTLLNATPFWGF
Sbjct: 193 VFSLDLWRDQVQYFVEEVIGEPVYIAGNSLGGYVALYFAATHPHLVKGVTLLNATPFWGF 252
Query: 296 SPNPIRSPKLARILPWSGTFPLPASVRKLIEFIWQKISDPESIAEVLKQVYADHATNVDT 355
PNP+RSPKLAR+ PW G FPLP V+K+ E +WQKISDPESIAE+LKQVY DH+ NVD
Sbjct: 253 FPNPVRSPKLARLFPWPGAFPLPERVKKITELVWQKISDPESIAEILKQVYTDHSINVDK 312
Query: 356 VFTRILETTQHPXXXXXXXXXXXXPQGNLSFREALSRCQMNGVPICLIYGKEDPWVKPVW 415
VF+RI+E TQHP P G LSF EALSRC+ N V ICL+YG+EDPWV+P+W
Sbjct: 313 VFSRIVEVTQHPAAAASFASIMLAPGGELSFSEALSRCKENNVQICLMYGREDPWVRPLW 372
Query: 416 GLQVKRQVPEAPYYEISPAGHCPHDEVPEVVNYLLRGWIKNLESQGSVALPLLDD-EENI 474
G ++K+++P APYYEISPAGHCPHDEVPEVVNYL+RGWIK+LES G ALPLL+D EE+
Sbjct: 373 GKKIKKEIPNAPYYEISPAGHCPHDEVPEVVNYLMRGWIKHLESGGFEALPLLEDTEEDW 432
Query: 475 QYV-IARDLEFVREESKKSVRVRIYGSRFSLWNRIGSFIKSRFRKV 519
+ I R++EF R+ KK+V + +YGS ++ W + +S F +V
Sbjct: 433 EESRIGREIEFPRDGWKKAVNLWLYGSNYTYWRGVRESFRSSFIRV 478
Score = 53 (23.7 bits), Expect = 4.6e-163, Sum P(2) = 4.6e-163
Identities = 12/24 (50%), Positives = 13/24 (54%)
Query: 1 MEILSYNCPPNCQVVNLRWKLVKK 24
MEI+S N P C VV KL K
Sbjct: 1 MEIISLNVVPQCSVVTWSSKLATK 24
>TAIR|locus:2115440 [details] [associations]
symbol:AT4G36530 species:3702 "Arabidopsis thaliana"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0005737
"cytoplasm" evidence=ISM] [GO:0009507 "chloroplast"
evidence=ISM;IDA] [GO:0016787 "hydrolase activity" evidence=ISS]
[GO:0009941 "chloroplast envelope" evidence=IDA] [GO:0000023
"maltose metabolic process" evidence=RCA] [GO:0019252 "starch
biosynthetic process" evidence=RCA] [GO:0043085 "positive
regulation of catalytic activity" evidence=RCA] InterPro:IPR000639
PRINTS:PR00412 InterPro:IPR000073 EMBL:CP002687 GO:GO:0009941
GO:GO:0016787 PRINTS:PR00111 EMBL:AL161589 EMBL:Z99708 HSSP:O31243
EMBL:AY087588 EMBL:AK227038 IPI:IPI00542770 PIR:C85431
RefSeq:NP_195371.1 UniGene:At.4614 ProteinModelPortal:O23227
SMR:O23227 STRING:O23227 MEROPS:S33.A01 PRIDE:O23227
EnsemblPlants:AT4G36530.2 GeneID:829805 KEGG:ath:AT4G36530
TAIR:At4g36530 InParanoid:O23227 OMA:FENLRRP PhylomeDB:O23227
ProtClustDB:PLN02578 ArrayExpress:O23227 Genevestigator:O23227
InterPro:IPR026129 PANTHER:PTHR10992:SF20 Uniprot:O23227
Length = 378
Score = 282 (104.3 bits), Expect = 4.5e-34, Sum P(2) = 4.5e-34
Identities = 81/245 (33%), Positives = 112/245 (45%)
Query: 221 LWGFGDKAQPWASE--LAYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNP 278
L GFG W+ + + Y +W DQV F+KEV++EP VVGNSLGGF A+ A P
Sbjct: 133 LLGFG-----WSDKALIEYDAMVWTDQVIDFMKEVVKEPAVVVGNSLGGFTALSVAVGLP 187
Query: 279 HLVKGVTLLNATPFWGFSPNPIRSPKLARILPWSGTFPLPASVRKLI-EFIWQKISDPES 337
V GV LLN+ + R ++ PL ++++ F++ + P
Sbjct: 188 EQVTGVALLNSAGQFAAESRK-REEADETVITKFIVKPLKEIFQRVVLGFLFWQAKQPSR 246
Query: 338 IAEVLKQVYADHATNVDTVFTR-ILETTQHPXXXXXXXXXXXXPQGNLSFREALSRCQMN 396
I VLK VY D +TNVD I + P N S S
Sbjct: 247 IESVLKSVYID-STNVDDYLVESISKPATDPNAGEVYYRLMTRFLTNQSRYTLDSVLSKM 305
Query: 397 GVPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEISPAGHCPHDEVPEVVNYLLRGWIK- 455
P+ L++G DPWV P ++K + + AGHCPHDEVPE VN L W+
Sbjct: 306 TCPLLLVWGDLDPWVGPAKAEKIKAFYSNSSLVHLQ-AGHCPHDEVPEAVNKALLDWLSI 364
Query: 456 NLESQ 460
N+ S+
Sbjct: 365 NIASK 369
Score = 115 (45.5 bits), Expect = 4.5e-34, Sum P(2) = 4.5e-34
Identities = 22/62 (35%), Positives = 33/62 (53%)
Query: 138 FWEWKPKFNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLKDLGKDYRAWAIDFLGQG 197
FWEW+ +HY G P++ + GFG FH+ + +L K Y+ +A+D LG G
Sbjct: 83 FWEWRGH-KIHYVVQG----EGSPLVLIHGFGASVFHWRYNIPELAKKYKVYALDLLGFG 137
Query: 198 MS 199
S
Sbjct: 138 WS 139
>TAIR|locus:2832896 [details] [associations]
symbol:AT5G19850 species:3702 "Arabidopsis thaliana"
[GO:0005737 "cytoplasm" evidence=ISM] [GO:0016787 "hydrolase
activity" evidence=ISS] InterPro:IPR000073 EMBL:CP002688
GO:GO:0016787 PRINTS:PR00111 IPI:IPI00529562 RefSeq:NP_568381.1
UniGene:At.31247 UniGene:At.66697 ProteinModelPortal:F4K2M9
SMR:F4K2M9 PRIDE:F4K2M9 EnsemblPlants:AT5G19850.1 GeneID:832105
KEGG:ath:AT5G19850 OMA:YTFETWG Uniprot:F4K2M9
Length = 359
Score = 199 (75.1 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
Identities = 58/236 (24%), Positives = 107/236 (45%)
Query: 222 WGFGDKAQP--WASELAYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNPH 279
+G+ DK P + E Y+ + W +Q+ F +V+++ + + NS+GG V + A P
Sbjct: 131 YGYSDKPNPREFGGEPFYTFETWGEQLNDFCLDVVKDEAFFICNSIGGLVGLQAAVSKPE 190
Query: 280 LVKGVTLLNATPFWGFSPNPIRSPKLARILPWSGTFPLPASVRKLIEFIWQKISDPESIA 339
+ +G+ L+N + + P + R P+ +F + + ++ I+ PE++
Sbjct: 191 ICRGLMLINISLRMLHIK---KQPFIGR--PFIKSFQNLLRNTPVGKLFFKSIAKPETVK 245
Query: 340 EVLKQVYADHATNVDTVFTRILETTQHPXXXXXXXXXXXXPQGNLSFREALSRCQMNGVP 399
+L Q Y D + D + IL P G L E L + P
Sbjct: 246 SILCQCYHDSSQVTDELVEAILRPGLEPGAVDVFLEFICYSGGPLP--EDL--LPLVKCP 301
Query: 400 ICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEISP-AGHCPHDEVPEVVNYLLRGWI 454
+ + +G++DPW +P+ + + + P AGHCP DE PE+VN L+ ++
Sbjct: 302 VLIAWGEKDPW-EPIELGRAYSNFDAVEDFVVLPDAGHCPQDEKPEMVNPLIESFV 356
Score = 141 (54.7 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
Identities = 37/133 (27%), Positives = 62/133 (46%)
Query: 81 FRNLNSQVLSGSYDGYVIGGEEDAGSFPKEREAIPKVLIPGLPDEYNGESGAPITSCFWE 140
FR NS ++S S ++ A + I V + DE + E + + W+
Sbjct: 21 FRVKNSSIISFSETHFLRQSISTAIVRSPTKRGIVSVSCSSVTDEASSEE-LQVRTLTWK 79
Query: 141 WKPKFNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLKDLGKDYRAWAIDFLGQGMSL 200
WK +++ Y+ C + P ++ + GFG S H+ K LGK +R ++ID +G G S
Sbjct: 80 WKG-YSIRYQ---CAGTSGPALVLVHGFGANSDHWRKNTPILGKTHRVYSIDLIGYGYS- 134
Query: 201 PDEDPTPRSKEGD 213
+ P PR G+
Sbjct: 135 --DKPNPREFGGE 145
>TAIR|locus:2122654 [details] [associations]
symbol:AT4G25290 species:3702 "Arabidopsis thaliana"
[GO:0003913 "DNA photolyase activity" evidence=IEA;ISS] [GO:0006281
"DNA repair" evidence=IEA;ISS] InterPro:IPR006050
InterPro:IPR000073 Pfam:PF00875 EMBL:CP002687 GO:GO:0006281
PRINTS:PR00111 Gene3D:3.40.50.620 InterPro:IPR014729 GO:GO:0003913
SUPFAM:SSF52425 IPI:IPI00518108 RefSeq:NP_194259.4 UniGene:At.44762
ProteinModelPortal:F4JSJ6 SMR:F4JSJ6 PRIDE:F4JSJ6
EnsemblPlants:AT4G25290.1 GeneID:828632 KEGG:ath:AT4G25290
OMA:VHGFGAF Uniprot:F4JSJ6
Length = 692
Score = 193 (73.0 bits), Expect = 2.3e-15, Sum P(2) = 2.3e-15
Identities = 70/255 (27%), Positives = 111/255 (43%)
Query: 208 RSKEGDSTEEKNFLW-----GFGDKAQPWASELAYSVDLWQDQVCYFIKEVIREPVYVVG 262
R + KN +W GFG +P + Y+ LW + + F+ EV+ EP + VG
Sbjct: 433 RDNVDNIVNSKNRVWTITVLGFGKSEKP---NIIYTELLWAELLRDFMAEVVGEPAHCVG 489
Query: 263 NSLGGFVAVYFAACNPHLVKGVTLLNATP--FWGFSPNPIRSPKLARILPWSGTFPLPAS 320
NS+GG+ A P LVK V L+N+ G+SP PI + R+ P+ F
Sbjct: 490 NSIGGYFVALMAFLWPALVKSVVLVNSAGNVVPGYSPLPISRER--RV-PFGAQFG---- 542
Query: 321 VRKLIEFIWQKISDPESIAEVLKQVYADHATNVDT-VFTRILETTQHPXXXXXXXXXXXX 379
+L+ F Q ++ ++LK Y D + T +L ++ P
Sbjct: 543 -SRLLLFFLQL-----NVKKLLKDCYPVKPERADDFLVTEMLRASRDPGVVMVLESIFGF 596
Query: 380 PQGNLSFREALSRCQMNGVPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEISPAGHCPH 439
+L L + + +I G EDP P + + +++ A + AGHCPH
Sbjct: 597 DL-SLPLNYLLKGFEEKTL---VIQGMEDPISDPQKKVALLKELCPAMVIKKVKAGHCPH 652
Query: 440 DEVPEVVNYLLRGWI 454
DE+ E VN ++ WI
Sbjct: 653 DEISEEVNPIICEWI 667
Score = 80 (33.2 bits), Expect = 2.3e-15, Sum P(2) = 2.3e-15
Identities = 21/62 (33%), Positives = 27/62 (43%)
Query: 139 WEWKPKFNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLKDL-GKDYRAWAIDFLGQG 197
W WK + + Y G E P VL + GFG HY + ++ R W I LG G
Sbjct: 400 WRWKG-YLIQYTVVGNEG---PAVLLVHGFGAFLEHYRDNVDNIVNSKNRVWTITVLGFG 455
Query: 198 MS 199
S
Sbjct: 456 KS 457
>TAIR|locus:2159823 [details] [associations]
symbol:AT5G38520 species:3702 "Arabidopsis thaliana"
[GO:0009507 "chloroplast" evidence=ISM;IDA] [GO:0016787 "hydrolase
activity" evidence=ISS] [GO:0009941 "chloroplast envelope"
evidence=IDA] [GO:0009534 "chloroplast thylakoid" evidence=IDA]
[GO:0006364 "rRNA processing" evidence=RCA] [GO:0009657 "plastid
organization" evidence=RCA] [GO:0010207 "photosystem II assembly"
evidence=RCA] [GO:0010264 "myo-inositol hexakisphosphate
biosynthetic process" evidence=RCA] [GO:0019761 "glucosinolate
biosynthetic process" evidence=RCA] [GO:0043085 "positive
regulation of catalytic activity" evidence=RCA] EMBL:CP002688
GO:GO:0009941 GO:GO:0016787 GO:GO:0009534 IPI:IPI00846134
RefSeq:NP_001078682.1 UniGene:At.20746 ProteinModelPortal:F4KBJ3
SMR:F4KBJ3 PRIDE:F4KBJ3 EnsemblPlants:AT5G38520.2 GeneID:833840
KEGG:ath:AT5G38520 OMA:TPILLIW Uniprot:F4KBJ3
Length = 374
Score = 166 (63.5 bits), Expect = 2.4e-09, P = 2.4e-09
Identities = 63/285 (22%), Positives = 116/285 (40%)
Query: 195 GQGMSLPDEDPTPRSKEGDSTEEKNFLWGFGDKAQPWASELAYSVDLWQDQVCYFIKEVI 254
G G S+P + + T L GFG +P +Y+++ W + + F++EV+
Sbjct: 98 GFGASIPHWRRNINALSKNHTVYAIDLLGFGASDKP--PGFSYTMESWAELILNFLEEVV 155
Query: 255 REPVYVVGNSLGGFVAVYFAA-------------CNPHLVKGVTLLNATPFWGFSPNPIR 301
++P ++GNS+G V A+ LVKG+ LLN G + +
Sbjct: 156 QKPTILIGNSVGSLACVIAASGTKFLIYLEKKTESRGDLVKGLVLLNCAG--GMNNKAVF 213
Query: 302 SP-KLARILPWSGTFPLPASVRKLIEFIWQKISDPESIAEVLKQVYADHATNVDTVFTRI 360
++ ++P R + ++ ++ D E++ +L VY + NVD I
Sbjct: 214 DDWRIKLLMPLLLLIDFLLKQRGIASALFNRVKDRENLKNILTNVYGNK-DNVDDTLVEI 272
Query: 361 LETTQHPXXXXXXXXXXXX-PQGNLSFREALSRCQMNGVPICLIYGKED---PWVKPV-- 414
+ + P G + + P+ +++G +D P PV
Sbjct: 273 IAGPANTEGALDAFVSILTGPPGPNPIKLIPEITK----PVLVLWGDQDGLTPLDGPVGK 328
Query: 415 WGLQVKRQVPEAPYYEISPAGHCPHDEVPEVVNYLLRGWIKNLES 459
+ + Q+P Y + GHCP D+ P++V+ L W+ L S
Sbjct: 329 YFTSLPDQLPNFNLYVLQGVGHCPQDDRPDLVHERLLPWLAQLSS 373
>UNIPROTKB|Q48LN2 [details] [associations]
symbol:catD1 "3-oxoadipate enol-lactonase" species:264730
"Pseudomonas syringae pv. phaseolicola 1448A" [GO:0019614
"catechol-containing compound catabolic process" evidence=ISS]
[GO:0042952 "beta-ketoadipate pathway" evidence=ISS] [GO:0047570
"3-oxoadipate enol-lactonase activity" evidence=ISS]
InterPro:IPR000073 eggNOG:COG0596 PRINTS:PR00111 EMBL:CP000058
GenomeReviews:CP000058_GR HOGENOM:HOG000028072 GO:GO:0042952
GO:GO:0019614 GO:GO:0047570 RefSeq:YP_273690.1
ProteinModelPortal:Q48LN2 STRING:Q48LN2 GeneID:3558560
KEGG:psp:PSPPH_1434 PATRIC:19972005 KO:K01055 OMA:FDAIVGW
ProtClustDB:CLSK867833 Uniprot:Q48LN2
Length = 262
Score = 95 (38.5 bits), Expect = 1.8e-05, Sum P(2) = 1.8e-05
Identities = 20/69 (28%), Positives = 36/69 (52%)
Query: 223 GFGDKAQPWASELAYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNPHLVK 282
G G +P+ YS+ + V I+ + PV+++G S+GG + A PHL+K
Sbjct: 55 GHGRSDKPYGR---YSIQAMSNDVEALIEHLHLGPVHLIGLSMGGMIGFQLAVDQPHLLK 111
Query: 283 GVTLLNATP 291
+ ++N+ P
Sbjct: 112 SLCIVNSAP 120
Score = 76 (31.8 bits), Expect = 1.8e-05, Sum P(2) = 1.8e-05
Identities = 19/54 (35%), Positives = 26/54 (48%)
Query: 146 NVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLKDLGKDYRAWAIDFLGQGMS 199
++HYE+ G PVL L G G +E Q+ L + YR +D G G S
Sbjct: 10 SLHYEEYGL----GEPVLLLHGLGSSCQDWEYQIPALARQYRVIVMDMRGHGRS 59
>UNIPROTKB|Q74EB1 [details] [associations]
symbol:GSU1052 "Hydrolase or acyltransferase, alpha/beta
fold family" species:243231 "Geobacter sulfurreducens PCA"
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR000073
GO:GO:0016787 PRINTS:PR00111 EMBL:AE017180
GenomeReviews:AE017180_GR GO:GO:0016746 RefSeq:NP_952105.1
ProteinModelPortal:Q74EB1 GeneID:2685731 KEGG:gsu:GSU1052
PATRIC:22024870 HOGENOM:HOG000065081 OMA:GHNPQEE
ProtClustDB:CLSK743106 BioCyc:GSUL243231:GH27-1039-MONOMER
Uniprot:Q74EB1
Length = 302
Score = 103 (41.3 bits), Expect = 0.00013, Sum P(2) = 0.00013
Identities = 61/259 (23%), Positives = 99/259 (38%)
Query: 203 EDPTPRSKEGDSTEEKNFLWGFGDKAQPWASELAYSVDLWQDQVCYFIKEVIREPVYVVG 262
+D P G T L GFG ++P +YS+ V FI+ V + G
Sbjct: 44 DDLVPLFSPGRFTLYLIDLKGFGFSSKPRRG--SYSLAEQAAVVTAFIQTQGLRQVVLAG 101
Query: 263 NSLGGFVAVYFAACNPHLVKGVTLLNATPFWGFSPNPIRSPKLARILPWSGTFPLPASVR 322
+SLGG +A+ AA + L++ + P R P+ R+L G +P R
Sbjct: 102 HSLGGGIAL-LAALRANERGDDGLIDRLILLDCAAYPQRLPRFMRLL---G---VPVLAR 154
Query: 323 KLIEFIWQKISDPESIAEVLKQVYADHATNVDTVFTRILETTQHPXXXXXXXXXXXXPQG 382
+ I ++ + L+ V+ D T + R ET
Sbjct: 155 LGMALIPVRLI----VKSTLRAVFED-PTAITAERIRRYETCFGRRGIARVLIRTVRELS 209
Query: 383 NLSFREALSRCQMNGVPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEISPAGHCPHDEV 442
+ R + +I+G+ D V+P G ++ +P A I GH PH+E
Sbjct: 210 RTDVSAVIQRYGEIAIRTLIIWGENDRIVRPTQGRRLVDALPSARLAVIGACGHNPHEEQ 269
Query: 443 PEVVNYLLRGWIKNLESQG 461
P L+R +I+ E +G
Sbjct: 270 PLRTYELMREFIEEGEDKG 288
Score = 61 (26.5 bits), Expect = 0.00013, Sum P(2) = 0.00013
Identities = 19/71 (26%), Positives = 33/71 (46%)
Query: 131 GAPITSCFWEWKPKFNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEK--QLKDLGKDYRA 188
G+ + F+ + P ++HY GC V+ V+F+ GF ++ L G+ +
Sbjct: 2 GSTSSGSFFAYSPDTSIHYRIHGCGPVS---VVFIHGFAAALTTWDDLVPLFSPGR-FTL 57
Query: 189 WAIDFLGQGMS 199
+ ID G G S
Sbjct: 58 YLIDLKGFGFS 68
>TIGR_CMR|GSU_1052 [details] [associations]
symbol:GSU_1052 "hydrolase, alpha/beta fold family"
species:243231 "Geobacter sulfurreducens PCA" [GO:0008150
"biological_process" evidence=ND] [GO:0016787 "hydrolase activity"
evidence=ISS] InterPro:IPR000073 GO:GO:0016787 PRINTS:PR00111
EMBL:AE017180 GenomeReviews:AE017180_GR GO:GO:0016746
RefSeq:NP_952105.1 ProteinModelPortal:Q74EB1 GeneID:2685731
KEGG:gsu:GSU1052 PATRIC:22024870 HOGENOM:HOG000065081 OMA:GHNPQEE
ProtClustDB:CLSK743106 BioCyc:GSUL243231:GH27-1039-MONOMER
Uniprot:Q74EB1
Length = 302
Score = 103 (41.3 bits), Expect = 0.00013, Sum P(2) = 0.00013
Identities = 61/259 (23%), Positives = 99/259 (38%)
Query: 203 EDPTPRSKEGDSTEEKNFLWGFGDKAQPWASELAYSVDLWQDQVCYFIKEVIREPVYVVG 262
+D P G T L GFG ++P +YS+ V FI+ V + G
Sbjct: 44 DDLVPLFSPGRFTLYLIDLKGFGFSSKPRRG--SYSLAEQAAVVTAFIQTQGLRQVVLAG 101
Query: 263 NSLGGFVAVYFAACNPHLVKGVTLLNATPFWGFSPNPIRSPKLARILPWSGTFPLPASVR 322
+SLGG +A+ AA + L++ + P R P+ R+L G +P R
Sbjct: 102 HSLGGGIAL-LAALRANERGDDGLIDRLILLDCAAYPQRLPRFMRLL---G---VPVLAR 154
Query: 323 KLIEFIWQKISDPESIAEVLKQVYADHATNVDTVFTRILETTQHPXXXXXXXXXXXXPQG 382
+ I ++ + L+ V+ D T + R ET
Sbjct: 155 LGMALIPVRLI----VKSTLRAVFED-PTAITAERIRRYETCFGRRGIARVLIRTVRELS 209
Query: 383 NLSFREALSRCQMNGVPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEISPAGHCPHDEV 442
+ R + +I+G+ D V+P G ++ +P A I GH PH+E
Sbjct: 210 RTDVSAVIQRYGEIAIRTLIIWGENDRIVRPTQGRRLVDALPSARLAVIGACGHNPHEEQ 269
Query: 443 PEVVNYLLRGWIKNLESQG 461
P L+R +I+ E +G
Sbjct: 270 PLRTYELMREFIEEGEDKG 288
Score = 61 (26.5 bits), Expect = 0.00013, Sum P(2) = 0.00013
Identities = 19/71 (26%), Positives = 33/71 (46%)
Query: 131 GAPITSCFWEWKPKFNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEK--QLKDLGKDYRA 188
G+ + F+ + P ++HY GC V+ V+F+ GF ++ L G+ +
Sbjct: 2 GSTSSGSFFAYSPDTSIHYRIHGCGPVS---VVFIHGFAAALTTWDDLVPLFSPGR-FTL 57
Query: 189 WAIDFLGQGMS 199
+ ID G G S
Sbjct: 58 YLIDLKGFGFS 68
>ZFIN|ZDB-GENE-080227-1 [details] [associations]
symbol:ephx4 "epoxide hydrolase 4" species:7955
"Danio rerio" [GO:0003824 "catalytic activity" evidence=IEA]
InterPro:IPR000639 PRINTS:PR00412 InterPro:IPR000073
ZFIN:ZDB-GENE-080227-1 GO:GO:0003824 GO:GO:0008152 PRINTS:PR00111
GeneTree:ENSGT00530000063213 EMBL:BX927081 IPI:IPI00806359
RefSeq:XP_002662469.1 Ensembl:ENSDART00000086929 GeneID:100331939
KEGG:dre:100331939 Uniprot:E7EZD2
Length = 370
Score = 84 (34.6 bits), Expect = 0.00018, Sum P(2) = 0.00018
Identities = 52/245 (21%), Positives = 100/245 (40%)
Query: 223 GFGDKAQPWASELAYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNPHLVK 282
G+G+ P ++E +Y +D + ++ + ++VG+ GG +A A P +V
Sbjct: 133 GYGESDLPSSTE-SYRLDYLVTDIKDIVEYLGYNRCFLVGHDWGGIIAWLCAIHYPEMVT 191
Query: 283 GVTLLNATPFWGFSPNPIRSPKLARILPWSGTFPLPASVRKLIEFIWQKISDPESIAEVL 342
+ +LN+ F+ +R P + F LP ++ I+D + L
Sbjct: 192 KLIVLNSPHPCVFTDYALRHPSQMLKSSYYFFFQLPYFPELMLS-----IND----FKAL 242
Query: 343 KQVYADHATNVDTVFTRILETTQHPXXXXXXXXXXXXPQGNLS-FREA-----LSRCQMN 396
K ++ +T + R L TT+ G L+ FR LS ++
Sbjct: 243 KSLFTSRSTGISCK-GRWL-TTEDLEAYLYALSQPGALTGALNYFRNVFSVLPLSHSEVK 300
Query: 397 GVPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEI-SPAGHCPHDEVPEVVNYLLRGWIK 455
P+ L++G+ D +++ + + I S A H + P++VN L+ +IK
Sbjct: 301 S-PVLLLWGERDAFLEQDMAEACRLYIRNLFRLNIISGASHWLQQDQPDIVNKLIWTFIK 359
Query: 456 NLESQ 460
E +
Sbjct: 360 EGEGR 364
Score = 83 (34.3 bits), Expect = 0.00018, Sum P(2) = 0.00018
Identities = 26/76 (34%), Positives = 35/76 (46%)
Query: 135 TSCFWEWKPK-FNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLKDLGKDYRAWAIDF 193
T C+ K HY AG E P +LFL GF F + QL++ ++R A+D
Sbjct: 74 THCYVRIKESGLRFHYVAAG-ER-GKPLMLFLHGFPEFWFSWRHQLREFKSEFRVVAVDM 131
Query: 194 LGQGMS-LPDEDPTPR 208
G G S LP + R
Sbjct: 132 RGYGESDLPSSTESYR 147
>MGI|MGI:2686228 [details] [associations]
symbol:Ephx4 "epoxide hydrolase 4" species:10090 "Mus
musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0003824
"catalytic activity" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
[GO:0016020 "membrane" evidence=IEA] [GO:0016021 "integral to
membrane" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR000639 PRINTS:PR00412 InterPro:IPR000073
MGI:MGI:2686228 GO:GO:0016021 GO:GO:0008152 eggNOG:COG0596
GO:GO:0016787 PRINTS:PR00111 EMBL:AC126598 HOGENOM:HOG000028073
GeneTree:ENSGT00530000063213 CTD:253152 OMA:RDAFMEV
OrthoDB:EOG469QTX EMBL:BN000367 IPI:IPI00420889 IPI:IPI01027710
RefSeq:NP_001001804.2 UniGene:Mm.489863 HSSP:P34913
ProteinModelPortal:Q6IE26 PhosphoSite:Q6IE26 PRIDE:Q6IE26
DNASU:384214 Ensembl:ENSMUST00000049146 GeneID:384214
KEGG:mmu:384214 UCSC:uc008yme.1 InParanoid:Q6IE26 NextBio:404440
CleanEx:MM_ABHD7 Genevestigator:Q6IE26 Uniprot:Q6IE26
Length = 359
Score = 90 (36.7 bits), Expect = 0.00029, Sum P(2) = 0.00029
Identities = 47/243 (19%), Positives = 100/243 (41%)
Query: 221 LWGFGDKAQPWASELAYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNPHL 280
L G+G+ P A + +Y +D + + + ++G+ GG +A A C P +
Sbjct: 125 LRGYGESDAP-AHQESYKLDCLIADIKDILDSLGYSKCVLIGHDWGGMIAWLIAVCYPEM 183
Query: 281 VKGVTLLNATPFWGFSPNPIRSPKLARILPWSGTFPLPASVRKLIEFIWQKISDPESIAE 340
+ + ++N F+ +R P A++ + +F + + EF++ I+D +
Sbjct: 184 IMKLIVINFPHPSVFTEYILRHP--AQL--FRSSFYYFFQIPRFPEFMFS-IND----FK 234
Query: 341 VLKQVYADHATNVDTVFTRILETTQHPXXXXXXXXXXXXPQGNLSFREALSRC-----QM 395
LK ++ +T + ++ TT+ G ++ + C M
Sbjct: 235 ALKHLFTSQSTGIGRKGRQL--TTEDLEAYVYVFSQPGALSGPINHYRNIFSCLPLKHHM 292
Query: 396 NGVPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEI---SPAGHCPHDEVPEVVNYLLRG 452
P L++G+ED +++ K V Y+ + S H + P++VN L+
Sbjct: 293 VTTPTLLLWGEEDAFMEVEMAEVTKIYVKN--YFRLTILSEGSHWLQQDQPDIVNGLIWA 350
Query: 453 WIK 455
++K
Sbjct: 351 FLK 353
Score = 74 (31.1 bits), Expect = 0.00029, Sum P(2) = 0.00029
Identities = 22/66 (33%), Positives = 30/66 (45%)
Query: 135 TSCFWEWKPK-FNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLKDLGKDYRAWAIDF 193
T C+ K HY AG E P +L L GF + + QL++ +YR A+D
Sbjct: 68 THCYVRIKDSGLRFHYVAAG-ER-GKPLMLLLHGFPEFWYSWRHQLREFKSEYRVVALDL 125
Query: 194 LGQGMS 199
G G S
Sbjct: 126 RGYGES 131
>TAIR|locus:2135843 [details] [associations]
symbol:AT4G12830 species:3702 "Arabidopsis thaliana"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0009507
"chloroplast" evidence=ISM;IDA] [GO:0016787 "hydrolase activity"
evidence=ISS] [GO:0016556 "mRNA modification" evidence=RCA]
InterPro:IPR000639 PRINTS:PR00412 InterPro:IPR000073 GO:GO:0009507
EMBL:CP002687 GO:GO:0016787 PRINTS:PR00111 EMBL:AY056437
EMBL:AY090325 IPI:IPI00539162 RefSeq:NP_567394.1 UniGene:At.3098
ProteinModelPortal:Q93ZN4 SMR:Q93ZN4 STRING:Q93ZN4 PRIDE:Q93ZN4
EnsemblPlants:AT4G12830.1 GeneID:826895 KEGG:ath:AT4G12830
TAIR:At4g12830 InParanoid:Q93ZN4 OMA:HEFADCG PhylomeDB:Q93ZN4
ProtClustDB:PLN03084 Genevestigator:Q93ZN4 Uniprot:Q93ZN4
Length = 393
Score = 99 (39.9 bits), Expect = 0.00029, Sum P(3) = 0.00029
Identities = 20/44 (45%), Positives = 29/44 (65%)
Query: 156 NVNSPPVLFLPGFGVGSFHYEKQLKDLGKDYRAWAIDFLGQGMS 199
+V+SPPV+ + GF ++ Y K + L K+YRA A D+LG G S
Sbjct: 130 SVDSPPVILIHGFPSQAYSYRKTIPVLSKNYRAIAFDWLGFGFS 173
Score = 64 (27.6 bits), Expect = 0.00029, Sum P(3) = 0.00029
Identities = 22/71 (30%), Positives = 31/71 (43%)
Query: 220 FLW-GFGDKAQPWASE-LAYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACN 277
F W GFG +P A Y++D + + FI EV V +V V +A
Sbjct: 165 FDWLGFGFSDKPQAGYGFNYTMDEFVSSLESFIDEVTTSKVSLVVQGYFSAAVVKYARNR 224
Query: 278 PHLVKGVTLLN 288
P +K + LLN
Sbjct: 225 PDKIKNLILLN 235
Score = 39 (18.8 bits), Expect = 0.00029, Sum P(3) = 0.00029
Identities = 11/47 (23%), Positives = 22/47 (46%)
Query: 398 VPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEISPAGHCPHDEVPE 444
+PI + +G+ D W+ G++ + E+ AGH ++ E
Sbjct: 332 IPITVCWGQRDRWLS-YEGVEEFCKSSGHNLVELPNAGHHVQEDCGE 377
>UNIPROTKB|F1N3G0 [details] [associations]
symbol:EPHX4 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0003824 "catalytic activity" evidence=IEA]
InterPro:IPR000639 PRINTS:PR00412 InterPro:IPR000073 GO:GO:0003824
GO:GO:0008152 PRINTS:PR00111 GeneTree:ENSGT00530000063213
CTD:253152 OMA:RDAFMEV EMBL:DAAA02007909 IPI:IPI00842845
RefSeq:NP_001069323.2 UniGene:Bt.63275 Ensembl:ENSBTAT00000007633
GeneID:524246 KEGG:bta:524246 NextBio:20873927 Uniprot:F1N3G0
Length = 362
Score = 87 (35.7 bits), Expect = 0.00064, Sum P(2) = 0.00064
Identities = 50/243 (20%), Positives = 96/243 (39%)
Query: 221 LWGFGDKAQPWASELAYSVDLWQDQVCYFIKEVIREPVYVVGNSLGGFVAVYFAACNPHL 280
L G+G+ P E Y +D + ++ + ++G+ GG +A A C P +
Sbjct: 127 LRGYGESDAPVHRE-NYKLDCLITDIKDILESLGYSKCVLIGHDWGGMIAWLIAICYPEM 185
Query: 281 VKGVTLLNATPFWGFSPNPIRSPKLARILPWSGTFPLPASVRKLIEFIWQKISDPESIAE 340
V + ++N F+ +R P + F +P EF++ I+D +
Sbjct: 186 VMKLIVINFPHPNVFTEYILRHPAQLFKSSYYYFFQIPW----FPEFMFS-IND----FK 236
Query: 341 VLKQVYADHATNVDTVFTRILETTQHPXXXXXXXXXXXXPQGNLSFREALSRC-----QM 395
LK ++ H+T + R+ TT+ G ++ + C M
Sbjct: 237 ALKHLFTSHSTGIGRKGCRL--TTEDLEAYIYVFSQPGALSGPINHYRNIFSCLPLKHHM 294
Query: 396 NGVPICLIYGKEDPWVKPVWGLQVKRQVPEAPYYEI---SPAGHCPHDEVPEVVNYLLRG 452
P L++G++D +++ K V Y+ + S H E P++VN L+
Sbjct: 295 VITPTLLLWGEKDAFMEVEMAEVTKIYVKN--YFRLTILSETSHWLQQEQPDIVNKLIWT 352
Query: 453 WIK 455
++K
Sbjct: 353 FLK 355
Score = 74 (31.1 bits), Expect = 0.00064, Sum P(2) = 0.00064
Identities = 22/66 (33%), Positives = 30/66 (45%)
Query: 135 TSCFWEWKPK-FNVHYEKAGCENVNSPPVLFLPGFGVGSFHYEKQLKDLGKDYRAWAIDF 193
T C+ K HY AG E P +L L GF + + QL++ +YR A+D
Sbjct: 70 THCYVRIKDSGLRFHYVAAG-ER-GKPLMLLLHGFPEFWYSWRHQLREFKSEYRVVALDL 127
Query: 194 LGQGMS 199
G G S
Sbjct: 128 RGYGES 133
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.137 0.437 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 524 512 0.00087 119 3 11 22 0.41 34
35 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 12
No. of states in DFA: 630 (67 KB)
Total size of DFA: 348 KB (2172 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 42.05u 0.08s 42.13t Elapsed: 00:00:04
Total cpu time: 42.05u 0.08s 42.13t Elapsed: 00:00:04
Start: Sat May 11 08:31:03 2013 End: Sat May 11 08:31:07 2013