Your job contains 1 sequence.
>017706
MAGVSLKCGDCGALLRSVQEAQEHAELTSHSNFSESTEAVLNLVCATCGKPCRSKTETDL
HRKRTGHTDFVDKTSEAAKPISLEVPKATADSEEAIDVDMSGSQPEEMVEPEVDKELLKE
LEAMGFPVARATRALHYSGNANVEAAVNWVVEHENDPDIDEMPMVPVSGGGGASKSSLTP
EEIKLKAQELRERARKKKEEEEKRMEREREKERIRIGKELLEAKRIEEENERKRILALRK
AEKEEEKRAREKIRQKLEEDKAERRRRLGLPPEDPATTKSSAPVVEEKKSMLPIRPATKV
EQMRECLRSLKQNHKDDDAKVKRAFQTLLTYIGNVAKNPNEEKFRKIRLSNQTFQVWNFF
TIDLDTC
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 017706
(367 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2010637 - symbol:AT1G04850 species:3702 "Arabi... 628 1.2e-106 2
TAIR|locus:2156489 - symbol:AT5G48690 "AT5G48690" species... 249 1.1e-29 2
UNIPROTKB|G4MXB0 - symbol:MGG_08270 "Ubiquitin carboxyl-t... 127 5.3e-05 1
UNIPROTKB|G4N023 - symbol:MGG_06184 "Uncharacterized prot... 105 0.00015 2
WB|WBGene00017733 - symbol:ubxn-1 species:6239 "Caenorhab... 101 0.00059 2
UNIPROTKB|Q9TXH9 - symbol:ubxn-1 "Protein UBXN-1" species... 101 0.00059 2
>TAIR|locus:2010637 [details] [associations]
symbol:AT1G04850 species:3702 "Arabidopsis thaliana"
[GO:0005737 "cytoplasm" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0005829 "cytosol" evidence=IDA] [GO:0006486
"protein glycosylation" evidence=RCA] [GO:0006623 "protein
targeting to vacuole" evidence=RCA] InterPro:IPR000449
InterPro:IPR006567 InterPro:IPR007087 InterPro:IPR009060
InterPro:IPR015880 Pfam:PF00627 PROSITE:PS00028 SMART:SM00355
SMART:SM00580 EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005829
GO:GO:0008270 InterPro:IPR015940 SMART:SM00165 PROSITE:PS50030
EMBL:AC004809 SUPFAM:SSF46934 InterPro:IPR018997 Pfam:PF09409
eggNOG:NOG126397 HOGENOM:HOG000239091 EMBL:AY062609 EMBL:AY088435
EMBL:BT000147 IPI:IPI00534720 PIR:G86181 RefSeq:NP_563718.1
UniGene:At.21872 ProteinModelPortal:Q9MAT3 SMR:Q9MAT3 IntAct:Q9MAT3
PaxDb:Q9MAT3 PRIDE:Q9MAT3 EnsemblPlants:AT1G04850.1 GeneID:839399
KEGG:ath:AT1G04850 TAIR:At1g04850 InParanoid:Q9MAT3 OMA:HYSGNAS
PhylomeDB:Q9MAT3 ProtClustDB:CLSN2687675 ArrayExpress:Q9MAT3
Genevestigator:Q9MAT3 Uniprot:Q9MAT3
Length = 413
Score = 628 (226.1 bits), Expect = 1.2e-106, Sum P(2) = 1.2e-106
Identities = 123/188 (65%), Positives = 140/188 (74%)
Query: 1 MAGVSLKCGDCGALLRSVQEAQEHAELTSHSNFSESTEAVLNLVCATCGKPCRSKTETDL 60
MAGVSLKCGDCG LL+SV+EAQEHAELTSHSNF+ESTEAVLNLVC TC KPCRSK E+DL
Sbjct: 1 MAGVSLKCGDCGTLLKSVEEAQEHAELTSHSNFAESTEAVLNLVCTTCTKPCRSKIESDL 60
Query: 61 HRKRTGHTDFVDKTSEAAKPISLEVPKATADSEEAIDVDMSGSQPEEMVEPEVDKELLKE 120
H KRTGHT+FVDKT E KPISLE PK + ++ + SG EEMV P+VD +L+E
Sbjct: 61 HTKRTGHTEFVDKTLETIKPISLEAPKVAMEIDD--NASGSGEAAEEMVVPDVDNNILEE 118
Query: 121 LEAMGFPVARATRALHYSGXXXXXXXXXXXXXXXXDPDIDEMPMVPVSGGGGASKSSLTP 180
LEAMGFP ARATRALHYSG DPD+DEMP VP + G +K +LTP
Sbjct: 119 LEAMGFPKARATRALHYSGNASLEAAVNWVVEHENDPDVDEMPKVPSNSNVGPAKPALTP 178
Query: 181 EEIKLKAQ 188
EE+KLKAQ
Sbjct: 179 EEVKLKAQ 186
Score = 447 (162.4 bits), Expect = 1.2e-106, Sum P(2) = 1.2e-106
Identities = 89/103 (86%), Positives = 94/103 (91%)
Query: 255 QKLEEDKAERRRRLGLPPEDPATT--KSSAPVVEEKKSMLPIRPATKVEQMRECLRSLKQ 312
QKLEEDKAERRR+LGLPPEDPAT K S PVVEEKK LPIRPATK EQMRECLRSLKQ
Sbjct: 253 QKLEEDKAERRRKLGLPPEDPATAAAKPSVPVVEEKKVTLPIRPATKTEQMRECLRSLKQ 312
Query: 313 NHKDDDAKVKRAFQTLLTYIGNVAKNPNEEKFRKIRLSNQTFQ 355
HK+DDAKVKRAFQTLLTY+GNVAKNP+EEKFRKIRL+NQTFQ
Sbjct: 313 AHKEDDAKVKRAFQTLLTYMGNVAKNPDEEKFRKIRLTNQTFQ 355
Score = 57 (25.1 bits), Expect = 1.6e-65, Sum P(2) = 1.6e-65
Identities = 18/69 (26%), Positives = 36/69 (52%)
Query: 255 QKLEEDKAERRRRLGLPPEDPATTKSSAPVVEEKKSMLPIRPATKVEQMRECLRSLKQNH 314
+++E ++ + R R+G ++ K V E K+ M +R A K E+ R ++Q
Sbjct: 201 KRMEREREKERIRIG---KELLEAKRMEEVNERKRLMF-LRKAEKEEEKR-AREKIRQKL 255
Query: 315 KDDDAKVKR 323
++D A+ +R
Sbjct: 256 EEDKAERRR 264
Score = 38 (18.4 bits), Expect = 1.6e-63, Sum P(2) = 1.6e-63
Identities = 17/70 (24%), Positives = 32/70 (45%)
Query: 280 SSAPVVEEKKSMLPIRPATKVEQMRECLRSLKQNHKDDDAKVKRAFQTLLTYIGNV---A 336
S++ V K ++ P K +++RE R K+ +++ +++R + IG A
Sbjct: 165 SNSNVGPAKPALTPEEVKLKAQELRERARKKKE---EEEKRMEREREKERIRIGKELLEA 221
Query: 337 KNPNEEKFRK 346
K E RK
Sbjct: 222 KRMEEVNERK 231
>TAIR|locus:2156489 [details] [associations]
symbol:AT5G48690 "AT5G48690" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005737
"cytoplasm" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR006567 InterPro:IPR009060 SMART:SM00580
EMBL:CP002688 InterPro:IPR015940 PROSITE:PS50030 SUPFAM:SSF46934
InterPro:IPR018997 Pfam:PF09409 EMBL:AY735713 EMBL:AY924856
IPI:IPI00544841 RefSeq:NP_199680.2 UniGene:At.51085
ProteinModelPortal:Q5XV09 SMR:Q5XV09 PRIDE:Q5XV09
EnsemblPlants:AT5G48690.1 GeneID:834927 KEGG:ath:AT5G48690
TAIR:At5g48690 HOGENOM:HOG000239091 InParanoid:Q5XV09 OMA:VEFNIEI
PhylomeDB:Q5XV09 ProtClustDB:CLSN2918881 Genevestigator:Q5XV09
Uniprot:Q5XV09
Length = 323
Score = 249 (92.7 bits), Expect = 1.1e-29, Sum P(2) = 1.1e-29
Identities = 50/103 (48%), Positives = 76/103 (73%)
Query: 256 KLEEDKAERRRRLGLPPE-DPATTKSSAPVVEEKKSMLPIRPA--TKVEQMRECLRSLKQ 312
K+ DK ER+RRLGLP E + A+T + ++ K+ ++ P+ +K E+MRECLRSL++
Sbjct: 156 KVNADKLERKRRLGLPTETESASTSNPVSPLDPKRIVMS-SPSVVSKAEEMRECLRSLRR 214
Query: 313 NHKDDDAKV-KRAFQTLLTYIGNVAKNPNEEKFRKIRLSNQTF 354
NHKD+D ++ +R F+TLLT + NVAK P+EE++R+IRL N+ F
Sbjct: 215 NHKDEDPRITRRVFETLLTIVRNVAKKPDEERYRRIRLKNRLF 257
Score = 95 (38.5 bits), Expect = 1.1e-29, Sum P(2) = 1.1e-29
Identities = 22/54 (40%), Positives = 28/54 (51%)
Query: 112 EVDKELLKELEAMGFPVARATRALHYSGXXXXXXXXXXXXXXXXDPDIDEMPMV 165
EV+ LLKELE MGF +ARA ALH+SG + + MP+V
Sbjct: 11 EVNHGLLKELEDMGFSMARAAWALHHSGNSSLEAAVNWIIDHENESQFENMPLV 64
>UNIPROTKB|G4MXB0 [details] [associations]
symbol:MGG_08270 "Ubiquitin carboxyl-terminal hydrolase"
species:242507 "Magnaporthe oryzae 70-15" [GO:0005575
"cellular_component" evidence=ND] InterPro:IPR000449
InterPro:IPR001394 InterPro:IPR001607 InterPro:IPR009060
InterPro:IPR016652 InterPro:IPR018200 Pfam:PF00443 Pfam:PF00627
Pfam:PF02148 PIRSF:PIRSF016308 PROSITE:PS00972 PROSITE:PS00973
PROSITE:PS50235 PROSITE:PS50271 SMART:SM00290 GO:GO:0046872
GO:GO:0008270 GO:GO:0008234 InterPro:IPR015940 SMART:SM00165
PROSITE:PS50030 Gene3D:3.30.40.10 InterPro:IPR013083 GO:GO:0006511
EMBL:CM001232 SUPFAM:SSF46934 GO:GO:0008242 GO:GO:0004221 KO:K11836
RefSeq:XP_003715815.1 ProteinModelPortal:G4MXB0
EnsemblFungi:MGG_08270T0 GeneID:2678560 KEGG:mgr:MGG_08270
Uniprot:G4MXB0
Length = 787
Score = 127 (49.8 bits), Expect = 5.3e-05, P = 5.3e-05
Identities = 46/163 (28%), Positives = 68/163 (41%)
Query: 36 STEAVLNLVCATCG-KPCRSKTETDLHRKRTGHTDFVDKTSEAAKPISLEVPKATADSEE 94
+ E ++ L C++CG K +K + + T PI ++VP D
Sbjct: 499 TAEEIVELTCSSCGSKAGYTKRSCFKTLPQVLVVNARKMTIVNWVPIKVDVPVLVNDDPY 558
Query: 95 AIDVDMS-GSQPEEMVEPE-----------VDKELLKELEAMGFPVARATRALHYSGXXX 142
+D +S G QP E + PE + E + +LEAMGFP R +ALH +G
Sbjct: 559 LLDNYLSQGQQPGEELLPEDASSSSAPAFTPNPEAVAQLEAMGFPRNRCEKALHATGNSD 618
Query: 143 XXXXXXXXXXXXXDPDIDEMPMVPVSGGGGASKSSLTPEEIKL 185
DPDIDE P+ G GA + P I++
Sbjct: 619 ANTAMEWLFGHMEDPDIDE-PLKLEGSGDGAGGFTADPSSIEM 660
>UNIPROTKB|G4N023 [details] [associations]
symbol:MGG_06184 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001012 InterPro:IPR007087
InterPro:IPR009060 InterPro:IPR015880 Pfam:PF00789 PROSITE:PS00028
PROSITE:PS50033 PROSITE:PS50157 SMART:SM00166 SMART:SM00355
EMBL:CM001233 GO:GO:0008270 GO:GO:0005622 InterPro:IPR015940
SMART:SM00165 PROSITE:PS50030 SUPFAM:SSF46934 RefSeq:XP_003712068.1
ProteinModelPortal:G4N023 EnsemblFungi:MGG_06184T0 GeneID:2684306
KEGG:mgr:MGG_06184 Uniprot:G4N023
Length = 316
Score = 105 (42.0 bits), Expect = 0.00015, Sum P(2) = 0.00015
Identities = 36/119 (30%), Positives = 57/119 (47%)
Query: 5 SLKCGDCGALLRSVQEAQEHAELTSHSNFSESTEAVLNLVCATCGKPCRSKTETDLHRKR 64
SL C +CG R+ A HA + H++FSESTEA+ +L K ++ L K+
Sbjct: 72 SLVCNECGKKFRNSDSATFHATKSGHTDFSESTEAIAHLT-EEQKKERLAELREQLKAKK 130
Query: 65 TGHTDFVDKTSEAAKPISLEVPKATADSEEAIDVDMSGSQPEEMVEPEVDKELLKELEA 123
++ DK E AK KAT ++++ + Q +E + +K L +LEA
Sbjct: 131 AAQSE-ADK--EDAKRNEKIRQKATRETQDLKEEIARKEQIKEAAKKRQEK--LDDLEA 184
Score = 51 (23.0 bits), Expect = 0.00015, Sum P(2) = 0.00015
Identities = 17/54 (31%), Positives = 22/54 (40%)
Query: 256 KLEEDKAERRRRLG-LPPEDPATTKSSAPVVEEKKSMLPIRPATKVEQMRECLR 308
K+E DKAER+R+ TT +S+ S P P V LR
Sbjct: 191 KIEADKAERKRKADEAKAARDGTTVASSSASPAPASSAPAAPKPSVSHNDARLR 244
>WB|WBGene00017733 [details] [associations]
symbol:ubxn-1 species:6239 "Caenorhabditis elegans"
[GO:0004190 "aspartic-type endopeptidase activity" evidence=IEA]
[GO:0006508 "proteolysis" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] InterPro:IPR000449 InterPro:IPR001012
InterPro:IPR007087 InterPro:IPR009060 InterPro:IPR015880
Pfam:PF00627 Pfam:PF00789 PROSITE:PS00028 PROSITE:PS50033
SMART:SM00166 SMART:SM00355 GO:GO:0008270 GO:GO:0005622
SUPFAM:SSF46934 EMBL:FO081207 GeneTree:ENSGT00530000063500
HOGENOM:HOG000188321 eggNOG:NOG126397 OMA:KTKHENF PIR:T33830
RefSeq:NP_490978.1 ProteinModelPortal:Q9TXH9 SMR:Q9TXH9
DIP:DIP-27462N IntAct:Q9TXH9 MINT:MINT-1037613 STRING:Q9TXH9
PaxDb:Q9TXH9 EnsemblMetazoa:F23C8.4 GeneID:171804
KEGG:cel:CELE_F23C8.4 UCSC:F23C8.4 CTD:171804 WormBase:F23C8.4
InParanoid:Q9TXH9 NextBio:872765 Uniprot:Q9TXH9
Length = 299
Score = 101 (40.6 bits), Expect = 0.00059, Sum P(2) = 0.00059
Identities = 29/92 (31%), Positives = 41/92 (44%)
Query: 5 SLKCGDCGALLRSVQEAQEHAELTSHSNFSESTEAVLNLVCAT-CGKPCRSKTETDLHRK 63
S KC DCG LL + HA T H NFSES+EA+ L K + + +H+
Sbjct: 66 SFKCDDCGKLLANDDAIMFHASKTKHENFSESSEAIKPLTAEEKAAKVLEIREKIKVHQA 125
Query: 64 RTGHTDFVDKTSEAAKPISLEVPKATADSEEA 95
+ + ++ E K E KA +EA
Sbjct: 126 KKAKLE-AEENREKEKK-RREDGKAMISHKEA 155
Score = 49 (22.3 bits), Expect = 0.00059, Sum P(2) = 0.00059
Identities = 9/43 (20%), Positives = 23/43 (53%)
Query: 286 EEKKSMLPIRPATKVEQMRECLRSLKQNHKDDDAKVKRAFQTL 328
E+ K+M+ + A + ++RE + ++ +D+ KR + +
Sbjct: 144 EDGKAMISHKEAARDREIREAAQDRRREKNEDEIARKRVLEQI 186
>UNIPROTKB|Q9TXH9 [details] [associations]
symbol:ubxn-1 "Protein UBXN-1" species:6239 "Caenorhabditis
elegans" [GO:0005515 "protein binding" evidence=IPI]
InterPro:IPR000449 InterPro:IPR001012 InterPro:IPR007087
InterPro:IPR009060 InterPro:IPR015880 Pfam:PF00627 Pfam:PF00789
PROSITE:PS00028 PROSITE:PS50033 SMART:SM00166 SMART:SM00355
GO:GO:0008270 GO:GO:0005622 SUPFAM:SSF46934 EMBL:FO081207
GeneTree:ENSGT00530000063500 HOGENOM:HOG000188321 eggNOG:NOG126397
OMA:KTKHENF PIR:T33830 RefSeq:NP_490978.1 ProteinModelPortal:Q9TXH9
SMR:Q9TXH9 DIP:DIP-27462N IntAct:Q9TXH9 MINT:MINT-1037613
STRING:Q9TXH9 PaxDb:Q9TXH9 EnsemblMetazoa:F23C8.4 GeneID:171804
KEGG:cel:CELE_F23C8.4 UCSC:F23C8.4 CTD:171804 WormBase:F23C8.4
InParanoid:Q9TXH9 NextBio:872765 Uniprot:Q9TXH9
Length = 299
Score = 101 (40.6 bits), Expect = 0.00059, Sum P(2) = 0.00059
Identities = 29/92 (31%), Positives = 41/92 (44%)
Query: 5 SLKCGDCGALLRSVQEAQEHAELTSHSNFSESTEAVLNLVCAT-CGKPCRSKTETDLHRK 63
S KC DCG LL + HA T H NFSES+EA+ L K + + +H+
Sbjct: 66 SFKCDDCGKLLANDDAIMFHASKTKHENFSESSEAIKPLTAEEKAAKVLEIREKIKVHQA 125
Query: 64 RTGHTDFVDKTSEAAKPISLEVPKATADSEEA 95
+ + ++ E K E KA +EA
Sbjct: 126 KKAKLE-AEENREKEKK-RREDGKAMISHKEA 155
Score = 49 (22.3 bits), Expect = 0.00059, Sum P(2) = 0.00059
Identities = 9/43 (20%), Positives = 23/43 (53%)
Query: 286 EEKKSMLPIRPATKVEQMRECLRSLKQNHKDDDAKVKRAFQTL 328
E+ K+M+ + A + ++RE + ++ +D+ KR + +
Sbjct: 144 EDGKAMISHKEAARDREIREAAQDRRREKNEDEIARKRVLEQI 186
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.312 0.128 0.361 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 367 285 0.00085 115 3 11 23 0.44 34
33 0.42 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 6
No. of states in DFA: 606 (64 KB)
Total size of DFA: 190 KB (2108 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 27.10u 0.15s 27.25t Elapsed: 00:00:02
Total cpu time: 27.10u 0.15s 27.25t Elapsed: 00:00:02
Start: Thu May 9 16:11:21 2013 End: Thu May 9 16:11:23 2013