Your job contains 1 sequence.
>006998
MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRTGAKVGVSNSEGGGSYLDMWQK
AVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERDRI
QRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPFVK
NSESNGTAEVPERDSSGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRD
LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSE
TNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGV
VCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ
GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWH
ERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGE
RWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR
KPSEFQEEPFEIQDKRSELQEP
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 006998
(622 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2079001 - symbol:AT3G55760 species:3702 "Arabi... 1612 1.1e-165 1
TAIR|locus:2035898 - symbol:AT1G42430 "AT1G42430" species... 634 4.8e-62 1
TAIR|locus:2179979 - symbol:KTF1 "AT5G04290" species:3702... 141 0.00030 3
>TAIR|locus:2079001 [details] [associations]
symbol:AT3G55760 species:3702 "Arabidopsis thaliana"
[GO:0009507 "chloroplast" evidence=IDA] [GO:0009570 "chloroplast
stroma" evidence=IDA] GO:GO:0009570 EMBL:CP002686
GenomeReviews:BA000014_GR EMBL:BT020590 IPI:IPI00536838
RefSeq:NP_001190098.1 RefSeq:NP_191135.1 RefSeq:NP_850708.1
UniGene:At.1705 ProteinModelPortal:Q5EAH9 IntAct:Q5EAH9
STRING:Q5EAH9 PaxDb:Q5EAH9 PRIDE:Q5EAH9 EnsemblPlants:AT3G55760.1
EnsemblPlants:AT3G55760.2 EnsemblPlants:AT3G55760.3 GeneID:824742
KEGG:ath:AT3G55760 TAIR:At3g55760 eggNOG:NOG137712
HOGENOM:HOG000243874 InParanoid:Q5EAH9 OMA:GWVHKYG PhylomeDB:Q5EAH9
ProtClustDB:CLSN2683991 Genevestigator:Q5EAH9 Uniprot:Q5EAH9
Length = 578
Score = 1612 (572.5 bits), Expect = 1.1e-165, P = 1.1e-165
Identities = 279/452 (61%), Positives = 335/452 (74%)
Query: 153 SSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERDSSGALSAGIFVPRSGTPG 212
++ +R + ++ S E P N+ ++ E P+ G S ++VPRS T G
Sbjct: 132 AAAAISAARAILASNNSGDGKEGFPNEDNTVTSEVTETPKNAKLGMWSRTVYVPRSETSG 191
Query: 213 NRTPAPGPDFWSWSPPEXXXXXXXXXXXLQMAEKSSVYPTPVNPVVEKARSVDILPIPFE 272
TP GPDFWSW+PP+ LQ EK + +PT NPV+EK +S D L IP+E
Sbjct: 192 TETP--GPDFWSWTPPQGSEISSVD---LQAVEKPAEFPTLPNPVLEKDKSADSLSIPYE 246
Query: 273 SKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSXXXXXXXXXLDKV 332
S LS + +PPF+SL+ V KE +ET + +L E DL + S LD +
Sbjct: 247 SMLSSERHSFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSL 304
Query: 333 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 392
DE +T G++ DG +WWK+TG+E+RPDGVVCRWTM RGV+AD +EWQ+K+WEA+D+ G K
Sbjct: 305 DESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFK 364
Query: 393 ELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDAS 452
ELGSEKSGRDATGNVWREFW ESM Q G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+
Sbjct: 365 ELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDAT 424
Query: 453 GKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKW 512
GK+EKWAHKWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER GDGW KW
Sbjct: 425 GKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKW 484
Query: 513 GDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHE 572
GDKWDENF+P++ GVKQGETWW GK+G+RWNR+WGE HNGSGWVHKYGKSSSGE WDTH
Sbjct: 485 GDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHV 544
Query: 573 QQETWYERFPHFGFYHCFDNSVQLREVRKPSE 604
QETWYE+FPHFGF+HCFDNSVQLR V+KPS+
Sbjct: 545 PQETWYEKFPHFGFFHCFDNSVQLRAVKKPSD 576
Score = 383 (139.9 bits), Expect = 1.9e-33, P = 1.9e-33
Identities = 161/577 (27%), Positives = 237/577 (41%)
Query: 22 FTTRRTTPQQINFWSRRTGAKV-GVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAE 80
FT T+ + + RTG ++ VSN EG SYLDMW+ AVDR++KE F+KIA ++
Sbjct: 35 FTAPVTSRRSLR--GSRTGVRILRVSN-EGRESYLDMWKNAVDREKKEKAFEKIAENVVA 91
Query: 81 SXXXXXXXXXXXXXLTEQLEKKSEEFSKILDVSKEERDRIQRLQVIDXXXXXXXXXXXXL 140
LEKKS+EF KIL+VS EERDRIQR+QV+D L
Sbjct: 92 VDGEKEKGG--------DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAIL 143
Query: 141 EEKNGSVVKNG----ESSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERDSS 196
N K G +++ T+EV+ KN++ G + +V SE++GT E P D
Sbjct: 144 ASNNSGDGKEGFPNEDNTVTSEVTE-TPKNAKL-GMWSRTVYVPRSETSGT-ETPGPD-- 198
Query: 197 GALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEXXXXXXXXXXXLQMAEKSSVYPTPVNP 256
F S TP + D + P L+ + + P
Sbjct: 199 -------FW--SWTPPQGSEISSVDLQAVEKP--AEFPTLPNPVLEKDKSADSLSIPYES 247
Query: 257 VVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEE--VSETNLET-----PSLE 309
++ R +P PFES L E + + P L E + +S N E SL+
Sbjct: 248 MLSSERHSFTIP-PFES-LIEVRKEAETKPSSETLSTEHDLDLISSANAEEVARVLDSLD 305
Query: 310 EERDLGALFSXXXXXXXXXLDKVDE------LATRGINPDGSRWWKETGIEQRPD-GVVC 362
E G ++K + RG+ DG W++ E D G
Sbjct: 306 ESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKE 365
Query: 363 RWTMTRGVSADEALEWQEKFWEAA--DELG--HKELGSEKSGRDATGNVWREFWTESMWQ 418
+ G A + W+E FW + E G H E ++K G+ G+ W+E W E +
Sbjct: 366 LGSEKSGRDATGNV-WRE-FWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEH-YD 422
Query: 419 NQGLVHLEKTADKW---GKN-----GNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQ 470
G EK A KW +N G+ W E+W E YD G + K+ KW
Sbjct: 423 ATG--KSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDG 480
Query: 471 LDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQG 530
D W ++W E ++ +K + W E GD W++ W E + + K G
Sbjct: 481 WDK-----WGDKWDENFNPSAQGVKQGETWWEGKHGDRWNR---SWGEGHNGSGWVHKYG 532
Query: 531 ETWWAGKYGERWN-----RTWGERHNGSGWVHKYGKS 562
++ GE W+ TW E+ G+ H + S
Sbjct: 533 KS----SSGEHWDTHVPQETWYEKFPHFGFFHCFDNS 565
>TAIR|locus:2035898 [details] [associations]
symbol:AT1G42430 "AT1G42430" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
"cellular_component" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] EMBL:CP002684 IPI:IPI00525753 RefSeq:NP_174971.5
UniGene:At.39108 PRIDE:F4I9G2 DNASU:840847
EnsemblPlants:AT1G42430.1 GeneID:840847 KEGG:ath:AT1G42430
OMA:ANEKDWG Uniprot:F4I9G2
Length = 426
Score = 634 (228.2 bits), Expect = 4.8e-62, P = 4.8e-62
Identities = 123/280 (43%), Positives = 171/280 (61%)
Query: 329 LDKVDELATR-GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAAD 387
+D ++E G N DGS W++E+G + +G CRW+ G S D + EW E +WE +D
Sbjct: 134 IDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSD 193
Query: 388 ELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEK 444
G+KELG EKSG+++ G+ W E W E + Q++ L +E++A K K+G + W EK
Sbjct: 194 WTGYKELGVEKSGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEK 253
Query: 445 WWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERC 504
WWE YDA G EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE
Sbjct: 254 WWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETE 304
Query: 505 EGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSS 564
G +KWGDKW+E F + G +QGETW +RW+RTWGE H G+G VHKYGKS++
Sbjct: 305 LG---TKWGDKWEEKFF-SGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTT 360
Query: 565 GELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSE 604
GE WD +ET+YE PH+G+ +S QL ++ P E
Sbjct: 361 GESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ-PRE 399
>TAIR|locus:2179979 [details] [associations]
symbol:KTF1 "AT5G04290" species:3702 "Arabidopsis
thaliana" [GO:0000166 "nucleotide binding" evidence=ISS]
[GO:0005634 "nucleus" evidence=ISM] [GO:0006306 "DNA methylation"
evidence=IMP] [GO:0030422 "production of siRNA involved in RNA
interference" evidence=IMP] InterPro:IPR017071 EMBL:CP002688
GO:GO:0006357 GO:GO:0030422 GO:GO:0006306 GO:GO:0032784
InterPro:IPR005824 SMART:SM00739 InterPro:IPR005100
PANTHER:PTHR11125:SF7 Pfam:PF03439 IPI:IPI00544683
RefSeq:NP_196049.1 UniGene:At.54715 ProteinModelPortal:F4JW79
SMR:F4JW79 IntAct:F4JW79 PRIDE:F4JW79 EnsemblPlants:AT5G04290.1
GeneID:830308 KEGG:ath:AT5G04290 OMA:SSWGKKD Uniprot:F4JW79
Length = 1493
Score = 141 (54.7 bits), Expect = 0.00031, Sum P(3) = 0.00030
Identities = 64/253 (25%), Positives = 102/253 (40%)
Query: 331 KVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELG 390
K D A+ G DG W K+ + DG G D W++KF + G
Sbjct: 947 KGDGAASWGKKDDGGSWGKKDD-GNKDDGGSSWGKKDDGQKDDGGSSWEKKF-DGGSSWG 1004
Query: 391 HKELGSEKSGR-DATGNVW-REFWTESMW--QNQGLVHLEKTAD---KWGKNGNGDE-WQ 442
K+ G G+ D G++W ++ S W ++ G K D WGK +G+ W
Sbjct: 1005 KKDDGGSSWGKKDDGGSLWGKKDDGGSSWGKEDDGGSLWGKKDDGESSWGKKDDGESSWG 1064
Query: 443 EKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAE 502
+K + + GK ++ + + D + G R G G G S ++ A
Sbjct: 1065 KKD-DGGSSWGKKDEGGYSEQTFDRGGR-GFGGRRGGGRRGGRDQFGRGSSFGNSEDPAP 1122
Query: 503 RCEGDGWSKWGDKWDENFDPNSHGVKQ----GETWWAGKYGERWNRTWGERHNGSGWVHK 558
+ G S WG K D + +S G + G +W GK +WG++++GSG
Sbjct: 1123 WSKPSGGSSWG-KQDGDGGGSSWGKENDAGGGSSW--GKQDNGVGSSWGKQNDGSGGGSS 1179
Query: 559 YGKSSS---GELW 568
+GK + G W
Sbjct: 1180 WGKQNDAGGGSSW 1192
Score = 42 (19.8 bits), Expect = 0.00031, Sum P(3) = 0.00030
Identities = 14/56 (25%), Positives = 28/56 (50%)
Query: 145 GSVVKNGESSGT-AEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERDSSGAL 199
G+V G++S + E S + K+ + +S A++ + + S+G + E G L
Sbjct: 784 GTVSGWGDTSASNVEASSWEKQGASTSNVADLGSWGTHGGSSGGNKQDEDSVWGKL 839
Score = 39 (18.8 bits), Expect = 0.00031, Sum P(3) = 0.00030
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 101 KKSEEFSKILDVSKEERDRI 120
K S F K D+++EE DRI
Sbjct: 92 KSSFVFPKEEDLNEEEFDRI 111
Score = 38 (18.4 bits), Expect = 0.00075, Sum P(3) = 0.00075
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 167 SESSGAAEISPFVKNSESNGTAEVPERDSSGALS 200
SESS E S + K S+G + +D + + S
Sbjct: 843 SESSQKKEESSWGKKGGSDGESSWGNKDGNSSAS 876
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.311 0.130 0.417 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 622 577 0.00081 120 3 11 23 0.37 35
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 3
No. of states in DFA: 629 (67 KB)
Total size of DFA: 431 KB (2199 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 48.94u 0.10s 49.04t Elapsed: 00:00:02
Total cpu time: 48.94u 0.10s 49.04t Elapsed: 00:00:02
Start: Sat May 11 09:55:43 2013 End: Sat May 11 09:55:45 2013