BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>008574
MKQHQELSKTNNMSHSTAATTTFRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPAD
VKTNNISKSRRALILNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDAN
PGKIEDGLMDKKKKEFEEKLMLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVE
DLVAAEAKIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIVMTDAISETSINTPPS
EPKIPIRNAAGVERKPQAYPSMPAPLPPPPPPRPPARAAATQKTPSFAQLYHSLTKQVEK
KDLPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIETKGGFINSLIQKVLAAAYTN
IEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKADAMREAAVEYRDLKQLENEISSYRD
DTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKDCKIPVDWMLDSGIISKIKQA
SMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAYRAHQFVGGLDSETLCAFEEI
RQRVPQHLGGSHKLLAGISSS

High Scoring Gene Products

Symbol, full name Information P value
AT1G48280 protein from Arabidopsis thaliana 8.2e-123
CHUP1
CHLOROPLAST UNUSUAL POSITIONING 1
protein from Arabidopsis thaliana 5.9e-77
AT4G18570 protein from Arabidopsis thaliana 5.4e-65
AT1G07120 protein from Arabidopsis thaliana 2.3e-58

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  008574
        (561 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2007755 - symbol:AT1G48280 "AT1G48280" species...   943  8.2e-123  2
TAIR|locus:2102385 - symbol:CHUP1 "CHLOROPLAST UNUSUAL PO...   704  5.9e-77   3
TAIR|locus:2831359 - symbol:AT4G18570 species:3702 "Arabi...   635  5.4e-65   2
TAIR|locus:2007477 - symbol:AT1G07120 "AT1G07120" species...   560  2.3e-58   2


>TAIR|locus:2007755 [details] [associations]
            symbol:AT1G48280 "AT1G48280" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM] EMBL:CP002684 EMBL:AC007932
            IPI:IPI00530542 PIR:G96522 RefSeq:NP_564524.1 UniGene:At.26293
            PRIDE:Q9SX62 EnsemblPlants:AT1G48280.1 GeneID:841248
            KEGG:ath:AT1G48280 TAIR:At1g48280 HOGENOM:HOG000153560
            InParanoid:Q9SX62 OMA:GEIQNRS PhylomeDB:Q9SX62
            ProtClustDB:CLSN2721717 Genevestigator:Q9SX62 Uniprot:Q9SX62
        Length = 558

 Score = 943 (337.0 bits), Expect = 8.2e-123, Sum P(2) = 8.2e-123
 Identities = 183/276 (66%), Positives = 224/276 (81%)

Query:   282 QKTPSFAQLYHSLTKQVEKKDLPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIET 341
             QK+P  +QL+  L KQ   ++L   VN  +  V+ AH+SIVGEIQNRSAHL+AIKADIET
Sbjct:   280 QKSPPVSQLFQLLNKQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIET 339

Query:   342 KGGFINSLIQKVLAAAYTNIEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKADAMREA 401
             KG FIN LIQKVL   ++++ED+++FVDWLDKEL++LADERAVLKHFKWPEKKAD ++EA
Sbjct:   340 KGEFINDLIQKVLTTCFSDMEDVMKFVDWLDKELATLADERAVLKHFKWPEKKADTLQEA 399

Query:   402 AVEYRDLKQLENEISSYRDDTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKDC 461
             AVEYR+LK+LE E+SSY DD N+ +G ALKKMA+LLDKSE+ I+RLV+LR S M SY+D 
Sbjct:   400 AVEYRELKKLEKELSSYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLRGSSMRSYQDF 459

Query:   462 KIPVDWMLDSGIISKIKQASMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAYR 521
             KIPV+WMLDSG+I KIK+AS+KLA+ YM RV  EL+   N DREST+EALLLQG+ FAYR
Sbjct:   460 KIPVEWMLDSGMICKIKRASIKLAKTYMNRVANELQSARNLDRESTKEALLLQGVRFAYR 519

Query:   522 AHQFVGGLDSETLCAFEEIRQRVPQHLGGSHKLLAG 557
              HQF GGLD ETLCA EEI+QRVP HL  +   +AG
Sbjct:   520 THQFAGGLDPETLCALEEIKQRVPSHLRLARGNMAG 555

 Score = 285 (105.4 bits), Expect = 8.2e-123, Sum P(2) = 8.2e-123
 Identities = 93/234 (39%), Positives = 131/234 (55%)

Query:    16 STAATTTFRLRA-NSKTRESPKQEA-GINGVSLSPELKARAKSVPADVKTNNISKSRRAL 73
             ST +TT  R+RA NS      K  A   NG++       + KS   DVK N+ +K RR++
Sbjct:     5 STTSTTPSRVRAANSHYSVISKPRAQDDNGLT-----GGKPKSSGYDVK-NDPAK-RRSI 57

Query:    74 ILNKPKSAEGAVGSHKDDEVKVFGRSLNRP-VVEQFARPRR------QRIVDANPGKIED 126
             +L + KSAE  +            RS+NRP VVEQF  PRR      +  V A     ED
Sbjct:    58 LLKRAKSAEEEMAVLAPQRA----RSVNRPAVVEQFGCPRRPISRKSEETVMATAAA-ED 112

Query:   127 GLMDXXXXXXXXXLMLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVAAE 186
                          L+++E+L+KDLQ +V  LK E  +A++ N ELE  N+KL +DLV+AE
Sbjct:   113 EKRKRMEELEEK-LVVNESLIKDLQLQVLNLKTELEEARNSNVELELNNRKLSQDLVSAE 171

Query:   187 AKIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIVMTDAISETSINTPPS 240
             AKI+SLSS ++     E+Q+ +FKD+Q+LIA+KLE   V  +   E+S  +PPS
Sbjct:   172 AKISSLSSNDK--PAKEHQNSRFKDIQRLIASKLEQPKVKKEVAVESSRLSPPS 223


>TAIR|locus:2102385 [details] [associations]
            symbol:CHUP1 "CHLOROPLAST UNUSUAL POSITIONING 1"
            species:3702 "Arabidopsis thaliana" [GO:0005634 "nucleus"
            evidence=ISM] [GO:0009507 "chloroplast" evidence=IDA] [GO:0009707
            "chloroplast outer membrane" evidence=IDA] [GO:0009902 "chloroplast
            relocation" evidence=RCA;IMP] [GO:0006364 "rRNA processing"
            evidence=RCA] [GO:0010027 "thylakoid membrane organization"
            evidence=RCA] [GO:0010207 "photosystem II assembly" evidence=RCA]
            [GO:0019684 "photosynthesis, light reaction" evidence=RCA]
            [GO:0034660 "ncRNA metabolic process" evidence=RCA] [GO:0035304
            "regulation of protein dephosphorylation" evidence=RCA] [GO:0042793
            "transcription from plastid promoter" evidence=RCA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=RCA]
            EMBL:CP002686 GenomeReviews:BA000014_GR EMBL:AP001313 GO:GO:0009707
            EMBL:AB087408 IPI:IPI00541711 RefSeq:NP_001189974.1
            RefSeq:NP_189197.2 UniGene:At.27741 ProteinModelPortal:Q9LI74
            STRING:Q9LI74 PaxDb:Q9LI74 PRIDE:Q9LI74 EnsemblPlants:AT3G25690.1
            EnsemblPlants:AT3G25690.2 GeneID:822157 KEGG:ath:AT3G25690
            TAIR:At3g25690 eggNOG:NOG310144 HOGENOM:HOG000242560
            InParanoid:Q9LI74 OMA:ELRNYQT PhylomeDB:Q9LI74
            ProtClustDB:CLSN2680868 Genevestigator:Q9LI74 GO:GO:0009902
            Uniprot:Q9LI74
        Length = 1004

 Score = 704 (252.9 bits), Expect = 5.9e-77, Sum P(3) = 5.9e-77
 Identities = 136/271 (50%), Positives = 193/271 (71%)

Query:   283 KTPSFAQLYHSLTKQVEKKD-LPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIET 341
             + P   + Y SL K+  KK+  PS ++      S A ++++GEI+NRS  LLA+KAD+ET
Sbjct:   720 RAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVET 779

Query:   342 KGGFINSLIQKVLAAAYTNIEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKADAMREA 401
             +G F+ SL  +V A+++T+IEDLL FV WLD+ELS L DERAVLKHF WPE KADA+REA
Sbjct:   780 QGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFDWPEGKADALREA 839

Query:   402 AVEYRDLKQLENEISSYRDDTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKDC 461
             A EY+DL +LE +++S+ DD N+    ALKKM  LL+K E+S+  L++ R+  +  YK+ 
Sbjct:   840 AFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLRTRDMAISRYKEF 899

Query:   462 KIPVDWMLDSGIISKIKQASMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAYR 521
              IPVDW+ D+G++ KIK +S++LA+ YMKRV  EL+ V  SD++  +E LLLQG+ FA+R
Sbjct:   900 GIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNREFLLLQGVRFAFR 959

Query:   522 AHQFVGGLDSETLCAFEEIRQRVPQHLGGSH 552
              HQF GG D+E++ AFEE+R R     G ++
Sbjct:   960 VHQFAGGFDAESMKAFEELRSRAKTESGDNN 990

 Score = 70 (29.7 bits), Expect = 5.9e-77, Sum P(3) = 5.9e-77
 Identities = 43/174 (24%), Positives = 81/174 (46%)

Query:    36 KQEAGINGVSLSPELKARAKSVPA-DVKTNNISKSRRALILNKPKSAEGAVGSHKDDEVK 94
             +QE+ I  V L  +LK +   +   ++  N++   R+ L   +  S  G V   K+ EV 
Sbjct:   160 EQESDI--VELQRQLKIKTVEIDMLNITINSLQAERKKL--QEELSQNGIV--RKELEVA 213

Query:    95 VFGRSLNRPVVEQFARPRRQRIVDANPGKIEDGLMDXXXXXXXXXLMLSENLVKDLQSE- 153
                    R  +++    +RQ  +DAN  K +  L+          +   E + KD + E 
Sbjct:   214 -------RNKIKEL---QRQIQLDANQTKGQ--LLLLKQHVSSLQMKEEEAMNKDTEVER 261

Query:   154 ----VFALKAEFVKAQSLNAELEKQNKKLVEDLVAAEAKIASLSSREQREAVGE 203
                 V  L+ + ++ +  N EL+ + ++L   L +AEA+IA+LS+  + + V +
Sbjct:   262 KLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAK 315

 Score = 43 (20.2 bits), Expect = 1.0e-71, Sum P(2) = 1.0e-71
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query:   229 AISETSINTPPSEPKIPIRNAAGVERK 255
             A++   +N  PS+P  P  N  G +++
Sbjct:    15 AVTVKRLNVKPSKPSKPSDNGEGGDKE 41

 Score = 37 (18.1 bits), Expect = 5.9e-77, Sum P(3) = 5.9e-77
 Identities = 7/14 (50%), Positives = 9/14 (64%)

Query:   238 PPSEPKIPIRNAAG 251
             PP  P+ P R+A G
Sbjct:   648 PPRVPRPPPRSAGG 661


>TAIR|locus:2831359 [details] [associations]
            symbol:AT4G18570 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM] EMBL:CP002687 EMBL:AY128285
            EMBL:BT004523 IPI:IPI00524553 RefSeq:NP_193591.2 UniGene:At.23905
            UniGene:At.70231 ProteinModelPortal:Q8L7S5 PRIDE:Q8L7S5
            EnsemblPlants:AT4G18570.1 GeneID:827588 KEGG:ath:AT4G18570
            TAIR:At4g18570 HOGENOM:HOG000070980 InParanoid:Q8L7S5 OMA:FEWPEQK
            PhylomeDB:Q8L7S5 ProtClustDB:CLSN2918131 Genevestigator:Q8L7S5
            Uniprot:Q8L7S5
        Length = 642

 Score = 635 (228.6 bits), Expect = 5.4e-65, Sum P(2) = 5.4e-65
 Identities = 131/268 (48%), Positives = 185/268 (69%)

Query:   282 QKTPSFAQLYHSLTKQVE---KKDLPSPVNQKRPAVSIAHSS---IVGEIQNRSAHLLAI 335
             ++ P   + YHSL ++     ++D     N    A+ +A+S+   ++GEI+NRS +LLAI
Sbjct:   354 RRVPEVVEFYHSLMRRDSTNSRRDSTGGGNAAAEAI-LANSNARDMIGEIENRSVYLLAI 412

Query:   336 KADIETKGGFINSLIQKVLAAAYTNIEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKA 395
             K D+ET+G FI  LI++V  AA+++IED++ FV WLD ELS L DERAVLKHF+WPE+KA
Sbjct:   413 KTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYLVDERAVLKHFEWPEQKA 472

Query:   396 DAMREAAVEYRDLKQLENEISSYRDDTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVM 455
             DA+REAA  Y DLK+L +E S +R+D      +ALKKM +L +K E  +  L ++R S  
Sbjct:   473 DALREAAFCYFDLKKLISEASRFREDPRQSSSSALKKMQALFEKLEHGVYSLSRMRESAA 532

Query:   456 HSYKDCKIPVDWMLDSGIISKIKQASMKLAQMYMKRVTRELELVHNSDRESTQEALLLQG 515
               +K  +IPVDWML++GI S+IK AS+KLA  YMKRV+ ELE +     E  +E L++QG
Sbjct:   533 TKFKSFQIPVDWMLETGITSQIKLASVKLAMKYMKRVSAELEAIEGGGPE--EEELIVQG 590

Query:   516 LHFAYRAHQFVGGLDSETLCAFEEIRQR 543
             + FA+R HQF GG D+ET+ AFEE+R +
Sbjct:   591 VRFAFRVHQFAGGFDAETMKAFEELRDK 618

 Score = 45 (20.9 bits), Expect = 5.4e-65, Sum P(2) = 5.4e-65
 Identities = 21/91 (23%), Positives = 43/91 (47%)

Query:   140 LMLSENL-VKDLQSEVFA---LKAEFVKAQSLNAELEKQNKKLVEDLVAAEAKI-ASLSS 194
             L+ +ENL VK L+  V     L+++         EL K+  +L ED      +   S   
Sbjct:   103 LLKTENLEVKLLRESVSVIPLLESQIADKNGEIDELRKETARLAEDNERLRREFDRSEEM 162

Query:   195 REQREAVGEYQSPKFKDVQKLIANKLE-HSI 224
             R + E   +    +  +++KL++++ + H++
Sbjct:   163 RRECETREKEMEAEIVELRKLVSSESDDHAL 193

 Score = 37 (18.1 bits), Expect = 3.7e-64, Sum P(2) = 3.7e-64
 Identities = 6/17 (35%), Positives = 10/17 (58%)

Query:   291 YHSLTKQVEKKDLPSPV 307
             +H      + KD+PSP+
Sbjct:    12 FHKSPSTKKTKDMPSPL 28


>TAIR|locus:2007477 [details] [associations]
            symbol:AT1G07120 "AT1G07120" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0009941 "chloroplast envelope" evidence=IDA]
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0009941 EMBL:AC067971
            IPI:IPI00537623 PIR:A86206 RefSeq:NP_172192.1 UniGene:At.28020
            EnsemblPlants:AT1G07120.1 GeneID:837222 KEGG:ath:AT1G07120
            TAIR:At1g07120 eggNOG:NOG239801 HOGENOM:HOG000070965
            InParanoid:Q9LMK4 OMA:MREGDAC PhylomeDB:Q9LMK4
            ProtClustDB:CLSN2682542 Genevestigator:Q9LMK4 Uniprot:Q9LMK4
        Length = 392

 Score = 560 (202.2 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
 Identities = 115/262 (43%), Positives = 171/262 (65%)

Query:   282 QKTPSFAQLYHSLTKQVEKKDLPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIET 341
             ++ P   + Y +LTK+  +  + + +NQ        + +++GEI+NRS +L  IK+D + 
Sbjct:   129 RRAPEVVEFYRALTKR--ESHMGNKINQNGVLSPAFNRNMIGEIENRSKYLSDIKSDTDR 186

Query:   342 KGGFINSLIQKVLAAAYTNIEDLLEFVDWLDKELSSLADERAVLKHF-KWPEKKADAMRE 400
                 I+ LI KV AA +T+I ++  FV W+D+ELSSL DERAVLKHF KWPE+K D++RE
Sbjct:   187 HRDHIHILISKVEAATFTDISEVETFVKWIDEELSSLVDERAVLKHFPKWPERKVDSLRE 246

Query:   401 AAVEYRDLKQLENEISSYRDDTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKD 460
             AA  Y+  K L NEI S++D+       AL+++ SL D+ E S+    K+R+S    YKD
Sbjct:   247 AACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQDRLEESVNNTEKMRDSTGKRYKD 306

Query:   461 CKIPVDWMLDSGIISKIKQASMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAY 520
              +IP +WMLD+G+I ++K +S++LAQ YMKR+ +ELE  + S +E     L+LQG+ FAY
Sbjct:   307 FQIPWEWMLDTGLIGQLKYSSLRLAQEYMKRIAKELES-NGSGKEGN---LMLQGVRFAY 362

Query:   521 RAHQFVGGLDSETLCAFEEIRQ 542
               HQF GG D ETL  F E+++
Sbjct:   363 TIHQFAGGFDGETLSIFHELKK 384

 Score = 57 (25.1 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
 Identities = 17/54 (31%), Positives = 33/54 (61%)

Query:   149 DLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVAAEAKIASLSSRE-QREAV 201
             DL   V  L+A  V+    N +LEK+N +L +++    A++++L S E +R+++
Sbjct:    10 DLLRLVKELQAYLVR----NDKLEKENHELRQEVARLRAQVSNLKSHENERKSM 59


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.313   0.127   0.344    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      561       527   0.00091  119 3  11 23  0.44    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  608 (65 KB)
  Total size of DFA:  249 KB (2134 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  53.87u 0.12s 53.99t   Elapsed:  00:00:03
  Total cpu time:  53.87u 0.12s 53.99t   Elapsed:  00:00:03
  Start:  Tue May 21 10:42:18 2013   End:  Tue May 21 10:42:21 2013

Back to top