BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>036026
MSKSVEGVLAEKYPAYKDRHKLALEREKQIKEKAEKARAYRFRDNSNFDSKHPTLPPKLA
LLKEKPIVSGDSSDQSHDDRAAESQTISKMKFSQIEKRPPRVFRPPPKPSGGAPAGTNAN
PSSGTPPAPPPPPGATPPPPPPPPPGGPPPPPPPPGSLPRGVGSGDKVQRAPELVEFYQT
LMKREAKKDTSSLISSTSNTSDARSNMIGEIENKSSFLLAVKADVETQGDFVQSLAAEVR
AASFTTVEDLVVFVNWLDEELSFLVDERAVLKHFDWPEGKADALREAAFEYQDLVKLEKQ
VSSFVDDPGLPCESALKKMYKLLEKVEQSVYALLRTRDMAISRYREFGIPVDWLLDTGVV
GKIKLSSVQLARKYMKRVSTELEAMSRPEKEPNREFLLLQGVRFAFRVHQFAGGFDAESM
KAFEVLRSRVHKQTVEDNKQEA

High Scoring Gene Products

Symbol, full name Information P value
CHUP1
CHLOROPLAST UNUSUAL POSITIONING 1
protein from Arabidopsis thaliana 3.7e-133
AT4G18570 protein from Arabidopsis thaliana 5.0e-76
AT1G48280 protein from Arabidopsis thaliana 2.3e-64
AT1G07120 protein from Arabidopsis thaliana 2.5e-56

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  036026
        (442 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2102385 - symbol:CHUP1 "CHLOROPLAST UNUSUAL PO...  1131  3.7e-133  2
TAIR|locus:2831359 - symbol:AT4G18570 species:3702 "Arabi...   766  5.0e-76   1
TAIR|locus:2007755 - symbol:AT1G48280 "AT1G48280" species...   656  2.3e-64   1
TAIR|locus:2007477 - symbol:AT1G07120 "AT1G07120" species...   580  2.5e-56   1


>TAIR|locus:2102385 [details] [associations]
            symbol:CHUP1 "CHLOROPLAST UNUSUAL POSITIONING 1"
            species:3702 "Arabidopsis thaliana" [GO:0005634 "nucleus"
            evidence=ISM] [GO:0009507 "chloroplast" evidence=IDA] [GO:0009707
            "chloroplast outer membrane" evidence=IDA] [GO:0009902 "chloroplast
            relocation" evidence=RCA;IMP] [GO:0006364 "rRNA processing"
            evidence=RCA] [GO:0010027 "thylakoid membrane organization"
            evidence=RCA] [GO:0010207 "photosystem II assembly" evidence=RCA]
            [GO:0019684 "photosynthesis, light reaction" evidence=RCA]
            [GO:0034660 "ncRNA metabolic process" evidence=RCA] [GO:0035304
            "regulation of protein dephosphorylation" evidence=RCA] [GO:0042793
            "transcription from plastid promoter" evidence=RCA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=RCA]
            EMBL:CP002686 GenomeReviews:BA000014_GR EMBL:AP001313 GO:GO:0009707
            EMBL:AB087408 IPI:IPI00541711 RefSeq:NP_001189974.1
            RefSeq:NP_189197.2 UniGene:At.27741 ProteinModelPortal:Q9LI74
            STRING:Q9LI74 PaxDb:Q9LI74 PRIDE:Q9LI74 EnsemblPlants:AT3G25690.1
            EnsemblPlants:AT3G25690.2 GeneID:822157 KEGG:ath:AT3G25690
            TAIR:At3g25690 eggNOG:NOG310144 HOGENOM:HOG000242560
            InParanoid:Q9LI74 OMA:ELRNYQT PhylomeDB:Q9LI74
            ProtClustDB:CLSN2680868 Genevestigator:Q9LI74 GO:GO:0009902
            Uniprot:Q9LI74
        Length = 1004

 Score = 1131 (403.2 bits), Expect = 3.7e-133, Sum P(2) = 3.7e-133
 Identities = 224/284 (78%), Positives = 250/284 (88%)

Query:   157 SLPRGVGSGDKVQRAPELVEFYQTLMKREAKKDXXXXXXXXXXXXD--ARSNMIGEIENK 214
             +L RG G G+KV RAPELVEFYQ+LMKRE+KK+               AR+NMIGEIEN+
Sbjct:   707 ALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENR 766

Query:   215 SSFLLAVKADVETQGDFVQSLAAEVRAASFTTVEDLVVFVNWLDEELSFLVDERAVLKHF 274
             S+FLLAVKADVETQGDFVQSLA EVRA+SFT +EDL+ FV+WLDEELSFLVDERAVLKHF
Sbjct:   767 STFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHF 826

Query:   275 DWPEGKADALREAAFEYQDLVKLEKQVSSFVDDPGLPCESALKKMYKLLEKVEQSVYALL 334
             DWPEGKADALREAAFEYQDL+KLEKQV+SFVDDP L CE ALKKMYKLLEKVEQSVYALL
Sbjct:   827 DWPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALL 886

Query:   335 RTRDMAISRYREFGIPVDWLLDTGVVGKIKLSSVQLARKYMKRVSTELEAMSRPEKEPNR 394
             RTRDMAISRY+EFGIPVDWL DTGVVGKIKLSSVQLA+KYMKRV+ EL+++S  +K+PNR
Sbjct:   887 RTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNR 946

Query:   395 EFLLLQGVRFAFRVHQFAGGFDAESMKAFEVLRSRVHKQTVEDN 438
             EFLLLQGVRFAFRVHQFAGGFDAESMKAFE LRSR   ++ ++N
Sbjct:   947 EFLLLQGVRFAFRVHQFAGGFDAESMKAFEELRSRAKTESGDNN 990

 Score = 195 (73.7 bits), Expect = 3.7e-133, Sum P(2) = 3.7e-133
 Identities = 46/100 (46%), Positives = 60/100 (60%)

Query:     1 MSKSVEGVLAEKYPAYKDRHKLALEREKQIKEKAEKARAYRFRDNSNFDSXXXXXXXXXX 60
             MSKSV+ VL EKYPAYKDRHKLA+EREK IK KA++ARA RF  N               
Sbjct:   546 MSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRV 605

Query:    61 XXXXXXIVSGDSSDQSHDD---RAAESQ-TISKMKFSQIE 96
                     +GD S++S++    +A+E+  T++KMK   IE
Sbjct:   606 VVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIE 645


>TAIR|locus:2831359 [details] [associations]
            symbol:AT4G18570 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM] EMBL:CP002687 EMBL:AY128285
            EMBL:BT004523 IPI:IPI00524553 RefSeq:NP_193591.2 UniGene:At.23905
            UniGene:At.70231 ProteinModelPortal:Q8L7S5 PRIDE:Q8L7S5
            EnsemblPlants:AT4G18570.1 GeneID:827588 KEGG:ath:AT4G18570
            TAIR:At4g18570 HOGENOM:HOG000070980 InParanoid:Q8L7S5 OMA:FEWPEQK
            PhylomeDB:Q8L7S5 ProtClustDB:CLSN2918131 Genevestigator:Q8L7S5
            Uniprot:Q8L7S5
        Length = 642

 Score = 766 (274.7 bits), Expect = 5.0e-76, P = 5.0e-76
 Identities = 159/280 (56%), Positives = 200/280 (71%)

Query:   167 KVQRAPELVEFYQTLMKREA---KKDXXXXXXXXXXXXDARSN---MIGEIENKSSFLLA 220
             KV+R PE+VEFY +LM+R++   ++D             A SN   MIGEIEN+S +LLA
Sbjct:   352 KVRRVPEVVEFYHSLMRRDSTNSRRDSTGGGNAAAEAILANSNARDMIGEIENRSVYLLA 411

Query:   221 VKADVETQGDFVQSLAAEVRAASFTTVEDLVVFVNWLDEELSFLVDERAVLKHFDWPEGK 280
             +K DVETQGDF++ L  EV  A+F+ +ED+V FV WLD+ELS+LVDERAVLKHF+WPE K
Sbjct:   412 IKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYLVDERAVLKHFEWPEQK 471

Query:   281 ADALREAAFEYQDLVKLEKQVSSFVDDPGLPCESALKKMYKLLEKVEQSVYALLRTRDMA 340
             ADALREAAF Y DL KL  + S F +DP     SALKKM  L EK+E  VY+L R R+ A
Sbjct:   472 ADALREAAFCYFDLKKLISEASRFREDPRQSSSSALKKMQALFEKLEHGVYSLSRMRESA 531

Query:   341 ISRYREFGIPVDWLLDTGVVGKIKLSSVQLARKYMKRVSTELEAMSRPEKEPNREFLLLQ 400
              ++++ F IPVDW+L+TG+  +IKL+SV+LA KYMKRVS ELEA+      P  E L++Q
Sbjct:   532 ATKFKSFQIPVDWMLETGITSQIKLASVKLAMKYMKRVSAELEAIEGGG--PEEEELIVQ 589

Query:   401 GVRFAFRVHQFAGGFDAESMKAFEVLRSRVHKQTVEDNKQ 440
             GVRFAFRVHQFAGGFDAE+MKAFE LR +     V+   Q
Sbjct:   590 GVRFAFRVHQFAGGFDAETMKAFEELRDKARSCHVQCQSQ 629


>TAIR|locus:2007755 [details] [associations]
            symbol:AT1G48280 "AT1G48280" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM] EMBL:CP002684 EMBL:AC007932
            IPI:IPI00530542 PIR:G96522 RefSeq:NP_564524.1 UniGene:At.26293
            PRIDE:Q9SX62 EnsemblPlants:AT1G48280.1 GeneID:841248
            KEGG:ath:AT1G48280 TAIR:At1g48280 HOGENOM:HOG000153560
            InParanoid:Q9SX62 OMA:GEIQNRS PhylomeDB:Q9SX62
            ProtClustDB:CLSN2721717 Genevestigator:Q9SX62 Uniprot:Q9SX62
        Length = 558

 Score = 656 (236.0 bits), Expect = 2.3e-64, P = 2.3e-64
 Identities = 124/273 (45%), Positives = 191/273 (69%)

Query:   159 PRGVGSGDKVQRAPELVEFYQTLMKREAKKDXXXXXXXXXXXXD-ARSNMIGEIENKSSF 217
             PR +    + Q++P + + +Q L K++  ++            + A ++++GEI+N+S+ 
Sbjct:   270 PRPLAKAARAQKSPPVSQLFQLLNKQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAH 329

Query:   218 LLAVKADVETQGDFVQSLAAEVRAASFTTVEDLVVFVNWLDEELSFLVDERAVLKHFDWP 277
             L+A+KAD+ET+G+F+  L  +V    F+ +ED++ FV+WLD+EL+ L DERAVLKHF WP
Sbjct:   330 LIAIKADIETKGEFINDLIQKVLTTCFSDMEDVMKFVDWLDKELATLADERAVLKHFKWP 389

Query:   278 EGKADALREAAFEYQDLVKLEKQVSSFVDDPGLPCESALKKMYKLLEKVEQSVYALLRTR 337
             E KAD L+EAA EY++L KLEK++SS+ DDP +    ALKKM  LL+K EQ +  L+R R
Sbjct:   390 EKKADTLQEAAVEYRELKKLEKELSSYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLR 449

Query:   338 DMAISRYREFGIPVDWLLDTGVVGKIKLSSVQLARKYMKRVSTELEAMSRPEKEPNREFL 397
               ++  Y++F IPV+W+LD+G++ KIK +S++LA+ YM RV+ EL++    ++E  +E L
Sbjct:   450 GSSMRSYQDFKIPVEWMLDSGMICKIKRASIKLAKTYMNRVANELQSARNLDRESTKEAL 509

Query:   398 LLQGVRFAFRVHQFAGGFDAESMKAFEVLRSRV 430
             LLQGVRFA+R HQFAGG D E++ A E ++ RV
Sbjct:   510 LLQGVRFAYRTHQFAGGLDPETLCALEEIKQRV 542


>TAIR|locus:2007477 [details] [associations]
            symbol:AT1G07120 "AT1G07120" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0009941 "chloroplast envelope" evidence=IDA]
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0009941 EMBL:AC067971
            IPI:IPI00537623 PIR:A86206 RefSeq:NP_172192.1 UniGene:At.28020
            EnsemblPlants:AT1G07120.1 GeneID:837222 KEGG:ath:AT1G07120
            TAIR:At1g07120 eggNOG:NOG239801 HOGENOM:HOG000070965
            InParanoid:Q9LMK4 OMA:MREGDAC PhylomeDB:Q9LMK4
            ProtClustDB:CLSN2682542 Genevestigator:Q9LMK4 Uniprot:Q9LMK4
        Length = 392

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 121/261 (46%), Positives = 173/261 (66%)

Query:   168 VQRAPELVEFYQTLMKREAKKDXXXXXXXXXXXXDARSNMIGEIENKSSFLLAVKADVET 227
             V+RAPE+VEFY+ L KRE+                 R NMIGEIEN+S +L  +K+D + 
Sbjct:   128 VRRAPEVVEFYRALTKRESHMGNKINQNGVLSPAFNR-NMIGEIENRSKYLSDIKSDTDR 186

Query:   228 QGDFVQSLAAEVRAASFTTVEDLVVFVNWLDEELSFLVDERAVLKHFD-WPEGKADALRE 286
               D +  L ++V AA+FT + ++  FV W+DEELS LVDERAVLKHF  WPE K D+LRE
Sbjct:   187 HRDHIHILISKVEAATFTDISEVETFVKWIDEELSSLVDERAVLKHFPKWPERKVDSLRE 246

Query:   287 AAFEYQDLVKLEKQVSSFVDDPGLPCESALKKMYKLLEKVEQSVYALLRTRDMAISRYRE 346
             AA  Y+    L  ++ SF D+P      AL+++  L +++E+SV    + RD    RY++
Sbjct:   247 AACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQDRLEESVNNTEKMRDSTGKRYKD 306

Query:   347 FGIPVDWLLDTGVVGKIKLSSVQLARKYMKRVSTELEAMSRPEKEPNREFLLLQGVRFAF 406
             F IP +W+LDTG++G++K SS++LA++YMKR++ ELE+ +   KE N   L+LQGVRFA+
Sbjct:   307 FQIPWEWMLDTGLIGQLKYSSLRLAQEYMKRIAKELES-NGSGKEGN---LMLQGVRFAY 362

Query:   407 RVHQFAGGFDAESMKAFEVLR 427
              +HQFAGGFD E++  F  L+
Sbjct:   363 TIHQFAGGFDGETLSIFHELK 383


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.130   0.356    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      442       358   0.00081  117 3  11 22  0.42    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  595 (63 KB)
  Total size of DFA:  200 KB (2113 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  27.98u 0.08s 28.06t   Elapsed:  00:00:01
  Total cpu time:  27.98u 0.08s 28.06t   Elapsed:  00:00:01
  Start:  Mon May 20 21:49:21 2013   End:  Mon May 20 21:49:22 2013

Back to top