BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>047848
MAPEDDDSRIDSFQKERDARIALLEKENFELRQEVLRLKAQISSLKAHDNERKSMLWKKL
QNPNTDTSPQKQTDFVKTQEFQNLDGETFRPRPGFQELEAGKERSMKIQTPVAFPAPPPP
PLPSKFLAGSKTVRRVPEVVELYRSLTRKDAHMENRSNTTAAPVIAFTRNMIGEIENRST
YLSAIKTDVKKQKEFINFLIKEVESAVFDQISEVEAFVKWLDGELSSLVDERAVLKHFPQ
WPERKADTLREAACNYRDLKNLEQEVSSFEDNQKESLPQATRKMQALQDRRACWSKGTGK
KYRDFQIPCDWMMDSGLIGQMKVSSLRLAKEYMKRFAGGFDAETIQAFEELKKVGLSSHK

High Scoring Gene Products

Symbol, full name Information P value
AT1G07120 protein from Arabidopsis thaliana 4.5e-82
AT4G18570 protein from Arabidopsis thaliana 1.3e-57
CHUP1
CHLOROPLAST UNUSUAL POSITIONING 1
protein from Arabidopsis thaliana 6.8e-51
AT1G48280 protein from Arabidopsis thaliana 1.9e-50

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  047848
        (360 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2007477 - symbol:AT1G07120 "AT1G07120" species...   823  4.5e-82   1
TAIR|locus:2831359 - symbol:AT4G18570 species:3702 "Arabi...   524  1.3e-57   2
TAIR|locus:2102385 - symbol:CHUP1 "CHLOROPLAST UNUSUAL PO...   465  6.8e-51   3
TAIR|locus:2007755 - symbol:AT1G48280 "AT1G48280" species...   437  1.9e-50   3


>TAIR|locus:2007477 [details] [associations]
            symbol:AT1G07120 "AT1G07120" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0009941 "chloroplast envelope" evidence=IDA]
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0009941 EMBL:AC067971
            IPI:IPI00537623 PIR:A86206 RefSeq:NP_172192.1 UniGene:At.28020
            EnsemblPlants:AT1G07120.1 GeneID:837222 KEGG:ath:AT1G07120
            TAIR:At1g07120 eggNOG:NOG239801 HOGENOM:HOG000070965
            InParanoid:Q9LMK4 OMA:MREGDAC PhylomeDB:Q9LMK4
            ProtClustDB:CLSN2682542 Genevestigator:Q9LMK4 Uniprot:Q9LMK4
        Length = 392

 Score = 823 (294.8 bits), Expect = 4.5e-82, P = 4.5e-82
 Identities = 187/355 (52%), Positives = 235/355 (66%)

Query:     1 MAPE-DDDSRIDSFQKERDARIAL---LEKENFELRQEVLRLKAQISSLKAHDNERKSML 56
             M P  +DDS +    KE  A +     LEKEN ELRQEV RL+AQ+S+LK+H+NERKSML
Sbjct:     1 MLPNGEDDSDLLRLVKELQAYLVRNDKLEKENHELRQEVARLRAQVSNLKSHENERKSML 60

Query:    57 WKKLQNP----NTDTSPQKQTDFVKTQEFQNLDGETFR---PRPGFQELEAGKERSMKIQ 109
             WKKLQ+     NTD S  K  + VK+    N  G+  R   P+P  Q    G+  + K  
Sbjct:    61 WKKLQSSYDGSNTDGSNLKAPESVKS----NTKGQEVRNPNPKPTIQ----GQSTATKPP 112

Query:   110 TXXXXXXXXXXXXXSKFLAGSKTVRRVPEVVELYRSLTRKDAHMENRSNTTAAPVIAFTR 169
                           SK   G ++VRR PEVVE YR+LT++++HM N+ N       AF R
Sbjct:   113 PPPPLP--------SKRTLGKRSVRRAPEVVEFYRALTKRESHMGNKINQNGVLSPAFNR 164

Query:   170 NMIGEIENRSTYLSAIKTDVKKQKEFINFLIKEVESAVFDQISEVEAFVKWLDGELSSLV 229
             NMIGEIENRS YLS IK+D  + ++ I+ LI +VE+A F  ISEVE FVKW+D ELSSLV
Sbjct:   165 NMIGEIENRSKYLSDIKSDTDRHRDHIHILISKVEAATFTDISEVETFVKWIDEELSSLV 224

Query:   230 DERAVLKHFPQWPERKADTLREAACNYRDLKNLEQEVSSFEDNQKESLPQATRKMQALQD 289
             DERAVLKHFP+WPERK D+LREAACNY+  KNL  E+ SF+DN K+SL QA +++Q+LQD
Sbjct:   225 DERAVLKHFPKWPERKVDSLREAACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQD 284

Query:   290 R-------RACWSKGTGKKYRDFQIPCDWMMDSGLIGQMKVSSLRLAKEYMKRFA 337
             R              TGK+Y+DFQIP +WM+D+GLIGQ+K SSLRLA+EYMKR A
Sbjct:   285 RLEESVNNTEKMRDSTGKRYKDFQIPWEWMLDTGLIGQLKYSSLRLAQEYMKRIA 339

 Score = 121 (47.7 bits), Expect = 0.00014, P = 0.00014
 Identities = 49/194 (25%), Positives = 93/194 (47%)

Query:   183 SAIKTDVKKQKEFINFLIKEVESAVFDQISEVEAFVKWLDGELSSLVDERAVLKHFPQ-- 240
             +A  TD+ + + F+ ++ +E+ S V D+ + ++ F KW + ++ SL +     K  P+  
Sbjct:   200 AATFTDISEVETFVKWIDEELSSLV-DERAVLKHFPKWPERKVDSLREAACNYKR-PKNL 257

Query:   241 ------WPERKADTLREAACNYRDLKN-LEQEVSSFEDNQKESLPQATRKMQALQDRRAC 293
                   + +   D+L +A    + L++ LE+ V++ E   ++S  +  +  Q   +    
Sbjct:   258 GNEILSFKDNPKDSLTQALQRIQSLQDRLEESVNNTE-KMRDSTGKRYKDFQIPWE---- 312

Query:   294 WSKGTGK----KYRDFQIPCDWMM---------DSGLIGQMKVSSLRLAKEYMKRFAGGF 340
             W   TG     KY   ++  ++M           SG  G + +  +R A   + +FAGGF
Sbjct:   313 WMLDTGLIGQLKYSSLRLAQEYMKRIAKELESNGSGKEGNLMLQGVRFAYT-IHQFAGGF 371

Query:   341 DAETIQAFEELKKV 354
             D ET+  F ELKK+
Sbjct:   372 DGETLSIFHELKKI 385


>TAIR|locus:2831359 [details] [associations]
            symbol:AT4G18570 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM] EMBL:CP002687 EMBL:AY128285
            EMBL:BT004523 IPI:IPI00524553 RefSeq:NP_193591.2 UniGene:At.23905
            UniGene:At.70231 ProteinModelPortal:Q8L7S5 PRIDE:Q8L7S5
            EnsemblPlants:AT4G18570.1 GeneID:827588 KEGG:ath:AT4G18570
            TAIR:At4g18570 HOGENOM:HOG000070980 InParanoid:Q8L7S5 OMA:FEWPEQK
            PhylomeDB:Q8L7S5 ProtClustDB:CLSN2918131 Genevestigator:Q8L7S5
            Uniprot:Q8L7S5
        Length = 642

 Score = 524 (189.5 bits), Expect = 1.3e-57, Sum P(2) = 1.3e-57
 Identities = 118/230 (51%), Positives = 154/230 (66%)

Query:   127 LAGSKTVRRVPEVVELYRSLTRKDAHMENRSNT----TAAPVI---AFTRNMIGEIENRS 179
             +A +K VRRVPEVVE Y SL R+D+    R +T     AA  I   +  R+MIGEIENRS
Sbjct:   348 IASAK-VRRVPEVVEFYHSLMRRDSTNSRRDSTGGGNAAAEAILANSNARDMIGEIENRS 406

Query:   180 TYLSAIKTDVKKQKEFINFLIKEVESAVFDQISEVEAFVKWLDGELSSLVDERAVLKHFP 239
              YL AIKTDV+ Q +FI FLIKEV +A F  I +V  FVKWLD ELS LVDERAVLKHF 
Sbjct:   407 VYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYLVDERAVLKHF- 465

Query:   240 QWPERKADTLREAACNYRDLKNLEQEVSSFEDNQKESLPQATRKMQALQDR--RACWS-- 295
             +WPE+KAD LREAA  Y DLK L  E S F ++ ++S   A +KMQAL ++     +S  
Sbjct:   466 EWPEQKADALREAAFCYFDLKKLISEASRFREDPRQSSSSALKKMQALFEKLEHGVYSLS 525

Query:   296 ---KGTGKKYRDFQIPCDWMMDSGLIGQMKVSSLRLAKEYMKRFAGGFDA 342
                +    K++ FQIP DWM+++G+  Q+K++S++LA +YMKR +   +A
Sbjct:   526 RMRESAATKFKSFQIPVDWMLETGITSQIKLASVKLAMKYMKRVSAELEA 575

 Score = 86 (35.3 bits), Expect = 1.3e-57, Sum P(2) = 1.3e-57
 Identities = 19/40 (47%), Positives = 27/40 (67%)

Query:   320 QMKVSSLRLAKEYMKRFAGGFDAETIQAFEELKKVGLSSH 359
             ++ V  +R A   + +FAGGFDAET++AFEEL+    S H
Sbjct:   585 ELIVQGVRFAFR-VHQFAGGFDAETMKAFEELRDKARSCH 623


>TAIR|locus:2102385 [details] [associations]
            symbol:CHUP1 "CHLOROPLAST UNUSUAL POSITIONING 1"
            species:3702 "Arabidopsis thaliana" [GO:0005634 "nucleus"
            evidence=ISM] [GO:0009507 "chloroplast" evidence=IDA] [GO:0009707
            "chloroplast outer membrane" evidence=IDA] [GO:0009902 "chloroplast
            relocation" evidence=RCA;IMP] [GO:0006364 "rRNA processing"
            evidence=RCA] [GO:0010027 "thylakoid membrane organization"
            evidence=RCA] [GO:0010207 "photosystem II assembly" evidence=RCA]
            [GO:0019684 "photosynthesis, light reaction" evidence=RCA]
            [GO:0034660 "ncRNA metabolic process" evidence=RCA] [GO:0035304
            "regulation of protein dephosphorylation" evidence=RCA] [GO:0042793
            "transcription from plastid promoter" evidence=RCA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=RCA]
            EMBL:CP002686 GenomeReviews:BA000014_GR EMBL:AP001313 GO:GO:0009707
            EMBL:AB087408 IPI:IPI00541711 RefSeq:NP_001189974.1
            RefSeq:NP_189197.2 UniGene:At.27741 ProteinModelPortal:Q9LI74
            STRING:Q9LI74 PaxDb:Q9LI74 PRIDE:Q9LI74 EnsemblPlants:AT3G25690.1
            EnsemblPlants:AT3G25690.2 GeneID:822157 KEGG:ath:AT3G25690
            TAIR:At3g25690 eggNOG:NOG310144 HOGENOM:HOG000242560
            InParanoid:Q9LI74 OMA:ELRNYQT PhylomeDB:Q9LI74
            ProtClustDB:CLSN2680868 Genevestigator:Q9LI74 GO:GO:0009902
            Uniprot:Q9LI74
        Length = 1004

 Score = 465 (168.7 bits), Expect = 6.8e-51, Sum P(3) = 6.8e-51
 Identities = 101/224 (45%), Positives = 146/224 (65%)

Query:   129 GSKTVRRVPEVVELYRSLTRKDAHMENRSNTTAAPV---IAFTRNMIGEIENRSTYLSAI 185
             G   V R PE+VE Y+SL ++++  E   +  ++      A   NMIGEIENRST+L A+
Sbjct:   714 GGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAV 773

Query:   186 KTDVKKQKEFINFLIKEVESAVFDQISEVEAFVKWLDGELSSLVDERAVLKHFPQWPERK 245
             K DV+ Q +F+  L  EV ++ F  I ++ AFV WLD ELS LVDERAVLKHF  WPE K
Sbjct:   774 KADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHF-DWPEGK 832

Query:   246 ADTLREAACNYRDLKNLEQEVSSFEDNQKESLPQATRKMQALQDR--RACWSKGTGK--- 300
             AD LREAA  Y+DL  LE++V+SF D+   S   A +KM  L ++  ++ ++    +   
Sbjct:   833 ADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLRTRDMA 892

Query:   301 --KYRDFQIPCDWMMDSGLIGQMKVSSLRLAKEYMKRFAGGFDA 342
               +Y++F IP DW+ D+G++G++K+SS++LAK+YMKR A   D+
Sbjct:   893 ISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDS 936

 Score = 73 (30.8 bits), Expect = 6.8e-51, Sum P(3) = 6.8e-51
 Identities = 13/18 (72%), Positives = 18/18 (100%)

Query:   335 RFAGGFDAETIQAFEELK 352
             +FAGGFDAE+++AFEEL+
Sbjct:   962 QFAGGFDAESMKAFEELR 979

 Score = 49 (22.3 bits), Expect = 6.8e-51, Sum P(3) = 6.8e-51
 Identities = 20/84 (23%), Positives = 39/84 (46%)

Query:     5 DDDSRIDSFQKERDARIALLEKENFELRQEVLRLKAQISSLKAHDNERKSML--WKKLQN 62
             DDD+ ++  +KER   + +   +      E+ RLK  +  L+  + + +  L  +  L+ 
Sbjct:   106 DDDNNLEKAEKERKYEVEMAYNDG-----ELERLKQLVKELEEREVKLEGELLEYYGLKE 160

Query:    63 PNTD-TSPQKQTDFVKTQEFQNLD 85
               +D    Q+Q   +KT E   L+
Sbjct:   161 QESDIVELQRQLK-IKTVEIDMLN 183


>TAIR|locus:2007755 [details] [associations]
            symbol:AT1G48280 "AT1G48280" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM] EMBL:CP002684 EMBL:AC007932
            IPI:IPI00530542 PIR:G96522 RefSeq:NP_564524.1 UniGene:At.26293
            PRIDE:Q9SX62 EnsemblPlants:AT1G48280.1 GeneID:841248
            KEGG:ath:AT1G48280 TAIR:At1g48280 HOGENOM:HOG000153560
            InParanoid:Q9SX62 OMA:GEIQNRS PhylomeDB:Q9SX62
            ProtClustDB:CLSN2721717 Genevestigator:Q9SX62 Uniprot:Q9SX62
        Length = 558

 Score = 437 (158.9 bits), Expect = 1.9e-50, Sum P(3) = 1.9e-50
 Identities = 95/220 (43%), Positives = 144/220 (65%)

Query:   127 LAGSKTVRRVPEVVELYRSLTRKD--AHMENRSNTTAAPVIAFTRNMIGEIENRSTYLSA 184
             LA +   ++ P V +L++ L ++D   ++    N   + V +   +++GEI+NRS +L A
Sbjct:   273 LAKAARAQKSPPVSQLFQLLNKQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIA 332

Query:   185 IKTDVKKQKEFINFLIKEVESAVFDQISEVEAFVKWLDGELSSLVDERAVLKHFPQWPER 244
             IK D++ + EFIN LI++V +  F  + +V  FV WLD EL++L DERAVLKHF +WPE+
Sbjct:   333 IKADIETKGEFINDLIQKVLTTCFSDMEDVMKFVDWLDKELATLADERAVLKHF-KWPEK 391

Query:   245 KADTLREAACNYRDLKNLEQEVSSFEDNQKESLPQATRKMQALQD------RRACWSKGT 298
             KADTL+EAA  YR+LK LE+E+SS+ D+       A +KM  L D      RR    +G+
Sbjct:   392 KADTLQEAAVEYRELKKLEKELSSYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLRGS 451

Query:   299 G-KKYRDFQIPCDWMMDSGLIGQMKVSSLRLAKEYMKRFA 337
               + Y+DF+IP +WM+DSG+I ++K +S++LAK YM R A
Sbjct:   452 SMRSYQDFKIPVEWMLDSGMICKIKRASIKLAKTYMNRVA 491

 Score = 60 (26.2 bits), Expect = 1.9e-50, Sum P(3) = 1.9e-50
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query:   335 RFAGGFDAETIQAFEELKKVGLSSH 359
             +FAGG D ET+ A EE+K+  + SH
Sbjct:   522 QFAGGLDPETLCALEEIKQ-RVPSH 545

 Score = 56 (24.8 bits), Expect = 1.9e-50, Sum P(3) = 1.9e-50
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query:    14 QKERDARIALLEKENFELRQEVLRLKAQISSLKAHDNERK 53
             ++ R++ + L E  N +L Q+++  +A+ISSL ++D   K
Sbjct:   147 EEARNSNVEL-ELNNRKLSQDLVSAEAKISSLSSNDKPAK 185


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.129   0.364    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      360       347   0.00098  116 3  11 22  0.42    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  612 (65 KB)
  Total size of DFA:  219 KB (2120 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  34.83u 0.13s 34.96t   Elapsed:  00:00:02
  Total cpu time:  34.83u 0.13s 34.96t   Elapsed:  00:00:02
  Start:  Thu May  9 21:03:49 2013   End:  Thu May  9 21:03:51 2013

Back to top