BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 004618
         (741 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q6R0H1|LHY_ARATH Protein LHY OS=Arabidopsis thaliana GN=LHY PE=1 SV=2
          Length = 645

 Score =  347 bits (889), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 322/744 (43%), Positives = 405/744 (54%), Gaps = 156/744 (20%)

Query: 13  LSRLFFLCFTWFRAE-HIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPR 71
           L  L      W R E HIGTK AVQIRSHAQKFF+KLEKEA  KG+P+ QA+DI+IPPPR
Sbjct: 36  LEALRLYGRAWQRIEEHIGTKTAVQIRSHAQKFFTKLEKEAEVKGIPVCQALDIEIPPPR 95

Query: 72  PKRKPRNPYPRKTCTNAPMS-QIG-AKDGKLRSSVSSLRCNQV-LDLEKEPICDRPNGDE 128
           PKRKP  PYPRK   N   S Q+  AKD KL SS SS + NQ  LDLEK P        E
Sbjct: 96  PKRKPNTPYPRKPGNNGTSSSQVSSAKDAKLVSSASSSQLNQAFLDLEKMPFS------E 149

Query: 129 KPTYTIESQKDNCSEVFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVN 188
           K +   E+Q +NCS             VS+VNK  +PT                 K+V  
Sbjct: 150 KTSTGKENQDENCS------------GVSTVNKYPLPT-----------------KQVSG 180

Query: 189 QDETRESYVTVELKANEKFGKPDAKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVAD 248
             ET                             SK   ++NA     + V  + +D   D
Sbjct: 181 DIET-----------------------------SKTSTVDNAV----QDVPKKNKD--KD 205

Query: 249 ALPTAEVQATQNYPRHVNVHILDGSLGTGTQS-PSDMPMQESIFHPIGE-VHGNPNLFTN 306
                 V + QNYP H +  I++G++    Q+ PS M  Q+ +FHP+ E  HG+ NL   
Sbjct: 206 GNDGTTVHSMQNYPWHFHADIVNGNIAKCPQNHPSGMVSQDFMFHPMREETHGHANLQAT 265

Query: 307 PAASATTENESNVPKSTHQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNP 366
            A++ TT        ++HQ+FP  H         QDDYRSFL ISS FS+LI+STLLQNP
Sbjct: 266 TASATTT--------ASHQAFPACH--------SQDDYRSFLQISSTFSNLIMSTLLQNP 309

Query: 367 AAHAAASFAATFWPYTNMETSADSPTCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAA 426
           AAHAAA+FAA+ WPY ++  S DS         +   SSPPS+ AIAAATVAAATAWWA+
Sbjct: 310 AAHAAATFAASVWPYASVGNSGDS--------STPMSSSPPSITAIAAATVAAATAWWAS 361

Query: 427 HGLLPLCAPFHAPFICPPACATAVSSMDTAQVPA-AKTDKNDNTPPNPALKDHQPDLEYS 485
           HGLLP+CAP  AP  C P    AV +      PA  + D  +NT         QP  + +
Sbjct: 362 HGLLPVCAP--APITCVPFSTVAVPT------PAMTEMDTVENT---------QPFEKQN 404

Query: 486 EALQAQNSASKSLTVLTSDSEESASAKLKMELKATDH--EKNSEENELHDSNKTKNRKQV 543
            ALQ QN ASKS    + DS+E+   KL  + K  D   E+      +HDSN  + +  V
Sbjct: 405 TALQDQNLASKSPASSSDDSDETGVTKLNADSKTNDDKIEEVVVTAAVHDSNTAQKKNLV 464

Query: 544 DRSSCGSNTPSSSEVETDALEKQEKGKEESKEIDPSLP-ATDPGNRRSRSSSN------I 596
           DRSSCGSNTPS S+ ETDAL+K EK KE+ KE D + P   +  NR+ +   N       
Sbjct: 465 DRSSCGSNTPSGSDAETDALDKMEKDKEDVKETDENQPDVIELNNRKIKMRDNNSNNNAT 524

Query: 597 TDSWKEVSEVGRLAFQALFSREVLPQSFSPPHDLKDKMQQDNVEDKQNGNKKDGDKSL-F 655
           TDSWKEVSE GR+AFQALF+RE LPQSFSPP            +  +N N+K  D S+  
Sbjct: 525 TDSWKEVSEEGRIAFQALFARERLPQSFSPP------------QVAENVNRKQSDTSMPL 572

Query: 656 DLNSKTWGSCFGHQEVEKSTVSGVENNGGEGLLTIGLGHGK-LKARRTGFKPYKRCSVEA 714
             N K+  SC   Q               EG++ IG+G  K LK R+TGFKPYKRCS+E 
Sbjct: 573 APNFKSQDSCAADQ---------------EGVVMIGVGTCKSLKTRQTGFKPYKRCSMEV 617

Query: 715 KENRILNTGNQAEEKCPKRIRVEG 738
           KE+++ N  NQ++EK  KR+R+EG
Sbjct: 618 KESQVGNINNQSDEKVCKRLRLEG 641


>sp|P92973|CCA1_ARATH Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1
          Length = 608

 Score =  256 bits (654), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 270/731 (36%), Positives = 358/731 (48%), Gaps = 198/731 (27%)

Query: 27  EHIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRKTCT 86
           EH+ TK AVQIRSHAQKFFSK+EKEA +KGV +GQA+DI IPPPRPKRKP NPYPRKT +
Sbjct: 51  EHVATKTAVQIRSHAQKFFSKVEKEAEAKGVAMGQALDIAIPPPRPKRKPNNPYPRKTGS 110

Query: 87  NAP-MSQIGAKDGKLRSSVSSLRCNQVLDLEKEPICDRPNGDEKPTYTIES--QKDNCSE 143
               MS+ G  DGK           + L  EK    +  N D + +   E   Q+DNCS+
Sbjct: 111 GTILMSKTGVNDGK-----------ESLGSEKVSHPEMANEDRQQSKPEEKTLQEDNCSD 159

Query: 144 VFILHQEAHCSSVSSVNKNSMPTPVGLRDSCNLREFVPSLKEVVNQDETRESYVTVELKA 203
            F  HQ  + S+ SS+NK+ + T     ++   REF+PS +E    +  R+         
Sbjct: 160 CFT-HQ--YLSAASSMNKSCIET----SNASTFREFLPSREEGSQNNRVRKE-------- 204

Query: 204 NEKFGKPDAKLALQDNGSSKPLNLENACPSHEKSVHGEKRDDVADALPTAEVQATQNYPR 263
                            S+  LN                    A +L     Q  Q YP 
Sbjct: 205 -----------------SNSDLN--------------------AKSLENGNEQGPQTYPM 227

Query: 264 HVNVHILDGSLGTGTQSPSDMPMQESIFHPIGEVHGNPNLFTNPAASATTENESNVPKST 323
           H+ V +  GS            +  S+ HP  E   +P+                     
Sbjct: 228 HIPVLVPLGS-----------SITSSLSHPPSEPDSHPHTVAG----------------D 260

Query: 324 HQSFPTFHPPFTQFRHDQDDYRSFLHISSAFSSLIVSTLLQNPAAHAAASFAATFWPYTN 383
           +QSFP                             I+STLLQ PA + AA+FA++FWP   
Sbjct: 261 YQSFPNH---------------------------IMSTLLQTPALYTAATFASSFWP--- 290

Query: 384 METSADSPTCPQGGFLSRQMSSPPSMAAIAAATVAAATAWWAAHGLLPLCAPF------- 436
            ++S  SP            +SPP++AA+AAATVAAA+AWWAA+GLLPLCAP        
Sbjct: 291 PDSSGGSPV---------PGNSPPNLAAMAAATVAAASAWWAANGLLPLCAPLSSGGFTS 341

Query: 437 HAPFICPPACATAVSSMDTAQVPAAKTDKNDNTPPNPALKDHQPDLEYSEALQAQNSASK 496
           H P    P+C    +   T Q  + ++ +                 E+SEA +A++S   
Sbjct: 342 HPPSTFGPSCDVEYTKASTLQHGSVQSREQ----------------EHSEASKARSSL-- 383

Query: 497 SLTVLTSDSEESASAKLKMELKATDHEKNSE--ENELHDSNKTKNRKQVDRSSCGSNTPS 554
                  DSE+     ++ + K   HE+ S   E++   S+   +RKQVDRSSCGSNTPS
Sbjct: 384 -------DSED-----VENKSKPVCHEQPSATPESDAKGSDGAGDRKQVDRSSCGSNTPS 431

Query: 555 SSE-VETDALEKQEKGKE-ESKEI--DPSLPATDPGN-RRSRSSSNITDSWKEVSEVGRL 609
           SS+ VE DA E+QE G   E KE   D + P T   N RRSR SSNITD WK VS+ GR+
Sbjct: 432 SSDDVEADASERQEDGTNGEVKETNEDTNKPQTSESNARRSRISSNITDPWKSVSDEGRI 491

Query: 610 AFQALFSREVLPQSFSPPHDLKDKMQQDNVEDKQNGNKKDGDKSLFDLNSKTWGSCFGHQ 669
           AFQALFSREVLPQSF+          ++   +++   ++       DLN     +    Q
Sbjct: 492 AFQALFSREVLPQSFT---------YREEHREEEQQQQEQRYPMALDLNFTAQLTPVDDQ 542

Query: 670 EVEKSTVSGVENNGGEGLLTIGLGHGKLKAR-RTGFKPYKRCSVEAKENRILNTGN--QA 726
           E +++T          G L IGL   KL +R RTGFKPYKRCS+EAKE+RILN       
Sbjct: 543 EEKRNT----------GFLGIGLDASKLMSRGRTGFKPYKRCSMEAKESRILNNNPIIHV 592

Query: 727 EEKCPKRIRVE 737
           E+K PKR+R+E
Sbjct: 593 EQKDPKRMRLE 603


>sp|Q54IF9|MYBG_DICDI Myb-like protein G OS=Dictyostelium discoideum GN=mybG PE=3 SV=1
          Length = 423

 Score = 55.8 bits (133), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/62 (45%), Positives = 37/62 (59%), Gaps = 11/62 (17%)

Query: 23  WFRAE-HIGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYP 81
           W + E  +G+K  +QIRSHAQK+F K++K    +           IPPPRPKRK   PYP
Sbjct: 63  WKKIESFVGSKTVIQIRSHAQKYFIKVQKNNTGE----------RIPPPRPKRKSIQPYP 112

Query: 82  RK 83
           +K
Sbjct: 113 QK 114


>sp|Q6R0H0|ASG4_ARATH Transcription factor ASG4 OS=Arabidopsis thaliana GN=ASG4 PE=2 SV=1
          Length = 287

 Score = 54.3 bits (129), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 10/55 (18%)

Query: 29  IGTKKAVQIRSHAQKFFSKLEKEALSKGVPIGQAIDIDIPPPRPKRKPRNPYPRK 83
           +G+K  +QIRSHAQK+F K++K    +           +PPPRPKRK  +PYP+K
Sbjct: 90  VGSKTVIQIRSHAQKYFLKVQKNGTKE----------HLPPPRPKRKANHPYPQK 134


>sp|Q54Z40|MYBH_DICDI Myb-like protein H OS=Dictyostelium discoideum GN=mybH PE=3 SV=1
          Length = 1217

 Score = 38.5 bits (88), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 6/45 (13%)

Query: 15  RLFFLCFTWFRA------EHIGTKKAVQIRSHAQKFFSKLEKEAL 53
           RLF   +  +        EH+ TK  +Q+RSHAQKF  KLEK  +
Sbjct: 159 RLFVEAYKLYDKDNKKIQEHVKTKTILQVRSHAQKFALKLEKNGI 203


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.311    0.127    0.371 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 282,030,187
Number of Sequences: 539616
Number of extensions: 12259548
Number of successful extensions: 37539
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 64
Number of HSP's successfully gapped in prelim test: 492
Number of HSP's that attempted gapping in prelim test: 35454
Number of HSP's gapped (non-prelim): 1782
length of query: 741
length of database: 191,569,459
effective HSP length: 125
effective length of query: 616
effective length of database: 124,117,459
effective search space: 76456354744
effective search space used: 76456354744
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 65 (29.6 bits)