BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 046709
         (109 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255538972|ref|XP_002510551.1| conserved hypothetical protein [Ricinus communis]
 gi|223551252|gb|EEF52738.1| conserved hypothetical protein [Ricinus communis]
          Length = 111

 Score =  185 bits (470), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 92/111 (82%), Positives = 100/111 (90%), Gaps = 2/111 (1%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MA+SDAVIGNLTTIYVAVIAGIKAYGLV GRSFS   VLILST++VG++LIG+LTWDVS+
Sbjct: 1   MAVSDAVIGNLTTIYVAVIAGIKAYGLVCGRSFSGGFVLILSTSVVGLILIGSLTWDVSR 60

Query: 61  KAVYAISRDHHVHNHEMCKGGICWHGVAVRSPASQVRFRLPQ--QVPYGAL 109
           KA YAISRDHH+H HEMCKGGICWHGVAVRSPASQVRFRLPQ  Q  YG L
Sbjct: 61  KATYAISRDHHIHVHEMCKGGICWHGVAVRSPASQVRFRLPQHHQPHYGPL 111


>gi|224083071|ref|XP_002306941.1| predicted protein [Populus trichocarpa]
 gi|222856390|gb|EEE93937.1| predicted protein [Populus trichocarpa]
          Length = 102

 Score =  171 bits (433), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 85/103 (82%), Positives = 91/103 (88%), Gaps = 1/103 (0%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MALSDAVIGNLTTIYVAVIAGIK YGLV GRSFS   VLILST +VG++L GTL WD+S+
Sbjct: 1   MALSDAVIGNLTTIYVAVIAGIKVYGLVCGRSFSGGFVLILSTIVVGLILTGTLAWDISR 60

Query: 61  KAVYAISRDHHVHNHEMCKGGICWHGVAVRSPASQVRFRLPQQ 103
           KA YAISRD HV+ HEMCKGGICWHGVAVRSPASQVRFRLPQ 
Sbjct: 61  KATYAISRD-HVNVHEMCKGGICWHGVAVRSPASQVRFRLPQH 102


>gi|18410939|ref|NP_565118.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6721099|gb|AAF26753.1|AC007396_2 T4O12.4 [Arabidopsis thaliana]
 gi|37202064|gb|AAQ89647.1| At1g75810 [Arabidopsis thaliana]
 gi|51971341|dbj|BAD44335.1| hypothetical protein [Arabidopsis thaliana]
 gi|332197640|gb|AEE35761.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 125

 Score =  155 bits (393), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 79/122 (64%), Positives = 93/122 (76%), Gaps = 13/122 (10%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MALSDAV+GNL TIYVAVI  IKAYGL+TGRSFS+  V+++S T VGV+L  TL WDVS+
Sbjct: 4   MALSDAVLGNLATIYVAVIIAIKAYGLITGRSFSAGFVVVVSITAVGVLLAVTLAWDVSR 63

Query: 61  KAVYAISR----------DHHVHNHE---MCKGGICWHGVAVRSPASQVRFRLPQQVPYG 107
           +A  A+SR           HH H+H+   +CKGGICWHGVAVRSPASQVRFRLPQ +PYG
Sbjct: 64  RAAEAVSRYNRVGGEEDLSHHRHHHDGGAICKGGICWHGVAVRSPASQVRFRLPQHIPYG 123

Query: 108 AL 109
           A 
Sbjct: 124 AF 125


>gi|297839461|ref|XP_002887612.1| hypothetical protein ARALYDRAFT_476728 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333453|gb|EFH63871.1| hypothetical protein ARALYDRAFT_476728 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 122

 Score =  155 bits (392), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 79/122 (64%), Positives = 93/122 (76%), Gaps = 13/122 (10%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MALSDAV+GNL TIYVAVI  IKAYGL+TGRSFS+  V+++S T VGV+L  TL WDVS+
Sbjct: 1   MALSDAVLGNLATIYVAVIIAIKAYGLITGRSFSAGFVVVVSITAVGVLLAVTLAWDVSR 60

Query: 61  KAVYAISR----------DHHVHNHE---MCKGGICWHGVAVRSPASQVRFRLPQQVPYG 107
           +A  A+SR           HH H+H+   +CKGGICWHGVAVRSPASQVRFRLPQ +PYG
Sbjct: 61  RAADAVSRYNRVGGEEDLSHHRHHHDGGAICKGGICWHGVAVRSPASQVRFRLPQHIPYG 120

Query: 108 AL 109
           A 
Sbjct: 121 AF 122


>gi|225457935|ref|XP_002273690.1| PREDICTED: uncharacterized protein LOC100245337 [Vitis vinifera]
          Length = 106

 Score =  154 bits (390), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 75/105 (71%), Positives = 85/105 (80%), Gaps = 3/105 (2%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MA+SDAV+ NLTTIY+  IA IKAYGL TGRSFS   VL+ ST  V ++LIGTL WDVS+
Sbjct: 1   MAISDAVVANLTTIYLVAIAAIKAYGLATGRSFSGGFVLVTSTAAVVLILIGTLAWDVSR 60

Query: 61  KAVYAISRDHHVHNHEMCKGGICWHGVAVRSPASQVRFRLPQQVP 105
           KA YA+SRD   H HEMC+GGICWHGVAV+S ASQVRFRLPQQ P
Sbjct: 61  KATYALSRD---HVHEMCRGGICWHGVAVKSSASQVRFRLPQQPP 102


>gi|356553369|ref|XP_003545029.1| PREDICTED: uncharacterized protein LOC100779434 [Glycine max]
          Length = 113

 Score =  151 bits (381), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 74/113 (65%), Positives = 86/113 (76%), Gaps = 4/113 (3%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MA+SDAV+ NLT +YVAVIA IKAYG + GRSFS   VL++ST  V ++L+ TLTWDVS+
Sbjct: 1   MAVSDAVMSNLTVLYVAVIACIKAYGFLCGRSFSGWFVLVVSTAAVALILVATLTWDVSR 60

Query: 61  KAVYAISRDH----HVHNHEMCKGGICWHGVAVRSPASQVRFRLPQQVPYGAL 109
           KA YA + D       H HE CKGGICWHGVAVRSPASQVRFRLPQ +PY  L
Sbjct: 61  KATYAFATDQPPPPPHHTHEACKGGICWHGVAVRSPASQVRFRLPQNLPYTTL 113


>gi|356562415|ref|XP_003549467.1| PREDICTED: uncharacterized protein LOC100787853 [Glycine max]
          Length = 117

 Score =  146 bits (369), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 71/117 (60%), Positives = 83/117 (70%), Gaps = 8/117 (6%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MA+SDAV+ NLT +YVA+I  IKAYG + GRSFS   VL+ ST  V ++L+ TLTWDVS+
Sbjct: 1   MAVSDAVVSNLTLLYVALILSIKAYGFLCGRSFSGCFVLVASTAAVALILVATLTWDVSR 60

Query: 61  KAVYAISRDHHV--------HNHEMCKGGICWHGVAVRSPASQVRFRLPQQVPYGAL 109
           KA YA +             H HE CKGGICWHGVAVRSPASQVRFRLPQ +PY  L
Sbjct: 61  KATYAFAAVQPPPPPPPPPHHTHETCKGGICWHGVAVRSPASQVRFRLPQNLPYTTL 117


>gi|356517879|ref|XP_003527613.1| PREDICTED: uncharacterized protein LOC100814019 [Glycine max]
          Length = 109

 Score =  128 bits (322), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 61/109 (55%), Positives = 76/109 (69%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           MA+SDAV  N +TI+V ++A  +AYG  +GR FS   V+I+ST  V + L+ TLTWDVS+
Sbjct: 1   MAISDAVARNFSTIFVGLMACFEAYGFASGRRFSGGYVVIVSTAAVILFLVATLTWDVSR 60

Query: 61  KAVYAISRDHHVHNHEMCKGGICWHGVAVRSPASQVRFRLPQQVPYGAL 109
           KA  A  RDH     ++CKGGICWHGVA RSPASQVR RLP  +P   L
Sbjct: 61  KATCAFHRDHADSQEDVCKGGICWHGVAPRSPASQVRVRLPSHLPLAPL 109


>gi|357438469|ref|XP_003589510.1| hypothetical protein MTR_1g025490 [Medicago truncatula]
 gi|355478558|gb|AES59761.1| hypothetical protein MTR_1g025490 [Medicago truncatula]
          Length = 112

 Score =  109 bits (272), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 55/111 (49%), Positives = 73/111 (65%), Gaps = 5/111 (4%)

Query: 1   MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
           M + D V  NLT IYVAV + +KAY ++ GR FS   V+ +STT+V ++L+ TL WD+S+
Sbjct: 1   MGVCDVVASNLTIIYVAVTSFVKAYEVLYGRRFSGVFVVTVSTTLVALILVATLMWDLSR 60

Query: 61  KAVYAISRDHHV----HN-HEMCKGGICWHGVAVRSPASQVRFRLPQQVPY 106
           K    +  D H     HN    CKGGICWHGVA RSPASQ+ F+LP  +P+
Sbjct: 61  KISKCVFVDQHPRTSQHNTMSYCKGGICWHGVADRSPASQLHFKLPLHLPH 111


>gi|224065799|ref|XP_002301964.1| predicted protein [Populus trichocarpa]
 gi|222843690|gb|EEE81237.1| predicted protein [Populus trichocarpa]
          Length = 81

 Score =  102 bits (254), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 53/69 (76%), Positives = 62/69 (89%), Gaps = 1/69 (1%)

Query: 1  MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
          MALSDAVIGNLTTIYVAVIAGIKAYGLV GRSFS   VLI+ST +VG++LIGTLTWD+S+
Sbjct: 1  MALSDAVIGNLTTIYVAVIAGIKAYGLVCGRSFSGVFVLIVSTAVVGLILIGTLTWDISR 60

Query: 61 KAV-YAISR 68
          KA+ Y ++R
Sbjct: 61 KAMRYLLAR 69


>gi|4105698|gb|AAD02500.1| small hydrophobic protein [Arabidopsis thaliana]
          Length = 141

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 43/62 (69%), Positives = 52/62 (83%)

Query: 1  MALSDAVIGNLTTIYVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQ 60
          MALSDAV+GNL TIYVAVI  IKAYGL+TGRSFS+  V+++S T VGV+L  TL WDVS+
Sbjct: 4  MALSDAVLGNLATIYVAVIIAIKAYGLITGRSFSAGFVVVVSITAVGVLLAVTLAWDVSR 63

Query: 61 KA 62
          +A
Sbjct: 64 RA 65


>gi|240273878|gb|EER37397.1| predicted protein [Ajellomyces capsulatus H143]
 gi|325094688|gb|EGC47998.1| predicted protein [Ajellomyces capsulatus H88]
          Length = 112

 Score = 35.0 bits (79), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 20/81 (24%), Positives = 33/81 (40%)

Query: 15  YVAVIAGIKAYGLVTGRSFSSAIVLILSTTIVGVVLIGTLTWDVSQKAVYAISRDHHVHN 74
           Y     GI   G+   R     ++ +L      V+L+ TL  D+ +           VH+
Sbjct: 27  YTTTKVGIMKLGVPLDRRARDGMLSLLEAAASLVILLETLNSDIFETTTILKDGKRRVHH 86

Query: 75  HEMCKGGICWHGVAVRSPASQ 95
             +  G  C+ G A R P+S+
Sbjct: 87  FSLVDGIYCFQGPAARVPSSR 107


>gi|258541615|ref|YP_003187048.1| hypothetical protein APA01_05180 [Acetobacter pasteurianus IFO
           3283-01]
 gi|384041536|ref|YP_005480280.1| hypothetical protein APA12_05180 [Acetobacter pasteurianus IFO
           3283-12]
 gi|384050051|ref|YP_005477114.1| hypothetical protein APA03_05180 [Acetobacter pasteurianus IFO
           3283-03]
 gi|384053161|ref|YP_005486255.1| hypothetical protein APA07_05180 [Acetobacter pasteurianus IFO
           3283-07]
 gi|384056393|ref|YP_005489060.1| hypothetical protein APA22_05180 [Acetobacter pasteurianus IFO
           3283-22]
 gi|384059034|ref|YP_005498162.1| hypothetical protein APA26_05180 [Acetobacter pasteurianus IFO
           3283-26]
 gi|384062328|ref|YP_005482970.1| hypothetical protein APA32_05180 [Acetobacter pasteurianus IFO
           3283-32]
 gi|384118404|ref|YP_005501028.1| hypothetical protein APA42C_05180 [Acetobacter pasteurianus IFO
           3283-01-42C]
 gi|256632693|dbj|BAH98668.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01]
 gi|256635750|dbj|BAI01719.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-03]
 gi|256638805|dbj|BAI04767.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-07]
 gi|256641859|dbj|BAI07814.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-22]
 gi|256644914|dbj|BAI10862.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-26]
 gi|256647969|dbj|BAI13910.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-32]
 gi|256651022|dbj|BAI16956.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01-42C]
 gi|256654013|dbj|BAI19940.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-12]
          Length = 180

 Score = 34.7 bits (78), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 44  TIVGVVLIGTLTWDVSQKAVYAISRDHHVHNHE 76
           T+ G+  +G L     Q+ V A++R+HH H HE
Sbjct: 77  TVAGLACVGQLNLPTCQRMVAALAREHHHHQHE 109


>gi|421852242|ref|ZP_16284932.1| hypothetical protein APS_0737 [Acetobacter pasteurianus subsp.
           pasteurianus LMG 1262 = NBRC 106471]
 gi|371479609|dbj|GAB30135.1| hypothetical protein APS_0737 [Acetobacter pasteurianus subsp.
           pasteurianus LMG 1262 = NBRC 106471]
          Length = 176

 Score = 34.7 bits (78), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 40  ILSTTIVGVVLIGTLTWDVSQKAVYAISRDHHVHNHE 76
           +   T+ G+  +G L     Q+ V A++R+HH H HE
Sbjct: 69  VWRGTVAGLACVGQLNPPTCQRMVAALAREHHHHQHE 105


>gi|421850324|ref|ZP_16283286.1| hypothetical protein APT_1979 [Acetobacter pasteurianus NBRC
           101655]
 gi|371458835|dbj|GAB28489.1| hypothetical protein APT_1979 [Acetobacter pasteurianus NBRC
           101655]
          Length = 180

 Score = 34.7 bits (78), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 44  TIVGVVLIGTLTWDVSQKAVYAISRDHHVHNHE 76
           T+ G+  +G L     Q+ V A++R+HH H HE
Sbjct: 77  TVAGLACVGQLNPPTCQRMVAALAREHHHHQHE 109


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.325    0.137    0.416 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,555,468,887
Number of Sequences: 23463169
Number of extensions: 52625636
Number of successful extensions: 208478
Number of sequences better than 100.0: 18
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 208457
Number of HSP's gapped (non-prelim): 18
length of query: 109
length of database: 8,064,228,071
effective HSP length: 77
effective length of query: 32
effective length of database: 6,257,564,058
effective search space: 200242049856
effective search space used: 200242049856
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 69 (31.2 bits)