BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 030213
         (181 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224137270|ref|XP_002327084.1| predicted protein [Populus trichocarpa]
 gi|222835399|gb|EEE73834.1| predicted protein [Populus trichocarpa]
          Length = 322

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 85/182 (46%), Positives = 113/182 (62%), Gaps = 7/182 (3%)

Query: 3   MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
           ME+QSVC G+  PK + S    LS++ +S  AS  G R   I   N  K      P  K+
Sbjct: 1   MEMQSVCTGITAPKLRLSTNRLLSNEIIS--ASHVGLRCPRILLKNPSKLCVK--PGFKN 56

Query: 63  KIEGYSAWSSKRGAILCASNSGSDTK---LGFPSGENSSLPVAIFNGPEPFRGKSGSVSF 119
             +G S  S  RG ++C S+S +      L     ++S +P +  NG EPF GKSGSVSF
Sbjct: 57  IFKGISWSSLNRGVVICGSSSSNSNADADLKISRRKSSGVPASSSNGVEPFNGKSGSVSF 116

Query: 120 CGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMG 179
            GLTHQ VEEG+L+SAPF E+KGSF+W+L PV L+ SLI+PQ FLG+ IE F+KD +L+ 
Sbjct: 117 HGLTHQSVEEGRLVSAPFNEEKGSFVWLLGPVALVLSLIVPQFFLGSAIEAFLKDEVLVE 176

Query: 180 IS 181
           I+
Sbjct: 177 IA 178


>gi|225458432|ref|XP_002283812.1| PREDICTED: uncharacterized protein LOC100245429 [Vitis vinifera]
 gi|302142414|emb|CBI19617.3| unnamed protein product [Vitis vinifera]
          Length = 355

 Score =  137 bits (346), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 86/178 (48%), Positives = 107/178 (60%), Gaps = 7/178 (3%)

Query: 3   MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
           ME+QS+   L  P  +      LSH+ VS    ++  + Q  RF    K   +  P LK 
Sbjct: 1   MELQSISCWLPSPDMRLCIHRPLSHRIVSTGHVRW--KDQRFRFEKVLK--SSLMPGLKK 56

Query: 63  KIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCGL 122
           +    S  S  RGA LC+ NS SD KL +   E +S+    F+G EPFRGKSGSVSF GL
Sbjct: 57  EC---SLSSFGRGATLCSLNSNSDAKLKYSGREITSVSGNGFDGVEPFRGKSGSVSFHGL 113

Query: 123 THQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
           THQ VEE KL S+PF+E  GSFLW+LAPV LISSL+LPQ FL + IE  +KD +L  I
Sbjct: 114 THQRVEESKLASSPFKEGTGSFLWVLAPVALISSLVLPQFFLDSAIEAILKDEVLAEI 171


>gi|449476766|ref|XP_004154828.1| PREDICTED: uncharacterized LOC101207359 [Cucumis sativus]
          Length = 345

 Score =  129 bits (323), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 63/112 (56%), Positives = 79/112 (70%)

Query: 67  YSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQL 126
           ++    KR A +C ++S    +L     EN +L  +  NG EPF GK GSVSF GLTHQL
Sbjct: 48  WAVLKRKRWAFMCVADSNKSPQLELSGEENHALYASRLNGVEPFHGKCGSVSFHGLTHQL 107

Query: 127 VEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLM 178
           VEE KLMSAPF+E+KGS LW+LAPV  ISSLILPQ+FLG +IE F K+ +L+
Sbjct: 108 VEESKLMSAPFREEKGSILWVLAPVAFISSLILPQVFLGGLIEAFFKNRILV 159


>gi|449460131|ref|XP_004147799.1| PREDICTED: uncharacterized protein LOC101207359 [Cucumis sativus]
          Length = 345

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 62/112 (55%), Positives = 79/112 (70%)

Query: 67  YSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQL 126
           ++    KR A +C ++S    +L     EN ++  +  NG EPF GK GSVSF GLTHQL
Sbjct: 48  WAVLKRKRWAFMCVADSNKSPQLELSGEENHAMYASRLNGVEPFHGKCGSVSFHGLTHQL 107

Query: 127 VEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLM 178
           VEE KLMSAPF+E+KGS LW+LAPV  ISSLILPQ+FLG +IE F K+ +L+
Sbjct: 108 VEESKLMSAPFREEKGSILWVLAPVAFISSLILPQVFLGGLIEAFFKNRILV 159


>gi|356510304|ref|XP_003523879.1| PREDICTED: uncharacterized protein LOC100805065 [Glycine max]
          Length = 356

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/181 (44%), Positives = 106/181 (58%), Gaps = 12/181 (6%)

Query: 3   MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPS---ASQFGSRSQSIRFGNAYKFLGNTFPN 59
           ME+Q++C  L      SSN   LS    SP+    + F ++ Q   FGNA K L      
Sbjct: 1   MELQTLCCALPSQPLHSSNHRILS----SPAFGIRAPFTNQRQRFHFGNASKCLKWVI-- 54

Query: 60  LKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSF 119
              K       S  +  ++ +S+S  +  L     ++SS+PV  FNG EPF GKSGSVSF
Sbjct: 55  ---KEPSLLGLSKFQRLLVHSSDSNLNGGLEVHPKQSSSVPVTTFNGAEPFHGKSGSVSF 111

Query: 120 CGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMG 179
           CGLTHQ VEEGKL S+PF+   GSFLW+LAP   I+SLILPQ F+ NV+E F+ + +L+ 
Sbjct: 112 CGLTHQSVEEGKLESSPFEGKGGSFLWVLAPAAFIASLILPQFFVDNVVEAFLNNVILID 171

Query: 180 I 180
           I
Sbjct: 172 I 172


>gi|255647675|gb|ACU24299.1| unknown [Glycine max]
          Length = 356

 Score =  124 bits (312), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 79/180 (43%), Positives = 108/180 (60%), Gaps = 10/180 (5%)

Query: 3   MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
           ME+Q++C  L      S    F +H+ +S  A  FG+ +      + ++F GN    LK 
Sbjct: 1   MELQTLCCAL-----PSQPLHFSNHRILSSPA--FGTHAPLTNQIHRFRF-GNPSKCLKW 52

Query: 63  KIEGYS--AWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFC 120
            I+G S    S  +  ++ AS+S  +  L     ++SS+PV  FNG EPF GKSGSVSFC
Sbjct: 53  VIKGTSLLGLSESKKLLVHASDSNLNGGLEVHPKQSSSVPVTTFNGVEPFHGKSGSVSFC 112

Query: 121 GLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
           GLTHQ VEEGKL S+PF    GSFLW+LAP   I+SL LPQ F+ NV+E F+ + +L+ I
Sbjct: 113 GLTHQSVEEGKLESSPFGVKGGSFLWVLAPAAFIASLTLPQFFVDNVVEAFLNNVMLIDI 172


>gi|356519084|ref|XP_003528204.1| PREDICTED: uncharacterized protein LOC100780682 [Glycine max]
          Length = 356

 Score =  122 bits (306), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 78/180 (43%), Positives = 107/180 (59%), Gaps = 10/180 (5%)

Query: 3   MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
           ME+Q++C  L      S    F +H+ +S  A  FG+ +      + ++F GN    LK 
Sbjct: 1   MELQTLCCAL-----PSQPLHFSNHRILSSPA--FGTHAPLTNQIHRFRF-GNPSKCLKW 52

Query: 63  KIEGYS--AWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSVSFC 120
            I+  S    S  +  ++ AS+S  +  L     ++SS+PV  FNG EPF GKSGSVSFC
Sbjct: 53  VIKETSLLGLSESKKLLVHASDSNLNGGLEVHPKQSSSVPVTTFNGVEPFHGKSGSVSFC 112

Query: 121 GLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
           GLTHQ VEEGKL S+PF    GSFLW+LAP   I+SL LPQ F+ NV+E F+ + +L+ I
Sbjct: 113 GLTHQSVEEGKLESSPFGVKGGSFLWVLAPAAFIASLTLPQFFVDNVVEAFLNNVMLIDI 172


>gi|255538514|ref|XP_002510322.1| conserved hypothetical protein [Ricinus communis]
 gi|223551023|gb|EEF52509.1| conserved hypothetical protein [Ricinus communis]
          Length = 339

 Score =  122 bits (305), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 81/180 (45%), Positives = 104/180 (57%), Gaps = 27/180 (15%)

Query: 3   MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRSQSIRFGNAYKFLGNTFPNLKH 62
           ME+QSVC GL  PK +      L+H+      S    ++ S++ G A K           
Sbjct: 1   MELQSVCHGLTAPKLR------LTHRL-----SSHQHQTVSLKIGIASK----------- 38

Query: 63  KIEGYSAWSSKRGAILCASNSGSDTK-LGFPSGENSSLPVAIFNGPEPFRGKSGSVSFCG 121
               Y + S+KR  I  +S++ S    L     E + +P A FN  EPFRGKSGS+SF G
Sbjct: 39  ---SYLSSSNKRVTIRASSSNSSRNTNLELSDREGTGVPDASFNELEPFRGKSGSISFYG 95

Query: 122 LTHQLVEEGKLMSAPFQ-EDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLLMGI 180
           LTHQ VE GKL+SAPF  ED GSFLWIL P  LI+SL +PQ F+ NVIE F+KD +L+ I
Sbjct: 96  LTHQSVEAGKLVSAPFNGEDNGSFLWILGPAALIASLFIPQFFISNVIEAFLKDEILVEI 155


>gi|8778686|gb|AAF79694.1|AC020889_2 T1N15.7 [Arabidopsis thaliana]
          Length = 343

 Score =  111 bits (277), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/126 (51%), Positives = 83/126 (65%), Gaps = 11/126 (8%)

Query: 61  KHKIEGYSA--WSS---KRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSG 115
           +HK++      W+     R  + C S+S SD++      + S+     +   E FRGKSG
Sbjct: 38  RHKLDSRQTLLWNKPQLSRVRVAC-SSSQSDSRPEKKQSDKSN-----YARAELFRGKSG 91

Query: 116 SVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDN 175
           SVSF GLTHQLVEE KL+SAPFQE+KGSFLW+LAPVVLISSLILPQ FL  +IE   K++
Sbjct: 92  SVSFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEATFKND 151

Query: 176 LLMGIS 181
            + G S
Sbjct: 152 TVAGRS 157


>gi|15221197|ref|NP_175279.1| uncharacterized protein [Arabidopsis thaliana]
 gi|56381889|gb|AAV85663.1| At1g48460 [Arabidopsis thaliana]
 gi|57222220|gb|AAW39017.1| At1g48460 [Arabidopsis thaliana]
 gi|332194175|gb|AEE32296.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 340

 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 64/125 (51%), Positives = 82/125 (65%), Gaps = 11/125 (8%)

Query: 61  KHKIEGYSA--WSS---KRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSG 115
           +HK++      W+     R  + C S+S SD++      + S+     +   E FRGKSG
Sbjct: 38  RHKLDSRQTLLWNKPQLSRVRVAC-SSSQSDSRPEKKQSDKSN-----YARAELFRGKSG 91

Query: 116 SVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDN 175
           SVSF GLTHQLVEE KL+SAPFQE+KGSFLW+LAPVVLISSLILPQ FL  +IE   K++
Sbjct: 92  SVSFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEATFKND 151

Query: 176 LLMGI 180
            +  I
Sbjct: 152 TVAEI 156


>gi|297847108|ref|XP_002891435.1| hypothetical protein ARALYDRAFT_473986 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337277|gb|EFH67694.1| hypothetical protein ARALYDRAFT_473986 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 331

 Score =  109 bits (273), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 75/183 (40%), Positives = 97/183 (53%), Gaps = 41/183 (22%)

Query: 3   MEVQSVCRGLQLPKFKSSNCGFLSHQFVSPSASQFGSRS-----QSIRFGNAYKFLGNTF 57
           ME +++C G   P+   S+   LS       +S F +R      Q++R+           
Sbjct: 1   MESKAICLGFLPPRLGFSSPHLLSRSPFLQGSSTFATRQKLDSRQTLRWNK--------- 51

Query: 58  PNLKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFNGPEPFRGKSGSV 117
           P L             RG + C+S           SG+++      +   E FRGKSGSV
Sbjct: 52  PQLS------------RGHVACSSKQ---------SGKSN------YARAELFRGKSGSV 84

Query: 118 SFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDFIKDNLL 177
           SF GLTHQLVEE KL+SAPFQE+KGSFLW+LAPVVLISSLILPQ FL   IE   K++ +
Sbjct: 85  SFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGAIEASFKNDTV 144

Query: 178 MGI 180
             I
Sbjct: 145 AEI 147


>gi|148909474|gb|ABR17835.1| unknown [Picea sitchensis]
          Length = 284

 Score = 53.1 bits (126), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 39/67 (58%)

Query: 110 FRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIE 169
           F GK G V+F G  HQ ++EG+  S   +E   S +W+L P+ L++S++LP  +L    E
Sbjct: 132 FDGKPGFVAFGGGNHQHLKEGEFSSTSSEEGNRSLIWLLGPIALVASVVLPPFYLRKSFE 191

Query: 170 DFIKDNL 176
               D+L
Sbjct: 192 AVFGDSL 198


>gi|224119486|ref|XP_002318085.1| predicted protein [Populus trichocarpa]
 gi|222858758|gb|EEE96305.1| predicted protein [Populus trichocarpa]
          Length = 329

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 5/136 (3%)

Query: 42  QSIRFGNAYKFLGNTFPNLKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPV 101
            S  F +    L   FP  +H I   S   S+   I   ++   D      +   +S  +
Sbjct: 12  HSCHFIDDLPKLKGKFPRRQHIIFAASQDQSQYSEIKPDASEEEDDHPTAEALPTTSSSI 71

Query: 102 AIFNGPEPFRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQ 161
             F G +   GK G +SF    +++V+E  ++ +  Q +  S LW + P VL++S I P 
Sbjct: 72  NHFPGTD---GKPGLISFYNRPYKMVDE--VIISNVQRNPSSLLWFIGPAVLVASFIFPS 126

Query: 162 MFLGNVIEDFIKDNLL 177
           ++L  ++    +D+LL
Sbjct: 127 LYLRKILSAVFEDSLL 142


>gi|255540607|ref|XP_002511368.1| conserved hypothetical protein [Ricinus communis]
 gi|223550483|gb|EEF51970.1| conserved hypothetical protein [Ricinus communis]
          Length = 367

 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 2/85 (2%)

Query: 93  SGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVV 152
           + EN +   + FN  +   GK G VSF    +++  E  +     Q+++ S LW   P V
Sbjct: 99  TAENLAPTSSFFNHLQGTDGKPGLVSFYNRPYKM--EDDVSKTNVQKNQSSLLWFAGPSV 156

Query: 153 LISSLILPQMFLGNVIEDFIKDNLL 177
           L++S I P ++L  ++    +D+LL
Sbjct: 157 LVASFIFPSLYLRRILSTVFEDSLL 181


>gi|449477656|ref|XP_004155083.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101208061
           [Cucumis sativus]
          Length = 366

 Score = 44.3 bits (103), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 92  PSGENSSLPVAIFNGPEPFRGKSGSVSFCGLTHQLVEEGKLMS-APFQEDKGSFLWILAP 150
           P+ E+ SL    F   E   GK G +SF    +   +EGK +  +  Q ++  FLW + P
Sbjct: 97  PTDEDISLENNSFLHYEGTGGKPGFISF----YNHSKEGKRIPLSSVQSNQYKFLWFVGP 152

Query: 151 VVLISSLILPQMFLGNVIEDFIKDNLL 177
            VL++S I P ++L  ++ +  +D+LL
Sbjct: 153 AVLVASFIFPSLYLRKLLSNIFEDSLL 179


>gi|359807649|ref|NP_001241168.1| uncharacterized protein LOC100803499 [Glycine max]
 gi|255634981|gb|ACU17849.1| unknown [Glycine max]
          Length = 371

 Score = 43.9 bits (102), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    H+   + K++    +  + S LW L P VL++S I P ++L  V+   
Sbjct: 121 GKPGLISFYSRPHR--RDSKILLPNSERSQNSILWFLGPAVLVASFIFPSLYLRKVLSII 178

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 179 FEDSLL 184


>gi|21592953|gb|AAM64903.1| unknown [Arabidopsis thaliana]
          Length = 367

 Score = 42.7 bits (99), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++   E  ++    Q   G  LW++ P VL+SS ILP ++L  ++   
Sbjct: 121 GKPGFISFYNPRNK--TEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAV 178

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 179 FEDSLL 184


>gi|18424658|ref|NP_568963.1| uncharacterized protein [Arabidopsis thaliana]
 gi|89000983|gb|ABD59081.1| At5g63040 [Arabidopsis thaliana]
 gi|332010306|gb|AED97689.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 366

 Score = 42.4 bits (98), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++   E  ++    Q   G  LW++ P VL+SS ILP ++L  ++   
Sbjct: 120 GKPGFISFYNPRNK--TEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAV 177

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 178 FEDSLL 183


>gi|297832432|ref|XP_002884098.1| hypothetical protein ARALYDRAFT_343424 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329938|gb|EFH60357.1| hypothetical protein ARALYDRAFT_343424 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 407

 Score = 42.4 bits (98), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 5/74 (6%)

Query: 104 FNGPEPFRGKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMF 163
           +NG +   GK G +SF    ++   E  ++    Q   G  LW++ P VL+SS ILP ++
Sbjct: 156 YNGND---GKPGFISFYNPRNK--TEDIIVPPESQSTWGRLLWLIGPAVLVSSFILPPIY 210

Query: 164 LGNVIEDFIKDNLL 177
           L  ++    +D+LL
Sbjct: 211 LRRIVSAVFEDSLL 224


>gi|42573776|ref|NP_974984.1| uncharacterized protein [Arabidopsis thaliana]
 gi|9758304|dbj|BAB08847.1| unnamed protein product [Arabidopsis thaliana]
 gi|332010307|gb|AED97690.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 366

 Score = 42.4 bits (98), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++   E  ++    Q   G  LW++ P VL+SS ILP ++L  ++   
Sbjct: 120 GKPGFISFYNPRNK--TEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAV 177

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 178 FEDSLL 183


>gi|225456967|ref|XP_002278631.1| PREDICTED: uncharacterized protein LOC100251871 [Vitis vinifera]
          Length = 319

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++  +E  L+S P   ++ + LW + P +L++S I P ++L  ++   
Sbjct: 69  GKPGLISFYNRPYKRADE-DLISGP-PRNQNNLLWFIGPSILVASFIFPSLYLRRILSTV 126

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 127 FEDSLL 132


>gi|297733750|emb|CBI14997.3| unnamed protein product [Vitis vinifera]
          Length = 356

 Score = 41.2 bits (95), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++  +E  L+S P   ++ + LW + P +L++S I P ++L  ++   
Sbjct: 106 GKPGLISFYNRPYKRADE-DLISGP-PRNQNNLLWFIGPSILVASFIFPSLYLRRILSTV 163

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 164 FEDSLL 169


>gi|356508691|ref|XP_003523088.1| PREDICTED: uncharacterized protein LOC100796720 [Glycine max]
          Length = 372

 Score = 41.2 bits (95), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++   + K++    +  + S LW L P VL++S I P ++L  V+   
Sbjct: 122 GKPGLLSFYNRPYR--RDSKILLPNSERSQNSILWFLGPAVLVASFIFPSLYLRKVLSII 179

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 180 FEDSLL 185


>gi|222619131|gb|EEE55263.1| hypothetical protein OsJ_03171 [Oryza sativa Japonica Group]
          Length = 372

 Score = 41.2 bits (95), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
           GKSG +SF G ++Q+     + S P    + S  +W + P +L++ L+LP ++L  V+  
Sbjct: 124 GKSGFISFQGSSYQM---KSVESVPHPGKEASRLVWFVGPTILVAFLVLPSLYLRKVLSA 180

Query: 171 FIKDNLL 177
             +D+LL
Sbjct: 181 VFEDSLL 187


>gi|168009397|ref|XP_001757392.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691515|gb|EDQ77877.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 252

 Score = 40.4 bits (93), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 11/72 (15%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLM--SAPFQE----DKGSFLWILAPVVLISSLILPQMFLG 165
           GK G VSFC         GK    S  +QE     +GS LW+  P+ L+ S++ P ++L 
Sbjct: 1   GKPGFVSFCS-----AGSGKEQPESTEYQELKPSQRGSLLWLFGPLALVFSVVGPPLYLR 55

Query: 166 NVIEDFIKDNLL 177
            V E  ++D+LL
Sbjct: 56  RVFESILEDSLL 67


>gi|218188924|gb|EEC71351.1| hypothetical protein OsI_03428 [Oryza sativa Indica Group]
          Length = 372

 Score = 39.7 bits (91), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
           GKSG +SF G ++++     + S P    + S  +W + P +L++ L+LP ++L  V+  
Sbjct: 124 GKSGFISFQGSSYRM---KSVESVPHPGKEASRLVWFVGPTILVAFLVLPSLYLRKVLSA 180

Query: 171 FIKDNLL 177
             +D+LL
Sbjct: 181 VFEDSLL 187


>gi|357461909|ref|XP_003601236.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
 gi|355490284|gb|AES71487.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
          Length = 377

 Score = 38.9 bits (89), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++   E  L+++  +  + + LW + P VL++S I P ++L  ++   
Sbjct: 127 GKPGFISFYNRPYRRDSEILLLNS--ERSQNNILWFVGPAVLVASFIFPSLYLRKLLSII 184

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 185 FEDSLL 190


>gi|388517087|gb|AFK46605.1| unknown [Medicago truncatula]
          Length = 376

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++   E  L+++  +  + + LW + P VL++S I P ++L  ++   
Sbjct: 126 GKPGFISFYNRPYRRDSEILLLNS--ERSQNNILWFVGPAVLVASFIFPSLYLRKLLSII 183

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 184 FEDSLL 189


>gi|357461911|ref|XP_003601237.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
 gi|355490285|gb|AES71488.1| hypothetical protein MTR_3g077500 [Medicago truncatula]
          Length = 330

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGSFLWILAPVVLISSLILPQMFLGNVIEDF 171
           GK G +SF    ++   E  L+++  +  + + LW + P VL++S I P ++L  ++   
Sbjct: 80  GKPGFISFYNRPYRRDSEILLLNS--ERSQNNILWFVGPAVLVASFIFPSLYLRKLLSII 137

Query: 172 IKDNLL 177
            +D+LL
Sbjct: 138 FEDSLL 143


>gi|326496917|dbj|BAJ98485.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 409

 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 37/67 (55%), Gaps = 4/67 (5%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
           GK G +SF G + Q +    + S P    + S  +W + P +L++ L+LP ++L  V+  
Sbjct: 161 GKPGFISFHGGSSQTIS---VESVPHPGKEASRLVWFVGPTILVAFLVLPSLYLRKVLSA 217

Query: 171 FIKDNLL 177
             +D+LL
Sbjct: 218 VFEDSLL 224


>gi|441671766|ref|XP_003281660.2| PREDICTED: LOW QUALITY PROTEIN: nephrocystin-4 [Nomascus
           leucogenys]
          Length = 1583

 Score = 37.0 bits (84), Expect = 3.5,   Method: Composition-based stats.
 Identities = 27/93 (29%), Positives = 42/93 (45%), Gaps = 4/93 (4%)

Query: 9   CRGLQLPKFKSSNCGFLSHQFV--SPSASQFGSRSQSIRFGNAYKFLGNTFPNLKHKIEG 66
           CRGL  P   +S+CGF         P+A+  G RS+S+       +L +  P+ +     
Sbjct: 31  CRGLPAPPNPASSCGFRQQHLHLDGPAAAPIGWRSRSVALDLGSDWLRS--PDGESDARR 88

Query: 67  YSAWSSKRGAILCASNSGSDTKLGFPSGENSSL 99
               S +RGA L  +     +  G P G++S L
Sbjct: 89  LPGLSVRRGAPLSEAPPAGGSGRGAPFGQSSVL 121


>gi|414880826|tpg|DAA57957.1| TPA: hypothetical protein ZEAMMB73_152959 [Zea mays]
          Length = 384

 Score = 36.2 bits (82), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
           GK G +SF   T +   +  + S P    + S  +W + P +L+S L+LP ++L  V+  
Sbjct: 136 GKPGFISFQESTFR---KSTVESVPHPGKEASRLVWFIGPTILVSFLVLPSLYLRKVLSA 192

Query: 171 FIKDNLL 177
             +D+LL
Sbjct: 193 VFEDSLL 199


>gi|195646440|gb|ACG42688.1| hypothetical protein [Zea mays]
          Length = 375

 Score = 36.2 bits (82), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
           GK G +SF   T +   +  + S P    + S  +W + P +L+S L+LP ++L  V+  
Sbjct: 127 GKPGFISFQESTFR---KSTVESVPHPGKEASRLVWFIGPTILVSFLVLPSLYLRKVLSA 183

Query: 171 FIKDNLL 177
             +D+LL
Sbjct: 184 VFEDSLL 190


>gi|219363149|ref|NP_001136799.1| uncharacterized protein LOC100216945 [Zea mays]
 gi|194697152|gb|ACF82660.1| unknown [Zea mays]
 gi|414880825|tpg|DAA57956.1| TPA: hypothetical protein ZEAMMB73_152959 [Zea mays]
          Length = 375

 Score = 36.2 bits (82), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 112 GKSGSVSFCGLTHQLVEEGKLMSAPFQEDKGS-FLWILAPVVLISSLILPQMFLGNVIED 170
           GK G +SF   T +   +  + S P    + S  +W + P +L+S L+LP ++L  V+  
Sbjct: 127 GKPGFISFQESTFR---KSTVESVPHPGKEASRLVWFIGPTILVSFLVLPSLYLRKVLSA 183

Query: 171 FIKDNLL 177
             +D+LL
Sbjct: 184 VFEDSLL 190


>gi|310830389|ref|YP_003965490.1| Minor extracellular protease epr [Paenibacillus polymyxa SC2]
 gi|309249856|gb|ADO59422.1| Minor extracellular protease epr [Paenibacillus polymyxa SC2]
          Length = 1035

 Score = 35.4 bits (80), Expect = 9.5,   Method: Composition-based stats.
 Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 5/103 (4%)

Query: 48  NAYKFLGNTF--PNLKHKIEGYSAWSSKRGAILCASNSGSDTKLGFPSGENSSLPVAIFN 105
           N Y ++  +    N+++K+EG +AW    G+IL  +N+    +     G  S +   + N
Sbjct: 310 NIYAYINASLDSTNIQYKLEGDTAWKPYNGSILLTNNTTIYAQTSNSGGYQSEVVSLVVN 369

Query: 106 GPEPFRGKSGSVSFCGLTHQLV---EEGKLMSAPFQEDKGSFL 145
             +     S +VS  G    ++   +   + S  +Q D GS+L
Sbjct: 370 NIDKTAPTSPTVSIVGNKLTIIPGSDPSGISSTKYQIDNGSWL 412


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.406 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,956,907,940
Number of Sequences: 23463169
Number of extensions: 124727463
Number of successful extensions: 235997
Number of sequences better than 100.0: 38
Number of HSP's better than 100.0 without gapping: 22
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 235969
Number of HSP's gapped (non-prelim): 38
length of query: 181
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 48
effective length of database: 9,238,593,890
effective search space: 443452506720
effective search space used: 443452506720
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)