BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033081
         (128 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359484646|ref|XP_003633137.1| PREDICTED: uncharacterized protein LOC100854031 [Vitis vinifera]
          Length = 121

 Score =  148 bits (374), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 78/125 (62%), Positives = 94/125 (75%), Gaps = 12/125 (9%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPEN- 59
           MA ++++FRP G++  A      + D  RR + SSNWW+PLFGWS + DYID E+K  N 
Sbjct: 3   MATDLISFRPIGIRACAA-----SCDSHRR-STSSNWWTPLFGWSPDPDYIDPETKTPNP 56

Query: 60  ----RSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDF 115
               RSE DLE K+SRS R++PGCFTEEKAKQLRL+T  T+SFHDAMYHSAIASRLASDF
Sbjct: 57  VAQSRSEQDLETKTSRS-RYAPGCFTEEKAKQLRLVTTSTSSFHDAMYHSAIASRLASDF 115

Query: 116 KIRSD 120
           K RSD
Sbjct: 116 KKRSD 120


>gi|315937290|gb|ADU56195.1| hypothetical protein [Jatropha curcas]
          Length = 128

 Score =  144 bits (362), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 77/129 (59%), Positives = 94/129 (72%), Gaps = 9/129 (6%)

Query: 1   MAANILTFRPAGVQVYATWSGHR-NPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPEN 59
           MAANIL      ++  A     R +P G++  ++S NWW PLFGWSSEADYI SE+KP++
Sbjct: 1   MAANILPLNSVTIRARAGPGSQRPDPHGRKSSSSSGNWWXPLFGWSSEADYIGSENKPQD 60

Query: 60  -------RSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
                  RSESDL +K  RS RF+PG FTEEKA+QLR+MT +T+SFHD MYHSAIASRLA
Sbjct: 61  MQEKEDGRSESDLSSKPVRS-RFAPGGFTEEKARQLRMMTNETSSFHDVMYHSAIASRLA 119

Query: 113 SDFKIRSDR 121
           SDFK RS+R
Sbjct: 120 SDFKHRSER 128


>gi|358248696|ref|NP_001239669.1| uncharacterized protein LOC100780553 [Glycine max]
 gi|255640770|gb|ACU20669.1| unknown [Glycine max]
          Length = 123

 Score =  131 bits (329), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 76/128 (59%), Positives = 92/128 (71%), Gaps = 14/128 (10%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSE------ 54
           MAA+ +  R AG+Q  AT SGHR  D  RR+ +S++WW+PLFGWSSE DYIDS       
Sbjct: 1   MAASSIPIRVAGIQPCAT-SGHRRTDPDRRRASSTSWWTPLFGWSSEPDYIDSNNRASSL 59

Query: 55  --SKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
             +KPE  + +  E+K+ R  RF+ GCFTEEKAKQLR+MT  + SFHD MYHSAIASRLA
Sbjct: 60  QRTKPEPVTAA-TESKAPR-PRFA-GCFTEEKAKQLRMMT--SKSFHDTMYHSAIASRLA 114

Query: 113 SDFKIRSD 120
           SDFK RSD
Sbjct: 115 SDFKARSD 122


>gi|255566909|ref|XP_002524437.1| conserved hypothetical protein [Ricinus communis]
 gi|223536225|gb|EEF37877.1| conserved hypothetical protein [Ricinus communis]
          Length = 125

 Score =  129 bits (324), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 73/126 (57%), Positives = 89/126 (70%), Gaps = 6/126 (4%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPENR 60
           MA N+L   P  ++         +P  ++  + SSNWW+PLFG SSEADYI SE K + R
Sbjct: 1   MATNLLPSTPLMIRASTGPVQKPDPSRRKSSSYSSNWWTPLFGMSSEADYIGSEPKADGR 60

Query: 61  ----SESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDF- 115
               SESDL+ K +RS RF+PG FTEEKAKQLR++T +T+SFHD MYHSAIASRLASDF 
Sbjct: 61  KEETSESDLDPKPARS-RFTPGAFTEEKAKQLRMLTTETSSFHDVMYHSAIASRLASDFN 119

Query: 116 KIRSDR 121
           K RSDR
Sbjct: 120 KRRSDR 125


>gi|357477921|ref|XP_003609246.1| hypothetical protein MTR_4g113600 [Medicago truncatula]
 gi|355510301|gb|AES91443.1| hypothetical protein MTR_4g113600 [Medicago truncatula]
          Length = 113

 Score =  124 bits (310), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 71/102 (69%), Positives = 79/102 (77%), Gaps = 4/102 (3%)

Query: 20  SGHRNPDGKRRKNASS-NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPG 78
           SGHR PD  RR+N+SS NWW+PLFG SSE DYIDS++K   +SESD  +K S+  RFS G
Sbjct: 14  SGHRIPDPDRRRNSSSTNWWTPLFGMSSEPDYIDSDNK-APKSESDPSSKPSQL-RFS-G 70

Query: 79  CFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDFKIRSD 120
             TEEKAKQLR MT  T SFHD MYHSAIASRLASDFK RSD
Sbjct: 71  GLTEEKAKQLRKMTVGTESFHDTMYHSAIASRLASDFKARSD 112


>gi|225442809|ref|XP_002285278.1| PREDICTED: uncharacterized protein LOC100244679 [Vitis vinifera]
          Length = 119

 Score =  119 bits (298), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 64/118 (54%), Positives = 78/118 (66%), Gaps = 7/118 (5%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPENR 60
           MA+ +  F P G++  A  SG R  D  RR+ +   WWSP+FGWSS+ DYI  ES    +
Sbjct: 1   MASTLFVFNPTGIRASAV-SGPRKLDQNRRRASQPTWWSPIFGWSSDPDYIVGESA--EK 57

Query: 61  SESDLEAKSSRS----SRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASD 114
           S   LE K S +    SRF+ GCFTEEKAKQLR  T ++ASFHD MYHS+IASRLASD
Sbjct: 58  SAGALEGKDSETARSRSRFTLGCFTEEKAKQLRRKTMESASFHDIMYHSSIASRLASD 115


>gi|351725191|ref|NP_001235804.1| uncharacterized protein LOC100306235 [Glycine max]
 gi|255627965|gb|ACU14327.1| unknown [Glycine max]
          Length = 123

 Score =  116 bits (291), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 75/128 (58%), Positives = 90/128 (70%), Gaps = 14/128 (10%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSE------ 54
           MAA  +  R AG+Q  AT SG R  D  RR+ +S+NWW+PLFGWSSE DYIDS       
Sbjct: 1   MAATSIPIRVAGIQACAT-SGRRRADPDRRRASSANWWTPLFGWSSEPDYIDSNNKASSL 59

Query: 55  --SKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
             +KPE  + ++ E+K+ R  RF+ G FTEEKAKQLR+MT +  SFHD MYHSAIASRLA
Sbjct: 60  QPAKPEPVT-AETESKAPR-PRFA-GGFTEEKAKQLRMMTSE--SFHDTMYHSAIASRLA 114

Query: 113 SDFKIRSD 120
           SDFK RSD
Sbjct: 115 SDFKSRSD 122


>gi|147768005|emb|CAN71657.1| hypothetical protein VITISV_030821 [Vitis vinifera]
          Length = 119

 Score =  115 bits (288), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 63/119 (52%), Positives = 77/119 (64%), Gaps = 7/119 (5%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPENR 60
           MA+ +  F P G++  A  SG R  D  RR+ +   WWSP+FGWSS+ DYI  ES    +
Sbjct: 1   MASTLSVFNPTGIRASAV-SGPRKLDQNRRRASQPTWWSPIFGWSSDPDYIVGESA--EK 57

Query: 61  SESDLEAKSSRS----SRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDF 115
           S    E K S +    SRF+ GCFTEEKAKQLR  T ++ASFHD MYHS+IASRLASD 
Sbjct: 58  SAGAQEGKDSETARSRSRFTLGCFTEEKAKQLRRKTMESASFHDIMYHSSIASRLASDV 116


>gi|388497552|gb|AFK36842.1| unknown [Medicago truncatula]
          Length = 113

 Score =  115 bits (288), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 66/102 (64%), Positives = 72/102 (70%), Gaps = 4/102 (3%)

Query: 20  SGHRNPDGKRRKNASS-NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPG 78
           SGHR PD  RR+N+SS NWW+PLFG SSE DYIDS++K          +  S   RFS G
Sbjct: 14  SGHRIPDPDRRRNSSSTNWWTPLFGMSSEPDYIDSDNKAPKSESD--PSSKSSQLRFS-G 70

Query: 79  CFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDFKIRSD 120
             TEEKAKQLR MT  T SFHD MYHSAIASRLASDFK RSD
Sbjct: 71  GLTEEKAKQLRKMTVGTESFHDTMYHSAIASRLASDFKARSD 112


>gi|356526179|ref|XP_003531697.1| PREDICTED: uncharacterized protein LOC100778751 [Glycine max]
          Length = 120

 Score =  112 bits (279), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 58/115 (50%), Positives = 73/115 (63%), Gaps = 3/115 (2%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPENR 60
           MA     FRP  ++  A   G   PD   RK  SSNWW+PLFGW ++ DYI    K   +
Sbjct: 1   MATTFTFFRPTMIRASAGSPGR--PDPSNRKPVSSNWWAPLFGWPADPDYIGPSQKTSYQ 58

Query: 61  SESDLEAKSSRS-SRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASD 114
           S+ + E    R  S+F+ GCFTE+KAK+LR  T +T++FHD MYHSAIASRLASD
Sbjct: 59  SDPEREVGPGRPRSKFAAGCFTEKKAKELRKKTVETSTFHDIMYHSAIASRLASD 113


>gi|255553047|ref|XP_002517566.1| conserved hypothetical protein [Ricinus communis]
 gi|223543198|gb|EEF44730.1| conserved hypothetical protein [Ricinus communis]
          Length = 126

 Score =  110 bits (274), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/127 (50%), Positives = 82/127 (64%), Gaps = 6/127 (4%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSN-WWSPLFGWSSEADYIDSESKPEN 59
           MA+ ++        + A+    R PD   RK ASS  WW+PLFGWSS+ DYI++ S   N
Sbjct: 1   MASTLVFNCSLAAPIRASSRSARKPDPNSRKTASSTTWWAPLFGWSSDPDYINTGSDTVN 60

Query: 60  R----SESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDF 115
           +    SES+  +  +RS +FS GCFTEEKA+QLR  T +++SFHD MYHSAIASRLASD 
Sbjct: 61  KQAEISESESGSDGARS-KFSLGCFTEEKARQLRKKTAESSSFHDIMYHSAIASRLASDI 119

Query: 116 KIRSDRE 122
             RS  E
Sbjct: 120 SGRSGNE 126


>gi|356519399|ref|XP_003528360.1| PREDICTED: uncharacterized protein LOC100776958 [Glycine max]
          Length = 122

 Score =  108 bits (269), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 55/117 (47%), Positives = 74/117 (63%), Gaps = 5/117 (4%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPENR 60
           MA  ++ FRP    + A+      PD + RK  SSNWW+PLFGW ++ DY+    K   +
Sbjct: 1   MATTLMFFRP--TMIRASAGSPAKPDPRNRKPVSSNWWAPLFGWPADPDYMGPLQKASEK 58

Query: 61  SESDLEAKSSRS---SRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASD 114
           S+ + E         S+F+ GCFTE+KAK+LR  T +T++FHD MYHSAIASRLASD
Sbjct: 59  SDPEREVGLDLGRPRSKFTAGCFTEKKAKELRKKTVETSTFHDIMYHSAIASRLASD 115


>gi|255640540|gb|ACU20555.1| unknown [Glycine max]
          Length = 120

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 56/115 (48%), Positives = 71/115 (61%), Gaps = 3/115 (2%)

Query: 1   MAANILTFRPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPENR 60
           MA     FRP  ++  A   G   PD   RK  SSNWW+P FGW ++ DYI    K   +
Sbjct: 1   MATTFTFFRPTMIRASAGSPGR--PDPSNRKPVSSNWWAPPFGWPADPDYIGPSQKTSYQ 58

Query: 61  SESDLEAKSSRS-SRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASD 114
           S+ + E    R  S+F+ GCFT +KAK+LR  T +T++FHD MYHSAIASRLASD
Sbjct: 59  SDPEREVGPGRPRSKFAAGCFTGKKAKELRKKTVETSTFHDIMYHSAIASRLASD 113


>gi|224054146|ref|XP_002298114.1| predicted protein [Populus trichocarpa]
 gi|222845372|gb|EEE82919.1| predicted protein [Populus trichocarpa]
          Length = 131

 Score =  102 bits (253), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 65/133 (48%), Positives = 79/133 (59%), Gaps = 17/133 (12%)

Query: 3   ANILTFR---PAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSES---- 55
           A++LTF    PA V+  A+    R PD   +K+ SS WWSPLFGWSS  DY++  S    
Sbjct: 2   ASVLTFACSVPAPVR--ASSGSPRTPDPYGKKSGSSTWWSPLFGWSSSPDYLNGNSIAGG 59

Query: 56  -------KPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIA 108
                  K    S SD E    RS RF+ G FTEEKAKQLR  T + ++FHD MYHSAIA
Sbjct: 60  TGDDVPDKESGLSGSDQEPGRPRS-RFALGSFTEEKAKQLRRKTLEGSTFHDMMYHSAIA 118

Query: 109 SRLASDFKIRSDR 121
           SRLASD   R ++
Sbjct: 119 SRLASDGSGRQEK 131


>gi|18404156|ref|NP_564614.1| uncharacterized protein [Arabidopsis thaliana]
 gi|5903034|gb|AAD55593.1|AC008016_3 F6D8.3 [Arabidopsis thaliana]
 gi|21554100|gb|AAM63180.1| unknown [Arabidopsis thaliana]
 gi|28466819|gb|AAO44018.1| At1g52720 [Arabidopsis thaliana]
 gi|110736534|dbj|BAF00234.1| hypothetical protein [Arabidopsis thaliana]
 gi|332194723|gb|AEE32844.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 117

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 53/101 (52%), Positives = 67/101 (66%), Gaps = 7/101 (6%)

Query: 20  SGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPE--NRSESDLEAKSSRSSRFSP 77
           SG  NPD  R+K+A+  WW+PLFG  S+ DY++ ES     N  ++D+   S    +F  
Sbjct: 19  SGSLNPDQNRKKSAA--WWAPLFGLPSDPDYLNIESSCSTVNPDKTDI---SGSGQKFRR 73

Query: 78  GCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDFKIR 118
           GCFTEEKAKQLR  T + ++FHD MYHSAIASRLASD   R
Sbjct: 74  GCFTEEKAKQLRRKTAEASTFHDVMYHSAIASRLASDITGR 114


>gi|224070847|ref|XP_002303262.1| predicted protein [Populus trichocarpa]
 gi|222840694|gb|EEE78241.1| predicted protein [Populus trichocarpa]
          Length = 131

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 57/111 (51%), Positives = 67/111 (60%), Gaps = 12/111 (10%)

Query: 15  VYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSES-----------KPENRSES 63
           V A+    R PD   +K+ SS WW+PLFGWSS  DYI+S S           K    S S
Sbjct: 15  VRASSGSSRKPDPCGKKSGSSTWWAPLFGWSSSPDYINSGSAGGSTSDDIPYKESGVSGS 74

Query: 64  DLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASD 114
           D E    RS RF+ G FTEEKA++LR  T +  +FHD MYHSAIASRLASD
Sbjct: 75  DQEPGRPRS-RFALGSFTEEKARRLRRKTVEGITFHDMMYHSAIASRLASD 124


>gi|326501614|dbj|BAK02596.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 129

 Score = 93.2 bits (230), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/88 (52%), Positives = 57/88 (64%), Gaps = 3/88 (3%)

Query: 27  GKRRKNASSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSR-FSPGCFTEEKA 85
           G R + A   WW+PL GWS + DYID++  P   +E +  A     +R F  G  TEEKA
Sbjct: 35  GARFRGADGKWWAPLLGWSGQPDYIDAQPAPAPATEEERAAGCGAGARRF--GVLTEEKA 92

Query: 86  KQLRLMTKDTASFHDAMYHSAIASRLAS 113
           +QLR+   +T SFHDAMYHSAIASRLAS
Sbjct: 93  RQLRMRMMETESFHDAMYHSAIASRLAS 120


>gi|297834436|ref|XP_002885100.1| hypothetical protein ARALYDRAFT_479015 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330940|gb|EFH61359.1| hypothetical protein ARALYDRAFT_479015 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 107

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 52/94 (55%), Positives = 64/94 (68%), Gaps = 8/94 (8%)

Query: 26  DGKRRKNASS-NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEK 84
           D  R+K  SS +WW+PLFG SSE DY++   K  N  ESDL+    RS R    C TEEK
Sbjct: 20  DPSRKKPVSSVSWWAPLFGMSSEPDYVN---KTVNL-ESDLDKTDKRSLR---CCLTEEK 72

Query: 85  AKQLRLMTKDTASFHDAMYHSAIASRLASDFKIR 118
           AKQLR  T + ++FHD MYHSAIASRLASD +++
Sbjct: 73  AKQLRRKTAEASTFHDVMYHSAIASRLASDVRVK 106


>gi|18400843|ref|NP_566521.1| uncharacterized protein [Arabidopsis thaliana]
 gi|11994335|dbj|BAB02294.1| unnamed protein product [Arabidopsis thaliana]
 gi|15529234|gb|AAK97711.1| AT3g15630/MSJ11_3 [Arabidopsis thaliana]
 gi|16974391|gb|AAL31121.1| AT3g15630/MSJ11_3 [Arabidopsis thaliana]
 gi|332642184|gb|AEE75705.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 107

 Score = 92.0 bits (227), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 52/94 (55%), Positives = 64/94 (68%), Gaps = 8/94 (8%)

Query: 26  DGKRRKNASS-NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEK 84
           D  R+K  SS +WW+PLFG SSE DY++   K  N  ESDL+    RS R    C TEEK
Sbjct: 20  DPARKKPVSSVSWWAPLFGMSSEPDYVN---KTVNL-ESDLDKAEKRSLR---CCLTEEK 72

Query: 85  AKQLRLMTKDTASFHDAMYHSAIASRLASDFKIR 118
           AKQLR  T + ++FHD MYHSAIASRLASD +++
Sbjct: 73  AKQLRRKTAEASTFHDVMYHSAIASRLASDVRVK 106


>gi|297847660|ref|XP_002891711.1| hypothetical protein ARALYDRAFT_892289 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337553|gb|EFH67970.1| hypothetical protein ARALYDRAFT_892289 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 121

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 64/101 (63%), Gaps = 11/101 (10%)

Query: 24  NPDGKRRKNASSNWWSPLFGWSSEADYIDSES-----KPE-NRSESDLEAKSSRSSRFSP 77
           NP   R+K+AS  WW+PLFG  S+ +Y++ ES      PE N  + D+   S    +F  
Sbjct: 23  NPGPNRKKSAS--WWAPLFGLPSDPNYLNIESSGSAVNPESNPGKPDI---SGSGQKFRR 77

Query: 78  GCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLASDFKIR 118
           GC TEEKAKQLR  T + ++FHD MYHSAIASRLASD   R
Sbjct: 78  GCLTEEKAKQLRRKTAEASTFHDVMYHSAIASRLASDISDR 118


>gi|357135828|ref|XP_003569510.1| PREDICTED: uncharacterized protein LOC100821923 [Brachypodium
           distachyon]
          Length = 149

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/87 (54%), Positives = 54/87 (62%), Gaps = 5/87 (5%)

Query: 30  RKNASSN--WWSPLFGWSSEADYID-SESKPENRSESDLEAKSSRSSRFSPGCFTEEKAK 86
           R  A+ N  WW+PL GWS + DYID +   P    E +  AK     RF  G  TEEKA+
Sbjct: 50  RAEATGNGRWWAPLLGWSGQPDYIDGAPPAPPREEEDERAAKGQGQRRF--GVLTEEKAR 107

Query: 87  QLRLMTKDTASFHDAMYHSAIASRLAS 113
           QLR    +T SFHDAMYHSAIASRLAS
Sbjct: 108 QLRARMMETESFHDAMYHSAIASRLAS 134


>gi|242051889|ref|XP_002455090.1| hypothetical protein SORBIDRAFT_03g004190 [Sorghum bicolor]
 gi|241927065|gb|EES00210.1| hypothetical protein SORBIDRAFT_03g004190 [Sorghum bicolor]
          Length = 130

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/83 (49%), Positives = 56/83 (67%), Gaps = 3/83 (3%)

Query: 32  NASSNWWSPLFGWSSEADYIDSE-SKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRL 90
           + ++ WW+PL GWS + DYID+  +  E   E   +  S+ + RF  G  TE+KA++LR+
Sbjct: 43  SGAAKWWAPLLGWSGQPDYIDARPAASEEEPEQTRQRASASARRF--GVLTEDKARRLRM 100

Query: 91  MTKDTASFHDAMYHSAIASRLAS 113
              +T SFHDAMYHSAIASRLAS
Sbjct: 101 QMMETESFHDAMYHSAIASRLAS 123


>gi|414876077|tpg|DAA53208.1| TPA: hypothetical protein ZEAMMB73_919517 [Zea mays]
          Length = 119

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 42/80 (52%), Positives = 54/80 (67%), Gaps = 3/80 (3%)

Query: 35  SNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSR-FSPGCFTEEKAKQLRLMTK 93
           + WW+PL GWS + DYID+    E+       A +S S+R F  G  TE+KA++LR+   
Sbjct: 35  AKWWAPLLGWSGQPDYIDARPASEDEPLPRGPASASASARRF--GVLTEDKARRLRVQMM 92

Query: 94  DTASFHDAMYHSAIASRLAS 113
           +T SFHDAMYHSAIASRLAS
Sbjct: 93  ETESFHDAMYHSAIASRLAS 112


>gi|226506616|ref|NP_001144682.1| uncharacterized protein LOC100277712 [Zea mays]
 gi|195645624|gb|ACG42280.1| hypothetical protein [Zea mays]
          Length = 119

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 42/80 (52%), Positives = 54/80 (67%), Gaps = 3/80 (3%)

Query: 35  SNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSR-FSPGCFTEEKAKQLRLMTK 93
           + WW+PL GWS + DYID+    E+       A +S S+R F  G  TE+KA++LR+   
Sbjct: 35  AKWWAPLLGWSGQPDYIDARPASEDEPLPRGPASASASARRF--GVLTEDKARRLRVQMM 92

Query: 94  DTASFHDAMYHSAIASRLAS 113
           +T SFHDAMYHSAIASRLAS
Sbjct: 93  ETESFHDAMYHSAIASRLAS 112


>gi|115434822|ref|NP_001042169.1| Os01g0175100 [Oryza sativa Japonica Group]
 gi|55296799|dbj|BAD68125.1| unknown protein [Oryza sativa Japonica Group]
 gi|113531700|dbj|BAF04083.1| Os01g0175100 [Oryza sativa Japonica Group]
 gi|215700989|dbj|BAG92413.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 138

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 51/84 (60%), Gaps = 10/84 (11%)

Query: 36  NWWSPLFGWSSEADYIDSESKPENRSES------DLEAKSSRSSRFSPGCFTEEKAKQLR 89
            WW+PL GWS + DYID++  P  R E+        E +     RF  G  TEEKA+QLR
Sbjct: 50  KWWAPLLGWSGQPDYIDAQ--PAAREEARPNPMAAAEQRGGGGKRF--GVLTEEKARQLR 105

Query: 90  LMTKDTASFHDAMYHSAIASRLAS 113
               +T SFHD MYHSAIASRLAS
Sbjct: 106 ARMMETESFHDCMYHSAIASRLAS 129


>gi|357134434|ref|XP_003568822.1| PREDICTED: uncharacterized protein LOC100844174 [Brachypodium
           distachyon]
          Length = 115

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 50/104 (48%), Positives = 59/104 (56%), Gaps = 6/104 (5%)

Query: 9   RPAGVQVYATWSGHRNPDGKRRKNASSNWWSPLFGWSSEADYIDSESKPENRSESDLEAK 68
           RPA V V    +      G+ R      WW+PL GWS  ADYI  E+     +   +E K
Sbjct: 15  RPATVVVRPRAASVSAGAGQVRAE-GGKWWAPLVGWSGRADYI--EAAVPAPAPVLVEEK 71

Query: 69  SSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
           + RS  F  G  TEEKA+QLR    +T SFHDAMYHSAIASRLA
Sbjct: 72  AGRS--FV-GGLTEEKARQLRARMVETESFHDAMYHSAIASRLA 112


>gi|195652245|gb|ACG45590.1| hypothetical protein [Zea mays]
          Length = 119

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/76 (53%), Positives = 51/76 (67%), Gaps = 3/76 (3%)

Query: 37  WWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTA 96
           WW+PL GWS +ADY+++ + P      D +A ++R  R   G  TEEKA+QLR     T 
Sbjct: 44  WWAPLLGWSGKADYLEAPA-PAPVVPQDDDAAAAR--RQFVGGLTEEKARQLRARMAQTE 100

Query: 97  SFHDAMYHSAIASRLA 112
           SFHDAMYHSAIASRLA
Sbjct: 101 SFHDAMYHSAIASRLA 116


>gi|326504048|dbj|BAK02810.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514538|dbj|BAJ96256.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 112

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 49/77 (63%), Gaps = 8/77 (10%)

Query: 36  NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDT 95
            WW+PL GWS +ADYI++ +         + A+  +  R   G  TEEKA+QLR    +T
Sbjct: 41  KWWAPLVGWSGKADYIEAAA--------PVVAEEEKPGRSFVGGLTEEKARQLRARMSET 92

Query: 96  ASFHDAMYHSAIASRLA 112
            SFHDAMYHSAIASRLA
Sbjct: 93  ESFHDAMYHSAIASRLA 109


>gi|242089729|ref|XP_002440697.1| hypothetical protein SORBIDRAFT_09g005330 [Sorghum bicolor]
 gi|241945982|gb|EES19127.1| hypothetical protein SORBIDRAFT_09g005330 [Sorghum bicolor]
          Length = 125

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/76 (53%), Positives = 52/76 (68%), Gaps = 3/76 (3%)

Query: 37  WWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTA 96
           WW+PL GWS +ADY+++ +     ++ D EA  +R  +F  G  TEEKA+QLR     T 
Sbjct: 50  WWAPLLGWSGKADYLEAPAPAPAVAQDD-EA-GARGRQFV-GGLTEEKARQLRARMAQTE 106

Query: 97  SFHDAMYHSAIASRLA 112
           SFHDAMYHSAIASRLA
Sbjct: 107 SFHDAMYHSAIASRLA 122


>gi|413944671|gb|AFW77320.1| hypothetical protein ZEAMMB73_662050 [Zea mays]
          Length = 119

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/76 (53%), Positives = 51/76 (67%), Gaps = 3/76 (3%)

Query: 37  WWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTA 96
           WW+PL GWS +ADY+++ + P      D +A ++R  R   G  TEEKA+QLR     T 
Sbjct: 44  WWAPLLGWSGKADYLEAPA-PVPAVLQDDDAAAAR--RQFVGGLTEEKARQLRARMAQTE 100

Query: 97  SFHDAMYHSAIASRLA 112
           SFHDAMYHSAIASRLA
Sbjct: 101 SFHDAMYHSAIASRLA 116


>gi|226503623|ref|NP_001142734.1| uncharacterized protein LOC100275074 [Zea mays]
 gi|195608922|gb|ACG26291.1| hypothetical protein [Zea mays]
          Length = 119

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/79 (50%), Positives = 50/79 (63%), Gaps = 3/79 (3%)

Query: 34  SSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTK 93
              WW+PL GWS +ADY+++ +      + D  A + R  +F  G  TEEKA+QLR    
Sbjct: 41  GGRWWAPLLGWSGKADYLEAPAPAPAVPQDDDAAAARR--QFV-GGLTEEKARQLRARMA 97

Query: 94  DTASFHDAMYHSAIASRLA 112
            T SFHDAMYHSAIASRLA
Sbjct: 98  QTESFHDAMYHSAIASRLA 116


>gi|242089731|ref|XP_002440698.1| hypothetical protein SORBIDRAFT_09g005340 [Sorghum bicolor]
 gi|241945983|gb|EES19128.1| hypothetical protein SORBIDRAFT_09g005340 [Sorghum bicolor]
          Length = 114

 Score = 75.1 bits (183), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 38/76 (50%), Positives = 48/76 (63%), Gaps = 5/76 (6%)

Query: 37  WWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTA 96
           WW+PL GWS +ADY+++ +        D  A+     R   G  TEEKA++LR+    T 
Sbjct: 41  WWAPLLGWSGKADYLEAPAPAPVPVAQDEAAR-----RQFVGVMTEEKARELRVRMAQTE 95

Query: 97  SFHDAMYHSAIASRLA 112
           SFHDAMYHSAIASRLA
Sbjct: 96  SFHDAMYHSAIASRLA 111


>gi|115462385|ref|NP_001054792.1| Os05g0176700 [Oryza sativa Japonica Group]
 gi|50511423|gb|AAT77346.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578343|dbj|BAF16706.1| Os05g0176700 [Oryza sativa Japonica Group]
 gi|215701233|dbj|BAG92657.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 118

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/79 (50%), Positives = 51/79 (64%), Gaps = 4/79 (5%)

Query: 34  SSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTK 93
           S  WW+PL GWS +ADYI++ +     +  + EA+     R   G  TEEKA++LR    
Sbjct: 41  SGKWWAPLLGWSGKADYIEAPAPAVVAAAEESEARR----RPFVGGLTEEKARELRARMV 96

Query: 94  DTASFHDAMYHSAIASRLA 112
           +T SFHDAMYHSAIASRLA
Sbjct: 97  ETESFHDAMYHSAIASRLA 115


>gi|125551027|gb|EAY96736.1| hypothetical protein OsI_18655 [Oryza sativa Indica Group]
          Length = 116

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/79 (50%), Positives = 51/79 (64%), Gaps = 4/79 (5%)

Query: 34  SSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTK 93
           S  WW+PL GWS +ADYI++ +     +  + EA+     R   G  TEEKA++LR    
Sbjct: 39  SGKWWAPLLGWSGKADYIEAPAPAVVAAAEESEARR----RPFVGGLTEEKARELRARMV 94

Query: 94  DTASFHDAMYHSAIASRLA 112
           +T SFHDAMYHSAIASRLA
Sbjct: 95  ETESFHDAMYHSAIASRLA 113


>gi|226501544|ref|NP_001144674.1| uncharacterized protein LOC100277700 [Zea mays]
 gi|195645556|gb|ACG42246.1| hypothetical protein [Zea mays]
          Length = 113

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/119 (39%), Positives = 65/119 (54%), Gaps = 23/119 (19%)

Query: 2   AANILTF--RPAGVQVYATWSGHRNPDGKRRKNA------SSNWWSPLFGWSSEADYIDS 53
           AA +++F  RPA     A+ +G R   G    +A       + WW+PL GWS +ADY+++
Sbjct: 7   AACVVSFPARPA-----ASTAGPRAAAGTGAVHAVAGSAEGAKWWAPLLGWSGKADYLEA 61

Query: 54  ESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
            +  ++ +            R   G  TEEKA++LR     T SFHDAMYHSAIASRLA
Sbjct: 62  PAPAQDEAPR----------RQFVGVMTEEKARELRARMAQTESFHDAMYHSAIASRLA 110


>gi|222630385|gb|EEE62517.1| hypothetical protein OsJ_17315 [Oryza sativa Japonica Group]
          Length = 116

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/79 (49%), Positives = 51/79 (64%), Gaps = 4/79 (5%)

Query: 34  SSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTK 93
           +  WW+PL GWS +ADYI++ +     +  + EA+     R   G  TEEKA++LR    
Sbjct: 39  TGKWWAPLLGWSGKADYIEAPAPAVVAAAEESEARR----RPFVGGLTEEKARELRARMV 94

Query: 94  DTASFHDAMYHSAIASRLA 112
           +T SFHDAMYHSAIASRLA
Sbjct: 95  ETESFHDAMYHSAIASRLA 113


>gi|357134438|ref|XP_003568824.1| PREDICTED: uncharacterized protein LOC100844774 [Brachypodium
           distachyon]
          Length = 138

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 41/77 (53%), Positives = 48/77 (62%), Gaps = 4/77 (5%)

Query: 36  NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDT 95
            WW+PL GWS  ADYI++ +      E +      RSS F  G  TE KA+QLR    +T
Sbjct: 63  KWWAPLVGWSGRADYIETAAAAVEEEEEEARI---RSSSF-VGGLTEAKARQLRARMVET 118

Query: 96  ASFHDAMYHSAIASRLA 112
            SFHDAMYHSAIASRLA
Sbjct: 119 ESFHDAMYHSAIASRLA 135


>gi|147775323|emb|CAN65834.1| hypothetical protein VITISV_040003 [Vitis vinifera]
          Length = 119

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 41/71 (57%), Positives = 48/71 (67%), Gaps = 6/71 (8%)

Query: 49  DYIDSESKPENRSESDLEAKSSRS----SRFSPGCFTEEKAKQLRLMTKDTASFHDAMYH 104
           +YI  ES    +S    E K S +    SRF+ GCFTEEKAKQLR  T ++ASFHD MYH
Sbjct: 48  NYIVGESA--EKSAGAXEGKDSETARSRSRFTLGCFTEEKAKQLRRKTMESASFHDIMYH 105

Query: 105 SAIASRLASDF 115
           S+IASRLASD 
Sbjct: 106 SSIASRLASDV 116


>gi|357134436|ref|XP_003568823.1| PREDICTED: uncharacterized protein LOC100844466 [Brachypodium
           distachyon]
          Length = 125

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 50/89 (56%), Gaps = 5/89 (5%)

Query: 26  DGKRRKNA--SSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEE 83
            G+ R  A     WW+PL GWS  ADYI + +        + E  + R   F  G  TE 
Sbjct: 37  SGRARAGAPEGGKWWAPLVGWSGRADYIKAAAPAALAVVEEEEEGAVRP--FV-GGLTEA 93

Query: 84  KAKQLRLMTKDTASFHDAMYHSAIASRLA 112
           KA+QLR    +T SFHDAMYHSAIASRLA
Sbjct: 94  KARQLRARMVETESFHDAMYHSAIASRLA 122


>gi|413944670|gb|AFW77319.1| hypothetical protein ZEAMMB73_320155 [Zea mays]
          Length = 117

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 48/78 (61%), Gaps = 6/78 (7%)

Query: 35  SNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKD 94
           + WW+PL GWS +ADY+++ +     ++ +         R   G  TEEKA++LR     
Sbjct: 43  AKWWAPLLGWSGKADYLEAPAPAPAGAQDEA------PRRQFVGVMTEEKARELRARMAQ 96

Query: 95  TASFHDAMYHSAIASRLA 112
           T SFHDAMYHSAIASRLA
Sbjct: 97  TESFHDAMYHSAIASRLA 114


>gi|383125662|gb|AFG43394.1| Pinus taeda anonymous locus 2_3379_02 genomic sequence
 gi|383125666|gb|AFG43396.1| Pinus taeda anonymous locus 2_3379_02 genomic sequence
          Length = 122

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 55/94 (58%), Gaps = 12/94 (12%)

Query: 31  KNASSNWWSPLFGWSSEAD---YIDSESK-PENRS-------ESDLEAKSSRSSRFSPGC 79
           K ++  WW PLFGWS +AD   + D   K P+ R+       E ++    ++ S+F  G 
Sbjct: 17  KRSTMRWWLPLFGWS-QADREPWADCGIKEPKARAMEKISSDEKEMGEAPAKRSKFEMGR 75

Query: 80  FTEEKAKQLRLMTKDTASFHDAMYHSAIASRLAS 113
            T EKAK LR   +DT  FHD MYHSAIASRLAS
Sbjct: 76  LTPEKAKLLRKNLRDTTMFHDIMYHSAIASRLAS 109


>gi|326523453|dbj|BAJ92897.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 113

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/77 (49%), Positives = 46/77 (59%), Gaps = 10/77 (12%)

Query: 36  NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDT 95
            WW+PL GWS  ADY+++ + P    E +         R   G  TEEKA+QLR    + 
Sbjct: 44  KWWAPLVGWSGRADYMEA-AAPTPVVEEE--------GRTFVG-LTEEKARQLRARMSEM 93

Query: 96  ASFHDAMYHSAIASRLA 112
            SFHDAMYHSAIASRLA
Sbjct: 94  DSFHDAMYHSAIASRLA 110


>gi|326496003|dbj|BAJ90623.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 113

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 44/77 (57%), Gaps = 10/77 (12%)

Query: 36  NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDT 95
            WW+PL GWS  ADY+++        E   E K+           TEEKA+QLR    + 
Sbjct: 44  KWWAPLVGWSGRADYMEAAVPTPVVEE---EGKTFMG-------LTEEKARQLRARMSEM 93

Query: 96  ASFHDAMYHSAIASRLA 112
            SFHDAMYHSAIASRLA
Sbjct: 94  DSFHDAMYHSAIASRLA 110


>gi|383125664|gb|AFG43395.1| Pinus taeda anonymous locus 2_3379_02 genomic sequence
          Length = 122

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 55/94 (58%), Gaps = 12/94 (12%)

Query: 31  KNASSNWWSPLFGWSSEAD---YIDSESK-PENRS-------ESDLEAKSSRSSRFSPGC 79
           K ++  WW PLFGWS +AD   + +   K P+ R+       E ++    ++ S+F  G 
Sbjct: 17  KRSTMRWWLPLFGWS-QADREPWANCGIKEPKARAMEKISSDEKEMGEAPTKRSKFEMGR 75

Query: 80  FTEEKAKQLRLMTKDTASFHDAMYHSAIASRLAS 113
            T EKAK LR   +DT  FHD MYHSAIASRLAS
Sbjct: 76  LTPEKAKLLRKNLRDTTMFHDIMYHSAIASRLAS 109


>gi|116779241|gb|ABK21196.1| unknown [Picea sitchensis]
          Length = 182

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 50/88 (56%), Gaps = 11/88 (12%)

Query: 37  WWSPLFGWSSE-----ADYIDSESKP---ENRSESD---LEAKSSRSSRFSPGCFTEEKA 85
           WW  LFGWS       AD    ESK    + RS+ D   +    ++ S+F  G  T EKA
Sbjct: 82  WWLHLFGWSQTDQERWADGGIKESKTTAMDKRSDDDEKEMGEAPAKRSKFVRGLLTTEKA 141

Query: 86  KQLRLMTKDTASFHDAMYHSAIASRLAS 113
           K LR   +DT+ FHD MYHSAIASRLAS
Sbjct: 142 KLLRKNLRDTSMFHDIMYHSAIASRLAS 169


>gi|116782515|gb|ABK22536.1| unknown [Picea sitchensis]
          Length = 103

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 51/89 (57%), Gaps = 11/89 (12%)

Query: 36  NWWSPLFGWSSE-----ADYIDSESKP---ENRSESD---LEAKSSRSSRFSPGCFTEEK 84
           +WW  LFGWS       AD    ESK    + RS+ D   +    ++ S+F  G  T EK
Sbjct: 2   SWWLHLFGWSQTDEERWADGGIKESKTTAMDKRSDDDEKEMGEAPAKRSKFVRGLLTAEK 61

Query: 85  AKQLRLMTKDTASFHDAMYHSAIASRLAS 113
           AK LR   +DT+ FHD MYHSAIASRLAS
Sbjct: 62  AKLLRKNLRDTSMFHDIMYHSAIASRLAS 90


>gi|125551125|gb|EAY96834.1| hypothetical protein OsI_18756 [Oryza sativa Indica Group]
          Length = 68

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 44/86 (51%), Gaps = 24/86 (27%)

Query: 27  GKRRKNASSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAK 86
           G+     S  WW+PL GWS + DYI++ +                          +EKA+
Sbjct: 4   GRAVAGESGKWWAPLLGWSGKVDYIEAPTP------------------------AKEKAR 39

Query: 87  QLRLMTKDTASFHDAMYHSAIASRLA 112
           +LR    +T SFHDAMYHSAIASRLA
Sbjct: 40  ELRARMAETESFHDAMYHSAIASRLA 65


>gi|125569224|gb|EAZ10739.1| hypothetical protein OsJ_00576 [Oryza sativa Japonica Group]
          Length = 423

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 36/71 (50%), Positives = 42/71 (59%), Gaps = 10/71 (14%)

Query: 49  DYIDSESKPENRSES------DLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAM 102
           DYID++  P  R E+        E +     RF  G  TEEKA+QLR    +T SFHD M
Sbjct: 348 DYIDAQ--PAAREEARPNPMAAAEQRGGGGKRF--GVLTEEKARQLRARMMETESFHDCM 403

Query: 103 YHSAIASRLAS 113
           YHSAIASRLAS
Sbjct: 404 YHSAIASRLAS 414


>gi|222630467|gb|EEE62599.1| hypothetical protein OsJ_17402 [Oryza sativa Japonica Group]
          Length = 70

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 24/79 (30%)

Query: 34  SSNWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTK 93
           S  WW+ L GWS +ADYI++ +                          +EKA++LR    
Sbjct: 13  SGKWWALLLGWSGKADYIEAPTP------------------------AKEKARELRARMA 48

Query: 94  DTASFHDAMYHSAIASRLA 112
           +T SFHDAMYHSAIASRLA
Sbjct: 49  ETESFHDAMYHSAIASRLA 67


>gi|218187605|gb|EEC70032.1| hypothetical protein OsI_00607 [Oryza sativa Indica Group]
          Length = 59

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/48 (60%), Positives = 32/48 (66%), Gaps = 2/48 (4%)

Query: 66  EAKSSRSSRFSPGCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLAS 113
           E +     RF  G  TEEKA+QLR    +T SFHD MYHSAIASRLAS
Sbjct: 5   EQRGGGGKRF--GVLTEEKARQLRARMMETESFHDCMYHSAIASRLAS 50


>gi|148906154|gb|ABR16234.1| unknown [Picea sitchensis]
          Length = 167

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 16/92 (17%)

Query: 37  WWSPLFGWSSEAD-----------YIDSESKPENRSESDLEAKSS-----RSSRFSPGCF 80
           WWSPLF    E +            + +E++ E R+ ++     +     R++R S   F
Sbjct: 65  WWSPLFDIIREPEPSSFTASSSLSSMATEAEAETRNVANSSPNPNPNPSPRAARRSGVGF 124

Query: 81  TEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
           T EKA+ LR   + T SFHD MYHSAIASRLA
Sbjct: 125 TAEKARLLRKELRATESFHDIMYHSAIASRLA 156


>gi|302790121|ref|XP_002976828.1| hypothetical protein SELMODRAFT_19119 [Selaginella moellendorffii]
 gi|300155306|gb|EFJ21938.1| hypothetical protein SELMODRAFT_19119 [Selaginella moellendorffii]
          Length = 72

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 43/77 (55%), Gaps = 5/77 (6%)

Query: 37  WWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTEEKAKQLRLMTKDTA 96
           WWSPLF +S E   +D+ S  E   E       SR     P    EEK K LR   +++ 
Sbjct: 1   WWSPLFDFSREVYDLDA-STDEIGDEIQAGKAPSREVPLEP----EEKFKLLRSQLRESE 55

Query: 97  SFHDAMYHSAIASRLAS 113
           S+HD MYHSAIASRLA+
Sbjct: 56  SWHDTMYHSAIASRLAT 72


>gi|349499565|emb|CCD17880.1| hypothetical protein [Micrasterias denticulata]
          Length = 154

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 25/35 (71%)

Query: 78  GCFTEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
           G FT EK K LR  T+++ S HD MYHS +A+RLA
Sbjct: 120 GKFTPEKVKLLRQKTRESESHHDLMYHSGLAARLA 154


>gi|168040270|ref|XP_001772618.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676173|gb|EDQ62660.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 257

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 22/34 (64%), Positives = 26/34 (76%)

Query: 80  FTEEKAKQLRLMTKDTASFHDAMYHSAIASRLAS 113
           FT EKAK+LR   + T +FHD  YHSAIASRLA+
Sbjct: 127 FTAEKAKRLRKENRATQTFHDQWYHSAIASRLAT 160


>gi|168018039|ref|XP_001761554.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687238|gb|EDQ73622.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 531

 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 22/33 (66%), Positives = 25/33 (75%)

Query: 80  FTEEKAKQLRLMTKDTASFHDAMYHSAIASRLA 112
           FT EKAK+LR   + T +FHD  YHSAIASRLA
Sbjct: 124 FTPEKAKRLRKENRATQTFHDQWYHSAIASRLA 156


>gi|409042150|gb|EKM51634.1| hypothetical protein PHACADRAFT_199137 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 320

 Score = 38.5 bits (88), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 12/101 (11%)

Query: 24  NPDGKRRKNASS-NWWSPLFGWSSEADYIDSESKPENRSESDLEAKSSRSSRFSPGCFTE 82
           N DG R    S+ + WSP      +  Y+ +E       +S +E K++ SSRF PG F+ 
Sbjct: 4   NHDGPRIATTSAAHAWSPNVAGVHQDPYVQTE-------DSAIETKTAVSSRFWPGGFSG 56

Query: 83  EKAKQLRLMTKDTASFHDAMYHSAIASRLASDFK--IRSDR 121
             ++ L L++ D  SF+  ++ + I +  ++ F   IRS R
Sbjct: 57  PASEDLVLVSCDRISFY--VHSTVILAASSNSFADLIRSTR 95


>gi|424852651|ref|ZP_18277048.1| transposase [Rhodococcus opacus PD630]
 gi|356667316|gb|EHI47387.1| transposase [Rhodococcus opacus PD630]
          Length = 138

 Score = 35.0 bits (79), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 8/72 (11%)

Query: 15  VYATWSGHRNPDGKRRKNASSNWWSPLFGWSSE--------ADYIDSESKPENRSESDLE 66
           V A +  H  PD  R+    +++WSP +  +S         A+YI ++ +PE   ESD  
Sbjct: 60  VSARYLRHEFPDHIRKYLWGNHFWSPSYFAASAGGAPLAIIAEYITNQKRPEPTKESDPA 119

Query: 67  AKSSRSSRFSPG 78
            +S R ++  PG
Sbjct: 120 TRSRRQNQLPPG 131


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.312    0.124    0.369 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,897,544,825
Number of Sequences: 23463169
Number of extensions: 67276558
Number of successful extensions: 163057
Number of sequences better than 100.0: 67
Number of HSP's better than 100.0 without gapping: 59
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 162930
Number of HSP's gapped (non-prelim): 73
length of query: 128
length of database: 8,064,228,071
effective HSP length: 94
effective length of query: 34
effective length of database: 10,153,657,481
effective search space: 345224354354
effective search space used: 345224354354
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 71 (32.0 bits)