BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 031892
         (151 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224098489|ref|XP_002311193.1| predicted protein [Populus trichocarpa]
 gi|222851013|gb|EEE88560.1| predicted protein [Populus trichocarpa]
          Length = 150

 Score =  150 bits (380), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/158 (60%), Positives = 113/158 (71%), Gaps = 15/158 (9%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATT--GGFGSIFSPPTAV-QGRGSSHSGSI 57
           MEN RQ  ASS      FDHLFGPKDSSSS++   G FGSIF PP+ V  GR S  +G+ 
Sbjct: 1   MENNRQVGASS------FDHLFGPKDSSSSSSASSGIFGSIFPPPSKVPAGRDSGTTGNH 54

Query: 58  IGS---LRKQGESSKG-GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKA 113
           +G+   +     + K  GES+  + K +SS+YQ+E+ EPCYFSSSIYYGGQENYSPRTK 
Sbjct: 55  VGNETYVNPDNATRKAKGESSGISGKGQSSVYQNETPEPCYFSSSIYYGGQENYSPRTK- 113

Query: 114 TTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
            +ES  HV KKD  +DDPNGN+ NSASRGNWWQGSLYY
Sbjct: 114 NSES-QHVFKKDYGKDDPNGNDPNSASRGNWWQGSLYY 150


>gi|255566540|ref|XP_002524255.1| conserved hypothetical protein [Ricinus communis]
 gi|223536532|gb|EEF38179.1| conserved hypothetical protein [Ricinus communis]
          Length = 149

 Score =  143 bits (360), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/159 (60%), Positives = 109/159 (68%), Gaps = 18/159 (11%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGG--FGSIFSPPTAVQGR------GSS 52
           MENKRQ  ASS       DHLFGPKDSSSS+++    FGSIFSPP+   GR      G+ 
Sbjct: 1   MENKRQTGASS------LDHLFGPKDSSSSSSSSSSVFGSIFSPPSMGMGRDSAGIMGNQ 54

Query: 53  HSGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTK 112
              S  G      +S+KG  S I   KD SS YQ+E+AEPCYFSSSIYYGGQENYSPRT+
Sbjct: 55  AGNSKYGIPDNVTQSNKGSNSGIM-GKDSSSFYQNETAEPCYFSSSIYYGGQENYSPRTR 113

Query: 113 ATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
               S  HV+KKD  +DDPNGN+SNSASRGNWWQGSLYY
Sbjct: 114 ---NSESHVLKKDDKKDDPNGNSSNSASRGNWWQGSLYY 149


>gi|296086145|emb|CBI31586.3| unnamed protein product [Vitis vinifera]
          Length = 160

 Score =  142 bits (358), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 96/166 (57%), Positives = 112/166 (67%), Gaps = 21/166 (12%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGS--IFSPPTAVQGRGSSHSGSII 58
           MEN +Q  +SSS   I  D LFGPKDSSSS++        IF PP++V GR SSH+G+I 
Sbjct: 1   MENTKQVGSSSS---ITSD-LFGPKDSSSSSSLSSGLFGSIFQPPSSVPGRNSSHTGNI- 55

Query: 59  GSLRKQGESSK------GGESNIFN------SKDKSSIYQSESAEPCYFSSSIYYGGQEN 106
           GS RKQG  ++      G   NI        SKD+ SIY +E+ EPCYFSSSIYYGGQE 
Sbjct: 56  GSFRKQGSGNQVGNGKHGTPDNITQICSGESSKDRGSIYHNETVEPCYFSSSIYYGGQEV 115

Query: 107 YSPRTKATTESHHHV-VKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           YSP  +  TESH  V +KKDG EDD NG+NSNSASRGNWWQGSLYY
Sbjct: 116 YSPNAR-NTESHPIVTMKKDGGEDDLNGSNSNSASRGNWWQGSLYY 160


>gi|224112647|ref|XP_002316250.1| predicted protein [Populus trichocarpa]
 gi|222865290|gb|EEF02421.1| predicted protein [Populus trichocarpa]
          Length = 165

 Score =  127 bits (319), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 83/156 (53%), Positives = 100/156 (64%), Gaps = 18/156 (11%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATT--GGFGSIFSPPTAV-QGRGS----SH 53
           MENKRQ  ASS      FDHLFGPKDSSSS+    G F SIF  P+ V  GR S    +H
Sbjct: 1   MENKRQVVASS------FDHLFGPKDSSSSSPASSGIFESIFPHPSKVPAGRDSGVMENH 54

Query: 54  SGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKA 113
            G    + R        GE +    K KS ++Q+E+ EPCY SSSIYYGGQENYSPRTK 
Sbjct: 55  GGRGKYANRDNVAQKAKGEGS---GKGKSVVFQNETPEPCYLSSSIYYGGQENYSPRTKN 111

Query: 114 TTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSL 149
           +   + HV KKD ++++PNGN+ NSASRGNWWQG +
Sbjct: 112 S--EYQHVFKKDDEQENPNGNDPNSASRGNWWQGIM 145


>gi|255558924|ref|XP_002520485.1| conserved hypothetical protein [Ricinus communis]
 gi|223540327|gb|EEF41898.1| conserved hypothetical protein [Ricinus communis]
          Length = 160

 Score =  126 bits (316), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 71/144 (49%), Positives = 92/144 (63%), Gaps = 14/144 (9%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKG---------- 70
           LFG K+S  S++TG F SIF PP+ V GR SS S  +IGS ++Q   ++G          
Sbjct: 18  LFGTKESPPSSSTGIFASIFPPPSKVLGRKSSGS-EVIGSWQRQSSENQGWNTKRGAPAM 76

Query: 71  -GESNIFN--SKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGD 127
             E+  +N  +KD++S    E  EPC+ SSS+YYGGQ+NYS    +     + + KKDG 
Sbjct: 77  SSEAASYNMPNKDRNSTLMEERGEPCHLSSSLYYGGQDNYSQPPSSRNSGSYPIFKKDGV 136

Query: 128 EDDPNGNNSNSASRGNWWQGSLYY 151
           +DDPNGNNSN ASRGNWWQGSLYY
Sbjct: 137 DDDPNGNNSNGASRGNWWQGSLYY 160


>gi|225434778|ref|XP_002281905.1| PREDICTED: uncharacterized protein LOC100263768 [Vitis vinifera]
 gi|297745986|emb|CBI16042.3| unnamed protein product [Vitis vinifera]
          Length = 154

 Score =  123 bits (308), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 77/165 (46%), Positives = 96/165 (58%), Gaps = 25/165 (15%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           ME +RQ  +SSS +      LFG K+S  S ++G F SIF PP+   GR  ++S      
Sbjct: 1   MEGRRQVGSSSSFT----SELFGTKESPPSNSSGIFASIFPPPSTAVGRNPANS-----E 51

Query: 61  LR----------KQGESSKG----GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQEN 106
           +R          KQG    G    G S+I  +KDK+ I+Q E  EPC+ SSS+YYGGQ+ 
Sbjct: 52  MRGLSGNQTWDTKQGTPDMGKSGVGASSIIPNKDKNQIFQEERVEPCHLSSSLYYGGQDI 111

Query: 107 YSPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           YS  +   T   + V KK  DEDDPNGNNS  ASRGNWWQGSLYY
Sbjct: 112 YSNSSSTQTSGSYPVFKK--DEDDPNGNNSQGASRGNWWQGSLYY 154


>gi|224106371|ref|XP_002314144.1| predicted protein [Populus trichocarpa]
 gi|222850552|gb|EEE88099.1| predicted protein [Populus trichocarpa]
          Length = 162

 Score =  119 bits (297), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 85/143 (59%), Gaps = 13/143 (9%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKG---------- 70
           LFG  +S+  ++ G F S+F PP+ V GR SS S  + GS +KQ   ++           
Sbjct: 21  LFGTTESAPVSSAGIFASMFPPPSTVVGRKSSGS-EVTGSWQKQSYGNQAWYPKQGSPAN 79

Query: 71  --GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDE 128
               S     KD++S+ Q E  EPC+ SSS+YYGGQENYS          + + KKDG E
Sbjct: 80  SQAASYSMPDKDRNSVIQEERVEPCHLSSSLYYGGQENYSQSPSTQMAGSYPIFKKDGGE 139

Query: 129 DDPNGNNSNSASRGNWWQGSLYY 151
           DDPNG+N +SASRGNWWQGSLYY
Sbjct: 140 DDPNGSNPHSASRGNWWQGSLYY 162


>gi|356569336|ref|XP_003552858.1| PREDICTED: uncharacterized protein LOC100797772 [Glycine max]
          Length = 161

 Score =  114 bits (284), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/157 (54%), Positives = 110/157 (70%), Gaps = 13/157 (8%)

Query: 1   MENKRQESASSSSSP--INFDHLFGPKDSSSSATTGG--FGSIFSPPTAVQGRGSSHSGS 56
           MEN +++    SSS    NFDHLFGPKD S+++++    FGSIF PP+ V GR S+   +
Sbjct: 1   MENNKKKVGGGSSSSSFTNFDHLFGPKDPSTTSSSSTSIFGSIFPPPSTVGGRDSTKQDT 60

Query: 57  ------IIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR 110
                   G+   +GESS  G SN  N+   ++ +Q+E+ EP Y+SSSI+YGGQENYSPR
Sbjct: 61  GCKNYGAPGNYSYKGESSASGVSNN-NNTTSTNYHQNETVEPSYYSSSIHYGGQENYSPR 119

Query: 111 TKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
           T+ TTES HH+ KKD D+DDPNGN+SNSASRGNWWQG
Sbjct: 120 TR-TTES-HHIFKKDKDDDDPNGNDSNSASRGNWWQG 154


>gi|356537962|ref|XP_003537475.1| PREDICTED: uncharacterized protein LOC100795477 [Glycine max]
          Length = 203

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 85/160 (53%), Positives = 107/160 (66%), Gaps = 14/160 (8%)

Query: 1   MENKRQESAS---SSSSPINFDHLFGPKDSSSSATTGG--FGSIFSPP-TAVQGRGSSHS 54
           MEN +++      SSSS  NFDHLFGPKD S+++++    FGSIF PP + V GR S+  
Sbjct: 1   MENNKKKVVGGGPSSSSFTNFDHLFGPKDPSTTSSSSTSIFGSIFPPPPSTVGGRDSTKQ 60

Query: 55  GSIIGSLRKQGESSKGGESN----IFNSKDKSSI---YQSESAEPCYFSSSIYYGGQENY 107
                +    G  S  GES+    + NSK+ +S    +Q+E+ EP Y+SSSIYYGGQENY
Sbjct: 61  DMGCKNYGAPGNYSYKGESSASGVVSNSKNTTSTNYHHQNETVEPSYYSSSIYYGGQENY 120

Query: 108 SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
           SPRT+ TTESHH   K   D+DDPNGN+SNSASRGNWWQG
Sbjct: 121 SPRTR-TTESHHSFKKDKKDDDDPNGNDSNSASRGNWWQG 159


>gi|357505741|ref|XP_003623159.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
 gi|355498174|gb|AES79377.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
 gi|388492720|gb|AFK34426.1| unknown [Medicago truncatula]
          Length = 156

 Score =  102 bits (253), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 71/162 (43%), Positives = 90/162 (55%), Gaps = 17/162 (10%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           ME ++Q  +SSS +      LFG  D   S+ +G F SIFSP + V GR S  S ++ G 
Sbjct: 1   MEGRKQFGSSSSETSSFTSELFGSNDLHKSSASGIFDSIFSPSSKVFGRESLRS-AMNGK 59

Query: 61  LRKQGESSK-----------GGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSP 109
              +G +SK            GE+    +KD S IYQ +  +PC+ SSSIYYGGQ+ YS 
Sbjct: 60  TATEGSNSKIDTQEYISKGNDGETQNKTNKDMSYIYQEQRVQPCHLSSSIYYGGQDVYSR 119

Query: 110 RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
              A     + + KKDG EDD     S SASRGNWWQGSLYY
Sbjct: 120 PQSARDSGLNTMYKKDGGEDD-----SGSASRGNWWQGSLYY 156


>gi|357492467|ref|XP_003616522.1| hypothetical protein MTR_5g081360 [Medicago truncatula]
 gi|355517857|gb|AES99480.1| hypothetical protein MTR_5g081360 [Medicago truncatula]
          Length = 145

 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 77/147 (52%), Positives = 99/147 (67%), Gaps = 7/147 (4%)

Query: 3   NKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPT--AVQGRGSSHSGSIIGS 60
           NK+Q  +SSSSS + FD LFGPKD SS++++  FGSIF PP   +V+GRGS        +
Sbjct: 4   NKKQAWSSSSSSTVKFDQLFGPKDPSSASSSSLFGSIFPPPPTPSVEGRGSRTQEVGSKN 63

Query: 61  LRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHH 120
           L   G +   G   I  +K+  + YQ+E+ EP Y+SSSIYYGGQENYSPR + T    HH
Sbjct: 64  LGATGTTPSDG---ISYNKNTCTNYQNETMEPSYYSSSIYYGGQENYSPRNRTTEP--HH 118

Query: 121 VVKKDGDEDDPNGNNSNSASRGNWWQG 147
           V KKD +  D NGNNS+SASRG+WW+G
Sbjct: 119 VFKKDKNHGDHNGNNSSSASRGDWWEG 145


>gi|297816874|ref|XP_002876320.1| hypothetical protein ARALYDRAFT_485995 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322158|gb|EFH52579.1| hypothetical protein ARALYDRAFT_485995 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 139

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 71/152 (46%), Positives = 93/152 (61%), Gaps = 17/152 (11%)

Query: 2   ENKRQESASSSSSPINFDHLFGPK--DSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIG 59
           +NK+++  S+SS    FDH+FGP+   SSSS+ TG F SIF PP+A Q        S  G
Sbjct: 3   KNKKKKIVSASS----FDHIFGPRVSSSSSSSATGLFKSIFPPPSADQLGRQVDFASQGG 58

Query: 60  SLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHH 119
            ++ Q  + +G  SN    K+K S Y  E+  PC+ SSS+YYGGQE YS  T  T ++ +
Sbjct: 59  HVKYQSPNERGERSN---KKEKKSYYNEETEPPCHLSSSLYYGGQEKYSSTTTITHDTTY 115

Query: 120 HVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
              KKDG+E D     S SASRGNWW+GSLYY
Sbjct: 116 ---KKDGEEGD-----SKSASRGNWWEGSLYY 139


>gi|449460363|ref|XP_004147915.1| PREDICTED: uncharacterized protein LOC101215701 [Cucumis sativus]
          Length = 158

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/164 (43%), Positives = 94/164 (57%), Gaps = 19/164 (11%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGS--SHSGSI- 57
           ME K+     SSSS +  D LFG  ++S S+TTG FGSIF+P + V GR S  SH+    
Sbjct: 1   MEGKKHVGLGSSSSSLTTD-LFGSNETSYSSTTGIFGSIFAPSSKVLGRESLLSHTKERE 59

Query: 58  IGSLRKQGESSKGGESNIFN----------SKDKSSIYQSESAEPCYFSSSIYYGGQENY 107
             S+ +    + G + +  N          +KD SSIYQ + A+PC+ SSSIYYGGQ+ Y
Sbjct: 60  RNSVNEPWNPNAGAQDDNANHTQKESQETKNKDMSSIYQDQRAQPCHLSSSIYYGGQDVY 119

Query: 108 SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           +    +     +   KK+G EDD     S SASRGNWWQGSLYY
Sbjct: 120 THPQNSYNSGANSAYKKEGGEDD-----SGSASRGNWWQGSLYY 158


>gi|79315178|ref|NP_001030865.1| uncharacterized protein [Arabidopsis thaliana]
 gi|98961793|gb|ABF59226.1| unknown protein [Arabidopsis thaliana]
 gi|332645894|gb|AEE79415.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 146

 Score = 90.9 bits (224), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/136 (47%), Positives = 81/136 (59%), Gaps = 14/136 (10%)

Query: 18  FDHLFGPK--DSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNI 75
           FDH+FGP+   SSSS+ TG F SIF PP+A Q        S  G ++ Q  ++KG  SN 
Sbjct: 23  FDHIFGPRVSSSSSSSATGLFKSIFPPPSADQLGRQVDFASQGGHVKYQSPNAKGERSN- 81

Query: 76  FNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNN 135
              K+K S Y  E+  PC+ SSS+YYGGQE       +TT + H   KKDG+E D     
Sbjct: 82  --KKEKKSYYNEETEPPCHLSSSLYYGGQET----YSSTTTTTHDTYKKDGEEGD----- 130

Query: 136 SNSASRGNWWQGSLYY 151
           S  ASRGNWW+GSLYY
Sbjct: 131 SKRASRGNWWEGSLYY 146


>gi|388494636|gb|AFK35384.1| unknown [Lotus japonicus]
          Length = 153

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 68/162 (41%), Positives = 86/162 (53%), Gaps = 20/162 (12%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPT-AVQGRGSSHSGSIIG 59
           M+ K+Q+  SSS +      LFG K+S  S ++G FGSIFSPP+  V GR S  S   + 
Sbjct: 1   MDEKKQKGTSSSFTA----ELFGSKESHPSPSSGIFGSIFSPPSPKVLGRESLRSELSVK 56

Query: 60  SLRK--------QGESSKG--GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSP 109
           +  +        Q + SKG  GE+    +KD S IYQ +   PC  SSSIYYGGQ+ Y  
Sbjct: 57  TANETWSSKIGIQDDVSKGNYGEAQHTENKDMSFIYQHQRIPPCPLSSSIYYGGQDTYYH 116

Query: 110 RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
                    + + K D   DD     S  ASRGNWWQGSLYY
Sbjct: 117 SQSTQNAGSNTLHKNDVGNDD-----SGMASRGNWWQGSLYY 153


>gi|449455074|ref|XP_004145278.1| PREDICTED: uncharacterized protein LOC101214982 [Cucumis sativus]
 gi|449473730|ref|XP_004153966.1| PREDICTED: uncharacterized protein LOC101213402 [Cucumis sativus]
 gi|449510713|ref|XP_004163741.1| PREDICTED: uncharacterized protein LOC101228934 [Cucumis sativus]
          Length = 139

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 72/135 (53%), Gaps = 16/135 (11%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSKD 80
           LF  K++  S+ +G F +IF  P    GR SS S   +    KQ   S+   +   NS  
Sbjct: 17  LFDVKEAPLSSASGAFATIFPSPQKGAGRNSSSSVDWL----KQTNGSQPHHTRQGNSGG 72

Query: 81  KSSIYQSESAEPCYFSSSIYYGGQENYSPRTKA----TTESHHHVVKKDGDEDDPNGNNS 136
                   S EPC+ SSS+YYGGQ+ YS  T A         H + K  G +DDPNGNNS
Sbjct: 73  --------SLEPCHLSSSLYYGGQDGYSQATSAGPSPLPPPPHTMKKSGGQQDDPNGNNS 124

Query: 137 NSASRGNWWQGSLYY 151
             ASRGNWWQGSLYY
Sbjct: 125 QPASRGNWWQGSLYY 139


>gi|357481999|ref|XP_003611285.1| hypothetical protein MTR_5g012280 [Medicago truncatula]
 gi|355512620|gb|AES94243.1| hypothetical protein MTR_5g012280 [Medicago truncatula]
          Length = 144

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/153 (44%), Positives = 92/153 (60%), Gaps = 11/153 (7%)

Query: 1   MENKRQESASSSSS--PINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSII 58
           M+N + ++ SSS+S   +NF+ LF  KDS++ A+   F     P  +V+G+GS      +
Sbjct: 1   MDNNKNQTCSSSNSSSTVNFNQLFSSKDSTT-ASFSSFFDSLFPQPSVEGKGSRTEE--V 57

Query: 59  GSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESH 118
           GS  K  E+     S+I N    S+ YQ+E+ EP YFSSSI+YGG E+Y     +T E  
Sbjct: 58  GS--KSLEAPGTPNSDISNKNTNSTDYQNETVEPTYFSSSIHYGGCEDYFSE-GSTIEPR 114

Query: 119 HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           H     +GD    NGNNSNSASRG+WWQGSLYY
Sbjct: 115 HVKYTNNGD---LNGNNSNSASRGDWWQGSLYY 144


>gi|42571137|ref|NP_973642.1| uncharacterized protein [Arabidopsis thaliana]
 gi|28207122|gb|AAO37201.1| hypothetical protein [Arabidopsis thaliana]
 gi|330254647|gb|AEC09741.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 142

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 82/143 (57%), Gaps = 27/143 (18%)

Query: 18  FDHLFGPK--DSSSSATTGGFGSIFSPPTAV-QGRGSSHSGSI------IGSLRKQGESS 68
            DH+FGP+   S SS+TTG F SIF PP+AV QG  +S +G+         +  ++GE S
Sbjct: 18  LDHIFGPRVSHSYSSSTTGLFKSIFPPPSAVTQGNLTSRNGAAKYQPTNFETPNERGERS 77

Query: 69  KGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDE 128
           K        +K++ S    E+  PC  SSSIYYGGQ+NYS  T     ++    KKDG+E
Sbjct: 78  K--------NKERKSYQSEETQPPCNLSSSIYYGGQDNYSSST-----TNPDAYKKDGEE 124

Query: 129 DDPNGNNSNSASRGNWWQGSLYY 151
            D     S SASRGNWW+GS  Y
Sbjct: 125 GD-----SESASRGNWWEGSFNY 142


>gi|255586976|ref|XP_002534085.1| conserved hypothetical protein [Ricinus communis]
 gi|223525878|gb|EEF28301.1| conserved hypothetical protein [Ricinus communis]
          Length = 146

 Score = 89.4 bits (220), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 75/157 (47%), Positives = 94/157 (59%), Gaps = 17/157 (10%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           ME K+Q+  SSSSS    + LFG KDSSSS++        +PP  V GRGS  S  ++G 
Sbjct: 1   MEGKKQQLGSSSSSSFTSE-LFGSKDSSSSSSGIFGSIF-APPPKVLGRGSLRS-EMVG- 56

Query: 61  LRKQ----GESSKGGESNIFN--SKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKAT 114
            +KQ     E+SK  ES   N  S+D  S YQ +  +PC+ SSSIYYGGQ+ Y    + +
Sbjct: 57  -KKQDSANAETSKSNESEGQNIPSRDTGSFYQDQKVQPCHLSSSIYYGGQDIYH-HPQNS 114

Query: 115 TESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
             S H + KK+G EDD     S SASRGNWWQGSLYY
Sbjct: 115 QSSSHSMFKKEGTEDD-----SGSASRGNWWQGSLYY 146


>gi|15241665|ref|NP_195822.1| uncharacterized protein [Arabidopsis thaliana]
 gi|7340676|emb|CAB82975.1| putative protein [Arabidopsis thaliana]
 gi|16648748|gb|AAL25566.1| AT5g02020/T7H20_70 [Arabidopsis thaliana]
 gi|20334906|gb|AAM16209.1| AT5g02020/T7H20_70 [Arabidopsis thaliana]
 gi|332003037|gb|AED90420.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 149

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/138 (42%), Positives = 79/138 (57%), Gaps = 16/138 (11%)

Query: 20  HLFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNS 78
            LFG +++ SS ++ G  GSIF PP+ V GR S    ++ G    +  S  GG  N+  +
Sbjct: 22  ELFGSRENPSSPSSSGILGSIFPPPSKVLGRESVRQETVTGGCWNEKTSKTGG--NVDRN 79

Query: 79  KDK-----SSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNG 133
           +++     S   Q +  +PC+ SSSIYYGG + Y     +T+ S +   KKDG EDD   
Sbjct: 80  REQQENHGSGYQQDQRVQPCHLSSSIYYGGPDVYFQPQNSTSNSTN---KKDGGEDD--- 133

Query: 134 NNSNSASRGNWWQGSLYY 151
             S SASRGNWWQGSLYY
Sbjct: 134 --SGSASRGNWWQGSLYY 149


>gi|209967469|gb|ACJ02359.1| hypothetical protein [Populus tremula x Populus alba]
          Length = 157

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 79/146 (54%), Gaps = 28/146 (19%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPT-AVQGRGS------------------SHSGSIIGSL 61
           LFG K+ SSS+T G FGSIF+P +  V GR S                  + SG+    L
Sbjct: 18  LFGSKEYSSSST-GIFGSIFAPASPKVLGRESLRFEVDEKKQDSADDAWNTKSGTPTSDL 76

Query: 62  RKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHV 121
             +      GES    +KD SSIYQ +  +PC+ SSSIYYGGQ+ Y     A T S + +
Sbjct: 77  TSKMNE---GESQSIPNKDMSSIYQEQRVQPCHLSSSIYYGGQDIYHHPQTAHTSSKNPM 133

Query: 122 VKKDGDEDDPNGNNSNSASRGNWWQG 147
            KKDG EDD     + SASRGNWWQG
Sbjct: 134 FKKDGPEDD-----TGSASRGNWWQG 154


>gi|388492094|gb|AFK34113.1| unknown [Medicago truncatula]
          Length = 172

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/157 (40%), Positives = 85/157 (54%), Gaps = 18/157 (11%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           ME ++Q  +S S +      LFG  D   S+ +G F SIFSP + V GR S  S ++ G 
Sbjct: 1   MEGRKQFGSSFSETSSFTSELFGSNDLHKSSASGIFDSIFSPSSKVFGRESLRS-AMNGK 59

Query: 61  LRKQGESSK-----------GGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS- 108
              +G +SK            GE+    +KD S IYQ +  +PC+ SSSIYYGGQ+ YS 
Sbjct: 60  TATEGSNSKIDTQEYISKGNDGETQNKTNKDMSYIYQEQRVQPCHLSSSIYYGGQDVYSR 119

Query: 109 PRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWW 145
           P++   +  +    KKDG EDD     S SASRGNWW
Sbjct: 120 PQSARDSGLNTMQYKKDGGEDD-----SGSASRGNWW 151


>gi|449464468|ref|XP_004149951.1| PREDICTED: cucumisin-like [Cucumis sativus]
          Length = 866

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/83 (56%), Positives = 61/83 (73%), Gaps = 6/83 (7%)

Query: 65  GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKK 124
           G+   GG S     KD+SS+YQ+E+ EP YFSSSI+YGGQENYSPRT A ++SH +  KK
Sbjct: 764 GKKGTGGASG----KDESSMYQNETMEPSYFSSSIFYGGQENYSPRTNA-SQSHPN-FKK 817

Query: 125 DGDEDDPNGNNSNSASRGNWWQG 147
           +  ++D N +NSN ASRGNWW+ 
Sbjct: 818 EVKDNDANESNSNPASRGNWWKA 840


>gi|449516501|ref|XP_004165285.1| PREDICTED: cucumisin-like, partial [Cucumis sativus]
          Length = 795

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/82 (57%), Positives = 61/82 (74%), Gaps = 6/82 (7%)

Query: 65  GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKK 124
           G+   GG S     KD+SS+YQ+E+ EP YFSSSI+YGGQENYSPRT A ++SH +  KK
Sbjct: 720 GKKGTGGASG----KDESSMYQNETMEPSYFSSSIFYGGQENYSPRTNA-SQSHPN-FKK 773

Query: 125 DGDEDDPNGNNSNSASRGNWWQ 146
           +  ++D N +NSN ASRGNWW+
Sbjct: 774 EVKDNDANESNSNPASRGNWWK 795


>gi|225440524|ref|XP_002275314.1| PREDICTED: uncharacterized protein LOC100252790 isoform 1 [Vitis
           vinifera]
 gi|359481721|ref|XP_003632663.1| PREDICTED: uncharacterized protein LOC100252790 isoform 2 [Vitis
           vinifera]
 gi|297740289|emb|CBI30471.3| unnamed protein product [Vitis vinifera]
          Length = 159

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 61/147 (41%), Positives = 82/147 (55%), Gaps = 22/147 (14%)

Query: 21  LFGPKDSS--SSATTGGFGSIFSPPTAVQGRGSSH---------SGSII-----GSLRKQ 64
           LFG K+SS  S ++TG F SIFS  + V GR S           SG+ +     G+    
Sbjct: 19  LFGSKESSYPSPSSTGIFASIFSTSSKVLGRESLRPDLTKKKQDSGNEVWNAKPGTTENA 78

Query: 65  GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKK 124
            + S+G ES   +++D  S YQ +  +PC+ SSSIYYGGQ+ Y     + +     ++KK
Sbjct: 79  LQQSEG-ESQSISNRDTGSFYQEQRVQPCHLSSSIYYGGQDIYFHPQNSQSSGMPSMLKK 137

Query: 125 DGDEDDPNGNNSNSASRGNWWQGSLYY 151
           D  EDD     + SASRGNWWQGSLYY
Sbjct: 138 DSGEDD-----TGSASRGNWWQGSLYY 159


>gi|297823891|ref|XP_002879828.1| hypothetical protein ARALYDRAFT_321676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325667|gb|EFH56087.1| hypothetical protein ARALYDRAFT_321676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 142

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 68/160 (42%), Positives = 90/160 (56%), Gaps = 27/160 (16%)

Query: 1   MENKRQESASSSSSPINFDHLFGPK--DSSSSATTGGFGSIFSPPTAV-------QGRGS 51
           M+NK+  S SSSSS  + DH+FGP+   SSSS+TTG F SIF  P+ V       +   +
Sbjct: 1   MDNKKSVSGSSSSSSSSLDHIFGPRVSSSSSSSTTGLFQSIFPLPSTVMQANSASRNEAA 60

Query: 52  SHSGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRT 111
            +  +  G   ++GE SK        +K++ S    E+  PC  SSSIYYGGQ+NYS  T
Sbjct: 61  KYQTTNFGIPNERGERSK--------NKERKSYQSEETQPPCNLSSSIYYGGQDNYSSST 112

Query: 112 KATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
                ++    KKDG+E D     S SASRGNWW+GS  Y
Sbjct: 113 -----TNPDAYKKDGEEGD-----SESASRGNWWEGSFNY 142


>gi|209967471|gb|ACJ02360.1| hypothetical protein [Populus euphratica]
          Length = 157

 Score = 81.6 bits (200), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 62/146 (42%), Positives = 79/146 (54%), Gaps = 28/146 (19%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPTA-VQGRGSSHSGSIIGSLRKQGES-----SKGG--- 71
           LFG K++S S + G FGSIF+P +  V GR    S     + +KQ  +     +K G   
Sbjct: 18  LFGSKENSPS-SMGIFGSIFAPASPKVLGR---ESLRFEVAEKKQDSADDAWNTKSGIPA 73

Query: 72  ----------ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHV 121
                     ES    +KD SSIYQ +  +PC+ SSSIYYGGQ+ Y     A T S + +
Sbjct: 74  SDLTSKMNEDESQSVPNKDMSSIYQEQRVQPCHLSSSIYYGGQDIYHHPQTAHTSSMNPM 133

Query: 122 VKKDGDEDDPNGNNSNSASRGNWWQG 147
            KKDG EDD     + SASRGNWWQG
Sbjct: 134 FKKDGPEDD-----TGSASRGNWWQG 154


>gi|224087557|ref|XP_002308189.1| predicted protein [Populus trichocarpa]
 gi|222854165|gb|EEE91712.1| predicted protein [Populus trichocarpa]
          Length = 174

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 77/160 (48%), Gaps = 39/160 (24%)

Query: 21  LFGPKDSSSSATTGGFGSIF---SPPTAVQGRGSSHSGSII------GSLRKQGESSKG- 70
           LFG K++S S + G FGSIF   SP   +  R +     ++       SLR +    K  
Sbjct: 18  LFGSKENSPS-SMGIFGSIFAPASPKVLIIARRAKFIVLVLIQVLGRESLRFEVAEKKQD 76

Query: 71  -----------------------GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENY 107
                                  GES    +KD SSIYQ +  +PC+ SSSIYYGGQ+ Y
Sbjct: 77  SADDAWNTKSGTPASDLTSKMNEGESQSVPNKDMSSIYQEQRVQPCHLSSSIYYGGQDIY 136

Query: 108 SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
                A T S + + KKDG EDD     + SASRGNWWQG
Sbjct: 137 HHPQTAHTSSINPMFKKDGPEDD-----TGSASRGNWWQG 171


>gi|297793471|ref|XP_002864620.1| hypothetical protein ARALYDRAFT_496048 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310455|gb|EFH40879.1| hypothetical protein ARALYDRAFT_496048 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 135

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 57/135 (42%), Positives = 74/135 (54%), Gaps = 24/135 (17%)

Query: 21  LFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSK 79
           LFG KD S  +++ G F +IF  P+   GR  S+S        K G  ++  ES   N+ 
Sbjct: 21  LFGSKDPSPPSSSSGIFSTIFPHPSKGSGRDGSNS--------KHGSQAQRRES--LNA- 69

Query: 80  DKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVK---KDGDEDDPNGNNS 136
                 Q +  EPC+ SSS+YYGGQ+ YS   ++TT   +  VK   +   EDD NG NS
Sbjct: 70  ------QEDRVEPCHLSSSLYYGGQDVYS---RSTTNQTYPAVKNERRRSGEDDANGQNS 120

Query: 137 NSASRGNWWQGSLYY 151
              SRGNWWQGSLYY
Sbjct: 121 QDVSRGNWWQGSLYY 135


>gi|357481929|ref|XP_003611250.1| hypothetical protein MTR_5g011930 [Medicago truncatula]
 gi|355512585|gb|AES94208.1| hypothetical protein MTR_5g011930 [Medicago truncatula]
          Length = 113

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 75/153 (49%), Gaps = 42/153 (27%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           MEN +++++S SS+ +NFD    PKDS           I   P +V+  GS         
Sbjct: 1   MENNKKQTSSPSSTTVNFDQPLSPKDS-----------ICPSPPSVEDIGSR-------- 41

Query: 61  LRKQGESSKG--GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESH 118
           +++ G+ S G  GE+ +F                   SSSI YGG+E+ +P     T   
Sbjct: 42  IQEVGKKSLGTPGEATLF-------------------SSSILYGGREDDTPI--GNTAEP 80

Query: 119 HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
             V KKD D  D N NN NSASRG+WW GSLYY
Sbjct: 81  CDVFKKDTDNADLNCNNPNSASRGDWWLGSLYY 113


>gi|356494879|ref|XP_003516310.1| PREDICTED: uncharacterized protein LOC100775733 [Glycine max]
          Length = 153

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 22/163 (13%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTA-VQGRGSSHSGSIIG 59
           ME  +Q+ ASSS +      LFG K+S  S+++G FGSIFSPP++ V GR S  S  +  
Sbjct: 1   MEGNKQKGASSSFT----SELFGSKESRPSSSSGIFGSIFSPPSSKVLGRESLRS-EMSS 55

Query: 60  SLRKQGESSKGGESNIFN-----------SKDKSSIYQSESAEPCYFSSSIYYGGQENYS 108
            +  +  SS+ G  + F+           +KD SSIYQ +  +PC+ SSSIYYGGQ+ YS
Sbjct: 56  KIANETWSSRIGIQDNFSKGYGSEAPNAVNKDMSSIYQDQRVQPCHLSSSIYYGGQDIYS 115

Query: 109 PRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
                  E  + + + DG EDD     S  ASRG+WWQG LYY
Sbjct: 116 NPQSTQNEGFNSMHRNDGGEDD-----SEFASRGDWWQGGLYY 153


>gi|15237814|ref|NP_200716.1| uncharacterized protein [Arabidopsis thaliana]
 gi|10177635|dbj|BAB10783.1| unnamed protein product [Arabidopsis thaliana]
 gi|17380960|gb|AAL36292.1| unknown protein [Arabidopsis thaliana]
 gi|20465393|gb|AAM20121.1| unknown protein [Arabidopsis thaliana]
 gi|110742611|dbj|BAE99218.1| hypothetical protein [Arabidopsis thaliana]
 gi|332009755|gb|AED97138.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 135

 Score = 75.1 bits (183), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 54/135 (40%), Positives = 72/135 (53%), Gaps = 24/135 (17%)

Query: 21  LFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSK 79
           LFG KD S  +++ G F ++F  P+    R  S+S        K G  ++  ES   N+ 
Sbjct: 21  LFGSKDPSPPSSSSGIFSTMFPHPSKGSARDGSNS--------KHGSQAQRRES--LNA- 69

Query: 80  DKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKD---GDEDDPNGNNS 136
                 Q +  EPC+ SSS+YYGGQ+ Y+   ++TT   +  VK D     EDD NG N 
Sbjct: 70  ------QEDRVEPCHLSSSLYYGGQDVYA---RSTTNQTYPPVKNDRRRSGEDDANGQNP 120

Query: 137 NSASRGNWWQGSLYY 151
              SRGNWWQGSLYY
Sbjct: 121 QDVSRGNWWQGSLYY 135


>gi|388493012|gb|AFK34572.1| unknown [Medicago truncatula]
          Length = 161

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 95/161 (59%), Gaps = 10/161 (6%)

Query: 1   MENKRQE-----SASSSSSPINFDHLFGPKDSSSSATTGGFGSIFS--PPTAVQGRGSSH 53
           MENK  +     + SSSS+P NFDHLFGPKD S+++++          PP+    R S+ 
Sbjct: 1   MENKNNKKKVGGAGSSSSTPTNFDHLFGPKDPSTTSSSSNSIFSSIFPPPSTGGWRDSTR 60

Query: 54  SGSIIGSLRKQGE-SSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYY-GGQENY-SPR 110
                 +    G  S+KGG S   N+   ++ YQ+E+ EP  + SS  Y GGQENY SP 
Sbjct: 61  HDMGSNTYGVPGNYSNKGGNSANKNTGSDTTYYQNETVEPSNYYSSSIYYGGQENYNSPT 120

Query: 111 TKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           T  TT++ H   K   D+DD NG++S SASRGNWW+GSLYY
Sbjct: 121 TCKTTQTPHFTKKNKDDDDDSNGSDSGSASRGNWWKGSLYY 161


>gi|449439960|ref|XP_004137753.1| PREDICTED: uncharacterized protein LOC101220657 [Cucumis sativus]
 gi|449511090|ref|XP_004163859.1| PREDICTED: uncharacterized LOC101220657 [Cucumis sativus]
          Length = 145

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 72/136 (52%), Gaps = 13/136 (9%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGS--IIGSLRKQGESSKGGESN--IF 76
           LFG +D+SS ++   FG +FS      G+ S   G    +    K   +SK  E      
Sbjct: 18  LFGIRDTSSLSSNHIFGPVFSSSFKPSGQHSEVGGKSPALSYFPKTEGNSKYKECKNGST 77

Query: 77  NSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKD-GDEDDPNGNN 135
           +S++  S YQ +   PC+ SSSIYYGGQ+ Y   T+      +  +K+D G EDD  G  
Sbjct: 78  SSREMGSFYQEQRTNPCHLSSSIYYGGQDVY---TQNPATGFNSPLKRDHGGEDDSGG-- 132

Query: 136 SNSASRGNWWQGSLYY 151
              ASRGNWWQGSLYY
Sbjct: 133 ---ASRGNWWQGSLYY 145


>gi|351722410|ref|NP_001237499.1| uncharacterized protein LOC100306610 [Glycine max]
 gi|255629059|gb|ACU14874.1| unknown [Glycine max]
          Length = 145

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 58/159 (36%), Positives = 79/159 (49%), Gaps = 22/159 (13%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           ME +++  +SS +S      L G      SA +G F S+F PP+ V GR S  S     +
Sbjct: 1   MEGRKKMGSSSFTS-----ELLGSNQFQKSAASGIFESMFPPPSKVLGRESLRSEVCEKT 55

Query: 61  LRKQGES-----SKG--GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRTK 112
             ++  S     SKG  GE+     KD SSIYQ +  +PC  SSSI+YGGQ+  S P++ 
Sbjct: 56  ANERWSSKIDYISKGSDGETQSTTHKDMSSIYQEQRLQPCQLSSSIHYGGQDICSCPKST 115

Query: 113 ATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
             +  +  + KKDG EDD            N WQG  YY
Sbjct: 116 QDSGYNSLLYKKDGVEDDLG---------SNLWQGGPYY 145


>gi|356566589|ref|XP_003551513.1| PREDICTED: uncharacterized protein LOC100791771 [Glycine max]
          Length = 175

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/143 (37%), Positives = 75/143 (52%), Gaps = 14/143 (9%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           ME +++  +SSS +      LFG      S+ +G F S+F PP  V GRGS  S  + G 
Sbjct: 1   MEGRKKMGSSSSFT----SELFGSNQFQKSSASGIFESMFPPPPKVLGRGSLCS-EVSGK 55

Query: 61  LRKQGESSKG--------GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRT 111
              +G + K         GE+     KD SSIYQ +  +PC  SSSI+YGGQ+ YS P++
Sbjct: 56  SSSEGWNYKFDYISMGNHGETQNTAQKDMSSIYQEQRLQPCQLSSSIHYGGQDIYSCPKS 115

Query: 112 KATTESHHHVVKKDGDEDDPNGN 134
              +  +  + KKDG EDD   N
Sbjct: 116 TQDSGYNSLLYKKDGVEDDLGSN 138


>gi|297810301|ref|XP_002873034.1| hypothetical protein ARALYDRAFT_908064 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318871|gb|EFH49293.1| hypothetical protein ARALYDRAFT_908064 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 154

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 48/116 (41%), Positives = 63/116 (54%), Gaps = 15/116 (12%)

Query: 42  PPTAVQGRGSSHSGSIIGSLR----KQGESSKGG-ESNIFNSKDKSSIYQSESAEPCYFS 96
           PP+ V GR S    ++ G+      K  E   G  + N    ++  S YQ +  +PC+ S
Sbjct: 48  PPSQVLGRESMRQETVAGAWNDKTSKTVEDFAGDVDRNREQQENHGSGYQDQRVQPCHLS 107

Query: 97  SSIYYGGQENY-SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           SSIYYGG + Y  P+  +   ++    KKDG EDD     S SASRGNWWQGSLYY
Sbjct: 108 SSIYYGGPDVYFQPQNSSINSTN----KKDGGEDD-----SGSASRGNWWQGSLYY 154


>gi|42570391|ref|NP_850323.2| uncharacterized protein [Arabidopsis thaliana]
 gi|28207124|gb|AAO37202.1| hypothetical protein [Arabidopsis thaliana]
 gi|330254646|gb|AEC09740.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 151

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 44/100 (44%), Positives = 59/100 (59%), Gaps = 17/100 (17%)

Query: 18  FDHLFGPK--DSSSSATTGGFGSIFSPPTAV-QGRGSSHSGSI------IGSLRKQGESS 68
            DH+FGP+   S SS+TTG F SIF PP+AV QG  +S +G+         +  ++GE S
Sbjct: 18  LDHIFGPRVSHSYSSSTTGLFKSIFPPPSAVTQGNLTSRNGAAKYQPTNFETPNERGERS 77

Query: 69  KGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS 108
           K        +K++ S    E+  PC  SSSIYYGGQ+NYS
Sbjct: 78  K--------NKERKSYQSEETQPPCNLSSSIYYGGQDNYS 109


>gi|297815854|ref|XP_002875810.1| hypothetical protein ARALYDRAFT_485059 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321648|gb|EFH52069.1| hypothetical protein ARALYDRAFT_485059 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 165

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 76/154 (49%), Gaps = 40/154 (25%)

Query: 12  SSSPINFD-HLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKG 70
           SSSP ++   +FGPK+ SSSA    F SIF PP+                     +S +G
Sbjct: 9   SSSPSSYTAKIFGPKEPSSSA---NFNSIFPPPS---------------------KSIQG 44

Query: 71  GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-----------PRTKATTESH- 118
              NI +SK  S   + ESA  C  SSS+YYGGQ+ YS             TK+ T  + 
Sbjct: 45  TSRNILSSKYGSLDQRKESAT-CNLSSSLYYGGQDVYSQSTHNHTHTTVKETKSLTLQNV 103

Query: 119 -HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
             H + KD + D+ N  NS+ ASRGNWWQ  + Y
Sbjct: 104 VSHQINKDQNRDN-NDANSSDASRGNWWQDDVCY 136


>gi|357505743|ref|XP_003623160.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
 gi|355498175|gb|AES79378.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
          Length = 148

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 48/119 (40%), Positives = 65/119 (54%), Gaps = 12/119 (10%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
           ME ++Q  +SSS +      LFG  D   S+ +G F SIFSP + V GR S  S ++ G 
Sbjct: 1   MEGRKQFGSSSSETSSFTSELFGSNDLHKSSASGIFDSIFSPSSKVFGRESLRS-AMNGK 59

Query: 61  LRKQGESSK-----------GGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS 108
              +G +SK            GE+    +KD S IYQ +  +PC+ SSSIYYGGQ+ YS
Sbjct: 60  TATEGSNSKIDTQEYISKGNDGETQNKTNKDMSYIYQEQRVQPCHLSSSIYYGGQDVYS 118


>gi|357461049|ref|XP_003600806.1| hypothetical protein MTR_3g069570 [Medicago truncatula]
 gi|355489854|gb|AES71057.1| hypothetical protein MTR_3g069570 [Medicago truncatula]
          Length = 217

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 90/156 (57%), Gaps = 10/156 (6%)

Query: 1   MENKRQE-----SASSSSSPINFDHLFGPKDSSSSATTGGFGSIFS--PPTAVQGRGSSH 53
           MENK  +     + SSSS+P NFDHLFGPKD S+++++          PP+    R S+ 
Sbjct: 1   MENKNNKKKVGGAGSSSSTPTNFDHLFGPKDPSTTSSSSNSIFSSIFPPPSTGGWRDSTR 60

Query: 54  SGSIIGSLRKQGE-SSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYY-GGQENY-SPR 110
                 +    G  S+KGG S   N+   ++ YQ+E+ EP  + SS  Y GGQENY SP 
Sbjct: 61  HDMGSNTYGAPGNYSNKGGNSANKNTGSDTTYYQNETVEPSNYYSSSIYYGGQENYNSPT 120

Query: 111 TKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQ 146
           T  TT++ H   K   D+DD NG++S SASRGNWW+
Sbjct: 121 TCKTTQTPHFTKKNKDDDDDSNGSDSGSASRGNWWK 156


>gi|194692056|gb|ACF80112.1| unknown [Zea mays]
 gi|195646162|gb|ACG42549.1| hypothetical protein [Zea mays]
 gi|268083689|gb|ACY95285.1| unknown [Zea mays]
 gi|413949056|gb|AFW81705.1| hypothetical protein ZEAMMB73_746072 [Zea mays]
          Length = 136

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 69/124 (55%), Gaps = 16/124 (12%)

Query: 34  GGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQ---GESSKGGESNIFNS-KDKSSIYQSES 89
           G F ++FS PTA     + H+  +   L KQ   G++ + G+S+   + KD    + +E 
Sbjct: 23  GYFSTVFSAPTAGSASDAKHA-DLYTMLNKQSSRGQNGRDGKSHSRPTYKDGKHAHPNEP 81

Query: 90  AEPCYFSSSIYYGGQENYSP--RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
           +E  YF SS++YGG+E YS   R +   E H      D   D+P+G    SA+RG+WWQG
Sbjct: 82  SESPYFGSSVHYGGREFYSSVLRKQPANEPH-----TDYKGDNPDG----SATRGDWWQG 132

Query: 148 SLYY 151
           SLYY
Sbjct: 133 SLYY 136


>gi|212722470|ref|NP_001131619.1| uncharacterized protein LOC100192973 [Zea mays]
 gi|195646590|gb|ACG42763.1| hypothetical protein [Zea mays]
          Length = 136

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 69/124 (55%), Gaps = 16/124 (12%)

Query: 34  GGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQ---GESSKGGESNIFNS-KDKSSIYQSES 89
           G F ++FS PTA     + H+  +   L KQ   G++ + G+S+   + KD    + +E 
Sbjct: 23  GYFSTVFSAPTAGSASDAKHA-DLYTMLNKQSSRGQNGRDGKSHSRPTYKDGKHAHPNEP 81

Query: 90  AEPCYFSSSIYYGGQENYSP--RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
           +E  YF SS++YGG+E YS   R +   E H      D   D+P+G    SA+RG+WWQG
Sbjct: 82  SESPYFGSSVHYGGREFYSSVLRKQPANEPH-----TDYKGDNPHG----SATRGDWWQG 132

Query: 148 SLYY 151
           SLYY
Sbjct: 133 SLYY 136


>gi|357119628|ref|XP_003561537.1| PREDICTED: uncharacterized protein LOC100840587 [Brachypodium
           distachyon]
          Length = 150

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 86/162 (53%), Gaps = 23/162 (14%)

Query: 1   MENKRQESASSSSSPIN----FDHLFGPKDSSSSATTGGFG---SIFSPPTAVQGRGSSH 53
           ME K++ +A+ + S  N     D LFGPK  +S++++       ++F  P+   G+ +S 
Sbjct: 1   MEGKKKPAAAGAGSKTNGSSIVDDLFGPKPDASNSSSSSASYFSTVFPTPSEAAGKDAS- 59

Query: 54  SGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYF-SSSIYYGGQENYSPRTK 112
                    ++ + + G +S+   +  +   Y S S E  YF SSS++YGG++ Y+P   
Sbjct: 60  ---------RRAQYASGNKSSSSWTGQQQQQYGSGSPESPYFGSSSVHYGGRDLYTPGNV 110

Query: 113 ATTESHHH---VVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
              +S  H    ++K    DD  G+ S +A+RG+WWQGSLYY
Sbjct: 111 NGDQSKTHHGGPIRKHNKVDD--GDTSGAATRGDWWQGSLYY 150


>gi|22326622|ref|NP_680147.1| uncharacterized protein [Arabidopsis thaliana]
 gi|52354445|gb|AAU44543.1| hypothetical protein AT5G05965 [Arabidopsis thaliana]
 gi|332003563|gb|AED90946.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 131

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 88/153 (57%), Gaps = 24/153 (15%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDS-SSSATTGGFGSIFSPPTAVQGRGSSHSGSIIG 59
           M+ K+ ++ASSS S  +FD+LFGPK S S+S+ +    SIF PP A + +GS  +  + G
Sbjct: 1   MDIKKTKAASSSHS-SSFDNLFGPKGSASASSCSTILDSIFPPPVA-RKKGSHTAPEVQG 58

Query: 60  SLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRTKATTESH 118
                         NI  + D+ S +  E  E  Y+SSSIYYGGQ++YS PRT  ++ S 
Sbjct: 59  --------------NISQATDERSSH--EKRESSYYSSSIYYGGQQHYSPPRTDGSSTSP 102

Query: 119 HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
            H  K+  D  D     + S SRGNWW+GSLYY
Sbjct: 103 SHQPKETNDRTD----ITTSTSRGNWWKGSLYY 131


>gi|242090741|ref|XP_002441203.1| hypothetical protein SORBIDRAFT_09g022260 [Sorghum bicolor]
 gi|241946488|gb|EES19633.1| hypothetical protein SORBIDRAFT_09g022260 [Sorghum bicolor]
          Length = 138

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 45/73 (61%), Gaps = 7/73 (9%)

Query: 79  KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           KD    Y +ES+E  YF SS++YGG+E YS   +    +  H   K   ED+P+G    S
Sbjct: 73  KDGKQSYPNESSESPYFGSSVHYGGREFYSSTLQKQPANEPHTNYK---EDNPDG----S 125

Query: 139 ASRGNWWQGSLYY 151
           A+RG+WWQGSLYY
Sbjct: 126 ATRGDWWQGSLYY 138


>gi|326498025|dbj|BAJ94875.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 137

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 81/152 (53%), Gaps = 16/152 (10%)

Query: 1   MENKRQESASSSSSPINFDHLFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIG 59
           M+ K++ +A++ SS +  D LFGPK+ ++S ++ G F ++F  P+A  G+          
Sbjct: 1   MDGKKKPAAATGSSIV--DDLFGPKNGAASPSSAGYFSTVFPTPSAATGK---------- 48

Query: 60  SLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHH 119
              ++  ++ G +S+    + +   +      P + SSS++YGG++ Y+    A ++  +
Sbjct: 49  DASRRAAAAAGNKSSSAGHQQQGKQHGGSPESPYFGSSSVHYGGRDFYA---GAESQRQY 105

Query: 120 HVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
                       +  ++++A+RG+WWQGSLYY
Sbjct: 106 PPAAAAPAPKHKDDGDTSAATRGDWWQGSLYY 137


>gi|357133435|ref|XP_003568330.1| PREDICTED: uncharacterized protein LOC100823157 [Brachypodium
           distachyon]
          Length = 146

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 65/131 (49%), Gaps = 22/131 (16%)

Query: 34  GGFGSIFSPPTAVQGRGSSHS--GSIIGSLRKQGESSKGGESNIFNS---------KDKS 82
           G F S+FS   A   R +  +   +I+     +G +S G     F           KD  
Sbjct: 25  GYFSSVFSASPAANPRDARETDLNTILNRQNPKGPTSGGIADGKFQGSPTKGRVTYKDGQ 84

Query: 83  SIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSAS 140
             Y +E +E  YF SS++YGG++ Y  SP  +A+  S  +       +D+P+G   + A+
Sbjct: 85  QFYPNEPSESPYFGSSVHYGGRDFYDSSPHKQASESSRSY------KDDNPDG---SLAT 135

Query: 141 RGNWWQGSLYY 151
           RG+WWQGSLYY
Sbjct: 136 RGDWWQGSLYY 146


>gi|30679498|ref|NP_850752.1| uncharacterized protein [Arabidopsis thaliana]
 gi|222422999|dbj|BAH19483.1| AT5G02020 [Arabidopsis thaliana]
 gi|332003038|gb|AED90421.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 137

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 8/94 (8%)

Query: 20  HLFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNS 78
            LFG +++ SS ++ G  GSIF PP+ V GR S    ++ G    +  S  GG  N+  +
Sbjct: 22  ELFGSRENPSSPSSSGILGSIFPPPSKVLGRESVRQETVTGGCWNEKTSKTGG--NVDRN 79

Query: 79  KDK-----SSIYQSESAEPCYFSSSIYYGGQENY 107
           +++     S   Q +  +PC+ SSSIYYGG + Y
Sbjct: 80  REQQENHGSGYQQDQRVQPCHLSSSIYYGGPDVY 113


>gi|195657743|gb|ACG48339.1| hypothetical protein [Zea mays]
          Length = 137

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 7/73 (9%)

Query: 79  KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           KD    Y ++S+E  YF SS++YGG+E YS   +    +      K   ED P+G    S
Sbjct: 72  KDGKHAYPNDSSESPYFGSSVHYGGREFYSNTLQKQPANEPQTKYK---EDKPDG----S 124

Query: 139 ASRGNWWQGSLYY 151
           A+RG+WWQGSLYY
Sbjct: 125 ATRGDWWQGSLYY 137


>gi|147862480|emb|CAN84007.1| hypothetical protein VITISV_033736 [Vitis vinifera]
          Length = 158

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 57/103 (55%), Gaps = 17/103 (16%)

Query: 21  LFGPKDSS--SSATTGGFGSIFSPPTAVQGRGSSH---------SGSII-----GSLRKQ 64
           LFG K+SS  S ++TG F SIFS  + V GR S           SG+ +     G+    
Sbjct: 19  LFGSKESSYPSPSSTGIFASIFSTSSKVLGRESLRPDLTKKKQDSGNEVWNAKPGTTENA 78

Query: 65  GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENY 107
            + S+G ES   +++D  S YQ +  +PC+ SSSIYYGGQ+ Y
Sbjct: 79  LQQSEG-ESQSISNRDTGSFYQEQRVQPCHLSSSIYYGGQDIY 120


>gi|326524530|dbj|BAK00648.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 142

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 67/141 (47%), Gaps = 30/141 (21%)

Query: 28  SSSATTGGFGSIFSPPTAVQGRGSSH---------------SGSIIGSLRKQGESSKGGE 72
           S++A  G F S+FS   A   R +                 SG      + QG  +KG  
Sbjct: 15  SAAAGNGYFSSVFSASPAANPRDARQMDLYTMLNKQNPKGQSGGGTADGKSQGSPAKG-- 72

Query: 73  SNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDD 130
                 KD    Y +ES+E  YF SS++YGG++ Y  SP  +A   S ++       ED+
Sbjct: 73  --RVAYKDGKQFYPNESSESPYFGSSVHYGGRDFYDSSPHKQANESSRNY------KEDN 124

Query: 131 PNGNNSNSASRGNWWQGSLYY 151
            +G   + A+RG+WWQGSLYY
Sbjct: 125 TDG---SLATRGDWWQGSLYY 142


>gi|413949057|gb|AFW81706.1| hypothetical protein ZEAMMB73_746072 [Zea mays]
          Length = 89

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 44/75 (58%), Gaps = 11/75 (14%)

Query: 79  KDKSSIYQSESAEPCYFSSSIYYGGQENYSP--RTKATTESHHHVVKKDGDEDDPNGNNS 136
           KD    + +E +E  YF SS++YGG+E YS   R +   E H      D   D+P+G   
Sbjct: 24  KDGKHAHPNEPSESPYFGSSVHYGGREFYSSVLRKQPANEPH-----TDYKGDNPDG--- 75

Query: 137 NSASRGNWWQGSLYY 151
            SA+RG+WWQGSLYY
Sbjct: 76  -SATRGDWWQGSLYY 89


>gi|388511371|gb|AFK43747.1| unknown [Lotus japonicus]
          Length = 115

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 51/102 (50%), Gaps = 23/102 (22%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGG--------- 71
           LFGP +S  S+    F S+FSP   V GR S  S  + G +  +G S+K G         
Sbjct: 18  LFGPSESHPSS---AFESMFSPSPKVTGRVSLRS-EVSGKIGGEGSSTKFGTPPDSSHIN 73

Query: 72  ----------ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGG 103
                     E+    +KD SSIYQ +  +PC+ SSSIYYGG
Sbjct: 74  ADFISKGSDSETQSKANKDMSSIYQEQRVQPCHLSSSIYYGG 115


>gi|212720725|ref|NP_001131984.1| hypothetical protein [Zea mays]
 gi|194693102|gb|ACF80635.1| unknown [Zea mays]
 gi|413945571|gb|AFW78220.1| hypothetical protein ZEAMMB73_376464 [Zea mays]
          Length = 137

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 42/73 (57%), Gaps = 7/73 (9%)

Query: 79  KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           KD    Y ++S+E  YF SS++YGG+E YS   +    +      K   ED  +G    S
Sbjct: 72  KDGKHAYPNDSSESPYFGSSVHYGGREFYSSTLQKQPANEPQTKYK---EDKSDG----S 124

Query: 139 ASRGNWWQGSLYY 151
           A+RG+WWQGSLYY
Sbjct: 125 ATRGDWWQGSLYY 137


>gi|357470595|ref|XP_003605582.1| hypothetical protein MTR_4g034100 [Medicago truncatula]
 gi|355506637|gb|AES87779.1| hypothetical protein MTR_4g034100 [Medicago truncatula]
          Length = 213

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 37/68 (54%), Gaps = 8/68 (11%)

Query: 79  KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           K  SSI+Q +   PC  SSSIYYGGQ+   P   A   S   + K D + DD     S  
Sbjct: 145 KVTSSIFQDQIIGPCNLSSSIYYGGQDILYPAQNARLTS---LKKYDWENDD-----SGV 196

Query: 139 ASRGNWWQ 146
           ASRG+WW+
Sbjct: 197 ASRGDWWK 204


>gi|357470589|ref|XP_003605579.1| hypothetical protein MTR_4g034060 [Medicago truncatula]
 gi|355506634|gb|AES87776.1| hypothetical protein MTR_4g034060 [Medicago truncatula]
          Length = 159

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 79  KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           K  SSI+Q +   PC  SSSIYYGGQ+   P           + K D +      N+S  
Sbjct: 86  KIASSIFQDQITGPCNLSSSIYYGGQDILYPAQSTKIARLTLLTKYDWE-----NNDSEV 140

Query: 139 ASRGNWWQ 146
           ASRG+WW+
Sbjct: 141 ASRGDWWK 148


>gi|297810695|ref|XP_002873231.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319068|gb|EFH49490.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 134

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/139 (40%), Positives = 78/139 (56%), Gaps = 24/139 (17%)

Query: 18  FDHLFGPKDSSSSATTGG----FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGES 73
           FD+LFGPK S+S+++         SIF PP A + +GS  +  + GS+            
Sbjct: 15  FDNLFGPKGSASASSASSCSTILDSIFPPPVA-RKKGSHTAPEVQGSI------------ 61

Query: 74  NIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRTKATTESHHHVVKKDGDEDDPN 132
            +  +KD+ S +  E  E  Y+SSSIYYGGQ++YS PRT  ++ S  H  K    E D  
Sbjct: 62  TLATAKDERSSH--EKRESSYYSSSIYYGGQQHYSPPRTDGSSTSPSHQPK----ETDDR 115

Query: 133 GNNSNSASRGNWWQGSLYY 151
            + + S SRGNWW+GSLYY
Sbjct: 116 TDTTTSTSRGNWWKGSLYY 134


>gi|116793306|gb|ABK26698.1| unknown [Picea sitchensis]
          Length = 116

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 55/116 (47%), Gaps = 18/116 (15%)

Query: 52  SHSGSIIGSLRKQGESSKGGESNIFNSKD----KSSIYQS---------ESAEPCYFSSS 98
           S S  ++GS +KQ  S +  ES +   K+    K  + Q           + E C++ SS
Sbjct: 3   SKSCEVLGSTQKQEHSEEHQESTLNGVKEINMEKEPLQQLLNHVKGKSFANPENCFWGSS 62

Query: 99  IYYGG-QENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           +YYGG  E+Y  S   KA  +     VK+  +  DP   N     RGNWW GSLYY
Sbjct: 63  LYYGGPDEDYATSQVIKANQDEIKTEVKQLEEAMDPC--NLEYTVRGNWWNGSLYY 116


>gi|115464241|ref|NP_001055720.1| Os05g0454500 [Oryza sativa Japonica Group]
 gi|53749346|gb|AAU90205.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579271|dbj|BAF17634.1| Os05g0454500 [Oryza sativa Japonica Group]
 gi|215768458|dbj|BAH00687.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218196903|gb|EEC79330.1| hypothetical protein OsI_20183 [Oryza sativa Indica Group]
 gi|222631813|gb|EEE63945.1| hypothetical protein OsJ_18770 [Oryza sativa Japonica Group]
          Length = 144

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 42/73 (57%), Gaps = 5/73 (6%)

Query: 79  KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           KD    Y +ES+E  YF SS++YG +E Y        ++     K+   E +P+G   + 
Sbjct: 77  KDGKQSYPNESSESPYFGSSVHYGAREFYGNTPPKQGDASPGNQKE--QEQNPDG---SL 131

Query: 139 ASRGNWWQGSLYY 151
           A+RG+WWQGSLYY
Sbjct: 132 ATRGDWWQGSLYY 144


>gi|15232675|ref|NP_190272.1| uncharacterized protein [Arabidopsis thaliana]
 gi|5541671|emb|CAB51177.1| hypothetical protein [Arabidopsis thaliana]
 gi|332644693|gb|AEE78214.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 153

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 35/88 (39%), Positives = 41/88 (46%), Gaps = 29/88 (32%)

Query: 21  LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSKD 80
           LFGPK+ SSSA    F SIF P                         SKG   NI +SK 
Sbjct: 19  LFGPKEPSSSA---NFNSIFPP-------------------------SKGTARNILSSKH 50

Query: 81  KSSIYQSESAEPCYFSSSIYYGGQENYS 108
             S+ Q + A  C  SSS+YYGGQ+ YS
Sbjct: 51  -GSLEQRKEAGTCNLSSSLYYGGQDVYS 77


>gi|116790009|gb|ABK25469.1| unknown [Picea sitchensis]
          Length = 161

 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 11/96 (11%)

Query: 62  RKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGG-QENYS--PRTKATTESH 118
           R +G   K G+ ++     K   ++S   E C  +SS+YYGG  E+Y     T+   +  
Sbjct: 71  RVKGGKLKKGQLHLTEYVKK---HRSADPEHCSLASSLYYGGPDEHYDSLQTTQVDQDEQ 127

Query: 119 HHV---VKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
             +   VK+  +  DP   N   A+RGNWW+GSLYY
Sbjct: 128 QKIKMEVKQLQEAMDPF--NLEYATRGNWWKGSLYY 161


>gi|116793491|gb|ABK26767.1| unknown [Picea sitchensis]
          Length = 161

 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 11/96 (11%)

Query: 62  RKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGG-QENYS--PRTKATTESH 118
           R +G   K G+ ++     K   ++S   E C  +SS+YYGG  E+Y     T+   +  
Sbjct: 71  RVKGGKLKKGQLHLTEYVKK---HRSADPEHCSLASSLYYGGPDEHYDSLQTTQVDQDEQ 127

Query: 119 HHV---VKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
             +   VK+  +  DP   N   A+RGNWW+GSLYY
Sbjct: 128 QKIKMEVKQLQEAMDPF--NLEYATRGNWWKGSLYY 161


>gi|223950247|gb|ACN29207.1| unknown [Zea mays]
          Length = 155

 Score = 40.0 bits (92), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 54/125 (43%), Gaps = 17/125 (13%)

Query: 36  FGSIFSPPTAVQGRGSSHSGSII---GSLRKQGESSKGGESNIFNSKDKSSIYQSESAEP 92
             S+F PP            SII    S   Q  ++  G+     SK  ++  +     P
Sbjct: 39  LSSLFYPPAPAANGKDQELYSIIYKGQSGSAQAGATGNGKPQWAPSKSHTAYAKDGKHSP 98

Query: 93  CY----FSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQ 146
            Y    F SS++YGG++ +  S  TK  TES        GD+ DP  +     S G+WWQ
Sbjct: 99  PYDTSCFGSSVHYGGRDYFYGSSTTKKATES---STDYKGDKKDPVAD-----SHGDWWQ 150

Query: 147 GSLYY 151
           GS YY
Sbjct: 151 GSFYY 155


>gi|242054979|ref|XP_002456635.1| hypothetical protein SORBIDRAFT_03g039870 [Sorghum bicolor]
 gi|241928610|gb|EES01755.1| hypothetical protein SORBIDRAFT_03g039870 [Sorghum bicolor]
          Length = 155

 Score = 39.7 bits (91), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 33/59 (55%), Gaps = 10/59 (16%)

Query: 95  FSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           F SS++YGG+E +  S  TK  +ES        GD+ DP  +     S G+WWQGS YY
Sbjct: 105 FGSSVHYGGREYFYGSSTTKKASES---STDYKGDKKDPAAD-----SHGDWWQGSFYY 155


>gi|398803908|ref|ZP_10562914.1| citrate synthase I, hexameric type [Polaromonas sp. CF318]
 gi|398095334|gb|EJL85673.1| citrate synthase I, hexameric type [Polaromonas sp. CF318]
          Length = 436

 Score = 38.9 bits (89), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 13/102 (12%)

Query: 30  SATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSES 89
           +A   G   ++ P     G  +  + +++G ++KQG   K GE  I   KDKSS      
Sbjct: 255 AAIAAGVACLWGP---AHGGANEAALNMLGDIQKQGGVEKIGEF-IKQVKDKSS-----G 305

Query: 90  AEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDP 131
            +   F   +Y    +NY PR K   E+   V+++ G E+DP
Sbjct: 306 VKLMGFGHRVY----KNYDPRAKLMQETCKEVLQEMGLENDP 343


>gi|326501634|dbj|BAK02606.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 195

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 13/73 (17%)

Query: 81  KSSIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           K + Y S     C F SS++YGG++ Y  S   +A   S + V KKD   D         
Sbjct: 134 KDNKYDSVGTSSC-FGSSVHYGGRDYYGSSAPKQAAEYSDYKVDKKDPVAD--------- 183

Query: 139 ASRGNWWQGSLYY 151
            S G+WWQGS YY
Sbjct: 184 -SHGDWWQGSFYY 195


>gi|326489436|dbj|BAK01699.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 195

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 13/73 (17%)

Query: 81  KSSIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
           K + Y S     C F SS++YGG++ Y  S   +A   S + V KKD   D         
Sbjct: 134 KDNKYDSVGTSSC-FGSSVHYGGRDYYGSSAPKQAAEYSDYKVDKKDPVAD--------- 183

Query: 139 ASRGNWWQGSLYY 151
            S G+WWQGS YY
Sbjct: 184 -SHGDWWQGSFYY 195


>gi|222619540|gb|EEE55672.1| hypothetical protein OsJ_04080 [Oryza sativa Japonica Group]
          Length = 156

 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 10/81 (12%)

Query: 72  ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR-TKATTESHHHVVKKDGDEDD 130
           +S    +KD     Q +S +   F SS+ YGG++ Y     K +TES+ +      D+ D
Sbjct: 85  KSRTVYTKDNKQSNQYDSVDTSCFGSSVNYGGRDYYGISGHKQSTESNDYKA----DKKD 140

Query: 131 PNGNNSNSASRGNWWQGSLYY 151
           P+ +     S G+WWQGS YY
Sbjct: 141 PSTD-----SHGDWWQGSFYY 156


>gi|297597964|ref|NP_001044806.2| Os01g0848900 [Oryza sativa Japonica Group]
 gi|56784045|dbj|BAD82673.1| unknown protein [Oryza sativa Japonica Group]
 gi|255673876|dbj|BAF06720.2| Os01g0848900 [Oryza sativa Japonica Group]
          Length = 156

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 10/81 (12%)

Query: 72  ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR-TKATTESHHHVVKKDGDEDD 130
           +S    +KD     Q +S +   F SS+ YGG++ Y     K +TES+ +      D+ D
Sbjct: 85  KSRTVYTKDNKQSNQYDSVDTSCFGSSVNYGGRDYYGISGHKQSTESNDYKA----DKKD 140

Query: 131 PNGNNSNSASRGNWWQGSLYY 151
           P+ +     S G+WWQGS YY
Sbjct: 141 PSTD-----SHGDWWQGSFYY 156


>gi|125528383|gb|EAY76497.1| hypothetical protein OsI_04437 [Oryza sativa Indica Group]
          Length = 156

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 10/81 (12%)

Query: 72  ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR-TKATTESHHHVVKKDGDEDD 130
           +S    +KD     Q +S +   F SS+ YGG++ Y     K +TES+ +      D+ D
Sbjct: 85  KSRTVYTKDNKQSNQYDSVDTSCFGSSVNYGGRDYYGISGHKQSTESNDYKA----DKKD 140

Query: 131 PNGNNSNSASRGNWWQGSLYY 151
           P+ +     S G+WWQGS YY
Sbjct: 141 PSTD-----SHGDWWQGSFYY 156


>gi|357125868|ref|XP_003564611.1| PREDICTED: uncharacterized protein LOC100824578 [Brachypodium
           distachyon]
          Length = 152

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 13/60 (21%)

Query: 95  FSSSIYYGGQENY---SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
           F SS++YGG++ Y   S   +AT  + + V  KD   D          S G+WWQGS YY
Sbjct: 103 FGSSVHYGGRDYYGSCSAPKQATEYNDYKVDNKDPTTD----------SHGDWWQGSFYY 152


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.303    0.122    0.352 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,538,584,021
Number of Sequences: 23463169
Number of extensions: 104909039
Number of successful extensions: 265381
Number of sequences better than 100.0: 626
Number of HSP's better than 100.0 without gapping: 198
Number of HSP's successfully gapped in prelim test: 428
Number of HSP's that attempted gapping in prelim test: 248814
Number of HSP's gapped (non-prelim): 13341
length of query: 151
length of database: 8,064,228,071
effective HSP length: 114
effective length of query: 37
effective length of database: 9,684,394,101
effective search space: 358322581737
effective search space used: 358322581737
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.8 bits)
S2: 71 (32.0 bits)