BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024641
         (265 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|296089775|emb|CBI39594.3| unnamed protein product [Vitis vinifera]
          Length = 471

 Score =  383 bits (983), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/246 (73%), Positives = 212/246 (86%), Gaps = 5/246 (2%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           M+  +KRVQ L+FV+GI+ALS+TAEKCRQL+GED SSQSGKFT+LNCFDM SGT+AC VK
Sbjct: 1   MNLPRKRVQLLIFVVGIVALSITAEKCRQLLGEDGSSQSGKFTLLNCFDMSSGTLACTVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIRA HVE+AR+ AIE A+ DAL+QGL++ DAAK AQKEGAKAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRAVHVEKARHHAIEGALSDALTQGLNAKDAAKHAQKEGAKAAKLATRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFGAY GGFLGE+RLGRFGYLVGSHLG
Sbjct: 121 RIIGPIISSGWDFFEAIYYGGTLTEGFLRGTGTLFGAYTGGFLGEQRLGRFGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLMIYDV NGV FLLQ VQ EE+     P++   +  ED  + E+P++ +SEA
Sbjct: 181 SWVGGRIGLMIYDVANGVQFLLQSVQPEET-----PMDMSSEVPEDPGSYESPAYESSEA 235

Query: 241 SEDSKA 246
            EDS++
Sbjct: 236 QEDSES 241


>gi|224123740|ref|XP_002330196.1| predicted protein [Populus trichocarpa]
 gi|222871652|gb|EEF08783.1| predicted protein [Populus trichocarpa]
          Length = 259

 Score =  379 bits (974), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 177/246 (71%), Positives = 209/246 (84%), Gaps = 5/246 (2%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF K+RV FL+F++G IALS+TAEKCRQLVG+D SSQSGKFTI NCFDMGSGT+AC VK
Sbjct: 1   MDFQKRRVLFLVFIVGTIALSITAEKCRQLVGDDYSSQSGKFTIFNCFDMGSGTLACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR++HVERARN+AIE++++D + QG+ + D AK AQKEGAKAAKLAKRQ K
Sbjct: 61  EGVKLYFYNIRSSHVERARNLAIERSLLDTIGQGMPAQDVAKTAQKEGAKAAKLAKRQTK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGP+I++GWDFFEA+YYGGT+TEGF+RG+GTL GAYAGGF GEERLGR GYLVGSHLG
Sbjct: 121 RIIGPVISSGWDFFEALYYGGTVTEGFLRGSGTLAGAYAGGFFGEERLGRVGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLM+YDVVNGVH+LLQFVQ E+ E H+ P  E F+  EDS       + + EA
Sbjct: 181 SWVGGRIGLMVYDVVNGVHYLLQFVQGEDGEVHETPTYENFEVSEDS-----QGYTSYEA 235

Query: 241 SEDSKA 246
           S+DS  
Sbjct: 236 SKDSNV 241


>gi|356575349|ref|XP_003555804.1| PREDICTED: uncharacterized protein LOC100810395 [Glycine max]
          Length = 266

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 187/271 (69%), Positives = 218/271 (80%), Gaps = 11/271 (4%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  +R+  L+ V+ I+ALS TAEKCR+LVGE+ SSQSGKFTILNCFDMGSGTVAC VK
Sbjct: 1   MDFESRRLHLLISVVAIVALSFTAEKCRELVGEEGSSQSGKFTILNCFDMGSGTVACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR++H ERAR+ AIE A+VDA++QG+S  D+AK AQKEG KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSSHAERARSQAIESALVDAVAQGMSPTDSAKHAQKEGKKAAKLASRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFG YAGGFLGE+RLGR GYLVGSHLG
Sbjct: 121 RIIGPIISSGWDFFEAIYYGGTVTEGFLRGTGTLFGTYAGGFLGEQRLGRIGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLM+YDVVNGVH LLQFVQ+ E+E     V EK +  E S+  E P +  SE 
Sbjct: 181 SWVGGRIGLMVYDVVNGVHLLLQFVQTAETE-----VREKSERSESSFFGETPVYDGSEG 235

Query: 241 S---EDSKASDSSLYENSES---ETYENSEL 265
           S   E     +S  Y+++E    ETYE+SEL
Sbjct: 236 SSVYESPLDEESYAYKSTERQSYETYEDSEL 266


>gi|255542892|ref|XP_002512509.1| conserved hypothetical protein [Ricinus communis]
 gi|223548470|gb|EEF49961.1| conserved hypothetical protein [Ricinus communis]
          Length = 264

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 195/265 (73%), Positives = 226/265 (85%), Gaps = 4/265 (1%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
            DF KKRVQ L+F+IGI+ALS+TA+KCRQLVGED+SSQSGKFTI NCFDM +GT+AC VK
Sbjct: 3   FDFQKKRVQLLIFMIGIVALSITADKCRQLVGEDSSSQSGKFTIFNCFDMSTGTLACTVK 62

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR+AHVE ARN+AIE+A++DAL QG+++ DAAKQAQKEGAKAAKLA RQAK
Sbjct: 63  EGVKLYFYNIRSAHVESARNLAIERALLDALGQGMAAKDAAKQAQKEGAKAAKLATRQAK 122

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEA+YYGGTITEGF+RGTGTL GAYAGGFLGE RLGRFGYLVGSHLG
Sbjct: 123 RIIGPIISSGWDFFEALYYGGTITEGFLRGTGTLGGAYAGGFLGEARLGRFGYLVGSHLG 182

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLM+YDVVNGVH+LLQ  Q+ +SE H+    EK    EDS   E+  + NSEA
Sbjct: 183 SWVGGRIGLMVYDVVNGVHYLLQVFQAGDSEVHEN--YEKVVVSEDSPVYESSEYTNSEA 240

Query: 241 SEDSKASDSSLYENSESETYENSEL 265
           SEDS   ++  YE+SES  Y+NSE 
Sbjct: 241 SEDSNVYEAPPYESSES--YDNSEF 263


>gi|388496142|gb|AFK36137.1| unknown [Lotus japonicus]
          Length = 282

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 187/270 (69%), Positives = 217/270 (80%), Gaps = 6/270 (2%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  +R Q LL V  I+ LS+TAEKCRQLVGE+ SSQSGKFTILNCFDMGSGTVACGVK
Sbjct: 1   MDFQNRRFQVLLAVAAIVVLSITAEKCRQLVGEEGSSQSGKFTILNCFDMGSGTVACGVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR+AHVERAR+ AIE A+VDA+SQG+S  D+A   QKE  KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSAHVERARHRAIESALVDAVSQGMSPKDSATYVQKESKKAAKLASRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFGAY GGFLGE++LGRFGYLVGSHLG
Sbjct: 121 RIIGPIISSGWDFFEAIYYGGTVTEGFLRGTGTLFGAYGGGFLGEQKLGRFGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLM+YDV NGV+ LLQFVQ+ E E H A   E  +  E  +  E P + +SE 
Sbjct: 181 SWVGGRIGLMVYDVFNGVNLLLQFVQTGEIEVHKASANENSEAPEGYFFGEIPVYDSSEG 240

Query: 241 S---EDSKASDSSLYENS---ESETYENSE 264
           S   E S + +S++YE+S   ES  YE++E
Sbjct: 241 SNIYESSPSEESNIYESSPSEESNAYESTE 270


>gi|225450593|ref|XP_002282180.1| PREDICTED: uncharacterized protein LOC100249443 [Vitis vinifera]
          Length = 242

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/244 (74%), Positives = 210/244 (86%), Gaps = 5/244 (2%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           M+  +KRVQ L+FV+GI+ALS+TAEKCRQL+GED SSQSGKFT+LNCFDM SGT+AC VK
Sbjct: 1   MNLPRKRVQLLIFVVGIVALSITAEKCRQLLGEDGSSQSGKFTLLNCFDMSSGTLACTVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIRA HVE+AR+ AIE A+ DAL+QGL++ DAAK AQKEGAKAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRAVHVEKARHHAIEGALSDALTQGLNAKDAAKHAQKEGAKAAKLATRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFGAY GGFLGE+RLGRFGYLVGSHLG
Sbjct: 121 RIIGPIISSGWDFFEAIYYGGTLTEGFLRGTGTLFGAYTGGFLGEQRLGRFGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLMIYDV NGV FLLQ VQ EE+     P++   +  ED  + E+P++ +SEA
Sbjct: 181 SWVGGRIGLMIYDVANGVQFLLQSVQPEET-----PMDMSSEVPEDPGSYESPAYESSEA 235

Query: 241 SEDS 244
            EDS
Sbjct: 236 QEDS 239


>gi|224125018|ref|XP_002319482.1| predicted protein [Populus trichocarpa]
 gi|222857858|gb|EEE95405.1| predicted protein [Populus trichocarpa]
          Length = 230

 Score =  375 bits (964), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 173/227 (76%), Positives = 205/227 (90%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF KKRVQ L F++GIIALS+TAEKCRQLVG+D SSQSGKFTI +CFDMGSGT+AC VK
Sbjct: 1   MDFQKKRVQLLAFIVGIIALSITAEKCRQLVGDDNSSQSGKFTIFDCFDMGSGTLACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY YNIR+AHVERARN+AIE++++DA+ QG+S  DAAK AQKEG KAAKLAK+QAK
Sbjct: 61  EGVKLYVYNIRSAHVERARNLAIERSLLDAVGQGMSPQDAAKTAQKEGTKAAKLAKQQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GP+I++GWDFFEA+YYGGTITEGF+RG+GTL GAYAGGFLG+ERLGR GYLVGSHLG
Sbjct: 121 RIVGPVISSGWDFFEALYYGGTITEGFLRGSGTLVGAYAGGFLGDERLGRVGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDS 227
           SW GGRIGLM+YDVV+GVH+LLQFVQ E+SE +++P +E  + YE S
Sbjct: 181 SWVGGRIGLMVYDVVDGVHYLLQFVQGEDSEVYESPPDESPESYEHS 227


>gi|255646276|gb|ACU23622.1| unknown [Glycine max]
          Length = 266

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 186/271 (68%), Positives = 217/271 (80%), Gaps = 11/271 (4%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  +R+  L+ V+ I ALS TAEKCR+LVGE+ SSQSGKFTILNCFDMGSGTVAC VK
Sbjct: 1   MDFESRRLHLLISVVAIGALSFTAEKCRELVGEEGSSQSGKFTILNCFDMGSGTVACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR++H ERAR+ AI+ A+VDA++QG+S  D+AK AQKEG KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSSHAERARSQAIKSALVDAVAQGMSPTDSAKHAQKEGKKAAKLASRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFG YAGGFLGE+RLGR GYLVGSHLG
Sbjct: 121 RIIGPIISSGWDFFEAIYYGGTVTEGFLRGTGTLFGTYAGGFLGEQRLGRIGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLM+YDVVNGVH LLQFVQ+ E+E     V EK +  E S+  E P +  SE 
Sbjct: 181 SWVGGRIGLMVYDVVNGVHLLLQFVQTAETE-----VREKSERSESSFFGETPVYDGSEG 235

Query: 241 S---EDSKASDSSLYENSES---ETYENSEL 265
           S   E     +S  Y+++E    ETYE+SEL
Sbjct: 236 SSVYESPLDEESYAYKSTERQSYETYEDSEL 266


>gi|358248534|ref|NP_001239898.1| uncharacterized protein LOC100786994 precursor [Glycine max]
 gi|255639709|gb|ACU20148.1| unknown [Glycine max]
          Length = 267

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 185/271 (68%), Positives = 217/271 (80%), Gaps = 10/271 (3%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  +R+  L+ V+ IIALS TAEKCR+LVGE+ SSQSGKFTILNCFDMGSGTVAC VK
Sbjct: 1   MDFESRRLHLLISVVAIIALSFTAEKCRELVGEEGSSQSGKFTILNCFDMGSGTVACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR++H ERAR+ AIE A+VDA++QG+S  D+AK AQKEG KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSSHAERARSQAIESALVDAVTQGMSPTDSAKHAQKEGKKAAKLASRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFG YAGGFLGE+RLGR GYL+GSHLG
Sbjct: 121 RIIGPIISSGWDFFEAIYYGGTVTEGFLRGTGTLFGTYAGGFLGEQRLGRIGYLLGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGRIGLM+YDVVNGVH LLQFVQ+ E E H+     +      S+  E P + +SE 
Sbjct: 181 SWVGGRIGLMVYDVVNGVHLLLQFVQTGEIEVHEKSKRSE----SSSFFGETPVYDSSEG 236

Query: 241 S---EDSKASDSSLYENSES---ETYENSEL 265
           S   E   + +S  YE++E    ETYE+SEL
Sbjct: 237 SGVYESPPSEESYAYESTERQSYETYEDSEL 267


>gi|449454014|ref|XP_004144751.1| PREDICTED: uncharacterized protein LOC101204135 [Cucumis sativus]
 gi|449524962|ref|XP_004169490.1| PREDICTED: uncharacterized protein LOC101228818 [Cucumis sativus]
          Length = 271

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 183/268 (68%), Positives = 216/268 (80%), Gaps = 5/268 (1%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
            DF KKR+Q L+F+IG I LS TAEKCR LVGE+ASSQSGKFT LNCFDMGSG+VACGVK
Sbjct: 5   FDFEKKRIQLLVFIIGTIVLSFTAEKCRHLVGEEASSQSGKFTFLNCFDMGSGSVACGVK 64

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR+AHVE  R+ A+E A+ DA++QG+S+ +AAK AQKEG KAAKLAKRQAK
Sbjct: 65  EGVKLYFYNIRSAHVESVRHTALETALADAITQGMSAKEAAKHAQKEGVKAAKLAKRQAK 124

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEA+YYGGTITEGF+RG+GTLFGAYAGGF+G++RLGRFGYL+GSHLG
Sbjct: 125 RIIGPIISSGWDFFEALYYGGTITEGFLRGSGTLFGAYAGGFIGDQRLGRFGYLIGSHLG 184

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQ---SEESEAHDAPVEEKFQDYEDSYANEAPSFWN 237
           SW GGRIGLM+YDVVNGVHFLL FVQ     E    +A   E     + S+ N+AP + N
Sbjct: 185 SWVGGRIGLMVYDVVNGVHFLLNFVQGEEESEVHEKEAAYVENEASSDGSHVNDAPIYNN 244

Query: 238 SEASEDSKASDSSLYENSESETYENSEL 265
            E  E+S   +SS   + ES  +ENSE 
Sbjct: 245 LEDIEESYHYESS--PSDESLDHENSEF 270


>gi|356505771|ref|XP_003521663.1| PREDICTED: uncharacterized protein LOC100794770 [Glycine max]
          Length = 218

 Score =  344 bits (882), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 165/211 (78%), Positives = 186/211 (88%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF KKRVQFL FV  IIALS+TAEKCRQLVGE ASSQSGKFT LNCFDM SGT+AC VK
Sbjct: 1   MDFQKKRVQFLAFVAAIIALSITAEKCRQLVGEKASSQSGKFTFLNCFDMSSGTLACSVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           E VKLYFYNIRAAHVE AR+ A++ A+VDAL QG+S + +AK A+KEG KAAKLA R+A+
Sbjct: 61  ESVKLYFYNIRAAHVEGARHDALQSALVDALKQGMSQSASAKYAKKEGDKAAKLASRKAR 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEA+YYGGT+TEGF+RG+GTLFG YAGGFLGE+RLGRFGYLVGSHLG
Sbjct: 121 RIIGPIISSGWDFFEAVYYGGTLTEGFLRGSGTLFGTYAGGFLGEQRLGRFGYLVGSHLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESE 211
           SW GGRIGLM+YDV NGVHFLL  VQS  S+
Sbjct: 181 SWIGGRIGLMLYDVANGVHFLLHSVQSLGSD 211


>gi|297848154|ref|XP_002891958.1| hypothetical protein ARALYDRAFT_474805 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337800|gb|EFH68217.1| hypothetical protein ARALYDRAFT_474805 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 264

 Score =  339 bits (870), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 158/263 (60%), Positives = 208/263 (79%), Gaps = 5/263 (1%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF ++RVQF+LF IG+IALS+TAEKCR+LVG++A+S+SG+FT LNCFDM SGT+AC VK
Sbjct: 1   MDFRRRRVQFILFAIGLIALSMTAEKCRELVGQEAASKSGQFTFLNCFDMSSGTLACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR+ HVE+ARNVAIEKA+ +AL   + + +AAK+ Q+ G KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSIHVEKARNVAIEKALHEALVNSMPAKEAAKEVQRAGEKAAKLASRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPI+AAGWDFFEA+Y+GGT+TEGF+RG+GT+ GAY+GG++GE+R GRFGYLVGS LG
Sbjct: 121 RIIGPIVAAGWDFFEALYFGGTLTEGFLRGSGTMVGAYSGGYVGEQRFGRFGYLVGSTLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEE----SEAHDAPVEEK-FQDYEDSYANEAPSF 235
           +W G R+GLM+YDVVNGV+F L+  QS E       +++P ++  F+  +D   +E+P  
Sbjct: 181 NWVGARVGLMVYDVVNGVNFFLESSQSGEIYKGQSTYESPADQSTFESPKDQSTSESPED 240

Query: 236 WNSEASEDSKASDSSLYENSESE 258
            ++  S   +  + S YE S  E
Sbjct: 241 QSTYESPQDRPENQSTYETSSDE 263


>gi|186491357|ref|NP_001117508.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332195272|gb|AEE33393.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 270

 Score =  337 bits (865), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 167/269 (62%), Positives = 207/269 (76%), Gaps = 9/269 (3%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  +RVQFLLF IG+IALS+TAEKCR+LVG++A+S+SG+FT LNCFDM SGT+AC VK
Sbjct: 1   MDFRSRRVQFLLFAIGLIALSMTAEKCRELVGQEAASKSGQFTFLNCFDMSSGTLACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR+ HVE+ARNVAIEKA+ +AL  G+ + +AAK+ Q+ G KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSIHVEKARNVAIEKALHEALDNGMLAKEAAKEGQRAGEKAAKLATRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPI+AAGWDFFEA+Y+GGT+TEGF+RGTGT+ GAY+GG++GE+R GRFGYLVGS LG
Sbjct: 121 RIIGPIVAAGWDFFEALYFGGTLTEGFLRGTGTMVGAYSGGYVGEQRFGRFGYLVGSTLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           +W G R+GLM+YDVVNGV+F  +  QS E    D    E  +D     + E PS +  E+
Sbjct: 181 NWVGARVGLMVYDVVNGVNFFYETYQSGEI-YEDQSTNESPEDRSTYESREDPSTY--ES 237

Query: 241 SEDSKA----SDSSLYENSESE--TYENS 263
            ED        D S YE+ E +  TYE S
Sbjct: 238 PEDRSTYEIREDQSTYESPEEDQSTYETS 266


>gi|116830703|gb|ABK28309.1| unknown [Arabidopsis thaliana]
          Length = 271

 Score =  337 bits (865), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 167/269 (62%), Positives = 207/269 (76%), Gaps = 9/269 (3%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  +RVQFLLF IG+IALS+TAEKCR+LVG++A+S+SG+FT LNCFDM SGT+AC VK
Sbjct: 1   MDFRSRRVQFLLFAIGLIALSMTAEKCRELVGQEAASKSGQFTFLNCFDMSSGTLACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR+ HVE+ARNVAIEKA+ +AL  G+ + +AAK+ Q+ G KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSIHVEKARNVAIEKALHEALDNGMLAKEAAKEGQRAGEKAAKLATRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPI+AAGWDFFEA+Y+GGT+TEGF+RGTGT+ GAY+GG++GE+R GRFGYLVGS LG
Sbjct: 121 RIIGPIVAAGWDFFEALYFGGTLTEGFLRGTGTMVGAYSGGYVGEQRFGRFGYLVGSTLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           +W G R+GLM+YDVVNGV+F  +  QS E    D    E  +D     + E PS +  E+
Sbjct: 181 NWVGARVGLMVYDVVNGVNFFYETYQSGEI-YEDQSTNESPEDRSTYESREDPSTY--ES 237

Query: 241 SEDSKA----SDSSLYENSESE--TYENS 263
            ED        D S YE+ E +  TYE S
Sbjct: 238 PEDRSTYEIREDQSTYESPEEDQSTYETS 266


>gi|357441883|ref|XP_003591219.1| hypothetical protein MTR_1g084040 [Medicago truncatula]
 gi|355480267|gb|AES61470.1| hypothetical protein MTR_1g084040 [Medicago truncatula]
          Length = 271

 Score =  333 bits (853), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 174/273 (63%), Positives = 208/273 (76%), Gaps = 12/273 (4%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  ++ +  + V  I+ALS+TAEKCR+L+GE+ SSQSGKFT+LNCFDMGSGT+AC VK
Sbjct: 3   MDFQNRKARLFVIVAAIVALSITAEKCRELIGEEGSSQSGKFTLLNCFDMGSGTLACAVK 62

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFY+IR+ HVE+AR  AIE A+VDA+SQG+   DAAK AQKE  KAAKLA RQAK
Sbjct: 63  EGVKLYFYSIRSTHVEKARQRAIESALVDAVSQGMPPTDAAKHAQKESKKAAKLASRQAK 122

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGTITEGF+RGTGTLFGAY GGF GE+ LGR GYLVGSH+G
Sbjct: 123 RIIGPIISSGWDFFEAIYYGGTITEGFLRGTGTLFGAYGGGFFGEQSLGRIGYLVGSHMG 182

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSF----- 235
           SW GGRIGLMIYDV+NGVH LL+FVQ+ E E     V E       S+  E P+F     
Sbjct: 183 SWVGGRIGLMIYDVINGVHLLLEFVQTGEIEVRKTLVHENSDAPYTSFDGEVPTFDRSER 242

Query: 236 ---WNSEASEDSKASDSSLYENSESETYENSEL 265
              + S  +E+S A +S  Y++ E+    NSEL
Sbjct: 243 SSSYESSPTEESNADESMEYQSYET----NSEL 271


>gi|356571073|ref|XP_003553705.1| PREDICTED: uncharacterized protein LOC100796894 [Glycine max]
          Length = 252

 Score =  330 bits (847), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 155/203 (76%), Positives = 180/203 (88%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF KKRVQFL FV  IIALS+TAEKCRQLVGE ASSQSGKFT LNCFDM SGT+AC VK
Sbjct: 1   MDFQKKRVQFLAFVAAIIALSITAEKCRQLVGEKASSQSGKFTFLNCFDMTSGTLACSVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           E VKLY YNIRAAHVERAR+ A++ A+VDAL QG+S + +AK A+KEG KAAKLA R+A+
Sbjct: 61  ESVKLYCYNIRAAHVERARHDAMKSALVDALKQGMSQSASAKYAKKEGDKAAKLASRKAR 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPI+++GWDFFEA+YYGGT+TEGF+RG+GTLFG YAGGFLG++RLGRFGYLVGSH+G
Sbjct: 121 RIIGPILSSGWDFFEAVYYGGTLTEGFLRGSGTLFGTYAGGFLGDQRLGRFGYLVGSHMG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQ 203
           SW GGR+GLM+YDV NGVHF  +
Sbjct: 181 SWIGGRMGLMLYDVANGVHFFYE 203


>gi|326501782|dbj|BAK06383.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524454|dbj|BAK00610.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 261

 Score =  315 bits (808), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 143/206 (69%), Positives = 182/206 (88%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           +D  K+RVQ LLFV G++ALS+TAEKCR+LVG++A+S+SG+FT +NCFDMGSG++AC  K
Sbjct: 2   LDIQKRRVQLLLFVTGVLALSMTAEKCRELVGKEAASKSGQFTFMNCFDMGSGSLACAGK 61

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY  NIR+AH+ER R  A+EKA+ DAL++GLS  +AAKQAQK GAKA K+A RQAK
Sbjct: 62  EGVKLYVNNIRSAHMERVRQRALEKALADALTEGLSPAEAAKQAQKVGAKATKVAARQAK 121

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GPII++GWDFFEA+Y+GG++TEGF+RGTGTLFG YAGGF GEERLG+ GYL GS LG
Sbjct: 122 RILGPIISSGWDFFEAMYFGGSMTEGFLRGTGTLFGTYAGGFHGEERLGKLGYLAGSQLG 181

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQ 206
           SW GGRIGLM+YDV+NG++++L+FV+
Sbjct: 182 SWVGGRIGLMVYDVINGLNYMLEFVR 207


>gi|357112569|ref|XP_003558081.1| PREDICTED: uncharacterized protein LOC100833733 [Brachypodium
           distachyon]
          Length = 254

 Score =  315 bits (808), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 148/238 (62%), Positives = 195/238 (81%), Gaps = 5/238 (2%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           +D  K+RVQ LLF+ G++ALS+TAEK R+LVG++A+S+SG+FT +NCFDMGSG++AC  K
Sbjct: 2   LDIQKRRVQLLLFITGVLALSMTAEKFRELVGKEAASKSGQFTFMNCFDMGSGSLACAGK 61

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY  N+R+AH+E  R  AIEKA+ DA+++GLS  +AAKQAQK GAKA K+A RQAK
Sbjct: 62  EGVKLYVNNLRSAHMEMVRQRAIEKALADAVTEGLSPAEAAKQAQKVGAKAMKVAARQAK 121

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GPII++GWDFFEA+Y+GG++TEGF+RGTGTLFG Y GGF GEERLG+ GYL GS LG
Sbjct: 122 RILGPIISSGWDFFEAMYFGGSMTEGFLRGTGTLFGTYVGGFHGEERLGKLGYLAGSQLG 181

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYAN---EAPSF 235
           SW GGRIGLMIYDVVNG++++LQFV+ E+  +  +  E+   +Y D+Y +   E P++
Sbjct: 182 SWVGGRIGLMIYDVVNGLNYMLQFVRPEQYRSSYSSGEDS--EYADNYRSTETEEPTY 237


>gi|116781183|gb|ABK21994.1| unknown [Picea sitchensis]
          Length = 249

 Score =  314 bits (804), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 157/257 (61%), Positives = 195/257 (75%), Gaps = 8/257 (3%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MD   +RVQ +LFV G++ LS+TAEKCRQLVGE+ SS+SGKFT +NCFDMGSGT+AC  K
Sbjct: 1   MDIQNRRVQLVLFVAGLVILSMTAEKCRQLVGEETSSKSGKFTWINCFDMGSGTLACAAK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY YN+RAAHVE  R  AIE A+ +ALS GLS + A KQAQKEGAKAAKLA RQA+
Sbjct: 61  EGVKLYVYNLRAAHVESTRQRAIENALNEALSGGLSVSAATKQAQKEGAKAAKLASRQAR 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPI+++GWDFFEA+YYGG+I EG +RG GTL GAY GGF GE+RLGRFGYLVGS LG
Sbjct: 121 RIIGPILSSGWDFFEALYYGGSIIEGCMRGAGTLVGAYIGGFQGEQRLGRFGYLVGSQLG 180

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSEESEAHDAPVEEKFQDYEDSYANEAPSFWNSEA 240
           SW GGR+GLM YD+++G  +L+  V S   + +++  E  +   ED Y+ +     N E+
Sbjct: 181 SWFGGRVGLMFYDIISGAQYLMH-VASGNQDNYESS-ETSYITSEDMYSQD-----NYES 233

Query: 241 SEDSKASDSSLYENSES 257
           SE S  +   +Y NSE+
Sbjct: 234 SETSYTTSEDMY-NSEN 249


>gi|226510026|ref|NP_001142914.1| uncharacterized protein LOC100275347 [Zea mays]
 gi|194698106|gb|ACF83137.1| unknown [Zea mays]
 gi|195611374|gb|ACG27517.1| hypothetical protein [Zea mays]
 gi|414866453|tpg|DAA45010.1| TPA: hypothetical protein ZEAMMB73_682809 [Zea mays]
          Length = 251

 Score =  310 bits (795), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 150/265 (56%), Positives = 196/265 (73%), Gaps = 24/265 (9%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           +D  K+RV+ LLF+  ++ALS+TAEK R+LVG++A+S+SG+FT +NCFDMGSG++AC  K
Sbjct: 2   LDIQKRRVRLLLFITAVLALSITAEKFRELVGKEAASKSGQFTFMNCFDMGSGSLACTAK 61

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY  NIR+AH+E  R  A+EKA+ DA+++GL+ ++AAKQAQK  AKA KLA RQAK
Sbjct: 62  EGVKLYVNNIRSAHLEMVRQRAMEKALADAVTEGLTPSEAAKQAQKISAKATKLAARQAK 121

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GPII+ GWDFFEA+Y+GG++TEGF+RGTGTLFG Y GGF GEER G+ GYL GSHLG
Sbjct: 122 RILGPIISCGWDFFEAMYFGGSMTEGFLRGTGTLFGTYMGGFHGEERFGKLGYLAGSHLG 181

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSE---ESEAHDAPVEEKFQDYEDSYANEAPSFWN 237
           SW GGRIGLMIYD+++G+ F+ Q +Q E    S A D P      +Y +SY N+      
Sbjct: 182 SWGGGRIGLMIYDIISGLKFMFQSIQPEYESSSYAEDGP------EYAESYTNQ------ 229

Query: 238 SEASEDSKASDSSLYENSESETYEN 262
                     DS+ YE SE +  E+
Sbjct: 230 ---------EDSTYYETSEEKQEES 245


>gi|383100960|emb|CCD74504.1| similar to XP_002891958 hypothetical protein [A.lyrata]
           [Arabidopsis halleri subsp. halleri]
          Length = 255

 Score =  307 bits (787), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 152/269 (56%), Positives = 200/269 (74%), Gaps = 24/269 (8%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF ++RVQF+LF IG+IALS+TAEKCR+LVG++A+S+SG+FT LNCFDM SGT+AC VK
Sbjct: 1   MDFRRRRVQFILFAIGLIALSMTAEKCRELVGQEAASKSGQFTFLNCFDMSSGTLACAVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           E              E+ARNVAIEKA+ DAL  G+ + +AAK+ Q+ G KAAKLA RQAK
Sbjct: 61  E--------------EKARNVAIEKALHDALVNGMPAKEAAKEVQRAGEKAAKLASRQAK 106

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPI+AAGWDFFEA+Y+GGT+TEGF+RG+GT+ GAY+GG++GE+R GRFGYLVGS LG
Sbjct: 107 RIIGPIVAAGWDFFEALYFGGTLTEGFLRGSGTMIGAYSGGYVGEQRFGRFGYLVGSTLG 166

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQS----EESEAHDAPVEEK-FQDYEDSYANEAPSF 235
           +W G R+GLM+YDVVNGV+F L+  QS    ++   +++P ++  ++  ED    E+P  
Sbjct: 167 NWVGARVGLMVYDVVNGVNFFLETYQSGEIYKDQSTYESPEDQSTYESREDQSIYESP-- 224

Query: 236 WNSEASEDSKASDSSLYENSESE-TYENS 263
              + S D  + D S Y  SE + TYE S
Sbjct: 225 --EDQSTDESSEDRSTYVTSEDQSTYETS 251


>gi|388514183|gb|AFK45153.1| unknown [Lotus japonicus]
          Length = 186

 Score =  307 bits (786), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 146/186 (78%), Positives = 164/186 (88%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           MDF  +R Q LL V  I+ LS+TAE+C+QLVGE+ SSQSGKFTILNCFDMGSGTVACGVK
Sbjct: 1   MDFQNRRFQVLLAVAAIVVLSITAERCQQLVGEEGSSQSGKFTILNCFDMGSGTVACGVK 60

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLYFYNIR+AHVERAR+ AIE A+VDA+SQG+S  D+A   QKE  KAAKLA RQAK
Sbjct: 61  EGVKLYFYNIRSAHVERARHRAIESALVDAVSQGMSPKDSATYVQKESKKAAKLASRQAK 120

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFGA  GGFLGE++LGRFGYLVGSHLG
Sbjct: 121 RIIGPIISSGWDFFEAIYYGGTVTEGFLRGTGTLFGACGGGFLGEQKLGRFGYLVGSHLG 180

Query: 181 SWAGGR 186
           SW GGR
Sbjct: 181 SWVGGR 186


>gi|413955948|gb|AFW88597.1| hypothetical protein ZEAMMB73_190055 [Zea mays]
          Length = 252

 Score =  290 bits (741), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 141/249 (56%), Positives = 189/249 (75%), Gaps = 7/249 (2%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           +D  K+RV+ LLF+ GI+ALS+TAEK R+LVG++A+S+SG+ T +NCFD GSG++AC  K
Sbjct: 2   LDMQKRRVRLLLFITGILALSMTAEKSRELVGKEAASKSGQSTFMNCFDKGSGSLACTSK 61

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY  NIR+AH+E  R  A+ KA+ DA+++GL+ ++AAKQAQK  AKA K+A RQA 
Sbjct: 62  EGVKLYVNNIRSAHLEMVRQRAMGKALADAVAEGLTPSEAAKQAQKVSAKATKIAARQAN 121

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GPII+ GWDFFEA+Y GG++ EGF+RGTGTLFG Y GGF GEER G+ GYL GSH+G
Sbjct: 122 RILGPIISCGWDFFEAMYSGGSMMEGFLRGTGTLFGTYVGGFHGEERFGKLGYLAGSHVG 181

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSE-ESEAHDAPVEEKFQDYEDSYAN-EAPSFW-- 236
           SW GGRIGLMIYDV++G+ ++   VQ + ES ++     E   +Y   Y N E P+++  
Sbjct: 182 SWGGGRIGLMIYDVISGLKYMFMSVQPKYESSSY---ASEDGPEYAKRYTNQEEPTYYEP 238

Query: 237 NSEASEDSK 245
           + E  E+SK
Sbjct: 239 SKEKQEESK 247


>gi|297600810|ref|NP_001049892.2| Os03g0307000 [Oryza sativa Japonica Group]
 gi|108707740|gb|ABF95535.1| expressed protein [Oryza sativa Japonica Group]
 gi|215769254|dbj|BAH01483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218192660|gb|EEC75087.1| hypothetical protein OsI_11239 [Oryza sativa Indica Group]
 gi|222624783|gb|EEE58915.1| hypothetical protein OsJ_10562 [Oryza sativa Japonica Group]
 gi|255674449|dbj|BAF11806.2| Os03g0307000 [Oryza sativa Japonica Group]
          Length = 254

 Score =  290 bits (741), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 152/253 (60%), Positives = 204/253 (80%), Gaps = 1/253 (0%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           +D  K+RVQ LLF++G++ALS+TAEK R+LVG++ +S+SG+FT +NCFDMGSG++AC VK
Sbjct: 2   LDIQKRRVQLLLFIVGVLALSMTAEKFRELVGKEEASKSGQFTFMNCFDMGSGSLACAVK 61

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EG+KLY YN++ AH ER R+ AIEKA+ DA+++GLS+ +AAKQAQK GAKAAK+A RQAK
Sbjct: 62  EGIKLYVYNLQTAHTERVRHRAIEKALADAVTEGLSAAEAAKQAQKVGAKAAKVAARQAK 121

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GPII++GWDFFEA+Y+GG++TEGF+RGTGTLFG Y GGF GEERLGRFGYL GSHLG
Sbjct: 122 RILGPIISSGWDFFEAMYFGGSMTEGFLRGTGTLFGTYVGGFHGEERLGRFGYLTGSHLG 181

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSE-ESEAHDAPVEEKFQDYEDSYANEAPSFWNSE 239
           SW GGRIGLMIYDV+NG+ ++LQFV+ E E+ A+ +    ++     S   E P+++ + 
Sbjct: 182 SWVGGRIGLMIYDVINGLKYMLQFVKPEYEASAYYSKESTEYAYSYRSGEREEPTYYETS 241

Query: 240 ASEDSKASDSSLY 252
                ++   SL+
Sbjct: 242 EENQEESQGFSLF 254


>gi|226494279|ref|NP_001144331.1| uncharacterized protein LOC100277228 precursor [Zea mays]
 gi|195640228|gb|ACG39582.1| hypothetical protein [Zea mays]
          Length = 252

 Score =  288 bits (738), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 140/249 (56%), Positives = 188/249 (75%), Gaps = 7/249 (2%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           +D  K+RV+ LLF+ GI+ALS+TAEK R+LVG++A+S+SG+ T +NCFD GSG++AC  K
Sbjct: 2   LDMQKRRVRLLLFITGILALSMTAEKSRELVGKEAASKSGQSTFMNCFDKGSGSLACTSK 61

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY  NIR+AH+E  R   + KA+ DA+++GL+ ++AAKQAQK  AKA K+A RQA 
Sbjct: 62  EGVKLYVNNIRSAHLEMVRQRPMGKALADAVAEGLTPSEAAKQAQKVSAKATKIAARQAN 121

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GPII+ GWDFFEA+Y GG++ EGF+RGTGTLFG Y GGF GEER G+ GYL GSH+G
Sbjct: 122 RILGPIISCGWDFFEAMYSGGSMMEGFLRGTGTLFGTYVGGFHGEERFGKLGYLAGSHVG 181

Query: 181 SWAGGRIGLMIYDVVNGVHFLLQFVQSE-ESEAHDAPVEEKFQDYEDSYAN-EAPSFW-- 236
           SW GGRIGLMIYDV++G+ ++   VQ + ES ++     E   +Y   Y N E P+++  
Sbjct: 182 SWGGGRIGLMIYDVISGLKYMFMSVQPKYESSSY---ASEDGPEYAKRYTNQEEPTYYEP 238

Query: 237 NSEASEDSK 245
           + E  E+SK
Sbjct: 239 SKEKQEESK 247


>gi|326529789|dbj|BAK04841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 242

 Score =  286 bits (733), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 130/187 (69%), Positives = 164/187 (87%)

Query: 20  LSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVKEGVKLYFYNIRAAHVERAR 79
           L   AEKCR+LVG++A+S+SG+FT +NCFDMGSG++AC  KEGVKLY  NIR+AH+ER R
Sbjct: 2   LQFAAEKCRELVGKEAASKSGQFTFMNCFDMGSGSLACAGKEGVKLYVNNIRSAHMERVR 61

Query: 80  NVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAKRIIGPIIAAGWDFFEAIYY 139
             A+EKA+ DAL++GLS  +AAKQAQK GAKA K+A RQAKRI+GPII++GWDFFEA+Y+
Sbjct: 62  QRALEKALADALTEGLSPAEAAKQAQKVGAKATKVAARQAKRILGPIISSGWDFFEAMYF 121

Query: 140 GGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGSWAGGRIGLMIYDVVNGVH 199
           GG++TEGF+RGTGTLFG YAGGF GEERLG+ GYL GS LGSW GGRIGLM+YDV+NG++
Sbjct: 122 GGSMTEGFLRGTGTLFGTYAGGFHGEERLGKLGYLAGSQLGSWVGGRIGLMVYDVINGLN 181

Query: 200 FLLQFVQ 206
           ++L+FV+
Sbjct: 182 YMLEFVR 188


>gi|147778257|emb|CAN65139.1| hypothetical protein VITISV_020492 [Vitis vinifera]
          Length = 195

 Score =  278 bits (710), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 135/177 (76%), Positives = 154/177 (87%), Gaps = 1/177 (0%)

Query: 50  MGSGTVACGVKEGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGA 109
           M SGT+AC VKEGVKLYFYNIRA HVE+AR+ AIE A+ DAL+QGL++ DAAK AQKEGA
Sbjct: 1   MSSGTLACTVKEGVKLYFYNIRAVHVEKARHHAIEGALSDALTQGLNAKDAAKHAQKEGA 60

Query: 110 KAAKLAKRQAKRIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLG 169
           KAAKLA RQAKRIIGPII++GWDFFEAIYYGGT+TEGF+RGTGTLFGAY GGFLGE+RLG
Sbjct: 61  KAAKLATRQAKRIIGPIISSGWDFFEAIYYGGTLTEGFLRGTGTLFGAYTGGFLGEQRLG 120

Query: 170 RFGYLVGSHLGSWAGGRIGLMIYDVVNGVHFLLQFVQSEESEA-HDAPVEEKFQDYE 225
           RFGYLVGSHLGSW GGRIGLM+YDV NGV FLLQ VQ EE+     + ++E+  D+E
Sbjct: 121 RFGYLVGSHLGSWVGGRIGLMVYDVANGVQFLLQSVQPEETPMDMSSEMKEENNDWE 177


>gi|242041173|ref|XP_002467981.1| hypothetical protein SORBIDRAFT_01g037490 [Sorghum bicolor]
 gi|241921835|gb|EER94979.1| hypothetical protein SORBIDRAFT_01g037490 [Sorghum bicolor]
          Length = 218

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 126/192 (65%), Positives = 160/192 (83%)

Query: 1   MDFGKKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVK 60
           +D  K+RV+ LL + G++ALS+TAEK R+LVG++A+S+SG+FT +NCFDMGSG++AC  K
Sbjct: 2   LDIQKRRVRLLLLITGVVALSMTAEKFRELVGKEAASKSGQFTFMNCFDMGSGSLACTAK 61

Query: 61  EGVKLYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAK 120
           EGVKLY  NIR+AH+E  R  A+EKA+ DA++QGL+  +AAKQAQK  AKA K+A RQA 
Sbjct: 62  EGVKLYVNNIRSAHLEMVRQRAMEKALADAVTQGLTPGEAAKQAQKVSAKATKVAARQAN 121

Query: 121 RIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLG 180
           RI+GPII+ GWDFFEA+Y+GG++TEGF+RGTGTLFG Y GGF GEER G+ GYL GS +G
Sbjct: 122 RILGPIISCGWDFFEAMYFGGSMTEGFLRGTGTLFGTYVGGFHGEERFGKLGYLAGSQIG 181

Query: 181 SWAGGRIGLMIY 192
           SW GGRIGLMIY
Sbjct: 182 SWGGGRIGLMIY 193


>gi|296086365|emb|CBI31954.3| unnamed protein product [Vitis vinifera]
          Length = 591

 Score =  237 bits (605), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 118/201 (58%), Positives = 151/201 (75%), Gaps = 2/201 (0%)

Query: 11  LLFVIGIIALSLTAEKCRQLVGEDASSQS--GKFTILNCFDMGSGTVACGVKEGVKLYFY 68
           LLF+  II LS+ AEK R LVGE+ SS+S  GKFT+ NCFD+G+GT+AC VKE V++Y +
Sbjct: 67  LLFLASIILLSIAAEKSRLLVGEENSSKSWTGKFTVFNCFDVGTGTIACVVKEVVRIYLH 126

Query: 69  NIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAKRIIGPIIA 128
            IRA HV + R  A  +A+ +  SQGLS  D++KQA ++G  AA+ A  QAK I+G +I+
Sbjct: 127 YIRAVHVRKVRAEAETEALNEGFSQGLSYEDSSKQACEKGDAAARRAYLQAKHIMGHLIS 186

Query: 129 AGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGSWAGGRIG 188
           +GWD FE +Y GGTITEG IRG+GTL G++AGG++GE+RLG  G LVGSH+GSW GGRIG
Sbjct: 187 SGWDVFETLYVGGTITEGLIRGSGTLLGSHAGGYIGEQRLGWIGSLVGSHMGSWVGGRIG 246

Query: 189 LMIYDVVNGVHFLLQFVQSEE 209
           LM YDV NGV +LLQ V  E 
Sbjct: 247 LMAYDVGNGVQYLLQLVGKEH 267


>gi|225425678|ref|XP_002269829.1| PREDICTED: uncharacterized protein LOC100246544 [Vitis vinifera]
          Length = 223

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 118/201 (58%), Positives = 151/201 (75%), Gaps = 2/201 (0%)

Query: 11  LLFVIGIIALSLTAEKCRQLVGEDASSQS--GKFTILNCFDMGSGTVACGVKEGVKLYFY 68
           LLF+  II LS+ AEK R LVGE+ SS+S  GKFT+ NCFD+G+GT+AC VKE V++Y +
Sbjct: 10  LLFLASIILLSIAAEKSRLLVGEENSSKSWTGKFTVFNCFDVGTGTIACVVKEVVRIYLH 69

Query: 69  NIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAKRIIGPIIA 128
            IRA HV + R  A  +A+ +  SQGLS  D++KQA ++G  AA+ A  QAK I+G +I+
Sbjct: 70  YIRAVHVRKVRAEAETEALNEGFSQGLSYEDSSKQACEKGDAAARRAYLQAKHIMGHLIS 129

Query: 129 AGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGSWAGGRIG 188
           +GWD FE +Y GGTITEG IRG+GTL G++AGG++GE+RLG  G LVGSH+GSW GGRIG
Sbjct: 130 SGWDVFETLYVGGTITEGLIRGSGTLLGSHAGGYIGEQRLGWIGSLVGSHMGSWVGGRIG 189

Query: 189 LMIYDVVNGVHFLLQFVQSEE 209
           LM YDV NGV +LLQ V  E 
Sbjct: 190 LMAYDVGNGVQYLLQLVGKEH 210


>gi|302769498|ref|XP_002968168.1| hypothetical protein SELMODRAFT_89585 [Selaginella moellendorffii]
 gi|302773962|ref|XP_002970398.1| hypothetical protein SELMODRAFT_93221 [Selaginella moellendorffii]
 gi|300161914|gb|EFJ28528.1| hypothetical protein SELMODRAFT_93221 [Selaginella moellendorffii]
 gi|300163812|gb|EFJ30422.1| hypothetical protein SELMODRAFT_89585 [Selaginella moellendorffii]
          Length = 205

 Score =  220 bits (560), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 110/181 (60%), Positives = 141/181 (77%)

Query: 21  SLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVKEGVKLYFYNIRAAHVERARN 80
           S TAEK RQ+VGE+ SSQSGKF+ ++CFD+GSGT+AC +KEGVKLY YNIR+  VE++R+
Sbjct: 1   SSTAEKSRQIVGEERSSQSGKFSWMDCFDLGSGTLACSIKEGVKLYTYNIRSGIVEKSRH 60

Query: 81  VAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAKRIIGPIIAAGWDFFEAIYYG 140
            A E A+ DAL +GLS  +A++ A   G KAAK++ R+A+RI GP+IAA WDFFEAIYYG
Sbjct: 61  KAYEIALEDALHEGLSIQEASRAAAVAGKKAAKISSRKARRITGPVIAAAWDFFEAIYYG 120

Query: 141 GTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGSWAGGRIGLMIYDVVNGVHF 200
           G   E  +RG GT+ GA+ GG  GE +LGR GYL+GSHLG+W GGR+GLM+YD+ N   F
Sbjct: 121 GGPIEATMRGAGTMCGAWMGGIEGERKLGRIGYLIGSHLGNWIGGRVGLMLYDIGNAAWF 180

Query: 201 L 201
           L
Sbjct: 181 L 181


>gi|413955946|gb|AFW88595.1| hypothetical protein ZEAMMB73_231594 [Zea mays]
          Length = 166

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 91/164 (55%), Positives = 118/164 (71%), Gaps = 1/164 (0%)

Query: 18  IALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVKEGVKLYFYNIRAAHVER 77
           I      EK R+LVG++A+S+SG+ T +NCFD GS  +A   KEGVKLY  NIR+AH+E 
Sbjct: 4   IGFYFIVEKSRELVGKEAASKSGQSTFMNCFDKGSSRLASTSKEGVKLYVNNIRSAHLEM 63

Query: 78  ARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAKRIIGPIIAAGWDFFEAI 137
            R   + KA+ D +++GL+S +A KQA K  AKA  +A  QA RI+GPII+ GWDFFEA+
Sbjct: 64  VRQRTMGKALADDVAEGLTS-EATKQAHKVCAKATNIAAPQANRILGPIISCGWDFFEAM 122

Query: 138 YYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGS 181
           Y GG++  GF+RGTG LFG Y GGFLGEE+ G+ GYL GSH+ S
Sbjct: 123 YSGGSMMGGFLRGTGALFGTYVGGFLGEEQFGKLGYLAGSHVES 166


>gi|168046058|ref|XP_001775492.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673162|gb|EDQ59689.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 279

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 129/202 (63%)

Query: 5   KKRVQFLLFVIGIIALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVKEGVK 64
           +  V +  F++    L L A++ R+ +G++ +S++GK    +C D+G G++ C VK+G K
Sbjct: 16  RNLVLWTAFILVFYLLHLGAKEARRQLGDEIASKTGKDDFGDCLDLGFGSLTCAVKQGSK 75

Query: 65  LYFYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAKRIIG 124
           LY  NIRA+ VE ++  A + A+   +S GL   +AA++AQ     AAK  K+QA+RI G
Sbjct: 76  LYTNNIRASIVEHSKQRAYQNALQALISDGLGMTEAARKAQSIADLAAKDTKKQARRIFG 135

Query: 125 PIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGSWAG 184
           P+ AA WD  E +YYGG+  E  +R TGTL G + GG LGE RLGR GYL+GS +GSWAG
Sbjct: 136 PLFAAVWDGLEVLYYGGSFAEVSMRATGTLCGTWYGGVLGENRLGRVGYLLGSQVGSWAG 195

Query: 185 GRIGLMIYDVVNGVHFLLQFVQ 206
            R+ LM YD++  +  +   V+
Sbjct: 196 SRVALMTYDIIKAIQLITSEVK 217


>gi|58038378|ref|YP_190347.1| putative pilin accessory protein [Gluconobacter oxydans 621H]
 gi|410945172|ref|ZP_11376913.1| putative pilin accessory protein [Gluconobacter frateurii NBRC
           101659]
 gi|58000792|gb|AAW59691.1| Putative pilin accessory protein [Gluconobacter oxydans 621H]
          Length = 394

 Score = 40.4 bits (93), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 49/116 (42%), Gaps = 13/116 (11%)

Query: 50  MGSGTVACGVKEGVKLY---FYNIRAAHVERARNVAIEKAVVDALSQGLSSNDAAKQAQK 106
           +G+G VACGV  G+ LY    +NIR A  +R + + +E   +  L+    +         
Sbjct: 164 IGAGIVACGVGIGLSLYTWHLHNIRLA-ADRVQQMRVESDRLKRLANAQVTTVTPDHWIN 222

Query: 107 EGAKAAKLAKRQAKRIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGF 162
              K AK           P+ AAGW   E +     +   ++R  GTL  A  G +
Sbjct: 223 ACLKTAKAV---------PLFAAGWAQMEWVCNSTGVEITWLRAGGTLADAPPGKY 269


>gi|424776672|ref|ZP_18203651.1| acetoin catabolism regulatory protein [Alcaligenes sp. HPC1271]
 gi|422888204|gb|EKU30594.1| acetoin catabolism regulatory protein [Alcaligenes sp. HPC1271]
          Length = 651

 Score = 40.4 bits (93), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 15/137 (10%)

Query: 60  KEGVKLYFYNIRAAHVERARNVAIEKAVVDA-LSQ---GLSSNDAAKQAQKEGAKAAKLA 115
           ++G  L+ + I    V   R+V ++ A VDA L Q   GL+  DAA Q+     KAA+LA
Sbjct: 300 RDGSVLFAHAIEPRRVLSNRSVGMDGAAVDAALPQALRGLAGTDAAMQSML--LKAARLA 357

Query: 116 KRQAKRIIGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGR--FGY 173
            R+   ++     +G ++     +  +       G G  F A     L E  +    FGY
Sbjct: 358 TREMPVLLQGETGSGKEYLARAIHTAS-------GRGGNFVAVNCAALPESLIESELFGY 410

Query: 174 LVGSHLGSWAGGRIGLM 190
           L G++ G  A GR GL+
Sbjct: 411 LAGTYTGGAARGRTGLI 427


>gi|303230417|ref|ZP_07317178.1| thiamine-phosphate diphosphorylase [Veillonella atypica
           ACS-049-V-Sch6]
 gi|302514956|gb|EFL56937.1| thiamine-phosphate diphosphorylase [Veillonella atypica
           ACS-049-V-Sch6]
          Length = 505

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 11/90 (12%)

Query: 85  KAVVDALSQGLSSNDAAKQAQKEGAK---AAKLAKRQAKRIIGPIIAAGWDFFE------ 135
           KAV DALS   ++N++      +GA+   +A +A R A+ I  P++A G + +       
Sbjct: 137 KAVYDALSHDDTANNSTAGKGVDGAQVEDSAVIAYRLARLINCPVVATGAEDYVSDGTRV 196

Query: 136 -AIYYGGTITEGFIRGTGTLFGAYAGGFLG 164
            A+ +G ++    + GTG L GA    FLG
Sbjct: 197 FAVPHGHSLMTA-VTGTGCLLGAVLAAFLG 225


>gi|393760843|ref|ZP_10349645.1| acetoin catabolism regulatory protein [Alcaligenes faecalis subsp.
           faecalis NCIB 8687]
 gi|393160945|gb|EJC61017.1| acetoin catabolism regulatory protein [Alcaligenes faecalis subsp.
           faecalis NCIB 8687]
          Length = 650

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 45/138 (32%), Positives = 67/138 (48%), Gaps = 17/138 (12%)

Query: 60  KEGVKLYFYNIRAAHVERARNVAIEKAVVDA-LSQ---GLSSNDAAKQAQKEGAKAAKLA 115
           ++G  L+ + I    V   R+V ++ A VDA L Q   GL+  DAA Q+     KAA+LA
Sbjct: 300 RDGSVLFAHAIEPRRVLNNRSVGMDGAAVDAALPQALRGLAGTDAAMQSML--LKAARLA 357

Query: 116 KRQAKRIIGPIIAAGWDFF-EAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGR--FG 172
            R+   ++     +G ++   AI+     T    RG+   F A     L E  +    FG
Sbjct: 358 TREMPVLLQGETGSGKEYLARAIH-----TASGRRGS---FVAVNCAALPESLIESELFG 409

Query: 173 YLVGSHLGSWAGGRIGLM 190
           YL G++ G  A GR GL+
Sbjct: 410 YLAGTYTGGAARGRTGLI 427


>gi|228473441|ref|ZP_04058194.1| collagenase [Capnocytophaga gingivalis ATCC 33624]
 gi|228275048|gb|EEK13851.1| collagenase [Capnocytophaga gingivalis ATCC 33624]
          Length = 414

 Score = 37.4 bits (85), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 43/182 (23%), Positives = 67/182 (36%), Gaps = 22/182 (12%)

Query: 18  IALSLTAEKCRQLVGEDASSQSGKFTILNCFDMGSGTVACGVKEGVKLYFYNIRAAHVER 77
           ++L    + C Q+  E     SG    +  F  G+  +A   K  + L+ +N  A     
Sbjct: 145 LSLRQIKKICEQIEREQVKGPSGNLVEVEIFGHGALCMAVSGKCYLSLHSHNSSANRGAC 204

Query: 78  ARNVAIEKAVVDALS--------------QGLSSNDAAKQAQKEGAKAAKLAKR-QAKRI 122
            +N   +  V+D  S              + L + D   Q    GAK  K+  R +A   
Sbjct: 205 KQNCRKKYTVIDQESGFEIELDNEYMMSPKDLCTLDFLDQVIDTGAKVLKIEGRGRAPEY 264

Query: 123 IGPIIAAGWDFFEAIYYGGTITEGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGSW 182
           +  +I    +  +A Y G    E   +    L   Y  GF G       GY +G  LG W
Sbjct: 265 VATVIRTYREAIDAYYEGTYTKEKVEKWMEALATVYNRGFWG-------GYYLGQKLGEW 317

Query: 183 AG 184
           +G
Sbjct: 318 SG 319


>gi|390630689|ref|ZP_10258666.1| Putative uncharacterized protein [Weissella confusa LBAE C39-2]
 gi|390484032|emb|CCF31014.1| Putative uncharacterized protein [Weissella confusa LBAE C39-2]
          Length = 1358

 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 51/128 (39%), Gaps = 17/128 (13%)

Query: 78  ARNVAIEKAVVDALSQGLSSNDAAKQAQKEGAKAAKLAKRQAKRIIGPIIAAGWDFFEAI 137
           ARN    K+    L  G+ S     QA K G   A   K  +   +G  +  G      +
Sbjct: 619 ARNTGFFKSTAGKLLSGIGSAGTTVQATKFGGALATATKGLSS--VGKFLGKGMPLMNGL 676

Query: 138 YYG------------GTIT--EGFIRGTGTLFGAYAGGFLGEERLGRFGYLVGSHLGSWA 183
           + G            G++   +G  +G GT  GA  GG LG   LG  G + GS  G W 
Sbjct: 677 FAGVDVMTTMASTKTGSLARHKGVGQGVGTGIGATVGGALGS-FLGPLGTIGGSMAGGWL 735

Query: 184 GGRIGLMI 191
           GG+ G  I
Sbjct: 736 GGKAGSWI 743


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.134    0.385 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,047,141,623
Number of Sequences: 23463169
Number of extensions: 175572553
Number of successful extensions: 576603
Number of sequences better than 100.0: 110
Number of HSP's better than 100.0 without gapping: 40
Number of HSP's successfully gapped in prelim test: 70
Number of HSP's that attempted gapping in prelim test: 576251
Number of HSP's gapped (non-prelim): 304
length of query: 265
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 125
effective length of database: 9,074,351,707
effective search space: 1134293963375
effective search space used: 1134293963375
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 75 (33.5 bits)