BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 020825
         (321 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225429560|ref|XP_002279873.1| PREDICTED: uncharacterized protein LOC100255893 [Vitis vinifera]
 gi|296081666|emb|CBI20671.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  436 bits (1122), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 207/321 (64%), Positives = 256/321 (79%), Gaps = 1/321 (0%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS+ LVLYGDGLARF+ PSH HLH LAS+  CGFLSL  +PPSE+E+ERIVREFA 
Sbjct: 1   MADKPSRGLVLYGDGLARFINPSHTHLHDLASRGICGFLSLSNSPPSETEDERIVREFAQ 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+D+ +  L       T L+ ++KS   TIS+RFMG++AA++  NS L SFG  LGF  +
Sbjct: 61  LLDSCEASLGVNVGGATNLECQQKSLMPTISERFMGMRAAIVANNSSLTSFGGTLGFTVM 120

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           Q +ELI+ +  LS  P+DVVASE LKLLG Q GK  E SQFDL+ +HIGAGEK N  + +
Sbjct: 121 QTDELIKNNHPLSEPPVDVVASESLKLLGFQEGKTLETSQFDLIFMHIGAGEKANG-QTE 179

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
            +A+D+EYIN LV  I+Q A+P +E+GSRLHLS+V+SYG V + D+ +LSVL++ +E +S
Sbjct: 180 IIANDVEYINGLVGRIMQTAEPGSEIGSRLHLSLVMSYGAVSKDDDPSLSVLVTKNENNS 239

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
           DLS+L PRQSYTMKG  PR+++RHHCPMLI+QWQYAVTRKDMA+TFSF+DFKEHGG LTI
Sbjct: 240 DLSSLSPRQSYTMKGGNPRDNIRHHCPMLIAQWQYAVTRKDMAKTFSFEDFKEHGGNLTI 299

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRFLHEVAFKLWKAPKYGA
Sbjct: 300 PADRFLHEVAFKLWKAPKYGA 320


>gi|224092198|ref|XP_002309505.1| predicted protein [Populus trichocarpa]
 gi|222855481|gb|EEE93028.1| predicted protein [Populus trichocarpa]
          Length = 309

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 212/321 (66%), Positives = 241/321 (75%), Gaps = 12/321 (3%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADK S+ LV+YGDGLA F+ PSHAHLHSLASKA CGFL+LP APPSESE+ERIVREFA 
Sbjct: 1   MADKASRGLVIYGDGLASFINPSHAHLHSLASKAFCGFLTLPNAPPSESEDERIVREFAY 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+DA + Y                +   TIS+RFMG+KAA+IT N  LKSFG KLG    
Sbjct: 61  LLDACEAY------------QNVSTSIPTISERFMGMKAAIITNNPGLKSFGGKLGLTVF 108

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
             N+L     SLSGS  D V  ELLKLLG Q GK  E SQFDLV VH+GAGE+ N +  K
Sbjct: 109 PFNDLKGNEFSLSGSSTDFVTFELLKLLGFQEGKTLETSQFDLVFVHVGAGERVNAEGHK 168

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
            +A D+EYI+ALV  I+++AQP +E+GSRLHLS+V+SYG V E D  +LS+L S DE   
Sbjct: 169 TIAIDVEYIDALVDGIMRIAQPGSEIGSRLHLSLVMSYGYVTEGDGRDLSILTSKDEMDP 228

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
            LS LFP QSYTMKGE PRND+R+HCPMLISQWQYAVTR DMAETFSFKDFKEHGG L I
Sbjct: 229 ALSKLFPLQSYTMKGEKPRNDIRYHCPMLISQWQYAVTRVDMAETFSFKDFKEHGGNLVI 288

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRFLHEVAFKLWKAPKYGA
Sbjct: 289 PADRFLHEVAFKLWKAPKYGA 309


>gi|255550804|ref|XP_002516450.1| conserved hypothetical protein [Ricinus communis]
 gi|223544270|gb|EEF45791.1| conserved hypothetical protein [Ricinus communis]
          Length = 313

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 196/321 (61%), Positives = 237/321 (73%), Gaps = 8/321 (2%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS+ALVLYGDGLA+ ++ SH H+HSLASKA+CGFLSLP AP SESE ERIVREFA 
Sbjct: 1   MADKPSRALVLYGDGLAQLIDESHTHIHSLASKASCGFLSLPNAPSSESENERIVREFAH 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+DA   Y         E+  E+ +   TIS RFM +KAA+IT N  L+SFG KLGF  L
Sbjct: 61  LVDAYDAY--------QEMSNEKCTLIPTISQRFMEMKAAIITDNPSLQSFGSKLGFTVL 112

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
               L  T  S   S  + + SELLKLLG Q GK  E SQFDLV++H GA E  N D  +
Sbjct: 113 SFKCLNGTHKSHYESSFEDLTSELLKLLGFQEGKTIEASQFDLVILHTGASELLNSDSSE 172

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
           A   D  Y+N++V+  +  A+P +E+GSRLHLS+V+SYG   + D++NLSVL+S DE +S
Sbjct: 173 ATECDAHYLNSMVKETMYNAKPGSEIGSRLHLSLVMSYGNASKIDSTNLSVLVSKDEMNS 232

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
           DLS L PRQSYTMKGE  R++VRHHCPMLI+QWQ+AVTRKD AET+S+K+FKEHG  L I
Sbjct: 233 DLSVLVPRQSYTMKGEKLRDNVRHHCPMLIAQWQHAVTRKDTAETYSYKEFKEHGVNLVI 292

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRF+HEVAFKLWKAPKYGA
Sbjct: 293 PADRFIHEVAFKLWKAPKYGA 313


>gi|356564061|ref|XP_003550275.1| PREDICTED: uncharacterized protein LOC100780200 [Glycine max]
          Length = 308

 Score =  392 bits (1008), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 196/321 (61%), Positives = 242/321 (75%), Gaps = 13/321 (4%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS++LVL+GDGLAR ++ SH+H HSLAS ++CGFLSLP +PPSESE+ER VREFAV
Sbjct: 1   MADKPSRSLVLFGDGLARSIDSSHSHFHSLASLSSCGFLSLPNSPPSESEDERTVREFAV 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+DA   Y S +    ++   +E  P QT+ DRF+G+KAA++T NS LKSF  KLGF+ L
Sbjct: 61  LLDACHTY-SNMKGQISDRDNQEDPPKQTLPDRFIGMKAAILTNNSSLKSFSAKLGFSVL 119

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           +L+EL+       G   +VVA ELLK LG + GK+ +   FDLV  HIGAGE+      K
Sbjct: 120 KLDELV------GGGSAEVVALELLKFLGFKEGKVLDSDHFDLVFFHIGAGEQ------K 167

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
            VA D+ Y++ALV  ++  AQP +++ SRLHLSVV+SYG VLEAD+S  SVL  +DEK+S
Sbjct: 168 VVAADVGYMDALVGGVMSQAQPGSDISSRLHLSVVMSYGNVLEADDSKFSVLKRVDEKNS 227

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
            LS L+P QSYTMKG  PR DVRH+ PMLI+QWQ AVTRKD AE FSF+DF EHGG LTI
Sbjct: 228 HLSMLYPLQSYTMKGGIPRKDVRHYSPMLIAQWQSAVTRKDNAERFSFEDFMEHGGNLTI 287

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRFLHE+AFKLWKAPKYGA
Sbjct: 288 PADRFLHEIAFKLWKAPKYGA 308


>gi|255647370|gb|ACU24151.1| unknown [Glycine max]
          Length = 308

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 195/321 (60%), Positives = 242/321 (75%), Gaps = 13/321 (4%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS++LVL+GDGLAR ++ SH+H HSLAS ++CGFLSLP +PPSESE+ER VREFAV
Sbjct: 1   MADKPSRSLVLFGDGLARSIDSSHSHFHSLASLSSCGFLSLPNSPPSESEDERTVREFAV 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+DA   Y S +    ++   +E  P QT+ DRF+G+KAA++T NS LKSF  KLGF+ L
Sbjct: 61  LLDACHTY-SNMKGQISDRDNQEDPPKQTLPDRFIGMKAAILTNNSSLKSFSAKLGFSVL 119

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           +L+EL+       G   +VVA ELLK LG + GK+ +   FDLV  HIGAGE+      K
Sbjct: 120 KLDELV------GGGSAEVVALELLKFLGFKEGKVLDSDHFDLVFFHIGAGEQ------K 167

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
            VA D+ Y++ALV  ++  AQP +++ SRLHLSVV+SYG VLEAD+   SVL  +DEK+S
Sbjct: 168 VVAADVGYMDALVGGVMSQAQPGSDISSRLHLSVVMSYGNVLEADDFKFSVLKRVDEKNS 227

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
            LS L+P QSYTMKG  PR DVRH+ PMLI+QWQ+AVTRKD AE FSF+DF EHGG LTI
Sbjct: 228 HLSMLYPLQSYTMKGGIPRKDVRHYSPMLIAQWQFAVTRKDNAERFSFEDFMEHGGNLTI 287

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRFLHE+AFKLWKAPKYGA
Sbjct: 288 PADRFLHEIAFKLWKAPKYGA 308


>gi|449450351|ref|XP_004142926.1| PREDICTED: uncharacterized protein LOC101217985 [Cucumis sativus]
          Length = 315

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 190/321 (59%), Positives = 247/321 (76%), Gaps = 6/321 (1%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS+AL ++GDGLARF++ SH++LH+LAS A+CGFLSLP APPSESEE+R++RE  +
Sbjct: 1   MADKPSRALAVFGDGLARFLDHSHSNLHALASLASCGFLSLPNAPPSESEEKRMIRELEL 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L DA+   +S  G        E  S  ++I +RFMG++AAL+T NS ++S G +LG + L
Sbjct: 61  LFDASDSDVSKNGGGC-----EGSSQKKSIPERFMGMRAALLTNNSVVQSLGSELGISLL 115

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           +L+EL++  + L   P+D +ASELLK LG Q GK+++ S+FDL+ VHIG G+K N +K +
Sbjct: 116 RLDELMQM-NHLGLPPVDFLASELLKRLGFQDGKIQDASEFDLLFVHIGVGDKVNGEKDR 174

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
             + +++YI+ALV  ILQ  QP +E+GSRLHLS+++SYG V E D ++LSVL S   K+S
Sbjct: 175 TASDEMKYIDALVGDILQKTQPGSEIGSRLHLSLLMSYGDVFEDDENSLSVLTSNCGKNS 234

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
           DLS LFPRQSYTM GE  RND+RHH PML +QWQY VTRKD AETFSFK+FKEHG  L I
Sbjct: 235 DLSVLFPRQSYTMMGEVQRNDIRHHSPMLAAQWQYGVTRKDKAETFSFKEFKEHGSNLVI 294

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRF+HEVAFKLWKAPKYGA
Sbjct: 295 PADRFIHEVAFKLWKAPKYGA 315


>gi|449494427|ref|XP_004159543.1| PREDICTED: uncharacterized LOC101217985 [Cucumis sativus]
          Length = 315

 Score =  390 bits (1001), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 189/321 (58%), Positives = 247/321 (76%), Gaps = 6/321 (1%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS+AL ++GDGLARF++ SH++LH+LAS A+CGFLSLP APPSESEE+R++RE  +
Sbjct: 1   MADKPSRALAVFGDGLARFLDQSHSNLHALASLASCGFLSLPNAPPSESEEKRMIRELEL 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L DA+   ++  G        E  S  ++I +RFMG++AAL+T NS ++S G +LG + L
Sbjct: 61  LFDASDSDVNKNGGGC-----EGSSQKKSIPERFMGMRAALLTNNSVVQSLGSELGISLL 115

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           +L+EL++  + L   P+D +ASELLK LG Q GK+++ S+FDL+ VHIG G+K N +K +
Sbjct: 116 RLDELMQM-NHLGLPPVDFLASELLKRLGFQDGKIQDASEFDLLFVHIGVGDKVNGEKDR 174

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
             + +++YI+ALV  ILQ  QP +E+GSRLHLS+++SYG V E D ++LSVL S   K+S
Sbjct: 175 TASDEMKYIDALVGDILQKTQPGSEIGSRLHLSLLMSYGDVFEDDENSLSVLTSNCGKNS 234

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
           DLS LFPRQSYTM GE  RND+RHH PML +QWQY VTRKD AETFSFK+FKEHG  L I
Sbjct: 235 DLSVLFPRQSYTMMGEVQRNDIRHHSPMLAAQWQYGVTRKDKAETFSFKEFKEHGSNLVI 294

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRF+HEVAFKLWKAPKYGA
Sbjct: 295 PADRFIHEVAFKLWKAPKYGA 315


>gi|356552344|ref|XP_003544528.1| PREDICTED: uncharacterized protein LOC100817041 [Glycine max]
          Length = 307

 Score =  386 bits (992), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 195/321 (60%), Positives = 242/321 (75%), Gaps = 14/321 (4%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS++LVL+GDGLARF++ SH+H HSLAS ++CGFLSLP +PPSESE+ER VRE AV
Sbjct: 1   MADKPSRSLVLFGDGLARFIDSSHSHFHSLASLSSCGFLSLPNSPPSESEDERTVREVAV 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+DA   YL+ +G+  ++   +E SP QT+ DRFMG+KAA++T NS LKSF  KLGF+ L
Sbjct: 61  LLDACPIYLN-MGK-ISDSDNQEDSPKQTLPDRFMGMKAAILTNNSSLKSFSAKLGFSVL 118

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           +L+EL+       G   +VVA ELLKLLG Q GK+ +   FDLV  H GAGE+      K
Sbjct: 119 KLDELV------VGGSAEVVALELLKLLGFQEGKVLDSDHFDLVFFHNGAGEQ------K 166

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
            VA D+ Y++ALV  ++   QP +++ SRLHLSVV+SYG +LE D+S  SV   +DEK+S
Sbjct: 167 VVAADMGYMDALVGGVMSQMQPGSDISSRLHLSVVVSYGNILEGDDSKFSVSKRVDEKNS 226

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
            LS L+P QSY MKG  PR DVRH+ PMLI+QWQ AVTRKD AE FSF+DF EHGG LTI
Sbjct: 227 HLSVLYPLQSYAMKGGIPRKDVRHYSPMLIAQWQSAVTRKDNAERFSFEDFMEHGGNLTI 286

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
           PADRFLHE+AFKLWKAPKYGA
Sbjct: 287 PADRFLHEIAFKLWKAPKYGA 307


>gi|388507454|gb|AFK41793.1| unknown [Lotus japonicus]
          Length = 304

 Score =  362 bits (929), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 190/322 (59%), Positives = 234/322 (72%), Gaps = 19/322 (5%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS++LVL+GDGLAR ++PSH+HLHSLAS ++CGFLSLP +PPSESE++R VREFAV
Sbjct: 1   MADKPSRSLVLFGDGLARSIDPSHSHLHSLASLSSCGFLSLPNSPPSESEDQRTVREFAV 60

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSP-FQTISDRFMGLKAALITTNSRLKSFGDKLGFAT 119
           L+DA       +    ++   ++ SP  QT+ DRFMG+KAA++T +S LKSF  KLGF  
Sbjct: 61  LLDA-----PNVNVQVSDDDSQKDSPEKQTLPDRFMGMKAAILTNHSGLKSFSSKLGFNV 115

Query: 120 LQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKG 179
           LQL+ L +  D+      DVVA ELLKLLG Q GK+++ + FDLV  HIGAGEK      
Sbjct: 116 LQLDALAKDYDA-GLHENDVVALELLKLLGFQEGKVQDSNNFDLVFFHIGAGEK------ 168

Query: 180 KAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKS 239
                ++EYINALV  ++  AQP +++ SRLHLS+V+SYG+V E D S  SV    DEKS
Sbjct: 169 -----NVEYINALVGGLMNQAQPGSDISSRLHLSLVMSYGKVSEDDESKFSVSKRADEKS 223

Query: 240 SDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLT 299
             LS L+P QSY MKG  PR DVR H P+LI+QWQ AVT KD AE F F+DF EHGG LT
Sbjct: 224 Y-LSLLYPLQSYAMKGGFPRKDVRLHSPLLIAQWQDAVTHKDNAERFYFQDFMEHGGNLT 282

Query: 300 IPADRFLHEVAFKLWKAPKYGA 321
           IPADRFLHE+AFKLWKAPKYGA
Sbjct: 283 IPADRFLHEIAFKLWKAPKYGA 304


>gi|357437483|ref|XP_003589017.1| hypothetical protein MTR_1g016410 [Medicago truncatula]
 gi|355478065|gb|AES59268.1| hypothetical protein MTR_1g016410 [Medicago truncatula]
 gi|388492878|gb|AFK34505.1| unknown [Medicago truncatula]
          Length = 306

 Score =  347 bits (889), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 180/327 (55%), Positives = 237/327 (72%), Gaps = 27/327 (8%)

Query: 1   MADKPSKALVLYGDGLARFVEPS--HAHLHSLASKAACGFLSLPIAPPS----ESEEERI 54
           MADKPS++L+LYGDGLARF++PS  H +LHSLAS ++C FL+L  +  +    E+E++RI
Sbjct: 1   MADKPSRSLILYGDGLARFIDPSSSHTNLHSLASLSSCAFLTLSNSNSNFNPSETEDDRI 60

Query: 55  VREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDK 114
           VREF +L+DA+Q +             +E    QT+ DRFMG+KAA++T +S LKSF  K
Sbjct: 61  VREFTLLLDASQIH-------------DE----QTLPDRFMGMKAAIMTNHSGLKSFSTK 103

Query: 115 LGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKT 174
           LGF+ +QL+ L++   S S    D+VA E+LKLLG Q GK+ + SQFDLV +H+GAGEK 
Sbjct: 104 LGFSVVQLDGLVKQHASESQDS-DIVAVEILKLLGFQEGKVLDNSQFDLVFIHVGAGEKD 162

Query: 175 NDDKGKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLIS 234
           N  +  A   D+E+++ALV  I++ AQP ++VGSRLHLSVV+SYG+VLE D S  SV   
Sbjct: 163 NGSEQNA---DVEFVDALVGAIMRQAQPGSDVGSRLHLSVVMSYGKVLEDDESKFSVSKK 219

Query: 235 IDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEH 294
           +DEK S    L+P QSY MKG  PR DVR H PMLI+Q Q+AVTRKD  + F+F+DF ++
Sbjct: 220 VDEKDSCFLTLYPLQSYAMKGGVPRKDVRLHSPMLIAQLQHAVTRKDNVQRFTFQDFMKN 279

Query: 295 GGYLTIPADRFLHEVAFKLWKAPKYGA 321
           GG LTIPADRFLHE+AFK+WKAPKYGA
Sbjct: 280 GGNLTIPADRFLHEIAFKMWKAPKYGA 306


>gi|297807235|ref|XP_002871501.1| hypothetical protein ARALYDRAFT_488018 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317338|gb|EFH47760.1| hypothetical protein ARALYDRAFT_488018 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 307

 Score =  330 bits (847), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 172/322 (53%), Positives = 226/322 (70%), Gaps = 16/322 (4%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS++LVLYGDGLARFV+PS+ H+HSLAS  +CGFLSLP APP E+E ER VREFA 
Sbjct: 1   MADKPSRSLVLYGDGLARFVDPSNTHIHSLASVGSCGFLSLPNAPP-ETENERTVREFAH 59

Query: 61  LIDANQDYLSGIGEHTTELKWE-EKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFAT 119
           L+DA++ Y    G     LK +   +   T+++RFMGLKAAL+T +S L SFG  +G   
Sbjct: 60  LMDASEAYSIASG-----LKPKGNGNDISTLAERFMGLKAALVTDSSTLTSFGKLIGLDV 114

Query: 120 LQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKG 179
           LQL+E+ + SDS    P D  +S+LLKLLG + GK  +VS +D V VHIG       D+ 
Sbjct: 115 LQLSEICQESDSF---PSDATSSKLLKLLGFEGGKCLDVSPYDSVFVHIGV------DQY 165

Query: 180 KAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKS 239
               +++  I++L+  I++MAQP +EV SRLHLS+VLSYG V + D S   +    ++ +
Sbjct: 166 NNGNNNMGIIDSLIGSIMRMAQPGSEVVSRLHLSLVLSYGSVTDKDVSVFPIKTPQEDIN 225

Query: 240 SDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLT 299
              + L PRQSYTM+GE  R+DVRH+CPML++QWQ+AVTRKD+ +T SF+  K+  G L 
Sbjct: 226 PAFAGLVPRQSYTMRGEKTRDDVRHYCPMLVAQWQHAVTRKDLVDTLSFETLKKLCGNLV 285

Query: 300 IPADRFLHEVAFKLWKAPKYGA 321
           IP DRF+HEV FKLWKAPKYGA
Sbjct: 286 IPTDRFIHEVTFKLWKAPKYGA 307


>gi|15239788|ref|NP_196742.1| uncharacterized protein [Arabidopsis thaliana]
 gi|7573395|emb|CAB87699.1| putative protein [Arabidopsis thaliana]
 gi|26452379|dbj|BAC43275.1| unknown protein [Arabidopsis thaliana]
 gi|108385324|gb|ABF85772.1| At5g11810 [Arabidopsis thaliana]
 gi|332004340|gb|AED91723.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 306

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 170/322 (52%), Positives = 225/322 (69%), Gaps = 17/322 (5%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADK S++L+LYGDGLARFV+PS+ ++HSLAS A CGFLSLP APP E+E ERIVREF+ 
Sbjct: 1   MADKSSRSLILYGDGLARFVDPSNTNIHSLASVATCGFLSLPNAPP-ETENERIVREFSH 59

Query: 61  LIDANQDYLSGIGEHTTELKWE-EKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFAT 119
           L+DA++ Y    G     LK +   +   T+++RFMGLKAAL+T +S L SFG  +G   
Sbjct: 60  LLDASEAYSIASG-----LKPKGNGNDISTLAERFMGLKAALVTDSSTLTSFGKLIGLDV 114

Query: 120 LQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKG 179
           LQL+E+ + SDS    P D  +S+LLKLLG + GK  +V+ +DLV VH G  E  N    
Sbjct: 115 LQLSEICQESDSF---PSDATSSKLLKLLGFEGGKCLDVNLYDLVFVHFGVDEYNN---- 167

Query: 180 KAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKS 239
               +++  +++L+  I+ MAQP +E+ SRLHLSVVLSYG V + D S   +    ++ +
Sbjct: 168 ---GNNMGILDSLIGSIMGMAQPGSEILSRLHLSVVLSYGSVTDKDVSVFPIKTPQEDIN 224

Query: 240 SDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLT 299
                L PRQSYTM+GE  R+DVRH+CPML++QWQ+ VTRKD+ +T SF+  K+  G L 
Sbjct: 225 PAFIGLVPRQSYTMRGEKTRDDVRHYCPMLVAQWQHGVTRKDLVDTLSFEALKKLCGNLV 284

Query: 300 IPADRFLHEVAFKLWKAPKYGA 321
           IPADRF+HEVAFKLWKAPKYGA
Sbjct: 285 IPADRFIHEVAFKLWKAPKYGA 306


>gi|397787620|gb|AFO66525.1| putative Myo28B1 [Brassica napus]
          Length = 303

 Score =  326 bits (836), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 170/324 (52%), Positives = 223/324 (68%), Gaps = 24/324 (7%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS+ALVLYGDG AR V+PS+ H+HSLAS   CGFLSLP APP+E+E+ERIVREFA 
Sbjct: 1   MADKPSRALVLYGDGSARSVDPSNTHIHSLASVGTCGFLSLPHAPPAETEDERIVREFAH 60

Query: 61  LIDANQDY--LSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFA 118
           L+DA++ Y   SG+    +E      +   T+S+RFMG+KAAL+T +S L SFG  +G  
Sbjct: 61  LLDASEAYSIASGLKAKGSE------NEISTLSERFMGIKAALVTDSSSLTSFGKLIGLD 114

Query: 119 TLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDK 178
            LQLNE+ + SD     P D  A+ELL+LLG + GK  ++S +D V VHIG         
Sbjct: 115 VLQLNEIHQKSDD--SFPSDATATELLRLLGFEGGKCLDMSLYDSVFVHIGG-------- 164

Query: 179 GKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEK 238
                 ++  IN+L+  I++M+QP +E+  RLHLS+VLSYG V + D S+ +V  +  E 
Sbjct: 165 -----DNVRTINSLIGSIMRMSQPGSEIAPRLHLSLVLSYGSVTDKDVSDFNVKKTPQEG 219

Query: 239 -SSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGY 297
            +     L PRQSYTM+GE  R++VR +CPML++QWQ AVTRKD+ +T SF+  K+  G 
Sbjct: 220 INPAFQGLVPRQSYTMRGEKTRDNVRDYCPMLVAQWQDAVTRKDLVDTLSFEALKKLSGN 279

Query: 298 LTIPADRFLHEVAFKLWKAPKYGA 321
           L IPADRF+HEVAFKLWKAPKYGA
Sbjct: 280 LVIPADRFIHEVAFKLWKAPKYGA 303


>gi|397787592|gb|AFO66498.1| putative rhomboid family protein [Brassica napus]
          Length = 271

 Score =  260 bits (664), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 150/324 (46%), Positives = 197/324 (60%), Gaps = 56/324 (17%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKPS+ALVLYGD                                +E+E+ERIVREFA 
Sbjct: 1   MADKPSRALVLYGD--------------------------------AETEDERIVREFAH 28

Query: 61  LIDANQDYL--SGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFA 118
           L+DA++ Y   SG+  + +E      +   T+S+RFMG+ AAL+T NS L SFG  +G  
Sbjct: 29  LLDASEAYTLASGLKANGSE------NEISTLSERFMGIMAALVTDNSSLTSFGKMIGLD 82

Query: 119 TLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDK 178
            LQLNE+ + SD    S  D  A+ELL+LLG + GK  ++S +D V VHIG      D+ 
Sbjct: 83  VLQLNEIHQKSDDSFSS--DATATELLRLLGFEGGKCLDMSLYDSVFVHIGG-----DNV 135

Query: 179 GKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEK 238
           G         +++L+  I++M+QP +E  SRLHLSVVLSYG V + D S   V  +  E 
Sbjct: 136 GT--------VDSLIGSIMRMSQPGSETASRLHLSVVLSYGSVTDKDVSVFPVKNTPQEG 187

Query: 239 -SSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGY 297
            +     L PRQSYTM+GE  R++VR +CPML++QWQ AVTRKD+ +T SF+  K+  G 
Sbjct: 188 INPAFQGLVPRQSYTMRGEKTRDNVRDYCPMLVAQWQDAVTRKDLVDTLSFEALKKLSGS 247

Query: 298 LTIPADRFLHEVAFKLWKAPKYGA 321
           L IPADRF+HEVAFKLWKAPKYGA
Sbjct: 248 LVIPADRFIHEVAFKLWKAPKYGA 271


>gi|115444077|ref|NP_001045818.1| Os02g0135600 [Oryza sativa Japonica Group]
 gi|42408840|dbj|BAD10100.1| unknown protein [Oryza sativa Japonica Group]
 gi|42409110|dbj|BAD10360.1| unknown protein [Oryza sativa Japonica Group]
 gi|113535349|dbj|BAF07732.1| Os02g0135600 [Oryza sativa Japonica Group]
 gi|125537989|gb|EAY84384.1| hypothetical protein OsI_05760 [Oryza sativa Indica Group]
 gi|125580726|gb|EAZ21657.1| hypothetical protein OsJ_05290 [Oryza sativa Japonica Group]
 gi|215697639|dbj|BAG91633.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 337

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 143/340 (42%), Positives = 197/340 (57%), Gaps = 22/340 (6%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHA------------HLHSLASKAACGFLSLPIAPPSE 48
           MADKPS+ LVLY  G A  + P+              HL + AS A+CGFLSL   P + 
Sbjct: 1   MADKPSRGLVLYAAGHAALLPPASGGGGGAGGGGGGSHLDAFASLASCGFLSLRSPPLAS 60

Query: 49  SEEER--IVREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQ----TISDRFMGLKAALI 102
            EE+R   + E A L+D   D      E T + +  +  P Q     +S+RFMG++AA++
Sbjct: 61  GEEKRDSTILELAQLLDVYDDLFPAKTEKTGQ-ETAQVDPLQLAVPKLSERFMGIRAAMV 119

Query: 103 TTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFD 162
           T    + SF   LGF      +L+  S S + S    + S+ L LLG   G ++E S+FD
Sbjct: 120 TNCPLVSSFAANLGFHVSGTEDLVAQSGSSAASKEAGIISQALSLLGFSEGNVQETSEFD 179

Query: 163 LVLVHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQV 221
           LV +H+ A E TN   GK  +  DL  ++ LV  I++ A  ++ + SR+ +SV+LSYG  
Sbjct: 180 LVFLHV-AMENTNSKLGKLGMKTDLNRLDKLVGAIMEAAPISSAIASRILVSVMLSYGSA 238

Query: 222 LEADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKD 281
            E +    S+  S  E  SDL+ L PRQSYTMK     +DVR+H P+L++QWQ  VTR D
Sbjct: 239 AE-NKDEFSISKSSTEIDSDLNLLRPRQSYTMKAGNTLDDVRNHHPILLAQWQEGVTRSD 297

Query: 282 MAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
           + E FSF++F + GG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 LTEGFSFEEFIKRGGNLAMLAERFLHEVAFKLWKAPKYGA 337


>gi|242063964|ref|XP_002453271.1| hypothetical protein SORBIDRAFT_04g002870 [Sorghum bicolor]
 gi|241933102|gb|EES06247.1| hypothetical protein SORBIDRAFT_04g002870 [Sorghum bicolor]
          Length = 334

 Score =  236 bits (603), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 143/338 (42%), Positives = 200/338 (59%), Gaps = 21/338 (6%)

Query: 1   MADKPSKALVLYGDGLARFVEP------SHAHLHSLASKAACGFLSL---PIAPPSESEE 51
           MADKPS+ALVLY  G A  + P      + +HL + AS+A+CG L+L   P + P+   E
Sbjct: 1   MADKPSRALVLYAAGHATVLTPPAGSAAAGSHLDAFASRASCGLLTLRSPPASHPTTGAE 60

Query: 52  ER--IVREFAVLIDANQDYLSG----IGEHTTELKWEEKSPFQTISDRFMGLKAALITTN 105
           +    + E A L+D       G     G+   ++  +E      +S+RFMGL+AAL+T+ 
Sbjct: 61  DNSSTILELAQLLDVYDHLYPGKNADSGQEVAQVDPQELV-VPKLSERFMGLRAALVTSC 119

Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDV-VASELLKLLGLQRGKMEEVSQFDLV 164
            R+ SF   LGF   Q N+    S S SG   +V + +    LLG   G ++E S+FDLV
Sbjct: 120 PRVSSFAANLGFQVFQTNDFAPQSAS-SGVTKEVGLINRAFDLLGFSDGNVQETSEFDLV 178

Query: 165 LVHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLE 223
            +H+ A E T+   GK  +  DL  +  LV  +++ A   + + SR+H+SV+LSYG    
Sbjct: 179 FMHV-AMENTSSKLGKLGMKTDLNRLEKLVGAVMESAPVGSAIASRIHVSVILSYGSA-S 236

Query: 224 ADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMA 283
            +    S+L S  E  SDL+ L PRQSYTMK     +DVR H P+L++QWQ  VTR D+A
Sbjct: 237 GNKDEFSLLTSSTEADSDLNLLRPRQSYTMKAGHTLDDVRLHHPILLAQWQEGVTRVDLA 296

Query: 284 ETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
           + FSF++F +HGG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 297 KGFSFEEFIKHGGNLAMLAERFLHEVAFKLWKAPKYGA 334


>gi|212275488|ref|NP_001130063.1| uncharacterized protein LOC100191155 [Zea mays]
 gi|194688200|gb|ACF78184.1| unknown [Zea mays]
 gi|413935502|gb|AFW70053.1| hypothetical protein ZEAMMB73_997782 [Zea mays]
          Length = 334

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 137/337 (40%), Positives = 196/337 (58%), Gaps = 19/337 (5%)

Query: 1   MADKPSKALVLYGDGLARFVEP------SHAHLHSLASKAACGFLSL---PIAPPSESEE 51
           MADKPS+ALV+Y  G A  + P      + +HL + AS+A CG L+L   P + P+   E
Sbjct: 1   MADKPSRALVVYAAGHAAALIPPPGSAAAGSHLDAFASRACCGLLTLRSPPTSHPTTGTE 60

Query: 52  ER--IVREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTIS----DRFMGLKAALITTN 105
           +    + E A L+D       G  E + + +  +  P + +     +RFMGL+AAL+T+ 
Sbjct: 61  DNSSTILELAQLLDVYDHLYPGKNEESGQ-EVAQVDPQELVVPKLFERFMGLRAALVTSC 119

Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVL 165
             + SF   LGF   + N+    S S S +  D +      LLG   G ++E S+FDLV 
Sbjct: 120 PHVSSFAANLGFHVFKTNDFAPQSGSSSITKEDGLIKRAFDLLGFSDGDVQETSEFDLVF 179

Query: 166 VHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEA 224
           +H+ A E T+   GK  +  D+  ++ LV  +L+ A   + + SR+H+SV+LSYG     
Sbjct: 180 MHV-AMENTSSKLGKLGMKTDINRLDKLVGAVLESAPVGSAIASRIHVSVILSYGSA-SG 237

Query: 225 DNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAE 284
           +    S+L S  E  SDL+ L PRQSYTMK     +DVR H P+L++QWQ  VTR D+A+
Sbjct: 238 NKDEFSLLTSSTETDSDLNLLRPRQSYTMKTGRTLDDVRLHHPILLAQWQDGVTRVDLAK 297

Query: 285 TFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
            FSF++F + GG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 GFSFEEFMKRGGNLAMLAERFLHEVAFKLWKAPKYGA 334


>gi|195636240|gb|ACG37588.1| hypothetical protein [Zea mays]
          Length = 334

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 137/337 (40%), Positives = 195/337 (57%), Gaps = 19/337 (5%)

Query: 1   MADKPSKALVLYGDGLARFVEP------SHAHLHSLASKAACGFLSL---PIAPPSESEE 51
           MADKPS+ALV+Y  G A  + P      + +HL + AS+A CG L+L   P + P+   E
Sbjct: 1   MADKPSRALVVYAAGHAAALIPPPGSAAAGSHLDAFASRACCGLLTLRSPPTSHPTTGTE 60

Query: 52  ER--IVREFAVLIDANQDYLSG----IGEHTTELKWEEKSPFQTISDRFMGLKAALITTN 105
           +    + E A L+D       G     G+   ++  +E      + +RFMGL+AAL+T+ 
Sbjct: 61  DNSSTILELAQLLDVYDHLYPGKNAESGQEVAQVDPQELV-VPKLFERFMGLRAALVTSC 119

Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVL 165
             + SF   LGF   + N+    S S S +  D +      LLG   G ++E S+FDLV 
Sbjct: 120 PHVSSFAANLGFHVFKTNDFAPQSGSSSITKEDGLIKRAFDLLGFSDGDVQETSEFDLVF 179

Query: 166 VHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEA 224
           +H+ A E T+   GK  +  D+  ++ LV  +L+ A   + + SR+H+SV+LSYG     
Sbjct: 180 MHV-AMENTSSKLGKLGMKTDINRLDKLVGAVLESAPVGSAIASRIHVSVILSYGSA-SG 237

Query: 225 DNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAE 284
           +    S+L S  E  SDL+ L PRQSYTMK     +DVR H P+L++QWQ  VTR D+A+
Sbjct: 238 NKDEFSLLTSSTETDSDLNLLRPRQSYTMKTGRTLDDVRLHHPILLAQWQDRVTRVDLAK 297

Query: 285 TFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
            FSF++F + GG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 GFSFEEFMKRGGNLAMLAERFLHEVAFKLWKAPKYGA 334


>gi|357147081|ref|XP_003574214.1| PREDICTED: uncharacterized protein LOC100836573 [Brachypodium
           distachyon]
          Length = 335

 Score =  220 bits (561), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 192/338 (56%), Gaps = 20/338 (5%)

Query: 1   MADKPSKALVLYGDGLARFVEPS---------HAHLHSLASKAACGFLSL--PIAPPSES 49
           MADKPS+ALVLY  G A  +  +          +HL + AS A+CGFLS+  P       
Sbjct: 1   MADKPSRALVLYAAGHAALLPAAAAAAGGGGGKSHLDAFASLASCGFLSVRTPAVNDEGD 60

Query: 50  EEERIVREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTI----SDRFMGLKAALITTN 105
           +    + E A L+D   D L    +  T L+     P + +    S+RFMG++AA++T  
Sbjct: 61  KNSATIVELAQLLDV-YDALYPAKDAQTGLETARVDPQELVVPKLSERFMGMRAAMVTNC 119

Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDV-VASELLKLLGLQRGKMEEVSQFDLV 164
             + SF   LGF   +  + +  S  L GS  D+ + +    LLG   G +++ S+FDLV
Sbjct: 120 PDVSSFTANLGFHVFRTEDFVAHS-GLGGSSKDIGIINRAFGLLGFSEGSVQDASEFDLV 178

Query: 165 LVHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLE 223
            VH+ A E T    GK  +  DL  ++ LV  +++ A   + + +R+H+SV+LSYG   E
Sbjct: 179 FVHV-AMENTASKLGKLGMKTDLNRLDKLVAAVMEAAPVKSAIAARVHVSVILSYGSATE 237

Query: 224 ADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMA 283
               +  +L S  E  SDL  L PRQSYTMK     +DVR+H PML++QWQ  VTR D+A
Sbjct: 238 NKEESCLILNSSTETDSDLKLLHPRQSYTMKAGRTLDDVRNHHPMLLAQWQEGVTRSDLA 297

Query: 284 ETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
           + FSF++F + GG   + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 KGFSFEEFIKRGGNFAMLAERFLHEVAFKLWKAPKYGA 335


>gi|326488863|dbj|BAJ98043.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 326

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 132/328 (40%), Positives = 180/328 (54%), Gaps = 9/328 (2%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHL---HSLASKAACGFLSLPIAPPSES---EEERI 54
           MADKPS+ALVLY  G A  +           + AS A+CGFLS+     +E    ++   
Sbjct: 1   MADKPSRALVLYAAGHAALLAGGGGGKGHLDAFASIASCGFLSVRTPAVNEDGGDKDSDT 60

Query: 55  VREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDK 114
           + E A L+D   D L    +  T     ++     +S+RFMGL+AA++T    + SF   
Sbjct: 61  ILELAQLLDV-YDGLCPAKDGETSRVDPQQLVVPKLSERFMGLRAAMVTNCPGVSSFAAN 119

Query: 115 LGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKT 174
           LGF      +    S S S S    +      LLG   G +++ S+FDLV VH  A E T
Sbjct: 120 LGFHVFGTEDFAAQSGSGSSSKDSRIIDRAFTLLGFVEGNVQDASEFDLVFVH-AAMENT 178

Query: 175 NDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLI 233
               GK  +  DL  ++ LV  +++ A   + V +R+H+SVVLSYG   E       ++ 
Sbjct: 179 TSKLGKLGMKTDLNRLDKLVASVMEAAPVGSAVAARIHVSVVLSYGSATENKEEACLIVN 238

Query: 234 SIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKE 293
           S  E  SDL  L PRQSYTMK     +DVR+H PML++QWQ  VTR D+A+ FSF++F +
Sbjct: 239 SSTETDSDLKLLRPRQSYTMKAGKTLDDVRNHHPMLLAQWQQGVTRSDLAKEFSFEEFIK 298

Query: 294 HGGYLTIPADRFLHEVAFKLWKAPKYGA 321
             G   + A+RFLHEVAFKLWKAPKYGA
Sbjct: 299 RAGNFAMLAERFLHEVAFKLWKAPKYGA 326


>gi|168006913|ref|XP_001756153.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692663|gb|EDQ79019.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 314

 Score =  203 bits (516), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 132/331 (39%), Positives = 181/331 (54%), Gaps = 27/331 (8%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MADKP++ALV YGDGL   V+P HAHLH LA   +CGFL+L   P  ES   R V E A 
Sbjct: 1   MADKPARALVCYGDGLMPAVKPHHAHLHQLAVSGSCGFLALRSLPDDES---RPVLELAQ 57

Query: 61  LID---------ANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSF 111
           L+D           +D  SG G    EL+ ++  P  ++++RFMG KA  I  +    + 
Sbjct: 58  LLDVYDVYSETACKEDLASGNGSAGAELRQDDLLP--SMAERFMGTKATFIANSKSAVAL 115

Query: 112 GDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGL-QRGKMEEVSQFDLVLVHIGA 170
           G + GFAT  L E  + S SL  +     AS+ L LLG  +    +E+++  LV +H+ A
Sbjct: 116 GRRCGFATDCLAE--DGSSSLPDA--SATASKALGLLGFDESSDAKEMNE--LVFLHLDA 169

Query: 171 GEKTNDDKGKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLS 230
                 D G +    +E++++LV  +   ++  T    RL   VVL YG         + 
Sbjct: 170 ------DNGTSAECGIEFLDSLVGSVQDASKEGTSAYGRLFFVVVLGYGDAKANLGDAVP 223

Query: 231 VLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKD 290
            + S ++ S +L++L PRQSYTMK   P   VR   P+L    Q AVTR+D  E F F+D
Sbjct: 224 SIKSKEKLSVELASLRPRQSYTMKTGKPVEGVREEYPLLAVYNQVAVTRRDEVERFMFED 283

Query: 291 FKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
           F+   G L +  DRFL+EVAFKLWKAPKYGA
Sbjct: 284 FQNRAGNLAMLVDRFLYEVAFKLWKAPKYGA 314


>gi|168012392|ref|XP_001758886.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690023|gb|EDQ76392.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 314

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 126/336 (37%), Positives = 175/336 (52%), Gaps = 37/336 (11%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSL-----PIAPPSESEEERIV 55
           MADKPS+ALV+YGDGLA  VE  H+HLH LA+  +CGFL+L     P    S    ER V
Sbjct: 1   MADKPSRALVIYGDGLADAVESHHSHLHQLAASGSCGFLALRTLSTPHPGSSADGSERTV 60

Query: 56  REFAVLID--------ANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSR 107
            E A L+D        A ++  SGI +   +   EE S   ++++RFMG+K+  IT +  
Sbjct: 61  LELAQLLDVYDIYTEKARKEDASGITDAGVD---EESSKLPSMAERFMGMKSTFITNSKP 117

Query: 108 LKSFGDKLGFATLQLNELIETSDSLSGSP-IDVVASELLKLLGL-QRGKMEEVSQFDLVL 165
             +   + GF+   L E        +G P  +  AS +L LLG  +    +E ++  LV 
Sbjct: 118 ALALSKRAGFSVKPLTED-------NGPPHANATASTVLSLLGFCESSDAKETNE--LVF 168

Query: 166 VHIGAGEKTNDDKGKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEAD 225
           +H+          G +    +EY+N+L+  +   ++  +     L L VVL YG      
Sbjct: 169 LHL---------DGSSSGTGIEYLNSLIGCVQDASKEGSLAYGHLFLVVVLGYGDAALGA 219

Query: 226 NSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAET 285
           N   S    I    S+L+AL P QSY MK       +R   P+L    Q AVTR+D  + 
Sbjct: 220 NGGRSFKEEI-ALPSELAALRPVQSYKMKTGKAVEGIREDYPLLAVYNQVAVTRRDEVQK 278

Query: 286 FSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
           FSF DFK+  G L +  DRFL+EV+FKLWKAPKYGA
Sbjct: 279 FSFDDFKKRAGNLVMLVDRFLYEVSFKLWKAPKYGA 314


>gi|302771948|ref|XP_002969392.1| hypothetical protein SELMODRAFT_410426 [Selaginella moellendorffii]
 gi|300162868|gb|EFJ29480.1| hypothetical protein SELMODRAFT_410426 [Selaginella moellendorffii]
          Length = 281

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 119/321 (37%), Positives = 170/321 (52%), Gaps = 40/321 (12%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MAD PS+++V++GDG    V   H++LHSLAS   CGFL+L    P+ S+  R       
Sbjct: 1   MADGPSRSVVIFGDGFLPHVAAQHSNLHSLASDGCCGFLAL--RSPAASDGNRSAEALIQ 58

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+D    Y  G G             FQT+S+RFMG+ AAL+T + +  + G K GF   
Sbjct: 59  LLDLYDAYKEGKG-------------FQTVSERFMGMNAALVTNSEQAVAVGSKAGFVVS 105

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           +  +L E      G   + + S+ L ++G+  G       FDL+ +H+ A    ++D  K
Sbjct: 106 RFQDLHE------GIGAEDMPSKFLGMVGV--GDSAGGKPFDLLFLHLVA----DNDLEK 153

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
           + A   E++++LV  +     PA      L L ++L YG  L    S + +   ID++  
Sbjct: 154 SPAISTEWMDSLVGKLKN--APAKN----LLLVLILGYGNAL----SEVEIPELIDQQ-- 201

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
            L  L PRQSY++KG  P  D+   C +L    Q AVTR+D        +F++  G LTI
Sbjct: 202 -LRQLRPRQSYSIKGGKPVEDISKDCSLLAVFHQKAVTRRDHCTCLQLGEFRQKCGNLTI 260

Query: 301 PADRFLHEVAFKLWKAPKYGA 321
            ADRFLHEVAFKLWKAPKYGA
Sbjct: 261 LADRFLHEVAFKLWKAPKYGA 281


>gi|302774625|ref|XP_002970729.1| hypothetical protein SELMODRAFT_411538 [Selaginella moellendorffii]
 gi|300161440|gb|EFJ28055.1| hypothetical protein SELMODRAFT_411538 [Selaginella moellendorffii]
          Length = 313

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 151/300 (50%), Gaps = 40/300 (13%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
           MAD PS+++V++GDG    V   H++LHSLAS   CGFL+L    P+ S+  R       
Sbjct: 1   MADGPSRSVVIFGDGFLPHVAAQHSNLHSLASDGCCGFLAL--RSPAASDGNRSAEALIQ 58

Query: 61  LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
           L+D              +   E KS FQT+S+RFMG+ AAL+T + +  + G K GF   
Sbjct: 59  LLD------------LYDADKEGKS-FQTVSERFMGMNAALVTNSEQAVAVGSKAGFVVS 105

Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
           +  +L E      G   + + S+ L ++G+  G       FDL+ +H+ A    ++D  K
Sbjct: 106 RFQDLHE------GIGAEDMPSKFLGMVGV--GDSAGGKPFDLLFLHLVA----DNDLEK 153

Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
           + A   E++++LV  +     PA      L L ++L YG  L    S + +   ID++  
Sbjct: 154 SPAISTEWMDSLVGKLKNA--PAK----NLLLVLILGYGNAL----SEVEIPEFIDQQ-- 201

Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
            L  L PRQSY++KG  P  D+   C +L    Q AVTR+D       ++F++ G  + +
Sbjct: 202 -LRQLRPRQSYSIKGGKPVEDISKDCSLLAVFHQKAVTRRDHCTCLQLEEFRQTGSSMRL 260


>gi|224141629|ref|XP_002324169.1| predicted protein [Populus trichocarpa]
 gi|222865603|gb|EEF02734.1| predicted protein [Populus trichocarpa]
          Length = 79

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 41/51 (80%), Positives = 45/51 (88%)

Query: 262 VRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFK 312
           + +HCP+LI+QWQYAVTR DMAETFSFKD KEHGG L IPADRFL EVAFK
Sbjct: 27  ISYHCPILIAQWQYAVTRLDMAETFSFKDLKEHGGNLVIPADRFLREVAFK 77


>gi|300681520|emb|CBH32615.1| conserved hypothetical protein, expressed [Triticum aestivum]
          Length = 364

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 77/249 (30%), Positives = 112/249 (44%), Gaps = 23/249 (9%)

Query: 1   MADKPSKALVLYGDGLARFVEPSHA---HLHSLASKAACGFLSLPIAPPSESEEER---I 54
           MADKPS+ALVLY  G A  +         L + AS A+C FLS+     +E   +R    
Sbjct: 1   MADKPSRALVLYAAGHAALLAGGGGGKGQLDAFASIASCSFLSVRTPVVNEDGGDRNSDS 60

Query: 55  VREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDK 114
           + E A L+D   D L  + +        ++ P   +S+RF      LI   S L  FG +
Sbjct: 61  ILELAQLLDVC-DALYSVKDGEIARVDPQELPVPKLSERF-----DLIRGESWLPCFGTE 114

Query: 115 LGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKT 174
             FA  Q      + D+        +      LLG   G +++ S+FDLV VH    E T
Sbjct: 115 -DFAA-QSGSGSSSKDTR-------IIDRAFTLLGFAEGNVQDASEFDLVFVH-ATMENT 164

Query: 175 NDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLI 233
               GK  +   L  ++ LV  +++ A   + V +R+H+SV+LSYG           +L 
Sbjct: 165 TSKLGKLGMKTYLNRLDKLVASVMEAAPVGSAVAARIHVSVILSYGSATANKEEACLILN 224

Query: 234 SIDEKSSDL 242
           S  E  SDL
Sbjct: 225 SSIETDSDL 233


>gi|290996720|ref|XP_002680930.1| hypothetical protein NAEGRDRAFT_46748 [Naegleria gruberi]
 gi|284094552|gb|EFC48186.1| hypothetical protein NAEGRDRAFT_46748 [Naegleria gruberi]
          Length = 300

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 105/249 (42%), Gaps = 43/249 (17%)

Query: 78  ELKWEEKSPFQTISDRFMGLKAALITTNSRL----KSFGDKLGFATLQLNELIETSDSLS 133
           EL+ E++S F     +F  LK  + TTN+ +    KS  D   F    +N   E  D++ 
Sbjct: 90  ELELEKQSLFC----KFKKLKVEMWTTNNNVLNIFKSKKDNAKFLNFIVNNK-EEEDAIC 144

Query: 134 GSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVAHDLEYINALV 193
            + +  + S                   D++++H       N  K + +     +I  + 
Sbjct: 145 ENAVQAMLSN------------------DILILH------DNITKLQTIEEKRNHIQQID 180

Query: 194 RVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSALFPRQSYTM 253
            ++ Q+    T+  ++  +     Y  +L +   N SVL S ++KSS      P QSY  
Sbjct: 181 HLLKQVNNLFTQKQNQFKI-----YNVILVSTPFN-SVLPSFNKKSSPFR---PLQSYEQ 231

Query: 254 KGETPRNDVRHHCPMLIS-QWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFK 312
           K  T   D+ HH P LI  Q     TR D     + K   E  G   I  D  + E+A+K
Sbjct: 232 KNGTILTDIVHHEPSLICVQHCEGETRSDTNTQLTEKSSIELNGNCAILIDFVISEMAYK 291

Query: 313 LWKAPKYGA 321
           L K+PKYGA
Sbjct: 292 LRKSPKYGA 300


>gi|66807053|ref|XP_637249.1| hypothetical protein DDB_G0287343 [Dictyostelium discoideum AX4]
 gi|60465660|gb|EAL63739.1| hypothetical protein DDB_G0287343 [Dictyostelium discoideum AX4]
          Length = 278

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 58/245 (23%), Positives = 103/245 (42%), Gaps = 36/245 (14%)

Query: 79  LKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPID 138
            K +EK   Q+  +++M LK  LI+   +                EL   SDS+      
Sbjct: 68  FKHDEKLTEQSFKEKYMDLKIKLISNIEK----------------ELCIFSDSVKLDTQK 111

Query: 139 VVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVAHDLEYINALVRVILQ 198
               ELL  +          S   L+++H      + +++ K + +    I+ ++  +L+
Sbjct: 112 QTNQELLDTIS--------ESNEQLIMIHYQTINNSLEERRKTLFN----IDLILSNLLK 159

Query: 199 MAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSALF--PRQSYTMKGE 256
             Q         ++++V+ Y Q     N  + +     ++S+++   F  P+QSYT+   
Sbjct: 160 KNQNDGNNNKNYYINIVIGYSQT----NIEIPLTAQQQQQSTNIPNWFNLPKQSYTLSNS 215

Query: 257 TPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKA 316
            P  D R   P+    +    TRKD  ++F   +F  +G    I   +   E+AFKL K 
Sbjct: 216 LPY-DPRLTSPLFTIHYNSIFTRKDNTKSFCESEFI-NGSNGKILLFQHFRELAFKLGKV 273

Query: 317 PKYGA 321
           PKYGA
Sbjct: 274 PKYGA 278


>gi|384253165|gb|EIE26640.1| hypothetical protein COCSUDRAFT_59159 [Coccomyxa subellipsoidea
           C-169]
          Length = 78

 Score = 48.1 bits (113), Expect = 0.006,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 247 PRQSYTMKG-ETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRF 305
           PRQS+   G ET R  +RH   ++ +Q    V R+D +  FS +  K+ GG   I  +R 
Sbjct: 6   PRQSFEFAGLETVR--IRHDASVMFAQRLKGVVRQDSSTAFSAEQCKQGGGG-GILLERL 62

Query: 306 LHEVAFKLWKAPKYGA 321
           + E+A+KL +A KYGA
Sbjct: 63  MPEIAYKLGRAHKYGA 78


>gi|452823400|gb|EME30411.1| hypothetical protein Gasu_23170 [Galdieria sulphuraria]
          Length = 284

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 77/320 (24%), Positives = 131/320 (40%), Gaps = 41/320 (12%)

Query: 6   SKALVLYGDGLARFVEPS-HAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAVLIDA 64
           S   +L+G  L+ F+  S + +L  +A     G  +L        E E +  +F++  D+
Sbjct: 2   SIVWILHGLDLSNFLLKSQYENLDRIAELGCSGICALY----EYDELENVTPQFSI-KDS 56

Query: 65  NQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLG-FATLQLN 123
               +SG+       +++ K  F   SD   GL+ A++TT+  L     +L  F  L L 
Sbjct: 57  CFLQVSGLSRWN---RYQHKKNFP--SDTIFGLRTAVLTTDENLLEHKSQLSPFNLLLLA 111

Query: 124 ELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVA 183
           E  +  DSL+     +  S    L      + E +++FDLV +++               
Sbjct: 112 E--KDEDSLTQR---LWHSCCQLLCHTYEKEYERIAKFDLVFIYLK-------------- 152

Query: 184 HDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLS 243
                 N   +VIL++A   T  G  L +   + +  VL  D         +     +  
Sbjct: 153 ------NIPNQVILRVADVVT--GMLLSIPNEIPFLSVLGFDPRWNKSFEDLFPSRKEGI 204

Query: 244 ALFPRQSYTMKGET--PRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIP 301
              PRQS+    E    R+   H+  +  S ++   TR+D    F  +D    G    I 
Sbjct: 205 PFMPRQSFQSIDELILERHCFHHNTHLWASFYKEGFTRRDKCCRFLPEDCFSLGCNRVIL 264

Query: 302 ADRFLHEVAFKLWKAPKYGA 321
           A+  ++E+AF L K PKYG+
Sbjct: 265 AEHLIYEIAFLLEKVPKYGS 284


>gi|325185878|emb|CCA20384.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 249

 Score = 44.3 bits (103), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 37/75 (49%)

Query: 247 PRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFL 306
           PRQS+T       +D       ++S +Q+   RKD  + +   +F    GY ++     +
Sbjct: 175 PRQSFTKVDGAYVDDNTAPQYFVLSYFQHNRIRKDDVKRYELSEFAHRNGYGSMNLRALI 234

Query: 307 HEVAFKLWKAPKYGA 321
            E+AF+L   PKYGA
Sbjct: 235 KEIAFRLGCTPKYGA 249


>gi|440793375|gb|ELR14561.1| hypothetical protein ACA1_077920 [Acanthamoeba castellanii str.
           Neff]
          Length = 298

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 31/53 (58%)

Query: 269 LISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
           L + +    TR+D  + F+  + K  G    I A++F+ E+AFKL K PKYGA
Sbjct: 246 LATYYHEGSTRRDDVQRFTEAEVKSKGANGNIAAEQFVAELAFKLGKTPKYGA 298


>gi|384491511|gb|EIE82707.1| hypothetical protein RO3G_07412 [Rhizopus delemar RA 99-880]
          Length = 214

 Score = 42.4 bits (98), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 1/107 (0%)

Query: 215 VLSYGQVLEADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQ 274
           +++ G  L+   + + ++  I++     +   P+QSY MKG     ++     ++ S + 
Sbjct: 109 IINNGCALDRPETLVCIVSEINQNEERKAEFMPQQSYWMKGGVKVEEIEKGRSLIYSYFH 168

Query: 275 YAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
              TR+D  E F  +D     G   I A  FL E+  KL    KYG+
Sbjct: 169 CGSTRRDSVEHFG-QDIVRLSGNKKILAWHFLAEIGNKLGFVAKYGS 214


>gi|281207162|gb|EFA81345.1| hypothetical protein PPL_05328 [Polysphondylium pallidum PN500]
          Length = 278

 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 59/251 (23%), Positives = 107/251 (42%), Gaps = 51/251 (20%)

Query: 82  EEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVA 141
           + K  + T  +++  L   LIT++  L    +  GF ++ +N              +V  
Sbjct: 68  DHKVTYNTFKEKYSNLSLRLITSSQSLLQDCNTKGFDSILVN--------------NVNC 113

Query: 142 SELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVAHDLEYINALVRVILQMAQ 201
            +LL  +      M+E  + ++V++H+           + V+  LE   AL+  + Q  Q
Sbjct: 114 KDLLNEIN---KPMKE--KVEMVIIHL-----------EPVSKGLEDRRALLTTLDQTVQ 157

Query: 202 PATEVGS-----RLHLSVVLSYGQVLEADNSNLSVLIS--IDEKSSDLSALF--PRQSYT 252
              +          ++++V+S+      D+ N++V ++         + + F  P+QS  
Sbjct: 158 SLKQQQEQQQQFETYINIVISF------DSPNVTVPLTEQQQSSQQSIPSTFIPPKQSTQ 211

Query: 253 MKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKE--HGGYLTIPADRFLHEVA 310
                P  + R   PM         TRKD +++F    F    +G  LTI   ++L E+A
Sbjct: 212 FYNGKPV-EPRQTSPMFAIYSHAPWTRKDESKSFRLDQFIRGCNGKILTI---QYLKELA 267

Query: 311 FKLWKAPKYGA 321
           FKL K PKYGA
Sbjct: 268 FKLGKIPKYGA 278


>gi|374371858|ref|ZP_09629777.1| hypothetical protein OR16_40849 [Cupriavidus basilensis OR16]
 gi|373096602|gb|EHP37804.1| hypothetical protein OR16_40849 [Cupriavidus basilensis OR16]
          Length = 696

 Score = 42.4 bits (98), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 45/101 (44%)

Query: 70  SGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETS 129
           S +G +T  LK  E +PF  +     G  A L++ ++   + G KL      L   I   
Sbjct: 82  SALGTYTCTLKTGEVAPFFVVVTDPTGNTAPLVSVSTVTPAAGAKLTLNATPLTTAIVAQ 141

Query: 130 DSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGA 170
            +  G+P+  V+ + +    LQ+     V+Q   VL  IGA
Sbjct: 142 LAADGNPLTFVSGKTVDAAALQKVTANVVAQLASVLAAIGA 182


>gi|301104856|ref|XP_002901512.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100516|gb|EEY58568.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 204

 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 51/113 (45%), Gaps = 23/113 (20%)

Query: 225 DNSNLSVLISIDEKSS-------DLSALFPRQSYTM---------KGETPRNDVRHHCPM 268
           D+ +  V +SI + +S       +   L P+QSY            GE PR        M
Sbjct: 99  DDGSSKVFVSIVKTASQRVVEPSEPHPLRPQQSYHKYDHKYPEHDSGEAPRR------LM 152

Query: 269 LISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
             S +Q   TR+D  +TF      + GGY  +    F+ E+AF+L  APKYGA
Sbjct: 153 FASLYQ-DQTRRDAVQTFDEAQMDKLGGYGAMDVRVFVKEMAFRLGYAPKYGA 204


>gi|418872319|ref|ZP_13426661.1| urease accessory protein UreE, N-terminal domain protein
           [Staphylococcus aureus subsp. aureus IS-125]
 gi|418947816|ref|ZP_13500158.1| urease accessory protein UreE, N-terminal domain protein
           [Staphylococcus aureus subsp. aureus IS-157]
 gi|375367276|gb|EHS71241.1| urease accessory protein UreE, N-terminal domain protein
           [Staphylococcus aureus subsp. aureus IS-125]
 gi|375374329|gb|EHS77963.1| urease accessory protein UreE, N-terminal domain protein
           [Staphylococcus aureus subsp. aureus IS-157]
          Length = 150

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 10/98 (10%)

Query: 186 LEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSAL 245
           LE  + + R+   +    TE+G RL   + L YG +L AD+ N   +I +D  S DL  +
Sbjct: 27  LENSDLVKRIQRVVTDHGTEIGIRLKQPIDLQYGDILYADDHN---MIIVDVNSEDLLVI 83

Query: 246 FPRQSYTMKGETPRNDVRHHCPM------LISQWQYAV 277
            PR    M G+       HH P       ++ Q+ Y V
Sbjct: 84  QPRTLQEM-GDIAHQLGNHHLPAQFTETEMLVQYDYLV 120


>gi|348689196|gb|EGZ29010.1| hypothetical protein PHYSODRAFT_476234 [Phytophthora sojae]
          Length = 184

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 29/44 (65%)

Query: 278 TRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
           TR+D  ++F+ ++  + G Y  + A  F+ E+AF+L  APKYGA
Sbjct: 141 TRRDGVQSFTEEEIDKLGAYGAMDARVFMKEMAFRLGCAPKYGA 184


>gi|206889471|ref|YP_002249056.1| dihydroorotase [Thermodesulfovibrio yellowstonii DSM 11347]
 gi|206741409|gb|ACI20466.1| dihydroorotase [Thermodesulfovibrio yellowstonii DSM 11347]
          Length = 435

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 71/154 (46%), Gaps = 22/154 (14%)

Query: 26  HLHSLASKAACGF-LSLPIAPPSESEEERIVREFAVLIDANQDYLSGIGEHTTELKWEEK 84
           H  +L  +A  G+  +  + PP  SEE+R   + A L D   D ++   +H      E+K
Sbjct: 269 HYFTLTEEAVSGYNTNAKVNPPLRSEEDREAIKNA-LSDGIIDVIAT--DHAPHHIDEKK 325

Query: 85  SPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASEL 144
            PFQ   +   GL+ A   +   LK   DK+    L L +LIE    L+ +P        
Sbjct: 326 VPFQEALNGISGLETAFAIS---LKLVHDKI----LNLRKLIE---KLTLNP-------- 367

Query: 145 LKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDK 178
            K+L L++  ++E +Q DL+LV I        DK
Sbjct: 368 SKILNLKKEGIKEGAQGDLILVDINKNWTVKKDK 401


>gi|418564108|ref|ZP_13128533.1| urease accessory protein UreE, N-terminal domain protein
           [Staphylococcus aureus subsp. aureus 21264]
 gi|371977208|gb|EHO94486.1| urease accessory protein UreE, N-terminal domain protein
           [Staphylococcus aureus subsp. aureus 21264]
          Length = 150

 Score = 37.7 bits (86), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 10/98 (10%)

Query: 186 LEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSAL 245
           LE  + + R+   +    TE+G RL   + L YG +L AD+ N   +I +D  S DL  +
Sbjct: 27  LENSDLVKRIQRVVTDHGTEIGIRLKQPIDLQYGDILYADDHN---MIVVDVNSEDLLVI 83

Query: 246 FPRQSYTMKGETPRNDVRHHCPM------LISQWQYAV 277
            PR    M G+        H P       ++ Q+ Y V
Sbjct: 84  HPRTLQEM-GDIAHQLGNRHLPAQFTETEMLVQYDYLV 120


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.379 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,895,686,194
Number of Sequences: 23463169
Number of extensions: 194162792
Number of successful extensions: 460540
Number of sequences better than 100.0: 59
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 460434
Number of HSP's gapped (non-prelim): 61
length of query: 321
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 179
effective length of database: 9,027,425,369
effective search space: 1615909141051
effective search space used: 1615909141051
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)