BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017999
         (362 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225454022|ref|XP_002281030.1| PREDICTED: uncharacterized protein LOC100259142 [Vitis vinifera]
 gi|296089202|emb|CBI38905.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  567 bits (1461), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 276/388 (71%), Positives = 306/388 (78%), Gaps = 40/388 (10%)

Query: 15  MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLP 74
           MR C G RR L+ LP V F+PY LSVLELH+ S +E   +K+ +KFDHL+LGPAAGQ L 
Sbjct: 1   MRVCSGWRRRLYCLPFVLFIPYFLSVLELHQSSTIEGSQKKHSKKFDHLVLGPAAGQGLH 60

Query: 75  NRLQCQ----------------------------------------DSRASNMVTVGNAS 94
           +RLQCQ                                        D R+S++VTVGNAS
Sbjct: 61  DRLQCQGTKALNKTHIATSSHESNFGESIALITVFTIYNSSLALHADGRSSDLVTVGNAS 120

Query: 95  YSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLML 154
           YSK ERSMAILNVFINFIQ TMP+S+V ILTDPAS+ S+ R  VTIYPI GEYSRDKLML
Sbjct: 121 YSKMERSMAILNVFINFIQATMPQSNVIILTDPASEFSLHRDRVTIYPIQGEYSRDKLML 180

Query: 155 QRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNK 214
           QRIRSYI FLE ++ EHSQG GHINHY+FTDSDIAVVDDLG IF  + NFH+ALTFRNNK
Sbjct: 181 QRIRSYIVFLETKLEEHSQGHGHINHYIFTDSDIAVVDDLGQIFQSHPNFHVALTFRNNK 240

Query: 215 DQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFD 274
           +QPLNSGFIAVRGTPDGI RAK+FL+EVL+VYSS++MNASRMLGDQLALAWVVKSHP FD
Sbjct: 241 EQPLNSGFIAVRGTPDGILRAKLFLQEVLKVYSSRFMNASRMLGDQLALAWVVKSHPYFD 300

Query: 275 ARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE 334
            +RF+K Q F+EDI G SVLFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE
Sbjct: 301 TKRFSKPQAFLEDIGGTSVLFLPCAIYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE 360

Query: 335 SWNFFSSSSDISDMLCLILMSGRTKYDF 362
           SWNFF SSSDISDMLCLILMSGRTKYDF
Sbjct: 361 SWNFFISSSDISDMLCLILMSGRTKYDF 388


>gi|224129974|ref|XP_002320717.1| predicted protein [Populus trichocarpa]
 gi|222861490|gb|EEE99032.1| predicted protein [Populus trichocarpa]
          Length = 369

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 262/364 (71%), Positives = 293/364 (80%), Gaps = 35/364 (9%)

Query: 34  LPYL-LSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQ------------ 80
           +P+L  SVLELH+    +  P+K   KFDHL+LGPAAGQ LPNRLQCQ            
Sbjct: 6   IPFLSFSVLELHQNPAAQPPPKKMNTKFDHLVLGPAAGQGLPNRLQCQGTKALNKTHTRS 65

Query: 81  ----------------------DSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPK 118
                                 DSR SN VTVGNASY+K ERSMA+LNVF+NFI+VTMP+
Sbjct: 66  SSNAGESVSFVTVFTVYNTSLADSRLSNFVTVGNASYTKMERSMAVLNVFVNFIKVTMPR 125

Query: 119 SDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHI 178
           S+V ILTDPASDLS+    VT+YPI G+YSRDKLMLQRIRSYITFLE R+ E +Q  GHI
Sbjct: 126 SNVVILTDPASDLSLFGNSVTVYPIQGDYSRDKLMLQRIRSYITFLETRLEELAQNPGHI 185

Query: 179 NHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIF 238
           NHY+FTDSDIAVVDDLGH+F+D+ NFHLALTFRNNK+QPLNSGFIAVRGT D I RAKIF
Sbjct: 186 NHYIFTDSDIAVVDDLGHLFNDHPNFHLALTFRNNKEQPLNSGFIAVRGTTDAILRAKIF 245

Query: 239 LEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPC 298
           L+EVL+VYSSK+M+ASRMLGDQLALAW +KSHP FD RRFTKAQ F+E+I GASVLFLPC
Sbjct: 246 LQEVLKVYSSKFMSASRMLGDQLALAWAIKSHPGFDLRRFTKAQAFLENIGGASVLFLPC 305

Query: 299 ATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRT 358
           ATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF SSSSDI DMLCL+L+SGRT
Sbjct: 306 ATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFLSSSSDIFDMLCLVLLSGRT 365

Query: 359 KYDF 362
           KYDF
Sbjct: 366 KYDF 369


>gi|255562808|ref|XP_002522409.1| conserved hypothetical protein [Ricinus communis]
 gi|223538294|gb|EEF39901.1| conserved hypothetical protein [Ricinus communis]
          Length = 388

 Score =  529 bits (1362), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 267/391 (68%), Positives = 304/391 (77%), Gaps = 46/391 (11%)

Query: 15  MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNR-QKFDHLILGPAAGQ 71
           MR   G RRF+  FFL LV F  ++ SVLELH  SV E  P+KNR +K DHL+LGPAAGQ
Sbjct: 1   MRTWSGWRRFILCFFLLLVIF--HIFSVLELHSNSVTE-APQKNRNKKSDHLVLGPAAGQ 57

Query: 72  RLPNRLQCQ----------------------------------------DSRASNMVTVG 91
            LP+RLQC+                                        D R+SN+VTVG
Sbjct: 58  GLPDRLQCEGSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSIPDDRSSNLVTVG 117

Query: 92  NASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDK 151
           N SYSK ERSMAILNVFINFIQVTMP+S+V ILTDPASDLS+ R  VT+YPI GEYSR+K
Sbjct: 118 NVSYSKMERSMAILNVFINFIQVTMPRSNVIILTDPASDLSLQRYKVTLYPIQGEYSREK 177

Query: 152 LMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFR 211
           LMLQRI+SYI FL+ +++E ++   H +HY+FTDSDIAVVDDLG IFH+Y NFH+ALTFR
Sbjct: 178 LMLQRIKSYINFLDMKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYPNFHIALTFR 237

Query: 212 NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHP 271
           NNK+QPLNSGFIAVRGT + I RAKIFL+ VL VY+SKYMNASRMLGDQLALAWV++SHP
Sbjct: 238 NNKEQPLNSGFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASRMLGDQLALAWVIRSHP 297

Query: 272 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 331
            FD RRF KAQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL
Sbjct: 298 GFDLRRFRKAQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 357

Query: 332 MLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           MLESWNFF S+SDISDMLCLILMSGRTKYDF
Sbjct: 358 MLESWNFFRSASDISDMLCLILMSGRTKYDF 388


>gi|255541282|ref|XP_002511705.1| conserved hypothetical protein [Ricinus communis]
 gi|223548885|gb|EEF50374.1| conserved hypothetical protein [Ricinus communis]
          Length = 388

 Score =  526 bits (1356), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 266/391 (68%), Positives = 304/391 (77%), Gaps = 46/391 (11%)

Query: 15  MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNR-QKFDHLILGPAAGQ 71
           MR   G RRF+  FFL LV F  ++ SVLELH  SV E  P+KNR +K DHL+LGPAAGQ
Sbjct: 1   MRTWSGWRRFILSFFLLLVIF--HIFSVLELHSNSVTE-APQKNRNKKSDHLVLGPAAGQ 57

Query: 72  RLPNRLQCQDSRA----------------------------------------SNMVTVG 91
            LP+RLQC+ S+A                                        SN+VTVG
Sbjct: 58  GLPDRLQCEGSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSLPNDRSSNLVTVG 117

Query: 92  NASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDK 151
           N SYSKTERSMAILNVFINFIQVTMP+S+V ILTDPASDL + R  VT+YPI GEYSR+K
Sbjct: 118 NVSYSKTERSMAILNVFINFIQVTMPQSNVIILTDPASDLLLQRDKVTLYPIQGEYSREK 177

Query: 152 LMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFR 211
           LMLQRIRSYI FL+ +++E ++   H +HY+FTDSDIAVVDDLG IFH+Y+NFH+ALTFR
Sbjct: 178 LMLQRIRSYINFLDTKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYRNFHIALTFR 237

Query: 212 NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHP 271
           NNK+QPLNSGFIAVRGT + I RAKIFL+ VL VY+SKYMNAS+MLGDQLALAWV++SHP
Sbjct: 238 NNKEQPLNSGFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASQMLGDQLALAWVIRSHP 297

Query: 272 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 331
            FD  RF KAQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL
Sbjct: 298 GFDLWRFRKAQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 357

Query: 332 MLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           MLESWNFF S+SDISDMLCLILMSGRTKYDF
Sbjct: 358 MLESWNFFRSASDISDMLCLILMSGRTKYDF 388


>gi|356545145|ref|XP_003541005.1| PREDICTED: uncharacterized protein LOC100785469 [Glycine max]
          Length = 432

 Score =  508 bits (1307), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 252/384 (65%), Positives = 289/384 (75%), Gaps = 36/384 (9%)

Query: 15  MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLP 74
           M+   G  RF+F LPL+F L +L SV ELH  S +E+  ++  +K DHL+LGPAAGQ L 
Sbjct: 49  MKIFSGWHRFVFGLPLIFLLTHLFSVRELHTNSKMEEPRKQLNKKLDHLVLGPAAGQGLS 108

Query: 75  NRLQCQDSRASNMV------------------------------------TVGNASYSKT 98
           NRLQCQ +++ N +                                     VGNASY+K 
Sbjct: 109 NRLQCQGTKSLNRIHSSNSRSGVDGSITFVTVFTIYNSSLNDVDDKSLNTIVGNASYNKF 168

Query: 99  ERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIR 158
            RSMA+LNVFINFIQV M +S V ILTDP SDLS+ R GV++YPI GEYSRDKLMLQRIR
Sbjct: 169 GRSMALLNVFINFIQVAMRQSKVIILTDPVSDLSVQRNGVSLYPIEGEYSRDKLMLQRIR 228

Query: 159 SYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPL 218
           SYITFLE R++  SQ   +I HY+FTDSD+AVVDDLG IFHD+ NFH+ALTFRNNK QPL
Sbjct: 229 SYITFLETRLQNLSQKPKNITHYIFTDSDMAVVDDLGQIFHDHPNFHVALTFRNNKAQPL 288

Query: 219 NSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRF 278
           NSGFIAVRGTP+ I RAK+FL+EVL+VY++KY NASRMLGDQLALAWVVKS P FDA RF
Sbjct: 289 NSGFIAVRGTPEAILRAKLFLQEVLKVYTTKYKNASRMLGDQLALAWVVKSKPHFDASRF 348

Query: 279 TKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF 338
            KA  F EDI G SVLFLPC+ YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF
Sbjct: 349 AKAPAFSEDIGGTSVLFLPCSLYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF 408

Query: 339 FSSSSDISDMLCLILMSGRTKYDF 362
           +SSS ++SDMLCLIL SGRTKYDF
Sbjct: 409 YSSSLEVSDMLCLILGSGRTKYDF 432


>gi|449432261|ref|XP_004133918.1| PREDICTED: uncharacterized protein LOC101215082 [Cucumis sativus]
 gi|449480062|ref|XP_004155788.1| PREDICTED: uncharacterized protein LOC101230110 [Cucumis sativus]
          Length = 387

 Score =  483 bits (1242), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 238/350 (68%), Positives = 271/350 (77%), Gaps = 40/350 (11%)

Query: 53  PRKNRQKFDHLILGPAAGQRLPNRLQC--------------------------------- 79
           P K  +KFDHLILGPA GQ L +RLQC                                 
Sbjct: 38  PDKRSKKFDHLILGPATGQGLSDRLQCSGTKALNNTHLPDTSNSADSGDSIHFVTVFTIY 97

Query: 80  ---QDS----RASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLS 132
              QDS    R++++V VG+ASY+K ERSMA+LNVFINFIQV+MP+S+V ILTDPASDL 
Sbjct: 98  NASQDSKVIGRSTDVVKVGDASYNKVERSMAVLNVFINFIQVSMPQSNVVILTDPASDLP 157

Query: 133 MPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVD 192
           + R  V ++PI GEYSRD LMLQRIRSYI+FL+ ++ E  QG  HINHY+FTDSD+AVV 
Sbjct: 158 VRRNRVAVFPIQGEYSRDTLMLQRIRSYISFLDAKLDEQRQGTTHINHYIFTDSDMAVVG 217

Query: 193 DLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMN 252
           DLG IFH +  FHLALTFRNNK QPLNSGFIAVRGT DGI RAK FLEEVL++YSS++M 
Sbjct: 218 DLGEIFHKHPKFHLALTFRNNKAQPLNSGFIAVRGTEDGIRRAKTFLEEVLKIYSSRFMK 277

Query: 253 ASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQF 312
           ASRMLGDQLALAWVV+S+PSFDAR+F+K + FVE+I GASVLFLPCA YNWTPPEGAGQF
Sbjct: 278 ASRMLGDQLALAWVVRSNPSFDARKFSKPETFVEEINGASVLFLPCALYNWTPPEGAGQF 337

Query: 313 HGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           HGMPL+VKVVHFKGSRKRLMLESWNFF SSS ISDMLCLIL SGRTKYDF
Sbjct: 338 HGMPLNVKVVHFKGSRKRLMLESWNFFQSSSSISDMLCLILSSGRTKYDF 387


>gi|357471691|ref|XP_003606130.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
 gi|355507185|gb|AES88327.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
          Length = 350

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 239/341 (70%), Positives = 266/341 (78%), Gaps = 36/341 (10%)

Query: 58  QKFDHLILGPAAGQRLPNRLQCQDSRASN----------------MVTV----------- 90
           +KFDHL+LGPAAGQ L NRLQCQ S+A N                 VTV           
Sbjct: 10  KKFDHLVLGPAAGQGLSNRLQCQGSKALNRTHSSNGRFGVDGSITFVTVFTIYNSSLNRV 69

Query: 91  ---------GNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIY 141
                    GNASY+K ERSMA+LNVFI+FIQV MP+S+V ILTDP SDLS+ R  V++Y
Sbjct: 70  DDKSSNTFVGNASYNKVERSMAVLNVFIDFIQVVMPQSEVIILTDPVSDLSVHRNRVSLY 129

Query: 142 PIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDY 201
           PI GEYSRDKLMLQRIRSYITFLE R+++ SQ    I HY+FTDSDIAVVDDLG IF D+
Sbjct: 130 PIQGEYSRDKLMLQRIRSYITFLETRLQKLSQNPKDITHYIFTDSDIAVVDDLGQIFRDH 189

Query: 202 QNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQL 261
            NFH+ALTFRNNK QPLNSGFIAV+GTPDGI RAK+FL+EVL+VY SKYM+ASRMLGDQL
Sbjct: 190 PNFHMALTFRNNKAQPLNSGFIAVKGTPDGILRAKLFLQEVLKVYVSKYMSASRMLGDQL 249

Query: 262 ALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKV 321
           ALAWVVKS P FDA RF K   F +DI G S+LFLPCA YNWTPPEGAGQFHGMPLDVKV
Sbjct: 250 ALAWVVKSKPQFDASRFAKTVAFSDDIGGTSILFLPCALYNWTPPEGAGQFHGMPLDVKV 309

Query: 322 VHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           VHFKGSRKRLMLESWNF+SS+ DI+DMLCLIL SGRTKYDF
Sbjct: 310 VHFKGSRKRLMLESWNFYSSTPDIADMLCLILGSGRTKYDF 350


>gi|356517294|ref|XP_003527323.1| PREDICTED: uncharacterized protein LOC100794487 [Glycine max]
          Length = 352

 Score =  472 bits (1215), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 237/352 (67%), Positives = 268/352 (76%), Gaps = 38/352 (10%)

Query: 49  VEDLPRKN-RQKFDHLILGPAAGQRLPNRLQCQDSRASNMV------------------- 88
           +E+ PRK   +K +HL+LGPAAGQ L NRLQCQ ++A N +                   
Sbjct: 1   MEEPPRKQLNKKLNHLVLGPAAGQGLSNRLQCQGTKALNRIHSSNSRSGVDGSITFVTVF 60

Query: 89  -----------------TVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDL 131
                             VGNASY+K  RS A+LNVFINFIQV MP+S V ILTDP SDL
Sbjct: 61  TIYNSSLNDVDDKSSNTVVGNASYNKFGRSTALLNVFINFIQVAMPQSKVIILTDPVSDL 120

Query: 132 SMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQ-GHINHYVFTDSDIAV 190
           S+ R GV++YPI GEYSRDKLMLQRIRSYITFLE R++  SQ +  +I HY+FTDSDIAV
Sbjct: 121 SVLRNGVSLYPIEGEYSRDKLMLQRIRSYITFLETRLQNLSQKKPKNITHYIFTDSDIAV 180

Query: 191 VDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKY 250
           VDDLG IF D+ NFH+ALTFRNNK QPLNSGFIAVRGTP+ I RAK+FL+EVL+VYS+KY
Sbjct: 181 VDDLGQIFRDHPNFHVALTFRNNKAQPLNSGFIAVRGTPEAILRAKLFLQEVLKVYSTKY 240

Query: 251 MNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAG 310
            NASRMLGDQLALAWVVKS P FDA RF KA  F EDI G SV+FLPC+ YNWTPPEGAG
Sbjct: 241 RNASRMLGDQLALAWVVKSKPHFDASRFGKALAFSEDIGGTSVVFLPCSLYNWTPPEGAG 300

Query: 311 QFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           QFHGMPLD KVVHFKGSRKRLMLESWNF+SSS ++SDMLCLIL SGRTKYDF
Sbjct: 301 QFHGMPLDAKVVHFKGSRKRLMLESWNFYSSSLEVSDMLCLILGSGRTKYDF 352


>gi|297828243|ref|XP_002882004.1| hypothetical protein ARALYDRAFT_903971 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327843|gb|EFH58263.1| hypothetical protein ARALYDRAFT_903971 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 391

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 238/385 (61%), Positives = 283/385 (73%), Gaps = 40/385 (10%)

Query: 15  MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
           MR+C G RR L  +P++F LP+L S+L +   S   +  R    +K DHL+LGP AGQ L
Sbjct: 10  MRSCSGWRRILLLIPVLFLLPHLSSLLGISSDSAKRNDARTIPNKKLDHLVLGPVAGQGL 69

Query: 74  PNRLQCQDSRA-----------------------------------SNMVT-VGNASYSK 97
            +RL C+ ++A                                   SNMV+ VGN +YSK
Sbjct: 70  SDRLHCRGTKALNKTHGSSHVSGAGNGVSFVTVFTVYNTSLGNAKSSNMVSVVGNVTYSK 129

Query: 98  TERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRI 157
            ERSMA+LN F  FIQVTMPKS+V ILTDPASDLS+ +  V + P+ G+YSR  LMLQRI
Sbjct: 130 PERSMAVLNAFAYFIQVTMPKSNVVILTDPASDLSIQQSNVMVQPVQGDYSRGNLMLQRI 189

Query: 158 RSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQP 217
           RSYITFLE ++ ++   +G INHY+FTDSDIAVVDD+  IF  + +FHLALTFRNNKDQP
Sbjct: 190 RSYITFLEMKLEKN---EGGINHYIFTDSDIAVVDDIRAIFDKHPSFHLALTFRNNKDQP 246

Query: 218 LNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARR 277
           LNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL WVVKSHPSFDA+R
Sbjct: 247 LNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVWVVKSHPSFDAKR 306

Query: 278 FTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWN 337
           FTK Q F ++I GASVLFLPC  YNWTPPEGAGQFHGMPLDVK+VHFKGSRKRLMLE+WN
Sbjct: 307 FTKPQAFTQEIAGASVLFLPCVLYNWTPPEGAGQFHGMPLDVKIVHFKGSRKRLMLEAWN 366

Query: 338 FFSSSSDISDMLCLILMSGRTKYDF 362
           F+ S+S+I DMLCL+L SGRTKYDF
Sbjct: 367 FYKSTSNIPDMLCLVLGSGRTKYDF 391


>gi|30689992|ref|NP_850432.1| uncharacterized protein [Arabidopsis thaliana]
 gi|330255444|gb|AEC10538.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 392

 Score =  459 bits (1182), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 235/386 (60%), Positives = 280/386 (72%), Gaps = 41/386 (10%)

Query: 15  MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
           MR+C G RR L  +P++F LP+L S+++    S   +  R    +K DHL+LGP AGQ L
Sbjct: 10  MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69

Query: 74  PNRLQCQDSRASN-------------------------------------MVTVGNASYS 96
            +R  C+ ++A N                                     +  VGN +YS
Sbjct: 70  SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLGNVKSSNPVSVVGNVTYS 129

Query: 97  KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR 156
           K ERSMA+LN F NFIQVTMPKS+V ILTDPASDLS+ +  V + P+ G+YSR  LMLQR
Sbjct: 130 KPERSMAVLNAFANFIQVTMPKSNVVILTDPASDLSIQQSNVILQPVQGDYSRGNLMLQR 189

Query: 157 IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ 216
           IRSYITFLE ++ ++   +G INHY+FTDSDIAVVDD+G IF  + +FHLALTFRNNKDQ
Sbjct: 190 IRSYITFLEMKLEKN---EGGINHYIFTDSDIAVVDDVGTIFDKHSSFHLALTFRNNKDQ 246

Query: 217 PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDAR 276
           PLNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL  VVKSH SFDA+
Sbjct: 247 PLNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVSVVKSHASFDAK 306

Query: 277 RFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESW 336
           RFTK Q F E+I GASVLFLPCA YNWTPPEGAGQFHGMPLDVK+VHFKGSRKRLMLE+W
Sbjct: 307 RFTKPQAFTEEIAGASVLFLPCALYNWTPPEGAGQFHGMPLDVKIVHFKGSRKRLMLEAW 366

Query: 337 NFFSSSSDISDMLCLILMSGRTKYDF 362
           NF+ S+S+I DMLCL+L SGRTKYDF
Sbjct: 367 NFYKSTSNIPDMLCLVLGSGRTKYDF 392


>gi|147854152|emb|CAN83830.1| hypothetical protein VITISV_003973 [Vitis vinifera]
          Length = 321

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 216/283 (76%), Positives = 238/283 (84%), Gaps = 18/283 (6%)

Query: 81  DSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTI 140
           D R+S++VTVGNASYSK ERSMAILNVFINFIQ TMP+S+V ILTDPAS+ S+ R  VTI
Sbjct: 56  DGRSSDLVTVGNASYSKMERSMAILNVFINFIQATMPQSNVIILTDPASEFSLHRDRVTI 115

Query: 141 YPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD 200
           YPI GEYSRDKLMLQRIRSYI FLE ++ EHSQG GHINHY+FTDSDIAVVDDLG IF  
Sbjct: 116 YPIQGEYSRDKLMLQRIRSYIVFLETKLEEHSQGHGHINHYIFTDSDIAVVDDLGQIFQS 175

Query: 201 YQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFL-EEVLRVYSSKYMNASRMLGD 259
           + NFH+ALTFRNNK+QPL                 K ++  +VL+VYSS++MNASRMLGD
Sbjct: 176 HPNFHVALTFRNNKEQPL-----------------KFWIYSKVLKVYSSRFMNASRMLGD 218

Query: 260 QLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDV 319
           QLALAWVVKSHP FD +RF+K Q F+EDI G SVLFLPCA YNWTPPEGAGQFHGMPLDV
Sbjct: 219 QLALAWVVKSHPYFDTKRFSKPQAFLEDIGGTSVLFLPCAIYNWTPPEGAGQFHGMPLDV 278

Query: 320 KVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           KVVHFKGSRKRLMLESWNFF SSSDISDMLCLILMSGRTKYDF
Sbjct: 279 KVVHFKGSRKRLMLESWNFFISSSDISDMLCLILMSGRTKYDF 321


>gi|294462546|gb|ADE76819.1| unknown [Picea sitchensis]
          Length = 391

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 214/390 (54%), Positives = 266/390 (68%), Gaps = 44/390 (11%)

Query: 17  ACGGCRRFLFFLPLVFFLPYLLSVLEL----HEKSVVEDLPRKNRQKFDHLILGPAAGQR 72
           A  G  RF+ FLP +  LP++ S  +L    + K   +      R+KFD+++LGPAAGQ 
Sbjct: 2   ASSGKWRFIRFLPFILILPFIFSGFQLSRLQNSKPKGDGSVGVGRKKFDYIVLGPAAGQG 61

Query: 73  LPNRLQCQ---------------------------------------DSRASNMVTVGNA 93
           LPNR+QCQ                                       D + S  V+VGN+
Sbjct: 62  LPNRIQCQGLKAVKRRPLPSFHLSLVKEKISFVTVFTIYNQSLQISFDQKVSTNVSVGNS 121

Query: 94  SYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLM 153
           +Y KT+RSMAILNVF NFI+V MP+S++FILTDPAS+  +      +  I G+YSR+ LM
Sbjct: 122 TYDKTQRSMAILNVFANFIKVAMPRSNIFILTDPASNFPVVPSNAVVMHIPGDYSRNNLM 181

Query: 154 LQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN 213
           LQRI+SYI FLE R+  H   Q  ++H++FTDSDIAVVDDLG +  +Y +FH+ LTFRNN
Sbjct: 182 LQRIKSYIDFLEARLSGHIGKQNQVDHFIFTDSDIAVVDDLGDVVENYPDFHIGLTFRNN 241

Query: 214 KDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSF 273
           KDQPLNSGFI VRGT + +S+AK FLEEVL +Y S +M A+RMLGDQLALAW+VK+ P F
Sbjct: 242 KDQPLNSGFILVRGTDEAVSKAKAFLEEVLEIYKSMFMKAARMLGDQLALAWIVKNQPLF 301

Query: 274 DARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLML 333
           DA+RF   + FV ++  A VLFLPCA YNWTPPEGAGQFHGMP DVKV+HFKGSRKRLM+
Sbjct: 302 DAQRFRNPKAFVAEVHRAQVLFLPCAIYNWTPPEGAGQFHGMPEDVKVIHFKGSRKRLMM 361

Query: 334 ESWNFFSSSS-DISDMLCLILMSGRTKYDF 362
           ESWNFF+S   D SDM+CLIL SGR KYDF
Sbjct: 362 ESWNFFNSHPVDFSDMMCLILKSGRVKYDF 391


>gi|293336758|ref|NP_001169994.1| uncharacterized protein LOC100383899 precursor [Zea mays]
 gi|224032791|gb|ACN35471.1| unknown [Zea mays]
 gi|413924636|gb|AFW64568.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
          Length = 388

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 216/374 (57%), Positives = 262/374 (70%), Gaps = 43/374 (11%)

Query: 30  LVFFLPYLLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRA- 84
           L+F +P + SV  L     EK V    P   ++  DHL+LGPAAGQ  P+RLQC+  RA 
Sbjct: 17  LLFLVPLIYSVSRLQPWAPEKGVCLPPPTAPKRP-DHLVLGPAAGQDRPDRLQCRGLRAL 75

Query: 85  ------------------------------------SNMVTVGNASYSKTERSMAILNVF 108
                                               S+ VTVGN+SYSK ERSMAILN F
Sbjct: 76  NKIGISSEENYSGEHVSFATVFTTYNSVSAGDDNVPSDSVTVGNSSYSKIERSMAILNTF 135

Query: 109 INFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRI 168
           I+FI+V+MP+SD+ ILTDP S +S+ +   T+ P+ G YSR  LMLQRI++YI FLE+++
Sbjct: 136 ISFIKVSMPRSDLIILTDPGSKISVNQGTATLLPVEGNYSRGNLMLQRIKTYIAFLEQKL 195

Query: 169 REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGT 228
            E  + +  +NH+V TDSDIAVV DLGHIF  Y +FHLA+TFRNNK QPLNSGF+AVRGT
Sbjct: 196 VEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRNNKGQPLNSGFVAVRGT 254

Query: 229 PDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDI 288
            DGI+ A  FL++VL  YS +YM ASRMLGDQLALAWVVKSH      +F+K + F  ++
Sbjct: 255 RDGITNAVEFLKQVLGTYSLRYMKASRMLGDQLALAWVVKSHLPSAFGKFSKNEAFTGEV 314

Query: 289 IGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDM 348
            G SVLFLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF+SS+S +SDM
Sbjct: 315 NGTSVLFLPCAVYNWTPPEGAGQFHGVPLDVKVVHFKGSRKRLMLEAWNFYSSTSKLSDM 374

Query: 349 LCLILMSGRTKYDF 362
           LCLIL SGRTKYDF
Sbjct: 375 LCLILRSGRTKYDF 388


>gi|413954355|gb|AFW87004.1| hypothetical protein ZEAMMB73_846695 [Zea mays]
          Length = 414

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 206/347 (59%), Positives = 250/347 (72%), Gaps = 38/347 (10%)

Query: 53  PRKNRQKFDHLILGPAAGQRLPNRLQCQDSRASNM------------------------- 87
           P    ++ DHL+LGPAAGQ  P+RLQC+  RA N                          
Sbjct: 69  PPTAPKRPDHLVLGPAAGQGRPDRLQCRGLRALNKIGLSSEENYSGEHVSFVTVFTTYNS 128

Query: 88  ------------VTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR 135
                       VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S  S+ +
Sbjct: 129 VSAGDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSVNQ 188

Query: 136 KGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLG 195
              T+ PI G YSR  LMLQRI++YI FLE+++ E  + +  +NH+V TDSDIAVVDDLG
Sbjct: 189 GSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLG 247

Query: 196 HIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASR 255
           HIF  Y +FHLA+TFRNNK QPLNSGF+AVRGT DGI++A  FL++VL+ YS +Y+ A+R
Sbjct: 248 HIFEKYPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITKAAEFLKQVLKAYSLRYIKAAR 307

Query: 256 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGM 315
           MLGDQLALAWVVKSH      +F+K + F  ++ G SVLFLPCA YNWTPPEGAGQFHG+
Sbjct: 308 MLGDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTPPEGAGQFHGI 367

Query: 316 PLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 368 PLDVKVVHFKGSRKRLMLEAWNFYNSTSKMSDMLCLILRSGRTKYDF 414


>gi|413954354|gb|AFW87003.1| hypothetical protein ZEAMMB73_846695 [Zea mays]
          Length = 389

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 206/347 (59%), Positives = 250/347 (72%), Gaps = 38/347 (10%)

Query: 53  PRKNRQKFDHLILGPAAGQRLPNRLQCQDSRASNM------------------------- 87
           P    ++ DHL+LGPAAGQ  P+RLQC+  RA N                          
Sbjct: 44  PPTAPKRPDHLVLGPAAGQGRPDRLQCRGLRALNKIGLSSEENYSGEHVSFVTVFTTYNS 103

Query: 88  ------------VTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR 135
                       VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S  S+ +
Sbjct: 104 VSAGDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSVNQ 163

Query: 136 KGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLG 195
              T+ PI G YSR  LMLQRI++YI FLE+++ E  + +  +NH+V TDSDIAVVDDLG
Sbjct: 164 GSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLG 222

Query: 196 HIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASR 255
           HIF  Y +FHLA+TFRNNK QPLNSGF+AVRGT DGI++A  FL++VL+ YS +Y+ A+R
Sbjct: 223 HIFEKYPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITKAAEFLKQVLKAYSLRYIKAAR 282

Query: 256 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGM 315
           MLGDQLALAWVVKSH      +F+K + F  ++ G SVLFLPCA YNWTPPEGAGQFHG+
Sbjct: 283 MLGDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTPPEGAGQFHGI 342

Query: 316 PLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 343 PLDVKVVHFKGSRKRLMLEAWNFYNSTSKMSDMLCLILRSGRTKYDF 389


>gi|357471693|ref|XP_003606131.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
 gi|355507186|gb|AES88328.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
          Length = 251

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 196/251 (78%), Positives = 218/251 (86%)

Query: 112 IQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREH 171
           +QV MP+S+V ILTDP SDLS+ R  V++YPI GEYSRDKLMLQRIRSYITFLE R+++ 
Sbjct: 1   MQVVMPQSEVIILTDPVSDLSVHRNRVSLYPIQGEYSRDKLMLQRIRSYITFLETRLQKL 60

Query: 172 SQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDG 231
           SQ    I HY+FTDSDIAVVDDLG IF D+ NFH+ALTFRNNK QPLNSGFIAV+GTPDG
Sbjct: 61  SQNPKDITHYIFTDSDIAVVDDLGQIFRDHPNFHMALTFRNNKAQPLNSGFIAVKGTPDG 120

Query: 232 ISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGA 291
           I RAK+FL+EVL+VY SKYM+ASRMLGDQLALAWVVKS P FDA RF K   F +DI G 
Sbjct: 121 ILRAKLFLQEVLKVYVSKYMSASRMLGDQLALAWVVKSKPQFDASRFAKTVAFSDDIGGT 180

Query: 292 SVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCL 351
           S+LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF+SS+ DI+DMLCL
Sbjct: 181 SILFLPCALYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFYSSTPDIADMLCL 240

Query: 352 ILMSGRTKYDF 362
           IL SGRTKYDF
Sbjct: 241 ILGSGRTKYDF 251


>gi|255541260|ref|XP_002511694.1| conserved hypothetical protein [Ricinus communis]
 gi|223548874|gb|EEF50363.1| conserved hypothetical protein [Ricinus communis]
          Length = 554

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 220/382 (57%), Positives = 252/382 (65%), Gaps = 97/382 (25%)

Query: 29  PLVFFLPYLL-------SVLELHEKSVVEDLPRKN-RQKFDHLILGPAAGQRLPNRLQCQ 80
           PL+F + YL        SVLELH  SV E  P+KN  +K DHL++GPAAGQ LP+RLQC+
Sbjct: 222 PLLFCVMYLACYTMHVASVLELHWNSVTE-APQKNWNKKSDHLVIGPAAGQGLPDRLQCE 280

Query: 81  ----------------------------------------DSRASNMVTVGNASYSKTER 100
                                                   D R+SN+VTVGN SYSKTER
Sbjct: 281 GSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSLPDDRSSNLVTVGNVSYSKTER 340

Query: 101 SMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSY 160
           SMAILNVFINFIQ                                               
Sbjct: 341 SMAILNVFINFIQ----------------------------------------------- 353

Query: 161 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNS 220
             FL+ +++E ++   H +HY+FTDSDIAVVDDLG IFH+Y NFH+ALTFRNNK+QPLNS
Sbjct: 354 -NFLDTKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYPNFHIALTFRNNKEQPLNS 412

Query: 221 GFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTK 280
           GFIAVRGT + I RAKIFL+ VL VY+SKYMNASRMLGDQLALAWV++SHP FD +RF K
Sbjct: 413 GFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASRMLGDQLALAWVIRSHPGFDLQRFRK 472

Query: 281 AQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFS 340
           AQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRK LMLESWNFF 
Sbjct: 473 AQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKHLMLESWNFFR 532

Query: 341 SSSDISDMLCLILMSGRTKYDF 362
           S+SDISDMLCLILMSGRTKYDF
Sbjct: 533 SASDISDMLCLILMSGRTKYDF 554


>gi|357124057|ref|XP_003563723.1| PREDICTED: uncharacterized protein LOC100833864 [Brachypodium
           distachyon]
          Length = 391

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 205/339 (60%), Positives = 242/339 (71%), Gaps = 38/339 (11%)

Query: 61  DHLILGPAAGQRLPNRLQCQDSRASN---------------------------------- 86
           D L+LGPAAGQ  P+RLQCQ  +A N                                  
Sbjct: 54  DRLVLGPAAGQGRPDRLQCQGLKAVNKIILSSETTHYGERVSFVTVFTTYNSDPDKASKM 113

Query: 87  ---MVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPI 143
              +VTVGN SYSK ERS+A+LN FI+FIQV+MP+S+V ILTDP S+LS+ +    I PI
Sbjct: 114 SSGLVTVGNHSYSKVERSIAVLNTFISFIQVSMPRSNVIILTDPKSNLSIDQGNAVILPI 173

Query: 144 HGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQN 203
            G YSR  LMLQRI+SYI FLE +  E  Q      H+VFTDSDIAVV+ LGHIF  Y +
Sbjct: 174 EGNYSRGNLMLQRIKSYIAFLELKFVEL-QRVDRFTHFVFTDSDIAVVEGLGHIFKRYPH 232

Query: 204 FHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLAL 263
            HLALTFRNN  QPLNSGF+AVRGT DGIS+A  F +EVL+ Y+SKYM ASRMLGDQLAL
Sbjct: 233 CHLALTFRNNNGQPLNSGFVAVRGTSDGISKATEFFKEVLKAYNSKYMKASRMLGDQLAL 292

Query: 264 AWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVH 323
           AWVVKS+      +F++ + F  ++ GAS+LFLPCA YNWTPPEGAGQFHGMPLDVKV+H
Sbjct: 293 AWVVKSYLPSAFGKFSRHEEFTGEVNGASILFLPCAVYNWTPPEGAGQFHGMPLDVKVIH 352

Query: 324 FKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           FKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 353 FKGSRKRLMLEAWNFYNSTSHLSDMLCLILKSGRTKYDF 391


>gi|242093388|ref|XP_002437184.1| hypothetical protein SORBIDRAFT_10g022560 [Sorghum bicolor]
 gi|241915407|gb|EER88551.1| hypothetical protein SORBIDRAFT_10g022560 [Sorghum bicolor]
          Length = 377

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 210/367 (57%), Positives = 248/367 (67%), Gaps = 55/367 (14%)

Query: 37  LLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRA-------- 84
           + SV  LH    EK V    P   ++  D L+LGPAAGQ  P+RLQCQ  RA        
Sbjct: 25  IYSVSRLHPWVPEKGVCLPPPTAPKRP-DRLVLGPAAGQGRPDRLQCQGLRALNKIGLSS 83

Query: 85  -----------------------------SNMVTVGNASYSKTERSMAILNVFINFIQVT 115
                                        S+ VTVGN SYSK ERSMAILN FI+FI+V+
Sbjct: 84  EEIYSGEHISFVTVFTTYNSVSAGDGNVPSDSVTVGNHSYSKIERSMAILNTFISFIKVS 143

Query: 116 MPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQ 175
           MP+S+V ILTDP S +S+ +   T+ PI G YSR  LMLQRI++YI              
Sbjct: 144 MPRSNVIILTDPGSKISVNQGSATLLPIEGNYSRGNLMLQRIQTYI-------------D 190

Query: 176 GHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRA 235
           G +  +  TDSDIAVVDDLGHIF  Y + HLALTFRNNK QPLNSGF+AVRGT DGI++A
Sbjct: 191 GGVESFFLTDSDIAVVDDLGHIFKKYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGITKA 250

Query: 236 KIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLF 295
             FL++VL  YSS+Y+ ASRMLGDQLALAWVVKSH      +F+K + F  ++ GASVLF
Sbjct: 251 VEFLKQVLGAYSSRYIKASRMLGDQLALAWVVKSHLPSAFGKFSKHEAFTGEVNGASVLF 310

Query: 296 LPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMS 355
           LPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL S
Sbjct: 311 LPCAVYNWTPPEGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFYNSTSKLSDMLCLILRS 370

Query: 356 GRTKYDF 362
           GRTKYDF
Sbjct: 371 GRTKYDF 377


>gi|242060516|ref|XP_002451547.1| hypothetical protein SORBIDRAFT_04g003580 [Sorghum bicolor]
 gi|241931378|gb|EES04523.1| hypothetical protein SORBIDRAFT_04g003580 [Sorghum bicolor]
          Length = 346

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 190/289 (65%), Positives = 232/289 (80%), Gaps = 2/289 (0%)

Query: 75  NRLQCQDSRA-SNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSM 133
           N +   D +  S+ VTVGN SYSKTERSMAIL+ FI+FI+V+MP+S+V ILTDP S +S+
Sbjct: 59  NSVSAGDGKVPSDSVTVGNHSYSKTERSMAILSTFISFIRVSMPRSNVIILTDPGSKISV 118

Query: 134 PRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDD 193
            +   T+ PI G YSR  LMLQRI++YI FLE+++ E    +G +NH+V TDSDIA+VDD
Sbjct: 119 NQGSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDSMEG-LNHFVLTDSDIALVDD 177

Query: 194 LGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNA 253
           LGHIF  Y + HLALTFRNNK QPLNSGF+AVRGT DGI++A  FL++VL  Y  +Y+ A
Sbjct: 178 LGHIFKKYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGITKAVEFLKQVLEAYCLRYIKA 237

Query: 254 SRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFH 313
           SRMLGDQLALAWVVKSH      +F+K + F  ++ GASVLFLPCA YNWTPPEGAGQFH
Sbjct: 238 SRMLGDQLALAWVVKSHLPSAFGKFSKHEAFTGEVNGASVLFLPCAVYNWTPPEGAGQFH 297

Query: 314 GMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           G+PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLC+IL SGRTKYDF
Sbjct: 298 GIPLDVKVVHFKGSRKRLMLEAWNFYNSTSKLSDMLCIILRSGRTKYDF 346


>gi|218198407|gb|EEC80834.1| hypothetical protein OsI_23435 [Oryza sativa Indica Group]
          Length = 344

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 196/296 (66%), Positives = 223/296 (75%), Gaps = 13/296 (4%)

Query: 67  PAAGQRLPNRLQCQDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 126
           PA   +LP          SN+VTVG  SYSK  RSMAILN FI FIQV+MP+S+V ILTD
Sbjct: 62  PAEASKLP----------SNVVTVGKHSYSKVGRSMAILNTFIGFIQVSMPRSNVIILTD 111

Query: 127 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 186
           P S L+       I PI G YSR  LMLQRIRSYI FLE+R+ E    +  INH +FTDS
Sbjct: 112 PNSKLT--HGSAVILPIEGNYSRGNLMLQRIRSYIAFLEQRLEELETVED-INHLIFTDS 168

Query: 187 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 246
           DIAVV DLGHIF  Y + HLALTFRNNK QPLNSGF+AVRGT DGI +A  F +EVL  Y
Sbjct: 169 DIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGIFKAIEFFKEVLEAY 228

Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
             KYM ASRMLGDQLALAWVVKS+      +F+K + F  ++ G S+LFLPCA YNWTPP
Sbjct: 229 HLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKHEAFTGEVNGTSILFLPCAVYNWTPP 288

Query: 307 EGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           EGAGQFHGMPLDVKVVHFKGSRKRLMLE+WNF++S+S++SDMLCLIL SGRTKYDF
Sbjct: 289 EGAGQFHGMPLDVKVVHFKGSRKRLMLEAWNFYNSTSELSDMLCLILRSGRTKYDF 344


>gi|222635777|gb|EEE65909.1| hypothetical protein OsJ_21755 [Oryza sativa Japonica Group]
          Length = 344

 Score =  386 bits (992), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 195/296 (65%), Positives = 222/296 (75%), Gaps = 13/296 (4%)

Query: 67  PAAGQRLPNRLQCQDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 126
           PA   +LP          SN+VTVG  SYSK  RSMAILN FI FIQV+MP+S+V ILTD
Sbjct: 62  PAEASKLP----------SNVVTVGKHSYSKVGRSMAILNTFIGFIQVSMPRSNVIILTD 111

Query: 127 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 186
           P S L+       I PI G YSR  LM QRIRSYI FLE+R+ E    +  INH +FTDS
Sbjct: 112 PNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYIAFLEQRLEELETVE-DINHLIFTDS 168

Query: 187 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 246
           DIAVV DLGHIF  Y + HLALTFRNNK QPLNSGF+AVRGT DGI +A  F +EVL  Y
Sbjct: 169 DIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGIFKAIEFFKEVLEAY 228

Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
             KYM ASRMLGDQLALAWVVKS+      +F+K + F  ++ G S+LFLPCA YNWTPP
Sbjct: 229 YLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKHEAFTGEVNGTSILFLPCAVYNWTPP 288

Query: 307 EGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           EGAGQFHGMPLDVKVVHFKGSRKRLMLE+WNF++S+S++SDMLCLIL SGRTKYDF
Sbjct: 289 EGAGQFHGMPLDVKVVHFKGSRKRLMLEAWNFYNSTSELSDMLCLILRSGRTKYDF 344


>gi|224101407|ref|XP_002334278.1| predicted protein [Populus trichocarpa]
 gi|222870580|gb|EEF07711.1| predicted protein [Populus trichocarpa]
          Length = 274

 Score =  365 bits (936), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 194/330 (58%), Positives = 221/330 (66%), Gaps = 62/330 (18%)

Query: 34  LPYL-LSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRASNMVTVGN 92
           +P+L  SVLELH+    +  P+K   KFDHL+LGPAAGQ LPNRLQCQ            
Sbjct: 6   IPFLSFSVLELHQNPAAQPPPKKMNTKFDHLVLGPAAGQGLPNRLQCQGD---------- 55

Query: 93  ASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKL 152
                   S+ I   ++ F QVTMP+S+V ILTDPASDLS+ R  VT+YPI G+YSRDKL
Sbjct: 56  --------SVQIHFSYVCF-QVTMPQSNVVILTDPASDLSLHRNSVTVYPIQGDYSRDKL 106

Query: 153 MLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRN 212
           MLQRIRSYITFLE R+ + +Q  G I+HY+ TDSDIAVVDDLGH+F+D+  F    TFR+
Sbjct: 107 MLQRIRSYITFLETRLEKLAQNPGPISHYILTDSDIAVVDDLGHLFNDHPTFTRLFTFRD 166

Query: 213 NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPS 272
           NK+QPLNSGFIAV GT D I R                                      
Sbjct: 167 NKEQPLNSGFIAVWGTADAILR-------------------------------------- 188

Query: 273 FDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM 332
               RFTKAQ F+E+I G SVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM
Sbjct: 189 ----RFTKAQAFLENIGGTSVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM 244

Query: 333 LESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           LESWNF SSSSDI  MLCL+L SGRTKYDF
Sbjct: 245 LESWNFLSSSSDIFGMLCLVLSSGRTKYDF 274


>gi|223949095|gb|ACN28631.1| unknown [Zea mays]
 gi|413924639|gb|AFW64571.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
          Length = 209

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 148/210 (70%), Positives = 174/210 (82%), Gaps = 1/210 (0%)

Query: 153 MLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRN 212
           MLQRI++YI FLE+++ E  + +  +NH+V TDSDIAVV DLGHIF  Y +FHLA+TFRN
Sbjct: 1   MLQRIKTYIAFLEQKLVEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRN 59

Query: 213 NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPS 272
           NK QPLNSGF+AVRGT DGI+ A  FL++VL  YS +YM ASRMLGDQLALAWVVKSH  
Sbjct: 60  NKGQPLNSGFVAVRGTRDGITNAVEFLKQVLGTYSLRYMKASRMLGDQLALAWVVKSHLP 119

Query: 273 FDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM 332
               +F+K + F  ++ G SVLFLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLM
Sbjct: 120 SAFGKFSKNEAFTGEVNGTSVLFLPCAVYNWTPPEGAGQFHGVPLDVKVVHFKGSRKRLM 179

Query: 333 LESWNFFSSSSDISDMLCLILMSGRTKYDF 362
           LE+WNF+SS+S +SDMLCLIL SGRTKYDF
Sbjct: 180 LEAWNFYSSTSKLSDMLCLILRSGRTKYDF 209


>gi|2583122|gb|AAB82631.1| hypothetical protein [Arabidopsis thaliana]
          Length = 304

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 170/346 (49%), Positives = 204/346 (58%), Gaps = 89/346 (25%)

Query: 15  MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
           MR+C G RR L  +P++F LP+L S+++    S   +  R    +K DHL+LGP AGQ L
Sbjct: 10  MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69

Query: 74  PNRLQCQDSRASN-------------------------------------MVTVGNASYS 96
            +R  C+ ++A N                                     +  VGN +YS
Sbjct: 70  SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLGNVKSSNPVSVVGNVTYS 129

Query: 97  KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR 156
           K ERSMA+LN F NFIQ                                           
Sbjct: 130 KPERSMAVLNAFANFIQ------------------------------------------- 146

Query: 157 IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ 216
                TFLE ++ ++   +G INHY+FTDSDIAVVDD+G IF  + +FHLALTFRNNKDQ
Sbjct: 147 -----TFLEMKLEKN---EGGINHYIFTDSDIAVVDDVGTIFDKHSSFHLALTFRNNKDQ 198

Query: 217 PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDAR 276
           PLNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL  VVKSH SFDA+
Sbjct: 199 PLNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVSVVKSHASFDAK 258

Query: 277 RFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVV 322
           RFTK Q F E+I GASVLFLPCA YNWTPPEGAGQFHGMPLDVKV+
Sbjct: 259 RFTKPQAFTEEIAGASVLFLPCALYNWTPPEGAGQFHGMPLDVKVL 304


>gi|54291155|dbj|BAD61827.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|54291236|dbj|BAD61931.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 288

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 152/254 (59%), Positives = 170/254 (66%), Gaps = 27/254 (10%)

Query: 67  PAAGQRLPNRLQCQDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 126
           PA   +LP          SN+VTVG  SYSK  RSMAILN FI FIQV+MP+S+V ILTD
Sbjct: 62  PAEASKLP----------SNVVTVGKHSYSKVGRSMAILNTFIGFIQVSMPRSNVIILTD 111

Query: 127 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 186
           P S L+       I PI G YSR  LM QRIRSYI FLE+R+ E    +  INH +FTDS
Sbjct: 112 PNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYIAFLEQRLEELETVED-INHLIFTDS 168

Query: 187 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 246
           DIAVV DLGHIF  Y + HLALTFRNNK QPLNSGF+AVRGT DGI +A  F +EVL  Y
Sbjct: 169 DIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGIFKAIEFFKEVLEAY 228

Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
             KYM ASRMLGDQLALAWVVKS+      +F+K + F                YNWTPP
Sbjct: 229 YLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKHEAF--------------TVYNWTPP 274

Query: 307 EGAGQFHGMPLDVK 320
           EGAGQFHGMPLDVK
Sbjct: 275 EGAGQFHGMPLDVK 288


>gi|302769952|ref|XP_002968395.1| hypothetical protein SELMODRAFT_65657 [Selaginella moellendorffii]
 gi|300164039|gb|EFJ30649.1| hypothetical protein SELMODRAFT_65657 [Selaginella moellendorffii]
          Length = 280

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 131/280 (46%), Positives = 185/280 (66%), Gaps = 7/280 (2%)

Query: 89  TVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR--KGVTIYPIHGE 146
            VG       E+  ++L VF+   ++ MP S   +LTDPA+ +S  R   G++   + G 
Sbjct: 2   VVGGRVLRGLEKGYSVLRVFVESARLAMPNSQQLVLTDPAAAISTERLPAGISFQRVPGN 61

Query: 147 YSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHL 206
           YSR  LMLQR+ SYI FL+ +I++  +    + H++F DSD+ VV DLG +F ++ +F +
Sbjct: 62  YSRGNLMLQRLDSYIAFLDDQIKQVGKADS-LQHFIFADSDMIVVGDLGCVFLEFPSFDV 120

Query: 207 ALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWV 266
           ALTFRNNK+QP+NSG I VRG+ DG+++ K+ L+ V+  Y   +  ASRM+GDQLA AWV
Sbjct: 121 ALTFRNNKEQPINSGMIFVRGSKDGLAKGKLLLQSVVDSYRRDFFRASRMMGDQLAFAWV 180

Query: 267 VKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKG 326
           V+         F + + F   + G  VLFLPC++YNWTP EGAGQFHGMPLDVK +HFKG
Sbjct: 181 VRHFADPLEDSFKQGKVFKSQVKGVEVLFLPCSSYNWTPAEGAGQFHGMPLDVKAIHFKG 240

Query: 327 SRKRLMLESWNF----FSSSSDISDMLCLILMSGRTKYDF 362
           SRKRLMLE+W+      +++ D+  + C +L SGR+KYDF
Sbjct: 241 SRKRLMLEAWDSHKHQVAATKDLLPLQCFVLKSGRSKYDF 280


>gi|302774282|ref|XP_002970558.1| hypothetical protein SELMODRAFT_65658 [Selaginella moellendorffii]
 gi|300162074|gb|EFJ28688.1| hypothetical protein SELMODRAFT_65658 [Selaginella moellendorffii]
          Length = 331

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 134/282 (47%), Positives = 189/282 (67%), Gaps = 11/282 (3%)

Query: 89  TVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR--KGVTIYPIHGE 146
            VG       E+  ++L VF+   ++ MP S   +LTDPA+ +S  R   G++   + G 
Sbjct: 53  VVGGRVLRGLEKGYSVLRVFVESARLAMPNSQQLVLTDPAAVISTERLPAGISFQRVPGN 112

Query: 147 YSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHL 206
           YSR  LMLQR+ SYI FL+ +I++  +    + H++F DSD+ VV DLG +F ++ +F +
Sbjct: 113 YSRGNLMLQRLDSYIAFLDDQIKQVGKAD-SLQHFIFADSDMIVVGDLGCVFLEFPSFDV 171

Query: 207 ALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWV 266
           ALTFRNNK+QP+NSG I VRG+ DG+++ K+ L+ V+  Y   +  ASRM+GDQLA AWV
Sbjct: 172 ALTFRNNKEQPINSGMIFVRGSKDGLAKGKLLLQSVVDSYRRDFFRASRMMGDQLAFAWV 231

Query: 267 VK--SHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHF 324
           V+  S P  D+  F + + F   + G  VLFLPC++YNWTP EGAGQFHGMPLDVK +HF
Sbjct: 232 VRHFSDPLEDS--FKQGKVFKSQVKGVEVLFLPCSSYNWTPAEGAGQFHGMPLDVKAIHF 289

Query: 325 KGSRKRLMLESWNF----FSSSSDISDMLCLILMSGRTKYDF 362
           KGSRKRLMLE+W+      +++ D+  + C +L SGR+KYDF
Sbjct: 290 KGSRKRLMLEAWDSHKHQVAATKDLLPLQCFVLKSGRSKYDF 331


>gi|413938025|gb|AFW72576.1| hypothetical protein ZEAMMB73_448315 [Zea mays]
          Length = 450

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 122/184 (66%), Positives = 148/184 (80%), Gaps = 1/184 (0%)

Query: 160 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 219
           +  FLE+++ E  + +  +NH+V TDSDIAVVDDLGHIF  Y +FHLA+TFRNNK QPLN
Sbjct: 262 FTAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIFEKYPHFHLAVTFRNNKGQPLN 320

Query: 220 SGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFT 279
           SGF+AVRGT DGI++A  FL++VL  YS +Y+ ASRMLGDQLALAWVVK H      +F+
Sbjct: 321 SGFVAVRGTSDGITKAVEFLKQVLGTYSLRYIKASRMLGDQLALAWVVKFHLPSALGKFS 380

Query: 280 KAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFF 339
           K + F  ++ G SVLFLPCA YNWT PEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF+
Sbjct: 381 KHEAFTGEVNGTSVLFLPCAVYNWTQPEGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFY 440

Query: 340 SSSS 343
           +S S
Sbjct: 441 NSFS 444


>gi|414866083|tpg|DAA44640.1| TPA: putative serine/threonine protein phosphatase superfamily
           protein [Zea mays]
          Length = 470

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 118/183 (64%), Positives = 143/183 (78%), Gaps = 1/183 (0%)

Query: 138 VTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHI 197
            T+ PI G YSR  LMLQRI++YI FLE+++ E  + +  +NH+V TDSDIAVVDDLGHI
Sbjct: 210 ATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHI 268

Query: 198 FHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRML 257
           F  Y +FHLA+TF NNK QPLNSGF+AVRGT DGI++A  FL++VL  YS +Y+ ASRML
Sbjct: 269 FEKYPHFHLAVTFCNNKGQPLNSGFVAVRGTRDGITKAVEFLKQVLGTYSLRYIKASRML 328

Query: 258 GDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPL 317
           GDQLALAWVVKSH      +F+K + F  ++ G SVLFLPC  YNWTPPEGAGQFHG+PL
Sbjct: 329 GDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCVVYNWTPPEGAGQFHGIPL 388

Query: 318 DVK 320
           DVK
Sbjct: 389 DVK 391


>gi|168008832|ref|XP_001757110.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691608|gb|EDQ77969.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 121/271 (44%), Positives = 170/271 (62%), Gaps = 7/271 (2%)

Query: 97  KTERSMAILNVFINFIQ-VTMP-KSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLML 154
           K  R  A+L  F+  IQ V+MP  + V I+T+         + +   P    +SR  LM+
Sbjct: 40  KKSRQDAVLRAFLESIQQVSMPGTTRVTIITNHNKLRGELPQDIDWKPTSRHFSRRNLMI 99

Query: 155 QRIRSYITFLERRIREHSQGQGH-INHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN 213
           QR++SYI  L+  I +        ++H +F+D D+ VVDDLG +F ++ +F +A TFRNN
Sbjct: 100 QRLQSYIELLDSMIEDRKNNSSSPVSHAIFSDFDMIVVDDLGCVFKEFPHFDIAFTFRNN 159

Query: 214 KDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSF 273
           + QP+NSG I VRGT   +SRA   L+EV+++Y +K+ +A  +LGDQLALA +VK   + 
Sbjct: 160 QRQPINSGVIMVRGTFGSLSRATQLLKEVVKIYLAKFRHAFGVLGDQLALADIVKG--TL 217

Query: 274 DARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLML 333
            AR F +  P    ++    LFLPC  YNWTPPEGAGQF GMP +VKV+HFKG RKRLM+
Sbjct: 218 QARAFQEGVPVEATVMTTKTLFLPCVIYNWTPPEGAGQFQGMPTEVKVLHFKGRRKRLMI 277

Query: 334 ESWNFFSSSS--DISDMLCLILMSGRTKYDF 362
           ++W F+      D   M CL+L SGR+KYD+
Sbjct: 278 QAWYFYKKQGVLDFYKMKCLVLKSGRSKYDY 308


>gi|413924637|gb|AFW64569.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
 gi|413924638|gb|AFW64570.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
          Length = 260

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 122/246 (49%), Positives = 156/246 (63%), Gaps = 43/246 (17%)

Query: 30  LVFFLPYLLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRA- 84
           L+F +P + SV  L     EK V    P   ++  DHL+LGPAAGQ  P+RLQC+  RA 
Sbjct: 17  LLFLVPLIYSVSRLQPWAPEKGVCLPPPTAPKRP-DHLVLGPAAGQDRPDRLQCRGLRAL 75

Query: 85  ------------------------------------SNMVTVGNASYSKTERSMAILNVF 108
                                               S+ VTVGN+SYSK ERSMAILN F
Sbjct: 76  NKIGISSEENYSGEHVSFATVFTTYNSVSAGDDNVPSDSVTVGNSSYSKIERSMAILNTF 135

Query: 109 INFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRI 168
           I+FI+V+MP+SD+ ILTDP S +S+ +   T+ P+ G YSR  LMLQRI++YI FLE+++
Sbjct: 136 ISFIKVSMPRSDLIILTDPGSKISVNQGTATLLPVEGNYSRGNLMLQRIKTYIAFLEQKL 195

Query: 169 REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGT 228
            E  + +  +NH+V TDSDIAVV DLGHIF  Y +FHLA+TFRNNK QPLNSGF+AVRGT
Sbjct: 196 VEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRNNKGQPLNSGFVAVRGT 254

Query: 229 PDGISR 234
            DGI++
Sbjct: 255 RDGITK 260


>gi|255562810|ref|XP_002522410.1| hypothetical protein RCOM_0835860 [Ricinus communis]
 gi|223538295|gb|EEF39902.1| hypothetical protein RCOM_0835860 [Ricinus communis]
          Length = 202

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 103/130 (79%), Positives = 112/130 (86%), Gaps = 6/130 (4%)

Query: 233 SRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGAS 292
            RAKIFL+ VL VY+SKYMNASR      ALAWV++SHP FD RRF KAQ F++++ GAS
Sbjct: 79  CRAKIFLQHVLEVYTSKYMNASR------ALAWVIRSHPGFDLRRFHKAQAFMDEMGGAS 132

Query: 293 VLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLI 352
            LFLPCA YNWTPPEGAGQFH MPLDVKVVHFKGSRKRLMLESWNFF S+SDISDMLCLI
Sbjct: 133 ALFLPCAIYNWTPPEGAGQFHRMPLDVKVVHFKGSRKRLMLESWNFFRSASDISDMLCLI 192

Query: 353 LMSGRTKYDF 362
           LMSGRTKYDF
Sbjct: 193 LMSGRTKYDF 202


>gi|414587497|tpg|DAA38068.1| TPA: hypothetical protein ZEAMMB73_303828 [Zea mays]
          Length = 258

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 109/216 (50%), Positives = 140/216 (64%), Gaps = 35/216 (16%)

Query: 53  PRKNRQKFDHLILGPAAGQRLPNR----------LQCQDSRAS----------------- 85
           P    ++ DHL+LGPAAGQ  P+R          L  +++ +                  
Sbjct: 44  PPTAPKRPDHLVLGPAAGQGRPDRRRLRALNKIGLSSEENYSGEHVPFVTVFTTYNSVSA 103

Query: 86  -------NMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGV 138
                  + VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S  S+ +   
Sbjct: 104 GDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSVNQGSA 163

Query: 139 TIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIF 198
           T+ PI G YSR  LMLQRI++YI FLE+++ E  + +  +NH+V TDSDIAVVDDLGHIF
Sbjct: 164 TLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIF 222

Query: 199 HDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISR 234
               +FHLA+TFRNNK QPLNSGF+AVRGT DGI++
Sbjct: 223 EKNPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITK 258


>gi|357520759|ref|XP_003630668.1| hypothetical protein MTR_8g102040 [Medicago truncatula]
 gi|355524690|gb|AET05144.1| hypothetical protein MTR_8g102040 [Medicago truncatula]
          Length = 105

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 82/129 (63%), Positives = 94/129 (72%), Gaps = 25/129 (19%)

Query: 234 RAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASV 293
           + K+FL+EVL+VY SKYM+ ++ L                         PF +DI G S+
Sbjct: 2   QGKLFLQEVLKVYVSKYMSVAKTL-------------------------PFSDDIGGTSI 36

Query: 294 LFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLIL 353
           LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF+SS+ DI+DMLCLIL
Sbjct: 37  LFLPCALYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFYSSTPDIADMLCLIL 96

Query: 354 MSGRTKYDF 362
            SGRTKYDF
Sbjct: 97  GSGRTKYDF 105


>gi|51968566|dbj|BAD42975.1| hypothetical protein [Arabidopsis thaliana]
          Length = 200

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 81/185 (43%), Positives = 105/185 (56%), Gaps = 38/185 (20%)

Query: 15  MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
           MR+C G RR L  +P++F LP+L S+++    S   +  R    +K DHL+LGP AGQ L
Sbjct: 10  MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69

Query: 74  PNRLQCQDSRASN-------------------------------------MVTVGNASYS 96
            +R  C+ ++A N                                     +  VGN +YS
Sbjct: 70  SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLGNVKSSNPVSVVGNVTYS 129

Query: 97  KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR 156
           K ERSMA+LN F NFIQVTMPKS+V ILTDPASDLS+ +  V + P+ G+YSR  LMLQR
Sbjct: 130 KPERSMAVLNAFANFIQVTMPKSNVVILTDPASDLSIQQSNVILQPVQGDYSRGNLMLQR 189

Query: 157 IRSYI 161
           IRSYI
Sbjct: 190 IRSYI 194


>gi|413942906|gb|AFW75555.1| hypothetical protein ZEAMMB73_119492 [Zea mays]
          Length = 177

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 61/84 (72%), Positives = 67/84 (79%), Gaps = 1/84 (1%)

Query: 256 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGM 315
           MLGDQLALAWVVK H      +F+K + F  ++ G SVLFLPCA YNWT PEGAGQFHG+
Sbjct: 1   MLGDQLALAWVVKFHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTSPEGAGQFHGI 60

Query: 316 PLDVKVVHFKGSRKRLMLES-WNF 338
           PLDVKVVHFKGSRKRLMLE  WNF
Sbjct: 61  PLDVKVVHFKGSRKRLMLERLWNF 84


>gi|356575160|ref|XP_003555710.1| PREDICTED: uncharacterized protein LOC100806135 [Glycine max]
          Length = 146

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 54/75 (72%), Positives = 60/75 (80%)

Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
           S+KY NASRMLGDQLALA V  S P FD  +F KA  F EDI G+S+LFLPC+ YNWT P
Sbjct: 23  STKYRNASRMLGDQLALASVEMSKPHFDTSKFAKALAFSEDIGGSSILFLPCSMYNWTLP 82

Query: 307 EGAGQFHGMPLDVKV 321
           EGAGQFHGMPLDVK+
Sbjct: 83  EGAGQFHGMPLDVKI 97


>gi|414867606|tpg|DAA46163.1| TPA: hypothetical protein ZEAMMB73_544883 [Zea mays]
          Length = 379

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/75 (64%), Positives = 62/75 (82%), Gaps = 1/75 (1%)

Query: 160 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 219
           +  FLE+++ E  + +  +NH+V TDSDIAVVDDLGHIF  Y +FHLA+TFRNNK+QPLN
Sbjct: 306 FTAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIFEKYPHFHLAVTFRNNKEQPLN 364

Query: 220 SGFIAVRGTPDGISR 234
           SGF+AVRGT DGI++
Sbjct: 365 SGFVAVRGTRDGITK 379


>gi|414886764|tpg|DAA62778.1| TPA: putative homeodomain-like transcription factor superfamily
           protein [Zea mays]
          Length = 2379

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 51/88 (57%), Positives = 63/88 (71%), Gaps = 1/88 (1%)

Query: 75  NRLQCQDSRAS-NMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSM 133
           N +   D   S + VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S  S+
Sbjct: 102 NSVSAGDGNVSPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSV 161

Query: 134 PRKGVTIYPIHGEYSRDKLMLQRIRSYI 161
            +   T+ PI G YSR  LMLQRI++YI
Sbjct: 162 NQGSATLLPIEGNYSRGNLMLQRIKTYI 189


>gi|255562818|ref|XP_002522414.1| conserved hypothetical protein [Ricinus communis]
 gi|223538299|gb|EEF39906.1| conserved hypothetical protein [Ricinus communis]
          Length = 205

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 60/97 (61%), Gaps = 5/97 (5%)

Query: 15  MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQR 72
           MR   G RRF+  FFL LV F  ++ SVLEL+  SV E L +   +K  HL+LGPAA Q 
Sbjct: 1   MRTWSGRRRFILCFFLLLVIF--HIFSVLELYSNSVTEALQKNRNKKSYHLVLGPAASQG 58

Query: 73  LPNRLQCQDSRASNMV-TVGNASYSKTERSMAILNVF 108
           LPNRLQC+ S+A N    + ++S S    ++A + VF
Sbjct: 59  LPNRLQCEGSKALNKTHLLDSSSDSNVRDNVAFVTVF 95


>gi|67920764|ref|ZP_00514283.1| hypothetical protein CwatDRAFT_5283 [Crocosphaera watsonii WH 8501]
 gi|67856881|gb|EAM52121.1| hypothetical protein CwatDRAFT_5283 [Crocosphaera watsonii WH 8501]
          Length = 316

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 65/257 (25%), Positives = 111/257 (43%), Gaps = 24/257 (9%)

Query: 84  ASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPI 143
           A  +    N  +      + ++N+    + +  P     +LTD  + L+     + +Y  
Sbjct: 16  AKKIYNQDNKDFRNDYNYILLINLLFRSVSIFHPNCRKVVLTDMNTRLAGLEDDIEVY-- 73

Query: 144 HGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQN 203
                 + +M  R+ +   +++ +     Q    I   +  DSD+ V  +L H+F   ++
Sbjct: 74  RTSLDPESIMFSRLVAQFNYVKTQ-----QIDSDI---ILIDSDMLVNANLEHLFE--ED 123

Query: 204 FHLALTFR---NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYM-NASRMLGD 259
           F +ALT+R     KD P+N G I +  + D    A  FLE+V ++Y  KY+ +     GD
Sbjct: 124 FSVALTYRYLEAVKDMPINGGIIFL--SRDRKQEAIKFLEKVYQIYQEKYLKDYQSWSGD 181

Query: 260 QLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDV 319
           Q AL   +     FD   F   Q  V  +    +  L C  YN++P            D 
Sbjct: 182 QYALIDAI----GFD--NFNSRQSDVMLVDEQKIKLLDCEIYNFSPDRNPNSIVREHKDK 235

Query: 320 KVVHFKGSRKRLMLESW 336
            ++HFKGSRK++M   W
Sbjct: 236 VILHFKGSRKKIMPLYW 252


>gi|416379625|ref|ZP_11683920.1| hypothetical protein CWATWH0003_0757 [Crocosphaera watsonii WH
           0003]
 gi|357265857|gb|EHJ14567.1| hypothetical protein CWATWH0003_0757 [Crocosphaera watsonii WH
           0003]
          Length = 316

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 65/257 (25%), Positives = 111/257 (43%), Gaps = 24/257 (9%)

Query: 84  ASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPI 143
           A  +    N  +      + ++N+    + +  P     +LTD  + L+     + +Y  
Sbjct: 16  AKKIYNQDNKDFRNDYNYILLINLLFRSVSIFHPNCRKVVLTDMNTRLAGLEDDIEVY-- 73

Query: 144 HGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQN 203
                 + +M  R+ +   +++ +     Q    I   +  DSD+ V  +L H+F   ++
Sbjct: 74  RTSLDPESIMFSRLVAQFNYVKTQ-----QIDSDI---ILIDSDMLVNANLEHLFE--ED 123

Query: 204 FHLALTFR---NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYM-NASRMLGD 259
           F +ALT+R     KD P+N G I +  + D    A  FLE+V ++Y  KY+ +     GD
Sbjct: 124 FSVALTYRYLEAVKDMPINGGIIFL--SRDRKQEAIKFLEKVYQIYQEKYLKDYQSWWGD 181

Query: 260 QLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDV 319
           Q AL   +     FD   F   Q  V  +    +  L C  YN++P            D 
Sbjct: 182 QYALIDAI----GFD--NFHSRQSDVMLVDEQKIKLLDCEIYNFSPGRNPNSIVREHKDK 235

Query: 320 KVVHFKGSRKRLMLESW 336
            ++HFKGSRK++M   W
Sbjct: 236 VILHFKGSRKKIMPLYW 252


>gi|297620616|ref|YP_003708753.1| hypothetical protein wcw_0375 [Waddlia chondrophila WSU 86-1044]
 gi|297375917|gb|ADI37747.1| hypothetical protein wcw_0375 [Waddlia chondrophila WSU 86-1044]
 gi|337292759|emb|CCB90764.1| putative uncharacterized protein [Waddlia chondrophila 2032/99]
          Length = 259

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 22/164 (13%)

Query: 182 VFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD-----QPLNSGFIAVRGTPDGISRAK 236
           VF D D+   + +  +F   +NF LAL +R + +      P+N GFI +   P+G ++A 
Sbjct: 106 VFCDYDLLFQESIELLFK--ENFDLALIYRKSFEGGLHPAPINGGFIGIH--PEGFTKAI 161

Query: 237 IFLEEVLRVYSSKYMNASRMLGDQLALAWVV---KSHPSFDARRFTKAQPFVEDIIGASV 293
            FLE V   Y   Y       G Q +L  ++   K H +F      +         GA +
Sbjct: 162 NFLETVHSCYLENYSEYKEWGGFQSSLNKLLVPKKVHNAFPNHLIYE---------GAEI 212

Query: 294 LFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWN 337
             LP + YN+   E  G++     D K++HFKG RK +M   WN
Sbjct: 213 ALLPSSEYNYAI-EAQGEWVDFKPDKKILHFKGPRKEVMANYWN 255


>gi|384252921|gb|EIE26396.1| hypothetical protein COCSUDRAFT_39504 [Coccomyxa subellipsoidea
           C-169]
          Length = 333

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 111/263 (42%), Gaps = 60/263 (22%)

Query: 108 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR--------IRS 159
           FI+ ++ + P   + +LTD  + + +P   V +Y     Y+ D+  L R          +
Sbjct: 50  FISALRRSNPGCTIVVLTDQGTQIELP-PDVRLY----RYAIDRSKLGRNPYANYYQYLA 104

Query: 160 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 219
            I+FL+  +   ++G       VF D DI V+D L  +F +   F  A+T  +  D P+N
Sbjct: 105 QISFLQHLM---AKGLAQSMDVVFLDMDILVIDSLAEVFKEGPGFDYAVTLSDAVDMPVN 161

Query: 220 SG--FIAVRGTPDGISRAKIFLEEVLRVY--SSKYMNASRMLGDQLALAWVVKSHPSFDA 275
            G  F+     P  ++    FLE+VL VY  +  +++    LG+ + L +        D 
Sbjct: 162 IGMQFVHHGRYPGAVA----FLEDVLAVYPFNETFVSGQVALGNLIGLRY-------NDE 210

Query: 276 RRFTKAQPFVEDIIGA----------SVLFLPCATYNWTPPEGAGQFHGMPLD------- 318
           +  T  +  V D   A          SV FLPC  YN+     AGQ      D       
Sbjct: 211 QLLTHYKSAVRDRSSAKQVRGRHSVHSVRFLPCMRYNYC---HAGQSCCTDPDRLPVSVT 267

Query: 319 ---------VKVVHFKGSRKRLM 332
                    VKV+HF G RK+ +
Sbjct: 268 TEEELTDTRVKVLHFVGHRKKAL 290


>gi|412992455|emb|CCO18435.1| unknown protein [Bathycoccus prasinos]
          Length = 401

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 63/238 (26%), Positives = 109/238 (45%), Gaps = 36/238 (15%)

Query: 117 PKSDVFILTDPASDLSMPRKGVTIYPIH---GEYSRDK-----LMLQRIRSYITFLERRI 168
           P + V ++TD  +++ M + G+    +H   G   R K     LML+R++ Y  F+ +R 
Sbjct: 156 PGTCVALITDEETEIDMSKPGMDKVQLHRFEGILDRTKIGTGALMLERMKLYNAFI-KRA 214

Query: 169 REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGT 228
           R++          +  D+DI  V D+  +F   +NF   +T R+NK  P+  G   V+  
Sbjct: 215 RDNDWNA----DLLMVDTDIVFVGDVSDLFQ-TRNFDYGVTIRDNKAYPVQGG---VQFV 266

Query: 229 PDG--ISRAKIFLEEVLRVYSSKYMNASR---MLGDQLALAWVVKSHPSFDARRFTKAQ- 282
           P G  +  AK F +  L ++ S    + +     GDQ A    +   P+   +   K + 
Sbjct: 267 PKGKYVGAAK-FSDHTLDLWKSDLEKSGKEAGFTGDQAAYQRGLNV-PASKVQSLAKGKK 324

Query: 283 ----PFVEDIIGASVL---FLPCATYNWTPPEGAGQFHGM-PLDVKVVHFKGSRKRLM 332
               P V    GA V+    +P   YN+ P   +G   G+   D++++H+KG +K  M
Sbjct: 325 VIDLPVVCGSSGAEVVTVRMIPGDQYNFVP---SGNGQGLKKKDIRILHYKGGKKEGM 379


>gi|297606040|ref|NP_001057915.2| Os06g0571300 [Oryza sativa Japonica Group]
 gi|255677159|dbj|BAF19829.2| Os06g0571300, partial [Oryza sativa Japonica Group]
          Length = 73

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/58 (55%), Positives = 38/58 (65%), Gaps = 2/58 (3%)

Query: 107 VFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFL 164
           +FI  +QV+MP+S+V ILTDP S L+       I PI G YSR  LM QRIRSYI  L
Sbjct: 4   LFIEPLQVSMPRSNVIILTDPNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYIVSL 59


>gi|384252787|gb|EIE26262.1| hypothetical protein COCSUDRAFT_64411 [Coccomyxa subellipsoidea
           C-169]
          Length = 477

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 96/247 (38%), Gaps = 32/247 (12%)

Query: 108 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERR 167
           FI+ ++ + P   V +LTD A+ + +P     +        R KL      +Y  +L + 
Sbjct: 190 FISTLRRSHPGCTVAVLTDQATQIDLP---ADVRLFRFTIDRSKLGRNPYANYYQYLAQI 246

Query: 168 I---REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIA 224
               +  ++G  H    VF D D  VVD +  +F     F   LT  +  D P+N   I 
Sbjct: 247 AFMKQLAAEGLEHSTDVVFLDMDALVVDSIAEVFGQGAQFDYGLTLSDATDMPVN---IG 303

Query: 225 VRGTPDG-ISRAKIFLEEVLRVY--SSKYMNASRMLGDQLALAWVVKSHPSFDARRFT-- 279
           ++  P G    A  FL++V+ +Y  +S +      L D L      K  P     R    
Sbjct: 304 IQFVPRGRYGSAIAFLQDVIAIYPFNSTFTAGQEALTDLLGF----KDDPEEVLSRVNIS 359

Query: 280 -KAQPFVEDIIGASVLFLPCATYNWTP------------PEGAGQFHGMPLD-VKVVHFK 325
            +     + + G +V    C  YN+              P     F  +    VKV+HF 
Sbjct: 360 VQEGRTCQQVGGRTVCLFTCMRYNYCHVDQSCCTDPARLPVSLTSFDDLAAARVKVLHFV 419

Query: 326 GSRKRLM 332
           G RK+ +
Sbjct: 420 GHRKKAL 426


>gi|381204432|ref|ZP_09911503.1| hypothetical protein SclubJA_02255, partial [SAR324 cluster
           bacterium JCVI-SC AAA005]
          Length = 176

 Score = 45.4 bits (106), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 17/122 (13%)

Query: 105 LNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTI-YPIHGEYSRDKLMLQRIRSYITF 163
           LN+  + ++   P++D+++LTD  S  S       I Y +   Y     +L R +++  F
Sbjct: 40  LNLMFSSVKRIYPEADLYVLTDTKSKFSENTISKLIRYDLDSRYP----ILARNKAWYKF 95

Query: 164 LERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFI 223
           LE+  +            +F DSDI + D+   +     +F +A TFR+ K  P+N G I
Sbjct: 96  LEKTDKST----------IFLDSDILINDNFDELMS--VDFDIAFTFRDWKKWPINLGII 143

Query: 224 AV 225
            V
Sbjct: 144 YV 145


>gi|182419850|ref|ZP_02951090.1| conserved hypothetical protein [Clostridium butyricum 5521]
 gi|237666660|ref|ZP_04526645.1| YfnD [Clostridium butyricum E4 str. BoNT E BL5262]
 gi|182376398|gb|EDT73980.1| conserved hypothetical protein [Clostridium butyricum 5521]
 gi|237657859|gb|EEP55414.1| YfnD [Clostridium butyricum E4 str. BoNT E BL5262]
          Length = 313

 Score = 41.2 bits (95), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 36/166 (21%), Positives = 70/166 (42%), Gaps = 25/166 (15%)

Query: 162 TFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ----- 216
            FLE  I ++S       +Y   D+D+    ++  IF++  N  + LT   N ++     
Sbjct: 89  VFLEYIINKYSDAV----YYAHVDADLFFFSNIDSIFNENSNASIFLTDHRNSEEFMHYY 144

Query: 217 ----PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA-WVVKSHP 271
                 N+GF+  + T +G +  K++ +  L+  +++Y   ++  GDQ  +  W+     
Sbjct: 145 ELSGQFNTGFVGFKNTDEGKAAIKLWGDRCLKRCTAEYDTINKTFGDQRYVEDWI----D 200

Query: 272 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPL 317
            F      K+       IGA+V F     Y ++  +     +  PL
Sbjct: 201 IFKDVHVVKS-------IGANVAFWNVKNYEFSKVDDLIYVNNKPL 239


>gi|440753032|ref|ZP_20932235.1| hypothetical protein O53_1407 [Microcystis aeruginosa TAIHU98]
 gi|440177525|gb|ELP56798.1| hypothetical protein O53_1407 [Microcystis aeruginosa TAIHU98]
          Length = 311

 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 13/109 (11%)

Query: 156 RIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD 215
           ++R        R R  S   G + H++F DSDI V+D L  +F  Y N  L   +     
Sbjct: 75  KVRGIDDIHNHRFRIFSIFWGPLEHFIFLDSDIIVLDSLQELFRTYINSELEFMY----- 129

Query: 216 QPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA 264
                     RG  D + +   F ++++R Y +   NA   +  + A +
Sbjct: 130 --------YYRGIFDQVYKEGEFRDKMIREYRANGFNAGSFISSRGAFS 170


>gi|425436184|ref|ZP_18816622.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389679139|emb|CCH92045.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
          Length = 311

 Score = 37.7 bits (86), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 13/109 (11%)

Query: 156 RIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD 215
           ++R        R R  S   G + H++F DSDI V+D L  +F  Y N  L   +     
Sbjct: 75  KVRGIDDIHNHRFRIFSIFWGPLEHFIFLDSDIIVLDSLQELFRTYINSELEFMY----- 129

Query: 216 QPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA 264
                     RG  D + +   F ++++R + +   NA   L  + A +
Sbjct: 130 --------YYRGIFDQVYKEGEFRDKMIREHRANGFNAGSFLSSRGAFS 170


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.325    0.140    0.424 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,632,447,918
Number of Sequences: 23463169
Number of extensions: 230695629
Number of successful extensions: 577097
Number of sequences better than 100.0: 55
Number of HSP's better than 100.0 without gapping: 41
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 576977
Number of HSP's gapped (non-prelim): 75
length of query: 362
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 218
effective length of database: 8,980,499,031
effective search space: 1957748788758
effective search space used: 1957748788758
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)