BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 035921
         (305 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225444981|ref|XP_002282754.1| PREDICTED: uncharacterized protein LOC100249262 [Vitis vinifera]
          Length = 304

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 207/298 (69%), Positives = 243/298 (81%), Gaps = 5/298 (1%)

Query: 7   AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
            ++V V  IL++ST   +S+C+KKPV  AR++D+PYI CQVCEKLA+QLYHQV+KKQAQI
Sbjct: 8   TIIVLVFSILVISTWLQVSHCSKKPVGVARRDDIPYIKCQVCEKLAAQLYHQVEKKQAQI 67

Query: 67  SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
           SPKKISEFEII I+ENVCNLKKEEADWIL+IDIVEQGDKL LVEQ++EGQCNSECKTIER
Sbjct: 68  SPKKISEFEIIEISENVCNLKKEEADWILRIDIVEQGDKLKLVEQDSEGQCNSECKTIER 127

Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
           ACQE+MGYSDTD AEY+Y +KP I SLVNYLCKDL+KACS  PPPV K+R PGEPFVPK 
Sbjct: 128 ACQEVMGYSDTDAAEYVYKNKPQIDSLVNYLCKDLTKACSASPPPVPKDRTPGEPFVPKS 187

Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLG 246
            KEAEME+IM+SMEGMPGAPGM+MYS+EDLMNM+NF    GD++ D+DDDDE  FP  LG
Sbjct: 188 LKEAEMEKIMKSMEGMPGAPGMKMYSREDLMNMQNF----GDEDADDDDDDEPHFPSKLG 243

Query: 247 KVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTE 304
           K  R KES   DWK  +T GI  TG  LK+ A++VSN +++WW G K   SKK  K  
Sbjct: 244 KNLRAKESTKPDWKHTVTKGIKETGQALKRHASRVSNQIRQWWRG-KITSSKKNSKAH 300


>gi|297738696|emb|CBI27941.3| unnamed protein product [Vitis vinifera]
          Length = 301

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 207/298 (69%), Positives = 243/298 (81%), Gaps = 5/298 (1%)

Query: 7   AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
            ++V V  IL++ST   +S+C+KKPV  AR++D+PYI CQVCEKLA+QLYHQV+KKQAQI
Sbjct: 5   TIIVLVFSILVISTWLQVSHCSKKPVGVARRDDIPYIKCQVCEKLAAQLYHQVEKKQAQI 64

Query: 67  SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
           SPKKISEFEII I+ENVCNLKKEEADWIL+IDIVEQGDKL LVEQ++EGQCNSECKTIER
Sbjct: 65  SPKKISEFEIIEISENVCNLKKEEADWILRIDIVEQGDKLKLVEQDSEGQCNSECKTIER 124

Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
           ACQE+MGYSDTD AEY+Y +KP I SLVNYLCKDL+KACS  PPPV K+R PGEPFVPK 
Sbjct: 125 ACQEVMGYSDTDAAEYVYKNKPQIDSLVNYLCKDLTKACSASPPPVPKDRTPGEPFVPKS 184

Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLG 246
            KEAEME+IM+SMEGMPGAPGM+MYS+EDLMNM+NF    GD++ D+DDDDE  FP  LG
Sbjct: 185 LKEAEMEKIMKSMEGMPGAPGMKMYSREDLMNMQNF----GDEDADDDDDDEPHFPSKLG 240

Query: 247 KVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTE 304
           K  R KES   DWK  +T GI  TG  LK+ A++VSN +++WW G K   SKK  K  
Sbjct: 241 KNLRAKESTKPDWKHTVTKGIKETGQALKRHASRVSNQIRQWWRG-KITSSKKNSKAH 297


>gi|356498081|ref|XP_003517882.1| PREDICTED: uncharacterized protein LOC100778421 [Glycine max]
          Length = 303

 Score =  403 bits (1036), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 205/283 (72%), Positives = 241/283 (85%)

Query: 23  PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
           P+S+C+KKPV  ARKED+PYI CQVCEKLA +LY QVQ KQA+I+PKKISE++II IAEN
Sbjct: 21  PLSHCSKKPVGVARKEDIPYIKCQVCEKLAKELYQQVQNKQAEIAPKKISEYQIIEIAEN 80

Query: 83  VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
           VCNLKK EADWIL+IDIVE+ D+L LVEQ++EGQCNSECKTIERACQE++GYSDTDVAEY
Sbjct: 81  VCNLKKVEADWILRIDIVEKEDRLELVEQDSEGQCNSECKTIERACQEVIGYSDTDVAEY 140

Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
           LY SKPDI SL NYLCKDLSKACS+KPPPV K+R PGEPFV K +KEAEME++++SMEGM
Sbjct: 141 LYNSKPDIDSLRNYLCKDLSKACSSKPPPVPKDRAPGEPFVAKSSKEAEMEKLLKSMEGM 200

Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
           PGAPGM+MYS++DLMNMKNFGGEDGD+ D++DDDDE+ FP  LG V R KES   DWKQ 
Sbjct: 201 PGAPGMKMYSRDDLMNMKNFGGEDGDEEDEDDDDDEASFPSKLGNVLRAKESGKRDWKQM 260

Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
           +  GI +    LKK ANKVSNH+++WW G +   SKK GKTEL
Sbjct: 261 VKKGIEDASMTLKKHANKVSNHIRQWWRGKRRTNSKKGGKTEL 303


>gi|356520856|ref|XP_003529076.1| PREDICTED: uncharacterized protein LOC100782313 [Glycine max]
          Length = 300

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 204/283 (72%), Positives = 240/283 (84%), Gaps = 1/283 (0%)

Query: 23  PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
           P S+C+KKPV  ARKED+ YI CQVCEKLA +LY QVQ KQA+I+PKKISE++II IAEN
Sbjct: 19  PFSHCSKKPVGVARKEDIQYIKCQVCEKLAKELYQQVQNKQAEIAPKKISEYQIIEIAEN 78

Query: 83  VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
           VCNLKK EADWIL+IDIVE+ D+L LVEQ++EGQCNSECKTIERACQE++GYSDTDVAEY
Sbjct: 79  VCNLKKAEADWILRIDIVEKEDRLELVEQDSEGQCNSECKTIERACQEVIGYSDTDVAEY 138

Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
           LY+SKPDI SL NYLCKDL+ ACSTKPPPV K+R PGEPFV K +KEAEME++++SMEGM
Sbjct: 139 LYSSKPDIDSLRNYLCKDLTNACSTKPPPVPKDRAPGEPFVAKSSKEAEMEKLLKSMEGM 198

Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
           PGAPGM+MYS++DLMNMKNFG EDGDD D++D+++ S FP  LGKV R KES+  DWKQ 
Sbjct: 199 PGAPGMKMYSRDDLMNMKNFGDEDGDDEDEDDNEEAS-FPSKLGKVLRAKESEKRDWKQV 257

Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
           I  GI +T   LKK ANKVSNH+++WW G K   SKK GKTEL
Sbjct: 258 IKKGIEDTSMTLKKHANKVSNHIRQWWRGKKITNSKKGGKTEL 300


>gi|449528651|ref|XP_004171317.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101206912
           [Cucumis sativus]
          Length = 300

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 201/286 (70%), Positives = 235/286 (82%), Gaps = 6/286 (2%)

Query: 23  PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISE--FEIIGIA 80
           P   C+KKP A ARKED+PYI CQVCEKLA++LYHQV+KKQA+I+PKK S     II IA
Sbjct: 18  PTFECSKKPSAAARKEDIPYIKCQVCEKLAAELYHQVEKKQAEIAPKKYSSVVIXIIEIA 77

Query: 81  ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
           ENVCNLKK EADWIL+IDIVEQGDKL LV+Q  EGQCNSECKTIERACQE+MGYSDTDVA
Sbjct: 78  ENVCNLKKAEADWILQIDIVEQGDKLELVDQNTEGQCNSECKTIERACQEVMGYSDTDVA 137

Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
           EYLY+SKP I SLVNYLCKDL+K+C+TKPPPV K+R PGEPFV K +KEAEME++MRSME
Sbjct: 138 EYLYSSKPKIDSLVNYLCKDLTKSCTTKPPPVPKDRTPGEPFVAKSSKEAEMEKMMRSME 197

Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
           GMPGAPGM+MYS++DLMNMKNFGGE   D+D+E++++E +FP NLGKV REKE K+ DWK
Sbjct: 198 GMPGAPGMKMYSRDDLMNMKNFGGE---DDDEEEEEEEENFPSNLGKVLREKERKNNDWK 254

Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSG-IKAARSKKPGKTEL 305
            RIT G+   G+ LK+ A KVSN V+ WW       +S KP K EL
Sbjct: 255 NRITKGVSKAGEALKQHAYKVSNKVRHWWRAKTGGLKSSKPTKQEL 300


>gi|357501185|ref|XP_003620881.1| hypothetical protein MTR_6g091780 [Medicago truncatula]
 gi|355495896|gb|AES77099.1| hypothetical protein MTR_6g091780 [Medicago truncatula]
          Length = 301

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 202/300 (67%), Positives = 249/300 (83%), Gaps = 2/300 (0%)

Query: 7   AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
            V  ++ V +++++  P+S+CAKKPV  ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3   GVFWFLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62

Query: 67  SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
           SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E ++EGQCNSECKT+ER
Sbjct: 63  SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEHDSEGQCNSECKTVER 122

Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
           ACQE+MGYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV K 
Sbjct: 123 ACQEVMGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFVAKS 182

Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFPKNL 245
           +KEAEME++++SMEGMPGAPGM+MYS++DLM  KNFG E+  DD D+++++DE+DFP  L
Sbjct: 183 SKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFPSKL 241

Query: 246 GKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
           GK+ + KE++ GDWKQ+I  GI++T   LKK A KVSNH+QRWW G K   SKK  K+EL
Sbjct: 242 GKILKSKENEKGDWKQKIRKGIVDTSTTLKKHATKVSNHIQRWWKGKKTTSSKKNSKSEL 301


>gi|449435830|ref|XP_004135697.1| PREDICTED: uncharacterized protein LOC101206912 [Cucumis sativus]
          Length = 294

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 199/284 (70%), Positives = 232/284 (81%), Gaps = 8/284 (2%)

Query: 23  PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
           P   C+KKP A ARKED+PYI CQVCEKLA++LYHQV+KKQA+I+PKKISE++II IAEN
Sbjct: 18  PTFECSKKPSAAARKEDIPYIKCQVCEKLAAELYHQVEKKQAEIAPKKISEYQIIEIAEN 77

Query: 83  VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
           VCNLKK EADWIL+IDIVEQGDKL LV+Q  EGQCNSECKTIERACQE+MGYSDTDVAEY
Sbjct: 78  VCNLKKAEADWILQIDIVEQGDKLELVDQNTEGQCNSECKTIERACQEVMGYSDTDVAEY 137

Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
           LY+SKP I SLVNYLCKDL+K+C+TKPPPV K+R PGEPFV K +KEAEME++MRSMEGM
Sbjct: 138 LYSSKPKIDSLVNYLCKDLTKSCTTKPPPVPKDRTPGEPFVAKSSKEAEMEKMMRSMEGM 197

Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
           PGAPGM+MYS++DLMNMKNFGGED D+ ++E+          LGKV REKE K+ DWK R
Sbjct: 198 PGAPGMKMYSRDDLMNMKNFGGEDDDEEEEENIF-------YLGKVLREKERKNNDWKNR 250

Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSG-IKAARSKKPGKTEL 305
           IT G+   G+ LK+ A KVSN V+ WW       +S KP K EL
Sbjct: 251 ITKGVSKAGEALKQHAYKVSNKVRHWWRAKTGGLKSSKPTKQEL 294


>gi|217071942|gb|ACJ84331.1| unknown [Medicago truncatula]
          Length = 301

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 201/300 (67%), Positives = 248/300 (82%), Gaps = 2/300 (0%)

Query: 7   AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
            V  ++ V +++++  P+S+CAKKPV  ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3   GVFWFLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62

Query: 67  SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
           SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E ++EGQCNSE KT+ER
Sbjct: 63  SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEHDSEGQCNSEYKTVER 122

Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
           ACQE+MGYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV K 
Sbjct: 123 ACQEVMGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFVAKS 182

Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFPKNL 245
           +KEAEME++++SMEGMPGAPGM+MYS++DLM  KNFG E+  DD D+++++DE+DFP  L
Sbjct: 183 SKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFPSKL 241

Query: 246 GKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
           GK+ + KE++ GDWKQ+I  GI++T   LKK A KVSNH+QRWW G K   SKK  K+EL
Sbjct: 242 GKILKSKENEKGDWKQKIRKGIVDTSTTLKKHATKVSNHIQRWWKGKKTTSSKKNSKSEL 301


>gi|224122586|ref|XP_002330518.1| predicted protein [Populus trichocarpa]
 gi|222872452|gb|EEF09583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 184/283 (65%), Positives = 231/283 (81%), Gaps = 6/283 (2%)

Query: 23  PISYCAK--KPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIA 80
           P+S  AK  KP+  ARK+D+PYI CQVCEK+ASQLY Q QKKQ QISP+KISE+++I IA
Sbjct: 22  PVSESAKTDKPLIVARKDDIPYIKCQVCEKVASQLYRQAQKKQDQISPQKISEYQVIEIA 81

Query: 81  ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
           ENVCNLKKEEADWI+K+DI+EQGDKL L+EQ+ EGQCNSECKTIE+ CQ++M Y DTDVA
Sbjct: 82  ENVCNLKKEEADWIMKLDIMEQGDKLELIEQDTEGQCNSECKTIEQTCQQVMDYVDTDVA 141

Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
           EY+YT+KP I  LV +LCKDL+ +C+ KPPPV KNR+PGEPFVPKP+KE EME+IMRSME
Sbjct: 142 EYIYTTKPQIDLLVKHLCKDLTNSCTKKPPPVPKNRMPGEPFVPKPSKEVEMEKIMRSME 201

Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
           GMPGAPGM+MYS+EDLM M NF    G+++ +++DD++  FP N+GKV REK+    DWK
Sbjct: 202 GMPGAPGMKMYSREDLMGMNNF----GNEDAEDEDDEDEQFPSNMGKVLREKQDMKDDWK 257

Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKT 303
           Q+I +GI +T + LK+  N VS  +Q+WW GI+AA +KK  KT
Sbjct: 258 QKIFEGIKSTAEALKRHTNIVSKQIQKWWKGIRAAPAKKNSKT 300


>gi|357501049|ref|XP_003620813.1| hypothetical protein MTR_6g090810 [Medicago truncatula]
 gi|355495828|gb|AES77031.1| hypothetical protein MTR_6g090810 [Medicago truncatula]
          Length = 286

 Score =  366 bits (939), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 192/300 (64%), Positives = 237/300 (79%), Gaps = 17/300 (5%)

Query: 7   AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
            V   + V +++++  P+S+CAKKPV  ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3   GVFWLLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62

Query: 67  SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
           SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L               +T+ER
Sbjct: 63  SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRL---------------ETVER 107

Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
           ACQE++GYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV K 
Sbjct: 108 ACQEVLGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFVAKS 167

Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFPKNL 245
           +KEAEME++++SMEGMPGAPGM+MYS++DLM  KNFG E+  DD D+++++DE+DFP  L
Sbjct: 168 SKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFPSKL 226

Query: 246 GKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
           GKV + KE++ GDWKQ+I  GI++TG  LKK A KVSNH+QRWW G K   SKK  K+EL
Sbjct: 227 GKVLKSKENEKGDWKQKIRKGIVDTGTTLKKHATKVSNHIQRWWKGKKTTSSKKNSKSEL 286


>gi|15233608|ref|NP_194681.1| uncharacterized protein [Arabidopsis thaliana]
 gi|5123546|emb|CAB45312.1| putative protein [Arabidopsis thaliana]
 gi|7269851|emb|CAB79710.1| putative protein [Arabidopsis thaliana]
 gi|332660239|gb|AEE85639.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 306

 Score =  361 bits (926), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 181/270 (67%), Positives = 225/270 (83%), Gaps = 1/270 (0%)

Query: 23  PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
           P+S  AKKP +  RKEDVPYI CQVCEKL+S+L+  V++KQ QISPKKISE+EII IAEN
Sbjct: 23  PVSDAAKKPSSTPRKEDVPYIKCQVCEKLSSRLHQLVKEKQQQISPKKISEYEIIEIAEN 82

Query: 83  VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
           VCNLKKEEADW+LKIDIVE+GD LVLVEQ+ EG CNS+CKTIE ACQ+++GYSDTDVAEY
Sbjct: 83  VCNLKKEEADWMLKIDIVEKGDNLVLVEQQEEGMCNSKCKTIENACQKVIGYSDTDVAEY 142

Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
           +Y SKPD+ SLVN+LCKDL+ ACS KPPPV K+R+PGEPFV KP+K+AEM++I+RSM+GM
Sbjct: 143 IYKSKPDLVSLVNHLCKDLTDACSKKPPPVPKDRVPGEPFVAKPSKDAEMDKILRSMQGM 202

Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
           PGAPGM++YS+ED+    N G ED D +DDED++++  FPKNLGKV +EKESK+ + K+ 
Sbjct: 203 PGAPGMKVYSREDI-EKGNIGNEDDDGDDDEDEEEDDKFPKNLGKVLKEKESKTEELKKT 261

Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
           IT      G+ LK+ A KVSN V+RWW G+
Sbjct: 262 ITKEFKKKGEALKRHAQKVSNRVRRWWKGL 291


>gi|21553422|gb|AAM62515.1| unknown [Arabidopsis thaliana]
          Length = 303

 Score =  360 bits (925), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 181/270 (67%), Positives = 225/270 (83%), Gaps = 1/270 (0%)

Query: 23  PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
           P+S  AKKP +  RKEDVPYI CQVCEKL+S+L+  V++KQ QISPKKISE+EII IAEN
Sbjct: 20  PVSDAAKKPSSTPRKEDVPYIKCQVCEKLSSRLHQLVKEKQQQISPKKISEYEIIEIAEN 79

Query: 83  VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
           VCNLKKEEADW+LKIDIVE+GD LVLVEQ+ EG CNS+CKTIE ACQ+++GYSDTDVAEY
Sbjct: 80  VCNLKKEEADWMLKIDIVEKGDNLVLVEQQEEGMCNSKCKTIENACQKVIGYSDTDVAEY 139

Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
           +Y SKPD+ SLVN+LCKDL+ ACS KPPPV K+R+PGEPFV KP+K+AEM++I+RSM+GM
Sbjct: 140 IYKSKPDLVSLVNHLCKDLTDACSKKPPPVPKDRVPGEPFVAKPSKDAEMDKILRSMQGM 199

Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
           PGAPGM++YS+ED+    N G ED D +DDED++++  FPKNLGKV +EKESK+ + K+ 
Sbjct: 200 PGAPGMKVYSREDI-EKGNIGNEDDDGDDDEDEEEDDKFPKNLGKVLKEKESKTEELKKT 258

Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
           IT      G+ LK+ A KVSN V+RWW G+
Sbjct: 259 ITKEFKKKGEALKRHAQKVSNRVRRWWKGL 288


>gi|297799038|ref|XP_002867403.1| hypothetical protein ARALYDRAFT_491806 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313239|gb|EFH43662.1| hypothetical protein ARALYDRAFT_491806 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 307

 Score =  357 bits (917), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 181/272 (66%), Positives = 224/272 (82%), Gaps = 1/272 (0%)

Query: 23  PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
           P+S  AKKP +  RKEDVPYI CQVCEKLAS+L+  V++KQ QISPKKISE+EII IAEN
Sbjct: 23  PVSDGAKKPSSAPRKEDVPYIKCQVCEKLASRLHQLVKEKQQQISPKKISEYEIIEIAEN 82

Query: 83  VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
           VCNLKKEEADW+LKIDIVE+GDKLVLVEQ  EG CNS+CKTIE ACQ+++GYSDTDVAE+
Sbjct: 83  VCNLKKEEADWMLKIDIVEKGDKLVLVEQAEEGMCNSKCKTIENACQKVIGYSDTDVAEF 142

Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
           +Y SKPD+ SLVN+LC DL+ ACS KPPPV K+R+PGEPFV KP+K+AEM++I+RSM+GM
Sbjct: 143 IYKSKPDLVSLVNHLCNDLTDACSKKPPPVPKDRVPGEPFVAKPSKDAEMDKILRSMQGM 202

Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
           PGAPGM++YS+ED+    N G ED D +DDED++++  FPKNLGKV +EKESK+ + K+ 
Sbjct: 203 PGAPGMKVYSREDI-EKGNIGNEDEDGDDDEDEEEDEKFPKNLGKVLKEKESKTEELKKT 261

Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGIKA 294
           I       G+VLK  A KVSN V+RWW G++ 
Sbjct: 262 IAKEFKKKGEVLKIHAQKVSNRVRRWWKGLRT 293


>gi|357501201|ref|XP_003620889.1| hypothetical protein MTR_6g091890 [Medicago truncatula]
 gi|357501239|ref|XP_003620908.1| hypothetical protein MTR_6g092180 [Medicago truncatula]
 gi|355495904|gb|AES77107.1| hypothetical protein MTR_6g091890 [Medicago truncatula]
 gi|355495923|gb|AES77126.1| hypothetical protein MTR_6g092180 [Medicago truncatula]
          Length = 304

 Score =  351 bits (901), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 186/303 (61%), Positives = 239/303 (78%), Gaps = 5/303 (1%)

Query: 7   AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
            V   + V +++++  P+S+CAKKPV  ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3   GVFWLLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62

Query: 67  SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKL-VLVEQEAEGQCNSECKTIE 125
           SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L  ++ +++    + +   + 
Sbjct: 63  SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLENMILKDSAILNDRQLSVLV 122

Query: 126 RA--CQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFV 183
           R+    +++GYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV
Sbjct: 123 RSNLVLQVLGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFV 182

Query: 184 PKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFP 242
            K +KEAEME++++SMEGMPGAPGM+MYS++DLM  KNFG E+  DD D+++++DE+DFP
Sbjct: 183 AKSSKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFP 241

Query: 243 KNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGK 302
             LGKV + KE++ GDWKQ+I  GI++TG  LKK A KVSNH+QRWW G K   SKK  K
Sbjct: 242 SKLGKVLKSKENEKGDWKQKIRKGIVDTGTTLKKHATKVSNHIQRWWKGKKTTSSKKNSK 301

Query: 303 TEL 305
           +EL
Sbjct: 302 SEL 304


>gi|255546197|ref|XP_002514158.1| conserved hypothetical protein [Ricinus communis]
 gi|223546614|gb|EEF48112.1| conserved hypothetical protein [Ricinus communis]
          Length = 243

 Score =  333 bits (853), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 152/205 (74%), Positives = 180/205 (87%)

Query: 20  TCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGI 79
            C P+S  AKKPV  AR ED+PYI CQVCEKLA +LY QVQKKQA+I PKKI+E++II I
Sbjct: 19  ACVPVSLSAKKPVLSARTEDIPYIKCQVCEKLAKELYQQVQKKQAEILPKKITEYQIIEI 78

Query: 80  AENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDV 139
           AENVCNLKK+EADWI+KIDIVE+GDKL LVEQ+ EGQCNSECKT+E+AC+E+MGY+DTD+
Sbjct: 79  AENVCNLKKQEADWIMKIDIVEKGDKLELVEQDVEGQCNSECKTVEKACEEVMGYADTDI 138

Query: 140 AEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSM 199
           AEY+++SKP+I SLVNY CKDL+ AC  KPPPV K+R PGEPF PKP+K+AEME+IMRSM
Sbjct: 139 AEYIFSSKPNIDSLVNYFCKDLTNACVKKPPPVPKDRSPGEPFEPKPSKDAEMEKIMRSM 198

Query: 200 EGMPGAPGMQMYSKEDLMNMKNFGG 224
           + MPGAP MQMYS+EDLMNMKNFG 
Sbjct: 199 QDMPGAPNMQMYSREDLMNMKNFGN 223


>gi|212274637|ref|NP_001130151.1| uncharacterized protein LOC100191245 precursor [Zea mays]
 gi|194688408|gb|ACF78288.1| unknown [Zea mays]
 gi|413953188|gb|AFW85837.1| hypothetical protein ZEAMMB73_915175 [Zea mays]
          Length = 303

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 162/285 (56%), Positives = 213/285 (74%), Gaps = 4/285 (1%)

Query: 22  APISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIA 80
           A ++   KKP   AR+ED+PYI CQVCE++A ++  QV KK   + P KK+ E EII +A
Sbjct: 22  AEVAAGQKKPATAARREDIPYIRCQVCERIAREISAQVAKKHQALPPSKKVPEIEIIEVA 81

Query: 81  ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
           ENVCNLKK+EADW+L+IDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE++GY+DTDVA
Sbjct: 82  ENVCNLKKQEADWMLRIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVIGYADTDVA 141

Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
           E++Y + P +  L+ +LCKDLSKAC+  PPPV K+R+PGEPF  KP+K+AEME+I+RSME
Sbjct: 142 EFVYKNNPSVDQLMKFLCKDLSKACAKDPPPVPKDRVPGEPFARKPSKDAEMEKILRSME 201

Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
           GMPGAP M+MYS++DLM   NFG E  DD++++D+D++ +FPKNLGK  + K S+  D K
Sbjct: 202 GMPGAPSMKMYSRDDLMK-NNFGTE--DDDEEDDEDEDDNFPKNLGKALKNKGSQKKDLK 258

Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
           Q++     +T   LK   NKVS  V++WW G K        KTEL
Sbjct: 259 QQVVQQFKDTSKKLKGHVNKVSGMVKKWWKGAKKPAKSSKSKTEL 303


>gi|326521176|dbj|BAJ96791.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 156/262 (59%), Positives = 205/262 (78%), Gaps = 2/262 (0%)

Query: 29  KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIAENVCNLK 87
           KKP   AR+EDVPYI CQVCE++A ++  QV  KQ  + P KK+ E EII IAENVCNLK
Sbjct: 32  KKPATAARREDVPYIRCQVCERIAREISAQVAAKQQALPPSKKVPEIEIIDIAENVCNLK 91

Query: 88  KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
           K+EADW+L+IDIVE+GD+L LV+Q+ EG CNSECKTIERACQE+MGY+DTDVAE++YT+K
Sbjct: 92  KQEADWMLRIDIVEKGDRLELVDQDEEGHCNSECKTIERACQEVMGYADTDVAEFVYTNK 151

Query: 148 PDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
           P +  L+ ++CKD++KAC+  PPPV K+R+PGEPF  KP+K+AEME+IMRSMEGMPGAP 
Sbjct: 152 PSVDQLIKFVCKDITKACAANPPPVPKDRVPGEPFATKPSKDAEMEKIMRSMEGMPGAPN 211

Query: 208 MQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGI 267
           M+MYS++DLM   NFG E  +D++DED+D++ + PKNLG + + K  +  D KQ++   I
Sbjct: 212 MKMYSRDDLMK-NNFGVEGDEDDEDEDEDEDYNLPKNLGNILKNKGPQKKDLKQQVVKQI 270

Query: 268 LNTGDVLKKQANKVSNHVQRWW 289
            +TG  LK   +KVS  V+ WW
Sbjct: 271 KDTGKKLKGHVSKVSKVVKNWW 292


>gi|195627550|gb|ACG35605.1| hypothetical protein [Zea mays]
          Length = 303

 Score =  317 bits (813), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 157/271 (57%), Positives = 208/271 (76%), Gaps = 4/271 (1%)

Query: 22  APISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIA 80
           A ++   KKP   AR+ED+PYI CQVCE++A ++  QV KK   + P KK+ E EII +A
Sbjct: 22  AEVAAGQKKPATAARREDIPYIRCQVCERIAREISAQVAKKHQALPPSKKVPEIEIIEVA 81

Query: 81  ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
           ENVCNLKK+EADW+L+IDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE++GY+DTDVA
Sbjct: 82  ENVCNLKKQEADWMLRIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVIGYADTDVA 141

Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
           E++Y + P +  L+ +LCKDLSKAC+  PPPV K+R+PGEPF  KP+K+AEME+I+RSME
Sbjct: 142 EFVYKNNPSVDQLMKFLCKDLSKACAKDPPPVPKDRVPGEPFARKPSKDAEMEKILRSME 201

Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
           GMPGAP M+MYS++DLM   NFG E  DD++++D+D++ +FPKNLGK  + K S+  D K
Sbjct: 202 GMPGAPSMKMYSRDDLMK-NNFGTE--DDDEEDDEDEDDNFPKNLGKALKNKGSQKKDLK 258

Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSG 291
           Q++     +T   LK   NKVS  V++WW G
Sbjct: 259 QQVVQQFKDTSKKLKGHVNKVSGMVKKWWKG 289


>gi|116794067|gb|ABK26994.1| unknown [Picea sitchensis]
          Length = 314

 Score =  300 bits (767), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 158/301 (52%), Positives = 210/301 (69%), Gaps = 18/301 (5%)

Query: 1   MARIKAAVLVYV-AVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQV 59
           MAR      ++V AVI   +  AP     KKP+A AR+ED+PYI C VCE++  QL  QV
Sbjct: 3   MARFLGLGFLFVFAVIAGQAMAAP-----KKPIAVARREDIPYIKCGVCEQIVKQLDRQV 57

Query: 60  QKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNS 119
              + QI+PK++SE++II +AEN+CNLKKEE DW+L +DIVE GDKL LVEQE EG CN+
Sbjct: 58  TSIRNQIAPKRLSEYQIIEVAENICNLKKEEGDWLLHLDIVEDGDKLKLVEQEQEGDCNA 117

Query: 120 ECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPG 179
           ECKTIER CQE++GY DTDVAE+L+ S+P   SL+ +LC DLS+ACS KPPPV K+R PG
Sbjct: 118 ECKTIERTCQEVIGYHDTDVAEFLFKSEPQSSSLLKFLCNDLSRACSVKPPPVPKDRTPG 177

Query: 180 EPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDES 239
           EPFVPK +KEAEME+IMRSM GMPGA  M+MYSK+DLMNM NFG  +G+D DDE+D+D+ 
Sbjct: 178 EPFVPKSSKEAEMEKIMRSMSGMPGASNMKMYSKDDLMNMPNFGS-NGEDEDDEEDEDDD 236

Query: 240 DFPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQA-----------NKVSNHVQRW 288
                +    R   S   DWK+ + D + +T   +K +            +++S+ +++W
Sbjct: 237 YTSSRMRNTKRPLVSVESDWKKSLLDRVQSTFSNVKDRTQHAVLGAKWSLDRISDKIRKW 296

Query: 289 W 289
           +
Sbjct: 297 Y 297


>gi|357501233|ref|XP_003620905.1| hypothetical protein MTR_6g092150 [Medicago truncatula]
 gi|355495920|gb|AES77123.1| hypothetical protein MTR_6g092150 [Medicago truncatula]
          Length = 298

 Score =  290 bits (743), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 164/295 (55%), Positives = 207/295 (70%), Gaps = 38/295 (12%)

Query: 12  VAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKI 71
           + V + +++  P+S+CAKKPV  ARKEDVPYI CQVCE LA QLY QVQ K+A+ISPKKI
Sbjct: 41  LVVPVAIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEISPKKI 100

Query: 72  SEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ-EAEGQCNSECKTIERACQE 130
           SE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E+ ++EGQCNSECKT+ERACQE
Sbjct: 101 SEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEEHDSEGQCNSECKTVERACQE 160

Query: 131 IMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEA 190
           +MGYSDTDVAEYLY+ KPDI SL NYLCKDLSK+C+T      K R PGEPFV K + EA
Sbjct: 161 VMGYSDTDVAEYLYSCKPDIDSLTNYLCKDLSKSCNT------KTRTPGEPFVAKSSNEA 214

Query: 191 EMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFR 250
           EME++++SME                    NFG E+ DD++DED+D++            
Sbjct: 215 EMEKLLKSME--------------------NFGAENEDDDEDEDEDEDE----------- 243

Query: 251 EKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
           +++    D+  ++    ++T   LKK A KVSNH+Q+WW G KA  SKK  K EL
Sbjct: 244 DEDEDEADFPSKLELWSVDTSTALKKHATKVSNHIQQWWKGKKAISSKKNPKPEL 298


>gi|357110756|ref|XP_003557182.1| PREDICTED: uncharacterized protein LOC100845828 [Brachypodium
           distachyon]
          Length = 304

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 160/274 (58%), Positives = 212/274 (77%), Gaps = 6/274 (2%)

Query: 33  AGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIAENVCNLKKEEA 91
             AR+EDVPYI CQVCE++A ++  QV +KQ  +SP KK+ E EII IAENVCNLKK+EA
Sbjct: 36  TAARREDVPYIRCQVCERIAREISAQVARKQQALSPSKKVPEIEIIEIAENVCNLKKQEA 95

Query: 92  DWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIK 151
           DW+L+IDIVE+GD+L LV+Q+ EG CNSECKTIERACQE+MGY+DTDVAE++YT+KP + 
Sbjct: 96  DWMLRIDIVEKGDRLELVDQDEEGHCNSECKTIERACQEVMGYADTDVAEFVYTNKPSLD 155

Query: 152 SLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMY 211
            LV ++CKD++KAC+  PP V K+R+PGEPF  KP+K+AEME+IMRSM  +PGAP M+MY
Sbjct: 156 QLVKFVCKDITKACAVDPPSVPKDRVPGEPFAAKPSKDAEMEKIMRSMGDIPGAPSMKMY 215

Query: 212 SKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGILNTG 271
           S++DLM   NFG   GDD++D+D+DDE +FPKNLGK  ++K  +  D K+++   I +TG
Sbjct: 216 SRDDLMQ-NNFGA-GGDDDEDDDEDDEDNFPKNLGKALKDKPPQKNDIKEQVVKQIKDTG 273

Query: 272 DVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
             LK   +KVS  V++WW   K  +S   GK+EL
Sbjct: 274 KKLKGHVSKVSKVVKKWWQAKKPTKS---GKSEL 304


>gi|115466448|ref|NP_001056823.1| Os06g0150400 [Oryza sativa Japonica Group]
 gi|55296609|dbj|BAD69311.1| unknown protein [Oryza sativa Japonica Group]
 gi|55297262|dbj|BAD69047.1| unknown protein [Oryza sativa Japonica Group]
 gi|113594863|dbj|BAF18737.1| Os06g0150400 [Oryza sativa Japonica Group]
          Length = 306

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 163/264 (61%), Positives = 206/264 (78%), Gaps = 4/264 (1%)

Query: 29  KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEIIGIAENVCNLK 87
           KK    ARKED+PYI CQVCE++A ++  QV KKQ  + + KK+ E EII IAENVCNLK
Sbjct: 34  KKVATAARKEDIPYIRCQVCERIAREISAQVAKKQQALPATKKVPEIEIIEIAENVCNLK 93

Query: 88  KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
           K+EADW+LKIDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE+MGY+DTDVAE++Y  K
Sbjct: 94  KQEADWMLKIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVMGYADTDVAEFVYKKK 153

Query: 148 PDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
           P    LV +LCKDLS+AC   PPPV K+R+PGEPF  KP+K+AEM+RI++SMEG+PGAP 
Sbjct: 154 PSADQLVKFLCKDLSEACVVDPPPVPKDRVPGEPFAAKPSKDAEMDRILKSMEGIPGAPS 213

Query: 208 MQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGI 267
           M+MYS++DLM   NFG +   D+DD+D+D++ DFPKNLGKVF++K S   D KQ++   I
Sbjct: 214 MKMYSRDDLMK-NNFGVD--GDDDDDDEDEDDDFPKNLGKVFKDKGSPKKDLKQQVVKQI 270

Query: 268 LNTGDVLKKQANKVSNHVQRWWSG 291
            +TG  LK   NKVS  V++WW G
Sbjct: 271 KDTGKKLKGHVNKVSKVVKKWWQG 294


>gi|218197595|gb|EEC80022.1| hypothetical protein OsI_21695 [Oryza sativa Indica Group]
          Length = 306

 Score =  287 bits (735), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 162/264 (61%), Positives = 204/264 (77%), Gaps = 4/264 (1%)

Query: 29  KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEIIGIAENVCNLK 87
           KK    ARKED+PYI CQVCE+ A ++  QV KKQ  + + KK+ E EII IAENVCNLK
Sbjct: 34  KKVATAARKEDIPYIRCQVCERTAREISAQVAKKQQALPATKKVPEIEIIEIAENVCNLK 93

Query: 88  KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
           K+EADW+LKIDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE+MGY+DTDVAE++Y  K
Sbjct: 94  KQEADWMLKIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVMGYADTDVAEFVYKKK 153

Query: 148 PDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
           P    LV +LCKDLS+AC   PPPV K+R+PGEPF  KP+K+AEM+RI++SMEG+PGAP 
Sbjct: 154 PSADQLVKFLCKDLSEACVVDPPPVPKDRVPGEPFAAKPSKDAEMDRILKSMEGIPGAPS 213

Query: 208 MQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGI 267
           M+MYS++DLM   NFG +    +DD+D+D++ DFPKNLGKVF++K S   D KQ++   I
Sbjct: 214 MKMYSRDDLMK-NNFGVDGD--DDDDDEDEDDDFPKNLGKVFKDKGSPKKDLKQQVVKQI 270

Query: 268 LNTGDVLKKQANKVSNHVQRWWSG 291
            +TG  LK   NKVS  V++WW G
Sbjct: 271 KDTGKKLKGHVNKVSKVVKKWWQG 294


>gi|357501043|ref|XP_003620810.1| hypothetical protein MTR_6g090780 [Medicago truncatula]
 gi|355495825|gb|AES77028.1| hypothetical protein MTR_6g090780 [Medicago truncatula]
          Length = 293

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 166/297 (55%), Positives = 198/297 (66%), Gaps = 57/297 (19%)

Query: 17  LLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEI 76
            LS  AP     KKPV  ARKEDVPYI CQVCE LA QLY QVQ K+A+ISPKKISE++I
Sbjct: 46  FLSRTAP-----KKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEISPKKISEYQI 100

Query: 77  IGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ-EAEGQCNSECKTIERACQEIMGYS 135
           I IAENVCNLKK EADWIL+IDIVE+ D+L L E+ ++EGQCNSECKT+ERACQE+MGYS
Sbjct: 101 IEIAENVCNLKKVEADWILRIDIVEKADRLELEEEHDSEGQCNSECKTVERACQEVMGYS 160

Query: 136 DTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERI 195
           DTDVAEYLY+ KPDI SL NYLCKDLSK+C+T      K R PGEPFV K + EAEME++
Sbjct: 161 DTDVAEYLYSCKPDIDSLTNYLCKDLSKSCNT------KTRTPGEPFVAKSSNEAEMEKL 214

Query: 196 MRSMEGMPGAPGMQMYSKEDLMNMKNFGG-------EDGDDNDDEDDDDESDFPKNLGKV 248
           ++SME                    NFG        ++ +D D+++D+DE+DFP      
Sbjct: 215 LKSME--------------------NFGAENEDDDEDEDEDEDEDEDEDEADFP------ 248

Query: 249 FREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
                SK   W        ++T   LKK A KVSNH+Q+WW G KA  SKK  K EL
Sbjct: 249 -----SKLELWS-------VDTSTALKKHATKVSNHIQQWWKGKKAISSKKNPKPEL 293


>gi|168003107|ref|XP_001754254.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694356|gb|EDQ80704.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 306

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 139/274 (50%), Positives = 179/274 (65%), Gaps = 5/274 (1%)

Query: 29  KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKK 88
           KK VA ARKED+P I C VCE +A QL  QV+ K+ + +P+KI+E EII +AEN+CN+KK
Sbjct: 22  KKVVATARKEDIPLIKCSVCEVIAKQLVRQVKDKREKAAPRKITELEIIDVAENICNMKK 81

Query: 89  EEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKP 148
           EE+DWILK+DIVE+GDKL LVEQ+ EG CN+ECKTIERAC+E+MG  DTD+AEYLY +  
Sbjct: 82  EESDWILKLDIVEKGDKLQLVEQQEEGVCNTECKTIERACREVMGDHDTDLAEYLYKAGA 141

Query: 149 DIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGM 208
               L   LCKDLSKAC  K P V K+R+PGE F PK  KEAEME++MRS+  +PGAP M
Sbjct: 142 TRAGLTKLLCKDLSKACVGKTPRVPKDRVPGEAFTPKGGKEAEMEKLMRSLNDVPGAPSM 201

Query: 209 QMYSKEDLM-NMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKE-SKSGDWKQRITDG 266
           +M+S+E+L  N    GG  G D+ D +D++E +         +    SK  +     T  
Sbjct: 202 KMFSREELQKNPGLMGGGRGSDDPDAEDEEEDEDEDEGETDAKTNSVSKLSNEVGARTQK 261

Query: 267 ILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKP 300
            + T     K  NK    +  WW+G K AR   P
Sbjct: 262 AIRTAS---KSVNKAMRTISSWWNGKKPARKAAP 292


>gi|357452911|ref|XP_003596732.1| hypothetical protein MTR_2g084730 [Medicago truncatula]
 gi|355485780|gb|AES66983.1| hypothetical protein MTR_2g084730 [Medicago truncatula]
          Length = 223

 Score =  241 bits (614), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 139/253 (54%), Positives = 173/253 (68%), Gaps = 36/253 (14%)

Query: 54  QLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ-E 112
           + Y Q   K+  +    ISE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E+ +
Sbjct: 6   KFYFQ---KRTSVIVGDISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEEHD 62

Query: 113 AEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPV 172
           +EGQCNSECKT+ERACQE+MGYSDTDVAEYLY+ KPDI SL NYLCKDLSK+C+T     
Sbjct: 63  SEGQCNSECKTVERACQEVMGYSDTDVAEYLYSCKPDIDSLTNYLCKDLSKSCNT----- 117

Query: 173 LKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDD 232
            K R PGEPFV K + EAEME++++SMEGMPG P M+MYS        N   ++ +D D+
Sbjct: 118 -KTRTPGEPFVAKSSNEAEMEKLLKSMEGMPGVPDMKMYS--------NDDEDEDEDEDE 168

Query: 233 EDDDDESDFPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
           ++D+DE+DFP           SK   W        ++T   LKK A KVSNH+Q+WW G 
Sbjct: 169 DEDEDEADFP-----------SKLELWS-------VDTSTALKKHATKVSNHIQQWWKGK 210

Query: 293 KAARSKKPGKTEL 305
           KA  SKK  K EL
Sbjct: 211 KAISSKKNPKPEL 223


>gi|222634964|gb|EEE65096.1| hypothetical protein OsJ_20142 [Oryza sativa Japonica Group]
          Length = 290

 Score =  234 bits (596), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 139/271 (51%), Positives = 180/271 (66%), Gaps = 34/271 (12%)

Query: 29  KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEI-------IGIA 80
           KK    ARKED+PYI CQVCE++A ++  QV KKQ  + + KK+  F           I 
Sbjct: 34  KKVATAARKEDIPYIRCQVCERIAREISAQVAKKQQALPATKKVLNFRFGSRKSGFTEIP 93

Query: 81  ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
           EN CNLK +EADW+LKIDIVE+GDKL                       E+MGY+DTDVA
Sbjct: 94  ENGCNLKNQEADWMLKIDIVEKGDKL-----------------------EVMGYADTDVA 130

Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
           E++Y  KP    LV +LCKDLS+AC   PPPV K+R+PGEPF  KP+K+AEM+RI++SME
Sbjct: 131 EFVYKKKPSADQLVKFLCKDLSEACVVDPPPVPKDRVPGEPFAAKPSKDAEMDRILKSME 190

Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
           G+PGAP M+MYS++DLM   NFG +   D+DD+D+D++ DFPKNLGKVF++K S   D K
Sbjct: 191 GIPGAPSMKMYSRDDLMK-NNFGVD--GDDDDDDEDEDDDFPKNLGKVFKDKGSPKKDLK 247

Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSG 291
           Q++   I +TG  LK   NKVS  V++WW G
Sbjct: 248 QQVVKQIKDTGKKLKGHVNKVSKVVKKWWQG 278


>gi|302759156|ref|XP_002963001.1| hypothetical protein SELMODRAFT_404499 [Selaginella moellendorffii]
 gi|300169862|gb|EFJ36464.1| hypothetical protein SELMODRAFT_404499 [Selaginella moellendorffii]
          Length = 312

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 125/289 (43%), Positives = 193/289 (66%), Gaps = 20/289 (6%)

Query: 35  ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
           ARKE++ Y+ C VCE+++ QL+ +V +K++    KK+SEFEII +AEN+CN+KK E++W+
Sbjct: 26  ARKENIKYLGCAVCEQISKQLFEKVSEKKS--IKKKLSEFEIIELAENICNVKKRESEWM 83

Query: 95  LKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KS 152
             +DIV +G+KL LVEQ  EG+CN++C+TIER CQE++G  DTD+AE+++T   D+  ++
Sbjct: 84  FFLDIVREGNKLKLVEQPEEGECNTKCRTIERTCQEVIGDHDTDIAEFIHTHLRDLSEEA 143

Query: 153 LVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYS 212
           +   LCK+++K+CS+K P + K    GEPF PKP K+A+M R+MRSM  MPG  GM+MYS
Sbjct: 144 IFKSLCKEVTKSCSSKLPALPKTLDLGEPFTPKPTKDADMARLMRSMGDMPGGQGMKMYS 203

Query: 213 KEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFRE------------KESKSGDWK 260
           K+DLMNM N+   D DD+DD+DD DE    ++L K+ +             K        
Sbjct: 204 KDDLMNMNNYPAGDPDDDDDDDDTDEGSKLQSLLKISKTPQKQNLKNPANPKNQPKRSML 263

Query: 261 QRITDGILNTG-DVL---KKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
            ++ D + + G D L   +K  +KV+ +V+RWW   K+++ ++  K +L
Sbjct: 264 TQVLDHVQSIGRDTLSSARKTKDKVTKYVKRWWRRAKSSKQRQAQKVDL 312


>gi|302797140|ref|XP_002980331.1| hypothetical protein SELMODRAFT_420060 [Selaginella moellendorffii]
 gi|300151947|gb|EFJ18591.1| hypothetical protein SELMODRAFT_420060 [Selaginella moellendorffii]
          Length = 197

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 86/175 (49%), Positives = 131/175 (74%), Gaps = 5/175 (2%)

Query: 35  ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
           ARKE++ Y+ C VCE+++ QL+ +V +K++    KK+SEFEII +AEN+CN+KK E++W+
Sbjct: 26  ARKENIKYLRCAVCEQISKQLFEKVSEKKS--IKKKLSEFEIIELAENICNVKKRESEWM 83

Query: 95  LKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KS 152
             +DIV +G+KL LVEQ  EG+CN++C+TIER CQE++G  DTD+AE+++T   D+  ++
Sbjct: 84  FFLDIVREGNKLKLVEQPEEGECNTKCRTIERTCQEVIGDHDTDIAEFIHTHLRDLSEEA 143

Query: 153 LVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
           +   LCK+++K+CS+K P + K    GEPF PKP K+A+M R+MRSM G+   P 
Sbjct: 144 IFKSLCKEVTKSCSSKLPALPKTLDLGEPFTPKPTKDADMARLMRSM-GVSFRPS 197


>gi|156382000|ref|XP_001632343.1| predicted protein [Nematostella vectensis]
 gi|156219397|gb|EDO40280.1| predicted protein [Nematostella vectensis]
          Length = 191

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 71/178 (39%), Positives = 109/178 (61%)

Query: 39  DVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKID 98
           +VP+I C+ C+K    LY +  + + +   KK+ E +II + E  CN  KEE +WI K D
Sbjct: 12  EVPHIRCETCQKAVKHLYRRTSEMREKSPRKKVEEDKIIELVEASCNPDKEEGEWISKFD 71

Query: 99  IVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLC 158
           IV++GD L L E+E  G+C  EC+TI +AC+E +   DTD+AE ++ +K  +  L N +C
Sbjct: 72  IVKKGDSLQLSEKEVFGRCKQECRTIAKACEETISEVDTDLAEMIWQNKLSLSKLTNKIC 131

Query: 159 KDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDL 216
             +SKAC+ K   +  ++   E F      E   ++IM+ M G+PG PGM+MYS+ED+
Sbjct: 132 HSMSKACTKKAQNLKIDKRFDEEFEAMSEDERNADKIMKQMRGVPGMPGMEMYSREDI 189


>gi|384249540|gb|EIE23021.1| hypothetical protein COCSUDRAFT_6458, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 188

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 73/171 (42%), Positives = 105/171 (61%), Gaps = 1/171 (0%)

Query: 30  KPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIAENVCNLKK 88
           +P A   K D+PYI C VCE +    Y QV+  +  + P KK+ E  I+ + E + +  K
Sbjct: 17  EPDALPVKSDLPYIRCGVCEAVGKNAYRQVKSARDALKPGKKLEELVILEMVEKMGDPAK 76

Query: 89  EEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKP 148
           +E +WI KID+VE G KL LVE E  G+C  ECKT++RA QE++   DTD+AE L+  K 
Sbjct: 77  DEGEWIAKIDLVEDGKKLKLVEHEELGKCREECKTVQRAAQEVLSDHDTDIAEKLWQGKM 136

Query: 149 DIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSM 199
                 N+LC D++ AC  KPPP+ K+R PG+ F     +EA+M+++M  M
Sbjct: 137 SRSQFSNWLCYDVTGACKQKPPPLPKDRAPGQAFQVIDPEEAQMQKMMAGM 187


>gi|308799811|ref|XP_003074687.1| unnamed protein product [Ostreococcus tauri]
 gi|116000857|emb|CAL50537.1| unnamed protein product [Ostreococcus tauri]
          Length = 312

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 86/225 (38%), Positives = 134/225 (59%), Gaps = 11/225 (4%)

Query: 4   IKAAVLVYVAVILLLSTCAPISYC-------AKKPVAGARKEDVPYITCQVCEKLASQLY 56
           IKA   V VA++ + +  A            A K +  A   DV YI C+ CE L +   
Sbjct: 3   IKATTKVAVALLAISAVSANGVEAKKGGKKDASKTIPRAVASDVKYIECETCELLVNAAV 62

Query: 57  HQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAE-- 114
            + + + A++  KK+ E ++I   E +C++  +E DWI+  D+ E+G KL LV+  AE  
Sbjct: 63  KEYKTQYAEVGSKKLRETDVIEKMEKMCDVASKEGDWIITKDLQEKGSKLRLVDMGAENY 122

Query: 115 GQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KSLVNYLCKDLSKACSTKPPPV 172
           G+C +ECKTI +AC++I+G  DTDV EYL+ S  D+  K+   +LC++ + +C+  PPP+
Sbjct: 123 GECRTECKTIVKACEKILGPRDTDVGEYLFVSDGDVDAKAFRKWLCEEETTSCAKAPPPL 182

Query: 173 LKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLM 217
            K+R  GE F  +  K+ EMER+M  M+GMPG  G QM+S++ +M
Sbjct: 183 PKDRPKGEAFKKRDKKDVEMERLMAGMKGMPGMGGAQMFSRDQMM 227


>gi|449671666|ref|XP_002170798.2| PREDICTED: uncharacterized protein LOC100207705 [Hydra
           magnipapillata]
          Length = 270

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 130/215 (60%), Gaps = 8/215 (3%)

Query: 38  EDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKI 97
           ED+P+I C VC+K A  L+  ++    Q S KK+ E +++ I E  C+      DWI ++
Sbjct: 33  EDLPFIKCDVCQKAAKVLFKTIEN---QRSEKKLDEDDVLSIVEKSCDTDVLYGDWISRL 89

Query: 98  DIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYL 157
           DIVE+ + L +VE   EG+C+ ECKTI R+C++I+G  DTD+ E L+ ++  + + +N++
Sbjct: 90  DIVEKENDLKVVEHPQEGKCDRECKTISRSCEDIIGDIDTDIGELLWKNEMKLATFINHV 149

Query: 158 CKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLM 217
           C  L+ +C  K   V K       FV    KE +   +MR+M+ +PG PGM+MYS++DL 
Sbjct: 150 CYKLTNSCKAKKKYV-KGSHKDYKFVEMSEKEKQARDLMRNMKSVPGMPGMEMYSQDDLQ 208

Query: 218 NMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREK 252
           NM+    + G  N+DE+  D S+  +N  +V+R K
Sbjct: 209 NMRE---QMGVRNEDENQQD-SEQSENYDEVYRTK 239


>gi|145341176|ref|XP_001415689.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575912|gb|ABO93981.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 233

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/214 (38%), Positives = 127/214 (59%), Gaps = 4/214 (1%)

Query: 5   KAAVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQA 64
           + AV    A I +    A       KP+A A K DV +I C+VCE LA  +  +    + 
Sbjct: 1   RRAVTTIAASIAIAGLVACADAKPSKPIARAVKSDVKFIKCEVCEHLADAVTQEYAALRE 60

Query: 65  QISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVE--QEAEGQCNSECK 122
           +   KK++E  +I   E +C+   +E DWI+  D+ E+G +L +V+  +E  G+C  ECK
Sbjct: 61  ETPEKKLTETMVIEKIEKMCDPSAKEGDWIITKDLQEKGSRLRVVDMGKENYGECGKECK 120

Query: 123 TIERACQEIMGYSDTDVAEYLYTSKP-DIK-SLVNYLCKDLSKACSTKPPPVLKNRIPGE 180
           TI +AC++I+G  DTDV E+LYTS   D++ +L  +LC++ + +C  K PP+  +R  GE
Sbjct: 121 TIVKACEDILGPRDTDVGEFLYTSNAGDLRGALREWLCEEETNSCGGKIPPLPSDRPKGE 180

Query: 181 PFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKE 214
            F  +  K+ EMER+M  M+GMPG  G QM+S++
Sbjct: 181 KFKKRDKKDVEMERLMAGMKGMPGMGGAQMFSRD 214


>gi|125554107|gb|EAY99712.1| hypothetical protein OsI_21693 [Oryza sativa Indica Group]
          Length = 163

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 75/138 (54%), Positives = 92/138 (66%), Gaps = 24/138 (17%)

Query: 29  KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEIIGIAENVCNLK 87
           KK    ARKED+PYI CQVCE++A ++  QV KKQ  + + KK+ E EII IAENVCNLK
Sbjct: 34  KKVATAARKEDIPYIRCQVCERIAREISAQVAKKQQALPATKKVPEIEIIEIAENVCNLK 93

Query: 88  KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
           K+EADW+LKIDIVE+GDKL                       E+MGY+DTDVAE++Y  K
Sbjct: 94  KQEADWMLKIDIVEKGDKL-----------------------EVMGYADTDVAEFVYKKK 130

Query: 148 PDIKSLVNYLCKDLSKAC 165
           P    LV +LCKDLS+AC
Sbjct: 131 PSADQLVKFLCKDLSEAC 148


>gi|159475098|ref|XP_001695660.1| hypothetical protein CHLREDRAFT_205897 [Chlamydomonas reinhardtii]
 gi|158275671|gb|EDP01447.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 314

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/235 (34%), Positives = 120/235 (51%), Gaps = 8/235 (3%)

Query: 1   MARIKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQ 60
           MAR+    L  + + L +++         KP A   K D+ YI CQVCE +A Q     +
Sbjct: 1   MARLNLLQLCVLGLCLAVASAQFGGKRPPKPKAPVAKGDIKYIKCQVCEAIAKQSIKATK 60

Query: 61  KKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSE 120
               Q  PKK+ E +I+   E +C+   ++ +WI K DIVE+   LVL +  A G+C SE
Sbjct: 61  DLIEQAGPKKVPEADILERVEKMCDPDTDDGEWITKYDIVEEDSALVLKDTGAVGRCKSE 120

Query: 121 CKTIERACQEIMGYSD-TDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPG 179
           C+TI RAC+ I    D TD++  L+  K    ++ N++C D S ACS K P + K     
Sbjct: 121 CRTIARACEMISDDVDLTDLSAMLFKGKKRA-AISNWMCHDASDACSKKAPALTKREGAD 179

Query: 180 EPFVPKPAKEAEMERIMRSME--GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDD 232
           E   P    E   ER+MRSM+  G+ G     MY+KE +        +  +DN+D
Sbjct: 180 EAHAPMDEDELRTERMMRSMKAAGLSGT----MYNKETMQEELAEMADQYEDNED 230


>gi|388507226|gb|AFK41679.1| unknown [Lotus japonicus]
          Length = 116

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/98 (64%), Positives = 82/98 (83%), Gaps = 1/98 (1%)

Query: 192 MERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFRE 251
           ME++++SMEGMPGAPGM+MYS++DLMNMKNFGG++ D+++DEDDDD++ FP  LGKV R+
Sbjct: 1   MEKLLKSMEGMPGAPGMKMYSRDDLMNMKNFGGDE-DEDEDEDDDDQASFPSKLGKVLRD 59

Query: 252 KESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWW 289
           KES   DWKQ I  GI +T   LKK AN++SNHVQ+WW
Sbjct: 60  KESGKSDWKQLIKKGITDTSMTLKKHANRISNHVQKWW 97


>gi|255084251|ref|XP_002508700.1| predicted protein [Micromonas sp. RCC299]
 gi|226523977|gb|ACO69958.1| predicted protein [Micromonas sp. RCC299]
          Length = 343

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 69/184 (37%), Positives = 110/184 (59%), Gaps = 7/184 (3%)

Query: 35  ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
           A K DV YI CQVC+++A  L  +    +      K+ E +++   E VC+ +  E +W+
Sbjct: 74  AVKSDVKYIRCQVCQEVAKVLSREASTLR-DTKGAKLKESDVLEKVEKVCDPEAVEGEWL 132

Query: 95  LKIDIVEQGDKLVLV-EQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSL 153
           +  D+ E+G +L +V  ++  G C SECKT++RAC++I+   DTDVAE L+T  P   +L
Sbjct: 133 ITHDLQEKGSELKMVLMKDQYGACGSECKTMQRACEKIVADRDTDVAEILFTGSPQRAAL 192

Query: 154 VNYLC----KDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQ 209
           V ++C    +D   AC  K PP+ K+R  G  F P   K+ +M+R+M++M+ M G  GM 
Sbjct: 193 VQHMCHNDDEDGFGACVKKAPPLPKSRPKGPAFDPVDKKDLDMQRMMKNMKDM-GMGGMS 251

Query: 210 MYSK 213
           MY++
Sbjct: 252 MYNR 255


>gi|307105084|gb|EFN53335.1| hypothetical protein CHLNCDRAFT_137061 [Chlorella variabilis]
          Length = 293

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 33/199 (16%)

Query: 31  PVAGARKEDVPYITCQ--VCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKK 88
           P    R  D+PYI CQ  VCE L    + QV             E +II   E +    +
Sbjct: 29  PKEKPRLADIPYIQCQARVCELLTKNAWRQV------------DEMKIIERMEKITTAWR 76

Query: 89  EEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYT--- 145
            E +WI  +D+VE+GDKLV+      G+C  ECKTIE A + IMG  DTD+AE L+T   
Sbjct: 77  AEGEWIASLDLVEKGDKLVV------GKCGVECKTIEAAAERIMGEHDTDIAEMLFTVGL 130

Query: 146 ----SKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME- 200
                K       N+LC + ++ C  KPPP+ K R PG PF P    +  +ER+M  ME 
Sbjct: 131 VLGLGKKTRAQFNNWLCYEETQVCKYKPPPLPKERPPGPPFEPMAQGDQNLERMMGEMED 190

Query: 201 -GMPGAPGMQMYSKEDLMN 218
            G+ G+    +YS++DL++
Sbjct: 191 KGLKGS----LYSRDDLLS 205


>gi|412992360|emb|CCO20073.1| predicted protein [Bathycoccus prasinos]
          Length = 356

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/183 (32%), Positives = 107/183 (58%), Gaps = 5/183 (2%)

Query: 35  ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
           A K D   I C+VC+K+ S++  +   +  +   +K++E ++I   E +C+   ++  W 
Sbjct: 53  AVKSDSKLIHCEVCQKV-SKIAFETGVRLREEKGEKLTEVDVIEEMELMCDESDDKTMWA 111

Query: 95  LKIDI-VEQGDKLVL--VEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK-PDI 150
           ++ D+ V   +KLVL  + ++A   C + CKT+ +AC+++M   DTD+AE L+ ++    
Sbjct: 112 VQNDLKVNFRNKLVLKFMGEDAYSSCETTCKTMLKACEDVMEGRDTDLAEILFKNRIKTA 171

Query: 151 KSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQM 210
           K +  +LC + +++CS K   V K R P   F  K  K+ + ER+M  ++ MPG PGMQM
Sbjct: 172 KGIEEWLCTEETESCSKKAKSVSKTRKPLPEFKKKEKKDIDTERMMGKLKNMPGMPGMQM 231

Query: 211 YSK 213
           Y++
Sbjct: 232 YNR 234


>gi|303287402|ref|XP_003062990.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455626|gb|EEH52929.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 336

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 92/166 (55%), Gaps = 8/166 (4%)

Query: 37  KEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILK 96
           K DV Y+ C VC+++A  L H+V  K  +   KK++E +++   E VC+   EE +W++ 
Sbjct: 51  KSDVKYVKCAVCQEVAKTL-HRVAAKVREDKGKKLTEADVLEKVEKVCDPDAEEGEWLIT 109

Query: 97  IDIVEQGDKLVLVEQEAE-GQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KSL 153
            D+VE+G +L +     + G+C +ECKT++ AC+ I+   DTDVAE L+T    +    +
Sbjct: 110 HDLVERGKRLDMKHMGGKFGECGAECKTMQMACERIVSDRDTDVAEMLFTDGASLARAGI 169

Query: 154 VNYLCKDLSK----ACSTKPPPVLKNRIPGEPFVPKPAKEAEMERI 195
              LC D ++    AC    P + K R  G  F  K  K+ EM R+
Sbjct: 170 QAKLCNDANEKAYSACRKPAPALPKTRPKGPAFKEKDKKDVEMARL 215


>gi|156343882|ref|XP_001621149.1| hypothetical protein NEMVEDRAFT_v1g222317 [Nematostella vectensis]
 gi|156206818|gb|EDO29049.1| predicted protein [Nematostella vectensis]
          Length = 376

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 51/133 (38%), Positives = 77/133 (57%)

Query: 71  ISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQE 130
           + E +II + E  CN  KEE +WI K DIV++GD L L E+E  G+C  EC+TI +AC+E
Sbjct: 1   VEEDKIIELVEASCNPDKEEGEWISKFDIVKKGDSLQLSEKEVFGRCKQECRTIAKACEE 60

Query: 131 IMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEA 190
            +   DTD+AE ++ +K  +  L N +C  +SKAC+ K   +  ++   E F      E 
Sbjct: 61  TISEVDTDLAEMIWQNKLSLSKLTNKICHSMSKACTKKAQNLKIDKRFDEEFEAMSEDER 120

Query: 191 EMERIMRSMEGMP 203
             ++IM+ M   P
Sbjct: 121 NADKIMKQMSYGP 133


>gi|307105085|gb|EFN53336.1| hypothetical protein CHLNCDRAFT_137062 [Chlorella variabilis]
          Length = 167

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 82/141 (58%), Gaps = 4/141 (2%)

Query: 94  ILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSL 153
           + K+D+VE+GDKLV+ E    G C SECKT++ A + IMG  DTD+AE L+T K +    
Sbjct: 1   MAKLDLVEEGDKLVVKEMGQVGNCGSECKTVQLAAERIMGEHDTDIAEILFTGKMNRAQF 60

Query: 154 VNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSK 213
             +LC D S+AC    PP+   R PG PF P+   +  ++ ++  +E    A   ++  +
Sbjct: 61  NRWLCWDESEACKKDAPPLPNGRKPGPPFDPRQEGDQSIDAMLGGLEDKGLANNKKL--R 118

Query: 214 EDLMNMKNFGGEDGDDNDDED 234
           E+L  M+  G ++ D  DD+D
Sbjct: 119 EEL--MEKLGVKEADRVDDDD 137


>gi|302836977|ref|XP_002950048.1| hypothetical protein VOLCADRAFT_104574 [Volvox carteri f.
           nagariensis]
 gi|300264521|gb|EFJ48716.1| hypothetical protein VOLCADRAFT_104574 [Volvox carteri f.
           nagariensis]
          Length = 154

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 59/151 (39%), Positives = 81/151 (53%), Gaps = 8/151 (5%)

Query: 83  VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSD-TDVAE 141
           +CN  +   DWI + DIVEQG  L L + +  G+C SEC+TI RAC+ I    D TD++ 
Sbjct: 1   MCNPDQNAGDWITQYDIVEQGTSLELKDTDMIGRCKSECRTIARACELITEDIDLTDLSA 60

Query: 142 YLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEG 201
            LY  K    ++ N+LC D + ACS KPPP    +   E   P    E    R+MR ME 
Sbjct: 61  MLYKGKK-RAAITNWLCYDATDACSRKPPPFSAGQRVDEVHEPLDEDEVRNTRMMRDMEA 119

Query: 202 MPGAPGMQMYS----KEDLMNMKNFGGEDGD 228
           M G  G  +Y+     E+L  M++  G+D D
Sbjct: 120 M-GLSG-SLYNTDTLSEELEEMQDVYGDDPD 148


>gi|357501197|ref|XP_003620887.1| hypothetical protein MTR_6g091870 [Medicago truncatula]
 gi|355495902|gb|AES77105.1| hypothetical protein MTR_6g091870 [Medicago truncatula]
          Length = 125

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/123 (43%), Positives = 75/123 (60%), Gaps = 27/123 (21%)

Query: 174 KNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDE 233
           + R PGEPFV K + EAEME++++SME                    NFG E+ DD++DE
Sbjct: 17  RTRTPGEPFVAKSSNEAEMEKLLKSME--------------------NFGAENEDDDEDE 56

Query: 234 DDDDE-------SDFPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQ 286
           D+D++       +DFP  LGKV R KE++ GDWKQ+I  GI++T   LKK A KVSNH+ 
Sbjct: 57  DEDEDEDEDEDEADFPSKLGKVLRSKENEKGDWKQKIRKGIVDTSTALKKHATKVSNHIH 116

Query: 287 RWW 289
            ++
Sbjct: 117 LFF 119


>gi|299472707|emb|CBN80275.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 246

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 84/176 (47%), Gaps = 8/176 (4%)

Query: 30  KPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKE 89
           KP +   K D+  I C VC+ LA  LY +V + +      K+ E +I  + E  C+   +
Sbjct: 25  KPKSDPVKGDLAAIKCGVCQGLAEHLYEKVAEMREAAPFGKVHEDDISTVIEGACDPATK 84

Query: 90  EADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY---LYTS 146
           E  WI + DIVE+G  L L +Q++   C  EC+TI ++C  ++   + DV E    L+ +
Sbjct: 85  EGRWITEYDIVEEGGYLELKKQDSFSHCKEECETIAKSCT-VLVDDELDVEEVVVALWKN 143

Query: 147 KPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAE----MERIMRS 198
           K  +  L    C + S  CS K             F PK AK+ E    MER+  S
Sbjct: 144 KLGLSELQKAACGEWSDRCSGKKRVKAPKDRQDYEFEPKTAKDFEVAEAMERLQNS 199


>gi|357501193|ref|XP_003620885.1| hypothetical protein MTR_6g091840 [Medicago truncatula]
 gi|355495900|gb|AES77103.1| hypothetical protein MTR_6g091840 [Medicago truncatula]
          Length = 117

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/133 (42%), Positives = 72/133 (54%), Gaps = 33/133 (24%)

Query: 174 KNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDE 233
           + R PGEPFV K + EAEME++++SME                    NFG E+ DDN+DE
Sbjct: 17  RTRTPGEPFVAKSSNEAEMEKLVKSME--------------------NFGAENEDDNEDE 56

Query: 234 DDDDESD-FPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
           D+D++   FP  LGKV R KE++ GDWKQ+I  GI+  G            H   WW G 
Sbjct: 57  DEDEDEADFPSKLGKVLRSKENEKGDWKQKIRKGIVECG------------HKHGWWKGK 104

Query: 293 KAARSKKPGKTEL 305
           KA  SKK  K EL
Sbjct: 105 KAISSKKNPKPEL 117


>gi|357499651|ref|XP_003620114.1| hypothetical protein MTR_6g077350 [Medicago truncatula]
 gi|355495129|gb|AES76332.1| hypothetical protein MTR_6g077350 [Medicago truncatula]
          Length = 158

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/69 (59%), Positives = 46/69 (66%), Gaps = 6/69 (8%)

Query: 4   IKAAVLVYVAVIL-LLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK 62
           IK  +  Y  + L  LS  AP     KKPV  ARKEDVPYI CQVCE LA QLY QVQ K
Sbjct: 84  IKERIFFYWRLHLGFLSRTAP-----KKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSK 138

Query: 63  QAQISPKKI 71
           +A+ISPKK+
Sbjct: 139 KAEISPKKV 147


>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 424

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 28/45 (62%), Positives = 36/45 (80%)

Query: 109 VEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSL 153
           V +  EG CNSECKT+E ACQ+++GYSDTDVAEY++ SK  + SL
Sbjct: 375 VLEYEEGHCNSECKTVETACQKVIGYSDTDVAEYIHISKSYLTSL 419


>gi|407410087|gb|EKF32656.1| hypothetical protein MOQ_003488 [Trypanosoma cruzi marinkellei]
          Length = 323

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 59/230 (25%), Positives = 102/230 (44%), Gaps = 26/230 (11%)

Query: 10  VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
           V+VA +LL S  A  +   K P A   K+ V  I+C VC  +  + Y  VQ    A +  
Sbjct: 19  VFVAAMLLFSE-AGFAKKEKPPHAPFPKDVVDAISCGVCNFMVRKAYSDVQLLFNASVET 77

Query: 69  K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
           + +++E +++   E+VCN   E   WI +I I  + D     L +       +C   C T
Sbjct: 78  RVRVNEDDVLTAIEDVCNPFSEVGHWIRQITITHESDTAPFLLGVTRSSVYAKCKGICST 137

Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLS-----KACSTKPP 170
           +  AC+ +M + + D        + EY      D ++  N LC+  S     +  S    
Sbjct: 138 VVEACEAVMDHENMDQLSPRLLRLHEY-----ADAEAFANALCQSSSICTERQGLSASRY 192

Query: 171 PVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMK 220
             ++N I  +       KE E+ER+M  ME         ++S++++ +M+
Sbjct: 193 EEVQNMIDADTVEEIDPKEMEIERMMDHMERKENR-RQTIFSRDEITSMQ 241


>gi|323453478|gb|EGB09349.1| hypothetical protein AURANDRAFT_62913 [Aureococcus anophagefferens]
          Length = 1856

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 63/233 (27%), Positives = 100/233 (42%), Gaps = 51/233 (21%)

Query: 29  KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP------------KKISEFEI 76
           K+P+A   K D+P I C VCE +A++L+ +   KQ + +P            +K+S F  
Sbjct: 158 KEPLA--VKGDLPCIRCDVCEIVATELWREA-TKQREAAPLTAASGKPGEKKRKVSTFSE 214

Query: 77  IGIAE---NVCNLKKEEADWILKIDIVE-QG-------------------DKLVLVEQEA 113
             + E    VCN +KE  +W+  +D+VE QG                   +  + V +  
Sbjct: 215 ADVNELVVGVCNRRKEMGEWLWYVDLVESQGPPGEGSAWRGLTKKEKASNENFLQVYRAQ 274

Query: 114 EG----QCNSECKTIERACQEIMGYSDTDVAEY---LYTSKPDIKSLVNYLCKDLSKACS 166
            G    + + E  T +R+C  +      DV E    L+    D K+     CK+LS  C 
Sbjct: 275 RGGPIRKWDRESATAKRSCDLLFDDDIGDVEELVVPLWRGDVDEKAFKKLACKELSGRCG 334

Query: 167 TKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSM--EGMPGAPGMQMYSKEDLM 217
               PV         +  +     + ER+M +M  +GMP    M M S+ED+M
Sbjct: 335 KDRKPVNAQGREDFEYDAQDKSLLDTERMMENMAAQGMP----MVMQSREDMM 383


>gi|428168650|gb|EKX37592.1| hypothetical protein GUITHDRAFT_116234 [Guillardia theta CCMP2712]
          Length = 318

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 74/147 (50%), Gaps = 7/147 (4%)

Query: 42  YITCQVCEKLASQLY-HQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIV 100
           YI C  C+ +A  LY H +   + +   +K+ E  I+ + E +C+  + + +W+   D+V
Sbjct: 44  YIKCDGCKIMADNLYRHAMDLLKEKGGREKMGEEPILELLEQICDQGEAKGEWLNAYDVV 103

Query: 101 EQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYS---DTDVAEYLYTSK--PDIKSLVN 155
            +  K+ L +Q+    C  EC  +   C+++   S   ++D+AE +Y        ++   
Sbjct: 104 GKKGKISLQKQDGFQSCGPECHALVSVCEQVKVRSEAGESDIAEKIYRGHHIASAEAFSE 163

Query: 156 YLCKDLSKACSTKPPPVLKNRIPGEPF 182
            +CK ++ +C  +P P LK+    EPF
Sbjct: 164 AICKKMTTSCK-EPLPKLKDPRIDEPF 189


>gi|71408335|ref|XP_806579.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870363|gb|EAN84728.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 330

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 92/210 (43%), Gaps = 25/210 (11%)

Query: 10  VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
           ++ AV LL S  A ++   K P A   K+ V  I+C VC  +  + Y  VQ    A +  
Sbjct: 26  IFFAVTLLFSE-AGVAKKEKPPHAPLPKDVVDAISCGVCNFMVGKAYGDVQLLFNASVQT 84

Query: 69  K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
           + +++E +++   E+VCN   +   WI +I I  + D     L +    A  +C   C T
Sbjct: 85  RVRVNEDDVLTAIEDVCNPFSDVGQWIRQITITHESDTAPFLLGVSRLPAYAKCKRVCST 144

Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLS-----KACSTKPP 170
           +  AC+ +M + + D        + EY        ++  N LC+  S     +  S    
Sbjct: 145 VVEACEAVMDHKNMDQLSPRLLRLNEY-----AGAEAFANALCQSSSICTERQGLSAARY 199

Query: 171 PVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
             ++N I  +       KE E+ER+M  ME
Sbjct: 200 EEVRNMIDEDTVEEIDPKEMEIERMMDHME 229


>gi|71405983|ref|XP_805566.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70869024|gb|EAN83715.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 330

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 91/210 (43%), Gaps = 25/210 (11%)

Query: 10  VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
           ++ AV LL S  A ++   K P A   K+ V  I+C VC  +  + Y  VQ    A +  
Sbjct: 26  IFFAVTLLFSA-AGVAKKEKPPHAPFPKDVVNAISCGVCNFMVGKAYGDVQLLVNASVQT 84

Query: 69  K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
           + +++E +++   E++CN   +   WI +I I  + D     L +    A  +C   C T
Sbjct: 85  RVRVNEDDVLTAIEDLCNPFSDVGQWIRQITITHESDTAPFLLGVSRLPAYAKCKRVCST 144

Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLS-----KACSTKPP 170
           +  AC+ +M + + D        + EY        ++  N LC   S     +  S    
Sbjct: 145 VVEACEAVMDHENMDQLSPRLLRLNEY-----AGAEAFANALCLPSSICTERQGLSAARY 199

Query: 171 PVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
             ++N I  +       KE E+ER+M  ME
Sbjct: 200 EEVRNMIDEDTVEEIDPKEMEIERMMDHME 229


>gi|387198247|gb|AFJ68839.1| hypothetical protein NGATSA_3049300 [Nannochloropsis gaditana
           CCMP526]
          Length = 281

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 49/184 (26%), Positives = 81/184 (44%), Gaps = 14/184 (7%)

Query: 35  ARKEDVPYITCQVCEKLASQLYHQVQK---KQAQISPKKISEFEIIGIAENVCNLKKEEA 91
           A+K D+PYI C VC     QL +  +    ++  +   K  +  ++   + VC+    E 
Sbjct: 73  AQKTDLPYIACDVCMLTMEQLSNVTRNLALREENLHKGKPKKSLVLKATKKVCDYASYEG 132

Query: 92  DWILKIDIVEQ----GDKLVLVEQEAEGQCNSECKTIERACQEIM---GYSDTDVAEYLY 144
            WI   DI E+    G  L LV Q   G    E  TI ++C ++    G    +VA+ L 
Sbjct: 133 RWITYYDIQEKDRVGGTLLKLVAQPESGMPEVEADTIAKSCTKLFKSEGIDRDEVADDLL 192

Query: 145 TSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGE-PFVPKPAKEAEMERIMRSM--EG 201
            +   + +L    C D+S  CS     +++  +  +  F PK  +    +RIM +    G
Sbjct: 193 EAL-SVSALQKKYCTDISSRCSKTAQSLMEKGLRRDYAFKPKTKEAKSYDRIMDAFAEAG 251

Query: 202 MPGA 205
           + GA
Sbjct: 252 VLGA 255


>gi|388496004|gb|AFK36068.1| unknown [Lotus japonicus]
          Length = 62

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 20/23 (86%), Positives = 23/23 (100%)

Query: 202 MPGAPGMQMYSKEDLMNMKNFGG 224
           MPGAPGM+MYS++DLMNMKNFGG
Sbjct: 1   MPGAPGMKMYSRDDLMNMKNFGG 23


>gi|407849528|gb|EKG04244.1| hypothetical protein TCSYLVIO_004702 [Trypanosoma cruzi]
          Length = 330

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 29/212 (13%)

Query: 10  VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
           ++ AV LL S  A ++   K   A   K+ V  I+C VC  +  + Y  VQ    A +  
Sbjct: 26  IFFAVTLLFSE-AGVAKKEKPSHAPFPKDVVDAISCGVCNFMVGKAYGDVQLLFNASVQT 84

Query: 69  K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
           + +++E +++   E++CN   +   WI +I I  + D     L +    A  +C   C T
Sbjct: 85  RVRVNEDDVLTAIEDLCNPFSDVGQWIRQITITHESDTAPFLLGVSRLPAYAKCKRVCST 144

Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPV--- 172
           +  AC+ +M +   D        + EY        ++  N LC  LS +  T+   +   
Sbjct: 145 VVEACEAVMDHESMDQLSPRLLRLNEY-----AGAEAFANALC--LSSSICTERQGLSAA 197

Query: 173 ----LKNRIPGEPFVPKPAKEAEMERIMRSME 200
               ++N I  +       KE E+ER+M  ME
Sbjct: 198 KYEEVRNMIDEDTVEEIDPKEMEIERMMDHME 229


>gi|340057508|emb|CCC51854.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 323

 Score = 46.2 bits (108), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 61/249 (24%), Positives = 104/249 (41%), Gaps = 45/249 (18%)

Query: 4   IKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKLAS 53
           +KA  LV +A + +L+           P A ARKE   ++P+       I C VC  +  
Sbjct: 23  VKAVSLVVLAALTVLTL---------PPTAVARKEKPLNIPFPQDVIDNIRCDVCNIMTR 73

Query: 54  QLYHQVQKKQAQISPKKI--SEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ 111
           + Y  V          ++  +E +++   E+VCN   E   WI    I +    L     
Sbjct: 74  KAYDLVHALFVLSDSTRVPVNEEDVMVAIEDVCNPFAESGQWIRLFSIGKTPSALTAAPA 133

Query: 112 EA---------EGQCNSECKTIERACQEIMGYSDTDVA--EYLYTSK-PDIKSLVNYLCK 159
                        +C   CKTI  AC+ ++ +   DV   + L   K  +  +    +C 
Sbjct: 134 PLHLTLLPLPNHTKCKRTCKTIWEACEAVIDHDSMDVLSPQLLRLRKYANADAFARSVC- 192

Query: 160 DLSKACSTK---PP---PVLKNRIPGEPFVPKPAKEAEM--ERIMRSMEGMPGAPGMQMY 211
           D++  C+ +   PP    VL   I  +  VP+  ++ EM  ER+M  ME        ++Y
Sbjct: 193 DVASFCTERGAIPPQRYEVLGGMIAED--VPEEIEQKEMDIERMMDFMER-KNNQRHEIY 249

Query: 212 SKEDLMNMK 220
           S+E++ +M+
Sbjct: 250 SREEVTSMQ 258


>gi|261333030|emb|CBH16025.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 300

 Score = 45.8 bits (107), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 102/247 (41%), Gaps = 42/247 (17%)

Query: 2   ARIKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKL 51
           A +K   LV VA++ +             P   A+KE   +VP+       + C VC  +
Sbjct: 4   APVKGVFLVLVALVPMF------------PAVDAKKEKPMNVPFPQDAIDDVRCDVCRLM 51

Query: 52  ASQLYHQVQKKQAQISPKKI--SEFEIIGIAENVCNLKKEEADWILKIDI-------VEQ 102
           A + Y  V +        +I  +E  ++   E++CN       WI ++ +        ++
Sbjct: 52  AKRAYGYVYELFVTSEHTRIPVNEENVLVAIEDICNPLAASGQWIRQVAVDTSSPAAAKE 111

Query: 103 GDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTD--VAEYLYTSK-PDIKSLVNYLCK 159
              + L E     +C   CKT+  AC+ ++   D D   ++ L   K  D  +L +  C 
Sbjct: 112 PFHITLTELSHHSKCKRTCKTLSDACEAVVDDDDMDQFSSKLLILKKYKDASALADATC- 170

Query: 160 DLSKAC------STKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSK 213
            +S  C      S+K    L   +  +       KE ++ER M  ME   G    ++YS+
Sbjct: 171 GVSAFCTSRTGFSSKRYKELLTLVASDVVEVIEQKEMDIERFMDEME-RKGNRRQEIYSR 229

Query: 214 EDLMNMK 220
           E+++ M+
Sbjct: 230 EEVIKMQ 236


>gi|71748168|ref|XP_823139.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832807|gb|EAN78311.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 300

 Score = 44.3 bits (103), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 100/247 (40%), Gaps = 42/247 (17%)

Query: 2   ARIKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKL 51
           A +K   LV VA++ +             P   A+KE   +VP+       + C VC  +
Sbjct: 4   APVKGVFLVLVALVPMF------------PAVDAKKEKPMNVPFPQDAIDDVRCDVCRLM 51

Query: 52  ASQLYHQVQKKQAQISPKKI--SEFEIIGIAENVCNLKKEEADWILKIDIVEQGD----- 104
           A + Y  V +        +I  +E  ++   E++CN       WI ++ +          
Sbjct: 52  AKRAYGYVYELFVTSEHTRIPVNEENVLVAIEDICNPLAASGQWIRQVAVDTSSPAAAKG 111

Query: 105 --KLVLVEQEAEGQCNSECKTIERACQEIMGYSDTD--VAEYLYTSK-PDIKSLVNYLCK 159
              + L E     +C   CKT+  AC+ ++   D D   ++ L   K  D  +L +  C 
Sbjct: 112 PFHITLTELPHHSKCKRTCKTLSDACEAVVDDDDMDQFSSKLLILKKYKDASALADATC- 170

Query: 160 DLSKAC------STKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSK 213
            +S  C      S+K    L   +  +       KE ++ER M  ME   G    ++YS+
Sbjct: 171 GVSAFCTSRTGFSSKRYKELLTLVASDVVEVIEQKEMDIERFMDEME-RKGNRRQEIYSR 229

Query: 214 EDLMNMK 220
           E+++ M+
Sbjct: 230 EEVIKMQ 236


>gi|342184535|emb|CCC94017.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 307

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 60/241 (24%), Positives = 97/241 (40%), Gaps = 45/241 (18%)

Query: 16  LLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKLASQLYHQVQKKQAQ 65
           L L   A ++      +  A+KE   +VPY       + C VC   A Q Y +V   +  
Sbjct: 3   LFLVKYAAVTLACMALLTSAKKEKPMNVPYPQDVVDDMRCDVCHTAAKQAYARVL--ELH 60

Query: 66  ISPKK----ISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQ----- 116
           ++ K+    ++E +++   E++CN    E  WI K  +    +   L    A  Q     
Sbjct: 61  VASKRTRVPVNEEDVLVAIEDLCNPLASEGQWIRKFGV----NTSYLTSAGAPAQPLFIS 116

Query: 117 ---------CNSECKTIERACQEIMGYSDTD-VAEYL--YTSKPDIKSL------VNYLC 158
                    C   CKT   AC+ +M   + D  + YL   T+  D ++       V   C
Sbjct: 117 VYTSRNYLKCKRTCKTASDACELVMDDEEMDQFSSYLLRLTTFKDAEAFAEKTCGVASFC 176

Query: 159 KDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMN 218
           K  S   + +   VL+  I  E       KE +MER+M  ME         +YS+E+++ 
Sbjct: 177 KKRSGFSAKRYKEVLEV-IALETPEEIDQKEMDMERLMDQMERRNNQR-QDIYSREEIIK 234

Query: 219 M 219
           M
Sbjct: 235 M 235


>gi|195487538|ref|XP_002091949.1| GE13920 [Drosophila yakuba]
 gi|194178050|gb|EDW91661.1| GE13920 [Drosophila yakuba]
          Length = 416

 Score = 42.0 bits (97), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 47/100 (47%), Gaps = 14/100 (14%)

Query: 205 APGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREK-ESKSGDWKQR- 262
           AP +  YS       + +  E  DDN+ E+D  ES F +NL ++  E+   +S  + +R 
Sbjct: 294 APPVASYSSS---YKRTYANELSDDNEAEEDQYESTFARNLARLRAERIGDRSSPYSRRT 350

Query: 263 ITDGILNTGDV---------LKKQANKVSNHVQRWWSGIK 293
           +  G + +G +         L+   N VS    RWW+ ++
Sbjct: 351 LASGNVGSGSLGYEPLARRSLRPDDNSVSVAFNRWWNSLE 390


>gi|392959126|ref|ZP_10324611.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           DSM 17108]
 gi|421053010|ref|ZP_15515993.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           B4]
 gi|421062083|ref|ZP_15524295.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           B3]
 gi|421068445|ref|ZP_15529750.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           A12]
 gi|421070001|ref|ZP_15531139.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           A11]
 gi|392442484|gb|EIW20067.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           B4]
 gi|392442614|gb|EIW20193.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           A12]
 gi|392444763|gb|EIW22165.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           B3]
 gi|392448906|gb|EIW26078.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           A11]
 gi|392456776|gb|EIW33513.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           DSM 17108]
          Length = 346

 Score = 40.4 bits (93), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 13/84 (15%)

Query: 63  QAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECK 122
           QA ++P +I+  E+ GI EN+     E  D+       + GDK+++V Q   G+C+   +
Sbjct: 49  QANVTPPRITGHELTGIIENI---GSEVEDY-------QVGDKVLVVPQMTCGKCHYCLQ 98

Query: 123 TIERACQEIM--GYS-DTDVAEYL 143
                C+E++  GYS D   AEY+
Sbjct: 99  GKSNLCEEVLLFGYSLDGGFAEYI 122


>gi|421078291|ref|ZP_15539248.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           JBW45]
 gi|392523608|gb|EIW46777.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
           JBW45]
          Length = 346

 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 13/84 (15%)

Query: 63  QAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECK 122
           QA ++P +I+  E+ GI EN+     E  D+       + GDK+++V Q   G+C+   +
Sbjct: 49  QANVTPPRITGHELTGIIENI---GSEVEDY-------QVGDKVLVVPQMTCGKCHYCLQ 98

Query: 123 TIERACQEIM--GYS-DTDVAEYL 143
                C+E++  GYS D   AEY+
Sbjct: 99  GKSNLCEEVLLFGYSLDGGFAEYI 122


>gi|413942884|gb|AFW75533.1| hypothetical protein ZEAMMB73_462245 [Zea mays]
          Length = 76

 Score = 40.0 bits (92), Expect = 1.2,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 45/64 (70%), Gaps = 4/64 (6%)

Query: 192 MERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFRE 251
           ME+++RSME M G  GM  YS++DLM  KN  G + DD+++ D D++ +F KN+GK  ++
Sbjct: 1   MEKLLRSMEVMAGYAGM--YSRDDLM--KNNFGTEDDDDEEGDGDEDDNFAKNMGKALKD 56

Query: 252 KESK 255
           K S+
Sbjct: 57  KGSQ 60


>gi|423720329|ref|ZP_17694511.1| hypothetical protein, DUF2515 family [Geobacillus thermoglucosidans
           TNO-09.020]
 gi|383366714|gb|EID44000.1| hypothetical protein, DUF2515 family [Geobacillus thermoglucosidans
           TNO-09.020]
          Length = 383

 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 50/110 (45%), Gaps = 15/110 (13%)

Query: 24  ISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP--KKISEFEIIGIAE 81
           + +    P  G RK+  P++   VCE    Q Y + + K  QI P   +I   +++   +
Sbjct: 267 VRWAFLHPHTGTRKDYWPHLFNDVCESAPRQPYRR-RIKDCQIQPGVPRIYSPQLLYAWK 325

Query: 82  NVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSE-----CKTIER 126
           NV + + E  DW    ++VE          E+ GQ N E     C+T+E+
Sbjct: 326 NVVHEEAERGDWFEDWNVVE-------YLAESNGQINGEITNAYCETLEK 368


>gi|341881244|gb|EGT37179.1| hypothetical protein CAEBREN_31119 [Caenorhabditis brenneri]
          Length = 380

 Score = 38.9 bits (89), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 46  QVCEKL------ASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWIL-KID 98
           QVCE++      A++L H+ +      S   ++ F I+G AEN+  ++K+  ++I+ K+ 
Sbjct: 237 QVCEQMCRLNRNAAKLLHKYEAH----SSTDVTGFGILGHAENLARVQKQPVEFIIEKLP 292

Query: 99  IVEQGDKLV--LVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYL 143
           I+E  D++V  ++    EG    E ++ E +   ++  S+ +  EY+
Sbjct: 293 IIEYMDEIVDKMIATGGEGFRLFEGRSAETSGGLLIAMSEKNAKEYI 339


>gi|336235792|ref|YP_004588408.1| hypothetical protein Geoth_2406 [Geobacillus thermoglucosidasius
           C56-YS93]
 gi|335362647|gb|AEH48327.1| Protein of unknown function DUF2515 [Geobacillus
           thermoglucosidasius C56-YS93]
          Length = 383

 Score = 37.7 bits (86), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 7/106 (6%)

Query: 24  ISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP--KKISEFEIIGIAE 81
           + +    P  G RK+  P++   VCE    Q Y + + K  QI P   +I   +++   +
Sbjct: 267 VRWAFLHPHTGTRKDYWPHLFNDVCESAPRQPYRR-RIKDCQIQPGVPRIYSPQLLYAWK 325

Query: 82  NVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQ-CNSECKTIER 126
           NV + + E  DW    ++VE    L    ++  G+  N+ C+T+E+
Sbjct: 326 NVVHEEAERGDWFEDWNVVEY---LTESNEQINGEITNAYCETLEK 368


>gi|393239155|gb|EJD46688.1| hypothetical protein AURDEDRAFT_164143 [Auricularia delicata
           TFB-10046 SS5]
          Length = 1365

 Score = 37.7 bits (86), Expect = 6.4,   Method: Composition-based stats.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 5/75 (6%)

Query: 223 GGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDG---ILNTGDVLKKQAN 279
           G ED D NDD  + D+ DF    G    E E + G+W   + DG   ++  G + K+Q  
Sbjct: 380 GVEDDDTNDDALNPDKPDFAD--GAYIDEDEDEDGNWVTMLPDGKPYVIKKGPLTKEQRK 437

Query: 280 KVSNHVQRWWSGIKA 294
           KV+  V+++   + A
Sbjct: 438 KVAAVVKKFEDSLHA 452


>gi|312111344|ref|YP_003989660.1| hypothetical protein GY4MC1_2322 [Geobacillus sp. Y4.1MC1]
 gi|311216445|gb|ADP75049.1| Protein of unknown function DUF2515 [Geobacillus sp. Y4.1MC1]
          Length = 383

 Score = 37.7 bits (86), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 7/106 (6%)

Query: 24  ISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP--KKISEFEIIGIAE 81
           + +    P  G RK+  P++   VCE    Q Y + + K  QI P   +I   +++   +
Sbjct: 267 VRWAFLHPHTGTRKDYWPHLFNDVCESAPRQPYRR-RIKDCQIQPGVPRIYSPQLLYAWK 325

Query: 82  NVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQ-CNSECKTIER 126
           NV + + E  DW    ++VE    L    ++  G+  N+ C+T+E+
Sbjct: 326 NVVHEEAERGDWFEDWNVVEY---LAESNEQINGEITNAYCETLEK 368


>gi|261863831|gb|ACY01390.1| AT-less polyketide synthase [Streptomyces platensis subsp. rosaceus]
          Length = 3192

 Score = 37.4 bits (85), Expect = 7.7,   Method: Composition-based stats.
 Identities = 36/139 (25%), Positives = 60/139 (43%), Gaps = 13/139 (9%)

Query: 120  ECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCK----DLSKACSTK-PPPVLK 174
            E   + +   E+ G   TD+   ++    DI+SL  YL      +LS   S   P PV  
Sbjct: 2470 ESIALAKLASELNGVLGTDLTPAVFFECSDIRSLAEYLLDRYGPELSLPTSADAPAPVAA 2529

Query: 175  NRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGM-----QMYSKEDLMNMKNFGGEDGDD 229
             R P    +P P  + +   I+ +    PGA  +     Q+ + EDL  + ++ G+  D 
Sbjct: 2530 TR-PSPVPMPAPGPDDDAVAIVGAAGRFPGADDLDTFWQQLRAGEDL--IADYPGDRFDG 2586

Query: 230  NDDEDDDDESDFPKNLGKV 248
                +    +DFPK  G++
Sbjct: 2587 GPYAEVVARADFPKFAGRI 2605


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.132    0.387 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,923,564,249
Number of Sequences: 23463169
Number of extensions: 216491904
Number of successful extensions: 1371195
Number of sequences better than 100.0: 616
Number of HSP's better than 100.0 without gapping: 125
Number of HSP's successfully gapped in prelim test: 491
Number of HSP's that attempted gapping in prelim test: 1363822
Number of HSP's gapped (non-prelim): 4396
length of query: 305
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 163
effective length of database: 9,027,425,369
effective search space: 1471470335147
effective search space used: 1471470335147
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 76 (33.9 bits)