BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029195
         (197 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255555521|ref|XP_002518797.1| conserved hypothetical protein [Ricinus communis]
 gi|223542178|gb|EEF43722.1| conserved hypothetical protein [Ricinus communis]
          Length = 368

 Score =  315 bits (807), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 153/204 (75%), Positives = 169/204 (82%), Gaps = 9/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MA+DLC F+KD F +KSPKKSPLVLRM+VL FVMVCGVYICS+C+KQIS  T     ++ 
Sbjct: 1   MADDLCFFSKDAFIIKSPKKSPLVLRMVVLAFVMVCGVYICSICLKQISISTNPGIFSLG 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VIE+PCP PNI+PWEIPYVHYPKPKTYSRAEC CNPVR FAILSMQRSGSGWFETLLNNH
Sbjct: 61  VIEKPCPEPNIQPWEIPYVHYPKPKTYSRAECMCNPVRNFAILSMQRSGSGWFETLLNNH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL- 179
           TNISSNGE+FSVKVRRSN S IVETLDKIYNLDW SSASKNECTAAVGLKWMLNQ  +  
Sbjct: 121 TNISSNGEIFSVKVRRSNVSMIVETLDKIYNLDWLSSASKNECTAAVGLKWMLNQGVMQH 180

Query: 180 ------YWKCSSIS--FSFQRPLV 195
                 Y+K   +S  F F+R L+
Sbjct: 181 HEEIVEYFKSRGVSAIFLFRRNLL 204


>gi|224123470|ref|XP_002330322.1| predicted protein [Populus trichocarpa]
 gi|222871357|gb|EEF08488.1| predicted protein [Populus trichocarpa]
          Length = 333

 Score =  312 bits (799), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 150/195 (76%), Positives = 165/195 (84%), Gaps = 9/195 (4%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           +D+F +K+PKKSPL LRM+VLVF MVCGVYICS+C+KQI  RT   FLNV+VIERPCP P
Sbjct: 1   QDSFIIKAPKKSPLALRMVVLVFAMVCGVYICSICLKQIGIRTNPGFLNVEVIERPCPEP 60

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
           NIEPWEIPYVHYP+PKTYSRAEC CNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGE+
Sbjct: 61  NIEPWEIPYVHYPRPKTYSRAECKCNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEI 120

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL-------YWK 182
           FSVKVRRSN STI ETLDKIYNLDW SSASKNECTAAVGLKWMLNQ  +        Y+K
Sbjct: 121 FSVKVRRSNVSTITETLDKIYNLDWSSSASKNECTAAVGLKWMLNQGVMRHHEEIVEYFK 180

Query: 183 CSSIS--FSFQRPLV 195
              +S  F F+R L+
Sbjct: 181 TRGVSAIFLFRRNLL 195


>gi|224105519|ref|XP_002313841.1| predicted protein [Populus trichocarpa]
 gi|222850249|gb|EEE87796.1| predicted protein [Populus trichocarpa]
          Length = 345

 Score =  311 bits (796), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 153/205 (74%), Positives = 168/205 (81%), Gaps = 10/205 (4%)

Query: 1   MAEDLCSF-TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNV 59
           MA+D   F +KD+F +K+PKKSPL LRM+VLVF MVCGVYICS+C+KQI  RT   FLNV
Sbjct: 1   MADDPFFFCSKDSFLIKAPKKSPLALRMVVLVFAMVCGVYICSICLKQIGIRTNPGFLNV 60

Query: 60  QVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNN 119
           +VIERPCP PNIEPWEIPYVHYPKP TYSR EC CNPVRYFAILSMQRSGSGWFETLLNN
Sbjct: 61  EVIERPCPEPNIEPWEIPYVHYPKPITYSRVECKCNPVRYFAILSMQRSGSGWFETLLNN 120

Query: 120 HTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL 179
           HTNISSNGE+FSVKVRRSN STI ETLDKIYNLDW SSASKNEC AAVGLKWMLNQ  + 
Sbjct: 121 HTNISSNGEIFSVKVRRSNVSTITETLDKIYNLDWLSSASKNECAAAVGLKWMLNQGVMQ 180

Query: 180 -------YWKCSSIS--FSFQRPLV 195
                  Y+K   +S  F F+R L+
Sbjct: 181 HHEEIVEYFKTRGVSAIFLFRRNLL 205


>gi|225428745|ref|XP_002285039.1| PREDICTED: nodulation protein H [Vitis vinifera]
 gi|297741312|emb|CBI32443.3| unnamed protein product [Vitis vinifera]
          Length = 344

 Score =  304 bits (779), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 146/204 (71%), Positives = 165/204 (80%), Gaps = 9/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MA+DLC FTKD+F  K+PKKSPLVLRM+VLVF MVCGVYICS+C+KQIS    + FLNV+
Sbjct: 1   MADDLCFFTKDSFVFKAPKKSPLVLRMIVLVFAMVCGVYICSICLKQISTTATAGFLNVE 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           + ERPC  PNIEP E+ YVH+PKP +YSRAECACNPVRYFAILS QRSGSGWFETLLN+H
Sbjct: 61  LTERPCEKPNIEPSEVRYVHFPKPISYSRAECACNPVRYFAILSTQRSGSGWFETLLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ----- 175
            NISSNGE+FSVKVRRSN S+I ETLDKIYNLDW SSASKNECTAAVGLKWMLNQ     
Sbjct: 121 INISSNGEIFSVKVRRSNISSITETLDKIYNLDWLSSASKNECTAAVGLKWMLNQGLMQN 180

Query: 176 ----VRLLYWKCSSISFSFQRPLV 195
               V     +  S+ F F+R L+
Sbjct: 181 HKEIVEYFNTRGVSLIFLFRRNLL 204


>gi|449529180|ref|XP_004171579.1| PREDICTED: nodulation protein H-like [Cucumis sativus]
          Length = 344

 Score =  300 bits (767), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 145/204 (71%), Positives = 168/204 (82%), Gaps = 9/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MA+D  S TK+TF  K PKKSPL+LRM+VLVF MVCGV+ICSVC+KQIS R+K  F+ V+
Sbjct: 1   MADDPASLTKETFISKVPKKSPLMLRMIVLVFAMVCGVFICSVCLKQISTRSKVGFMKVE 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VI+ PC  P I+P + P+VH+PKP TYSRAECAC+PVRYFAILSMQRSGSGWFETLLNNH
Sbjct: 61  VIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAILSMQRSGSGWFETLLNNH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL- 179
           TNISSNGE+FSVKVRRSN STIVETLDK+YNLDWF+SASKNECTAAVGLKWMLNQ  +  
Sbjct: 121 TNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNECTAAVGLKWMLNQGLMQH 180

Query: 180 ------YWKCSSIS--FSFQRPLV 195
                 Y+K   +S  F F+R L+
Sbjct: 181 HEEIVEYFKRRGVSAIFLFRRNLL 204


>gi|449438424|ref|XP_004136988.1| PREDICTED: nodulation protein H-like isoform 2 [Cucumis sativus]
          Length = 346

 Score =  297 bits (760), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 146/206 (70%), Positives = 168/206 (81%), Gaps = 11/206 (5%)

Query: 1   MAEDLCSFTKD--TFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLN 58
           MA+D  S TKD  TF  K PKKSPL+LRM+VLVF MVCGV+ICSVC+KQIS R+K  F+ 
Sbjct: 1   MADDPASLTKDGKTFISKVPKKSPLMLRMIVLVFAMVCGVFICSVCLKQISTRSKVGFMK 60

Query: 59  VQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLN 118
           V+VI+ PC  P I+P + P+VH+PKP TYSRAECAC+PVRYFAILSMQRSGSGWFETLLN
Sbjct: 61  VEVIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAILSMQRSGSGWFETLLN 120

Query: 119 NHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRL 178
           NHTNISSNGE+FSVKVRRSN STIVETLDK+YNLDWF+SASKNECTAAVGLKWMLNQ  +
Sbjct: 121 NHTNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNECTAAVGLKWMLNQGLM 180

Query: 179 L-------YWKCSSIS--FSFQRPLV 195
                   Y+K   +S  F F+R L+
Sbjct: 181 QHHEEIVEYFKRRGVSAIFLFRRNLL 206


>gi|79556423|ref|NP_179175.3| nodulation protein-like protein [Arabidopsis thaliana]
 gi|40823013|gb|AAR92253.1| At2g15730 [Arabidopsis thaliana]
 gi|45752690|gb|AAS76243.1| At2g15730 [Arabidopsis thaliana]
 gi|330251339|gb|AEC06433.1| nodulation protein-like protein [Arabidopsis thaliana]
          Length = 344

 Score =  295 bits (755), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 142/204 (69%), Positives = 163/204 (79%), Gaps = 9/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M ++     KD+F  K PKKSPLVLR +VL+FVMVC VYICS+C+KQI     + FLNV+
Sbjct: 1   MGDEHSLIPKDSFVFKLPKKSPLVLRTVVLLFVMVCTVYICSICLKQIGVVPSAGFLNVE 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V ERPCP PNI+PW+IPYVHYPKPKTYSR EC+CNPVRYFAILSMQRSGSGWFETLLNNH
Sbjct: 61  VFERPCPEPNIQPWDIPYVHYPKPKTYSREECSCNPVRYFAILSMQRSGSGWFETLLNNH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL- 179
           TNISSNGE+FSVK RR+N STI ETLDK+YNLDW SSASKNECT+AVGLKWMLNQ  +  
Sbjct: 121 TNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNECTSAVGLKWMLNQGLMKN 180

Query: 180 ------YWKCSSIS--FSFQRPLV 195
                 Y+K   +S  F F+R L+
Sbjct: 181 HEEIVEYFKTRGVSAIFLFRRNLL 204


>gi|334187156|ref|NP_195168.3| P-loop containing nucleoside triphosphate hydrolase family
           protein-like protein [Arabidopsis thaliana]
 gi|332660972|gb|AEE86372.1| P-loop containing nucleoside triphosphate hydrolase family
           protein-like protein [Arabidopsis thaliana]
          Length = 342

 Score =  294 bits (753), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 136/175 (77%), Positives = 150/175 (85%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAED     KD F LK PKKS LVLRM+VL+FVMVC VYICS+C+KQI       FLNV+
Sbjct: 1   MAEDPSFLPKDGFVLKLPKKSSLVLRMVVLLFVMVCAVYICSICLKQIGVSPNYGFLNVE 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V ERPCP PNIEPW+IP+VHYPKPKTY+R EC+C+PVRYFAILSMQRSGSGWFETLLNNH
Sbjct: 61  VFERPCPEPNIEPWDIPFVHYPKPKTYNRDECSCHPVRYFAILSMQRSGSGWFETLLNNH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           TNISSNGE+FSVK RR+N STI ETLDK+YNLDW SSASKNECT+AVG KWMLNQ
Sbjct: 121 TNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNECTSAVGFKWMLNQ 175


>gi|297832068|ref|XP_002883916.1| hypothetical protein ARALYDRAFT_480426 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329756|gb|EFH60175.1| hypothetical protein ARALYDRAFT_480426 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 344

 Score =  292 bits (748), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 141/204 (69%), Positives = 162/204 (79%), Gaps = 9/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M ++     KD+F  K PKKSPLVLR +VL+FVMVC VYICS+C+KQI     + FLNV+
Sbjct: 1   MGDEHSLIPKDSFVFKLPKKSPLVLRTVVLLFVMVCTVYICSICLKQIGVVPSAGFLNVE 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V ER CP PNIEPW+IPYVHYPKPKTYSR EC+CNPVRYFAILSMQRSGSGWFETLLNNH
Sbjct: 61  VFERQCPEPNIEPWDIPYVHYPKPKTYSREECSCNPVRYFAILSMQRSGSGWFETLLNNH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL- 179
           TNISSNGE+FSVK RR+N +TI ETLDK+YNLDW SSASKNECT+AVGLKWMLNQ  +  
Sbjct: 121 TNISSNGEIFSVKDRRANVTTIFETLDKVYNLDWLSSASKNECTSAVGLKWMLNQGLMKH 180

Query: 180 ------YWKCSSIS--FSFQRPLV 195
                 Y+K   +S  F F+R L+
Sbjct: 181 HEEIVEYFKTRGVSAIFLFRRNLL 204


>gi|297802526|ref|XP_002869147.1| hypothetical protein ARALYDRAFT_912948 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314983|gb|EFH45406.1| hypothetical protein ARALYDRAFT_912948 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 490

 Score =  292 bits (747), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 135/175 (77%), Positives = 149/175 (85%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAED     KD F LK PKKS LVLRM+VL+FVMVC VYICS+C+KQI       FLNV+
Sbjct: 149 MAEDPSFLPKDGFVLKLPKKSSLVLRMVVLLFVMVCAVYICSICLKQIGVSPNYGFLNVE 208

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V ERPCP PNIEPW+I +VHYPKPKTY+R EC+C+PVRYFAILSMQRSGSGWFETLLNNH
Sbjct: 209 VFERPCPEPNIEPWDILFVHYPKPKTYNRDECSCHPVRYFAILSMQRSGSGWFETLLNNH 268

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           TNISSNGE+FSVK RR+N STI ETLDK+YNLDW SSASKNECT+AVG KWMLNQ
Sbjct: 269 TNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNECTSAVGFKWMLNQ 323


>gi|357475463|ref|XP_003608017.1| Nodulation protein H [Medicago truncatula]
 gi|355509072|gb|AES90214.1| Nodulation protein H [Medicago truncatula]
          Length = 344

 Score =  290 bits (742), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 132/175 (75%), Positives = 149/175 (85%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MA+DL SFTKD   +K  K S LV R++VL F MVCGVYICS+C+KQIS  ++  FL++ 
Sbjct: 1   MADDLSSFTKDVLLVKGLKNSTLVWRLIVLAFAMVCGVYICSICLKQISTGSRIGFLDIN 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VI++PCP PNIEPWEIPYVHYP PKTYSR EC C+PVRYF ILSMQRSGSGWFET LN+H
Sbjct: 61  VIQKPCPEPNIEPWEIPYVHYPNPKTYSREECRCHPVRYFTILSMQRSGSGWFETFLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
            NISSNGE+FSVKVRRSN +TI ETLD IYNLDWF+SASKNECTAAVGLKWMLNQ
Sbjct: 121 PNISSNGEIFSVKVRRSNITTITETLDTIYNLDWFNSASKNECTAAVGLKWMLNQ 175


>gi|356500053|ref|XP_003518849.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 344

 Score =  289 bits (740), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 131/175 (74%), Positives = 147/175 (84%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MA+DL S  KD   +K  K S LV R++VL   MVCGVYICS+C+KQI   +K  FL+++
Sbjct: 1   MADDLSSLAKDVLLVKGLKSSTLVWRLVVLTLAMVCGVYICSICLKQIGTSSKIGFLDIK 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V+++PCP PNIEPWEIPYVHYP PKTYSRAECAC+PVRYFAILSMQRSGSGWFET LN+H
Sbjct: 61  VVQKPCPEPNIEPWEIPYVHYPNPKTYSRAECACHPVRYFAILSMQRSGSGWFETFLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
            NISSNGE+FSVKVRRSN STI ETLD IYNLDW SSASKNECT AVGLKWMLNQ
Sbjct: 121 ANISSNGEIFSVKVRRSNMSTITETLDTIYNLDWLSSASKNECTTAVGLKWMLNQ 175


>gi|356521486|ref|XP_003529386.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 344

 Score =  288 bits (736), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 136/204 (66%), Positives = 159/204 (77%), Gaps = 9/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MA+DL SF KD   +K  K S LV R++VL   MV GVYICS+C+KQI   +K  FL+++
Sbjct: 1   MADDLSSFAKDALLVKGLKSSTLVWRLIVLTLAMVSGVYICSICLKQIGTSSKIGFLDIK 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V+++PCP PNIEPWEIP+VHYP PKTYSRAECAC+PVRYFAILSMQRSGSGWFET LN+H
Sbjct: 61  VVQKPCPEPNIEPWEIPFVHYPHPKTYSRAECACHPVRYFAILSMQRSGSGWFETFLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL- 179
            NISSNGE+FSVKVRRSN STI ETLD IYNLDW SSASKNECT AVGLKWMLNQ  +  
Sbjct: 121 ANISSNGEIFSVKVRRSNMSTITETLDTIYNLDWLSSASKNECTTAVGLKWMLNQGLMQH 180

Query: 180 ------YWKCSSIS--FSFQRPLV 195
                 Y++   +S  F F+R L+
Sbjct: 181 HEEIAEYFRIHGVSLIFLFRRNLL 204


>gi|449438422|ref|XP_004136987.1| PREDICTED: nodulation protein H-like isoform 1 [Cucumis sativus]
          Length = 371

 Score =  287 bits (735), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 139/193 (72%), Positives = 160/193 (82%), Gaps = 9/193 (4%)

Query: 12  TFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVPNI 71
           TF  K PKKSPL+LRM+VLVF MVCGV+ICSVC+KQIS R+K  F+ V+VI+ PC  P I
Sbjct: 39  TFISKVPKKSPLMLRMIVLVFAMVCGVFICSVCLKQISTRSKVGFMKVEVIDMPCSKPTI 98

Query: 72  EPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           +P + P+VH+PKP TYSRAECAC+PVRYFAILSMQRSGSGWFETLLNNHTNISSNGE+FS
Sbjct: 99  DPSDAPFVHFPKPTTYSRAECACHPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEIFS 158

Query: 132 VKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL-------YWKCS 184
           VKVRRSN STIVETLDK+YNLDWF+SASKNECTAAVGLKWMLNQ  +        Y+K  
Sbjct: 159 VKVRRSNISTIVETLDKVYNLDWFTSASKNECTAAVGLKWMLNQGLMQHHEEIVEYFKRR 218

Query: 185 SIS--FSFQRPLV 195
            +S  F F+R L+
Sbjct: 219 GVSAIFLFRRNLL 231


>gi|388496630|gb|AFK36381.1| unknown [Lotus japonicus]
          Length = 251

 Score =  282 bits (722), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 130/175 (74%), Positives = 146/175 (83%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MA+DL SF KD   +K  K S LV R++VL F MVCGVYIC++C+KQI   +K   L+++
Sbjct: 1   MADDLSSFAKDVLLVKGLKNSTLVWRLVVLTFAMVCGVYICTICLKQIGTSSKIGLLDIK 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VI+ PCP PNIEPWEIPYVHYP PKTYSR EC+C+PVRYFAILSMQRSGSGWFET LN+H
Sbjct: 61  VIQNPCPEPNIEPWEIPYVHYPNPKTYSREECSCHPVRYFAILSMQRSGSGWFETFLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
            NISSNGE+FSVKVRRSN STI ETLD IYNLDW SSASKNECT AVGLKWMLNQ
Sbjct: 121 PNISSNGEIFSVKVRRSNVSTITETLDTIYNLDWLSSASKNECTTAVGLKWMLNQ 175


>gi|224137390|ref|XP_002327114.1| predicted protein [Populus trichocarpa]
 gi|222835429|gb|EEE73864.1| predicted protein [Populus trichocarpa]
          Length = 344

 Score =  279 bits (713), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 119/175 (68%), Positives = 146/175 (83%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M EDLC F KD   +K PKKSPL+LR++V+ F MVCGVYICS+C+KQIS  T ++FLN++
Sbjct: 1   MTEDLCFFNKDALVIKGPKKSPLLLRLVVVAFAMVCGVYICSICIKQISPHTTAKFLNIR 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           + ++PC   N+E WE PYVHYPKP+T+SR ECACNPVR+FAI SMQRSGSGWFETLLN+H
Sbjct: 61  IFDQPCNSSNVEEWEKPYVHYPKPETFSREECACNPVRFFAIFSMQRSGSGWFETLLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
            N+SSNGE+F  + RR++ S I +TLD++YNLDWFSSASKNEC AAVG KWMLNQ
Sbjct: 121 INVSSNGEIFGKRARRASVSAITQTLDRVYNLDWFSSASKNECNAAVGFKWMLNQ 175


>gi|225458537|ref|XP_002284367.1| PREDICTED: uncharacterized protein LOC100252224 [Vitis vinifera]
          Length = 344

 Score =  275 bits (704), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 131/204 (64%), Positives = 158/204 (77%), Gaps = 9/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M + +C F KDT  +K+PKKSPL+L M+VLVF MVCGVYIC++C+KQ S  TK++FL  Q
Sbjct: 1   MVDYVCFFNKDTLVIKTPKKSPLLLTMVVLVFAMVCGVYICAICLKQTSNNTKAKFLKFQ 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VI +  P   IEPWEIPYVHYPKPKT+SR EC CN VRYFAILSMQRSGSGWFETLLN+H
Sbjct: 61  VINQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAILSMQRSGSGWFETLLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRL-- 178
            N+SSNGE+FS+  RR+N S+I+ T+DK+YNLDWFSSASKNEC AAVG KWMLNQ  +  
Sbjct: 121 INVSSNGEIFSIGNRRNNISSILSTMDKVYNLDWFSSASKNECLAAVGFKWMLNQGLMEH 180

Query: 179 -----LYWKCSSIS--FSFQRPLV 195
                 Y+K   +S  F F+R L+
Sbjct: 181 HKDIVKYFKNKGVSAIFLFRRNLL 204


>gi|225438662|ref|XP_002281772.1| PREDICTED: uncharacterized protein LOC100254393 isoform 1 [Vitis
           vinifera]
          Length = 340

 Score =  266 bits (679), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 132/204 (64%), Positives = 159/204 (77%), Gaps = 11/204 (5%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAEDLC F KDT  +K PKKSP +LRM VLVF MVCGVYIC +C+KQIS  TK++F N  
Sbjct: 1   MAEDLCFFNKDTLIIKPPKKSPKLLRMTVLVFAMVCGVYICLICLKQISIHTKAKFENFW 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V+ERP    +++  EIP VHYPKP+T+SRAECA NPVR+FAI+S+QRSGSGWFETLLN+H
Sbjct: 61  VVERPSDDHSMK--EIPNVHYPKPRTFSRAECANNPVRFFAIVSIQRSGSGWFETLLNSH 118

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL- 179
            N+SSNGE+F+V  RR N S+IV+TLDKIYNLDWFSSASKNEC+AAVG KWMLNQ  +  
Sbjct: 119 INVSSNGEIFNVYDRRKNISSIVKTLDKIYNLDWFSSASKNECSAAVGFKWMLNQGLMKH 178

Query: 180 ------YWKCSSIS--FSFQRPLV 195
                 Y+K   +S  F F+R L+
Sbjct: 179 HKEIVDYFKNRDVSTIFLFRRNLL 202


>gi|147792848|emb|CAN68800.1| hypothetical protein VITISV_008807 [Vitis vinifera]
          Length = 647

 Score =  265 bits (678), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 128/202 (63%), Positives = 154/202 (76%), Gaps = 9/202 (4%)

Query: 3   EDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVI 62
           E L    +DT  +K+PKKSPL+L M+VLVF MVCGVYIC++C+KQ S  TK++FL  QVI
Sbjct: 302 EKLNMLGEDTLVIKTPKKSPLLLTMVVLVFAMVCGVYICAICLKQTSNNTKAKFLKFQVI 361

Query: 63  ERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTN 122
            +  P   IEPWEIPYVHYPKPKT+SR EC CN VRYFAILSMQRSGSGWFETLLN+H N
Sbjct: 362 NQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAILSMQRSGSGWFETLLNSHIN 421

Query: 123 ISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRL---- 178
           +SSNGE+FS+  RR+N S+I+ T+DK+YNLDW SSASKNEC AAVG KWMLNQ  +    
Sbjct: 422 VSSNGEIFSIGNRRNNISSILSTMDKVYNLDWXSSASKNECLAAVGFKWMLNQGLMEHHK 481

Query: 179 ---LYWKCSSIS--FSFQRPLV 195
               Y+K   +S  F F+R L+
Sbjct: 482 DIVKYFKNKGVSAIFLFRRNLL 503


>gi|224094390|ref|XP_002310152.1| predicted protein [Populus trichocarpa]
 gi|222853055|gb|EEE90602.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  264 bits (674), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 122/182 (67%), Positives = 147/182 (80%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAE +C   KDT  +K  KK+P++LRM+VLVF MVCGVYICSVC+KQIS  +K +  ++Q
Sbjct: 1   MAEYICVLNKDTIIIKPAKKTPILLRMIVLVFAMVCGVYICSVCLKQISTHSKIKIQDIQ 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VIERP P  + E  +I  VHYP P+T+SRAECA NPVRYFAILSMQRSGSGWFETLLN+H
Sbjct: 61  VIERPSPDVDHENLQISSVHYPNPETFSRAECAHNPVRYFAILSMQRSGSGWFETLLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLY 180
            N+SSNGE+FSV  RR N S+I +TLDK+YNLDWF+SASKNEC+AAVG KWMLNQ  + +
Sbjct: 121 VNVSSNGEIFSVLDRRINISSITQTLDKVYNLDWFTSASKNECSAAVGFKWMLNQGVMQH 180

Query: 181 WK 182
            K
Sbjct: 181 HK 182


>gi|359480689|ref|XP_003632512.1| PREDICTED: uncharacterized protein LOC100254393 isoform 2 [Vitis
           vinifera]
 gi|296082441|emb|CBI21446.3| unnamed protein product [Vitis vinifera]
          Length = 362

 Score =  260 bits (665), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 134/224 (59%), Positives = 160/224 (71%), Gaps = 29/224 (12%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAEDLC F KDT  +K PKKSP +LRM VLVF MVCGVYIC +C+KQIS  TK++F N  
Sbjct: 1   MAEDLCFFNKDTLIIKPPKKSPKLLRMTVLVFAMVCGVYICLICLKQISIHTKAKFENFW 60

Query: 61  VIERPCP-----VPNIEPW---------------EIPYVHYPKPKTYSRAECACNPVRYF 100
           V+ERP       +P I  +               EIP VHYPKP+T+SRAECA NPVR+F
Sbjct: 61  VVERPSDDHSIQIPQISTFIGDKNSCYVNGFNMKEIPNVHYPKPRTFSRAECANNPVRFF 120

Query: 101 AILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASK 160
           AI+S+QRSGSGWFETLLN+H N+SSNGE+F+V  RR N S+IV+TLDKIYNLDWFSSASK
Sbjct: 121 AIVSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISSIVKTLDKIYNLDWFSSASK 180

Query: 161 NECTAAVGLKWMLNQVRLL-------YWKCSSIS--FSFQRPLV 195
           NEC+AAVG KWMLNQ  +        Y+K   +S  F F+R L+
Sbjct: 181 NECSAAVGFKWMLNQGLMKHHKEIVDYFKNRDVSTIFLFRRNLL 224


>gi|224084324|ref|XP_002307259.1| predicted protein [Populus trichocarpa]
 gi|222856708|gb|EEE94255.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  256 bits (655), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 122/192 (63%), Positives = 148/192 (77%), Gaps = 2/192 (1%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAE +C   KDT  +K  KK+P++LRM+VL+F MVCGVYICSVC+KQ S  +K +F ++Q
Sbjct: 1   MAEYICLLNKDTIVIKPAKKTPILLRMIVLMFAMVCGVYICSVCLKQTSIHSKIKFQDIQ 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           V+ER  P  +    +I  VHYP P+T+SRAECA NPVRYFAILSMQRSGSGWFETLLN+H
Sbjct: 61  VVERLSPDDDHGNLQISSVHYPNPETFSRAECAHNPVRYFAILSMQRSGSGWFETLLNSH 120

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLY 180
            N+SSNGE+FSV  RR N S+I  TLDK+YNLDWF+SASKNEC+AAVG KWMLNQ  + +
Sbjct: 121 VNVSSNGEIFSVLDRRRNISSITRTLDKVYNLDWFTSASKNECSAAVGFKWMLNQGVMQH 180

Query: 181 WKCSSISFSFQR 192
            K   IS  F R
Sbjct: 181 HK--EISDYFNR 190


>gi|449530259|ref|XP_004172113.1| PREDICTED: uncharacterized LOC101217742 [Cucumis sativus]
          Length = 340

 Score =  252 bits (643), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 116/175 (66%), Positives = 144/175 (82%), Gaps = 2/175 (1%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAE++C F KDT  LK PKKSPL+LRM VL+F MVC VYICS+CVKQ++  T++ FL V+
Sbjct: 1   MAENICFFNKDTLILKPPKKSPLLLRMAVLMFAMVCSVYICSICVKQLNTHTRARFLRVR 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           + +  CP  +I   ++P  HYP+PKT+SRAEC  NPVR+FAI+SMQRSGSGWFE+LLN+H
Sbjct: 61  IAD--CPELSIGLTKVPREHYPRPKTFSRAECFNNPVRFFAIVSMQRSGSGWFESLLNSH 118

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
            N+SSNGEVFSV  RR N +TIV+TLD+IYNLDW +SASKN+C+AAVG KWMLNQ
Sbjct: 119 VNVSSNGEVFSVLDRRRNITTIVQTLDRIYNLDWLNSASKNQCSAAVGFKWMLNQ 173


>gi|357465451|ref|XP_003603010.1| hypothetical protein MTR_3g101380 [Medicago truncatula]
 gi|355492058|gb|AES73261.1| hypothetical protein MTR_3g101380 [Medicago truncatula]
          Length = 388

 Score =  251 bits (641), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/169 (65%), Positives = 140/169 (82%), Gaps = 1/169 (0%)

Query: 9   TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPV 68
           TKD+ ++KS KK PL LR++VL   +VCG+YICS+C +    RT S+FL V V  +PCP 
Sbjct: 52  TKDSLDIKSTKKFPLTLRLVVLAIAVVCGIYICSICFELKGVRTYSKFLGVSVFTQPCPH 111

Query: 69  P-NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNG 127
           P N++ WEIPY+HYP+PKTYSR EC+CNPVRYF I+SMQRSGSGWFET LN+H N+SSNG
Sbjct: 112 PSNVQEWEIPYLHYPEPKTYSREECSCNPVRYFCIVSMQRSGSGWFETFLNSHINVSSNG 171

Query: 128 EVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQV 176
           E+FSV+ RR N S I++T++++YNLDWF+SASKNEC+AA+G KWMLNQV
Sbjct: 172 EIFSVEKRRENVSLILKTINEVYNLDWFTSASKNECSAAIGYKWMLNQV 220


>gi|449448334|ref|XP_004141921.1| PREDICTED: uncharacterized protein LOC101217742 [Cucumis sativus]
          Length = 334

 Score =  239 bits (610), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 110/166 (66%), Positives = 137/166 (82%), Gaps = 2/166 (1%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           +DT  LK PKKSPL+LRM VL+F MVC VYICS+CVKQ++  T++ FL V++ +  CP  
Sbjct: 4   QDTLILKPPKKSPLLLRMAVLMFAMVCSVYICSICVKQLNTHTRARFLRVRIAD--CPEL 61

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
           +I   ++P  HYP+PKT+SRAEC  NPVR+FAI+SMQRSGSGWFE+LLN+H N+SSNGEV
Sbjct: 62  SIGLTKVPREHYPRPKTFSRAECFNNPVRFFAIVSMQRSGSGWFESLLNSHVNVSSNGEV 121

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FSV  RR N +TIV+TLD+IYNLDW +SASKN+C+AAVG KWMLNQ
Sbjct: 122 FSVLDRRRNITTIVQTLDRIYNLDWLNSASKNQCSAAVGFKWMLNQ 167


>gi|302142361|emb|CBI19564.3| unnamed protein product [Vitis vinifera]
          Length = 311

 Score =  238 bits (606), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 112/171 (65%), Positives = 133/171 (77%), Gaps = 9/171 (5%)

Query: 34  MVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECA 93
           MVCGVYIC++C+KQ S  TK++FL  QVI +  P   IEPWEIPYVHYPKPKT+SR EC 
Sbjct: 1   MVCGVYICAICLKQTSNNTKAKFLKFQVINQASPSFGIEPWEIPYVHYPKPKTHSREECV 60

Query: 94  CNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLD 153
           CN VRYFAILSMQRSGSGWFETLLN+H N+SSNGE+FS+  RR+N S+I+ T+DK+YNLD
Sbjct: 61  CNGVRYFAILSMQRSGSGWFETLLNSHINVSSNGEIFSIGNRRNNISSILSTMDKVYNLD 120

Query: 154 WFSSASKNECTAAVGLKWMLNQVRL-------LYWKCSSIS--FSFQRPLV 195
           WFSSASKNEC AAVG KWMLNQ  +        Y+K   +S  F F+R L+
Sbjct: 121 WFSSASKNECLAAVGFKWMLNQGLMEHHKDIVKYFKNKGVSAIFLFRRNLL 171


>gi|242035555|ref|XP_002465172.1| hypothetical protein SORBIDRAFT_01g033350 [Sorghum bicolor]
 gi|241919026|gb|EER92170.1| hypothetical protein SORBIDRAFT_01g033350 [Sorghum bicolor]
          Length = 364

 Score =  236 bits (601), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 105/167 (62%), Positives = 133/167 (79%)

Query: 9   TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPV 68
            K+ F+LK+PKKSPL LRM+V    M+CG+ ICS+C+KQ+ +   S  + ++V+E+PC  
Sbjct: 28  NKNAFDLKAPKKSPLPLRMVVFAMTMLCGISICSMCMKQLGSDGWSRIVKIEVVEQPCNK 87

Query: 69  PNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGE 128
             + P E+ +VHYP+P TYSR EC CN VR+FAI+S QRSGSGWFETLLN+H N+SSNGE
Sbjct: 88  STVPPSEVQFVHYPQPITYSREECKCNAVRFFAIISSQRSGSGWFETLLNSHMNVSSNGE 147

Query: 129 VFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           +FS K RRSN S+I+ TLDK+YNLDW SSASKNECTAA+G KWMLNQ
Sbjct: 148 IFSRKERRSNISSIINTLDKVYNLDWNSSASKNECTAAIGFKWMLNQ 194


>gi|297819744|ref|XP_002877755.1| hypothetical protein ARALYDRAFT_485408 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323593|gb|EFH54014.1| hypothetical protein ARALYDRAFT_485408 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 340

 Score =  235 bits (600), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 111/177 (62%), Positives = 140/177 (79%), Gaps = 6/177 (3%)

Query: 1   MAEDLCSFTKDT--FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLN 58
           MAE +C F KD+    +K PKKSPL LRM+VLVF MVCG+YICSVC+KQ+S      F +
Sbjct: 1   MAEYICLFGKDSAAIVIKQPKKSPLFLRMIVLVFAMVCGLYICSVCLKQLS---NVSFQS 57

Query: 59  VQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLN 118
            Q+++   P  +     +  +HYPKP+T++RAEC  NPVRYFAILSMQRSGSGWFETLLN
Sbjct: 58  SQLVQ-TTPFDSHSLGFVTRIHYPKPQTFNRAECGHNPVRYFAILSMQRSGSGWFETLLN 116

Query: 119 NHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           +H+N+SSNGE+FSV  RR N S+I++TLD++YNLDWF+SASKNEC+AA+G KWMLNQ
Sbjct: 117 SHSNVSSNGEIFSVLDRRKNISSIIQTLDRVYNLDWFTSASKNECSAAIGFKWMLNQ 173


>gi|42565806|ref|NP_190631.2| nodulation-related protein [Arabidopsis thaliana]
 gi|45773806|gb|AAS76707.1| At3g50620 [Arabidopsis thaliana]
 gi|114050671|gb|ABI49485.1| At3g50620 [Arabidopsis thaliana]
 gi|332645165|gb|AEE78686.1| nodulation-related protein [Arabidopsis thaliana]
          Length = 340

 Score =  235 bits (599), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 110/177 (62%), Positives = 139/177 (78%), Gaps = 6/177 (3%)

Query: 1   MAEDLCSFTKDT--FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLN 58
           MAE +C F KD+    +K PKKSPL LRM+VLVF MVCG+YIC+VC+KQ+S      F  
Sbjct: 1   MAEYICLFGKDSAAIVIKQPKKSPLFLRMIVLVFAMVCGLYICAVCLKQLS---NVSFQT 57

Query: 59  VQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLN 118
            Q+++   P+ +     +  +HYPKP+T++RAEC  NPVRYFAILSMQRSGSGWFETLLN
Sbjct: 58  SQLVQ-TSPIDSHSLRFVTRIHYPKPQTFNRAECGHNPVRYFAILSMQRSGSGWFETLLN 116

Query: 119 NHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           +H N+SSNGE+FSV  RR N S+I++TLD++YNLDWF+SASKNEC+AA+G KWMLNQ
Sbjct: 117 SHNNVSSNGEIFSVLDRRKNISSIIQTLDRVYNLDWFTSASKNECSAAIGFKWMLNQ 173


>gi|356565626|ref|XP_003551040.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 350

 Score =  232 bits (592), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 118/207 (57%), Positives = 153/207 (73%), Gaps = 16/207 (7%)

Query: 3   EDLCSFT-KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQV 61
           +D+C F  KD   +K  KKSPL+LRM VL+F MVCGV+ICSVC+KQIS   ++  + +Q 
Sbjct: 6   KDMCFFNNKDILVIKPQKKSPLLLRMSVLIFSMVCGVFICSVCLKQISTHARTMLMELQ- 64

Query: 62  IERPCPVP----NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLL 117
           IE+P        N++  ++P +HYPKP +++R+ECA NPVR+FAILS QRSGSGWFETLL
Sbjct: 65  IEKPSRSSSSRLNLKN-DVPLLHYPKPVSFNRSECAGNPVRFFAILSNQRSGSGWFETLL 123

Query: 118 NNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVR 177
           N+H N+SSNGE+FSV+ RR N STIV+TLDK+YNLDW SSASKNEC+AA+G KWMLNQ  
Sbjct: 124 NSHINVSSNGEIFSVRERRVNVSTIVQTLDKVYNLDWLSSASKNECSAAIGFKWMLNQGL 183

Query: 178 LL-------YWKCSSIS--FSFQRPLV 195
           +        Y+   S+S  F F+R L+
Sbjct: 184 MEHPKEIADYFNSRSVSVIFLFRRNLL 210


>gi|218192981|gb|EEC75408.1| hypothetical protein OsI_11905 [Oryza sativa Indica Group]
          Length = 365

 Score =  231 bits (588), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 104/166 (62%), Positives = 131/166 (78%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           KD   +KS KKSPL LRM+V    M+CG++IC++C+KQ+ + + S  + ++V E+ C   
Sbjct: 31  KDALIVKSSKKSPLALRMVVFTMTMICGIFICTMCMKQLGSDSLSRIVKIEVAEQLCNKS 90

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
            I   E+ +VHYP+P TYSR+EC C PVR+FAI+S QRSGSGWFETLLN+H N+SSNGE+
Sbjct: 91  AILHSEVHFVHYPQPITYSRSECKCTPVRFFAIISSQRSGSGWFETLLNSHMNVSSNGEI 150

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FS K RRSN S+I +TLDK+YNLDW SSASKNECTAAVGLKWMLNQ
Sbjct: 151 FSSKERRSNISSITKTLDKVYNLDWNSSASKNECTAAVGLKWMLNQ 196


>gi|115453377|ref|NP_001050289.1| Os03g0394900 [Oryza sativa Japonica Group]
 gi|108708616|gb|ABF96411.1| nodulation protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113548760|dbj|BAF12203.1| Os03g0394900 [Oryza sativa Japonica Group]
 gi|215737279|dbj|BAG96208.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625064|gb|EEE59196.1| hypothetical protein OsJ_11141 [Oryza sativa Japonica Group]
          Length = 362

 Score =  230 bits (586), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 104/166 (62%), Positives = 131/166 (78%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           KD   +KS KKSPL LRM+V    M+CG++IC++C+KQ+ + + S  + ++V E+ C   
Sbjct: 28  KDALIVKSSKKSPLALRMVVFAMTMICGIFICTMCMKQLGSDSLSRIVKIEVAEQLCNKS 87

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
            I   E+ +VHYP+P TYSR+EC C PVR+FAI+S QRSGSGWFETLLN+H N+SSNGE+
Sbjct: 88  AILHSEVHFVHYPQPITYSRSECKCTPVRFFAIISSQRSGSGWFETLLNSHMNVSSNGEI 147

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FS K RRSN S+I +TLDK+YNLDW SSASKNECTAAVGLKWMLNQ
Sbjct: 148 FSSKERRSNISSITKTLDKVYNLDWNSSASKNECTAAVGLKWMLNQ 193


>gi|226500168|ref|NP_001142620.1| uncharacterized protein LOC100274889 [Zea mays]
 gi|195607388|gb|ACG25524.1| hypothetical protein [Zea mays]
          Length = 360

 Score =  229 bits (584), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 101/166 (60%), Positives = 132/166 (79%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           K+ F+LK+PKKSPL LR++V    ++CG+ I S+C+KQ+ +   S  + ++V+E+PC   
Sbjct: 26  KNAFDLKAPKKSPLALRVVVFAMTVLCGISIWSICMKQLGSNGWSRIVKIEVVEQPCNKS 85

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
            + P E+ +VHYP+P TYSR EC CN VR+FAI+S QRSGSGWFETLLN+H N+SSNGE+
Sbjct: 86  TVPPSEVQFVHYPQPTTYSREECKCNAVRFFAIISSQRSGSGWFETLLNSHMNVSSNGEI 145

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FS K RRSN S+IV TLD++Y+LDW SSASKNECTAA+G KWMLNQ
Sbjct: 146 FSRKERRSNISSIVNTLDRVYSLDWNSSASKNECTAAIGFKWMLNQ 191


>gi|29824462|gb|AAP04177.1| unknown protein [Oryza sativa Japonica Group]
          Length = 384

 Score =  228 bits (582), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 103/166 (62%), Positives = 131/166 (78%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           +D   +KS KKSPL LRM+V    M+CG++IC++C+KQ+ + + S  + ++V E+ C   
Sbjct: 50  QDALIVKSSKKSPLALRMVVFAMTMICGIFICTMCMKQLGSDSLSRIVKIEVAEQLCNKS 109

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
            I   E+ +VHYP+P TYSR+EC C PVR+FAI+S QRSGSGWFETLLN+H N+SSNGE+
Sbjct: 110 AILHSEVHFVHYPQPITYSRSECKCTPVRFFAIISSQRSGSGWFETLLNSHMNVSSNGEI 169

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FS K RRSN S+I +TLDK+YNLDW SSASKNECTAAVGLKWMLNQ
Sbjct: 170 FSSKERRSNISSITKTLDKVYNLDWNSSASKNECTAAVGLKWMLNQ 215


>gi|356514160|ref|XP_003525774.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 361

 Score =  228 bits (582), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 110/191 (57%), Positives = 139/191 (72%), Gaps = 17/191 (8%)

Query: 1   MAEDLCSFTK----------DTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISA 50
           + ED+C F K          D   +K  KKSPL+LRM VL+F MVCGV+ICSVC+KQIS 
Sbjct: 3   LKEDICFFNKVARTSASKVSDILVMKPQKKSPLLLRMSVLIFSMVCGVFICSVCLKQIST 62

Query: 51  RTKSEFLNVQVIERPCPVPNIE------PWEIPYVHYPKPKTYSRAECACNPVRYFAILS 104
             ++  + +Q IE+P               ++P +HYP+P +++R+ECA NPVR+FAILS
Sbjct: 63  HARTMLMELQ-IEKPSRSSISSINRLNLTHDVPLLHYPEPVSFNRSECAGNPVRFFAILS 121

Query: 105 MQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECT 164
            QRSGSGWFETLLN+H N+SSNGE+FSV+ RR N STIV+TLDK+YNLDW SSASKNEC+
Sbjct: 122 NQRSGSGWFETLLNSHINVSSNGEIFSVRERRVNVSTIVQTLDKVYNLDWLSSASKNECS 181

Query: 165 AAVGLKWMLNQ 175
           AA+G KWMLNQ
Sbjct: 182 AAIGSKWMLNQ 192


>gi|14029013|gb|AAK52554.1|AC079853_7 Unknown protein [Oryza sativa Japonica Group]
          Length = 332

 Score =  228 bits (581), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 103/166 (62%), Positives = 131/166 (78%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           +D   +KS KKSPL LRM+V    M+CG++IC++C+KQ+ + + S  + ++V E+ C   
Sbjct: 50  QDALIVKSSKKSPLALRMVVFAMTMICGIFICTMCMKQLGSDSLSRIVKIEVAEQLCNKS 109

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
            I   E+ +VHYP+P TYSR+EC C PVR+FAI+S QRSGSGWFETLLN+H N+SSNGE+
Sbjct: 110 AILHSEVHFVHYPQPITYSRSECKCTPVRFFAIISSQRSGSGWFETLLNSHMNVSSNGEI 169

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FS K RRSN S+I +TLDK+YNLDW SSASKNECTAAVGLKWMLNQ
Sbjct: 170 FSSKERRSNISSITKTLDKVYNLDWNSSASKNECTAAVGLKWMLNQ 215


>gi|147833389|emb|CAN66236.1| hypothetical protein VITISV_022815 [Vitis vinifera]
          Length = 352

 Score =  226 bits (577), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 117/195 (60%), Positives = 135/195 (69%), Gaps = 33/195 (16%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAEDLC F KDT  +K PKKSP +LRM VLVF M             IS  TK++F N  
Sbjct: 1   MAEDLCFFNKDTLIIKPPKKSPKLLRMTVLVFAM-------------ISIHTKAKFENFW 47

Query: 61  VIERPCP-----VPNIEPW---------------EIPYVHYPKPKTYSRAECACNPVRYF 100
           V+ERP       +P I  +               EIP VHYPKP+T+SRAECA NPVR+F
Sbjct: 48  VVERPSDDHSIQIPQISTFIGDKNSCYVNGFNMKEIPNVHYPKPRTFSRAECANNPVRFF 107

Query: 101 AILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASK 160
           AI+S+QRSGSGWFETLLN+H N+SSNGE+F+V  RR N S IV+TLDKIYNLDWFSSASK
Sbjct: 108 AIVSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISXIVKTLDKIYNLDWFSSASK 167

Query: 161 NECTAAVGLKWMLNQ 175
           NEC+AAVG KWMLNQ
Sbjct: 168 NECSAAVGFKWMLNQ 182


>gi|356537631|ref|XP_003537329.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 341

 Score =  225 bits (573), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 109/204 (53%), Positives = 145/204 (71%), Gaps = 10/204 (4%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M ED+C   K+   +K PK+SP++LRM V++F MVCGV+I SVC+ QIS + ++ F++ +
Sbjct: 1   MVEDVCFLYKEILVMKPPKRSPMLLRMAVVIFSMVCGVFIFSVCLMQISTQARTTFMDFK 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VI+       ++      +HYPKP ++SR EC  NPV +FAILS QRSGSGWFETLLN+H
Sbjct: 61  VIDNHSQSI-LKLMNTHLLHYPKPASFSRNECVHNPVLFFAILSNQRSGSGWFETLLNSH 119

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLY 180
            N+SSNGE+FSV+ RR N S+I+ TLDK+YNLDWF+SASKNEC+AA GLKWMLNQ  + +
Sbjct: 120 INVSSNGEIFSVRERRQNVSSILHTLDKVYNLDWFNSASKNECSAATGLKWMLNQGLMEH 179

Query: 181 WK---------CSSISFSFQRPLV 195
            K           SI F F+R L+
Sbjct: 180 HKEIAEYFNRRRVSIIFLFRRNLL 203


>gi|357111946|ref|XP_003557771.1| PREDICTED: uncharacterized protein LOC100845427 [Brachypodium
           distachyon]
          Length = 357

 Score =  224 bits (572), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 105/176 (59%), Positives = 132/176 (75%), Gaps = 2/176 (1%)

Query: 2   AEDLCS--FTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNV 59
            ED C     KDT  LK+ KKSP  LR++V    M+ GV+ICS+C+KQ+   + S  + +
Sbjct: 13  GEDCCYQLGNKDTLVLKTQKKSPPALRIVVFAMTMISGVFICSICMKQLGTDSWSRMVKI 72

Query: 60  QVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNN 119
           +V E+PC    I   E+ +VHYP+P+ YSR EC C PVR+FAI+S QRSGSGWFETLLN+
Sbjct: 73  EVAEQPCNKSLIPLSEVQFVHYPQPQNYSRKECMCTPVRFFAIISSQRSGSGWFETLLNS 132

Query: 120 HTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           H N+SSNGE+FS K RR+N S+IV+T+DK+YNLDW SSASKNECTAAVG KWMLNQ
Sbjct: 133 HINVSSNGEIFSNKERRNNISSIVKTMDKVYNLDWNSSASKNECTAAVGFKWMLNQ 188


>gi|356564206|ref|XP_003550347.1| PREDICTED: uncharacterized protein LOC100819406 [Glycine max]
          Length = 330

 Score =  224 bits (570), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 105/185 (56%), Positives = 138/185 (74%), Gaps = 3/185 (1%)

Query: 14  NLKSPKKSPLVLRMLVLVFVMVCGVYICSVCV-KQISARTKSEFLNVQVIERPCPVPNIE 72
           ++KS KK P  LR++VL     CG+YICSV + K    RT S+ L ++VI + C   ++E
Sbjct: 2   DIKSIKKFPSTLRLVVLAMAATCGLYICSVNLEKPTRIRTNSKLLELRVINQSCHPSSVE 61

Query: 73  PWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
            WE+P++HYP+PKTY+R ECACNPVR+F IL+MQRSGSGWFETLLN+H N+SSNGE+FSV
Sbjct: 62  EWEVPFLHYPQPKTYNREECACNPVRFFVILTMQRSGSGWFETLLNSHMNVSSNGEIFSV 121

Query: 133 KVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLYWKCSSISFSFQR 192
             RR N S+I+ T+D+++NLDWFS ASKNEC+AAVG KWMLNQ  + + K   I   F+R
Sbjct: 122 AKRRENVSSILMTMDEVFNLDWFSGASKNECSAAVGYKWMLNQGLMEHHK--EIGEYFER 179

Query: 193 PLVGS 197
             V +
Sbjct: 180 RRVST 184


>gi|357464733|ref|XP_003602648.1| Kinase-like protein [Medicago truncatula]
 gi|355491696|gb|AES72899.1| Kinase-like protein [Medicago truncatula]
          Length = 403

 Score =  223 bits (569), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 106/165 (64%), Positives = 124/165 (75%), Gaps = 10/165 (6%)

Query: 8   FTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCP 67
           +  D   +K  K S LV R++VL F MVCGV ICS+C+KQIS  ++  FL++ VI+ PCP
Sbjct: 127 YIMDVLLVKGLKNSTLVWRLIVLAFAMVCGVCICSICLKQISTGSRIGFLDINVIQMPCP 186

Query: 68  VPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNG 127
            PNIEPWEIPYV           EC C+P+RYF ILSMQR GSGW ET LN+H NISSNG
Sbjct: 187 GPNIEPWEIPYVE----------ECRCHPLRYFTILSMQRFGSGWHETFLNSHPNISSNG 236

Query: 128 EVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWM 172
           E+FSVKVRRSN +TI ETLD IYNLDWF+SASKNECTAAVGL W+
Sbjct: 237 EIFSVKVRRSNITTITETLDTIYNLDWFNSASKNECTAAVGLAWL 281


>gi|356496876|ref|XP_003517291.1| PREDICTED: uncharacterized protein LOC100789437 [Glycine max]
          Length = 341

 Score =  223 bits (569), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 105/192 (54%), Positives = 142/192 (73%), Gaps = 1/192 (0%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M ED+C   K+   +K PK+SP++LR  V++F MVCGV+I SVC+KQIS + ++ F++ +
Sbjct: 1   MVEDMCFLYKEILVMKPPKRSPMLLRTAVIIFSMVCGVFIFSVCLKQISTQARTTFMDFK 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VI+       ++      +HYPKP ++SR ECA NPV +FAILS QRSGSGWFETLLN+H
Sbjct: 61  VIDNHSQSI-LKLMNTHLLHYPKPSSFSRNECAHNPVLFFAILSNQRSGSGWFETLLNSH 119

Query: 121 TNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLY 180
            N+SS GE+FSV+ RR N S+I+ TLDK+YNLDWF+SASKNEC+AA GLKWMLNQ  + +
Sbjct: 120 VNVSSYGEIFSVRERRQNVSSILLTLDKVYNLDWFNSASKNECSAATGLKWMLNQGLVEH 179

Query: 181 WKCSSISFSFQR 192
            K  +  F+ +R
Sbjct: 180 HKEIAEYFNHRR 191


>gi|226496856|ref|NP_001145331.1| hypothetical protein [Zea mays]
 gi|195654733|gb|ACG46834.1| hypothetical protein [Zea mays]
 gi|414867126|tpg|DAA45683.1| TPA: hypothetical protein ZEAMMB73_734756 [Zea mays]
          Length = 360

 Score =  223 bits (567), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 104/168 (61%), Positives = 130/168 (77%), Gaps = 2/168 (1%)

Query: 9   TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPV 68
            KD F+LK PKKSP+ LR +V    M+CG+ ICS+C+KQ+ +   S  + ++V+E+PC  
Sbjct: 24  NKDAFDLKPPKKSPIALRTVVFAMTMLCGISICSMCMKQLGSDGWSRVVKIEVVEQPCN- 82

Query: 69  PNIEPW-EIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNG 127
            +I P  E  +V YP+P TYSR EC CN VR FAI+S QRSGSGWFETLLN+H N+SSNG
Sbjct: 83  KSIAPLSEAQFVRYPQPITYSREECKCNAVRSFAIISSQRSGSGWFETLLNSHMNVSSNG 142

Query: 128 EVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           E+FS K RRSN S+I++TLDK+YNLDW SSASKNECTAA+G KWMLNQ
Sbjct: 143 EIFSRKERRSNISSIIDTLDKVYNLDWNSSASKNECTAAIGFKWMLNQ 190


>gi|293332934|ref|NP_001167842.1| hypothetical protein [Zea mays]
 gi|223944369|gb|ACN26268.1| unknown [Zea mays]
 gi|413955392|gb|AFW88041.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 360

 Score =  221 bits (564), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 97/166 (58%), Positives = 129/166 (77%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           K+ F+ K+PKKSPL LR++V    ++CG+ I S+C+KQ+ +   S  + ++V+E+PC   
Sbjct: 26  KNAFDSKAPKKSPLALRVVVFAMTVLCGISIWSMCMKQLGSDGWSRIVKIEVVEQPCNKS 85

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
            + P E+ + HYP+P TYSR EC CN VR+FAI+S QRSGSGWFETLLN+H N+SSNGE+
Sbjct: 86  TVPPSEVQFAHYPQPTTYSREECKCNAVRFFAIISSQRSGSGWFETLLNSHMNVSSNGEI 145

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FS K RRSN S+IV  +D++Y+LDW SSASKNECTAA+G KWMLNQ
Sbjct: 146 FSRKERRSNISSIVNAMDRVYSLDWNSSASKNECTAAIGFKWMLNQ 191


>gi|413955391|gb|AFW88040.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 307

 Score =  221 bits (564), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 97/166 (58%), Positives = 129/166 (77%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           K+ F+ K+PKKSPL LR++V    ++CG+ I S+C+KQ+ +   S  + ++V+E+PC   
Sbjct: 26  KNAFDSKAPKKSPLALRVVVFAMTVLCGISIWSMCMKQLGSDGWSRIVKIEVVEQPCNKS 85

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
            + P E+ + HYP+P TYSR EC CN VR+FAI+S QRSGSGWFETLLN+H N+SSNGE+
Sbjct: 86  TVPPSEVQFAHYPQPTTYSREECKCNAVRFFAIISSQRSGSGWFETLLNSHMNVSSNGEI 145

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FS K RRSN S+IV  +D++Y+LDW SSASKNECTAA+G KWMLNQ
Sbjct: 146 FSRKERRSNISSIVNAMDRVYSLDWNSSASKNECTAAIGFKWMLNQ 191


>gi|356553925|ref|XP_003545301.1| PREDICTED: uncharacterized protein LOC100820591 [Glycine max]
          Length = 1057

 Score =  220 bits (560), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 100/183 (54%), Positives = 135/183 (73%), Gaps = 1/183 (0%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCV-KQISARTKSEFLNV 59
           +++ +     D+  +KS KK P  L+++VL  V  C +YICSV   K    R  S+ L +
Sbjct: 716 VSQGVLKLMHDSMYIKSIKKFPSTLKLVVLAMVATCVLYICSVNFEKPARIRANSKLLEL 775

Query: 60  QVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNN 119
           +VI + C + N+E WE+P++HYP+PKTY+R ECACNPVR+F IL+MQRSGSGWFETLLN+
Sbjct: 776 RVINQSCQLSNVEEWEVPFLHYPQPKTYNREECACNPVRFFTILTMQRSGSGWFETLLNS 835

Query: 120 HTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL 179
           H N+SSNGE+F V  RR N S+I++T+D ++NLDWFS ASKNEC+AAVG KWMLNQ  + 
Sbjct: 836 HMNVSSNGEIFGVVKRRENVSSILKTMDVVFNLDWFSGASKNECSAAVGYKWMLNQGLME 895

Query: 180 YWK 182
           + K
Sbjct: 896 HHK 898


>gi|125527968|gb|EAY76082.1| hypothetical protein OsI_04009 [Oryza sativa Indica Group]
          Length = 358

 Score =  218 bits (556), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 107/180 (59%), Positives = 133/180 (73%), Gaps = 6/180 (3%)

Query: 1   MAEDLCSFTKDTFN--LKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQ---ISARTKSE 55
           + EDL + TKDT N   KS ++ PL+    +L  + + GVYI SV +KQ   +    +++
Sbjct: 15  VVEDLGTATKDTVNPSAKSTRRYPLLSWTTILALIALVGVYIFSVSLKQNGMLLGLKQTD 74

Query: 56  FLNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFET 115
            +  +  E+ C  P+I   EIPYVHYP P TYSR ECAC PVR+FAILSMQRSGSGW ET
Sbjct: 75  MIEKER-EKLCQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIET 133

Query: 116 LLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           LLN+H NISSNGE+FS+K RRSN ++I +TLDK+YNLDW SSA+KNECTAAVGLKWMLNQ
Sbjct: 134 LLNSHENISSNGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQ 193


>gi|125572263|gb|EAZ13778.1| hypothetical protein OsJ_03703 [Oryza sativa Japonica Group]
          Length = 359

 Score =  218 bits (555), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 107/180 (59%), Positives = 133/180 (73%), Gaps = 6/180 (3%)

Query: 1   MAEDLCSFTKDTFN--LKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQ---ISARTKSE 55
           + EDL + TKDT N   KS ++ PL+    +L  + + GVYI SV +KQ   +    +++
Sbjct: 15  VVEDLGTATKDTVNPSAKSTRRYPLLSWTTILALIALVGVYIFSVSLKQNGMLLGLKQTD 74

Query: 56  FLNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFET 115
            +  +  E+ C  P+I   EIPYVHYP P TYSR ECAC PVR+FAILSMQRSGSGW ET
Sbjct: 75  MIEKER-EKLCQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIET 133

Query: 116 LLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           LLN+H NISSNGE+FS+K RRSN ++I +TLDK+YNLDW SSA+KNECTAAVGLKWMLNQ
Sbjct: 134 LLNSHENISSNGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQ 193


>gi|115440365|ref|NP_001044462.1| Os01g0784600 [Oryza sativa Japonica Group]
 gi|53792452|dbj|BAD53360.1| unknown protein [Oryza sativa Japonica Group]
 gi|113533993|dbj|BAF06376.1| Os01g0784600 [Oryza sativa Japonica Group]
 gi|215695270|dbj|BAG90461.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 346

 Score =  217 bits (553), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 107/180 (59%), Positives = 133/180 (73%), Gaps = 6/180 (3%)

Query: 1   MAEDLCSFTKDTFN--LKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQ---ISARTKSE 55
           + EDL + TKDT N   KS ++ PL+    +L  + + GVYI SV +KQ   +    +++
Sbjct: 15  VVEDLGTATKDTVNPSAKSTRRYPLLSWTTILALIALVGVYIFSVSLKQNGMLLGLKQTD 74

Query: 56  FLNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFET 115
            +  +  E+ C  P+I   EIPYVHYP P TYSR ECAC PVR+FAILSMQRSGSGW ET
Sbjct: 75  MIEKER-EKLCQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIET 133

Query: 116 LLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           LLN+H NISSNGE+FS+K RRSN ++I +TLDK+YNLDW SSA+KNECTAAVGLKWMLNQ
Sbjct: 134 LLNSHENISSNGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQ 193


>gi|242082293|ref|XP_002445915.1| hypothetical protein SORBIDRAFT_07g027970 [Sorghum bicolor]
 gi|241942265|gb|EES15410.1| hypothetical protein SORBIDRAFT_07g027970 [Sorghum bicolor]
          Length = 357

 Score =  217 bits (552), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 108/179 (60%), Positives = 131/179 (73%), Gaps = 6/179 (3%)

Query: 2   AEDLCSFTKDT--FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQ---ISARTKSEF 56
           AEDL   TKDT   N K  K+ PL L + +L  +M+ GVYI S+ +KQ   +    ++  
Sbjct: 12  AEDLGVATKDTAVVNTKPAKRYPLALWIAILGLIMLVGVYIFSLSLKQNGMLFGLMQTNL 71

Query: 57  LNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETL 116
           +  +  E+PC  P+I   EIPY+HYP P TY R ECAC  VR+FAILSMQRSGSGW ETL
Sbjct: 72  IEKER-EKPCHDPSIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETL 130

Query: 117 LNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           LN+H NISSNGE+FSVK RRSN + I +TLDK+YNLDW+SSA+KNECTAAVGLKWMLNQ
Sbjct: 131 LNSHPNISSNGEIFSVKDRRSNITAITKTLDKLYNLDWYSSAAKNECTAAVGLKWMLNQ 189


>gi|356509720|ref|XP_003523594.1| PREDICTED: uncharacterized protein LOC100808615 [Glycine max]
          Length = 336

 Score =  217 bits (552), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 109/197 (55%), Positives = 147/197 (74%), Gaps = 10/197 (5%)

Query: 9   TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPC-P 67
           TK++ ++KS KK PL LR++VLV V++CG+YIC++C++Q    T ++  ++ V ++PC  
Sbjct: 3   TKESPDIKSFKKFPLTLRLVVLVVVVLCGIYICTICLEQKGVHTSAKLFDITVFKQPCFN 62

Query: 68  VPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNG 127
              +E WE+ Y+HYP+PK YSR ECACNPV +F I+SMQRSGSGWFET LN+H N+SSNG
Sbjct: 63  HSGVEEWELSYLHYPEPKIYSREECACNPVLFFCIVSMQRSGSGWFETFLNSHINVSSNG 122

Query: 128 EVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------VRL 178
           E+FSV  RR+N S+I+ET+D++YNLDWFSSASKNECTAAVG KWMLNQ         V  
Sbjct: 123 EIFSVGKRRANVSSILETMDRVYNLDWFSSASKNECTAAVGFKWMLNQGLVEHHEEIVEY 182

Query: 179 LYWKCSSISFSFQRPLV 195
              +  S+ F F+R L+
Sbjct: 183 FERRRVSVIFLFRRNLL 199


>gi|357482975|ref|XP_003611774.1| hypothetical protein MTR_5g017700 [Medicago truncatula]
 gi|355513109|gb|AES94732.1| hypothetical protein MTR_5g017700 [Medicago truncatula]
          Length = 342

 Score =  215 bits (547), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 107/211 (50%), Positives = 146/211 (69%), Gaps = 23/211 (10%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M ED+C   KD   +K PKKSP++LR  V +F MV  ++I  VC KQI    +++F+++ 
Sbjct: 1   MVEDMCFLYKDILVIKPPKKSPMLLRTAVFMFSMVSVIFIFYVCRKQIGTEVRTKFMDLN 60

Query: 61  VIE-------RPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWF 113
           VI+       +   +P+I       +HYP+P +++R ECA NPV +FAILS QRSGSGWF
Sbjct: 61  VIDNLTRSIVKQTHIPDI-------LHYPEPLSFNRNECAPNPVMFFAILSNQRSGSGWF 113

Query: 114 ETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWML 173
           ETLLN+H N+SSNGE+FSV+ RR NAS+I++TLD++YNLDW SSASKNEC+AA+G KWML
Sbjct: 114 ETLLNSHINVSSNGEIFSVRERRQNASSILQTLDRVYNLDWLSSASKNECSAAIGFKWML 173

Query: 174 NQVRL-------LYWKCSSIS--FSFQRPLV 195
           NQ  +        Y+K  S+S  F F+R L+
Sbjct: 174 NQGLMEHHNEIEEYFKRRSVSVIFLFRRNLL 204


>gi|226531263|ref|NP_001142398.1| uncharacterized protein LOC100274572 [Zea mays]
 gi|194708632|gb|ACF88400.1| unknown [Zea mays]
 gi|195625724|gb|ACG34692.1| hypothetical protein [Zea mays]
 gi|413925273|gb|AFW65205.1| hypothetical protein ZEAMMB73_585643 [Zea mays]
          Length = 348

 Score =  214 bits (545), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 107/179 (59%), Positives = 130/179 (72%), Gaps = 6/179 (3%)

Query: 2   AEDLCSFTKDT--FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQ---ISARTKSEF 56
           +EDL   TKDT   N K  K+ PL L + +L  +M+ GVYI S+ +KQ   +    ++  
Sbjct: 3   SEDLGVATKDTEVVNTKPAKRYPLALWIAILGLIMLVGVYIFSLSLKQNGMLFGLMQTNL 62

Query: 57  LNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETL 116
           +  +  E+PC  P I   EIPY+HYP P TY R ECAC  VR+FAILSMQRSGSGW ETL
Sbjct: 63  IEKER-EKPCHDPGIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETL 121

Query: 117 LNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           LN+H NISSNGE+FSVK RRSN + I +TLDK+YNLDW+SSA+KNECTAAVGLKWMLNQ
Sbjct: 122 LNSHPNISSNGEIFSVKERRSNFTAIKKTLDKLYNLDWYSSAAKNECTAAVGLKWMLNQ 180


>gi|357125400|ref|XP_003564382.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
          Length = 359

 Score =  212 bits (539), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 107/177 (60%), Positives = 127/177 (71%), Gaps = 5/177 (2%)

Query: 3   EDLCSFTKDTFNL--KSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISA--RTKSEFLN 58
           ED  S  KD  N+  K  ++ PL   + +L    + GVYI S+ +KQ       K   + 
Sbjct: 17  EDF-SAAKDAANVTTKPTRRYPLASWIAILALATLVGVYIFSLSLKQNGMLFGLKQTSMI 75

Query: 59  VQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLN 118
            +  E+PC  P+I   EIPYVHYP P TYSR ECAC PVR+FAILSMQRSGSGWFETLLN
Sbjct: 76  EKEREKPCHRPSIPDTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWFETLLN 135

Query: 119 NHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           +H NISSNGE+FSVK RRSN ++I +TLDK+YNLDW SSA+KNECTAAVGLKWMLNQ
Sbjct: 136 SHENISSNGEIFSVKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQ 192


>gi|296087807|emb|CBI35063.3| unnamed protein product [Vitis vinifera]
          Length = 344

 Score =  210 bits (534), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 96/170 (56%), Positives = 125/170 (73%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVPNIE 72
           F+ ++ KKSP V  +++L  +M  GVYIC V + +++  +K   ++   +ER C +  I 
Sbjct: 12  FSSRASKKSPYVFWLVLLFLIMAFGVYICGVSLTRVTVPSKPTKISFTRVERTCSINGIP 71

Query: 73  PWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P E+ YVH+P+PKTYSR EC C PVR+F ILSMQRSGSGWFETLLN+H NISSNGEVFS+
Sbjct: 72  PEELVYVHFPQPKTYSREECKCTPVRFFVILSMQRSGSGWFETLLNSHPNISSNGEVFSL 131

Query: 133 KVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLYWK 182
           K RR +  TI  TLD IYNL+W SSA+KN+C AAVG KWMLNQ  ++Y +
Sbjct: 132 KQRRQSMETIQRTLDTIYNLEWISSAAKNDCVAAVGFKWMLNQAVMVYHR 181


>gi|225464035|ref|XP_002265782.1| PREDICTED: nodulation protein H-like [Vitis vinifera]
          Length = 339

 Score =  210 bits (534), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 96/170 (56%), Positives = 125/170 (73%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVPNIE 72
           F+ ++ KKSP V  +++L  +M  GVYIC V + +++  +K   ++   +ER C +  I 
Sbjct: 7   FSSRASKKSPYVFWLVLLFLIMAFGVYICGVSLTRVTVPSKPTKISFTRVERTCSINGIP 66

Query: 73  PWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P E+ YVH+P+PKTYSR EC C PVR+F ILSMQRSGSGWFETLLN+H NISSNGEVFS+
Sbjct: 67  PEELVYVHFPQPKTYSREECKCTPVRFFVILSMQRSGSGWFETLLNSHPNISSNGEVFSL 126

Query: 133 KVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLYWK 182
           K RR +  TI  TLD IYNL+W SSA+KN+C AAVG KWMLNQ  ++Y +
Sbjct: 127 KQRRQSMETIQRTLDTIYNLEWISSAAKNDCVAAVGFKWMLNQAVMVYHR 176


>gi|223942635|gb|ACN25401.1| unknown [Zea mays]
 gi|414869370|tpg|DAA47927.1| TPA: thylakoid lumenal protein, mRNA [Zea mays]
          Length = 364

 Score =  202 bits (514), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 97/163 (59%), Positives = 121/163 (74%), Gaps = 4/163 (2%)

Query: 16  KSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQ---ISARTKSEFLNVQVIERPCPVPNIE 72
           K  K+ PL L + +L  +M+  VYI S+ +KQ   +    ++  +  +  E+PC  P+I 
Sbjct: 35  KPAKRYPLALWIGILGLIMLVAVYIFSLSLKQNGMLFGLMQTNLIEKER-EKPCHDPSIP 93

Query: 73  PWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
             EIPY+HYP P TY R ECAC  VR+FAILSMQRSGSGW ETLLN+H NISSNGE+FSV
Sbjct: 94  DTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETLLNSHPNISSNGEIFSV 153

Query: 133 KVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           K RRSN + I +TLD++YNLDW+SSA+KNECTAAVGLKWMLNQ
Sbjct: 154 KERRSNITAITKTLDRLYNLDWYSSAAKNECTAAVGLKWMLNQ 196


>gi|255569155|ref|XP_002525546.1| conserved hypothetical protein [Ricinus communis]
 gi|223535125|gb|EEF36805.1| conserved hypothetical protein [Ricinus communis]
          Length = 333

 Score =  194 bits (492), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 92/163 (56%), Positives = 123/163 (75%), Gaps = 1/163 (0%)

Query: 21  SPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ-VIERPCPVPNIEPWEIPYV 79
           S L  + + L  V+  GVYIC +C++QIS    S+ ++++ +++  C + +I P EI YV
Sbjct: 8   SKLSKKSIFLFTVIGFGVYICYLCLQQISFPLISKEVSIKDIMKSSCYIHDIPPQEIHYV 67

Query: 80  HYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNA 139
           H+P PKTYSR ECAC PVR+F I+SMQRSGSGWFETLLN+H N+SSNGE+FSVK RRS+ 
Sbjct: 68  HFPYPKTYSREECACTPVRFFVIMSMQRSGSGWFETLLNSHPNVSSNGEIFSVKQRRSSI 127

Query: 140 STIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLYWK 182
           STI  TLD ++ L+W SSA+KN+C AAVG KWMLNQ  ++Y +
Sbjct: 128 STIQATLDTVFGLEWVSSAAKNDCIAAVGFKWMLNQGVMVYHR 170


>gi|242046744|ref|XP_002461118.1| hypothetical protein SORBIDRAFT_02g041100 [Sorghum bicolor]
 gi|241924495|gb|EER97639.1| hypothetical protein SORBIDRAFT_02g041100 [Sorghum bicolor]
          Length = 335

 Score =  194 bits (492), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 96/189 (50%), Positives = 128/189 (67%), Gaps = 10/189 (5%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVPNIE 72
           ++LKS K +P++ R + + F+   G Y+C +   QI+  +  E  N +  +R C  P + 
Sbjct: 4   YSLKSSKGAPILPRPIFVFFIASFGFYVCYLSFNQITLESGREG-NSREEQRDCRKPYVP 62

Query: 73  PWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
             E+ YVH+PKP +YSR EC+C PVR+F I+SMQRSGSGWFETLLN+H NISSNGE+F+ 
Sbjct: 63  HEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLLNSHPNISSNGEIFNR 122

Query: 133 KVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------VRLLYWKC 183
             RR N S+I++TLD +YNLDW +SA+KNECTAA GLKWMLNQ         V  L  K 
Sbjct: 123 VDRRENISSILQTLDTLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPEDIVSYLKKKG 182

Query: 184 SSISFSFQR 192
            S+ F F+R
Sbjct: 183 VSVIFLFRR 191


>gi|242035665|ref|XP_002465227.1| hypothetical protein SORBIDRAFT_01g034575 [Sorghum bicolor]
 gi|241919081|gb|EER92225.1| hypothetical protein SORBIDRAFT_01g034575 [Sorghum bicolor]
          Length = 366

 Score =  193 bits (491), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 88/166 (53%), Positives = 118/166 (71%), Gaps = 3/166 (1%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERP---CPVP 69
           ++LKSPK   L LR +++ F+ +   Y+C     QIS   + +  +    E+    C  P
Sbjct: 3   YSLKSPKGPVLPLRSVLVFFIAMFSFYVCYFSFHQISLENEEKMTSADRAEQTEIHCTRP 62

Query: 70  NIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEV 129
                ++ YVH+P+P TY R ECACNPVR+F I+SMQRSGSGWFETLLN+H N+SSNGE+
Sbjct: 63  ATPHEQMRYVHFPRPATYDRGECACNPVRFFVIISMQRSGSGWFETLLNSHPNVSSNGEI 122

Query: 130 FSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           FSV+ RR N S+I+ TLD++Y++DW +SA+KNECTAA GLKWMLNQ
Sbjct: 123 FSVRDRRENISSILATLDRLYDMDWITSAAKNECTAAFGLKWMLNQ 168


>gi|115473697|ref|NP_001060447.1| Os07g0644200 [Oryza sativa Japonica Group]
 gi|33146729|dbj|BAC79620.1| unknown protein [Oryza sativa Japonica Group]
 gi|50509939|dbj|BAD30260.1| unknown protein [Oryza sativa Japonica Group]
 gi|113611983|dbj|BAF22361.1| Os07g0644200 [Oryza sativa Japonica Group]
 gi|215694729|dbj|BAG89920.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 337

 Score =  192 bits (488), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 90/164 (54%), Positives = 122/164 (74%), Gaps = 1/164 (0%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQV-IERPCPVPNI 71
           ++LKS K +P   R +++  + + G Y+C +   QI+   +SE  + +V  E  C  P++
Sbjct: 4   YSLKSSKGAPFPPRPILVFLIAIFGFYVCYISFNQITLENRSEENSGEVQAEIHCRKPHL 63

Query: 72  EPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
              E+ YVH+PKP++YSR EC+CN VR F ++SMQRSGSGWFETLLN+H NISSNGE+F+
Sbjct: 64  PHEELRYVHFPKPESYSRGECSCNLVRSFVLVSMQRSGSGWFETLLNSHPNISSNGEIFN 123

Query: 132 VKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
              RR N S+I++TLDK+YNLDWF+SA+KNECTAA GLKWMLNQ
Sbjct: 124 RVDRRENISSILQTLDKLYNLDWFTSAAKNECTAAFGLKWMLNQ 167


>gi|357121725|ref|XP_003562568.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
          Length = 337

 Score =  191 bits (484), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 91/164 (55%), Positives = 117/164 (71%), Gaps = 1/164 (0%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKS-EFLNVQVIERPCPVPNI 71
           ++LKS K +PL  R  ++  + V G+Y+C +   QI    +  E    +  E  C  P++
Sbjct: 4   YSLKSSKGAPLPPRPTLVFLIAVFGLYVCYLSFNQIRLENEGGENSAEEHTEHVCTKPSV 63

Query: 72  EPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
              E+ YVH PKPK Y+R EC+C PVR+F I+SMQRSGSGWFETLLN+H NISSNGE+F+
Sbjct: 64  PSEELRYVHLPKPKGYNRGECSCTPVRFFVIVSMQRSGSGWFETLLNSHPNISSNGEIFN 123

Query: 132 VKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
              RR N S+IV+TLDK+YNLDW +SA+KNECTAA GLKWMLNQ
Sbjct: 124 RIDRRENLSSIVQTLDKLYNLDWLTSAAKNECTAAFGLKWMLNQ 167


>gi|326523931|dbj|BAJ96976.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 339

 Score =  190 bits (483), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 93/163 (57%), Positives = 115/163 (70%), Gaps = 3/163 (1%)

Query: 16  KSPKKSPLVLRMLVLVFVMVC-GVYICSVCVKQISARTKSEFLN--VQVIERPCPVPNIE 72
           K  K +P +     LVF++   G+Y+C +   QI   +K    N      E  C  P + 
Sbjct: 7   KGSKGAPSLPPRPTLVFLIALFGLYVCYLSFNQIRMESKHAEENGAQDQNEHVCAKPYVP 66

Query: 73  PWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
             E+PYVH+PKPK YSRAEC+CNPVR+F I+SMQRSGSGWFETLLN+H NISSNGE+F+ 
Sbjct: 67  SEELPYVHFPKPKGYSRAECSCNPVRFFVIMSMQRSGSGWFETLLNSHPNISSNGEIFNR 126

Query: 133 KVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
             RR N S+IV+TLDK+YNLDW +SA+KNECTAA G KWMLNQ
Sbjct: 127 VDRRENISSIVQTLDKLYNLDWLTSAAKNECTAAFGFKWMLNQ 169


>gi|255562595|ref|XP_002522303.1| conserved hypothetical protein [Ricinus communis]
 gi|223538381|gb|EEF39987.1| conserved hypothetical protein [Ricinus communis]
          Length = 329

 Score =  190 bits (482), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 91/154 (59%), Positives = 118/154 (76%), Gaps = 3/154 (1%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           MAE +C F KDT  +K+PKKSP++LRM+VL+  MVCGVYICSVC+KQIS  +K +  ++Q
Sbjct: 1   MAEYICLFNKDTVIIKAPKKSPILLRMIVLLIAMVCGVYICSVCLKQISTNSKIKIEDIQ 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VIERP    + +  +IP VHYP P+T++R+EC  NPVRYFAILSMQRSGSGWFETLLN+H
Sbjct: 61  VIERPSDNESTQ-MQIPIVHYPHPETFNRSECVHNPVRYFAILSMQRSGSGWFETLLNSH 119

Query: 121 TNISSNGEVFS--VKVRRSNASTIVETLDKIYNL 152
            N+SSNGE+FS  +  RR+  + +  TL +I  L
Sbjct: 120 VNVSSNGEIFSFIIGFRRTLLTILYYTLPRIKRL 153


>gi|222624965|gb|EEE59097.1| hypothetical protein OsJ_10950 [Oryza sativa Japonica Group]
          Length = 339

 Score =  189 bits (481), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 88/168 (52%), Positives = 115/168 (68%), Gaps = 1/168 (0%)

Query: 9   TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFL-NVQVIERPCP 67
           ++  + LKSPK     LR +++  + + G Y+C     QI    K   +   + I   C 
Sbjct: 2   SQSVYALKSPKGPLFPLRSILVFLIALFGFYVCYFSFNQIDLENKENLISGEEQIRTLCR 61

Query: 68  VPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNG 127
              I    + YVH+PKP +YSR ECAC PVR+F I+SMQRSGSGWFETLLN+H N+SSNG
Sbjct: 62  RHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSSNG 121

Query: 128 EVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           E+FS++ RR + S+I+ TLDK+YNLDW +SA+KNECTAA GLKWMLNQ
Sbjct: 122 EIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQ 169


>gi|218192885|gb|EEC75312.1| hypothetical protein OsI_11681 [Oryza sativa Indica Group]
          Length = 339

 Score =  189 bits (481), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 88/168 (52%), Positives = 115/168 (68%), Gaps = 1/168 (0%)

Query: 9   TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFL-NVQVIERPCP 67
           ++  + LKSPK     LR +++  + + G Y+C     QI    K   +   + I   C 
Sbjct: 2   SQSVYALKSPKGPLFPLRSILVFLIALFGFYVCYFSFNQIDLENKENLISGEEQIRTLCR 61

Query: 68  VPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNG 127
              I    + YVH+PKP +YSR ECAC PVR+F I+SMQRSGSGWFETLLN+H N+SSNG
Sbjct: 62  RHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSSNG 121

Query: 128 EVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           E+FS++ RR + S+I+ TLDK+YNLDW +SA+KNECTAA GLKWMLNQ
Sbjct: 122 EIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQ 169


>gi|357119777|ref|XP_003561610.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
          Length = 343

 Score =  189 bits (480), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 85/165 (51%), Positives = 119/165 (72%), Gaps = 2/165 (1%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERP-CPVPNI 71
           ++ KS K +   LR +++ F+ + G Y+C     Q+    + E  + +   +  C  P++
Sbjct: 9   YSSKSSKGTLFPLRSMLVFFIALFGFYVCYFSFTQVDFENEEEMTDAEEQTKVVCRRPSV 68

Query: 72  EPWE-IPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVF 130
            P+E + YVH+P+P TY R ECAC PVR+F I+SMQRSGSGWFETLLN+H N+SSNGE+F
Sbjct: 69  IPYEQMQYVHFPRPMTYDRGECACTPVRFFVIVSMQRSGSGWFETLLNSHPNVSSNGEIF 128

Query: 131 SVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           SV+ RR + ++I+ TLDK+Y+LDW +SA+KNECTAA GLKWMLNQ
Sbjct: 129 SVRERREDIASILRTLDKLYDLDWRTSAAKNECTAAFGLKWMLNQ 173


>gi|108708316|gb|ABF96111.1| nodulation protein, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 337

 Score =  189 bits (480), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 88/167 (52%), Positives = 117/167 (70%), Gaps = 7/167 (4%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLN----VQVIERPCPV 68
           + LKSPK     LR +++  + + G Y+C     QI    K   ++    ++ + R   +
Sbjct: 4   YALKSPKGPLFPLRSILVFLIALFGFYVCYFSFNQIDLENKENLISGEEQIRTLCRRHTI 63

Query: 69  PNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGE 128
           PN     + YVH+PKP +YSR ECAC PVR+F I+SMQRSGSGWFETLLN+H N+SSNGE
Sbjct: 64  PN---ELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSSNGE 120

Query: 129 VFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           +FS++ RR + S+I+ TLDK+YNLDW +SA+KNECTAA GLKWMLNQ
Sbjct: 121 IFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQ 167


>gi|297722141|ref|NP_001173434.1| Os03g0363950 [Oryza sativa Japonica Group]
 gi|255674526|dbj|BAH92162.1| Os03g0363950 [Oryza sativa Japonica Group]
          Length = 366

 Score =  189 bits (479), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 88/167 (52%), Positives = 117/167 (70%), Gaps = 7/167 (4%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLN----VQVIERPCPV 68
           + LKSPK     LR +++  + + G Y+C     QI    K   ++    ++ + R   +
Sbjct: 33  YALKSPKGPLFPLRSILVFLIALFGFYVCYFSFNQIDLENKENLISGEEQIRTLCRRHTI 92

Query: 69  PNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGE 128
           PN     + YVH+PKP +YSR ECAC PVR+F I+SMQRSGSGWFETLLN+H N+SSNGE
Sbjct: 93  PN---ELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSSNGE 149

Query: 129 VFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           +FS++ RR + S+I+ TLDK+YNLDW +SA+KNECTAA GLKWMLNQ
Sbjct: 150 IFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQ 196


>gi|194696388|gb|ACF82278.1| unknown [Zea mays]
          Length = 337

 Score =  189 bits (479), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 95/191 (49%), Positives = 128/191 (67%), Gaps = 12/191 (6%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERP--CPVPN 70
           ++LKS K +P++ R + + F+   G Y+C +   QI+     E  N +  +R   C  P 
Sbjct: 4   YSLKSSKGAPILPRPVFVFFIATFGFYVCYLSFNQITLENNREG-NSREEQRDDICRKPY 62

Query: 71  IEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVF 130
           +   E+ YVH+PKP +YSR EC+C PVR+F I+SMQRSGSGWFETL+N+H NISSNGE+F
Sbjct: 63  VPYEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLMNSHPNISSNGEIF 122

Query: 131 SVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------VRLLYW 181
           +   RR N S+I++TLD++YNLDW +SA+KNECTAA GLKWMLNQ         V  L  
Sbjct: 123 NRVDRRENISSILQTLDRLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPEDIVSYLNK 182

Query: 182 KCSSISFSFQR 192
           K  S+ F F+R
Sbjct: 183 KGVSVIFLFRR 193


>gi|212723924|ref|NP_001131369.1| uncharacterized protein LOC100192694 [Zea mays]
 gi|194691340|gb|ACF79754.1| unknown [Zea mays]
 gi|224028935|gb|ACN33543.1| unknown [Zea mays]
 gi|414887880|tpg|DAA63894.1| TPA: hypothetical protein ZEAMMB73_966617 [Zea mays]
 gi|414887881|tpg|DAA63895.1| TPA: hypothetical protein ZEAMMB73_966617 [Zea mays]
          Length = 337

 Score =  188 bits (477), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 95/191 (49%), Positives = 127/191 (66%), Gaps = 12/191 (6%)

Query: 13  FNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERP--CPVPN 70
           ++LKS K +P++ R + + F+   G Y+C +   QI+     E  N +  +R   C  P 
Sbjct: 4   YSLKSSKGAPILPRPVFVFFIATFGFYVCYLSFNQITLENNREG-NSREEQRDDICRKPY 62

Query: 71  IEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVF 130
           +   E+ YVH+PKP +YSR EC+C PVR+F I+SMQRSGSGWFETL+N+H NISSNGE+F
Sbjct: 63  VPYEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLMNSHPNISSNGEIF 122

Query: 131 SVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------VRLLYW 181
           +   RR N S+I++TLD +YNLDW +SA+KNECTAA GLKWMLNQ         V  L  
Sbjct: 123 NRMDRRENISSILQTLDTLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPEDIVSYLNK 182

Query: 182 KCSSISFSFQR 192
           K  S+ F F+R
Sbjct: 183 KGVSVIFLFRR 193


>gi|168059476|ref|XP_001781728.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666812|gb|EDQ53457.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 378

 Score =  180 bits (456), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 95/181 (52%), Positives = 113/181 (62%), Gaps = 6/181 (3%)

Query: 6   CSF----TKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCV--KQISARTKSEFLNV 59
           C+F     KD  +LK  +K     R+LVLV +   G+Y C V V  +Q S  +  E +  
Sbjct: 34  CNFFAGNMKDLSSLKEWRKLSTAWRLLVLVVIGAIGLYTCMVGVGTRQFSYESVQEVIVK 93

Query: 60  QVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNN 119
                        P    Y HYP P  Y R EC C PV YF ILSMQRSGSGWFETLLNN
Sbjct: 94  DWRREDDSCSRRGPSVDFYQHYPLPHAYERQECTCTPVHYFVILSMQRSGSGWFETLLNN 153

Query: 120 HTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLL 179
           H NISS+GE+FSVK RR N S+I +T+DK+YNLDW +SA+KNECTAAVG KWMLNQ  + 
Sbjct: 154 HPNISSHGEIFSVKDRRENFSSIAKTMDKVYNLDWLNSAAKNECTAAVGFKWMLNQGPME 213

Query: 180 Y 180
           Y
Sbjct: 214 Y 214


>gi|302770487|ref|XP_002968662.1| hypothetical protein SELMODRAFT_90926 [Selaginella moellendorffii]
 gi|302816465|ref|XP_002989911.1| hypothetical protein SELMODRAFT_130724 [Selaginella moellendorffii]
 gi|300142222|gb|EFJ08924.1| hypothetical protein SELMODRAFT_130724 [Selaginella moellendorffii]
 gi|300163167|gb|EFJ29778.1| hypothetical protein SELMODRAFT_90926 [Selaginella moellendorffii]
          Length = 329

 Score =  172 bits (436), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 84/149 (56%), Positives = 109/149 (73%), Gaps = 8/149 (5%)

Query: 36  CGVYICSVCVKQISARTKSEFLNV----QVIERPCPVPNIEPWEIPYVHYPKPKTYSRAE 91
           CGVY+C + V +   RT  + L++    QV  R    P +   + P  HYP+P+TYSR E
Sbjct: 19  CGVYMCLIGVDR---RTTYDRLSLDPIQQVWRRENLCPRLSRAKFPQ-HYPQPRTYSRKE 74

Query: 92  CACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYN 151
           C C PV +F +LSMQRSGSGWFETLLN+H N+SS+GE+FS+  RR+N STI +TLD++YN
Sbjct: 75  CRCVPVHFFVLLSMQRSGSGWFETLLNSHPNVSSHGEIFSIGRRRANFSTIKQTLDEVYN 134

Query: 152 LDWFSSASKNECTAAVGLKWMLNQVRLLY 180
           L+W+SSA+KNECTAAVG KWMLNQ  L +
Sbjct: 135 LEWYSSAAKNECTAAVGFKWMLNQGALAH 163


>gi|222637549|gb|EEE67681.1| hypothetical protein OsJ_25329 [Oryza sativa Japonica Group]
          Length = 923

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 86/157 (54%), Positives = 112/157 (71%), Gaps = 10/157 (6%)

Query: 46  KQISARTKSEFLNVQV-IERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILS 104
           ++I+   +SE  + +V  E  C  P++   E+ YVH+PKP++YSR EC+CN VR F ++S
Sbjct: 623 EKITLENRSEENSGEVQAEIHCRKPHLPHEELRYVHFPKPESYSRGECSCNLVRSFVLVS 682

Query: 105 MQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECT 164
           MQRSGSGWFETLLN+H NISSNGE+F+   RR N S+I++TLDK+YNLDWF+SA+KNECT
Sbjct: 683 MQRSGSGWFETLLNSHPNISSNGEIFNRVDRRENISSILQTLDKLYNLDWFTSAAKNECT 742

Query: 165 AAVGLKWMLNQ---------VRLLYWKCSSISFSFQR 192
           AA GLKWMLNQ            L  K  S+ F F+R
Sbjct: 743 AAFGLKWMLNQGFMDHHDDIASYLNKKGVSVIFLFRR 779


>gi|413955663|gb|AFW88312.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 374

 Score =  170 bits (430), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 91/196 (46%), Positives = 118/196 (60%), Gaps = 19/196 (9%)

Query: 16  KSPKK--SPLVLRMLVLVFVMVCGVYICSVCVKQI-----SARTKSEFLNVQVIERPCP- 67
           KSPK   +   LR  ++ FV   G Y+C    +QI      A T +     +  E  C  
Sbjct: 8   KSPKGPVAAFPLRSALVFFVTSFGFYVCYFSSRQIIALETEAETTASGGGAEPTETRCTN 67

Query: 68  VPNIEPW--EIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISS 125
            P I P   +  YVH+P+P TY R EC CNPVR+F I+S QRSGSGW E LLN+H N+SS
Sbjct: 68  RPAIIPHGPQTRYVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSS 127

Query: 126 NGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------V 176
           NGEVFS++ RR N S+++ TLD++Y +DW +SA+KNECTAA G KWMLNQ         V
Sbjct: 128 NGEVFSMRERRQNLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIV 187

Query: 177 RLLYWKCSSISFSFQR 192
             L  K + + F F+R
Sbjct: 188 NYLNRKGAMVVFLFRR 203


>gi|413955662|gb|AFW88311.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 346

 Score =  170 bits (430), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 91/196 (46%), Positives = 118/196 (60%), Gaps = 19/196 (9%)

Query: 16  KSPKK--SPLVLRMLVLVFVMVCGVYICSVCVKQI-----SARTKSEFLNVQVIERPCP- 67
           KSPK   +   LR  ++ FV   G Y+C    +QI      A T +     +  E  C  
Sbjct: 8   KSPKGPVAAFPLRSALVFFVTSFGFYVCYFSSRQIIALETEAETTASGGGAEPTETRCTN 67

Query: 68  VPNIEPW--EIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISS 125
            P I P   +  YVH+P+P TY R EC CNPVR+F I+S QRSGSGW E LLN+H N+SS
Sbjct: 68  RPAIIPHGPQTRYVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSS 127

Query: 126 NGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------V 176
           NGEVFS++ RR N S+++ TLD++Y +DW +SA+KNECTAA G KWMLNQ         V
Sbjct: 128 NGEVFSMRERRQNLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIV 187

Query: 177 RLLYWKCSSISFSFQR 192
             L  K + + F F+R
Sbjct: 188 NYLNRKGAMVVFLFRR 203


>gi|226502206|ref|NP_001144194.1| uncharacterized protein LOC100277054 [Zea mays]
 gi|195638236|gb|ACG38586.1| hypothetical protein [Zea mays]
          Length = 340

 Score =  167 bits (423), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 87/198 (43%), Positives = 118/198 (59%), Gaps = 23/198 (11%)

Query: 16  KSPKK--SPLVLRMLVLVFVMVCGVYICSVCVKQI----------SARTKSEFLNVQVIE 63
           KSPK   +   LR  ++ FV   G Y+C    +QI          ++   +E    +   
Sbjct: 8   KSPKGPVAAFPLRSALVFFVTSFGFYVCYFSSRQIIALETEAETTASGGGAEPTETRCTN 67

Query: 64  RPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNI 123
           RP  +P+    +  YVH+P+P TY R EC CNPVR+F I+S QRSGSGW   LLN+H N+
Sbjct: 68  RPAIIPHGP--QTRYVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVXALLNSHPNV 125

Query: 124 SSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ-------- 175
           SSNGEVFS++ RR N S+++ TLD++Y +DW +SA+KNECTAA G KWMLNQ        
Sbjct: 126 SSNGEVFSMRERRQNLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRD 185

Query: 176 -VRLLYWKCSSISFSFQR 192
            V  L  K + + F F+R
Sbjct: 186 IVNYLNRKGAMVVFLFRR 203


>gi|168020679|ref|XP_001762870.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685979|gb|EDQ72371.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 306

 Score =  165 bits (417), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 72/98 (73%), Positives = 80/98 (81%)

Query: 78  YVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRS 137
           Y HYP P TY R EC C PV YF ILSMQRSGSGWFETLLNNH NISS+GE+FSV+ RR 
Sbjct: 40  YQHYPLPHTYERNECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVRERRD 99

Query: 138 NASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
           N S+I   +DK++NLDW +SASKNECTAAVG KWMLNQ
Sbjct: 100 NFSSIARNMDKVFNLDWLNSASKNECTAAVGFKWMLNQ 137


>gi|255538396|ref|XP_002510263.1| conserved hypothetical protein [Ricinus communis]
 gi|223550964|gb|EEF52450.1| conserved hypothetical protein [Ricinus communis]
          Length = 278

 Score =  159 bits (401), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 70/92 (76%), Positives = 80/92 (86%)

Query: 91  ECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIY 150
           ECACNPVRYFAIL+MQRSGSGWFETLLN+H N+SSNGE+F  K RR+N S I++TLDK+Y
Sbjct: 25  ECACNPVRYFAILTMQRSGSGWFETLLNSHMNVSSNGEIFGKKERRANVSAIMKTLDKVY 84

Query: 151 NLDWFSSASKNECTAAVGLKWMLNQVRLLYWK 182
           NLDWFSSA+KNEC AAVG KWMLNQ  L + K
Sbjct: 85  NLDWFSSAAKNECNAAVGFKWMLNQGVLEHHK 116


>gi|302799659|ref|XP_002981588.1| hypothetical protein SELMODRAFT_114849 [Selaginella moellendorffii]
 gi|300150754|gb|EFJ17403.1| hypothetical protein SELMODRAFT_114849 [Selaginella moellendorffii]
          Length = 336

 Score =  156 bits (394), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 73/113 (64%), Positives = 86/113 (76%), Gaps = 2/113 (1%)

Query: 80  HYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNA 139
           HYP+PKTYSR ECAC PV  F ILS QRSGSGWFETLLN+H NI+S+GE+FSV+ RR N 
Sbjct: 73  HYPRPKTYSRGECACTPVHNFVILSTQRSGSGWFETLLNSHPNITSHGEIFSVRPRRHNF 132

Query: 140 STIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLYWKCSSISFSFQR 192
           S +  TLD +YNLDW +SASKN CT+ VG KWMLNQ  + Y   S +S  F+R
Sbjct: 133 SIVSHTLDALYNLDWHNSASKNSCTSVVGFKWMLNQGAMEY--RSEVSSYFKR 183


>gi|302759563|ref|XP_002963204.1| hypothetical protein SELMODRAFT_80417 [Selaginella moellendorffii]
 gi|300168472|gb|EFJ35075.1| hypothetical protein SELMODRAFT_80417 [Selaginella moellendorffii]
          Length = 336

 Score =  155 bits (393), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 89/180 (49%), Positives = 110/180 (61%), Gaps = 7/180 (3%)

Query: 16  KSPKKSPLVLRMLVLVFVMVCGVYI-CSVCVKQISARTK--SEFLNVQVIERPCPVPNIE 72
           KS  +  LV ++++L    V GVY+  S     +S  +      L     E  CP     
Sbjct: 8   KSFLRVNLVWKLVILFVAGVLGVYLFLSRADHGLSYHSLRLDPLLQAWRKEDFCP-QRFA 66

Query: 73  PWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
             E    HYP+PKTYSR ECAC PV  F ILS QRSGSGWFETLLN+H NI+S+GE+FSV
Sbjct: 67  KLEFAQ-HYPRPKTYSRGECACTPVHNFVILSTQRSGSGWFETLLNSHPNITSHGEIFSV 125

Query: 133 KVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQVRLLYWKCSSISFSFQR 192
           + RR N S +  TLD +YNLDW +SASKN CT+ VG KWMLNQ  + Y   S +S  F+R
Sbjct: 126 RPRRHNFSIVSHTLDALYNLDWHNSASKNSCTSVVGFKWMLNQGAMEY--RSEVSSYFKR 183


>gi|6562002|emb|CAB62491.1| hypothetical protein [Arabidopsis thaliana]
          Length = 263

 Score =  155 bits (392), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 67/86 (77%), Positives = 78/86 (90%)

Query: 90  AECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKI 149
           AEC  NPVRYFAILSMQRSGSGWFETLLN+H N+SSNGE+FSV  RR N S+I++TLD++
Sbjct: 36  AECGHNPVRYFAILSMQRSGSGWFETLLNSHNNVSSNGEIFSVLDRRKNISSIIQTLDRV 95

Query: 150 YNLDWFSSASKNECTAAVGLKWMLNQ 175
           YNLDWF+SASKNEC+AA+G KWMLNQ
Sbjct: 96  YNLDWFTSASKNECSAAIGFKWMLNQ 121


>gi|168007783|ref|XP_001756587.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692183|gb|EDQ78541.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 256

 Score =  153 bits (386), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 71/101 (70%), Positives = 80/101 (79%)

Query: 89  RAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDK 148
           R EC C PV +F ILSMQRSGSGWFETLLNNH NISS+GE+FSVK RR+N STI  T+DK
Sbjct: 1   RQECQCTPVHFFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVKPRRANFSTIARTMDK 60

Query: 149 IYNLDWFSSASKNECTAAVGLKWMLNQVRLLYWKCSSISFS 189
           IYNLDW +SA+KNECTAAVG KWMLNQ  + Y +  S  F 
Sbjct: 61  IYNLDWLNSAAKNECTAAVGFKWMLNQGPMEYSREVSDYFE 101


>gi|255641735|gb|ACU21138.1| unknown [Glycine max]
          Length = 144

 Score =  152 bits (385), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 71/135 (52%), Positives = 98/135 (72%), Gaps = 1/135 (0%)

Query: 1   MAEDLCSFTKDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQ 60
           M ED+C   K+   +K PK+SP++LRM V++F MVCGV+I SVC+ QIS + ++ F++ +
Sbjct: 1   MVEDVCFLYKEILVMKPPKRSPMLLRMAVVIFSMVCGVFIFSVCLMQISTQARTTFMDFK 60

Query: 61  VIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNH 120
           VI+       ++      +HYPKP ++SR EC  NPV +FAILS QRSGSGWFETLLN+H
Sbjct: 61  VIDNHSQ-SILKLMNTHLLHYPKPASFSRNECVHNPVLFFAILSNQRSGSGWFETLLNSH 119

Query: 121 TNISSNGEVFSVKVR 135
            N+SSNGE+FSV+ R
Sbjct: 120 INVSSNGEIFSVRER 134


>gi|293334015|ref|NP_001170415.1| uncharacterized protein LOC100384402 [Zea mays]
 gi|224035729|gb|ACN36940.1| unknown [Zea mays]
          Length = 275

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 68/85 (80%), Positives = 77/85 (90%)

Query: 91  ECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIY 150
           ECAC  VR+FAILSMQRSGSGW ETLLN+H NISSNGE+FSVK RRSN + I +TLD++Y
Sbjct: 23  ECACTGVRFFAILSMQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNITAITKTLDRLY 82

Query: 151 NLDWFSSASKNECTAAVGLKWMLNQ 175
           NLDW+SSA+KNECTAAVGLKWMLNQ
Sbjct: 83  NLDWYSSAAKNECTAAVGLKWMLNQ 107


>gi|413925274|gb|AFW65206.1| hypothetical protein ZEAMMB73_585643 [Zea mays]
          Length = 275

 Score =  151 bits (382), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 69/85 (81%), Positives = 77/85 (90%)

Query: 91  ECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIY 150
           ECAC  VR+FAILSMQRSGSGW ETLLN+H NISSNGE+FSVK RRSN + I +TLDK+Y
Sbjct: 23  ECACTGVRFFAILSMQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNFTAIKKTLDKLY 82

Query: 151 NLDWFSSASKNECTAAVGLKWMLNQ 175
           NLDW+SSA+KNECTAAVGLKWMLNQ
Sbjct: 83  NLDWYSSAAKNECTAAVGLKWMLNQ 107


>gi|4335739|gb|AAD17417.1| hypothetical protein [Arabidopsis thaliana]
          Length = 213

 Score =  151 bits (382), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 77/138 (55%), Positives = 87/138 (63%), Gaps = 35/138 (25%)

Query: 42  SVCVKQISARTKSEFLNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYFA 101
           S+  KQI     + FLNV+V ERPCP PNI+PW+IPYVHYPKPKTYS             
Sbjct: 6   SLIPKQIGVVPSAGFLNVEVFERPCPEPNIQPWDIPYVHYPKPKTYS------------- 52

Query: 102 ILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKN 161
                                 SSNGE+FSVK RR+N STI ETLDK+YNLDW SSASKN
Sbjct: 53  ----------------------SSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKN 90

Query: 162 ECTAAVGLKWMLNQVRLL 179
           ECT+AVGLKWMLNQ  +L
Sbjct: 91  ECTSAVGLKWMLNQAEIL 108


>gi|413955661|gb|AFW88310.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 362

 Score =  151 bits (381), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/193 (44%), Positives = 112/193 (58%), Gaps = 25/193 (12%)

Query: 16  KSPKK--SPLVLRMLVLVFVMVCGVYICSVCVKQI-----SARTKSEFLNVQVIERPCPV 68
           KSPK   +   LR  ++ FV   G Y+C    +QI      A T +     +  E  C  
Sbjct: 8   KSPKGPVAAFPLRSALVFFVTSFGFYVCYFSSRQIIALETEAETTASGGGAEPTETRC-- 65

Query: 69  PNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGE 128
               P  IP  H P+    +R EC CNPVR+F I+S QRSGSGW E LLN+H N+SSNGE
Sbjct: 66  -TNRPAIIP--HGPQ----TRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGE 118

Query: 129 VFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------VRLL 179
           VFS++ RR N S+++ TLD++Y +DW +SA+KNECTAA G KWMLNQ         V  L
Sbjct: 119 VFSMRERRQNLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYL 178

Query: 180 YWKCSSISFSFQR 192
             K + + F F+R
Sbjct: 179 NRKGAMVVFLFRR 191


>gi|194702290|gb|ACF85229.1| unknown [Zea mays]
 gi|413955660|gb|AFW88309.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 334

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/193 (44%), Positives = 112/193 (58%), Gaps = 25/193 (12%)

Query: 16  KSPKK--SPLVLRMLVLVFVMVCGVYICSVCVKQI-----SARTKSEFLNVQVIERPCPV 68
           KSPK   +   LR  ++ FV   G Y+C    +QI      A T +     +  E  C  
Sbjct: 8   KSPKGPVAAFPLRSALVFFVTSFGFYVCYFSSRQIIALETEAETTASGGGAEPTETRC-- 65

Query: 69  PNIEPWEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGE 128
               P  IP  H P+    +R EC CNPVR+F I+S QRSGSGW E LLN+H N+SSNGE
Sbjct: 66  -TNRPAIIP--HGPQ----TRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGE 118

Query: 129 VFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ---------VRLL 179
           VFS++ RR N S+++ TLD++Y +DW +SA+KNECTAA G KWMLNQ         V  L
Sbjct: 119 VFSMRERRQNLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYL 178

Query: 180 YWKCSSISFSFQR 192
             K + + F F+R
Sbjct: 179 NRKGAMVVFLFRR 191


>gi|167999402|ref|XP_001752406.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696306|gb|EDQ82645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 256

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 67/92 (72%), Positives = 74/92 (80%)

Query: 89  RAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDK 148
           R EC C PV YF ILSMQRSGSGWFETLLNNH NISS+GEVFSV  RR N S+I   +DK
Sbjct: 1   RNECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEVFSVGERRDNFSSIATNMDK 60

Query: 149 IYNLDWFSSASKNECTAAVGLKWMLNQVRLLY 180
           ++NLDW +SASKNECTAAVG KWMLNQ  + Y
Sbjct: 61  VFNLDWLNSASKNECTAAVGFKWMLNQGPMEY 92


>gi|238013712|gb|ACR37891.1| unknown [Zea mays]
          Length = 239

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 57/71 (80%), Positives = 65/71 (91%)

Query: 105 MQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECT 164
           MQRSGSGW ETLLN+H NISSNGE+FSVK RRSN + I +TLD++YNLDW+SSA+KNECT
Sbjct: 1   MQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNITAITKTLDRLYNLDWYSSAAKNECT 60

Query: 165 AAVGLKWMLNQ 175
           AAVGLKWMLNQ
Sbjct: 61  AAVGLKWMLNQ 71


>gi|218200116|gb|EEC82543.1| hypothetical protein OsI_27076 [Oryza sativa Indica Group]
          Length = 866

 Score = 99.8 bits (247), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 65/174 (37%), Positives = 93/174 (53%), Gaps = 37/174 (21%)

Query: 11  DTFNLKSPKKSPLVLRMLVLVFVM----VCGVYICSVCVKQISARTKSEFLNVQV-IERP 65
           D+ N+ +   +PL  RM ++ +      V  +++       I+   +SE  + +V  E  
Sbjct: 551 DSVNVFAGWNTPLSARMQIVRWTAPAEGVTDMWVPPTSGPHITLEYRSEENSGEVQAEIH 610

Query: 66  CPVPNIEPWEIPYVHYPKPKTYSRAECACNPVRYF-AILSMQR---SGSGWFETLLNNHT 121
           C  P +   E+ YVH+PKP++YSR EC+CNPVR+  A+  +QR   SG G  E++     
Sbjct: 611 CRKPRLPHEELRYVHFPKPESYSRGECSCNPVRFLCALYLLQRVAESGMG-VESIC---- 665

Query: 122 NISSNGEVFSVKVRRSNASTIVETLDKIYNLDWFSSASKNECTAAVGLKWMLNQ 175
                                  TLDK+YNLDWF+SA+KNECTAA GLKWMLNQ
Sbjct: 666 -----------------------TLDKLYNLDWFTSAAKNECTAAFGLKWMLNQ 696


>gi|413955389|gb|AFW88038.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 105

 Score = 82.4 bits (202), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 55/80 (68%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           K+ F+ K+PKKSPL LR++V    ++CG+ I S+C+KQ+ +   S  + ++V+E+PC   
Sbjct: 26  KNAFDSKAPKKSPLALRVVVFAMTVLCGISIWSMCMKQLGSDGWSRIVKIEVVEQPCNKS 85

Query: 70  NIEPWEIPYVHYPKPKTYSR 89
            + P E+ + HYP+P TYSR
Sbjct: 86  TVPPSEVQFAHYPQPTTYSR 105


>gi|413955390|gb|AFW88039.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 113

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 55/80 (68%)

Query: 10  KDTFNLKSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQISARTKSEFLNVQVIERPCPVP 69
           K+ F+ K+PKKSPL LR++V    ++CG+ I S+C+KQ+ +   S  + ++V+E+PC   
Sbjct: 26  KNAFDSKAPKKSPLALRVVVFAMTVLCGISIWSMCMKQLGSDGWSRIVKIEVVEQPCNKS 85

Query: 70  NIEPWEIPYVHYPKPKTYSR 89
            + P E+ + HYP+P TYSR
Sbjct: 86  TVPPSEVQFAHYPQPTTYSR 105


>gi|4455187|emb|CAB36719.1| hypothetical protein [Arabidopsis thaliana]
 gi|7270392|emb|CAB80159.1| hypothetical protein [Arabidopsis thaliana]
          Length = 403

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 30/43 (69%), Positives = 34/43 (79%)

Query: 47  QISARTKSEFLNVQVIERPCPVPNIEPWEIPYVHYPKPKTYSR 89
           QI       FLNV+V ERPCP PNIEPW+IP+VHYPKPKTY+R
Sbjct: 110 QIGVSPNYGFLNVEVFERPCPEPNIEPWDIPFVHYPKPKTYNR 152


>gi|414869371|tpg|DAA47928.1| TPA: hypothetical protein ZEAMMB73_204795 [Zea mays]
          Length = 110

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 4/77 (5%)

Query: 16  KSPKKSPLVLRMLVLVFVMVCGVYICSVCVKQ---ISARTKSEFLNVQVIERPCPVPNIE 72
           K  K+ PL L + +L  +M+  VYI S+ +KQ   +    ++  +  +  E+PC  P+I 
Sbjct: 35  KPAKRYPLALWIGILGLIMLVAVYIFSLSLKQNGMLFGLMQTNLIEKER-EKPCHDPSIP 93

Query: 73  PWEIPYVHYPKPKTYSR 89
             EIPY+HYP P TY R
Sbjct: 94  DTEIPYLHYPMPNTYDR 110


>gi|128469|sp|P06237.1|NOH4_RHIME RecName: Full=Nodulation protein H; AltName: Full=Host-specificity
           of nodulation protein D
 gi|152243|gb|AAA26291.1| host-specificity of modulation protein D [Sinorhizobium meliloti]
          Length = 247

 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 27/37 (72%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P R FAIL+M+R+G+ + E L+N H N+ SNGE+ + 
Sbjct: 7   PPRPFAILAMRRTGTHYLEELVNEHPNVLSNGELLNT 43


>gi|186474683|ref|YP_001863654.1| NodH sulfotransferase [Burkholderia phymatum STM815]
 gi|184198642|gb|ACC76604.1| NodH sulfotransferase [Burkholderia phymatum STM815]
          Length = 251

 Score = 43.5 bits (101), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 45/84 (53%), Gaps = 9/84 (10%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFSV-----KVRRSNASTIVETLDKIYNLDW 154
           F IL M R+G+ + E LLN H N+SSNGE+ +        R+    T  E L+  Y L +
Sbjct: 11  FVILGMPRTGTHYLEELLNEHPNVSSNGELLNPYDAIWHNRKRLLLTDHELLELAY-LHY 69

Query: 155 FSSASKNECTAAVGLKWMLNQVRL 178
            + A KN  T  VG K  +NQ +L
Sbjct: 70  PARAGKNAIT-HVGCK--INQPQL 90


>gi|1531623|gb|AAB16900.1| NodH [Rhizobium sp. N33]
          Length = 248

 Score = 42.7 bits (99), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           FAIL+M R+G+ + E LLN H NI SNGE+ +
Sbjct: 11  FAILAMPRTGTHYLEVLLNEHPNILSNGELLN 42


>gi|384540890|ref|YP_005724973.1| NodH sulfotransferase [Sinorhizobium meliloti SM11]
 gi|336036233|gb|AEH82164.1| NodH sulfotransferase [Sinorhizobium meliloti SM11]
          Length = 245

 Score = 42.7 bits (99), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P + FAIL+M R+G+ + E L+N H N+ SNGE+ + 
Sbjct: 7   PPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNT 43


>gi|407690764|ref|YP_006814348.1| Nodulation protein H [Sinorhizobium meliloti Rm41]
 gi|433616513|ref|YP_007193308.1| Sulfotransferase domain protein [Sinorhizobium meliloti GR4]
 gi|407321939|emb|CCM70541.1| Nodulation protein H [Sinorhizobium meliloti Rm41]
 gi|429554760|gb|AGA09709.1| Sulfotransferase domain protein [Sinorhizobium meliloti GR4]
          Length = 247

 Score = 42.7 bits (99), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P + FAIL+M R+G+ + E L+N H N+ SNGE+ + 
Sbjct: 7   PPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNT 43


>gi|334319373|ref|YP_004551932.1| NodH sulfotransferase [Sinorhizobium meliloti AK83]
 gi|384532354|ref|YP_005717958.1| NodH sulfotransferase [Sinorhizobium meliloti BL225C]
 gi|333814530|gb|AEG07198.1| NodH sulfotransferase [Sinorhizobium meliloti BL225C]
 gi|334099800|gb|AEG57809.1| NodH sulfotransferase [Sinorhizobium meliloti AK83]
          Length = 247

 Score = 42.7 bits (99), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P + FAIL+M R+G+ + E L+N H N+ SNGE+ + 
Sbjct: 7   PPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNT 43


>gi|16262917|ref|NP_435710.1| NodH sulfotransferase [Sinorhizobium meliloti 1021]
 gi|128468|sp|P06236.1|NODH_RHIME RecName: Full=Nodulation protein H; AltName: Full=Host-specificity
           of nodulation protein D
 gi|46310|emb|CAA27963.1| unnamed protein product [Sinorhizobium meliloti]
 gi|152368|gb|AAA26339.1| nodulation protein [Sinorhizobium meliloti]
 gi|14523561|gb|AAK65122.1| NodH sulfotransferase [Sinorhizobium meliloti 1021]
 gi|226080|prf||1410222D nodH gene
          Length = 247

 Score = 42.4 bits (98), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P + FAIL+M R+G+ + E L+N H N+ SNGE+ + 
Sbjct: 7   PPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNT 43


>gi|418403142|ref|ZP_12976639.1| NodH sulfotransferase [Sinorhizobium meliloti CCNWSX0020]
 gi|359502916|gb|EHK75481.1| NodH sulfotransferase [Sinorhizobium meliloti CCNWSX0020]
          Length = 247

 Score = 42.4 bits (98), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P + FAIL+M R+G+ + E L+N H N+ SNGE+ + 
Sbjct: 7   PPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNT 43


>gi|440222681|ref|YP_007336086.1| sulfotransferase NodH [Rhizobium tropici CIAT 899]
 gi|1709321|sp|P52994.1|NODH_RHITR RecName: Full=Nodulation protein H
 gi|1107851|emb|CAA60912.1| NodH protein [Rhizobium tropici CIAT 899]
 gi|1280527|gb|AAB08982.1| NodH [Rhizobium leucaenae]
 gi|386870456|gb|AFJ42556.1| NodH [Rhizobium tropici]
 gi|440040828|gb|AGB73540.1| sulfotransferase NodH [Rhizobium tropici CIAT 899]
          Length = 249

 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           F IL+M R+G+ + E LLN+H NI SNGE+ +
Sbjct: 11  FVILAMPRTGTHYLEALLNDHPNILSNGELLN 42


>gi|150378149|ref|YP_001314744.1| NodH sulfotransferase [Sinorhizobium medicae WSM419]
 gi|150032696|gb|ABR64811.1| NodH sulfotransferase [Sinorhizobium medicae WSM419]
          Length = 247

 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           P + FAIL+M R+G+ + E L+N H N+ SNGE+ + 
Sbjct: 7   PPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNT 43


>gi|389688854|ref|ZP_10178419.1| hypothetical protein MicloDRAFT_00005250 [Microvirga sp. WSM3557]
 gi|388590338|gb|EIM30622.1| hypothetical protein MicloDRAFT_00005250 [Microvirga sp. WSM3557]
          Length = 248

 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 24/33 (72%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           FA+L+M R+G+ + E LLN H N+ SNGE+ + 
Sbjct: 11  FAVLAMPRTGTHYLEALLNEHPNVLSNGELLNT 43


>gi|407711678|ref|YP_006836451.1| Nodulation protein H [Burkholderia phenoliruptrix BR3459a]
 gi|407240361|gb|AFT90558.1| Nodulation protein H [Burkholderia phenoliruptrix BR3459a]
          Length = 261

 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 97  VRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           +  F IL M R+G+ + E LLN H N+SSNGE+ +
Sbjct: 8   IEPFVILGMPRTGTHYLEELLNEHPNVSSNGELLN 42


>gi|336314455|ref|ZP_08569373.1| Flp pilus assembly protein TadD [Rheinheimera sp. A13L]
 gi|335881236|gb|EGM79117.1| Flp pilus assembly protein TadD [Rheinheimera sp. A13L]
          Length = 526

 Score = 41.2 bits (95), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 7/60 (11%)

Query: 102 ILSMQRSGSGWFETLLNNHTNISSNGEV--FSVKVRRSNAST-----IVETLDKIYNLDW 154
           IL M RSG+   E +L++H+++ S GE+  F + V+   A+T      VETL+  Y LD+
Sbjct: 284 ILGMPRSGTTLVERILSSHSDVLSAGELQDFGIAVKELTATTSAKVVDVETLNAAYALDF 343


>gi|15667163|gb|AAL02419.1| NodH [Rhizobium tropici CIAT 899]
          Length = 55

 Score = 41.2 bits (95), Expect = 0.26,   Method: Composition-based stats.
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           F IL+M R+G+ + E LLN+H NI SNGE+ +
Sbjct: 11  FVILAMPRTGTHYLEALLNDHPNILSNGELLN 42


>gi|220923914|ref|YP_002499216.1| NodH sulfotransferase [Methylobacterium nodulans ORS 2060]
 gi|189231158|emb|CAN84687.1| NodH protein [Methylobacterium nodulans ORS 2060]
 gi|219948521|gb|ACL58913.1| NodH sulfotransferase [Methylobacterium nodulans ORS 2060]
          Length = 249

 Score = 40.8 bits (94), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 24/36 (66%)

Query: 96  PVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           P   F IL+M R+G+ + E LLN H N+ SNGE+ +
Sbjct: 7   PPEPFVILAMPRTGTHYLEELLNEHPNVLSNGELLN 42


>gi|188591606|ref|YP_001796205.1| NodH sulfotransferase [Cupriavidus taiwanensis LMG 19424]
 gi|170939001|emb|CAP64014.1| NodH sulfotransferase [Cupriavidus taiwanensis LMG 19424]
          Length = 248

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           F IL M R+G+ + E LLN H N+SSNGE+ +
Sbjct: 14  FVILGMPRTGTHYLEELLNLHPNVSSNGELLN 45


>gi|119468600|ref|ZP_01611652.1| putative orphan protein ; putative tetratricopeptide repeat domain
           [Alteromonadales bacterium TW-7]
 gi|119447656|gb|EAW28922.1| putative orphan protein ; putative tetratricopeptide repeat domain
           [Alteromonadales bacterium TW-7]
          Length = 527

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 7/65 (10%)

Query: 102 ILSMQRSGSGWFETLLNNHTNISSNGEV--FSVKVRR-SNAST----IVETLDKIYNLDW 154
           +L M RSG+   E +L++H+++ S GE+  F + V+R SN  T     ++TL K Y LD+
Sbjct: 285 VLGMPRSGTTLVERILSSHSDVQSAGELQDFGLSVKRLSNTPTPNVLDIQTLTKAYELDF 344

Query: 155 FSSAS 159
               S
Sbjct: 345 HQLGS 349


>gi|386870435|gb|AFJ42539.1| NodH [Mesorhizobium plurifarium]
          Length = 327

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           F IL+M R+G+ + E LLN H N+ SNGE+ + 
Sbjct: 7   FVILAMPRTGTHYLEELLNEHPNVLSNGELLNA 39


>gi|359449695|ref|ZP_09239179.1| hypothetical protein P20480_1896 [Pseudoalteromonas sp. BSi20480]
 gi|358044491|dbj|GAA75428.1| hypothetical protein P20480_1896 [Pseudoalteromonas sp. BSi20480]
          Length = 527

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 37/60 (61%), Gaps = 7/60 (11%)

Query: 102 ILSMQRSGSGWFETLLNNHTNISSNGEV--FSVKVRR-SNAST----IVETLDKIYNLDW 154
           +L M RSG+   E +L++H+++ S GE+  F + V+R SN  T     ++TL K Y LD+
Sbjct: 285 VLGMPRSGTTLVERILSSHSDVQSAGELQDFGLSVKRLSNTPTPNVLDIQTLTKAYELDF 344


>gi|295701194|ref|YP_003610195.1| NodH sulfotransferase [Burkholderia sp. CCGE1002]
 gi|295441517|gb|ADG20684.1| NodH sulfotransferase [Burkholderia sp. CCGE1002]
          Length = 250

 Score = 40.0 bits (92), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           F I+ M R+G+ + E LLN H N+SSNGE+ +
Sbjct: 11  FVIVGMPRTGTHYLEELLNAHPNVSSNGELLN 42


>gi|392538244|ref|ZP_10285381.1| putative tetratricopeptide repeat domain protein [Pseudoalteromonas
           marina mano4]
          Length = 527

 Score = 40.0 bits (92), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 37/60 (61%), Gaps = 7/60 (11%)

Query: 102 ILSMQRSGSGWFETLLNNHTNISSNGEV--FSVKVRR-SNAST----IVETLDKIYNLDW 154
           +L M RSG+   E +L++H+++ S GE+  F + V+R SN  T     ++TL K Y LD+
Sbjct: 285 VLGMPRSGTTLVERILSSHSDVQSAGELQDFGLSVKRLSNTPTPNVLDIQTLTKAYELDF 344


>gi|77021910|gb|ABA60810.1| NodH [Rhizobium sp. LPU83]
 gi|283463704|gb|ADB22542.1| NodH [Rhizobium sp. LPU83]
          Length = 252

 Score = 40.0 bits (92), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           FAIL+M R+G+ + E  LN H N+ SNGE+ + 
Sbjct: 11  FAILAMPRTGTHYLEECLNEHPNVLSNGELLNT 43


>gi|27125924|emb|CAD57726.1| NodH protein [Sinorhizobium sp. BR816]
          Length = 250

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFSV 132
           FAIL M R+G+ + E LLN H  + SNGE+ + 
Sbjct: 9   FAILGMPRTGTHYLEELLNEHPKVWSNGELLNT 41


>gi|392536107|ref|ZP_10283244.1| putative tetratricopeptide repeat domain protein [Pseudoalteromonas
           arctica A 37-1-2]
          Length = 542

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 35/60 (58%), Gaps = 7/60 (11%)

Query: 102 ILSMQRSGSGWFETLLNNHTNISSNGEV--FSVKVRRSNASTI-----VETLDKIYNLDW 154
           +L M RSG+   E +L++H+N+ S GE+  F + V++ + +        +TL K Y LD+
Sbjct: 300 VLGMPRSGTTLVERILSSHSNVQSAGELQDFGLSVKKLSQTQTPHVLDTQTLSKAYELDF 359


>gi|433777055|ref|YP_007307522.1| sulfotransferase family protein [Mesorhizobium australicum WSM2073]
 gi|433669070|gb|AGB48146.1| sulfotransferase family protein [Mesorhizobium australicum WSM2073]
          Length = 245

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 22/32 (68%)

Query: 100 FAILSMQRSGSGWFETLLNNHTNISSNGEVFS 131
           F IL+M R+G+ + E LLN H  + SNGE+ +
Sbjct: 11  FVILAMPRTGTHYLEELLNEHPTVLSNGELLN 42


>gi|422345164|ref|ZP_16426078.1| hypothetical protein HMPREF9476_00151 [Clostridium perfringens
           WAL-14572]
 gi|373227889|gb|EHP50199.1| hypothetical protein HMPREF9476_00151 [Clostridium perfringens
           WAL-14572]
          Length = 694

 Score = 36.6 bits (83), Expect = 5.1,   Method: Composition-based stats.
 Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 101 AILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLD 153
           + L   ++ SG +E  +NN  N+SS GE   + +  SNAS I   L+K  NLD
Sbjct: 27  STLDSGKNNSG-YEIKVNNSENLSSLGEYKDINLESSNASNITYDLEKYKNLD 78


>gi|110799574|ref|YP_695173.1| exo-alpha-sialidase [Clostridium perfringens ATCC 13124]
 gi|110674221|gb|ABG83208.1| sialidase [Clostridium perfringens ATCC 13124]
          Length = 694

 Score = 36.6 bits (83), Expect = 5.1,   Method: Composition-based stats.
 Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 101 AILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLD 153
           + L   ++ SG +E  +NN  N+SS GE   + +  SNAS I   L+K  NLD
Sbjct: 27  STLDSGKNNSG-YEIKVNNSENLSSLGEYKDINLESSNASNITYDLEKYKNLD 78


>gi|853807|emb|CAA60796.1| exo-alpha-sialidase [Clostridium perfringens]
          Length = 694

 Score = 36.6 bits (83), Expect = 5.1,   Method: Composition-based stats.
 Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 101 AILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTIVETLDKIYNLD 153
           + L   ++ SG +E  +NN  N+SS GE   + +  SNAS I   L+K  NLD
Sbjct: 27  STLDSGKNNSG-YEIKVNNSENLSSLGEYKDINLESSNASNITYDLEKYKNLD 78


>gi|301628713|ref|XP_002943493.1| PREDICTED: hepatocyte cell adhesion molecule-like [Xenopus
           (Silurana) tropicalis]
          Length = 229

 Score = 36.6 bits (83), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 13/83 (15%)

Query: 74  WEIPYVHYPKPKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVK 133
           W++    +   +T +RA+  CNPV Y       RS             N+S NG +    
Sbjct: 20  WQVKKSQHQYRETIARADALCNPVLYEGNYPPPRS-------------NVSHNGSLLLYN 66

Query: 134 VRRSNASTIVETLDKIYNLDWFS 156
           + R +A     T+D ++   WF+
Sbjct: 67  LTREDAGKYTVTVDTLWREQWFN 89


>gi|227822469|ref|YP_002826441.1| hypothetical protein NGR_c19240 [Sinorhizobium fredii NGR234]
 gi|227341470|gb|ACP25688.1| conserved hypothetical protein [Sinorhizobium fredii NGR234]
          Length = 256

 Score = 36.2 bits (82), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 28/46 (60%)

Query: 97  VRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTI 142
           +R + +L+  RSGS W  +L+NN  N+  + E  S K+ R + S++
Sbjct: 1   MRGYLLLTEARSGSNWLGSLINNSGNMGQSSEWLSPKIHRLDTSSL 46


>gi|378826444|ref|YP_005189176.1| Nodulation protein H Host-specificity of nodulation protein D
           [Sinorhizobium fredii HH103]
 gi|365179496|emb|CCE96351.1| Nodulation protein H Host-specificity of nodulation protein D
           [Sinorhizobium fredii HH103]
          Length = 256

 Score = 36.2 bits (82), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 28/46 (60%)

Query: 97  VRYFAILSMQRSGSGWFETLLNNHTNISSNGEVFSVKVRRSNASTI 142
           +R + +L+  RSGS W  +L+NN  N+  + E  S K+ R + S++
Sbjct: 1   MRGYLLLTEARSGSNWLGSLINNSGNMGRSSEWLSPKIHRLDTSSL 46


>gi|224015483|ref|XP_002297395.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220967944|gb|EED86309.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1222

 Score = 35.8 bits (81), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 14/69 (20%)

Query: 69  PNIEPW----EIPYVHYPK--PKTYSRAECACNPVRYFAILSMQRSGSGWFETLLNNHTN 122
           P ++PW    E+  +  PK  PK  S+          F +L+  RSGS W  T+L++H  
Sbjct: 333 PAVQPWLDMMEMEGLEIPKQLPKDPSK--------NLFMLLTSHRSGSEWITTMLDSHPQ 384

Query: 123 ISSNGEVFS 131
           + ++GE  S
Sbjct: 385 VCASGETQS 393


>gi|373501485|ref|ZP_09591838.1| 4-hydroxy-3-methylbut-2-enyl diphosphate reductase [Prevotella
           micans F0438]
 gi|371948246|gb|EHO66129.1| 4-hydroxy-3-methylbut-2-enyl diphosphate reductase [Prevotella
           micans F0438]
          Length = 292

 Score = 35.8 bits (81), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 37/66 (56%), Gaps = 3/66 (4%)

Query: 124 SSNGEVFSVKVRRSNAST-IVETLDKIYNLDWFSSASK-NECTAAVGLKWMLNQVRLLYW 181
           SSNG+V   + +  NA+   VE+ D+I N+DWFS AS    C A    KW++ + R    
Sbjct: 228 SSNGKVLFQQCQSVNANAHQVESADEI-NMDWFSGASTVGICGATSTPKWLMEECRDAIK 286

Query: 182 KCSSIS 187
           K SS S
Sbjct: 287 KYSSNS 292


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.134    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,924,283,227
Number of Sequences: 23463169
Number of extensions: 107684286
Number of successful extensions: 268966
Number of sequences better than 100.0: 162
Number of HSP's better than 100.0 without gapping: 121
Number of HSP's successfully gapped in prelim test: 41
Number of HSP's that attempted gapping in prelim test: 268604
Number of HSP's gapped (non-prelim): 355
length of query: 197
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 62
effective length of database: 9,191,667,552
effective search space: 569883388224
effective search space used: 569883388224
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 73 (32.7 bits)