BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 035616
         (289 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|302142361|emb|CBI19564.3| unnamed protein product [Vitis vinifera]
          Length = 311

 Score =  457 bits (1176), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 220/302 (72%), Positives = 249/302 (82%), Gaps = 13/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQTS +TK K L  +VI+Q   S  I+  EIP++HYP P T +REEC CN VRYFAI
Sbjct: 10  ICLKQTSNNTKAKFLKFQVINQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAI 69

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             I +RR NISSI+STMD+VYNLDWF+SASKNE
Sbjct: 70  LSMQRSGSGWFETLLNSHINVSSNGEIFSIGNRRNNISSILSTMDKVYNLDWFSSASKNE 129

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C AAVGFKWM+NQGLMEHH DI +YFK KGVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 130 CLAAVGFKWMLNQGLMEHHKDIVKYFKNKGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 189

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA+ILAKYKPT+NATLL+P+LK+ E+T  KA+EYFKSTRHIV YYEDL
Sbjct: 190 NGTHKSHVHSTEEAQILAKYKPTVNATLLLPDLKEAEETITKALEYFKSTRHIVLYYEDL 249

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  LK+VQEFLRL YR+L+SRQVKIHSG L KQV NWDDVQKALKGTSYE  LH+DY
Sbjct: 250 INNRTKLKDVQEFLRLPYRDLSSRQVKIHSGNLPKQVGNWDDVQKALKGTSYEGMLHADY 309

Query: 288 RV 289
           ++
Sbjct: 310 QM 311


>gi|225458537|ref|XP_002284367.1| PREDICTED: uncharacterized protein LOC100252224 [Vitis vinifera]
          Length = 344

 Score =  457 bits (1175), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 220/302 (72%), Positives = 249/302 (82%), Gaps = 13/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQTS +TK K L  +VI+Q   S  I+  EIP++HYP P T +REEC CN VRYFAI
Sbjct: 43  ICLKQTSNNTKAKFLKFQVINQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             I +RR NISSI+STMD+VYNLDWF+SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNSHINVSSNGEIFSIGNRRNNISSILSTMDKVYNLDWFSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C AAVGFKWM+NQGLMEHH DI +YFK KGVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 163 CLAAVGFKWMLNQGLMEHHKDIVKYFKNKGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA+ILAKYKPT+NATLL+P+LK+ E+T  KA+EYFKSTRHIV YYEDL
Sbjct: 223 NGTHKSHVHSTEEAQILAKYKPTVNATLLLPDLKEAEETITKALEYFKSTRHIVLYYEDL 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  LK+VQEFLRL YR+L+SRQVKIHSG L KQV NWDDVQKALKGTSYE  LH+DY
Sbjct: 283 INNRTKLKDVQEFLRLPYRDLSSRQVKIHSGNLPKQVGNWDDVQKALKGTSYEGMLHADY 342

Query: 288 RV 289
           ++
Sbjct: 343 QM 344


>gi|147792848|emb|CAN68800.1| hypothetical protein VITISV_008807 [Vitis vinifera]
          Length = 647

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 219/306 (71%), Positives = 248/306 (81%), Gaps = 17/306 (5%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQTS +TK K L  +VI+Q   S  I+  EIP++HYP P T +REEC CN VRYFAI
Sbjct: 342 ICLKQTSNNTKAKFLKFQVINQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAI 401

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             I +RR NISSI+STMD+VYNLDW +SASKNE
Sbjct: 402 LSMQRSGSGWFETLLNSHINVSSNGEIFSIGNRRNNISSILSTMDKVYNLDWXSSASKNE 461

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C AAVGFKWM+NQGLMEHH DI +YFK KGVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 462 CLAAVGFKWMLNQGLMEHHKDIVKYFKNKGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 521

Query: 168 NGTHKSHVHSPAE----AKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFY 223
           NGTHKSHVHS  E    A+ILAKYKPT+NATLL+P+LK+ E+T  KA+EYFKSTRHIV Y
Sbjct: 522 NGTHKSHVHSTEEILFQAQILAKYKPTVNATLLLPDLKEAEETITKALEYFKSTRHIVLY 581

Query: 224 YEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
           YEDLI NR  LK+VQEFLRL YR+L+SRQVKIHSG L KQV NWDDVQKALKGTSYE  L
Sbjct: 582 YEDLINNRTKLKDVQEFLRLPYRDLSSRQVKIHSGNLPKQVGNWDDVQKALKGTSYEGML 641

Query: 284 HSDYRV 289
           H+DY++
Sbjct: 642 HADYQM 647


>gi|224137390|ref|XP_002327114.1| predicted protein [Populus trichocarpa]
 gi|222835429|gb|EEE73864.1| predicted protein [Populus trichocarpa]
          Length = 344

 Score =  446 bits (1148), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 205/301 (68%), Positives = 243/301 (80%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ SPHT  K LNI++ DQPC S+N+++ E P++HYP P TF+REECACNPVR+FAI
Sbjct: 43  ICIKQISPHTTAKFLNIRIFDQPCNSSNVEEWEKPYVHYPKPETFSREECACNPVRFFAI 102

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
            SMQRSGSGWFETLLN              + RR ++S+I  T+DRVYNLDWF+SASKNE
Sbjct: 103 FSMQRSGSGWFETLLNSHINVSSNGEIFGKRARRASVSAITQTLDRVYNLDWFSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVGFKWM+NQG+MEHH  IAEYFK+ GV AIFL RRNLLRRMIS+LANSYD++ K L
Sbjct: 163 CNAAVGFKWMLNQGVMEHHEGIAEYFKQNGVHAIFLFRRNLLRRMISVLANSYDKSNKPL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA++LAKY+PTINAT LI ELK ++    +AI+YFKSTRH V YYED+
Sbjct: 223 NGTHKSHVHSSMEAEVLAKYRPTINATTLIAELKHVDDRATRAIDYFKSTRHTVVYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  LKEVQ+FLRL YRELTSRQVKIHSG LSKQV+NWD++ K LKGT YE FLH DY
Sbjct: 283 VGNRTKLKEVQDFLRLPYRELTSRQVKIHSGHLSKQVQNWDEIHKVLKGTQYESFLHLDY 342

Query: 288 R 288
           +
Sbjct: 343 Q 343


>gi|449438424|ref|XP_004136988.1| PREDICTED: nodulation protein H-like isoform 2 [Cucumis sativus]
          Length = 346

 Score =  442 bits (1138), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 203/301 (67%), Positives = 243/301 (80%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S  +K   + ++VID PC    I  S+ PF+H+P P T++R ECAC+PVRYFAI
Sbjct: 45  VCLKQISTRSKVGFMKVEVIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAI 104

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR NIS+IV T+D+VYNLDWFTSASKNE
Sbjct: 105 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNE 164

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGLM+HH +I EYFK++GVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 165 CTAAVGLKWMLNQGLMQHHEEIVEYFKRRGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 224

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA ILAKYKP INATLLIP LKQ+E TT KA+EYF STRHIV YYED+
Sbjct: 225 NGTHKSHVHSHHEADILAKYKPVINATLLIPNLKQVEDTTTKALEYFNSTRHIVMYYEDV 284

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L++VQ+FL++  R+L SRQVKIH GPLS Q++NW+DVQKAL+G+ YE FL++DY
Sbjct: 285 VKNRTKLRDVQDFLKVPQRKLKSRQVKIHRGPLSNQIDNWEDVQKALQGSRYETFLNADY 344

Query: 288 R 288
           R
Sbjct: 345 R 345


>gi|449529180|ref|XP_004171579.1| PREDICTED: nodulation protein H-like [Cucumis sativus]
          Length = 344

 Score =  442 bits (1138), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 203/301 (67%), Positives = 243/301 (80%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S  +K   + ++VID PC    I  S+ PF+H+P P T++R ECAC+PVRYFAI
Sbjct: 43  VCLKQISTRSKVGFMKVEVIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR NIS+IV T+D+VYNLDWFTSASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGLM+HH +I EYFK++GVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 163 CTAAVGLKWMLNQGLMQHHEEIVEYFKRRGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA ILAKYKP INATLLIP LKQ+E TT KA+EYF STRHIV YYED+
Sbjct: 223 NGTHKSHVHSHHEADILAKYKPVINATLLIPNLKQVEDTTTKALEYFNSTRHIVMYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L++VQ+FL++  R+L SRQVKIH GPLS Q++NW+DVQKAL+G+ YE FL++DY
Sbjct: 283 VKNRTKLRDVQDFLKVPQRKLKSRQVKIHRGPLSNQIDNWEDVQKALQGSRYETFLNADY 342

Query: 288 R 288
           R
Sbjct: 343 R 343


>gi|449438422|ref|XP_004136987.1| PREDICTED: nodulation protein H-like isoform 1 [Cucumis sativus]
          Length = 371

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 203/301 (67%), Positives = 243/301 (80%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S  +K   + ++VID PC    I  S+ PF+H+P P T++R ECAC+PVRYFAI
Sbjct: 70  VCLKQISTRSKVGFMKVEVIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAI 129

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR NIS+IV T+D+VYNLDWFTSASKNE
Sbjct: 130 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNE 189

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGLM+HH +I EYFK++GVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 190 CTAAVGLKWMLNQGLMQHHEEIVEYFKRRGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 249

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA ILAKYKP INATLLIP LKQ+E TT KA+EYF STRHIV YYED+
Sbjct: 250 NGTHKSHVHSHHEADILAKYKPVINATLLIPNLKQVEDTTTKALEYFNSTRHIVMYYEDV 309

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L++VQ+FL++  R+L SRQVKIH GPLS Q++NW+DVQKAL+G+ YE FL++DY
Sbjct: 310 VKNRTKLRDVQDFLKVPQRKLKSRQVKIHRGPLSNQIDNWEDVQKALQGSRYETFLNADY 369

Query: 288 R 288
           R
Sbjct: 370 R 370


>gi|356521486|ref|XP_003529386.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 344

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 195/301 (64%), Positives = 237/301 (78%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ    +K   L+IKV+ +PC   NI+  EIPF+HYP+P T++R ECAC+PVRYFAI
Sbjct: 43  ICLKQIGTSSKIGFLDIKVVQKPCPEPNIEPWEIPFVHYPHPKTYSRAECACHPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFET LN             +K RR N+S+I  T+D +YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHANISSNGEIFSVKVRRSNMSTITETLDTIYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+ AVG KWM+NQGLM+HH +IAEYF+  GVS IFL RRNLLRRM+S+LAN YDRNAKLL
Sbjct: 163 CTTAVGLKWMLNQGLMQHHEEIAEYFRIHGVSLIFLFRRNLLRRMVSVLANEYDRNAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILAKYKPTIN+TLLI +LKQ+ +TT KA+EYFKSTRHI+ YYED+
Sbjct: 223 NGTHKSHVHSPKEAEILAKYKPTINSTLLIAQLKQVNETTTKALEYFKSTRHIILYYEDI 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L++VQ+FL++   +L SRQVKIH G LS QVENW+D+ KAL GT YE F+H DY
Sbjct: 283 VKNRTKLRDVQDFLKVPQMDLKSRQVKIHKGSLSSQVENWNDISKALTGTPYESFIHEDY 342

Query: 288 R 288
           R
Sbjct: 343 R 343


>gi|356500053|ref|XP_003518849.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 344

 Score =  426 bits (1096), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 195/301 (64%), Positives = 236/301 (78%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ    +K   L+IKV+ +PC   NI+  EIP++HYPNP T++R ECAC+PVRYFAI
Sbjct: 43  ICLKQIGTSSKIGFLDIKVVQKPCPEPNIEPWEIPYVHYPNPKTYSRAECACHPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFET LN             +K RR N+S+I  T+D +YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHANISSNGEIFSVKVRRSNMSTITETLDTIYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+ AVG KWM+NQGLM+HH  IAEYF+  GVS IFL RRNLLRRM+S+LAN YDRNAK+L
Sbjct: 163 CTTAVGLKWMLNQGLMQHHEQIAEYFRIHGVSLIFLFRRNLLRRMVSVLANEYDRNAKIL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILAKYKPTIN+TLLI +LKQ+ +TT KA+EYFKSTRHI+ YYED+
Sbjct: 223 NGTHKSHVHSPQEAEILAKYKPTINSTLLIAQLKQVNETTTKALEYFKSTRHILLYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L++VQ+FLR+   +L SRQVKIH G LS QVENW+D+ KAL GT YE F+H DY
Sbjct: 283 VKNRTKLRDVQDFLRVPQMKLKSRQVKIHKGSLSSQVENWNDISKALTGTPYESFIHEDY 342

Query: 288 R 288
           R
Sbjct: 343 R 343


>gi|356553925|ref|XP_003545301.1| PREDICTED: uncharacterized protein LOC100820591 [Glycine max]
          Length = 1057

 Score =  423 bits (1088), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 196/288 (68%), Positives = 234/288 (81%), Gaps = 13/288 (4%)

Query: 12   TKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWF 71
            +KLL ++VI+Q CQ +N+++ E+PFLHYP P T+NREECACNPVR+F IL+MQRSGSGWF
Sbjct: 770  SKLLELRVINQSCQLSNVEEWEVPFLHYPQPKTYNREECACNPVRFFTILTMQRSGSGWF 829

Query: 72   ETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMM 118
            ETLLN             +  RR N+SSI+ TMD V+NLDWF+ ASKNECSAAVG+KWM+
Sbjct: 830  ETLLNSHMNVSSNGEIFGVVKRRENVSSILKTMDVVFNLDWFSGASKNECSAAVGYKWML 889

Query: 119  NQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSP 178
            NQGLMEHH +I EYF+++ VS IFL RRNLLRRM+S+LANSYD+NAK LNGTHKSHVHS 
Sbjct: 890  NQGLMEHHKEIVEYFERRKVSTIFLFRRNLLRRMVSVLANSYDKNAKPLNGTHKSHVHST 949

Query: 179  AEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQ 238
             EA ILAKY+P IN TLL+ ELKQ E T AKAIEYF +TRHIV YYEDLI N   LK+VQ
Sbjct: 950  LEAGILAKYRPWINTTLLMTELKQTEDTAAKAIEYFNNTRHIVLYYEDLIKNSTKLKDVQ 1009

Query: 239  EFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
            EFLRL YR++ SRQVKIH+ PL KQ+ENWD+VQK L+GT+Y+ FL SD
Sbjct: 1010 EFLRLPYRDMHSRQVKIHTAPLWKQIENWDEVQKTLRGTAYQNFLFSD 1057


>gi|224105519|ref|XP_002313841.1| predicted protein [Populus trichocarpa]
 gi|222850249|gb|EEE87796.1| predicted protein [Populus trichocarpa]
          Length = 345

 Score =  423 bits (1088), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 197/301 (65%), Positives = 234/301 (77%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ    T    LN++VI++PC   NI+  EIP++HYP P+T++R EC CNPVRYFAI
Sbjct: 44  ICLKQIGIRTNPGFLNVEVIERPCPEPNIEPWEIPYVHYPKPITYSRVECKCNPVRYFAI 103

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR N+S+I  T+D++YNLDW +SASKNE
Sbjct: 104 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNVSTITETLDKIYNLDWLSSASKNE 163

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQG+M+HH +I EYFK +GVSAIFL RRNLLRRM+SILANSYDR  K L
Sbjct: 164 CAAAVGLKWMLNQGVMQHHEEIVEYFKTRGVSAIFLFRRNLLRRMVSILANSYDREVKPL 223

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILAKYKP IN TLLI  LKQ+E TTAKA+EYFKSTRHI+ YYED+
Sbjct: 224 NGTHKSHVHSPREAEILAKYKPLINTTLLISNLKQVEDTTAKALEYFKSTRHIILYYEDV 283

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + N   L +VQ+FL++  REL SRQVKIH G LS  VENWD+VQK+LKGT YE  L  DY
Sbjct: 284 VKNHTKLLDVQDFLKVPQRELKSRQVKIHKGSLSNYVENWDEVQKSLKGTHYENLLTGDY 343

Query: 288 R 288
           R
Sbjct: 344 R 344


>gi|255555521|ref|XP_002518797.1| conserved hypothetical protein [Ricinus communis]
 gi|223542178|gb|EEF43722.1| conserved hypothetical protein [Ricinus communis]
          Length = 368

 Score =  421 bits (1081), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 194/301 (64%), Positives = 235/301 (78%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S  T   + ++ VI++PC   NIQ  EIP++HYP P T++R EC CNPVR FAI
Sbjct: 43  ICLKQISISTNPGIFSLGVIEKPCPEPNIQPWEIPYVHYPKPKTYSRAECMCNPVRNFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR N+S IV T+D++YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNVSMIVETLDKIYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQG+M+HH +I EYFK +GVSAIFL RRNLLRRMIS+LANSYDR AKLL
Sbjct: 163 CTAAVGLKWMLNQGVMQHHEEIVEYFKSRGVSAIFLFRRNLLRRMISVLANSYDREAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILA+YKP++N +LL+  LKQ+EQTT KA+EYFKS RHI+ YYED+
Sbjct: 223 NGTHKSHVHSPDEAEILARYKPSVNTSLLLSSLKQVEQTTIKALEYFKSNRHIIVYYEDI 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + N   L EVQ+FLR+  +EL SRQVKIH G +S Q ENW ++QKALKGT YE FL+ DY
Sbjct: 283 VKNHTKLLEVQDFLRVPRKELKSRQVKIHKGSISSQAENWGEIQKALKGTHYESFLYGDY 342

Query: 288 R 288
           R
Sbjct: 343 R 343


>gi|356564206|ref|XP_003550347.1| PREDICTED: uncharacterized protein LOC100819406 [Glycine max]
          Length = 330

 Score =  419 bits (1078), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 195/296 (65%), Positives = 234/296 (79%), Gaps = 13/296 (4%)

Query: 4   KQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSM 63
           K T   T +KLL ++VI+Q C  +++++ E+PFLHYP P T+NREECACNPVR+F IL+M
Sbjct: 35  KPTRIRTNSKLLELRVINQSCHPSSVEEWEVPFLHYPQPKTYNREECACNPVRFFVILTM 94

Query: 64  QRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSA 110
           QRSGSGWFETLLN             +  RR N+SSI+ TMD V+NLDWF+ ASKNECSA
Sbjct: 95  QRSGSGWFETLLNSHMNVSSNGEIFSVAKRRENVSSILMTMDEVFNLDWFSGASKNECSA 154

Query: 111 AVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGT 170
           AVG+KWM+NQGLMEHH +I EYF+++ VS IFL RRNLLRRM+S+L NSYD+ AK LNGT
Sbjct: 155 AVGYKWMLNQGLMEHHKEIGEYFERRRVSTIFLFRRNLLRRMVSVLENSYDKKAKPLNGT 214

Query: 171 HKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN 230
           HKSHVHS  EA ILAKY+P IN TLL+ EL Q E+T AKAIEYFK+TRHIV YYEDLI N
Sbjct: 215 HKSHVHSTLEAGILAKYRPWINTTLLMTELNQTEETAAKAIEYFKNTRHIVLYYEDLIKN 274

Query: 231 RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
              LK+VQEFLRL +R++ SRQVKIH+ PL KQ+ENWDDV K L+GTSY+ FL SD
Sbjct: 275 ATKLKDVQEFLRLPFRDMHSRQVKIHTAPLLKQIENWDDVYKTLRGTSYQNFLFSD 330


>gi|225428745|ref|XP_002285039.1| PREDICTED: nodulation protein H [Vitis vinifera]
 gi|297741312|emb|CBI32443.3| unnamed protein product [Vitis vinifera]
          Length = 344

 Score =  416 bits (1069), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 191/302 (63%), Positives = 237/302 (78%), Gaps = 13/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S       LN+++ ++PC+  NI+ SE+ ++H+P P++++R ECACNPVRYFAI
Sbjct: 43  ICLKQISTTATAGFLNVELTERPCEKPNIEPSEVRYVHFPKPISYSRAECACNPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LS QRSGSGWFETLLN             +K RR NISSI  T+D++YNLDW +SASKNE
Sbjct: 103 LSTQRSGSGWFETLLNSHINISSNGEIFSVKVRRSNISSITETLDKIYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGLM++H +I EYF  +GVS IFL RRNLLRRMIS+LANSYDR AK L
Sbjct: 163 CTAAVGLKWMLNQGLMQNHKEIVEYFNTRGVSLIFLFRRNLLRRMISMLANSYDREAKQL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILA+YKP+INATLLI +LKQ+E T AKA++YF STRH +FYYED+
Sbjct: 223 NGTHKSHVHSPREAEILAQYKPSINATLLIRDLKQVEDTVAKALQYFNSTRHTIFYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  L +V  FL++  R+L SRQVKIH G LS+QVENWD+V K LKGT YE FLH+DY
Sbjct: 283 IKNRTKLVDVLNFLKVPQRDLRSRQVKIHKGSLSEQVENWDEVLKTLKGTQYESFLHADY 342

Query: 288 RV 289
           ++
Sbjct: 343 QI 344


>gi|224094390|ref|XP_002310152.1| predicted protein [Populus trichocarpa]
 gi|222853055|gb|EEE90602.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  416 bits (1068), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 197/300 (65%), Positives = 234/300 (78%), Gaps = 13/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S H+K K+ +I+VI++P    + +  +I  +HYPNP TF+R ECA NPVRYFAI
Sbjct: 43  VCLKQISTHSKIKIQDIQVIERPSPDVDHENLQISSVHYPNPETFSRAECAHNPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +  RRINISSI  T+D+VYNLDWFTSASKNE
Sbjct: 103 LSMQRSGSGWFETLLNSHVNVSSNGEIFSVLDRRINISSITQTLDKVYNLDWFTSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           CSAAVGFKWM+NQG+M+HH +IA+YF ++GVSAIFL RRNLLRRM+S+LANSYDR+AKLL
Sbjct: 163 CSAAVGFKWMLNQGVMQHHKEIADYFNRRGVSAIFLFRRNLLRRMVSVLANSYDRHAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA+ LAKYKP IN+TLLI +LK++E T  KA+EYF STRH V YYEDL
Sbjct: 223 NGTHKSHVHSTEEAETLAKYKPMINSTLLISDLKEVEITATKALEYFNSTRHTVLYYEDL 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I N   LK+VQ FL L   EL SRQVKIH GPLS  V+NW+D+ K L GT+YE FL +DY
Sbjct: 283 IKNPTKLKDVQAFLGLPVMELMSRQVKIHKGPLSDHVKNWEDINKTLNGTAYESFLQADY 342


>gi|224123470|ref|XP_002330322.1| predicted protein [Populus trichocarpa]
 gi|222871357|gb|EEF08488.1| predicted protein [Populus trichocarpa]
          Length = 333

 Score =  416 bits (1068), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 195/300 (65%), Positives = 230/300 (76%), Gaps = 13/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ    T    LN++VI++PC   NI+  EIP++HYP P T++R EC CNPVRYFAI
Sbjct: 34  ICLKQIGIRTNPGFLNVEVIERPCPEPNIEPWEIPYVHYPRPKTYSRAECKCNPVRYFAI 93

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR N+S+I  T+D++YNLDW +SASKNE
Sbjct: 94  LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNVSTITETLDKIYNLDWSSSASKNE 153

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQG+M HH +I EYFK +GVSAIFL RRNLLRRMISILANSYDR+ K L
Sbjct: 154 CTAAVGLKWMLNQGVMRHHEEIVEYFKTRGVSAIFLFRRNLLRRMISILANSYDRDVKPL 213

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILAKYK  IN TLLIP LKQ+E TTAKA+EYFKS RHI+ YYED+
Sbjct: 214 NGTHKSHVHSPREAEILAKYKSLINTTLLIPSLKQVEDTTAKALEYFKSARHIILYYEDV 273

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L +VQ+FL++  REL S QVKIH G L   VENWD+VQK+LKGT YE  L  DY
Sbjct: 274 VKNRTKLLDVQDFLKVPRRELKSHQVKIHKGSLLNYVENWDEVQKSLKGTHYESLLSGDY 333


>gi|79556423|ref|NP_179175.3| nodulation protein-like protein [Arabidopsis thaliana]
 gi|40823013|gb|AAR92253.1| At2g15730 [Arabidopsis thaliana]
 gi|45752690|gb|AAS76243.1| At2g15730 [Arabidopsis thaliana]
 gi|330251339|gb|AEC06433.1| nodulation protein-like protein [Arabidopsis thaliana]
          Length = 344

 Score =  416 bits (1068), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 189/301 (62%), Positives = 236/301 (78%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ         LN++V ++PC   NIQ  +IP++HYP P T++REEC+CNPVRYFAI
Sbjct: 43  ICLKQIGVVPSAGFLNVEVFERPCPEPNIQPWDIPYVHYPKPKTYSREECSCNPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR N+S+I  T+D+VYNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C++AVG KWM+NQGLM++H +I EYFK +GVSAIFL RRNLLRRMIS+LANSYDR+AK L
Sbjct: 163 CTSAVGLKWMLNQGLMKNHEEIVEYFKTRGVSAIFLFRRNLLRRMISVLANSYDRDAKPL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILA+YKP IN +LLIP+LKQ+++ T+KA+ YF +TRHI  YYED+
Sbjct: 223 NGTHKSHVHSPKEAEILARYKPLINTSLLIPDLKQVQEMTSKALAYFNTTRHIFLYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L +VQEFL++   +L SRQVKIH GPLS+ V+NW++VQK LKGT +E FL  DY
Sbjct: 283 VKNRTKLDDVQEFLKVPKLDLKSRQVKIHHGPLSQHVQNWEEVQKTLKGTGFENFLLEDY 342

Query: 288 R 288
           R
Sbjct: 343 R 343


>gi|356509720|ref|XP_003523594.1| PREDICTED: uncharacterized protein LOC100808615 [Glycine max]
          Length = 336

 Score =  413 bits (1062), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 192/300 (64%), Positives = 234/300 (78%), Gaps = 14/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPC-QSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFA 59
           +C +Q   HT  KL +I V  QPC   + +++ E+ +LHYP P  ++REECACNPV +F 
Sbjct: 37  ICLEQKGVHTSAKLFDITVFKQPCFNHSGVEEWELSYLHYPEPKIYSREECACNPVLFFC 96

Query: 60  ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
           I+SMQRSGSGWFET LN             +  RR N+SSI+ TMDRVYNLDWF+SASKN
Sbjct: 97  IVSMQRSGSGWFETFLNSHINVSSNGEIFSVGKRRANVSSILETMDRVYNLDWFSSASKN 156

Query: 107 ECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKL 166
           EC+AAVGFKWM+NQGL+EHH +I EYF+++ VS IFL RRNLLRRMIS+LANSYD++AK 
Sbjct: 157 ECTAAVGFKWMLNQGLVEHHEEIVEYFERRRVSVIFLFRRNLLRRMISVLANSYDKDAKP 216

Query: 167 LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED 226
           LNGTHKSHVHS  EA ILAKY+P +N TLLIPEL Q ++T AKAI YFK+TRHIV YYED
Sbjct: 217 LNGTHKSHVHSAVEANILAKYRPRLNITLLIPELNQTKETAAKAIAYFKNTRHIVLYYED 276

Query: 227 LITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
           L+ NR  LK+VQEFLR+ YR+L SRQVKIH+  LSKQ+ENW++VQ+AL GT Y+RFL  D
Sbjct: 277 LVNNRTKLKDVQEFLRVPYRDLESRQVKIHTAALSKQIENWNEVQEALTGTPYQRFLFKD 336


>gi|357465451|ref|XP_003603010.1| hypothetical protein MTR_3g101380 [Medicago truncatula]
 gi|355492058|gb|AES73261.1| hypothetical protein MTR_3g101380 [Medicago truncatula]
          Length = 388

 Score =  413 bits (1062), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 194/303 (64%), Positives = 233/303 (76%), Gaps = 17/303 (5%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPC-QSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFA 59
           +C +     T +K L + V  QPC   +N+Q+ EIP+LHYP P T++REEC+CNPVRYF 
Sbjct: 86  ICFELKGVRTYSKFLGVSVFTQPCPHPSNVQEWEIPYLHYPEPKTYSREECSCNPVRYFC 145

Query: 60  ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
           I+SMQRSGSGWFET LN             ++ RR N+S I+ T++ VYNLDWFTSASKN
Sbjct: 146 IVSMQRSGSGWFETFLNSHINVSSNGEIFSVEKRRENVSLILKTINEVYNLDWFTSASKN 205

Query: 107 ECSAAVGFKWMMNQ---GLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRN 163
           ECSAA+G+KWM+NQ   GLM+HH +I EY ++K VS IFL RRNLLRRM+S+LANSYD++
Sbjct: 206 ECSAAIGYKWMLNQVGRGLMDHHKEIVEYLEQKQVSTIFLFRRNLLRRMVSVLANSYDKD 265

Query: 164 AKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFY 223
            KLLNGTHKSHVHS  EA ILAKYKP IN TLLIPELK+  +T AKAI YFKS RHIV Y
Sbjct: 266 VKLLNGTHKSHVHSTMEATILAKYKPKINVTLLIPELKKTNETIAKAISYFKSIRHIVLY 325

Query: 224 YEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
           YEDL+ NR  LK+VQEFL++ YREL SRQVKIH+ PLS Q+ENWD+VQK L GTSYE FL
Sbjct: 326 YEDLVNNRTKLKDVQEFLKVPYRELESRQVKIHTAPLSMQIENWDEVQKTLTGTSYESFL 385

Query: 284 HSD 286
           + D
Sbjct: 386 YKD 388


>gi|357475463|ref|XP_003608017.1| Nodulation protein H [Medicago truncatula]
 gi|355509072|gb|AES90214.1| Nodulation protein H [Medicago truncatula]
          Length = 344

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 193/301 (64%), Positives = 229/301 (76%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S  ++   L+I VI +PC   NI+  EIP++HYPNP T++REEC C+PVRYF I
Sbjct: 43  ICLKQISTGSRIGFLDINVIQKPCPEPNIEPWEIPYVHYPNPKTYSREECRCHPVRYFTI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFET LN             +K RR NI++I  T+D +YNLDWF SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHPNISSNGEIFSVKVRRSNITTITETLDTIYNLDWFNSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGLM+HH  IAEYF+  GVS IFL R+NLLRRMIS+LAN YD+NAK+L
Sbjct: 163 CTAAVGLKWMLNQGLMQHHEQIAEYFRIHGVSVIFLFRKNLLRRMISVLANEYDKNAKIL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHSP EA+ILAKYKPT+N+TLLI  LKQ+  TT KA+E FKSTRHI+ YYED+
Sbjct: 223 NGTHKSHVHSPKEAEILAKYKPTLNSTLLIKNLKQVNDTTTKALESFKSTRHIMLYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L +V EFL++    L SRQVKIH G LS QVENW+DV K L GT YE FLH DY
Sbjct: 283 VKNRTKLMDVLEFLKVPQVNLKSRQVKIHKGSLSSQVENWNDVTKTLNGTQYESFLHEDY 342

Query: 288 R 288
           R
Sbjct: 343 R 343


>gi|224084324|ref|XP_002307259.1| predicted protein [Populus trichocarpa]
 gi|222856708|gb|EEE94255.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 198/300 (66%), Positives = 233/300 (77%), Gaps = 13/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQTS H+K K  +I+V+++     +    +I  +HYPNP TF+R ECA NPVRYFAI
Sbjct: 43  VCLKQTSIHSKIKFQDIQVVERLSPDDDHGNLQISSVHYPNPETFSRAECAHNPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +  RR NISSI  T+D+VYNLDWFTSASKNE
Sbjct: 103 LSMQRSGSGWFETLLNSHVNVSSNGEIFSVLDRRRNISSITRTLDKVYNLDWFTSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           CSAAVGFKWM+NQG+M+HH +I++YF ++GV AIFL RRNLLRRM+S+LANSYDR+AKLL
Sbjct: 163 CSAAVGFKWMLNQGVMQHHKEISDYFNRRGVFAIFLFRRNLLRRMVSVLANSYDRHAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EAK LAKYKP IN+TLLI +LK++E T AKA+EYF STRHIV YYEDL
Sbjct: 223 NGTHKSHVHSTEEAKTLAKYKPMINSTLLISDLKEVEITAAKALEYFNSTRHIVLYYEDL 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I N A LK+VQ FL L   EL SRQVKIH GPLS  V+NW+DV K L GT+YE FL +DY
Sbjct: 283 IKNPAKLKDVQAFLGLPLMELMSRQVKIHKGPLSDHVKNWEDVNKTLNGTAYESFLQADY 342


>gi|297832068|ref|XP_002883916.1| hypothetical protein ARALYDRAFT_480426 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329756|gb|EFH60175.1| hypothetical protein ARALYDRAFT_480426 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 344

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 186/301 (61%), Positives = 234/301 (77%), Gaps = 13/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ         LN++V ++ C   NI+  +IP++HYP P T++REEC+CNPVRYFAI
Sbjct: 43  ICLKQIGVVPSAGFLNVEVFERQCPEPNIEPWDIPYVHYPKPKTYSREECSCNPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR N+++I  T+D+VYNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVTTIFETLDKVYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C++AVG KWM+NQGLM+HH +I EYFK +GVSAIFL RRNLLRRMIS+LANSYDR+AK L
Sbjct: 163 CTSAVGLKWMLNQGLMKHHEEIVEYFKTRGVSAIFLFRRNLLRRMISVLANSYDRDAKPL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           +GTHKSHVHSP EA+ILA+YKP IN TLLIP+LKQ+++ T+KA+ YF +TRHI  YYED+
Sbjct: 223 HGTHKSHVHSPKEAEILARYKPLINTTLLIPDLKQVQEMTSKALAYFNTTRHIFLYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + NR  L +VQEFL++   +L SRQVKIH GPLSK V+NW++VQ  LKGT ++ FL  DY
Sbjct: 283 VKNRTKLDDVQEFLKVPKLDLKSRQVKIHHGPLSKHVQNWEEVQTTLKGTGFKNFLLEDY 342

Query: 288 R 288
           R
Sbjct: 343 R 343


>gi|334187156|ref|NP_195168.3| P-loop containing nucleoside triphosphate hydrolase family
           protein-like protein [Arabidopsis thaliana]
 gi|332660972|gb|AEE86372.1| P-loop containing nucleoside triphosphate hydrolase family
           protein-like protein [Arabidopsis thaliana]
          Length = 342

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 185/300 (61%), Positives = 232/300 (77%), Gaps = 13/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ         LN++V ++PC   NI+  +IPF+HYP P T+NR+EC+C+PVRYFAI
Sbjct: 43  ICLKQIGVSPNYGFLNVEVFERPCPEPNIEPWDIPFVHYPKPKTYNRDECSCHPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR N+S+I  T+D+VYNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C++AVGFKWM+NQGLM+HH +I EYFK +GVSAIFL R+NLLRRMIS+LANSYDR+AKLL
Sbjct: 163 CTSAVGFKWMLNQGLMKHHEEIVEYFKTRGVSAIFLFRKNLLRRMISVLANSYDRDAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSH HS  EA+ILA YKP IN TLLI EL+QI++ T KA+ YF +TRHI+ YYED+
Sbjct: 223 NGTHKSHTHSAKEAEILAGYKPMINTTLLINELRQIQEMTLKALTYFNTTRHILVYYEDV 282

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + N   L +VQEFL++  R+L SRQVKIH G LSK V+NW++V   LKGT++E FLH ++
Sbjct: 283 VKNLTRLDDVQEFLKVPKRKLKSRQVKIHGGSLSKHVQNWEEVMTTLKGTNFENFLHQEF 342


>gi|225438662|ref|XP_002281772.1| PREDICTED: uncharacterized protein LOC100254393 isoform 1 [Vitis
           vinifera]
          Length = 340

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 189/300 (63%), Positives = 232/300 (77%), Gaps = 15/300 (5%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S HTK K  N  V+++P    +++  EIP +HYP P TF+R ECA NPVR+FAI
Sbjct: 43  ICLKQISIHTKAKFENFWVVERPSDDHSMK--EIPNVHYPKPRTFSRAECANNPVRFFAI 100

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S+QRSGSGWFETLLN             +  RR NISSIV T+D++YNLDWF+SASKNE
Sbjct: 101 VSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISSIVKTLDKIYNLDWFSSASKNE 160

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           CSAAVGFKWM+NQGLM+HH +I +YFK + VS IFL RRNLLRRM+S+LANSYDR A++L
Sbjct: 161 CSAAVGFKWMLNQGLMKHHKEIVDYFKNRDVSTIFLFRRNLLRRMVSVLANSYDREARVL 220

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA +L+ YKPTINAT LI +L+Q+E+T AKA+EYF STRHI+ YYEDL
Sbjct: 221 NGTHKSHVHSTDEADMLSNYKPTINATTLITDLQQMEKTVAKALEYFNSTRHIILYYEDL 280

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR   +++ EFL+L + +L SRQVKIH G LS  ++NWDDV K LKGTSYE FL +DY
Sbjct: 281 IKNRTKTEDILEFLKLPHMDLASRQVKIHKGELSIHIKNWDDVNKTLKGTSYEHFLQADY 340


>gi|297802526|ref|XP_002869147.1| hypothetical protein ARALYDRAFT_912948 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314983|gb|EFH45406.1| hypothetical protein ARALYDRAFT_912948 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 490

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 183/300 (61%), Positives = 232/300 (77%), Gaps = 13/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ         LN++V ++PC   NI+  +I F+HYP P T+NR+EC+C+PVRYFAI
Sbjct: 191 ICLKQIGVSPNYGFLNVEVFERPCPEPNIEPWDILFVHYPKPKTYNRDECSCHPVRYFAI 250

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFETLLN             +K RR N+S+I  T+D+VYNLDW +SASKNE
Sbjct: 251 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNE 310

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C++AVGFKWM+NQGLM+HH +I EYFK +GVSAIFL R+NLLRRMIS+LANSYDR+AKLL
Sbjct: 311 CTSAVGFKWMLNQGLMKHHEEIVEYFKTRGVSAIFLFRKNLLRRMISVLANSYDRDAKLL 370

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSH HS  EA+ILA YKP IN TLLI ELKQI++ T KA++YF +TRHI+ YYED+
Sbjct: 371 NGTHKSHTHSAKEAEILAGYKPMINTTLLINELKQIQEMTLKALDYFNTTRHILVYYEDV 430

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + N   L +VQ+FL++  R+L SRQVKIH G LS+ V+NW++VQ  LKGT++E FL  ++
Sbjct: 431 VKNLTRLDDVQKFLKVPKRKLKSRQVKIHGGSLSQHVQNWEEVQTTLKGTNFENFLRQEF 490


>gi|359480689|ref|XP_003632512.1| PREDICTED: uncharacterized protein LOC100254393 isoform 2 [Vitis
           vinifera]
 gi|296082441|emb|CBI21446.3| unnamed protein product [Vitis vinifera]
          Length = 362

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 191/320 (59%), Positives = 232/320 (72%), Gaps = 33/320 (10%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQ--------------------QSEIPFLHYP 40
           +C KQ S HTK K  N  V+++P    +IQ                      EIP +HYP
Sbjct: 43  ICLKQISIHTKAKFENFWVVERPSDDHSIQIPQISTFIGDKNSCYVNGFNMKEIPNVHYP 102

Query: 41  NPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSI 87
            P TF+R ECA NPVR+FAI+S+QRSGSGWFETLLN             +  RR NISSI
Sbjct: 103 KPRTFSRAECANNPVRFFAIVSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISSI 162

Query: 88  VSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRN 147
           V T+D++YNLDWF+SASKNECSAAVGFKWM+NQGLM+HH +I +YFK + VS IFL RRN
Sbjct: 163 VKTLDKIYNLDWFSSASKNECSAAVGFKWMLNQGLMKHHKEIVDYFKNRDVSTIFLFRRN 222

Query: 148 LLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTT 207
           LLRRM+S+LANSYDR A++LNGTHKSHVHS  EA +L+ YKPTINAT LI +L+Q+E+T 
Sbjct: 223 LLRRMVSVLANSYDREARVLNGTHKSHVHSTDEADMLSNYKPTINATTLITDLQQMEKTV 282

Query: 208 AKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENW 267
           AKA+EYF STRHI+ YYEDLI NR   +++ EFL+L + +L SRQVKIH G LS  ++NW
Sbjct: 283 AKALEYFNSTRHIILYYEDLIKNRTKTEDILEFLKLPHMDLASRQVKIHKGELSIHIKNW 342

Query: 268 DDVQKALKGTSYERFLHSDY 287
           DDV K LKGTSYE FL +DY
Sbjct: 343 DDVNKTLKGTSYEHFLQADY 362


>gi|356565626|ref|XP_003551040.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 350

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 187/303 (61%), Positives = 234/303 (77%), Gaps = 17/303 (5%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQ---QSEIPFLHYPNPVTFNREECACNPVRY 57
           +C KQ S H +T L+ ++ I++P +S++ +   ++++P LHYP PV+FNR ECA NPVR+
Sbjct: 47  VCLKQISTHARTMLMELQ-IEKPSRSSSSRLNLKNDVPLLHYPKPVSFNRSECAGNPVRF 105

Query: 58  FAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSAS 104
           FAILS QRSGSGWFETLLN             ++ RR+N+S+IV T+D+VYNLDW +SAS
Sbjct: 106 FAILSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRVNVSTIVQTLDKVYNLDWLSSAS 165

Query: 105 KNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNA 164
           KNECSAA+GFKWM+NQGLMEH  +IA+YF  + VS IFL RRNLLRRM+S+LANSYDR A
Sbjct: 166 KNECSAAIGFKWMLNQGLMEHPKEIADYFNSRSVSVIFLFRRNLLRRMVSMLANSYDRYA 225

Query: 165 KLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYY 224
           KLL+GTHKSHVHS  EA  L+KYKPTIN+T L+ +LK IE  TAK +EYF +TRH++ YY
Sbjct: 226 KLLSGTHKSHVHSTEEADTLSKYKPTINSTSLLADLKDIEMRTAKVLEYFNTTRHMILYY 285

Query: 225 EDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH 284
           EDL+ N   LK+VQEFL L   ELTSRQVKIH GPLS  ++NWDDV + L GT YE FLH
Sbjct: 286 EDLMRNCTKLKDVQEFLGLPQMELTSRQVKIHRGPLSDHIKNWDDVNRTLTGTVYESFLH 345

Query: 285 SDY 287
           +DY
Sbjct: 346 ADY 348


>gi|357482975|ref|XP_003611774.1| hypothetical protein MTR_5g017700 [Medicago truncatula]
 gi|355513109|gb|AES94732.1| hypothetical protein MTR_5g017700 [Medicago truncatula]
          Length = 342

 Score =  393 bits (1010), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 187/301 (62%), Positives = 226/301 (75%), Gaps = 15/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIP-FLHYPNPVTFNREECACNPVRYFA 59
           +C KQ     +TK +++ VID   +S  ++Q+ IP  LHYP P++FNR ECA NPV +FA
Sbjct: 43  VCRKQIGTEVRTKFMDLNVIDNLTRSI-VKQTHIPDILHYPEPLSFNRNECAPNPVMFFA 101

Query: 60  ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
           ILS QRSGSGWFETLLN             ++ RR N SSI+ T+DRVYNLDW +SASKN
Sbjct: 102 ILSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRQNASSILQTLDRVYNLDWLSSASKN 161

Query: 107 ECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKL 166
           ECSAA+GFKWM+NQGLMEHHN+I EYFK++ VS IFL RRNLLRRM+S+LANSYDR AKL
Sbjct: 162 ECSAAIGFKWMLNQGLMEHHNEIEEYFKRRSVSVIFLFRRNLLRRMVSMLANSYDRYAKL 221

Query: 167 LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED 226
           LNGTHKSHVHS  EA IL+KYKP IN+T L+ +LK +E    KA++YF S+RH++ YYED
Sbjct: 222 LNGTHKSHVHSAEEADILSKYKPVINSTSLLVDLKDMETKAEKALKYFNSSRHMILYYED 281

Query: 227 LITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
           L+ N   L +VQEFL L   ELTSRQVKIH GPLS  ++NWDDV K L GT YE FL +D
Sbjct: 282 LMRNHTKLNDVQEFLGLPQMELTSRQVKIHKGPLSDHIQNWDDVNKTLTGTVYESFLEAD 341

Query: 287 Y 287
           Y
Sbjct: 342 Y 342


>gi|356514160|ref|XP_003525774.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 361

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 186/306 (60%), Positives = 231/306 (75%), Gaps = 20/306 (6%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQS------EIPFLHYPNPVTFNREECACNP 54
           +C KQ S H +T L+ ++ I++P +S+    +      ++P LHYP PV+FNR ECA NP
Sbjct: 55  VCLKQISTHARTMLMELQ-IEKPSRSSISSINRLNLTHDVPLLHYPEPVSFNRSECAGNP 113

Query: 55  VRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFT 101
           VR+FAILS QRSGSGWFETLLN             ++ RR+N+S+IV T+D+VYNLDW +
Sbjct: 114 VRFFAILSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRVNVSTIVQTLDKVYNLDWLS 173

Query: 102 SASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYD 161
           SASKNECSAA+G KWM+NQGLM+H  +IA+YF  + VS IFL RRNLLRRM+S+LANSYD
Sbjct: 174 SASKNECSAAIGSKWMLNQGLMDHPKEIADYFNSRSVSVIFLFRRNLLRRMVSMLANSYD 233

Query: 162 RNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIV 221
           R AKL +GTHKSHVHS  EA  L+KYKPTIN+T L+ +LK +E  TAKA+EYF +TRH++
Sbjct: 234 RYAKLFSGTHKSHVHSKEEADTLSKYKPTINSTSLLADLKDMEMRTAKALEYFNTTRHMI 293

Query: 222 FYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYER 281
            YYEDLI NR  LK+VQEFL L   ELTSRQVKIH GPLS  ++NWDDV + L GT YE 
Sbjct: 294 LYYEDLIQNRTKLKDVQEFLGLPQMELTSRQVKIHRGPLSDHIKNWDDVNRTLAGTIYES 353

Query: 282 FLHSDY 287
           FLH+DY
Sbjct: 354 FLHADY 359


>gi|42565806|ref|NP_190631.2| nodulation-related protein [Arabidopsis thaliana]
 gi|45773806|gb|AAS76707.1| At3g50620 [Arabidopsis thaliana]
 gi|114050671|gb|ABI49485.1| At3g50620 [Arabidopsis thaliana]
 gi|332645165|gb|AEE78686.1| nodulation-related protein [Arabidopsis thaliana]
          Length = 340

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 183/282 (64%), Positives = 220/282 (78%), Gaps = 16/282 (5%)

Query: 22  QPCQSTNIQQSEIPF---LHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-- 76
           Q  Q++ I    + F   +HYP P TFNR EC  NPVRYFAILSMQRSGSGWFETLLN  
Sbjct: 59  QLVQTSPIDSHSLRFVTRIHYPKPQTFNRAECGHNPVRYFAILSMQRSGSGWFETLLNSH 118

Query: 77  -----------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEH 125
                      +  RR NISSI+ T+DRVYNLDWFTSASKNECSAA+GFKWM+NQGL+E+
Sbjct: 119 NNVSSNGEIFSVLDRRKNISSIIQTLDRVYNLDWFTSASKNECSAAIGFKWMLNQGLLEN 178

Query: 126 HNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILA 185
           H DI EYF ++GVSAIFL RRN LRRM+S+LANSYDR AKLLNGTHKSHVHSPAEA  L+
Sbjct: 179 HKDIVEYFNRRGVSAIFLFRRNPLRRMVSVLANSYDRYAKLLNGTHKSHVHSPAEADALS 238

Query: 186 KYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAY 245
           +YKP IN+T LI +L++ E + AKA+EYF +TRHIV +YEDLITN+ TLK+VQEFL +  
Sbjct: 239 RYKPVINSTSLIHDLQETENSAAKALEYFNTTRHIVVFYEDLITNQTTLKQVQEFLNIPV 298

Query: 246 RELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           ++L+SRQVKIH G LS  ++NW+D+ K L GT YE+FL +DY
Sbjct: 299 KDLSSRQVKIHRGDLSDHIKNWEDINKTLNGTEYEKFLRADY 340


>gi|356496876|ref|XP_003517291.1| PREDICTED: uncharacterized protein LOC100789437 [Glycine max]
          Length = 341

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 186/300 (62%), Positives = 222/300 (74%), Gaps = 14/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S   +T  ++ KVID   QS  ++      LHYP P +F+R ECA NPV +FAI
Sbjct: 43  VCLKQISTQARTTFMDFKVIDNHSQSI-LKLMNTHLLHYPKPSSFSRNECAHNPVLFFAI 101

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LS QRSGSGWFETLLN             ++ RR N+SSI+ T+D+VYNLDWF SASKNE
Sbjct: 102 LSNQRSGSGWFETLLNSHVNVSSYGEIFSVRERRQNVSSILLTLDKVYNLDWFNSASKNE 161

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           CSAA G KWM+NQGL+EHH +IAEYF  + VS IFL RRNLLRRM+S+LANSYDR AKLL
Sbjct: 162 CSAATGLKWMLNQGLVEHHKEIAEYFNHRRVSIIFLFRRNLLRRMVSMLANSYDRYAKLL 221

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVHS  EA+IL+KYKP IN+T L+ +LK +E  +AKA+EYF STRHI+ YYEDL
Sbjct: 222 NGTHKAHVHSAEEAEILSKYKPIINSTSLLDDLKDMEMRSAKALEYFNSTRHIIVYYEDL 281

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + N   LK+VQEFL L   ELTSR VKIH GPLS  ++NWDDV K LKGT YE FL +DY
Sbjct: 282 MRNHTKLKDVQEFLGLPQMELTSRHVKIHRGPLSDHIQNWDDVNKTLKGTIYESFLEADY 341


>gi|297819744|ref|XP_002877755.1| hypothetical protein ARALYDRAFT_485408 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323593|gb|EFH54014.1| hypothetical protein ARALYDRAFT_485408 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 340

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 179/282 (63%), Positives = 218/282 (77%), Gaps = 16/282 (5%)

Query: 22  QPCQSTNIQQSEIPF---LHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-- 76
           Q  Q+T      + F   +HYP P TFNR EC  NPVRYFAILSMQRSGSGWFETLLN  
Sbjct: 59  QLVQTTPFDSHSLGFVTRIHYPKPQTFNRAECGHNPVRYFAILSMQRSGSGWFETLLNSH 118

Query: 77  -----------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEH 125
                      +  RR NISSI+ T+DRVYNLDWFTSASKNECSAA+GFKWM+NQGL+++
Sbjct: 119 SNVSSNGEIFSVLDRRKNISSIIQTLDRVYNLDWFTSASKNECSAAIGFKWMLNQGLLDN 178

Query: 126 HNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILA 185
           H +I +YF ++GVSAIFL RRN LRRM+S+LANSYDR AKLLNGTHKSHVHSP EA  L+
Sbjct: 179 HKEIVDYFNRRGVSAIFLFRRNPLRRMVSVLANSYDRYAKLLNGTHKSHVHSPEEADALS 238

Query: 186 KYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAY 245
           +YKP IN+T LI +L++ E + AKA+EYF +TRHIV +YEDLITN+ TLK+VQEFL +  
Sbjct: 239 RYKPVINSTSLIHDLQETENSAAKALEYFNTTRHIVVFYEDLITNQTTLKQVQEFLNIPV 298

Query: 246 RELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           ++L+SRQVKIH G LS  ++NW+D+ K L GT YE+FL +DY
Sbjct: 299 KDLSSRQVKIHRGDLSDHIKNWEDINKTLNGTEYEKFLRADY 340


>gi|449448334|ref|XP_004141921.1| PREDICTED: uncharacterized protein LOC101217742 [Cucumis sativus]
          Length = 334

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 178/300 (59%), Positives = 225/300 (75%), Gaps = 15/300 (5%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ + HT+ + L +++ D  C   +I  +++P  HYP P TF+R EC  NPVR+FAI
Sbjct: 37  ICVKQLNTHTRARFLRVRIAD--CPELSIGLTKVPREHYPRPKTFSRAECFNNPVRFFAI 94

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +SMQRSGSGWFE+LLN             +  RR NI++IV T+DR+YNLDW  SASKN+
Sbjct: 95  VSMQRSGSGWFESLLNSHVNVSSNGEVFSVLDRRRNITTIVQTLDRIYNLDWLNSASKNQ 154

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           CSAAVGFKWM+NQGLM+HH +IAEYF ++GVS IFL RRNLLRR++S+LANSYDR AK+L
Sbjct: 155 CSAAVGFKWMLNQGLMQHHEEIAEYFNRRGVSTIFLFRRNLLRRVVSVLANSYDRYAKML 214

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA  L+KYKP IN+T LI EL+ +E T +K++EYF +TRH++ YYED+
Sbjct: 215 NGTHKSHVHSLEEANALSKYKPVINSTSLISELEGMEATISKSLEYFGNTRHLILYYEDI 274

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  LK+VQEFL L   EL SRQVKIH G LS  ++NW+DV+  L GT YE  LH+DY
Sbjct: 275 INNRTKLKDVQEFLNLPPMELKSRQVKIHKGHLSDHIKNWEDVKATLNGTVYEHLLHADY 334


>gi|449530259|ref|XP_004172113.1| PREDICTED: uncharacterized LOC101217742 [Cucumis sativus]
          Length = 340

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 178/300 (59%), Positives = 225/300 (75%), Gaps = 15/300 (5%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ + HT+ + L +++ D  C   +I  +++P  HYP P TF+R EC  NPVR+FAI
Sbjct: 43  ICVKQLNTHTRARFLRVRIAD--CPELSIGLTKVPREHYPRPKTFSRAECFNNPVRFFAI 100

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +SMQRSGSGWFE+LLN             +  RR NI++IV T+DR+YNLDW  SASKN+
Sbjct: 101 VSMQRSGSGWFESLLNSHVNVSSNGEVFSVLDRRRNITTIVQTLDRIYNLDWLNSASKNQ 160

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           CSAAVGFKWM+NQGLM+HH +IAEYF ++GVS IFL RRNLLRR++S+LANSYDR AK+L
Sbjct: 161 CSAAVGFKWMLNQGLMQHHEEIAEYFNRRGVSTIFLFRRNLLRRVVSVLANSYDRYAKML 220

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHKSHVHS  EA  L+KYKP IN+T LI EL+ +E T +K++EYF +TRH++ YYED+
Sbjct: 221 NGTHKSHVHSLEEANALSKYKPVINSTSLISELEGMEATISKSLEYFGNTRHLILYYEDI 280

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  LK+VQEFL L   EL SRQVKIH G LS  ++NW+DV+  L GT YE  LH+DY
Sbjct: 281 INNRTKLKDVQEFLNLPPMELKSRQVKIHKGHLSDHIKNWEDVKATLNGTVYEHLLHADY 340


>gi|356537631|ref|XP_003537329.1| PREDICTED: nodulation protein H-like [Glycine max]
          Length = 341

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 182/300 (60%), Positives = 221/300 (73%), Gaps = 14/300 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C  Q S   +T  ++ KVID   QS  ++      LHYP P +F+R EC  NPV +FAI
Sbjct: 43  VCLMQISTQARTTFMDFKVIDNHSQSI-LKLMNTHLLHYPKPASFSRNECVHNPVLFFAI 101

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LS QRSGSGWFETLLN             ++ RR N+SSI+ T+D+VYNLDWF SASKNE
Sbjct: 102 LSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRQNVSSILHTLDKVYNLDWFNSASKNE 161

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           CSAA G KWM+NQGLMEHH +IAEYF ++ VS IFL RRNLLRRM+S+LANSYDR AKLL
Sbjct: 162 CSAATGLKWMLNQGLMEHHKEIAEYFNRRRVSIIFLFRRNLLRRMVSMLANSYDRYAKLL 221

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVHS  EA+IL+KYKP IN+T L+ +L+ +E  +AKA+EYF STRH++ YYEDL
Sbjct: 222 NGTHKAHVHSAEEAEILSKYKPIINSTSLLDDLQDMEMRSAKALEYFYSTRHMIVYYEDL 281

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           + N   LK+VQ+FL L   ELTSR VKIH GPLS  ++NWDDV K LKGT YE FL +DY
Sbjct: 282 MRNHTKLKDVQDFLGLPQMELTSRHVKIHRGPLSDHIQNWDDVNKTLKGTIYESFLEADY 341


>gi|255538396|ref|XP_002510263.1| conserved hypothetical protein [Ricinus communis]
 gi|223550964|gb|EEF52450.1| conserved hypothetical protein [Ricinus communis]
          Length = 278

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 174/255 (68%), Positives = 206/255 (80%), Gaps = 13/255 (5%)

Query: 48  EECACNPVRYFAILSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRV 94
           EECACNPVRYFAIL+MQRSGSGWFETLLN              K RR N+S+I+ T+D+V
Sbjct: 24  EECACNPVRYFAILTMQRSGSGWFETLLNSHMNVSSNGEIFGKKERRANVSAIMKTLDKV 83

Query: 95  YNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMIS 154
           YNLDWF+SA+KNEC+AAVGFKWM+NQG++EHH  I EYF +KGV AIFL RRNLLRRM+S
Sbjct: 84  YNLDWFSSAAKNECNAAVGFKWMLNQGVLEHHKGIVEYFNEKGVHAIFLFRRNLLRRMVS 143

Query: 155 ILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYF 214
           ++ANSYD++ K LNGTHKSHVHS AEAK+LAKYKP IN T LI +LK ++   A+AI YF
Sbjct: 144 VIANSYDKSNKPLNGTHKSHVHSAAEAKVLAKYKPKINTTTLIADLKHVDDRAARAIGYF 203

Query: 215 KSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKAL 274
           KSTRHIV YYED++ N   LKEVQEF+RL YRELTSRQVKIHSG L  Q++N  +VQ+ L
Sbjct: 204 KSTRHIVVYYEDVVGNSTKLKEVQEFMRLPYRELTSRQVKIHSGNLLDQIQNSGEVQQVL 263

Query: 275 KGTSYERFLHSDYRV 289
           KGTSYE FL SDY++
Sbjct: 264 KGTSYESFLQSDYQM 278


>gi|218192981|gb|EEC75408.1| hypothetical protein OsI_11905 [Oryza sativa Indica Group]
          Length = 365

 Score =  363 bits (931), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 171/302 (56%), Positives = 220/302 (72%), Gaps = 13/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ    + ++++ I+V +Q C  + I  SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 64  MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 123

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSI  T+D+VYNLDW +SASKNE
Sbjct: 124 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 183

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL RRNLLR+++S LAN++DR  K L
Sbjct: 184 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRRNLLRQLVSQLANNHDRYLKQL 243

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+YKP +N T LI  LKQ++  T  A+E  KS  HI  YYEDL
Sbjct: 244 NGTHKAHVHTAYEASILARYKPRLNTTSLIRSLKQVDDYTRDALENLKSINHITIYYEDL 303

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  L +V +FL++   +L SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL++DY
Sbjct: 304 IRNRTKLLDVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 363

Query: 288 RV 289
           R+
Sbjct: 364 RI 365


>gi|242035555|ref|XP_002465172.1| hypothetical protein SORBIDRAFT_01g033350 [Sorghum bicolor]
 gi|241919026|gb|EER92170.1| hypothetical protein SORBIDRAFT_01g033350 [Sorghum bicolor]
          Length = 364

 Score =  360 bits (923), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 169/303 (55%), Positives = 225/303 (74%), Gaps = 14/303 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ      ++++ I+V++QPC  + +  SE+ F+HYP P+T++REEC CN VR+FAI
Sbjct: 62  MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFVHYPQPITYSREECKCNAVRFFAI 121

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSI++T+D+VYNLDW +SASKNE
Sbjct: 122 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIINTLDKVYNLDWNSSASKNE 181

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AA+GFKWM+NQGL+ +H D+ +YF ++GVSAIFL RRNLLR+++S LAN++DR  K L
Sbjct: 182 CTAAIGFKWMLNQGLVANHADVVDYFNQRGVSAIFLFRRNLLRQLVSQLANNHDRFLKQL 241

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+YKP +N T LI +LKQ ++ T  AI+   +TRHI  YYED+
Sbjct: 242 NGTHKAHVHTKREAHILARYKPKLNTTSLIWQLKQADEYTRDAIQNLNNTRHITVYYEDI 301

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
           + NR  L +V +FL++  R L SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL+ +D
Sbjct: 302 VRNRTKLFDVLDFLKVPRRNLVSRHVKIHTRPLSEQIENWDEVYSALNGTQYEGFLNAAD 361

Query: 287 YRV 289
           Y V
Sbjct: 362 YVV 364


>gi|115453377|ref|NP_001050289.1| Os03g0394900 [Oryza sativa Japonica Group]
 gi|108708616|gb|ABF96411.1| nodulation protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113548760|dbj|BAF12203.1| Os03g0394900 [Oryza sativa Japonica Group]
 gi|215737279|dbj|BAG96208.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625064|gb|EEE59196.1| hypothetical protein OsJ_11141 [Oryza sativa Japonica Group]
          Length = 362

 Score =  359 bits (922), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 170/302 (56%), Positives = 219/302 (72%), Gaps = 13/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ    + ++++ I+V +Q C  + I  SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 61  MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 120

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSI  T+D+VYNLDW +SASKNE
Sbjct: 121 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 180

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL R NLLR+++S LAN++DR  K L
Sbjct: 181 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRWNLLRQLVSQLANNHDRYLKQL 240

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+YKP +N T LI  LKQ++  T  A+E  KS  HI  YYEDL
Sbjct: 241 NGTHKAHVHTAYEASILARYKPRLNTTSLIRSLKQVDDYTRDALENLKSINHITIYYEDL 300

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  L +V +FL++   +L SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL++DY
Sbjct: 301 IRNRTKLLDVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 360

Query: 288 RV 289
           R+
Sbjct: 361 RI 362


>gi|29824462|gb|AAP04177.1| unknown protein [Oryza sativa Japonica Group]
          Length = 384

 Score =  359 bits (921), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 170/302 (56%), Positives = 219/302 (72%), Gaps = 13/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ    + ++++ I+V +Q C  + I  SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 83  MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 142

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSI  T+D+VYNLDW +SASKNE
Sbjct: 143 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 202

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL R NLLR+++S LAN++DR  K L
Sbjct: 203 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRWNLLRQLVSQLANNHDRYLKQL 262

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+YKP +N T LI  LKQ++  T  A+E  KS  HI  YYEDL
Sbjct: 263 NGTHKAHVHTAYEASILARYKPRLNTTSLIRSLKQVDDYTRDALENLKSINHITIYYEDL 322

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  L +V +FL++   +L SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL++DY
Sbjct: 323 IRNRTKLLDVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 382

Query: 288 RV 289
           R+
Sbjct: 383 RI 384


>gi|357111946|ref|XP_003557771.1| PREDICTED: uncharacterized protein LOC100845427 [Brachypodium
           distachyon]
          Length = 357

 Score =  357 bits (915), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 168/302 (55%), Positives = 218/302 (72%), Gaps = 13/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ    + ++++ I+V +QPC  + I  SE+ F+HYP P  ++R+EC C PVR+FAI
Sbjct: 56  ICMKQLGTDSWSRMVKIEVAEQPCNKSLIPLSEVQFVHYPQPQNYSRKECMCTPVRFFAI 115

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSIV TMD+VYNLDW +SASKNE
Sbjct: 116 ISSQRSGSGWFETLLNSHINVSSNGEIFSNKERRNNISSIVKTMDKVYNLDWNSSASKNE 175

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVGFKWM+NQGL+ +H  + +YF ++GVS +FL+RRNLLR+M+S LAN++DR  K L
Sbjct: 176 CTAAVGFKWMLNQGLVANHAGVVDYFNQRGVSTVFLMRRNLLRQMVSQLANNHDRYIKQL 235

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NG HK+HVH+  EA ILA+YKP +N T LI  LKQ +  T  A+E  KS RH+  YYEDL
Sbjct: 236 NGKHKAHVHTRDEANILARYKPRLNTTTLIWSLKQADDYTRDALENLKSVRHMTMYYEDL 295

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           I NR  L +V +FL++  R+L SR VKIH+ PLS Q+ENWD+V  AL GT +E FL +DY
Sbjct: 296 IQNRTKLVDVLDFLKVPRRKLVSRHVKIHTKPLSDQIENWDEVYNALNGTQFESFLTADY 355

Query: 288 RV 289
           R+
Sbjct: 356 RI 357


>gi|293332934|ref|NP_001167842.1| hypothetical protein [Zea mays]
 gi|223944369|gb|ACN26268.1| unknown [Zea mays]
 gi|413955392|gb|AFW88041.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 360

 Score =  353 bits (905), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 219/301 (72%), Gaps = 14/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ      ++++ I+V++QPC  + +  SE+ F HYP P T++REEC CN VR+FAI
Sbjct: 59  MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSREECKCNAVRFFAI 118

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSIV+ MDRVY+LDW +SASKNE
Sbjct: 119 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIVNAMDRVYSLDWNSSASKNE 178

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AA+GFKWM+NQGL+ +H  + +YF ++GVSAIFL RRNLLR+++S LAN++DR  K L
Sbjct: 179 CTAAIGFKWMLNQGLVANHVAVVDYFNRRGVSAIFLFRRNLLRQLVSQLANNHDRFLKQL 238

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+YKP +N + LI +LKQ ++ T  A+E   +TRHI  YYED+
Sbjct: 239 NGTHKAHVHTEHEAHILARYKPRLNTSSLIRKLKQADEYTRDALENLNNTRHITVYYEDI 298

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
           + NR  L +  +FL +  REL SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL+ +D
Sbjct: 299 VRNRTRLLDALDFLGVPRRELASRHVKIHTKPLSEQIENWDEVYSALNGTQYEGFLNAAD 358

Query: 287 Y 287
           Y
Sbjct: 359 Y 359


>gi|226500168|ref|NP_001142620.1| uncharacterized protein LOC100274889 [Zea mays]
 gi|195607388|gb|ACG25524.1| hypothetical protein [Zea mays]
          Length = 360

 Score =  352 bits (904), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 165/301 (54%), Positives = 222/301 (73%), Gaps = 14/301 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ   +  ++++ I+V++QPC  + +  SE+ F+HYP P T++REEC CN VR+FAI
Sbjct: 59  ICMKQLGSNGWSRIVKIEVVEQPCNKSTVPPSEVQFVHYPQPTTYSREECKCNAVRFFAI 118

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSIV+T+DRVY+LDW +SASKNE
Sbjct: 119 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIVNTLDRVYSLDWNSSASKNE 178

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AA+GFKWM+NQGL+ +H  + +YF ++GVSA+FL RRNLLR+++S LAN++DR  K L
Sbjct: 179 CTAAIGFKWMLNQGLVANHVAVVDYFNRRGVSAVFLFRRNLLRQLVSQLANNHDRFLKQL 238

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+YKP +N + LI +LKQ ++ T  A+E   +TRHI  YYED+
Sbjct: 239 NGTHKAHVHTEHEAHILARYKPRLNTSSLIRQLKQADEYTRDALENLNNTRHITVYYEDI 298

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
           + NR  L +  +FL +  REL SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL+ +D
Sbjct: 299 VRNRTRLLDALDFLGVPRRELASRHVKIHTKPLSEQIENWDEVYSALNGTQYEGFLNAAD 358

Query: 287 Y 287
           Y
Sbjct: 359 Y 359


>gi|226496856|ref|NP_001145331.1| hypothetical protein [Zea mays]
 gi|195654733|gb|ACG46834.1| hypothetical protein [Zea mays]
 gi|414867126|tpg|DAA45683.1| TPA: hypothetical protein ZEAMMB73_734756 [Zea mays]
          Length = 360

 Score =  344 bits (883), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 164/303 (54%), Positives = 220/303 (72%), Gaps = 14/303 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ      ++++ I+V++QPC  +    SE  F+ YP P+T++REEC CN VR FAI
Sbjct: 58  MCMKQLGSDGWSRVVKIEVVEQPCNKSIAPLSEAQFVRYPQPITYSREECKCNAVRSFAI 117

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSI+ T+D+VYNLDW +SASKNE
Sbjct: 118 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIIDTLDKVYNLDWNSSASKNE 177

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AA+GFKWM+NQGL+ +H D+ +YF ++GVSAIFL RRNLLR+++S +AN++DR  K L
Sbjct: 178 CTAAIGFKWMLNQGLVANHADVVDYFNRRGVSAIFLFRRNLLRQLVSQVANNHDRLLKQL 237

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+Y+P +N T LI +LK+ ++ T  A+E   +TRH+  YYED+
Sbjct: 238 NGTHKAHVHTKREAHILARYRPRLNTTSLIWQLKRADEYTRDALENLNNTRHMSVYYEDV 297

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
           + NR  L +V +FL +  R+L SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL+ +D
Sbjct: 298 VRNRTKLLDVLDFLGVPRRKLVSRHVKIHTKPLSEQIENWDEVYSALNGTQYEGFLNAAD 357

Query: 287 YRV 289
           Y V
Sbjct: 358 YLV 360


>gi|255562595|ref|XP_002522303.1| conserved hypothetical protein [Ricinus communis]
 gi|223538381|gb|EEF39987.1| conserved hypothetical protein [Ricinus communis]
          Length = 329

 Score =  343 bits (879), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 173/294 (58%), Positives = 214/294 (72%), Gaps = 14/294 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S ++K K+ +I+VI++P  + + Q  +IP +HYP+P TFNR EC  NPVRYFAI
Sbjct: 43  VCLKQISTNSKIKIEDIQVIERPSDNESTQM-QIPIVHYPHPETFNRSECVHNPVRYFAI 101

Query: 61  LSMQRSGSGWFETLLNIKHRRINISS-------IVSTMDRVYNLDWFTSASKNECSAAVG 113
           LSMQRSGSGWFETLLN     +N+SS       I+     +  + ++T        A+  
Sbjct: 102 LSMQRSGSGWFETLLN---SHVNVSSNGEIFSFIIGFRRTLLTILYYTLPRIKRLMAS-- 156

Query: 114 FKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKS 173
           F  +M QGLMEHH  I EYF  +GVSAIFL RRNLLRRM+S+LANSYDR AKLLNGTHKS
Sbjct: 157 FLCLM-QGLMEHHEAIVEYFNSRGVSAIFLFRRNLLRRMVSVLANSYDRYAKLLNGTHKS 215

Query: 174 HVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRAT 233
           HVHS  EA+ L++YKP IN+TLLI +LK++E T A+A+EYF STRHI+ YYEDLI N   
Sbjct: 216 HVHSTEEAETLSRYKPIINSTLLISDLKEVEITAARALEYFNSTRHIILYYEDLIKNNTK 275

Query: 234 LKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           LK+VQEFL L   ELTSRQVKIH GPLS  ++NW+DV KAL GT+YE FL +DY
Sbjct: 276 LKDVQEFLGLPLMELTSRQVKIHKGPLSDHIKNWEDVNKALTGTAYESFLEADY 329


>gi|147833389|emb|CAN66236.1| hypothetical protein VITISV_022815 [Vitis vinifera]
          Length = 352

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 175/332 (52%), Positives = 214/332 (64%), Gaps = 66/332 (19%)

Query: 7   SPHTKTKLLNIKVIDQPCQSTNIQ--------------------QSEIPFLHYPNPVTFN 46
           S HTK K  N  V+++P    +IQ                      EIP +HYP P TF+
Sbjct: 36  SIHTKAKFENFWVVERPSDDHSIQIPQISTFIGDKNSCYVNGFNMKEIPNVHYPKPRTFS 95

Query: 47  REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
           R ECA NPVR+FAI+S+QRSGSGWFETLLN             +  RR NIS IV T+D+
Sbjct: 96  RAECANNPVRFFAIVSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISXIVKTLDK 155

Query: 94  VYNLDWFTSASKNECSAAVGFKWMMNQ------------------GLMEHHNDIAEYFKK 135
           +YNLDWF+SASKNECSAAVGFKWM+NQ                  GLM+HH +I +YFK 
Sbjct: 156 IYNLDWFSSASKNECSAAVGFKWMLNQKLGEVVFMLXVEISHKMMGLMKHHKEIVDYFKN 215

Query: 136 KGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATL 195
           + VS IFL RRNLLRRM+S+LANSYDR A+               A +L+ YKPTINAT 
Sbjct: 216 RDVSTIFLFRRNLLRRMVSVLANSYDREAR---------------ADMLSXYKPTINATT 260

Query: 196 LIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKI 255
           LI +L+Q+E+T AKA+EYF STRHI+ YYEDLI NR   +++ EFL+L + +L SRQVKI
Sbjct: 261 LITDLQQVEKTVAKALEYFNSTRHIILYYEDLIKNRTKTEDILEFLKLPHMDLASRQVKI 320

Query: 256 HSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           H G LS  ++NWDDV K LKGTSYE FL +DY
Sbjct: 321 HKGELSIHIKNWDDVNKTLKGTSYEHFLQADY 352


>gi|357125400|ref|XP_003564382.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
          Length = 359

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 155/278 (55%), Positives = 207/278 (74%), Gaps = 13/278 (4%)

Query: 21  DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
           ++PC   +I  +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGWFETLLN    
Sbjct: 80  EKPCHRPSIPDTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWFETLLNSHEN 139

Query: 77  ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
                    +K RR NI+SI  T+D++YNLDW +SA+KNEC+AAVG KWM+NQGLM+HH 
Sbjct: 140 ISSNGEIFSVKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLMKHHQ 199

Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
           ++ EYF ++GVSAIFL+RRNLL+R +SILAN +DR+ K LNGTHK+HVH   +A +LA+Y
Sbjct: 200 EVVEYFNRRGVSAIFLLRRNLLQRYVSILANDHDRSTKQLNGTHKAHVHHRGQADVLAQY 259

Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
           KPTI+   LI ELK+ ++  A A+  FK+ R+IV YYED+++NR  L +V +FL+L   +
Sbjct: 260 KPTISTKSLIAELKRSDKLAADALVNFKTIRNIVLYYEDVVSNRTKLTDVLDFLKLPNIK 319

Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
           L+SR VKIH+  L   ++NW DV   L GT YE FL+ 
Sbjct: 320 LSSRHVKIHTKRLRDHIDNWIDVYNTLNGTQYESFLNG 357


>gi|125527968|gb|EAY76082.1| hypothetical protein OsI_04009 [Oryza sativa Indica Group]
          Length = 358

 Score =  341 bits (874), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 154/275 (56%), Positives = 205/275 (74%), Gaps = 13/275 (4%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           CQ  +I  +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGW ETLLN       
Sbjct: 84  CQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIETLLNSHENISS 143

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 IK RR NI+SI  T+D++YNLDW +SA+KNEC+AAVG KWM+NQGL++HH ++ 
Sbjct: 144 NGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLVQHHEEVV 203

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
           EYF ++GVSAIFL+RRN L+R +S+LAN++D  AK +NGTHKSHVHS  EA+ILA++KP 
Sbjct: 204 EYFNRRGVSAIFLLRRNTLQRYVSVLANAHDSKAKQINGTHKSHVHSEREAQILAQFKPE 263

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           I+   LI +LK+ ++  A A+ YFK TRHI+ YYED+++N   L +V +FLRL  R+L+S
Sbjct: 264 IDTKKLIADLKKSDKLAADALLYFKKTRHIILYYEDVVSNDTKLMDVLDFLRLPKRKLSS 323

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
           R VKIH+  L   ++NW +V   L GT YE FL+ 
Sbjct: 324 RHVKIHTKLLRDHIDNWAEVNSTLMGTQYESFLNG 358


>gi|125572263|gb|EAZ13778.1| hypothetical protein OsJ_03703 [Oryza sativa Japonica Group]
          Length = 359

 Score =  333 bits (855), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 154/276 (55%), Positives = 203/276 (73%), Gaps = 14/276 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           CQ  +I  +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGW ETLLN       
Sbjct: 84  CQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIETLLNSHENISS 143

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 IK RR NI+SI  T+D++YNLDW +SA+KNEC+AAVG KWM+NQGL++HH ++ 
Sbjct: 144 NGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLVQHHEEVV 203

Query: 131 EYFK-KKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKP 189
           EYF  KK  SAIFL+RRN L+R +S+LAN++D  AK +NGTHKSHVHS  EA+ILA++KP
Sbjct: 204 EYFNPKKAFSAIFLLRRNTLQRYVSVLANAHDSKAKQINGTHKSHVHSEREAQILAQFKP 263

Query: 190 TINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELT 249
            I+   LI +LK+ ++  A A+ YFK TRHI+ YYED+++N   L +V +FLRL  R+L+
Sbjct: 264 EIDTKKLIADLKKSDKLAADALLYFKKTRHIILYYEDVVSNDTKLMDVLDFLRLPKRKLS 323

Query: 250 SRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
           SR VKIH+  L   ++NW +V   L GT YE FL+ 
Sbjct: 324 SRHVKIHTKLLRDHIDNWAEVNSTLMGTQYESFLNG 359


>gi|222637549|gb|EEE67681.1| hypothetical protein OsJ_25329 [Oryza sativa Japonica Group]
          Length = 923

 Score =  333 bits (854), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 154/278 (55%), Positives = 199/278 (71%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
           C+  ++   E+ ++H+P P +++R EC+CN VR F ++SMQRSGSGWFETLLN       
Sbjct: 644 CRKPHLPHEELRYVHFPKPESYSRGECSCNLVRSFVLVSMQRSGSGWFETLLNSHPNISS 703

Query: 79  --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                    RR NISSI+ T+D++YNLDWFTSA+KNEC+AA G KWM+NQG M+HH+DIA
Sbjct: 704 NGEIFNRVDRRENISSILQTLDKLYNLDWFTSAAKNECTAAFGLKWMLNQGFMDHHDDIA 763

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  KKGVS IFL RRN LRR+IS+LAN YDR+AK LNGTHKSHVHS  EA+ILAK+KP 
Sbjct: 764 SYLNKKGVSVIFLFRRNTLRRLISVLANDYDRDAKQLNGTHKSHVHSKEEAEILAKFKPV 823

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
            + + LIP ++  E+     +++F +TRH++ YYED++ NR  L +VQEFL +  R+L S
Sbjct: 824 QDVSNLIPNIRNSEKYIRDCLDHFNTTRHMILYYEDIVRNRNALFQVQEFLGVPVRKLVS 883

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           RQVKIH+ PL   V NWDDV   L GT Y  FL  +DY
Sbjct: 884 RQVKIHTSPLPGLVRNWDDVSNKLNGTQYAHFLDGADY 921


>gi|326523931|dbj|BAJ96976.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 339

 Score =  332 bits (852), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 153/278 (55%), Positives = 198/278 (71%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
           C    +   E+P++H+P P  ++R EC+CNPVR+F I+SMQRSGSGWFETLLN       
Sbjct: 60  CAKPYVPSEELPYVHFPKPKGYSRAECSCNPVRFFVIMSMQRSGSGWFETLLNSHPNISS 119

Query: 79  --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                    RR NISSIV T+D++YNLDW TSA+KNEC+AA GFKWM+NQG M+H +DI 
Sbjct: 120 NGEIFNRVDRRENISSIVQTLDKLYNLDWLTSAAKNECTAAFGFKWMLNQGFMDHRDDIL 179

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  KKG+S IFL RRN LRR+IS+LAN+YDR+AK LNGTHKSHVHS  EA+ILAK+KP 
Sbjct: 180 SYLNKKGISVIFLFRRNTLRRLISVLANNYDRDAKQLNGTHKSHVHSEEEAEILAKFKPE 239

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++ + L+  ++ +E+     ++ F +TR +V YYED+I NR  L +VQEF+ +  R+L S
Sbjct: 240 LDVSTLVSNIRDVEKYMGDCLDSFNTTRRMVLYYEDIIRNRNALFQVQEFIGVPVRKLVS 299

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           RQVKIH+ PL   V NW+DV   L GT Y RFL  +DY
Sbjct: 300 RQVKIHTRPLPDLVRNWEDVNSRLNGTEYARFLDGADY 337


>gi|255569155|ref|XP_002525546.1| conserved hypothetical protein [Ricinus communis]
 gi|223535125|gb|EEF36805.1| conserved hypothetical protein [Ricinus communis]
          Length = 333

 Score =  332 bits (852), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 157/302 (51%), Positives = 217/302 (71%), Gaps = 15/302 (4%)

Query: 1   MCSKQTSPHTKTKLLNIK-VIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFA 59
           +C +Q S    +K ++IK ++   C   +I   EI ++H+P P T++REECAC PVR+F 
Sbjct: 30  LCLQQISFPLISKEVSIKDIMKSSCYIHDIPPQEIHYVHFPYPKTYSREECACTPVRFFV 89

Query: 60  ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
           I+SMQRSGSGWFETLLN             +K RR +IS+I +T+D V+ L+W +SA+KN
Sbjct: 90  IMSMQRSGSGWFETLLNSHPNVSSNGEIFSVKQRRSSISTIQATLDTVFGLEWVSSAAKN 149

Query: 107 ECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKL 166
           +C AAVGFKWM+NQG+M +H +I +YF +KGVS IFL R+NLLRR++S+LAN+YDR+AK 
Sbjct: 150 DCIAAVGFKWMLNQGVMVYHREIMDYFNQKGVSVIFLFRKNLLRRLVSVLANAYDRDAKQ 209

Query: 167 LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED 226
           +NGTHKSHVH+  EA +LAK+KP +N + LI +L   E T  + +E+FKS RH+V YYED
Sbjct: 210 INGTHKSHVHTREEANLLAKFKPVLNISALISDLSSAEHTMTETLEFFKSCRHLVLYYED 269

Query: 227 LITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL-HS 285
           L+ N   L E QEFLR+  R L S QVKIH   LS+Q+ NW++V + L+G+ Y+ FL ++
Sbjct: 270 LMKNPKALSEAQEFLRVPIRNLVSHQVKIHVKLLSEQIYNWEEVYRTLEGSEYKYFLNYA 329

Query: 286 DY 287
           DY
Sbjct: 330 DY 331


>gi|115473697|ref|NP_001060447.1| Os07g0644200 [Oryza sativa Japonica Group]
 gi|33146729|dbj|BAC79620.1| unknown protein [Oryza sativa Japonica Group]
 gi|50509939|dbj|BAD30260.1| unknown protein [Oryza sativa Japonica Group]
 gi|113611983|dbj|BAF22361.1| Os07g0644200 [Oryza sativa Japonica Group]
 gi|215694729|dbj|BAG89920.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 337

 Score =  332 bits (850), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 154/278 (55%), Positives = 200/278 (71%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
           C+  ++   E+ ++H+P P +++R EC+CN VR F ++SMQRSGSGWFETLLN       
Sbjct: 58  CRKPHLPHEELRYVHFPKPESYSRGECSCNLVRSFVLVSMQRSGSGWFETLLNSHPNISS 117

Query: 79  --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                    RR NISSI+ T+D++YNLDWFTSA+KNEC+AA G KWM+NQG M+HH+DIA
Sbjct: 118 NGEIFNRVDRRENISSILQTLDKLYNLDWFTSAAKNECTAAFGLKWMLNQGFMDHHDDIA 177

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  KKGVS IFL RRN LRR+IS+LAN YDR+AK LNGTHKSHVHS  EA+ILAK+KP 
Sbjct: 178 SYLNKKGVSVIFLFRRNTLRRLISVLANDYDRDAKQLNGTHKSHVHSKEEAEILAKFKPV 237

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++ + LIP ++  E+     +++F +TRH++ YYED++ NR  L +VQEFL +  R+L S
Sbjct: 238 LDVSNLIPNIRNAEKYIRDCLDHFNTTRHMILYYEDIVRNRNALFQVQEFLGVPVRKLVS 297

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           RQVKIH+ PL   V NWDDV   L GT Y  FL  +DY
Sbjct: 298 RQVKIHTSPLPGLVRNWDDVSNKLNGTQYAHFLDGADY 335


>gi|357121725|ref|XP_003562568.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
          Length = 337

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 154/278 (55%), Positives = 200/278 (71%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
           C   ++   E+ ++H P P  +NR EC+C PVR+F I+SMQRSGSGWFETLLN       
Sbjct: 58  CTKPSVPSEELRYVHLPKPKGYNRGECSCTPVRFFVIVSMQRSGSGWFETLLNSHPNISS 117

Query: 79  --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                    RR N+SSIV T+D++YNLDW TSA+KNEC+AA G KWM+NQG M+HH+DI 
Sbjct: 118 NGEIFNRIDRRENLSSIVQTLDKLYNLDWLTSAAKNECTAAFGLKWMLNQGFMDHHDDIV 177

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            YF +KGVS IFL RRN LRR+IS+LAN+YDR+AK LNGTHKSHVHS  EA+ILAK+KP 
Sbjct: 178 SYFNQKGVSLIFLFRRNTLRRLISVLANNYDRDAKQLNGTHKSHVHSEEEAEILAKFKPE 237

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++ + LI +++ IE+     ++ F +TRH++ YYED+I+NR  L  VQEFL +  R+L S
Sbjct: 238 LDVSTLILDIRDIEKYIRDCLDRFNTTRHMILYYEDIISNRNALFRVQEFLGVPARKLVS 297

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           +QVKIH+ PL   V+NW+DV   L GT Y RFL  +DY
Sbjct: 298 KQVKIHTRPLPDLVKNWEDVNSKLNGTEYARFLDGADY 335


>gi|223942635|gb|ACN25401.1| unknown [Zea mays]
 gi|414869370|tpg|DAA47927.1| TPA: thylakoid lumenal protein, mRNA [Zea mays]
          Length = 364

 Score =  328 bits (842), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 162/281 (57%), Positives = 210/281 (74%), Gaps = 13/281 (4%)

Query: 21  DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
           ++PC   +I  +EIP+LHYP P T++R+ECAC  VR+FAILSMQRSGSGW ETLLN    
Sbjct: 84  EKPCHDPSIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETLLNSHPN 143

Query: 77  ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
                    +K RR NI++I  T+DR+YNLDW++SA+KNEC+AAVG KWM+NQGLM+HH 
Sbjct: 144 ISSNGEIFSVKERRSNITAITKTLDRLYNLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQ 203

Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
           +I EYF ++GVSAIFL+RRNLLRR +SILAN++D   K LNGTHK+HVHS  EA+ILA+Y
Sbjct: 204 EIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAEILAQY 263

Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
           KPTI+   LI ELK+ ++  A A+  FK+TRH+V YYED+++NR  L +V +FLRL  R+
Sbjct: 264 KPTIDKKTLITELKRSDKLAADALVNFKNTRHVVLYYEDVVSNRTMLMDVLDFLRLPKRK 323

Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
           L SR VKIH+  L   ++NW DV   LKGT +E FL+   R
Sbjct: 324 LFSRHVKIHTKRLCDHIDNWADVTDFLKGTPFESFLNGSRR 364


>gi|242082293|ref|XP_002445915.1| hypothetical protein SORBIDRAFT_07g027970 [Sorghum bicolor]
 gi|241942265|gb|EES15410.1| hypothetical protein SORBIDRAFT_07g027970 [Sorghum bicolor]
          Length = 357

 Score =  326 bits (835), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 159/281 (56%), Positives = 210/281 (74%), Gaps = 13/281 (4%)

Query: 21  DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
           ++PC   +I  +EIP+LHYP P T++R+ECAC  VR+FAILSMQRSGSGW ETLLN    
Sbjct: 77  EKPCHDPSIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETLLNSHPN 136

Query: 77  ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
                    +K RR NI++I  T+D++YNLDW++SA+KNEC+AAVG KWM+NQGLM+HH 
Sbjct: 137 ISSNGEIFSVKDRRSNITAITKTLDKLYNLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQ 196

Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
           +I EYF ++GVSAIFL+RRNLLRR +SILAN++D   K LNGTHK+HVHS  EA+ILA+Y
Sbjct: 197 EIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAEILAQY 256

Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
           KPTI+   LI ELK+ ++  + A+  FK+TRH+V YYED++ NR  L +V +FLR+  R+
Sbjct: 257 KPTIDKKTLITELKRSDKLASDALVNFKNTRHVVLYYEDVVRNRTMLMDVLDFLRVPKRK 316

Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
           L+SR VKIH+  L   ++NW DV   LKGT +E FL+   R
Sbjct: 317 LSSRHVKIHTKRLCDHIDNWADVNNFLKGTRFESFLNGSRR 357


>gi|242046744|ref|XP_002461118.1| hypothetical protein SORBIDRAFT_02g041100 [Sorghum bicolor]
 gi|241924495|gb|EER97639.1| hypothetical protein SORBIDRAFT_02g041100 [Sorghum bicolor]
          Length = 335

 Score =  324 bits (830), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 151/278 (54%), Positives = 199/278 (71%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
           C+   +   E+ ++H+P P +++R EC+C PVR+F I+SMQRSGSGWFETLLN       
Sbjct: 56  CRKPYVPHEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLLNSHPNISS 115

Query: 79  --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                    RR NISSI+ T+D +YNLDW TSA+KNEC+AA G KWM+NQG++E+  DI 
Sbjct: 116 NGEIFNRVDRRENISSILQTLDTLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPEDIV 175

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y KKKGVS IFL RRN LRR+IS+LAN YD++AK LNGTHKSHVHS  EA+ILAK+KP 
Sbjct: 176 SYLKKKGVSVIFLFRRNTLRRLISVLANDYDKDAKQLNGTHKSHVHSKEEAEILAKFKPH 235

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++ + LI  ++ IE+     +++FKSTRH++ YYED+I+N   L +VQEFLR+  R L S
Sbjct: 236 LDPSTLITNIRNIEKAIRDCLDHFKSTRHMILYYEDIISNSNALFQVQEFLRVPVRRLMS 295

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           RQVKIH+ P+   V+NW++V   L GT +  FL  SDY
Sbjct: 296 RQVKIHTRPIPDLVKNWEEVSSKLNGTEFAHFLDGSDY 333


>gi|212723924|ref|NP_001131369.1| uncharacterized protein LOC100192694 [Zea mays]
 gi|194691340|gb|ACF79754.1| unknown [Zea mays]
 gi|224028935|gb|ACN33543.1| unknown [Zea mays]
 gi|414887880|tpg|DAA63894.1| TPA: hypothetical protein ZEAMMB73_966617 [Zea mays]
 gi|414887881|tpg|DAA63895.1| TPA: hypothetical protein ZEAMMB73_966617 [Zea mays]
          Length = 337

 Score =  323 bits (828), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 150/281 (53%), Positives = 198/281 (70%), Gaps = 14/281 (4%)

Query: 21  DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK-- 78
           D  C+   +   E+ ++H+P P +++R EC+C PVR+F I+SMQRSGSGWFETL+N    
Sbjct: 55  DDICRKPYVPYEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLMNSHPN 114

Query: 79  -----------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
                       RR NISSI+ T+D +YNLDW TSA+KNEC+AA G KWM+NQG++E+  
Sbjct: 115 ISSNGEIFNRMDRRENISSILQTLDTLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPE 174

Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
           DI  Y  KKGVS IFL RRN LRR++S+LAN YD++AK LNGTHKSHVHS  EA+ILAK+
Sbjct: 175 DIVSYLNKKGVSVIFLFRRNTLRRLVSVLANDYDKDAKQLNGTHKSHVHSKEEAEILAKF 234

Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
           KP ++ + LI  ++ IE+     +++FKSTRH++ YYED+I+N   L  VQEFLR+  R 
Sbjct: 235 KPHLDTSTLITNIRNIEKAIRDCLDHFKSTRHMILYYEDIISNSNALSRVQEFLRVPVRR 294

Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           L SRQVKIH+ PL   V+NW++V   L GT +  FL  SDY
Sbjct: 295 LMSRQVKIHTRPLPDLVKNWEEVSSKLNGTEFAHFLDGSDY 335


>gi|194696388|gb|ACF82278.1| unknown [Zea mays]
          Length = 337

 Score =  322 bits (825), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 150/281 (53%), Positives = 197/281 (70%), Gaps = 14/281 (4%)

Query: 21  DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK-- 78
           D  C+   +   E+ ++H+P P +++R EC+C PVR+F I+SMQRSGSGWFETL+N    
Sbjct: 55  DDICRKPYVPYEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLMNSHPN 114

Query: 79  -----------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
                       RR NISSI+ T+DR+YNLDW TSA+KNEC+AA G KWM+NQG++E+  
Sbjct: 115 ISSNGEIFNRVDRRENISSILQTLDRLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPE 174

Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
           DI  Y  KKGVS IFL RRN LRR++S+LAN YD+ AK LNGTHKSHVHS  EA+ILAK+
Sbjct: 175 DIVSYLNKKGVSVIFLFRRNTLRRLVSVLANDYDKAAKQLNGTHKSHVHSKEEAEILAKF 234

Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
           KP ++ + LI  ++ IE+     +++FKSTRH++ YYED+I+N   L  VQEFL +  R 
Sbjct: 235 KPHLDTSTLITNIRNIEKAIRDCLDHFKSTRHMILYYEDIISNSNALSRVQEFLGVPVRR 294

Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           L SRQVKIH+ PL   V+NW++V   L GT +  FL  SDY
Sbjct: 295 LMSRQVKIHTRPLPDLVKNWEEVSSKLNGTEFAHFLDGSDY 335


>gi|226531263|ref|NP_001142398.1| uncharacterized protein LOC100274572 [Zea mays]
 gi|194708632|gb|ACF88400.1| unknown [Zea mays]
 gi|195625724|gb|ACG34692.1| hypothetical protein [Zea mays]
 gi|413925273|gb|AFW65205.1| hypothetical protein ZEAMMB73_585643 [Zea mays]
          Length = 348

 Score =  322 bits (825), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 159/281 (56%), Positives = 207/281 (73%), Gaps = 13/281 (4%)

Query: 21  DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
           ++PC    I  +EIP+LHYP P T++R+ECAC  VR+FAILSMQRSGSGW ETLLN    
Sbjct: 68  EKPCHDPGIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETLLNSHPN 127

Query: 77  ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
                    +K RR N ++I  T+D++YNLDW++SA+KNEC+AAVG KWM+NQGLM+HH 
Sbjct: 128 ISSNGEIFSVKERRSNFTAIKKTLDKLYNLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQ 187

Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
           +I EYF ++GVSAIFL+RRNLLRR +SILAN++D   K LNGTHK+HVHS  EA+ILA+Y
Sbjct: 188 EIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAEILAQY 247

Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
           KPTI+   LI ELK+ ++  A A+  FK+TRH+V YYED++ +R  L +V +FLRL  R+
Sbjct: 248 KPTIDKKTLIAELKRSDKFAADALVNFKNTRHVVLYYEDVVRSRTMLMDVLDFLRLPKRK 307

Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
           L SR VKIH+  L   ++NW DV   LKGT +E FL+   R
Sbjct: 308 LLSRHVKIHTKRLRDHIDNWADVNNFLKGTPFESFLNGSRR 348


>gi|6562002|emb|CAB62491.1| hypothetical protein [Arabidopsis thaliana]
          Length = 263

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 151/217 (69%), Positives = 178/217 (82%), Gaps = 13/217 (5%)

Query: 49  ECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVY 95
           EC  NPVRYFAILSMQRSGSGWFETLLN             +  RR NISSI+ T+DRVY
Sbjct: 37  ECGHNPVRYFAILSMQRSGSGWFETLLNSHNNVSSNGEIFSVLDRRKNISSIIQTLDRVY 96

Query: 96  NLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISI 155
           NLDWFTSASKNECSAA+GFKWM+NQGL+E+H DI EYF ++GVSAIFL RRN LRRM+S+
Sbjct: 97  NLDWFTSASKNECSAAIGFKWMLNQGLLENHKDIVEYFNRRGVSAIFLFRRNPLRRMVSV 156

Query: 156 LANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
           LANSYDR AKLLNGTHKSHVHSPAEA  L++YKP IN+T LI +L++ E + AKA+EYF 
Sbjct: 157 LANSYDRYAKLLNGTHKSHVHSPAEADALSRYKPVINSTSLIHDLQETENSAAKALEYFN 216

Query: 216 STRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQ 252
           +TRHIV +YEDLITN+ TLK+VQEFL +  ++L+SRQ
Sbjct: 217 TTRHIVVFYEDLITNQTTLKQVQEFLNIPVKDLSSRQ 253


>gi|297722141|ref|NP_001173434.1| Os03g0363950 [Oryza sativa Japonica Group]
 gi|255674526|dbj|BAH92162.1| Os03g0363950 [Oryza sativa Japonica Group]
          Length = 366

 Score =  321 bits (822), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           C+   I    + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN       
Sbjct: 87  CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 146

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI 
Sbjct: 147 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 206

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  KKGV  IFL RRN LRR+IS+LAN YDR  K LNGTHK+HVHS  EA ILA++KP 
Sbjct: 207 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 266

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++   LIP ++  EQ+    +++F STRH++ YYED+I ++  L  VQEFL +    L+S
Sbjct: 267 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 326

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           R VKIH+ PL   V+NW++V + L GT Y RF+  +DY
Sbjct: 327 RHVKIHTSPLPDLVDNWEEVSEKLNGTEYARFVDGADY 364


>gi|108708316|gb|ABF96111.1| nodulation protein, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 337

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           C+   I    + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN       
Sbjct: 58  CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 117

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI 
Sbjct: 118 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 177

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  KKGV  IFL RRN LRR+IS+LAN YDR  K LNGTHK+HVHS  EA ILA++KP 
Sbjct: 178 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 237

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++   LIP ++  EQ+    +++F STRH++ YYED+I ++  L  VQEFL +    L+S
Sbjct: 238 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 297

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           R VKIH+ PL   V+NW++V + L GT Y RF+  +DY
Sbjct: 298 RHVKIHTSPLPDLVDNWEEVSEKLNGTEYARFVDGADY 335


>gi|222624965|gb|EEE59097.1| hypothetical protein OsJ_10950 [Oryza sativa Japonica Group]
          Length = 339

 Score =  320 bits (820), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           C+   I    + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN       
Sbjct: 60  CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 119

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI 
Sbjct: 120 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 179

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  KKGV  IFL RRN LRR+IS+LAN YDR  K LNGTHK+HVHS  EA ILA++KP 
Sbjct: 180 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 239

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++   LIP ++  EQ+    +++F STRH++ YYED+I ++  L  VQEFL +    L+S
Sbjct: 240 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 299

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           R VKIH+ PL   V+NW++V + L GT Y RF+  +DY
Sbjct: 300 RHVKIHTSPLPDLVDNWEEVSEKLNGTEYARFVDGADY 337


>gi|218192885|gb|EEC75312.1| hypothetical protein OsI_11681 [Oryza sativa Indica Group]
          Length = 339

 Score =  320 bits (820), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           C+   I    + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN       
Sbjct: 60  CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 119

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI 
Sbjct: 120 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 179

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  KKGV  IFL RRN LRR+IS+LAN YDR  K LNGTHK+HVHS  EA ILA++KP 
Sbjct: 180 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 239

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
           ++   LIP ++  EQ+    +++F STRH++ YYED+I ++  L  VQEFL +    L+S
Sbjct: 240 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 299

Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           R VKIH+ PL   V+NW++V + L GT Y RF+  +DY
Sbjct: 300 RHVKIHTSPLPDLVDNWEEVSENLNGTEYARFVDGADY 337


>gi|357119777|ref|XP_003561610.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
          Length = 343

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 139/271 (51%), Positives = 194/271 (71%), Gaps = 13/271 (4%)

Query: 29  IQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------------ 76
           I   ++ ++H+P P+T++R ECAC PVR+F I+SMQRSGSGWFETLLN            
Sbjct: 69  IPYEQMQYVHFPRPMTYDRGECACTPVRFFVIVSMQRSGSGWFETLLNSHPNVSSNGEIF 128

Query: 77  -IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKK 135
            ++ RR +I+SI+ T+D++Y+LDW TSA+KNEC+AA G KWM+NQGL ++H DIA Y  +
Sbjct: 129 SVRERREDIASILRTLDKLYDLDWRTSAAKNECTAAFGLKWMLNQGLTDYHQDIANYLNE 188

Query: 136 KGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATL 195
           KGV  IFL R+N LRR++S+LAN YDRN K LNGTHK+HVHS  EA+ILA+++P ++ + 
Sbjct: 189 KGVMVIFLFRKNTLRRLVSVLANDYDRNVKQLNGTHKAHVHSHEEAEILARFRPELDVSS 248

Query: 196 LIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKI 255
           L+  ++  EQ+    +  F+STRH++ YYED++ +   L  VQEFL +  R L+SR VKI
Sbjct: 249 LVLSIRDAEQSMESCLVRFRSTRHMILYYEDIVRDDNALSRVQEFLGVPVRRLSSRHVKI 308

Query: 256 HSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
           H+ PL   VENW+DV + LKGT +  FL  +
Sbjct: 309 HTRPLPDLVENWEDVSETLKGTEFAHFLDGE 339


>gi|296087807|emb|CBI35063.3| unnamed protein product [Vitis vinifera]
          Length = 344

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 158/299 (52%), Positives = 216/299 (72%), Gaps = 15/299 (5%)

Query: 3   SKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILS 62
           ++ T P   TK+   +V ++ C    I   E+ ++H+P P T++REEC C PVR+F ILS
Sbjct: 45  TRVTVPSKPTKISFTRV-ERTCSINGIPPEELVYVHFPQPKTYSREECKCTPVRFFVILS 103

Query: 63  MQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECS 109
           MQRSGSGWFETLLN             +K RR ++ +I  T+D +YNL+W +SA+KN+C 
Sbjct: 104 MQRSGSGWFETLLNSHPNISSNGEVFSLKQRRQSMETIQRTLDTIYNLEWISSAAKNDCV 163

Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
           AAVGFKWM+NQ +M +H  IA YF +KGVS IFL RRNLLRR++S+LAN+YD++AKL+NG
Sbjct: 164 AAVGFKWMLNQAVMVYHRQIANYFNQKGVSVIFLFRRNLLRRLVSLLANAYDKDAKLING 223

Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
           TH SHVH+  EA +LA++KPT+N + LI +L   E+T  +++E+FKS RH++ YYED+I 
Sbjct: 224 THMSHVHTREEADLLAQFKPTLNVSSLIVDLSTAERTMTESLEFFKSNRHMILYYEDIIN 283

Query: 230 NRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           N   L +VQEFLR+ +R+L SRQVKIH  PL +QV+NW DV   ++G+ YE FLH SDY
Sbjct: 284 NPKALSDVQEFLRVPFRKLKSRQVKIHIKPLHEQVDNWYDVYSTVQGSQYEHFLHSSDY 342


>gi|413955662|gb|AFW88311.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 346

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 137/265 (51%), Positives = 188/265 (70%), Gaps = 14/265 (5%)

Query: 36  FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
           ++H+P P T++R EC CNPVR+F I+S QRSGSGW E LLN             ++ RR 
Sbjct: 80  YVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQ 139

Query: 83  NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
           N+SS++ T+DR+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI  Y  +KG   +F
Sbjct: 140 NLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVF 199

Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
           L RRN LRR+IS++AN+YDR A+ LNG HKSHVHS  EA++LA+++P ++A  LIP ++ 
Sbjct: 200 LFRRNTLRRLISVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRT 259

Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLS 261
            ++     +  F STRH+V YYED++ + R  L  VQEFL +  REL+S+ VKIH+ PL 
Sbjct: 260 AQRAVRACLRRFGSTRHMVLYYEDVVRDSRKALSRVQEFLGVPARELSSKHVKIHTRPLP 319

Query: 262 KQVENWDDVQKALKGTSYERFLHSD 286
             V NW++V++ L+GT Y RFL  D
Sbjct: 320 DLVGNWEEVRRTLRGTEYSRFLDDD 344


>gi|225464035|ref|XP_002265782.1| PREDICTED: nodulation protein H-like [Vitis vinifera]
          Length = 339

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 158/299 (52%), Positives = 216/299 (72%), Gaps = 15/299 (5%)

Query: 3   SKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILS 62
           ++ T P   TK+   +V ++ C    I   E+ ++H+P P T++REEC C PVR+F ILS
Sbjct: 40  TRVTVPSKPTKISFTRV-ERTCSINGIPPEELVYVHFPQPKTYSREECKCTPVRFFVILS 98

Query: 63  MQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECS 109
           MQRSGSGWFETLLN             +K RR ++ +I  T+D +YNL+W +SA+KN+C 
Sbjct: 99  MQRSGSGWFETLLNSHPNISSNGEVFSLKQRRQSMETIQRTLDTIYNLEWISSAAKNDCV 158

Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
           AAVGFKWM+NQ +M +H  IA YF +KGVS IFL RRNLLRR++S+LAN+YD++AKL+NG
Sbjct: 159 AAVGFKWMLNQAVMVYHRQIANYFNQKGVSVIFLFRRNLLRRLVSLLANAYDKDAKLING 218

Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
           TH SHVH+  EA +LA++KPT+N + LI +L   E+T  +++E+FKS RH++ YYED+I 
Sbjct: 219 THMSHVHTREEADLLAQFKPTLNVSSLIVDLSTAERTMTESLEFFKSNRHMILYYEDIIN 278

Query: 230 NRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
           N   L +VQEFLR+ +R+L SRQVKIH  PL +QV+NW DV   ++G+ YE FLH SDY
Sbjct: 279 NPKALSDVQEFLRVPFRKLKSRQVKIHIKPLHEQVDNWYDVYSTVQGSQYEHFLHSSDY 337


>gi|168059476|ref|XP_001781728.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666812|gb|EDQ53457.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 378

 Score =  306 bits (783), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 147/264 (55%), Positives = 186/264 (70%), Gaps = 13/264 (4%)

Query: 36  FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
           + HYP P  + R+EC C PV YF ILSMQRSGSGWFETLLN             +K RR 
Sbjct: 112 YQHYPLPHAYERQECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVKDRRE 171

Query: 83  NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
           N SSI  TMD+VYNLDW  SA+KNEC+AAVGFKWM+NQG ME++ ++ EYF++ GVS I 
Sbjct: 172 NFSSIAKTMDKVYNLDWLNSAAKNECTAAVGFKWMLNQGPMEYNREVREYFERMGVSVIL 231

Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
           L+RRN+L+R+ISILAN+YDR  K LNGTHKSHVHS  EA  LA+Y+P ++A  L   L +
Sbjct: 232 LLRRNVLKRLISILANAYDRVVKPLNGTHKSHVHSVEEAMKLAEYRPVVDAKHLFENLAR 291

Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSK 262
           +EQ T  A  +F +TR  V +YEDL+ +   L E+Q FL +  R+L S+QVKIH+ PL +
Sbjct: 292 VEQITDDAQRFFNNTRIRVVHYEDLVVDPKYLSEIQHFLGVQPRKLESQQVKIHTRPLRE 351

Query: 263 QVENWDDVQKALKGTSYERFLHSD 286
           Q++NWD+V   LKGT YE  L  D
Sbjct: 352 QIQNWDEVLIHLKGTKYESLLQDD 375


>gi|242035665|ref|XP_002465227.1| hypothetical protein SORBIDRAFT_01g034575 [Sorghum bicolor]
 gi|241919081|gb|EER92225.1| hypothetical protein SORBIDRAFT_01g034575 [Sorghum bicolor]
          Length = 366

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 195/306 (63%), Gaps = 42/306 (13%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           C        ++ ++H+P P T++R ECACNPVR+F I+SMQRSGSGWFETLLN       
Sbjct: 59  CTRPATPHEQMRYVHFPRPATYDRGECACNPVRFFVIISMQRSGSGWFETLLNSHPNVSS 118

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 ++ RR NISSI++T+DR+Y++DW TSA+KNEC+AA G KWM+NQGLME+H DI 
Sbjct: 119 NGEIFSVRDRRENISSILATLDRLYDMDWITSAAKNECTAAFGLKWMLNQGLMENHRDIV 178

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
            Y  ++G   IFL RRN LRR+IS+LAN+YDR  K LNG HKSHVHS  EA+ILA++KP 
Sbjct: 179 NYLNRRGAIVIFLFRRNTLRRLISVLANNYDRRTKQLNGVHKSHVHSKEEAEILARFKPE 238

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNR------------------- 231
           ++ + LIP ++  ++     +  F STRH+V YYED+I +R                   
Sbjct: 239 LDISTLIPSIRDAQRAMRTCLGRFSSTRHMVLYYEDVIRDRNVGITSKQAFPASEDYPII 298

Query: 232 ---------ATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERF 282
                      L  VQEFL +  R+L+S+ VKIH+ PL   V+NW+ V + L GT Y RF
Sbjct: 299 MLFLMPCAVQALSRVQEFLGVPVRKLSSKHVKIHTRPLPDLVDNWEQVSQVLNGTRYGRF 358

Query: 283 L-HSDY 287
           L  +DY
Sbjct: 359 LADADY 364


>gi|168020679|ref|XP_001762870.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685979|gb|EDQ72371.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 306

 Score =  302 bits (774), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 142/264 (53%), Positives = 185/264 (70%), Gaps = 13/264 (4%)

Query: 36  FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
           + HYP P T+ R EC C PV YF ILSMQRSGSGWFETLLN             ++ RR 
Sbjct: 40  YQHYPLPHTYERNECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVRERRD 99

Query: 83  NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
           N SSI   MD+V+NLDW  SASKNEC+AAVGFKWM+NQG ME++ ++ +YF++ GVS I 
Sbjct: 100 NFSSIARNMDKVFNLDWLNSASKNECTAAVGFKWMLNQGPMEYNREVLDYFERMGVSVIL 159

Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
           L+RRN+L+R+ISI+ANSYD+ AK+LNGTHKSHVHS  EA  LA+Y+P I+   L   L++
Sbjct: 160 LLRRNVLKRLISIMANSYDQRAKILNGTHKSHVHSVEEALKLAEYRPVIDVNHLPENLQR 219

Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSK 262
           +E+  + A   F  TR  + YYEDL+ +   L E+Q FL +  R+L S+QVKIH+ PL +
Sbjct: 220 VEKMASDAQRLFNKTRSRLVYYEDLVMDSQRLTEIQTFLGVPPRKLESQQVKIHTRPLRE 279

Query: 263 QVENWDDVQKALKGTSYERFLHSD 286
           Q++NWD+V   L GT YE  +H D
Sbjct: 280 QIQNWDEVLARLNGTRYELLMHDD 303


>gi|218200116|gb|EEC82543.1| hypothetical protein OsI_27076 [Oryza sativa Indica Group]
          Length = 866

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 138/265 (52%), Positives = 184/265 (69%), Gaps = 12/265 (4%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIKHRRIN 83
           C+   +   E+ ++H+P P +++R EC+CNPVR+   L +          L  +    + 
Sbjct: 611 CRKPRLPHEELRYVHFPKPESYSRGECSCNPVRFLCALYL----------LQRVAESGMG 660

Query: 84  ISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFL 143
           + SI  T+D++YNLDWFTSA+KNEC+AA G KWM+NQG M+HH+DIA YF KKGVS IFL
Sbjct: 661 VESIC-TLDKLYNLDWFTSAAKNECTAAFGLKWMLNQGFMDHHDDIASYFNKKGVSVIFL 719

Query: 144 IRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQI 203
            RRN LRR+IS+LAN YDRNAK LNGTHKSHVHS  EA+ILAK+KP ++ + LIP ++  
Sbjct: 720 FRRNTLRRLISVLANDYDRNAKQLNGTHKSHVHSKEEAEILAKFKPVLDVSNLIPNIRNA 779

Query: 204 EQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQ 263
           E+     +++F +TRH++ YYED++ NR  L +VQEFL +  R+L SRQVKIH+ PL   
Sbjct: 780 EKYIRDCLDHFNTTRHMILYYEDIVRNRNALFQVQEFLGVPVRKLVSRQVKIHTSPLPGL 839

Query: 264 VENWDDVQKALKGTSYERFLH-SDY 287
           V NWDDV   L GT Y  FL  +DY
Sbjct: 840 VSNWDDVSNKLNGTQYAHFLDGADY 864


>gi|413955663|gb|AFW88312.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 374

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 138/293 (47%), Positives = 190/293 (64%), Gaps = 42/293 (14%)

Query: 36  FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
           ++H+P P T++R EC CNPVR+F I+S QRSGSGW E LLN             ++ RR 
Sbjct: 80  YVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQ 139

Query: 83  NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
           N+SS++ T+DR+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI  Y  +KG   +F
Sbjct: 140 NLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVF 199

Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
           L RRN LRR+IS++AN+YDR A+ LNG HKSHVHS  EA++LA+++P ++A  LIP ++ 
Sbjct: 200 LFRRNTLRRLISVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRT 259

Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLI---------TNRATLK------------------ 235
            ++     +  F STRH+V YYED++         T+R  L+                  
Sbjct: 260 AQRAVRACLRRFGSTRHMVLYYEDVVRDSRKVGGGTSRMALEFKRGYPITGACSSLCMQA 319

Query: 236 --EVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
              VQEFL +  REL+S+ VKIH+ PL   V NW++V++ L+GT Y RFL  D
Sbjct: 320 LSRVQEFLGVPARELSSKHVKIHTRPLPDLVGNWEEVRRTLRGTEYSRFLDDD 372


>gi|226502206|ref|NP_001144194.1| uncharacterized protein LOC100277054 [Zea mays]
 gi|195638236|gb|ACG38586.1| hypothetical protein [Zea mays]
          Length = 340

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 131/257 (50%), Positives = 181/257 (70%), Gaps = 14/257 (5%)

Query: 36  FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
           ++H+P P T++R EC CNPVR+F I+S QRSGSGW   LLN             ++ RR 
Sbjct: 80  YVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVXALLNSHPNVSSNGEVFSMRERRQ 139

Query: 83  NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
           N+SS++ T+DR+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI  Y  +KG   +F
Sbjct: 140 NLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVF 199

Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
           L RRN LRR+IS++AN+YDR A+ LNG HKSHVHS  EA++LA+++P ++A  LIP ++ 
Sbjct: 200 LFRRNTLRRLISVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRT 259

Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLS 261
            ++     +  F STRH+V YYED++ + R  L  VQEFL +  REL+S+ VKIH+ PL 
Sbjct: 260 AQRAVRACLRRFGSTRHMVLYYEDVVRDSRKALSRVQEFLGVPARELSSKHVKIHTRPLP 319

Query: 262 KQVENWDDVQKALKGTS 278
             V NW++V++ L G S
Sbjct: 320 DLVGNWEEVRRTLAGQS 336


>gi|167999402|ref|XP_001752406.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696306|gb|EDQ82645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 256

 Score =  296 bits (759), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 139/253 (54%), Positives = 180/253 (71%), Gaps = 13/253 (5%)

Query: 47  REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
           R EC C PV YF ILSMQRSGSGWFETLLN             +  RR N SSI + MD+
Sbjct: 1   RNECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEVFSVGERRDNFSSIATNMDK 60

Query: 94  VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
           V+NLDW  SASKNEC+AAVGFKWM+NQG ME++ ++ +YF+K GVS I L+RRN+L+R+I
Sbjct: 61  VFNLDWLNSASKNECTAAVGFKWMLNQGPMEYNGEVLDYFQKMGVSVILLLRRNVLKRLI 120

Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
           SI+AN+YD+ A++LNGTHKSHVHS  EA  LA+YKP I+   L   L+++EQ  + A  +
Sbjct: 121 SIMANTYDQRARILNGTHKSHVHSVEEALKLAEYKPVIDVKHLPERLQRVEQIASDAQRF 180

Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
           F  TR    YYEDL+T+   L E+QEFLR+  + L S+QVKIH+ P+ +Q++NWD+V   
Sbjct: 181 FNKTRLRPVYYEDLVTDPKQLTEIQEFLRVPPQNLESQQVKIHTRPMREQIQNWDEVLAR 240

Query: 274 LKGTSYERFLHSD 286
           L GT YE  +H D
Sbjct: 241 LNGTKYELLMHDD 253


>gi|194702290|gb|ACF85229.1| unknown [Zea mays]
 gi|413955660|gb|AFW88309.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 334

 Score =  296 bits (759), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 133/254 (52%), Positives = 179/254 (70%), Gaps = 14/254 (5%)

Query: 47  REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
           R EC CNPVR+F I+S QRSGSGW E LLN             ++ RR N+SS++ T+DR
Sbjct: 79  RGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQNLSSVLGTLDR 138

Query: 94  VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
           +Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI  Y  +KG   +FL RRN LRR+I
Sbjct: 139 LYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVFLFRRNTLRRLI 198

Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
           S++AN+YDR A+ LNG HKSHVHS  EA++LA+++P ++A  LIP ++  ++     +  
Sbjct: 199 SVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRTAQRAVRACLRR 258

Query: 214 FKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQK 272
           F STRH+V YYED++ + R  L  VQEFL +  REL+S+ VKIH+ PL   V NW++V++
Sbjct: 259 FGSTRHMVLYYEDVVRDSRKALSRVQEFLGVPARELSSKHVKIHTRPLPDLVGNWEEVRR 318

Query: 273 ALKGTSYERFLHSD 286
            L+GT Y RFL  D
Sbjct: 319 TLRGTEYSRFLDDD 332


>gi|293334015|ref|NP_001170415.1| uncharacterized protein LOC100384402 [Zea mays]
 gi|224035729|gb|ACN36940.1| unknown [Zea mays]
          Length = 275

 Score =  295 bits (755), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 149/253 (58%), Positives = 189/253 (74%), Gaps = 13/253 (5%)

Query: 49  ECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVY 95
           ECAC  VR+FAILSMQRSGSGW ETLLN             +K RR NI++I  T+DR+Y
Sbjct: 23  ECACTGVRFFAILSMQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNITAITKTLDRLY 82

Query: 96  NLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISI 155
           NLDW++SA+KNEC+AAVG KWM+NQGLM+HH +I EYF ++GVSAIFL+RRNLLRR +SI
Sbjct: 83  NLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSI 142

Query: 156 LANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
           LAN++D   K LNGTHK+HVHS  EA+ILA+YKPTI+   LI ELK+ ++  A A+  FK
Sbjct: 143 LANAHDSAMKQLNGTHKAHVHSKHEAEILAQYKPTIDKKTLITELKRSDKLAADALVNFK 202

Query: 216 STRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
           +TRH+V YYED+++NR  L +V +FLRL  R+L SR VKIH+  L   ++NW DV   LK
Sbjct: 203 NTRHVVLYYEDVVSNRTMLMDVLDFLRLPKRKLFSRHVKIHTKRLCDHIDNWADVTDFLK 262

Query: 276 GTSYERFLHSDYR 288
           GT +E FL+   R
Sbjct: 263 GTPFESFLNGSRR 275


>gi|413955391|gb|AFW88040.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 307

 Score =  293 bits (751), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 136/244 (55%), Positives = 179/244 (73%), Gaps = 13/244 (5%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ      ++++ I+V++QPC  + +  SE+ F HYP P T++REEC CN VR+FAI
Sbjct: 59  MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSREECKCNAVRFFAI 118

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSIV+ MDRVY+LDW +SASKNE
Sbjct: 119 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIVNAMDRVYSLDWNSSASKNE 178

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AA+GFKWM+NQGL+ +H  + +YF ++GVSAIFL RRNLLR+++S LAN++DR  K L
Sbjct: 179 CTAAIGFKWMLNQGLVANHVAVVDYFNRRGVSAIFLFRRNLLRQLVSQLANNHDRFLKQL 238

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+  EA ILA+YKP +N + LI +LKQ ++ T  A+E   +TRHI  YYED+
Sbjct: 239 NGTHKAHVHTEHEAHILARYKPRLNTSSLIRKLKQADEYTRDALENLNNTRHITVYYEDI 298

Query: 228 ITNR 231
           + NR
Sbjct: 299 VRNR 302


>gi|115440365|ref|NP_001044462.1| Os01g0784600 [Oryza sativa Japonica Group]
 gi|53792452|dbj|BAD53360.1| unknown protein [Oryza sativa Japonica Group]
 gi|113533993|dbj|BAF06376.1| Os01g0784600 [Oryza sativa Japonica Group]
 gi|215695270|dbj|BAG90461.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 346

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 129/220 (58%), Positives = 171/220 (77%), Gaps = 13/220 (5%)

Query: 24  CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
           CQ  +I  +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGW ETLLN       
Sbjct: 84  CQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIETLLNSHENISS 143

Query: 77  ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
                 IK RR NI+SI  T+D++YNLDW +SA+KNEC+AAVG KWM+NQGL++HH ++ 
Sbjct: 144 NGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLVQHHEEVV 203

Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
           EYF ++GVSAIFL+RRN L+R +S+LAN++D  AK +NGTHKSHVHS  EA+ILA++KP 
Sbjct: 204 EYFNRRGVSAIFLLRRNTLQRYVSVLANAHDSKAKQINGTHKSHVHSEREAQILAQFKPE 263

Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN 230
           I+   LI +LK+ ++  A A+ YFK TRHI+ YYED+++N
Sbjct: 264 IDTKKLIADLKKSDKLAADALLYFKKTRHIILYYEDVVSN 303


>gi|168007783|ref|XP_001756587.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692183|gb|EDQ78541.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 256

 Score =  290 bits (741), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 140/252 (55%), Positives = 177/252 (70%), Gaps = 13/252 (5%)

Query: 47  REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
           R+EC C PV +F ILSMQRSGSGWFETLLN             +K RR N S+I  TMD+
Sbjct: 1   RQECQCTPVHFFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVKPRRANFSTIARTMDK 60

Query: 94  VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
           +YNLDW  SA+KNEC+AAVGFKWM+NQG ME+  ++++YF+K GVS I L+RRN+L+R+I
Sbjct: 61  IYNLDWLNSAAKNECTAAVGFKWMLNQGPMEYSREVSDYFEKMGVSVILLLRRNVLKRLI 120

Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
           SILAN+YDR AK LNG HKSHVHS  EA  LA+Y+P I+   L   L ++EQ T  A  +
Sbjct: 121 SILANAYDRKAKPLNGIHKSHVHSVEEAMKLAEYRPVIDVNHLTDNLHRVEQITDDAQRF 180

Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
           F +TR  V YYEDL+ +   L +VQEFL +  R+L S QVKIH+ PL +Q++NWD V   
Sbjct: 181 FNNTRLRVVYYEDLVMDPKHLMQVQEFLGVQPRKLESLQVKIHTRPLREQIQNWDAVLAR 240

Query: 274 LKGTSYERFLHS 285
           LK T YE  L+ 
Sbjct: 241 LKATQYETLLND 252


>gi|413925274|gb|AFW65206.1| hypothetical protein ZEAMMB73_585643 [Zea mays]
          Length = 275

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 146/253 (57%), Positives = 187/253 (73%), Gaps = 13/253 (5%)

Query: 49  ECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVY 95
           ECAC  VR+FAILSMQRSGSGW ETLLN             +K RR N ++I  T+D++Y
Sbjct: 23  ECACTGVRFFAILSMQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNFTAIKKTLDKLY 82

Query: 96  NLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISI 155
           NLDW++SA+KNEC+AAVG KWM+NQGLM+HH +I EYF ++GVSAIFL+RRNLLRR +SI
Sbjct: 83  NLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSI 142

Query: 156 LANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
           LAN++D   K LNGTHK+HVHS  EA+ILA+YKPTI+   LI ELK+ ++  A A+  FK
Sbjct: 143 LANAHDSAMKQLNGTHKAHVHSKHEAEILAQYKPTIDKKTLIAELKRSDKFAADALVNFK 202

Query: 216 STRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
           +TRH+V YYED++ +R  L +V +FLRL  R+L SR VKIH+  L   ++NW DV   LK
Sbjct: 203 NTRHVVLYYEDVVRSRTMLMDVLDFLRLPKRKLLSRHVKIHTKRLRDHIDNWADVNNFLK 262

Query: 276 GTSYERFLHSDYR 288
           GT +E FL+   R
Sbjct: 263 GTPFESFLNGSRR 275


>gi|413955661|gb|AFW88310.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
          Length = 362

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 134/282 (47%), Positives = 181/282 (64%), Gaps = 42/282 (14%)

Query: 47  REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
           R EC CNPVR+F I+S QRSGSGW E LLN             ++ RR N+SS++ T+DR
Sbjct: 79  RGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQNLSSVLGTLDR 138

Query: 94  VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
           +Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI  Y  +KG   +FL RRN LRR+I
Sbjct: 139 LYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVFLFRRNTLRRLI 198

Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
           S++AN+YDR A+ LNG HKSHVHS  EA++LA+++P ++A  LIP ++  ++     +  
Sbjct: 199 SVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRTAQRAVRACLRR 258

Query: 214 FKSTRHIVFYYEDLI---------TNRATLK--------------------EVQEFLRLA 244
           F STRH+V YYED++         T+R  L+                     VQEFL + 
Sbjct: 259 FGSTRHMVLYYEDVVRDSRKVGGGTSRMALEFKRGYPITGACSSLCMQALSRVQEFLGVP 318

Query: 245 YRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
            REL+S+ VKIH+ PL   V NW++V++ L+GT Y RFL  D
Sbjct: 319 ARELSSKHVKIHTRPLPDLVGNWEEVRRTLRGTEYSRFLDDD 360


>gi|302770487|ref|XP_002968662.1| hypothetical protein SELMODRAFT_90926 [Selaginella moellendorffii]
 gi|302816465|ref|XP_002989911.1| hypothetical protein SELMODRAFT_130724 [Selaginella moellendorffii]
 gi|300142222|gb|EFJ08924.1| hypothetical protein SELMODRAFT_130724 [Selaginella moellendorffii]
 gi|300163167|gb|EFJ29778.1| hypothetical protein SELMODRAFT_90926 [Selaginella moellendorffii]
          Length = 329

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 141/308 (45%), Positives = 197/308 (63%), Gaps = 23/308 (7%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNI----QQSEIPFLHYPNPVTFNREECACNPVR 56
           MC       T    L++  I Q  +  N+     +++ P  HYP P T++R+EC C PV 
Sbjct: 23  MCLIGVDRRTTYDRLSLDPIQQVWRRENLCPRLSRAKFP-QHYPQPRTYSRKECRCVPVH 81

Query: 57  YFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSA 103
           +F +LSMQRSGSGWFETLLN             I  RR N S+I  T+D VYNL+W++SA
Sbjct: 82  FFVLLSMQRSGSGWFETLLNSHPNVSSHGEIFSIGRRRANFSTIKQTLDEVYNLEWYSSA 141

Query: 104 SKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRN 163
           +KNEC+AAVGFKWM+NQG + H + +A+YF+ +GVS +FL RRN L+R+IS++ N+YDR 
Sbjct: 142 AKNECTAAVGFKWMLNQGALAHGDQVADYFRSRGVSVVFLQRRNYLKRLISVMGNAYDR- 200

Query: 164 AKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFY 223
           AK LNGTH +HVHS  EA++LA +KPTI+   L+  L++++  + + +  F +TR    Y
Sbjct: 201 AKPLNGTHVAHVHSREEAELLASFKPTIDVANLLQNLRRVQNMSDETLRIFGTTRLTALY 260

Query: 224 YEDLITNRATLKEVQ---EFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYE 280
           YEDL+ N   +  V     FL +  ++L S+ VKIH+ PL   + NWDDV +AL GT +E
Sbjct: 261 YEDLVKNPRKVMRVDVVLRFLGVPRQDLRSKHVKIHTRPLRDSITNWDDVYRALNGTEFE 320

Query: 281 RFLHS-DY 287
             LH  DY
Sbjct: 321 SLLHDRDY 328


>gi|302799659|ref|XP_002981588.1| hypothetical protein SELMODRAFT_114849 [Selaginella moellendorffii]
 gi|300150754|gb|EFJ17403.1| hypothetical protein SELMODRAFT_114849 [Selaginella moellendorffii]
          Length = 336

 Score =  280 bits (717), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 136/262 (51%), Positives = 174/262 (66%), Gaps = 14/262 (5%)

Query: 38  HYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINI 84
           HYP P T++R ECAC PV  F ILS QRSGSGWFETLLN             ++ RR N 
Sbjct: 73  HYPRPKTYSRGECACTPVHNFVILSTQRSGSGWFETLLNSHPNITSHGEIFSVRPRRHNF 132

Query: 85  SSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLI 144
           S +  T+D +YNLDW  SASKN C++ VGFKWM+NQG ME+ ++++ YFK++GVS I LI
Sbjct: 133 SIVSHTLDALYNLDWHNSASKNSCTSVVGFKWMLNQGAMEYRSEVSSYFKRRGVSVILLI 192

Query: 145 RRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIE 204
           RRN L+R+ISILAN+YDR  + +NGTH SHVHS   A +LA YKPT++   L   L + +
Sbjct: 193 RRNGLKRLISILANAYDRR-QPMNGTHVSHVHSEEAADLLASYKPTVDVAFLKNNLDRYD 251

Query: 205 QTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQV 264
                A+  F+ TR +V YYED + NR   + VQEFL +    L SRQ KIH GPLS+ +
Sbjct: 252 LMIKNALRAFEGTRLLVIYYEDFVKNRKMQRTVQEFLSVPVMRLRSRQWKIHRGPLSQSI 311

Query: 265 ENWDDVQKALKGTSYERFLHSD 286
            NW +V + L GTSY+ FL  D
Sbjct: 312 TNWKEVFRELNGTSYQGFLADD 333


>gi|14029013|gb|AAK52554.1|AC079853_7 Unknown protein [Oryza sativa Japonica Group]
          Length = 332

 Score =  279 bits (714), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 142/302 (47%), Positives = 187/302 (61%), Gaps = 65/302 (21%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           MC KQ    + ++++ I+V +Q C  + I  SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 83  MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 142

Query: 61  LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           +S QRSGSGWFETLLN              K RR NISSI  T+D+VYNLDW +SASKNE
Sbjct: 143 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 202

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL R NLLR+++S LAN++DR  K L
Sbjct: 203 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRWNLLRQLVSQLANNHDRYLKQL 262

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
           NGTHK+HVH+                                              YE L
Sbjct: 263 NGTHKAHVHTA---------------------------------------------YEKL 277

Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
           +       +V +FL++   +L SR VKIH+ PLS+Q+ENWD+V  AL GT YE FL++DY
Sbjct: 278 L-------DVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 330

Query: 288 RV 289
           R+
Sbjct: 331 RI 332


>gi|302759563|ref|XP_002963204.1| hypothetical protein SELMODRAFT_80417 [Selaginella moellendorffii]
 gi|300168472|gb|EFJ35075.1| hypothetical protein SELMODRAFT_80417 [Selaginella moellendorffii]
          Length = 336

 Score =  279 bits (714), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 137/262 (52%), Positives = 173/262 (66%), Gaps = 14/262 (5%)

Query: 38  HYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINI 84
           HYP P T++R ECAC PV  F ILS QRSGSGWFETLLN             ++ RR N 
Sbjct: 73  HYPRPKTYSRGECACTPVHNFVILSTQRSGSGWFETLLNSHPNITSHGEIFSVRPRRHNF 132

Query: 85  SSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLI 144
           S +  T+D +YNLDW  SASKN C++ VGFKWM+NQG ME+ ++++ YFK++GVS I LI
Sbjct: 133 SIVSHTLDALYNLDWHNSASKNSCTSVVGFKWMLNQGAMEYRSEVSSYFKRRGVSVILLI 192

Query: 145 RRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIE 204
           RRN L+R+ISILAN+YDR    +NGTH SHVHS   A +LA YKPT++   L   L + +
Sbjct: 193 RRNGLKRLISILANAYDRRQP-MNGTHVSHVHSEEAADLLASYKPTVDVAFLKNNLDRYD 251

Query: 205 QTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQV 264
                A+  F+ TR +V YYED + NR   + VQEFL +    L SRQ KIH GPLS+ V
Sbjct: 252 LMIKNALCAFEGTRLLVIYYEDFVKNRKMQRTVQEFLSVPVMRLRSRQWKIHRGPLSQSV 311

Query: 265 ENWDDVQKALKGTSYERFLHSD 286
            NW +V + L GTSY+ FL  D
Sbjct: 312 TNWKEVFRELNGTSYQGFLADD 333


>gi|388496630|gb|AFK36381.1| unknown [Lotus japonicus]
          Length = 251

 Score =  278 bits (710), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 128/209 (61%), Positives = 154/209 (73%), Gaps = 13/209 (6%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ    +K  LL+IKVI  PC   NI+  EIP++HYPNP T++REEC+C+PVRYFAI
Sbjct: 43  ICLKQIGTSSKIGLLDIKVIQNPCPEPNIEPWEIPYVHYPNPKTYSREECSCHPVRYFAI 102

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQRSGSGWFET LN             +K RR N+S+I  T+D +YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHPNISSNGEIFSVKVRRSNVSTITETLDTIYNLDWLSSASKNE 162

Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
           C+ AVG KWM+NQGLM+HH  IAEYF+  GVS IFL RRNLLRRM+S+LAN YDRNAKLL
Sbjct: 163 CTTAVGLKWMLNQGLMQHHEQIAEYFRIHGVSVIFLFRRNLLRRMVSLLANEYDRNAKLL 222

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLL 196
           NGTHKSHVHSP E  +   +   + AT +
Sbjct: 223 NGTHKSHVHSPKEVSVCTLFCACMCATCV 251


>gi|238013712|gb|ACR37891.1| unknown [Zea mays]
          Length = 239

 Score =  269 bits (688), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 138/239 (57%), Positives = 177/239 (74%), Gaps = 13/239 (5%)

Query: 63  MQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECS 109
           MQRSGSGW ETLLN             +K RR NI++I  T+DR+YNLDW++SA+KNEC+
Sbjct: 1   MQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNITAITKTLDRLYNLDWYSSAAKNECT 60

Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
           AAVG KWM+NQGLM+HH +I EYF ++GVSAIFL+RRNLLRR +SILAN++D   K LNG
Sbjct: 61  AAVGLKWMLNQGLMKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNG 120

Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
           THK+HVHS  EA+ILA+YKPTI+   LI ELK+ ++  A A+  FK+TRH+V YYED+++
Sbjct: 121 THKAHVHSKHEAEILAQYKPTIDKKTLITELKRSDKLAADALVNFKNTRHVVLYYEDVVS 180

Query: 230 NRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
           NR  L +V +FLRL  R+L SR VKIH+  L   ++NW DV   LKGT +E FL+   R
Sbjct: 181 NRTMLMDVLDFLRLPKRKLFSRHVKIHTKRLCDHIDNWADVTDFLKGTPFESFLNGSRR 239


>gi|414869372|tpg|DAA47929.1| TPA: thylakoid lumenal protein, mRNA [Zea mays]
          Length = 166

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 94/166 (56%), Positives = 123/166 (74%)

Query: 123 MEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAK 182
           M+HH +I EYF ++GVSAIFL+RRNLLRR +SILAN++D   K LNGTHK+HVHS  EA+
Sbjct: 1   MKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAE 60

Query: 183 ILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLR 242
           ILA+YKPTI+   LI ELK+ ++  A A+  FK+TRH+V YYED+++NR  L +V +FLR
Sbjct: 61  ILAQYKPTIDKKTLITELKRSDKLAADALVNFKNTRHVVLYYEDVVSNRTMLMDVLDFLR 120

Query: 243 LAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
           L  R+L SR VKIH+  L   ++NW DV   LKGT +E FL+   R
Sbjct: 121 LPKRKLFSRHVKIHTKRLCDHIDNWADVTDFLKGTPFESFLNGSRR 166


>gi|4335739|gb|AAD17417.1| hypothetical protein [Arabidopsis thaliana]
          Length = 213

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 65/109 (59%), Positives = 87/109 (79%)

Query: 180 EAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQE 239
           +A+ILA+YKP IN +LLIP+LKQ+++ T+KA+ YF +TRHI  YYED++ NR  L +VQE
Sbjct: 104 QAEILARYKPLINTSLLIPDLKQVQEMTSKALAYFNTTRHIFLYYEDVVKNRTKLDDVQE 163

Query: 240 FLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
           FL++   +L SRQVKIH GPLS+ V+NW++VQK LKGT +E FL  DYR
Sbjct: 164 FLKVPKLDLKSRQVKIHHGPLSQHVQNWEEVQKTLKGTGFENFLLEDYR 212



 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 70/121 (57%), Gaps = 22/121 (18%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +  KQ         LN++V ++PC   NIQ  +IP++HYP P T++              
Sbjct: 7   LIPKQIGVVPSAGFLNVEVFERPCPEPNIQPWDIPYVHYPKPKTYS-------------- 52

Query: 61  LSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQ 120
                S +G    + ++K RR N+S+I  T+D+VYNLDW +SASKNEC++AVG KWM+NQ
Sbjct: 53  -----SSNG---EIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNECTSAVGLKWMLNQ 104

Query: 121 G 121
            
Sbjct: 105 A 105


>gi|4455187|emb|CAB36719.1| hypothetical protein [Arabidopsis thaliana]
 gi|7270392|emb|CAB80159.1| hypothetical protein [Arabidopsis thaliana]
          Length = 403

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 67/99 (67%), Positives = 82/99 (82%)

Query: 145 RRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIE 204
           R+NLLRRMIS+LANSYDR+AKLLNGTHKSH HS  EA+ILA YKP IN TLLI EL+QI+
Sbjct: 152 RKNLLRRMISVLANSYDRDAKLLNGTHKSHTHSAKEAEILAGYKPMINTTLLINELRQIQ 211

Query: 205 QTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRL 243
           + T KA+ YF +TRHI+ YYED++ N   L +VQEFL++
Sbjct: 212 EMTLKALTYFNTTRHILVYYEDVVKNLTRLDDVQEFLKV 250



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 26/35 (74%)

Query: 14  LLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNRE 48
            LN++V ++PC   NI+  +IPF+HYP P T+NR+
Sbjct: 119 FLNVEVFERPCPEPNIEPWDIPFVHYPKPKTYNRK 153


>gi|357464733|ref|XP_003602648.1| Kinase-like protein [Medicago truncatula]
 gi|355491696|gb|AES72899.1| Kinase-like protein [Medicago truncatula]
          Length = 403

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 63/130 (48%), Positives = 79/130 (60%), Gaps = 23/130 (17%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C KQ S  ++   L+I VI  PC   NI+  EIP++          EEC C+P+RYF I
Sbjct: 162 ICLKQISTGSRIGFLDINVIQMPCPGPNIEPWEIPYV----------EECRCHPLRYFTI 211

Query: 61  LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
           LSMQR GSGW ET LN             +K RR NI++I  T+D +YNLDWF SASKNE
Sbjct: 212 LSMQRFGSGWHETFLNSHPNISSNGEIFSVKVRRSNITTITETLDTIYNLDWFNSASKNE 271

Query: 108 CSAAVGFKWM 117
           C+AAVG  W+
Sbjct: 272 CTAAVGLAWL 281


>gi|242034033|ref|XP_002464411.1| hypothetical protein SORBIDRAFT_01g017756 [Sorghum bicolor]
 gi|241918265|gb|EER91409.1| hypothetical protein SORBIDRAFT_01g017756 [Sorghum bicolor]
          Length = 146

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/142 (43%), Positives = 88/142 (61%), Gaps = 3/142 (2%)

Query: 142 FLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELK 201
           F+ RRN LR++IS  AN YD +AK LN TH+S VHS  + KI+ K KP ++++ +I    
Sbjct: 1   FVFRRNTLRKLISFTANDYDNDAKQLNATHRSRVHSKEKPKIITKLKPHLDSSTMIT--- 57

Query: 202 QIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLS 261
            IE+     +++  STRH++ YYED+I N   L  VQEFLR+  R L S+QVK+H  PL 
Sbjct: 58  NIEKAIRDCLDHLNSTRHMILYYEDIINNSNALSWVQEFLRVPVRRLMSKQVKVHMRPLP 117

Query: 262 KQVENWDDVQKALKGTSYERFL 283
             ++N + V   L  T +  FL
Sbjct: 118 DLIKNREKVSIKLNRTEFAHFL 139


>gi|255641735|gb|ACU21138.1| unknown [Glycine max]
          Length = 144

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 54/86 (62%), Gaps = 4/86 (4%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
           +C  Q S   +T  ++ KVID   QS  ++      LHYP P +F+R EC  NPV +FAI
Sbjct: 43  VCLMQISTQARTTFMDFKVIDNHSQSI-LKLMNTHLLHYPKPASFSRNECVHNPVLFFAI 101

Query: 61  LSMQRSGSGWFETLLNIKHRRINISS 86
           LS QRSGSGWFETLLN     IN+SS
Sbjct: 102 LSNQRSGSGWFETLLN---SHINVSS 124


>gi|367060058|gb|AEX10990.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060060|gb|AEX10991.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060064|gb|AEX10993.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060066|gb|AEX10994.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060068|gb|AEX10995.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060070|gb|AEX10996.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060072|gb|AEX10997.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060074|gb|AEX10998.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060076|gb|AEX10999.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060078|gb|AEX11000.1| hypothetical protein 0_11079_01 [Pinus taeda]
 gi|367060080|gb|AEX11001.1| hypothetical protein 0_11079_01 [Pinus taeda]
          Length = 72

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/72 (55%), Positives = 49/72 (68%), Gaps = 1/72 (1%)

Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
           FKSTRH V YYE+ I+    + E+ +FL L  RELTSR VKIH+ PLS+ V NW +V   
Sbjct: 1   FKSTRHTVIYYEE-ISKPKKIMEILKFLGLKPRELTSRHVKIHTKPLSEHVHNWQEVNNR 59

Query: 274 LKGTSYERFLHS 285
           LKGT +E FLH 
Sbjct: 60  LKGTEFEVFLHD 71


>gi|367060062|gb|AEX10992.1| hypothetical protein 0_11079_01 [Pinus taeda]
          Length = 72

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 39/72 (54%), Positives = 48/72 (66%), Gaps = 1/72 (1%)

Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
           FKSTRH V YYE+ I+    + E+ +FL L  RELTSR VKIH+ PLS+ V NW +V   
Sbjct: 1   FKSTRHTVIYYEE-ISKPKKIMEILKFLGLKPRELTSRHVKIHTKPLSEHVHNWQEVNNR 59

Query: 274 LKGTSYERFLHS 285
           LKGT +E  LH 
Sbjct: 60  LKGTEFEVLLHD 71


>gi|323449148|gb|EGB05038.1| hypothetical protein AURANDRAFT_66664 [Aureococcus anophagefferens]
          Length = 837

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 82/180 (45%), Gaps = 21/180 (11%)

Query: 68  SGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
           +G    LLN   R  ++ +    M+ V   D + S  +   +AAVGFKWM NQG   HH 
Sbjct: 406 AGEMRDLLN---RVGDVRASWEHMEAVIEAD-YRSLCEGRQAAAVGFKWMTNQGHSIHHA 461

Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKIL-AK 186
            I E  ++ G S I+L RRNLLRR IS  AN         N    +  H   EA++  A+
Sbjct: 462 RIVE--RRTGTSIIYLFRRNLLRREISNAAN---------NQLGDAQAHPKTEAELASAR 510

Query: 187 YKPTI-NATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN----RATLKEVQEFL 241
              T+     L+  + +  +  A  + Y+     +   YEDL+       A   EV EFL
Sbjct: 511 GNVTLFEGVKLVSVIVKRLEARATVVGYYADIPSLFLAYEDLVAGSSDAEARWAEVFEFL 570


>gi|323453203|gb|EGB09075.1| hypothetical protein AURANDRAFT_63694 [Aureococcus anophagefferens]
          Length = 884

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 53/173 (30%), Positives = 80/173 (46%), Gaps = 17/173 (9%)

Query: 113 GFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHK 172
           GFKWM NQG  E  + I +Y   +GV  ++L RRN+LR++IS LAN      + +   H 
Sbjct: 676 GFKWMTNQGHDERRHAIKKYMLNRGVKLVYLWRRNVLRQLISNLANR-----RTVGLAHP 730

Query: 173 SHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK-STRHIVFYYEDLITNR 231
                 A+A +  +     +   LI  LK+IE+   +   Y+    +    +YEDLI   
Sbjct: 731 ESREDAAKADVDLELPAGGD---LIHALKKIERQRRRVRSYYHPEVKETAVFYEDLIAGS 787

Query: 232 ATLKE----VQEFLRL---AYRELTSRQVKIH-SGPLSKQVENWDDVQKALKG 276
               +    V +FL     A+    +  V IH S P    V N D+V+  L+ 
Sbjct: 788 PRYNDSWARVVKFLGAAPHAFEPSAADTVIIHQSRPTLASVRNADEVRATLEA 840


>gi|323451151|gb|EGB07029.1| hypothetical protein AURANDRAFT_71905 [Aureococcus anophagefferens]
          Length = 1695

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 48/203 (23%), Positives = 86/203 (42%), Gaps = 24/203 (11%)

Query: 103  ASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDR 162
            A++   + A GFKWM++Q    H     E  +  GV  +FL+RRN  R+++S L N+ D+
Sbjct: 1214 AARGGYAVATGFKWMLSQRAGRHWPWFVELCRDFGVRLVFLLRRNAARQLVSRLLNAEDK 1273

Query: 163  NAKLLNG-THKSHVHSPAEAKILAKYKPTINATLLIPELK-------QIEQTTAKAIEYF 214
                  G  H +H +S +    L   + T     ++ +L+       ++E+    A+   
Sbjct: 1274 ARAQAGGLRHSAHPNSESRLDELRSRRVTFRRDQVLEQLQDMREKWDELERLRLYALARG 1333

Query: 215  KSTRHIVFYYEDLITNRATLKEVQEFL--------------RLAYRELTSRQVKIHSGPL 260
              +  +V  YED+  + + +  + +FL                A R       KIH+  L
Sbjct: 1334 VPSARVV--YEDVDADHSLVGNLSDFLLADVGAEERANCAAAFASRPPPQGHEKIHTSSL 1391

Query: 261  SKQVENWDDVQKALKGTSYERFL 283
               + N+ +V   L  T +   L
Sbjct: 1392 EDLIVNFKEVWHLLHNTPWRHCL 1414


>gi|440714456|ref|ZP_20895035.1| hypothetical protein RBSWK_02092 [Rhodopirellula baltica SWK14]
 gi|436440652|gb|ELP33956.1| hypothetical protein RBSWK_02092 [Rhodopirellula baltica SWK14]
          Length = 273

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 87/181 (48%), Gaps = 16/181 (8%)

Query: 109 SAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLN 168
           S AV F+ M NQ  ++  + +    + +G+  I L+R +LLR+ +S     +  N   L+
Sbjct: 95  SGAVAFRGMYNQ--LQRKSIVNHLLQVQGLKIIHLMRDDLLRQYVSRKQMHHRYN---LH 149

Query: 169 GTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRH--IVFYYED 226
           G   +H   P +   +      I+    + E+  + ++  K  + F+   +  +  +YE 
Sbjct: 150 GKGSAHTSKPIKLNAIP-----ISPAAALAEMTFMARSREKLHKAFEDNGNEILTLHYER 204

Query: 227 LI----TNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERF 282
           L+     + A  K + EFL +   E+TS  VK+    L   VEN+DD++ A++ + +  F
Sbjct: 205 LLGGGMVDEADRKTLCEFLSVRDVEMTSDLVKMSQARLEDMVENFDDLRTAVQASEFADF 264

Query: 283 L 283
           L
Sbjct: 265 L 265


>gi|413955390|gb|AFW88039.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 113

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 30/47 (63%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNR 47
           MC KQ      ++++ I+V++QPC  + +  SE+ F HYP P T++R
Sbjct: 59  MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSR 105


>gi|413955389|gb|AFW88038.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
          Length = 105

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 30/47 (63%)

Query: 1   MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNR 47
           MC KQ      ++++ I+V++QPC  + +  SE+ F HYP P T++R
Sbjct: 59  MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSR 105


>gi|323452062|gb|EGB07937.1| hypothetical protein AURANDRAFT_71728 [Aureococcus anophagefferens]
          Length = 1699

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 40/123 (32%), Positives = 60/123 (48%), Gaps = 19/123 (15%)

Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
           A+VGFK   NQG   HH  I E  ++ G + ++L RRNLLR  IS  AN+  ++A     
Sbjct: 18  ASVGFK--TNQGHSVHHARIIE--RRTGTTILYLFRRNLLRHEISKNANTQLKDA----- 68

Query: 170 THKSHVHSPAEAKILAKYKPTIN----ATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYE 225
                +  P  A  LA  +  +     A L+   +K++E   AK + Y+     +   YE
Sbjct: 69  -----LAHPKTAAELASARGNVTLFEGAELVSDMVKRLE-ARAKVVGYYAGIPSLFLAYE 122

Query: 226 DLI 228
           DL+
Sbjct: 123 DLV 125


>gi|254422696|ref|ZP_05036414.1| hypothetical protein S7335_2848 [Synechococcus sp. PCC 7335]
 gi|196190185|gb|EDX85149.1| hypothetical protein S7335_2848 [Synechococcus sp. PCC 7335]
          Length = 245

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 71/161 (44%), Gaps = 7/161 (4%)

Query: 127 NDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAK 186
           N   +   + G   I+L RR+LLR  I+ L      + +  +  +K++        + + 
Sbjct: 82  NQFLQDLHRGGCRVIYLQRRDLLRHAIATLKAD---DIQFHSSLNKTNAFGSPSNWVHSN 138

Query: 187 YKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNR----ATLKEVQEFLR 242
            + TI    L+  L  IE     A        H+   YED + +     AT + + EFL 
Sbjct: 139 RRVTIPVQALLDCLSYIEAQRVDAQAILHGIPHLSLTYEDDLIDPNVYPATAQRLSEFLE 198

Query: 243 LAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
           +   +LT R VK+    ++  V N+D+V++AL+ +     L
Sbjct: 199 IRSLKLTGRLVKLVHQDIADLVANYDEVRQALESSENAHLL 239


>gi|323454077|gb|EGB09947.1| hypothetical protein AURANDRAFT_62424 [Aureococcus anophagefferens]
          Length = 348

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 80/199 (40%), Gaps = 36/199 (18%)

Query: 111 AVGFKWMMNQGLMEHHNDIA-------EYFKKKGVSAIFLIRRNLLRRMISILANSYDRN 163
           A GFKWMM+Q     H D A       E    + +  +FL R NLLR   S +A +    
Sbjct: 106 ACGFKWMMSQ-----HVDRAWDAGWLPELCAARKLRVVFLERLNLLRVHASTVAKNL--- 157

Query: 164 AKLLNGTHKSHV-HSPAEAKILAKYKPTINATLLIPELKQIEQ-----TTAKAIEYFKST 217
           A    G H++    +P EA     Y    +   L+ +L+ I          +A    +  
Sbjct: 158 ASTSAGGHRNPTGEAPREA-----YIELASGDALVAKLEGIAAQYDGLRRVRAEAARRGV 212

Query: 218 RHIVFYYEDLITNRAT-LKEVQEFLR---------LAYRELTSRQVKIHSGPLSKQVENW 267
             +   YE L  +R   L  V  F+           A+    SRQ +IH+G LS  V NW
Sbjct: 213 ATLAVTYEALQADRGGGLDRVARFVAGGGACDVGAYAFAGNASRQTQIHAGKLSSYVSNW 272

Query: 268 DDVQKALKGTSYERFLHSD 286
             V   L+ T +  +L  D
Sbjct: 273 AAVVDTLRPTRFACYLDED 291


>gi|1531623|gb|AAB16900.1| NodH [Rhizobium sp. N33]
          Length = 248

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/244 (22%), Positives = 102/244 (41%), Gaps = 29/244 (11%)

Query: 58  FAILSMQRSGSGWFETLLNIKHRRINISSIVSTMD--------------RVYNLDWFTSA 103
           FAIL+M R+G+ + E LLN     ++   +++  D               +  L +    
Sbjct: 11  FAILAMPRTGTHYLEVLLNEHPNILSNGELLNEYDTNWPDKDRLLLGDRELLELAYVRCP 70

Query: 104 SKNECSAA-VGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDR 162
           + ++ +   VG K    Q   +H    AE  +  G+  I ++RRN+L  + S++     +
Sbjct: 71  TWSDKTVTHVGCKVNEPQ-FHDHPGFFAELARWPGLKVILVVRRNILESLRSLVQAR--Q 127

Query: 163 NAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVF 222
           + + L  +       P   ++     P  N        K  +   A+    F  +  +V 
Sbjct: 128 SGQWLKFSSDKVGAPPPRVRL-----PIANCEAY---FKTADAFYARVAHAFAPSNVLVI 179

Query: 223 YYEDLITNRAT-LKEVQEFLRLAYRELTSRQV--KIHSGPLSKQVENWDDVQKALKGTSY 279
            YE L+   AT L  V +FL +   +L+ R +  +  + PL + VEN+D+++       Y
Sbjct: 180 EYESLLQEPATCLGAVWDFLGVPGLQLSGRAILQRQEARPLDETVENFDELRLHFANGPY 239

Query: 280 ERFL 283
            RF 
Sbjct: 240 ARFF 243


>gi|292493296|ref|YP_003528735.1| hypothetical protein Nhal_3305 [Nitrosococcus halophilus Nc4]
 gi|291581891|gb|ADE16348.1| hypothetical protein Nhal_3305 [Nitrosococcus halophilus Nc4]
          Length = 243

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 79/191 (41%), Gaps = 30/191 (15%)

Query: 103 ASKNECSAAVGFKWMMNQ------GLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISIL 156
            ++N+    V FK M NQ        + HH DI           I L R NLL++ +S  
Sbjct: 72  CARNDAPVLV-FKSMYNQLNNKVCNFLTHHTDI---------RVIHLRRENLLKQYVS-- 119

Query: 157 ANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKS 216
                   K+L G  +     P   K +      I+ T  I E++++     +  +    
Sbjct: 120 --------KMLLGAKRERRWQPHTTKKVPGVSIHISPTAAIKEMQRVRNQFLEFEQLLSH 171

Query: 217 TRHIVFYYEDLITNRATLKE----VQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQK 272
              I   YE +I  ++   E    + E  +L    +    VKI+   L   V+N+D++  
Sbjct: 172 HHRIELVYETMINGQSLSNEAAEKICELFQLNPAPMHCDFVKINPNELELMVKNYDELAS 231

Query: 273 ALKGTSYERFL 283
           AL+GT +E+FL
Sbjct: 232 ALRGTEFEQFL 242


>gi|218441161|ref|YP_002379490.1| sulfotransferase [Cyanothece sp. PCC 7424]
 gi|218173889|gb|ACK72622.1| sulfotransferase, putative [Cyanothece sp. PCC 7424]
          Length = 278

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 79/175 (45%), Gaps = 14/175 (8%)

Query: 111 AVGFKWMMNQGLMEHHNDIAEYFK-KKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
           AVGFK        +  + I  Y K +K +  I L R N LR ++S+      R A   N 
Sbjct: 114 AVGFKLFYYHAQNDSRHLIWSYLKEQKTLKIIHLQRDNTLRELLSL------RKAFKTNK 167

Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
              +      E  I  +Y+  +       E    ++  AK  ++F+  + I   YE+L  
Sbjct: 168 WTNTDGMEEQEFSIKLEYEDCLQ------EFTHSQEIKAKYNKFFQDHQVINMIYENLSN 221

Query: 230 NRAT-LKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
           +  T LK++Q+FL + Y+ +     K    PL + + N+ ++++  +GT +E F 
Sbjct: 222 DYETELKKLQDFLEVDYKPVKPLTYKQSKQPLKEAISNYYELKQKFQGTPWEAFF 276


>gi|344339260|ref|ZP_08770189.1| Stf0 sulfotransferase [Thiocapsa marina 5811]
 gi|343800564|gb|EGV18509.1| Stf0 sulfotransferase [Thiocapsa marina 5811]
          Length = 248

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 63/269 (23%), Positives = 109/269 (40%), Gaps = 72/269 (26%)

Query: 58  FAILSMQRSGSGWFETLLN------------------------------------IKHRR 81
           F +LS QRSGS    T L+                                    + HRR
Sbjct: 6   FVVLSTQRSGSTLLRTALDSHPEISCRGELFLPSYSQGDSFRDFVSEKKLSRISELLHRR 65

Query: 82  INISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQ-GLMEHHNDIA-EYFKKKGVS 139
                + S +DR+Y+         +  + A GFK M  Q G   +   +  +Y +  GV 
Sbjct: 66  ---RLVYSYLDRLYD---------DTLAGARGFKIMYGQMGYRPYKFPMVFDYIRDYGVR 113

Query: 140 AIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKP---TINATLL 196
            I L+R N L   IS     + R +K  +   +              Y+P   T+N   L
Sbjct: 114 VIHLVRENTLDICIS---RQFARASKTYHAVEQ--------------YQPRSQTVNIPTL 156

Query: 197 IPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRA-TLKEVQEFLRL-AYRELTSRQVK 254
           I E++++E+   +     K  R +   YED + +R+ + + + EFL + +  EL S   +
Sbjct: 157 ILEMRKVEREKIRWRSAVKGLRCLDQTYEDFVNSRSESSRRILEFLGVESAIELVSPLKR 216

Query: 255 IHSGPLSKQVENWDDVQKALKGTSYERFL 283
           +    L+++V N++++ + +    Y RFL
Sbjct: 217 LSGAHLNERVLNYEEMVQEVVSHGYGRFL 245


>gi|323448004|gb|EGB03908.1| expressed protein [Aureococcus anophagefferens]
          Length = 359

 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 30/48 (62%)

Query: 111 AVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILAN 158
           A GFKWM+NQG+ E    +    + + +  +FL RR+LLR ++SI  N
Sbjct: 132 AYGFKWMVNQGMDELWGPLLATARARRLRVVFLYRRDLLRMLVSIAHN 179


>gi|307154289|ref|YP_003889673.1| hypothetical protein Cyan7822_4488 [Cyanothece sp. PCC 7822]
 gi|306984517|gb|ADN16398.1| hypothetical protein Cyan7822_4488 [Cyanothece sp. PCC 7822]
          Length = 261

 Score = 45.1 bits (105), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 63/268 (23%), Positives = 106/268 (39%), Gaps = 58/268 (21%)

Query: 60  ILSMQRSGSGWFETLLNIKHRRINISSIV---------------------STMDRVYNLD 98
           IL+  RSGS +   +LN      N   ++                     S +D +YN  
Sbjct: 6   ILTNGRSGSNYISGVLNSHPHITNYGEVLGEWTVPYMIYDKFFKYRMSSESYLDYMYNSK 65

Query: 99  WFTSAS----------KNECSAAVGFKWMMNQGLMEH-----HNDIAEYF-KKKGVSAIF 142
           +F +A+          + E      +K + + G+ E        +I  Y   KK +  I 
Sbjct: 66  FFFNAAQIYSAYSKIKRKEKINFKSYKQIKSLGIKEFSINFFRRNIDNYLASKKNIFVIN 125

Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
           L R N L R++SI A               +     +  K L K K  +N   ++  L  
Sbjct: 126 LYRENSLARLVSIEA--------------MTTTGIVSTQKKLGKIKIYLNPEGILERLNI 171

Query: 203 IEQTTAKAIEYFKSTRHIVFY---YEDLI----TNRATLKEVQEFLRLAYRELTSRQVKI 255
            E+      +  ++      +   YE+      + R   K++ EFL++   ++T  Q KI
Sbjct: 172 FEKEKKDQFDLIENLEQKFIFNISYEEYFASPESQRELTKKIFEFLKVEPIDVTLNQKKI 231

Query: 256 HSGPLSKQVENWDDVQKALKGTSYERFL 283
            S  LS  VEN++++   LKGT YE++L
Sbjct: 232 LSRKLSNTVENYEEIVSILKGTQYEQYL 259


>gi|434399105|ref|YP_007133109.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
 gi|428270202|gb|AFZ36143.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
          Length = 286

 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 1/85 (1%)

Query: 200 LKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATL-KEVQEFLRLAYRELTSRQVKIHSG 258
            ++I+    K    F     I  YYEDL  N   + K++Q FL +  +EL +   KI   
Sbjct: 200 FEEIQSHQNKYTNLFNGYPMIDIYYEDLCINSNDVTKQIQNFLGIEVQELKTATKKIIQS 259

Query: 259 PLSKQVENWDDVQKALKGTSYERFL 283
           PLS Q+ N+ ++++    T YE + 
Sbjct: 260 PLSSQINNYYELKEKFINTPYEEYF 284


>gi|153875891|ref|ZP_02003487.1| conserved hypothetical protein [Beggiatoa sp. PS]
 gi|152067645|gb|EDN66513.1| conserved hypothetical protein [Beggiatoa sp. PS]
          Length = 243

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 55/244 (22%), Positives = 99/244 (40%), Gaps = 29/244 (11%)

Query: 54  PVRYFAILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDW--------FTSASK 105
           P + F +    R+GS     L+N  H  +     +    RV  + +        +  A K
Sbjct: 14  PNKKFVMFFRPRAGSTLLCDLMN-SHPDVCCDWEIFGAGRVGKVSFPKLYLKGRYAIAEK 72

Query: 106 NECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAK 165
           N     V  K + +Q   E     ++ F ++G   I++ R N LR+ +S L         
Sbjct: 73  NVYGFKVNIKQIRDQKF-EAQTFFSD-FDQQGWKIIYIRRENSLRQAVSFLI-------- 122

Query: 166 LLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYE 225
                H+S         +  K    +N   L+ ++ +IEQ   K +E       +   YE
Sbjct: 123 ---AEHRSEWIGKPNNTLTGKV--PVNPEELVSKIGRIEQAVQKEVELLAPYSTLNVIYE 177

Query: 226 DLITN----RATLKEVQEFLRLAYRELTSRQVKIH-SGPLSKQVENWDDVQKALKGTSYE 280
           + + N    + TL +V +FL +    + ++  K   S  LS  +EN+++V++ +  T Y 
Sbjct: 178 NDLLNAVQQQITLDKVFDFLGIDSVPIKTKMAKTAASSNLSDIIENYEEVERVISQTPYA 237

Query: 281 RFLH 284
            FL 
Sbjct: 238 HFLE 241


>gi|128469|sp|P06237.1|NOH4_RHIME RecName: Full=Nodulation protein H; AltName: Full=Host-specificity
           of nodulation protein D
 gi|152243|gb|AAA26291.1| host-specificity of modulation protein D [Sinorhizobium meliloti]
          Length = 247

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 58/248 (23%), Positives = 98/248 (39%), Gaps = 30/248 (12%)

Query: 54  PVRYFAILSMQRSGSGWFETLLNIKHRRINISSIVSTMD-------RVYNLD-------- 98
           P R FAIL+M+R+G+ + E L+N     ++   +++T D       R+   D        
Sbjct: 7   PPRPFAILAMRRTGTHYLEELVNEHPNVLSNGELLNTYDTNWPDKERLLLSDRELLERAC 66

Query: 99  WFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILAN 158
           W      ++    VG K +      E  +  AE     G+  I +IRRN L  + S +  
Sbjct: 67  WRYPPHSDKKVTHVGCK-INEPQFQERPSFFAELTAWPGLKVILVIRRNTLESLRSFVQA 125

Query: 159 SYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTR 218
              R         KS   +P    +L     T  A       K  +   A+ +  F S+R
Sbjct: 126 RQTRQWLQF----KSDSSAPPPPVMLPFA--TCEAY-----FKAADDFHARVVNAFDSSR 174

Query: 219 HIVFYYEDLITNRAT-LKEVQEFLRLAYRELTSRQV--KIHSGPLSKQVENWDDVQKALK 275
             +  YE L+ +    +  V +FL     +L  R +  +  + PL + V N+ +++    
Sbjct: 175 IRLIEYERLLRDPVPCVATVLDFLGAPALQLADRGILRRQETRPLDQTVRNFHELRVHFA 234

Query: 276 GTSYERFL 283
              Y RF 
Sbjct: 235 NGPYARFF 242


>gi|410861292|ref|YP_006976526.1| hypothetical protein amad1_08300 [Alteromonas macleodii AltDE1]
 gi|410818554|gb|AFV85171.1| hypothetical protein amad1_08300 [Alteromonas macleodii AltDE1]
          Length = 264

 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 82/167 (49%), Gaps = 22/167 (13%)

Query: 127 NDIAEYF-KKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILA 185
           +++  YF   K +S I L R N+L+R+IS +              H++ V S  +     
Sbjct: 111 DNLYNYFLDSKEISVIHLKRSNILKRLISAML------------MHQAGVVSTTDNSTEN 158

Query: 186 KYKPTINATLLIPELKQI----EQTTAKAIEYFKSTRHIVFYYEDLITN-RATLK---EV 237
           K    I+   L+ +LK+     E+  A A +  K        YED  ++  +T K   ++
Sbjct: 159 K-SFEIDIEQLLAKLKRDYELNEEENAFAAKLSKKQPFYQLSYEDYFSSPESTSKHNSQI 217

Query: 238 QEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH 284
            +FL +   ++ S+  K+ S  L+  + N+DD+++ALK T +E++L+
Sbjct: 218 FDFLGVESTQVKSKHRKVLSQNLADIITNYDDIERALKTTKFEQYLY 264


>gi|113477543|ref|YP_723604.1| hypothetical protein Tery_4123 [Trichodesmium erythraeum IMS101]
 gi|110168591|gb|ABG53131.1| hypothetical protein Tery_4123 [Trichodesmium erythraeum IMS101]
          Length = 184

 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 72/155 (46%), Gaps = 23/155 (14%)

Query: 137 GVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLL 196
           G   I+L R NL+   +S      + NA L    H    H   E ++  +YKP +   + 
Sbjct: 37  GYKFIYLTRSNLVAHALS------NINASLKQKFH----HRSCEGEL--EYKPIL---VE 81

Query: 197 IPELKQIEQTTAKAIEY----FKSTRHIVFYYEDLITNRA----TLKEVQEFLRLAYREL 248
           I E+ Q  Q + +  EY     K   HI   YE+ + +R     T+ +V E   L    +
Sbjct: 82  IEEVFQRIQYSEELGEYEKKLLKKIPHISITYEENLLDRECHQKTVDKVLELFNLPSNPV 141

Query: 249 TSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
            +  VK+    LSK VEN++++ +A+K + Y  F+
Sbjct: 142 KTNLVKLMPLDLSKMVENYEELIQAIKKSKYAHFI 176


>gi|434400774|ref|YP_007134778.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
 gi|428271871|gb|AFZ37812.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
          Length = 294

 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 78/182 (42%), Gaps = 23/182 (12%)

Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMIS-ILANSYDRNAKLL 167
           +AVGFK   +Q       +I  Y +K   V  I L R+N+L   +S  LA     N  +L
Sbjct: 127 SAVGFKLFYDQAKRGQQKNIWGYLQKMSDVKIIHLKRKNILEAHVSHKLAER--NNQWIL 184

Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY---FKSTRHIV--F 222
               K  +  P    I   Y   + A          EQT     EY   F   +H +   
Sbjct: 185 LDNQKELMLEP----IALDYDECLQA---------FEQTKIWETEYEQFFAKYKHPIQTI 231

Query: 223 YYEDLITNRATL-KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYER 281
           YYE+L++N   + +E+Q FL + Y  L +   K +   L  ++ N+D+++     T + +
Sbjct: 232 YYEELVSNTLLITRELQNFLEVNYWFLNTYTKKQNQVSLRDKIINYDELKDKFSKTIWSK 291

Query: 282 FL 283
           F 
Sbjct: 292 FF 293


>gi|359459323|ref|ZP_09247886.1| sulfotransferase [Acaryochloris sp. CCMEE 5410]
          Length = 256

 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 44/186 (23%), Positives = 77/186 (41%), Gaps = 43/186 (23%)

Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMISILANSYDRNAKLLN 168
           + VGFK   N     H N + +Y K+ K ++ I L R+NLL+                  
Sbjct: 91  SGVGFKLFYNHARSSHENQVWDYLKESKDLNVIHLKRKNLLK------------------ 132

Query: 169 GTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYF-----KSTRHIVFY 223
            TH S   +      + K  P+        E+  IE      +E+F     +   H  F+
Sbjct: 133 -THLSEQVALKTQIWVGKKNPS--------EIYPIELDYNNTLEFFNHMKEQEEEHHQFF 183

Query: 224 ---------YEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
                    YEDL+ +     +++Q FL++ Y+ L     K  S PLS+ + N+ ++++ 
Sbjct: 184 SNHNIFEVNYEDLVKDIEFYSRKLQAFLKIPYKSLYVSTPKQSSIPLSQAISNFRELKER 243

Query: 274 LKGTSY 279
             G+ +
Sbjct: 244 FNGSPW 249


>gi|399125168|pdb|4FLC|A Chain A, Structural And Biochemical Characterization Of Human
           Adenylosuccinate Lyase (Adsl) And The R303c Adsl
           Deficiency Associated Mutation
 gi|399125169|pdb|4FLC|B Chain B, Structural And Biochemical Characterization Of Human
           Adenylosuccinate Lyase (Adsl) And The R303c Adsl
           Deficiency Associated Mutation
 gi|399125170|pdb|4FLC|C Chain C, Structural And Biochemical Characterization Of Human
           Adenylosuccinate Lyase (Adsl) And The R303c Adsl
           Deficiency Associated Mutation
 gi|399125171|pdb|4FLC|D Chain D, Structural And Biochemical Characterization Of Human
           Adenylosuccinate Lyase (Adsl) And The R303c Adsl
           Deficiency Associated Mutation
          Length = 487

 Score = 42.0 bits (97), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 9/87 (10%)

Query: 14  LLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSM---QRSGSGW 70
           L N+K +++P +   I  S +P+    NP+   R EC C+  R+   L M   Q +   W
Sbjct: 275 LANLKEMEEPFEKQQIGSSAMPYKR--NPM---RSECCCSLARHLMTLVMDPLQTASVQW 329

Query: 71  FE-TLLNIKHRRINISSIVSTMDRVYN 96
           FE TL +  +RRI ++    T D + N
Sbjct: 330 FERTLDDSANRRICLAEAFLTADTILN 356


>gi|323452209|gb|EGB08084.1| hypothetical protein AURANDRAFT_64396 [Aureococcus anophagefferens]
          Length = 1060

 Score = 41.6 bits (96), Expect = 0.43,   Method: Composition-based stats.
 Identities = 58/255 (22%), Positives = 100/255 (39%), Gaps = 41/255 (16%)

Query: 65  RSGSGWFETLLN------IKHRRINISS----------------------IVSTMDRVYN 96
           RSGS W  +LLN      +   R+++ +                      + +  +   N
Sbjct: 452 RSGSDWLMSLLNAHPAICMPAGRVDVPAGHDPTVLVSKVRAREAARPGRNVTAAFEASAN 511

Query: 97  LDWFTSASKNECSAAVGFK---WMMN-QGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRM 152
                +A  N   AA G+K   W+        H  + A +  ++GV  + L+RR  + R+
Sbjct: 512 ARVAIAARDNGRCAAFGWKQGLWLAKGYATAAHRAEFAAWVARRGVKLV-LLRRVGVARV 570

Query: 153 ISILANSYDRNAKLLNGTHKSHVH------SPAEAKILAKYKPTINATLLIPELKQIEQT 206
           +S   N    NA LL      +VH      + A AK+  +      A  L  + ++ +  
Sbjct: 571 VSGAKNRLTPNATLLGAEASRNVHCTSTRCAAAVAKLRVRLDAETLAGTLASQKREWDAV 630

Query: 207 TAKAIEYFKSTRHIVFYYEDLITNRAT-LKEVQEFLRLAYRELTSRQVKIHSG-PLSKQV 264
            A A       R     Y++L+ +    L E+ +FL +       R   + SG   +  +
Sbjct: 631 EAWAAAAAPPPRAFRVTYDELVADTPRWLGELYDFLGVGPPPKPPRTAYVKSGGRAADSI 690

Query: 265 ENWDDVQKALKGTSY 279
           EN D+V+ AL GT +
Sbjct: 691 ENLDEVRAALAGTEW 705


>gi|434399106|ref|YP_007133110.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
 gi|428270203|gb|AFZ36144.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
          Length = 310

 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 42/189 (22%), Positives = 78/189 (41%), Gaps = 27/189 (14%)

Query: 99  WFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMISILA 157
           W+    K  C       W  NQ       ++ +Y K+ K ++ I L R NLL   +S + 
Sbjct: 145 WWKDVGKEPC-------WNSNQS------NLWQYLKENKDIAIIHLKRNNLLEAKVSGIT 191

Query: 158 NSYDRNAKL--LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
                N  +    G +K++V    E          +N    + + + + +   +A E+F 
Sbjct: 192 AQTTGNWGIGATGGINKNNVLVKCE----------LNFEECMQDFEALRRMEDEADEFFA 241

Query: 216 STRHIVFYYEDLITNRATL-KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKAL 274
             R +V  YE+L+ N A +  ++Q FL L    L ++  K  +  L   ++N+  ++   
Sbjct: 242 EHRKLVITYEELVENTANITNKIQVFLGLKTENLVTQSKKQATCSLLDVIDNYYQLKSQF 301

Query: 275 KGTSYERFL 283
             T +  F 
Sbjct: 302 TNTRWSNFF 310


>gi|443320647|ref|ZP_21049735.1| hypothetical protein GLO73106DRAFT_00031410 [Gloeocapsa sp. PCC
           73106]
 gi|442789646|gb|ELR99291.1| hypothetical protein GLO73106DRAFT_00031410 [Gloeocapsa sp. PCC
           73106]
          Length = 264

 Score = 40.4 bits (93), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 72/172 (41%), Gaps = 36/172 (20%)

Query: 129 IAEYFKKKGVSAIFLIRRNLLRRMIS--------ILANSYDRNAKLLNGTHKSHVHSPAE 180
           I +Y K + +  IFL R NLL+R++S        ++A + DR   +    +   +H P +
Sbjct: 110 IPDYLKARDILIIFLYRENLLKRVLSRLNLRKTRVIAITDDR---IQEKNNIEKIHVPID 166

Query: 181 AKILAKYKPTINATLLIPEL-----KQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLK 235
                          LIPEL     +Q EQ     IE     R +   +E    +     
Sbjct: 167 E--------------LIPELDVFNEEQKEQLMM--IEGIPEERILRITFEQYFASSDNQN 210

Query: 236 E----VQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
           E    + +FL +    L S   KI    L   +EN+++V + LK T Y +FL
Sbjct: 211 EYNNRIFDFLNVDKLTLKSSHKKILPNKLVDIIENYEEVFEVLKNTKYNKFL 262


>gi|254416257|ref|ZP_05030011.1| hypothetical protein MC7420_5093 [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196176939|gb|EDX71949.1| hypothetical protein MC7420_5093 [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 272

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 40/146 (27%), Positives = 69/146 (47%), Gaps = 17/146 (11%)

Query: 141 IFLIRRNLLRRMISILANSYDRNAKLLNGTHKS-------HVHSPAEAKILAKYKPTINA 193
           I L R+N LRR+IS +  S  +  + LN   K+        +H  A   I+   KP IN 
Sbjct: 96  IVLERKNYLRRIISAILGSQTK-IRHLNLKSKAKLTRTRVDLHQVAFG-IVYNPKPLINC 153

Query: 194 TLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED--LITNRATLKEVQEFLRLAYRELTSR 251
                 L++I+QT  +     +    +   YED  L   R   +++ EF+++    +   
Sbjct: 154 ------LEEIDQTYRQLESLLEDKSSLWLTYEDDILPDPRIAYRKICEFIQVEPMAVDIL 207

Query: 252 QVKIHSGPLSKQVENWDDVQKALKGT 277
             + +  P+ + VEN+D+++  LKGT
Sbjct: 208 NQRTNPFPICEIVENYDELEAMLKGT 233


>gi|333892195|ref|YP_004466070.1| hypothetical protein ambt_03590 [Alteromonas sp. SN2]
 gi|332992213|gb|AEF02268.1| hypothetical protein ambt_03590 [Alteromonas sp. SN2]
          Length = 207

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 16/183 (8%)

Query: 95  YNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMIS 154
           Y  + F+ A  +    + GFK M +Q   + +  +  Y  +     I L R+NLL R IS
Sbjct: 27  YFTELFSMARSSPKCRSAGFKLMYSQE--KRNRGLWGYLSQNTDKIIHLKRKNLLNRAIS 84

Query: 155 ILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ-IEQTTA--KAI 211
            +          +  T  +H +  A     +K K +++   ++ ELK+ I ++++  K I
Sbjct: 85  KV---------YMESTGIAHANEVASGDT-SKPKGSLDIEKVLDELKKDISESSSFSKKI 134

Query: 212 EYFKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDV 270
                 R I  YYEDLI N   TLK++  FL + +   +S   K+ +      V N+  +
Sbjct: 135 TALPPERVITIYYEDLIENLDITLKKLYNFLEVPFHHYSSNFKKLSNEQPEDYVLNYHVL 194

Query: 271 QKA 273
           + A
Sbjct: 195 KAA 197


>gi|168187879|ref|ZP_02622514.1| GTP-binding protein LepA [Clostridium botulinum C str. Eklund]
 gi|169294262|gb|EDS76395.1| GTP-binding protein LepA [Clostridium botulinum C str. Eklund]
          Length = 601

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/130 (23%), Positives = 64/130 (49%), Gaps = 17/130 (13%)

Query: 103 ASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDR 162
           A + E S A+GF +      + H   I E  +++          +++    S++ N Y R
Sbjct: 330 AYEPETSIALGFGFRCGFLGLLHMEIIQERIERE-------FNLDIITTAPSVIYNIYKR 382

Query: 163 NAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIP--------ELKQIEQTTAKAIEYF 214
           + +++  T+ +++  P E +I+   +P + A+++ P        EL Q ++ T   +EY 
Sbjct: 383 DGEMMKITNPTNMPDPTEIEIME--EPVVKASIITPSDFVGAVMELCQNKRGTFIDMEYI 440

Query: 215 KSTRHIVFYY 224
           ++TR +V YY
Sbjct: 441 ETTRVVVNYY 450


>gi|74141487|dbj|BAE38524.1| unnamed protein product [Mus musculus]
          Length = 484

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 9/87 (10%)

Query: 14  LLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSM---QRSGSGW 70
           L N+K +++P +   I  S +P+    NP+   R E  C+  R+   L+M   Q +   W
Sbjct: 272 LANLKEMEEPFEKQQIGSSAMPYKR--NPM---RSERCCSLARHLMALTMDPLQTASVQW 326

Query: 71  FE-TLLNIKHRRINISSIVSTMDRVYN 96
           FE TL N  +RRI ++    T D + N
Sbjct: 327 FERTLDNSANRRICLAEAFLTADTILN 353


>gi|427416768|ref|ZP_18906951.1| Stf0 sulfotransferase [Leptolyngbya sp. PCC 7375]
 gi|425759481|gb|EKV00334.1| Stf0 sulfotransferase [Leptolyngbya sp. PCC 7375]
          Length = 256

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 26/237 (10%)

Query: 58  FAILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWM 117
           F I    RSGS    +LLN          I++    ++   +  + + N  ++  G K +
Sbjct: 26  FIIFGRGRSGSTALVSLLNCLPDVCCDGEILA-QPVLWPQLYLKAKAANAQASVYGCKVL 84

Query: 118 MNQGLMEHHNDIAEYF----KKKGVSAIFLIRRNLLRRMIS-ILANSYD--RNAKLLNGT 170
             Q    H      +F     + G   ++L R NLL   IS I A ++     ++L+ G 
Sbjct: 85  SYQLRNIHSIQQGSHFLYELSQAGFHILYLRRENLLDHAISNIRARNFGFHHKSRLVQGF 144

Query: 171 HKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN 230
            K+ +H              +N   ++  +++ +Q     +   +   ++   YE  + N
Sbjct: 145 QKNKIH--------------VNPVTVVDWIEKSQQLWQYELSLLRDIPYLELTYEKNLAN 190

Query: 231 RA----TLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
            +    T+  + ++L +  R   S+  K+    L   VEN+D++   L+ T Y  +L
Sbjct: 191 ESQHQFTIDTICQYLGIQSRAAVSQYQKVSPQTLQASVENYDELISYLRATPYHHYL 247


>gi|414869371|tpg|DAA47928.1| TPA: hypothetical protein ZEAMMB73_204795 [Zea mays]
          Length = 110

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 21  DQPCQSTNIQQSEIPFLHYPNPVTFNR 47
           ++PC   +I  +EIP+LHYP P T++R
Sbjct: 84  EKPCHDPSIPDTEIPYLHYPMPNTYDR 110


>gi|312897719|ref|ZP_07757135.1| virulence-associated protein E [Megasphaera micronuciformis F0359]
 gi|310621103|gb|EFQ04647.1| virulence-associated protein E [Megasphaera micronuciformis F0359]
          Length = 826

 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 44/200 (22%), Positives = 86/200 (43%), Gaps = 20/200 (10%)

Query: 85  SSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLI 144
           S+++ TM + +  D  ++    E +  +  KW++  G M  +    E   K+     FL 
Sbjct: 557 STLIRTMGKKWYADGLSTFEGKEAAENIQGKWIIEAGEMAGYTRAEENASKQ-----FLS 611

Query: 145 RR-NLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAK------ILAKYKPTINATLLI 197
           R+ ++ R+        Y R       +++                 +   KPT N  + +
Sbjct: 612 RQVDVFRQAYGRRTQEYPRRCVFFGSSNQYEFLKDITGNRRFWPVDIEAQKPTKNVYVNL 671

Query: 198 PELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHS 257
           P   +++Q  A+A+  +K+   ++   ED   N A LK + E  R A+ E  S+Q  I+ 
Sbjct: 672 P--GEVDQIWAEAVVRYKNGESLII--ED---NEAALK-IAETAREAHMESNSKQGLINE 723

Query: 258 GPLSKQVENWDDVQKALKGT 277
             L K  +NW+ + ++ + T
Sbjct: 724 FLLQKVPKNWNTMSRSARRT 743


>gi|354556386|ref|ZP_08975681.1| hypothetical protein Cy51472DRAFT_4478 [Cyanothece sp. ATCC 51472]
 gi|353551598|gb|EHC20999.1| hypothetical protein Cy51472DRAFT_4478 [Cyanothece sp. ATCC 51472]
          Length = 273

 Score = 38.1 bits (87), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 33/50 (66%)

Query: 235 KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH 284
           +++ +FL ++   L S+  KI S  L   ++N+D+V + ++GT YE+FLH
Sbjct: 223 QKIFDFLGVSKLNLVSQHKKILSQNLVDILKNYDEVVEIIQGTKYEKFLH 272


>gi|254444117|ref|ZP_05057593.1| hypothetical protein VDG1235_2356 [Verrucomicrobiae bacterium
           DG1235]
 gi|198258425|gb|EDY82733.1| hypothetical protein VDG1235_2356 [Verrucomicrobiae bacterium
           DG1235]
          Length = 192

 Score = 38.1 bits (87), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 4/68 (5%)

Query: 220 IVFYYEDLITN----RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
           +   YED I +    R  +  V +F+ + Y E      KI    L   +EN+++++ AL 
Sbjct: 111 LTLTYEDHIASEDKQRMMMPLVCDFIGIPYEEGNCAYKKISPRSLRDSIENYEEIETALA 170

Query: 276 GTSYERFL 283
           GT YE FL
Sbjct: 171 GTKYEVFL 178


>gi|172038036|ref|YP_001804537.1| hypothetical protein cce_3123 [Cyanothece sp. ATCC 51142]
 gi|171699490|gb|ACB52471.1| unknown [Cyanothece sp. ATCC 51142]
          Length = 282

 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 33/51 (64%)

Query: 235 KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
           +++ +FL ++   L S+  KI S  L   ++N+D+V + ++GT YE+FLH 
Sbjct: 232 QKIFDFLGVSKLNLVSQHKKILSQNLVDILKNYDEVVEIIQGTKYEKFLHD 282


>gi|254425995|ref|ZP_05039712.1| hypothetical protein S7335_563 [Synechococcus sp. PCC 7335]
 gi|196188418|gb|EDX83383.1| hypothetical protein S7335_563 [Synechococcus sp. PCC 7335]
          Length = 266

 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 219 HIVFYYEDL---ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
           H V Y  DL    +++ T+  + E+L LA R +T++  K++S PL + + N+D+    L 
Sbjct: 199 HEVIYERDLENAQSHQQTVDRILEYLSLAPRVVTTKHKKVNSRPLEETISNYDEFIDCLD 258

Query: 276 GTSYERFL 283
              ++ FL
Sbjct: 259 KHGWQSFL 266


>gi|93115452|gb|ABE98414.1| glycosyltransferase family 2 [Escherichia coli]
 gi|203285026|gb|ACH97144.1| WclW [Escherichia coli]
          Length = 266

 Score = 37.4 bits (85), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 58/137 (42%), Gaps = 19/137 (13%)

Query: 59  AILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFK--- 115
           A+L   R+G  W   L +   RR+  S+I+      Y  DWF     +E S   G +   
Sbjct: 118 ALLFEYRTGM-WM--LPSFTMRRMLKSNIIFCSAMYYREDWFNVGGYDE-SLRTGLEDWD 173

Query: 116 -WMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSH 174
            W+   GL  +HNDI     K      + IRRN + R I+     Y    K++N  HK H
Sbjct: 174 FWLSLIGLYNNHNDIVARVNKP--LFFYRIRRNSMLRSIN-----YTEKEKIVNYIHKKH 226

Query: 175 VHSPAEAKILAKYKPTI 191
           +        L K KP I
Sbjct: 227 IQLFE----LYKVKPLI 239


>gi|203285014|gb|ACH97133.1| WclW [Escherichia coli]
          Length = 266

 Score = 37.4 bits (85), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 58/137 (42%), Gaps = 19/137 (13%)

Query: 59  AILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFK--- 115
           A+L   R+G  W   L +   RR+  S+I+      Y  DWF     +E S   G +   
Sbjct: 118 ALLFEYRTGM-WM--LPSFTMRRMLKSNIIFCSAMYYREDWFNVGGYDE-SLRTGLEDWD 173

Query: 116 -WMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSH 174
            W+   GL  +HNDI     K      + IRRN + R I+     Y    K++N  HK H
Sbjct: 174 FWLSLIGLYNNHNDIVARVNKP--LFFYRIRRNSMLRSIN-----YTEKEKIVNYIHKKH 226

Query: 175 VHSPAEAKILAKYKPTI 191
           +        L K KP I
Sbjct: 227 IQLFE----LYKVKPLI 239


>gi|443327654|ref|ZP_21056274.1| tetratricopeptide repeat protein [Xenococcus sp. PCC 7305]
 gi|442792746|gb|ELS02213.1| tetratricopeptide repeat protein [Xenococcus sp. PCC 7305]
          Length = 511

 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 45/176 (25%), Positives = 79/176 (44%), Gaps = 10/176 (5%)

Query: 111 AVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
           AVGFK    Q L    + I +Y KK K +  I L R+N+L   +S    +     KL+  
Sbjct: 341 AVGFKLFYFQ-LKNRQHVIWQYLKKLKNLKIIHLTRKNMLHTYVSHKLATISNQWKLIKE 399

Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLI- 228
              S   S +E  I+  Y+  +    L+    +  +T         + ++    YEDL+ 
Sbjct: 400 EKNSSRFSLSEP-IVIDYEECLATFKLV----RYWETQYGNFFDLSTQQYCNVNYEDLVG 454

Query: 229 -TNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
            T + T K +QEF+ +  + LT   +K    P+ + V N+D +++  + + +  F 
Sbjct: 455 KTEQVTNK-LQEFIGVDRQPLTFSTIKQGLTPMKEMVANYDQLKRKFQNSCWVEFF 509


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.132    0.391 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,147,468,170
Number of Sequences: 23463169
Number of extensions: 154873983
Number of successful extensions: 379381
Number of sequences better than 100.0: 146
Number of HSP's better than 100.0 without gapping: 108
Number of HSP's successfully gapped in prelim test: 38
Number of HSP's that attempted gapping in prelim test: 379125
Number of HSP's gapped (non-prelim): 160
length of query: 289
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 148
effective length of database: 9,050,888,538
effective search space: 1339531503624
effective search space used: 1339531503624
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)