BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 035616
(289 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|302142361|emb|CBI19564.3| unnamed protein product [Vitis vinifera]
Length = 311
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 220/302 (72%), Positives = 249/302 (82%), Gaps = 13/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQTS +TK K L +VI+Q S I+ EIP++HYP P T +REEC CN VRYFAI
Sbjct: 10 ICLKQTSNNTKAKFLKFQVINQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAI 69
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN I +RR NISSI+STMD+VYNLDWF+SASKNE
Sbjct: 70 LSMQRSGSGWFETLLNSHINVSSNGEIFSIGNRRNNISSILSTMDKVYNLDWFSSASKNE 129
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C AAVGFKWM+NQGLMEHH DI +YFK KGVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 130 CLAAVGFKWMLNQGLMEHHKDIVKYFKNKGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 189
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA+ILAKYKPT+NATLL+P+LK+ E+T KA+EYFKSTRHIV YYEDL
Sbjct: 190 NGTHKSHVHSTEEAQILAKYKPTVNATLLLPDLKEAEETITKALEYFKSTRHIVLYYEDL 249
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR LK+VQEFLRL YR+L+SRQVKIHSG L KQV NWDDVQKALKGTSYE LH+DY
Sbjct: 250 INNRTKLKDVQEFLRLPYRDLSSRQVKIHSGNLPKQVGNWDDVQKALKGTSYEGMLHADY 309
Query: 288 RV 289
++
Sbjct: 310 QM 311
>gi|225458537|ref|XP_002284367.1| PREDICTED: uncharacterized protein LOC100252224 [Vitis vinifera]
Length = 344
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 220/302 (72%), Positives = 249/302 (82%), Gaps = 13/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQTS +TK K L +VI+Q S I+ EIP++HYP P T +REEC CN VRYFAI
Sbjct: 43 ICLKQTSNNTKAKFLKFQVINQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN I +RR NISSI+STMD+VYNLDWF+SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNSHINVSSNGEIFSIGNRRNNISSILSTMDKVYNLDWFSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C AAVGFKWM+NQGLMEHH DI +YFK KGVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 163 CLAAVGFKWMLNQGLMEHHKDIVKYFKNKGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA+ILAKYKPT+NATLL+P+LK+ E+T KA+EYFKSTRHIV YYEDL
Sbjct: 223 NGTHKSHVHSTEEAQILAKYKPTVNATLLLPDLKEAEETITKALEYFKSTRHIVLYYEDL 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR LK+VQEFLRL YR+L+SRQVKIHSG L KQV NWDDVQKALKGTSYE LH+DY
Sbjct: 283 INNRTKLKDVQEFLRLPYRDLSSRQVKIHSGNLPKQVGNWDDVQKALKGTSYEGMLHADY 342
Query: 288 RV 289
++
Sbjct: 343 QM 344
>gi|147792848|emb|CAN68800.1| hypothetical protein VITISV_008807 [Vitis vinifera]
Length = 647
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 219/306 (71%), Positives = 248/306 (81%), Gaps = 17/306 (5%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQTS +TK K L +VI+Q S I+ EIP++HYP P T +REEC CN VRYFAI
Sbjct: 342 ICLKQTSNNTKAKFLKFQVINQASPSFGIEPWEIPYVHYPKPKTHSREECVCNGVRYFAI 401
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN I +RR NISSI+STMD+VYNLDW +SASKNE
Sbjct: 402 LSMQRSGSGWFETLLNSHINVSSNGEIFSIGNRRNNISSILSTMDKVYNLDWXSSASKNE 461
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C AAVGFKWM+NQGLMEHH DI +YFK KGVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 462 CLAAVGFKWMLNQGLMEHHKDIVKYFKNKGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 521
Query: 168 NGTHKSHVHSPAE----AKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFY 223
NGTHKSHVHS E A+ILAKYKPT+NATLL+P+LK+ E+T KA+EYFKSTRHIV Y
Sbjct: 522 NGTHKSHVHSTEEILFQAQILAKYKPTVNATLLLPDLKEAEETITKALEYFKSTRHIVLY 581
Query: 224 YEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
YEDLI NR LK+VQEFLRL YR+L+SRQVKIHSG L KQV NWDDVQKALKGTSYE L
Sbjct: 582 YEDLINNRTKLKDVQEFLRLPYRDLSSRQVKIHSGNLPKQVGNWDDVQKALKGTSYEGML 641
Query: 284 HSDYRV 289
H+DY++
Sbjct: 642 HADYQM 647
>gi|224137390|ref|XP_002327114.1| predicted protein [Populus trichocarpa]
gi|222835429|gb|EEE73864.1| predicted protein [Populus trichocarpa]
Length = 344
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 205/301 (68%), Positives = 243/301 (80%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ SPHT K LNI++ DQPC S+N+++ E P++HYP P TF+REECACNPVR+FAI
Sbjct: 43 ICIKQISPHTTAKFLNIRIFDQPCNSSNVEEWEKPYVHYPKPETFSREECACNPVRFFAI 102
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
SMQRSGSGWFETLLN + RR ++S+I T+DRVYNLDWF+SASKNE
Sbjct: 103 FSMQRSGSGWFETLLNSHINVSSNGEIFGKRARRASVSAITQTLDRVYNLDWFSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVGFKWM+NQG+MEHH IAEYFK+ GV AIFL RRNLLRRMIS+LANSYD++ K L
Sbjct: 163 CNAAVGFKWMLNQGVMEHHEGIAEYFKQNGVHAIFLFRRNLLRRMISVLANSYDKSNKPL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA++LAKY+PTINAT LI ELK ++ +AI+YFKSTRH V YYED+
Sbjct: 223 NGTHKSHVHSSMEAEVLAKYRPTINATTLIAELKHVDDRATRAIDYFKSTRHTVVYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR LKEVQ+FLRL YRELTSRQVKIHSG LSKQV+NWD++ K LKGT YE FLH DY
Sbjct: 283 VGNRTKLKEVQDFLRLPYRELTSRQVKIHSGHLSKQVQNWDEIHKVLKGTQYESFLHLDY 342
Query: 288 R 288
+
Sbjct: 343 Q 343
>gi|449438424|ref|XP_004136988.1| PREDICTED: nodulation protein H-like isoform 2 [Cucumis sativus]
Length = 346
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 203/301 (67%), Positives = 243/301 (80%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S +K + ++VID PC I S+ PF+H+P P T++R ECAC+PVRYFAI
Sbjct: 45 VCLKQISTRSKVGFMKVEVIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAI 104
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR NIS+IV T+D+VYNLDWFTSASKNE
Sbjct: 105 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNE 164
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGLM+HH +I EYFK++GVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 165 CTAAVGLKWMLNQGLMQHHEEIVEYFKRRGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 224
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA ILAKYKP INATLLIP LKQ+E TT KA+EYF STRHIV YYED+
Sbjct: 225 NGTHKSHVHSHHEADILAKYKPVINATLLIPNLKQVEDTTTKALEYFNSTRHIVMYYEDV 284
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L++VQ+FL++ R+L SRQVKIH GPLS Q++NW+DVQKAL+G+ YE FL++DY
Sbjct: 285 VKNRTKLRDVQDFLKVPQRKLKSRQVKIHRGPLSNQIDNWEDVQKALQGSRYETFLNADY 344
Query: 288 R 288
R
Sbjct: 345 R 345
>gi|449529180|ref|XP_004171579.1| PREDICTED: nodulation protein H-like [Cucumis sativus]
Length = 344
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 203/301 (67%), Positives = 243/301 (80%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S +K + ++VID PC I S+ PF+H+P P T++R ECAC+PVRYFAI
Sbjct: 43 VCLKQISTRSKVGFMKVEVIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR NIS+IV T+D+VYNLDWFTSASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGLM+HH +I EYFK++GVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 163 CTAAVGLKWMLNQGLMQHHEEIVEYFKRRGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA ILAKYKP INATLLIP LKQ+E TT KA+EYF STRHIV YYED+
Sbjct: 223 NGTHKSHVHSHHEADILAKYKPVINATLLIPNLKQVEDTTTKALEYFNSTRHIVMYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L++VQ+FL++ R+L SRQVKIH GPLS Q++NW+DVQKAL+G+ YE FL++DY
Sbjct: 283 VKNRTKLRDVQDFLKVPQRKLKSRQVKIHRGPLSNQIDNWEDVQKALQGSRYETFLNADY 342
Query: 288 R 288
R
Sbjct: 343 R 343
>gi|449438422|ref|XP_004136987.1| PREDICTED: nodulation protein H-like isoform 1 [Cucumis sativus]
Length = 371
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 203/301 (67%), Positives = 243/301 (80%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S +K + ++VID PC I S+ PF+H+P P T++R ECAC+PVRYFAI
Sbjct: 70 VCLKQISTRSKVGFMKVEVIDMPCSKPTIDPSDAPFVHFPKPTTYSRAECACHPVRYFAI 129
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR NIS+IV T+D+VYNLDWFTSASKNE
Sbjct: 130 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNISTIVETLDKVYNLDWFTSASKNE 189
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGLM+HH +I EYFK++GVSAIFL RRNLLRRMIS+LANSYDR+AKLL
Sbjct: 190 CTAAVGLKWMLNQGLMQHHEEIVEYFKRRGVSAIFLFRRNLLRRMISVLANSYDRDAKLL 249
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA ILAKYKP INATLLIP LKQ+E TT KA+EYF STRHIV YYED+
Sbjct: 250 NGTHKSHVHSHHEADILAKYKPVINATLLIPNLKQVEDTTTKALEYFNSTRHIVMYYEDV 309
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L++VQ+FL++ R+L SRQVKIH GPLS Q++NW+DVQKAL+G+ YE FL++DY
Sbjct: 310 VKNRTKLRDVQDFLKVPQRKLKSRQVKIHRGPLSNQIDNWEDVQKALQGSRYETFLNADY 369
Query: 288 R 288
R
Sbjct: 370 R 370
>gi|356521486|ref|XP_003529386.1| PREDICTED: nodulation protein H-like [Glycine max]
Length = 344
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 195/301 (64%), Positives = 237/301 (78%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ +K L+IKV+ +PC NI+ EIPF+HYP+P T++R ECAC+PVRYFAI
Sbjct: 43 ICLKQIGTSSKIGFLDIKVVQKPCPEPNIEPWEIPFVHYPHPKTYSRAECACHPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFET LN +K RR N+S+I T+D +YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHANISSNGEIFSVKVRRSNMSTITETLDTIYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+ AVG KWM+NQGLM+HH +IAEYF+ GVS IFL RRNLLRRM+S+LAN YDRNAKLL
Sbjct: 163 CTTAVGLKWMLNQGLMQHHEEIAEYFRIHGVSLIFLFRRNLLRRMVSVLANEYDRNAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILAKYKPTIN+TLLI +LKQ+ +TT KA+EYFKSTRHI+ YYED+
Sbjct: 223 NGTHKSHVHSPKEAEILAKYKPTINSTLLIAQLKQVNETTTKALEYFKSTRHIILYYEDI 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L++VQ+FL++ +L SRQVKIH G LS QVENW+D+ KAL GT YE F+H DY
Sbjct: 283 VKNRTKLRDVQDFLKVPQMDLKSRQVKIHKGSLSSQVENWNDISKALTGTPYESFIHEDY 342
Query: 288 R 288
R
Sbjct: 343 R 343
>gi|356500053|ref|XP_003518849.1| PREDICTED: nodulation protein H-like [Glycine max]
Length = 344
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 195/301 (64%), Positives = 236/301 (78%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ +K L+IKV+ +PC NI+ EIP++HYPNP T++R ECAC+PVRYFAI
Sbjct: 43 ICLKQIGTSSKIGFLDIKVVQKPCPEPNIEPWEIPYVHYPNPKTYSRAECACHPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFET LN +K RR N+S+I T+D +YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHANISSNGEIFSVKVRRSNMSTITETLDTIYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+ AVG KWM+NQGLM+HH IAEYF+ GVS IFL RRNLLRRM+S+LAN YDRNAK+L
Sbjct: 163 CTTAVGLKWMLNQGLMQHHEQIAEYFRIHGVSLIFLFRRNLLRRMVSVLANEYDRNAKIL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILAKYKPTIN+TLLI +LKQ+ +TT KA+EYFKSTRHI+ YYED+
Sbjct: 223 NGTHKSHVHSPQEAEILAKYKPTINSTLLIAQLKQVNETTTKALEYFKSTRHILLYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L++VQ+FLR+ +L SRQVKIH G LS QVENW+D+ KAL GT YE F+H DY
Sbjct: 283 VKNRTKLRDVQDFLRVPQMKLKSRQVKIHKGSLSSQVENWNDISKALTGTPYESFIHEDY 342
Query: 288 R 288
R
Sbjct: 343 R 343
>gi|356553925|ref|XP_003545301.1| PREDICTED: uncharacterized protein LOC100820591 [Glycine max]
Length = 1057
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 196/288 (68%), Positives = 234/288 (81%), Gaps = 13/288 (4%)
Query: 12 TKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWF 71
+KLL ++VI+Q CQ +N+++ E+PFLHYP P T+NREECACNPVR+F IL+MQRSGSGWF
Sbjct: 770 SKLLELRVINQSCQLSNVEEWEVPFLHYPQPKTYNREECACNPVRFFTILTMQRSGSGWF 829
Query: 72 ETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMM 118
ETLLN + RR N+SSI+ TMD V+NLDWF+ ASKNECSAAVG+KWM+
Sbjct: 830 ETLLNSHMNVSSNGEIFGVVKRRENVSSILKTMDVVFNLDWFSGASKNECSAAVGYKWML 889
Query: 119 NQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSP 178
NQGLMEHH +I EYF+++ VS IFL RRNLLRRM+S+LANSYD+NAK LNGTHKSHVHS
Sbjct: 890 NQGLMEHHKEIVEYFERRKVSTIFLFRRNLLRRMVSVLANSYDKNAKPLNGTHKSHVHST 949
Query: 179 AEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQ 238
EA ILAKY+P IN TLL+ ELKQ E T AKAIEYF +TRHIV YYEDLI N LK+VQ
Sbjct: 950 LEAGILAKYRPWINTTLLMTELKQTEDTAAKAIEYFNNTRHIVLYYEDLIKNSTKLKDVQ 1009
Query: 239 EFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
EFLRL YR++ SRQVKIH+ PL KQ+ENWD+VQK L+GT+Y+ FL SD
Sbjct: 1010 EFLRLPYRDMHSRQVKIHTAPLWKQIENWDEVQKTLRGTAYQNFLFSD 1057
>gi|224105519|ref|XP_002313841.1| predicted protein [Populus trichocarpa]
gi|222850249|gb|EEE87796.1| predicted protein [Populus trichocarpa]
Length = 345
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 197/301 (65%), Positives = 234/301 (77%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ T LN++VI++PC NI+ EIP++HYP P+T++R EC CNPVRYFAI
Sbjct: 44 ICLKQIGIRTNPGFLNVEVIERPCPEPNIEPWEIPYVHYPKPITYSRVECKCNPVRYFAI 103
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR N+S+I T+D++YNLDW +SASKNE
Sbjct: 104 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNVSTITETLDKIYNLDWLSSASKNE 163
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQG+M+HH +I EYFK +GVSAIFL RRNLLRRM+SILANSYDR K L
Sbjct: 164 CAAAVGLKWMLNQGVMQHHEEIVEYFKTRGVSAIFLFRRNLLRRMVSILANSYDREVKPL 223
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILAKYKP IN TLLI LKQ+E TTAKA+EYFKSTRHI+ YYED+
Sbjct: 224 NGTHKSHVHSPREAEILAKYKPLINTTLLISNLKQVEDTTAKALEYFKSTRHIILYYEDV 283
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ N L +VQ+FL++ REL SRQVKIH G LS VENWD+VQK+LKGT YE L DY
Sbjct: 284 VKNHTKLLDVQDFLKVPQRELKSRQVKIHKGSLSNYVENWDEVQKSLKGTHYENLLTGDY 343
Query: 288 R 288
R
Sbjct: 344 R 344
>gi|255555521|ref|XP_002518797.1| conserved hypothetical protein [Ricinus communis]
gi|223542178|gb|EEF43722.1| conserved hypothetical protein [Ricinus communis]
Length = 368
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 194/301 (64%), Positives = 235/301 (78%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S T + ++ VI++PC NIQ EIP++HYP P T++R EC CNPVR FAI
Sbjct: 43 ICLKQISISTNPGIFSLGVIEKPCPEPNIQPWEIPYVHYPKPKTYSRAECMCNPVRNFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR N+S IV T+D++YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNVSMIVETLDKIYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQG+M+HH +I EYFK +GVSAIFL RRNLLRRMIS+LANSYDR AKLL
Sbjct: 163 CTAAVGLKWMLNQGVMQHHEEIVEYFKSRGVSAIFLFRRNLLRRMISVLANSYDREAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILA+YKP++N +LL+ LKQ+EQTT KA+EYFKS RHI+ YYED+
Sbjct: 223 NGTHKSHVHSPDEAEILARYKPSVNTSLLLSSLKQVEQTTIKALEYFKSNRHIIVYYEDI 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ N L EVQ+FLR+ +EL SRQVKIH G +S Q ENW ++QKALKGT YE FL+ DY
Sbjct: 283 VKNHTKLLEVQDFLRVPRKELKSRQVKIHKGSISSQAENWGEIQKALKGTHYESFLYGDY 342
Query: 288 R 288
R
Sbjct: 343 R 343
>gi|356564206|ref|XP_003550347.1| PREDICTED: uncharacterized protein LOC100819406 [Glycine max]
Length = 330
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 195/296 (65%), Positives = 234/296 (79%), Gaps = 13/296 (4%)
Query: 4 KQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSM 63
K T T +KLL ++VI+Q C +++++ E+PFLHYP P T+NREECACNPVR+F IL+M
Sbjct: 35 KPTRIRTNSKLLELRVINQSCHPSSVEEWEVPFLHYPQPKTYNREECACNPVRFFVILTM 94
Query: 64 QRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSA 110
QRSGSGWFETLLN + RR N+SSI+ TMD V+NLDWF+ ASKNECSA
Sbjct: 95 QRSGSGWFETLLNSHMNVSSNGEIFSVAKRRENVSSILMTMDEVFNLDWFSGASKNECSA 154
Query: 111 AVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGT 170
AVG+KWM+NQGLMEHH +I EYF+++ VS IFL RRNLLRRM+S+L NSYD+ AK LNGT
Sbjct: 155 AVGYKWMLNQGLMEHHKEIGEYFERRRVSTIFLFRRNLLRRMVSVLENSYDKKAKPLNGT 214
Query: 171 HKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN 230
HKSHVHS EA ILAKY+P IN TLL+ EL Q E+T AKAIEYFK+TRHIV YYEDLI N
Sbjct: 215 HKSHVHSTLEAGILAKYRPWINTTLLMTELNQTEETAAKAIEYFKNTRHIVLYYEDLIKN 274
Query: 231 RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
LK+VQEFLRL +R++ SRQVKIH+ PL KQ+ENWDDV K L+GTSY+ FL SD
Sbjct: 275 ATKLKDVQEFLRLPFRDMHSRQVKIHTAPLLKQIENWDDVYKTLRGTSYQNFLFSD 330
>gi|225428745|ref|XP_002285039.1| PREDICTED: nodulation protein H [Vitis vinifera]
gi|297741312|emb|CBI32443.3| unnamed protein product [Vitis vinifera]
Length = 344
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 191/302 (63%), Positives = 237/302 (78%), Gaps = 13/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S LN+++ ++PC+ NI+ SE+ ++H+P P++++R ECACNPVRYFAI
Sbjct: 43 ICLKQISTTATAGFLNVELTERPCEKPNIEPSEVRYVHFPKPISYSRAECACNPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LS QRSGSGWFETLLN +K RR NISSI T+D++YNLDW +SASKNE
Sbjct: 103 LSTQRSGSGWFETLLNSHINISSNGEIFSVKVRRSNISSITETLDKIYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGLM++H +I EYF +GVS IFL RRNLLRRMIS+LANSYDR AK L
Sbjct: 163 CTAAVGLKWMLNQGLMQNHKEIVEYFNTRGVSLIFLFRRNLLRRMISMLANSYDREAKQL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILA+YKP+INATLLI +LKQ+E T AKA++YF STRH +FYYED+
Sbjct: 223 NGTHKSHVHSPREAEILAQYKPSINATLLIRDLKQVEDTVAKALQYFNSTRHTIFYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR L +V FL++ R+L SRQVKIH G LS+QVENWD+V K LKGT YE FLH+DY
Sbjct: 283 IKNRTKLVDVLNFLKVPQRDLRSRQVKIHKGSLSEQVENWDEVLKTLKGTQYESFLHADY 342
Query: 288 RV 289
++
Sbjct: 343 QI 344
>gi|224094390|ref|XP_002310152.1| predicted protein [Populus trichocarpa]
gi|222853055|gb|EEE90602.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 416 bits (1068), Expect = e-114, Method: Compositional matrix adjust.
Identities = 197/300 (65%), Positives = 234/300 (78%), Gaps = 13/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S H+K K+ +I+VI++P + + +I +HYPNP TF+R ECA NPVRYFAI
Sbjct: 43 VCLKQISTHSKIKIQDIQVIERPSPDVDHENLQISSVHYPNPETFSRAECAHNPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN + RRINISSI T+D+VYNLDWFTSASKNE
Sbjct: 103 LSMQRSGSGWFETLLNSHVNVSSNGEIFSVLDRRINISSITQTLDKVYNLDWFTSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
CSAAVGFKWM+NQG+M+HH +IA+YF ++GVSAIFL RRNLLRRM+S+LANSYDR+AKLL
Sbjct: 163 CSAAVGFKWMLNQGVMQHHKEIADYFNRRGVSAIFLFRRNLLRRMVSVLANSYDRHAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA+ LAKYKP IN+TLLI +LK++E T KA+EYF STRH V YYEDL
Sbjct: 223 NGTHKSHVHSTEEAETLAKYKPMINSTLLISDLKEVEITATKALEYFNSTRHTVLYYEDL 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I N LK+VQ FL L EL SRQVKIH GPLS V+NW+D+ K L GT+YE FL +DY
Sbjct: 283 IKNPTKLKDVQAFLGLPVMELMSRQVKIHKGPLSDHVKNWEDINKTLNGTAYESFLQADY 342
>gi|224123470|ref|XP_002330322.1| predicted protein [Populus trichocarpa]
gi|222871357|gb|EEF08488.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 416 bits (1068), Expect = e-114, Method: Compositional matrix adjust.
Identities = 195/300 (65%), Positives = 230/300 (76%), Gaps = 13/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ T LN++VI++PC NI+ EIP++HYP P T++R EC CNPVRYFAI
Sbjct: 34 ICLKQIGIRTNPGFLNVEVIERPCPEPNIEPWEIPYVHYPRPKTYSRAECKCNPVRYFAI 93
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR N+S+I T+D++YNLDW +SASKNE
Sbjct: 94 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKVRRSNVSTITETLDKIYNLDWSSSASKNE 153
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQG+M HH +I EYFK +GVSAIFL RRNLLRRMISILANSYDR+ K L
Sbjct: 154 CTAAVGLKWMLNQGVMRHHEEIVEYFKTRGVSAIFLFRRNLLRRMISILANSYDRDVKPL 213
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILAKYK IN TLLIP LKQ+E TTAKA+EYFKS RHI+ YYED+
Sbjct: 214 NGTHKSHVHSPREAEILAKYKSLINTTLLIPSLKQVEDTTAKALEYFKSARHIILYYEDV 273
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L +VQ+FL++ REL S QVKIH G L VENWD+VQK+LKGT YE L DY
Sbjct: 274 VKNRTKLLDVQDFLKVPRRELKSHQVKIHKGSLLNYVENWDEVQKSLKGTHYESLLSGDY 333
>gi|79556423|ref|NP_179175.3| nodulation protein-like protein [Arabidopsis thaliana]
gi|40823013|gb|AAR92253.1| At2g15730 [Arabidopsis thaliana]
gi|45752690|gb|AAS76243.1| At2g15730 [Arabidopsis thaliana]
gi|330251339|gb|AEC06433.1| nodulation protein-like protein [Arabidopsis thaliana]
Length = 344
Score = 416 bits (1068), Expect = e-114, Method: Compositional matrix adjust.
Identities = 189/301 (62%), Positives = 236/301 (78%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ LN++V ++PC NIQ +IP++HYP P T++REEC+CNPVRYFAI
Sbjct: 43 ICLKQIGVVPSAGFLNVEVFERPCPEPNIQPWDIPYVHYPKPKTYSREECSCNPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR N+S+I T+D+VYNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C++AVG KWM+NQGLM++H +I EYFK +GVSAIFL RRNLLRRMIS+LANSYDR+AK L
Sbjct: 163 CTSAVGLKWMLNQGLMKNHEEIVEYFKTRGVSAIFLFRRNLLRRMISVLANSYDRDAKPL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILA+YKP IN +LLIP+LKQ+++ T+KA+ YF +TRHI YYED+
Sbjct: 223 NGTHKSHVHSPKEAEILARYKPLINTSLLIPDLKQVQEMTSKALAYFNTTRHIFLYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L +VQEFL++ +L SRQVKIH GPLS+ V+NW++VQK LKGT +E FL DY
Sbjct: 283 VKNRTKLDDVQEFLKVPKLDLKSRQVKIHHGPLSQHVQNWEEVQKTLKGTGFENFLLEDY 342
Query: 288 R 288
R
Sbjct: 343 R 343
>gi|356509720|ref|XP_003523594.1| PREDICTED: uncharacterized protein LOC100808615 [Glycine max]
Length = 336
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/300 (64%), Positives = 234/300 (78%), Gaps = 14/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPC-QSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFA 59
+C +Q HT KL +I V QPC + +++ E+ +LHYP P ++REECACNPV +F
Sbjct: 37 ICLEQKGVHTSAKLFDITVFKQPCFNHSGVEEWELSYLHYPEPKIYSREECACNPVLFFC 96
Query: 60 ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
I+SMQRSGSGWFET LN + RR N+SSI+ TMDRVYNLDWF+SASKN
Sbjct: 97 IVSMQRSGSGWFETFLNSHINVSSNGEIFSVGKRRANVSSILETMDRVYNLDWFSSASKN 156
Query: 107 ECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKL 166
EC+AAVGFKWM+NQGL+EHH +I EYF+++ VS IFL RRNLLRRMIS+LANSYD++AK
Sbjct: 157 ECTAAVGFKWMLNQGLVEHHEEIVEYFERRRVSVIFLFRRNLLRRMISVLANSYDKDAKP 216
Query: 167 LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED 226
LNGTHKSHVHS EA ILAKY+P +N TLLIPEL Q ++T AKAI YFK+TRHIV YYED
Sbjct: 217 LNGTHKSHVHSAVEANILAKYRPRLNITLLIPELNQTKETAAKAIAYFKNTRHIVLYYED 276
Query: 227 LITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
L+ NR LK+VQEFLR+ YR+L SRQVKIH+ LSKQ+ENW++VQ+AL GT Y+RFL D
Sbjct: 277 LVNNRTKLKDVQEFLRVPYRDLESRQVKIHTAALSKQIENWNEVQEALTGTPYQRFLFKD 336
>gi|357465451|ref|XP_003603010.1| hypothetical protein MTR_3g101380 [Medicago truncatula]
gi|355492058|gb|AES73261.1| hypothetical protein MTR_3g101380 [Medicago truncatula]
Length = 388
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/303 (64%), Positives = 233/303 (76%), Gaps = 17/303 (5%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPC-QSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFA 59
+C + T +K L + V QPC +N+Q+ EIP+LHYP P T++REEC+CNPVRYF
Sbjct: 86 ICFELKGVRTYSKFLGVSVFTQPCPHPSNVQEWEIPYLHYPEPKTYSREECSCNPVRYFC 145
Query: 60 ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
I+SMQRSGSGWFET LN ++ RR N+S I+ T++ VYNLDWFTSASKN
Sbjct: 146 IVSMQRSGSGWFETFLNSHINVSSNGEIFSVEKRRENVSLILKTINEVYNLDWFTSASKN 205
Query: 107 ECSAAVGFKWMMNQ---GLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRN 163
ECSAA+G+KWM+NQ GLM+HH +I EY ++K VS IFL RRNLLRRM+S+LANSYD++
Sbjct: 206 ECSAAIGYKWMLNQVGRGLMDHHKEIVEYLEQKQVSTIFLFRRNLLRRMVSVLANSYDKD 265
Query: 164 AKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFY 223
KLLNGTHKSHVHS EA ILAKYKP IN TLLIPELK+ +T AKAI YFKS RHIV Y
Sbjct: 266 VKLLNGTHKSHVHSTMEATILAKYKPKINVTLLIPELKKTNETIAKAISYFKSIRHIVLY 325
Query: 224 YEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
YEDL+ NR LK+VQEFL++ YREL SRQVKIH+ PLS Q+ENWD+VQK L GTSYE FL
Sbjct: 326 YEDLVNNRTKLKDVQEFLKVPYRELESRQVKIHTAPLSMQIENWDEVQKTLTGTSYESFL 385
Query: 284 HSD 286
+ D
Sbjct: 386 YKD 388
>gi|357475463|ref|XP_003608017.1| Nodulation protein H [Medicago truncatula]
gi|355509072|gb|AES90214.1| Nodulation protein H [Medicago truncatula]
Length = 344
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 193/301 (64%), Positives = 229/301 (76%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S ++ L+I VI +PC NI+ EIP++HYPNP T++REEC C+PVRYF I
Sbjct: 43 ICLKQISTGSRIGFLDINVIQKPCPEPNIEPWEIPYVHYPNPKTYSREECRCHPVRYFTI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFET LN +K RR NI++I T+D +YNLDWF SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHPNISSNGEIFSVKVRRSNITTITETLDTIYNLDWFNSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGLM+HH IAEYF+ GVS IFL R+NLLRRMIS+LAN YD+NAK+L
Sbjct: 163 CTAAVGLKWMLNQGLMQHHEQIAEYFRIHGVSVIFLFRKNLLRRMISVLANEYDKNAKIL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHSP EA+ILAKYKPT+N+TLLI LKQ+ TT KA+E FKSTRHI+ YYED+
Sbjct: 223 NGTHKSHVHSPKEAEILAKYKPTLNSTLLIKNLKQVNDTTTKALESFKSTRHIMLYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L +V EFL++ L SRQVKIH G LS QVENW+DV K L GT YE FLH DY
Sbjct: 283 VKNRTKLMDVLEFLKVPQVNLKSRQVKIHKGSLSSQVENWNDVTKTLNGTQYESFLHEDY 342
Query: 288 R 288
R
Sbjct: 343 R 343
>gi|224084324|ref|XP_002307259.1| predicted protein [Populus trichocarpa]
gi|222856708|gb|EEE94255.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 198/300 (66%), Positives = 233/300 (77%), Gaps = 13/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQTS H+K K +I+V+++ + +I +HYPNP TF+R ECA NPVRYFAI
Sbjct: 43 VCLKQTSIHSKIKFQDIQVVERLSPDDDHGNLQISSVHYPNPETFSRAECAHNPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN + RR NISSI T+D+VYNLDWFTSASKNE
Sbjct: 103 LSMQRSGSGWFETLLNSHVNVSSNGEIFSVLDRRRNISSITRTLDKVYNLDWFTSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
CSAAVGFKWM+NQG+M+HH +I++YF ++GV AIFL RRNLLRRM+S+LANSYDR+AKLL
Sbjct: 163 CSAAVGFKWMLNQGVMQHHKEISDYFNRRGVFAIFLFRRNLLRRMVSVLANSYDRHAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EAK LAKYKP IN+TLLI +LK++E T AKA+EYF STRHIV YYEDL
Sbjct: 223 NGTHKSHVHSTEEAKTLAKYKPMINSTLLISDLKEVEITAAKALEYFNSTRHIVLYYEDL 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I N A LK+VQ FL L EL SRQVKIH GPLS V+NW+DV K L GT+YE FL +DY
Sbjct: 283 IKNPAKLKDVQAFLGLPLMELMSRQVKIHKGPLSDHVKNWEDVNKTLNGTAYESFLQADY 342
>gi|297832068|ref|XP_002883916.1| hypothetical protein ARALYDRAFT_480426 [Arabidopsis lyrata subsp.
lyrata]
gi|297329756|gb|EFH60175.1| hypothetical protein ARALYDRAFT_480426 [Arabidopsis lyrata subsp.
lyrata]
Length = 344
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 186/301 (61%), Positives = 234/301 (77%), Gaps = 13/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ LN++V ++ C NI+ +IP++HYP P T++REEC+CNPVRYFAI
Sbjct: 43 ICLKQIGVVPSAGFLNVEVFERQCPEPNIEPWDIPYVHYPKPKTYSREECSCNPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR N+++I T+D+VYNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVTTIFETLDKVYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C++AVG KWM+NQGLM+HH +I EYFK +GVSAIFL RRNLLRRMIS+LANSYDR+AK L
Sbjct: 163 CTSAVGLKWMLNQGLMKHHEEIVEYFKTRGVSAIFLFRRNLLRRMISVLANSYDRDAKPL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
+GTHKSHVHSP EA+ILA+YKP IN TLLIP+LKQ+++ T+KA+ YF +TRHI YYED+
Sbjct: 223 HGTHKSHVHSPKEAEILARYKPLINTTLLIPDLKQVQEMTSKALAYFNTTRHIFLYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ NR L +VQEFL++ +L SRQVKIH GPLSK V+NW++VQ LKGT ++ FL DY
Sbjct: 283 VKNRTKLDDVQEFLKVPKLDLKSRQVKIHHGPLSKHVQNWEEVQTTLKGTGFKNFLLEDY 342
Query: 288 R 288
R
Sbjct: 343 R 343
>gi|334187156|ref|NP_195168.3| P-loop containing nucleoside triphosphate hydrolase family
protein-like protein [Arabidopsis thaliana]
gi|332660972|gb|AEE86372.1| P-loop containing nucleoside triphosphate hydrolase family
protein-like protein [Arabidopsis thaliana]
Length = 342
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 185/300 (61%), Positives = 232/300 (77%), Gaps = 13/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ LN++V ++PC NI+ +IPF+HYP P T+NR+EC+C+PVRYFAI
Sbjct: 43 ICLKQIGVSPNYGFLNVEVFERPCPEPNIEPWDIPFVHYPKPKTYNRDECSCHPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR N+S+I T+D+VYNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C++AVGFKWM+NQGLM+HH +I EYFK +GVSAIFL R+NLLRRMIS+LANSYDR+AKLL
Sbjct: 163 CTSAVGFKWMLNQGLMKHHEEIVEYFKTRGVSAIFLFRKNLLRRMISVLANSYDRDAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSH HS EA+ILA YKP IN TLLI EL+QI++ T KA+ YF +TRHI+ YYED+
Sbjct: 223 NGTHKSHTHSAKEAEILAGYKPMINTTLLINELRQIQEMTLKALTYFNTTRHILVYYEDV 282
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ N L +VQEFL++ R+L SRQVKIH G LSK V+NW++V LKGT++E FLH ++
Sbjct: 283 VKNLTRLDDVQEFLKVPKRKLKSRQVKIHGGSLSKHVQNWEEVMTTLKGTNFENFLHQEF 342
>gi|225438662|ref|XP_002281772.1| PREDICTED: uncharacterized protein LOC100254393 isoform 1 [Vitis
vinifera]
Length = 340
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 189/300 (63%), Positives = 232/300 (77%), Gaps = 15/300 (5%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S HTK K N V+++P +++ EIP +HYP P TF+R ECA NPVR+FAI
Sbjct: 43 ICLKQISIHTKAKFENFWVVERPSDDHSMK--EIPNVHYPKPRTFSRAECANNPVRFFAI 100
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S+QRSGSGWFETLLN + RR NISSIV T+D++YNLDWF+SASKNE
Sbjct: 101 VSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISSIVKTLDKIYNLDWFSSASKNE 160
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
CSAAVGFKWM+NQGLM+HH +I +YFK + VS IFL RRNLLRRM+S+LANSYDR A++L
Sbjct: 161 CSAAVGFKWMLNQGLMKHHKEIVDYFKNRDVSTIFLFRRNLLRRMVSVLANSYDREARVL 220
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA +L+ YKPTINAT LI +L+Q+E+T AKA+EYF STRHI+ YYEDL
Sbjct: 221 NGTHKSHVHSTDEADMLSNYKPTINATTLITDLQQMEKTVAKALEYFNSTRHIILYYEDL 280
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR +++ EFL+L + +L SRQVKIH G LS ++NWDDV K LKGTSYE FL +DY
Sbjct: 281 IKNRTKTEDILEFLKLPHMDLASRQVKIHKGELSIHIKNWDDVNKTLKGTSYEHFLQADY 340
>gi|297802526|ref|XP_002869147.1| hypothetical protein ARALYDRAFT_912948 [Arabidopsis lyrata subsp.
lyrata]
gi|297314983|gb|EFH45406.1| hypothetical protein ARALYDRAFT_912948 [Arabidopsis lyrata subsp.
lyrata]
Length = 490
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 183/300 (61%), Positives = 232/300 (77%), Gaps = 13/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ LN++V ++PC NI+ +I F+HYP P T+NR+EC+C+PVRYFAI
Sbjct: 191 ICLKQIGVSPNYGFLNVEVFERPCPEPNIEPWDILFVHYPKPKTYNRDECSCHPVRYFAI 250
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFETLLN +K RR N+S+I T+D+VYNLDW +SASKNE
Sbjct: 251 LSMQRSGSGWFETLLNNHTNISSNGEIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNE 310
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C++AVGFKWM+NQGLM+HH +I EYFK +GVSAIFL R+NLLRRMIS+LANSYDR+AKLL
Sbjct: 311 CTSAVGFKWMLNQGLMKHHEEIVEYFKTRGVSAIFLFRKNLLRRMISVLANSYDRDAKLL 370
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSH HS EA+ILA YKP IN TLLI ELKQI++ T KA++YF +TRHI+ YYED+
Sbjct: 371 NGTHKSHTHSAKEAEILAGYKPMINTTLLINELKQIQEMTLKALDYFNTTRHILVYYEDV 430
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ N L +VQ+FL++ R+L SRQVKIH G LS+ V+NW++VQ LKGT++E FL ++
Sbjct: 431 VKNLTRLDDVQKFLKVPKRKLKSRQVKIHGGSLSQHVQNWEEVQTTLKGTNFENFLRQEF 490
>gi|359480689|ref|XP_003632512.1| PREDICTED: uncharacterized protein LOC100254393 isoform 2 [Vitis
vinifera]
gi|296082441|emb|CBI21446.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 191/320 (59%), Positives = 232/320 (72%), Gaps = 33/320 (10%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQ--------------------QSEIPFLHYP 40
+C KQ S HTK K N V+++P +IQ EIP +HYP
Sbjct: 43 ICLKQISIHTKAKFENFWVVERPSDDHSIQIPQISTFIGDKNSCYVNGFNMKEIPNVHYP 102
Query: 41 NPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSI 87
P TF+R ECA NPVR+FAI+S+QRSGSGWFETLLN + RR NISSI
Sbjct: 103 KPRTFSRAECANNPVRFFAIVSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISSI 162
Query: 88 VSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRN 147
V T+D++YNLDWF+SASKNECSAAVGFKWM+NQGLM+HH +I +YFK + VS IFL RRN
Sbjct: 163 VKTLDKIYNLDWFSSASKNECSAAVGFKWMLNQGLMKHHKEIVDYFKNRDVSTIFLFRRN 222
Query: 148 LLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTT 207
LLRRM+S+LANSYDR A++LNGTHKSHVHS EA +L+ YKPTINAT LI +L+Q+E+T
Sbjct: 223 LLRRMVSVLANSYDREARVLNGTHKSHVHSTDEADMLSNYKPTINATTLITDLQQMEKTV 282
Query: 208 AKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENW 267
AKA+EYF STRHI+ YYEDLI NR +++ EFL+L + +L SRQVKIH G LS ++NW
Sbjct: 283 AKALEYFNSTRHIILYYEDLIKNRTKTEDILEFLKLPHMDLASRQVKIHKGELSIHIKNW 342
Query: 268 DDVQKALKGTSYERFLHSDY 287
DDV K LKGTSYE FL +DY
Sbjct: 343 DDVNKTLKGTSYEHFLQADY 362
>gi|356565626|ref|XP_003551040.1| PREDICTED: nodulation protein H-like [Glycine max]
Length = 350
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/303 (61%), Positives = 234/303 (77%), Gaps = 17/303 (5%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQ---QSEIPFLHYPNPVTFNREECACNPVRY 57
+C KQ S H +T L+ ++ I++P +S++ + ++++P LHYP PV+FNR ECA NPVR+
Sbjct: 47 VCLKQISTHARTMLMELQ-IEKPSRSSSSRLNLKNDVPLLHYPKPVSFNRSECAGNPVRF 105
Query: 58 FAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSAS 104
FAILS QRSGSGWFETLLN ++ RR+N+S+IV T+D+VYNLDW +SAS
Sbjct: 106 FAILSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRVNVSTIVQTLDKVYNLDWLSSAS 165
Query: 105 KNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNA 164
KNECSAA+GFKWM+NQGLMEH +IA+YF + VS IFL RRNLLRRM+S+LANSYDR A
Sbjct: 166 KNECSAAIGFKWMLNQGLMEHPKEIADYFNSRSVSVIFLFRRNLLRRMVSMLANSYDRYA 225
Query: 165 KLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYY 224
KLL+GTHKSHVHS EA L+KYKPTIN+T L+ +LK IE TAK +EYF +TRH++ YY
Sbjct: 226 KLLSGTHKSHVHSTEEADTLSKYKPTINSTSLLADLKDIEMRTAKVLEYFNTTRHMILYY 285
Query: 225 EDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH 284
EDL+ N LK+VQEFL L ELTSRQVKIH GPLS ++NWDDV + L GT YE FLH
Sbjct: 286 EDLMRNCTKLKDVQEFLGLPQMELTSRQVKIHRGPLSDHIKNWDDVNRTLTGTVYESFLH 345
Query: 285 SDY 287
+DY
Sbjct: 346 ADY 348
>gi|357482975|ref|XP_003611774.1| hypothetical protein MTR_5g017700 [Medicago truncatula]
gi|355513109|gb|AES94732.1| hypothetical protein MTR_5g017700 [Medicago truncatula]
Length = 342
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/301 (62%), Positives = 226/301 (75%), Gaps = 15/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIP-FLHYPNPVTFNREECACNPVRYFA 59
+C KQ +TK +++ VID +S ++Q+ IP LHYP P++FNR ECA NPV +FA
Sbjct: 43 VCRKQIGTEVRTKFMDLNVIDNLTRSI-VKQTHIPDILHYPEPLSFNRNECAPNPVMFFA 101
Query: 60 ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
ILS QRSGSGWFETLLN ++ RR N SSI+ T+DRVYNLDW +SASKN
Sbjct: 102 ILSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRQNASSILQTLDRVYNLDWLSSASKN 161
Query: 107 ECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKL 166
ECSAA+GFKWM+NQGLMEHHN+I EYFK++ VS IFL RRNLLRRM+S+LANSYDR AKL
Sbjct: 162 ECSAAIGFKWMLNQGLMEHHNEIEEYFKRRSVSVIFLFRRNLLRRMVSMLANSYDRYAKL 221
Query: 167 LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED 226
LNGTHKSHVHS EA IL+KYKP IN+T L+ +LK +E KA++YF S+RH++ YYED
Sbjct: 222 LNGTHKSHVHSAEEADILSKYKPVINSTSLLVDLKDMETKAEKALKYFNSSRHMILYYED 281
Query: 227 LITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
L+ N L +VQEFL L ELTSRQVKIH GPLS ++NWDDV K L GT YE FL +D
Sbjct: 282 LMRNHTKLNDVQEFLGLPQMELTSRQVKIHKGPLSDHIQNWDDVNKTLTGTVYESFLEAD 341
Query: 287 Y 287
Y
Sbjct: 342 Y 342
>gi|356514160|ref|XP_003525774.1| PREDICTED: nodulation protein H-like [Glycine max]
Length = 361
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 186/306 (60%), Positives = 231/306 (75%), Gaps = 20/306 (6%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQS------EIPFLHYPNPVTFNREECACNP 54
+C KQ S H +T L+ ++ I++P +S+ + ++P LHYP PV+FNR ECA NP
Sbjct: 55 VCLKQISTHARTMLMELQ-IEKPSRSSISSINRLNLTHDVPLLHYPEPVSFNRSECAGNP 113
Query: 55 VRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFT 101
VR+FAILS QRSGSGWFETLLN ++ RR+N+S+IV T+D+VYNLDW +
Sbjct: 114 VRFFAILSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRVNVSTIVQTLDKVYNLDWLS 173
Query: 102 SASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYD 161
SASKNECSAA+G KWM+NQGLM+H +IA+YF + VS IFL RRNLLRRM+S+LANSYD
Sbjct: 174 SASKNECSAAIGSKWMLNQGLMDHPKEIADYFNSRSVSVIFLFRRNLLRRMVSMLANSYD 233
Query: 162 RNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIV 221
R AKL +GTHKSHVHS EA L+KYKPTIN+T L+ +LK +E TAKA+EYF +TRH++
Sbjct: 234 RYAKLFSGTHKSHVHSKEEADTLSKYKPTINSTSLLADLKDMEMRTAKALEYFNTTRHMI 293
Query: 222 FYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYER 281
YYEDLI NR LK+VQEFL L ELTSRQVKIH GPLS ++NWDDV + L GT YE
Sbjct: 294 LYYEDLIQNRTKLKDVQEFLGLPQMELTSRQVKIHRGPLSDHIKNWDDVNRTLAGTIYES 353
Query: 282 FLHSDY 287
FLH+DY
Sbjct: 354 FLHADY 359
>gi|42565806|ref|NP_190631.2| nodulation-related protein [Arabidopsis thaliana]
gi|45773806|gb|AAS76707.1| At3g50620 [Arabidopsis thaliana]
gi|114050671|gb|ABI49485.1| At3g50620 [Arabidopsis thaliana]
gi|332645165|gb|AEE78686.1| nodulation-related protein [Arabidopsis thaliana]
Length = 340
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 183/282 (64%), Positives = 220/282 (78%), Gaps = 16/282 (5%)
Query: 22 QPCQSTNIQQSEIPF---LHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-- 76
Q Q++ I + F +HYP P TFNR EC NPVRYFAILSMQRSGSGWFETLLN
Sbjct: 59 QLVQTSPIDSHSLRFVTRIHYPKPQTFNRAECGHNPVRYFAILSMQRSGSGWFETLLNSH 118
Query: 77 -----------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEH 125
+ RR NISSI+ T+DRVYNLDWFTSASKNECSAA+GFKWM+NQGL+E+
Sbjct: 119 NNVSSNGEIFSVLDRRKNISSIIQTLDRVYNLDWFTSASKNECSAAIGFKWMLNQGLLEN 178
Query: 126 HNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILA 185
H DI EYF ++GVSAIFL RRN LRRM+S+LANSYDR AKLLNGTHKSHVHSPAEA L+
Sbjct: 179 HKDIVEYFNRRGVSAIFLFRRNPLRRMVSVLANSYDRYAKLLNGTHKSHVHSPAEADALS 238
Query: 186 KYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAY 245
+YKP IN+T LI +L++ E + AKA+EYF +TRHIV +YEDLITN+ TLK+VQEFL +
Sbjct: 239 RYKPVINSTSLIHDLQETENSAAKALEYFNTTRHIVVFYEDLITNQTTLKQVQEFLNIPV 298
Query: 246 RELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
++L+SRQVKIH G LS ++NW+D+ K L GT YE+FL +DY
Sbjct: 299 KDLSSRQVKIHRGDLSDHIKNWEDINKTLNGTEYEKFLRADY 340
>gi|356496876|ref|XP_003517291.1| PREDICTED: uncharacterized protein LOC100789437 [Glycine max]
Length = 341
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 186/300 (62%), Positives = 222/300 (74%), Gaps = 14/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S +T ++ KVID QS ++ LHYP P +F+R ECA NPV +FAI
Sbjct: 43 VCLKQISTQARTTFMDFKVIDNHSQSI-LKLMNTHLLHYPKPSSFSRNECAHNPVLFFAI 101
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LS QRSGSGWFETLLN ++ RR N+SSI+ T+D+VYNLDWF SASKNE
Sbjct: 102 LSNQRSGSGWFETLLNSHVNVSSYGEIFSVRERRQNVSSILLTLDKVYNLDWFNSASKNE 161
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
CSAA G KWM+NQGL+EHH +IAEYF + VS IFL RRNLLRRM+S+LANSYDR AKLL
Sbjct: 162 CSAATGLKWMLNQGLVEHHKEIAEYFNHRRVSIIFLFRRNLLRRMVSMLANSYDRYAKLL 221
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVHS EA+IL+KYKP IN+T L+ +LK +E +AKA+EYF STRHI+ YYEDL
Sbjct: 222 NGTHKAHVHSAEEAEILSKYKPIINSTSLLDDLKDMEMRSAKALEYFNSTRHIIVYYEDL 281
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ N LK+VQEFL L ELTSR VKIH GPLS ++NWDDV K LKGT YE FL +DY
Sbjct: 282 MRNHTKLKDVQEFLGLPQMELTSRHVKIHRGPLSDHIQNWDDVNKTLKGTIYESFLEADY 341
>gi|297819744|ref|XP_002877755.1| hypothetical protein ARALYDRAFT_485408 [Arabidopsis lyrata subsp.
lyrata]
gi|297323593|gb|EFH54014.1| hypothetical protein ARALYDRAFT_485408 [Arabidopsis lyrata subsp.
lyrata]
Length = 340
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 179/282 (63%), Positives = 218/282 (77%), Gaps = 16/282 (5%)
Query: 22 QPCQSTNIQQSEIPF---LHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-- 76
Q Q+T + F +HYP P TFNR EC NPVRYFAILSMQRSGSGWFETLLN
Sbjct: 59 QLVQTTPFDSHSLGFVTRIHYPKPQTFNRAECGHNPVRYFAILSMQRSGSGWFETLLNSH 118
Query: 77 -----------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEH 125
+ RR NISSI+ T+DRVYNLDWFTSASKNECSAA+GFKWM+NQGL+++
Sbjct: 119 SNVSSNGEIFSVLDRRKNISSIIQTLDRVYNLDWFTSASKNECSAAIGFKWMLNQGLLDN 178
Query: 126 HNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILA 185
H +I +YF ++GVSAIFL RRN LRRM+S+LANSYDR AKLLNGTHKSHVHSP EA L+
Sbjct: 179 HKEIVDYFNRRGVSAIFLFRRNPLRRMVSVLANSYDRYAKLLNGTHKSHVHSPEEADALS 238
Query: 186 KYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAY 245
+YKP IN+T LI +L++ E + AKA+EYF +TRHIV +YEDLITN+ TLK+VQEFL +
Sbjct: 239 RYKPVINSTSLIHDLQETENSAAKALEYFNTTRHIVVFYEDLITNQTTLKQVQEFLNIPV 298
Query: 246 RELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
++L+SRQVKIH G LS ++NW+D+ K L GT YE+FL +DY
Sbjct: 299 KDLSSRQVKIHRGDLSDHIKNWEDINKTLNGTEYEKFLRADY 340
>gi|449448334|ref|XP_004141921.1| PREDICTED: uncharacterized protein LOC101217742 [Cucumis sativus]
Length = 334
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 178/300 (59%), Positives = 225/300 (75%), Gaps = 15/300 (5%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ + HT+ + L +++ D C +I +++P HYP P TF+R EC NPVR+FAI
Sbjct: 37 ICVKQLNTHTRARFLRVRIAD--CPELSIGLTKVPREHYPRPKTFSRAECFNNPVRFFAI 94
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+SMQRSGSGWFE+LLN + RR NI++IV T+DR+YNLDW SASKN+
Sbjct: 95 VSMQRSGSGWFESLLNSHVNVSSNGEVFSVLDRRRNITTIVQTLDRIYNLDWLNSASKNQ 154
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
CSAAVGFKWM+NQGLM+HH +IAEYF ++GVS IFL RRNLLRR++S+LANSYDR AK+L
Sbjct: 155 CSAAVGFKWMLNQGLMQHHEEIAEYFNRRGVSTIFLFRRNLLRRVVSVLANSYDRYAKML 214
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA L+KYKP IN+T LI EL+ +E T +K++EYF +TRH++ YYED+
Sbjct: 215 NGTHKSHVHSLEEANALSKYKPVINSTSLISELEGMEATISKSLEYFGNTRHLILYYEDI 274
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR LK+VQEFL L EL SRQVKIH G LS ++NW+DV+ L GT YE LH+DY
Sbjct: 275 INNRTKLKDVQEFLNLPPMELKSRQVKIHKGHLSDHIKNWEDVKATLNGTVYEHLLHADY 334
>gi|449530259|ref|XP_004172113.1| PREDICTED: uncharacterized LOC101217742 [Cucumis sativus]
Length = 340
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 178/300 (59%), Positives = 225/300 (75%), Gaps = 15/300 (5%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ + HT+ + L +++ D C +I +++P HYP P TF+R EC NPVR+FAI
Sbjct: 43 ICVKQLNTHTRARFLRVRIAD--CPELSIGLTKVPREHYPRPKTFSRAECFNNPVRFFAI 100
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+SMQRSGSGWFE+LLN + RR NI++IV T+DR+YNLDW SASKN+
Sbjct: 101 VSMQRSGSGWFESLLNSHVNVSSNGEVFSVLDRRRNITTIVQTLDRIYNLDWLNSASKNQ 160
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
CSAAVGFKWM+NQGLM+HH +IAEYF ++GVS IFL RRNLLRR++S+LANSYDR AK+L
Sbjct: 161 CSAAVGFKWMLNQGLMQHHEEIAEYFNRRGVSTIFLFRRNLLRRVVSVLANSYDRYAKML 220
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHKSHVHS EA L+KYKP IN+T LI EL+ +E T +K++EYF +TRH++ YYED+
Sbjct: 221 NGTHKSHVHSLEEANALSKYKPVINSTSLISELEGMEATISKSLEYFGNTRHLILYYEDI 280
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR LK+VQEFL L EL SRQVKIH G LS ++NW+DV+ L GT YE LH+DY
Sbjct: 281 INNRTKLKDVQEFLNLPPMELKSRQVKIHKGHLSDHIKNWEDVKATLNGTVYEHLLHADY 340
>gi|356537631|ref|XP_003537329.1| PREDICTED: nodulation protein H-like [Glycine max]
Length = 341
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/300 (60%), Positives = 221/300 (73%), Gaps = 14/300 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C Q S +T ++ KVID QS ++ LHYP P +F+R EC NPV +FAI
Sbjct: 43 VCLMQISTQARTTFMDFKVIDNHSQSI-LKLMNTHLLHYPKPASFSRNECVHNPVLFFAI 101
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LS QRSGSGWFETLLN ++ RR N+SSI+ T+D+VYNLDWF SASKNE
Sbjct: 102 LSNQRSGSGWFETLLNSHINVSSNGEIFSVRERRQNVSSILHTLDKVYNLDWFNSASKNE 161
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
CSAA G KWM+NQGLMEHH +IAEYF ++ VS IFL RRNLLRRM+S+LANSYDR AKLL
Sbjct: 162 CSAATGLKWMLNQGLMEHHKEIAEYFNRRRVSIIFLFRRNLLRRMVSMLANSYDRYAKLL 221
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVHS EA+IL+KYKP IN+T L+ +L+ +E +AKA+EYF STRH++ YYEDL
Sbjct: 222 NGTHKAHVHSAEEAEILSKYKPIINSTSLLDDLQDMEMRSAKALEYFYSTRHMIVYYEDL 281
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ N LK+VQ+FL L ELTSR VKIH GPLS ++NWDDV K LKGT YE FL +DY
Sbjct: 282 MRNHTKLKDVQDFLGLPQMELTSRHVKIHRGPLSDHIQNWDDVNKTLKGTIYESFLEADY 341
>gi|255538396|ref|XP_002510263.1| conserved hypothetical protein [Ricinus communis]
gi|223550964|gb|EEF52450.1| conserved hypothetical protein [Ricinus communis]
Length = 278
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 174/255 (68%), Positives = 206/255 (80%), Gaps = 13/255 (5%)
Query: 48 EECACNPVRYFAILSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRV 94
EECACNPVRYFAIL+MQRSGSGWFETLLN K RR N+S+I+ T+D+V
Sbjct: 24 EECACNPVRYFAILTMQRSGSGWFETLLNSHMNVSSNGEIFGKKERRANVSAIMKTLDKV 83
Query: 95 YNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMIS 154
YNLDWF+SA+KNEC+AAVGFKWM+NQG++EHH I EYF +KGV AIFL RRNLLRRM+S
Sbjct: 84 YNLDWFSSAAKNECNAAVGFKWMLNQGVLEHHKGIVEYFNEKGVHAIFLFRRNLLRRMVS 143
Query: 155 ILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYF 214
++ANSYD++ K LNGTHKSHVHS AEAK+LAKYKP IN T LI +LK ++ A+AI YF
Sbjct: 144 VIANSYDKSNKPLNGTHKSHVHSAAEAKVLAKYKPKINTTTLIADLKHVDDRAARAIGYF 203
Query: 215 KSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKAL 274
KSTRHIV YYED++ N LKEVQEF+RL YRELTSRQVKIHSG L Q++N +VQ+ L
Sbjct: 204 KSTRHIVVYYEDVVGNSTKLKEVQEFMRLPYRELTSRQVKIHSGNLLDQIQNSGEVQQVL 263
Query: 275 KGTSYERFLHSDYRV 289
KGTSYE FL SDY++
Sbjct: 264 KGTSYESFLQSDYQM 278
>gi|218192981|gb|EEC75408.1| hypothetical protein OsI_11905 [Oryza sativa Indica Group]
Length = 365
Score = 363 bits (931), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 171/302 (56%), Positives = 220/302 (72%), Gaps = 13/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ + ++++ I+V +Q C + I SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 64 MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 123
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSI T+D+VYNLDW +SASKNE
Sbjct: 124 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 183
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL RRNLLR+++S LAN++DR K L
Sbjct: 184 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRRNLLRQLVSQLANNHDRYLKQL 243
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+YKP +N T LI LKQ++ T A+E KS HI YYEDL
Sbjct: 244 NGTHKAHVHTAYEASILARYKPRLNTTSLIRSLKQVDDYTRDALENLKSINHITIYYEDL 303
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR L +V +FL++ +L SR VKIH+ PLS+Q+ENWD+V AL GT YE FL++DY
Sbjct: 304 IRNRTKLLDVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 363
Query: 288 RV 289
R+
Sbjct: 364 RI 365
>gi|242035555|ref|XP_002465172.1| hypothetical protein SORBIDRAFT_01g033350 [Sorghum bicolor]
gi|241919026|gb|EER92170.1| hypothetical protein SORBIDRAFT_01g033350 [Sorghum bicolor]
Length = 364
Score = 360 bits (923), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 169/303 (55%), Positives = 225/303 (74%), Gaps = 14/303 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ ++++ I+V++QPC + + SE+ F+HYP P+T++REEC CN VR+FAI
Sbjct: 62 MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFVHYPQPITYSREECKCNAVRFFAI 121
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSI++T+D+VYNLDW +SASKNE
Sbjct: 122 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIINTLDKVYNLDWNSSASKNE 181
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AA+GFKWM+NQGL+ +H D+ +YF ++GVSAIFL RRNLLR+++S LAN++DR K L
Sbjct: 182 CTAAIGFKWMLNQGLVANHADVVDYFNQRGVSAIFLFRRNLLRQLVSQLANNHDRFLKQL 241
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+YKP +N T LI +LKQ ++ T AI+ +TRHI YYED+
Sbjct: 242 NGTHKAHVHTKREAHILARYKPKLNTTSLIWQLKQADEYTRDAIQNLNNTRHITVYYEDI 301
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
+ NR L +V +FL++ R L SR VKIH+ PLS+Q+ENWD+V AL GT YE FL+ +D
Sbjct: 302 VRNRTKLFDVLDFLKVPRRNLVSRHVKIHTRPLSEQIENWDEVYSALNGTQYEGFLNAAD 361
Query: 287 YRV 289
Y V
Sbjct: 362 YVV 364
>gi|115453377|ref|NP_001050289.1| Os03g0394900 [Oryza sativa Japonica Group]
gi|108708616|gb|ABF96411.1| nodulation protein, putative, expressed [Oryza sativa Japonica
Group]
gi|113548760|dbj|BAF12203.1| Os03g0394900 [Oryza sativa Japonica Group]
gi|215737279|dbj|BAG96208.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625064|gb|EEE59196.1| hypothetical protein OsJ_11141 [Oryza sativa Japonica Group]
Length = 362
Score = 359 bits (922), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 170/302 (56%), Positives = 219/302 (72%), Gaps = 13/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ + ++++ I+V +Q C + I SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 61 MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 120
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSI T+D+VYNLDW +SASKNE
Sbjct: 121 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 180
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL R NLLR+++S LAN++DR K L
Sbjct: 181 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRWNLLRQLVSQLANNHDRYLKQL 240
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+YKP +N T LI LKQ++ T A+E KS HI YYEDL
Sbjct: 241 NGTHKAHVHTAYEASILARYKPRLNTTSLIRSLKQVDDYTRDALENLKSINHITIYYEDL 300
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR L +V +FL++ +L SR VKIH+ PLS+Q+ENWD+V AL GT YE FL++DY
Sbjct: 301 IRNRTKLLDVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 360
Query: 288 RV 289
R+
Sbjct: 361 RI 362
>gi|29824462|gb|AAP04177.1| unknown protein [Oryza sativa Japonica Group]
Length = 384
Score = 359 bits (921), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 170/302 (56%), Positives = 219/302 (72%), Gaps = 13/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ + ++++ I+V +Q C + I SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 83 MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 142
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSI T+D+VYNLDW +SASKNE
Sbjct: 143 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 202
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL R NLLR+++S LAN++DR K L
Sbjct: 203 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRWNLLRQLVSQLANNHDRYLKQL 262
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+YKP +N T LI LKQ++ T A+E KS HI YYEDL
Sbjct: 263 NGTHKAHVHTAYEASILARYKPRLNTTSLIRSLKQVDDYTRDALENLKSINHITIYYEDL 322
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR L +V +FL++ +L SR VKIH+ PLS+Q+ENWD+V AL GT YE FL++DY
Sbjct: 323 IRNRTKLLDVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 382
Query: 288 RV 289
R+
Sbjct: 383 RI 384
>gi|357111946|ref|XP_003557771.1| PREDICTED: uncharacterized protein LOC100845427 [Brachypodium
distachyon]
Length = 357
Score = 357 bits (915), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 168/302 (55%), Positives = 218/302 (72%), Gaps = 13/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ + ++++ I+V +QPC + I SE+ F+HYP P ++R+EC C PVR+FAI
Sbjct: 56 ICMKQLGTDSWSRMVKIEVAEQPCNKSLIPLSEVQFVHYPQPQNYSRKECMCTPVRFFAI 115
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSIV TMD+VYNLDW +SASKNE
Sbjct: 116 ISSQRSGSGWFETLLNSHINVSSNGEIFSNKERRNNISSIVKTMDKVYNLDWNSSASKNE 175
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVGFKWM+NQGL+ +H + +YF ++GVS +FL+RRNLLR+M+S LAN++DR K L
Sbjct: 176 CTAAVGFKWMLNQGLVANHAGVVDYFNQRGVSTVFLMRRNLLRQMVSQLANNHDRYIKQL 235
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NG HK+HVH+ EA ILA+YKP +N T LI LKQ + T A+E KS RH+ YYEDL
Sbjct: 236 NGKHKAHVHTRDEANILARYKPRLNTTTLIWSLKQADDYTRDALENLKSVRHMTMYYEDL 295
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
I NR L +V +FL++ R+L SR VKIH+ PLS Q+ENWD+V AL GT +E FL +DY
Sbjct: 296 IQNRTKLVDVLDFLKVPRRKLVSRHVKIHTKPLSDQIENWDEVYNALNGTQFESFLTADY 355
Query: 288 RV 289
R+
Sbjct: 356 RI 357
>gi|293332934|ref|NP_001167842.1| hypothetical protein [Zea mays]
gi|223944369|gb|ACN26268.1| unknown [Zea mays]
gi|413955392|gb|AFW88041.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
Length = 360
Score = 353 bits (905), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 219/301 (72%), Gaps = 14/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ ++++ I+V++QPC + + SE+ F HYP P T++REEC CN VR+FAI
Sbjct: 59 MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSREECKCNAVRFFAI 118
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSIV+ MDRVY+LDW +SASKNE
Sbjct: 119 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIVNAMDRVYSLDWNSSASKNE 178
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AA+GFKWM+NQGL+ +H + +YF ++GVSAIFL RRNLLR+++S LAN++DR K L
Sbjct: 179 CTAAIGFKWMLNQGLVANHVAVVDYFNRRGVSAIFLFRRNLLRQLVSQLANNHDRFLKQL 238
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+YKP +N + LI +LKQ ++ T A+E +TRHI YYED+
Sbjct: 239 NGTHKAHVHTEHEAHILARYKPRLNTSSLIRKLKQADEYTRDALENLNNTRHITVYYEDI 298
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
+ NR L + +FL + REL SR VKIH+ PLS+Q+ENWD+V AL GT YE FL+ +D
Sbjct: 299 VRNRTRLLDALDFLGVPRRELASRHVKIHTKPLSEQIENWDEVYSALNGTQYEGFLNAAD 358
Query: 287 Y 287
Y
Sbjct: 359 Y 359
>gi|226500168|ref|NP_001142620.1| uncharacterized protein LOC100274889 [Zea mays]
gi|195607388|gb|ACG25524.1| hypothetical protein [Zea mays]
Length = 360
Score = 352 bits (904), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 165/301 (54%), Positives = 222/301 (73%), Gaps = 14/301 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ + ++++ I+V++QPC + + SE+ F+HYP P T++REEC CN VR+FAI
Sbjct: 59 ICMKQLGSNGWSRIVKIEVVEQPCNKSTVPPSEVQFVHYPQPTTYSREECKCNAVRFFAI 118
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSIV+T+DRVY+LDW +SASKNE
Sbjct: 119 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIVNTLDRVYSLDWNSSASKNE 178
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AA+GFKWM+NQGL+ +H + +YF ++GVSA+FL RRNLLR+++S LAN++DR K L
Sbjct: 179 CTAAIGFKWMLNQGLVANHVAVVDYFNRRGVSAVFLFRRNLLRQLVSQLANNHDRFLKQL 238
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+YKP +N + LI +LKQ ++ T A+E +TRHI YYED+
Sbjct: 239 NGTHKAHVHTEHEAHILARYKPRLNTSSLIRQLKQADEYTRDALENLNNTRHITVYYEDI 298
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
+ NR L + +FL + REL SR VKIH+ PLS+Q+ENWD+V AL GT YE FL+ +D
Sbjct: 299 VRNRTRLLDALDFLGVPRRELASRHVKIHTKPLSEQIENWDEVYSALNGTQYEGFLNAAD 358
Query: 287 Y 287
Y
Sbjct: 359 Y 359
>gi|226496856|ref|NP_001145331.1| hypothetical protein [Zea mays]
gi|195654733|gb|ACG46834.1| hypothetical protein [Zea mays]
gi|414867126|tpg|DAA45683.1| TPA: hypothetical protein ZEAMMB73_734756 [Zea mays]
Length = 360
Score = 344 bits (883), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 164/303 (54%), Positives = 220/303 (72%), Gaps = 14/303 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ ++++ I+V++QPC + SE F+ YP P+T++REEC CN VR FAI
Sbjct: 58 MCMKQLGSDGWSRVVKIEVVEQPCNKSIAPLSEAQFVRYPQPITYSREECKCNAVRSFAI 117
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSI+ T+D+VYNLDW +SASKNE
Sbjct: 118 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIIDTLDKVYNLDWNSSASKNE 177
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AA+GFKWM+NQGL+ +H D+ +YF ++GVSAIFL RRNLLR+++S +AN++DR K L
Sbjct: 178 CTAAIGFKWMLNQGLVANHADVVDYFNRRGVSAIFLFRRNLLRQLVSQVANNHDRLLKQL 237
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+Y+P +N T LI +LK+ ++ T A+E +TRH+ YYED+
Sbjct: 238 NGTHKAHVHTKREAHILARYRPRLNTTSLIWQLKRADEYTRDALENLNNTRHMSVYYEDV 297
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SD 286
+ NR L +V +FL + R+L SR VKIH+ PLS+Q+ENWD+V AL GT YE FL+ +D
Sbjct: 298 VRNRTKLLDVLDFLGVPRRKLVSRHVKIHTKPLSEQIENWDEVYSALNGTQYEGFLNAAD 357
Query: 287 YRV 289
Y V
Sbjct: 358 YLV 360
>gi|255562595|ref|XP_002522303.1| conserved hypothetical protein [Ricinus communis]
gi|223538381|gb|EEF39987.1| conserved hypothetical protein [Ricinus communis]
Length = 329
Score = 343 bits (879), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 173/294 (58%), Positives = 214/294 (72%), Gaps = 14/294 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S ++K K+ +I+VI++P + + Q +IP +HYP+P TFNR EC NPVRYFAI
Sbjct: 43 VCLKQISTNSKIKIEDIQVIERPSDNESTQM-QIPIVHYPHPETFNRSECVHNPVRYFAI 101
Query: 61 LSMQRSGSGWFETLLNIKHRRINISS-------IVSTMDRVYNLDWFTSASKNECSAAVG 113
LSMQRSGSGWFETLLN +N+SS I+ + + ++T A+
Sbjct: 102 LSMQRSGSGWFETLLN---SHVNVSSNGEIFSFIIGFRRTLLTILYYTLPRIKRLMAS-- 156
Query: 114 FKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKS 173
F +M QGLMEHH I EYF +GVSAIFL RRNLLRRM+S+LANSYDR AKLLNGTHKS
Sbjct: 157 FLCLM-QGLMEHHEAIVEYFNSRGVSAIFLFRRNLLRRMVSVLANSYDRYAKLLNGTHKS 215
Query: 174 HVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRAT 233
HVHS EA+ L++YKP IN+TLLI +LK++E T A+A+EYF STRHI+ YYEDLI N
Sbjct: 216 HVHSTEEAETLSRYKPIINSTLLISDLKEVEITAARALEYFNSTRHIILYYEDLIKNNTK 275
Query: 234 LKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
LK+VQEFL L ELTSRQVKIH GPLS ++NW+DV KAL GT+YE FL +DY
Sbjct: 276 LKDVQEFLGLPLMELTSRQVKIHKGPLSDHIKNWEDVNKALTGTAYESFLEADY 329
>gi|147833389|emb|CAN66236.1| hypothetical protein VITISV_022815 [Vitis vinifera]
Length = 352
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 175/332 (52%), Positives = 214/332 (64%), Gaps = 66/332 (19%)
Query: 7 SPHTKTKLLNIKVIDQPCQSTNIQ--------------------QSEIPFLHYPNPVTFN 46
S HTK K N V+++P +IQ EIP +HYP P TF+
Sbjct: 36 SIHTKAKFENFWVVERPSDDHSIQIPQISTFIGDKNSCYVNGFNMKEIPNVHYPKPRTFS 95
Query: 47 REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
R ECA NPVR+FAI+S+QRSGSGWFETLLN + RR NIS IV T+D+
Sbjct: 96 RAECANNPVRFFAIVSIQRSGSGWFETLLNSHINVSSNGEIFNVYDRRKNISXIVKTLDK 155
Query: 94 VYNLDWFTSASKNECSAAVGFKWMMNQ------------------GLMEHHNDIAEYFKK 135
+YNLDWF+SASKNECSAAVGFKWM+NQ GLM+HH +I +YFK
Sbjct: 156 IYNLDWFSSASKNECSAAVGFKWMLNQKLGEVVFMLXVEISHKMMGLMKHHKEIVDYFKN 215
Query: 136 KGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATL 195
+ VS IFL RRNLLRRM+S+LANSYDR A+ A +L+ YKPTINAT
Sbjct: 216 RDVSTIFLFRRNLLRRMVSVLANSYDREAR---------------ADMLSXYKPTINATT 260
Query: 196 LIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKI 255
LI +L+Q+E+T AKA+EYF STRHI+ YYEDLI NR +++ EFL+L + +L SRQVKI
Sbjct: 261 LITDLQQVEKTVAKALEYFNSTRHIILYYEDLIKNRTKTEDILEFLKLPHMDLASRQVKI 320
Query: 256 HSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
H G LS ++NWDDV K LKGTSYE FL +DY
Sbjct: 321 HKGELSIHIKNWDDVNKTLKGTSYEHFLQADY 352
>gi|357125400|ref|XP_003564382.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
Length = 359
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 155/278 (55%), Positives = 207/278 (74%), Gaps = 13/278 (4%)
Query: 21 DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
++PC +I +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGWFETLLN
Sbjct: 80 EKPCHRPSIPDTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWFETLLNSHEN 139
Query: 77 ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
+K RR NI+SI T+D++YNLDW +SA+KNEC+AAVG KWM+NQGLM+HH
Sbjct: 140 ISSNGEIFSVKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLMKHHQ 199
Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
++ EYF ++GVSAIFL+RRNLL+R +SILAN +DR+ K LNGTHK+HVH +A +LA+Y
Sbjct: 200 EVVEYFNRRGVSAIFLLRRNLLQRYVSILANDHDRSTKQLNGTHKAHVHHRGQADVLAQY 259
Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
KPTI+ LI ELK+ ++ A A+ FK+ R+IV YYED+++NR L +V +FL+L +
Sbjct: 260 KPTISTKSLIAELKRSDKLAADALVNFKTIRNIVLYYEDVVSNRTKLTDVLDFLKLPNIK 319
Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
L+SR VKIH+ L ++NW DV L GT YE FL+
Sbjct: 320 LSSRHVKIHTKRLRDHIDNWIDVYNTLNGTQYESFLNG 357
>gi|125527968|gb|EAY76082.1| hypothetical protein OsI_04009 [Oryza sativa Indica Group]
Length = 358
Score = 341 bits (874), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 154/275 (56%), Positives = 205/275 (74%), Gaps = 13/275 (4%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
CQ +I +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGW ETLLN
Sbjct: 84 CQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIETLLNSHENISS 143
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
IK RR NI+SI T+D++YNLDW +SA+KNEC+AAVG KWM+NQGL++HH ++
Sbjct: 144 NGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLVQHHEEVV 203
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
EYF ++GVSAIFL+RRN L+R +S+LAN++D AK +NGTHKSHVHS EA+ILA++KP
Sbjct: 204 EYFNRRGVSAIFLLRRNTLQRYVSVLANAHDSKAKQINGTHKSHVHSEREAQILAQFKPE 263
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
I+ LI +LK+ ++ A A+ YFK TRHI+ YYED+++N L +V +FLRL R+L+S
Sbjct: 264 IDTKKLIADLKKSDKLAADALLYFKKTRHIILYYEDVVSNDTKLMDVLDFLRLPKRKLSS 323
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
R VKIH+ L ++NW +V L GT YE FL+
Sbjct: 324 RHVKIHTKLLRDHIDNWAEVNSTLMGTQYESFLNG 358
>gi|125572263|gb|EAZ13778.1| hypothetical protein OsJ_03703 [Oryza sativa Japonica Group]
Length = 359
Score = 333 bits (855), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 154/276 (55%), Positives = 203/276 (73%), Gaps = 14/276 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
CQ +I +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGW ETLLN
Sbjct: 84 CQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIETLLNSHENISS 143
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
IK RR NI+SI T+D++YNLDW +SA+KNEC+AAVG KWM+NQGL++HH ++
Sbjct: 144 NGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLVQHHEEVV 203
Query: 131 EYFK-KKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKP 189
EYF KK SAIFL+RRN L+R +S+LAN++D AK +NGTHKSHVHS EA+ILA++KP
Sbjct: 204 EYFNPKKAFSAIFLLRRNTLQRYVSVLANAHDSKAKQINGTHKSHVHSEREAQILAQFKP 263
Query: 190 TINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELT 249
I+ LI +LK+ ++ A A+ YFK TRHI+ YYED+++N L +V +FLRL R+L+
Sbjct: 264 EIDTKKLIADLKKSDKLAADALLYFKKTRHIILYYEDVVSNDTKLMDVLDFLRLPKRKLS 323
Query: 250 SRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
SR VKIH+ L ++NW +V L GT YE FL+
Sbjct: 324 SRHVKIHTKLLRDHIDNWAEVNSTLMGTQYESFLNG 359
>gi|222637549|gb|EEE67681.1| hypothetical protein OsJ_25329 [Oryza sativa Japonica Group]
Length = 923
Score = 333 bits (854), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 154/278 (55%), Positives = 199/278 (71%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
C+ ++ E+ ++H+P P +++R EC+CN VR F ++SMQRSGSGWFETLLN
Sbjct: 644 CRKPHLPHEELRYVHFPKPESYSRGECSCNLVRSFVLVSMQRSGSGWFETLLNSHPNISS 703
Query: 79 --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
RR NISSI+ T+D++YNLDWFTSA+KNEC+AA G KWM+NQG M+HH+DIA
Sbjct: 704 NGEIFNRVDRRENISSILQTLDKLYNLDWFTSAAKNECTAAFGLKWMLNQGFMDHHDDIA 763
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKGVS IFL RRN LRR+IS+LAN YDR+AK LNGTHKSHVHS EA+ILAK+KP
Sbjct: 764 SYLNKKGVSVIFLFRRNTLRRLISVLANDYDRDAKQLNGTHKSHVHSKEEAEILAKFKPV 823
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
+ + LIP ++ E+ +++F +TRH++ YYED++ NR L +VQEFL + R+L S
Sbjct: 824 QDVSNLIPNIRNSEKYIRDCLDHFNTTRHMILYYEDIVRNRNALFQVQEFLGVPVRKLVS 883
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
RQVKIH+ PL V NWDDV L GT Y FL +DY
Sbjct: 884 RQVKIHTSPLPGLVRNWDDVSNKLNGTQYAHFLDGADY 921
>gi|326523931|dbj|BAJ96976.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 339
Score = 332 bits (852), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 153/278 (55%), Positives = 198/278 (71%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
C + E+P++H+P P ++R EC+CNPVR+F I+SMQRSGSGWFETLLN
Sbjct: 60 CAKPYVPSEELPYVHFPKPKGYSRAECSCNPVRFFVIMSMQRSGSGWFETLLNSHPNISS 119
Query: 79 --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
RR NISSIV T+D++YNLDW TSA+KNEC+AA GFKWM+NQG M+H +DI
Sbjct: 120 NGEIFNRVDRRENISSIVQTLDKLYNLDWLTSAAKNECTAAFGFKWMLNQGFMDHRDDIL 179
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKG+S IFL RRN LRR+IS+LAN+YDR+AK LNGTHKSHVHS EA+ILAK+KP
Sbjct: 180 SYLNKKGISVIFLFRRNTLRRLISVLANNYDRDAKQLNGTHKSHVHSEEEAEILAKFKPE 239
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ + L+ ++ +E+ ++ F +TR +V YYED+I NR L +VQEF+ + R+L S
Sbjct: 240 LDVSTLVSNIRDVEKYMGDCLDSFNTTRRMVLYYEDIIRNRNALFQVQEFIGVPVRKLVS 299
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
RQVKIH+ PL V NW+DV L GT Y RFL +DY
Sbjct: 300 RQVKIHTRPLPDLVRNWEDVNSRLNGTEYARFLDGADY 337
>gi|255569155|ref|XP_002525546.1| conserved hypothetical protein [Ricinus communis]
gi|223535125|gb|EEF36805.1| conserved hypothetical protein [Ricinus communis]
Length = 333
Score = 332 bits (852), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 157/302 (51%), Positives = 217/302 (71%), Gaps = 15/302 (4%)
Query: 1 MCSKQTSPHTKTKLLNIK-VIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFA 59
+C +Q S +K ++IK ++ C +I EI ++H+P P T++REECAC PVR+F
Sbjct: 30 LCLQQISFPLISKEVSIKDIMKSSCYIHDIPPQEIHYVHFPYPKTYSREECACTPVRFFV 89
Query: 60 ILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKN 106
I+SMQRSGSGWFETLLN +K RR +IS+I +T+D V+ L+W +SA+KN
Sbjct: 90 IMSMQRSGSGWFETLLNSHPNVSSNGEIFSVKQRRSSISTIQATLDTVFGLEWVSSAAKN 149
Query: 107 ECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKL 166
+C AAVGFKWM+NQG+M +H +I +YF +KGVS IFL R+NLLRR++S+LAN+YDR+AK
Sbjct: 150 DCIAAVGFKWMLNQGVMVYHREIMDYFNQKGVSVIFLFRKNLLRRLVSVLANAYDRDAKQ 209
Query: 167 LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED 226
+NGTHKSHVH+ EA +LAK+KP +N + LI +L E T + +E+FKS RH+V YYED
Sbjct: 210 INGTHKSHVHTREEANLLAKFKPVLNISALISDLSSAEHTMTETLEFFKSCRHLVLYYED 269
Query: 227 LITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL-HS 285
L+ N L E QEFLR+ R L S QVKIH LS+Q+ NW++V + L+G+ Y+ FL ++
Sbjct: 270 LMKNPKALSEAQEFLRVPIRNLVSHQVKIHVKLLSEQIYNWEEVYRTLEGSEYKYFLNYA 329
Query: 286 DY 287
DY
Sbjct: 330 DY 331
>gi|115473697|ref|NP_001060447.1| Os07g0644200 [Oryza sativa Japonica Group]
gi|33146729|dbj|BAC79620.1| unknown protein [Oryza sativa Japonica Group]
gi|50509939|dbj|BAD30260.1| unknown protein [Oryza sativa Japonica Group]
gi|113611983|dbj|BAF22361.1| Os07g0644200 [Oryza sativa Japonica Group]
gi|215694729|dbj|BAG89920.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 337
Score = 332 bits (850), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 154/278 (55%), Positives = 200/278 (71%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
C+ ++ E+ ++H+P P +++R EC+CN VR F ++SMQRSGSGWFETLLN
Sbjct: 58 CRKPHLPHEELRYVHFPKPESYSRGECSCNLVRSFVLVSMQRSGSGWFETLLNSHPNISS 117
Query: 79 --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
RR NISSI+ T+D++YNLDWFTSA+KNEC+AA G KWM+NQG M+HH+DIA
Sbjct: 118 NGEIFNRVDRRENISSILQTLDKLYNLDWFTSAAKNECTAAFGLKWMLNQGFMDHHDDIA 177
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKGVS IFL RRN LRR+IS+LAN YDR+AK LNGTHKSHVHS EA+ILAK+KP
Sbjct: 178 SYLNKKGVSVIFLFRRNTLRRLISVLANDYDRDAKQLNGTHKSHVHSKEEAEILAKFKPV 237
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ + LIP ++ E+ +++F +TRH++ YYED++ NR L +VQEFL + R+L S
Sbjct: 238 LDVSNLIPNIRNAEKYIRDCLDHFNTTRHMILYYEDIVRNRNALFQVQEFLGVPVRKLVS 297
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
RQVKIH+ PL V NWDDV L GT Y FL +DY
Sbjct: 298 RQVKIHTSPLPGLVRNWDDVSNKLNGTQYAHFLDGADY 335
>gi|357121725|ref|XP_003562568.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
Length = 337
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 154/278 (55%), Positives = 200/278 (71%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
C ++ E+ ++H P P +NR EC+C PVR+F I+SMQRSGSGWFETLLN
Sbjct: 58 CTKPSVPSEELRYVHLPKPKGYNRGECSCTPVRFFVIVSMQRSGSGWFETLLNSHPNISS 117
Query: 79 --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
RR N+SSIV T+D++YNLDW TSA+KNEC+AA G KWM+NQG M+HH+DI
Sbjct: 118 NGEIFNRIDRRENLSSIVQTLDKLYNLDWLTSAAKNECTAAFGLKWMLNQGFMDHHDDIV 177
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
YF +KGVS IFL RRN LRR+IS+LAN+YDR+AK LNGTHKSHVHS EA+ILAK+KP
Sbjct: 178 SYFNQKGVSLIFLFRRNTLRRLISVLANNYDRDAKQLNGTHKSHVHSEEEAEILAKFKPE 237
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ + LI +++ IE+ ++ F +TRH++ YYED+I+NR L VQEFL + R+L S
Sbjct: 238 LDVSTLILDIRDIEKYIRDCLDRFNTTRHMILYYEDIISNRNALFRVQEFLGVPARKLVS 297
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
+QVKIH+ PL V+NW+DV L GT Y RFL +DY
Sbjct: 298 KQVKIHTRPLPDLVKNWEDVNSKLNGTEYARFLDGADY 335
>gi|223942635|gb|ACN25401.1| unknown [Zea mays]
gi|414869370|tpg|DAA47927.1| TPA: thylakoid lumenal protein, mRNA [Zea mays]
Length = 364
Score = 328 bits (842), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 162/281 (57%), Positives = 210/281 (74%), Gaps = 13/281 (4%)
Query: 21 DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
++PC +I +EIP+LHYP P T++R+ECAC VR+FAILSMQRSGSGW ETLLN
Sbjct: 84 EKPCHDPSIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETLLNSHPN 143
Query: 77 ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
+K RR NI++I T+DR+YNLDW++SA+KNEC+AAVG KWM+NQGLM+HH
Sbjct: 144 ISSNGEIFSVKERRSNITAITKTLDRLYNLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQ 203
Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
+I EYF ++GVSAIFL+RRNLLRR +SILAN++D K LNGTHK+HVHS EA+ILA+Y
Sbjct: 204 EIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAEILAQY 263
Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
KPTI+ LI ELK+ ++ A A+ FK+TRH+V YYED+++NR L +V +FLRL R+
Sbjct: 264 KPTIDKKTLITELKRSDKLAADALVNFKNTRHVVLYYEDVVSNRTMLMDVLDFLRLPKRK 323
Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
L SR VKIH+ L ++NW DV LKGT +E FL+ R
Sbjct: 324 LFSRHVKIHTKRLCDHIDNWADVTDFLKGTPFESFLNGSRR 364
>gi|242082293|ref|XP_002445915.1| hypothetical protein SORBIDRAFT_07g027970 [Sorghum bicolor]
gi|241942265|gb|EES15410.1| hypothetical protein SORBIDRAFT_07g027970 [Sorghum bicolor]
Length = 357
Score = 326 bits (835), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 159/281 (56%), Positives = 210/281 (74%), Gaps = 13/281 (4%)
Query: 21 DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
++PC +I +EIP+LHYP P T++R+ECAC VR+FAILSMQRSGSGW ETLLN
Sbjct: 77 EKPCHDPSIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETLLNSHPN 136
Query: 77 ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
+K RR NI++I T+D++YNLDW++SA+KNEC+AAVG KWM+NQGLM+HH
Sbjct: 137 ISSNGEIFSVKDRRSNITAITKTLDKLYNLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQ 196
Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
+I EYF ++GVSAIFL+RRNLLRR +SILAN++D K LNGTHK+HVHS EA+ILA+Y
Sbjct: 197 EIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAEILAQY 256
Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
KPTI+ LI ELK+ ++ + A+ FK+TRH+V YYED++ NR L +V +FLR+ R+
Sbjct: 257 KPTIDKKTLITELKRSDKLASDALVNFKNTRHVVLYYEDVVRNRTMLMDVLDFLRVPKRK 316
Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
L+SR VKIH+ L ++NW DV LKGT +E FL+ R
Sbjct: 317 LSSRHVKIHTKRLCDHIDNWADVNNFLKGTRFESFLNGSRR 357
>gi|242046744|ref|XP_002461118.1| hypothetical protein SORBIDRAFT_02g041100 [Sorghum bicolor]
gi|241924495|gb|EER97639.1| hypothetical protein SORBIDRAFT_02g041100 [Sorghum bicolor]
Length = 335
Score = 324 bits (830), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 151/278 (54%), Positives = 199/278 (71%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK----- 78
C+ + E+ ++H+P P +++R EC+C PVR+F I+SMQRSGSGWFETLLN
Sbjct: 56 CRKPYVPHEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLLNSHPNISS 115
Query: 79 --------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
RR NISSI+ T+D +YNLDW TSA+KNEC+AA G KWM+NQG++E+ DI
Sbjct: 116 NGEIFNRVDRRENISSILQTLDTLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPEDIV 175
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKKGVS IFL RRN LRR+IS+LAN YD++AK LNGTHKSHVHS EA+ILAK+KP
Sbjct: 176 SYLKKKGVSVIFLFRRNTLRRLISVLANDYDKDAKQLNGTHKSHVHSKEEAEILAKFKPH 235
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ + LI ++ IE+ +++FKSTRH++ YYED+I+N L +VQEFLR+ R L S
Sbjct: 236 LDPSTLITNIRNIEKAIRDCLDHFKSTRHMILYYEDIISNSNALFQVQEFLRVPVRRLMS 295
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
RQVKIH+ P+ V+NW++V L GT + FL SDY
Sbjct: 296 RQVKIHTRPIPDLVKNWEEVSSKLNGTEFAHFLDGSDY 333
>gi|212723924|ref|NP_001131369.1| uncharacterized protein LOC100192694 [Zea mays]
gi|194691340|gb|ACF79754.1| unknown [Zea mays]
gi|224028935|gb|ACN33543.1| unknown [Zea mays]
gi|414887880|tpg|DAA63894.1| TPA: hypothetical protein ZEAMMB73_966617 [Zea mays]
gi|414887881|tpg|DAA63895.1| TPA: hypothetical protein ZEAMMB73_966617 [Zea mays]
Length = 337
Score = 323 bits (828), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 150/281 (53%), Positives = 198/281 (70%), Gaps = 14/281 (4%)
Query: 21 DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK-- 78
D C+ + E+ ++H+P P +++R EC+C PVR+F I+SMQRSGSGWFETL+N
Sbjct: 55 DDICRKPYVPYEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLMNSHPN 114
Query: 79 -----------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
RR NISSI+ T+D +YNLDW TSA+KNEC+AA G KWM+NQG++E+
Sbjct: 115 ISSNGEIFNRMDRRENISSILQTLDTLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPE 174
Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
DI Y KKGVS IFL RRN LRR++S+LAN YD++AK LNGTHKSHVHS EA+ILAK+
Sbjct: 175 DIVSYLNKKGVSVIFLFRRNTLRRLVSVLANDYDKDAKQLNGTHKSHVHSKEEAEILAKF 234
Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
KP ++ + LI ++ IE+ +++FKSTRH++ YYED+I+N L VQEFLR+ R
Sbjct: 235 KPHLDTSTLITNIRNIEKAIRDCLDHFKSTRHMILYYEDIISNSNALSRVQEFLRVPVRR 294
Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
L SRQVKIH+ PL V+NW++V L GT + FL SDY
Sbjct: 295 LMSRQVKIHTRPLPDLVKNWEEVSSKLNGTEFAHFLDGSDY 335
>gi|194696388|gb|ACF82278.1| unknown [Zea mays]
Length = 337
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 150/281 (53%), Positives = 197/281 (70%), Gaps = 14/281 (4%)
Query: 21 DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIK-- 78
D C+ + E+ ++H+P P +++R EC+C PVR+F I+SMQRSGSGWFETL+N
Sbjct: 55 DDICRKPYVPYEELSYVHFPKPTSYSRGECSCTPVRFFVIVSMQRSGSGWFETLMNSHPN 114
Query: 79 -----------HRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
RR NISSI+ T+DR+YNLDW TSA+KNEC+AA G KWM+NQG++E+
Sbjct: 115 ISSNGEIFNRVDRRENISSILQTLDRLYNLDWLTSAAKNECTAAFGLKWMLNQGILENPE 174
Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
DI Y KKGVS IFL RRN LRR++S+LAN YD+ AK LNGTHKSHVHS EA+ILAK+
Sbjct: 175 DIVSYLNKKGVSVIFLFRRNTLRRLVSVLANDYDKAAKQLNGTHKSHVHSKEEAEILAKF 234
Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
KP ++ + LI ++ IE+ +++FKSTRH++ YYED+I+N L VQEFL + R
Sbjct: 235 KPHLDTSTLITNIRNIEKAIRDCLDHFKSTRHMILYYEDIISNSNALSRVQEFLGVPVRR 294
Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
L SRQVKIH+ PL V+NW++V L GT + FL SDY
Sbjct: 295 LMSRQVKIHTRPLPDLVKNWEEVSSKLNGTEFAHFLDGSDY 335
>gi|226531263|ref|NP_001142398.1| uncharacterized protein LOC100274572 [Zea mays]
gi|194708632|gb|ACF88400.1| unknown [Zea mays]
gi|195625724|gb|ACG34692.1| hypothetical protein [Zea mays]
gi|413925273|gb|AFW65205.1| hypothetical protein ZEAMMB73_585643 [Zea mays]
Length = 348
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 159/281 (56%), Positives = 207/281 (73%), Gaps = 13/281 (4%)
Query: 21 DQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN---- 76
++PC I +EIP+LHYP P T++R+ECAC VR+FAILSMQRSGSGW ETLLN
Sbjct: 68 EKPCHDPGIPDTEIPYLHYPMPNTYDRKECACTGVRFFAILSMQRSGSGWVETLLNSHPN 127
Query: 77 ---------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
+K RR N ++I T+D++YNLDW++SA+KNEC+AAVG KWM+NQGLM+HH
Sbjct: 128 ISSNGEIFSVKERRSNFTAIKKTLDKLYNLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQ 187
Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKY 187
+I EYF ++GVSAIFL+RRNLLRR +SILAN++D K LNGTHK+HVHS EA+ILA+Y
Sbjct: 188 EIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAEILAQY 247
Query: 188 KPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRE 247
KPTI+ LI ELK+ ++ A A+ FK+TRH+V YYED++ +R L +V +FLRL R+
Sbjct: 248 KPTIDKKTLIAELKRSDKFAADALVNFKNTRHVVLYYEDVVRSRTMLMDVLDFLRLPKRK 307
Query: 248 LTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
L SR VKIH+ L ++NW DV LKGT +E FL+ R
Sbjct: 308 LLSRHVKIHTKRLRDHIDNWADVNNFLKGTPFESFLNGSRR 348
>gi|6562002|emb|CAB62491.1| hypothetical protein [Arabidopsis thaliana]
Length = 263
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 151/217 (69%), Positives = 178/217 (82%), Gaps = 13/217 (5%)
Query: 49 ECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVY 95
EC NPVRYFAILSMQRSGSGWFETLLN + RR NISSI+ T+DRVY
Sbjct: 37 ECGHNPVRYFAILSMQRSGSGWFETLLNSHNNVSSNGEIFSVLDRRKNISSIIQTLDRVY 96
Query: 96 NLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISI 155
NLDWFTSASKNECSAA+GFKWM+NQGL+E+H DI EYF ++GVSAIFL RRN LRRM+S+
Sbjct: 97 NLDWFTSASKNECSAAIGFKWMLNQGLLENHKDIVEYFNRRGVSAIFLFRRNPLRRMVSV 156
Query: 156 LANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
LANSYDR AKLLNGTHKSHVHSPAEA L++YKP IN+T LI +L++ E + AKA+EYF
Sbjct: 157 LANSYDRYAKLLNGTHKSHVHSPAEADALSRYKPVINSTSLIHDLQETENSAAKALEYFN 216
Query: 216 STRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQ 252
+TRHIV +YEDLITN+ TLK+VQEFL + ++L+SRQ
Sbjct: 217 TTRHIVVFYEDLITNQTTLKQVQEFLNIPVKDLSSRQ 253
>gi|297722141|ref|NP_001173434.1| Os03g0363950 [Oryza sativa Japonica Group]
gi|255674526|dbj|BAH92162.1| Os03g0363950 [Oryza sativa Japonica Group]
Length = 366
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
C+ I + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN
Sbjct: 87 CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 146
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI
Sbjct: 147 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 206
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKGV IFL RRN LRR+IS+LAN YDR K LNGTHK+HVHS EA ILA++KP
Sbjct: 207 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 266
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ LIP ++ EQ+ +++F STRH++ YYED+I ++ L VQEFL + L+S
Sbjct: 267 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 326
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
R VKIH+ PL V+NW++V + L GT Y RF+ +DY
Sbjct: 327 RHVKIHTSPLPDLVDNWEEVSEKLNGTEYARFVDGADY 364
>gi|108708316|gb|ABF96111.1| nodulation protein, putative, expressed [Oryza sativa Japonica
Group]
Length = 337
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
C+ I + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN
Sbjct: 58 CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 117
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI
Sbjct: 118 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 177
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKGV IFL RRN LRR+IS+LAN YDR K LNGTHK+HVHS EA ILA++KP
Sbjct: 178 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 237
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ LIP ++ EQ+ +++F STRH++ YYED+I ++ L VQEFL + L+S
Sbjct: 238 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 297
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
R VKIH+ PL V+NW++V + L GT Y RF+ +DY
Sbjct: 298 RHVKIHTSPLPDLVDNWEEVSEKLNGTEYARFVDGADY 335
>gi|222624965|gb|EEE59097.1| hypothetical protein OsJ_10950 [Oryza sativa Japonica Group]
Length = 339
Score = 320 bits (820), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
C+ I + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN
Sbjct: 60 CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 119
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI
Sbjct: 120 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 179
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKGV IFL RRN LRR+IS+LAN YDR K LNGTHK+HVHS EA ILA++KP
Sbjct: 180 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 239
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ LIP ++ EQ+ +++F STRH++ YYED+I ++ L VQEFL + L+S
Sbjct: 240 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 299
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
R VKIH+ PL V+NW++V + L GT Y RF+ +DY
Sbjct: 300 RHVKIHTSPLPDLVDNWEEVSEKLNGTEYARFVDGADY 337
>gi|218192885|gb|EEC75312.1| hypothetical protein OsI_11681 [Oryza sativa Indica Group]
Length = 339
Score = 320 bits (820), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 147/278 (52%), Positives = 196/278 (70%), Gaps = 14/278 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
C+ I + ++H+P P +++R ECAC PVR+F I+SMQRSGSGWFETLLN
Sbjct: 60 CRRHTIPNELMQYVHFPKPTSYSRGECACTPVRFFVIISMQRSGSGWFETLLNSHPNVSS 119
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
I+ RR +ISSI+ T+D++YNLDW TSA+KNEC+AA G KWM+NQG+MEH++DI
Sbjct: 120 NGEIFSIRERREDISSILRTLDKLYNLDWHTSAAKNECTAAFGLKWMLNQGIMEHYHDIV 179
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y KKGV IFL RRN LRR+IS+LAN YDR K LNGTHK+HVHS EA ILA++KP
Sbjct: 180 NYLNKKGVMVIFLFRRNTLRRIISVLANDYDRKTKQLNGTHKAHVHSREEADILARFKPK 239
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTS 250
++ LIP ++ EQ+ +++F STRH++ YYED+I ++ L VQEFL + L+S
Sbjct: 240 LDVPTLIPNIRSAEQSITTCLDHFSSTRHMILYYEDVIRDQNALSRVQEFLGVPAMRLSS 299
Query: 251 RQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
R VKIH+ PL V+NW++V + L GT Y RF+ +DY
Sbjct: 300 RHVKIHTSPLPDLVDNWEEVSENLNGTEYARFVDGADY 337
>gi|357119777|ref|XP_003561610.1| PREDICTED: nodulation protein H-like [Brachypodium distachyon]
Length = 343
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 139/271 (51%), Positives = 194/271 (71%), Gaps = 13/271 (4%)
Query: 29 IQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------------ 76
I ++ ++H+P P+T++R ECAC PVR+F I+SMQRSGSGWFETLLN
Sbjct: 69 IPYEQMQYVHFPRPMTYDRGECACTPVRFFVIVSMQRSGSGWFETLLNSHPNVSSNGEIF 128
Query: 77 -IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKK 135
++ RR +I+SI+ T+D++Y+LDW TSA+KNEC+AA G KWM+NQGL ++H DIA Y +
Sbjct: 129 SVRERREDIASILRTLDKLYDLDWRTSAAKNECTAAFGLKWMLNQGLTDYHQDIANYLNE 188
Query: 136 KGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATL 195
KGV IFL R+N LRR++S+LAN YDRN K LNGTHK+HVHS EA+ILA+++P ++ +
Sbjct: 189 KGVMVIFLFRKNTLRRLVSVLANDYDRNVKQLNGTHKAHVHSHEEAEILARFRPELDVSS 248
Query: 196 LIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKI 255
L+ ++ EQ+ + F+STRH++ YYED++ + L VQEFL + R L+SR VKI
Sbjct: 249 LVLSIRDAEQSMESCLVRFRSTRHMILYYEDIVRDDNALSRVQEFLGVPVRRLSSRHVKI 308
Query: 256 HSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
H+ PL VENW+DV + LKGT + FL +
Sbjct: 309 HTRPLPDLVENWEDVSETLKGTEFAHFLDGE 339
>gi|296087807|emb|CBI35063.3| unnamed protein product [Vitis vinifera]
Length = 344
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/299 (52%), Positives = 216/299 (72%), Gaps = 15/299 (5%)
Query: 3 SKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILS 62
++ T P TK+ +V ++ C I E+ ++H+P P T++REEC C PVR+F ILS
Sbjct: 45 TRVTVPSKPTKISFTRV-ERTCSINGIPPEELVYVHFPQPKTYSREECKCTPVRFFVILS 103
Query: 63 MQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECS 109
MQRSGSGWFETLLN +K RR ++ +I T+D +YNL+W +SA+KN+C
Sbjct: 104 MQRSGSGWFETLLNSHPNISSNGEVFSLKQRRQSMETIQRTLDTIYNLEWISSAAKNDCV 163
Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
AAVGFKWM+NQ +M +H IA YF +KGVS IFL RRNLLRR++S+LAN+YD++AKL+NG
Sbjct: 164 AAVGFKWMLNQAVMVYHRQIANYFNQKGVSVIFLFRRNLLRRLVSLLANAYDKDAKLING 223
Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
TH SHVH+ EA +LA++KPT+N + LI +L E+T +++E+FKS RH++ YYED+I
Sbjct: 224 THMSHVHTREEADLLAQFKPTLNVSSLIVDLSTAERTMTESLEFFKSNRHMILYYEDIIN 283
Query: 230 NRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
N L +VQEFLR+ +R+L SRQVKIH PL +QV+NW DV ++G+ YE FLH SDY
Sbjct: 284 NPKALSDVQEFLRVPFRKLKSRQVKIHIKPLHEQVDNWYDVYSTVQGSQYEHFLHSSDY 342
>gi|413955662|gb|AFW88311.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
Length = 346
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 137/265 (51%), Positives = 188/265 (70%), Gaps = 14/265 (5%)
Query: 36 FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
++H+P P T++R EC CNPVR+F I+S QRSGSGW E LLN ++ RR
Sbjct: 80 YVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQ 139
Query: 83 NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
N+SS++ T+DR+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI Y +KG +F
Sbjct: 140 NLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVF 199
Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
L RRN LRR+IS++AN+YDR A+ LNG HKSHVHS EA++LA+++P ++A LIP ++
Sbjct: 200 LFRRNTLRRLISVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRT 259
Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLS 261
++ + F STRH+V YYED++ + R L VQEFL + REL+S+ VKIH+ PL
Sbjct: 260 AQRAVRACLRRFGSTRHMVLYYEDVVRDSRKALSRVQEFLGVPARELSSKHVKIHTRPLP 319
Query: 262 KQVENWDDVQKALKGTSYERFLHSD 286
V NW++V++ L+GT Y RFL D
Sbjct: 320 DLVGNWEEVRRTLRGTEYSRFLDDD 344
>gi|225464035|ref|XP_002265782.1| PREDICTED: nodulation protein H-like [Vitis vinifera]
Length = 339
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/299 (52%), Positives = 216/299 (72%), Gaps = 15/299 (5%)
Query: 3 SKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILS 62
++ T P TK+ +V ++ C I E+ ++H+P P T++REEC C PVR+F ILS
Sbjct: 40 TRVTVPSKPTKISFTRV-ERTCSINGIPPEELVYVHFPQPKTYSREECKCTPVRFFVILS 98
Query: 63 MQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECS 109
MQRSGSGWFETLLN +K RR ++ +I T+D +YNL+W +SA+KN+C
Sbjct: 99 MQRSGSGWFETLLNSHPNISSNGEVFSLKQRRQSMETIQRTLDTIYNLEWISSAAKNDCV 158
Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
AAVGFKWM+NQ +M +H IA YF +KGVS IFL RRNLLRR++S+LAN+YD++AKL+NG
Sbjct: 159 AAVGFKWMLNQAVMVYHRQIANYFNQKGVSVIFLFRRNLLRRLVSLLANAYDKDAKLING 218
Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
TH SHVH+ EA +LA++KPT+N + LI +L E+T +++E+FKS RH++ YYED+I
Sbjct: 219 THMSHVHTREEADLLAQFKPTLNVSSLIVDLSTAERTMTESLEFFKSNRHMILYYEDIIN 278
Query: 230 NRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH-SDY 287
N L +VQEFLR+ +R+L SRQVKIH PL +QV+NW DV ++G+ YE FLH SDY
Sbjct: 279 NPKALSDVQEFLRVPFRKLKSRQVKIHIKPLHEQVDNWYDVYSTVQGSQYEHFLHSSDY 337
>gi|168059476|ref|XP_001781728.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666812|gb|EDQ53457.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 378
Score = 306 bits (783), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 147/264 (55%), Positives = 186/264 (70%), Gaps = 13/264 (4%)
Query: 36 FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
+ HYP P + R+EC C PV YF ILSMQRSGSGWFETLLN +K RR
Sbjct: 112 YQHYPLPHAYERQECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVKDRRE 171
Query: 83 NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
N SSI TMD+VYNLDW SA+KNEC+AAVGFKWM+NQG ME++ ++ EYF++ GVS I
Sbjct: 172 NFSSIAKTMDKVYNLDWLNSAAKNECTAAVGFKWMLNQGPMEYNREVREYFERMGVSVIL 231
Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
L+RRN+L+R+ISILAN+YDR K LNGTHKSHVHS EA LA+Y+P ++A L L +
Sbjct: 232 LLRRNVLKRLISILANAYDRVVKPLNGTHKSHVHSVEEAMKLAEYRPVVDAKHLFENLAR 291
Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSK 262
+EQ T A +F +TR V +YEDL+ + L E+Q FL + R+L S+QVKIH+ PL +
Sbjct: 292 VEQITDDAQRFFNNTRIRVVHYEDLVVDPKYLSEIQHFLGVQPRKLESQQVKIHTRPLRE 351
Query: 263 QVENWDDVQKALKGTSYERFLHSD 286
Q++NWD+V LKGT YE L D
Sbjct: 352 QIQNWDEVLIHLKGTKYESLLQDD 375
>gi|242035665|ref|XP_002465227.1| hypothetical protein SORBIDRAFT_01g034575 [Sorghum bicolor]
gi|241919081|gb|EER92225.1| hypothetical protein SORBIDRAFT_01g034575 [Sorghum bicolor]
Length = 366
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 146/306 (47%), Positives = 195/306 (63%), Gaps = 42/306 (13%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
C ++ ++H+P P T++R ECACNPVR+F I+SMQRSGSGWFETLLN
Sbjct: 59 CTRPATPHEQMRYVHFPRPATYDRGECACNPVRFFVIISMQRSGSGWFETLLNSHPNVSS 118
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
++ RR NISSI++T+DR+Y++DW TSA+KNEC+AA G KWM+NQGLME+H DI
Sbjct: 119 NGEIFSVRDRRENISSILATLDRLYDMDWITSAAKNECTAAFGLKWMLNQGLMENHRDIV 178
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
Y ++G IFL RRN LRR+IS+LAN+YDR K LNG HKSHVHS EA+ILA++KP
Sbjct: 179 NYLNRRGAIVIFLFRRNTLRRLISVLANNYDRRTKQLNGVHKSHVHSKEEAEILARFKPE 238
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNR------------------- 231
++ + LIP ++ ++ + F STRH+V YYED+I +R
Sbjct: 239 LDISTLIPSIRDAQRAMRTCLGRFSSTRHMVLYYEDVIRDRNVGITSKQAFPASEDYPII 298
Query: 232 ---------ATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERF 282
L VQEFL + R+L+S+ VKIH+ PL V+NW+ V + L GT Y RF
Sbjct: 299 MLFLMPCAVQALSRVQEFLGVPVRKLSSKHVKIHTRPLPDLVDNWEQVSQVLNGTRYGRF 358
Query: 283 L-HSDY 287
L +DY
Sbjct: 359 LADADY 364
>gi|168020679|ref|XP_001762870.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685979|gb|EDQ72371.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 306
Score = 302 bits (774), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 142/264 (53%), Positives = 185/264 (70%), Gaps = 13/264 (4%)
Query: 36 FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
+ HYP P T+ R EC C PV YF ILSMQRSGSGWFETLLN ++ RR
Sbjct: 40 YQHYPLPHTYERNECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVRERRD 99
Query: 83 NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
N SSI MD+V+NLDW SASKNEC+AAVGFKWM+NQG ME++ ++ +YF++ GVS I
Sbjct: 100 NFSSIARNMDKVFNLDWLNSASKNECTAAVGFKWMLNQGPMEYNREVLDYFERMGVSVIL 159
Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
L+RRN+L+R+ISI+ANSYD+ AK+LNGTHKSHVHS EA LA+Y+P I+ L L++
Sbjct: 160 LLRRNVLKRLISIMANSYDQRAKILNGTHKSHVHSVEEALKLAEYRPVIDVNHLPENLQR 219
Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSK 262
+E+ + A F TR + YYEDL+ + L E+Q FL + R+L S+QVKIH+ PL +
Sbjct: 220 VEKMASDAQRLFNKTRSRLVYYEDLVMDSQRLTEIQTFLGVPPRKLESQQVKIHTRPLRE 279
Query: 263 QVENWDDVQKALKGTSYERFLHSD 286
Q++NWD+V L GT YE +H D
Sbjct: 280 QIQNWDEVLARLNGTRYELLMHDD 303
>gi|218200116|gb|EEC82543.1| hypothetical protein OsI_27076 [Oryza sativa Indica Group]
Length = 866
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 138/265 (52%), Positives = 184/265 (69%), Gaps = 12/265 (4%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLNIKHRRIN 83
C+ + E+ ++H+P P +++R EC+CNPVR+ L + L + +
Sbjct: 611 CRKPRLPHEELRYVHFPKPESYSRGECSCNPVRFLCALYL----------LQRVAESGMG 660
Query: 84 ISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFL 143
+ SI T+D++YNLDWFTSA+KNEC+AA G KWM+NQG M+HH+DIA YF KKGVS IFL
Sbjct: 661 VESIC-TLDKLYNLDWFTSAAKNECTAAFGLKWMLNQGFMDHHDDIASYFNKKGVSVIFL 719
Query: 144 IRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQI 203
RRN LRR+IS+LAN YDRNAK LNGTHKSHVHS EA+ILAK+KP ++ + LIP ++
Sbjct: 720 FRRNTLRRLISVLANDYDRNAKQLNGTHKSHVHSKEEAEILAKFKPVLDVSNLIPNIRNA 779
Query: 204 EQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQ 263
E+ +++F +TRH++ YYED++ NR L +VQEFL + R+L SRQVKIH+ PL
Sbjct: 780 EKYIRDCLDHFNTTRHMILYYEDIVRNRNALFQVQEFLGVPVRKLVSRQVKIHTSPLPGL 839
Query: 264 VENWDDVQKALKGTSYERFLH-SDY 287
V NWDDV L GT Y FL +DY
Sbjct: 840 VSNWDDVSNKLNGTQYAHFLDGADY 864
>gi|413955663|gb|AFW88312.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
Length = 374
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 138/293 (47%), Positives = 190/293 (64%), Gaps = 42/293 (14%)
Query: 36 FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
++H+P P T++R EC CNPVR+F I+S QRSGSGW E LLN ++ RR
Sbjct: 80 YVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQ 139
Query: 83 NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
N+SS++ T+DR+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI Y +KG +F
Sbjct: 140 NLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVF 199
Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
L RRN LRR+IS++AN+YDR A+ LNG HKSHVHS EA++LA+++P ++A LIP ++
Sbjct: 200 LFRRNTLRRLISVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRT 259
Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLI---------TNRATLK------------------ 235
++ + F STRH+V YYED++ T+R L+
Sbjct: 260 AQRAVRACLRRFGSTRHMVLYYEDVVRDSRKVGGGTSRMALEFKRGYPITGACSSLCMQA 319
Query: 236 --EVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
VQEFL + REL+S+ VKIH+ PL V NW++V++ L+GT Y RFL D
Sbjct: 320 LSRVQEFLGVPARELSSKHVKIHTRPLPDLVGNWEEVRRTLRGTEYSRFLDDD 372
>gi|226502206|ref|NP_001144194.1| uncharacterized protein LOC100277054 [Zea mays]
gi|195638236|gb|ACG38586.1| hypothetical protein [Zea mays]
Length = 340
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 131/257 (50%), Positives = 181/257 (70%), Gaps = 14/257 (5%)
Query: 36 FLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRI 82
++H+P P T++R EC CNPVR+F I+S QRSGSGW LLN ++ RR
Sbjct: 80 YVHFPRPATYDRGECVCNPVRFFVIVSTQRSGSGWVXALLNSHPNVSSNGEVFSMRERRQ 139
Query: 83 NISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIF 142
N+SS++ T+DR+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI Y +KG +F
Sbjct: 140 NLSSVLGTLDRLYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVF 199
Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
L RRN LRR+IS++AN+YDR A+ LNG HKSHVHS EA++LA+++P ++A LIP ++
Sbjct: 200 LFRRNTLRRLISVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRT 259
Query: 203 IEQTTAKAIEYFKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLS 261
++ + F STRH+V YYED++ + R L VQEFL + REL+S+ VKIH+ PL
Sbjct: 260 AQRAVRACLRRFGSTRHMVLYYEDVVRDSRKALSRVQEFLGVPARELSSKHVKIHTRPLP 319
Query: 262 KQVENWDDVQKALKGTS 278
V NW++V++ L G S
Sbjct: 320 DLVGNWEEVRRTLAGQS 336
>gi|167999402|ref|XP_001752406.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696306|gb|EDQ82645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 256
Score = 296 bits (759), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 139/253 (54%), Positives = 180/253 (71%), Gaps = 13/253 (5%)
Query: 47 REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
R EC C PV YF ILSMQRSGSGWFETLLN + RR N SSI + MD+
Sbjct: 1 RNECTCTPVHYFVILSMQRSGSGWFETLLNNHPNISSHGEVFSVGERRDNFSSIATNMDK 60
Query: 94 VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
V+NLDW SASKNEC+AAVGFKWM+NQG ME++ ++ +YF+K GVS I L+RRN+L+R+I
Sbjct: 61 VFNLDWLNSASKNECTAAVGFKWMLNQGPMEYNGEVLDYFQKMGVSVILLLRRNVLKRLI 120
Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
SI+AN+YD+ A++LNGTHKSHVHS EA LA+YKP I+ L L+++EQ + A +
Sbjct: 121 SIMANTYDQRARILNGTHKSHVHSVEEALKLAEYKPVIDVKHLPERLQRVEQIASDAQRF 180
Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
F TR YYEDL+T+ L E+QEFLR+ + L S+QVKIH+ P+ +Q++NWD+V
Sbjct: 181 FNKTRLRPVYYEDLVTDPKQLTEIQEFLRVPPQNLESQQVKIHTRPMREQIQNWDEVLAR 240
Query: 274 LKGTSYERFLHSD 286
L GT YE +H D
Sbjct: 241 LNGTKYELLMHDD 253
>gi|194702290|gb|ACF85229.1| unknown [Zea mays]
gi|413955660|gb|AFW88309.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
Length = 334
Score = 296 bits (759), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 133/254 (52%), Positives = 179/254 (70%), Gaps = 14/254 (5%)
Query: 47 REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
R EC CNPVR+F I+S QRSGSGW E LLN ++ RR N+SS++ T+DR
Sbjct: 79 RGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQNLSSVLGTLDR 138
Query: 94 VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI Y +KG +FL RRN LRR+I
Sbjct: 139 LYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVFLFRRNTLRRLI 198
Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
S++AN+YDR A+ LNG HKSHVHS EA++LA+++P ++A LIP ++ ++ +
Sbjct: 199 SVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRTAQRAVRACLRR 258
Query: 214 FKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQK 272
F STRH+V YYED++ + R L VQEFL + REL+S+ VKIH+ PL V NW++V++
Sbjct: 259 FGSTRHMVLYYEDVVRDSRKALSRVQEFLGVPARELSSKHVKIHTRPLPDLVGNWEEVRR 318
Query: 273 ALKGTSYERFLHSD 286
L+GT Y RFL D
Sbjct: 319 TLRGTEYSRFLDDD 332
>gi|293334015|ref|NP_001170415.1| uncharacterized protein LOC100384402 [Zea mays]
gi|224035729|gb|ACN36940.1| unknown [Zea mays]
Length = 275
Score = 295 bits (755), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 149/253 (58%), Positives = 189/253 (74%), Gaps = 13/253 (5%)
Query: 49 ECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVY 95
ECAC VR+FAILSMQRSGSGW ETLLN +K RR NI++I T+DR+Y
Sbjct: 23 ECACTGVRFFAILSMQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNITAITKTLDRLY 82
Query: 96 NLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISI 155
NLDW++SA+KNEC+AAVG KWM+NQGLM+HH +I EYF ++GVSAIFL+RRNLLRR +SI
Sbjct: 83 NLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSI 142
Query: 156 LANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
LAN++D K LNGTHK+HVHS EA+ILA+YKPTI+ LI ELK+ ++ A A+ FK
Sbjct: 143 LANAHDSAMKQLNGTHKAHVHSKHEAEILAQYKPTIDKKTLITELKRSDKLAADALVNFK 202
Query: 216 STRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
+TRH+V YYED+++NR L +V +FLRL R+L SR VKIH+ L ++NW DV LK
Sbjct: 203 NTRHVVLYYEDVVSNRTMLMDVLDFLRLPKRKLFSRHVKIHTKRLCDHIDNWADVTDFLK 262
Query: 276 GTSYERFLHSDYR 288
GT +E FL+ R
Sbjct: 263 GTPFESFLNGSRR 275
>gi|413955391|gb|AFW88040.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
Length = 307
Score = 293 bits (751), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 136/244 (55%), Positives = 179/244 (73%), Gaps = 13/244 (5%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ ++++ I+V++QPC + + SE+ F HYP P T++REEC CN VR+FAI
Sbjct: 59 MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSREECKCNAVRFFAI 118
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSIV+ MDRVY+LDW +SASKNE
Sbjct: 119 ISSQRSGSGWFETLLNSHMNVSSNGEIFSRKERRSNISSIVNAMDRVYSLDWNSSASKNE 178
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AA+GFKWM+NQGL+ +H + +YF ++GVSAIFL RRNLLR+++S LAN++DR K L
Sbjct: 179 CTAAIGFKWMLNQGLVANHVAVVDYFNRRGVSAIFLFRRNLLRQLVSQLANNHDRFLKQL 238
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ EA ILA+YKP +N + LI +LKQ ++ T A+E +TRHI YYED+
Sbjct: 239 NGTHKAHVHTEHEAHILARYKPRLNTSSLIRKLKQADEYTRDALENLNNTRHITVYYEDI 298
Query: 228 ITNR 231
+ NR
Sbjct: 299 VRNR 302
>gi|115440365|ref|NP_001044462.1| Os01g0784600 [Oryza sativa Japonica Group]
gi|53792452|dbj|BAD53360.1| unknown protein [Oryza sativa Japonica Group]
gi|113533993|dbj|BAF06376.1| Os01g0784600 [Oryza sativa Japonica Group]
gi|215695270|dbj|BAG90461.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 346
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 129/220 (58%), Positives = 171/220 (77%), Gaps = 13/220 (5%)
Query: 24 CQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN------- 76
CQ +I +EIP++HYP P T++R+ECAC PVR+FAILSMQRSGSGW ETLLN
Sbjct: 84 CQDPSIPVTEIPYVHYPTPDTYSRKECACTPVRFFAILSMQRSGSGWIETLLNSHENISS 143
Query: 77 ------IKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIA 130
IK RR NI+SI T+D++YNLDW +SA+KNEC+AAVG KWM+NQGL++HH ++
Sbjct: 144 NGEIFSIKERRSNITSITKTLDKLYNLDWLSSAAKNECTAAVGLKWMLNQGLVQHHEEVV 203
Query: 131 EYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPT 190
EYF ++GVSAIFL+RRN L+R +S+LAN++D AK +NGTHKSHVHS EA+ILA++KP
Sbjct: 204 EYFNRRGVSAIFLLRRNTLQRYVSVLANAHDSKAKQINGTHKSHVHSEREAQILAQFKPE 263
Query: 191 INATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN 230
I+ LI +LK+ ++ A A+ YFK TRHI+ YYED+++N
Sbjct: 264 IDTKKLIADLKKSDKLAADALLYFKKTRHIILYYEDVVSN 303
>gi|168007783|ref|XP_001756587.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692183|gb|EDQ78541.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 256
Score = 290 bits (741), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 140/252 (55%), Positives = 177/252 (70%), Gaps = 13/252 (5%)
Query: 47 REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
R+EC C PV +F ILSMQRSGSGWFETLLN +K RR N S+I TMD+
Sbjct: 1 RQECQCTPVHFFVILSMQRSGSGWFETLLNNHPNISSHGEIFSVKPRRANFSTIARTMDK 60
Query: 94 VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
+YNLDW SA+KNEC+AAVGFKWM+NQG ME+ ++++YF+K GVS I L+RRN+L+R+I
Sbjct: 61 IYNLDWLNSAAKNECTAAVGFKWMLNQGPMEYSREVSDYFEKMGVSVILLLRRNVLKRLI 120
Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
SILAN+YDR AK LNG HKSHVHS EA LA+Y+P I+ L L ++EQ T A +
Sbjct: 121 SILANAYDRKAKPLNGIHKSHVHSVEEAMKLAEYRPVIDVNHLTDNLHRVEQITDDAQRF 180
Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
F +TR V YYEDL+ + L +VQEFL + R+L S QVKIH+ PL +Q++NWD V
Sbjct: 181 FNNTRLRVVYYEDLVMDPKHLMQVQEFLGVQPRKLESLQVKIHTRPLREQIQNWDAVLAR 240
Query: 274 LKGTSYERFLHS 285
LK T YE L+
Sbjct: 241 LKATQYETLLND 252
>gi|413925274|gb|AFW65206.1| hypothetical protein ZEAMMB73_585643 [Zea mays]
Length = 275
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 146/253 (57%), Positives = 187/253 (73%), Gaps = 13/253 (5%)
Query: 49 ECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVY 95
ECAC VR+FAILSMQRSGSGW ETLLN +K RR N ++I T+D++Y
Sbjct: 23 ECACTGVRFFAILSMQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNFTAIKKTLDKLY 82
Query: 96 NLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISI 155
NLDW++SA+KNEC+AAVG KWM+NQGLM+HH +I EYF ++GVSAIFL+RRNLLRR +SI
Sbjct: 83 NLDWYSSAAKNECTAAVGLKWMLNQGLMKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSI 142
Query: 156 LANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
LAN++D K LNGTHK+HVHS EA+ILA+YKPTI+ LI ELK+ ++ A A+ FK
Sbjct: 143 LANAHDSAMKQLNGTHKAHVHSKHEAEILAQYKPTIDKKTLIAELKRSDKFAADALVNFK 202
Query: 216 STRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
+TRH+V YYED++ +R L +V +FLRL R+L SR VKIH+ L ++NW DV LK
Sbjct: 203 NTRHVVLYYEDVVRSRTMLMDVLDFLRLPKRKLLSRHVKIHTKRLRDHIDNWADVNNFLK 262
Query: 276 GTSYERFLHSDYR 288
GT +E FL+ R
Sbjct: 263 GTPFESFLNGSRR 275
>gi|413955661|gb|AFW88310.1| hypothetical protein ZEAMMB73_433211 [Zea mays]
Length = 362
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 134/282 (47%), Positives = 181/282 (64%), Gaps = 42/282 (14%)
Query: 47 REECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDR 93
R EC CNPVR+F I+S QRSGSGW E LLN ++ RR N+SS++ T+DR
Sbjct: 79 RGECVCNPVRFFVIVSTQRSGSGWVEALLNSHPNVSSNGEVFSMRERRQNLSSVLGTLDR 138
Query: 94 VYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMI 153
+Y +DW TSA+KNEC+AA GFKWM+NQGLME+H DI Y +KG +FL RRN LRR+I
Sbjct: 139 LYGMDWLTSAAKNECTAAFGFKWMLNQGLMENHRDIVNYLNRKGAMVVFLFRRNTLRRLI 198
Query: 154 SILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY 213
S++AN+YDR A+ LNG HKSHVHS EA++LA+++P ++A LIP ++ ++ +
Sbjct: 199 SVVANNYDRRARQLNGVHKSHVHSKEEAEVLARFRPELDAPALIPSIRTAQRAVRACLRR 258
Query: 214 FKSTRHIVFYYEDLI---------TNRATLK--------------------EVQEFLRLA 244
F STRH+V YYED++ T+R L+ VQEFL +
Sbjct: 259 FGSTRHMVLYYEDVVRDSRKVGGGTSRMALEFKRGYPITGACSSLCMQALSRVQEFLGVP 318
Query: 245 YRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSD 286
REL+S+ VKIH+ PL V NW++V++ L+GT Y RFL D
Sbjct: 319 ARELSSKHVKIHTRPLPDLVGNWEEVRRTLRGTEYSRFLDDD 360
>gi|302770487|ref|XP_002968662.1| hypothetical protein SELMODRAFT_90926 [Selaginella moellendorffii]
gi|302816465|ref|XP_002989911.1| hypothetical protein SELMODRAFT_130724 [Selaginella moellendorffii]
gi|300142222|gb|EFJ08924.1| hypothetical protein SELMODRAFT_130724 [Selaginella moellendorffii]
gi|300163167|gb|EFJ29778.1| hypothetical protein SELMODRAFT_90926 [Selaginella moellendorffii]
Length = 329
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 197/308 (63%), Gaps = 23/308 (7%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNI----QQSEIPFLHYPNPVTFNREECACNPVR 56
MC T L++ I Q + N+ +++ P HYP P T++R+EC C PV
Sbjct: 23 MCLIGVDRRTTYDRLSLDPIQQVWRRENLCPRLSRAKFP-QHYPQPRTYSRKECRCVPVH 81
Query: 57 YFAILSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSA 103
+F +LSMQRSGSGWFETLLN I RR N S+I T+D VYNL+W++SA
Sbjct: 82 FFVLLSMQRSGSGWFETLLNSHPNVSSHGEIFSIGRRRANFSTIKQTLDEVYNLEWYSSA 141
Query: 104 SKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRN 163
+KNEC+AAVGFKWM+NQG + H + +A+YF+ +GVS +FL RRN L+R+IS++ N+YDR
Sbjct: 142 AKNECTAAVGFKWMLNQGALAHGDQVADYFRSRGVSVVFLQRRNYLKRLISVMGNAYDR- 200
Query: 164 AKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFY 223
AK LNGTH +HVHS EA++LA +KPTI+ L+ L++++ + + + F +TR Y
Sbjct: 201 AKPLNGTHVAHVHSREEAELLASFKPTIDVANLLQNLRRVQNMSDETLRIFGTTRLTALY 260
Query: 224 YEDLITNRATLKEVQ---EFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYE 280
YEDL+ N + V FL + ++L S+ VKIH+ PL + NWDDV +AL GT +E
Sbjct: 261 YEDLVKNPRKVMRVDVVLRFLGVPRQDLRSKHVKIHTRPLRDSITNWDDVYRALNGTEFE 320
Query: 281 RFLHS-DY 287
LH DY
Sbjct: 321 SLLHDRDY 328
>gi|302799659|ref|XP_002981588.1| hypothetical protein SELMODRAFT_114849 [Selaginella moellendorffii]
gi|300150754|gb|EFJ17403.1| hypothetical protein SELMODRAFT_114849 [Selaginella moellendorffii]
Length = 336
Score = 280 bits (717), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 136/262 (51%), Positives = 174/262 (66%), Gaps = 14/262 (5%)
Query: 38 HYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINI 84
HYP P T++R ECAC PV F ILS QRSGSGWFETLLN ++ RR N
Sbjct: 73 HYPRPKTYSRGECACTPVHNFVILSTQRSGSGWFETLLNSHPNITSHGEIFSVRPRRHNF 132
Query: 85 SSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLI 144
S + T+D +YNLDW SASKN C++ VGFKWM+NQG ME+ ++++ YFK++GVS I LI
Sbjct: 133 SIVSHTLDALYNLDWHNSASKNSCTSVVGFKWMLNQGAMEYRSEVSSYFKRRGVSVILLI 192
Query: 145 RRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIE 204
RRN L+R+ISILAN+YDR + +NGTH SHVHS A +LA YKPT++ L L + +
Sbjct: 193 RRNGLKRLISILANAYDRR-QPMNGTHVSHVHSEEAADLLASYKPTVDVAFLKNNLDRYD 251
Query: 205 QTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQV 264
A+ F+ TR +V YYED + NR + VQEFL + L SRQ KIH GPLS+ +
Sbjct: 252 LMIKNALRAFEGTRLLVIYYEDFVKNRKMQRTVQEFLSVPVMRLRSRQWKIHRGPLSQSI 311
Query: 265 ENWDDVQKALKGTSYERFLHSD 286
NW +V + L GTSY+ FL D
Sbjct: 312 TNWKEVFRELNGTSYQGFLADD 333
>gi|14029013|gb|AAK52554.1|AC079853_7 Unknown protein [Oryza sativa Japonica Group]
Length = 332
Score = 279 bits (714), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 142/302 (47%), Positives = 187/302 (61%), Gaps = 65/302 (21%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
MC KQ + ++++ I+V +Q C + I SE+ F+HYP P+T++R EC C PVR+FAI
Sbjct: 83 MCMKQLGSDSLSRIVKIEVAEQLCNKSAILHSEVHFVHYPQPITYSRSECKCTPVRFFAI 142
Query: 61 LSMQRSGSGWFETLLNI-------------KHRRINISSIVSTMDRVYNLDWFTSASKNE 107
+S QRSGSGWFETLLN K RR NISSI T+D+VYNLDW +SASKNE
Sbjct: 143 ISSQRSGSGWFETLLNSHMNVSSNGEIFSSKERRSNISSITKTLDKVYNLDWNSSASKNE 202
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+AAVG KWM+NQGL+ +H DIA+YF ++GVSAIFL R NLLR+++S LAN++DR K L
Sbjct: 203 CTAAVGLKWMLNQGLVANHADIADYFNRRGVSAIFLFRWNLLRQLVSQLANNHDRYLKQL 262
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDL 227
NGTHK+HVH+ YE L
Sbjct: 263 NGTHKAHVHTA---------------------------------------------YEKL 277
Query: 228 ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDY 287
+ +V +FL++ +L SR VKIH+ PLS+Q+ENWD+V AL GT YE FL++DY
Sbjct: 278 L-------DVLDFLKVPRSKLVSRHVKIHTKPLSEQIENWDEVYNALNGTQYESFLNADY 330
Query: 288 RV 289
R+
Sbjct: 331 RI 332
>gi|302759563|ref|XP_002963204.1| hypothetical protein SELMODRAFT_80417 [Selaginella moellendorffii]
gi|300168472|gb|EFJ35075.1| hypothetical protein SELMODRAFT_80417 [Selaginella moellendorffii]
Length = 336
Score = 279 bits (714), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 137/262 (52%), Positives = 173/262 (66%), Gaps = 14/262 (5%)
Query: 38 HYPNPVTFNREECACNPVRYFAILSMQRSGSGWFETLLN-------------IKHRRINI 84
HYP P T++R ECAC PV F ILS QRSGSGWFETLLN ++ RR N
Sbjct: 73 HYPRPKTYSRGECACTPVHNFVILSTQRSGSGWFETLLNSHPNITSHGEIFSVRPRRHNF 132
Query: 85 SSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLI 144
S + T+D +YNLDW SASKN C++ VGFKWM+NQG ME+ ++++ YFK++GVS I LI
Sbjct: 133 SIVSHTLDALYNLDWHNSASKNSCTSVVGFKWMLNQGAMEYRSEVSSYFKRRGVSVILLI 192
Query: 145 RRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIE 204
RRN L+R+ISILAN+YDR +NGTH SHVHS A +LA YKPT++ L L + +
Sbjct: 193 RRNGLKRLISILANAYDRRQP-MNGTHVSHVHSEEAADLLASYKPTVDVAFLKNNLDRYD 251
Query: 205 QTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQV 264
A+ F+ TR +V YYED + NR + VQEFL + L SRQ KIH GPLS+ V
Sbjct: 252 LMIKNALCAFEGTRLLVIYYEDFVKNRKMQRTVQEFLSVPVMRLRSRQWKIHRGPLSQSV 311
Query: 265 ENWDDVQKALKGTSYERFLHSD 286
NW +V + L GTSY+ FL D
Sbjct: 312 TNWKEVFRELNGTSYQGFLADD 333
>gi|388496630|gb|AFK36381.1| unknown [Lotus japonicus]
Length = 251
Score = 278 bits (710), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 128/209 (61%), Positives = 154/209 (73%), Gaps = 13/209 (6%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ +K LL+IKVI PC NI+ EIP++HYPNP T++REEC+C+PVRYFAI
Sbjct: 43 ICLKQIGTSSKIGLLDIKVIQNPCPEPNIEPWEIPYVHYPNPKTYSREECSCHPVRYFAI 102
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQRSGSGWFET LN +K RR N+S+I T+D +YNLDW +SASKNE
Sbjct: 103 LSMQRSGSGWFETFLNSHPNISSNGEIFSVKVRRSNVSTITETLDTIYNLDWLSSASKNE 162
Query: 108 CSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLL 167
C+ AVG KWM+NQGLM+HH IAEYF+ GVS IFL RRNLLRRM+S+LAN YDRNAKLL
Sbjct: 163 CTTAVGLKWMLNQGLMQHHEQIAEYFRIHGVSVIFLFRRNLLRRMVSLLANEYDRNAKLL 222
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLL 196
NGTHKSHVHSP E + + + AT +
Sbjct: 223 NGTHKSHVHSPKEVSVCTLFCACMCATCV 251
>gi|238013712|gb|ACR37891.1| unknown [Zea mays]
Length = 239
Score = 269 bits (688), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 138/239 (57%), Positives = 177/239 (74%), Gaps = 13/239 (5%)
Query: 63 MQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNECS 109
MQRSGSGW ETLLN +K RR NI++I T+DR+YNLDW++SA+KNEC+
Sbjct: 1 MQRSGSGWVETLLNSHPNISSNGEIFSVKERRSNITAITKTLDRLYNLDWYSSAAKNECT 60
Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
AAVG KWM+NQGLM+HH +I EYF ++GVSAIFL+RRNLLRR +SILAN++D K LNG
Sbjct: 61 AAVGLKWMLNQGLMKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNG 120
Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
THK+HVHS EA+ILA+YKPTI+ LI ELK+ ++ A A+ FK+TRH+V YYED+++
Sbjct: 121 THKAHVHSKHEAEILAQYKPTIDKKTLITELKRSDKLAADALVNFKNTRHVVLYYEDVVS 180
Query: 230 NRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
NR L +V +FLRL R+L SR VKIH+ L ++NW DV LKGT +E FL+ R
Sbjct: 181 NRTMLMDVLDFLRLPKRKLFSRHVKIHTKRLCDHIDNWADVTDFLKGTPFESFLNGSRR 239
>gi|414869372|tpg|DAA47929.1| TPA: thylakoid lumenal protein, mRNA [Zea mays]
Length = 166
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/166 (56%), Positives = 123/166 (74%)
Query: 123 MEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAK 182
M+HH +I EYF ++GVSAIFL+RRNLLRR +SILAN++D K LNGTHK+HVHS EA+
Sbjct: 1 MKHHQEIVEYFNRRGVSAIFLLRRNLLRRYVSILANAHDSAMKQLNGTHKAHVHSKHEAE 60
Query: 183 ILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLR 242
ILA+YKPTI+ LI ELK+ ++ A A+ FK+TRH+V YYED+++NR L +V +FLR
Sbjct: 61 ILAQYKPTIDKKTLITELKRSDKLAADALVNFKNTRHVVLYYEDVVSNRTMLMDVLDFLR 120
Query: 243 LAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
L R+L SR VKIH+ L ++NW DV LKGT +E FL+ R
Sbjct: 121 LPKRKLFSRHVKIHTKRLCDHIDNWADVTDFLKGTPFESFLNGSRR 166
>gi|4335739|gb|AAD17417.1| hypothetical protein [Arabidopsis thaliana]
Length = 213
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 65/109 (59%), Positives = 87/109 (79%)
Query: 180 EAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQE 239
+A+ILA+YKP IN +LLIP+LKQ+++ T+KA+ YF +TRHI YYED++ NR L +VQE
Sbjct: 104 QAEILARYKPLINTSLLIPDLKQVQEMTSKALAYFNTTRHIFLYYEDVVKNRTKLDDVQE 163
Query: 240 FLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHSDYR 288
FL++ +L SRQVKIH GPLS+ V+NW++VQK LKGT +E FL DYR
Sbjct: 164 FLKVPKLDLKSRQVKIHHGPLSQHVQNWEEVQKTLKGTGFENFLLEDYR 212
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 70/121 (57%), Gaps = 22/121 (18%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+ KQ LN++V ++PC NIQ +IP++HYP P T++
Sbjct: 7 LIPKQIGVVPSAGFLNVEVFERPCPEPNIQPWDIPYVHYPKPKTYS-------------- 52
Query: 61 LSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQ 120
S +G + ++K RR N+S+I T+D+VYNLDW +SASKNEC++AVG KWM+NQ
Sbjct: 53 -----SSNG---EIFSVKDRRANVSTIFETLDKVYNLDWLSSASKNECTSAVGLKWMLNQ 104
Query: 121 G 121
Sbjct: 105 A 105
>gi|4455187|emb|CAB36719.1| hypothetical protein [Arabidopsis thaliana]
gi|7270392|emb|CAB80159.1| hypothetical protein [Arabidopsis thaliana]
Length = 403
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 67/99 (67%), Positives = 82/99 (82%)
Query: 145 RRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIE 204
R+NLLRRMIS+LANSYDR+AKLLNGTHKSH HS EA+ILA YKP IN TLLI EL+QI+
Sbjct: 152 RKNLLRRMISVLANSYDRDAKLLNGTHKSHTHSAKEAEILAGYKPMINTTLLINELRQIQ 211
Query: 205 QTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRL 243
+ T KA+ YF +TRHI+ YYED++ N L +VQEFL++
Sbjct: 212 EMTLKALTYFNTTRHILVYYEDVVKNLTRLDDVQEFLKV 250
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 26/35 (74%)
Query: 14 LLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNRE 48
LN++V ++PC NI+ +IPF+HYP P T+NR+
Sbjct: 119 FLNVEVFERPCPEPNIEPWDIPFVHYPKPKTYNRK 153
>gi|357464733|ref|XP_003602648.1| Kinase-like protein [Medicago truncatula]
gi|355491696|gb|AES72899.1| Kinase-like protein [Medicago truncatula]
Length = 403
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/130 (48%), Positives = 79/130 (60%), Gaps = 23/130 (17%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C KQ S ++ L+I VI PC NI+ EIP++ EEC C+P+RYF I
Sbjct: 162 ICLKQISTGSRIGFLDINVIQMPCPGPNIEPWEIPYV----------EECRCHPLRYFTI 211
Query: 61 LSMQRSGSGWFETLLN-------------IKHRRINISSIVSTMDRVYNLDWFTSASKNE 107
LSMQR GSGW ET LN +K RR NI++I T+D +YNLDWF SASKNE
Sbjct: 212 LSMQRFGSGWHETFLNSHPNISSNGEIFSVKVRRSNITTITETLDTIYNLDWFNSASKNE 271
Query: 108 CSAAVGFKWM 117
C+AAVG W+
Sbjct: 272 CTAAVGLAWL 281
>gi|242034033|ref|XP_002464411.1| hypothetical protein SORBIDRAFT_01g017756 [Sorghum bicolor]
gi|241918265|gb|EER91409.1| hypothetical protein SORBIDRAFT_01g017756 [Sorghum bicolor]
Length = 146
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 88/142 (61%), Gaps = 3/142 (2%)
Query: 142 FLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELK 201
F+ RRN LR++IS AN YD +AK LN TH+S VHS + KI+ K KP ++++ +I
Sbjct: 1 FVFRRNTLRKLISFTANDYDNDAKQLNATHRSRVHSKEKPKIITKLKPHLDSSTMIT--- 57
Query: 202 QIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLS 261
IE+ +++ STRH++ YYED+I N L VQEFLR+ R L S+QVK+H PL
Sbjct: 58 NIEKAIRDCLDHLNSTRHMILYYEDIINNSNALSWVQEFLRVPVRRLMSKQVKVHMRPLP 117
Query: 262 KQVENWDDVQKALKGTSYERFL 283
++N + V L T + FL
Sbjct: 118 DLIKNREKVSIKLNRTEFAHFL 139
>gi|255641735|gb|ACU21138.1| unknown [Glycine max]
Length = 144
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 54/86 (62%), Gaps = 4/86 (4%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAI 60
+C Q S +T ++ KVID QS ++ LHYP P +F+R EC NPV +FAI
Sbjct: 43 VCLMQISTQARTTFMDFKVIDNHSQSI-LKLMNTHLLHYPKPASFSRNECVHNPVLFFAI 101
Query: 61 LSMQRSGSGWFETLLNIKHRRINISS 86
LS QRSGSGWFETLLN IN+SS
Sbjct: 102 LSNQRSGSGWFETLLN---SHINVSS 124
>gi|367060058|gb|AEX10990.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060060|gb|AEX10991.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060064|gb|AEX10993.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060066|gb|AEX10994.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060068|gb|AEX10995.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060070|gb|AEX10996.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060072|gb|AEX10997.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060074|gb|AEX10998.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060076|gb|AEX10999.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060078|gb|AEX11000.1| hypothetical protein 0_11079_01 [Pinus taeda]
gi|367060080|gb|AEX11001.1| hypothetical protein 0_11079_01 [Pinus taeda]
Length = 72
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/72 (55%), Positives = 49/72 (68%), Gaps = 1/72 (1%)
Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
FKSTRH V YYE+ I+ + E+ +FL L RELTSR VKIH+ PLS+ V NW +V
Sbjct: 1 FKSTRHTVIYYEE-ISKPKKIMEILKFLGLKPRELTSRHVKIHTKPLSEHVHNWQEVNNR 59
Query: 274 LKGTSYERFLHS 285
LKGT +E FLH
Sbjct: 60 LKGTEFEVFLHD 71
>gi|367060062|gb|AEX10992.1| hypothetical protein 0_11079_01 [Pinus taeda]
Length = 72
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 39/72 (54%), Positives = 48/72 (66%), Gaps = 1/72 (1%)
Query: 214 FKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
FKSTRH V YYE+ I+ + E+ +FL L RELTSR VKIH+ PLS+ V NW +V
Sbjct: 1 FKSTRHTVIYYEE-ISKPKKIMEILKFLGLKPRELTSRHVKIHTKPLSEHVHNWQEVNNR 59
Query: 274 LKGTSYERFLHS 285
LKGT +E LH
Sbjct: 60 LKGTEFEVLLHD 71
>gi|323449148|gb|EGB05038.1| hypothetical protein AURANDRAFT_66664 [Aureococcus anophagefferens]
Length = 837
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 82/180 (45%), Gaps = 21/180 (11%)
Query: 68 SGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHN 127
+G LLN R ++ + M+ V D + S + +AAVGFKWM NQG HH
Sbjct: 406 AGEMRDLLN---RVGDVRASWEHMEAVIEAD-YRSLCEGRQAAAVGFKWMTNQGHSIHHA 461
Query: 128 DIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKIL-AK 186
I E ++ G S I+L RRNLLRR IS AN N + H EA++ A+
Sbjct: 462 RIVE--RRTGTSIIYLFRRNLLRREISNAAN---------NQLGDAQAHPKTEAELASAR 510
Query: 187 YKPTI-NATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN----RATLKEVQEFL 241
T+ L+ + + + A + Y+ + YEDL+ A EV EFL
Sbjct: 511 GNVTLFEGVKLVSVIVKRLEARATVVGYYADIPSLFLAYEDLVAGSSDAEARWAEVFEFL 570
>gi|323453203|gb|EGB09075.1| hypothetical protein AURANDRAFT_63694 [Aureococcus anophagefferens]
Length = 884
Score = 63.9 bits (154), Expect = 8e-08, Method: Composition-based stats.
Identities = 53/173 (30%), Positives = 80/173 (46%), Gaps = 17/173 (9%)
Query: 113 GFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHK 172
GFKWM NQG E + I +Y +GV ++L RRN+LR++IS LAN + + H
Sbjct: 676 GFKWMTNQGHDERRHAIKKYMLNRGVKLVYLWRRNVLRQLISNLANR-----RTVGLAHP 730
Query: 173 SHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK-STRHIVFYYEDLITNR 231
A+A + + + LI LK+IE+ + Y+ + +YEDLI
Sbjct: 731 ESREDAAKADVDLELPAGGD---LIHALKKIERQRRRVRSYYHPEVKETAVFYEDLIAGS 787
Query: 232 ATLKE----VQEFLRL---AYRELTSRQVKIH-SGPLSKQVENWDDVQKALKG 276
+ V +FL A+ + V IH S P V N D+V+ L+
Sbjct: 788 PRYNDSWARVVKFLGAAPHAFEPSAADTVIIHQSRPTLASVRNADEVRATLEA 840
>gi|323451151|gb|EGB07029.1| hypothetical protein AURANDRAFT_71905 [Aureococcus anophagefferens]
Length = 1695
Score = 55.1 bits (131), Expect = 4e-05, Method: Composition-based stats.
Identities = 48/203 (23%), Positives = 86/203 (42%), Gaps = 24/203 (11%)
Query: 103 ASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDR 162
A++ + A GFKWM++Q H E + GV +FL+RRN R+++S L N+ D+
Sbjct: 1214 AARGGYAVATGFKWMLSQRAGRHWPWFVELCRDFGVRLVFLLRRNAARQLVSRLLNAEDK 1273
Query: 163 NAKLLNG-THKSHVHSPAEAKILAKYKPTINATLLIPELK-------QIEQTTAKAIEYF 214
G H +H +S + L + T ++ +L+ ++E+ A+
Sbjct: 1274 ARAQAGGLRHSAHPNSESRLDELRSRRVTFRRDQVLEQLQDMREKWDELERLRLYALARG 1333
Query: 215 KSTRHIVFYYEDLITNRATLKEVQEFL--------------RLAYRELTSRQVKIHSGPL 260
+ +V YED+ + + + + +FL A R KIH+ L
Sbjct: 1334 VPSARVV--YEDVDADHSLVGNLSDFLLADVGAEERANCAAAFASRPPPQGHEKIHTSSL 1391
Query: 261 SKQVENWDDVQKALKGTSYERFL 283
+ N+ +V L T + L
Sbjct: 1392 EDLIVNFKEVWHLLHNTPWRHCL 1414
>gi|440714456|ref|ZP_20895035.1| hypothetical protein RBSWK_02092 [Rhodopirellula baltica SWK14]
gi|436440652|gb|ELP33956.1| hypothetical protein RBSWK_02092 [Rhodopirellula baltica SWK14]
Length = 273
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 87/181 (48%), Gaps = 16/181 (8%)
Query: 109 SAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLN 168
S AV F+ M NQ ++ + + + +G+ I L+R +LLR+ +S + N L+
Sbjct: 95 SGAVAFRGMYNQ--LQRKSIVNHLLQVQGLKIIHLMRDDLLRQYVSRKQMHHRYN---LH 149
Query: 169 GTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRH--IVFYYED 226
G +H P + + I+ + E+ + ++ K + F+ + + +YE
Sbjct: 150 GKGSAHTSKPIKLNAIP-----ISPAAALAEMTFMARSREKLHKAFEDNGNEILTLHYER 204
Query: 227 LI----TNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERF 282
L+ + A K + EFL + E+TS VK+ L VEN+DD++ A++ + + F
Sbjct: 205 LLGGGMVDEADRKTLCEFLSVRDVEMTSDLVKMSQARLEDMVENFDDLRTAVQASEFADF 264
Query: 283 L 283
L
Sbjct: 265 L 265
>gi|413955390|gb|AFW88039.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
Length = 113
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 30/47 (63%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNR 47
MC KQ ++++ I+V++QPC + + SE+ F HYP P T++R
Sbjct: 59 MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSR 105
>gi|413955389|gb|AFW88038.1| hypothetical protein ZEAMMB73_299317 [Zea mays]
Length = 105
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 30/47 (63%)
Query: 1 MCSKQTSPHTKTKLLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNR 47
MC KQ ++++ I+V++QPC + + SE+ F HYP P T++R
Sbjct: 59 MCMKQLGSDGWSRIVKIEVVEQPCNKSTVPPSEVQFAHYPQPTTYSR 105
>gi|323452062|gb|EGB07937.1| hypothetical protein AURANDRAFT_71728 [Aureococcus anophagefferens]
Length = 1699
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 40/123 (32%), Positives = 60/123 (48%), Gaps = 19/123 (15%)
Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
A+VGFK NQG HH I E ++ G + ++L RRNLLR IS AN+ ++A
Sbjct: 18 ASVGFK--TNQGHSVHHARIIE--RRTGTTILYLFRRNLLRHEISKNANTQLKDA----- 68
Query: 170 THKSHVHSPAEAKILAKYKPTIN----ATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYE 225
+ P A LA + + A L+ +K++E AK + Y+ + YE
Sbjct: 69 -----LAHPKTAAELASARGNVTLFEGAELVSDMVKRLE-ARAKVVGYYAGIPSLFLAYE 122
Query: 226 DLI 228
DL+
Sbjct: 123 DLV 125
>gi|254422696|ref|ZP_05036414.1| hypothetical protein S7335_2848 [Synechococcus sp. PCC 7335]
gi|196190185|gb|EDX85149.1| hypothetical protein S7335_2848 [Synechococcus sp. PCC 7335]
Length = 245
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 71/161 (44%), Gaps = 7/161 (4%)
Query: 127 NDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAK 186
N + + G I+L RR+LLR I+ L + + + +K++ + +
Sbjct: 82 NQFLQDLHRGGCRVIYLQRRDLLRHAIATLKAD---DIQFHSSLNKTNAFGSPSNWVHSN 138
Query: 187 YKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNR----ATLKEVQEFLR 242
+ TI L+ L IE A H+ YED + + AT + + EFL
Sbjct: 139 RRVTIPVQALLDCLSYIEAQRVDAQAILHGIPHLSLTYEDDLIDPNVYPATAQRLSEFLE 198
Query: 243 LAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
+ +LT R VK+ ++ V N+D+V++AL+ + L
Sbjct: 199 IRSLKLTGRLVKLVHQDIADLVANYDEVRQALESSENAHLL 239
>gi|323454077|gb|EGB09947.1| hypothetical protein AURANDRAFT_62424 [Aureococcus anophagefferens]
Length = 348
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 80/199 (40%), Gaps = 36/199 (18%)
Query: 111 AVGFKWMMNQGLMEHHNDIA-------EYFKKKGVSAIFLIRRNLLRRMISILANSYDRN 163
A GFKWMM+Q H D A E + + +FL R NLLR S +A +
Sbjct: 106 ACGFKWMMSQ-----HVDRAWDAGWLPELCAARKLRVVFLERLNLLRVHASTVAKNL--- 157
Query: 164 AKLLNGTHKSHV-HSPAEAKILAKYKPTINATLLIPELKQIEQ-----TTAKAIEYFKST 217
A G H++ +P EA Y + L+ +L+ I +A +
Sbjct: 158 ASTSAGGHRNPTGEAPREA-----YIELASGDALVAKLEGIAAQYDGLRRVRAEAARRGV 212
Query: 218 RHIVFYYEDLITNRAT-LKEVQEFLR---------LAYRELTSRQVKIHSGPLSKQVENW 267
+ YE L +R L V F+ A+ SRQ +IH+G LS V NW
Sbjct: 213 ATLAVTYEALQADRGGGLDRVARFVAGGGACDVGAYAFAGNASRQTQIHAGKLSSYVSNW 272
Query: 268 DDVQKALKGTSYERFLHSD 286
V L+ T + +L D
Sbjct: 273 AAVVDTLRPTRFACYLDED 291
>gi|1531623|gb|AAB16900.1| NodH [Rhizobium sp. N33]
Length = 248
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 102/244 (41%), Gaps = 29/244 (11%)
Query: 58 FAILSMQRSGSGWFETLLNIKHRRINISSIVSTMD--------------RVYNLDWFTSA 103
FAIL+M R+G+ + E LLN ++ +++ D + L +
Sbjct: 11 FAILAMPRTGTHYLEVLLNEHPNILSNGELLNEYDTNWPDKDRLLLGDRELLELAYVRCP 70
Query: 104 SKNECSAA-VGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDR 162
+ ++ + VG K Q +H AE + G+ I ++RRN+L + S++ +
Sbjct: 71 TWSDKTVTHVGCKVNEPQ-FHDHPGFFAELARWPGLKVILVVRRNILESLRSLVQAR--Q 127
Query: 163 NAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVF 222
+ + L + P ++ P N K + A+ F + +V
Sbjct: 128 SGQWLKFSSDKVGAPPPRVRL-----PIANCEAY---FKTADAFYARVAHAFAPSNVLVI 179
Query: 223 YYEDLITNRAT-LKEVQEFLRLAYRELTSRQV--KIHSGPLSKQVENWDDVQKALKGTSY 279
YE L+ AT L V +FL + +L+ R + + + PL + VEN+D+++ Y
Sbjct: 180 EYESLLQEPATCLGAVWDFLGVPGLQLSGRAILQRQEARPLDETVENFDELRLHFANGPY 239
Query: 280 ERFL 283
RF
Sbjct: 240 ARFF 243
>gi|292493296|ref|YP_003528735.1| hypothetical protein Nhal_3305 [Nitrosococcus halophilus Nc4]
gi|291581891|gb|ADE16348.1| hypothetical protein Nhal_3305 [Nitrosococcus halophilus Nc4]
Length = 243
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 79/191 (41%), Gaps = 30/191 (15%)
Query: 103 ASKNECSAAVGFKWMMNQ------GLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISIL 156
++N+ V FK M NQ + HH DI I L R NLL++ +S
Sbjct: 72 CARNDAPVLV-FKSMYNQLNNKVCNFLTHHTDI---------RVIHLRRENLLKQYVS-- 119
Query: 157 ANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKS 216
K+L G + P K + I+ T I E++++ + +
Sbjct: 120 --------KMLLGAKRERRWQPHTTKKVPGVSIHISPTAAIKEMQRVRNQFLEFEQLLSH 171
Query: 217 TRHIVFYYEDLITNRATLKE----VQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQK 272
I YE +I ++ E + E +L + VKI+ L V+N+D++
Sbjct: 172 HHRIELVYETMINGQSLSNEAAEKICELFQLNPAPMHCDFVKINPNELELMVKNYDELAS 231
Query: 273 ALKGTSYERFL 283
AL+GT +E+FL
Sbjct: 232 ALRGTEFEQFL 242
>gi|218441161|ref|YP_002379490.1| sulfotransferase [Cyanothece sp. PCC 7424]
gi|218173889|gb|ACK72622.1| sulfotransferase, putative [Cyanothece sp. PCC 7424]
Length = 278
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 79/175 (45%), Gaps = 14/175 (8%)
Query: 111 AVGFKWMMNQGLMEHHNDIAEYFK-KKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
AVGFK + + I Y K +K + I L R N LR ++S+ R A N
Sbjct: 114 AVGFKLFYYHAQNDSRHLIWSYLKEQKTLKIIHLQRDNTLRELLSL------RKAFKTNK 167
Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLIT 229
+ E I +Y+ + E ++ AK ++F+ + I YE+L
Sbjct: 168 WTNTDGMEEQEFSIKLEYEDCLQ------EFTHSQEIKAKYNKFFQDHQVINMIYENLSN 221
Query: 230 NRAT-LKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
+ T LK++Q+FL + Y+ + K PL + + N+ ++++ +GT +E F
Sbjct: 222 DYETELKKLQDFLEVDYKPVKPLTYKQSKQPLKEAISNYYELKQKFQGTPWEAFF 276
>gi|344339260|ref|ZP_08770189.1| Stf0 sulfotransferase [Thiocapsa marina 5811]
gi|343800564|gb|EGV18509.1| Stf0 sulfotransferase [Thiocapsa marina 5811]
Length = 248
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 63/269 (23%), Positives = 109/269 (40%), Gaps = 72/269 (26%)
Query: 58 FAILSMQRSGSGWFETLLN------------------------------------IKHRR 81
F +LS QRSGS T L+ + HRR
Sbjct: 6 FVVLSTQRSGSTLLRTALDSHPEISCRGELFLPSYSQGDSFRDFVSEKKLSRISELLHRR 65
Query: 82 INISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQ-GLMEHHNDIA-EYFKKKGVS 139
+ S +DR+Y+ + + A GFK M Q G + + +Y + GV
Sbjct: 66 ---RLVYSYLDRLYD---------DTLAGARGFKIMYGQMGYRPYKFPMVFDYIRDYGVR 113
Query: 140 AIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKP---TINATLL 196
I L+R N L IS + R +K + + Y+P T+N L
Sbjct: 114 VIHLVRENTLDICIS---RQFARASKTYHAVEQ--------------YQPRSQTVNIPTL 156
Query: 197 IPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRA-TLKEVQEFLRL-AYRELTSRQVK 254
I E++++E+ + K R + YED + +R+ + + + EFL + + EL S +
Sbjct: 157 ILEMRKVEREKIRWRSAVKGLRCLDQTYEDFVNSRSESSRRILEFLGVESAIELVSPLKR 216
Query: 255 IHSGPLSKQVENWDDVQKALKGTSYERFL 283
+ L+++V N++++ + + Y RFL
Sbjct: 217 LSGAHLNERVLNYEEMVQEVVSHGYGRFL 245
>gi|323448004|gb|EGB03908.1| expressed protein [Aureococcus anophagefferens]
Length = 359
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 30/48 (62%)
Query: 111 AVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILAN 158
A GFKWM+NQG+ E + + + + +FL RR+LLR ++SI N
Sbjct: 132 AYGFKWMVNQGMDELWGPLLATARARRLRVVFLYRRDLLRMLVSIAHN 179
>gi|307154289|ref|YP_003889673.1| hypothetical protein Cyan7822_4488 [Cyanothece sp. PCC 7822]
gi|306984517|gb|ADN16398.1| hypothetical protein Cyan7822_4488 [Cyanothece sp. PCC 7822]
Length = 261
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 63/268 (23%), Positives = 106/268 (39%), Gaps = 58/268 (21%)
Query: 60 ILSMQRSGSGWFETLLNIKHRRINISSIV---------------------STMDRVYNLD 98
IL+ RSGS + +LN N ++ S +D +YN
Sbjct: 6 ILTNGRSGSNYISGVLNSHPHITNYGEVLGEWTVPYMIYDKFFKYRMSSESYLDYMYNSK 65
Query: 99 WFTSAS----------KNECSAAVGFKWMMNQGLMEH-----HNDIAEYF-KKKGVSAIF 142
+F +A+ + E +K + + G+ E +I Y KK + I
Sbjct: 66 FFFNAAQIYSAYSKIKRKEKINFKSYKQIKSLGIKEFSINFFRRNIDNYLASKKNIFVIN 125
Query: 143 LIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ 202
L R N L R++SI A + + K L K K +N ++ L
Sbjct: 126 LYRENSLARLVSIEA--------------MTTTGIVSTQKKLGKIKIYLNPEGILERLNI 171
Query: 203 IEQTTAKAIEYFKSTRHIVFY---YEDLI----TNRATLKEVQEFLRLAYRELTSRQVKI 255
E+ + ++ + YE+ + R K++ EFL++ ++T Q KI
Sbjct: 172 FEKEKKDQFDLIENLEQKFIFNISYEEYFASPESQRELTKKIFEFLKVEPIDVTLNQKKI 231
Query: 256 HSGPLSKQVENWDDVQKALKGTSYERFL 283
S LS VEN++++ LKGT YE++L
Sbjct: 232 LSRKLSNTVENYEEIVSILKGTQYEQYL 259
>gi|434399105|ref|YP_007133109.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
gi|428270202|gb|AFZ36143.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
Length = 286
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 1/85 (1%)
Query: 200 LKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATL-KEVQEFLRLAYRELTSRQVKIHSG 258
++I+ K F I YYEDL N + K++Q FL + +EL + KI
Sbjct: 200 FEEIQSHQNKYTNLFNGYPMIDIYYEDLCINSNDVTKQIQNFLGIEVQELKTATKKIIQS 259
Query: 259 PLSKQVENWDDVQKALKGTSYERFL 283
PLS Q+ N+ ++++ T YE +
Sbjct: 260 PLSSQINNYYELKEKFINTPYEEYF 284
>gi|153875891|ref|ZP_02003487.1| conserved hypothetical protein [Beggiatoa sp. PS]
gi|152067645|gb|EDN66513.1| conserved hypothetical protein [Beggiatoa sp. PS]
Length = 243
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 99/244 (40%), Gaps = 29/244 (11%)
Query: 54 PVRYFAILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDW--------FTSASK 105
P + F + R+GS L+N H + + RV + + + A K
Sbjct: 14 PNKKFVMFFRPRAGSTLLCDLMN-SHPDVCCDWEIFGAGRVGKVSFPKLYLKGRYAIAEK 72
Query: 106 NECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAK 165
N V K + +Q E ++ F ++G I++ R N LR+ +S L
Sbjct: 73 NVYGFKVNIKQIRDQKF-EAQTFFSD-FDQQGWKIIYIRRENSLRQAVSFLI-------- 122
Query: 166 LLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYE 225
H+S + K +N L+ ++ +IEQ K +E + YE
Sbjct: 123 ---AEHRSEWIGKPNNTLTGKV--PVNPEELVSKIGRIEQAVQKEVELLAPYSTLNVIYE 177
Query: 226 DLITN----RATLKEVQEFLRLAYRELTSRQVKIH-SGPLSKQVENWDDVQKALKGTSYE 280
+ + N + TL +V +FL + + ++ K S LS +EN+++V++ + T Y
Sbjct: 178 NDLLNAVQQQITLDKVFDFLGIDSVPIKTKMAKTAASSNLSDIIENYEEVERVISQTPYA 237
Query: 281 RFLH 284
FL
Sbjct: 238 HFLE 241
>gi|128469|sp|P06237.1|NOH4_RHIME RecName: Full=Nodulation protein H; AltName: Full=Host-specificity
of nodulation protein D
gi|152243|gb|AAA26291.1| host-specificity of modulation protein D [Sinorhizobium meliloti]
Length = 247
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 98/248 (39%), Gaps = 30/248 (12%)
Query: 54 PVRYFAILSMQRSGSGWFETLLNIKHRRINISSIVSTMD-------RVYNLD-------- 98
P R FAIL+M+R+G+ + E L+N ++ +++T D R+ D
Sbjct: 7 PPRPFAILAMRRTGTHYLEELVNEHPNVLSNGELLNTYDTNWPDKERLLLSDRELLERAC 66
Query: 99 WFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILAN 158
W ++ VG K + E + AE G+ I +IRRN L + S +
Sbjct: 67 WRYPPHSDKKVTHVGCK-INEPQFQERPSFFAELTAWPGLKVILVIRRNTLESLRSFVQA 125
Query: 159 SYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTR 218
R KS +P +L T A K + A+ + F S+R
Sbjct: 126 RQTRQWLQF----KSDSSAPPPPVMLPFA--TCEAY-----FKAADDFHARVVNAFDSSR 174
Query: 219 HIVFYYEDLITNRAT-LKEVQEFLRLAYRELTSRQV--KIHSGPLSKQVENWDDVQKALK 275
+ YE L+ + + V +FL +L R + + + PL + V N+ +++
Sbjct: 175 IRLIEYERLLRDPVPCVATVLDFLGAPALQLADRGILRRQETRPLDQTVRNFHELRVHFA 234
Query: 276 GTSYERFL 283
Y RF
Sbjct: 235 NGPYARFF 242
>gi|410861292|ref|YP_006976526.1| hypothetical protein amad1_08300 [Alteromonas macleodii AltDE1]
gi|410818554|gb|AFV85171.1| hypothetical protein amad1_08300 [Alteromonas macleodii AltDE1]
Length = 264
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 82/167 (49%), Gaps = 22/167 (13%)
Query: 127 NDIAEYF-KKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILA 185
+++ YF K +S I L R N+L+R+IS + H++ V S +
Sbjct: 111 DNLYNYFLDSKEISVIHLKRSNILKRLISAML------------MHQAGVVSTTDNSTEN 158
Query: 186 KYKPTINATLLIPELKQI----EQTTAKAIEYFKSTRHIVFYYEDLITN-RATLK---EV 237
K I+ L+ +LK+ E+ A A + K YED ++ +T K ++
Sbjct: 159 K-SFEIDIEQLLAKLKRDYELNEEENAFAAKLSKKQPFYQLSYEDYFSSPESTSKHNSQI 217
Query: 238 QEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH 284
+FL + ++ S+ K+ S L+ + N+DD+++ALK T +E++L+
Sbjct: 218 FDFLGVESTQVKSKHRKVLSQNLADIITNYDDIERALKTTKFEQYLY 264
>gi|113477543|ref|YP_723604.1| hypothetical protein Tery_4123 [Trichodesmium erythraeum IMS101]
gi|110168591|gb|ABG53131.1| hypothetical protein Tery_4123 [Trichodesmium erythraeum IMS101]
Length = 184
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 72/155 (46%), Gaps = 23/155 (14%)
Query: 137 GVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLL 196
G I+L R NL+ +S + NA L H H E ++ +YKP + +
Sbjct: 37 GYKFIYLTRSNLVAHALS------NINASLKQKFH----HRSCEGEL--EYKPIL---VE 81
Query: 197 IPELKQIEQTTAKAIEY----FKSTRHIVFYYEDLITNRA----TLKEVQEFLRLAYREL 248
I E+ Q Q + + EY K HI YE+ + +R T+ +V E L +
Sbjct: 82 IEEVFQRIQYSEELGEYEKKLLKKIPHISITYEENLLDRECHQKTVDKVLELFNLPSNPV 141
Query: 249 TSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
+ VK+ LSK VEN++++ +A+K + Y F+
Sbjct: 142 KTNLVKLMPLDLSKMVENYEELIQAIKKSKYAHFI 176
>gi|434400774|ref|YP_007134778.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
gi|428271871|gb|AFZ37812.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
Length = 294
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 78/182 (42%), Gaps = 23/182 (12%)
Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMIS-ILANSYDRNAKLL 167
+AVGFK +Q +I Y +K V I L R+N+L +S LA N +L
Sbjct: 127 SAVGFKLFYDQAKRGQQKNIWGYLQKMSDVKIIHLKRKNILEAHVSHKLAER--NNQWIL 184
Query: 168 NGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEY---FKSTRHIV--F 222
K + P I Y + A EQT EY F +H +
Sbjct: 185 LDNQKELMLEP----IALDYDECLQA---------FEQTKIWETEYEQFFAKYKHPIQTI 231
Query: 223 YYEDLITNRATL-KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYER 281
YYE+L++N + +E+Q FL + Y L + K + L ++ N+D+++ T + +
Sbjct: 232 YYEELVSNTLLITRELQNFLEVNYWFLNTYTKKQNQVSLRDKIINYDELKDKFSKTIWSK 291
Query: 282 FL 283
F
Sbjct: 292 FF 293
>gi|359459323|ref|ZP_09247886.1| sulfotransferase [Acaryochloris sp. CCMEE 5410]
Length = 256
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 77/186 (41%), Gaps = 43/186 (23%)
Query: 110 AAVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMISILANSYDRNAKLLN 168
+ VGFK N H N + +Y K+ K ++ I L R+NLL+
Sbjct: 91 SGVGFKLFYNHARSSHENQVWDYLKESKDLNVIHLKRKNLLK------------------ 132
Query: 169 GTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYF-----KSTRHIVFY 223
TH S + + K P+ E+ IE +E+F + H F+
Sbjct: 133 -THLSEQVALKTQIWVGKKNPS--------EIYPIELDYNNTLEFFNHMKEQEEEHHQFF 183
Query: 224 ---------YEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKA 273
YEDL+ + +++Q FL++ Y+ L K S PLS+ + N+ ++++
Sbjct: 184 SNHNIFEVNYEDLVKDIEFYSRKLQAFLKIPYKSLYVSTPKQSSIPLSQAISNFRELKER 243
Query: 274 LKGTSY 279
G+ +
Sbjct: 244 FNGSPW 249
>gi|399125168|pdb|4FLC|A Chain A, Structural And Biochemical Characterization Of Human
Adenylosuccinate Lyase (Adsl) And The R303c Adsl
Deficiency Associated Mutation
gi|399125169|pdb|4FLC|B Chain B, Structural And Biochemical Characterization Of Human
Adenylosuccinate Lyase (Adsl) And The R303c Adsl
Deficiency Associated Mutation
gi|399125170|pdb|4FLC|C Chain C, Structural And Biochemical Characterization Of Human
Adenylosuccinate Lyase (Adsl) And The R303c Adsl
Deficiency Associated Mutation
gi|399125171|pdb|4FLC|D Chain D, Structural And Biochemical Characterization Of Human
Adenylosuccinate Lyase (Adsl) And The R303c Adsl
Deficiency Associated Mutation
Length = 487
Score = 42.0 bits (97), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 9/87 (10%)
Query: 14 LLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSM---QRSGSGW 70
L N+K +++P + I S +P+ NP+ R EC C+ R+ L M Q + W
Sbjct: 275 LANLKEMEEPFEKQQIGSSAMPYKR--NPM---RSECCCSLARHLMTLVMDPLQTASVQW 329
Query: 71 FE-TLLNIKHRRINISSIVSTMDRVYN 96
FE TL + +RRI ++ T D + N
Sbjct: 330 FERTLDDSANRRICLAEAFLTADTILN 356
>gi|323452209|gb|EGB08084.1| hypothetical protein AURANDRAFT_64396 [Aureococcus anophagefferens]
Length = 1060
Score = 41.6 bits (96), Expect = 0.43, Method: Composition-based stats.
Identities = 58/255 (22%), Positives = 100/255 (39%), Gaps = 41/255 (16%)
Query: 65 RSGSGWFETLLN------IKHRRINISS----------------------IVSTMDRVYN 96
RSGS W +LLN + R+++ + + + + N
Sbjct: 452 RSGSDWLMSLLNAHPAICMPAGRVDVPAGHDPTVLVSKVRAREAARPGRNVTAAFEASAN 511
Query: 97 LDWFTSASKNECSAAVGFK---WMMN-QGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRM 152
+A N AA G+K W+ H + A + ++GV + L+RR + R+
Sbjct: 512 ARVAIAARDNGRCAAFGWKQGLWLAKGYATAAHRAEFAAWVARRGVKLV-LLRRVGVARV 570
Query: 153 ISILANSYDRNAKLLNGTHKSHVH------SPAEAKILAKYKPTINATLLIPELKQIEQT 206
+S N NA LL +VH + A AK+ + A L + ++ +
Sbjct: 571 VSGAKNRLTPNATLLGAEASRNVHCTSTRCAAAVAKLRVRLDAETLAGTLASQKREWDAV 630
Query: 207 TAKAIEYFKSTRHIVFYYEDLITNRAT-LKEVQEFLRLAYRELTSRQVKIHSG-PLSKQV 264
A A R Y++L+ + L E+ +FL + R + SG + +
Sbjct: 631 EAWAAAAAPPPRAFRVTYDELVADTPRWLGELYDFLGVGPPPKPPRTAYVKSGGRAADSI 690
Query: 265 ENWDDVQKALKGTSY 279
EN D+V+ AL GT +
Sbjct: 691 ENLDEVRAALAGTEW 705
>gi|434399106|ref|YP_007133110.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
gi|428270203|gb|AFZ36144.1| sulfotransferase, putative [Stanieria cyanosphaera PCC 7437]
Length = 310
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 42/189 (22%), Positives = 78/189 (41%), Gaps = 27/189 (14%)
Query: 99 WFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMISILA 157
W+ K C W NQ ++ +Y K+ K ++ I L R NLL +S +
Sbjct: 145 WWKDVGKEPC-------WNSNQS------NLWQYLKENKDIAIIHLKRNNLLEAKVSGIT 191
Query: 158 NSYDRNAKL--LNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFK 215
N + G +K++V E +N + + + + + +A E+F
Sbjct: 192 AQTTGNWGIGATGGINKNNVLVKCE----------LNFEECMQDFEALRRMEDEADEFFA 241
Query: 216 STRHIVFYYEDLITNRATL-KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKAL 274
R +V YE+L+ N A + ++Q FL L L ++ K + L ++N+ ++
Sbjct: 242 EHRKLVITYEELVENTANITNKIQVFLGLKTENLVTQSKKQATCSLLDVIDNYYQLKSQF 301
Query: 275 KGTSYERFL 283
T + F
Sbjct: 302 TNTRWSNFF 310
>gi|443320647|ref|ZP_21049735.1| hypothetical protein GLO73106DRAFT_00031410 [Gloeocapsa sp. PCC
73106]
gi|442789646|gb|ELR99291.1| hypothetical protein GLO73106DRAFT_00031410 [Gloeocapsa sp. PCC
73106]
Length = 264
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 72/172 (41%), Gaps = 36/172 (20%)
Query: 129 IAEYFKKKGVSAIFLIRRNLLRRMIS--------ILANSYDRNAKLLNGTHKSHVHSPAE 180
I +Y K + + IFL R NLL+R++S ++A + DR + + +H P +
Sbjct: 110 IPDYLKARDILIIFLYRENLLKRVLSRLNLRKTRVIAITDDR---IQEKNNIEKIHVPID 166
Query: 181 AKILAKYKPTINATLLIPEL-----KQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLK 235
LIPEL +Q EQ IE R + +E +
Sbjct: 167 E--------------LIPELDVFNEEQKEQLMM--IEGIPEERILRITFEQYFASSDNQN 210
Query: 236 E----VQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
E + +FL + L S KI L +EN+++V + LK T Y +FL
Sbjct: 211 EYNNRIFDFLNVDKLTLKSSHKKILPNKLVDIIENYEEVFEVLKNTKYNKFL 262
>gi|254416257|ref|ZP_05030011.1| hypothetical protein MC7420_5093 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196176939|gb|EDX71949.1| hypothetical protein MC7420_5093 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 272
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 69/146 (47%), Gaps = 17/146 (11%)
Query: 141 IFLIRRNLLRRMISILANSYDRNAKLLNGTHKS-------HVHSPAEAKILAKYKPTINA 193
I L R+N LRR+IS + S + + LN K+ +H A I+ KP IN
Sbjct: 96 IVLERKNYLRRIISAILGSQTK-IRHLNLKSKAKLTRTRVDLHQVAFG-IVYNPKPLINC 153
Query: 194 TLLIPELKQIEQTTAKAIEYFKSTRHIVFYYED--LITNRATLKEVQEFLRLAYRELTSR 251
L++I+QT + + + YED L R +++ EF+++ +
Sbjct: 154 ------LEEIDQTYRQLESLLEDKSSLWLTYEDDILPDPRIAYRKICEFIQVEPMAVDIL 207
Query: 252 QVKIHSGPLSKQVENWDDVQKALKGT 277
+ + P+ + VEN+D+++ LKGT
Sbjct: 208 NQRTNPFPICEIVENYDELEAMLKGT 233
>gi|333892195|ref|YP_004466070.1| hypothetical protein ambt_03590 [Alteromonas sp. SN2]
gi|332992213|gb|AEF02268.1| hypothetical protein ambt_03590 [Alteromonas sp. SN2]
Length = 207
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 16/183 (8%)
Query: 95 YNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMIS 154
Y + F+ A + + GFK M +Q + + + Y + I L R+NLL R IS
Sbjct: 27 YFTELFSMARSSPKCRSAGFKLMYSQE--KRNRGLWGYLSQNTDKIIHLKRKNLLNRAIS 84
Query: 155 ILANSYDRNAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIPELKQ-IEQTTA--KAI 211
+ + T +H + A +K K +++ ++ ELK+ I ++++ K I
Sbjct: 85 KV---------YMESTGIAHANEVASGDT-SKPKGSLDIEKVLDELKKDISESSSFSKKI 134
Query: 212 EYFKSTRHIVFYYEDLITN-RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDV 270
R I YYEDLI N TLK++ FL + + +S K+ + V N+ +
Sbjct: 135 TALPPERVITIYYEDLIENLDITLKKLYNFLEVPFHHYSSNFKKLSNEQPEDYVLNYHVL 194
Query: 271 QKA 273
+ A
Sbjct: 195 KAA 197
>gi|168187879|ref|ZP_02622514.1| GTP-binding protein LepA [Clostridium botulinum C str. Eklund]
gi|169294262|gb|EDS76395.1| GTP-binding protein LepA [Clostridium botulinum C str. Eklund]
Length = 601
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/130 (23%), Positives = 64/130 (49%), Gaps = 17/130 (13%)
Query: 103 ASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDR 162
A + E S A+GF + + H I E +++ +++ S++ N Y R
Sbjct: 330 AYEPETSIALGFGFRCGFLGLLHMEIIQERIERE-------FNLDIITTAPSVIYNIYKR 382
Query: 163 NAKLLNGTHKSHVHSPAEAKILAKYKPTINATLLIP--------ELKQIEQTTAKAIEYF 214
+ +++ T+ +++ P E +I+ +P + A+++ P EL Q ++ T +EY
Sbjct: 383 DGEMMKITNPTNMPDPTEIEIME--EPVVKASIITPSDFVGAVMELCQNKRGTFIDMEYI 440
Query: 215 KSTRHIVFYY 224
++TR +V YY
Sbjct: 441 ETTRVVVNYY 450
>gi|74141487|dbj|BAE38524.1| unnamed protein product [Mus musculus]
Length = 484
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 9/87 (10%)
Query: 14 LLNIKVIDQPCQSTNIQQSEIPFLHYPNPVTFNREECACNPVRYFAILSM---QRSGSGW 70
L N+K +++P + I S +P+ NP+ R E C+ R+ L+M Q + W
Sbjct: 272 LANLKEMEEPFEKQQIGSSAMPYKR--NPM---RSERCCSLARHLMALTMDPLQTASVQW 326
Query: 71 FE-TLLNIKHRRINISSIVSTMDRVYN 96
FE TL N +RRI ++ T D + N
Sbjct: 327 FERTLDNSANRRICLAEAFLTADTILN 353
>gi|427416768|ref|ZP_18906951.1| Stf0 sulfotransferase [Leptolyngbya sp. PCC 7375]
gi|425759481|gb|EKV00334.1| Stf0 sulfotransferase [Leptolyngbya sp. PCC 7375]
Length = 256
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 26/237 (10%)
Query: 58 FAILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFKWM 117
F I RSGS +LLN I++ ++ + + + N ++ G K +
Sbjct: 26 FIIFGRGRSGSTALVSLLNCLPDVCCDGEILA-QPVLWPQLYLKAKAANAQASVYGCKVL 84
Query: 118 MNQGLMEHHNDIAEYF----KKKGVSAIFLIRRNLLRRMIS-ILANSYD--RNAKLLNGT 170
Q H +F + G ++L R NLL IS I A ++ ++L+ G
Sbjct: 85 SYQLRNIHSIQQGSHFLYELSQAGFHILYLRRENLLDHAISNIRARNFGFHHKSRLVQGF 144
Query: 171 HKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLITN 230
K+ +H +N ++ +++ +Q + + ++ YE + N
Sbjct: 145 QKNKIH--------------VNPVTVVDWIEKSQQLWQYELSLLRDIPYLELTYEKNLAN 190
Query: 231 RA----TLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
+ T+ + ++L + R S+ K+ L VEN+D++ L+ T Y +L
Sbjct: 191 ESQHQFTIDTICQYLGIQSRAAVSQYQKVSPQTLQASVENYDELISYLRATPYHHYL 247
>gi|414869371|tpg|DAA47928.1| TPA: hypothetical protein ZEAMMB73_204795 [Zea mays]
Length = 110
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 21 DQPCQSTNIQQSEIPFLHYPNPVTFNR 47
++PC +I +EIP+LHYP P T++R
Sbjct: 84 EKPCHDPSIPDTEIPYLHYPMPNTYDR 110
>gi|312897719|ref|ZP_07757135.1| virulence-associated protein E [Megasphaera micronuciformis F0359]
gi|310621103|gb|EFQ04647.1| virulence-associated protein E [Megasphaera micronuciformis F0359]
Length = 826
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 44/200 (22%), Positives = 86/200 (43%), Gaps = 20/200 (10%)
Query: 85 SSIVSTMDRVYNLDWFTSASKNECSAAVGFKWMMNQGLMEHHNDIAEYFKKKGVSAIFLI 144
S+++ TM + + D ++ E + + KW++ G M + E K+ FL
Sbjct: 557 STLIRTMGKKWYADGLSTFEGKEAAENIQGKWIIEAGEMAGYTRAEENASKQ-----FLS 611
Query: 145 RR-NLLRRMISILANSYDRNAKLLNGTHKSHVHSPAEAK------ILAKYKPTINATLLI 197
R+ ++ R+ Y R +++ + KPT N + +
Sbjct: 612 RQVDVFRQAYGRRTQEYPRRCVFFGSSNQYEFLKDITGNRRFWPVDIEAQKPTKNVYVNL 671
Query: 198 PELKQIEQTTAKAIEYFKSTRHIVFYYEDLITNRATLKEVQEFLRLAYRELTSRQVKIHS 257
P +++Q A+A+ +K+ ++ ED N A LK + E R A+ E S+Q I+
Sbjct: 672 P--GEVDQIWAEAVVRYKNGESLII--ED---NEAALK-IAETAREAHMESNSKQGLINE 723
Query: 258 GPLSKQVENWDDVQKALKGT 277
L K +NW+ + ++ + T
Sbjct: 724 FLLQKVPKNWNTMSRSARRT 743
>gi|354556386|ref|ZP_08975681.1| hypothetical protein Cy51472DRAFT_4478 [Cyanothece sp. ATCC 51472]
gi|353551598|gb|EHC20999.1| hypothetical protein Cy51472DRAFT_4478 [Cyanothece sp. ATCC 51472]
Length = 273
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 33/50 (66%)
Query: 235 KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLH 284
+++ +FL ++ L S+ KI S L ++N+D+V + ++GT YE+FLH
Sbjct: 223 QKIFDFLGVSKLNLVSQHKKILSQNLVDILKNYDEVVEIIQGTKYEKFLH 272
>gi|254444117|ref|ZP_05057593.1| hypothetical protein VDG1235_2356 [Verrucomicrobiae bacterium
DG1235]
gi|198258425|gb|EDY82733.1| hypothetical protein VDG1235_2356 [Verrucomicrobiae bacterium
DG1235]
Length = 192
Score = 38.1 bits (87), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 4/68 (5%)
Query: 220 IVFYYEDLITN----RATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
+ YED I + R + V +F+ + Y E KI L +EN+++++ AL
Sbjct: 111 LTLTYEDHIASEDKQRMMMPLVCDFIGIPYEEGNCAYKKISPRSLRDSIENYEEIETALA 170
Query: 276 GTSYERFL 283
GT YE FL
Sbjct: 171 GTKYEVFL 178
>gi|172038036|ref|YP_001804537.1| hypothetical protein cce_3123 [Cyanothece sp. ATCC 51142]
gi|171699490|gb|ACB52471.1| unknown [Cyanothece sp. ATCC 51142]
Length = 282
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 33/51 (64%)
Query: 235 KEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFLHS 285
+++ +FL ++ L S+ KI S L ++N+D+V + ++GT YE+FLH
Sbjct: 232 QKIFDFLGVSKLNLVSQHKKILSQNLVDILKNYDEVVEIIQGTKYEKFLHD 282
>gi|254425995|ref|ZP_05039712.1| hypothetical protein S7335_563 [Synechococcus sp. PCC 7335]
gi|196188418|gb|EDX83383.1| hypothetical protein S7335_563 [Synechococcus sp. PCC 7335]
Length = 266
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 219 HIVFYYEDL---ITNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALK 275
H V Y DL +++ T+ + E+L LA R +T++ K++S PL + + N+D+ L
Sbjct: 199 HEVIYERDLENAQSHQQTVDRILEYLSLAPRVVTTKHKKVNSRPLEETISNYDEFIDCLD 258
Query: 276 GTSYERFL 283
++ FL
Sbjct: 259 KHGWQSFL 266
>gi|93115452|gb|ABE98414.1| glycosyltransferase family 2 [Escherichia coli]
gi|203285026|gb|ACH97144.1| WclW [Escherichia coli]
Length = 266
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 58/137 (42%), Gaps = 19/137 (13%)
Query: 59 AILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFK--- 115
A+L R+G W L + RR+ S+I+ Y DWF +E S G +
Sbjct: 118 ALLFEYRTGM-WM--LPSFTMRRMLKSNIIFCSAMYYREDWFNVGGYDE-SLRTGLEDWD 173
Query: 116 -WMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSH 174
W+ GL +HNDI K + IRRN + R I+ Y K++N HK H
Sbjct: 174 FWLSLIGLYNNHNDIVARVNKP--LFFYRIRRNSMLRSIN-----YTEKEKIVNYIHKKH 226
Query: 175 VHSPAEAKILAKYKPTI 191
+ L K KP I
Sbjct: 227 IQLFE----LYKVKPLI 239
>gi|203285014|gb|ACH97133.1| WclW [Escherichia coli]
Length = 266
Score = 37.4 bits (85), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 58/137 (42%), Gaps = 19/137 (13%)
Query: 59 AILSMQRSGSGWFETLLNIKHRRINISSIVSTMDRVYNLDWFTSASKNECSAAVGFK--- 115
A+L R+G W L + RR+ S+I+ Y DWF +E S G +
Sbjct: 118 ALLFEYRTGM-WM--LPSFTMRRMLKSNIIFCSAMYYREDWFNVGGYDE-SLRTGLEDWD 173
Query: 116 -WMMNQGLMEHHNDIAEYFKKKGVSAIFLIRRNLLRRMISILANSYDRNAKLLNGTHKSH 174
W+ GL +HNDI K + IRRN + R I+ Y K++N HK H
Sbjct: 174 FWLSLIGLYNNHNDIVARVNKP--LFFYRIRRNSMLRSIN-----YTEKEKIVNYIHKKH 226
Query: 175 VHSPAEAKILAKYKPTI 191
+ L K KP I
Sbjct: 227 IQLFE----LYKVKPLI 239
>gi|443327654|ref|ZP_21056274.1| tetratricopeptide repeat protein [Xenococcus sp. PCC 7305]
gi|442792746|gb|ELS02213.1| tetratricopeptide repeat protein [Xenococcus sp. PCC 7305]
Length = 511
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 79/176 (44%), Gaps = 10/176 (5%)
Query: 111 AVGFKWMMNQGLMEHHNDIAEYFKK-KGVSAIFLIRRNLLRRMISILANSYDRNAKLLNG 169
AVGFK Q L + I +Y KK K + I L R+N+L +S + KL+
Sbjct: 341 AVGFKLFYFQ-LKNRQHVIWQYLKKLKNLKIIHLTRKNMLHTYVSHKLATISNQWKLIKE 399
Query: 170 THKSHVHSPAEAKILAKYKPTINATLLIPELKQIEQTTAKAIEYFKSTRHIVFYYEDLI- 228
S S +E I+ Y+ + L+ + +T + ++ YEDL+
Sbjct: 400 EKNSSRFSLSEP-IVIDYEECLATFKLV----RYWETQYGNFFDLSTQQYCNVNYEDLVG 454
Query: 229 -TNRATLKEVQEFLRLAYRELTSRQVKIHSGPLSKQVENWDDVQKALKGTSYERFL 283
T + T K +QEF+ + + LT +K P+ + V N+D +++ + + + F
Sbjct: 455 KTEQVTNK-LQEFIGVDRQPLTFSTIKQGLTPMKEMVANYDQLKRKFQNSCWVEFF 509
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.132 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,147,468,170
Number of Sequences: 23463169
Number of extensions: 154873983
Number of successful extensions: 379381
Number of sequences better than 100.0: 146
Number of HSP's better than 100.0 without gapping: 108
Number of HSP's successfully gapped in prelim test: 38
Number of HSP's that attempted gapping in prelim test: 379125
Number of HSP's gapped (non-prelim): 160
length of query: 289
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 148
effective length of database: 9,050,888,538
effective search space: 1339531503624
effective search space used: 1339531503624
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)