BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 040989
(700 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224067960|ref|XP_002302619.1| predicted protein [Populus trichocarpa]
gi|222844345|gb|EEE81892.1| predicted protein [Populus trichocarpa]
Length = 321
Score = 490 bits (1262), Expect = e-136, Method: Compositional matrix adjust.
Identities = 227/301 (75%), Positives = 257/301 (85%), Gaps = 1/301 (0%)
Query: 387 EKPNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHF 446
+ P+SS+G +KWNSF+S VQ NPT+ELRNGTDVIWQIP PKAVLF+AHGC+GRA +F
Sbjct: 17 KSPDSSFGYPIKKWNSFES-VQFNPTIELRNGTDVIWQIPSLPKAVLFVAHGCDGRAANF 75
Query: 447 WDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERH 506
WDRS +CPNCIGLPEERLLVLHAL+R FA+ITISSAGRCWT GEERL+V+ II+WW + H
Sbjct: 76 WDRSSSCPNCIGLPEERLLVLHALARKFAVITISSAGRCWTFGEERLIVKNIIQWWAQTH 135
Query: 507 KLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKD 566
LE LPLVALGASSGGYFVSA+A LRFSSI LMIAEG FDQMDI +YPPTLFVHMPKD
Sbjct: 136 NLESLPLVALGASSGGYFVSAVATNLRFSSITLMIAEGKFDQMDITVNYPPTLFVHMPKD 195
Query: 567 SYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGF 626
SYRQ+KI EF+ VL+NKGID AE+ECMEFPLS +F DRIPG+ Q S KLFELFQ K F
Sbjct: 196 SYRQQKISEFIEVLKNKGIDAAEVECMEFPLSPSFLADRIPGMNQTVSAKLFELFQVKSF 255
Query: 627 IDENGYMRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+D NGYM+ DGR TRWKEALRE + L+K++V HVQEELNLAFAYHEMTSL SEQIFKW
Sbjct: 256 VDGNGYMKKDGRATRWKEALRECKNLVLDKSMVQHVQEELNLAFAYHEMTSLQSEQIFKW 315
Query: 687 F 687
F
Sbjct: 316 F 316
>gi|356501481|ref|XP_003519553.1| PREDICTED: uncharacterized protein LOC100782197 [Glycine max]
Length = 330
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 212/295 (71%), Positives = 251/295 (85%)
Query: 393 YGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPN 452
Y L KQKWNS DS V L+P E RNGTD+IWQ+P+SPK VLFLAHGCNGRA++FWD+SP
Sbjct: 31 YELPKQKWNSSDSLVSLHPKKEFRNGTDLIWQVPESPKGVLFLAHGCNGRAINFWDKSPE 90
Query: 453 CPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLP 512
CP+CIGLPEERLLVLH L++GFA+ITISSA RCWTLG+E +VV++II WW+ R KLEKLP
Sbjct: 91 CPDCIGLPEERLLVLHGLAQGFAVITISSAQRCWTLGKEVMVVKDIIEWWIGRRKLEKLP 150
Query: 513 LVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRK 572
LVALGASSGGYFVS LA ++FSS LMIAEG+F+Q+D+ DYPPTLFVHMPKD YRQ+K
Sbjct: 151 LVALGASSGGYFVSVLAAAMKFSSTVLMIAEGMFEQIDVKGDYPPTLFVHMPKDLYRQQK 210
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
I E++ VL++KGIDV +ECMEFPL N DRIPGL+QA S KLFE FQEKGFID+NGY
Sbjct: 211 IDEYVKVLKDKGIDVGVVECMEFPLLPNTLADRIPGLDQALSRKLFEFFQEKGFIDKNGY 270
Query: 633 MRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
M+ DGR+ +WK+AL E A+ L+KNLV H+QEELNLAFAYHEMTS+ S+QIFKWF
Sbjct: 271 MKKDGRQIKWKKALEEKKALLLDKNLVPHIQEELNLAFAYHEMTSVHSDQIFKWF 325
>gi|255558716|ref|XP_002520382.1| conserved hypothetical protein [Ricinus communis]
gi|223540429|gb|EEF41998.1| conserved hypothetical protein [Ricinus communis]
Length = 347
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 221/297 (74%), Positives = 253/297 (85%), Gaps = 1/297 (0%)
Query: 391 SSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRS 450
SS G Q+WN+F+S V+ +PT+E RNGTDVIWQIPDSPKAVLFLAHGCNGRAV+FWDRS
Sbjct: 47 SSPGSTMQRWNTFES-VEFHPTIEFRNGTDVIWQIPDSPKAVLFLAHGCNGRAVNFWDRS 105
Query: 451 PNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEK 510
CPNCIGLPEERLLVLHAL+ FA++TISSAGRCWT GEE LVV+ II+ WV RHKLE
Sbjct: 106 HYCPNCIGLPEERLLVLHALANKFAVLTISSAGRCWTFGEEMLVVKNIIKSWVRRHKLEN 165
Query: 511 LPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQ 570
+PLVALGASSGGYFVSALA LRFSSI +MIAEG+F+Q+D+ ++YPPTLFVHMPKD RQ
Sbjct: 166 IPLVALGASSGGYFVSALATILRFSSITVMIAEGVFNQIDVAQNYPPTLFVHMPKDLPRQ 225
Query: 571 RKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDEN 630
+KI E + L+NKGIDVAE+EC+EFPLS +F DRIPGL+Q S KLFELFQ +GFID+N
Sbjct: 226 QKISESVKFLKNKGIDVAEVECIEFPLSPSFLGDRIPGLDQTISAKLFELFQNRGFIDKN 285
Query: 631 GYMRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
YM+ DGR TRWK ALRE+ L+KNLV HVQEELNLAFAYHEMTSL SEQIFKWF
Sbjct: 286 RYMKKDGRATRWKAALRETKDFVLDKNLVQHVQEELNLAFAYHEMTSLQSEQIFKWF 342
>gi|356553585|ref|XP_003545135.1| PREDICTED: uncharacterized protein LOC100799601 [Glycine max]
Length = 351
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 211/299 (70%), Positives = 251/299 (83%)
Query: 389 PNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWD 448
P +Y L KQKWNSFDS V L+PT E RN TD+IWQ+P+SPK VLFLAHGCNGRA++FWD
Sbjct: 48 PILAYELPKQKWNSFDSLVHLHPTKEFRNETDLIWQVPESPKGVLFLAHGCNGRAINFWD 107
Query: 449 RSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKL 508
+SP CP+CIGLPEERLLVLH L++GFA+ITISSA RCWT G+E LVV++II WW+ R KL
Sbjct: 108 KSPKCPDCIGLPEERLLVLHGLAQGFAVITISSAQRCWTFGKEVLVVKDIIEWWIGRKKL 167
Query: 509 EKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSY 568
EKLPLVALGASSGGYFVS LA ++FSS LMIAEG+F+Q+D+ DYPPTLFVHMPKD Y
Sbjct: 168 EKLPLVALGASSGGYFVSVLATAMKFSSTVLMIAEGMFEQIDVKGDYPPTLFVHMPKDLY 227
Query: 569 RQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFID 628
RQ+KI E++ VL++KGIDV +EC+EFPLS + DRIPGL+QA S LFE FQEKGFID
Sbjct: 228 RQQKIDEYVEVLKDKGIDVGVVECLEFPLSPSTLADRIPGLDQALSRNLFEFFQEKGFID 287
Query: 629 ENGYMRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
+NGYMR DGR+ +WK+A E A+ L+KNLV H+QEELNLAFAYHEMTS+ S+QIFKW
Sbjct: 288 KNGYMRKDGRQIKWKKAFEEKKALLLDKNLVPHIQEELNLAFAYHEMTSVHSDQIFKWL 346
>gi|388503550|gb|AFK39841.1| unknown [Lotus japonicus]
Length = 347
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 203/302 (67%), Positives = 253/302 (83%)
Query: 386 NEKPNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVH 445
EKP+ +Y + +QKW+SFDS V+L+PT E RNGTD+IWQ+P+SPK V+FLAHGCNGR+++
Sbjct: 41 GEKPSLAYEVSEQKWSSFDSLVKLHPTREFRNGTDLIWQVPESPKGVMFLAHGCNGRSIN 100
Query: 446 FWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVER 505
FWD+SP CPNCIGLPEERLLVLH L++GFA+ITISSA CW+ G+E LVV++I+ WW+ R
Sbjct: 101 FWDKSPECPNCIGLPEERLLVLHGLAQGFAVITISSARTCWSFGKEVLVVKDILEWWIGR 160
Query: 506 HKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPK 565
KLEKLPLVALGASSGGYFVS LA ++F+SI +MIAEG+F++MD+ EDYPPTLFVHMPK
Sbjct: 161 RKLEKLPLVALGASSGGYFVSVLATAVKFNSIVIMIAEGMFEEMDVKEDYPPTLFVHMPK 220
Query: 566 DSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKG 625
D YRQ+KI E++ VL+ KGIDV +ECMEFPLS N DRIPGL+ S LFE+FQ KG
Sbjct: 221 DLYRQQKIDEYVEVLKGKGIDVGVVECMEFPLSPNTLADRIPGLDLTLSRSLFEIFQGKG 280
Query: 626 FIDENGYMRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
F+D+NGYMR DGRR +W++A+ E + L+K LV H+QEELNLAFA HEMTS+ S+QIFK
Sbjct: 281 FVDQNGYMRKDGRRIKWEKAVEEKKTLSLDKRLVPHIQEELNLAFASHEMTSIHSDQIFK 340
Query: 686 WF 687
WF
Sbjct: 341 WF 342
>gi|296086320|emb|CBI31761.3| unnamed protein product [Vitis vinifera]
Length = 312
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 208/302 (68%), Positives = 257/302 (85%), Gaps = 4/302 (1%)
Query: 387 EKPNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHF 446
++P+SS G++ Q S D VQ +PT+E RNGTDVIWQIPDSPKAVLFLAHGCNG+AV+F
Sbjct: 9 KQPSSSIGVMTQ---SSDPLVQFHPTIEFRNGTDVIWQIPDSPKAVLFLAHGCNGQAVNF 65
Query: 447 WDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERH 506
WDRSP+CP+C+GLPEERL+VLH+L+R FA++TISSAG+CW+ G+E L+V+ II+WW+E++
Sbjct: 66 WDRSPSCPDCVGLPEERLIVLHSLNRKFAVLTISSAGKCWSFGKESLIVKGIIQWWIEKN 125
Query: 507 KLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKD 566
KLEKLPLVALGASSGGYFVS LA LRFSS+ LMIAEGLF ++DI +DYPPTLFVHMPKD
Sbjct: 126 KLEKLPLVALGASSGGYFVSVLATNLRFSSVTLMIAEGLFHKIDIMKDYPPTLFVHMPKD 185
Query: 567 SYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGF 626
R+++I E + +LRNKGIDVAE++CMEFPLS F DRIPG+++ S KLF L ++KGF
Sbjct: 186 QARKQRIAENMEILRNKGIDVAEVKCMEFPLSPYFLADRIPGVDKNISAKLFGLLRDKGF 245
Query: 627 IDENGYMRSDGRRTRWKEALRESNAIFLNKNL-VHHVQEELNLAFAYHEMTSLPSEQIFK 685
I+ NGYMR+DGR T WK+ L ESN IF + ++ VHH+QEELNLAFAYHEMTSL SEQIF
Sbjct: 246 INVNGYMRNDGRATNWKKVLSESNIIFPDWDIFVHHIQEELNLAFAYHEMTSLQSEQIFN 305
Query: 686 WF 687
WF
Sbjct: 306 WF 307
>gi|357494337|ref|XP_003617457.1| hypothetical protein MTR_5g091790 [Medicago truncatula]
gi|355518792|gb|AET00416.1| hypothetical protein MTR_5g091790 [Medicago truncatula]
Length = 358
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 203/309 (65%), Positives = 255/309 (82%), Gaps = 1/309 (0%)
Query: 379 DGERGIRNEKPNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHG 438
DG+ ++ KP+ +Y KQ WNSFDS V+ +PT E RNGTD+IWQIP+SPK VLFLAHG
Sbjct: 46 DGDVKVK-VKPSLAYESHKQIWNSFDSLVKFHPTREFRNGTDLIWQIPESPKGVLFLAHG 104
Query: 439 CNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREI 498
CNG+A++FWD+S CPNCIGLPEERLLVLH L++GFA+ITISSA RCW+ G+E LVV++I
Sbjct: 105 CNGKAINFWDKSIECPNCIGLPEERLLVLHGLAQGFAVITISSANRCWSFGKELLVVKDI 164
Query: 499 IRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPT 558
+ WW+ + KLEKLPLVALGASSGGYFVS LA +F+SI LMIAEG+FD++DI EDYPPT
Sbjct: 165 LEWWIGKRKLEKLPLVALGASSGGYFVSLLASSKKFNSIVLMIAEGMFDEIDINEDYPPT 224
Query: 559 LFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLF 618
LFVHMPKD YRQ+KI E+++VL++KGID +ECMEFPLS + DRIPG++Q S LF
Sbjct: 225 LFVHMPKDLYRQQKIDEYVMVLKDKGIDAGVVECMEFPLSPDTLADRIPGIDQTLSRNLF 284
Query: 619 ELFQEKGFIDENGYMRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSL 678
E+F+EKGFIDE GYMR DGR+ +W+ L E + ++K+L+ H+QEELNLAF+YHEMTS+
Sbjct: 285 EIFKEKGFIDEKGYMRKDGRKMKWQNILDERKNLLIDKHLIPHIQEELNLAFSYHEMTSV 344
Query: 679 PSEQIFKWF 687
S+QIFKWF
Sbjct: 345 HSDQIFKWF 353
>gi|225446873|ref|XP_002279871.1| PREDICTED: uncharacterized protein LOC100242074 [Vitis vinifera]
Length = 346
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 208/302 (68%), Positives = 257/302 (85%), Gaps = 4/302 (1%)
Query: 387 EKPNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHF 446
++P+SS G++ Q S D VQ +PT+E RNGTDVIWQIPDSPKAVLFLAHGCNG+AV+F
Sbjct: 43 KQPSSSIGVMTQ---SSDPLVQFHPTIEFRNGTDVIWQIPDSPKAVLFLAHGCNGQAVNF 99
Query: 447 WDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERH 506
WDRSP+CP+C+GLPEERL+VLH+L+R FA++TISSAG+CW+ G+E L+V+ II+WW+E++
Sbjct: 100 WDRSPSCPDCVGLPEERLIVLHSLNRKFAVLTISSAGKCWSFGKESLIVKGIIQWWIEKN 159
Query: 507 KLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKD 566
KLEKLPLVALGASSGGYFVS LA LRFSS+ LMIAEGLF ++DI +DYPPTLFVHMPKD
Sbjct: 160 KLEKLPLVALGASSGGYFVSVLATNLRFSSVTLMIAEGLFHKIDIMKDYPPTLFVHMPKD 219
Query: 567 SYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGF 626
R+++I E + +LRNKGIDVAE++CMEFPLS F DRIPG+++ S KLF L ++KGF
Sbjct: 220 QARKQRIAENMEILRNKGIDVAEVKCMEFPLSPYFLADRIPGVDKNISAKLFGLLRDKGF 279
Query: 627 IDENGYMRSDGRRTRWKEALRESNAIFLNKNL-VHHVQEELNLAFAYHEMTSLPSEQIFK 685
I+ NGYMR+DGR T WK+ L ESN IF + ++ VHH+QEELNLAFAYHEMTSL SEQIF
Sbjct: 280 INVNGYMRNDGRATNWKKVLSESNIIFPDWDIFVHHIQEELNLAFAYHEMTSLQSEQIFN 339
Query: 686 WF 687
WF
Sbjct: 340 WF 341
>gi|147790532|emb|CAN65314.1| hypothetical protein VITISV_041340 [Vitis vinifera]
Length = 346
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 208/302 (68%), Positives = 256/302 (84%), Gaps = 4/302 (1%)
Query: 387 EKPNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHF 446
++P+SS G++ Q S D VQ +PT+E RNGTDVIWQIPDSPKAVLFLAHGCNG+AV+F
Sbjct: 43 KQPSSSIGVMTQ---SSDPLVQFHPTIEFRNGTDVIWQIPDSPKAVLFLAHGCNGQAVNF 99
Query: 447 WDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERH 506
WDRSP+C +C+GLPEERL+VLH+L+R FA++TISSAG+CW+ G+E L+V+ II+WW+E++
Sbjct: 100 WDRSPSCLDCVGLPEERLIVLHSLNRKFAVLTISSAGKCWSFGKESLIVKGIIQWWIEKN 159
Query: 507 KLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKD 566
KLEKLPLVALGASSGGYFVS LA LRFSSI LMIAEGLF ++DI +DYPPTLFVHMPKD
Sbjct: 160 KLEKLPLVALGASSGGYFVSVLATNLRFSSITLMIAEGLFHKIDIMKDYPPTLFVHMPKD 219
Query: 567 SYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGF 626
R+++I E + +LRNKGIDVAE++CMEFPLS F DRIPG+++ S KLF L ++KGF
Sbjct: 220 QARKQRIAENMEILRNKGIDVAEVKCMEFPLSPYFLADRIPGVDKNISAKLFGLLRDKGF 279
Query: 627 IDENGYMRSDGRRTRWKEALRESNAIFLNKNL-VHHVQEELNLAFAYHEMTSLPSEQIFK 685
I+ NGYMR+DGR T WK+ L ESN IF + ++ VHH+QEELNLAFAYHEMTSL SEQIF
Sbjct: 280 INVNGYMRNDGRATNWKKVLSESNIIFPDWDIFVHHIQEELNLAFAYHEMTSLQSEQIFN 339
Query: 686 WF 687
WF
Sbjct: 340 WF 341
>gi|449450852|ref|XP_004143176.1| PREDICTED: uncharacterized protein LOC101217366 isoform 1 [Cucumis
sativus]
gi|449525333|ref|XP_004169672.1| PREDICTED: uncharacterized LOC101217366 isoform 1 [Cucumis sativus]
Length = 346
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 199/291 (68%), Positives = 242/291 (83%)
Query: 397 KQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNC 456
KQKW+SF+S VQ NP E RNGTD+I QIPDSPK+VLF AHGCNG+AV+FWDRSP CP+C
Sbjct: 51 KQKWDSFNSLVQFNPNREFRNGTDIISQIPDSPKSVLFFAHGCNGKAVNFWDRSPECPSC 110
Query: 457 IGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVAL 516
+GLPEERL+VLHALS+ FA++TISS+ +CWTLG+E VV++IIRWWVE++KLEKLPLV L
Sbjct: 111 VGLPEERLIVLHALSKHFAVLTISSSQKCWTLGKELAVVKDIIRWWVEKNKLEKLPLVGL 170
Query: 517 GASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEF 576
GASSGGYF+S LA +F SI LMIAEGL++QM++ E YPPTLFVHMPKD +RQ+KI E
Sbjct: 171 GASSGGYFLSVLATKFKFDSITLMIAEGLYEQMEVTERYPPTLFVHMPKDHFRQQKINEN 230
Query: 577 LVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD 636
+ L+ KGIDVAEIEC+EF LS +F DRIPGL+++ S KLFELFQ KGF+D NG M D
Sbjct: 231 MEFLKGKGIDVAEIECLEFALSPHFLADRIPGLDKSVSAKLFELFQAKGFVDGNGKMLKD 290
Query: 637 GRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
GR TRWKEAL+ S + +L +H+QEELNLAFAYHEMTSL +++IF WF
Sbjct: 291 GRATRWKEALKASGISLPDNSLTNHIQEELNLAFAYHEMTSLEADKIFTWF 341
>gi|449450854|ref|XP_004143177.1| PREDICTED: uncharacterized protein LOC101217366 isoform 2 [Cucumis
sativus]
gi|449525335|ref|XP_004169673.1| PREDICTED: uncharacterized LOC101217366 isoform 2 [Cucumis sativus]
Length = 340
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 198/290 (68%), Positives = 241/290 (83%)
Query: 397 KQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNC 456
KQKW+SF+S VQ NP E RNGTD+I QIPDSPK+VLF AHGCNG+AV+FWDRSP CP+C
Sbjct: 51 KQKWDSFNSLVQFNPNREFRNGTDIISQIPDSPKSVLFFAHGCNGKAVNFWDRSPECPSC 110
Query: 457 IGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVAL 516
+GLPEERL+VLHALS+ FA++TISS+ +CWTLG+E VV++IIRWWVE++KLEKLPLV L
Sbjct: 111 VGLPEERLIVLHALSKHFAVLTISSSQKCWTLGKELAVVKDIIRWWVEKNKLEKLPLVGL 170
Query: 517 GASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEF 576
GASSGGYF+S LA +F SI LMIAEGL++QM++ E YPPTLFVHMPKD +RQ+KI E
Sbjct: 171 GASSGGYFLSVLATKFKFDSITLMIAEGLYEQMEVTERYPPTLFVHMPKDHFRQQKINEN 230
Query: 577 LVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD 636
+ L+ KGIDVAEIEC+EF LS +F DRIPGL+++ S KLFELFQ KGF+D NG M D
Sbjct: 231 MEFLKGKGIDVAEIECLEFALSPHFLADRIPGLDKSVSAKLFELFQAKGFVDGNGKMLKD 290
Query: 637 GRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
GR TRWKEAL+ S + +L +H+QEELNLAFAYHEMTSL +++IF W
Sbjct: 291 GRATRWKEALKASGISLPDNSLTNHIQEELNLAFAYHEMTSLEADKIFTW 340
>gi|255639007|gb|ACU19804.1| unknown [Glycine max]
Length = 277
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 179/253 (70%), Positives = 213/253 (84%)
Query: 389 PNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWD 448
P +Y L KQKWNSFDS V L+PT E RN TD+IWQ+P+SPK VLFLAHGCNGRA++FWD
Sbjct: 25 PILAYELPKQKWNSFDSLVHLHPTREFRNETDLIWQVPESPKGVLFLAHGCNGRAINFWD 84
Query: 449 RSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKL 508
+SP CP+CIGLPEERLLVLH L++GFA+ITISSA RCWT G+E LVV++II WW+ R KL
Sbjct: 85 KSPKCPDCIGLPEERLLVLHGLAQGFAVITISSAQRCWTFGKEVLVVKDIIEWWIGRKKL 144
Query: 509 EKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSY 568
EKLPLVALGASSGGYFVS LA ++FSS LMIAEG+F+Q+D+ DYPPTLFVHMPKD Y
Sbjct: 145 EKLPLVALGASSGGYFVSVLATAMKFSSTVLMIAEGMFEQIDVKGDYPPTLFVHMPKDLY 204
Query: 569 RQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFID 628
RQ+KI E++ VL++KGIDV +EC+EFPLS + DRIPGL+QA S LFE FQEKGFID
Sbjct: 205 RQQKIDEYVEVLKDKGIDVGVVECLEFPLSPSTLADRIPGLDQALSRNLFEFFQEKGFID 264
Query: 629 ENGYMRSDGRRTR 641
+NGYMR DGR+ +
Sbjct: 265 KNGYMRKDGRQIK 277
>gi|21555258|gb|AAM63817.1| unknown [Arabidopsis thaliana]
Length = 344
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/293 (62%), Positives = 231/293 (78%), Gaps = 1/293 (0%)
Query: 395 LVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCP 454
L KQ N+ +Q PT+E++N T +IW IP+SPKAVLF+AHGC+ +A FWD+S +CP
Sbjct: 48 LPKQISNNVKPELQSQPTLEIQNETQLIWTIPNSPKAVLFIAHGCHRKASDFWDKSSDCP 107
Query: 455 NCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLV 514
C GLPEER+L+ AL+R FA++T+SS G CWT G+E+ +V II+ WVE+HKLE+LPLV
Sbjct: 108 QCTGLPEERILMRFALARKFAVLTVSSTGTCWTFGKEKTIVGSIIKSWVEKHKLERLPLV 167
Query: 515 ALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIG 574
ALGASSGGYFVSALA ++FSSI LMIAEG+FDQ+ I + YPPTLFVHMPKD YRQ+KI
Sbjct: 168 ALGASSGGYFVSALATEMQFSSIVLMIAEGVFDQISISKQYPPTLFVHMPKDVYRQQKIR 227
Query: 575 EFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMR 634
EFL LR +GID AEIEC++ P+S +F DRIPGL+ S KLF+LF++KGF+DE GYM+
Sbjct: 228 EFLEGLRMEGIDAAEIECLDLPISPDFLADRIPGLDSDVSAKLFKLFRDKGFVDEKGYMK 287
Query: 635 SDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
DGRRT WK+AL I L ++L+ V+EELNLA+AYHEMTSL S+QIF WF
Sbjct: 288 RDGRRTPWKQAL-SGYKISLEQSLITPVEEELNLAYAYHEMTSLQSDQIFSWF 339
>gi|15231138|ref|NP_191439.1| uncharacterized protein [Arabidopsis thaliana]
gi|7630075|emb|CAB88297.1| putative protein [Arabidopsis thaliana]
gi|26450849|dbj|BAC42532.1| unknown protein [Arabidopsis thaliana]
gi|332646312|gb|AEE79833.1| uncharacterized protein [Arabidopsis thaliana]
Length = 344
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/293 (62%), Positives = 231/293 (78%), Gaps = 1/293 (0%)
Query: 395 LVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCP 454
L KQ N+ +Q PT+E++N T +IW IP+SPKAVLF+AHGC+ +A FWD+S +CP
Sbjct: 48 LPKQISNNVKPELQSQPTLEIQNETQLIWTIPNSPKAVLFIAHGCHRKASDFWDKSSDCP 107
Query: 455 NCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLV 514
C GLPEER+L+ AL+R FA++T+SS G CWT G+E+ +V II+ WVE+HKLE+LPLV
Sbjct: 108 QCTGLPEERILMRFALARKFAVLTVSSTGTCWTFGKEKTIVGSIIKSWVEKHKLERLPLV 167
Query: 515 ALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIG 574
ALGASSGGYFVSALA ++FSSI LMIAEG+FDQ+ I + YPPTLFVHMPKD YRQ+KI
Sbjct: 168 ALGASSGGYFVSALATEMQFSSIVLMIAEGVFDQISISKQYPPTLFVHMPKDVYRQQKIR 227
Query: 575 EFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMR 634
EFL LR +GID AEIEC++ P+S +F DRIPGL+ S KLF+LF++KGF+DE GYM+
Sbjct: 228 EFLEGLRMEGIDAAEIECLDLPISPDFLADRIPGLDSDVSAKLFKLFRDKGFVDEKGYMK 287
Query: 635 SDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
DGRRT WK+AL I L ++L+ V+EELNLA+AYHEMTSL S+QIF WF
Sbjct: 288 RDGRRTPWKQAL-SGYKISLEQSLITPVEEELNLAYAYHEMTSLQSDQIFSWF 339
>gi|297817194|ref|XP_002876480.1| hypothetical protein ARALYDRAFT_348948 [Arabidopsis lyrata subsp.
lyrata]
gi|297322318|gb|EFH52739.1| hypothetical protein ARALYDRAFT_348948 [Arabidopsis lyrata subsp.
lyrata]
Length = 369
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 178/278 (64%), Positives = 223/278 (80%), Gaps = 1/278 (0%)
Query: 410 NPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHA 469
P VE++NGT +IW IP+SPKAVLF+AHGC+ +A FWD+S +CP C GLPEER+L+ A
Sbjct: 88 KPAVEIQNGTQLIWTIPNSPKAVLFIAHGCHRKASDFWDKSSDCPECTGLPEERILIRFA 147
Query: 470 LSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALA 529
L+R FA++T+SSAG CWT G+E+ +V ++I+ WVE+H LE+LPLV LGASSGGYFVSALA
Sbjct: 148 LARKFAVLTVSSAGTCWTFGKEKSIVGKMIKSWVEKHNLERLPLVGLGASSGGYFVSALA 207
Query: 530 KGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAE 589
++FSSI LMIAEG+FD++ I + YPPTLFVHMPKD YRQ+KI EFL LR +GID AE
Sbjct: 208 TDMQFSSIVLMIAEGVFDKISISKQYPPTLFVHMPKDVYRQQKIREFLEGLRMEGIDAAE 267
Query: 590 IECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALRES 649
IEC++ P+S F DRIPGL STKLF+LFQ+KGF+DE GYM+ DGRRT WK+AL
Sbjct: 268 IECLDLPISPEFLADRIPGLRPDVSTKLFKLFQDKGFVDEKGYMKRDGRRTPWKQAL-SG 326
Query: 650 NAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
I L ++L+ V+EELNLA+AYHEMTSL S+QIF WF
Sbjct: 327 YKISLEESLITPVEEELNLAYAYHEMTSLQSDQIFSWF 364
>gi|302807975|ref|XP_002985682.1| hypothetical protein SELMODRAFT_122862 [Selaginella moellendorffii]
gi|300146591|gb|EFJ13260.1| hypothetical protein SELMODRAFT_122862 [Selaginella moellendorffii]
Length = 331
Score = 295 bits (754), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 138/283 (48%), Positives = 189/283 (66%), Gaps = 2/283 (0%)
Query: 407 VQLNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLL 465
V+L P + ++G++ +WQ+P P +A+LFLAHGC+ RA FWD SPNCP+CIGLPE+RL
Sbjct: 44 VELLPILGSKHGSEFLWQMPPRPARAILFLAHGCHCRATFFWDTSPNCPHCIGLPEDRLY 103
Query: 466 VLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
VL AL +A+I ISS CW + V I++ W+ ++L+ LP+ ALGASSGGYFV
Sbjct: 104 VLKALESRYAVIAISSTNTCWDRQADTGKVVSILQGWIRENRLDGLPIFALGASSGGYFV 163
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGI 585
S LA LRF++ LMI+EG F+ MD DYP TLF HMP+D R + + +L+ KG+
Sbjct: 164 STLALQLRFNATVLMISEGHFEVMDFDRDYPATLFAHMPRDERRAIGVARSMKLLKEKGV 223
Query: 586 DVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEA 645
EI+C+ P++ +DRIPG+ S K+++ +E G +DEN +MRSDGRR W+
Sbjct: 224 GTEEIQCLALPVTPQLLSDRIPGVGDEASRKIYQALKEAGILDENDFMRSDGRRVGWRAV 283
Query: 646 LRESNAIFLN-KNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
L NA ++ K HH+QEELN A+AYHEMTS P E+ F WF
Sbjct: 284 LERRNAFPVSLKKWEHHIQEELNAAYAYHEMTSAPLERTFDWF 326
>gi|413926024|gb|AFW65956.1| hypothetical protein ZEAMMB73_425062 [Zea mays]
Length = 358
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 144/296 (48%), Positives = 197/296 (66%), Gaps = 14/296 (4%)
Query: 406 SVQLNPTVELRNGTDVIWQIP--DSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEE 462
+V+L+ +V RNG +V+WQ+P SP +A LF A GC RA F+D SP CP C GLPEE
Sbjct: 57 AVRLDASVTRRNGNEVLWQLPPPSSPLRAALFAAPGCTIRATDFFDASPGCPRCTGLPEE 116
Query: 463 RLLVLHALSRGFAIITISSAGRCWTL--------GEERLVVREIIRWWV-ERH-KLEKLP 512
R ALSRG+A++ +SS +CW+L G E V+ II+WW E++ +L LP
Sbjct: 117 RRFTRAALSRGYAVLAVSSRAKCWSLDDAGGGEDGSELAAVQSIIKWWTTEKYPQLAGLP 176
Query: 513 LVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQM-DIPEDYPPTLFVHMPKDSYRQR 571
L +GASSGGYF+SALA ++FSS+ +MIAEG++ M +IP YPP LFVHMPKD+ R R
Sbjct: 177 LFGIGASSGGYFLSALAARVKFSSVTVMIAEGMYGTMAEIPTGYPPALFVHMPKDTERAR 236
Query: 572 KIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENG 631
+ E + LR K +DV EI+C +F +S F +R+PGL A + L ++ + KGF+DE G
Sbjct: 237 LVAESMDKLREKHVDVREIQCDDFAVSAEFLAERVPGLTWAFANALVDMLRRKGFVDEKG 296
Query: 632 YMRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
++ DGRRT WKEA E+ + L HV EELN+A+AYHE TSL + ++F+WF
Sbjct: 297 MLKKDGRRTPWKEAAEEAKVLPERFGLERHVTEELNVAYAYHEFTSLKNTEVFQWF 352
>gi|302785227|ref|XP_002974385.1| hypothetical protein SELMODRAFT_442394 [Selaginella moellendorffii]
gi|300157983|gb|EFJ24607.1| hypothetical protein SELMODRAFT_442394 [Selaginella moellendorffii]
Length = 331
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 137/283 (48%), Positives = 190/283 (67%), Gaps = 2/283 (0%)
Query: 407 VQLNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLL 465
V+L P + ++G++ +WQ+P P +A+LFLAHGC+ RA FWD+SPNCP+CIGLPE+RL
Sbjct: 44 VELLPILGSKHGSEFLWQMPPRPARAILFLAHGCHCRATFFWDKSPNCPHCIGLPEDRLY 103
Query: 466 VLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
VL AL +A+I ISS CW + V I++ W+ ++L+ LP+ ALGASSGGYFV
Sbjct: 104 VLKALESRYAVIAISSTNTCWDRQADTGKVVSILQGWIRENRLDGLPIFALGASSGGYFV 163
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGI 585
S LA LRF++ LMI+EG F+ MD DYP TLF HMP+D R + + +L+ KG+
Sbjct: 164 STLALRLRFNATVLMISEGHFEVMDFDRDYPATLFAHMPRDERRAIGVARSMKLLKEKGV 223
Query: 586 DVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEA 645
EI+C+ P++ +DRIPG+ S K+++ +E G +DEN +MRSDGRR +
Sbjct: 224 GTEEIQCLALPVTPQLLSDRIPGVGDEASRKIYQALKEAGILDENDFMRSDGRRLVCRAV 283
Query: 646 LRESNAIFLN-KNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
L NA ++ K HH+QEELN A+AYHEMTS+P E+ F WF
Sbjct: 284 LDRRNAFPVSLKKWEHHIQEELNAAYAYHEMTSVPLERTFDWF 326
>gi|326515444|dbj|BAK03635.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 288 bits (737), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 190/292 (65%), Gaps = 10/292 (3%)
Query: 406 SVQLNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
+++L+P V R+G +V+WQ+P P +A +F+A GC RA F+D SP CP C GLPEER
Sbjct: 41 AIRLDPRVVRRSGNEVLWQLPPGPPRAAVFVAPGCTIRATDFFDASPGCPRCAGLPEERR 100
Query: 465 LVLHALSRGFAIITISSAGRCWTL------GEERLVVREIIRWWV--ERHKLEKLPLVAL 516
AL RG+A++ +SS CW+L G E V II WW E +L LPLV +
Sbjct: 101 FTREALRRGYAVLAVSSRAECWSLDAASGDGSELAAVGSIIEWWAKEEHPELSGLPLVGI 160
Query: 517 GASSGGYFVSALAKGLRFSSIALMIAEGLFDQMD-IPEDYPPTLFVHMPKDSYRQRKIGE 575
GASSGGYFVSALA +RFSS+A+MIAEG+F M+ IP YPP LFVHMPKD+ R + +
Sbjct: 161 GASSGGYFVSALAARVRFSSVAIMIAEGVFAAMEEIPPGYPPALFVHMPKDAERAQMVAA 220
Query: 576 FLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRS 635
+ L++K +DV EI+C F +S F RIPGL + L ++ ++K F+DE G++++
Sbjct: 221 SIDRLKSKHVDVREIQCNAFAVSAEFLAARIPGLTPTVADGLVDMLRQKSFLDEKGFLKN 280
Query: 636 DGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
DGR T WK A E+ + L HV EELN+A+AYHE TSL + +IFKWF
Sbjct: 281 DGRSTPWKAAAEEAKVLPEGFRLERHVTEELNIAYAYHEFTSLENGEIFKWF 332
>gi|242064562|ref|XP_002453570.1| hypothetical protein SORBIDRAFT_04g008280 [Sorghum bicolor]
gi|241933401|gb|EES06546.1| hypothetical protein SORBIDRAFT_04g008280 [Sorghum bicolor]
Length = 369
Score = 285 bits (729), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 143/303 (47%), Positives = 194/303 (64%), Gaps = 22/303 (7%)
Query: 407 VQLNPTVELRNGTDVIWQIP--DSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEER 463
V+L+ V RNG +V+WQ+P SP +A LF A GC RA F+D SP CP C GLPEER
Sbjct: 61 VRLDARVARRNGNEVLWQLPRPSSPLRAALFAAPGCTIRATDFFDASPGCPRCTGLPEER 120
Query: 464 LLVLHALSRGFAIITISSAGRCWTLG----------------EERLVVREIIRWWV-ERH 506
ALSRG+A++ +SS +CW+L E V+ II WW E++
Sbjct: 121 RFTREALSRGYAVLAVSSRAKCWSLDDDDAAAGGGGGDGEDGSELAAVQSIIEWWTTEKY 180
Query: 507 -KLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQM-DIPEDYPPTLFVHMP 564
+L LPLV +GASSGGYF+SALA ++FSS+A+MIAEG++ M DIP YPP LFVHMP
Sbjct: 181 PQLAGLPLVGIGASSGGYFLSALAARVKFSSVAVMIAEGVYGAMADIPTGYPPALFVHMP 240
Query: 565 KDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEK 624
KD+ R R + + + LR K +DV EI C +F +S R+PGL +A + L ++ + K
Sbjct: 241 KDTERARLVADSMGKLREKHVDVREIRCDDFAVSAELLAGRVPGLTRAFADALVDVLRRK 300
Query: 625 GFIDENGYMRSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIF 684
GF+DE G+++ DGR+T WKEA +E+ + L HV EE N+A+AYHE TSL + +IF
Sbjct: 301 GFVDEKGFLKKDGRKTLWKEAAKEAKVLPEGFGLERHVTEEFNVAYAYHEFTSLKNTEIF 360
Query: 685 KWF 687
+WF
Sbjct: 361 EWF 363
>gi|125538712|gb|EAY85107.1| hypothetical protein OsI_06459 [Oryza sativa Indica Group]
Length = 442
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 145/293 (49%), Positives = 191/293 (65%), Gaps = 11/293 (3%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSP---KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPE 461
++V+L+ VE R+G +V+WQ+P P +A +F+A GC RA F+D SP CP C GLPE
Sbjct: 47 AAVRLDARVERRSGNEVLWQLPPPPTPPRAAVFVAPGCTIRATDFFDASPGCPRCAGLPE 106
Query: 462 ERLLVLHALSRGFAIITISSAGRCWTL----GEERLVVREIIRWWV-ERH--KLEKLPLV 514
ER AL RG+A++ +SS CW+L G E V II WWV ERH +L LPLV
Sbjct: 107 ERRFTREALRRGYAVLAVSSRAECWSLDTGDGGELAAVESIIEWWVKERHPNQLAGLPLV 166
Query: 515 ALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMD-IPEDYPPTLFVHMPKDSYRQRKI 573
+GASSGGYF+SALA +RFSS+A+MIAEG+F M+ IP YPP LFV MPKD R R++
Sbjct: 167 GIGASSGGYFLSALAARVRFSSVAIMIAEGVFATMEEIPARYPPALFVQMPKDGERAREV 226
Query: 574 GEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYM 633
+ LR + V EI+C EF +S F RIPGL A + L ++ + KGF+DE G++
Sbjct: 227 AASMGKLRGSRVSVREIQCGEFAVSAQFLAARIPGLTLAVADGLVDVLRRKGFVDEKGFL 286
Query: 634 RSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
++DGR T WK+A E+ + L HV EELNLA+AYHE TSL + +IF W
Sbjct: 287 KNDGRSTPWKKAAEEAKILPEEFRLERHVTEELNLAYAYHEFTSLKNGEIFDW 339
>gi|115445171|ref|NP_001046365.1| Os02g0230200 [Oryza sativa Japonica Group]
gi|49388953|dbj|BAD26173.1| unknown protein [Oryza sativa Japonica Group]
gi|113535896|dbj|BAF08279.1| Os02g0230200 [Oryza sativa Japonica Group]
gi|125581393|gb|EAZ22324.1| hypothetical protein OsJ_05978 [Oryza sativa Japonica Group]
gi|215740609|dbj|BAG97265.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 352
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 145/292 (49%), Positives = 189/292 (64%), Gaps = 11/292 (3%)
Query: 407 VQLNPTVELRNGTDVIWQIPDSP---KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEER 463
V+L+ VE R+G +V+WQ+P +A +F+A GC RA F+D SP CP C GLPEER
Sbjct: 53 VRLDARVERRSGNEVLWQLPPPTTPPRAAVFVAPGCTIRATDFFDASPGCPRCAGLPEER 112
Query: 464 LLVLHALSRGFAIITISSAGRCWTL----GEERLVVREIIRWWV-ERH--KLEKLPLVAL 516
AL RG+A++ +SS CW+L G E V II WWV ERH +L LPLV +
Sbjct: 113 RFTREALRRGYAVLAVSSRAECWSLDTGDGGELAAVESIIEWWVKERHPNQLAGLPLVGI 172
Query: 517 GASSGGYFVSALAKGLRFSSIALMIAEGLFDQMD-IPEDYPPTLFVHMPKDSYRQRKIGE 575
GASSGGYF+SALA +RFSS+A+MIAEG+F M+ IP YPP LFV MPKD R R++
Sbjct: 173 GASSGGYFLSALAARVRFSSVAIMIAEGVFATMEEIPARYPPALFVQMPKDGERAREVAA 232
Query: 576 FLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRS 635
+ LR + V EI+C EF +S F RIPGL A + L ++ + KGF+DE G++++
Sbjct: 233 SMGKLRGSRVSVREIQCGEFAVSAQFLAARIPGLTLAVADGLVDVLRRKGFVDEKGFLKN 292
Query: 636 DGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
DGR T WK+A E+ + L HV EELNLA+AYHE TSL + +IF WF
Sbjct: 293 DGRSTPWKKAAEEAKILPEEFRLERHVTEELNLAYAYHEFTSLKNGEIFDWF 344
>gi|125538709|gb|EAY85104.1| hypothetical protein OsI_06456 [Oryza sativa Indica Group]
Length = 352
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 146/292 (50%), Positives = 190/292 (65%), Gaps = 11/292 (3%)
Query: 407 VQLNPTVELRNGTDVIWQIPDSP---KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEER 463
V+L+ VE R+G +V+WQ+P P +A +F+A GC RA F+D SP CP C GLPEER
Sbjct: 53 VRLDARVERRSGNEVLWQLPPPPTPPRAAVFVAPGCTIRATDFFDASPGCPRCAGLPEER 112
Query: 464 LLVLHALSRGFAIITISSAGRCWTL----GEERLVVREIIRWWV-ERH--KLEKLPLVAL 516
AL RG+A++ +SS CW+L G E V II WWV ERH +L LPLV +
Sbjct: 113 RFTREALRRGYAVLAVSSRAECWSLDTGDGGELAAVESIIEWWVKERHPNQLAGLPLVGI 172
Query: 517 GASSGGYFVSALAKGLRFSSIALMIAEGLFDQMD-IPEDYPPTLFVHMPKDSYRQRKIGE 575
GASSGGYF+SALA +RFSS+A+MIAEG+F M+ IP YPP LFV MPKD R R++
Sbjct: 173 GASSGGYFLSALAARVRFSSVAIMIAEGVFATMEEIPARYPPALFVQMPKDGERAREVAA 232
Query: 576 FLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRS 635
+ LR + V EI+C EF +S F RIPGL A + L ++ + KGF+DE G++++
Sbjct: 233 SMGKLRGSRVSVREIQCGEFAVSAQFLAARIPGLTLAVADGLVDVLRRKGFVDEKGFLKN 292
Query: 636 DGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
DGR T WK+A E+ + L HV EELNLA+AYHE TSL + +IF WF
Sbjct: 293 DGRSTPWKKAAEEAKILPEEFRLERHVTEELNLAYAYHEFTSLKNGEIFDWF 344
>gi|297720411|ref|NP_001172567.1| Os01g0748200 [Oryza sativa Japonica Group]
gi|255673685|dbj|BAH91297.1| Os01g0748200 [Oryza sativa Japonica Group]
Length = 356
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 126/280 (45%), Positives = 179/280 (63%), Gaps = 1/280 (0%)
Query: 409 LNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVL 467
L+PTV G IW++P +P +AVLF+AHGC+ R +FW SP CP C+GLPE+ +
Sbjct: 72 LSPTVGSIGGARAIWELPAAPARAVLFVAHGCHCRPENFWPPSPRCPGCVGLPEDVAITE 131
Query: 468 HALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSA 527
AL R FA++ ++SA CW++G+E + IR W + L LP+ ALGASSGGYFVS
Sbjct: 132 RALRRRFAVLALASARECWSMGKEVSAAKRGIRSWATENGLADLPVAALGASSGGYFVSR 191
Query: 528 LAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDV 587
LA ++ +++ LMIAEG F P YPP +F+HMPKD R + +LR+ G++V
Sbjct: 192 LAAEMKLAAVVLMIAEGSFGPGGAPAGYPPAMFLHMPKDQRRAALVERNSKMLRSNGVEV 251
Query: 588 AEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALR 647
E+ +E PL+ +DRIPGL + S +++ +F E GF+DE G+MR DGR T WK+A+
Sbjct: 252 NELRSLELPLTPTMLSDRIPGLNRGLSERIWRVFTEDGFVDERGFMRKDGRATPWKDAVV 311
Query: 648 ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
+ H+QEELNLA+ YHEMTSL ++++F W
Sbjct: 312 KRGFWEEVAGWADHIQEELNLAYGYHEMTSLHTDEMFDWI 351
>gi|57899506|dbj|BAD86968.1| hypothetical protein [Oryza sativa Japonica Group]
gi|125572016|gb|EAZ13531.1| hypothetical protein OsJ_03447 [Oryza sativa Japonica Group]
Length = 379
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 126/280 (45%), Positives = 179/280 (63%), Gaps = 1/280 (0%)
Query: 409 LNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVL 467
L+PTV G IW++P +P +AVLF+AHGC+ R +FW SP CP C+GLPE+ +
Sbjct: 95 LSPTVGSIGGARAIWELPAAPARAVLFVAHGCHCRPENFWPPSPRCPGCVGLPEDVAITE 154
Query: 468 HALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSA 527
AL R FA++ ++SA CW++G+E + IR W + L LP+ ALGASSGGYFVS
Sbjct: 155 RALRRRFAVLALASARECWSMGKEVSAAKRGIRSWATENGLADLPVAALGASSGGYFVSR 214
Query: 528 LAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDV 587
LA ++ +++ LMIAEG F P YPP +F+HMPKD R + +LR+ G++V
Sbjct: 215 LAAEMKLAAVVLMIAEGSFGPGGAPAGYPPAMFLHMPKDQRRAALVERNSKMLRSNGVEV 274
Query: 588 AEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALR 647
E+ +E PL+ +DRIPGL + S +++ +F E GF+DE G+MR DGR T WK+A+
Sbjct: 275 NELRSLELPLTPTMLSDRIPGLNRGLSERIWRVFTEDGFVDERGFMRKDGRATPWKDAVV 334
Query: 648 ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
+ H+QEELNLA+ YHEMTSL ++++F W
Sbjct: 335 KRGFWEEVAGWADHIQEELNLAYGYHEMTSLHTDEMFDWI 374
>gi|125527698|gb|EAY75812.1| hypothetical protein OsI_03726 [Oryza sativa Indica Group]
Length = 379
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 126/280 (45%), Positives = 179/280 (63%), Gaps = 1/280 (0%)
Query: 409 LNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVL 467
L+PTV G IW++P +P +AVLF+AHGC+ R +FW SP CP C+GLPE+ +
Sbjct: 95 LSPTVGSIGGARAIWELPAAPARAVLFVAHGCHCRPENFWPPSPRCPGCVGLPEDVAITE 154
Query: 468 HALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSA 527
AL R FA++ ++SA CW++G+E + IR W + L LP+ ALGASSGGYFVS
Sbjct: 155 RALRRRFAVLALASARECWSMGKEVSAAKRGIRSWATENGLADLPVAALGASSGGYFVSR 214
Query: 528 LAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDV 587
LA ++ +++ LMIAEG F P YPP +F+HMPKD R + +LR+ G++V
Sbjct: 215 LAAEMKLAAVVLMIAEGSFGPGGAPAGYPPAMFLHMPKDQRRAALVERNSKMLRSNGVEV 274
Query: 588 AEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALR 647
E+ +E PL+ +DRIPGL + S +++ +F E GF+DE G+MR DGR T WK+A+
Sbjct: 275 NELRSLELPLTPTMLSDRIPGLNRGLSERIWRVFTEDGFVDERGFMRKDGRATPWKDAVV 334
Query: 648 ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
+ H+QEELNLA+ YHEMTSL ++++F W
Sbjct: 335 KRGFWEEVAGWADHIQEELNLAYGYHEMTSLHTDEMFDWI 374
>gi|357130965|ref|XP_003567114.1| PREDICTED: uncharacterized protein LOC100822531 [Brachypodium
distachyon]
Length = 509
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 181/287 (63%), Gaps = 1/287 (0%)
Query: 402 SFDSSVQLNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLP 460
S +S +PTVE +G IW++P +P +AVLF+AHGC R +FW S +CP C+GLP
Sbjct: 218 SSESPALSSPTVESIDGARAIWELPAAPARAVLFVAHGCRCRPENFWPPSQSCPGCVGLP 277
Query: 461 EERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASS 520
E+ + AL R FA++ ++S CW+L +E + I+ W ++ L + P+ ALGASS
Sbjct: 278 EDVAITARALRRRFAVLAVASNEECWSLEQEVGATKRAIQSWTAKNGLGERPVFALGASS 337
Query: 521 GGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVL 580
GGYFVS LA + +++ +MI+EG F P YPP +F+HMPKD R + + +L
Sbjct: 338 GGYFVSRLATEMNLAAVVIMISEGAFGPAGAPAGYPPAMFLHMPKDHRRAASVERNMKML 397
Query: 581 RNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRT 640
R G++V E+ +E PL+ +DRIPGL+ S ++ ++F E+GF+D+ GYMR DGR T
Sbjct: 398 RGNGVEVKEVRSVELPLTPTLLSDRIPGLDHGLSERILKMFSEEGFVDDRGYMRKDGRAT 457
Query: 641 RWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
WK+A+ + H+QEELNLA+ YHEMTSL ++++F W
Sbjct: 458 PWKDAVVKRGFWEEVSRWAEHIQEELNLAYGYHEMTSLQADEMFDWI 504
>gi|168047752|ref|XP_001776333.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672293|gb|EDQ58832.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 359
Score = 265 bits (677), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 129/297 (43%), Positives = 186/297 (62%), Gaps = 11/297 (3%)
Query: 399 KWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIG 458
+W+ S L+P ++ NGT+ ++Q+P SP+AV+FLAHGCN +A FWD+ P C C G
Sbjct: 50 RWDFPVYSKTLSPMRKVLNGTEYVYQVPPSPRAVIFLAHGCNCKATFFWDKHPECQTCCG 109
Query: 459 LPEERLLVLHALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGA 518
PEER V+ AL + +A+I ISS CW + R + ++ W++ L +LPL+ LGA
Sbjct: 110 APEERAFVIEALRKSYALIAISSQNECWQKEDSRR-AKLVLESWIDEFTLTELPLLGLGA 168
Query: 519 SSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPED-YPPTLFVHMPKDSYRQRKIGEFL 577
SSGGYF+S+ AK +RF +I +MIAEG F +P+ YPP LFVHM KD R +KI E +
Sbjct: 169 SSGGYFISSFAKVVRFDAIVVMIAEGRFQ--SVPDSLYPPVLFVHMIKDEMRAKKIRERI 226
Query: 578 VVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDG 637
VLR GI+ EI+C E +S +FF RIP ++ ST+L E F + ++D+ G+++ DG
Sbjct: 227 PVLRKAGIEADEIQCKEIKISDDFFAKRIPYIDVPTSTRLREFFLQSEYLDKRGHLKMDG 286
Query: 638 RRTRWKEALRESNAIFL-------NKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
R +WK +R L + N HH+ EELN+AF++HE TS P+ +I W
Sbjct: 287 RLMKWKAEMRSKGGQELLETLSRKSGNWEHHICEELNVAFSFHEFTSRPTGKIIAWL 343
>gi|242058681|ref|XP_002458486.1| hypothetical protein SORBIDRAFT_03g034540 [Sorghum bicolor]
gi|241930461|gb|EES03606.1| hypothetical protein SORBIDRAFT_03g034540 [Sorghum bicolor]
Length = 373
Score = 265 bits (676), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 128/281 (45%), Positives = 185/281 (65%), Gaps = 4/281 (1%)
Query: 410 NPTVELRNGTDVIWQIPDSP--KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVL 467
+PTVE G IW++P + +AVLF+AHGC R +FW RSP CP C+GLPE+ +
Sbjct: 87 SPTVETIEGARAIWELPAAAPARAVLFVAHGCRCRPENFWPRSPRCPGCVGLPEDVAITD 146
Query: 468 HALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSA 527
AL R FA++ ++SA CW+LG+E + +IR W ++ LE LP+VALGASSGGYFVS
Sbjct: 147 LALRRRFAVLAVTSARECWSLGKEVSAAKRVIRSWAAKNGLEGLPVVALGASSGGYFVSR 206
Query: 528 LAKGLRFSSIALMIAEGLF--DQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGI 585
LA + +++ +MIAEG F +P YPP +F+HMPKD R + + +L G+
Sbjct: 207 LAGKMSLAAVVIMIAEGAFGGSAGALPAVYPPAMFIHMPKDKRRAALVDKNSKMLMKNGV 266
Query: 586 DVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEA 645
+V E++ +E PL+ ++RIPGL + S +++ +F+E+GF+DE GYM+ DGR T WK+A
Sbjct: 267 EVKELQSLELPLTPTLLSERIPGLNRGLSERIWTVFREEGFVDEKGYMKEDGRGTPWKDA 326
Query: 646 LRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
L + L H+QEELNLA+ YHEMTSL ++++F W
Sbjct: 327 LVKRGFWKEVSALADHIQEELNLAYGYHEMTSLQADEMFDW 367
>gi|326515374|dbj|BAK03600.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 370
Score = 264 bits (675), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 123/271 (45%), Positives = 182/271 (67%), Gaps = 1/271 (0%)
Query: 418 GTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAI 476
G IW++P +P + VLF+AHGC+ R +FW S +CP C+GLPE+ + AL R FA+
Sbjct: 95 GACAIWELPAAPARGVLFVAHGCHCRPENFWPPSQHCPGCVGLPEDVAITARALRRRFAV 154
Query: 477 ITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSS 536
+ ++SAG CW+LG+E + I+ W ++ L LP+VALGASSGGYFVS LA + ++
Sbjct: 155 LAVASAGDCWSLGKEVGAAKRAIQSWTAKNGLGDLPVVALGASSGGYFVSRLAAKMSLTA 214
Query: 537 IALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFP 596
+ +MI+EG FD P YPP++F+HMPKD R +G + +LR+ G++V E++ +E P
Sbjct: 215 VVMMISEGAFDLAGAPPGYPPSMFLHMPKDRKRAALVGRNVKMLRSNGVEVMELQSLELP 274
Query: 597 LSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALRESNAIFLNK 656
L+ +DRI GL+ S +++++F E+GF+DE GYMR DGR T WK+A+ +S
Sbjct: 275 LTPTLLSDRIAGLDHGLSERIWKVFSEEGFVDERGYMRKDGRTTPWKDAVVKSGFWEEVS 334
Query: 657 NLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
H+QEELNLA+ YHEMTSL ++++F W
Sbjct: 335 QWADHIQEELNLAYGYHEMTSLQTDEMFNWI 365
>gi|357140524|ref|XP_003571816.1| PREDICTED: uncharacterized protein LOC100824302 [Brachypodium
distachyon]
Length = 349
Score = 261 bits (668), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 140/293 (47%), Positives = 192/293 (65%), Gaps = 11/293 (3%)
Query: 406 SVQLNPTVELRNGTDVIWQIPDSPKAV---LFLAHGCNGRAVHFWDRSPNCPNCIGLPEE 462
+V+L+P V R+G +V+WQ+P P+ +F+A GC RA F+D SP CP+C GLPEE
Sbjct: 49 AVRLDPRVARRSGNEVLWQLPPGPRPPRAAVFVAPGCTIRATDFFDASPGCPHCAGLPEE 108
Query: 463 RLLVLHALSRGFAIITISSAGRCWTL---GE--ERLVVREIIRWWV--ERHKLEKLPLVA 515
R AL R +A++ +SS +CW+L GE E V II+W V E +L LPL+
Sbjct: 109 RRFTSAALRRSYAVLAVSSRAQCWSLDAAGEDTELAAVESIIKWLVNEEHPELRGLPLIG 168
Query: 516 LGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMD-IPEDYPPTLFVHMPKDSYRQRKIG 574
+GASSGGYFVSALA +RFSS+ +MIAEG+F M+ +P YPP LFVHMPKDS R +
Sbjct: 169 IGASSGGYFVSALAARVRFSSVTIMIAEGVFAAMEEVPAGYPPALFVHMPKDSERATLVV 228
Query: 575 EFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMR 634
+ L++K ++V EI+C F +S F RIPGL +A + L ++ ++K F+DE G+++
Sbjct: 229 ASISKLKSKHVEVREIQCDAFAVSAEFLAARIPGLTRAVADGLVDVLRQKSFLDEKGFLK 288
Query: 635 SDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
+DGR T WKEA E+ A+ HV EELNLA+AYHE TS E+IF+WF
Sbjct: 289 NDGRSTPWKEAAEEAEALPKQFGFDRHVTEELNLAYAYHEFTSQKDEEIFEWF 341
>gi|219362425|ref|NP_001136922.1| uncharacterized protein LOC100217080 [Zea mays]
gi|194697632|gb|ACF82900.1| unknown [Zea mays]
gi|414880505|tpg|DAA57636.1| TPA: hypothetical protein ZEAMMB73_622857 [Zea mays]
Length = 372
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 184/281 (65%), Gaps = 4/281 (1%)
Query: 410 NPTVELRNGTDVIWQIPDS--PKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVL 467
+PTVE IW++P + +AVLF+AHGC R +FW SP CP C+GLPE+ +
Sbjct: 86 SPTVETMERARAIWELPAAGPARAVLFVAHGCRCRPENFWPPSPRCPGCVGLPEDVAITD 145
Query: 468 HALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSA 527
AL R FA++ ++SAG CW+ G+E V + +IR W ++ LE LP+VALGASSGGYFVS
Sbjct: 146 RALRRRFAVLAVASAGECWSPGKEVDVAKRVIRSWAAKNGLEGLPVVALGASSGGYFVSR 205
Query: 528 LAKGLRFSSIALMIAEGLFDQM--DIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGI 585
LA + +++ +MIAEG F +P YPP +F+HMPKD R + + +L G+
Sbjct: 206 LAAKMSLAAVVIMIAEGAFGSSAGALPAVYPPAMFIHMPKDKRRAVLVDKNSKMLMKNGV 265
Query: 586 DVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEA 645
+V E++ +E PL+ ++RIPGL++ S +++ +F+E+ F+D+ GYM+ DGR T WK+A
Sbjct: 266 EVKELQSLELPLTPTLLSERIPGLDRGLSERIWTVFREEAFVDDKGYMKEDGRETPWKDA 325
Query: 646 LRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
L + L H+QEELNLA+ YHEMTSL ++++F W
Sbjct: 326 LVKRGLWEEVSALADHIQEELNLAYGYHEMTSLHADEMFDW 366
>gi|168014170|ref|XP_001759626.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689165|gb|EDQ75538.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 196 bits (497), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 101/289 (34%), Positives = 163/289 (56%), Gaps = 9/289 (3%)
Query: 408 QLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVL 467
+ NPT E+ TDV+WQIP+SP+ V++LAHG F+DRSP C C GLPE+R +VL
Sbjct: 47 RFNPTYEMHYNTDVLWQIPESPRGVMYLAHGRFLDIFSFFDRSPGCERCYGLPEDRAIVL 106
Query: 468 HALSRGFAIITISSAGRCW-----TLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
AL R +A+I + S G W ++++VV I+ WV H L++LP+V G SSGG
Sbjct: 107 DALDRKYAVIVVKSLGDEWDSWPIETSKDKVVVTSIVTDWVMEHGLDELPMVGFGYSSGG 166
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPE-DYPPTLFVHMPKDSYRQRKIGEFLVVLR 581
F S L+ L SI ++ + G+++ ++ + YPPTL+ +MPKD + +I + +++
Sbjct: 167 NFTSMLSPILGMKSIVILDSPGVYEILENADASYPPTLYFYMPKDPFFTSQILRNVELMK 226
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
++G+ V + +EFP+ FF R P + S +F +DEN +
Sbjct: 227 SRGVKVEHVPMLEFPIILEFFALRNPCISNETSRSIFRALLNAKLLDENNLVTISPWEFD 286
Query: 642 WKEALRESNAI---FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
+ + E++ + + L V +ELN+AFA+H M+S ++ +F WF
Sbjct: 287 YNSWIEENDLLSGCIPHTCLSEQVVQELNIAFAFHSMSSTKNDIMFDWF 335
>gi|302761002|ref|XP_002963923.1| hypothetical protein SELMODRAFT_34571 [Selaginella moellendorffii]
gi|302769157|ref|XP_002967998.1| hypothetical protein SELMODRAFT_34569 [Selaginella moellendorffii]
gi|300164736|gb|EFJ31345.1| hypothetical protein SELMODRAFT_34569 [Selaginella moellendorffii]
gi|300167652|gb|EFJ34256.1| hypothetical protein SELMODRAFT_34571 [Selaginella moellendorffii]
Length = 284
Score = 182 bits (462), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 103/283 (36%), Positives = 147/283 (51%), Gaps = 17/283 (6%)
Query: 422 IWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISS 481
I QIP +P+AVLF+AH NG + +W+ P PNC G+P+ +V HALSRG+A+I I S
Sbjct: 1 ISQIPPNPRAVLFIAHEGNGSPLTYWEPQPQQPNCTGVPDHVAIVKHALSRGYAVIAIKS 60
Query: 482 AGRCW-----TLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSS 536
W ++ V ++ W +H LE+ P+V LG+ SGG F+S LA ++FS+
Sbjct: 61 VKSFWQDWPPQASQDAHNVVSVLVSWTAQHGLERAPIVGLGSGSGGNFLSMLALTIKFSA 120
Query: 537 IALMIAEGLFD-----QMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIE 591
+ LM++ GL +M YPP LFVHMPKD KI +LR K I VA++
Sbjct: 121 LVLMVSPGLHKALERAKMAPASTYPPVLFVHMPKDQTTASKIHSHEQLLRAKAIPVAQVC 180
Query: 592 CMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALRESNA 651
C EFP++ +F IPG STKL Q +G ID G + S R L + +
Sbjct: 181 CEEFPIAPFYFAAHIPGFGSDASTKLHAAMQAEGVIDGLGRVLSAPRPPDVCLMLEKFDV 240
Query: 652 IFLNKN-------LVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
+ L ++ L H+ + +A H T +IF W
Sbjct: 241 LPLQRDDHSYARLLAEHIHSLMRVAQGRHATTGRKCPEIFNWL 283
>gi|307107040|gb|EFN55284.1| hypothetical protein CHLNCDRAFT_48221 [Chlorella variabilis]
Length = 329
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 104/288 (36%), Positives = 151/288 (52%), Gaps = 17/288 (5%)
Query: 411 PTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHAL 470
P V+ + +W++P SPK VL L HGCN + FW S C +C+GLP E L A+
Sbjct: 34 PKVKTISERKTLWEVPASPKGVLLLCHGCNHQGSDFWPASDRCQDCLGLPGEVTLRTAAI 93
Query: 471 SRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSA 527
G+A+I ++S R CW+ GE+ V EI++ VE KL LPL G SSGG FV
Sbjct: 94 KHGYAVIAVTSIDRSSKCWS-GEDLKPVTEIVKAVVEEEKLGSLPLFLWGGSSGGSFVQM 152
Query: 528 LAKGL--RFSSIALMIAEGLFDQMDIPED--YPPTLFVHMPK-DSYRQRKIGEFLVVLRN 582
L GL + IA + D+ + YPP L+VHM K D + + L + ++
Sbjct: 153 L-PGLMPEVKGVIAQIAPASPQEFDVGKGRTYPPILWVHMAKRDQATAGSVAQALKICKD 211
Query: 583 KGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRW 642
KGI EI+ P++ F + P + + + + +E+ FID++G + D RR+ W
Sbjct: 212 KGIPADEIKVDPQPVTPKFL-QQSPQISAKLAASIVDTLKERSFIDQDGLLVGDPRRSAW 270
Query: 643 KEALRESNAIFLNKNL---VHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
++ L+ I N NL V H+ E LN AFA HE+ S +E FKW
Sbjct: 271 RDFLKP---IVGNANLQPDVGHLPELLNTAFAAHEIISDNAEAWFKWL 315
>gi|302809236|ref|XP_002986311.1| hypothetical protein SELMODRAFT_46563 [Selaginella moellendorffii]
gi|300145847|gb|EFJ12520.1| hypothetical protein SELMODRAFT_46563 [Selaginella moellendorffii]
Length = 292
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/290 (34%), Positives = 153/290 (52%), Gaps = 17/290 (5%)
Query: 411 PTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHA 469
PT EL G +++WQIP SP +A+LF+AH G F+ +C C GLPE + A
Sbjct: 3 PTHELLEGVELVWQIPSSPPRAILFIAHPGCGSPGDFFPPHRDCARCKGLPEHSAITQRA 62
Query: 470 LSRGFAIITISSAGRCWTL-----GEERLVVREIIRWWVERHKLEKLPLVALGASSGGYF 524
++ +A++ I SA W ++ V I+ W RH + LPLVALG +SGG F
Sbjct: 63 IAARYAVVAIKSAKDYWQTWPLEASQDAFNVERIVDSWKNRHGMANLPLVALGCASGGSF 122
Query: 525 VSALAKGLRFSSIALMIAEGLFDQMDIPED-YPPTLFVHMPKDSYRQRKIGEFLVVLRNK 583
VSALA + F+++ +MI+ G+ + YPPTLFVHMPKDS KI + L++K
Sbjct: 123 VSALALRIEFAALVIMISHGVTRAFQRATNRYPPTLFVHMPKDSLTAAKIEREIRALQSK 182
Query: 584 GIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWK 643
GI EI C++FP++ F L ++KLF G I+ G++ S + + +
Sbjct: 183 GIPTEEIACLDFPVTSEIFRGL---LSSHGASKLFAALVIHGVINGEGFVTSAMQTSEIR 239
Query: 644 EALRESNAIFLNKN------LVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
E L+ ++ IF + +V V + +A A H +T+ +IF W
Sbjct: 240 ELLQRAD-IFGEADEDEREMVVESVNCLIRVAEARHAVTARKCSEIFDWL 288
>gi|302831137|ref|XP_002947134.1| hypothetical protein VOLCADRAFT_103362 [Volvox carteri f.
nagariensis]
gi|300267541|gb|EFJ51724.1| hypothetical protein VOLCADRAFT_103362 [Volvox carteri f.
nagariensis]
Length = 431
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 155/313 (49%), Gaps = 43/313 (13%)
Query: 418 GTDVIWQIP---DSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGF 474
G + +WQ P + + V+F+AHGCN A+ +W +SP+CP CIGLPEE + LH L RG+
Sbjct: 102 GIEAVWQGPPLGTARRGVMFIAHGCNHGAIDWWPQSPSCPQCIGLPEELNVTLHCLVRGY 161
Query: 475 AIITISSAG----RCWT--------------------LGEERLVVREIIRWWVERHKLEK 510
A++ +SSA RCW R VR ++ + R L
Sbjct: 162 AVVAVSSAARRGSRCWQSVLRGGGDPWDYSQGPSPEDWMRTRRQVRRVLDVLLSREGLSS 221
Query: 511 LPLVALGASSGGYFVSALAKGL--RFSSIALMIAEG------LFDQMDIPEDYPPTLFVH 562
LPL+ALGASSGG FV L + RFS +A+ I G + + +PPTLFVH
Sbjct: 222 LPLLALGASSGGAFVLGLPLVMPGRFSGLAVQIMGGPPGLLQKYRYFNQSAAWPPTLFVH 281
Query: 563 MPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRI-PGLEQANSTKLFELF 621
MP+D+ + L LR G+ EI+ PL+ + R P +++ S ++E
Sbjct: 282 MPRDAALAELVARNLEELREGGVRTREIQVRPQPLTPLYLVQRCAPEVDEQTSRAVYEAL 341
Query: 622 QEKGFIDENGYMRSDGRRTR---WKEALRESNAIFLNKNLVH----HVQEELNLAFAYHE 674
+ F+D GY+R + R + W+ L+ + L K + + EELNL +A HE
Sbjct: 342 KFADFLDPRGYLRLNPRSQQGGGWRAVLKAAGVPELQKLRLQPEASPIAEELNLLYAGHE 401
Query: 675 MTSLPSEQIFKWF 687
+ S + + WF
Sbjct: 402 LCSETTTDMLDWF 414
>gi|302814113|ref|XP_002988741.1| hypothetical protein SELMODRAFT_128651 [Selaginella moellendorffii]
gi|300143562|gb|EFJ10252.1| hypothetical protein SELMODRAFT_128651 [Selaginella moellendorffii]
Length = 334
Score = 165 bits (418), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 103/300 (34%), Positives = 155/300 (51%), Gaps = 17/300 (5%)
Query: 401 NSFDSSVQLNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGL 459
+S D PT EL G +++WQIP SP +A+LF+AH G F+ +C C GL
Sbjct: 30 SSQDHWASSAPTHELLEGVELVWQIPSSPPRAILFIAHPGCGSPGDFFPPHRDCARCKGL 89
Query: 460 PEERLLVLHALSRGFAIITISSAGRCWTL-----GEERLVVREIIRWWVERHKLEKLPLV 514
PE + A+ +A++ I SA W ++ V I+ W RH + LPLV
Sbjct: 90 PEHSAITQRAIGARYAVVAIKSAKDYWQTWPLEASQDAFNVERILDSWKNRHGMANLPLV 149
Query: 515 ALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPED-YPPTLFVHMPKDSYRQRKI 573
ALG +SGG FVSALA + F+++ +MI+ G+ + YPPTLFVHMPKDS KI
Sbjct: 150 ALGCASGGSFVSALALRIEFAALVIMISHGVTRAFQRATNRYPPTLFVHMPKDSLTAAKI 209
Query: 574 GEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYM 633
+ L++KGI EI C++FP++ F L ++KLF G I+ G++
Sbjct: 210 EREIRALQSKGIPTEEIACLDFPVTSEIFRSL---LSSHGASKLFAALVIHGVINGEGFV 266
Query: 634 RSDGRRTRWKEALRESNAIFLNKN------LVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
S + + +E L+ ++ IF + +V V + +A A H +T+ +IF W
Sbjct: 267 TSAMQTSEIRELLQRAD-IFGEADEDEREMVVESVNCLIRVAEARHAVTARKCSEIFDWL 325
>gi|307108594|gb|EFN56834.1| hypothetical protein CHLNCDRAFT_144402 [Chlorella variabilis]
Length = 601
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 144/291 (49%), Gaps = 18/291 (6%)
Query: 403 FDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEE 462
+ S L PT G ++IP +P VLF+AHGC A FW S CP C GLPEE
Sbjct: 160 MEGSSLLTPTRGEVMGRTFYFEIPHNPVGVLFMAHGCVHDAADFWPPSKACPECSGLPEE 219
Query: 463 RLLVLHALSRGFAIITISS-----AGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALG 517
AL+RG+A++ + S RC++ GE++ V+ ++ W+ ++ L+ LP+ LG
Sbjct: 220 VSHTKQALARGYAVLAVDSKDRDPMSRCYSYGEDKWGVKYVLENWMRQYSLQALPVFGLG 279
Query: 518 ASSGGYFVSALAKGLRFSSIALMIAEGLFDQM---DIPEDYPPTLFVHMPKDSYRQRKIG 574
S+G FV L K R + I + ++ ++ YPPT+F+ M +D +I
Sbjct: 280 ISAGASFVLRLPKLTRVNGIISEVLGATYETFPVDNLGATYPPTIFIDMQRDRTMTARIA 339
Query: 575 EFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMR 634
+ + L + A+I P++R+FF+DR + S ++ + G ID+ G +
Sbjct: 340 QSMQKLMENSVPTAKIAVDPRPVTRSFFSDRSEFISPELSRRIVSGLHQIGMIDDAGMVT 399
Query: 635 SDGRRTR--WKEALRE--------SNAIFLNKNLVHHVQEELNLAFAYHEM 675
SD R T W++ L E + + + E++NLA+A HE+
Sbjct: 400 SDPRYTTQPWRDQLAEVLPELEPDTGETAGPASDWALISEQMNLAWATHEI 450
>gi|307109927|gb|EFN58164.1| hypothetical protein CHLNCDRAFT_141975 [Chlorella variabilis]
Length = 527
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 102/317 (32%), Positives = 156/317 (49%), Gaps = 39/317 (12%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P + ++ + +Q+PDSP+ V+ H C G H+W +S CP+C G+PE+ L
Sbjct: 13 LVPQSDYKHNHEYFYQVPDSPRGVVLFFHSCVGAGSHYWPQSQACPDCRGMPEQMSHTLQ 72
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL RGFA++ ISSA R CW+ E+ V I+ W+ L+ LPL +G G F
Sbjct: 73 ALRRGFAVMAISSADRDIGCWSWNEDSDAVAAIVPAWINEWGLQALPLYGMGFPPGASFA 132
Query: 526 SALAKGLRFSSI---ALMI---AEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVV 579
L + L+F+ I AL I A GL D++D YPP +F++M KD +Q + E + +
Sbjct: 133 LKLPRYLKFNGIMSEALAIDPAAWGL-DEID---KYPPAIFINMVKDENQQPRFREDMRL 188
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIP-GLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
L+++ V + + +P +F+DR P + S K+ QE IDE GY+ D R
Sbjct: 189 LQDRRRPVWDRKV--YP---TYFSDRSPYYITPELSEKIAFALQEMWMIDEQGYVTEDPR 243
Query: 639 RTR--WKEALR---------------ESNAIFLNKNL---VHHVQEELNLAFAYHEMTSL 678
T+ W E L+ + F + ++ V +NLA+A HE+ S
Sbjct: 244 DTQRPWLEELKGLVPELGGSTQDFGGRTVPKFFDSSVDVNAAGVYSLMNLAWAKHEIISD 303
Query: 679 PSEQIFKWFLPIGSLDL 695
WF GS D+
Sbjct: 304 YVTVALLWFESGGSADI 320
>gi|159482484|ref|XP_001699299.1| hypothetical protein CHLREDRAFT_205579 [Chlamydomonas reinhardtii]
gi|158272935|gb|EDO98729.1| predicted protein [Chlamydomonas reinhardtii]
Length = 433
Score = 142 bits (357), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 148/312 (47%), Gaps = 44/312 (14%)
Query: 419 TDVIWQIPDSP----KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGF 474
+V+WQ+P + K V+FLAHGC+ AV FW +S CP C GLPEE + LH L+RG+
Sbjct: 97 VEVVWQLPAAGAAAAKGVVFLAHGCSHGAVDFWPKSAGCPQCTGLPEEMNITLHVLTRGY 156
Query: 475 AIITISS----AGRCW--TLGEERLV------------VREIIRWWVERHKLEKLPLVAL 516
A + ISS RCW G + V ++ + R +LE PL A
Sbjct: 157 AAVAISSYDRRMSRCWQNMWGSRDEILNSASDQADFTRVETALKALLARERLEAAPLFAF 216
Query: 517 GASSGGYFVSALAKGLR--FSSIALMIAEGLFDQMDIPE----------DYPPTLFVHMP 564
GASSGG FV L + FS +A I G + E +PPT VHMP
Sbjct: 217 GASSGGGFVLGLPPVMPGVFSGVAAQIMGGAPKHFEAYERARETDSVVSHWPPTALVHMP 276
Query: 565 KDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDR-IPGLEQANSTKLFELFQE 623
+D + +G L L+ I EI+ PL+ + R P +++A S ++E +
Sbjct: 277 RDGHIGTLVGMNLQELKALKIPHLEIQIPPQPLTPLYMVQRCAPEVDEATSKAIYEALKA 336
Query: 624 KGFIDENGYMRSDGRRT-RWKEALRESNA-------IFLNKNLVHHVQEELNLAFAYHEM 675
++D G++R + R W+ L+ +N + L +L + EELN + HE+
Sbjct: 337 ADYLDAEGFLRHNPRLAPDWRLILQRANVPGVTSGKLRLEPDL-SPLAEELNHLYGAHEI 395
Query: 676 TSLPSEQIFKWF 687
+S + + W+
Sbjct: 396 SSESTTDLLDWW 407
>gi|307107112|gb|EFN55356.1| hypothetical protein CHLNCDRAFT_134385 [Chlorella variabilis]
Length = 513
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 88/258 (34%), Positives = 136/258 (52%), Gaps = 10/258 (3%)
Query: 400 WNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGL 459
W + D S L P + ++ + +Q+P++P+AV+FLAHGC A ++W +S CP C GL
Sbjct: 37 WPNVDWS-DLKPYQDFKHKHEFFYQVPENPRAVVFLAHGCVHSAHNYWPQSEECPECRGL 95
Query: 460 PEERLLVLHALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVAL 516
PEE L AL G+A+I ISS R CW+ ++ V +II+ W E L+ LP+
Sbjct: 96 PEELSHTLQALKLGYAVIAISSLDRETGCWSFWDDCFDVIDIIQEWREDWGLDHLPIYGC 155
Query: 517 GASSGGYFVSALAKGLRFSSI---ALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKI 573
G SSGG FV L + ++F + AL I D+P YPPT+ V M +D ++ I
Sbjct: 156 GISSGGSFVLKLPRYMKFDGLMSEALGIDPLSGGFRDVPGVYPPTVIVSMQRDKKQRELI 215
Query: 574 GEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIP-GLEQANSTKLFELFQEKGFIDENGY 632
E +L + V I+ + +F++R P + + S K+ + +E ID +G
Sbjct: 216 NENREILEGRNTPVEIIKSYARQVYPTYFSERFPLYISEELSIKIRDGLEEIAMIDGDGN 275
Query: 633 MRSDGRRTR--WKEALRE 648
+ D R T W + L+E
Sbjct: 276 VLEDPRYTSRPWLKELQE 293
>gi|307109675|gb|EFN57912.1| hypothetical protein CHLNCDRAFT_141974 [Chlorella variabilis]
Length = 640
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 141/313 (45%), Gaps = 45/313 (14%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L PT+E++ +Q+P++PK + L HGC +++ RS C +C GLPEE L
Sbjct: 69 LKPTIEVKYNHTYYYQVPEAPKGAVVLFHGCVHSGYNYFPRSAACLDCRGLPEEMSHTLQ 128
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL+RG+A+I ISSA R CW+ + V+ ++R W H+L LPL +G SSG F
Sbjct: 129 ALNRGYAVIAISSADRDTGCWSWDSDSDGVKALVREWQREHRLRDLPLYGMGISSGASFA 188
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGI 585
+ + L S YPPTLFV + +D KI VL
Sbjct: 189 LKIPRWLNMS-------------------YPPTLFVALMRDKTTVAKILRNQNVLTALHR 229
Query: 586 DVAEIECMEFPLSRNFFTDRIP-GLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR--W 642
V+ I+ + +F+DR P +A S K+ + G ID +G + D R T W
Sbjct: 230 PVSVIKVYPRLVYPTYFSDRDPHSFSKALSVKIANGLKAIGMIDGDGAVLLDPRYTTKPW 289
Query: 643 KEALR------ESNAIFLNKNL--------------VHHVQEELNLAFAYHEMTSLPSEQ 682
L+ ++ + KNL V V +NLA+AYHE+ S
Sbjct: 290 LRQLKQEVPELQTGPLTTFKNLYVPDWNDTDSTNTKVAGVYSLMNLAYAYHEIISDYVTV 349
Query: 683 IFKWFLPIGSLDL 695
WF G D+
Sbjct: 350 ALMWFESGGKADI 362
>gi|307103063|gb|EFN51327.1| hypothetical protein CHLNCDRAFT_141094 [Chlorella variabilis]
Length = 530
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/242 (32%), Positives = 119/242 (49%), Gaps = 16/242 (6%)
Query: 423 WQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAII----- 477
++IP P +L + H C A FW S CP CIG+PE + AL+RG+A++
Sbjct: 68 FEIPRDPVGLLVMLHRCGRNAEDFWPPSSVCPQCIGMPEAVSISRQALARGYALLAINSF 127
Query: 478 --TISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFS 535
T+ S GRCW+ G + + VR+I+ + +RH L LPL G SSGG L +
Sbjct: 128 NRTVGSLGRCWSWGTDAVAVRDIVAAFRKRHALADLPLFFTGCSSGGSLALRLPGIMAMD 187
Query: 536 SI---ALMIAEGLFDQMD--IPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEI 590
I A+ + E +F D + YPP +VH D RK+ L + G+ AE+
Sbjct: 188 GIMPVAIGLEEDVFPATDPSLKFPYPPVAYVHFADDDNTDRKVAGMLRISEEAGLAAAEV 247
Query: 591 ECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD----GRRTRWKEAL 646
+ +F + R+FF+ R P + A S + + +E +DE+ + D G+ W L
Sbjct: 248 KVGKFAIGRDFFSQRDPMVSPALSELVHQALRELNVLDEHKILVDDWERQGKPAPWDAML 307
Query: 647 RE 648
RE
Sbjct: 308 RE 309
>gi|147839067|emb|CAN59772.1| hypothetical protein VITISV_029147 [Vitis vinifera]
Length = 832
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 158/313 (50%), Gaps = 24/313 (7%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ +V ++++Y G+W + +Y + G + KGI + I+YKEL+ ++ I+K+D +
Sbjct: 8 VDEVGIMLLYKGEWVRDGNVFY-FEGSKGKGIEIPKTISYKELLGVVHHILKLDPTNCFL 66
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHA 182
+MK ++ ANIPT P+++ DD DV+FFI N + +L P+CIT+E++ + ++
Sbjct: 67 SMKYVFNANIPTSPIQLTDDGDVKFFIGLNCTNG-KLLVPLCITVEKRIDNHNQKSICNS 125
Query: 183 ALDHQTPAASRSESNQISVVVPG---MQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDG 239
+ + E N S+++ + VE ++ NG F E+P +G
Sbjct: 126 YFECHMSSEIDKELNGDSMLMHKSRHIHCDSVETLTVDGENGQ----RFQNESP----EG 177
Query: 240 ILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSY--EDIVCG----- 292
+ ++E + + ++ + DK + + R+ N + + IV G
Sbjct: 178 YKVHDWNMNETTINEEDYRMNTNPTSDKQVTQIGSFRTDVANERQNMMQQPIVSGISDDH 237
Query: 293 ----KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ ++SKK+L +KL + A K+ F FK +KST V C D CKW++RATK
Sbjct: 238 LEEHQIYSSKKELQRKLHMMALKRKFEFKTTKSTTKLLLVECFDKECKWQVRATKLGISN 297
Query: 349 YFEIRRFSNIHIC 361
F+I +F + H C
Sbjct: 298 MFQIMKFYSTHTC 310
>gi|147841860|emb|CAN69231.1| hypothetical protein VITISV_008803 [Vitis vinifera]
Length = 751
Score = 125 bits (315), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 154/337 (45%), Gaps = 71/337 (21%)
Query: 63 MSQVRVLVMYDGKWFCSPSG-YYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYV 121
+ +V ++++Y G W P G Y + G + KGI V +I+Y EL+ +Y I+++D E
Sbjct: 8 IDEVGIIILYKGNWV--PDGNTYHFKGSEGKGITVKKNISYHELMRVMYHILQLDPIECS 65
Query: 122 ITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYH 181
I+MK + NIPT P+++ DD DV FFI+ N + +L +P+CIT++R
Sbjct: 66 ISMKYAFSGNIPTSPIQLRDDGDVNFFIRLNCIN--KLLAPLCITVDR------------ 111
Query: 182 AALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFN--GDPVAANFGTEAPLGCGDG 239
RS++N S+ + G V + ++ +FN GD F E PL +
Sbjct: 112 -----------RSKNNAKSMFMHG--NGHVNDGSIESFNVVGDESITKFNYE-PLERSN- 156
Query: 240 ILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIV--------- 290
++ N+ DD LD + + R S+ ++A I HS +I
Sbjct: 157 VVEWNMNGYAIYDDYHVLDTNLTSNVQVIENRDSSNKAAQIMEIHSIMNIKDGLMNDVPT 216
Query: 291 -------------------CG---------KTFTSKKDLIKKLKLGAAKKNFAFKVSKST 322
CG + ++SKK+L KKL + A K+ F F+ KST
Sbjct: 217 MIEEVSNNDQDMSRIGTSDCGTNDDHIEEKQIYSSKKELQKKLYIIALKEKFEFRTIKST 276
Query: 323 KDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIH 359
+ C D CKWR RATK +F++ ++ H
Sbjct: 277 TKLLVLQCVDNECKWRFRATKLGSSNFFQVMKYHPTH 313
>gi|307108595|gb|EFN56835.1| hypothetical protein CHLNCDRAFT_51615 [Chlorella variabilis]
Length = 374
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 141/310 (45%), Gaps = 21/310 (6%)
Query: 409 LNPTVELRNGTDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPN-CPNCIGLPEERLLV 466
L P+ G V ++IP +P VL + HGC A FW R + CP C GLPEE
Sbjct: 39 LTPSTFTMRGRQVFYEIPAAPILGVLAMMHGCAHDASDFWPRDASACPECDGLPEEVAHT 98
Query: 467 LHALSRGFAIITI--SSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYF 524
AL+RG+A I + S A RC+ + V I++ W++++ PL +G S+G
Sbjct: 99 KQALARGYAAIAVDASDATRCFGWHADGEDVAAILQHWIKQNNFTAKPLYLVGVSAGASL 158
Query: 525 VSALAKGLRFSSIALMIAEGLFDQMD------IPEDYPPTLFVHMPKDSYRQRKIGEFLV 578
V L + LR + I+E L Q+D + +PPT +V MP+D+ +I + +
Sbjct: 159 VLKLPRLLRVHGV---ISEVLGVQLDAWGMEELGAAFPPTAYVSMPRDARTAARIADNQL 215
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
+LR G+ + P + +F ++R + K+ + G +D G + D R
Sbjct: 216 LLRRYGVPTDVLLVQPQPFTSDFLSNRSALFSPELAAKVVAALRATGLLDGAGVLLQDPR 275
Query: 639 RTRWKEALRESNAI-----FLNKNLV---HHVQEELNLAFAYHEMTSLPSEQIFKWFLPI 690
T R + A+ +LV HV EELN A+A HE+ W
Sbjct: 276 YTALPWRARVAAAVPELAAGAGDSLVADASHVGEELNRAYAMHEIVGDHLTACLAWLEAR 335
Query: 691 GSLDLPSEAK 700
G+L L A+
Sbjct: 336 GTLPLAGLAQ 345
>gi|307110506|gb|EFN58742.1| hypothetical protein CHLNCDRAFT_140438 [Chlorella variabilis]
Length = 296
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 141/293 (48%), Gaps = 35/293 (11%)
Query: 420 DVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITI 479
+ +W++P PK VLF+AHGC+ + FW S C +C+GLPEE + AL RG+A++ +
Sbjct: 5 ETLWEVPRKPKGVLFVAHGCSHQGSDFWPASQRCKHCLGLPEEMGVRAAALKRGYAVVAV 64
Query: 480 SSA---GRCW-----TLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKG 531
+S RCW E+ + I+R ++ L LPL A GASSGG
Sbjct: 65 TSYNRDSRCWHNTQAKRSEDLKRLPSILRDVIQEEGLAGLPLFAFGASSGGSLA------ 118
Query: 532 LRFSSIALMIAEGLFDQM------------DIPEDYPPTLFVHMP-KDSYRQRKIGEFLV 578
LR +++ + +G+ Q+ +PPT+FVHM +D + + E +
Sbjct: 119 LRLAAVMPEV-QGVICQITPVNPSALEVDGGGSRRFPPTVFVHMAQRDPEKAETVAEAIK 177
Query: 579 VLRNKGIDVAEIECMEFPLSRNFF--TDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD 636
+L+ G EI+ ++ +D+I G +QA + + ++ G +D+ G ++ D
Sbjct: 178 ILKKAGTPATEIKVAPQAVTAALLQRSDQIGG-KQAKA--IVAALKKGGALDKKGLLKLD 234
Query: 637 GRR--TRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
R + W++ + + V + E LN+A+A HE+ S + W
Sbjct: 235 PRSSASTWRQMVEPVAGGLSLEADVSQIGELLNVAYARHEIVSEHTAASLAWL 287
>gi|384254369|gb|EIE27843.1| hypothetical protein COCSUDRAFT_83463 [Coccomyxa subellipsoidea
C-169]
Length = 463
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 151/316 (47%), Gaps = 43/316 (13%)
Query: 404 DSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEER 463
+S+V + +E +GT+V++ +PKAVL L HGC+ A+ + +S CP C+GLPEE
Sbjct: 127 NSTVTGDAVLEEVDGTEVVYTRVAAPKAVLLLFHGCSHSALDWGRKSVTCPGCLGLPEEL 186
Query: 464 LLVLHALSRGFAIITISS---AGRCWT---LGEERLV--VREIIRWWVERHKLEKLPLVA 515
+V A++R + + SS A RCW L + + V++ + R + EKLP+ A
Sbjct: 187 AIVRRAVARSYVPVAFSSSDKANRCWDAQWLDKSLDIPNVKQTFQKLAARERWEKLPVYA 246
Query: 516 LGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDI-PED--------YPPTLFVHMPKD 566
LGASSG V LA + I I ++ P D YPP FVHM D
Sbjct: 247 LGASSGASMVLFLAMRMPLDGIVPQIMSLPPHMLEAKPTDPAAGKSWTYPPVYFVHMELD 306
Query: 567 SYRQRKIGEFLVVLRNK-GIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKG 625
+++ + VL+ K G V E P+S F RI G++ + + + +
Sbjct: 307 RATAQRVTADINVLKTKQGAWVTEKLIRPRPVSAELFL-RIEGIDVKTAEAVHQTLKAAE 365
Query: 626 FIDENGYMRSDGRR--TRWKEALR------------ESNAIFLNKNLVHHVQEELNLAFA 671
++ +++ D R+ T+W AL+ ES+ +F EELNLA+A
Sbjct: 366 LLNATDFLKEDPRQTSTKWMPALKAIPGLAGISLEPESSPMF----------EELNLAWA 415
Query: 672 YHEMTSLPSEQIFKWF 687
HE+ S +++ W
Sbjct: 416 RHELISDHLDEVLDWL 431
>gi|384250150|gb|EIE23630.1| hypothetical protein COCSUDRAFT_41813 [Coccomyxa subellipsoidea
C-169]
Length = 429
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 147/322 (45%), Gaps = 42/322 (13%)
Query: 418 GTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAII 477
G +V+ P+ VL + HGC R +W P C CIGLPEE + AL+R F ++
Sbjct: 74 GMEVVLVRSKDPRGVLIVMHGCMRRPTDWWHYDPACVTCIGLPEEMAIARAALARSFDVL 133
Query: 478 TIS---SAGRCWTL-----GEERLVVREIIRWWVER-HKLEKLPLVALGASSGGYFVSAL 528
++ A RCW E+ V ++ R + LP ALG SSGG L
Sbjct: 134 ALTPKEQAYRCWDTEWPDKSEDVPRVAAVLEELEARDYGYRGLPRYALGVSSGGAMAMVL 193
Query: 529 AK------------GLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEF 576
A+ G+R S +AL + + + P YPPT F++M +D I
Sbjct: 194 AQFFPLQGICSQIMGIRPSLLALPKNKTMAGR---PWRYPPTFFLYMARDVAMSEWISLD 250
Query: 577 LVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD 636
L+++G+ V ++ + P++ +F+DRI L+ A S ++ Q +D + D
Sbjct: 251 ADKLKSQGVYVKKVSVLPRPITPTYFSDRISSLDNATSAAIYHALQGAHMLDLQDMLVQD 310
Query: 637 GRRT-----RWKEALRESNAIFLNKNL---VHHVQEELNLAFAYHEMTSLPSEQIFKWF- 687
R+ RW++ L+E+ +L + E LNLA+A HE+ S ++ +F ++
Sbjct: 311 PRQVMPSAPRWRQVLQENVPGLRGVSLESETSPIHELLNLAWAGHEIVSDYADDMFAFWE 370
Query: 688 ---------LPIGSLDLPSEAK 700
+PIG + S+ +
Sbjct: 371 RPGPDEAELMPIGGIGASSKMR 392
>gi|145346415|ref|XP_001417683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577911|gb|ABO95976.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 439
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 133/295 (45%), Gaps = 31/295 (10%)
Query: 420 DVIWQIPDS-----------PKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
+ IW+ P + PK L HGC+ A F+ + C C GL +E +
Sbjct: 139 EAIWEGPAAENAAGEFEATKPKGALLALHGCHHSAKDFFATNTKCSQCRGLAQEMAITRT 198
Query: 469 ALSRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSAL 528
AL+RG+ ++ ISS G CW+ + V+ ++ + L LPL A GASSGG FV A+
Sbjct: 199 ALNRGYVVLAISSLGTCWSKDIDAPRVQAVLDVFYNDTALSDLPLFAFGASSGGSFVGAM 258
Query: 529 --------AKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVL 580
A GL A G+ + + DYPPT+F HMP+D+ I +
Sbjct: 259 PQMQLTPKAAGLIIQIAPGPTASGVVGE--VLADYPPTVFSHMPRDARTAEFIAASIADF 316
Query: 581 RNKGIDVAEIECMEFPLSRNFFTD----RIPGLEQA---NSTKLFELFQEKGFIDENGYM 633
RN G+ + E + +F + R+ E A S + F L +G + E+ +
Sbjct: 317 RNAGVRIIESRLEPRAIDDAYFYEQSFGRVSEEESAAMVASLRSFNLIDAQGLLTEDPRL 376
Query: 634 RSDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWFL 688
D K E +++ +++ +E LN+A+A HE+++ + F W +
Sbjct: 377 FDDDHIIMIKRHAPEWDSLLADQS---DTRELLNVAYAMHELSAQDFSKNFAWLV 428
>gi|307110505|gb|EFN58741.1| hypothetical protein CHLNCDRAFT_140437 [Chlorella variabilis]
Length = 335
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 114/227 (50%), Gaps = 22/227 (9%)
Query: 420 DVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITI 479
+ +W++PD VL +AHGC+ +A FW S CP C+GLPEE + AL RG+A+I +
Sbjct: 56 ETLWEVPDQVGGVLLVAHGCSHQAGDFWPASQRCPGCLGLPEEMAIRATALRRGYAVIAV 115
Query: 480 SSAGR---CWTLGE-------ERL--VVREIIRWWVERHKLEKLPLVALGASSGGYFVSA 527
SS R CW + +RL +VRE++R LE LPL GASSGG V
Sbjct: 116 SSYNRSSGCWHNTQASRSEDLQRLPDIVREVVR----EEGLEGLPLFLFGASSGGGIVLR 171
Query: 528 LAKGL-RFSSIALMIAEGLFDQMDIPED---YPPTLFVHM-PKDSYRQRKIGEFLVVLRN 582
LA + +A + D M+ + +PP + HM +D ++ E + R+
Sbjct: 172 LAAIMAEVQGVAAQVVPFRPDLMEGGQGWRRFPPVVVAHMAARDPVWAAQVAETMGYCRS 231
Query: 583 KGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDE 629
GI AEI P++ F R P +++A + + E +G +DE
Sbjct: 232 VGIPAAEIRIDPQPITPQFL-QRAPQIDRATAEAMVEALDGEGLLDE 277
>gi|147855764|emb|CAN83444.1| hypothetical protein VITISV_005823 [Vitis vinifera]
Length = 534
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 141/302 (46%), Gaps = 44/302 (14%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ +V ++++Y+ KW + +Y + + K I VS I+YKEL+ +Y I+K+D +
Sbjct: 2 VDEVGIMLLYEEKWVQDGNAFY-FERSKGKDIEVSKTISYKELLGVVYHILKLDPTNCFL 60
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHA 182
+M ++ ANI T P+++ DD DV+FFI N + EL P+CI +E++ + ++
Sbjct: 61 SMNYVFNANISTSPIQLTDDGDVKFFIGLNCTNG-ELHVPLCIIVEKRIDNHNQKSICNS 119
Query: 183 ALDHQTPAASRSESNQISVVV---PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDG 239
++ + E N+ S+++ + VE +++ NG F E+ G
Sbjct: 120 YSEYHMSSEIDKELNKDSMLMHKSTHVHCESVETLTVDSENG----LRFQNESLEG---- 171
Query: 240 ILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKK 299
+ N +I+ E+ + ++SKK
Sbjct: 172 ----------------------------YKVHDWNMNETSIDGDDHLEE---HQIYSSKK 200
Query: 300 DLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIH 359
+L +KL + A K+ F FK +KST V C D CKWR+RATK F+I ++ + H
Sbjct: 201 ELQRKLYMMALKRKFEFKTTKSTTKLLLVECFDKECKWRVRATKLGISNMFQIMKYYSTH 260
Query: 360 IC 361
C
Sbjct: 261 TC 262
>gi|307103064|gb|EFN51328.1| hypothetical protein CHLNCDRAFT_55184 [Chlorella variabilis]
Length = 345
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 129/291 (44%), Gaps = 29/291 (9%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
+ PT ++IP P +L + H C A FW S CP CIG+PE +
Sbjct: 37 MTPTSASAANHTYYFEIPIDPVGLLVMLHRCGRSAEDFWPPSGACPECIGMPEAVSISRQ 96
Query: 469 ALSRGFAII-------TISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSG 521
AL+RG+A++ T+ S GRCW+L + L VRE++ + + H L LPL G SSG
Sbjct: 97 ALARGYALLAINSFNRTVGSLGRCWSLATDALPVRELVTAFRQLHGLADLPLFVTGCSSG 156
Query: 522 GYFVSALAKGLRFSSIALMIAEGLFDQMDIPED-------YPPTLFVHMPKDSYRQRKIG 574
G L + I + +A GL D P + YPP + H +D+ ++
Sbjct: 157 GLLALRLPSFMALDGI-MPVAIGL-DLDSFPPNLPRQNHTYPPVAYFHFSRDNTTTTQVE 214
Query: 575 EFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMR 634
+ + G+ +++ + +FF+ R P + A +T+ E G+ +E +
Sbjct: 215 AMVEFSQAAGLPASQVTVQPSQFTPDFFSQRDPLIRHAGATRRMPRAGE-GWTEEVSTVE 273
Query: 635 SDGRRTRWKEALRESNAIFLNKN------------LVHHVQEELNLAFAYH 673
R EA ++ + ++ L HHV EEL +A+A H
Sbjct: 274 LRQEVERVAEAAGAVSSYCVAESGDGTAADLTPDLLYHHVLEELGVAWAEH 324
>gi|308803725|ref|XP_003079175.1| unnamed protein product [Ostreococcus tauri]
gi|116057630|emb|CAL53833.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 348
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 152/349 (43%), Gaps = 42/349 (12%)
Query: 367 KNRKKQTALQRPDGERGIRNEKPNSSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIP 426
K+ K A + GER + +SS V WN+ V +G + IW+ P
Sbjct: 3 KSEKGGGASEEESGERELL--IGDSSVAYVSMGWNAERRLVG-------DDGIETIWEGP 53
Query: 427 -----------DSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFA 475
+P L + HGC+ A ++ + C C GL +E + AL+RGF
Sbjct: 54 AAENEKGELEATAPVGALLVLHGCHHSAKDWFANNTKCETCRGLAQEMSVTRTALNRGFV 113
Query: 476 IITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKG---L 532
+I +S+ G CW+ + + +++ + L LPL A GASSGG FV L + +
Sbjct: 114 VIAVSAYGTCWSKEIDAPRMEQVLNTFFNDTALSDLPLYAFGASSGGSFVGMLPQMKLPV 173
Query: 533 RFSSIALMIAEGLFDQMDIPED---YPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAE 589
+ S + + IA G + D YPPT+F +MPKD I + + + G+ V E
Sbjct: 174 QPSGLIIQIAPGPTLENTPRADLSTYPPTVFSYMPKDERTSELIAQSIADFKRAGVQVKE 233
Query: 590 IECMEFPLSRNFFTDRIPG-LEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALRE 648
+ + +S FF + G + + S L +D +G + +D R A +
Sbjct: 234 SKLNDHVVSDTFFYEESFGKITKEESAGLTNTLNSFNLLDADGKLTNDPR------AFDD 287
Query: 649 SNAIFLNKNL---------VHHVQEELNLAFAYHEMTSLPSEQIFKWFL 688
+ L K+ V+E LN+A+A HE+++ Q KW +
Sbjct: 288 DHVEVLKKHAPVWDSMLADQSDVRELLNVAYATHELSAQEFGQNLKWLM 336
>gi|46398239|gb|AAS91797.1| MuDRA-like transposase [Cucumis melo]
gi|51477400|gb|AAU04773.1| MuDRA transposase-like [Cucumis melo]
Length = 807
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 152/323 (47%), Gaps = 38/323 (11%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
M+ VR+LV + G+W G Y GG KGI+V +IT+K+L LY + +VD ++ I
Sbjct: 1 MAHVRILVRHGGEW---DEGRRKYEGGVLKGIVVPKEITHKDLQSELYDLAEVDPTKFDI 57
Query: 123 TMKSLY--KANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGY 180
++ +Y K P E+ +D D++F+I + +P E+ P+ ++ E ++ +
Sbjct: 58 KIRCIYEIKGEKEAPPFELSNDRDLKFYIL--SENPLEV--PLYLSFEPTSNRSMKV--- 110
Query: 181 HAALDHQTPAASRSESNQISVVVP-----GMQATEVEEQNLNAFNGDPVAANFGTEAPL- 234
S S SNQ+ + P GM + E ++ G GT + +
Sbjct: 111 -----LNKDYNSVSGSNQVQNLNPHPPPIGMDRLDENEVDIGEVEGGLCHNMIGTNSAIW 165
Query: 235 -------GCGDGILAANIEVSEQQDDI-QRLDLSQDDCGDKLPI-RRSNTR------SAA 279
D ++E+ + DI ++ D DC K + RS++R S
Sbjct: 166 ESYESYHSIDDTFTLESVEMYNELFDIPEQRDAPTKDCKGKGKVDYRSSSRKLKTKGSGW 225
Query: 280 INPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRL 339
+ E++ G+ F K+DL +L + A KKNF F V KSTK+ V C D C WRL
Sbjct: 226 SEESSTSEELDVGQIFFCKRDLSMRLSVLAMKKNFQFVVKKSTKEVLFVRCIDNKCGWRL 285
Query: 340 RATKTAEDEYFEIRRFSNIHICT 362
RA + + F+I+++ +H C+
Sbjct: 286 RAVRLKDSNIFKIKKYVKVHSCS 308
>gi|412989160|emb|CCO15751.1| predicted protein [Bathycoccus prasinos]
Length = 561
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 139/291 (47%), Gaps = 34/291 (11%)
Query: 417 NGTDV--IWQIP---DSPKAVLFLAHGCNGRAVHFWDRSPN-CPNCIGLPEERLLVLHAL 470
NG D+ ++++P P+A+ L HGC A ++++ N CP C GLP E + A
Sbjct: 40 NGEDIQYLYKVPKLSSGPRALFLLFHGCAHDAFDWFEKDENNCPKCRGLPIEVSIAHFAY 99
Query: 471 SRGFAIITISSAGRCWTLGEERLVVREIIRWWVER---------HKLEKLPLVALGASSG 521
+ A++ ISS CW + V ++++ + E+ HK E + + A GASSG
Sbjct: 100 EQNMAVLAISSNKGCWQRNPDASRVYKVLKNFSEKDGTDIGDLAHK-EHVDVYAFGASSG 158
Query: 522 GYFVSALAKGL---RFSSIALMIAEGLFDQMDIPEDYP----PTLFVHMPKDSYRQRKIG 574
G FVS + + F + + + +G Q+ P + P +F HMPKD R +
Sbjct: 159 GTFVSHVLPQIFEDHFGHLKMRL-KGAISQIAAPPEVEKITLPMVFSHMPKDERIARAVR 217
Query: 575 EFLVVL-RNKGIDVAEIECMEFPLSRNFFTDRIPG--LEQANSTKLFELFQEKGFIDENG 631
+ + ++ + V E E L FF D P E S K+FE + G++D+ G
Sbjct: 218 NTVDQMQKSSDVPVKESELHPMKLHETFFRDNAPPGYYETYESEKMFEALKTNGYLDDEG 277
Query: 632 YMRSDGRRTRWKE----ALRESNAIFLN---KNLVHHVQEELNLAFAYHEM 675
Y++ D RR+ W++ +L N + +N E +N+A+A HEM
Sbjct: 278 YLKEDPRRSDWRDVVCGSLEGENRFAPDSPCENNKSWTSELMNVAWASHEM 328
>gi|449443702|ref|XP_004139616.1| PREDICTED: uncharacterized protein LOC101218844 [Cucumis sativus]
Length = 806
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 153/321 (47%), Gaps = 35/321 (10%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
MS + +LV Y G W Y GG KGI+VS +IT+K+L LY + +VD +++ +
Sbjct: 1 MSHIPMLVRYGGMW---DERRRKYEGGMLKGIVVSKEITHKDLQAELYDLAEVDPSKFDV 57
Query: 123 TMKSLYKANI----PTMPVEIVDDDDVRFFI-QENAAHPTELRSPICITIERKGSQCQET 177
++ +Y+ + PT E+ +D D++F++ EN L+ P+ ++ E K +Q ++
Sbjct: 58 MIRCIYEIKVEHEAPTF--ELSNDRDLKFYLLSENP-----LKVPLYVSFEPKSNQSKKV 110
Query: 178 EGYHAALDHQTPAASRSESN---QISVVVPGMQATEVEEQNLNAFNGDPV----AANFGT 230
+ D+ + + S N +V+ + EV + + D V +A + +
Sbjct: 111 ----LSKDYNSVSGSNQAHNLNPHPPIVMDTLNENEVHVREVEVGLCDNVIGTTSAIWES 166
Query: 231 EAPLGCGDGILAAN-IEVSEQQDDI-QRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYED 288
D +E++ + DI Q D DC K +R S++ + + +
Sbjct: 167 YESYDSKDETFTWEPVEMNSESFDIPQHRDGPTKDCKGKSKVRYSSSSQKLKTDMNDWSE 226
Query: 289 IV-------CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRA 341
G+ F SKKDL +L + A KKNF F V KSTK+ V C D C WRLRA
Sbjct: 227 ESSTSEEFDVGQIFFSKKDLSMRLSVLAMKKNFQFVVKKSTKEVLFVRCIDNKCGWRLRA 286
Query: 342 TKTAEDEYFEIRRFSNIHICT 362
+ + F+I+++ +H C+
Sbjct: 287 MRLKDSNIFKIKKYVKVHSCS 307
>gi|255078562|ref|XP_002502861.1| predicted protein [Micromonas sp. RCC299]
gi|226518127|gb|ACO64119.1| predicted protein [Micromonas sp. RCC299]
Length = 365
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 128/286 (44%), Gaps = 14/286 (4%)
Query: 419 TDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPN-CPNCIGLPEERLLVLHALSRGFAII 477
+ IW +P S K + HGC+ A+ ++ R + CP C GL +E+ + AL G+A++
Sbjct: 81 VEAIWNVPASAKGTVLALHGCSHSALDWFPRDESRCPECRGLAQEQKITNAALDAGYALL 140
Query: 478 TISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAK----GLR 533
+SSAG CW+ + + + + E + P+ A GASSGG F L R
Sbjct: 141 AVSSAGHCWSRDDVPRLALALDAYGAEHGASRRRPMYAFGASSGGSFAGVLPSLSDLPRR 200
Query: 534 FSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECM 593
+ + IA G YPPT+ HMP+D + + L G V E
Sbjct: 201 PDGLIIQIAGGPGGTGLGLAHYPPTVLSHMPRDESTAAAVRRSIDELTAAGARVLEHRLE 260
Query: 594 EFPLSRNFFTDRIPGLEQA-NSTKLFE-LFQEKGFIDENGYMRSDGRRTRWKE--ALRES 649
L+ +FF + G A S LF L E G +D + + D R + LRE
Sbjct: 261 RRALTPSFFEEESAGKVTARESAGLFHALRDELGVLDAHDLLIRDPRSFDVGQIAKLRER 320
Query: 650 NAIFLNKNLV---HHVQEELNLAFAYHEMTSLPSEQIFKWFLPIGS 692
+ +LV V+E LN+A+A HE+++ + +W + S
Sbjct: 321 TPAW--DSLVPDESDVREILNVAYAAHELSAQDFMKNLRWIVDASS 364
>gi|147841847|emb|CAN60749.1| hypothetical protein VITISV_043463 [Vitis vinifera]
Length = 483
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 85/327 (25%), Positives = 152/327 (46%), Gaps = 46/327 (14%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ ++ ++++Y + + Y+ + G + KGI V +I+Y EL+ +Y I+++D E I
Sbjct: 8 IDEIGIMLLYKRNYVQDGNTYH-FKGSEGKGITVKKNISYHELMRIVYRILQLDPVECSI 66
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHA 182
+MK + +NIPT P+++ DD DV+FFI+ N + +L +P+CIT++R+ ++ H
Sbjct: 67 SMKYAFNSNIPTSPIQLRDDGDVKFFIRLNCTN--KLPAPLCITVDRRSKNNAKSMFMHG 124
Query: 183 ALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPV----AANFGTEAPLGCGD 238
H + S + ++++ FN +P+ + + D
Sbjct: 125 N-GHVNDGSIES-------------LNVIGDESITKFNYEPLERSNVVEWNMNGIMNIKD 170
Query: 239 GIL--AANI--EVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKT 294
G++ A + EVS D+ R+ S DCG +Y+ I
Sbjct: 171 GLMNDAPTMIEEVSNXDQDMSRIGTS--DCGT------------------NYDHIEEKXI 210
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRR 354
++ KK+L KKL + K+ F FK KST + C D CK R AT +F++ +
Sbjct: 211 YSXKKELXKKLYIIXLKEKFKFKTIKSTTKLLVLQCVDNECKCRFXATXLGSSNFFQLMK 270
Query: 355 FSNIHICTQPPAKNRKKQTALQRPDGE 381
+ H C + +R + A R GE
Sbjct: 271 YHPTHTC-RLEIMSRDNRHASSRLVGE 296
>gi|448924743|gb|AGE48324.1| hypothetical protein PBCVAN69C_040R [Paramecium bursaria Chlorella
virus AN69C]
Length = 412
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/289 (25%), Positives = 131/289 (45%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V + +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 107 LKPIVIIVDGRQHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 165
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + ++++I +L PL G S+GG
Sbjct: 166 ALKAGYAILYISPADQKNKCFSAKTDPNTIKKVINQVRSSLRLNNKPLYIGGCSAGG--- 222
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 223 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 278
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS K+ + Q+ GFID +G +R D +
Sbjct: 279 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSNKIVQYLQKIGFIDASGNIRKDPKNKS 338
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 339 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 385
>gi|448930175|gb|AGE53740.1| hypothetical protein PBCVIL3A_036R [Paramecium bursaria Chlorella
virus IL-3A]
Length = 436
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/289 (25%), Positives = 131/289 (45%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V + +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 131 LKPIVIIVDGRQHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 189
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + ++++I +L PL G S+GG
Sbjct: 190 ALKAGYAILYISPADQKNKCFSAKTDPNTIKKVINQVRSSLRLNNKPLYIGGCSAGG--- 246
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 247 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 302
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS K+ + Q+ GFID +G +R D +
Sbjct: 303 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSNKIVQYLQKIGFIDASGNIRKDPKNKS 362
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 363 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 409
>gi|307108283|gb|EFN56523.1| hypothetical protein CHLNCDRAFT_144139 [Chlorella variabilis]
Length = 528
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 140/342 (40%), Gaps = 71/342 (20%)
Query: 391 SSYGLVKQKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRS 450
++ GL Q+W F G +QIP P L + HGC + F+ +
Sbjct: 48 TNRGLKPQRWPKF--------------GRMHYYQIPPDPIGTLVIFHGCGRASKAFFPWN 93
Query: 451 PN-CPNCIGLPEERLLVLHALSRGFAIITIS--------------SAGRCWTLGEERLVV 495
P+ C C+G PE AL+ G+A++ + +GR G ++ V
Sbjct: 94 PDTCEECLGFPEHVSQTKQALALGYAVLALDPLDFRHLCWSSSDRGSGRLKHAGNDQPTV 153
Query: 496 REIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIA------LMIAEGLFDQM 549
R ++R ++ L++ PL+ LG SSGG LA L + A + A G+ +
Sbjct: 154 RLLLREFLTAQGLKEKPLLLLGVSSGGTLALKLAVALEKEAEAAGEGEWVPRASGVISEE 213
Query: 550 DIPEDY--------------PPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEF 595
+P D+ PPT+FV M + R G + LR+ G+ I
Sbjct: 214 SVPTDFGAADDDGKLRYRGFPPTIFVVMTGGNSRLDAPG-VVEFLRDGGVPADVITSGPR 272
Query: 596 PLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR------------WK 643
P+ FF+DRIP + Q S + E G +D +G++ D R R W
Sbjct: 273 PVVPTFFSDRIPAITQQQSRWIVEGMTYIGMLDADGWLLGDPERNRNVRKNTTSPAFNWD 332
Query: 644 EALR--------ESNAIFLNKNLVHHVQEELNLAFAYHEMTS 677
L+ +S + L+ +Q+ LN+A+A H+ S
Sbjct: 333 TKLKALLPWLADDSPTLSLSYRQC-GIQQALNVAYARHDAVS 373
>gi|157952374|ref|YP_001497266.1| hypothetical protein NY2A_B070R [Paramecium bursaria Chlorella
virus NY2A]
gi|155122601|gb|ABT14469.1| hypothetical protein NY2A_B070R [Paramecium bursaria Chlorella
virus NY2A]
Length = 433
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/289 (25%), Positives = 131/289 (45%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V + +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 128 LKPVVIMVDGLSHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 186
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + ++++I +L PL G S+GG
Sbjct: 187 ALKAGYAILYISPADQKNKCFSAKTDPNTIKKVINQVRSSLRLNNKPLYIGGCSAGG--- 243
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 244 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 299
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS K+ + Q+ GFID +G +R D +
Sbjct: 300 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSIKIVQYLQKIGFIDASGNIRKDPKDKA 359
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 360 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 406
>gi|307109995|gb|EFN58232.1| hypothetical protein CHLNCDRAFT_57065 [Chlorella variabilis]
Length = 496
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/330 (25%), Positives = 140/330 (42%), Gaps = 62/330 (18%)
Query: 423 WQIPDSPKAVLFLAHGCNGRAVHFWDRSP-NCPNCIGLPEERLLVLHALSRGFAIITISS 481
+Q+P+ P L + HGC A W P +C C+GLPEE AL+ GFA++ + S
Sbjct: 117 YQVPEKPIGTLVMFHGCYHDASGSWPYHPKHCRECLGLPEE------ALAHGFAVLAVES 170
Query: 482 -----AGRCWTLGEERLVVRE-----IIRWWVERHKLEKLPLVALGASSGGYFVSALAKG 531
GRC++ G + + + I+ ++ L+ LP+ LG S+G F + + K
Sbjct: 171 RNRSRKGRCFSSGTDPMTSDQWTAPYTIQNFLYETGLQNLPVYTLGISAGAAFATKVIKN 230
Query: 532 LRFSSIALMIA-EGLFDQMDIPE-----------------DYPPTLFVHMPKDSYRQRKI 573
+ +I G+ +++ P D+PP F+ M +D+ +I
Sbjct: 231 FWDPNFGGIIKPAGIISEVNAPSSWRSWGLEGKDGKLKFPDFPPVAFIAMERDNRTFNRI 290
Query: 574 GEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYM 633
+ + LR + I P+ R +F +R P + S ++ + GF+D +G +
Sbjct: 291 LDRIQDLRRFNVPADYIMVKSRPVDRLWFYNRSPVITARQSAEIVRAMKRLGFLDADGNL 350
Query: 634 RSDGRRTRWKEALRESNAIF-LNKNLVHHVQ----------------------EELNLAF 670
+ D R AL ++N + N+ LV V EE+N+A+
Sbjct: 351 KYDPRIG----ALTDNNPLAKWNRKLVGRVDWLHMSPSKAPMLSVLSDRSTIFEEMNVAW 406
Query: 671 AYHEMTSLPSEQIFKWFLPIGSLDLPSEAK 700
AYHE + W G LDL A+
Sbjct: 407 AYHEGVADHMVPCLMWLRSGGELDLKYLAR 436
>gi|448934664|gb|AGE58216.1| hypothetical protein PBCVNY2B_082R [Paramecium bursaria Chlorella
virus NY-2B]
Length = 415
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 130/289 (44%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 110 LKPVVITVDGRQHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 168
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + V+++I +L PL G S+GG
Sbjct: 169 ALKAGYAILYISPADQKNKCFSAKTDSNTVKKVINQVRSSLRLNNKPLYIGGCSAGG--- 225
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 226 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 281
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS K+ + Q+ GFID +G +R D +
Sbjct: 282 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSIKIVQYLQKIGFIDASGNIRKDPKDKA 341
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 342 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 388
>gi|448930575|gb|AGE54139.1| hypothetical protein PBCVIL52s1_206L [Paramecium bursaria Chlorella
virus IL-5-2s1]
Length = 411
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 130/289 (44%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 106 LKPVVITVDGRQHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 164
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + V+++I +L PL G S+GG
Sbjct: 165 ALKAGYAILYISPADQKNKCFSAKTDSNTVKKVINQVRSSLRLNNKPLYIGGCSAGG--- 221
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 222 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 277
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS K+ + Q+ GFID +G +R D +
Sbjct: 278 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSIKIVQYLQKIGFIDASGNIRKDPKDKA 337
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 338 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 384
>gi|157953248|ref|YP_001498139.1| hypothetical protein AR158_C057R [Paramecium bursaria Chlorella
virus AR158]
gi|156067896|gb|ABU43603.1| hypothetical protein AR158_C057R [Paramecium bursaria Chlorella
virus AR158]
Length = 415
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 130/289 (44%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 110 LKPVVITVDGRQHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 168
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + V+++I +L PL G S+GG
Sbjct: 169 ALKAGYAILYISPADQKNKCFSAKTDSNTVKKVINQVRSSLRLNNKPLYIGGCSAGG--- 225
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 226 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 281
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS K+ + Q+ GFID +G +R D +
Sbjct: 282 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSIKIVQYLQKIGFIDASGNIRKDPKDKG 341
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 342 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 388
>gi|448933614|gb|AGE57169.1| hypothetical protein PBCVNEJV4_051R [Paramecium bursaria Chlorella
virus NE-JV-4]
Length = 448
Score = 92.0 bits (227), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 131/289 (45%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 143 LKPIVITVDGRQHVYQSPSNPKGLVVFLHGCFRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 201
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + ++++I +L PL G S+GG
Sbjct: 202 ALKAGYAILYISPADQKNKCFSAKTDPNTIKKVINQVRSSLRLNNKPLYIGGCSAGG--- 258
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G + VLR
Sbjct: 259 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-VNVLR 314
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + ++F+D+I + NS+K+ + Q+ FID +G + D +
Sbjct: 315 RFGKPAAVLVSGKRKIYPDYFSDQIASISVQNSSKIVQYLQKIRFIDASGNILKDPKDKS 374
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 375 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 421
>gi|307109925|gb|EFN58162.1| hypothetical protein CHLNCDRAFT_141972 [Chlorella variabilis]
Length = 366
Score = 92.0 bits (227), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 125/312 (40%), Gaps = 44/312 (14%)
Query: 422 IWQIPDSPKAVLFLAHGCNGRAVHFWDRSPN-CPNCIGLPEERLLVLHALSRGFAIITIS 480
+Q+P+ P HGC A W P+ CP C+GLPEE AL+RG+A++ +
Sbjct: 7 FYQVPEDPIGTFIWFHGCVHDASAGWPYDPDECPECLGLPEE------ALARGYAVLAVE 60
Query: 481 SAGR-----CWTLGEERLV-----VREIIRWWVERHKLEKLPLVALGASSGGYFVSALAK 530
S R C+ + + V +II +V +L+ P+ G SSG F + K
Sbjct: 61 SKNRERKERCFNYSPDETISDAYQVPKIIERFVRVKRLQDKPIYTAGVSSGASFAVKIPK 120
Query: 531 GLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHM--------PKDSYRQRKIGEFLVVLRN 582
G S + I + + IP + L P D + + ++ E + V R+
Sbjct: 121 GFYSSENEVKIHGAISEANAIPIESWGLLDTRGQLRYPGFPPNDPFSRVQMLENIDVFRS 180
Query: 583 KGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR---- 638
GI + + ++ +FF +R P + S + + + G +DE+G + D R
Sbjct: 181 VGIPSDWVSVWQRSINESFFFERSPTITMPQSAAIVKALKAMGVVDEDGNLMGDPRIDAP 240
Query: 639 ---------------RTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQI 683
R W R + + + EE+N+AFA HE+ +
Sbjct: 241 NRTSVMHKWNFKLVQRVPWLNLNRNKPPMLTALSDRGAIFEEMNVAFARHEIIADYLVPC 300
Query: 684 FKWFLPIGSLDL 695
W G DL
Sbjct: 301 LAWLEKEGKADL 312
>gi|448927800|gb|AGE51372.1| hypothetical protein PBCVCviKI_040R [Paramecium bursaria Chlorella
virus CviKI]
Length = 416
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 129/289 (44%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 111 LKPIVITVDGRQHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 169
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + ++++I +L PL G S+GG
Sbjct: 170 ALKAGYAILYISPADQKNKCFSAKTDPNTIKKVINQVRSSLRLNNKPLYIGGCSAGG--- 226
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 227 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 282
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS+K+ + Q+ FID +G + D +
Sbjct: 283 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSSKIVQYLQKIRFIDASGNILKDPKDKS 342
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 343 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 389
>gi|9631610|ref|NP_048389.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|624069|gb|AAC96409.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
Length = 412
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 129/289 (44%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G ++Q P +PK ++ HGC W RS N P GL E+
Sbjct: 107 LKPIVITVDGRQHVYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 165
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + ++++I +L PL G S+GG
Sbjct: 166 ALKAGYAILYISPADQKNKCFSAKTDPNTIKKVINQVRSSLRLNNKPLYIGGCSAGG--- 222
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 223 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 278
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G A + + + +F+D+I + NS+K+ + Q+ FID +G + D +
Sbjct: 279 RFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSSKIVQYLQKIRFIDASGNILKDPKDKS 338
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 339 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 385
>gi|448928848|gb|AGE52417.1| hypothetical protein PBCVCvsA1_118R [Paramecium bursaria Chlorella
virus CvsA1]
gi|448931576|gb|AGE55137.1| hypothetical protein PBCVMA1E_031R [Paramecium bursaria Chlorella
virus MA-1E]
Length = 412
Score = 89.0 bits (219), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 127/289 (43%), Gaps = 21/289 (7%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G ++Q P PK ++ HGC W RS N P GL E+
Sbjct: 107 LKPIVITVDGRQHVYQSPSDPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 165
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A +C++ + ++++I +L PL G S+GG
Sbjct: 166 ALKAGYAILYISPADQKNKCFSAKTDPNTIKKVINQVRSSLRLNNKPLYIGGCSAGG--- 222
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ + P +L+ + Q G VLR
Sbjct: 223 -GLAQ--RLVASGFIQCNGMFNESATSAEPSNKTPASLWTVLSTPKELQVATGR-ANVLR 278
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
G + + + +F+D+I + NS+K+ + Q+ FID +G + D +
Sbjct: 279 RFGKPAGVLVSGKRKIYPEYFSDQIASISVQNSSKIVQYLQKIRFIDASGNILKDPKDKS 338
Query: 642 WKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 339 WYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 385
>gi|147790169|emb|CAN70123.1| hypothetical protein VITISV_033217 [Vitis vinifera]
Length = 500
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 71/106 (66%), Gaps = 2/106 (1%)
Query: 65 QVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITM 124
+V ++++Y+G+W + +Y + G + KGI + I+YKEL+ ++ I+K+D ++M
Sbjct: 10 EVGIMLLYEGEWVRDRNVFY-FEGSKGKGIEIPKTISYKELLGVVHHILKLDPTNCFLSM 68
Query: 125 KSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERK 170
K ++ ANIPT P+++ DD DV+FFI N + +L P+CIT+E++
Sbjct: 69 KCVFNANIPTSPIQLTDDGDVKFFIGLNCTN-GKLPVPLCITVEKR 113
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 37/62 (59%)
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ ++SKK+L +KL + A K+ F FK +KST V C D CKWR+R + ++F
Sbjct: 269 QIYSSKKELQRKLYMMALKRKFEFKTTKSTTKLLLVECFDKECKWRVRELRNLLQKWFVT 328
Query: 353 RR 354
R+
Sbjct: 329 RQ 330
>gi|359488303|ref|XP_002275053.2| PREDICTED: uncharacterized protein LOC100256986 [Vitis vinifera]
Length = 1111
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 72/108 (66%), Gaps = 2/108 (1%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ +V ++++Y+G+W + +Y + G + KGI + I+YKEL+ ++ I+K+D +
Sbjct: 8 VDEVGIMLLYEGEWVRDGNVFY-FEGSKGKGIEIPKTISYKELLGVVHHILKLDPTNCFL 66
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERK 170
+MK ++ ANIPT P+++ DD DV+FFI N + +L P+CIT+E++
Sbjct: 67 SMKYVFNANIPTSPIQLTDDGDVKFFIGLNCTNG-KLPVPLCITVEKR 113
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 42/69 (60%)
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ ++SKK+L +KL + A K+ F FK +KST V C D CKWR+RATK F+I
Sbjct: 269 QIYSSKKELQRKLYMMALKRKFEFKTTKSTTKLLLVECFDKECKWRVRATKLGISNMFQI 328
Query: 353 RRFSNIHIC 361
+F + H C
Sbjct: 329 MKFYSTHTC 337
>gi|448930937|gb|AGE54500.1| hypothetical protein PBCVKS1B_121R [Paramecium bursaria Chlorella
virus KS1B]
Length = 417
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/294 (25%), Positives = 132/294 (44%), Gaps = 30/294 (10%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
LNP V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 112 LNPVVITVDGRQHAYQSPANPKGLVVFLHGCSRSIYGAWPRSSN-PKFFGYSEDVSRTKQ 170
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS A + C++ + V+++I +L PL G S+GG
Sbjct: 171 ALKAGYAILYISPADQKNNCFSAKTDSETVKKVINQVRSSLRLNDKPLFIGGCSAGG--- 227
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHM--PKDSYRQRKIGEFLVV 579
LA+ R + + G+F++ D P +L+ M PK+ + E + V
Sbjct: 228 -GLAQ--RLVASGFITCNGMFNESATSADPSNKTPASLWTVMSTPKE---LQVANERVNV 281
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
LR G + + + +F+D+I + NS K+ ++ ++ GFI+ +G +++D +
Sbjct: 282 LRRFGKPAGVLVSGKRKIYPEYFSDQIASISVQNSIKMVDVLKKVGFINASGNIKTDPKD 341
Query: 640 TR-WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ W L ++ I F + ++V +A+A H+ ++ KW
Sbjct: 342 DKSWYFTLGKTVPIPETTIGFWDSSVVQATM----VAYAVHDAVAVYMTTFLKW 391
>gi|307104865|gb|EFN53117.1| hypothetical protein CHLNCDRAFT_137468 [Chlorella variabilis]
Length = 497
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/264 (24%), Positives = 118/264 (44%), Gaps = 43/264 (16%)
Query: 459 LPEERLLVLHALSRGFAIITISSAGR----CWTLGEERLV-----VREIIRWWVERHKLE 509
LPEE L+RG+A++ ++S R C++ ++ + + I+ + ++ +++
Sbjct: 137 LPEEVAHTKQCLARGYAVLALASKDRTRMRCFSAAGDQTLSDMVDAMDAIKSFTKKLRIQ 196
Query: 510 KLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPED--------------- 554
K P+ G S+G F ++F M +GL ++++P +
Sbjct: 197 KKPIYMWGISAGASFA------VKFP--LTMPIDGLVSEVNMPWEKKWHATDGAGTLKTR 248
Query: 555 YPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANS 614
+PPT F M +D +R+I + + +LRN G++ + FPL +FF R L + S
Sbjct: 249 FPPTAFYQMERDDQTRRQIAKAVEILRNNGVNAESVFIPSFPLYSSFFWRRSIYLTRQQS 308
Query: 615 TKLFELFQEKGFIDENGYMRSDGRRT-RWKEALRESNAIFLNKNLVHH----------VQ 663
+ + G ID+ G++ D R +WK LRE N N +++ +
Sbjct: 309 AAIVNGLKAIGAIDKQGWLNYDPRTVQKWKGKLREVLPWLGNINDINNPYNLNSDESQIT 368
Query: 664 EELNLAFAYHEMTSLPSEQIFKWF 687
E +NLA+A HE+ W
Sbjct: 369 EAMNLAWAQHEIVGEHVRPTLMWL 392
>gi|452821320|gb|EME28352.1| hypothetical protein Gasu_41910 [Galdieria sulphuraria]
Length = 540
Score = 82.4 bits (202), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 108/232 (46%), Gaps = 43/232 (18%)
Query: 419 TDVIWQIPDSP-KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAII 477
+ ++ P+S +VL L HGC A ++ LPEE +V AL R F+++
Sbjct: 316 SSIVQHSPESDIHSVLLLFHGCKHSARDWF----------VLPEEVSVVCEALRRNFSVV 365
Query: 478 TISSAGR---CW-----TLGEERLV-VREIIRWWV----------ERHKLEKLPLVALGA 518
SS R CW LG + ++ V + + W+ + ++ L L ALG
Sbjct: 366 AFSSVDRWSGCWDSFYPALGNKDVIRVHQSFQEWIFENFVNNQTNSQQAIQSLKLYALGV 425
Query: 519 SSGGYFVSALAKGLR-FSSIALMIAEGLFDQMDIPE--DYPPTLFVHMPKDSYRQRKIGE 575
SSGG FVS L+ LR S+ A+ I+ G F + + YPPT FVHM KD+ G
Sbjct: 426 SSGGSFVSILSTFLRNISAQAIYISPGSFQSFTLFDWNPYPPTFFVHMLKDT----TFGS 481
Query: 576 FLVV------LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELF 621
F V L I + P++ ++F ++IP + + S +LF+L
Sbjct: 482 FNHVNSSCNLLTQHKIACRILSLKPVPITADWFHEKIPIISLSLSRQLFDLL 533
>gi|448934727|gb|AGE58279.1| hypothetical protein PBCVNY2B_254R [Paramecium bursaria Chlorella
virus NY-2B]
Length = 368
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 139/325 (42%), Gaps = 40/325 (12%)
Query: 390 NSSYGLVKQKWNSF--------------DSSVQLNPTVELRNGTDVIWQIPDSPKAVLFL 435
+ S +++QKW + L P V +G +Q P +PK ++
Sbjct: 29 DESISILRQKWTDIGPINAKRLQDGYKRQTEAGLKPVVITVDGRQHAYQSPSNPKGLVVF 88
Query: 436 AHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGR---CWTLGEER 492
HGC W RS N P GL E+ AL G+AI+ IS + C++ +
Sbjct: 89 LHGCARSVFGGWPRSSN-PKFFGLSEDVSRTKQALKAGYAILYISPEDQKTNCFSAKTDP 147
Query: 493 LVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIP 552
++++I +L + PL G S+GG LA+ R + + G+F++
Sbjct: 148 DTIKKVINQMRSSLRLNEKPLYIGGCSAGG----GLAQ--RLIASGFLSCNGMFNESATT 201
Query: 553 ED----YPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPG 608
D P +L+ + Q I E + L + G A + + + +F+D+I
Sbjct: 202 ADPTNKTPASLWTVLSTPKELQVAI-ERVNALHHFGKPAAVLVSGKRKIYPEYFSDQIAS 260
Query: 609 LEQANSTKLFELFQEKGFIDENGYMRSDGRRTR-WKEALRESNAI------FLNKNLVHH 661
+ NS K+ ++ ++ GF+D +G ++S+ + + W L +S I F + ++V
Sbjct: 261 VSVQNSIKMVDVLKKIGFVDSSGNIKSEPKDDKSWYVTLGKSVPIPETTISFWDSSVVQA 320
Query: 662 VQEELNLAFAYHEMTSLPSEQIFKW 686
+A+A+H+ ++ KW
Sbjct: 321 TM----VAYAHHDAVAVYMTTFLKW 341
>gi|307106123|gb|EFN54370.1| hypothetical protein CHLNCDRAFT_135657 [Chlorella variabilis]
Length = 463
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/291 (21%), Positives = 114/291 (39%), Gaps = 41/291 (14%)
Query: 430 KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTL- 488
+ HGC ++ +S CP C GLPEE L+RG+A++ + S R +
Sbjct: 146 RGTCIFVHGCKHDPHSWFYKSKKCPRCTGLPEEVSHSKQCLARGYAVLALMSLDRRYRFR 205
Query: 489 ---------GEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIAL 539
+ R+++ W+ + G SSG F ++F
Sbjct: 206 CFSSSGGPGSNDHPSARDLVTNWMNTFNMMGKAKFIFGVSSGASF------AVKFPMTMW 259
Query: 540 MIAEGLFDQMDIPED---------------YPPTLFVHMPKDSYRQRKIGEFLVVLRNKG 584
M +G+ ++++P + +PPT F M +D +R+I + +L G
Sbjct: 260 M--DGVVSEVNMPWEKAWGATDGRGNLKVRFPPTAFYMMQRDDQTRRQIEQATYILNRNG 317
Query: 585 IDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR-WK 643
+ + P+ R + R + + S ++++ + ID G + D R R W
Sbjct: 318 VPAEYVYVPARPVHRTWLAQRSVYINKQQSAQIWQALVKARVIDRRGNVLYDVRTNRKWV 377
Query: 644 EALRESNAIFLNK----NLV---HHVQEELNLAFAYHEMTSLPSEQIFKWF 687
L ++ NLV + +ELNLA+A+HE+ S W
Sbjct: 378 NQLLKAVPWLKQNDPFYNLVSDESQIWQELNLAWAWHEIVSDYVRATLAWL 428
>gi|448936537|gb|AGE60084.1| hypothetical protein ATCVWI0606_411L [Acanthocystis turfacea
Chlorella virus WI0606]
Length = 446
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 124/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S V L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 138 SDVGLKPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-KFFGMPEDVG 196
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 197 RTKQCLVNGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 256
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + G+F++ D PP+L+ + + R G +
Sbjct: 257 GMIQ------RLVADGSIQCNGMFNESATSADPSRKTPPSLWTVLSTPGEKARAEGH-VK 309
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ GI A + + ++ ++F ++ + NS K+ + G ID NG + D +
Sbjct: 310 AMKGFGIPAAVLVSGKRVITPDYFYNQYASISMENSKKIAASLRSSGLIDANGNIVKDPK 369
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL A+A H+ S KW
Sbjct: 370 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLKW 420
>gi|359488047|ref|XP_003633694.1| PREDICTED: uncharacterized protein LOC100241533 [Vitis vinifera]
Length = 734
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 123/299 (41%), Gaps = 63/299 (21%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ +V ++++Y+G+W + +Y + G + KGI + I+YKEL+ ++ I+K+D +
Sbjct: 8 VDEVGIMLLYEGEWVRDGNVFY-FEGSKGKGIEIPKTISYKELLRVVHHILKLDPTNCFL 66
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHA 182
+MK ++ NA PT SPI +T
Sbjct: 67 SMKYVF-----------------------NANIPT---SPIQLT---------------- 84
Query: 183 ALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILA 242
+ + + G+ T VE ++ NG F E+ G G A
Sbjct: 85 ------------DDGDVKFFI-GLNCTNVETLTVDGENG----PRFQNESLEGTGS---A 124
Query: 243 ANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLI 302
+ E+ D + + R N I S + + + ++SKK+L
Sbjct: 125 QSAEILTMIDTSDGFIHDNPTIIEDVANERQNMMQQPIVSGISDDHLEEHQIYSSKKELQ 184
Query: 303 KKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
+KL + A K+ F FK +KST V C D CKWR+RATK F+I +F + H C
Sbjct: 185 RKLYMMALKRKFEFKTTKSTTKLLLVECFDKECKWRVRATKLGISNMFQIMKFYSTHTC 243
>gi|448935101|gb|AGE58652.1| hypothetical protein PBCVNYs1_235R [Paramecium bursaria Chlorella
virus NYs1]
Length = 367
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 136/327 (41%), Gaps = 39/327 (11%)
Query: 387 EKPNSSYGLVKQKWNSF--------------DSSVQLNPTVELRNGTDVIWQIPDSPKAV 432
+K + + +++QKW + L P V +G +Q P +PK +
Sbjct: 26 KKHDDAISILRQKWTDIGPINAKRLQDGYKRQTEAGLKPVVITVDGRQHAYQSPSNPKGL 85
Query: 433 LFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGR---CWTLG 489
+ HGC W RS N P GL E+ AL G+AI+ IS + C++
Sbjct: 86 VVFLHGCARSVFGGWPRSSN-PKFFGLSEDVSRTKQALKAGYAILYISPEDQKTNCFSAK 144
Query: 490 EERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQM 549
+ ++++I L + PL G S+GG LA+ R + + G+F++
Sbjct: 145 TDPDTIKKVINQVRSALHLNEKPLYIGGCSAGG----GLAQ--RLVASGFLSCNGMFNES 198
Query: 550 DIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDR 605
D P +L+ + Q E + L + G A + + + +F+D+
Sbjct: 199 ATTADPTNKTPASLWTVLSTPKEFQVAT-ERVNALHHFGKPAAVLVSGKRKIYPEYFSDQ 257
Query: 606 IPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALRESNAI------FLNKNLV 659
I + NS K+ ++ ++ GF+D +G ++SD W L +S I F + ++V
Sbjct: 258 IASVSVQNSIKMVDVLKKIGFVDSSGNIKSDKDDKSWYVTLGKSVPIPETTISFWDSSVV 317
Query: 660 HHVQEELNLAFAYHEMTSLPSEQIFKW 686
+A+A H+ ++ KW
Sbjct: 318 QATM----VAYAVHDAIAVYMTVFLKW 340
>gi|448931239|gb|AGE54801.1| hypothetical protein PBCVMA1D_131R [Paramecium bursaria Chlorella
virus MA-1D]
Length = 367
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 136/327 (41%), Gaps = 39/327 (11%)
Query: 387 EKPNSSYGLVKQKWNSF--------------DSSVQLNPTVELRNGTDVIWQIPDSPKAV 432
+K + + +++QKW + L P V +G +Q P +PK +
Sbjct: 26 KKHDEAISILRQKWTDIGPINAKRLQDGYKRQTEAGLKPVVITVDGRQHAYQSPSNPKGL 85
Query: 433 LFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGR---CWTLG 489
+ HGC W RS N P GL E+ AL G+AI+ IS + C++
Sbjct: 86 VVFLHGCARSVFGGWPRSSN-PKFFGLSEDVSRTKQALKAGYAILYISPEDQKTNCFSAK 144
Query: 490 EERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQM 549
+ ++++I L + PL G S+GG LA+ R + + G+F++
Sbjct: 145 TDPDTIKKVINQVRSALHLNEKPLYIGGCSAGG----GLAQ--RLVASGFLSCNGMFNES 198
Query: 550 DIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDR 605
D P +L+ + Q E + L + G A + + + +F+D+
Sbjct: 199 ATTADPTNKTPASLWTVLSTPKEFQVAT-ERVNALHHFGKPAAVLVSGKRKIYPEYFSDQ 257
Query: 606 IPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTRWKEALRESNAI------FLNKNLV 659
I + NS K+ ++ ++ GF+D +G ++SD W L +S I F + ++V
Sbjct: 258 IASVSVQNSIKMVDVLKKIGFVDSSGNIKSDKDDKSWYVTLGKSVPIPETTISFWDSSVV 317
Query: 660 HHVQEELNLAFAYHEMTSLPSEQIFKW 686
+A+A H+ ++ KW
Sbjct: 318 QATM----VAYAVHDAIAVYMTVFLKW 340
>gi|448932404|gb|AGE55963.1| hypothetical protein ATCVMO0605SPH_394L [Acanthocystis turfacea
Chlorella virus MO0605SPH]
Length = 446
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 124/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 138 SDAGLKPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-KFFGMPEDVG 196
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 197 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 256
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + G+F++ D PP+L+ + + R G +
Sbjct: 257 GMIQ------RLVADGSIQCNGMFNESATSADPSRKTPPSLWTVLSTPGEKARAEGH-VK 309
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ GI A + + ++ ++F ++ + NS K+ + G ID NG + D +
Sbjct: 310 AMKGFGIPAAVLVSGKRVITPDYFYNQYASISMENSKKIAASLRSSGLIDANGNIVKDPK 369
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL A+A H+ S KW
Sbjct: 370 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLKW 420
>gi|9631740|ref|NP_048519.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|1181334|gb|AAC96539.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
Length = 387
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 130/294 (44%), Gaps = 30/294 (10%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 82 LKPVVITVDGRQHAYQSPANPKGLVVFLHGCSRSIFGAWPRSSN-PKFFGYSEDVSRTKQ 140
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS + C++ + ++++I +L PL G S+GG
Sbjct: 141 ALKVGYAILYISPTDQKNNCFSAKTDPETIKKVINQVRSGLRLNDKPLFIGGCSAGG--- 197
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSYRQRKIGEFLVV 579
LA+ R + + G+F++ D P +L+ + PK+ + E + V
Sbjct: 198 -GLAQ--RLVASGFLTCNGMFNESATSADPSSKTPASLWTVLSTPKE---LQVANERVNV 251
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
LR G + + + +F+D+I + NS K+ ++ ++ GFID +G +++D +
Sbjct: 252 LRRFGKPAGVLVSGKRKIYPEYFSDQIASISVQNSIKMVDVLKKVGFIDASGNIKTDPKD 311
Query: 640 TR-WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ W L ++ I F + +V V +A+A H+ ++ KW
Sbjct: 312 DKSWYFTLGKTVPIPETTIGFWDSGVVMSVM----VAYAVHDAVAVYMTTFLKW 361
>gi|448931643|gb|AGE55204.1| hypothetical protein PBCVMA1E_187R [Paramecium bursaria Chlorella
virus MA-1E]
Length = 387
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 130/294 (44%), Gaps = 30/294 (10%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 82 LKPIVITVDGRQHAYQSPANPKGLVVFLHGCSRSIFGAWPRSSN-PKFFGYSEDVSRTKQ 140
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS + C++ + ++++I +L PL G S+GG
Sbjct: 141 ALKAGYAILYISPTDQKNNCFSAKTDPETIKKVINQVRSGLRLNDKPLFIGGCSAGG--- 197
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSYRQRKIGEFLVV 579
LA+ R + + G+F++ D P +L+ + PK+ + E + V
Sbjct: 198 -GLAQ--RLVASGFLACNGMFNESATSADPSSKTPASLWTVLSTPKE---LQVANERVNV 251
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
LR G + + + +F+D+I + NS K+ ++ ++ GFID +G +++D +
Sbjct: 252 LRRFGKPAGVLVSGKRKIYPEYFSDQIASISVQNSIKMVDVLKKVGFIDASGNIKTDPKD 311
Query: 640 TR-WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ W L ++ I F + +V V +A+A H+ ++ KW
Sbjct: 312 DKSWYFTLGKTVQIPETTIGFWDSGVVMSVM----VAYAVHDAVAVYMTTFLKW 361
>gi|448928881|gb|AGE52450.1| hypothetical protein PBCVCvsA1_190R [Paramecium bursaria Chlorella
virus CvsA1]
Length = 401
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 130/294 (44%), Gaps = 30/294 (10%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 96 LKPIVITVDGRQHAYQSPANPKGLVVFLHGCSRSIFGAWPRSSN-PKFFGYSEDVSRTKQ 154
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS + C++ + ++++I +L PL G S+GG
Sbjct: 155 ALKAGYAILYISPTDQKNNCFSAKTDPETIKKVINQVRSGLRLNDNPLFIGGCSAGG--- 211
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSYRQRKIGEFLVV 579
LA+ R + + G+F++ D P +L+ + PK+ + E + V
Sbjct: 212 -GLAQ--RLVASGFLACNGMFNESATSADPSSKTPASLWTVLSTPKE---LQVANERVNV 265
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
LR G + + + +F+D+I + NS K+ ++ ++ GFID +G +++D +
Sbjct: 266 LRRIGKPAGVLVSGKRKIYPEYFSDQIASISVQNSIKMVDVLKKVGFIDASGNIKTDPKD 325
Query: 640 TR-WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ W L ++ I F + +V V +A+A H+ ++ KW
Sbjct: 326 DKSWYFTLGKTVQIPETTIGFWDSGVVMSVM----VAYAVHDAVAVYMTTFLKW 375
>gi|448927863|gb|AGE51435.1| hypothetical protein PBCVCviKI_183R [Paramecium bursaria Chlorella
virus CviKI]
Length = 387
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 130/294 (44%), Gaps = 30/294 (10%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 82 LKPIVITVDGRQHAYQSPANPKGLVVFLHGCSRSIFGAWPRSSN-PKFFGYSEDVSRTKQ 140
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS + C++ + ++++I +L PL G S+GG
Sbjct: 141 ALKAGYAILYISPTDQKNNCFSAKTDPETIKKVINQVRSGLRLNDNPLFIGGCSAGG--- 197
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSYRQRKIGEFLVV 579
LA+ R + + G+F++ D P +L+ + PK+ + E + V
Sbjct: 198 -GLAQ--RLVASGFLACNGMFNESATSADPSSKTPASLWTVLSTPKE---LQVANERVNV 251
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
LR G + + + +F+D+I + NS K+ ++ ++ GFID +G +++D +
Sbjct: 252 LRRIGKPAGVLVSGKRKIYPEYFSDQIASISVQNSIKMVDVLKKVGFIDASGNIKTDPKD 311
Query: 640 TR-WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ W L ++ I F + +V V +A+A H+ ++ KW
Sbjct: 312 DKSWYFTLGKTVQIPETTIGFWDSGVVMSVM----VAYAVHDAVAVYMTTFLKW 361
>gi|448933399|gb|AGE56955.1| hypothetical protein ATCVNEJV3_385L [Acanthocystis turfacea
Chlorella virus NE-JV-3]
Length = 446
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 124/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 138 SDAGLRPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-KFFGMPEDVG 196
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 197 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 256
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + G+F++ D PP+L+ + + R G +
Sbjct: 257 GMIQ------RLVADGSIQCNGMFNESATSADPSRKTPPSLWTVLSTPGEKARAEGH-VK 309
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ GI A + + ++ ++F ++ + NS K+ + G ID NG + D +
Sbjct: 310 AMKGFGIPAAVLVSGKRVITPDYFYNQYASISMENSKKIAASLRSSGLIDANGNIVKDPK 369
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL A+A H+ S KW
Sbjct: 370 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLKW 420
>gi|155371277|ref|YP_001426811.1| hypothetical protein ATCV1_Z330L [Acanthocystis turfacea Chlorella
virus 1]
gi|155124597|gb|ABT16464.1| hypothetical protein ATCV1_Z330L [Acanthocystis turfacea Chlorella
virus 1]
Length = 462
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 124/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 154 SDAGLRPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-KFFGMPEDVG 212
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 213 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 272
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + G+F++ D PP+L+ + + R G +
Sbjct: 273 GMIQ------RLVADGSIQCNGMFNESATSADPSRKTPPSLWTVLSTPGEKARAEGH-VK 325
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ GI A + + ++ ++F ++ + NS K+ + G ID NG + D +
Sbjct: 326 AMKGFGIPAAVLVSGKRVITPDYFYNQYASISMENSKKIAASLRSSGLIDANGNIVKDPK 385
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL A+A H+ S KW
Sbjct: 386 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLKW 436
>gi|448935855|gb|AGE59404.1| hypothetical protein ATCVOR07043_385L [Acanthocystis turfacea
Chlorella virus OR0704.3]
Length = 482
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 124/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 174 SDAGLRPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-KFFGMPEDVG 232
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 233 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 292
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQ----MDIPEDYPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + G+F++ D PP+L+ + + R G +
Sbjct: 293 GMIQ------RLVADGSIQCNGMFNESATSADPSRKTPPSLWTVLSTPGEKARAEGH-VK 345
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ GI A + + ++ ++F ++ + NS K+ + G ID NG + D +
Sbjct: 346 AMKGFGIPAAVLVSGKRVITPDYFYNQYASISMENSKKIAASLRSSGLIDANGNIVKDPK 405
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL A+A H+ S KW
Sbjct: 406 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLKW 456
>gi|448925897|gb|AGE49475.1| hypothetical protein ATCVCan0610SP_390L [Acanthocystis turfacea
Chlorella virus Can0610SP]
Length = 466
Score = 79.3 bits (194), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 125/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 158 SDAGLRPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-KFFGMPEDVG 216
Query: 465 LVLHALSRGFAIITIS--SAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S +A C++ + + I L+ PL G S+GG
Sbjct: 217 RTKQCLANGYAILYLSPEAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 276
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + G+F++ D PP+L+ + + R G +
Sbjct: 277 GMIQ------RLVADGSIQCNGMFNESATSADPSRKTPPSLWTVLSTPGEKARAEGH-VK 329
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ GI A + + ++ ++F ++ + NS K+ + G ID NG + D +
Sbjct: 330 AMKGFGIPAAVLVSGKRVITPDYFYNQYASISMENSKKIAASLRSSGLIDANGNIVKDPK 389
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL A+A H+ S KW
Sbjct: 390 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLKW 440
>gi|448933681|gb|AGE57236.1| hypothetical protein PBCVNEJV4_198R [Paramecium bursaria Chlorella
virus NE-JV-4]
Length = 387
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 71/294 (24%), Positives = 129/294 (43%), Gaps = 30/294 (10%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 82 LKPVVITVDGRHHAYQSPANPKGLVVFLHGCSRSIFGAWPRSSN-PKFFGYSEDVSRTKQ 140
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
L G+AI+ IS + C++ + ++++I +L PL G S+GG
Sbjct: 141 VLKAGYAILYISPTDQKNNCFSAKTDSETIKKVINQVRSGLRLNDKPLFIGGCSAGG--- 197
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSYRQRKIGEFLVV 579
LA+ R + + G+F++ D P +L+ + PK+ + E + V
Sbjct: 198 -GLAQ--RLVASGFLTCNGMFNESATSADPSSKTPASLWTVLSTPKE---LQVANERVNV 251
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
LR G + + + +F+D+I + NS K+ ++ ++ GFID +G +++D +
Sbjct: 252 LRRFGKPAGVLVSGKRKIYPEYFSDQIASISVQNSIKMVDVLKKVGFIDASGNIKTDPKD 311
Query: 640 TR-WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ W L ++ I F + +V V +A+A H+ ++ KW
Sbjct: 312 DKSWYFTLGKTVPIPETTIGFWDSGVVMSVM----VAYAVHDAVAVYMTTFLKW 361
>gi|448925565|gb|AGE49144.1| hypothetical protein ATCVBr0604L_380L [Acanthocystis turfacea
Chlorella virus Br0604L]
Length = 492
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 126/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S V L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 184 SDVGLKPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-QFFGMPEDVG 242
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 243 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 302
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQ----MDIPEDYPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + +G+F++ D + PP+L+ + + R G +
Sbjct: 303 GMIQ------RLVADGSIKCDGMFNESATSADPSQKTPPSLWTVLSTPGEKARAEGH-VK 355
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ G A + + ++ +FF ++ + NS K+ + G ID +G + D +
Sbjct: 356 SMKGFGTPAAVLVSGKRIITPDFFYNQYASISMENSKKIAASLRSSGLIDASGNIVKDPK 415
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL ++A H+ S KW
Sbjct: 416 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTSYAVHDAVSCYMTTFLKW 466
>gi|448929955|gb|AGE53521.1| hypothetical protein ATCVGM07011_388L, partial [Acanthocystis
turfacea Chlorella virus GM0701.1]
Length = 383
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 126/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 75 SDAGLKPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-QFFGMPEDVG 133
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 134 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGNPLYLGGCSAGG 193
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQ----MDIPEDYPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + +G+F++ D + PP+L+ + S + R G +
Sbjct: 194 GMIQ------RLVADGSIKCDGMFNESATSADPSQKTPPSLWTVLSTPSEKARAEGH-VK 246
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ G A + + ++ +FF ++ + NS K+ + G ID +G + D +
Sbjct: 247 AMKGFGTPAAVLVSGKRIITPDFFYNQYTSISMENSKKIAASLRSSGLIDASGNIVKDPK 306
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL ++A H+ S KW
Sbjct: 307 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTSYAVHDAVSCYMTTFLKW 357
>gi|448924811|gb|AGE48392.1| hypothetical protein PBCVAN69C_192R [Paramecium bursaria Chlorella
virus AN69C]
gi|448930241|gb|AGE53806.1| hypothetical protein PBCVIL3A_188R [Paramecium bursaria Chlorella
virus IL-3A]
Length = 387
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/294 (24%), Positives = 129/294 (43%), Gaps = 30/294 (10%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 82 LKPVVITVDGRHHAYQSPANPKGLVVFLHGCSRSIFGAWPRSSN-PKFFGYSEDVSRTKQ 140
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
L G+AI+ IS + C++ + ++++I +L PL G S+GG
Sbjct: 141 VLKAGYAILYISPTDQKNNCFSAKTDPETIKKVINQVRSGLRLNDKPLFIGGCSAGG--- 197
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSYRQRKIGEFLVV 579
LA+ R + + G+F++ D P +L+ + PK+ + E + V
Sbjct: 198 -GLAQ--RLVASGFLTCNGMFNESATSADPSSKTPASLWTVLSTPKE---LQVANERVNV 251
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
LR G + + + +F+D+I + NS K+ ++ ++ GFID +G +++D +
Sbjct: 252 LRRFGKPAGVLVSGKRKIYPEYFSDQIASISVQNSIKMVDVLKKVGFIDASGNIKTDPKD 311
Query: 640 TR-WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ W L ++ I F + +V V +A+A H+ ++ KW
Sbjct: 312 DKSWYFTLGKTVPIPETTIGFWDSGVVMSVM----VAYAVHDAVAVYMTTFLKW 361
>gi|307107113|gb|EFN55357.1| hypothetical protein CHLNCDRAFT_134386 [Chlorella variabilis]
Length = 498
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 88/197 (44%), Gaps = 7/197 (3%)
Query: 498 IIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDI---PED 554
I+ ++++ +L LPL ALG S GG FV+ LA ++ + + Q D+
Sbjct: 24 ILGSFLDQQRLRHLPLYALGVSVGGGFVAKLAAHIKMDGVVSEVLGPKPVQWDLDVYQYG 83
Query: 555 YPPTLFVHMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANS 614
PP +FV M +D +I LR +G A + + FF+DR L S
Sbjct: 84 LPPIVFVSMDQDPPMAARISACSARLRQRGGLTATVRVAPRVVYPTFFSDRSSRLSPEVS 143
Query: 615 TKLFELFQEKGFIDENGYMRSDGRRTR--WKEALRESNAIFLNKNLVHHVQ--EELNLAF 670
K+ E +DE G + D R TR W + L E + H Q EE+NLA+
Sbjct: 144 AKVVAALHEIDMLDERGMVTVDPRHTRRPWVKQLVELVPELNGGMMADHSQVWEEMNLAW 203
Query: 671 AYHEMTSLPSEQIFKWF 687
+ HE+ S + F WF
Sbjct: 204 SSHEIISDFATPAFVWF 220
>gi|147789349|emb|CAN62351.1| hypothetical protein VITISV_004945 [Vitis vinifera]
Length = 684
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 128/298 (42%), Gaps = 57/298 (19%)
Query: 69 LVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLY 128
+ +Y+G+ + +Y + G + KGI + I+YKEL+ ++ I+K+D ++MK ++
Sbjct: 1 MXLYEGEXVRDGNVFY-FEGSKGKGIEIPKTISYKELLGVVHHILKLDPTNCFLSMKYVF 59
Query: 129 KANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQT 188
ANIPT +++ D+ DV+FFI N + +L P+CI +E++ + + +
Sbjct: 60 NANIPTSLIQLTDNGDVKFFIGLNCTNG-KLSVPLCIIVEKRIDNHNQKSICNFYFECHM 118
Query: 189 PAASRSESNQISVVVPG---MQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANI 245
P+ E N S+++ + VE ++ NG F E+ G
Sbjct: 119 PSEIDKELNGDSMLMQKSRHIHCDSVETLTVDGENG----PRFQNESLEG---------- 164
Query: 246 EVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKL 305
+ N AIN ED TS K +
Sbjct: 165 ----------------------YKVHDWNXNETAINE----EDYRMNTNPTSDKQVT--- 195
Query: 306 KLGAAKKNFAFKVSKSTKDRFEVVCADTN--CKWRLRATKTAEDEYFEIRRFSNIHIC 361
++G +F+ S ST+ + DTN CKWR+RATK F+I +F + H C
Sbjct: 196 QIG------SFRTS-STQSAEILTMIDTNDECKWRVRATKLGISNMFQIMKFYSTHTC 246
>gi|448936198|gb|AGE59746.1| hypothetical protein ATCVTN60342_379L [Acanthocystis turfacea
Chlorella virus TN603.4.2]
Length = 490
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/295 (23%), Positives = 125/295 (42%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 182 SDAGLKPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-QFFGMPEDVG 240
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 241 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 300
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQ----MDIPEDYPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + +G+F++ D + PP+L+ + + R G +
Sbjct: 301 GMIQ------RLVADGSIKCDGMFNESATSADPSQKTPPSLWTVLSTPGEKARAEGH-VK 353
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ G A + + ++ +FF ++ + NS K+ + G ID +G + D +
Sbjct: 354 AMKGFGTPAAVLVSGKRIITPDFFYNQYASISMENSKKIAASLRSSGLIDASGNIVKDPK 413
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL ++A H+ S KW
Sbjct: 414 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTSYAVHDAVSCYMTTFLKW 464
>gi|157952528|ref|YP_001497420.1| hypothetical protein NY2A_B224R [Paramecium bursaria Chlorella
virus NY2A]
gi|155122755|gb|ABT14623.1| hypothetical protein NY2A_B224R [Paramecium bursaria Chlorella
virus NY2A]
Length = 368
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 129/292 (44%), Gaps = 26/292 (8%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC W RS N P GL E+
Sbjct: 62 LKPVVITVDGRQHAYQSPSNPKGLVVFLHGCVRSVFGGWPRSSN-PKFFGLSEDVSRTKQ 120
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS + C++ + ++++I +L + PL G S+GG
Sbjct: 121 ALKAGYAILYISPEDQKTNCFSAKTDPDTIKKVINQVRSSLRLNEKPLYIGGCSAGG--- 177
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ D P +L+ + Q + + L
Sbjct: 178 -GLAQ--RLVASGFLSCNGMFNESATTADPSNKTPASLWTVLSTPQELQVAT-DRVNALH 233
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
+ G A + + + +F+D+I + NS K+ ++ ++ GF+D +G ++S+ + +
Sbjct: 234 HFGKPAAVLVSGKRKIYPEYFSDQIASVSVQNSIKMVDVLKKIGFVDSSGNIKSEPKDDK 293
Query: 642 -WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
W L +S I F + ++V +A+A+H+ ++ KW
Sbjct: 294 SWYVTLGKSVPIPETTISFWDSSVVQATM----VAYAHHDAVAVYMTTFLKW 341
>gi|157953397|ref|YP_001498288.1| hypothetical protein AR158_C206R [Paramecium bursaria Chlorella
virus AR158]
gi|156068045|gb|ABU43752.1| hypothetical protein AR158_C206R [Paramecium bursaria Chlorella
virus AR158]
gi|448930592|gb|AGE54156.1| hypothetical protein PBCVIL52s1_246R [Paramecium bursaria Chlorella
virus IL-5-2s1]
Length = 368
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 129/292 (44%), Gaps = 26/292 (8%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L V +G +Q P +PK ++ HGC W RS N P GL E+
Sbjct: 62 LKSVVITVDGRQHAYQSPSNPKGLVVFLHGCARSVFGGWPRSSN-PKFFGLSEDVSRTKQ 120
Query: 469 ALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
AL G+AI+ IS + C++ + ++++I +L + PL G S+GG
Sbjct: 121 ALKAGYAILYISPEDQKTNCFSAKTDPDTIKKVINQVRSSLRLNEKPLYIGGCSAGG--- 177
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLVVLR 581
LA+ R + + G+F++ D P +L+ + Q I E + L
Sbjct: 178 -GLAQ--RLIASGFLSCNGMFNESATTADPTNKTPASLWTVLSTPKELQVAI-ERVNALH 233
Query: 582 NKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRRTR 641
+ G A + + + +F+D+I + NS K+ ++ ++ GF+D +G ++S+ + +
Sbjct: 234 HFGKPAAVLVSGKRKIYPEYFSDQIASVSVQNSIKMVDVLKKIGFVDSSGNIKSEPKDDK 293
Query: 642 -WKEALRESNAI------FLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
W L +S I F + ++V +A+A+H+ ++ KW
Sbjct: 294 SWYVTLGKSVPIPETTISFWDSSVVQATM----VAYAHHDAVAVYMTTFLKW 341
>gi|448933065|gb|AGE56622.1| hypothetical protein ATCVNEJV2_418L [Acanthocystis turfacea
Chlorella virus NE-JV-2]
Length = 454
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 123/295 (41%), Gaps = 25/295 (8%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 146 SDAGLRPTIVNIDGRSQIYQSPKNAKGLVVFLHGCSRTAFGGWPMSSGS-KFFGMPEDVG 204
Query: 465 LVLHALSRGFAIITISSAGR--CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S A C++ + + I L+ PL G S+GG
Sbjct: 205 RTKQCLANGYAILYLSPAAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 264
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLFVHMPKDSYRQRKIGEFLV 578
+ R + + G+F++ D P +L+ + + R G +
Sbjct: 265 GMIQ------RLVADGSIQCNGMFNESATSADPSRKTPASLWTVLSTPGEKARAEGH-VK 317
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
++ GI A + + ++ ++F ++ + NS K+ + G ID NG + D +
Sbjct: 318 AMKGFGIPAAVLVSGKRVITPDYFYNQYASISMENSKKIAASLRSSGLIDANGNIVKDPK 377
Query: 639 RTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W L+ E++A F N +V +EL A+A H+ S KW
Sbjct: 378 QPREWFMKLQKDVRIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLKW 428
>gi|384247558|gb|EIE21044.1| hypothetical protein COCSUDRAFT_48249 [Coccomyxa subellipsoidea
C-169]
Length = 1465
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 126/310 (40%), Gaps = 59/310 (19%)
Query: 426 PDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSA--- 482
P K VLF GC + +S CP+C G+PEE L V A RG+A++ + +
Sbjct: 71 PGDLKGVLFALPGCLQLTTEWGFQSATCPSCHGMPEEMLSVYRAAKRGYALVALGAHLSE 130
Query: 483 ----GRCWTLG---EERLVVREIIR----------WWVERHKLEKLPLVALGASSGGYFV 525
C+ E + + E++R WW LP A G+S GG
Sbjct: 131 AEQNFHCFNTTWPPEAHIELPELVRTLRAVLSARDWW-------HLPRFAFGSSRGGAMT 183
Query: 526 SALAKGLRFSSIALMIA----EGLFDQMDIPED--------YPPTLFVHMPKDSYRQRKI 573
LA F ++ M+ E + + P + +PPT+ + D Q I
Sbjct: 184 LILALRFPFQAVGSMVMGMRPEEVMEAALQPRNLASGAQWAFPPTMLMSARND---QDDI 240
Query: 574 GEFL----VVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDE 629
+ L+++G+ V I + LS +FF R+ G+++ S +FE + G + E
Sbjct: 241 IGLINRTEAHLQSQGVLVQRIVMEPYALSPSFFNQRMKGIDEETSAAIFEELKAVGVVLE 300
Query: 630 NGYMRSDGRRTRWKEALRESNAIFL------------NKNLVHHVQEELNLAFAYHEMTS 677
+ GR WK+A + I L ++ +E L + A HE+T+
Sbjct: 301 DNQSHF-GRYETWKKAEAQLKKILLRHFPADLGAKNGSRGYERAFREMLWASEAIHELTA 359
Query: 678 LPSEQIFKWF 687
+++ +F
Sbjct: 360 EHMDEMLDFF 369
>gi|147784962|emb|CAN70818.1| hypothetical protein VITISV_037122 [Vitis vinifera]
Length = 610
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 70/274 (25%), Positives = 116/274 (42%), Gaps = 46/274 (16%)
Query: 127 LYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDH 186
++ ANIP P+++ DD DV+FFI N + +L +CIT+E++ + ++ ++
Sbjct: 36 VFNANIPIGPIQLTDDGDVKFFIGLNCTNG-KLHVALCITVEKRIDNHNQKSICNSYFEY 94
Query: 187 QTPAASRSESNQISVVVPG---MQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAA 243
+ E N S+++ + VE ++ NG P N E G +
Sbjct: 95 HISSEIDKELNGNSMLMHKSRHIHCESVETLTVDGENG-PRFQNESLE-----GYKVHDW 148
Query: 244 NIEVSEQQDDIQRLDLSQDDCGDKLPI---RRSNTRSAAINPFHSYED------------ 288
N+ + ++ R+D + + I R S+ +S I D
Sbjct: 149 NMNETSINEENYRMDTNPTNDKQVTQIGTFRTSSAQSVEILTMIDTNDGFIQDNPTIIEG 208
Query: 289 ------------IVCG---------KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFE 327
IV G + ++SKK+L +KL + A K F FK +KST
Sbjct: 209 VANEGQNIMQQPIVSGISDDHLEEHQIYSSKKELQRKLYMMALKMKFEFKTTKSTTKLLL 268
Query: 328 VVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
V C D CKWR+RATK F+I ++ + H C
Sbjct: 269 VECFDKECKWRVRATKLEIFNMFQIMKYYSTHTC 302
>gi|307108700|gb|EFN56939.1| hypothetical protein CHLNCDRAFT_143478 [Chlorella variabilis]
Length = 436
Score = 75.9 bits (185), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 92/228 (40%), Gaps = 50/228 (21%)
Query: 435 LAHGCNGRAVHFWDRSP-NCPNCIGLPEERLLVLHALSRGFAIITISSAGRCWTLGEERL 493
L HGC A F+ P NCP C GLPE AL+RG+A+
Sbjct: 119 LIHGCGREARSFFPYDPVNCPECTGLPEHVAQAKQALARGYAVAAT-------------- 164
Query: 494 VVREIIRWWVERHKLEKLPLVALGASSGGYFVSAL--------------------AKGLR 533
VV ++ H L LP+ G SSGG L GLR
Sbjct: 165 VVD-----FLSNHSLAGLPVFFHGFSSGGTMSLKLPGYLHDCCGEGREEGEAPGPGAGLR 219
Query: 534 FSSIALMIA--EGLFDQMD------IPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGI 585
I + A EG F D +P +PPTLFV M + RQR + + +L++ G+
Sbjct: 220 IDGIVSVDAAPEGGFGAEDEEGRFKLP-SFPPTLFVVMERGKSRQRAPPQ-IEMLQDGGV 277
Query: 586 DVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYM 633
I + FF+DRIP + + S L ++ G +D GY+
Sbjct: 278 PAEMIVSPPRAVGPTFFSDRIPTMSRRQSEALAGALRQIGLLDAQGYL 325
>gi|448932081|gb|AGE55641.1| hypothetical protein ATCVMN08101_414L [Acanthocystis turfacea
Chlorella virus MN0810.1]
Length = 476
Score = 75.5 bits (184), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 126/297 (42%), Gaps = 29/297 (9%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L PT+ +G I+Q P + K ++ HGC+ A W S G+PE+
Sbjct: 168 SDAGLKPTIVNIDGRSQIYQSPANAKGLVVFLHGCSRTAFGGWPMSSGS-RFFGMPEDVG 226
Query: 465 LVLHALSRGFAIITIS--SAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGG 522
L+ G+AI+ +S +A C++ + + I L+ PL G S+GG
Sbjct: 227 RTKQCLANGYAILYLSPVAATGCFSAKTDANTAIKSINQVRSSLGLQGKPLYLGGCSAGG 286
Query: 523 YFVSALAKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSYRQRKIGEF 576
+ R + + +G+F++ D PP+L+ + P++ K +
Sbjct: 287 GMIQ------RLVADGAIQCDGMFNESATSADPSRKTPPSLWTVLSTPREKANADKHAD- 339
Query: 577 LVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD 636
LR G A + + ++ ++F ++ + NS K+ + G ID +G + D
Sbjct: 340 --ALRGFGTAAAVLVSGKRVITPDYFYNQYASISMDNSKKIAASLRSSGLIDASGNIVKD 397
Query: 637 GRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
++ R W L+ E++A F N +V +EL A+A H+ S +W
Sbjct: 398 PKQPREWFAKLQRDVKIPETSAKFWNSGIV----QELMTAYAVHDAVSCYMTTFLRW 450
>gi|339484079|ref|YP_004695865.1| cell surface receptor IPT/TIG domain-containing protein
[Nitrosomonas sp. Is79A3]
gi|338806224|gb|AEJ02466.1| cell surface receptor IPT/TIG domain protein [Nitrosomonas sp.
Is79A3]
Length = 637
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 120/293 (40%), Gaps = 51/293 (17%)
Query: 420 DVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITI 479
DV+ PD V+F+ HG G A F+D++ E R + A +RG+AI+ +
Sbjct: 123 DVVTYFPDPHIGVIFMFHGMGGCAACFFDQT----------ETRSFIQDATARGYAIVAL 172
Query: 480 SSAGR---CWTLGEERLVVREIIRWWVERHKL-------EKLPLVALGASSGGYFVSALA 529
+S R W L + ++ R R L P +G SSGG+F S
Sbjct: 173 NSYDRQNKAWNLEMQPTNNPDLQRVAALRQALIGQGSISSADPTYLVGISSGGFFASLFT 232
Query: 530 KG----LRF--SSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNK 583
+ L+F ++AL IA G ++ I PT+FV D+ L + N
Sbjct: 233 QSVQDYLKFPIEAMALYIASGNWNS--ILSASTPTIFVAGVNDT---------LTLYNNV 281
Query: 584 GIDVAEIECMEFPLSRNFFTD---------RIPGLEQANSTKLFELFQEKGFIDENGYMR 634
A++ ++ P T RI GL +S ++ GF+D Y+
Sbjct: 282 QNSYAQLLSIDIPTQLISSTPSQLYPERFWRIEGLSALDSQTIYNAIDNAGFLDGKDYLL 341
Query: 635 SDGRRTRWKEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWF 687
S+ + W+ +L + ++ V +L +AFA H ++ +F +F
Sbjct: 342 SNPTTSGWEASLPA-----MYESYKDQVSNQLKVAFAEHSFMGHLNKPVFDFF 389
>gi|147778960|emb|CAN62542.1| hypothetical protein VITISV_042506 [Vitis vinifera]
Length = 702
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 60/98 (61%), Gaps = 2/98 (2%)
Query: 73 DGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANI 132
DG+W + +Y + G + K I + I+YKEL+ ++ I+K+D ++MK + ANI
Sbjct: 275 DGEWVRDGNVFY-FEGSKGKXIEIXKTISYKELLGVVHHILKLDLTNCFLSMKYEFNANI 333
Query: 133 PTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERK 170
PT P+++ DD DV+FFI N +L P+CI +E++
Sbjct: 334 PTSPIQLTDDGDVKFFIGLNCT-KGKLXXPLCIXVEKR 370
>gi|449463928|ref|XP_004149682.1| PREDICTED: uncharacterized protein LOC101207197 [Cucumis sativus]
Length = 749
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 112/241 (46%), Gaps = 24/241 (9%)
Query: 138 EIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESN 197
E+ +D D++F++ + +P E+ P+ I+ E K +Q ++ D+ + + S N
Sbjct: 18 ELSNDRDLKFYLL--SENPLEV--PLYISFEPKSNQSKKV----LNKDYNSISGSNQAHN 69
Query: 198 ---QISVVVPGMQATEVEEQNLNAFNGDPV----AANFGTEAPLGCGDGILAAN-IEVSE 249
+V+ + EV + + D V +A + + D +E++
Sbjct: 70 LNPHPPIVMDTLNENEVHVREVEVGLCDNVIGTTSAIWESYESYDSKDETFTWEPVEMNS 129
Query: 250 QQDDI-QRLDLSQDDCGDKLPIRRSNT-------RSAAINPFHSYEDIVCGKTFTSKKDL 301
+ DI Q D DC K +R S++ R+ + E+ G+ F SKKDL
Sbjct: 130 ESFDIPQHRDGPTKDCKGKCKVRYSSSSQKLKTDRNDWSEESSTSEEFDVGQIFFSKKDL 189
Query: 302 IKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
+L + A KKNF F V KSTK+ V C D C WRLRA + + F+I+++ +H C
Sbjct: 190 SMRLSVLAMKKNFQFVVKKSTKEVLFVRCIDNKCGWRLRAMRLKDSNIFKIKKYVKVHSC 249
Query: 362 T 362
+
Sbjct: 250 S 250
>gi|147767861|emb|CAN60223.1| hypothetical protein VITISV_027940 [Vitis vinifera]
Length = 802
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ ++ ++++ +G W + + YY + G Q KGI + TY+EL+ + I+KVDT ++ +
Sbjct: 8 VDEMGIMLLNEGDWVQNGNMYY-FEGCQGKGIELKKSTTYEELLKIVCHILKVDTTKHNL 66
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAH 155
+MK ++ NIP+ P+++ DD DV+FFI N +
Sbjct: 67 SMKYVFNGNIPSTPIQLRDDGDVKFFIHLNCTN 99
>gi|307107114|gb|EFN55358.1| hypothetical protein CHLNCDRAFT_134387 [Chlorella variabilis]
Length = 153
Score = 72.4 bits (176), Expect = 9e-10, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 48/78 (61%)
Query: 398 QKWNSFDSSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCI 457
Q++ + ++ P V G +V++Q+PDSP+A LF+A GC+ A +W SP CP+C+
Sbjct: 74 QRYEAMFPVPRMVPVVTRMMGHEVLYQVPDSPRATLFIAPGCSHAAHDWWPASPACPDCL 133
Query: 458 GLPEERLLVLHALSRGFA 475
GLPEE AL R +A
Sbjct: 134 GLPEELAHTQQALGRRYA 151
>gi|307104309|gb|EFN52563.1| hypothetical protein CHLNCDRAFT_138550 [Chlorella variabilis]
Length = 366
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/316 (23%), Positives = 111/316 (35%), Gaps = 80/316 (25%)
Query: 409 LNPT-VELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVL 467
L PT +L G ++ ++ P P +L + H C W RS CP+C+GLP
Sbjct: 67 LTPTKTQLSTGHELFYESPPRPVGLLLILHKCGRSGSDHWPRSQPCPDCLGLPHSLAKTK 126
Query: 468 HALSRGFAIITISS-------AGRCWTLGEERLVVREIIRWWVERHKL-EKLPLVALGAS 519
ALSRG+A+ +SS GRC+ + E++R +L P+ GAS
Sbjct: 127 QALSRGYAVAAMSSLDRRAGGGGRCFAWAADARAAAEVVRSLPRTLRLPAGAPVYVDGAS 186
Query: 520 SGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVV 579
SGG LA + F+ G ++ P P L
Sbjct: 187 SGGSIALRLASIVDFN--------GAIGEVIAPPGLPEML-------------------- 218
Query: 580 LRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGRR 639
E ME D P + S L + ++ G +D G++ +D
Sbjct: 219 -----------EKME--------ADLFPSISPTLSASLHQQLEQLGLLDAKGFISADPGT 259
Query: 640 TRW--------------------KEALRESNAIFLNKNLVHHVQEELNLAFAYHEMTSLP 679
W K A RE + ++K H+ E + LA+A HE
Sbjct: 260 AAWDTFKSQVASFAALRKLLTGMKAAQREQDVRLVDK----HIWECILLAWARHETVGEH 315
Query: 680 SEQIFKWFLPIGSLDL 695
W G D+
Sbjct: 316 MTAALVWLEGGGRQDI 331
>gi|242072085|ref|XP_002451319.1| hypothetical protein SORBIDRAFT_05g027680 [Sorghum bicolor]
gi|241937162|gb|EES10307.1| hypothetical protein SORBIDRAFT_05g027680 [Sorghum bicolor]
Length = 817
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/336 (23%), Positives = 144/336 (42%), Gaps = 38/336 (11%)
Query: 66 VRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMK 125
V + Y G++ PSG +Y GG+ + V+ D + + D L + VD + +++K
Sbjct: 7 VVAICQYGGEFASGPSGNLIYRGGEAHAVDVTRDSSLEGFKDELSKVFHVDVTD--MSLK 64
Query: 126 SLYKANIPTMPVEIVDDDDVRFFIQ--ENAAHP-------TELRSPICITIERKGS---- 172
N+ T+ + I D D++ + NAAH E RS + + GS
Sbjct: 65 YFLPNNMKTL-ITISCDRDLQRMVDFTANAAHVDVFLISRQENRSIVTQSGATSGSVASG 123
Query: 173 -QCQETEGYHAALDHQTPAASRSESNQISVVVPGMQ---ATEVEEQNLNAFNGDPVAANF 228
+ + + + PAAS + N + V+ ++ + E ++ F F
Sbjct: 124 DKRKRPTSKNKVIRSSKPAAS-APGNAVQVITDSLRQPISVTTENEDNRVFQ-----LEF 177
Query: 229 GTEAPLGCGDGILAANIEVSEQQ-----DDIQRLDLSQDDCGDKLPIRRSNTR---SAAI 280
G + G + ++ +QQ D+I R +S D +P S
Sbjct: 178 GNDIAFTTTAGGASFTSDILDQQKLALVDNIPREAVSLFD-DAFIPYVGSEIMHEPQGPN 236
Query: 281 NPFHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA-DTNCKW 337
NP ++DI+ G+ F + KD +L A K F ++ K+ R V CA + C W
Sbjct: 237 NPIALWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFVYRFIKNETSRVTVKCAAEGGCTW 296
Query: 338 RLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
RL A++++ ++ F I+R ++ H C+ + +++ T
Sbjct: 297 RLHASESSRNKKFVIKRMTDEHTCSGGSGEGQRRAT 332
>gi|147842177|emb|CAN75858.1| hypothetical protein VITISV_017441 [Vitis vinifera]
Length = 210
Score = 69.3 bits (168), Expect = 7e-09, Method: Composition-based stats.
Identities = 37/109 (33%), Positives = 67/109 (61%), Gaps = 6/109 (5%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+++V ++++Y+G W + + YY + G Q K I + TY+ L+ + I+KVD E+ I
Sbjct: 8 INEVGIMLLYEGDWVQNGNMYY-FEGCQGKSIKLKKTTTYEGLLKIVCHILKVDPIEHNI 66
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPI--CITIER 169
+ ++ NIP+ P+++ DD DV+FFI+ N + + P+ CITIE+
Sbjct: 67 LINYVFNGNIPSTPIQLRDDGDVKFFIRLNC---IDGKFPVLLCITIEK 112
>gi|307109825|gb|EFN58062.1| hypothetical protein CHLNCDRAFT_142323 [Chlorella variabilis]
Length = 646
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 126/328 (38%), Gaps = 48/328 (14%)
Query: 409 LNPTVELRNGTDVIWQIPD-SPKAVLFLAHGCNGRAVHFWDRSP-NCPNCIGLPEERLLV 466
L P L G +Q+P SP+ + GC A W P C C GLPEE L
Sbjct: 125 LRPQRLLGGGRTHYYQMPQGSPRGTVVSFPGCARAARGHWPYDPVYCKECFGLPEELSLT 184
Query: 467 LHALSRGFAIITISSAGR---CWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGY 523
AL RG+A++ +S + CW+ G + I + L ASS
Sbjct: 185 KQALKRGYALLALSPKDQKHLCWSSGTD-------ISANDQSQARPPASLAQCWASSQTS 237
Query: 524 FVSALAKGLR---FSSIALMIAEG--------------LFDQ----MDIPED-YPPTLFV 561
A + R ++ + ++ G +F + + +P YPPTLFV
Sbjct: 238 CWRATCRACRSVTYAKLDVIKLRGVVSVVSTPSVNTWSVFQRDGRTLRVPPGVYPPTLFV 297
Query: 562 HMPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELF 621
M D + + + +LR + + + E + + +DRIP + S + +
Sbjct: 298 TMADDPWGVSQAPVCVDLLRQNNVPSSYVVSPERIIRPSTLSDRIPVITPQQSVLIVQAL 357
Query: 622 QEKGFIDENG--------YMRSDGRRTRWKEALRE-----SNAIFLNKNL-VHHVQEELN 667
Q+ G +D G Y R + W A+ E +N+ LN NL + +EL
Sbjct: 358 QQVGLLDAEGWVVQDPKDYNRKGSKVYGWTRAVYELLPWMANSHSLNLNLRTSLIFQELT 417
Query: 668 LAFAYHEMTSLPSEQIFKWFLPIGSLDL 695
A A HE + W G+ +L
Sbjct: 418 AADARHESMHDYATACLVWLESGGTANL 445
>gi|384252928|gb|EIE26403.1| hypothetical protein COCSUDRAFT_46074 [Coccomyxa subellipsoidea
C-169]
Length = 917
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 88/206 (42%), Gaps = 45/206 (21%)
Query: 464 LLVLHALSRGFAIITIS-----SAGRCWT----------LGEERLVVREIIR---WWVER 505
L+V A+ RG+A++T+ ++ RCWT + E + +R ++ WW
Sbjct: 2 LMVSSAVQRGYAVVTVGPHLSGASDRCWTKTWPPEASIEIPELKRAIRHVLEEQDWW--- 58
Query: 506 HKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQM-----------DIPED 554
LP LGAS+GG A + A M+ D++ D PE
Sbjct: 59 ----GLPRYLLGASAGGAVALLFASRFPVQAAASMLMGFRPDEVFQEEQLVSRNGDQPEG 114
Query: 555 ------YPPTLFVHMPKDS--YRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRI 606
+PP L + D Y++R I + +L +G+ + +PL FF+DR
Sbjct: 115 QQKSWPFPPVLLLEAQNDQDIYKER-ILRSIQLLNQQGLHAENVTMQPYPLHSTFFSDRS 173
Query: 607 PGLEQANSTKLFELFQEKGFIDENGY 632
P ++ S ++ Q G IDE+GY
Sbjct: 174 PEIDPEQSQAVYGDLQAAGIIDESGY 199
>gi|147857559|emb|CAN78664.1| hypothetical protein VITISV_019818 [Vitis vinifera]
Length = 1433
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 127/327 (38%), Gaps = 45/327 (13%)
Query: 63 MSQVRVLVMY-DGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYV 121
M + +V Y DG+ P+G YS KG++V++ I Y EL D+L ++ +D +
Sbjct: 1 MXSLLTIVCYKDGEIIDGPNGV-CYSCPPKKGVLVNNLIKYDELEDKLCHVMSIDRTHTM 59
Query: 122 ITM-------KSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQC 174
++M S+ NI + + I DDDDVR A + P TIE
Sbjct: 60 LSMIFRYPILMSIGNGNINYIQLPIKDDDDVRLMFHVVA------QIPPSNTIE------ 107
Query: 175 QETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPL 234
L + S S + G E+ + +A D + N +
Sbjct: 108 -------MYLQTRPRDHSSELSPSFDQEIMGHDDVEIPAKGNSAVQIDEMDENLAHNDEM 160
Query: 235 GCGDGILAANIEVSEQQ--DDIQRLDLSQDDCGDKLP------IRRSNTRSAAINP---- 282
G G+ + + E++E D+ +S DD +P + S P
Sbjct: 161 GGGEFGSSNSDEINEMHYDDEPPTNKVSSDDGEHIMPSPMFKQLNWDAINSMTAEPLTPR 220
Query: 283 ---FHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA--DTNCKW 337
++ ++ G F SK+DL +K A +N V +S + V C C W
Sbjct: 221 TGLWNESNELFKGLRFESKEDLQYAVKRYAICRNQHLVVCESEPQLWAVRCKKWQEGCNW 280
Query: 338 RLRATKTAEDEYFEIRRFSNIHICTQP 364
RLRA + FEI +++ H C P
Sbjct: 281 RLRACRRKSHGMFEITKYAGPHTCVYP 307
>gi|448934626|gb|AGE58179.1| hypothetical protein PBCVNW6652_943L [Paramecium bursaria Chlorella
virus NW665.2]
Length = 487
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 127/301 (42%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 177 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 235
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 236 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 295
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG V L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 296 GGGMVQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 348
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G
Sbjct: 349 AG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGD 404
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N +V + + +A+A H+ SL K
Sbjct: 405 VLKDPKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLK 460
Query: 686 W 686
W
Sbjct: 461 W 461
>gi|4038033|gb|AAC97215.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 503
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 117/297 (39%), Gaps = 48/297 (16%)
Query: 94 IIVSDDITYKELVDRLYGIVKVD--TNEYVITMKSLYKAN-IPTMPVEIVDDDDVRFFIQ 150
+++ + ITY L D L ++D TN ++ K + + I + P+ I +D + F+
Sbjct: 16 VLIDEHITYHALFDILVEDFQIDSSTNSLKLSYKLVSTSTTIQSHPLYIRNDRQLATFMH 75
Query: 151 ENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATE 210
+ + L+ +C+T+E+ ++ E P S S VP QA
Sbjct: 76 KFSQSGGLLQ--LCLTVEQISVTVEQAE------TTSNPFVSHS--------VPNAQA-- 117
Query: 211 VEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPI 270
E N F + T P I A E S D Q P
Sbjct: 118 --ETTSNPFASHSAS----TTQPRLTSRIIEACVSETSSILD--------QQYSSSAFPT 163
Query: 271 RRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC 330
S+ S I GK F K +++ L++ A K +F F KS R+ + C
Sbjct: 164 GLSDADSLFI-----------GKFFKDKDEMVFTLRMFAVKHSFEFHTVKSDLTRYVLHC 212
Query: 331 ADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNE 387
D NC WRLRAT+ E + IR++ + H C KN + A R G R I N
Sbjct: 213 IDENCSWRLRATRAGGSESYVIRKYVSHHNC-DSSLKNVSHRQAFARTLG-RLISNH 267
>gi|428182484|gb|EKX51345.1| hypothetical protein GUITHDRAFT_134815 [Guillardia theta CCMP2712]
Length = 341
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 82/185 (44%), Gaps = 59/185 (31%)
Query: 428 SPKAVLFLAHG-----------------CNGRAVHFWDRSPNCPNCIGLPEERLLVLHAL 470
+ +A+LFLAHG C+ A F+D+S +C CIGLP ER +V AL
Sbjct: 114 TSRAILFLAHGGNIEMLRGGSGTDSSAGCSHSATDFFDKSDDCKECIGLPIERKIVSTAL 173
Query: 471 SRGFAIITISSAGRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAK 530
+G +++ GE+ V E V HK + LVALGASSGG V AL
Sbjct: 174 DKGLSVVA----------GEQSRVKTEAAS--VGTHK--TMSLVALGASSGGAIVHAL-- 217
Query: 531 GLRF---SSIALMI------------------AEGLFD-----QMDIPEDYPPTLFVHMP 564
G+ ++IA+ I A +F + + E YPP +F HM
Sbjct: 218 GISSEPPAAIAVQIMPVPPEVGNFIIFVQDETASDVFGCHWYTMLHLTEKYPPAIFWHMV 277
Query: 565 KDSYR 569
+ + R
Sbjct: 278 RRNRR 282
>gi|147855432|emb|CAN81754.1| hypothetical protein VITISV_020921 [Vitis vinifera]
Length = 2116
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 54/84 (64%), Gaps = 2/84 (2%)
Query: 99 DITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTE 158
+I+Y EL+ +Y I+++D E I+MK + NIPT P+++ DD DV+FFI+ N + +
Sbjct: 4 NISYHELMRVVYRILQLDPVECSISMKYAFSGNIPTSPIQLRDDGDVKFFIRLN--YTNK 61
Query: 159 LRSPICITIERKGSQCQETEGYHA 182
L +P+CIT++R+ E+ H
Sbjct: 62 LPAPLCITMDRRSENNAESMFMHG 85
>gi|397632782|gb|EJK70694.1| hypothetical protein THAOC_07926, partial [Thalassiosira oceanica]
Length = 374
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 58/115 (50%), Gaps = 4/115 (3%)
Query: 430 KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGR---CW 486
K ++ L H C A F+ RS C C+GL EE + AL G+ + +SSA R CW
Sbjct: 120 KGLVLLLHACTHSAYKFFARSGRCEECVGLSEEMRIARIALRNGYLPVAVSSADRDTGCW 179
Query: 487 TLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALAKGLRFSSIALMI 541
+ G + VR ++ R + L +VA+GASSGG + LA G +M+
Sbjct: 180 SKGRDAERVRAVLDAQFLR-RYRDLSVVAVGASSGGALAAELALGGLADGALVMV 233
>gi|448927766|gb|AGE51339.1| hypothetical protein PBCVCVG1_909L [Paramecium bursaria Chlorella
virus CVG-1]
Length = 481
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 171 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPPSAN-PRFYGMPEDVS 229
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 230 RTKQSLKAGYAILYVSPENQKTGCFSQNGSDPATTKSVIDQVRNTLRIQNKALYIGGCSA 289
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQMDIPED----YPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 290 GGGMAQRLVAKG-------FIQCDGMFNESATSADPSNKTPASLWTVLSTPKEKADAEKK 342
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G
Sbjct: 343 AG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGD 398
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N +V + + +A+A H+ SL K
Sbjct: 399 VLKDPKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLK 454
Query: 686 W 686
W
Sbjct: 455 W 455
>gi|448928442|gb|AGE52013.1| hypothetical protein PBCVCVM1_941L [Paramecium bursaria Chlorella
virus CVM-1]
Length = 487
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 177 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 235
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 236 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 295
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 296 GGGMAQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 348
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G
Sbjct: 349 AG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGD 404
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N +V + + +A+A H+ SL K
Sbjct: 405 VLKDPKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLK 460
Query: 686 W 686
W
Sbjct: 461 W 461
>gi|448925413|gb|AGE48993.1| hypothetical protein PBCVAP110A_958L [Paramecium bursaria Chlorella
virus AP110A]
Length = 481
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 171 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 229
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 230 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 289
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 290 GGGMAQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 342
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G
Sbjct: 343 AG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGD 398
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N +V + + +A+A H+ SL K
Sbjct: 399 VLKDPKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLK 454
Query: 686 W 686
W
Sbjct: 455 W 455
>gi|155370893|ref|YP_001426427.1| hypothetical protein FR483_N795L [Paramecium bursaria Chlorella
virus FR483]
gi|155124213|gb|ABT16080.1| hypothetical protein FR483_N795L [Paramecium bursaria Chlorella
virus FR483]
Length = 487
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 177 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 235
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 236 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 295
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 296 GGGMAQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 348
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G
Sbjct: 349 AG----ALRRFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGD 404
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N +V + + +A+A H+ SL K
Sbjct: 405 VLKDPKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLK 460
Query: 686 W 686
W
Sbjct: 461 W 461
>gi|147778655|emb|CAN71718.1| hypothetical protein VITISV_016812 [Vitis vinifera]
Length = 96
Score = 65.5 bits (158), Expect = 9e-08, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 56/90 (62%), Gaps = 1/90 (1%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ +V ++++Y G W + + YY + G Q KGI + TY+EL+ + I+KVD E+ +
Sbjct: 8 IDEVGIMLLYKGDWVQNGNTYY-FEGCQGKGIELKKTTTYEELLKIVCHILKVDPTEHNL 66
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQEN 152
+MK ++ NIP+ P+++ D + + FI+ N
Sbjct: 67 SMKYVFNGNIPSTPIQLRRDGNXKIFIRLN 96
>gi|448929454|gb|AGE53022.1| hypothetical protein PBCVCZ2_919L [Paramecium bursaria Chlorella
virus CZ-2]
Length = 457
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 147 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPPSAN-PRFYGMPEDVS 205
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 206 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPDTTKAVINQVRSTLRIQNKALYIGGCSA 265
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 266 GGGMAQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 318
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G
Sbjct: 319 AG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGD 374
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N +V + + +A+A H+ SL K
Sbjct: 375 VLKDPKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLK 430
Query: 686 W 686
W
Sbjct: 431 W 431
>gi|414868070|tpg|DAA46627.1| TPA: hypothetical protein ZEAMMB73_712403 [Zea mays]
gi|414868071|tpg|DAA46628.1| TPA: hypothetical protein ZEAMMB73_712403 [Zea mays]
Length = 821
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/335 (20%), Positives = 135/335 (40%), Gaps = 31/335 (9%)
Query: 66 VRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMK 125
V + Y G++ P+G Y GG+ + V+ D+ + D + + VD + +++K
Sbjct: 7 VVAICQYGGEFTSGPNGNLTYKGGEAHAVDVTRDMPLENFKDEVSKVFHVDVTD--MSLK 64
Query: 126 SLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKG-----SQCQETEGY 180
N T+ + I D D++ + + T++ + +E + G
Sbjct: 65 YFLPNNYRTL-ITISCDRDLQRMVDFTTS-ATQVEVFLISRVENRSIVTYSGASASKPGS 122
Query: 181 HAALDHQT--PAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAAN----------F 228
+A D + P+ ++ + P T V+ N P+A F
Sbjct: 123 NAPGDKRKRPPSKNKVSKSNKKTPTPSAPGTSVQPSANNVKQPRPIAIETNDNRVFQLEF 182
Query: 229 GTEAPLGCGDGILAANIEVSEQQ-----DDIQRLDLSQ-DDCGDKLPIRR--SNTRSAAI 280
G++ G + ++ +QQ D R + DD GD + +
Sbjct: 183 GSDVAFATPAGGASTAPDILDQQNLALVDTAHREPIGHFDDSGDSYVGSEITVDPQHGLD 242
Query: 281 NPFHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWR 338
NP ++DI+ G+ F + KD +L A K FA++ K+ R V C C WR
Sbjct: 243 NPVMFWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFAYRFIKNETTRVTVKCVAEGCTWR 302
Query: 339 LRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
+ A++++ ++ F I++ ++ H C + +++ T
Sbjct: 303 MHASESSRNKKFVIKKMTDEHTCGGGNGEGQRRAT 337
>gi|448926436|gb|AGE50013.1| hypothetical protein PBCVCan184_959L [Paramecium bursaria Chlorella
virus Can18-4]
Length = 470
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 126/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 160 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPPSAN-PRFYGMPEDVS 218
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 219 RTKQSLKAGYAILYVSPENQKTGCFSQNGSDPATTKSVIDQVRNTLRIQNKALYIGGCSA 278
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQ----MDIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 279 GGGMAQRLVAKG-------FIQCDGMFNESATTADPSNKTPASLWTVLSTPKEKADAEKK 331
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
+G LR G A + + ++ +FF +++ + NS ++ + ++ G I+ G
Sbjct: 332 VG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKRIADSLRKSGMINAAGD 387
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N + + + +A+A H+ SL K
Sbjct: 388 VLKDPKANRQWYAALGKDVKIPETRLSFWNSG----ITQAMLVAWAVHDACSLYMTAFLK 443
Query: 686 W 686
W
Sbjct: 444 W 444
>gi|147767515|emb|CAN66712.1| hypothetical protein VITISV_041525 [Vitis vinifera]
Length = 518
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 42/69 (60%)
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ ++SKK+L +KL + A K+ F FK +KST V C D CKWR+RATK F+I
Sbjct: 19 QIYSSKKELQRKLCMMALKRKFEFKTTKSTTKLLLVECFDKECKWRVRATKLGISNMFQI 78
Query: 353 RRFSNIHIC 361
+F + H C
Sbjct: 79 MKFYSTHTC 87
>gi|4585914|gb|AAD25575.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 607
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + GK F K +++ L++ A K NF F KS R+ + C D NC WRLRAT+
Sbjct: 46 DSLFIGKIFKDKDEMVFTLRMFAVKHNFEFHTVKSDLTRYVLHCIDENCSWRLRATRAGG 105
Query: 347 DEYFEIRRFSNIHIC 361
E + IR++ + H C
Sbjct: 106 SESYAIRKYVSHHSC 120
>gi|155122489|gb|ABT14357.1| hypothetical protein MT325_M803L [Paramecium bursaria chlorella
virus MT325]
Length = 500
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 125/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 190 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 248
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 249 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 308
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 309 GGGMAQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 361
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G
Sbjct: 362 AG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGD 417
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N + + + +A+A H+ SL K
Sbjct: 418 VLKDPKANRQWYAALGKDVKIPETRLSFWNSG----IAQAMLVAWAVHDACSLYMTAFLK 473
Query: 686 W 686
W
Sbjct: 474 W 474
>gi|4585910|gb|AAD25571.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 580
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + GK F K +++ L++ A K NF F KS R+ + C D NC WRLRAT+
Sbjct: 46 DSLFIGKNFKDKDEMVFTLRMFAVKHNFEFHTVKSDLTRYVLHCIDENCSWRLRATRAGG 105
Query: 347 DEYFEIRRFSNIHIC 361
E + IR++ + H C
Sbjct: 106 SESYVIRKYVSHHSC 120
>gi|4646228|gb|AAD26892.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 580
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + GK F K +++ L++ A K NF F KS R+ + C D NC WRLRAT+
Sbjct: 46 DSLFIGKIFKDKDEMVFTLRMFAVKHNFKFHTVKSDLTRYVLHCIDENCSWRLRATRAGG 105
Query: 347 DEYFEIRRFSNIHIC 361
E + IR++ + H C
Sbjct: 106 SESYVIRKYVSHHSC 120
>gi|448932923|gb|AGE56481.1| hypothetical protein PBCVNEJV1_943R [Paramecium bursaria Chlorella
virus NE-JV-1]
Length = 418
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 125/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 108 SDAGLKPILINVGGRQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 166
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 167 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 226
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 227 GGGMAQRLVAKG-------FLQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 279
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +FF +++ + NS ++ + ++ G I+ G
Sbjct: 280 AG----ALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKRIADSLRKSGMINAAGD 335
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N + + + +A+A H+ SL K
Sbjct: 336 VLKDPKANRQWYAALGKDVKIPETRLSFWNSG----IAQAMLVAWAVHDACSLYMTAFLK 391
Query: 686 W 686
W
Sbjct: 392 W 392
>gi|147832845|emb|CAN66129.1| hypothetical protein VITISV_003261 [Vitis vinifera]
Length = 1362
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 71/332 (21%), Positives = 127/332 (38%), Gaps = 51/332 (15%)
Query: 63 MSQVRVLVMY-DGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYV 121
M + +V Y DG+ P+G YS KG++V++ I Y EL D+L ++ +D +
Sbjct: 1 MDSLLTIVCYKDGEIIDGPNGV-CYSCPPKKGVLVNNLIKYDELEDKLCHVMSIDRTHTM 59
Query: 122 ITM-------KSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQC 174
++M S+ NI + + I DDDDVR A P + + +
Sbjct: 60 LSMIFRYPILMSIGNGNINYIQLPIKDDDDVRLMFHVVAQIPPSNTIEMYLQTRPRDHSS 119
Query: 175 QETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPL 234
+ + + + + + + G A +++E + N + D + N +
Sbjct: 120 ELSPSFDQEI----------MGHDVEIPAKGNSAMQIDEMDENLAHNDEMGGNLAIVSQS 169
Query: 235 GCGDGILAANIEVSEQQDDIQRLD-----------------LSQDDCGDKLPIRR----- 272
G +I + + DD++ D +S DD +P
Sbjct: 170 VMGATNNYVDIPFTNENDDVEFYDEDEINEMHYDDEPPTNKVSSDDGEYIMPSPMFKQLN 229
Query: 273 ----SNTRSAAINP----FHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKD 324
+N + + P ++ ++ G F SK+DL +K A +N V +S
Sbjct: 230 WDAINNMTAEPLTPRTGLWNESNELFKGLRFESKEDLQYAVKRYAICRNQHLVVCESKPQ 289
Query: 325 RFEVVCA--DTNCKWRLRATKTAEDEYFEIRR 354
+ V C C WRLRA + FEI +
Sbjct: 290 LWAVRCKKWQEGCNWRLRACRRKSHGMFEITK 321
>gi|242034881|ref|XP_002464835.1| hypothetical protein SORBIDRAFT_01g027410 [Sorghum bicolor]
gi|241918689|gb|EER91833.1| hypothetical protein SORBIDRAFT_01g027410 [Sorghum bicolor]
Length = 820
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/339 (20%), Positives = 133/339 (39%), Gaps = 39/339 (11%)
Query: 66 VRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMK 125
V + Y G++ P+G Y GG+ + VS D+ + D + + VD + +++K
Sbjct: 7 VVAICQYGGEFTSGPNGNLTYKGGEAHAVDVSRDMPLESFKDEVSKVFHVDVTD--MSLK 64
Query: 126 SLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALD 185
N T+ D D R +A E+ I R ++ T +A+
Sbjct: 65 YFLPNNYRTLITISCDRDLQRMVDFTISAAQVEV-----FLISRVENRSVVTYSGASAIK 119
Query: 186 ----------HQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAAN-------- 227
+ P+ ++ + P T V+ N P+ +
Sbjct: 120 PGSNAPGDKRKKPPSKNKVSKSNKKTPTPSAPGTSVQPSASNVKQPRPIVTDNDDNRVFQ 179
Query: 228 --FGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQ-------DDCGDKLPIRR--SNTR 276
FG++ G + V +QQ+ + +D + DD GD ++ +
Sbjct: 180 LEFGSDVAFATPAGGASTAPGVLDQQN-LALVDTAPREPIGLFDDSGDPYVGSEITADPQ 238
Query: 277 SAAINPFHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTN 334
NP ++DI+ G+ F + KD +L A K FA++ K+ R V C
Sbjct: 239 HGLDNPVMFWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFAYRFIKNETTRVTVKCVAEG 298
Query: 335 CKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
C WRL A++++ ++ F I++ ++ H C + +++ T
Sbjct: 299 CTWRLHASESSRNKKFVIKKMTDEHTCGGGNGEGQRRAT 337
>gi|448935712|gb|AGE59262.1| hypothetical protein PBCVOR070422_910L [Paramecium bursaria
Chlorella virus OR0704.2.2]
Length = 463
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 125/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 153 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPPSAN-PRFYGMPEDVS 211
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 212 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPDTTKAVINQVRSTLRIQNKALYIGGCSA 271
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 272 GGGMAQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 324
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +F+ +++ + NS K+ + ++ G I+ G
Sbjct: 325 AG----ALRKFGKPAAVLVSPKRKITPDFYYNQMASISLENSKKIADSLRKSGMINAAGD 380
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N + + + +A+A H+ SL K
Sbjct: 381 VLKDPKANRQWYAALGKDVKIPETQLSFWNSG----IAQAMLVAWAVHDACSLYMTAFLK 436
Query: 686 W 686
W
Sbjct: 437 W 437
>gi|3319368|gb|AAC28217.1| similar to maize transposon MuDR mudrA protein (GB:AL021710)
[Arabidopsis thaliana]
gi|7267154|emb|CAB77866.1| putative transposon protein [Arabidopsis thaliana]
Length = 580
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 48/94 (51%), Gaps = 1/94 (1%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + GK F K +++ L++ A K +F F KS R+ + C D NC WRLRATK
Sbjct: 46 DSLFIGKKFKDKDEMVFTLRMFAVKHSFEFHTVKSDLTRYVLHCIDENCSWRLRATKAGG 105
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQTALQRPDG 380
E + IR++ + H C +N + A R G
Sbjct: 106 SESYVIRKYVSHHSC-DSSLRNVSHRQAFARTLG 138
>gi|5541695|emb|CAB51200.1| putative protein [Arabidopsis thaliana]
Length = 739
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 62/271 (22%), Positives = 108/271 (39%), Gaps = 46/271 (16%)
Query: 94 IIVSDDITYKELVDRLYGIVKVD--TNEYVITMKSLYKAN-IPTMPVEIVDDDDVRFFIQ 150
+++ + ITY LVD L ++D TN ++ K + + I + P+ I +D + F+
Sbjct: 16 VLIDEHITYHALVDILVEDFQIDSSTNSLKLSYKLVSTSTTIQSHPLYIRNDRQLATFMH 75
Query: 151 ENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATE 210
+ + L+ +C+ +E+ ++ E P S S VP QA
Sbjct: 76 KFSQSGGLLQ--LCLIVEQISVTVEQAETT------SNPFVSHS--------VPNAQAET 119
Query: 211 VEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPI 270
+P A++ + I+ A + + D Q
Sbjct: 120 T---------SNPFASHSASTTQPRLTSRIIEACVSETSSILDQQY-------------- 156
Query: 271 RRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC 330
+ SA + + GK F K +++ L+ A K +F F KS R+ + C
Sbjct: 157 ----SSSAFSTGLSDADSLFIGKIFKDKDEMVFTLRKFAVKHSFKFHTVKSDLTRYVLHC 212
Query: 331 ADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
D NC WRLRAT+ E + IR++ + H C
Sbjct: 213 IDENCSWRLRATRAGGSESYVIRKYVSHHSC 243
>gi|448927439|gb|AGE51013.1| hypothetical protein PBCVCVB1_905L [Paramecium bursaria Chlorella
virus CVB-1]
Length = 481
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/297 (22%), Positives = 120/297 (40%), Gaps = 27/297 (9%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 171 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 229
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 230 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 289
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQMDI---PEDYPPTLFVHMPKDSYRQRKIGEF 576
GG L AKG + +G+F++ P + P + + + +
Sbjct: 290 GGGMAQRLVAKG-------FIQCDGMFNESATSADPSNKTPASLWTVLSTPKEKAEAEKK 342
Query: 577 LVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD 636
LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G + D
Sbjct: 343 AGALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGDVLKD 402
Query: 637 GRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W AL E+ F N +V + + +A+A H+ SL KW
Sbjct: 403 PKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLKW 455
>gi|448929798|gb|AGE53365.1| hypothetical protein PBCVFr5L_913L [Paramecium bursaria Chlorella
virus Fr5L]
Length = 457
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 125/301 (41%), Gaps = 35/301 (11%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 147 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPPSAN-PRFYGMPEDVS 205
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 206 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPDTTKAVINQVRSTLRIQNKALYIGGCSA 265
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQM----DIPEDYPPTLF--VHMPKDSY-RQRK 572
GG L AKG + +G+F++ D P +L+ + PK+ ++K
Sbjct: 266 GGGMAQRLVAKG-------FIQCDGMFNESATTGDPSNKTPASLWTVLSTPKEKADAEKK 318
Query: 573 IGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGY 632
G LR G A + + ++ +F+ +++ + NS K+ + ++ G I+ G
Sbjct: 319 AG----ALRKFGKPAAVLVSPKRKITPDFYYNQMASISLENSKKIADSLRKSGMINAAGD 374
Query: 633 MRSDGRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFK 685
+ D + R W AL E+ F N + + + +A+A H+ SL K
Sbjct: 375 VLKDPKANRQWYAALGKDVKIPETRLSFWNSG----IAQAMLVAWAVHDACSLYMTAFLK 430
Query: 686 W 686
W
Sbjct: 431 W 431
>gi|448927101|gb|AGE50676.1| hypothetical protein PBCVCVA1_959L [Paramecium bursaria Chlorella
virus CVA-1]
gi|448928784|gb|AGE52354.1| hypothetical protein PBCVCVR1_965L [Paramecium bursaria Chlorella
virus CVR-1]
Length = 469
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 67/297 (22%), Positives = 120/297 (40%), Gaps = 27/297 (9%)
Query: 405 SSVQLNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERL 464
S L P + G +Q P PK ++ HGC W S N P G+PE+
Sbjct: 159 SDAGLKPVLINVGGKQHAYQSPKDPKGLVVFLHGCARSIYGGWPASAN-PKFYGMPEDVS 217
Query: 465 LVLHALSRGFAIITISSAGR---CWTL-GEERLVVREIIRWWVERHKLEKLPLVALGASS 520
+L G+AI+ +S + C++ G + + +I +++ L G S+
Sbjct: 218 RTKQSLKAGYAILYVSPENQKTGCFSQNGPDPATTKSVIDQVRNTLRIQNKALYIGGCSA 277
Query: 521 GGYFVSAL-AKGLRFSSIALMIAEGLFDQMDI---PEDYPPTLFVHMPKDSYRQRKIGEF 576
GG L AKG + +G+F++ P + P + + + +
Sbjct: 278 GGGMAQRLVAKG-------FIQCDGMFNESATSADPSNKTPASLWTVLSTPKEKAEAEKK 330
Query: 577 LVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSD 636
LR G A + + ++ +FF +++ + NS K+ + ++ G I+ G + D
Sbjct: 331 AGALRKFGKPAAVLVSPKRKITPDFFYNQMASISLENSKKIADSLRKSGMINAAGDVLKD 390
Query: 637 GRRTR-WKEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKW 686
+ R W AL E+ F N +V + + +A+A H+ SL KW
Sbjct: 391 PKANRQWYAALGKDVKIPETRLSFWNSGIV----QAMLVAWAVHDACSLYMTAFLKW 443
>gi|147800487|emb|CAN68581.1| hypothetical protein VITISV_011863 [Vitis vinifera]
Length = 276
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 41/69 (59%)
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ ++SKK+L +KL + A K+ F FK +KST + C D CKWR+ ATK F+I
Sbjct: 19 QIYSSKKELQRKLYMMALKRKFEFKTTKSTTKLLLIECFDKECKWRVXATKLGISNMFQI 78
Query: 353 RRFSNIHIC 361
+F + H C
Sbjct: 79 MKFYSTHTC 87
>gi|147845568|emb|CAN78489.1| hypothetical protein VITISV_004932 [Vitis vinifera]
Length = 477
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 57/113 (50%), Gaps = 27/113 (23%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ +V ++++Y+G W + + YY + G Q D E+ +
Sbjct: 8 IDEVGIMLLYEGDWVQNGNMYY-FEGYQ-------------------------DPTEHNL 41
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQ 175
+MK ++ NIPT P+++ DD DV+FFI N +L+ P+CITIE+K Q
Sbjct: 42 SMKYVFNGNIPTTPIQLRDDGDVKFFIHLNCT-DGKLQVPLCITIEKKNGNHQ 93
>gi|147815191|emb|CAN74432.1| hypothetical protein VITISV_040275 [Vitis vinifera]
Length = 855
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/355 (23%), Positives = 129/355 (36%), Gaps = 69/355 (19%)
Query: 64 SQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVIT 123
S VL +G+ P+G YS K I++S+ TY EL + + +D +
Sbjct: 3 SHTTVLCYMNGEIIDRPNGV-CYSCPPRKAILISNIKTYDELETNICHALSIDRTCIKLR 61
Query: 124 MKSLY-----KANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETE 178
M Y +I V I DD DVR NA + RS I + +E
Sbjct: 62 MIFRYPICVANGSINYEQVPIGDDGDVRVMF--NAVTQSSPRSTIEMYLET--------- 110
Query: 179 GYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNL--NAFNGDPVAANFGTEAPLGC 236
+ DHQ+ + S + + V P MQ E L N+ + ++ L
Sbjct: 111 ---SPRDHQSVSISYTRPMHVEVAGPSMQMKEENTSPLGSNSATSPRIVHSYDDNVELKI 167
Query: 237 G-------DGILAA-----NIEVSEQQDDIQRLD-------------------------- 258
G D ++ A +I D++ D
Sbjct: 168 GMNLKTVTDSVMMAAKNFVDIPAGNDDADVKLFDEDEDNEDDEDNVDTDSPTKEVTSLGG 227
Query: 259 -------LSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAK 311
+S++ D L R + T + + D+ G F SK DL +K
Sbjct: 228 EHEVSSLMSKELNWDALNARCAKTLTTCSGLWSESNDLFKGLRFKSKADLQYAVKRYLIH 287
Query: 312 KNFAFKVSKSTKDRFEVVCADTN--CKWRLRATKTAEDEYFEIRRFSNIHICTQP 364
KN V++S + + V C N CKWRLRA + + FEI +++N H C P
Sbjct: 288 KNQQMIVTESEPNLWAVKCRKWNEGCKWRLRACRLKTHDMFEITKYTNSHTCVYP 342
>gi|6910579|gb|AAF31284.1|AC006424_13 MUDRA-like transposase [Arabidopsis thaliana]
Length = 517
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 42/75 (56%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + GK F K +++ L++ A K +F F KS R+ + C D NC WRLRAT+
Sbjct: 46 DSLFIGKFFKDKDEMVFTLRMFAVKHSFEFHTVKSDLTRYVLHCIDENCSWRLRATRAGG 105
Query: 347 DEYFEIRRFSNIHIC 361
E + IR++ + H C
Sbjct: 106 SESYVIRKYVSHHSC 120
>gi|307108593|gb|EFN56833.1| hypothetical protein CHLNCDRAFT_144401 [Chlorella variabilis]
Length = 202
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 37/76 (48%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P+ G +V +++P K +L HGC A FW CP C GLPEE
Sbjct: 116 LTPSTATVEGREVYYEVPQDAKGLLVFFHGCYHNAYDFWPEQEACPECRGLPEEVSHTKQ 175
Query: 469 ALSRGFAIITISSAGR 484
AL+ G+A I+S R
Sbjct: 176 ALAMGYAFAAINSLDR 191
>gi|4585938|gb|AAD25598.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 616
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 42/75 (56%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + GK F K +++ L++ A K +F F KS ++ + C D NC WRLRAT+
Sbjct: 46 DSLFIGKFFKDKDEMVFTLRMSAVKHSFEFHTVKSDLTKYVLHCIDENCSWRLRATRAGG 105
Query: 347 DEYFEIRRFSNIHIC 361
E + IR++ + H C
Sbjct: 106 SESYVIRKYVSHHSC 120
>gi|4309756|gb|AAD15525.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 550
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
ED+ GK + SK D L + A + F FK +++ D F + CAD C WR+ A + E
Sbjct: 328 EDVYKGKVYASKSDCQIGLAIYAIRHMFHFKQTRTKLDSFVLNCADEKCDWRVTAHEMPE 387
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
Y+EIR+ H C+ K KK+
Sbjct: 388 SGYYEIRKAKLDHTCSVETRKMYKKK 413
>gi|8778971|gb|AAF79886.1|AC021198_6 Strong similarity to a mutator-like transposase gi|4063759 from
Arabidopsis thaliana BAC T14A4 gb|AC005561 [Arabidopsis
thaliana]
Length = 962
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+DI G+ F SK+D KL + A F F +KS + VC C WR+ ATK +
Sbjct: 462 DDIYIGRIFKSKEDCATKLAIHAIHWKFHFIYAKSGPNIVAAVCVSDTCPWRVYATKLPD 521
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQTA 374
++FE+R + H C+ + KQ +
Sbjct: 522 SDWFEVRSTTQTHTCSVDARGDFHKQAS 549
>gi|4581175|gb|AAD24658.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 942
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G F +K+ + + ++L A KK F +K ++S R E+VC D NC W L A + F+
Sbjct: 379 GSVFKNKQVIQQTMRLVAIKKCFRYKQTRSCPKRLEMVCVDRNCSWHLTAHVIPNSKCFK 438
Query: 352 IRRFSNIHICTQPPAKNRKKQTALQ 376
I + +IH+C K+ +K +
Sbjct: 439 ITGYDSIHVCKIKTRKDYRKHATYK 463
>gi|195970396|gb|ACG60684.1| maize transposon MuDR-like protein [Brassica oleracea var.
alboglabra]
Length = 622
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 121/318 (38%), Gaps = 46/318 (14%)
Query: 69 LVMYDGKWFCSPSGYYLYSGGQTKG---IIVSDDITYKELVDRLYGIVKVDTNE------ 119
++++ GKW L+ +G + +++ Y++ V +Y +VD E
Sbjct: 4 IILFCGKWKFERKKMILFEVDNDQGSKLLAANEETRYEDFVKTIYEDYEVDFAEHKLELM 63
Query: 120 YVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQET-- 177
YV+ L K N+ T PV++ +D + F+ H E +C+ + K + +E
Sbjct: 64 YVLPKPKLVKENLDTPPVKVRNDRQLHGFL---CLHKVE-NVRLCVEFKLKKKKVEEASK 119
Query: 178 --EGYHAALDHQTPAASRS-ESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPL 234
E + D + S E N +++N ++ P +P
Sbjct: 120 EPEEFLQKDDLLSAEYEDSDEDNDRFDYCDDSDVATSDDENFTSYGFPPDQVQESQGSPT 179
Query: 235 GCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKT 294
A ++ + R DLS KL + G++
Sbjct: 180 KMSS---ATVLKTPKGDCTHNRFDLS----SFKLEV---------------------GQS 211
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRR 354
F SK L +LK+ + F F V KST+ + V C C W+LRAT E F IR
Sbjct: 212 FDSKDALATRLKICSVVHKFNFDVDKSTRTLWFVKCWVKGCTWKLRATPVGESSRFTIRI 271
Query: 355 FSNIHICTQPPAKNRKKQ 372
+ + H C+ +R +Q
Sbjct: 272 YVDEHSCSVTERSSRSRQ 289
>gi|4038062|gb|AAC97243.1| Mutator-like transposase [Arabidopsis thaliana]
gi|20198026|gb|AAM15358.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 942
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 55/120 (45%), Gaps = 2/120 (1%)
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAF 316
LD D D L R + ++ S + G F +K+ + + ++L A KK F +
Sbjct: 346 LDTDADPVFDDLANVRVDVTEVELST--SGHPLYVGSVFKNKQVIQQTMRLVAIKKCFRY 403
Query: 317 KVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQTALQ 376
K ++S R E+ C D NC W L A + F+I + +IH+C K+ +K +
Sbjct: 404 KQTRSCPKRLEMFCVDRNCSWHLTAHVIPNSKCFKITGYDSIHVCKIDTRKDYRKHATYK 463
>gi|7767661|gb|AAF69158.1|AC007915_10 F27F5.15 [Arabidopsis thaliana]
Length = 1071
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 55/120 (45%), Gaps = 2/120 (1%)
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAF 316
LD D D L R + ++ S + G F +K+ + + ++L A KK F +
Sbjct: 346 LDTDADPVFDDLANVRVDVTEVELST--SGHPLYVGSVFKNKQVIQQTMRLVAIKKCFRY 403
Query: 317 KVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQTALQ 376
K ++S R E+ C D NC W L A + F+I + +IH+C K+ +K +
Sbjct: 404 KQTRSCPKRLEMFCVDRNCSWHLTAHVIPNSKCFKITGYDSIHVCKIDTRKDYRKHATYK 463
>gi|11994228|dbj|BAB01350.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 811
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/291 (20%), Positives = 116/291 (39%), Gaps = 56/291 (19%)
Query: 98 DDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
++ITY LVD++ +K+D + + + P + I DDDDV ++
Sbjct: 40 EEITYSWLVDKICTKIKIDESTMKRKLSYIPTGVKPPTYLYIRDDDDVYVYL--TTLDEE 97
Query: 158 ELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLN 217
+ RS + + + G++ + + ++ Q R S ++ EE+N
Sbjct: 98 KSRSVLHVEV---GNELEMVD-----VNEQQSRVQRKSSVGVNY----------EEENFG 139
Query: 218 AFNGDPVAANFGT-----EAPLGCG--------------------DGILAANIEVSEQQD 252
+ D AN G PLG D + +N+ ++++ D
Sbjct: 140 GLDNDD-RANVGAITLYANEPLGEQLAEPLETLEPLEPLEPLRDVDNGVDSNLSMTKEND 198
Query: 253 DIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKK 312
I+ + + R + S P+ D+V G+ F + D+ ++ GA +
Sbjct: 199 GIR----------EYVEEPRDSQHSKFKGPWDDGLDLVIGQEFENSDDVKYLVETGANRN 248
Query: 313 NFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQ 363
F + S K R+ V CA+ C+W +R +K + F +R + +H C +
Sbjct: 249 CFGLTLLNSNKKRYVVKCAEEGCEWYIRFSKAKDSTRFSVRTYRKLHTCVR 299
>gi|357127369|ref|XP_003565354.1| PREDICTED: uncharacterized protein LOC100825703 [Brachypodium
distachyon]
Length = 828
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/340 (22%), Positives = 142/340 (41%), Gaps = 43/340 (12%)
Query: 66 VRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMK 125
V + Y G++ P+G +Y GG+ + VS +++ D + + VD + ++ K
Sbjct: 22 VVAICQYGGEFTSGPNGL-IYKGGEAHAVDVSREMSLDSFKDEVSKVFHVDVTD--MSFK 78
Query: 126 SLYKANIPTMPVEIVDDDDVRFFIQENAAHP---------TELRSPICIT---IERKGSQ 173
N T+ + I D D++ ++ A+ E RS + T + GS
Sbjct: 79 YFLPNNNKTL-ITISCDRDLQRMVEFTASAAQADVFVISRVENRSMVTYTGASTVKAGSS 137
Query: 174 CQ-------ETEGYHAALDHQTPAASRS--ESNQISVVVPGMQATEVEEQNLNAFNGDPV 224
+ +++ + +TP AS + +++ + V P T LN +N D
Sbjct: 138 ARSDKRKRPDSKNKASKSSKKTPDASGAAVQASTVDVNEPRPVLT------LNDYNED-F 190
Query: 225 AANFGTEAPLGCGDGILAANIEVSEQQ-----DDIQRLDLSQDDCG----DKLPIRRSNT 275
FG EA G +++ + Q+ D+ QR + D D I
Sbjct: 191 PLEFGREAAFDTAAGAISSAPDTLNQEKLALVDNTQREPIGFFDDAINPYDGSEIIIDPP 250
Query: 276 RSAAINPFHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADT 333
+ NP ++DI+ G+ F + KD +L A K F ++ K+ R V C
Sbjct: 251 QELNNNPTVFWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFVYRFIKNETTRVTVKCVGD 310
Query: 334 NCKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
C WRL A++++ ++ F I++ ++ H C + +++ T
Sbjct: 311 GCTWRLHASESSRNKKFVIKKMTDEHTCGGGGGEGQRRAT 350
>gi|4587619|gb|AAD25847.1| Mutator-like transposase [Arabidopsis thaliana]
gi|20198206|gb|AAM15459.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 764
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 64/128 (50%), Gaps = 12/128 (9%)
Query: 241 LAANIEVSEQQDDIQRLDLSQD--DCGDKLPIRRSNTRSAAINPFHSYED---IVCGKTF 295
L A +E Q+D R D S D D G++ A+ P +ED + + F
Sbjct: 161 LEAIVEDHGTQEDKTRYDESMDNSDRGEQYV-----ESPPAVEP--EWEDGIGLTLRQEF 213
Query: 296 TSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRF 355
+K L + + A +F + + KS K+R+ + CA +C WRLRA+ + + F IRR+
Sbjct: 214 PNKAALHEVVDRAAFANSFGYVIKKSDKERYVLKCAKESCSWRLRASNISNTDIFSIRRY 273
Query: 356 SNIHICTQ 363
+ +H CT+
Sbjct: 274 NKMHSCTR 281
>gi|18542933|gb|AAK00423.2| Putative maize transposon MuDR mudrA-like protein [Oryza sativa
Japonica Group]
gi|125573769|gb|EAZ15053.1| hypothetical protein OsJ_30463 [Oryza sativa Japonica Group]
Length = 937
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/333 (19%), Positives = 136/333 (40%), Gaps = 26/333 (7%)
Query: 64 SQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVIT 123
S V + Y G++ P+G +Y GG+ + V+ +++ D + + V+ ++ ++
Sbjct: 124 SLVVAICQYGGEFTSGPNGNLIYKGGEAHAVDVTREMSLDNFKDEVSKVFHVEVSD--VS 181
Query: 124 MKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETE---GY 180
+K N T+ + I D D++ + A+ I R +Q + G
Sbjct: 182 LKYFLPNNNRTL-ITISCDRDLQRMVDFTASSAQVDVFLISRVENRSITQTGASTAKPGS 240
Query: 181 HAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAA----------NFGT 230
+A D + S++++++ P T V+ N V FG+
Sbjct: 241 NARGDKRKTPTSKNKASKNKKKTPSATGTAVQANANNVKQPRQVVTENDDNRVFPLEFGS 300
Query: 231 EAPLGCGDGILAANIEVSEQQ-----DDIQR--LDLSQDDCGDKLPIRRSNTRSAAI-NP 282
+ G + ++ QQ D+ R + L D + + + + NP
Sbjct: 301 DIAFANTAGAGSTAPDILNQQKLALVDNTARESVGLFDDSVNPYVGSEITTEPTQGLNNP 360
Query: 283 FHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLR 340
++DI+ G+ F + KD +L A K F ++ K+ R V C C WRL
Sbjct: 361 IVFWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFVYRFIKNETTRVTVKCVGEGCTWRLH 420
Query: 341 ATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
A++++ ++ F I++ ++ H C + +++ T
Sbjct: 421 ASESSRNKKFVIKKMTDEHTCGGGSGEGQRRAT 453
>gi|357134769|ref|XP_003568988.1| PREDICTED: uncharacterized protein LOC100842134 [Brachypodium
distachyon]
Length = 819
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 71/334 (21%), Positives = 131/334 (39%), Gaps = 30/334 (8%)
Query: 66 VRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMK 125
V + Y G++ P+G +Y GG+ + VS +++ D + + VD + ++ K
Sbjct: 7 VVAICQYGGEFTSGPNGNLIYKGGEAHAVDVSREMSLDSFKDEVSKVFHVDVTD--MSFK 64
Query: 126 SLYKANIPTMPVEIVDDDDVRFFIQENAAHP---------TELRSPICITIERKGSQCQE 176
N T+ + I D D++ + A+ E RS + +
Sbjct: 65 YFLPNNNRTL-ITISCDKDLQRMVDFTASAAQADVFVISRVENRSIVTYSGALTVKPGSS 123
Query: 177 TEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQ------NLNAFNGDPVAANFGT 230
G AS+S +N + +QA + LN +NGD FG
Sbjct: 124 ANGDKRKRPSPKKKASKSNTNTPTATATAVQANTNDVNQPRPVVTLNDYNGD-FQLEFGH 182
Query: 231 EAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCG---------DKLPIRRSNTRSAAIN 281
+ L +++ +V Q+ + +D G D I + N
Sbjct: 183 DVALATTAEAVSSAPDVLNQEKLALVDNTPRDPVGLFDDAINAYDGSEIIIDPPQEFNDN 242
Query: 282 PFHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRL 339
P S++DI+ G+ F + KD +L A K F ++ K+ R V C C WRL
Sbjct: 243 PTVSWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFVYRFIKNETTRVTVKCVGDGCTWRL 302
Query: 340 RATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
A++++ ++ F I++ + H C + +++ T
Sbjct: 303 HASESSRNKKFVIKKMTGEHTCGGGDGEGQRRAT 336
>gi|115480856|ref|NP_001064021.1| Os10g0104700 [Oryza sativa Japonica Group]
gi|113638630|dbj|BAF25935.1| Os10g0104700, partial [Oryza sativa Japonica Group]
Length = 826
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/333 (19%), Positives = 136/333 (40%), Gaps = 26/333 (7%)
Query: 64 SQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVIT 123
S V + Y G++ P+G +Y GG+ + V+ +++ D + + V+ ++ ++
Sbjct: 13 SLVVAICQYGGEFTSGPNGNLIYKGGEAHAVDVTREMSLDNFKDEVSKVFHVEVSD--VS 70
Query: 124 MKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETE---GY 180
+K N T+ + I D D++ + A+ I R +Q + G
Sbjct: 71 LKYFLPNNNRTL-ITISCDRDLQRMVDFTASSAQVDVFLISRVENRSITQTGASTAKPGS 129
Query: 181 HAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAA----------NFGT 230
+A D + S++++++ P T V+ N V FG+
Sbjct: 130 NARGDKRKTPTSKNKASKNKKKTPSATGTAVQANANNVKQPRQVVTENDDNRVFPLEFGS 189
Query: 231 EAPLGCGDGILAANIEVSEQQ-----DDIQR--LDLSQDDCGDKLPIRRSNTRSAAI-NP 282
+ G + ++ QQ D+ R + L D + + + + NP
Sbjct: 190 DIAFANTAGAGSTAPDILNQQKLALVDNTARESVGLFDDSVNPYVGSEITTEPTQGLNNP 249
Query: 283 FHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLR 340
++DI+ G+ F + KD +L A K F ++ K+ R V C C WRL
Sbjct: 250 IVFWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFVYRFIKNETTRVTVKCVGEGCTWRLH 309
Query: 341 ATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
A++++ ++ F I++ ++ H C + +++ T
Sbjct: 310 ASESSRNKKFVIKKMTDEHTCGGGSGEGQRRAT 342
>gi|8885626|dbj|BAA97556.1| mutator-like transposase [Arabidopsis thaliana]
Length = 893
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
E + G F ++K L + + L A K+ F FK KS ++VC D C W+L A +
Sbjct: 308 ETLFVGSVFKNRKVLQQTMSLQAIKQCFCFKQPKSCPKTLKMVCVDETCPWQLTARVVKD 367
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
E F+I ++ H C KN K
Sbjct: 368 SESFKITSYATTHTCNIDSRKNYNKH 393
>gi|78707609|gb|ABB46584.1| transposon protein, putative, Mutator sub-class, expressed [Oryza
sativa Japonica Group]
Length = 818
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/333 (19%), Positives = 136/333 (40%), Gaps = 26/333 (7%)
Query: 64 SQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVIT 123
S V + Y G++ P+G +Y GG+ + V+ +++ D + + V+ ++ ++
Sbjct: 5 SLVVAICQYGGEFTSGPNGNLIYKGGEAHAVDVTREMSLDNFKDEVSKVFHVEVSD--VS 62
Query: 124 MKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETE---GY 180
+K N T+ + I D D++ + A+ I R +Q + G
Sbjct: 63 LKYFLPNNNRTL-ITISCDRDLQRMVDFTASSAQVDVFLISRVENRSITQTGASTAKPGS 121
Query: 181 HAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAA----------NFGT 230
+A D + S++++++ P T V+ N V FG+
Sbjct: 122 NARGDKRKTPTSKNKASKNKKKTPSATGTAVQANANNVKQPRQVVTENDDNRVFPLEFGS 181
Query: 231 EAPLGCGDGILAANIEVSEQQ-----DDIQR--LDLSQDDCGDKLPIRRSNTRSAAI-NP 282
+ G + ++ QQ D+ R + L D + + + + NP
Sbjct: 182 DIAFANTAGAGSTAPDILNQQKLALVDNTARESVGLFDDSVNPYVGSEITTEPTQGLNNP 241
Query: 283 FHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLR 340
++DI+ G+ F + KD +L A K F ++ K+ R V C C WRL
Sbjct: 242 IVFWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFVYRFIKNETTRVTVKCVGEGCTWRLH 301
Query: 341 ATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
A++++ ++ F I++ ++ H C + +++ T
Sbjct: 302 ASESSRNKKFVIKKMTDEHTCGGGSGEGQRRAT 334
>gi|3252817|gb|AAC24187.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 895
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
E + G F ++K L + + L A K+ F FK KS ++VC D C W+L A +
Sbjct: 300 ETLFVGSVFKNRKVLQQTMSLQAIKQCFCFKQPKSCPKTLKMVCVDETCPWQLTARVVKD 359
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
E F+I ++ H C KN K
Sbjct: 360 SESFKITSYATTHTCNIDSRKNYNKH 385
>gi|3805769|gb|AAC69125.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 590
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 45/88 (51%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
++I G+ F +K+D KL + A ++ F F +KS + VC C WR+ ATK +
Sbjct: 51 DNIFIGRLFKNKEDCATKLAIHAIRRKFNFITAKSCPNIVLAVCVSHTCPWRVYATKLED 110
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQTA 374
E FEI+ + H C+ + KQ +
Sbjct: 111 SERFEIKCATQQHTCSVDARGDFHKQAS 138
>gi|326515038|dbj|BAJ99880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 881
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 71/335 (21%), Positives = 136/335 (40%), Gaps = 32/335 (9%)
Query: 66 VRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMK 125
V + + G++ P+G +Y GG+ + VS ++ D + + VD + ++ K
Sbjct: 71 VVAICQHGGEFTSGPNGNLVYKGGEAHAVDVSREMPLDSFKDEVSKVFHVDVTD--MSFK 128
Query: 126 SLYKANIPTMPVEIVDDDDVRFFIQENAAHPT---------ELRSPICIT---IERKGSQ 173
N T+ + I D D++ + A+ E RS + T + GS
Sbjct: 129 YFLPNNNRTL-ITISCDRDLQRMVDFTASAAQVDVFVISTGENRSVVTYTGPSTVKAGSN 187
Query: 174 CQ-----------ETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGD 222
Q + + + T AA ++ ++ ++ P + E F D
Sbjct: 188 AQGDKRKRLPSKNKASKSNKKTPNATGAAVQANTHDVNQPRPAATINDYNEDFQLEFGHD 247
Query: 223 PVAANFGTEAPLGCGDGILAANIEV--SEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAI 280
VA TEA D + + + + Q++ I D S + D I +
Sbjct: 248 -VAFVTTTEAVSSAPDVMSQEKLALVDTTQRELIGPFDDSIN-AYDGSEIIIDPPQELTN 305
Query: 281 NPFHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWR 338
NP ++DI+ G+ F + KD +L + K FA++ K+ R V C C WR
Sbjct: 306 NPTMFWDDIIKGVGQEFDNVKDFRAQLCKYSISKGFAYRFIKNETTRVTVKCVGDGCTWR 365
Query: 339 LRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
L A++++ + F I++ +++H C + +++ T
Sbjct: 366 LHASESSRSKKFVIKKMTDVHTCGGEGGEGQRRAT 400
>gi|147779611|emb|CAN61155.1| hypothetical protein VITISV_011233 [Vitis vinifera]
Length = 237
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 40/67 (59%)
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ ++SKK+L +KL + A K+ F FK +KST V D CKWR+RATK F+I
Sbjct: 19 QIYSSKKELQRKLYMMALKRKFEFKTTKSTTKLLLVEFFDKECKWRVRATKLGISNMFQI 78
Query: 353 RRFSNIH 359
++ + H
Sbjct: 79 MKYYSTH 85
>gi|125530899|gb|EAY77464.1| hypothetical protein OsI_32508 [Oryza sativa Indica Group]
Length = 818
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 66/333 (19%), Positives = 135/333 (40%), Gaps = 26/333 (7%)
Query: 64 SQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVIT 123
S V + Y G++ P+G +Y GG+ + V+ +++ D + + V+ + ++
Sbjct: 5 SLVVAICQYGGEFTSGPNGNLIYKGGEAHAVDVTREMSLDNFKDEVSKVFHVEVAD--VS 62
Query: 124 MKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETE---GY 180
+K N T+ + I D D++ + A+ I R +Q + G
Sbjct: 63 LKYFLPNNNRTL-ITISCDRDLQRMVDFTASSAQVDVFLITRVENRSITQTGASTAKPGS 121
Query: 181 HAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAA----------NFGT 230
+A D + S++++++ P T V+ N V FG+
Sbjct: 122 NARGDKRKTPTSKNKASKNKKKTPSATGTAVQANANNVKQPRQVVTENDDNRVFPLEFGS 181
Query: 231 EAPLGCGDGILAANIEVSEQQ-----DDIQR--LDLSQDDCGDKLPIRRSNTRSAAI-NP 282
+ G + ++ QQ D+ R + L D + + + + NP
Sbjct: 182 DIAFANTAGAGSTAPDILNQQKLALVDNTARESVGLFDDSVNPYVGSEITTEPTQGLNNP 241
Query: 283 FHSYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLR 340
++DI+ G+ F + KD +L A K F ++ K+ R V C C WRL
Sbjct: 242 IVFWDDIIKGVGQEFDNVKDFRAQLCKYAIGKGFVYRFIKNETTRVTVKCVGEGCTWRLH 301
Query: 341 ATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
A++++ ++ F I++ ++ H C + +++ T
Sbjct: 302 ASESSRNKKFVIKKMTDEHTCGGGSGEGQRRAT 334
>gi|4325366|gb|AAD17362.1| similar to maize transposon MuDR-like proteins [Arabidopsis
thaliana]
gi|7267531|emb|CAB78013.1| putative transposon protein [Arabidopsis thaliana]
Length = 1028
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 40/86 (46%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
E + G F ++K L + + L A K+ F FK KS ++VC D C W L A +
Sbjct: 307 ETLFVGSVFKNRKVLQQTMYLQAIKQCFCFKQPKSCPKTLKMVCVDETCPWHLTARIVKD 366
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
E F+I ++ H C KN K
Sbjct: 367 SESFKITSYATTHTCNIDSRKNYNKH 392
>gi|448930972|gb|AGE54535.1| hypothetical protein PBCVKS1B_197L, partial [Paramecium bursaria
Chlorella virus KS1B]
Length = 461
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 71/299 (23%), Positives = 127/299 (42%), Gaps = 20/299 (6%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P + G ++Q P++PK + + C W RS + P G+PE+
Sbjct: 155 LKPVLAEFLGRLCVYQTPENPKGLFVFLNACTKSIYGGWPRSSD-PRFTGMPEDVSRTKQ 213
Query: 469 ALSRGFAIITISSA---GRCWTLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFV 525
L G+AI+ IS+A +C++ G + + II + L PL GASSGG
Sbjct: 214 TLKAGYAILYISAADEVSKCFSWGPDGNTIVGIIEIVRNQLGLINKPLYLGGASSGGGLA 273
Query: 526 SALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQRKIGEFLVVLRNKGI 585
L R + MI E + + E P ++++ M Q+ E + LR+ G
Sbjct: 274 QRLVAKGRI-KVDGMINE-VATTAEPTEKTPNSVWITMASPD-EQKVAEERVTALRSFGK 330
Query: 586 DVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR---RTRW 642
+ + + ++F +IP + + NS + + +E + G + + + RW
Sbjct: 331 KGEMLVSGKRNVLNDYFYYQIPTISKENSKIIVDTMKEFNMLSSTGEIIVNPKPDINQRW 390
Query: 643 KEALR------ESNAIFLNKNLVHHVQEELNLAFAYHEMTSLPSEQIFKWFLPIGSLDL 695
E LR E + FL+ V + + +A+A HE T++ + W G++ +
Sbjct: 391 FEVLRTKLDIPEISGPFLD----TPVAQSIMVAWAVHEPTAMYTTVCLNWLESNGAIPI 445
>gi|20197381|gb|AAM15051.1| hypothetical protein [Arabidopsis thaliana]
gi|20197514|gb|AAM15104.1| hypothetical protein [Arabidopsis thaliana]
Length = 382
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 46/94 (48%)
Query: 281 NPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLR 340
+P ++I G+ F +K+D KL + A ++ F F +KS + C C WR+
Sbjct: 26 DPGEEGDNIFIGRLFKNKEDCATKLAIHAIRRKFHFIYAKSYPNILLAACVSHTCPWRIY 85
Query: 341 ATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQTA 374
ATK + E FEI+ + H C+ + KQ +
Sbjct: 86 ATKMEDSERFEIKCATQQHTCSVDARGDFHKQAS 119
>gi|3242713|gb|AAC23765.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 784
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 42/71 (59%)
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ F +K L + + A +F + + KS K+R+ + CA +C WRLRA+ + + F I
Sbjct: 203 QEFPNKAALHEVVDRAAFANSFGYVIKKSDKERYVLKCAKESCSWRLRASNISTTDIFSI 262
Query: 353 RRFSNIHICTQ 363
RR++ +H CT+
Sbjct: 263 RRYNKMHSCTR 273
>gi|4581167|gb|AAD24650.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 569
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
E + G F ++K L + + L A K+ F FK KS ++VC D C W+L A +
Sbjct: 343 ETLFVGCVFKNRKVLQQTMSLQAIKQCFCFKQPKSCPKTLKMVCVDETCPWQLTARVVKD 402
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
E F+I ++ H C KN K
Sbjct: 403 SESFKITSYATTHTCNIDSWKNYNKH 428
>gi|4309749|gb|AAD15518.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 633
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 110/259 (42%), Gaps = 31/259 (11%)
Query: 94 IIVSDDITYKELVDRLYGIVKVD--TNEYVITMKSLYKAN-IPTMPVEIVDDDDVRFFIQ 150
+++ + ITY LVD L ++D TN ++ K + + I + P+ +D + F+
Sbjct: 16 VLIDEHITYHALVDILVEDFQIDSSTNSLKLSYKLVSTSTTIQSHPLYFRNDRQLATFMH 75
Query: 151 ENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQ-ISVVVPGMQAT 209
+ + L+ +C+T+E+ ++ E + SN +S VP QA
Sbjct: 76 KFSQSGGLLQ--LCLTVEQISVTVEQAE---------------TTSNPFVSHFVPTAQA- 117
Query: 210 EVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLP 269
E N F F T+A + + + ++ + + +++ + L
Sbjct: 118 ---ETTSNPFASHSA---FTTQAE-TTSNPFASHSASTTQPRLTSRIIEVCVSETSSILD 170
Query: 270 IRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVV 329
+ S+ SA + + GK F K ++ L++ A K NF F KS R+ +
Sbjct: 171 QQYSS--SAFSTGLSDADSLFIGKIFKDKDKMVFTLRMFAVKHNFEFHTVKSDLTRYVLH 228
Query: 330 CADTNCKWRLRATKTAEDE 348
C D NC WRLRAT+ E
Sbjct: 229 CIDENCIWRLRATRAGGSE 247
>gi|5732431|gb|AAD49099.1|AF177535_3 contains similarity to maize transposon MuDR (GB:M76978)
[Arabidopsis thaliana]
Length = 664
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ F +K L +LK+ + F F VSKST + V C C+WR+RAT E
Sbjct: 233 LAVGQCFETKDHLETRLKILTVLQKFDFDVSKSTPTLYFVKCWIKGCRWRVRATPVDEYP 292
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQ 372
F +R + + H C+ +R +Q
Sbjct: 293 KFHVRVYVSEHTCSVTKRSSRSRQ 316
>gi|8778244|gb|AAF79253.1|AC023279_2 F12K21.4 [Arabidopsis thaliana]
Length = 857
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 77/196 (39%), Gaps = 55/196 (28%)
Query: 185 DHQTPAASRSESNQISVVVPGMQATEVEEQN--------------LNAFNGDPVAANFGT 230
D A N +S + + T+++ QN L+ F+ DP+ FG
Sbjct: 222 DSSRAAVGTPHGNHVSPIGNAVAETDLQRQNILFWGRAQEALNTILSDFSDDPIL--FGR 279
Query: 231 EAPL----GCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSY 286
+AP G G+G+ +A +V + GDKL +
Sbjct: 280 DAPPVFNDGKGEGVDSAFFDVKYE--------------GDKLFV---------------- 309
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
G+ F SK D K+ + A + F F+ ++ST + C C WR+ A+K
Sbjct: 310 -----GRVFKSKSDCKIKIAIHAINRKFHFRTARSTPKFMVLKCISKTCPWRVYASKVDT 364
Query: 347 DEYFEIRRFSNIHICT 362
+ F++R+ + H CT
Sbjct: 365 SDSFQVRQANQRHTCT 380
>gi|10177931|dbj|BAB11196.1| mutator-like transposase [Arabidopsis thaliana]
Length = 797
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ F +K L +LK+ + F F VSKST + V C C+WR+RAT E
Sbjct: 233 LAVGQCFETKDHLETRLKILTVLEKFDFDVSKSTPTLYFVKCWIKGCRWRVRATPVDEYP 292
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQ 372
F++R + + H C+ +R +Q
Sbjct: 293 KFQVRVYVSEHTCSVTKRSSRSRQ 316
>gi|12320742|gb|AAG50520.1|AC084221_2 mutator-like transposase, putative [Arabidopsis thaliana]
Length = 1127
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 75/198 (37%), Gaps = 23/198 (11%)
Query: 182 AALDHQTPAASRSESNQISV-------VVPGMQATEVEEQNLNAFNGDPVAANFGTEAPL 234
+ L + T + +SN I + V+ G+ A+ QN N A E P
Sbjct: 279 SQLGYGTRCEGQRDSNLICIDEVTGDEVIEGIGASSTGTQNDNGVVTAAEAYQVYNEFPK 338
Query: 235 GCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKT 294
G + +V+ DD+ D+ + S T E + G
Sbjct: 339 LHGGELANLGNDVAPVFDDLLNFR------ADETQVEISTTG----------ETLFVGSV 382
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRR 354
F ++K L + + L A K+ F FK KS ++VC D C W+L A + E F+I
Sbjct: 383 FKNRKVLQQTMSLQAIKQCFCFKQPKSCPKTLKMVCVDETCPWQLTARVVKDSESFKITS 442
Query: 355 FSNIHICTQPPAKNRKKQ 372
++ H C K K
Sbjct: 443 YATTHTCNIDSRKYYNKH 460
>gi|3047071|gb|AAC13585.1| similar to maize transposon MuDR (GB:M76978) [Arabidopsis thaliana]
Length = 882
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 39/86 (45%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
E + G F ++K L + + L K+ F FK KS ++VC D C W L A +
Sbjct: 306 ETLFVGSVFKNRKVLQQTMSLQPIKQCFCFKQPKSCPKTLKMVCVDETCPWHLTACVVKD 365
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
E F+I ++ H C KN K
Sbjct: 366 SESFKITSYATTHTCNIDSRKNYNKH 391
>gi|7267514|emb|CAB77997.1| putative protein [Arabidopsis thaliana]
gi|7321061|emb|CAB82108.1| putative protein [Arabidopsis thaliana]
Length = 848
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 41/69 (59%)
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRR 354
F S+ L + + GA +F + + KS K+R+ + C+ NC WR++A+ + + IRR
Sbjct: 199 FPSRAALQEVVDKGAFANSFDYVIKKSDKERYVLTCSKANCDWRIQASCVQDTSIYSIRR 258
Query: 355 FSNIHICTQ 363
++ +H CT+
Sbjct: 259 YNKMHSCTR 267
>gi|384252930|gb|EIE26405.1| hypothetical protein COCSUDRAFT_58942 [Coccomyxa subellipsoidea
C-169]
Length = 304
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 96/204 (47%), Gaps = 30/204 (14%)
Query: 458 GLPEERLLVLHALSRGFAIITIS-----SAGRCWTLG---EERLVVREI---IRWWVERH 506
GL EE +V A+ RG+A++++ ++ RCWT E + + E+ IR +E
Sbjct: 40 GLAEEIAIVSTAVQRGYAVVSVGPHLSGASDRCWTKTWPPEASIEIPELKRAIRHVLEEQ 99
Query: 507 KLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIA----EGLFDQ-------MDIPED- 554
K LP LG+S+GG A ++A ++ + LF++ +D P
Sbjct: 100 KWWGLPRYLLGSSAGGAAALLFATRFPVQAVASLVMGFRPDELFEEKQLVSRNVDQPGGN 159
Query: 555 ----YPPTLFVHM--PKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPG 608
+PP F+ +D Y++R I + L+ +G+ + + L+ F++R+P
Sbjct: 160 ASWPFPPVFFLEARNDQDIYKER-IHRSIYFLKQQGLHAEHVMMEPYALTPATFSERVPD 218
Query: 609 LEQANSTKLFELFQEKGFIDENGY 632
+ A S+ L+E Q G ++ G+
Sbjct: 219 ITFALSSTLYEELQAAGIVNATGF 242
>gi|147805683|emb|CAN73907.1| hypothetical protein VITISV_029159 [Vitis vinifera]
Length = 359
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 5/74 (6%)
Query: 112 IVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKG 171
I+K D E+ ++MK ++ NIP+ P+++ DD DV+ FI+ N + +L P+CITIE++
Sbjct: 16 ILKADPTEHNLSMKYVFNGNIPSTPIQLRDDGDVKNFIRLNCTN-GKLSVPLCITIEKRS 74
Query: 172 SQCQETEGYHAALD 185
GY + ++
Sbjct: 75 GN----HGYESIIN 84
>gi|5732067|gb|AAD48966.1|AF147263_8 contains similarity to Medicago truncatula N7 protein (GB:Y17613)
[Arabidopsis thaliana]
gi|7267315|emb|CAB81097.1| AT4g05550 [Arabidopsis thaliana]
Length = 500
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 40/73 (54%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
I G+ + SK+D L + A K+ F FK S++ + F + CAD NC+WR+ + +
Sbjct: 274 IYKGRVYASKQDCQISLAIHAIKEQFYFKQSRTKRHYFVITCADENCQWRIMSHEMKTCG 333
Query: 349 YFEIRRFSNIHIC 361
Y+ IR+ H C
Sbjct: 334 YYLIRKADLEHTC 346
>gi|125536589|gb|EAY83077.1| hypothetical protein OsI_38297 [Oryza sativa Indica Group]
Length = 605
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 54/119 (45%), Gaps = 10/119 (8%)
Query: 246 EVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYED--IVCGKTFTSKKDLIK 303
EV +D + L +D+ D +P P Y++ I + F K +K
Sbjct: 409 EVEIGEDPTEHLLEREDEITDSVPCEPC--------PVVDYQNPAIKINEVFPDKISFLK 460
Query: 304 KLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICT 362
L+ A K F F + S K +F C D +CKWR+ A++ ++ F+I+ + H CT
Sbjct: 461 ALRHFAILKQFEFGIDYSDKTKFGASCCDPDCKWRIHASRLQDEHSFQIKTVGSDHECT 519
>gi|147782711|emb|CAN77300.1| hypothetical protein VITISV_042537 [Vitis vinifera]
Length = 1435
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/352 (20%), Positives = 130/352 (36%), Gaps = 55/352 (15%)
Query: 63 MSQVRVLVMY-DGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYV 121
M + +V Y DG+ P+G Y+ KG++V++ I Y EL D+L ++ +D +
Sbjct: 377 MDSLLTIVCYKDGEIIHGPNGV-CYNCPPKKGVLVNNLIKYDELEDKLCHVMLIDRAHTM 435
Query: 122 ITMKSLY-------KANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQC 174
++M Y NI + + I D DDVR A P + + +
Sbjct: 436 LSMIFRYPILMPIGNGNINYIQLPIKDVDDVRLMFHVVAQIPPSNTIEMYLQTRPRDHSS 495
Query: 175 QETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFG--TEA 232
+ + + + + + + G E + N + + + N T++
Sbjct: 496 ELSPSFDQEI----------MGHDVEIPAKGNLVVHTNEMDENLAHNEEMRGNLAVVTQS 545
Query: 233 PLGCGDGILAANIEVSEQQDDIQRLD-----------------LSQDDCGDKLPIRRSNT 275
+ + + +I + + DD++ D S DD +P S
Sbjct: 546 IMRATNNYV--DIPFTNENDDVEFYDEDEINEMHYDDEPPTNKASLDDGEHIMPSPMSKQ 603
Query: 276 RSA-AINP------------FHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKST 322
+ AIN ++ ++ G F S++DL +K A +N V +S
Sbjct: 604 LNCDAINSMTAKPLTSRTGLWNESNELFKGLRFESEEDLQYAIKRYAICRNQHLVVCESE 663
Query: 323 KDRFEVVCA--DTNCKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQ 372
+ V C C WRLRA + FEI +++ H C P Q
Sbjct: 664 PQLWAVRCKKWQEGCNWRLRACRRKSHGMFEITKYAGPHTCVYPKLSQDHSQ 715
>gi|4567298|gb|AAD23710.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 667
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 50/107 (46%), Gaps = 4/107 (3%)
Query: 260 SQDDCGDKLPIRRSNTR----SAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFA 315
+ D G P R+S T SA + + + + G+ F SK D K+ + A F
Sbjct: 166 TSDKAGPAGPSRQSGTEDSDNSAFFDVKYEGDKLFVGRVFKSKFDCKIKIAIHAINCKFH 225
Query: 316 FKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICT 362
F+ ++ST + C C WR+ A+K + F++R+ + H CT
Sbjct: 226 FRTARSTPKFMVLKCISKTCPWRVYASKVDSSDSFQVRQANQRHTCT 272
>gi|147854235|emb|CAN83434.1| hypothetical protein VITISV_010136 [Vitis vinifera]
Length = 315
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 54/237 (22%), Positives = 99/237 (41%), Gaps = 45/237 (18%)
Query: 145 VRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVP 204
+ FFI+ N + +L +P+CIT++R+ E+ H P S S+ V
Sbjct: 5 LNFFIRLNCTN--KLPAPLCITMDRRSENNAESMFMHG----NGPVNDGSIE---SLNVV 55
Query: 205 GMQATEVEEQNLNAFNGDPVAAN---------FGTEAPLGCGDGILAANIEVSEQQD--- 252
G ++++ FN +P+ + + + D L +N++V+E +D
Sbjct: 56 G-------DESIKKFNYEPLERSNVVEWNMNRYAIDDDYHVLDTNLTSNVQVTENRDSSN 108
Query: 253 ------DIQRLDLSQDDCGDKLP--IRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKK 304
+I + +D + +P I + ++ + + CG + D I+
Sbjct: 109 KIAQIMEIHSIMNIKDRLMNDVPTMIEEVSNNDQDMSRIGTSD---CG----TNNDHIED 161
Query: 305 LKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
++ K F F+ KST + C D CKWR RATK YF++ ++ H C
Sbjct: 162 KQIYFKK--FEFRTIKSTTKLLVLQCVDNECKWRFRATKLGSSNYFQVMKYHPTHTC 216
>gi|147816100|emb|CAN68488.1| hypothetical protein VITISV_043691 [Vitis vinifera]
Length = 1329
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 3/84 (3%)
Query: 284 HSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA---DTNCKWRLR 340
H + + G+ F SK DL +K+ + + V STK + C + C WRLR
Sbjct: 233 HPTGEFIVGQIFNSKGDLQHAVKMYXINSHQEYIVLSSTKKLLVLRCKKXEQSQCPWRLR 292
Query: 341 ATKTAEDEYFEIRRFSNIHICTQP 364
AT FEI ++S H C P
Sbjct: 293 ATVVKGTSLFEINKYSGPHTCVNP 316
>gi|8655994|gb|AAF78267.1|AC020576_11 Contains weak similarity to 25.7 kDa protein from Cicer arietinum
gb|AJ276422 and contains a transposase mutator PF|00872
domain. ESTs gb|T13756, gb|AA712647, gb|AA585980 come
from this gene [Arabidopsis thaliana]
Length = 1206
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 64/150 (42%), Gaps = 10/150 (6%)
Query: 217 NAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTR 276
+ N + V + + +GC G + D ++L+ D+ D P+
Sbjct: 561 SGVNAEKVKKEWAAKTKVGCKAG------SSHGLRGDARKLEEIDDEEFDTPPLFEDIEY 614
Query: 277 SAAINPFHSYED----IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCAD 332
P ED I GK + SK+D L + A K F FK +++ + F + C+D
Sbjct: 615 EVENIPDLDIEDDGKGIYKGKVYASKEDCQIGLAIYAIKNMFHFKQTRTKWNYFVLSCSD 674
Query: 333 TNCKWRLRATKTAEDEYFEIRRFSNIHICT 362
C WR+ AT Y+EI++ S H C+
Sbjct: 675 EKCDWRILATLMKGTGYYEIKKASLDHTCS 704
>gi|9802586|gb|AAF99788.1|AC012463_5 T2E6.8 [Arabidopsis thaliana]
Length = 708
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 42/84 (50%)
Query: 278 AAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
A +N + I G+ F++K +L K L + + K F F+++KS K R VC D C W
Sbjct: 334 ANMNLRKGGDAIYIGRVFSNKGELQKALTVYSMKMLFNFRINKSDKIRVIAVCHDKKCDW 393
Query: 338 RLRATKTAEDEYFEIRRFSNIHIC 361
R+ T E EIR + H C
Sbjct: 394 RVYTTFHENSENMEIRTATLKHTC 417
>gi|7263611|emb|CAB81577.1| putative transposase [Arabidopsis thaliana]
Length = 675
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
E + F ++K + + + L A K+ F FK KS ++VC D C W+L +
Sbjct: 375 ETLFVESVFKNRKVMQQTMSLQAIKQCFCFKQPKSCPKTLKMVCVDETCPWQLTTRVVKD 434
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
E F+I ++ H C KN K
Sbjct: 435 SESFKITSYATTHTCNIDSRKNYNKH 460
>gi|4263652|gb|AAD15374.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 343
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 1/89 (1%)
Query: 274 NTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADT 333
+ ++A +N +++ G+TF S++D L + A F FK ++ K V C D
Sbjct: 42 DAKAAHVNLSKENDNMFVGRTFASREDFRIALSIYAINFIFRFKFTRYEKHYLVVECYDK 101
Query: 334 N-CKWRLRATKTAEDEYFEIRRFSNIHIC 361
C WR+RA + + E +E R+ + H+C
Sbjct: 102 QCCDWRVRAHQVGDSEEYEFRKAKSEHMC 130
>gi|384254164|gb|EIE27638.1| hypothetical protein COCSUDRAFT_39250 [Coccomyxa subellipsoidea
C-169]
Length = 282
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 78/189 (41%), Gaps = 43/189 (22%)
Query: 456 CIGLPEERLLVLHALSRGFAIITISS-----AGRCWTLG---EERLVVREIIRWWVERH- 506
C+G PE LV A+ RG+A++ + + RC+ EE + II+ V +H
Sbjct: 5 CVGSPEVMTLVYKAVQRGYAVVAVGAHRGGDKWRCYQTHWPPEEYHEISGIIK--VMKHL 62
Query: 507 ----KLEKLPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVH 562
K LP A G+SSGG AL LRF + L Q+ + +PP
Sbjct: 63 MTERKWWHLPRYAFGSSSGG--CIALELALRFP-----LQASLSSQL---QYFPP----- 107
Query: 563 MPKDSYRQRKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQ 622
I FL G +IE + L+ FF R+ G+ S ++F Q
Sbjct: 108 ----------ISRFLDF---AGTVTRKIELKPYALTPQFFAGRMRGVSAEISQQIFARLQ 154
Query: 623 EKGFIDENG 631
IDENG
Sbjct: 155 AIHVIDENG 163
>gi|147779774|emb|CAN74680.1| hypothetical protein VITISV_039715 [Vitis vinifera]
Length = 634
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 35/51 (68%), Gaps = 1/51 (1%)
Query: 120 YVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERK 170
Y +K ++ ANIPT P+++ DD DV+FFI N + +L P+CIT+E++
Sbjct: 29 YFEEVKYVFNANIPTSPIQLTDDGDVKFFIDLNCTNG-KLHVPLCITVEKR 78
>gi|225452988|ref|XP_002262784.1| PREDICTED: uncharacterized protein LOC100258751 [Vitis vinifera]
gi|147858253|emb|CAN83921.1| hypothetical protein VITISV_011842 [Vitis vinifera]
Length = 749
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 55/133 (41%), Gaps = 20/133 (15%)
Query: 245 IEVSEQQDDIQRLDLSQDDCGDKLPIRRSN----------------TRSAAINPFHSYED 288
+ VSE Q+ LDL+ DD ++ I ++ R+ NP SYE
Sbjct: 119 LAVSENQELDDNLDLAVDD-HQEMGIESTSDMVQQHQLVVSTPVLQARTVVANP--SYE- 174
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+V G+ F K + L+ A +F + KS K RF CA C WR+ A K
Sbjct: 175 LVVGQEFPDVKSCRRALRDTAIALHFEMQTIKSDKTRFTAKCASDGCPWRIHAAKLPGVP 234
Query: 349 YFEIRRFSNIHIC 361
F IR H C
Sbjct: 235 TFTIRTIHEAHTC 247
>gi|356575297|ref|XP_003555778.1| PREDICTED: uncharacterized protein LOC100797259 [Glycine max]
Length = 758
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 66/302 (21%), Positives = 118/302 (39%), Gaps = 42/302 (13%)
Query: 63 MSQVRVLVMYD--GKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEY 120
M+ ++++ + GK+ G Y GG I + D + + E + + + + +
Sbjct: 1 MAAMKIIAICQSGGKFVTGKDGCLSYKGGDAHAIDIGDLMKFDEFKEEVAEMFNIRADS- 59
Query: 121 VITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGY 180
+++K N + + I +D D++ I+ + + T I I IE
Sbjct: 60 -MSIKYFLPGNKKIL-ITISNDKDLQRMIKFHVSSST---VDIYILIEEV---------- 104
Query: 181 HAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGI 240
AA + ASRS +S V V + LNAF+ DG+
Sbjct: 105 -AAPEVSNMPASRSSRTTLSETVA------VAPEPLNAFH-------------TAVADGV 144
Query: 241 LAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIV-CGKTFTSKK 299
L + + Q D +D + +P+R SN A I G+ F+S
Sbjct: 145 LDV-VHDTNQIDTNTDMDTPLE--VPPVPLRSSNDEKYAKGAQQWQNTITGVGQRFSSVH 201
Query: 300 DLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIH 359
+ + L+ A FAFK K+ R V C C WR+ A++ + + I++ ++ H
Sbjct: 202 EFRESLRKYAIAHQFAFKYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNSTH 261
Query: 360 IC 361
C
Sbjct: 262 TC 263
>gi|20197833|gb|AAM15272.1| putative Mutator-like transposase [Arabidopsis thaliana]
gi|20198268|gb|AAD32762.2| putative Mutator-like transposase [Arabidopsis thaliana]
Length = 696
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/184 (22%), Positives = 76/184 (41%), Gaps = 43/184 (23%)
Query: 185 DHQTPAASRSESNQIS--VVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPL----GCGD 238
+H +P + E +I +++ +A + L+ F+ +P+ FG +APL G G+
Sbjct: 175 NHVSPITNDVEETEIQRQIIMFWGRAKDALNTILSDFSDEPIL--FGRDAPLVFNDGKGE 232
Query: 239 GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSK 298
G+ +A +V ++ E + G F SK
Sbjct: 233 GVGSAFFDVKYER-----------------------------------EKLFVGIVFKSK 257
Query: 299 KDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNI 358
D K+ + A + F F+ ++ST + C C WR+ A+K + F++R+ +
Sbjct: 258 SDCKIKIVIHAINRKFHFRSARSTPKFMVLKCISRTCPWRVYASKVDSSDSFQVRQANQK 317
Query: 359 HICT 362
H CT
Sbjct: 318 HTCT 321
>gi|159491476|ref|XP_001703691.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270540|gb|EDO96382.1| predicted protein [Chlamydomonas reinhardtii]
Length = 208
Score = 50.8 bits (120), Expect = 0.003, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 40/72 (55%), Gaps = 10/72 (13%)
Query: 420 DVIWQIPDSP---KAVLFLAHGCNG----RAVHFWDRSPNCPNCIGLPEERLLVLHALSR 472
+V+WQ+P + + V+F+AHGC R V W R + LPEE + LH L R
Sbjct: 42 EVVWQVPAADTRLRGVMFIAHGCGCVPGCRRVGVWGRGAATGS---LPEEMNVTLHLLLR 98
Query: 473 GFAIITISSAGR 484
G+A + +SSA R
Sbjct: 99 GYAAVAVSSADR 110
>gi|147767879|emb|CAN69012.1| hypothetical protein VITISV_015954 [Vitis vinifera]
Length = 389
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 60/123 (48%), Gaps = 31/123 (25%)
Query: 63 MSQVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
+ +V ++++ +G W + + YY + G Q D E+ +
Sbjct: 8 IDEVGIMLLNEGDWVQNGNIYY-FEGCQ-------------------------DPTEHNL 41
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHA 182
+MK ++ NIP+ P+++ DD DV+ FI+ N + +L P+CITIE++ GY +
Sbjct: 42 SMKYVFIGNIPSTPIQLRDDGDVKIFIRLNCTN-GKLAVPLCITIEKRSGN----HGYES 96
Query: 183 ALD 185
++
Sbjct: 97 IIN 99
>gi|147864334|emb|CAN83003.1| hypothetical protein VITISV_003697 [Vitis vinifera]
Length = 1006
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 3/92 (3%)
Query: 284 HSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA---DTNCKWRLR 340
H + + G+ F SK DL +K+ + + + + STK + C + C WRL
Sbjct: 244 HPIGEFIVGQIFNSKGDLQHVVKMYSINSHQEYIILSSTKKLLVLRCKKAEQSQCPWRLH 303
Query: 341 ATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQ 372
AT FEI ++S H C P Q
Sbjct: 304 ATVVKGTSLFEINKYSGPHTCVNPCMNQNHHQ 335
>gi|147770608|emb|CAN75668.1| hypothetical protein VITISV_016268 [Vitis vinifera]
Length = 1139
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 39/84 (46%), Gaps = 3/84 (3%)
Query: 284 HSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA---DTNCKWRLR 340
H + + G+ F SK DL +K+ + + + V STK + C + C WRLR
Sbjct: 211 HPIGEFIVGQIFNSKGDLQHAVKMYSINSHQEYIVLSSTKKLLVLRCKKAEQSQCPWRLR 270
Query: 341 ATKTAEDEYFEIRRFSNIHICTQP 364
AT FEI ++S + C P
Sbjct: 271 ATVVKGTSLFEINKYSGSYTCINP 294
>gi|8953702|dbj|BAA98060.1| mutator-like transposase [Arabidopsis thaliana]
Length = 941
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%)
Query: 299 KDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNI 358
K L + + L A K+ F FK KS ++VC D C W+L A + E F+I ++
Sbjct: 338 KVLQQTMSLQAIKQCFCFKQPKSCPKTLKMVCVDETCPWQLTARVVKDSESFKITSYATT 397
Query: 359 HICTQPPAKNRKKQ 372
H C KN K
Sbjct: 398 HTCNIDSRKNYNKH 411
>gi|4773891|gb|AAD29764.1|AF076243_11 putative transposon protein [Arabidopsis thaliana]
gi|7267209|emb|CAB77920.1| putative transposon protein [Arabidopsis thaliana]
Length = 586
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 36/73 (49%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
I G F +K +L K L + + K+ F F++ S K R VC D C WR+ AT E
Sbjct: 233 IYIGWVFNNKVELHKTLTMYSMKRLFNFRIKASDKTRVIAVCDDKKCDWRVYATFHENSE 292
Query: 349 YFEIRRFSNIHIC 361
EIR + H C
Sbjct: 293 KVEIRTATLKHTC 305
>gi|2565002|gb|AAB81872.1| putative MuDR-like transposon protein [Arabidopsis thaliana]
gi|7267508|emb|CAB77991.1| putative MuDR-like transposon protein [Arabidopsis thaliana]
Length = 714
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 42/86 (48%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ + SK +L +LKL A + F F V S K C C+WR+RA++ ++
Sbjct: 260 LAVGQQYRSKFELEYRLKLLAIRDGFDFDVPTSNKTTVYYECWVDRCRWRVRASRQGDNP 319
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQTA 374
F + + H C+ +R +QT
Sbjct: 320 NFYVYIYDTEHTCSVTERSDRSRQTT 345
>gi|7529246|emb|CAB86476.1| putative protein [Arabidopsis thaliana]
Length = 608
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/277 (20%), Positives = 112/277 (40%), Gaps = 30/277 (10%)
Query: 98 DDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
++I Y LVD++ + +DT + + + P P+ I +DDD+ +++ N +
Sbjct: 43 EEINYLALVDKIGRKMMLDTTTSEMRFSYMPISVNPKKPIYISNDDDLLCYLEWNKIEFS 102
Query: 158 ELRSPICITI--ERKGSQCQETEGYHAALDHQ---TPAASRSESNQISVVVPGMQATEVE 212
L + ++ Q + +EG ++ + +R + + + + + +E+
Sbjct: 103 VLHVELVKDDGENQRSEQIRVSEGGEVGASNKEFVSCGITRRDHDDVMI----LYESELC 158
Query: 213 EQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRR 272
EQN N + VA E + E++D + D+ G + R
Sbjct: 159 EQNKQVENSEAVACEEQEEMAC-----EEREEMACVEREDGM-------DNQGIGVCKRV 206
Query: 273 SNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCAD 332
+ +N + G+ F SKK + L+ GA K F + + KS + V C
Sbjct: 207 NKEWEDGVN-------LTIGQEFGSKKAVQDLLEKGAHKNCFEYDIMKSDPMLYVVKCTG 259
Query: 333 TN--CKWRLRATKTAEDEYFEIRRFSNIHICTQPPAK 367
CKW +R K E F +R ++ +H C + K
Sbjct: 260 KKFGCKWFVRVAKVKNSECFSVRTYNKMHSCYRSTTK 296
>gi|147776572|emb|CAN63024.1| hypothetical protein VITISV_030782 [Vitis vinifera]
Length = 1433
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 3/84 (3%)
Query: 284 HSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA---DTNCKWRLR 340
H + + G+ F SK DL +K+ + + + V STK + C + C WRLR
Sbjct: 244 HPTGEFIVGQIFNSKGDLQHAVKMYSINSHQEYIVLSSTKKLLVLRCKKAEQSQCAWRLR 303
Query: 341 ATKTAEDEYFEIRRFSNIHICTQP 364
AT FEI ++S C P
Sbjct: 304 ATVVKGTSLFEINKYSGPXTCVNP 327
>gi|225001249|gb|ACN78480.1| putative mutator sub-class protein [Arachis hypogaea]
Length = 620
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/155 (23%), Positives = 67/155 (43%), Gaps = 31/155 (20%)
Query: 217 NAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIR----- 271
NA G P P+G GDG+L ++DD+ + D+ +DD GD+ +
Sbjct: 182 NALQGVP---------PVGVGDGLLG-----DAEEDDV-KPDMIEDDSGDEEGVSGHSVG 226
Query: 272 ---RSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEV 328
R +A + F G+ F K + + +K + ++ +KV +S R+
Sbjct: 227 FGARDAEGTAGLTEFQ------VGQQFQDKDEALLSVKTYSIRRGVQYKVVESDYRRYVG 280
Query: 329 VCAD--TNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
C++ C W +R + +E++R++ H C
Sbjct: 281 KCSEFGNGCTWLIRLSLRQRKGIWEVKRYNGPHTC 315
>gi|90399130|emb|CAJ86059.1| H0821G03.10 [Oryza sativa Indica Group]
Length = 1779
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 1/103 (0%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADT-NCKWRLRATKTAED 347
V TF + + ++ A K F VS S+K R+ CA T +C WR+ A K +
Sbjct: 1227 FVVDATFPNGEAFKLAMRQFAIKGEFEVNVSHSSKARYIAKCAKTKDCTWRIYARKETKS 1286
Query: 348 EYFEIRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNEKPN 390
++I R S IH C + K ++ D + I +PN
Sbjct: 1287 SVWKISRLSEIHTCATTSKLSGKMASSGWIADRAKDIMTTEPN 1329
>gi|449520056|ref|XP_004167050.1| PREDICTED: uncharacterized LOC101204073 [Cucumis sativus]
Length = 582
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 1/87 (1%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
V G+ F K +K A ++F ++ KS R+ CA+ C WR+RA K
Sbjct: 6 FVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEEGCPWRIRAVKLPNSP 65
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQTAL 375
F IR H C Q A+N Q ++
Sbjct: 66 IFTIRSLQGTHTCGQ-NAQNGHHQASI 91
>gi|147852615|emb|CAN81689.1| hypothetical protein VITISV_009754 [Vitis vinifera]
Length = 771
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 3/84 (3%)
Query: 284 HSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA---DTNCKWRLR 340
H + + G+ F SK DL +K+ + + + V STK + C + C WRLR
Sbjct: 233 HPTGEFIVGQIFNSKGDLQHAVKMYSINSHQEYIVLSSTKKLLVLRCKKXEQSQCPWRLR 292
Query: 341 ATKTAEDEYFEIRRFSNIHICTQP 364
A FEI ++S H C P
Sbjct: 293 AXVVKGTSLFEINKYSGPHTCVNP 316
>gi|449459624|ref|XP_004147546.1| PREDICTED: uncharacterized protein LOC101204073 [Cucumis sativus]
Length = 582
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 1/87 (1%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
V G+ F K +K A ++F ++ KS R+ CA+ C WR+RA K
Sbjct: 6 FVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEEGCPWRIRAVKLPNSP 65
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQTAL 375
F IR H C Q A+N Q ++
Sbjct: 66 IFTIRSLQGTHTCGQ-NAQNGHHQASI 91
>gi|242085466|ref|XP_002443158.1| hypothetical protein SORBIDRAFT_08g012913 [Sorghum bicolor]
gi|241943851|gb|EES16996.1| hypothetical protein SORBIDRAFT_08g012913 [Sorghum bicolor]
Length = 354
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
DI G TFT KK + +K A K+ F V S K R+ CAD+ C W++ A K
Sbjct: 259 DIKEGSTFTDKKAFVLMIKQYAIKREFQTFVEHSDKSRYRARCADSECGWKVHAKKLLGC 318
Query: 348 EYFEIRRFSNIHIC 361
F + + H C
Sbjct: 319 PTFMVVSIGSSHTC 332
>gi|6967100|emb|CAB72483.1| putative protein [Arabidopsis thaliana]
Length = 693
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 290 VCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC--ADTNCKWRLRATKTAED 347
+ G+ F SK+ + + + ++ F K KS R + C A C W LR TKT+E
Sbjct: 189 LVGQEFRSKEAVWDLINRASKEEVFGVKTIKSDTGRLMLECSQASKGCDWYLRVTKTSET 248
Query: 348 EYFEIRRFSNIHICTQPPAKNR-KKQTALQR 377
+++ +++ + IH C++ P R +KQ R
Sbjct: 249 DFWCVKKHTQIHKCSRCPEATRYEKQKGTPR 279
>gi|11994447|dbj|BAB02449.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 819
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/93 (26%), Positives = 46/93 (49%), Gaps = 2/93 (2%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+++ G+ F +K+D KL + A + F F+ +S K + C +C WR+ K +
Sbjct: 342 DELFVGRVFKNKQDCNVKLAVHALNRRFHFRCDRSCKKLMTLTCISESCPWRVYIFKLED 401
Query: 347 DEYFEIRRFSNIHICTQPPAKN--RKKQTALQR 377
+ ++IR + H CT N R++Q + R
Sbjct: 402 TDNYQIRSATLEHTCTVEERCNYHREQQRGIIR 434
>gi|448931409|gb|AGE54971.1| hypothetical protein PBCVMA1D_566L [Paramecium bursaria Chlorella
virus MA-1D]
gi|448935035|gb|AGE58586.1| hypothetical protein PBCVNYs1_065R [Paramecium bursaria Chlorella
virus NYs1]
Length = 174
Score = 48.5 bits (114), Expect = 0.013, Method: Composition-based stats.
Identities = 29/112 (25%), Positives = 56/112 (50%), Gaps = 6/112 (5%)
Query: 579 VLRNKGIDVAEIECMEFPLSRNFFTDRIPGLEQANSTKLFELFQEKGFIDENGYMRSDGR 638
VLR G A + + + +F+D+I + NS K+ + Q+ GFID +G +R D +
Sbjct: 38 VLRRFGKPAAVLVSGKRKIYPEYFSDQIASISVQNSIKIVQYLQKIGFIDASGNIRKDPK 97
Query: 639 RTRWKEALRESNAIFLNKNLVHH----VQEELNLAFAYHEMTSLPSEQIFKW 686
W L+ NA+ + + + + V + + +A+A+H+ ++ KW
Sbjct: 98 DKGWYNGLK--NAVRIPETTISYWNSSVVQAMMVAYAHHDAVAVYMTTFLKW 147
>gi|242091025|ref|XP_002441345.1| hypothetical protein SORBIDRAFT_09g024820 [Sorghum bicolor]
gi|241946630|gb|EES19775.1| hypothetical protein SORBIDRAFT_09g024820 [Sorghum bicolor]
Length = 506
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
+I G TFT KK + +K A K+ F V S K R+ CAD+ C W++ A K
Sbjct: 259 EIKEGSTFTDKKAFVLMIKQYAIKREFQTFVEHSDKSRYRARCADSECGWKVHAKKLLGC 318
Query: 348 EYFEIRRFSNIHIC 361
F + + H C
Sbjct: 319 PTFMVVSIGSSHTC 332
>gi|3377831|gb|AAC28204.1| similar to maize transposon MuDR mudrA (GB:M76978) [Arabidopsis
thaliana]
gi|7267170|emb|CAB77882.1| putative transposon protein [Arabidopsis thaliana]
Length = 946
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 2/90 (2%)
Query: 274 NTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCAD- 332
+ ++A ++ +++ G+TF S++D L + A + F FK ++ K C D
Sbjct: 305 DAKAAYVDLTKEDDNMFVGRTFASREDFRIALSIYAINRIFRFKFTRYEKHYLVAECYDK 364
Query: 333 TNCKWRLRATKTAED-EYFEIRRFSNIHIC 361
NC WR+RA + D E +E+R H+C
Sbjct: 365 KNCDWRVRAHQVGGDSEEYEVRLAKLEHVC 394
>gi|4220448|gb|AAD12675.1| Similar to gi|3047071 F7N22.10 maize transposon MuDR homolog from
Arabidopsis thaliana BAC gb|AF058825 [Arabidopsis
thaliana]
Length = 729
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%)
Query: 299 KDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNI 358
K L + + L A K+ F FK +S ++VC D C W+L A + E F+I ++
Sbjct: 338 KVLQQTMSLQAIKQCFCFKQPQSCPKTLKMVCVDETCPWQLTARVVKDSESFKITSYATT 397
Query: 359 HICTQPPAKNRKKQ 372
H C KN K
Sbjct: 398 HTCNIDSRKNYNKH 411
>gi|77554727|gb|ABA97523.1| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 1307
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 73/165 (44%), Gaps = 17/165 (10%)
Query: 205 GMQATEVE----EQNLNAFNGDPVAANFGTEAPLGCGD-----GILAANIEVSEQQDDIQ 255
M+ +VE E + +G+ + G +G D + +E + ++
Sbjct: 117 AMETEDVEGPSHEGDFVKPSGEDEGNDTGVVQAMGVADEGERISTIVDEMEREDSDNEQT 176
Query: 256 RLDLSQDDCGDKLPIRRSNT--RSAAINPFH----SYED--IVCGKTFTSKKDLIKKLKL 307
D S D+ GD +P +N AI+ H Y+D ++ G T+ K+++ + +K
Sbjct: 177 EGDASSDEEGDVMPTDWTNEDFSGLAISEGHHVPWEYKDNEVIEGATYAHKEEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K +Y+++
Sbjct: 237 WAVSLMREFRVVKSTNYVYEVRCVKEDCLWRVHAYKGKWKDYWKV 281
>gi|54261828|gb|AAV31178.1| Putative transposase, identical [Solanum tuberosum]
Length = 587
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 54/291 (18%), Positives = 114/291 (39%), Gaps = 73/291 (25%)
Query: 90 QTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFI 149
Q +GII Y LV + + +D N + I +K + P PV I +D VR F+
Sbjct: 8 QMEGIIYEYSSLYSGLVSAITSQLMIDANIHNIEIKYIVSDRCP--PVSIHNDVGVRVFL 65
Query: 150 QENAAHPTEL-RSPICITIERKGSQCQETEGYHAALDHQTPAASRSESN-----QISVVV 203
+ ++ + P+CIT++ Q++ +D + SR+ +N S+ +
Sbjct: 66 DQKKSNVDFFTKYPLCITLKNIAIDNQDSI---VVVDRRHSDVSRTLTNFDLYSSNSIRL 122
Query: 204 PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILA--ANIEVSEQQDDIQRLDLSQ 261
G+ V ++N + G+G+++ +N+ ++E+Q
Sbjct: 123 IGVNLDGVVDENTDE------------------GNGVISDLSNLFIAEKQ---------- 154
Query: 262 DDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKS 321
+ K+ L++ ++ + F F V++S
Sbjct: 155 --------------------------------IYNDKETLMEVMRHYGVVEKFKFLVTRS 182
Query: 322 TKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQ 372
+ + + C NC W + ++ + + F+IR++ H C+ N ++Q
Sbjct: 183 SSSCYYLKCPADNCSWMMNSSCLNQSKLFKIRKYCVEHTCSVRDRVNARRQ 233
>gi|8886998|gb|AAF80658.1|AC012190_14 Similar to At2g29230 Mutator-like transposase gi|3980409 from
Arabidopsis thaliana gb|AC004561. It is a member of
Transposase mutator family PF|00872 [Arabidopsis
thaliana]
Length = 904
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 38/76 (50%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+++ G+ F +K+D KL + A + F F+ +S K + C C WR+ K +
Sbjct: 342 DELFVGRVFKNKQDCNVKLAVHALNRRFHFRRDRSCKKLMTLTCISETCPWRVYIVKLED 401
Query: 347 DEYFEIRRFSNIHICT 362
+ ++IR + H CT
Sbjct: 402 SDNYQIRSANLEHTCT 417
>gi|4585931|gb|AAD25591.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 731
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 64/272 (23%), Positives = 107/272 (39%), Gaps = 34/272 (12%)
Query: 100 ITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTEL 159
ITY LVD++ V +D E + +K Y + IVDD+DV F+ E+ E
Sbjct: 42 ITYLMLVDKIMKKVAID--EASLKLKLSYNLSKVRRETYIVDDEDVFIFLTES---DEES 96
Query: 160 RSPICITIERKGSQCQETEGYHAALDHQTPAASRSES--NQISVVVPGMQATEVEEQNLN 217
R P+ E G + E R + N++ +V +V + N
Sbjct: 97 RIPVLHVEELNGIGVERVEISVPERQSSVGVNVREDDVLNEVCEMVHDGYENDVNDCE-N 155
Query: 218 AFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRS 277
+ VA E P + E++D+ R+D DD D +P RS
Sbjct: 156 VVGMEIVAVQRPMERP-----------VNFDEEEDEDVRIDY--DDIHD-IP------RS 195
Query: 278 AAINP-FHSYEDIVC---GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC--A 331
+ P ++D C + F S++ + + + A ++ F KS R + C A
Sbjct: 196 VEVTPHVKEWDDGTCIEIDQEFCSREAVWELVNKAAKQEVFGVNTIKSEPLRLMLRCRQA 255
Query: 332 DTNCKWRLRATKTAEDEYFEIRRFSNIHICTQ 363
C W LR +T + + +R +H C++
Sbjct: 256 SKGCTWYLRVARTKKSHFSSVRVHRKVHTCSR 287
>gi|4063759|gb|AAC98466.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 715
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 38/84 (45%), Gaps = 5/84 (5%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+DI G+ F SK+D KL + A + F F +KS + VC C W + ATK +
Sbjct: 395 DDIYIGRIFKSKEDCATKLGIQAIHRKFHFIYTKSGPNVVAAVCVSNTCPWWVYATKLPD 454
Query: 347 DEYFEIRRFSNIHICTQPPAKNRK 370
+ RF C Q A RK
Sbjct: 455 SD-----RFERHDACVQGYAYMRK 473
>gi|9802561|gb|AAF99763.1|AC003981_13 F22O13.21 [Arabidopsis thaliana]
gi|9293930|dbj|BAB01833.1| Mutator-like transposase [Arabidopsis thaliana]
gi|10177478|dbj|BAB10869.1| mutator-like transposase [Arabidopsis thaliana]
Length = 915
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 38/76 (50%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+++ G+ F +K+D KL + A + F F+ +S K + C C WR+ K +
Sbjct: 342 DELFVGRVFKNKQDCNVKLAVHALNRRFHFRRDRSCKKLMTLTCISETCPWRVYIVKLED 401
Query: 347 DEYFEIRRFSNIHICT 362
+ ++IR + H CT
Sbjct: 402 SDNYQIRSANLEHTCT 417
>gi|3980409|gb|AAC95212.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 915
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 38/76 (50%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+++ G+ F +K+D KL + A + F F+ +S K + C C WR+ K +
Sbjct: 342 DELFVGRVFKNKQDCNVKLAVHALNRRFHFRRDRSCKKLMTLTCISETCPWRVYIVKLED 401
Query: 347 DEYFEIRRFSNIHICT 362
+ ++IR + H CT
Sbjct: 402 SDNYQIRSANLEHTCT 417
>gi|53749362|gb|AAU90221.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 1542
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|108707714|gb|ABF95509.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1440
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|38344204|emb|CAE05769.2| OSJNBa0064G10.20 [Oryza sativa Japonica Group]
Length = 1602
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|50511430|gb|AAT77353.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 1620
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P M+ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMENEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K D+ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDDMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|2660662|gb|AAC79133.1| putative Mutator-like transposase, 3' partial [Arabidopsis
thaliana]
Length = 792
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 22/85 (25%), Positives = 40/85 (47%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+++ G+ F +K+D KL + A + F F+ +S K + C C WR+ K +
Sbjct: 342 DELFVGRVFKNKQDCNVKLAVHALNRRFHFRRDRSCKKLMTLTCISETCPWRVYIVKLED 401
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKK 371
+ ++IR + H CT N +
Sbjct: 402 SDNYQIRSANLEHTCTVEERSNYHR 426
>gi|297720761|ref|NP_001172742.1| Os01g0947400 [Oryza sativa Japonica Group]
gi|255674069|dbj|BAH91472.1| Os01g0947400 [Oryza sativa Japonica Group]
Length = 1876
Score = 47.8 bits (112), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKAKWNDYWKV 281
>gi|110288976|gb|ABG66035.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1575
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 71/164 (43%), Gaps = 22/164 (13%)
Query: 209 TEVEEQNLNAFNGDPVAANFGTE-------APLGCGD-----GILAANIEVSEQQDDIQR 256
TE E NA GD V + G E +G D GI+ +E + ++
Sbjct: 120 TEDVEGTSNA--GDVVGPSMGNEENQPREEQAMGMADEGERVGIIVDEMEREDSDNEQVE 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + SN + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWSNEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|18266653|gb|AAL67599.1|AC018929_21 mutator-like transposase [Oryza sativa Japonica Group]
gi|31433669|gb|AAP55153.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1638
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|62701796|gb|AAX92869.1| transposon protein, putative, mutator sub-class [Oryza sativa
Japonica Group]
gi|77549851|gb|ABA92648.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1562
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 72/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN+ VV P M+ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNEGDVVGPSMENEENQPREEQAMG----MADEGERV------GIIVDEMERKDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +I+ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEIIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|77552377|gb|ABA95174.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1344
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANKDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|29788839|gb|AAP03385.1| putative mutator-like transposase [Oryza sativa Japonica Group]
gi|108710328|gb|ABF98123.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1656
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVNEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|108862633|gb|ABG22009.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1280
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 33 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 82
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 83 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEIEEAVKH 142
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 143 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 187
>gi|77557073|gb|ABA99869.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1656
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|77553913|gb|ABA96709.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1537
Score = 47.4 bits (111), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 8/103 (7%)
Query: 258 DLSQDDCGDKLPIRRSNTRSAA--INPFH----SYED--IVCGKTFTSKKDLIKKLKLGA 309
D S D+ GD +P +N + I+ H Y+D ++ G T+ K+D+ + +K A
Sbjct: 179 DASSDEEGDVMPTDWTNEDFSGLIISEGHHLPWEYKDNEVIEGATYAHKEDMKEAVKHWA 238
Query: 310 AKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
F+V KST +EV C +C WR+ A K +Y+++
Sbjct: 239 VSLIREFRVVKSTNYVYEVRCVKEHCPWRVHAYKGKWKDYWKV 281
>gi|52353740|gb|AAU44306.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 1175
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----------MADEGERVGIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|242082275|ref|XP_002445906.1| hypothetical protein SORBIDRAFT_07g027820 [Sorghum bicolor]
gi|241942256|gb|EES15401.1| hypothetical protein SORBIDRAFT_07g027820 [Sorghum bicolor]
Length = 846
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 57/133 (42%), Gaps = 4/133 (3%)
Query: 245 IEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYED---IVCGKTFTSKKDL 301
+ V++ D I D +D C D + R F S + F SKK+L
Sbjct: 219 VPVNDFGDGIAYFDSDEDVCYDDDSETNAKRRKCRFPIFDSRAETPQFTLDMCFRSKKEL 278
Query: 302 IKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
++ A K K K+ K R VC+ C W + A+ ++ ++F+I ++ H C
Sbjct: 279 KDAIERYALKMKVNIKFPKNDKKRLRAVCSWKGCPWLVHASYNSKSDWFQIVTYNPNHAC 338
Query: 362 TQPPAKNRKKQTA 374
P KN++ TA
Sbjct: 339 C-PVLKNKRLSTA 350
>gi|6691198|gb|AAF24536.1|AC007534_17 F7F22.3 [Arabidopsis thaliana]
Length = 818
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 41/81 (50%)
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRR 354
F +K+D KL + ++ F F +KS + +C C WR+ AT+ E + FEI+
Sbjct: 505 FKNKEDCTTKLAIHPIRRKFHFIYAKSCPNIVFAMCVSHTCLWRVYATELEEIDRFEIKC 564
Query: 355 FSNIHICTQPPAKNRKKQTAL 375
+ +H C+ + KQ ++
Sbjct: 565 ATQLHTCSVDARGDFHKQASI 585
>gi|3047068|gb|AAC13582.1| similar to maize transposon MuDR (GB:M76978) [Arabidopsis thaliana]
gi|8843876|dbj|BAA97402.1| mutator-like transposase [Arabidopsis thaliana]
Length = 806
Score = 47.0 bits (110), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ + SK +L +LKL A + F F V S K C C WR+RA++ +
Sbjct: 260 LAVGQQYRSKFELEYRLKLLAIRDGFDFDVPTSNKTTVSYECWVDRCLWRVRASRQGNNP 319
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQ 372
F + + + H C+ R +Q
Sbjct: 320 NFHVYIYDSEHTCSVTERSGRSRQ 343
>gi|218184683|gb|EEC67110.1| hypothetical protein OsI_33908 [Oryza sativa Indica Group]
Length = 388
Score = 47.0 bits (110), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 32/73 (43%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
I G TF K + ++ A K F + S KDR+ CAD +CKW + A + D
Sbjct: 70 IKAGCTFVDKSAFVLTIRQHAIKNEFETNIKHSNKDRYRAKCADPDCKWVVYAKRVLGDV 129
Query: 349 YFEIRRFSNIHIC 361
F I H C
Sbjct: 130 MFMIVNIGPAHTC 142
>gi|8778375|gb|AAF79383.1|AC007887_42 F15O4.15 [Arabidopsis thaliana]
Length = 739
Score = 47.0 bits (110), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 38/76 (50%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+++ G+ F +K+D KL + A + F F+ +S K + C C WR+ K +
Sbjct: 268 DELFVGRVFKNKQDCNVKLAVHALNRRFYFRRDRSCKKLTTLTCISETCPWRVYIVKLED 327
Query: 347 DEYFEIRRFSNIHICT 362
+ ++IR + H CT
Sbjct: 328 SDNYQIRSANLEHTCT 343
>gi|18855019|gb|AAL79711.1|AC091774_2 putative transposon protein [Oryza sativa Japonica Group]
Length = 1656
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEECPWRVHAYKGKWNDYWKV 281
>gi|108706173|gb|ABF93968.1| retrotransposon protein, putative, unclassified [Oryza sativa
Japonica Group]
Length = 2453
Score = 46.6 bits (109), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 65 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 114
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 115 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 174
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 175 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 219
>gi|4850412|gb|AAD31079.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 819
Score = 46.6 bits (109), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 38/84 (45%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ + SK +L +LKL A + F F V S K C C WR+RA + D
Sbjct: 255 LAVGQQYRSKFELEYRLKLLAIRDGFDFDVPTSNKTTVSYECWVDRCLWRVRACRQGNDP 314
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQ 372
F + + + H C+ R +Q
Sbjct: 315 NFYVYIYDSEHTCSVRERSGRSRQ 338
>gi|15451606|gb|AAK98730.1|AC090485_9 Putative mutator-like transposase [Oryza sativa Japonica Group]
Length = 2421
Score = 46.6 bits (109), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 33 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 82
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 83 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKEYEVIEGARYAHKDEMKEAVKH 142
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 143 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 187
>gi|356534979|ref|XP_003536027.1| PREDICTED: uncharacterized protein LOC100776861 [Glycine max]
Length = 759
Score = 46.6 bits (109), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 67/306 (21%), Positives = 119/306 (38%), Gaps = 49/306 (16%)
Query: 63 MSQVRVLVMYD--GKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEY 120
M+ ++++ + GK+ G Y GG I + D + + E + + + + +
Sbjct: 1 MAAMKIIAICQSGGKFVTGKDGSLSYKGGDAHAIDIDDLMKFNEFKEEVAEMFSIRADS- 59
Query: 121 VITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGY 180
I++K N + + I +D D++ I+ + ++ T I I IE
Sbjct: 60 -ISIKYFLPGNKKIL-ITISNDKDLQRMIKFHGSYST---VDIYILIEEV---------- 104
Query: 181 HAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGI 240
AA + ASRS +S V V LNAF+ D +
Sbjct: 105 -AAPELSNMPASRSSRTTLSETV-----VAVAPAPLNAFHTH-------------VADDV 145
Query: 241 LAANIEVSEQQDDIQRLDLSQD-DCGDKLP---IRRSNTRSAAINPFHSYEDIV-CGKTF 295
L + D ++D + D D ++P +R SN A I G+ F
Sbjct: 146 L-------DVVHDTNQIDTNMDIDIPLEVPPVSLRSSNDVKYAKGAQQWQNTITGVGQRF 198
Query: 296 TSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRF 355
+S + + L+ A FAFK K+ R V C C WR+ A++ + + I++
Sbjct: 199 SSVHEFRESLRKYAIAHQFAFKYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKM 258
Query: 356 SNIHIC 361
++ H C
Sbjct: 259 NSTHNC 264
>gi|38344783|emb|CAE02984.2| OSJNBa0043L09.3 [Oryza sativa Japonica Group]
Length = 1468
Score = 46.6 bits (109), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEECPWRVHAYKGKWNDYWKV 281
>gi|116309063|emb|CAH66173.1| H0725E11.4 [Oryza sativa Indica Group]
Length = 794
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 40/170 (23%), Positives = 71/170 (41%), Gaps = 28/170 (16%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGD-----GILAANIEVSEQ 250
SN VV P MQ E + + E +G D GI+ +E +
Sbjct: 127 SNAGDVVGPSMQNEENQPRE---------------EQAMGMADEGERVGIIVDEMEREDS 171
Query: 251 QDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLI 302
++ D S D+ GD + +N + + P+ E +++ G + K ++
Sbjct: 172 DNEQAEDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMK 231
Query: 303 KKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ +K A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 232 EAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|38344773|emb|CAE01499.2| OSJNBb0026L04.4 [Oryza sativa Japonica Group]
Length = 794
Score = 46.2 bits (108), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 40/170 (23%), Positives = 71/170 (41%), Gaps = 28/170 (16%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGD-----GILAANIEVSEQ 250
SN VV P MQ E + + E +G D GI+ +E +
Sbjct: 127 SNAGDVVGPSMQNEENQPRE---------------EQAMGMADEGERVGIIVDEMEREDS 171
Query: 251 QDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLI 302
++ D S D+ GD + +N + + P+ E +++ G + K ++
Sbjct: 172 DNEQAEDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMK 231
Query: 303 KKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ +K A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 232 EAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|110289148|gb|AAP53990.2| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1613
Score = 46.2 bits (108), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 71/170 (41%), Gaps = 28/170 (16%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGD-----GILAANIEVSEQ 250
SN VV P M E EE L E +G D GI+ +E +
Sbjct: 127 SNAGDVVGPSM---ENEENQLR------------EEQAMGMADEGERVGIIVDEMEREDS 171
Query: 251 QDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLI 302
++ D S D+ GD + +N + + P+ E +++ G + K ++
Sbjct: 172 DNEQAEDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMK 231
Query: 303 KKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
+ +K A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 232 EAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|116309874|emb|CAH66911.1| OSIGBa0126B18.4 [Oryza sativa Indica Group]
Length = 1194
Score = 46.2 bits (108), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P M+ E + + A A+ G GI+ +E + ++
Sbjct: 33 SNAGDVVGPSMENEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 82
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 83 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 142
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 143 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 187
>gi|297795759|ref|XP_002865764.1| octicosapeptide/Phox/Bem1p domain-containing protein [Arabidopsis
lyrata subsp. lyrata]
gi|297311599|gb|EFH42023.1| octicosapeptide/Phox/Bem1p domain-containing protein [Arabidopsis
lyrata subsp. lyrata]
Length = 285
Score = 46.2 bits (108), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 6/95 (6%)
Query: 64 SQVRVLVMYDGKWFCSPS-GYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVI 122
S+V+ + + G+ PS G Y GG+T+ I VS DI++ ELV +L I + D ++
Sbjct: 9 SKVKFMCSFGGRILPRPSDGVLKYVGGETRVIAVSPDISFSELVKKLAAITEND----IV 64
Query: 123 TMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
+ ++ + V + D+DV+ I+E H T
Sbjct: 65 LKYQIIPEDLDAL-VSVKSDEDVKHMIEEYNRHET 98
>gi|77552340|gb|ABA95137.1| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 1905
Score = 46.2 bits (108), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P M+ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMENEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|108862177|gb|ABA96424.2| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 972
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIDGARYAHKDEMKEAMKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|225429092|ref|XP_002269076.1| PREDICTED: uncharacterized protein LOC100253276 isoform 1 [Vitis
vinifera]
gi|359475510|ref|XP_003631693.1| PREDICTED: uncharacterized protein LOC100253276 isoform 2 [Vitis
vinifera]
gi|147838852|emb|CAN72363.1| hypothetical protein VITISV_011058 [Vitis vinifera]
Length = 583
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 1/88 (1%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
+ V G+ F K +K A ++F ++ KS R+ CA C WR+RA K
Sbjct: 5 NFVVGQEFADVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCATEGCPWRIRAVKLPNA 64
Query: 348 EYFEIRRFSNIHICTQPPAKNRKKQTAL 375
F IR H C + A+N Q ++
Sbjct: 65 PTFTIRSLEGTHTCGK-NAQNGHHQASV 91
>gi|326519080|dbj|BAJ96539.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531422|dbj|BAK05586.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 720
Score = 46.2 bits (108), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 285 SYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
S+E+ + G+ F K+ L A F ++ K+ R V C D C WR+ A+
Sbjct: 139 SWENTITGVGQVFEGPKEFRDALHKYAIAHRFHYRFVKNDSSRVTVECTDEGCPWRIHAS 198
Query: 343 KTAEDEYFEIRRFSNIHICTQPPAKNRK 370
K++ ++ F I++ H C K+ +
Sbjct: 199 KSSANQEFMIKKVVGSHTCESESVKSNR 226
>gi|77552337|gb|ABA95134.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1615
Score = 46.2 bits (108), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 71/165 (43%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P M+ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMENEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANENFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|218184054|gb|EEC66481.1| hypothetical protein OsI_32567 [Oryza sativa Indica Group]
Length = 405
Score = 46.2 bits (108), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 5/119 (4%)
Query: 238 DGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYED----IVCGK 293
D ++ N ++ Q+ + + QD + +P AI P SY+ I G
Sbjct: 180 DQEMSKNHQICHDQEMSKNQQIHQDHGTNNVPGDEV-VLDTAIVPHTSYDRDDPAIKAGC 238
Query: 294 TFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
TF K + ++ A K F + S KDR+ CAD +CKW + A + D F +
Sbjct: 239 TFVDKSAFVLTIRQHAIKNEFETNIKHSNKDRYRAKCADPDCKWVVYAKRVLGDVMFMV 297
>gi|147853693|emb|CAN83825.1| hypothetical protein VITISV_043699 [Vitis vinifera]
Length = 1349
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 8/77 (10%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
++ G+ F+SK DL +K+ + K + F V +S A +C+WRLRA +
Sbjct: 267 ELSIGQIFSSKSDLQHAVKMFSIKAHQEFTVYRSN--------ATPHCQWRLRAMTVKDT 318
Query: 348 EYFEIRRFSNIHICTQP 364
F I ++ H C P
Sbjct: 319 GMFRITKYKGPHTCVNP 335
>gi|108864689|gb|ABG22596.1| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 1468
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEQAEDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|31432149|gb|AAP53819.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1421
Score = 45.8 bits (107), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 209 TEVEEQNLNAFNGDPVAANFGTE-------APLGCGD-----GILAANIEVSEQQDDIQR 256
TE E ++NA GD V + E +G D GI+ +E + ++
Sbjct: 58 TEDVEGHINA--GDVVGPSMQNEENQPREEQAMGMADEGERVGIIVDEMEREDSDNEEAD 115
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 116 DDASSDEEGDVMATDWANEDFSGLVVSEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 175
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 176 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 219
>gi|320109257|ref|YP_004184847.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319927778|gb|ADV84853.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 640
Score = 45.8 bits (107), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 7/82 (8%)
Query: 415 LRNGTDV----IWQIP-DSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHA 469
L NGT++ I+ +P D P+A+LF C+G F+ S + P + LP + L A
Sbjct: 505 LLNGTELSNWKIYPLPLDHPEAILFKNAPCSGPC--FFRTSISAPAAMTLPADTFLETTA 562
Query: 470 LSRGFAIITISSAGRCWTLGEE 491
+ +GFA + + GR W +G +
Sbjct: 563 IHKGFAWVNAVALGRAWNIGPQ 584
>gi|77551584|gb|ABA94381.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1503
Score = 45.8 bits (107), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 66 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEAD 115
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 116 DDASSDEEGDVMATDWANEDFSGLVILEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 175
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 176 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 219
>gi|356505949|ref|XP_003521751.1| PREDICTED: uncharacterized protein LOC100795835 [Glycine max]
Length = 758
Score = 45.4 bits (106), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 38/86 (44%), Gaps = 3/86 (3%)
Query: 276 RSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNC 335
R+ AI+P +YE + G+ F K + L+ A +F + KS K RF C C
Sbjct: 168 RTLAISP--NYE-LAVGQEFPDVKSCRRALRDTAIALHFEMQTIKSDKTRFTAKCRSEGC 224
Query: 336 KWRLRATKTAEDEYFEIRRFSNIHIC 361
WR+ A K F IR H C
Sbjct: 225 PWRIHAAKLPGVPTFTIRTVHESHTC 250
>gi|42491376|gb|AAS16882.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 1433
Score = 45.4 bits (106), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|125534636|gb|EAY81184.1| hypothetical protein OsI_36364 [Oryza sativa Indica Group]
Length = 637
Score = 45.4 bits (106), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 34/73 (46%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
GK+F SKK L + KN K S S++ + C D +C WRL AT T +
Sbjct: 113 GKSFDSKKHLQIAIGEFHICKNSEIKYSTSSQSKLVAECTDNSCMWRLYATPTKIGSGWM 172
Query: 352 IRRFSNIHICTQP 364
IR+ H C P
Sbjct: 173 IRKCPYAHTCRAP 185
>gi|38344160|emb|CAD41880.2| OSJNBa0041A02.27 [Oryza sativa Japonica Group]
Length = 1473
Score = 45.4 bits (106), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSYNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|48843800|gb|AAT47059.1| putative mutator-like transposase [Oryza sativa Japonica Group]
Length = 1403
Score = 45.4 bits (106), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|29244695|gb|AAO73287.1| putative mutator-like transposase [Oryza sativa Japonica Group]
gi|108712091|gb|ABF99886.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1153
Score = 45.4 bits (106), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 18/165 (10%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQ 255
SN VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 127 SNAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEQA 176
Query: 256 RLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKL 307
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 177 EDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKH 236
Query: 308 GAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C C WR+ A K ++Y+++
Sbjct: 237 WAVSLQREFRVIKSTNYVYEVRCMKEECPWRVHAYKGKWNDYWKV 281
>gi|108706867|gb|ABF94662.1| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 1055
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|22758265|gb|AAN05493.1| Putative transposase protein [Oryza sativa Japonica Group]
gi|108706053|gb|ABF93848.1| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 779
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|77552251|gb|ABA95048.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1769
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEQAEDDASSDEEGDVMATDWANEDFSGLVILEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|4388831|gb|AAD19786.1| Mutator-like transposase [Arabidopsis thaliana]
Length = 874
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 37/76 (48%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+++ G+ F +K+D KL + A F F+ +S K + C C WR+ K +
Sbjct: 327 DELFVGRVFKNKQDCNVKLAVHALNSRFHFRRDRSYKKLMTLTCISELCLWRVYIVKLED 386
Query: 347 DEYFEIRRFSNIHICT 362
+ ++IR + H CT
Sbjct: 387 SDNYQIRSATLEHTCT 402
>gi|77551288|gb|ABA94085.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1698
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIDGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|28372672|gb|AAO39856.1| hypothetical protein [Oryza sativa Japonica Group]
gi|31249756|gb|AAP46248.1| putative mutator-like transposase [Oryza sativa Japonica Group]
gi|108711898|gb|ABF99693.1| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 342
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIDGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|10140679|gb|AAG13514.1|AC068924_19 mutator-like transposase [Oryza sativa Japonica Group]
Length = 1626
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEKEDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|38345409|emb|CAE03100.2| OSJNBa0017B10.15 [Oryza sativa Japonica Group]
Length = 1405
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|28393406|gb|AAO42126.1| unknown protein [Arabidopsis thaliana]
Length = 288
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 51/98 (52%), Gaps = 6/98 (6%)
Query: 61 RKMSQVRVLVMYDGKWFCSPSGYYL-YSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNE 119
R S+V+ + + G+ PS L Y GG+T+ + VS DI++ EL+ +L I + D
Sbjct: 5 RSPSKVKFMCSFGGRILPRPSDSVLKYVGGETRVVAVSPDISFSELMKKLTAITEND--- 61
Query: 120 YVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
++ + ++ + V + D+DV+ I+E H T
Sbjct: 62 -IVLKYQIIPEDLDAL-VSVKSDEDVKHMIEEYNRHET 97
>gi|297603310|ref|NP_001053788.2| Os04g0604400 [Oryza sativa Japonica Group]
gi|255675754|dbj|BAF15702.2| Os04g0604400 [Oryza sativa Japonica Group]
Length = 1626
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSYNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|110289405|gb|AAP54600.2| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1644
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEKEDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|297725499|ref|NP_001175113.1| Os07g0238100 [Oryza sativa Japonica Group]
gi|255677626|dbj|BAH93841.1| Os07g0238100 [Oryza sativa Japonica Group]
Length = 1467
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIDGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|108862711|gb|ABA98631.2| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 778
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEQAEDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|15240580|ref|NP_199803.1| octicosapeptide/Phox/Bem1p domain-containing protein [Arabidopsis
thaliana]
gi|8777423|dbj|BAA97013.1| unnamed protein product [Arabidopsis thaliana]
gi|56381921|gb|AAV85679.1| At5g49920 [Arabidopsis thaliana]
gi|58331811|gb|AAW70403.1| At5g49920 [Arabidopsis thaliana]
gi|332008488|gb|AED95871.1| octicosapeptide/Phox/Bem1p domain-containing protein [Arabidopsis
thaliana]
Length = 288
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 51/98 (52%), Gaps = 6/98 (6%)
Query: 61 RKMSQVRVLVMYDGKWFCSPSGYYL-YSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNE 119
R S+V+ + + G+ PS L Y GG+T+ + VS DI++ EL+ +L I + D
Sbjct: 5 RSPSKVKFMCSFGGRILPRPSDSVLKYVGGETRVVAVSPDISFSELMKKLTAITEND--- 61
Query: 120 YVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
++ + ++ + V + D+DV+ I+E H T
Sbjct: 62 -IVLKYQIIPEDLDAL-VSVKSDEDVKHMIEEYNRHET 97
>gi|108864455|gb|ABA94106.2| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 1080
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCIKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|359479587|ref|XP_003632297.1| PREDICTED: uncharacterized protein LOC100261309 [Vitis vinifera]
Length = 752
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 5/93 (5%)
Query: 269 PIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEV 328
P+ +S T A N ++V G+ F+ + + L+ A +F + KS K RF
Sbjct: 162 PVLQSRTVVPAPN-----HELVVGQEFSDVQSCRRALRDTAIALHFEIQTVKSDKTRFTA 216
Query: 329 VCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
CA C WR+ K F IR H C
Sbjct: 217 KCASDGCPWRIHCAKLPGVPTFTIRTIHESHTC 249
>gi|297795297|ref|XP_002865533.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311368|gb|EFH41792.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 182
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 53/116 (45%), Gaps = 13/116 (11%)
Query: 226 ANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHS 285
A G + + + + A E+ + D + D ++D GD I SN
Sbjct: 71 ATSGEKYKVQVANELFATRGEIRDHVDLGDQYD--KNDGGDNNNILSSNNT--------- 119
Query: 286 YEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRA 341
DI G+ F SK ++ ++L + + K F ++VSKS + VVC + C WR+ A
Sbjct: 120 --DIKVGEYFKSKYEIQQRLWMLSVKYKFGWRVSKSDPTKLVVVCKNNGCSWRVLA 173
>gi|108707097|gb|ABF94892.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1644
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|326511513|dbj|BAJ91901.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528689|dbj|BAJ97366.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 740
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 40/87 (45%), Gaps = 2/87 (2%)
Query: 286 YEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATK 343
++D++ G+ F S KD L A F ++ K+ R C +C WR+ A+K
Sbjct: 141 WKDVITGVGQVFESPKDFRDALHKYAIAHRFHYRFIKNDSTRVTAECTGEDCPWRIHASK 200
Query: 344 TAEDEYFEIRRFSNIHICTQPPAKNRK 370
+ + F I++ S H C K+ +
Sbjct: 201 SPANIDFMIKKISETHTCESETVKSHR 227
>gi|77552357|gb|ABA95154.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1597
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCSWRVHAYKGKWNDYWKV 281
>gi|356576716|ref|XP_003556476.1| PREDICTED: uncharacterized protein LOC100811471 isoform 1 [Glycine
max]
gi|356576718|ref|XP_003556477.1| PREDICTED: uncharacterized protein LOC100811471 isoform 2 [Glycine
max]
Length = 750
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 1/77 (1%)
Query: 285 SYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKT 344
SYE + G+ F K + L+ A +F + KS K RF CA C WR+ A K
Sbjct: 173 SYE-LSVGQEFPDVKSCRRALRDTAIALHFEMQTIKSDKTRFTAKCASEGCPWRIHAAKL 231
Query: 345 AEDEYFEIRRFSNIHIC 361
F IR H C
Sbjct: 232 PGVPTFTIRTIHENHTC 248
>gi|147784902|emb|CAN75224.1| hypothetical protein VITISV_024668 [Vitis vinifera]
Length = 347
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 63/146 (43%), Gaps = 6/146 (4%)
Query: 283 FHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA--DTNCKWRLR 340
++ + ++ G F SK+DL +K A +N V +S + V C C WRLR
Sbjct: 131 WNEFNELFKGLRFESKEDLQYVVKRYAICRNQHLVVCESEPQLWAVRCKKWQEGCNWRLR 190
Query: 341 ATKTAEDEYFEIRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNEKPNSSYGLVKQKW 400
A FEI +++ H C P K + L R I+NE + V W
Sbjct: 191 AYCRKSHRMFEITKYAGSHTCVYP--KLSQDHFQLDSTLIAREIQNETKRKTMLKVFGDW 248
Query: 401 N-SFDSSVQLNPTVELRN-GTDVIWQ 424
+ S+ + + ++L N G ++W+
Sbjct: 249 DESYQALPKWMNILKLTNPGIKIVWK 274
>gi|222626119|gb|EEE60251.1| hypothetical protein OsJ_13260 [Oryza sativa Japonica Group]
Length = 974
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/132 (24%), Positives = 54/132 (40%), Gaps = 4/132 (3%)
Query: 246 EVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDI---VCGKTFTSKKDLI 302
+V+++ D I D D D + R F ++ ++ F K L
Sbjct: 138 QVNDEGDGIAYFDSEDDASYDDDSGDSAERRKCRFPIFDNHAELPQNAVDMCFRGKMQLK 197
Query: 303 KKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICT 362
+ A KK K K+ +DR VC C W L A+ + ++F+I ++ H C
Sbjct: 198 DAITRYALKKKVNIKFIKNDRDRLRAVCMRKGCTWLLHASYNSRSDWFQIVTYNGNHSCC 257
Query: 363 QPPAKNRKKQTA 374
P KN++ T+
Sbjct: 258 -PDLKNKRLSTS 268
>gi|356535187|ref|XP_003536130.1| PREDICTED: uncharacterized protein LOC100786116 [Glycine max]
Length = 752
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 1/77 (1%)
Query: 285 SYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKT 344
SYE + G+ F K + L+ A +F + KS K RF CA C WR+ A K
Sbjct: 175 SYE-LSVGQEFPDVKSCRRALRDTAIALHFEMQTIKSDKTRFTAKCASEGCPWRIHAAKL 233
Query: 345 AEDEYFEIRRFSNIHIC 361
F IR H C
Sbjct: 234 PGVPTFTIRTIHENHTC 250
>gi|110289375|gb|AAP54539.2| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1539
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREKQAMG----MADEGERV------GIIVDEMEREDLDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYNENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|38344455|emb|CAE04926.2| OSJNBa0017P10.3 [Oryza sativa Japonica Group]
gi|38345440|emb|CAE03292.2| OSJNBb0046P18.8 [Oryza sativa Japonica Group]
Length = 1619
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 209 TEVEEQNLNAFNGDPVAANFGTE-------APLGCGD-----GILAANIEVSEQQDDIQR 256
TE E ++NA GD V + E +G D GI+ +E + ++
Sbjct: 120 TEDVEGSINA--GDVVGPSMQNEENQPREEQAMGMADEGERVGIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|297725729|ref|NP_001175228.1| Os07g0521200 [Oryza sativa Japonica Group]
gi|255677821|dbj|BAH93956.1| Os07g0521200 [Oryza sativa Japonica Group]
Length = 756
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|297729019|ref|NP_001176873.1| Os12g0254450 [Oryza sativa Japonica Group]
gi|255670190|dbj|BAH95601.1| Os12g0254450 [Oryza sativa Japonica Group]
Length = 893
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 33/73 (45%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
KTF SK+ L + KN K S S++ + C DT+C WRL AT +
Sbjct: 273 AKTFDSKEHLQIAIGEFHINKNSEIKYSTSSQSKLIAQCTDTSCMWRLYATPIGIGSCWM 332
Query: 352 IRRFSNIHICTQP 364
IR+ H C P
Sbjct: 333 IRKCPYAHTCRAP 345
>gi|22758321|gb|AAN05525.1| putative transposase [Oryza sativa Japonica Group]
gi|31432538|gb|AAP54160.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1597
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|224105815|ref|XP_002313941.1| predicted protein [Populus trichocarpa]
gi|222850349|gb|EEE87896.1| predicted protein [Populus trichocarpa]
Length = 589
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 1/87 (1%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
V G+ F K +K A ++F ++ KS R+ CA C WR+RA K
Sbjct: 6 FVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCASEGCPWRIRAVKLPNVP 65
Query: 349 YFEIRRFSNIHICTQPPAKNRKKQTAL 375
F +R H C + A+N Q ++
Sbjct: 66 TFTVRSLEGTHTCGR-NAQNGHHQASV 91
>gi|125597938|gb|EAZ37718.1| hypothetical protein OsJ_22060 [Oryza sativa Japonica Group]
Length = 754
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/79 (26%), Positives = 34/79 (43%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F KD L A F ++ K+ R C +C WR+ A+K+ + F
Sbjct: 150 GQVFDGPKDFRDALHKYAIAHKFHYRFIKNDSSRVTAECTGEDCPWRIHASKSPAKKQFM 209
Query: 352 IRRFSNIHICTQPPAKNRK 370
I++ S H C K+ +
Sbjct: 210 IKKISESHTCESETVKSHR 228
>gi|116308842|emb|CAH65979.1| H1005F08.8 [Oryza sativa Indica Group]
Length = 1589
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|17047024|gb|AAL34929.1|AC079037_2 Putative mutator-like transposase [Oryza sativa]
gi|31429884|gb|AAP51873.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1597
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|108707436|gb|ABF95231.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 584
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYED-IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E+ ++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENKVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|449449645|ref|XP_004142575.1| PREDICTED: uncharacterized protein LOC101203810 [Cucumis sativus]
gi|449532723|ref|XP_004173330.1| PREDICTED: uncharacterized LOC101203810 [Cucumis sativus]
Length = 770
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 56/127 (44%), Gaps = 11/127 (8%)
Query: 241 LAANIEVSEQQDD---IQRLD--LSQDDCGDKLP-IRRSNTRSAAINPFHSYEDIVCGKT 294
L N+E++ Q+D IQ +D SQ P + ++ T AA +YE + G+
Sbjct: 147 LDDNLELAVDQNDELAIQTVDDLTSQSQMMVSTPSVLQARTVMAAP----TYE-LSVGQE 201
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRR 354
F K + L+ A +F + KS K RF CA C WR+ A K F IR
Sbjct: 202 FPDVKSCRRALRDTAIALHFEVQTIKSDKTRFTAKCAAEGCPWRIHAAKLPGVPTFTIRT 261
Query: 355 FSNIHIC 361
+ H C
Sbjct: 262 IHDTHTC 268
>gi|218198598|gb|EEC81025.1| hypothetical protein OsI_23801 [Oryza sativa Indica Group]
Length = 754
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/79 (26%), Positives = 34/79 (43%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F KD L A F ++ K+ R C +C WR+ A+K+ + F
Sbjct: 150 GQVFDGPKDFRDALHKYAIAHKFHYRFIKNDSSRVTAECTGEDCPWRIHASKSPAKKQFM 209
Query: 352 IRRFSNIHICTQPPAKNRK 370
I++ S H C K+ +
Sbjct: 210 IKKISESHTCESETVKSHR 228
>gi|40539045|gb|AAR87302.1| putative MuDR family transposase [Oryza sativa Japonica Group]
gi|108711011|gb|ABF98806.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1493
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSRLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|297721953|ref|NP_001173340.1| Os03g0240375 [Oryza sativa Japonica Group]
gi|255674354|dbj|BAH92068.1| Os03g0240375 [Oryza sativa Japonica Group]
Length = 1632
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIDGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKCNDYWKV 281
>gi|62701735|gb|AAX92808.1| transposon protein, putative, mutator sub-class [Oryza sativa
Japonica Group]
gi|77550564|gb|ABA93361.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1597
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|108707089|gb|ABF94884.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1654
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIDGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKCNDYWKV 281
>gi|21743069|emb|CAD40702.1| OSJNBa0083D01.20 [Oryza sativa Japonica Group]
Length = 1590
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|77554219|gb|ABA97015.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1653
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|20303617|gb|AAM19044.1|AC099774_6 putative transposase related protein [Oryza sativa Japonica Group]
Length = 1557
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGKRV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDIMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|110288887|gb|AAP53004.2| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 1538
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGKRV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDIMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|22324952|gb|AAM95679.1| putative Mu transposable element [Oryza sativa Japonica Group]
Length = 1536
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREKQAMG----MADEGERV------GIIVDEMEREDLDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYNENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|53749448|gb|AAU90304.1| MuDR family transposase containing protein [Solanum demissum]
Length = 392
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 60/312 (19%), Positives = 115/312 (36%), Gaps = 84/312 (26%)
Query: 65 QVRVLVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITM 124
+ +L+ + G W Y + GI+V ++I++ L+ + + +D + I +
Sbjct: 2 NISILLRHSGSWESDVR----YEQYISDGIVVCENISFVNLISAIKAELGIDEFKKNIEV 57
Query: 125 KSLYKANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAAL 184
+ + K N + P+ I +D V+ +I+ P+CI K +
Sbjct: 58 RYVVKGN--SSPLCIRNDMGVKLYIEFKKHEVGFGMYPLCIDTSDKSDE----------- 104
Query: 185 DHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAAN 244
E QN +A G V G
Sbjct: 105 ---------------------------EIQNFDATTGAIVCVEGG--------------- 122
Query: 245 IEVSEQQDDIQRLDLSQDDCGDKLPIRR-------SNTRSAAINPFHSYEDIVCGKTFTS 297
+ D + L + + GD I S+T + + Y+D +
Sbjct: 123 ------KSDAKALTIVESKIGDSYYIPEMEVKNYISDTNISVVEVKQMYKDK------AT 170
Query: 298 KKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSN 357
K +++K K+ K +F FKV +S + +VC +C W+L+A+ + F++R F++
Sbjct: 171 LKAVMEKYKI---KNSFNFKVKRSDNKSYVLVCYSDDCCWKLKASVRKNSDIFKVRYFNS 227
Query: 358 IHICTQPPAKNR 369
H C P ++R
Sbjct: 228 EHRC---PLRDR 236
>gi|38345521|emb|CAE01805.2| OSJNBa0039K24.24 [Oryza sativa Japonica Group]
Length = 1286
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 53 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 112
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 113 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 172
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 173 RVHAYKGKWNDYWKV 187
>gi|77556931|gb|ABA99727.1| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 932
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIDGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|224008901|ref|XP_002293409.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970809|gb|EED89145.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 532
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 70/172 (40%), Gaps = 42/172 (24%)
Query: 427 DSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSAGR-- 484
+ P+ VL+L HGC+ A F+ P+ R +V A G I+ +
Sbjct: 214 NCPRGVLYLFHGCHRYAASFFYS----------PQGRQIVSMANDAGLIIVVFTKESELG 263
Query: 485 CWTLGEERLVVREIIRWWVERHKLEKL----------PLVALGASSGGYFVSALAKGLR- 533
CW ++ +R+I + +++ P+ A GASSGG F+ LA ++
Sbjct: 264 CWDWDKDWETIRQIGKKFIKGRIQGSCFAPDGSHFYPPMYAFGASSGGAFIVNLAAKMKE 323
Query: 534 ---------FSSIALMIAEGLFDQMDIPEDYP---PTLFVHMPKDSYRQRKI 573
FS+I + I M PE+ PT+F M D ++ +
Sbjct: 324 DKESFGPFLFSAINVQI-------MAPPENLEWDIPTVFTVMNGDERTKQMV 368
>gi|47900422|gb|AAT39216.1| putative mutator-like transposase [Oryza sativa Japonica Group]
Length = 1684
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEVD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|242072830|ref|XP_002446351.1| hypothetical protein SORBIDRAFT_06g014620 [Sorghum bicolor]
gi|241937534|gb|EES10679.1| hypothetical protein SORBIDRAFT_06g014620 [Sorghum bicolor]
Length = 584
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 26/106 (24%), Positives = 45/106 (42%), Gaps = 2/106 (1%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G F + K L + + + K K KD+ + C C W + +KT+ +F+
Sbjct: 200 GMAFRDSRQFKKALVKYGLRSHKSLKFPKDEKDKVKATCDSFGCNWMIYGSKTSRSRWFK 259
Query: 352 IRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNEKPNSSYGLVK 397
+ F + H C PP+++ K T+ + P S GL+K
Sbjct: 260 VVTFVDEHTC--PPSRDNKLVTSTLIAKHYYNQIKDNPTWSVGLMK 303
>gi|115469058|ref|NP_001058128.1| Os06g0632700 [Oryza sativa Japonica Group]
gi|113596168|dbj|BAF20042.1| Os06g0632700 [Oryza sativa Japonica Group]
gi|215704898|dbj|BAG94926.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 722
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 21/79 (26%), Positives = 34/79 (43%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F KD L A F ++ K+ R C +C WR+ A+K+ + F
Sbjct: 150 GQVFDGPKDFRDALHKYAIAHKFHYRFIKNDSSRVTAECTGEDCPWRIHASKSPAKKQFM 209
Query: 352 IRRFSNIHICTQPPAKNRK 370
I++ S H C K+ +
Sbjct: 210 IKKISESHTCESETVKSHR 228
>gi|70671596|gb|AAZ06229.1| putative mutator-like transposase [Oryza sativa Indica Group]
gi|70671602|gb|AAZ06234.1| putative mutator-like transposase [Oryza sativa Indica Group]
Length = 1358
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 54/113 (47%), Gaps = 12/113 (10%)
Query: 248 SEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKK 299
+E+ DD D S D+ GD + +N + + P+ E +++ G + K
Sbjct: 111 NEEADD----DASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKD 166
Query: 300 DLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
++ + +K A FKV KST +EV C +C WR+ A K ++Y+++
Sbjct: 167 EMKEAVKHWAVSLQREFKVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 219
>gi|110289230|gb|AAP54225.2| Transposable element protein, putative, MuDR [Oryza sativa Japonica
Group]
Length = 1579
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 69/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ +E + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMEREDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIDGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 VVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|108707912|gb|ABF95707.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 829
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 2/83 (2%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G F S + K L + K +D+ C+ CKW + +KT++ E+F+
Sbjct: 337 GMAFRSSRQFKKALVKYGLTTHRHIIFPKDERDKVRATCSWPGCKWLIYGSKTSKSEWFQ 396
Query: 352 IRRFSNIHICTQPPAKNRKKQTA 374
+ F+N H C PP ++ K T+
Sbjct: 397 VASFNNEHCC--PPRRDNKLVTS 417
>gi|147780218|emb|CAN72427.1| hypothetical protein VITISV_008825 [Vitis vinifera]
Length = 1436
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA--DTNCKWRLRATKTAEDEY 349
G F SK+DL +K A +N V +S + V C C WRLRA +
Sbjct: 103 GLRFESKEDLQYAVKRYAICRNQHLVVCESEPQLWAVRCKKWQEGCNWRLRACRRKSHGM 162
Query: 350 FEIRRFSNIHICTQP 364
FEI +++ H C P
Sbjct: 163 FEITKYAGPHTCVYP 177
>gi|308081269|ref|NP_001183476.1| uncharacterized protein LOC100501908 [Zea mays]
gi|238011782|gb|ACR36926.1| unknown [Zea mays]
Length = 796
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 31/74 (41%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ F + ++ A +F + KS K RF CA C WR+ A K A
Sbjct: 219 LAVGQEFPDVMSCRRAIRSTAIACHFEIQTVKSDKTRFTAKCAAEGCPWRIHAAKLAGVP 278
Query: 349 YFEIRRFSNIHICT 362
F IR + H C
Sbjct: 279 TFSIRTIHDTHSCV 292
>gi|116309814|emb|CAH66852.1| OSIGBa0103M18.4 [Oryza sativa Indica Group]
Length = 1229
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVIKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|222624845|gb|EEE58977.1| hypothetical protein OsJ_10678 [Oryza sativa Japonica Group]
Length = 785
Score = 43.9 bits (102), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 2/83 (2%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G F S + K L + K +D+ C+ CKW + +KT++ E+F+
Sbjct: 293 GMAFRSSRQFKKALVKYGLTTHRHIIFPKDERDKVRATCSWPGCKWLIYGSKTSKSEWFQ 352
Query: 352 IRRFSNIHICTQPPAKNRKKQTA 374
+ F+N H C PP ++ K T+
Sbjct: 353 VASFNNEHCC--PPRRDNKLVTS 373
>gi|78708488|gb|ABB47463.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1597
Score = 43.9 bits (102), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ ++ + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMDREDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|357123787|ref|XP_003563589.1| PREDICTED: uncharacterized protein LOC100844444 [Brachypodium
distachyon]
Length = 733
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 21/79 (26%), Positives = 34/79 (43%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F KD L A F ++ K+ R C +C WR+ A+K+ + F
Sbjct: 149 GQVFEGPKDFRDALHKYAIAHRFHYRFVKNDSSRVTAECTGEDCPWRIHASKSPAKQDFM 208
Query: 352 IRRFSNIHICTQPPAKNRK 370
I++ S H C K+ +
Sbjct: 209 IKKISESHTCESETVKSHR 227
>gi|25407496|pir||G85095 hypothetical protein AT4g09410 [imported] - Arabidopsis thaliana
gi|7267583|emb|CAB78064.1| putative protein [Arabidopsis thaliana]
Length = 673
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/85 (25%), Positives = 43/85 (50%), Gaps = 2/85 (2%)
Query: 290 VCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC--ADTNCKWRLRATKTAED 347
+ G+ F SK+ + + + ++ F K KS R + C A C W LR T T+
Sbjct: 182 LVGQEFRSKEAVWDLINRASKEEVFGVKTIKSDPGRLMLECSQASLGCDWYLRVTNTSST 241
Query: 348 EYFEIRRFSNIHICTQPPAKNRKKQ 372
+++ +++ + IH C++ P R ++
Sbjct: 242 DFWCVKKHTQIHKCSRCPEATRYEK 266
>gi|115480681|ref|NP_001063934.1| Os09g0562400 [Oryza sativa Japonica Group]
gi|113632167|dbj|BAF25848.1| Os09g0562400 [Oryza sativa Japonica Group]
Length = 1972
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEVDDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|13129433|gb|AAK13091.1|AC078839_7 Mutator-like transposase [Oryza sativa Japonica Group]
Length = 1641
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 70/164 (42%), Gaps = 18/164 (10%)
Query: 197 NQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQR 256
N VV P MQ E + + A A+ G GI+ ++ + ++
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMG----MADEGERV------GIIVDEMDREDSDNEEAD 177
Query: 257 LDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLIKKLKLG 308
D S D+ GD + +N + + P+ E +++ G + K ++ + +K
Sbjct: 178 DDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVIEGARYAHKDEMKEAVKHW 237
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 238 AVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|47900530|gb|AAT39265.1| putative mutator-like transposase [Oryza sativa Japonica Group]
Length = 1385
Score = 43.5 bits (101), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIEGARYAYKDEMKEAVKHWAVFLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAEDEYFEI 352
R+ A K ++Y+++
Sbjct: 267 RVHAYKGKWNDYWKV 281
>gi|22267590|gb|AAM94925.1| mutator-like transposase, 3'-partial [Oryza sativa Japonica Group]
Length = 655
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 57/122 (46%), Gaps = 8/122 (6%)
Query: 239 GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIV 290
GI+ +E + ++ D S D+ GD + +N + + P+ E +++
Sbjct: 160 GIIVDEMEREDLDNEEADDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYNENEVI 219
Query: 291 CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYF 350
G + K ++ + +K A F+V KST +EV C +C WR+ A K ++Y+
Sbjct: 220 EGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYW 279
Query: 351 EI 352
++
Sbjct: 280 KV 281
>gi|10177197|dbj|BAB10385.1| unnamed protein product [Arabidopsis thaliana]
Length = 733
Score = 43.5 bits (101), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 288 DIVCGKTFTSK---KDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKT 344
D G+ F SK KD I+K + K N FK +S +++ E VC CKWR+ A+ T
Sbjct: 104 DFYVGQEFVSKEKCKDTIEKYAI-REKVNIHFK--RSERNKIEGVCVQDYCKWRIYASIT 160
Query: 345 AEDEYFEIRRFSNIHIC 361
+ + ++ + IH C
Sbjct: 161 SRSDKMVVQSYKGIHSC 177
>gi|8920571|gb|AAF81293.1|AC027656_10 Strong similarity to a mutator-like transposase from Arabidopsis
thaliana gb|AC006067. It contains a zinc finger, CCHC
class domain PF|00098 [Arabidopsis thaliana]
Length = 753
Score = 43.5 bits (101), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 288 DIVCGKTFTSK---KDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKT 344
D G+ F SK KD I+K + K N FK +S +++ E VC CKWR+ A+ T
Sbjct: 143 DFYVGQEFVSKEKCKDTIEKYAI-REKVNIHFK--RSERNKIEGVCVQDCCKWRIYASIT 199
Query: 345 AEDEYFEIRRFSNIHIC 361
+ + ++ + IH C
Sbjct: 200 SRSDKMVVQSYKGIHSC 216
>gi|397624101|gb|EJK67272.1| hypothetical protein THAOC_11725 [Thalassiosira oceanica]
Length = 496
Score = 43.5 bits (101), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 19/103 (18%)
Query: 430 KAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLHALSRGFAIITISSA---GRCW 486
+ ++ L H C G A F+ R C C+GL EE L G+ + +SSA CW
Sbjct: 173 RGLVLLLHDCGGSAFDFFARGDKCEECLGLSEEMRTARLVLLNGYLPMAVSSADARSGCW 232
Query: 487 TLGEERLVVREIIRWWVERHKLEKLPLVALGASSGGYFVSALA 529
+ ++ R+++ LE L GGY + A A
Sbjct: 233 SAEDDADRFRKVL-------ALEDL---------GGYHIGAYA 259
>gi|357139057|ref|XP_003571102.1| PREDICTED: uncharacterized protein LOC100835418 [Brachypodium
distachyon]
Length = 720
Score = 43.5 bits (101), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 2/79 (2%)
Query: 285 SYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
S+E+ + G+ F K+ L A + F ++ K+ R V C D C WR+ A+
Sbjct: 139 SWENAITGAGQVFEGPKEFRDALHKYAIARRFHYRFIKNDSTRVTVECTDEGCPWRIHAS 198
Query: 343 KTAEDEYFEIRRFSNIHIC 361
K+ + F I++ H C
Sbjct: 199 KSPSKKEFMIKKVVGSHTC 217
>gi|218200194|gb|EEC82621.1| hypothetical protein OsI_27204 [Oryza sativa Indica Group]
Length = 573
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 33/73 (45%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F SK+ L + N +VS S K R + C D+ CKW+L A T ++
Sbjct: 236 GRIFYSKQHLQFAVNEFHISNNMEVRVSTSNKSRLDFKCKDSTCKWKLYAKTTKIGSSWK 295
Query: 352 IRRFSNIHICTQP 364
I+ H C P
Sbjct: 296 IQTCQFPHTCRAP 308
>gi|38569201|emb|CAE04557.3| OSJNBa0052P16.6 [Oryza sativa Japonica Group]
Length = 1489
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 28/122 (22%), Positives = 57/122 (46%), Gaps = 8/122 (6%)
Query: 239 GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIV 290
G++ +E + ++ D S D+ GD + +N + + P+ E +++
Sbjct: 9 GLIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVI 68
Query: 291 CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYF 350
G + K ++ + +K A F+V KST +EV C +C WR+ A K ++Y+
Sbjct: 69 EGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYW 128
Query: 351 EI 352
++
Sbjct: 129 KV 130
>gi|297722083|ref|NP_001173405.1| Os03g0325300 [Oryza sativa Japonica Group]
gi|255674470|dbj|BAH92133.1| Os03g0325300 [Oryza sativa Japonica Group]
Length = 905
Score = 43.1 bits (100), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 2/79 (2%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G F S + K L + K +D+ C+ CKW + +KT++ E+F+
Sbjct: 332 GMAFRSSRQFKKALVKYGLTTHRHIIFPKDERDKVRATCSWPGCKWLIYGSKTSKSEWFQ 391
Query: 352 IRRFSNIHICTQPPAKNRK 370
+ F+N H C PP ++ K
Sbjct: 392 VASFNNEHCC--PPRRDNK 408
>gi|10177835|dbj|BAB11264.1| unnamed protein product [Arabidopsis thaliana]
Length = 733
Score = 43.1 bits (100), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 288 DIVCGKTFTSK---KDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKT 344
D G+ F SK KD I+K + K N FK +S +++ E VC CKWR+ A+ T
Sbjct: 104 DFYVGQEFVSKEKCKDTIEKYAI-REKVNIHFK--RSERNKIEGVCVQDCCKWRIYASIT 160
Query: 345 AEDEYFEIRRFSNIHIC 361
+ + ++ + IH C
Sbjct: 161 SRSDKMVVQSYKGIHSC 177
>gi|147818139|emb|CAN73564.1| hypothetical protein VITISV_024011 [Vitis vinifera]
Length = 468
Score = 43.1 bits (100), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 27/45 (60%)
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
+ ++SKK+L KKL + A K+ F FK KST + C D CKW
Sbjct: 139 QIYSSKKELQKKLYIIALKEKFEFKTIKSTTKLLVLQCVDNECKW 183
>gi|38345701|emb|CAE01929.2| OSJNBb0085C12.3 [Oryza sativa Japonica Group]
Length = 1356
Score = 43.1 bits (100), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 45/195 (23%), Positives = 83/195 (42%), Gaps = 33/195 (16%)
Query: 199 ISVVVPGMQ---------ATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIE-VS 248
I+V+ PG+Q + EV E ++ N + G AP+G G +A E V
Sbjct: 65 INVIDPGVQMNMDDGEGPSAEVNETSVEEVN---AREDGGVVAPVGIQPGGVADEGETVG 121
Query: 249 EQQDDIQRLDL---------SQDDCGDKLPIRRSN--------TRSAAINPFHSYEDIVC 291
D+++R D S DD D P ++ + ++ + +++
Sbjct: 122 AIVDEMEREDSDNERVEEGDSSDDETDINPAEWASQDFSGLIVSEEDSVRWEYKKNEVIQ 181
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADT--NCKWRLRATKTAEDEY 349
G ++ +D+ + +K A + F+V+KS + ++EV C C WR+ A K +Y
Sbjct: 182 GAIYSRAEDMKEAVKHFAVSLHREFRVAKSNRSQYEVRCVKEKDGCPWRVHAYKGKWKDY 241
Query: 350 FEIRRFSNIHICTQP 364
+ + + H C P
Sbjct: 242 WTVSVVTK-HTCFLP 255
>gi|7523705|gb|AAF63144.1|AC011001_14 Similar to maize transposon mudrA protein [Arabidopsis thaliana]
gi|46518483|gb|AAS99723.1| At1g06740 [Arabidopsis thaliana]
gi|62319905|dbj|BAD93972.1| mudrA-like protein [Arabidopsis thaliana]
Length = 726
Score = 43.1 bits (100), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 28/115 (24%), Positives = 48/115 (41%), Gaps = 1/115 (0%)
Query: 248 SEQQDDIQRLDLSQD-DCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLK 306
S+ DD L++SQ + K + +++ +++ + ++V G F+ + +K
Sbjct: 114 SQDTDDKLELEMSQSTEFSQKSVMPSPSSQCWSMSGAGTDHEMVVGMEFSDAYACRRAIK 173
Query: 307 LGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
A F + KS K RF C C WR+ K + F IR H C
Sbjct: 174 NAAISLRFEMRTIKSDKTRFTAKCNSKGCPWRIHCAKVSNAPTFTIRTIHGSHTC 228
>gi|125557789|gb|EAZ03325.1| hypothetical protein OsI_25468 [Oryza sativa Indica Group]
Length = 345
Score = 43.1 bits (100), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 29/56 (51%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
GKTF SK+ L + +KN K S S+K + C D +C WRL AT T D
Sbjct: 240 GKTFDSKEHLKIAIGEFHIEKNAEVKYSTSSKSKIVAECTDNSCTWRLYATPTGID 295
>gi|297831584|ref|XP_002883674.1| hypothetical protein ARALYDRAFT_899307 [Arabidopsis lyrata subsp.
lyrata]
gi|297329514|gb|EFH59933.1| hypothetical protein ARALYDRAFT_899307 [Arabidopsis lyrata subsp.
lyrata]
Length = 227
Score = 43.1 bits (100), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 19/67 (28%), Positives = 36/67 (53%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+DI K F +K +L++K++ + F F+ S K+R +V D C R+RA +
Sbjct: 113 DDIYVNKYFMNKAELMQKMRTWELEYKFEFRYRWSNKERVVLVSVDDKCTLRMRAIRVDS 172
Query: 347 DEYFEIR 353
++F ++
Sbjct: 173 CDFFVVK 179
>gi|147816297|emb|CAN61956.1| hypothetical protein VITISV_032645 [Vitis vinifera]
Length = 625
Score = 43.1 bits (100), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 53/275 (19%), Positives = 103/275 (37%), Gaps = 62/275 (22%)
Query: 86 YSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDV 145
Y GG+T +VS +I+ E V ++ G++ +D+N I ++ K + P+ + + +D D+
Sbjct: 25 YKGGRTNCSVVSKNISQSEFVSKVCGVLNLDSNS--IKLEFTVKFD-PSCLLPLHNDGDI 81
Query: 146 RFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPG 205
+ N ++ + I SQC E ++ P
Sbjct: 82 VNMFKFN-----DMFCHVYI------SQCTECGD--------------------DLICPT 110
Query: 206 MQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCG 265
T + VA+N + +G + +S + IQ + SQ
Sbjct: 111 SGPTPI------------VASNSAHVSSIG------EPPLHISNESPTIQSVGFSQ---- 148
Query: 266 DKLPIRRSNTRSAAINPFHSYEDIV-CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKD 324
R + T + + P IV G TF + + + L + F + +++
Sbjct: 149 -----RCAMTNTVQLQPSRFEHSIVGSGHTFPNASEFRDAIYLMSLAGKFRYSYKRNSPK 203
Query: 325 RFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIH 359
VVC +C W++ A + ++ F N+H
Sbjct: 204 HMTVVCTIEDCPWKITARAIGDSNIVQVHTFRNVH 238
>gi|58532005|emb|CAI44652.1| OSJNBa0096F01.20 [Oryza sativa Japonica Group]
Length = 1422
Score = 43.1 bits (100), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 28/122 (22%), Positives = 57/122 (46%), Gaps = 8/122 (6%)
Query: 239 GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIV 290
G++ +E + ++ D S D+ GD + +N + + P+ E +++
Sbjct: 9 GLIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVI 68
Query: 291 CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYF 350
G + K ++ + +K A F+V KST +EV C +C WR+ A K ++Y+
Sbjct: 69 EGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYW 128
Query: 351 EI 352
++
Sbjct: 129 KV 130
>gi|38346407|emb|CAE54572.1| OSJNBa0011F23.13 [Oryza sativa Japonica Group]
Length = 1396
Score = 43.1 bits (100), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 56/122 (45%), Gaps = 8/122 (6%)
Query: 239 GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIV 290
GI+ +E ++ D S D+ GD + +N + + P+ E +++
Sbjct: 9 GIIVDEMERENSDNEEVDDDASSDEEGDVMATDWANEDFSGLVISEGDHVPWEYKENEVI 68
Query: 291 CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYF 350
G + K ++ + +K A F+V KST +EV C +C WR+ A K ++Y+
Sbjct: 69 EGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYW 128
Query: 351 EI 352
++
Sbjct: 129 KV 130
>gi|297608625|ref|NP_001061869.2| Os08g0432600 [Oryza sativa Japonica Group]
gi|255678470|dbj|BAF23783.2| Os08g0432600 [Oryza sativa Japonica Group]
Length = 892
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 55/141 (39%), Gaps = 6/141 (4%)
Query: 237 GDGILAANIEVSEQQDDIQRLD----LSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCG 292
G I N +V E D + D S D+ D + RR R + F
Sbjct: 293 GGDIPLGNNQVLEGGDGAEYFDSDGDASYDEDSDGVFTRR-KCRFPIFDSFADTPQFAVD 351
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
F K L ++ A KK + K+ + R VC C W L A+ + ++F+I
Sbjct: 352 MCFRGKDQLKDAIERYALKKKINIRYVKNEQKRIRAVCRWKGCPWLLYASHNSRSDWFQI 411
Query: 353 RRFSNIHICTQPPAKNRKKQT 373
++ H C P KN++ T
Sbjct: 412 VTYNPNHACC-PELKNKRLST 431
>gi|31415943|gb|AAP50964.1| putative mutator-like transposase [Oryza sativa Japonica Group]
gi|108711252|gb|ABF99047.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
Japonica Group]
Length = 773
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 31/135 (22%), Positives = 60/135 (44%), Gaps = 13/135 (9%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 53 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEEGDVMATDWANEDFSGLVISEG 112
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 113 DHVPWEYKENEVIDGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 172
Query: 338 RLRATKTAEDEYFEI 352
+ A K ++Y+++
Sbjct: 173 HVHAYKGKWNDYWKV 187
>gi|147845721|emb|CAN80111.1| hypothetical protein VITISV_040515 [Vitis vinifera]
Length = 655
Score = 42.7 bits (99), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 20/85 (23%), Positives = 36/85 (42%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + G+ F + L A F +S + R CA +C+WR+ A+K +
Sbjct: 134 DTLTVGQEFPDARCFRDALVASAIASKFELTFIRSDRSRVTARCAADDCQWRIHASKLPD 193
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKK 371
E F+I+ H C P + ++
Sbjct: 194 GETFQIKTLKGKHSCVWPERSSHRQ 218
>gi|449510833|ref|XP_004163775.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101203656
[Cucumis sativus]
Length = 770
Score = 42.7 bits (99), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 106/292 (36%), Gaps = 37/292 (12%)
Query: 74 GKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIP 133
G++ G Y GG I V D + + E + + D + +++K N
Sbjct: 14 GEFETGRDGMLSYHGGDAHAIDVDDKMKFNEFKMEIAEMFNFDVDN--VSIKYFLPGNRK 71
Query: 134 TMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASR 193
T+ + + +D D++ ++ H I + +E A PA+
Sbjct: 72 TL-ITLSNDKDLKRMLK---FHGDSTTVDIFVIMEE----------VMAPNISNLPASRS 117
Query: 194 SESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDD 253
S + VVP +G P+ G E D L ++V DD
Sbjct: 118 SRTTLSETVVP--------------VDGTPLTVVHGIEDDNIESDIPLDGALDVV---DD 160
Query: 254 IQRLDLSQDDCGDKLPIRRSNTRSAAINP--FHSYEDIV--CGKTFTSKKDLIKKLKLGA 309
L D GD PI S N +++ + G+ F+S + + L+ A
Sbjct: 161 TNPLVNHIDIAGDITPILPLLGSSDEKNGKGVQQWQNTITGVGQRFSSVHEFRESLRKYA 220
Query: 310 AKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
FAF+ K+ R V C C WR+ A++ + + I++ + H C
Sbjct: 221 IAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPNHTC 272
>gi|242060818|ref|XP_002451698.1| hypothetical protein SORBIDRAFT_04g006220 [Sorghum bicolor]
gi|241931529|gb|EES04674.1| hypothetical protein SORBIDRAFT_04g006220 [Sorghum bicolor]
Length = 719
Score = 42.7 bits (99), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 3/88 (3%)
Query: 285 SYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
S+E+ + G+ F S K+ L A F +K K+ R C D C WR+ A+
Sbjct: 139 SWENAITGAGQVFQSAKEFRDALHKYAIAHRFHYKFVKNDSSRVTAECTDGGCAWRIHAS 198
Query: 343 KTAEDEYFEIRRFSNIHICTQPPAKNRK 370
K+ E F +++ H C K+ +
Sbjct: 199 KSHSKE-FMVKKVFGTHTCESETIKSHR 225
>gi|39546253|emb|CAE04262.3| OSJNBa0089N06.23 [Oryza sativa Japonica Group]
Length = 1597
Score = 42.7 bits (99), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 66/166 (39%), Gaps = 22/166 (13%)
Query: 197 NQISVVVPGMQATE---VEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDD 253
N VV P MQ E EEQ + + E D A + S+++ D
Sbjct: 34 NAGDVVGPSMQNEENQPREEQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEESD 93
Query: 254 IQRLDLSQDDC-------GDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLK 306
+ D + +D GD +P + +++ G + K ++ + +K
Sbjct: 94 VMATDWANEDFSGLVISEGDHVPWE------------YKENEVIEGARYAHKDEMKEAVK 141
Query: 307 LGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 142 HWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 187
>gi|262411014|gb|ACY66871.1| P20Sh249C12 [Saccharum hybrid cultivar R570]
Length = 720
Score = 42.7 bits (99), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 21/79 (26%), Positives = 33/79 (41%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F KD L A F ++ K+ R C C WR+ A+K+ + F
Sbjct: 149 GQVFEGPKDFRDALHKYAIAHRFHYRFIKNDSSRVTAECTGEGCPWRIHASKSPAKKEFM 208
Query: 352 IRRFSNIHICTQPPAKNRK 370
I++ S H C K+ +
Sbjct: 209 IKKISESHTCESETVKSNR 227
>gi|147780251|emb|CAN65744.1| hypothetical protein VITISV_037761 [Vitis vinifera]
Length = 1076
Score = 42.7 bits (99), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 58/293 (19%), Positives = 109/293 (37%), Gaps = 38/293 (12%)
Query: 74 GKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIP 133
G++ G Y GG I + D + + E + + + +++K N
Sbjct: 14 GEFEADKDGSLSYRGGDAHAIDIDDQMKFNEFKMEVAEMFNCSIS--TMSIKYFLPKNKK 71
Query: 134 TMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASR 193
T+ + I +D D++ I+ H + I + E ALD ASR
Sbjct: 72 TL-ITISNDKDLKRMIK---FHVDSVTVDIYVMTEEV-----------VALDVSNMPASR 116
Query: 194 SESNQIS-VVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGI----LAANIEVS 248
S +S VVP ++++ ++ D + PL D + A I +
Sbjct: 117 SSRTTLSEAVVPVDAPLDMKDDMVD----DTTYPDVSLGLPLDVVDDTTHVDVDAQITMP 172
Query: 249 EQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLG 308
+ + L +S ++ K + NT + G+ F+ + + L+
Sbjct: 173 NEISPVLPLSISNEEKHVKAAQQWQNTITGV------------GQRFSGVHEFREALRKY 220
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
A FAF+ K+ R V C C WR+ A++ + + I++ + H C
Sbjct: 221 AIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNATHTC 273
>gi|242096428|ref|XP_002438704.1| hypothetical protein SORBIDRAFT_10g024700 [Sorghum bicolor]
gi|241916927|gb|EER90071.1| hypothetical protein SORBIDRAFT_10g024700 [Sorghum bicolor]
Length = 720
Score = 42.7 bits (99), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 21/79 (26%), Positives = 33/79 (41%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F KD L A F ++ K+ R C C WR+ A+K+ + F
Sbjct: 149 GQVFEGPKDFRDALHKYAIAHRFHYRFIKNDSSRVTAECTGEGCPWRIHASKSPAKKEFM 208
Query: 352 IRRFSNIHICTQPPAKNRK 370
I++ S H C K+ +
Sbjct: 209 IKKISESHTCESETVKSNR 227
>gi|449456080|ref|XP_004145778.1| PREDICTED: uncharacterized protein LOC101203656 [Cucumis sativus]
Length = 770
Score = 42.7 bits (99), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 106/292 (36%), Gaps = 37/292 (12%)
Query: 74 GKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIP 133
G++ G Y GG I V D + + E + + D + +++K N
Sbjct: 14 GEFETGRDGMLSYHGGDAHAIDVDDKMKFNEFKMEIAEMFNFDVDN--VSIKYFLPGNRK 71
Query: 134 TMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASR 193
T+ + + +D D++ ++ H I + +E A PA+
Sbjct: 72 TL-ITLSNDKDLKRMLK---FHGDSTTVDIFVIMEE----------VMAPNISNLPASRS 117
Query: 194 SESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDD 253
S + VVP +G P+ G E D L ++V DD
Sbjct: 118 SRTTLSETVVP--------------VDGTPLTVVHGIEDDNIESDIPLDGALDVV---DD 160
Query: 254 IQRLDLSQDDCGDKLPIRRSNTRSAAINP--FHSYEDIV--CGKTFTSKKDLIKKLKLGA 309
L D GD PI S N +++ + G+ F+S + + L+ A
Sbjct: 161 TNPLVNHIDIAGDITPILPLLGSSDEKNGKGVQQWQNTITGVGQRFSSVHEFRESLRKYA 220
Query: 310 AKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
FAF+ K+ R V C C WR+ A++ + + I++ + H C
Sbjct: 221 IAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPNHTC 272
>gi|6691195|gb|AAF24533.1|AC007534_14 F7F22.11 [Arabidopsis thaliana]
Length = 612
Score = 42.7 bits (99), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 11/82 (13%)
Query: 282 PFHSYED----IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ ED I GK + SK+D L + A K F FK +++ C+D C W
Sbjct: 155 PYLDIEDDGKGIYKGKVYASKEDCQIGLAIYAIKNMFHFKQTRTN-------CSDEKCDW 207
Query: 338 RLRATKTAEDEYFEIRRFSNIH 359
R+ AT YFEI++ S H
Sbjct: 208 RILATLMKGTGYFEIKKASLQH 229
>gi|224068699|ref|XP_002302803.1| predicted protein [Populus trichocarpa]
gi|222844529|gb|EEE82076.1| predicted protein [Populus trichocarpa]
Length = 573
Score = 42.7 bits (99), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 286 YEDIVCG--KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATK 343
+E+ + G + F + +++ L+ + + F K K+ R V C+ C WR+ A++
Sbjct: 4 WENCITGLHQQFNNVREVRDALRKYSIAQGFTVKFKKNDSMRVSVKCSVDGCPWRIFASR 63
Query: 344 TAEDEYFEIRRFSNIHIC 361
+ F I+R + IH C
Sbjct: 64 LSTTHLFRIKRLNEIHTC 81
>gi|115435016|ref|NP_001042266.1| Os01g0190900 [Oryza sativa Japonica Group]
gi|113531797|dbj|BAF04180.1| Os01g0190900 [Oryza sativa Japonica Group]
Length = 1255
Score = 42.7 bits (99), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 67/166 (40%), Gaps = 22/166 (13%)
Query: 197 NQISVVVPGMQATE---VEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDD 253
N VV P MQ E EEQ + + + E D A + S+++ D
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMGMADEGERVSIIVDEMEREDSDNEEADDDASSDEEGD 187
Query: 254 IQRLDLSQDDC-------GDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLK 306
+ D + +D GD +P + +++ G + K ++ + +K
Sbjct: 188 VMATDWANEDFSGLVISEGDHVPWE------------YKENEVIEGARYAHKDEMKEAVK 235
Query: 307 LGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 236 HWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|222640607|gb|EEE68739.1| hypothetical protein OsJ_27421 [Oryza sativa Japonica Group]
Length = 569
Score = 42.7 bits (99), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 58/149 (38%), Gaps = 14/149 (9%)
Query: 229 GTEAPLGCGDGILAANIEVSEQQDDIQRLD----LSQDDCGDKLPIRRSNTRSAAINPFH 284
G + PLG N +V E D + D S D+ D + RR R + F
Sbjct: 231 GGDIPLG--------NNQVLEGGDGAEYFDSDGDASYDEDSDGVFTRRK-CRFPIFDSFA 281
Query: 285 SYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKT 344
F K L ++ A KK + K+ + R VC C W L A+
Sbjct: 282 DTPQFAVDMCFRGKDQLKDAIERYALKKKINIRYVKNEQKRIRAVCRWKGCPWLLYASHN 341
Query: 345 AEDEYFEIRRFSNIHICTQPPAKNRKKQT 373
+ ++F+I ++ H C P KN++ T
Sbjct: 342 SRSDWFQIVTYNPNHACC-PELKNKRLST 369
>gi|147835944|emb|CAN68410.1| hypothetical protein VITISV_022914 [Vitis vinifera]
Length = 744
Score = 42.7 bits (99), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 2/79 (2%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA--DTNCKWRLRATKTA 345
++ G F SK+DL +K A +N V +S + V C C WRLRA +
Sbjct: 105 ELFKGLRFESKEDLQYAVKRYAICRNQHLVVCESEPQLWAVRCKKWQEGCNWRLRACRRK 164
Query: 346 EDEYFEIRRFSNIHICTQP 364
FEI +++ H C P
Sbjct: 165 SHGMFEITKYAGPHTCVYP 183
>gi|147832830|emb|CAN61900.1| hypothetical protein VITISV_029814 [Vitis vinifera]
Length = 689
Score = 42.7 bits (99), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 52/275 (18%), Positives = 99/275 (36%), Gaps = 62/275 (22%)
Query: 86 YSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDV 145
Y GG+T +VS +I+ E V ++ G++ +D+N I ++ K + P+ + + +D D+
Sbjct: 25 YKGGRTNCSVVSKNISQSEFVSKVCGVLNLDSNS--IKLEFTVKFD-PSCLLPLHNDGDI 81
Query: 146 RFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPG 205
+ N C H + H T + P
Sbjct: 82 VNMFKFNDM--------FC----------------HVYISHCTECGD-------DFICPT 110
Query: 206 MQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDDIQRLDLSQDDCG 265
T + VA+N + +G + +S + IQ + SQ
Sbjct: 111 SGPTPI------------VASNSAHVSSIG------QPPLHISNESPTIQSVGFSQ---- 148
Query: 266 DKLPIRRSNTRSAAINPFHSYEDIV-CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKD 324
R + T + + P IV G TF + + + L + F + +++
Sbjct: 149 -----RCAMTNTVQLQPSRFEHSIVGSGHTFPNASEFRDAIYLMSLAGKFRYSYKRNSPK 203
Query: 325 RFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIH 359
VVC +C W++ A + ++ F N+H
Sbjct: 204 HMTVVCTIEDCPWKITARAIGDSNIVQVHTFRNVH 238
>gi|22202805|dbj|BAC07461.1| hypothetical protein [Oryza sativa Japonica Group]
gi|50510005|dbj|BAD30618.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 325
Score = 42.7 bits (99), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 29/56 (51%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
GKTF SK+ L + +KN K S S+K + C D +C WRL AT T D
Sbjct: 220 GKTFDSKEHLKIAIGEFHIEKNAEVKYSTSSKSKIVAECTDNSCTWRLYATPTGID 275
>gi|9759134|dbj|BAB09619.1| mutator-like transposase-like protein [Arabidopsis thaliana]
gi|16648945|gb|AAL24324.1| mutator-like transposase-like protein [Arabidopsis thaliana]
gi|20259876|gb|AAM13285.1| mutator-like transposase-like protein [Arabidopsis thaliana]
Length = 597
Score = 42.7 bits (99), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 34/73 (46%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ GK F + + LK A +F ++ KS + RF C+ C WR+ A K +
Sbjct: 25 LAVGKEFPDVETCRRTLKDLAIALHFDLRIVKSDRSRFIAKCSKEGCPWRIHAAKCPGVQ 84
Query: 349 YFEIRRFSNIHIC 361
F +R ++ H C
Sbjct: 85 TFTVRTLNSEHTC 97
>gi|242117499|dbj|BAH79982.1| transposon mutator sub-class [Oryza sativa Indica Group]
Length = 1114
Score = 42.4 bits (98), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 65/161 (40%), Gaps = 28/161 (17%)
Query: 196 SNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGD-----GILAANIEVSEQ 250
SN VV P MQ E + + E +G D GI+ +E +
Sbjct: 127 SNAGDVVGPSMQNEENQPRE---------------ELAMGMADEGERVGIIVDEMEREDS 171
Query: 251 QDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN-------PFHSYE-DIVCGKTFTSKKDLI 302
++ D S D+ GD + +N + + P+ E +++ G + K ++
Sbjct: 172 DNEQAEDDASSDEEGDVMATDWANEDFSELVISVGDHVPWEYKENEVIEGARYAHKDEMK 231
Query: 303 KKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATK 343
+ +K A F+V KST +EV C +C WR+ A K
Sbjct: 232 EAMKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYK 272
>gi|125553424|gb|EAY99133.1| hypothetical protein OsI_21092 [Oryza sativa Indica Group]
Length = 1005
Score = 42.4 bits (98), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 55/141 (39%), Gaps = 6/141 (4%)
Query: 237 GDGILAANIEVSEQQDDIQRLD----LSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCG 292
G I N +V E D + D S D+ D + RR R + F
Sbjct: 231 GGDIPLGNNQVLEGGDGAEYFDSDGDASYDEDSDGVFTRRK-CRFPIFDSFADTPQFAVD 289
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
F K L ++ A KK + K+ + R VC C W L A+ + ++F+I
Sbjct: 290 MCFRGKDQLKDAIERYALKKKINIRYVKNEQKRIRAVCRWKGCPWLLYASHNSRSDWFQI 349
Query: 353 RRFSNIHICTQPPAKNRKKQT 373
++ H C P KN++ T
Sbjct: 350 VTYNPNHACC-PELKNKRLST 369
>gi|147765965|emb|CAN76929.1| hypothetical protein VITISV_024833 [Vitis vinifera]
Length = 952
Score = 42.4 bits (98), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 34/78 (43%), Gaps = 3/78 (3%)
Query: 298 KKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA---DTNCKWRLRATKTAEDEYFEIRR 354
++DL +K+ + + + V STK + C + C WRLRAT FEI +
Sbjct: 162 REDLQYAVKMYSINSHQEYIVLSSTKKLLALRCKKAEQSQCPWRLRATVVKGTSLFEINK 221
Query: 355 FSNIHICTQPPAKNRKKQ 372
+S H C P Q
Sbjct: 222 YSGPHTCVNPCMNQDHHQ 239
>gi|297791969|ref|XP_002863869.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297309704|gb|EFH40128.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 279
Score = 42.4 bits (98), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 32/60 (53%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
DI K F +K +L++K++ + F F+ S K+R +V D C WR+R + A +
Sbjct: 200 DIYVNKYFMNKAELMQKMRTWELEYKFEFRYRWSNKERVVLVFVDDKCTWRMRTMRFASN 259
>gi|449449158|ref|XP_004142332.1| PREDICTED: serine/threonine-protein phosphatase 7 long form homolog
[Cucumis sativus]
Length = 501
Score = 42.4 bits (98), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 1/77 (1%)
Query: 69 LVMYDGKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLY 128
+ ++G P G Y G K I V IT+ EL+DR+YG+ D + + + Y
Sbjct: 1 MCYWNGTTMSGPHGIS-YEGAAPKPIRVGYGITHNELIDRIYGVTGFDKQSFKLKIICRY 59
Query: 129 KANIPTMPVEIVDDDDV 145
A +PV I DD+ +
Sbjct: 60 PACREYIPVPIDDDESI 76
>gi|31432298|gb|AAP53948.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1005
Score = 42.4 bits (98), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 55/141 (39%), Gaps = 6/141 (4%)
Query: 237 GDGILAANIEVSEQQDDIQRLD----LSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCG 292
G I N +V E D + D S D+ D + RR R + F
Sbjct: 231 GGDIPLGNNQVLEGGDGAEYFDSDGDASYDEDSDGVFTRRK-CRFPIFDSFADTPQFAVD 289
Query: 293 KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
F K L ++ A KK + K+ + R VC C W L A+ + ++F+I
Sbjct: 290 MCFRGKDQLKDAIERYALKKKINIRYVKNEQKRIRAVCRWKGCPWLLYASHNSRSDWFQI 349
Query: 353 RRFSNIHICTQPPAKNRKKQT 373
++ H C P KN++ T
Sbjct: 350 VTYNPNHACC-PELKNKRLST 369
>gi|356521263|ref|XP_003529276.1| PREDICTED: uncharacterized protein LOC100800865 [Glycine max]
Length = 582
Score = 42.4 bits (98), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 29/70 (41%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F K +K A ++F + KS R+ CA C WR+RA K F
Sbjct: 9 GQEFPDVKAFRNAIKEAAIAQHFELRTIKSDLIRYFAKCASDGCPWRIRAVKLPNAPTFT 68
Query: 352 IRRFSNIHIC 361
IR H C
Sbjct: 69 IRSIDGTHTC 78
>gi|242096220|ref|XP_002438600.1| hypothetical protein SORBIDRAFT_10g022527 [Sorghum bicolor]
gi|241916823|gb|EER89967.1| hypothetical protein SORBIDRAFT_10g022527 [Sorghum bicolor]
Length = 338
Score = 42.4 bits (98), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
D++ G + D L+ A K+F KS K RF C C W +RA KT D
Sbjct: 202 DMLVGTHYPCMDDFRMALRQHAIVKDFELGTKKSDKGRFSGYCTAAVCPWVIRA-KTQRD 260
Query: 348 EYFEIRRFSNIHICTQPPAKNRK 370
+ I+ S+ H C P+ NR+
Sbjct: 261 KSVRIQINSDDHKC---PSMNRE 280
>gi|194688834|gb|ACF78501.1| unknown [Zea mays]
Length = 863
Score = 42.4 bits (98), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKS-TKDRFEVVCADTNCKWRLRATKTAE 346
DI G F L + L+ + + FKV S + R+ VVC + C WR+ A + E
Sbjct: 294 DIKVGLLFKDLSTLRQWLQEYSVNRKRPFKVRHSYAQRRYTVVCEVSECNWRVCARRQKE 353
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
F+I + H C Q ++ +Q
Sbjct: 354 TGKFKITKIVGPHTCAQTELSSKHRQ 379
>gi|6598590|gb|AAF18645.1|AC006228_16 F5J5.10 [Arabidopsis thaliana]
Length = 727
Score = 42.4 bits (98), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 59/274 (21%), Positives = 107/274 (39%), Gaps = 38/274 (13%)
Query: 98 DDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
+ ITY LVD++ V +D E + +K Y + IVDD+DV F+ E+
Sbjct: 40 EKITYSMLVDKIMKKVAID--EASVKLKLSYNLSKVRRETYIVDDEDVFIFLTES---DE 94
Query: 158 ELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLN 217
E R P+ E G + E + R S ++V+ EV E +
Sbjct: 95 ESRIPVLHVEELNGIGVERRE--------EISVPERRISVGVNVL------NEVYEMVHD 140
Query: 218 AFNGDPVAANFGTEAPLGCGDGILAANIEVSEQ--QDDIQRLDLSQDDCGDKLPIRRSNT 275
++ D + + ++ I E+ + + ++ +D
Sbjct: 141 GYDND-----------VNDCENVVGTKIVAVERPMERPVNSVEEEDEDDRIDYDDIHDIP 189
Query: 276 RSAAINP-FHSYED---IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC- 330
RS + P ++D I G+ F S++ + + + A K+ F KS R + C
Sbjct: 190 RSVEVTPQVIEWDDGTGIEIGQEFCSREAVWELVNRAAKKEVFGVYTIKSDPLRLMLRCR 249
Query: 331 -ADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQ 363
A C W LR +T + ++ +R +H C++
Sbjct: 250 QASKGCTWYLRVARTKKSHFWSVRVHRKMHTCSR 283
>gi|225450448|ref|XP_002276424.1| PREDICTED: uncharacterized protein LOC100251030 [Vitis vinifera]
Length = 768
Score = 42.4 bits (98), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 58/293 (19%), Positives = 109/293 (37%), Gaps = 38/293 (12%)
Query: 74 GKWFCSPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIP 133
G++ G Y GG I + D + + E + + + +++K N
Sbjct: 14 GEFEADKDGSLSYRGGDAHAIDIDDQMKFNEFKMEVAEMFNCSIS--TMSIKYFLPKNKK 71
Query: 134 TMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQETEGYHAALDHQTPAASR 193
T+ + I +D D++ I+ H + I + E ALD ASR
Sbjct: 72 TL-ITISNDKDLKRMIK---FHVDSVTVDIYVMTEEV-----------VALDVSNMPASR 116
Query: 194 SESNQIS-VVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGI----LAANIEVS 248
S +S VVP ++++ ++ D + PL D + A I +
Sbjct: 117 SSRTTLSEAVVPVDAPLDMKDDMVD----DTTYPDVSLGLPLDVVDDTTHVDVDAQITMP 172
Query: 249 EQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLG 308
+ + L +S ++ K + NT + G+ F+ + + L+
Sbjct: 173 NEISPVLPLSISNEEKHVKAAQQWQNTITGV------------GQRFSGVHEFREALRKY 220
Query: 309 AAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
A FAF+ K+ R V C C WR+ A++ + + I++ + H C
Sbjct: 221 AIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNATHTC 273
>gi|356527977|ref|XP_003532582.1| PREDICTED: uncharacterized protein LOC100775909 [Glycine max]
Length = 582
Score = 42.4 bits (98), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 30/74 (40%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
+ G+ F K +K A ++F + KS R+ CA C WR+RA K
Sbjct: 5 NFFVGQKFHDVKAFRNAIKEAAIAQHFELRTIKSDLIRYFAKCASDGCPWRIRAVKLPNA 64
Query: 348 EYFEIRRFSNIHIC 361
F IR H C
Sbjct: 65 PTFTIRSIDGTHTC 78
>gi|413954737|gb|AFW87386.1| hypothetical protein ZEAMMB73_887474 [Zea mays]
Length = 723
Score = 42.4 bits (98), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 2/88 (2%)
Query: 285 SYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
S+++++ G+ F KD L A F ++ K+ R C C WR+ A+
Sbjct: 140 SWKNVITGVGQVFEGPKDFRDALHKYAIAHRFHYRFIKNDSSRVTAECTGEGCPWRIHAS 199
Query: 343 KTAEDEYFEIRRFSNIHICTQPPAKNRK 370
K+ + F +++ S H C K+ +
Sbjct: 200 KSPAKKEFMVKKISESHTCESETVKSNR 227
>gi|448931652|gb|AGE55213.1| hypothetical protein PBCVMA1E_205R, partial [Paramecium bursaria
Chlorella virus MA-1E]
Length = 155
Score = 42.4 bits (98), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 1/72 (1%)
Query: 409 LNPTVELRNGTDVIWQIPDSPKAVLFLAHGCNGRAVHFWDRSPNCPNCIGLPEERLLVLH 468
L P V +G +Q P +PK ++ HGC+ W RS N P G E+
Sbjct: 82 LKPVVITVDGRHHAYQSPANPKGLVVFLHGCSRSIFGAWPRSSN-PKFFGYSEDVSRTKQ 140
Query: 469 ALSRGFAIITIS 480
L G+AI+ IS
Sbjct: 141 VLKAGYAILYIS 152
>gi|225431049|ref|XP_002274082.1| PREDICTED: uncharacterized protein LOC100261536 [Vitis vinifera]
Length = 602
Score = 42.4 bits (98), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 32/73 (43%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+V G+ F + + LK A +F ++ KS + RF C+ C WR+ K
Sbjct: 28 LVIGQEFPDVETCRRTLKDIAIALHFDLRIVKSDRSRFIAKCSKEGCPWRVHVAKCPGVP 87
Query: 349 YFEIRRFSNIHIC 361
F IR +H C
Sbjct: 88 TFSIRTLHGVHTC 100
>gi|8777291|dbj|BAA96881.1| mutator-like transposase [Arabidopsis thaliana]
Length = 875
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 30/54 (55%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
I GK + SK+D L + A K F FK +++ + F + C+D C WR+ AT
Sbjct: 349 IYQGKVYASKEDCQIGLAIFAIKNQFHFKQTRTKWNYFVLSCSDEKCDWRILAT 402
>gi|147854673|emb|CAN80243.1| hypothetical protein VITISV_031021 [Vitis vinifera]
Length = 587
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 32/73 (43%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+V G+ F + + LK A +F ++ KS + RF C+ C WR+ K
Sbjct: 13 LVIGQEFPDVETCRRTLKDIAIALHFDLRIVKSDRSRFIAKCSKEGCPWRVHVAKCPGVP 72
Query: 349 YFEIRRFSNIHIC 361
F IR +H C
Sbjct: 73 TFSIRTLHGVHTC 85
>gi|7267580|emb|CAB78061.1| putative protein [Arabidopsis thaliana]
Length = 960
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 57/274 (20%), Positives = 107/274 (39%), Gaps = 33/274 (12%)
Query: 98 DDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
+ I+Y LVD++ V +D E + +K Y + IVDD+DV F+ E+
Sbjct: 40 EKISYSMLVDKIMKKVAID--EASVKLKLSYNLSKVRRETYIVDDEDVFIFLTES---DE 94
Query: 158 ELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLN 217
E R P+ +E G + R S ++V+ + EV E +
Sbjct: 95 ESRIPVL--------HVEELNGIGVERREEISVPERRSSVGVNVIEDDV-LNEVCEMVHD 145
Query: 218 AFNGDPVAANFGTEAPLGCGDGILAANIEVSEQ--QDDIQRLDLSQDDCGDKLPIRRSNT 275
++ D + + ++ I E+ + + ++ +D
Sbjct: 146 GYDND-----------VNDCENVVGTEIVAVERPMERPVNSVEEEDEDDRIDYDDIHDIP 194
Query: 276 RSAAINP-FHSYED---IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC- 330
RS + P ++D I G+ F S++ + + + A K+ F KS R + C
Sbjct: 195 RSVEVTPQVIEWDDGTGIEIGQEFCSREAVWELVNRAAKKEVFGVYTIKSDPLRLMLRCR 254
Query: 331 -ADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQ 363
A C W LR +T + ++ +R +H C++
Sbjct: 255 QASKGCTWYLRVARTKKSHFWSVRMHRKMHTCSR 288
>gi|108862710|gb|ABA98628.2| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 947
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 67/166 (40%), Gaps = 22/166 (13%)
Query: 197 NQISVVVPGMQATE---VEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDD 253
N VV P MQ E EEQ + + + E D A + S+++ D
Sbjct: 128 NAGDVVGPSMQNEENQPKEEQAMGMADEGERVSIIVDEMEREDSDNEEADDDASSDEEGD 187
Query: 254 IQRLDLSQDDC-------GDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLK 306
+ D + +D GD +P + +++ G + K ++ + +K
Sbjct: 188 VMATDWANEDFSGLVISEGDHVPWE------------YKENEVIEGARYAHKDEMKEAVK 235
Query: 307 LGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
A F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 236 HWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWNDYWKV 281
>gi|5732430|gb|AAD49098.1|AF177535_2 contains similarity to maize transposon MuDR (GB:M76978)
[Arabidopsis thaliana]
Length = 872
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 30/54 (55%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
I GK + SK+D L + A K F FK +++ + F + C+D C WR+ AT
Sbjct: 349 IYQGKVYASKEDCQIGLAIFAIKNQFHFKQTRTKWNYFVLSCSDEKCDWRILAT 402
>gi|224122824|ref|XP_002318925.1| predicted protein [Populus trichocarpa]
gi|222857301|gb|EEE94848.1| predicted protein [Populus trichocarpa]
Length = 580
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/70 (27%), Positives = 34/70 (48%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F+S + + L+ A FAF+ K+ R V C C WR+ A++ + +
Sbjct: 19 GQRFSSVHEFRESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLIC 78
Query: 352 IRRFSNIHIC 361
I++ + H C
Sbjct: 79 IKKMNPTHTC 88
>gi|5690095|emb|CAB51950.1| transposase related protein [Zea mays]
Length = 863
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKS-TKDRFEVVCADTNCKWRLRATKTAE 346
DI G F L + L+ + + FKV S + R+ VVC + C WR+ A + E
Sbjct: 294 DIKVGLLFKDLPTLRQWLQEYSVNRKRPFKVRHSYAQRRYTVVCEVSECNWRVCARRQKE 353
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
F+I + H C Q ++ +Q
Sbjct: 354 TGKFKITKIVGPHTCAQTELSSKHRQ 379
>gi|90399302|emb|CAH68172.1| H0323C08.14 [Oryza sativa Indica Group]
Length = 1532
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 37/166 (22%), Positives = 65/166 (39%), Gaps = 22/166 (13%)
Query: 197 NQISVVVPGMQATE---VEEQNLNAFNGDPVAANFGTEAPLGCGDGILAANIEVSEQQDD 253
N VV P MQ E EEQ + + E D A + S+++ D
Sbjct: 128 NAGDVVGPSMQNEENQPREEQAMGMADEGERVGTIVDEMEREDSDNEEADDDASSDEEGD 187
Query: 254 IQRLDLSQDDC-------GDKLPIRRSNTRSAAINPFHSYEDIVCGKTFTSKKDLIKKLK 306
+ D + +D GD +P + +++ G + K ++ + +K
Sbjct: 188 VMATDWANEDFSGLVISEGDHVPWE------------YKENEVIEGARYAHKDEMKEAVK 235
Query: 307 LGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEI 352
F+V KST +EV C +C WR+ A K ++Y+++
Sbjct: 236 HWTVSLQREFRVVKSTNYVYEVRCIKEDCPWRVHAYKGKWNDYWKV 281
>gi|14165348|gb|AAK55480.1|AC084295_13 transposon protein, putative, mutator sub-class [Oryza sativa
Japonica Group]
Length = 883
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 56/129 (43%), Gaps = 13/129 (10%)
Query: 231 EAPLGCGD-----GILAANIEVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAIN---- 281
E +G D GI+ +E + ++ D S D+ GD + +N + +
Sbjct: 147 EQAMGMADEGERVGIIVDEMEREDSDNEEADDDASSDEKGDVMATDWANEDFSGLVISEG 206
Query: 282 ---PFHSYE-DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKW 337
P+ E +++ G + K ++ + +K A F+V KST +EV C +C W
Sbjct: 207 DHVPWEYKENEVIDGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPW 266
Query: 338 RLRATKTAE 346
R+ A K E
Sbjct: 267 RVHAYKGVE 275
>gi|9369392|gb|AAF87140.1|AC002423_5 T23E23.9 [Arabidopsis thaliana]
Length = 972
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 58/274 (21%), Positives = 108/274 (39%), Gaps = 33/274 (12%)
Query: 98 DDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
+ I+Y LVD++ V +D E + +K Y + IVDD+DV F+ E+
Sbjct: 40 EKISYSMLVDKIMKKVAID--EAFVKLKLSYNLSKFRRETYIVDDEDVFIFLTES---DE 94
Query: 158 ELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLN 217
E R P+ E G + E + R S ++V+ + EV E +
Sbjct: 95 ESRIPVLHVEELNGIGVERRE--------EISVPERRSSVGVNVIEDDV-LNEVCEMVHD 145
Query: 218 AFNGDPVAANFGTEAPLGCGDGILAANIEVSEQ--QDDIQRLDLSQDDCGDKLPIRRSNT 275
++ D + + ++ I E+ + + ++ +D
Sbjct: 146 GYDND-----------VNDCENVVGTEIVAVERPMERPVNSVEEEDEDDRIDYDDIHDIP 194
Query: 276 RSAAINP-FHSYED---IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC- 330
RS + P ++D I G+ F S++ + + + A K+ F KS R + C
Sbjct: 195 RSVEVTPQVIEWDDGTGIEIGQEFCSREAVWELVNRAAKKEVFGVYTIKSDPMRLMLRCR 254
Query: 331 -ADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQ 363
A C W LR +T + ++ +R +H C++
Sbjct: 255 QASKGCTWYLRVARTKKSHFWSVRVHRKMHTCSR 288
>gi|242072298|ref|XP_002446085.1| hypothetical protein SORBIDRAFT_06g001530 [Sorghum bicolor]
gi|241937268|gb|EES10413.1| hypothetical protein SORBIDRAFT_06g001530 [Sorghum bicolor]
Length = 473
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 45/106 (42%), Gaps = 2/106 (1%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G TF + K L + + K K KD+ + C C W + +KT++ +F+
Sbjct: 254 GMTFRDSRQFKKTLVKYGLRAHKLLKFLKDEKDKVKATCDWFGCNWMIYRSKTSKSRWFK 313
Query: 352 IRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNEKPNSSYGLVK 397
+ F + H C PP ++ K T+ + P S GL+K
Sbjct: 314 VVTFVDEHTC--PPRRDNKLVTSTVIAKHYYNQIKDNPTWSVGLMK 357
>gi|242054657|ref|XP_002456474.1| hypothetical protein SORBIDRAFT_03g037000 [Sorghum bicolor]
gi|241928449|gb|EES01594.1| hypothetical protein SORBIDRAFT_03g037000 [Sorghum bicolor]
Length = 406
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKS-TKDRFEVVCADTNCKWRLRATKTAE 346
+I G F L + L+ + ++ FKV S + R+ VVC +C WR+ A K
Sbjct: 42 EIQKGMVFKDLHALRRWLQDYSVRRKRPFKVRHSYVERRYTVVCEMADCNWRVCACKQKA 101
Query: 347 DEYFEIRRFSNIHICTQPPAKNRKKQ 372
F+I R H C Q + + +Q
Sbjct: 102 TGKFKITRIVGPHTCAQTDLQQKHRQ 127
>gi|29788866|gb|AAP03412.1| putative mutator-like transposase [Oryza sativa Japonica Group]
Length = 844
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/65 (29%), Positives = 35/65 (53%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
+I+ G + K ++ + +K A F+V KST +EV C +C WR+ A K +
Sbjct: 191 EIIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAYKGKWN 250
Query: 348 EYFEI 352
+Y+++
Sbjct: 251 DYWKV 255
>gi|20197172|gb|AAC17090.2| Mutator-like transposase [Arabidopsis thaliana]
Length = 764
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 57/274 (20%), Positives = 107/274 (39%), Gaps = 33/274 (12%)
Query: 98 DDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
+ I+Y LVD++ V +D E + +K Y + IVDD+DV F+ E+
Sbjct: 13 EKISYSMLVDKIMKKVAID--EASVKLKLSYNLSKVRRETYIVDDEDVFIFLTES---DE 67
Query: 158 ELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLN 217
E R P+ +E G + R S ++V+ + EV E +
Sbjct: 68 ESRIPVL--------HVEELNGIGVERREEISVPERRSSVGVNVIEDDV-LNEVCEMVHD 118
Query: 218 AFNGDPVAANFGTEAPLGCGDGILAANIEVSEQ--QDDIQRLDLSQDDCGDKLPIRRSNT 275
++ D + + ++ I E+ + + ++ +D
Sbjct: 119 GYDND-----------VNDCENVVGTEIVAVERPMERPVNSVEEEDEDDRIDYDDIHDIP 167
Query: 276 RSAAINP-FHSYED---IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC- 330
RS + P ++D I G+ F S++ + + + A K+ F KS R + C
Sbjct: 168 RSVEVTPQVIEWDDGTGIEIGQEFCSREAVWELVNRAAKKEVFGVYTIKSDPLRLMLRCR 227
Query: 331 -ADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQ 363
A C W LR +T + ++ +R +H C++
Sbjct: 228 QASKGCTWYLRVARTKKSHFWSVRVHRKMHTCSR 261
>gi|6175165|gb|AAF04891.1|AC011437_6 Mutator-like transposase [Arabidopsis thaliana]
gi|18389286|gb|AAL67086.1| putative Mutator transposase [Arabidopsis thaliana]
gi|20465953|gb|AAM20162.1| putative mutator transposase [Arabidopsis thaliana]
Length = 757
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 30/73 (41%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ F K + L+ A +F + KS K RF C+ C WR+ A K
Sbjct: 182 LTVGQEFPDVKSCRRALRDMAIALHFEMQTIKSDKTRFTAKCSSDGCPWRVHAAKLPGVP 241
Query: 349 YFEIRRFSNIHIC 361
F IR H C
Sbjct: 242 TFTIRTIHESHSC 254
>gi|147768131|emb|CAN60604.1| hypothetical protein VITISV_020063 [Vitis vinifera]
Length = 944
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 34/83 (40%), Gaps = 2/83 (2%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCA--DTNCKWRLRATKTAEDEY 349
G F SK+DL +K A +N V + + V C C WRLRA +
Sbjct: 22 GLRFESKEDLQYAVKRYAICRNQHLVVCELEPQLWAVRCKKWQEGCNWRLRACRRKSHGM 81
Query: 350 FEIRRFSNIHICTQPPAKNRKKQ 372
FEI +++ H C P Q
Sbjct: 82 FEITKYAGPHTCVYPKLSQDHSQ 104
>gi|125538436|gb|EAY84831.1| hypothetical protein OsI_06197 [Oryza sativa Indica Group]
Length = 638
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/116 (23%), Positives = 43/116 (37%), Gaps = 10/116 (8%)
Query: 265 GDKLPIRRSNTRSAAINPFHSYEDIVC----------GKTFTSKKDLIKKLKLGAAKKNF 314
G+ + N AA P S +C G+ F K+ L A F
Sbjct: 29 GEAHAMHVKNHAVAATTPDGSKRQKICASWENAITGAGQVFEGPKEFRDALHKYAIAHRF 88
Query: 315 AFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQPPAKNRK 370
++ K+ R V C C WR+ A+K+ + F I++ H C K+ +
Sbjct: 89 HYRFVKNDSSRVTVECTAEGCPWRIHASKSPAKKDFMIKKVFGSHTCESESVKSHR 144
>gi|225457205|ref|XP_002284005.1| PREDICTED: uncharacterized protein LOC100267211 [Vitis vinifera]
Length = 267
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/78 (25%), Positives = 33/78 (42%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ + G+ F + L A F +S + R CA +C+WR+ A+K +
Sbjct: 32 DTLTVGQEFPDARCFRDALVASAIASKFELTFIRSDRSRVTARCAADDCQWRIHASKLPD 91
Query: 347 DEYFEIRRFSNIHICTQP 364
E F+I+ H C P
Sbjct: 92 GETFQIKTLKGKHSCVWP 109
>gi|42562928|ref|NP_176608.2| MuDR family transposase [Arabidopsis thaliana]
gi|12323472|gb|AAG51711.1|AC066689_10 hypothetical protein; 95918-93759 [Arabidopsis thaliana]
gi|332196098|gb|AEE34219.1| MuDR family transposase [Arabidopsis thaliana]
Length = 719
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/131 (23%), Positives = 59/131 (45%), Gaps = 5/131 (3%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
D+ G F + +L K + ++ V ++ K+ + C CKW LRA + E
Sbjct: 184 DMHLGLCFKDRDELKKAVDWWCIRRRRNCIVRETEKEMYTFECVRWKCKWSLRAARMEEH 243
Query: 348 EYFEIRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNEKPNSSYGLVKQKWNSFDSSV 407
EI +++ H C+ + + + A + ER +R + P S +K+ W +
Sbjct: 244 GLVEITKYTGPHTCSHEYPNDFESEFAAD--EIERVVRIQ-PTLSIAELKKWWKE-KTGY 299
Query: 408 QLNPTVELRNG 418
+L T ++R+G
Sbjct: 300 ELQ-TSKMRDG 309
>gi|125537276|gb|EAY83764.1| hypothetical protein OsI_38982 [Oryza sativa Indica Group]
Length = 747
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 31/74 (41%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+V G+ F + ++ A +F + KS K RF C+ C WR+ A K
Sbjct: 172 LVVGQEFPDVMSCRRAIRNTAIACHFEIQTVKSDKTRFTAKCSADGCPWRIHAAKLPGVP 231
Query: 349 YFEIRRFSNIHICT 362
F IR + H C
Sbjct: 232 TFSIRTIHDNHSCV 245
>gi|115489412|ref|NP_001067193.1| Os12g0597300 [Oryza sativa Japonica Group]
gi|77556405|gb|ABA99201.1| transposon protein, putative, Mutator sub-class, expressed [Oryza
sativa Japonica Group]
gi|113649700|dbj|BAF30212.1| Os12g0597300 [Oryza sativa Japonica Group]
gi|125579958|gb|EAZ21104.1| hypothetical protein OsJ_36747 [Oryza sativa Japonica Group]
Length = 749
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 31/74 (41%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+V G+ F + ++ A +F + KS K RF C+ C WR+ A K
Sbjct: 174 LVVGQEFPDVMSCRRAIRNTAIACHFEIQTVKSDKTRFTAKCSADGCPWRIHAAKLPGVP 233
Query: 349 YFEIRRFSNIHICT 362
F IR + H C
Sbjct: 234 TFSIRTIHDNHSCV 247
>gi|293337036|ref|NP_001168705.1| uncharacterized protein LOC100382497 [Zea mays]
gi|223950329|gb|ACN29248.1| unknown [Zea mays]
gi|224030155|gb|ACN34153.1| unknown [Zea mays]
Length = 751
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 31/74 (41%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ F + ++ A +F + KS K RF CA +C WR+ A K
Sbjct: 179 LAVGQQFPDVVSCRRAIRNTAIACHFEIQTVKSDKSRFTAKCAAESCPWRIHAAKLPGVP 238
Query: 349 YFEIRRFSNIHICT 362
F IR + H C
Sbjct: 239 TFSIRTIHDNHSCV 252
>gi|55168183|gb|AAV44049.1| putative polyprotein [Oryza sativa Japonica Group]
gi|55168324|gb|AAV44190.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 1456
Score = 40.8 bits (94), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/176 (21%), Positives = 71/176 (40%), Gaps = 17/176 (9%)
Query: 204 PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDG--ILAANIEVSEQQDDIQRLDLSQ 261
P + E + +NA V A G + P G D + A ++ E++D
Sbjct: 166 PSAEVNETSVEEVNAREDGGVVAPVGIQ-PGGVADEGETVGAIVDEMEREDSNNERVEEG 224
Query: 262 DDCGDKLPIRRSNTRSAAINPF-----------HSYEDIVCGKTFTSKKDLIKKLKLGAA 310
D D++ I + S + + +++ G ++ +D+ + +K A
Sbjct: 225 DSSDDEMDINPAEWASEDFSGLIVSEEDSVRWEYKENEVIQGAIYSRAEDMKEAVKHFAV 284
Query: 311 KKNFAFKVSKSTKDRFEVVCADT--NCKWRLRATKTAEDEYFEIRRFSNIHICTQP 364
+ F+V+KS + ++EV C C WR+ A K +Y+ + + H C P
Sbjct: 285 SLHREFRVAKSNRSQYEVRCVKEKDGCPWRVHAYKGKWKDYWTVSVVTK-HTCFLP 339
>gi|218194045|gb|EEC76472.1| hypothetical protein OsI_14206 [Oryza sativa Indica Group]
Length = 1029
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 29/132 (21%), Positives = 52/132 (39%), Gaps = 4/132 (3%)
Query: 246 EVSEQQDDIQRLDLSQDDCGDKLPIRRSNTRSAAINPFHSYEDI---VCGKTFTSKKDLI 302
+V+++ D I D D D + R F ++ ++ F K L
Sbjct: 193 QVNDEGDGIAYFDSEDDASYDDDSGDSAERRKCRFPIFDNHAELPQNAVDMCFRGKMQLK 252
Query: 303 KKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHICT 362
+ A K + + +DR VC C W L A+ + ++F+I ++ H C
Sbjct: 253 DAITRYALHKKVTLQFITNDRDRLRAVCMRKGCTWLLHASYNSRSDWFQIVTYNGNHSCC 312
Query: 363 QPPAKNRKKQTA 374
P KN++ T+
Sbjct: 313 -PDLKNKRLSTS 323
>gi|6692119|gb|AAF24584.1|AC007764_26 F22C12.1 [Arabidopsis thaliana]
Length = 3290
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 31/131 (23%), Positives = 59/131 (45%), Gaps = 5/131 (3%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
D+ G F + +L K + ++ V ++ K+ + C CKW LRA + E
Sbjct: 2755 DMHLGLCFKDRDELKKAVDWWCIRRRRNCIVRETEKEMYTFECVRWKCKWSLRAARMEEH 2814
Query: 348 EYFEIRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNEKPNSSYGLVKQKWNSFDSSV 407
EI +++ H C+ + + + A + ER +R + P S +K+ W +
Sbjct: 2815 GLVEITKYTGPHTCSHEYPNDFESEFAAD--EIERVVRIQ-PTLSIAELKKWWKE-KTGY 2870
Query: 408 QLNPTVELRNG 418
+L T ++R+G
Sbjct: 2871 ELQ-TSKMRDG 2880
>gi|224124326|ref|XP_002329995.1| predicted protein [Populus trichocarpa]
gi|222871420|gb|EEF08551.1| predicted protein [Populus trichocarpa]
Length = 580
Score = 40.4 bits (93), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/70 (27%), Positives = 33/70 (47%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F S + + L+ A FAF+ K+ R V C C WR+ A++ + +
Sbjct: 19 GQRFRSVHEFRESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLIC 78
Query: 352 IRRFSNIHIC 361
I++ + H C
Sbjct: 79 IKKMNPAHTC 88
>gi|224125676|ref|XP_002319648.1| predicted protein [Populus trichocarpa]
gi|222858024|gb|EEE95571.1| predicted protein [Populus trichocarpa]
Length = 581
Score = 40.4 bits (93), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 30/74 (40%)
Query: 288 DIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAED 347
++ G+ F + L+ A +F + KS K RF CA C WR+ A K
Sbjct: 6 ELAVGQEFPDVFSCRRALRDTAIALHFEMQTIKSDKTRFTAKCATEGCPWRIHAAKLPGV 65
Query: 348 EYFEIRRFSNIHIC 361
F IR H C
Sbjct: 66 PTFTIRTIHESHTC 79
>gi|39545839|emb|CAE04747.3| OSJNBb0060E08.10 [Oryza sativa Japonica Group]
Length = 889
Score = 40.4 bits (93), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 283 FHSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
+ YE ++ G + K ++ + +K A F+V KST +EV C +C WR+ A
Sbjct: 83 YKEYE-VIEGARYAHKDEMKEAVKHWAVSLQREFRVVKSTNYVYEVRCMKEDCPWRVHAY 141
Query: 343 KTAEDEYFEI 352
K ++Y+++
Sbjct: 142 KGKWNDYWKV 151
>gi|55168240|gb|AAV44106.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 909
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 38/177 (21%), Positives = 74/177 (41%), Gaps = 19/177 (10%)
Query: 204 PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGC---GDGILAANIEVSEQQDDIQRLDLS 260
P + E + +NA V A G + P G G+ + A E+ + D +R++
Sbjct: 126 PSAEVNETSVEEVNAREDGGVVAPVGIQ-PGGVADEGETVGAIVNEMEREDSDNERVE-E 183
Query: 261 QDDCGDKLPIRRSNTRSAAINPF-----------HSYEDIVCGKTFTSKKDLIKKLKLGA 309
D D+ I ++ S + + +++ G ++ +D+ + +K A
Sbjct: 184 GDSSDDETDINQAEWASEDFSGLIVSEEDSVRWEYKENEVIQGAIYSRAEDMKEAVKHFA 243
Query: 310 AKKNFAFKVSKSTKDRFEVVCADT--NCKWRLRATKTAEDEYFEIRRFSNIHICTQP 364
+ F+V+KS + ++EV C C WR+ A K +Y+ + + H C P
Sbjct: 244 VSLHREFRVAKSNRSQYEVRCVKEKDGCPWRVHAYKGKWKDYWTVSVVTK-HTCFLP 299
>gi|357151529|ref|XP_003575819.1| PREDICTED: uncharacterized protein LOC100832575 [Brachypodium
distachyon]
Length = 886
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 19/70 (27%), Positives = 36/70 (51%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G F SK+ + K L K++ +K KD+ C+ +CKW + A+KT ++++ +
Sbjct: 377 GMVFRSKEHIKKALIRYGIKQHVHLLFTKDEKDKIRAHCSWPDCKWMIYASKTTKNKWLQ 436
Query: 352 IRRFSNIHIC 361
+ + H C
Sbjct: 437 VSTLVDEHHC 446
>gi|2565007|gb|AAB81877.1| predicted transposon protein [Arabidopsis thaliana]
gi|7267503|emb|CAB77986.1| predicted transposon protein [Arabidopsis thaliana]
Length = 907
Score = 40.0 bits (92), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 63/317 (19%), Positives = 122/317 (38%), Gaps = 49/317 (15%)
Query: 81 SGYY-----LYSGGQTKGIIV-SDDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPT 134
+GYY S G+++ + +TY LV ++ + +D +T+ Y
Sbjct: 12 NGYYSKEHEWISNNSLSGVLLRTSKVTYSMLVKKICKKIGIDETSRKVTLS--YILCTSQ 69
Query: 135 MPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGS----QCQETEGYHA-ALDHQTP 189
IV DDD ++ + +R + + + K S Q EG + ++++
Sbjct: 70 RETSIVHDDDFLGYLM--VVNAKGIRPALNVEVVNKESERVEQISRVEGRSSVGVNYEEL 127
Query: 190 AASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDGILAA------ 243
A E+ + ++ G E N N +P+ E P + I A
Sbjct: 128 NAFDVENGGVVTILRG------EIAGGNCVNDEPI-----IEEPNSVKEAIENADSRVFE 176
Query: 244 ----NIEVSEQQDDIQRLDLSQDDCGD------KLPI----RRSNTRSAAINPFHSYEDI 289
N+ + +++ R+D+++D+ D +LP +S+ +S ++ + I
Sbjct: 177 EAMENVVIEDREHVDSRMDVAEDNGVDGRIEYYELPCVGEGSQSSEKSQSVKEWEDGTGI 236
Query: 290 VCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC---ADTNCKWRLRATKTAE 346
+ F SK+ + + + A K FA R + C T C+W LR K
Sbjct: 237 ELNQEFVSKEVVQDLVNIVANKYCFATATIVLDPQRLMLRCRQSKTTGCQWYLRCAKVKS 296
Query: 347 DEYFEIRRFSNIHICTQ 363
+ F +R +H C +
Sbjct: 297 SDCFSVRVHRKMHTCKR 313
>gi|356504141|ref|XP_003520857.1| PREDICTED: uncharacterized protein LOC100808214 [Glycine max]
Length = 599
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 20/75 (26%), Positives = 32/75 (42%)
Query: 287 EDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAE 346
+ +V G+ F + + LK A +F ++ KS + RF C+ C WR+ K
Sbjct: 22 QPLVIGQEFADVETCRRTLKDIAIAMHFDLRIVKSDRSRFIAKCSKEGCPWRVHVAKCPG 81
Query: 347 DEYFEIRRFSNIHIC 361
F +R H C
Sbjct: 82 VPTFTVRTLHGEHTC 96
>gi|357454463|ref|XP_003597512.1| hypothetical protein MTR_2g098840 [Medicago truncatula]
gi|355486560|gb|AES67763.1| hypothetical protein MTR_2g098840 [Medicago truncatula]
Length = 196
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 70/171 (40%), Gaps = 12/171 (7%)
Query: 64 SQVRVLVMYDGKWFC---SPSGYYLYSGGQTKGIIVSDDITYKELVDRLYGIVKVDTNEY 120
S V +V Y+GK C + SG Y S + KG VS T+ L R+ K+ N+
Sbjct: 5 SNVFFIVHYNGK--CERNNVSGVYFESKVR-KGFNVSPKCTFAYLKQRIEEKFKLGNNKV 61
Query: 121 V---ITMKSLY--KANIPTMPVEIVDDDDVRFFIQENAAHPTELRSPICITIERKGSQCQ 175
V I ++ K+NI +I D+DDV F + L +C+TIE
Sbjct: 62 VSEIICRIPMFIGKSNILYESYKITDNDDVEFMFDIYKKYDVILSIELCVTIEDGNPSSN 121
Query: 176 ETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLNAFNGDPVAA 226
+ ++ TP S+ + +P E+QN A N P A
Sbjct: 122 FFQPSSSSQYFHTPKTHYGPSSSQPMSIPTPSPNAAEQQNA-ATNTHPAAT 171
>gi|242039793|ref|XP_002467291.1| hypothetical protein SORBIDRAFT_01g023040 [Sorghum bicolor]
gi|241921145|gb|EER94289.1| hypothetical protein SORBIDRAFT_01g023040 [Sorghum bicolor]
Length = 934
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 48/114 (42%), Gaps = 5/114 (4%)
Query: 284 HSYEDIVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATK 343
+SYE+ G+ K+ + K L A K + K K R C C W + +K
Sbjct: 251 YSYEEDSDGEIRRQFKNALVKYGLKAHK---SLKFDIDEKLRVRAKCDWIGCNWMIYGSK 307
Query: 344 TAEDEYFEIRRFSNIHICTQPPAKNRKKQTALQRPDGERGIRNEKPNSSYGLVK 397
T ++F++ FS++H C PP ++ T+ + P GL+K
Sbjct: 308 TTRSQWFKVVTFSDVHTC--PPRRDNNLVTSTVIAKHYYSQIKDNPTWCAGLIK 359
>gi|115444765|ref|NP_001046162.1| Os02g0192200 [Oryza sativa Japonica Group]
gi|113535693|dbj|BAF08076.1| Os02g0192200 [Oryza sativa Japonica Group]
gi|125581125|gb|EAZ22056.1| hypothetical protein OsJ_05714 [Oryza sativa Japonica Group]
gi|215678777|dbj|BAG95214.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704861|dbj|BAG94889.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 720
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 2/88 (2%)
Query: 285 SYEDIV--CGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRAT 342
S+E+ + G+ F K+ L A F ++ K+ R V C C WR+ A+
Sbjct: 139 SWENAITGAGQVFEGPKEFRDALHKYAIAHRFHYRFVKNDSSRVTVECTAEGCPWRIHAS 198
Query: 343 KTAEDEYFEIRRFSNIHICTQPPAKNRK 370
K+ + F I++ H C K+ +
Sbjct: 199 KSPAKKDFMIKKVFGSHTCESESVKSHR 226
>gi|242032183|ref|XP_002463486.1| hypothetical protein SORBIDRAFT_01g000626 [Sorghum bicolor]
gi|241917340|gb|EER90484.1| hypothetical protein SORBIDRAFT_01g000626 [Sorghum bicolor]
Length = 593
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKS-TKDRFEVVCADTNCKWRLRATKTAEDEYFEIR 353
FT L + L+ + ++ FKV S + R+ VVC ++C W++ A K F+I
Sbjct: 3 FTDLPSLKRWLQEYSVRRKRPFKVRHSYVERRYTVVCEMSDCNWKVCACKQKATGKFKIT 62
Query: 354 RFSNIHICTQPPAKNRKKQ 372
R H C Q +++ +Q
Sbjct: 63 RIVGPHTCAQMDLQHKHRQ 81
>gi|62734555|gb|AAX96664.1| transposon protein, putative, mutator sub-class [Oryza sativa
Japonica Group]
gi|77549474|gb|ABA92271.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 1210
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 39/176 (22%), Positives = 78/176 (44%), Gaps = 17/176 (9%)
Query: 204 PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDG--ILAANIEVSEQQD-DIQRLDL- 259
P + E + +NA V A G + P G D + A ++ E++D D +R++
Sbjct: 126 PSTEVNETSVEEVNAREDGGVVAPVGIQ-PGGVADEGETVGAIVDEMEREDSDNERVEEG 184
Query: 260 -SQDDCGDKLPIRRSN--------TRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAA 310
S DD + P ++ + ++ + +++ G ++ +D+ + +K A
Sbjct: 185 DSSDDETNINPAEWASEDFSGLIVSEEDSVRWEYKENEVIQGAIYSRAEDMKEAVKHFAV 244
Query: 311 KKNFAFKVSKSTKDRFEVVCADT--NCKWRLRATKTAEDEYFEIRRFSNIHICTQP 364
+ F+V+KS + ++EV C C WR+ A K +Y+ + + H C P
Sbjct: 245 SLHREFRVAKSNRSQYEVRCVKEKDGCPWRVHAYKGKWKDYWTVSVVTK-HTCFLP 299
>gi|62732780|gb|AAX94899.1| transposon protein, putative, mutator sub-class [Oryza sativa
Japonica Group]
Length = 932
Score = 40.0 bits (92), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 39/176 (22%), Positives = 78/176 (44%), Gaps = 17/176 (9%)
Query: 204 PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDG--ILAANIEVSEQQD-DIQRLDL- 259
P + E + +NA V A G + P G D + A ++ E++D D +R++
Sbjct: 126 PSTEVNETSVEEVNAREDGGVVAPVGIQ-PGGVADEGETVGAIVDEMEREDSDNERVEEG 184
Query: 260 -SQDDCGDKLPIRRSN--------TRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAA 310
S DD + P ++ + ++ + +++ G ++ +D+ + +K A
Sbjct: 185 DSSDDETNINPAEWASEDFSGLIVSEEDSVRWEYKENEVIQGAIYSRAEDMKEAVKHFAV 244
Query: 311 KKNFAFKVSKSTKDRFEVVCADT--NCKWRLRATKTAEDEYFEIRRFSNIHICTQP 364
+ F+V+KS + ++EV C C WR+ A K +Y+ + + H C P
Sbjct: 245 SLHREFRVAKSNRSQYEVRCVKEKDGCPWRVHAYKGKWKDYWTVSVVTK-HTCFLP 299
>gi|222622739|gb|EEE56871.1| hypothetical protein OsJ_06504 [Oryza sativa Japonica Group]
Length = 945
Score = 39.7 bits (91), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 4/64 (6%)
Query: 302 IKKLKLGAA----KKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSN 357
I++ +L A K F + + KS +RF C C WR+ A+++A+ + E++
Sbjct: 520 IEEFRLAIAQYSIKHEFEYAIEKSEPNRFRAHCPVKGCNWRIHASRSADKKSIEVKVHVA 579
Query: 358 IHIC 361
H C
Sbjct: 580 DHEC 583
>gi|4972086|emb|CAB43911.1| putative protein [Arabidopsis thaliana]
gi|7269795|emb|CAB79655.1| putative protein [Arabidopsis thaliana]
Length = 914
Score = 39.7 bits (91), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 57/274 (20%), Positives = 108/274 (39%), Gaps = 33/274 (12%)
Query: 98 DDITYKELVDRLYGIVKVDTNEYVITMKSLYKANIPTMPVEIVDDDDVRFFIQENAAHPT 157
+ I+Y LVD++ V +D E + +K Y + IV+D+DV F+ E+
Sbjct: 40 EKISYSMLVDKIMKKVAID--EASVKLKLSYNLSKVRRETYIVNDEDVFIFLTES---DE 94
Query: 158 ELRSPICITIERKGSQCQETEGYHAALDHQTPAASRSESNQISVVVPGMQATEVEEQNLN 217
E R P+ E G + E + R S ++V+ + EV E +
Sbjct: 95 ESRIPVLHVEELNGIGVERRE--------EISVPERRSSVGVNVIEDDV-LNEVCEMVHD 145
Query: 218 AFNGDPVAANFGTEAPLGCGDGILAANIEVSEQ--QDDIQRLDLSQDDCGDKLPIRRSNT 275
++ D + + ++ I E+ + + ++ +D
Sbjct: 146 GYDND-----------VNDCENVVGTEIVAIERPMERPVNSVEEQDEDDRIDYDDIHDIP 194
Query: 276 RSAAINP-FHSYED---IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVC- 330
RS + P ++D I G+ F S++ + + + A K+ F KS R + C
Sbjct: 195 RSVEVTPQVIEWDDGTGIEIGQEFCSREAVWELVNRAAKKEVFGVYTIKSDPLRLMLRCR 254
Query: 331 -ADTNCKWRLRATKTAEDEYFEIRRFSNIHICTQ 363
A C W LR +T + ++ +R +H C++
Sbjct: 255 QASKGCTWYLRVARTKKSHFWSVRVHRKMHTCSR 288
>gi|224120502|ref|XP_002331063.1| predicted protein [Populus trichocarpa]
gi|222872993|gb|EEF10124.1| predicted protein [Populus trichocarpa]
Length = 1018
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 14/101 (13%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F + L+ + + FA+++ K+ +R C C WR+ A+ ++ F
Sbjct: 364 GQDFKDVAEFRDALQKYSIARRFAYRLKKNDTNRASGRCVVEGCSWRIHASWVESEQVFR 423
Query: 352 IRRFSNIHICT------QPPAKN--------RKKQTALQRP 378
I++ + H C P KN R +QT Q+P
Sbjct: 424 IKKMNKSHTCEGESWKRATPNKNWLVSIIKDRLRQTPRQKP 464
>gi|224128171|ref|XP_002320261.1| predicted protein [Populus trichocarpa]
gi|222861034|gb|EEE98576.1| predicted protein [Populus trichocarpa]
Length = 573
Score = 39.7 bits (91), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 18/78 (23%), Positives = 37/78 (47%), Gaps = 2/78 (2%)
Query: 286 YEDIVCG--KTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATK 343
+E+ + G + F + +++ + + + F K K+ R C+ C WR+ A++
Sbjct: 4 WENCITGLHQQFNNVREVRDAFRKYSIAQGFTIKFKKNDSMRVSAKCSVDGCPWRIFASR 63
Query: 344 TAEDEYFEIRRFSNIHIC 361
+ F I+R + IH C
Sbjct: 64 LSTTHLFRIKRLNEIHTC 81
>gi|116308910|emb|CAH66041.1| OSIGBa0107A02.2 [Oryza sativa Indica Group]
Length = 863
Score = 39.7 bits (91), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 40/176 (22%), Positives = 77/176 (43%), Gaps = 17/176 (9%)
Query: 204 PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDG--ILAANIEVSEQQD-DIQRLDL- 259
P + E + +NA V A G + P G D + A ++ E++D D +R++
Sbjct: 126 PSAEVNETSVEEVNAREDRGVVAPVGIQ-PGGVADEGETVGAIVDEMEREDSDNERVEEG 184
Query: 260 -SQDDCGDKLPIRRSN--------TRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAA 310
S DD D P ++ + ++ + +++ G ++ +D+ + +K A
Sbjct: 185 DSSDDETDINPAEWASEDFSGLIVSEEDSVRWEYKENEVIQGAIYSRAEDMKEAVKYFAV 244
Query: 311 KKNFAFKVSKSTKDRFEVVCADT--NCKWRLRATKTAEDEYFEIRRFSNIHICTQP 364
+ F+V+KS ++EV C C WR+ A K +Y+ + + H C P
Sbjct: 245 SLHREFRVAKSNLSQYEVRCVKEKDGCPWRVHAYKGKWKDYWTVSVVTK-HTCFLP 299
>gi|224115958|ref|XP_002332013.1| predicted protein [Populus trichocarpa]
gi|222875238|gb|EEF12369.1| predicted protein [Populus trichocarpa]
Length = 572
Score = 39.7 bits (91), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 29/73 (39%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+ G+ F + L+ A +F + KS K RF CA C WR+ A K
Sbjct: 1 MAVGQEFPDVFSCRRALRDTAIALHFEMQTIKSDKTRFTAKCASEGCPWRIHAAKLPGVP 60
Query: 349 YFEIRRFSNIHIC 361
F IR H C
Sbjct: 61 TFTIRTIHESHTC 73
>gi|224144106|ref|XP_002325188.1| predicted protein [Populus trichocarpa]
gi|222866622|gb|EEF03753.1| predicted protein [Populus trichocarpa]
Length = 1041
Score = 39.3 bits (90), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 16/70 (22%), Positives = 34/70 (48%)
Query: 292 GKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFE 351
G+ F + + L+ + + FA+++ K+ +R C C WR+ A+ ++ F
Sbjct: 362 GQDFKNVAEFRDALQKYSIARRFAYRLKKNDTNRASGRCVVEGCSWRIHASWVESEQVFR 421
Query: 352 IRRFSNIHIC 361
I++ + H C
Sbjct: 422 IKKMNKSHTC 431
>gi|426405570|ref|YP_007024541.1| dipeptidyl aminopeptidase/acylaminoacyl-peptidase [Bdellovibrio
bacteriovorus str. Tiberius]
gi|425862238|gb|AFY03274.1| dipeptidyl aminopeptidase/acylaminoacyl-peptidase [Bdellovibrio
bacteriovorus str. Tiberius]
Length = 271
Score = 39.3 bits (90), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 71/167 (42%), Gaps = 25/167 (14%)
Query: 465 LVLHALSRGFAIITISSA-GRCWTL---------GEERLVVREIIRWWV--ERHKLEKLP 512
L+ L GFA+I + G W E+R V E++ E KL
Sbjct: 99 LIQTLLDSGFAVIAPPAIEGVAWMTNIVGIDYDTSEDRYFVEELLAAMGNGEFGKLNMDR 158
Query: 513 LVALGASSGGYFVSALA-------KGLRFSSIALMIAEG--LFDQMDIPEDYPPTLFVHM 563
L A G SSGGY S +A K L S + G F +PE++PPT+F+H
Sbjct: 159 LYATGISSGGYHSSRMAVAFPGVFKALAVHSASYADCGGPMCFVPAQVPENHPPTIFLHG 218
Query: 564 PKDSYRQ-RKIGEFLVVLRNKGIDVAEIECMEFPLSRNFFTDRIPGL 609
D R + + VL+++G+ E E P +R+ + + P L
Sbjct: 219 RLDPVVPVRTMYPYHEVLKSQGV---ETEMFVSPWARHEWLEEAPEL 262
>gi|31432047|gb|AAP53739.1| transposon protein, putative, Mutator sub-class [Oryza sativa
Japonica Group]
Length = 865
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 40/176 (22%), Positives = 77/176 (43%), Gaps = 17/176 (9%)
Query: 204 PGMQATEVEEQNLNAFNGDPVAANFGTEAPLGCGDG--ILAANIEVSEQQD-DIQRLDL- 259
P + E + +NA V A G + P G D + A ++ E++D D +R++
Sbjct: 117 PSAEVNETSVEEVNAREDGGVVAPVGIQ-PGGVADEGETVGAIVDEMEREDSDNERVEEG 175
Query: 260 -SQDDCGDKLPIRRSN--------TRSAAINPFHSYEDIVCGKTFTSKKDLIKKLKLGAA 310
S DD D P ++ + ++ + +++ G ++ +D+ + +K A
Sbjct: 176 DSSDDETDINPAEWASEDFSGLIVSEEDSVRWEYKENEVIQGAIYSRAEDMKEAVKHFAV 235
Query: 311 KKNFAFKVSKSTKDRFEVVCADT--NCKWRLRATKTAEDEYFEIRRFSNIHICTQP 364
+ F+V+KS ++EV C C WR+ A K +Y+ + + H C P
Sbjct: 236 SLHREFRVAKSNWSQYEVRCVKEKDGCPWRVHAYKGKWKDYWTVSVVTK-HTCFLP 290
>gi|284434710|gb|ADB85408.1| putative transposon protein [Phyllostachys edulis]
Length = 547
Score = 39.3 bits (90), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 295 FTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATK 343
F K L +KL A +F+V KS+++++ VVC C WR+ A +
Sbjct: 65 FADKGSLQDAIKLWALSMQRSFRVLKSSQEQYTVVCEQAGCPWRVHARR 113
>gi|449442265|ref|XP_004138902.1| PREDICTED: uncharacterized protein LOC101220272 [Cucumis sativus]
gi|449506289|ref|XP_004162705.1| PREDICTED: uncharacterized protein LOC101228133 [Cucumis sativus]
Length = 594
Score = 38.9 bits (89), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 31/73 (42%)
Query: 289 IVCGKTFTSKKDLIKKLKLGAAKKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDE 348
+V G+ F + + LK A +F ++ KS + RF C+ C WR+ K
Sbjct: 20 LVIGQEFPDVETCRRMLKDIAIAMHFDIRIVKSDRSRFIAKCSKEGCPWRVHVAKCPGVP 79
Query: 349 YFEIRRFSNIHIC 361
F +R H C
Sbjct: 80 TFTVRTLHGEHTC 92
>gi|407474172|ref|YP_006788572.1| hypothetical protein Curi_c17180 [Clostridium acidurici 9a]
gi|407050680|gb|AFS78725.1| hypothetical protein Curi_c17180 [Clostridium acidurici 9a]
Length = 231
Score = 38.9 bits (89), Expect = 8.8, Method: Composition-based stats.
Identities = 42/151 (27%), Positives = 65/151 (43%), Gaps = 17/151 (11%)
Query: 511 LPLVALGASSGGYFVSALAKGLRFSSIALMIAEGLFDQMDIPEDYPPTLFVHMPKDSYRQ 570
+PLV L S G F A+G + LMI + M D T + + KDS R
Sbjct: 56 IPLVLLFISIKGIFNLKRAEGRNKLLLFLMIITVIMPMMKWSLDITRTGYYWIKKDSLRS 115
Query: 571 RKIGEFLVVL---RNKGIDVAEIECMEFPLSRNFFTDRI---PGLEQANSTKLFELFQEK 624
IG+ V L +K +E +++ +++N F R+ L+ N+ +EL
Sbjct: 116 VDIGDSKVSLSGSNDKLTMTLNLELIDYGMTQNNFKIRVYLPKALKTYNNQDFYEL---- 171
Query: 625 GFIDENGYMRSDGRRTRWKEALRESNAIFLN 655
EN Y+ + RR R +RE + LN
Sbjct: 172 ----ENEYLTNGNRRKR---TIREEIVLNLN 195
>gi|109289903|gb|AAP45178.2| Zinc knuckle family protein [Solanum bulbocastanum]
Length = 558
Score = 38.9 bits (89), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 311 KKNFAFKVSKSTKDRFEVVCADTNCKWRLRATKTAEDEYFEIRRFSNIHIC 361
KK+ FK ++S++ R +C NCKW + A+K EDE F I+ H C
Sbjct: 75 KKDIKFKKNESSRAR--AICKVLNCKWFIYASKANEDEPFMIKTIGPDHSC 123
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,275,617,130
Number of Sequences: 23463169
Number of extensions: 492617754
Number of successful extensions: 1285163
Number of sequences better than 100.0: 469
Number of HSP's better than 100.0 without gapping: 316
Number of HSP's successfully gapped in prelim test: 153
Number of HSP's that attempted gapping in prelim test: 1284508
Number of HSP's gapped (non-prelim): 538
length of query: 700
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 550
effective length of database: 8,839,720,017
effective search space: 4861846009350
effective search space used: 4861846009350
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)