BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007176
         (614 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255550299|ref|XP_002516200.1| DNA binding protein, putative [Ricinus communis]
 gi|223544686|gb|EEF46202.1| DNA binding protein, putative [Ricinus communis]
          Length = 749

 Score = 1092 bits (2823), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 526/619 (84%), Positives = 556/619 (89%), Gaps = 5/619 (0%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
            +GVPYVIYWK +FSCYAA HF QALLSVVQSSCSHT DAFQLAHASF LYCVRNN  ++
Sbjct: 131 FKGVPYVIYWKSTFSCYAAAHFRQALLSVVQSSCSHTCDAFQLAHASFSLYCVRNNTGLS 190

Query: 61  SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           SN+QK   K GP LLG+PPKIDI L E DVQ EE+S   LPAIKIYDDDVTMRFLVC +P
Sbjct: 191 SNNQKVGGKPGPRLLGEPPKIDITLPEADVQDEESSSGTLPAIKIYDDDVTMRFLVCELP 250

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
            TLD  LLG LEDGLNALLNIEIRGSKLHNRTSAPPPPLQAG FSRGVVTMRCDLSTCSS
Sbjct: 251 STLDACLLGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGVVTMRCDLSTCSS 310

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           AHISLLVSGSAQ CFNDQLLENHIKNELIENSQLVHALP+S +++L  SEPRKSASI CG
Sbjct: 311 AHISLLVSGSAQACFNDQLLENHIKNELIENSQLVHALPSSEESKLLTSEPRKSASIGCG 370

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           ASVFEV +KV +WASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK+D ERLLFFCTR
Sbjct: 371 ASVFEVCLKVPSWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKEDTERLLFFCTR 430

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRE-----SKGVESENVCNVRPKLNSAAM 355
           QGK  +  NS++ +PP WL  PAPSRKRSEPCRE     SKG+E EN  +V+ KLN AAM
Sbjct: 431 QGKELYPNNSIIIKPPCWLIPPAPSRKRSEPCRETKLFTSKGLERENGGSVKQKLNVAAM 490

Query: 356 RPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQ 415
           RPIPHTRH+KMLPFSGF+E ERYDGDQ K +LPVAP KH   GP PV+HRKSLSSSYQAQ
Sbjct: 491 RPIPHTRHHKMLPFSGFAEGERYDGDQGKPSLPVAPAKHGVVGPAPVSHRKSLSSSYQAQ 550

Query: 416 QIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 475
           QIISLNPLPLKKHGCGRAPIQ CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA
Sbjct: 551 QIISLNPLPLKKHGCGRAPIQACSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 610

Query: 476 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE 535
           KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE
Sbjct: 611 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE 670

Query: 536 YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC 595
           YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+C
Sbjct: 671 YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYIC 730

Query: 596 PQCSVTNFKKKSQKTSNGY 614
           P CS+ NF+KKSQKT+NGY
Sbjct: 731 PHCSIANFRKKSQKTANGY 749


>gi|224141545|ref|XP_002324130.1| predicted protein [Populus trichocarpa]
 gi|222865564|gb|EEF02695.1| predicted protein [Populus trichocarpa]
          Length = 745

 Score = 1047 bits (2708), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 519/617 (84%), Positives = 543/617 (88%), Gaps = 5/617 (0%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GVPYVIYWK +FSCYAA HF QALLSVVQSSCSHT DAFQLAHASFRLYCV+NN   AS
Sbjct: 130 KGVPYVIYWKSAFSCYAASHFRQALLSVVQSSCSHTCDAFQLAHASFRLYCVQNNNTPAS 189

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
           NSQK   K GP LLGDPPK DI+L E D QGEE S   LPAIKIYDDDVTMRFLVCG+  
Sbjct: 190 NSQKVGGKPGPRLLGDPPKFDISLPEADDQGEEGSSGALPAIKIYDDDVTMRFLVCGLTG 249

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           TLD   LG LEDGLNALLNIEIRGSKLHNRTSAPPPPLQAG FSRGVVTMRCDLSTCSSA
Sbjct: 250 TLDACALGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGVVTMRCDLSTCSSA 309

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           HISLLVSGSAQ CFNDQLLENHIK+ELIENSQLVHA  +S +++ P SEPRKSASIACGA
Sbjct: 310 HISLLVSGSAQNCFNDQLLENHIKSELIENSQLVHASTSSDESKSPSSEPRKSASIACGA 369

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           SVFEVSMKV TWASQVLRQLAPDV+YRSLVMLGIASIQGLSVASFEKDDA+RLLFFCT+Q
Sbjct: 370 SVFEVSMKVPTWASQVLRQLAPDVTYRSLVMLGIASIQGLSVASFEKDDADRLLFFCTKQ 429

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK----GVESENVCNVRPKLNSAAMRP 357
            K  H  N VLTR PSWL  PAP RKR EP RE+K    G   EN  N + KL  AAMRP
Sbjct: 430 SKDPHPRNPVLTRHPSWLIPPAPCRKRYEPSRETKPLTFGCGGENGGNFKQKLYVAAMRP 489

Query: 358 IPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQI 417
           IPHTR +KMLPFSGF E ERYDG+Q K +LP  P KHS  GP PVTHRKSLS+SYQAQQI
Sbjct: 490 IPHTRRHKMLPFSGFLEAERYDGEQTKPSLPPPP-KHSVVGPAPVTHRKSLSNSYQAQQI 548

Query: 418 ISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKR 477
           ISLNPLPLKKHGCGR+PIQ CSEEEFLRDVMQFLILRGH+RLVPQGGLAEFPDAILNAKR
Sbjct: 549 ISLNPLPLKKHGCGRSPIQACSEEEFLRDVMQFLILRGHSRLVPQGGLAEFPDAILNAKR 608

Query: 478 LDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
           LDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE
Sbjct: 609 LDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 668

Query: 538 LAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQ 597
           LAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP 
Sbjct: 669 LAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPH 728

Query: 598 CSVTNFKKKSQKTSNGY 614
           CS+ NFKKKSQK +NGY
Sbjct: 729 CSIANFKKKSQKNANGY 745


>gi|225444591|ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-containing protein 4 [Vitis
           vinifera]
 gi|297738501|emb|CBI27746.3| unnamed protein product [Vitis vinifera]
          Length = 739

 Score = 1023 bits (2645), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 502/620 (80%), Positives = 540/620 (87%), Gaps = 14/620 (2%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GV YVIYWK++FSCYAACHF QAL SVVQSSCSHTWDAFQLAHASFRLYCV+NN V  S
Sbjct: 127 KGVSYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRLYCVQNNTV-PS 185

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
           N+QK S KLGP LLGDPPKI++   E+D   EE+ P  LP IKIYD DV+MRFLVCG P 
Sbjct: 186 NNQKVSGKLGPCLLGDPPKINVVPPEVD--EEESLPATLPVIKIYDADVSMRFLVCGAPS 243

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
            LD  LLG LEDGLNALL IEIRGSKLHNR SAPPPPLQAG FSRGVVTMRCDLSTCSSA
Sbjct: 244 ALDACLLGSLEDGLNALLCIEIRGSKLHNRVSAPPPPLQAGTFSRGVVTMRCDLSTCSSA 303

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           HISLLVSGSAQTC NDQLLE++IKNELIE SQLVHA+P+  +++L  SEPR+SASIACGA
Sbjct: 304 HISLLVSGSAQTCLNDQLLESYIKNELIEKSQLVHAVPSCEESKLSSSEPRRSASIACGA 363

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           SVFEV +KV TWASQVLRQLAPDVSYRSLV LGIASIQGLSVASFEKDDA+RLLFFCTR 
Sbjct: 364 SVFEVRIKVPTWASQVLRQLAPDVSYRSLVTLGIASIQGLSVASFEKDDADRLLFFCTRH 423

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESENVCNVRPKLNSAA 354
            K  +  NS+L RPPSWL +P  SRKRS PC E+K       G  +  V   +PK+  AA
Sbjct: 424 AKQLNQNNSILPRPPSWLIAPPASRKRSGPCHETKPSGYKVLGGVNGGVLQQKPKI--AA 481

Query: 355 MRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQA 414
           MRPIPHTR++KMLPFSG SE  R DGDQ K NL V P KH+  G TPVTHRK LSSS+QA
Sbjct: 482 MRPIPHTRNHKMLPFSGISEASRCDGDQAKGNLSVVPAKHN--GTTPVTHRKLLSSSFQA 539

Query: 415 QQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILN 474
           QQIISLNPLPLKKHGCGR+PIQ+CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILN
Sbjct: 540 QQIISLNPLPLKKHGCGRSPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILN 599

Query: 475 AKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL 534
           AKRLDL+NLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL
Sbjct: 600 AKRLDLYNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL 659

Query: 535 EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYV 594
           EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+
Sbjct: 660 EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYI 719

Query: 595 CPQCSVTNFKKKSQKTSNGY 614
           CP CS+TNF+KKSQKT+NGY
Sbjct: 720 CPHCSITNFQKKSQKTANGY 739


>gi|297742936|emb|CBI35803.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  991 bits (2562), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 483/615 (78%), Positives = 524/615 (85%), Gaps = 2/615 (0%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYWK++FSCYAACHF  AL SVVQSS +HTWDAFQLA+ASFRLYCVRNN V+ +
Sbjct: 131 KGIPYVIYWKNAFSCYAACHFRNALFSVVQSSSTHTWDAFQLAYASFRLYCVRNNHVLPA 190

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           NS K S KLGP LLGDP  ID+   E+D  + EE S   LPAIKIYDDDV +RFLVCG P
Sbjct: 191 NSHKVSGKLGPRLLGDPATIDVPPPEVDAGEDEEGSLGTLPAIKIYDDDVGIRFLVCGEP 250

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           C LD+ L   LEDGLNALL+IEIRGSKLHNR SAPPPPLQAG FSRGVVTMRCDLSTCSS
Sbjct: 251 CMLDSCLFESLEDGLNALLSIEIRGSKLHNRVSAPPPPLQAGTFSRGVVTMRCDLSTCSS 310

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           AHISLLVSGSAQTCF+DQLLEN+IK E+ E SQLVHALP S  N+ P SEPR+SASIACG
Sbjct: 311 AHISLLVSGSAQTCFDDQLLENNIKKEVTEQSQLVHALPYSEGNKPPLSEPRRSASIACG 370

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           A+VFEV  KV  WASQVLRQLAPDVSYRSLV LGIASIQGL+VASFEKDDA RLLFFCTR
Sbjct: 371 AAVFEVCAKVPAWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDDANRLLFFCTR 430

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
           QGK  H  N   +R PSWL  P PSRKR EP +++    +  +     +L  AAMRPIPH
Sbjct: 431 QGKYIHPNNFTPSRLPSWLKPPPPSRKRVEPSQDTMNGVTMPLLPAGQRLKVAAMRPIPH 490

Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRKSLSSSYQAQQIIS 419
            RH+KMLPFSG SE + +DG QVKANL V P  KHS  G T   HRKS SSSYQA+QIIS
Sbjct: 491 IRHHKMLPFSGISEADGHDGGQVKANLSVPPPTKHSIVGSTSAMHRKSFSSSYQAKQIIS 550

Query: 420 LNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLD 479
           LNPLPLKKHGCGR+PI+VCSEEEFL+DVMQFL LRGHTRL+PQGGLAEFPDAILNAKRLD
Sbjct: 551 LNPLPLKKHGCGRSPIRVCSEEEFLKDVMQFLNLRGHTRLIPQGGLAEFPDAILNAKRLD 610

Query: 480 LFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELA 539
           L+NLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELA
Sbjct: 611 LYNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELA 670

Query: 540 HDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
           HDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CPQCS
Sbjct: 671 HDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCS 730

Query: 600 VTNFKKKSQKTSNGY 614
           VTNFKKK+ K  NG+
Sbjct: 731 VTNFKKKANKAPNGF 745


>gi|356530473|ref|XP_003533805.1| PREDICTED: uncharacterized protein LOC100808357 [Glycine max]
          Length = 752

 Score =  974 bits (2519), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 468/619 (75%), Positives = 522/619 (84%), Gaps = 7/619 (1%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GVPY IYWK+ FS YAA HF  +L SV QS+ SHTWDAFQLA ASFRLYC+ NN V+ S
Sbjct: 135 KGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-VLPS 193

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           N  KG+ KLGP +LG PP ID++    D+ + EE+SPE + A+KIYDDDV MRFL+CGVP
Sbjct: 194 NCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKIYDDDVNMRFLICGVP 253

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           CTLD  LLG LEDGLNALL  EIRG KLHNRTSA PPPLQAG FSRGVVTMRCD+STCSS
Sbjct: 254 CTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDISTCSS 313

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           AHISLLVSGSA TCFNDQLLENHIK ELIE SQLV A PN   ++ P SEPR+SAS+ACG
Sbjct: 314 AHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRRSASVACG 373

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           +SVFEV M+V  WASQVLRQLAP++SYRSLVMLGIASIQGL VASF KDDAERLLFFCTR
Sbjct: 374 SSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAERLLFFCTR 433

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVES-----ENVCNVRPKLNSAAM 355
           Q K +   + V +  PSWL  P+ SRKRSEPC  SK +       E + + R K N A+M
Sbjct: 434 QEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGVEAIGSHRQKFNLASM 493

Query: 356 RPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQ 415
           RPIPH+  +K+LPFSG SE  RYDGD  K+NLP+AP+KH+ +GPT VT+RKS+S+S+QA 
Sbjct: 494 RPIPHSNRHKILPFSGLSEGTRYDGDHGKSNLPLAPIKHNVSGPTSVTNRKSVSNSFQAH 553

Query: 416 QIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 475
           QIISLNPLP+KKHGC RAPI+ CSEEEFLRDVMQFLILRGH RL+P GGLAEFPDAILNA
Sbjct: 554 QIISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRGHNRLIPPGGLAEFPDAILNA 613

Query: 476 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE 535
           KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLE
Sbjct: 614 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLE 673

Query: 536 YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC 595
           YEL+HDDVDGECCL+CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC
Sbjct: 674 YELSHDDVDGECCLMCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC 733

Query: 596 PQCSVTNFKKKSQKTSNGY 614
           P+CS   F KKSQKT+NG+
Sbjct: 734 PRCSALKFSKKSQKTANGF 752


>gi|356556430|ref|XP_003546529.1| PREDICTED: uncharacterized protein LOC100775894 [Glycine max]
          Length = 750

 Score =  974 bits (2517), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 464/618 (75%), Positives = 521/618 (84%), Gaps = 6/618 (0%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GVPY IYWK+ FS YAA HF  +  SV QS+ SHTWDAFQLA ASFRLYCV+NN V+ S
Sbjct: 134 KGVPYTIYWKNDFSKYAASHFRHSFFSVAQSTSSHTWDAFQLALASFRLYCVQNN-VLPS 192

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
           NS KG+ KLGP +LG PP ID++    D++ EE SPE + ++KIYDDDV MRFL+CGVPC
Sbjct: 193 NSHKGAGKLGPQILGVPPNIDVSPCVADMKEEEGSPETISSLKIYDDDVNMRFLICGVPC 252

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           TLD  LLG LEDGLNALL  EIRG KLHNRTSA PPPLQAG FSRGVVTMRCD+STCSSA
Sbjct: 253 TLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDISTCSSA 312

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           HISLLVSGSA TCFNDQLLENHIK ELIE SQLV A PN   ++ P SEPR+SAS+ACG+
Sbjct: 313 HISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHQQSKAPSSEPRRSASVACGS 372

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           SVFEV M+V  WASQVLRQLAP+++YRSLVMLGIASIQ L VASF KDDAERLLFFCTRQ
Sbjct: 373 SVFEVCMRVPAWASQVLRQLAPNLAYRSLVMLGIASIQALPVASFSKDDAERLLFFCTRQ 432

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVES-----ENVCNVRPKLNSAAMR 356
            K +  ++ V +  PSWL  P PSRKRSEPC  SK + +     E + + R K   A+MR
Sbjct: 433 EKENCPKDHVFSGIPSWLKPPPPSRKRSEPCSSSKSINASGRGVEAIGSHRQKFIVASMR 492

Query: 357 PIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQ 416
           PIPH+  +K+LPFSG SE  RYDGD  K+NLP+A +KH+ +GPT VT+RKS+S+S+QA Q
Sbjct: 493 PIPHSNRHKILPFSGLSEGTRYDGDHGKSNLPLALIKHNVSGPTSVTNRKSVSNSFQAHQ 552

Query: 417 IISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAK 476
           IISLNPLP+KKHGC RAPI+ CSEEEFLRDVMQFLILRGH RL+P GGL+EFPDAILNAK
Sbjct: 553 IISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRGHNRLIPPGGLSEFPDAILNAK 612

Query: 477 RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
           RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEY
Sbjct: 613 RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEY 672

Query: 537 ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP 596
           EL+HDDVDGECCL+CHSSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP
Sbjct: 673 ELSHDDVDGECCLMCHSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP 732

Query: 597 QCSVTNFKKKSQKTSNGY 614
           +CS   F KKSQKT+NGY
Sbjct: 733 RCSALKFSKKSQKTANGY 750


>gi|42565391|ref|NP_189910.2| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
           thaliana]
 gi|75223288|sp|Q6NQ79.1|ARID4_ARATH RecName: Full=AT-rich interactive domain-containing protein 4;
           Short=ARID domain-containing protein 4
 gi|34849889|gb|AAQ82841.1| At3g43240 [Arabidopsis thaliana]
 gi|332644257|gb|AEE77778.1| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
           thaliana]
          Length = 747

 Score =  948 bits (2450), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 462/613 (75%), Positives = 527/613 (85%), Gaps = 4/613 (0%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GV YVIYWK+ FS YAACHF  +L SV+QSSCS TWD F +A ASFRLYC  +N V+ S
Sbjct: 133 KGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFRLYCTSDNAVLPS 192

Query: 62  NS-QKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           NS +K + ++GP LLG+PPKID+   E D   EENS E+LP+IKIYD+DVT+RFL+CG P
Sbjct: 193 NSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDEDVTVRFLLCGPP 252

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           CT+DT LLG L DGLNALL IE+RGSKLHNR+SAP PPLQAG F+RGVVTMRCD+STCSS
Sbjct: 253 CTVDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGVVTMRCDVSTCSS 312

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           AHIS+LVSG+AQTCF+DQLLENHIK+E++E  QLVH++ NS + +   SEPR+SASIACG
Sbjct: 313 AHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHSVVNSEETKRGFSEPRRSASIACG 372

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           ASV EVSM+V TWA QVLRQLAPDVSYRSLV+LG+ASIQGLSVASFEKDDAERLLFFC +
Sbjct: 373 ASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEKDDAERLLFFCGQ 432

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
           Q       +++L++ P+WLT P P+RKRSEPCRESK  E EN      K+N AA+RPIPH
Sbjct: 433 QINDTSNHDALLSKIPNWLTPPLPTRKRSEPCRESK--EIENGGPTSRKINVAALRPIPH 490

Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISL 420
           TR +KM+PFSG+SEI R+DGD  K +LP+ P KH ++G TPVTHRK+ S SYQ +QIISL
Sbjct: 491 TRRHKMIPFSGYSEIGRFDGDHTKGSLPMPP-KHGASGGTPVTHRKAFSGSYQRKQIISL 549

Query: 421 NPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
           NPLPLKKH CGRA IQVCSEEEFLRDVMQFL++RGHTRLVP GGLAEFPDA+LN+KRLDL
Sbjct: 550 NPLPLKKHDCGRAHIQVCSEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDL 609

Query: 481 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
           FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE AH
Sbjct: 610 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAH 669

Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
           DDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP CSV
Sbjct: 670 DDVDGECCLICRSSTAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSV 729

Query: 601 TNFKKKSQKTSNG 613
           +N++KKSQKTSNG
Sbjct: 730 SNYRKKSQKTSNG 742


>gi|297818776|ref|XP_002877271.1| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
           lyrata subsp. lyrata]
 gi|297323109|gb|EFH53530.1| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 748

 Score =  942 bits (2435), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 461/613 (75%), Positives = 523/613 (85%), Gaps = 4/613 (0%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GV YVIYWK  FS YAACHF  AL SV+QSSCS TWD F +A ASFRLYC  +N V+ S
Sbjct: 134 KGVQYVIYWKDVFSKYAACHFRHALFSVIQSSCSDTWDVFHVAEASFRLYCTSDNAVLPS 193

Query: 62  NS-QKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           NS +K + ++GP LLG+PPKID+   E D   EENS E+LP+IKIYD+DVT+RFL+CG P
Sbjct: 194 NSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDEDVTVRFLLCGPP 253

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           CTLDT LLG L DGLNALL IE+RGSKLHNR+SAP PPLQAG F+RGVVTMRCD+STCSS
Sbjct: 254 CTLDTLLLGSLVDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGVVTMRCDVSTCSS 313

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           AHIS+LVSG+AQTCF+DQLLENHIK+E++E  QLVH + NS + +   SEPR+SASIACG
Sbjct: 314 AHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHPVGNSEETKRGFSEPRRSASIACG 373

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           ASV EVSM+V TWA QVLRQLAPDVSYRSLV+LG+ASIQGLSVASFEKDDAERLLFFC +
Sbjct: 374 ASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEKDDAERLLFFCGQ 433

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
           Q       +++L++ P WLT P P+RKRSEPCRESK  E EN      K+N AA+RPIPH
Sbjct: 434 QINDTSNHDTLLSKIPHWLTPPLPTRKRSEPCRESK--EIENGGPTSRKINVAALRPIPH 491

Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISL 420
           TR +KM+PFSG+SEI R+DGD  K +LP+ P KH ++G TPVTHRK+ S SYQ +QIISL
Sbjct: 492 TRRHKMIPFSGYSEIGRFDGDHTKGSLPMPP-KHGASGGTPVTHRKAFSGSYQRKQIISL 550

Query: 421 NPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
           NPLPLKKH CGRA IQVCSEEEFLRDVMQFL++RGH+RLVP GGLAEFPDA+LN+KRLDL
Sbjct: 551 NPLPLKKHDCGRAHIQVCSEEEFLRDVMQFLLIRGHSRLVPPGGLAEFPDAVLNSKRLDL 610

Query: 481 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
           FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE AH
Sbjct: 611 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAH 670

Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
           DDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP CSV
Sbjct: 671 DDVDGECCLICRSSTAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSV 730

Query: 601 TNFKKKSQKTSNG 613
           +N++KK+ KTSNG
Sbjct: 731 SNYRKKTPKTSNG 743


>gi|7649364|emb|CAB89045.1| putative protein [Arabidopsis thaliana]
          Length = 717

 Score =  874 bits (2257), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/613 (71%), Positives = 496/613 (80%), Gaps = 32/613 (5%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GV YVIYWK+ FS YAACHF  +L SV+QSSCS TWD F +A ASFRLYC  +N V+ S
Sbjct: 131 KGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFRLYCTSDNAVLPS 190

Query: 62  NS-QKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           NS +K + ++GP LLG+PPKID+   E D   EENS E+LP+IKIYD+DVT+ +    + 
Sbjct: 191 NSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDEDVTVSYKAFSL- 249

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
              DT LLG L DGLNALL IE+RGSKLHNR+SAP PPLQAG F+RGVVTMRCD+STCSS
Sbjct: 250 -FQDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGVVTMRCDVSTCSS 308

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           AHIS+LVSG+AQTCF +                           +   SEPR+SASIACG
Sbjct: 309 AHISMLVSGNAQTCFKE--------------------------TKRGFSEPRRSASIACG 342

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           ASV EVSM+V TWA QVLRQLAPDVSYRSLV+LG+ASIQGLSVASFEKDDAERLLFFC +
Sbjct: 343 ASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEKDDAERLLFFCGQ 402

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
           Q       +++L++ P+WLT P P+RKRSEPCRESK  E EN      K+N AA+RPIPH
Sbjct: 403 QINDTSNHDALLSKIPNWLTPPLPTRKRSEPCRESK--EIENGGPTSRKINVAALRPIPH 460

Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISL 420
           TR +KM+PFSG+SEI R+DGD  K +LP+ P KH ++G TPVTHRK+ S SYQ +QIISL
Sbjct: 461 TRRHKMIPFSGYSEIGRFDGDHTKGSLPMPP-KHGASGGTPVTHRKAFSGSYQRKQIISL 519

Query: 421 NPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
           NPLPLKKH CGRA IQVCSEEEFLRDVMQFL++RGHTRLVP GGLAEFPDA+LN+KRLDL
Sbjct: 520 NPLPLKKHDCGRAHIQVCSEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDL 579

Query: 481 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
           FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE AH
Sbjct: 580 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAH 639

Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
           DDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP CSV
Sbjct: 640 DDVDGECCLICRSSTAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSV 699

Query: 601 TNFKKKSQKTSNG 613
           +N++KKSQKTSNG
Sbjct: 700 SNYRKKSQKTSNG 712


>gi|357514291|ref|XP_003627434.1| Fiber protein Fb21 [Medicago truncatula]
 gi|355521456|gb|AET01910.1| Fiber protein Fb21 [Medicago truncatula]
          Length = 769

 Score =  858 bits (2216), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/635 (67%), Positives = 496/635 (78%), Gaps = 25/635 (3%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
           ++G+PYV++WK++FS YAACHF QAL SVVQSS +HTWDAF LA ASF LYCV+NN V+ 
Sbjct: 129 LKGIPYVVFWKNAFSQYAACHFRQALFSVVQSSSTHTWDAFHLARASFELYCVQNNQVLP 188

Query: 61  SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPEN--LPAIKIYDDDVTMRFLVCG 118
           ++S    S +GPHLLG+  KI++   EMD + ++    +  LP+I+I+DD+V +RFL+CG
Sbjct: 189 TDSNDADSDMGPHLLGECLKINVDPPEMDEEDDDEESSSGSLPSIQIHDDEVNLRFLICG 248

Query: 119 VPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTC 178
            P T+D SLL  LEDGL ALL IE+R  KLH + SAPPPPLQA +FSRGVVTMRCD+STC
Sbjct: 249 APSTVDESLLRSLEDGLRALLTIEMRSCKLHGKYSAPPPPLQAASFSRGVVTMRCDISTC 308

Query: 179 SSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIA 238
           SSAHISLLVSGS Q CFNDQLLENHIKNE+IE SQ+VHA  N   N    SEPR+SASIA
Sbjct: 309 SSAHISLLVSGSPQACFNDQLLENHIKNEIIEKSQIVHARLNGEANTQIISEPRRSASIA 368

Query: 239 CGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFC 298
           CGA++FEVSMK+  WA Q+LRQLAPDVSYRSLV LGIASIQGL VASFEKDDAERLLFF 
Sbjct: 369 CGATIFEVSMKLPQWALQILRQLAPDVSYRSLVALGIASIQGLPVASFEKDDAERLLFFY 428

Query: 299 TRQGKAD-HTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVE------------------ 339
               K      N V +RPP WL  P P+RKR   C  S+G                    
Sbjct: 429 QSSAKDGCDNGNIVFSRPPVWLKPPPPTRKR---CESSQGASPDIHNDEEEKDRKMVNGI 485

Query: 340 SENVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGP 399
           S  +   R +L  +AMRPIPH R ++M PFSG S +  + G  V+A +P+ P+K SS G 
Sbjct: 486 STPLTPARQRLKVSAMRPIPHVRRHRMTPFSGPSGVNGFGGPHVEAYVPLVPVKRSSIGS 545

Query: 400 TPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRL 459
           +  T RKS SSS Q +Q+ISLNPLPLKKHGC R  +Q CSEEEF++DVM+FLILRGH+RL
Sbjct: 546 SSATQRKSFSSSSQPKQVISLNPLPLKKHGCSRGSVQTCSEEEFIKDVMEFLILRGHSRL 605

Query: 460 VPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMT 519
           +PQGGLAEFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T TNRMT
Sbjct: 606 IPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMGNYTSTNRMT 665

Query: 520 GVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGL 579
           GVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGL
Sbjct: 666 GVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGL 725

Query: 580 GAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
           GAFKDYAKTDGLEY+CP CSVTNFKKK Q  +NGY
Sbjct: 726 GAFKDYAKTDGLEYICPHCSVTNFKKK-QSVANGY 759


>gi|356559199|ref|XP_003547888.1| PREDICTED: uncharacterized protein LOC100816540 [Glycine max]
          Length = 782

 Score =  855 bits (2208), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 446/645 (69%), Positives = 512/645 (79%), Gaps = 32/645 (4%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
           ++G+PYVI+WK++FSCYAACHF QA LSVVQSS +HTWDAF LA ASF LYCV+NN V+ 
Sbjct: 129 LKGIPYVIFWKNTFSCYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYCVQNNQVLP 188

Query: 61  SNSQKGSSKLGPHLLGDPPKIDI-ALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGV 119
           S+S   SS++GPHLLGD  KI++      +   +E+S  +LPAIKI++D+V +RFL+CG 
Sbjct: 189 SDSDDASSEMGPHLLGDCLKINVDPPEIDEEDDDESSSGSLPAIKIHEDEVNLRFLICGA 248

Query: 120 PCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCS 179
           P T+D SLL  LEDGL ALL IEIRG KLH + SAPPPPLQA AFSRGVVTMRCD+STCS
Sbjct: 249 PSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAAAFSRGVVTMRCDISTCS 308

Query: 180 SAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIAC 239
           SAHISLLVSGSAQTCFNDQLLENHIKNE+IE SQLVHA  N+  N+    EPR+SASIAC
Sbjct: 309 SAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQLNNEGNKENICEPRRSASIAC 368

Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
           GASVFE+ MK+  WA Q+LRQLAP+VSYRSLV LGIASIQGL +ASFEKDDAERLLFF  
Sbjct: 369 GASVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDDAERLLFFYQ 428

Query: 300 RQGKADHTE--NSVLTRPPSWLTSPAPSRKRSEPCRES-----KGVESEN--VCNV---- 346
              K   T   N + + PP WL  P P+RKR EP +E+     +GV +    VC +    
Sbjct: 429 NCEKDSCTNKNNIIFSSPPGWLKPPPPTRKRCEPRQEASPGLHEGVFAGQGGVCKLNEEE 488

Query: 347 ----------------RPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLP-V 389
                           R +L  +AMRPIPH R ++M PF G SE + +DG QV+A LP V
Sbjct: 489 KDRKIVNGISMPLTPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFDGTQVEAILPLV 548

Query: 390 APLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQ 449
           AP K +S G T  THRKS SS+ Q++Q+ISLNPLPLKKHGCGR P+Q CSEEEFL+DVM+
Sbjct: 549 APTKRTSIGSTSGTHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTCSEEEFLKDVME 608

Query: 450 FLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKM 509
           FLILRGH RL+PQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM
Sbjct: 609 FLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKM 668

Query: 510 RNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWA 569
           RN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWA
Sbjct: 669 RNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWA 728

Query: 570 HFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
           HFGCDRRQGLGAFKDYAKTDGLEY+CP CSVTNFKKK Q  +NGY
Sbjct: 729 HFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKK-QNVANGY 772


>gi|449517916|ref|XP_004165990.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
           [Cucumis sativus]
          Length = 772

 Score =  854 bits (2207), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/632 (67%), Positives = 499/632 (78%), Gaps = 20/632 (3%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PY+IYW  +FSCYAA HF  ALLSVVQSS +HTWDAFQLA A+FRLY V +N  +  
Sbjct: 130 KGIPYLIYWNSTFSCYAAAHFRHALLSVVQSSSTHTWDAFQLARAAFRLYSVGSNYGLPG 189

Query: 62  NSQKGS-SKLGPHLLGDPPKIDIALSEMDV---QGEENSPENLPAIKIYDDDVTMRFLVC 117
            +   + S L P L+G+P KID+   E+DV   + E+ S E LPAI I+D++VTMRFL+C
Sbjct: 190 IADDSTMSDLEPQLIGEPLKIDVEPPELDVGEGEDEDGSLEALPAINIHDNNVTMRFLIC 249

Query: 118 GVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLST 177
           GVPCT DT LL  LEDGL+ALL IE+RGSKL  + SAPPPPLQAG+FSRGVVTMRCD+ T
Sbjct: 250 GVPCTPDTCLLRSLEDGLDALLKIEMRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVT 309

Query: 178 CSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASI 237
           CSSAHIS+LVSGSA TCF+DQLLE HIK+E+IE++QLVHA+ +   N+    +PRKSASI
Sbjct: 310 CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEHNQLVHAIHDCEGNKHHMHKPRKSASI 369

Query: 238 ACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFF 297
           ACGA+VFEVSMKV  WASQVLRQLAPD+SYRSLV LGI  +QGL VASFEK+DAERLLFF
Sbjct: 370 ACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDAERLLFF 429

Query: 298 CTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------ 345
           C+  G   H+E  +++  PSW   P PSRKR EP +  +   S ++              
Sbjct: 430 CSGDGNDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRNSLSHDIGREDPVPMNGFKAS 489

Query: 346 ---VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLP-VAPLKHSSAGPTP 401
               R KL  A+MRP+P     KM PF+G +E++  +G   KA+L  V P KH + G T 
Sbjct: 490 LHPARKKLKVASMRPVPRLHRNKMTPFAGLTEVDGNNGGLSKASLSIVTPPKHVTVGSTS 549

Query: 402 VTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVP 461
            THRKS SSS Q++QIISLNPLPLKKHGCGR PIQ CSEEEFL+DVM+FL+LRGHTRL+P
Sbjct: 550 ATHRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHTRLIP 609

Query: 462 QGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGV 521
           QGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGV
Sbjct: 610 QGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGV 669

Query: 522 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 581
           GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA
Sbjct: 670 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 729

Query: 582 FKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
           FKDYAKTDGLEYVCP CS+T +KKK  + +NG
Sbjct: 730 FKDYAKTDGLEYVCPHCSITTYKKKPHRVANG 761


>gi|449453463|ref|XP_004144477.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
           [Cucumis sativus]
          Length = 772

 Score =  846 bits (2186), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/632 (67%), Positives = 496/632 (78%), Gaps = 20/632 (3%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PY+IYW  +FSCYAA HF  ALLSVVQSS +HTWDAFQLA A+FRLY V +N  +  
Sbjct: 130 KGIPYLIYWNSTFSCYAAAHFRHALLSVVQSSSTHTWDAFQLARAAFRLYSVGSNYGLPG 189

Query: 62  NSQKGS-SKLGPHLLGDPPKIDIALSEMDV---QGEENSPENLPAIKIYDDDVTMRFLVC 117
            +     S L P L+G+P KID+   E+DV   + E+ S E LPAI I+D++VTMRFL+C
Sbjct: 190 IADDSMMSDLEPQLIGEPLKIDVEPPELDVGEGEDEDGSLEALPAINIHDNNVTMRFLIC 249

Query: 118 GVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLST 177
           GVPCT DT LL  LEDGL+ALL IE+RGSKL  + SAPPPPLQAG+FSRGVVTMRCD+ T
Sbjct: 250 GVPCTPDTCLLRSLEDGLDALLKIEMRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVT 309

Query: 178 CSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASI 237
           CSSAHIS+LVSGSA TCF+DQLLE HIK+E+IE++QLVHA+ +   N+    +PRKSASI
Sbjct: 310 CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEHNQLVHAIHDCEGNKHHMHKPRKSASI 369

Query: 238 ACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFF 297
           ACGA+VFEVSMKV  WASQVLRQLAPD+SYRSLV LGI  +QGL VASFEK+DAERLLFF
Sbjct: 370 ACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDAERLLFF 429

Query: 298 CTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------ 345
           C+  G   H+E  +++  PSW   P PSRKR EP +  +   S ++              
Sbjct: 430 CSGDGNDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRNSLSHDIGREDPVPMNGFKAS 489

Query: 346 ---VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLP-VAPLKHSSAGPTP 401
               R KL  A+MRP+P     KM PF+G +E++  +G   KA+L  V P KH + G T 
Sbjct: 490 LHPARKKLKVASMRPVPRLHRNKMTPFAGLTEVDGNNGGLSKASLSIVTPPKHVTVGSTS 549

Query: 402 VTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVP 461
            THRKS SSS Q++QIISLNPLPLKKHGCGR PIQ CSEEEFL+DVM+FL+LRGHTRL+P
Sbjct: 550 ATHRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHTRLIP 609

Query: 462 QGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGV 521
           QGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGV
Sbjct: 610 QGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGV 669

Query: 522 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 581
           GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA
Sbjct: 670 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 729

Query: 582 FKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
           FK YAKTDGLEYVC  CS+T +KKK  + +NG
Sbjct: 730 FKVYAKTDGLEYVCSHCSITTYKKKPHRVANG 761


>gi|449448140|ref|XP_004141824.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
           [Cucumis sativus]
          Length = 781

 Score =  830 bits (2144), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 415/645 (64%), Positives = 487/645 (75%), Gaps = 38/645 (5%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW+ +F+CYAACHF  A LSV+QSS +HTWDAFQLAHASFR+YC+ NN V+ S
Sbjct: 132 KGIPYVIYWRSAFTCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRMYCLGNNFVLPS 191

Query: 62  NSQKGSSK-LGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           +S K  S+ LGPHLLG+  KI++   E +V  +E S     ++ I D+DV MRFLVCG P
Sbjct: 192 SSHKEVSEDLGPHLLGERLKINVEPLEKEVADDEESSSEGISVNILDNDVEMRFLVCGEP 251

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
            +LD  +L  LEDGLNALL+IEIRGSKLH + SAPPPPLQAG  S GVVTMRCDLSTCS 
Sbjct: 252 GSLDAYVLEALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGTLSNGVVTMRCDLSTCSF 311

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNS-GDNRLPPSEPRKSASIAC 239
           AHISLLVSGSAQ CF+DQL EN+IK E+I+  +LV  L +S G   L  SEPRKS SIAC
Sbjct: 312 AHISLLVSGSAQACFDDQLFENYIKTEIIDRGELVQTLLDSEGSKHL--SEPRKSTSIAC 369

Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
           GA+VFEVS+KV +WASQ+ RQLAPDVSYRSLV LGIASIQGLSVASFEKDDAERLLFFC+
Sbjct: 370 GATVFEVSLKVPSWASQIFRQLAPDVSYRSLVGLGIASIQGLSVASFEKDDAERLLFFCS 429

Query: 300 RQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNV------------- 346
           R+       N   +  PSWL  PAP RKR +  +++     E + ++             
Sbjct: 430 RKENDLFLSNLTDSTLPSWLKPPAP-RKRPKYIKDTSLGSHEIIEHLKVSPGSRIHGANM 488

Query: 347 -----------------RPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPV 389
                            R  +  AAMRPIPH   +KM+ F G SE   ++G  +KA++P 
Sbjct: 489 EIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFHGISETGGHNGSLLKASVPS 548

Query: 390 A-PLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVM 448
           + P KH + G   V  +K   S+   +QII +NPLPLKKHGCGR+ IQ C EEEFL+D+M
Sbjct: 549 SNPTKHVTVGSASVFQQKVFPSASHYKQIIPMNPLPLKKHGCGRSHIQACFEEEFLKDLM 608

Query: 449 QFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSK 508
           QFL LRGH+RL+P GGLAEFPDAILN KRLDL+NLY+EVVSRGGF VGNGINWKGQ+FSK
Sbjct: 609 QFLALRGHSRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGINWKGQIFSK 668

Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
           MRN+T+TNRMTGVGNTLKRHYETYLLEYELAH+DVDGECCLLCHSSAAGDWVNCGICGEW
Sbjct: 669 MRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWVNCGICGEW 728

Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
           AHFGCDRRQGLGAFKDYAKTDGLEY+CP CSV N+KKK  K +NG
Sbjct: 729 AHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKK--KVANG 771


>gi|449480653|ref|XP_004155958.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
           [Cucumis sativus]
          Length = 781

 Score =  830 bits (2143), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 415/645 (64%), Positives = 487/645 (75%), Gaps = 38/645 (5%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW+ +F+CYAACHF  A LSV+QSS +HTWDAFQLAHASFR+YC+ NN V+ S
Sbjct: 132 KGIPYVIYWRSAFTCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRMYCLGNNFVLPS 191

Query: 62  NSQKGSSK-LGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           +S K  S+ LGPHLLG+  KI++   E +V  +E S     ++ I D+DV MRFLVCG P
Sbjct: 192 SSHKEVSEDLGPHLLGEHLKINVEPLEKEVADDEESSSEGISVNILDNDVEMRFLVCGEP 251

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
            +LD  +L  LEDGLNALL+IEIRGSKLH + SAPPPPLQAG  S GVVTMRCDLSTCS 
Sbjct: 252 GSLDAYVLEALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGTLSNGVVTMRCDLSTCSF 311

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNS-GDNRLPPSEPRKSASIAC 239
           AHISLLVSGSAQ CF+DQL EN+IK E+I+  +LV  L +S G   L  SEPRKS SIAC
Sbjct: 312 AHISLLVSGSAQACFDDQLFENYIKTEIIDRGELVQTLLDSEGSKHL--SEPRKSTSIAC 369

Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
           GA+VFEVS+KV +WASQ+ RQLAPDVSYRSLV LGIASIQGLSVASFEKDDAERLLFFC+
Sbjct: 370 GATVFEVSLKVPSWASQIFRQLAPDVSYRSLVGLGIASIQGLSVASFEKDDAERLLFFCS 429

Query: 300 RQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNV------------- 346
           R+       N   +  PSWL  PAP RKR +  +++     E + ++             
Sbjct: 430 RKENDLFLSNLTDSTLPSWLKPPAP-RKRPKYIKDTSLGSHEIIEHLKVSPGSRIHGANM 488

Query: 347 -----------------RPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPV 389
                            R  +  AAMRPIPH   +KM+ F G SE   ++G  +KA++P 
Sbjct: 489 EIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMVSFHGISETGGHNGSLLKASVPS 548

Query: 390 A-PLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVM 448
           + P KH + G   V  +K   S+   +QII +NPLPLKKHGCGR+ IQ C EEEFL+D+M
Sbjct: 549 SNPTKHVTVGSASVFQQKVFPSASHYKQIIPMNPLPLKKHGCGRSHIQACFEEEFLKDLM 608

Query: 449 QFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSK 508
           QFL LRGH+RL+P GGLAEFPDAILN KRLDL+NLY+EVVSRGGF VGNGINWKGQ+FSK
Sbjct: 609 QFLALRGHSRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGINWKGQIFSK 668

Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
           MRN+T+TNRMTGVGNTLKRHYETYLLEYELAH+DVDGECCLLCHSSAAGDWVNCGICGEW
Sbjct: 669 MRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWVNCGICGEW 728

Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
           AHFGCDRRQGLGAFKDYAKTDGLEY+CP CSV N+KKK  K +NG
Sbjct: 729 AHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKK--KVANG 771


>gi|222641636|gb|EEE69768.1| hypothetical protein OsJ_29479 [Oryza sativa Japonica Group]
          Length = 765

 Score =  829 bits (2142), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 422/630 (66%), Positives = 484/630 (76%), Gaps = 27/630 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW++SFS YAA HF  AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V   
Sbjct: 136 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 193

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PKI+I   E ++  EE S +  PAIKIYDDD+ M+FL+CGVP 
Sbjct: 194 -----SVKLGPRLLGDAPKINITPPENEMVEEEGSSDVFPAIKIYDDDINMKFLLCGVPS 248

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           T D  LLG LEDGLNALLNIEIRG KL NR SA PPPL A +  RG+VTMRCD++TCSS+
Sbjct: 249 TPDPCLLGSLEDGLNALLNIEIRGCKLQNRISASPPPLHAASLPRGMVTMRCDITTCSSS 308

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+SLLVSGSAQTCF+DQLLE+HIK+E+IE SQLVHALPN+ D++L  S P  S S ACGA
Sbjct: 309 HVSLLVSGSAQTCFDDQLLESHIKDEIIEKSQLVHALPNN-DDKLSSSVPFTSMSTACGA 367

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV M +  WA+QVL+ LAPD+SYRSLV LGI  I G  VASF++ DA+RLLFFCT Q
Sbjct: 368 STFEVWMTLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFDRRDADRLLFFCTNQ 427

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN-------VCNVR 347
            K    EN      P W  S + +++R +   ESK       G+  +          + +
Sbjct: 428 HKDLAIENGPYFHLPRW--SASLTKERVKVGLESKPNLLGANGIPEDKKRLIEGPSSSSK 485

Query: 348 PKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRK 406
            KL  A MRPIPH+R  +M PF GF E   ++  QVK NLP AP +KH+S    P THRK
Sbjct: 486 AKLKPATMRPIPHSRKQQMHPFMGFLEATVHETSQVKPNLPAAPPVKHNSVPAAPATHRK 545

Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
           S S    AQ II LNPLPLKKHGC R PIQ+CSEE+FL+DVMQFLI RGH RLVP GGLA
Sbjct: 546 STSGPSHAQSIIQLNPLPLKKHGCDRLPIQMCSEEDFLKDVMQFLIQRGHNRLVPHGGLA 605

Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
           EFP+A+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTLK
Sbjct: 606 EFPEAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLK 665

Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
           RHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYA
Sbjct: 666 RHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYA 725

Query: 587 KTDGLEYVCPQCSVTNFKKK--SQKTSNGY 614
           KTDGLEY+CP CS+ N+KKK   QK +NG+
Sbjct: 726 KTDGLEYICPHCSLANYKKKPPPQKVANGF 755


>gi|218202201|gb|EEC84628.1| hypothetical protein OsI_31490 [Oryza sativa Indica Group]
          Length = 766

 Score =  817 bits (2110), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/617 (67%), Positives = 474/617 (76%), Gaps = 25/617 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW++SFS YAA HF  AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V   
Sbjct: 136 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 193

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PKI+I   E ++  EE S +  PAIKIYDDD+ M+FL+CGVP 
Sbjct: 194 -----SVKLGPRLLGDAPKINITPPENEMVEEEGSSDVFPAIKIYDDDINMKFLLCGVPS 248

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           T D  LLG LEDGLNALLNIEIRG KL NR SA PPPL A +  RG+VTMRCD++TCSS+
Sbjct: 249 TPDPCLLGSLEDGLNALLNIEIRGCKLQNRISASPPPLHAASLPRGMVTMRCDITTCSSS 308

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+SLLVSGSAQTCF+DQLLE+HIK+E+IE SQLVHALPN+ D++L    P  S S ACGA
Sbjct: 309 HVSLLVSGSAQTCFDDQLLESHIKDEIIEKSQLVHALPNN-DDKLSSIVPFTSMSTACGA 367

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV M +  WA+QVL+ LAPD+SYRSLV LGI  I G  VASF++ DA+RLLFFCT Q
Sbjct: 368 STFEVWMTLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFDRRDADRLLFFCTNQ 427

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN-------VCNVR 347
            K    EN      P W  S + +++R +   ESK       G+  +          + +
Sbjct: 428 HKDLAIENGPYFHLPRW--SASLTKERVKVGLESKPKLLGANGIPEDKKRLIEGPSSSSK 485

Query: 348 PKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRK 406
            KL  A MRPIPH+R  +M PF GF E   ++  QVK NLP AP +KH+S    P THRK
Sbjct: 486 AKLKPATMRPIPHSRKQQMHPFMGFLEATVHETSQVKPNLPAAPPVKHNSVPAAPTTHRK 545

Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
           S S    AQ II LNPLPLKKHGC R PIQ+CSEE+FL+DVMQFLI RGH RLVP GGLA
Sbjct: 546 STSGPSHAQSIIQLNPLPLKKHGCDRLPIQMCSEEDFLKDVMQFLIQRGHNRLVPHGGLA 605

Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
           EFP+A+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTLK
Sbjct: 606 EFPEAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLK 665

Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
           RHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYA
Sbjct: 666 RHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYA 725

Query: 587 KTDGLEYVCPQCSVTNF 603
           KTDGLEY+CP CS+ N+
Sbjct: 726 KTDGLEYICPHCSLANY 742


>gi|326493346|dbj|BAJ85134.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 771

 Score =  816 bits (2109), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 417/632 (65%), Positives = 481/632 (76%), Gaps = 27/632 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW++SFS YAA HF  AL SVVQSS SHTWD+FQLAHASFRLYCVRNN V   
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRNALFSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ-- 194

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
                S KLGP LLGD PK+++ + E ++ + E +S E  PAIKIYDDD+ M+FL+CGVP
Sbjct: 195 -----SVKLGPRLLGDAPKVNVVIPENEMAEEEGSSSEVSPAIKIYDDDINMKFLLCGVP 249

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
             LD+ LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A +  RG+VTMRCD++TCSS
Sbjct: 250 SALDSCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASVPRGMVTMRCDMTTCSS 309

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           +H+SLLVSGSAQTCF+DQLLE+HIKNELIE SQLV ALPNS D +L  +EP  S S ACG
Sbjct: 310 SHVSLLVSGSAQTCFDDQLLESHIKNELIEKSQLVRALPNSED-KLSSTEPFTSMSTACG 368

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           AS FEV M +  WA+QVL+ LAP++SYRSLV LG+  I    VASF++ DA+RLLFFCT 
Sbjct: 369 ASTFEVWMTLPKWAAQVLKHLAPEISYRSLVALGVGCINATPVASFDRQDADRLLFFCTG 428

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRS-----------EPCRESKGVESENVCNVRP- 348
           Q K    E+      P W  S A  R ++               E +    E   ++ P 
Sbjct: 429 QQKDLAGESGPYFHLPRWSASLAKDRAKTGSESKSNLLGVNGTLEDRKAPVEGPSSLTPF 488

Query: 349 --KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHR 405
             KL  A MRPIPH+R  +M PF GF E   ++  Q K NLP AP +KHSSA  +   HR
Sbjct: 489 KGKLKPATMRPIPHSRQQQMHPFMGFPEANVHETSQAKPNLPSAPPVKHSSAPASTAAHR 548

Query: 406 KSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGL 465
           KS S    AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GGL
Sbjct: 549 KSTSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHHRLVPHGGL 608

Query: 466 AEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTL 525
           AEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTL
Sbjct: 609 AEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTL 668

Query: 526 KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDY 585
           KRHYETYLLEYEL+HDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDY
Sbjct: 669 KRHYETYLLEYELSHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDY 728

Query: 586 AKTDGLEYVCPQCSVTNFKKK---SQKTSNGY 614
           AKTDGLEY+CP CS+ N+KKK    QK +NG+
Sbjct: 729 AKTDGLEYICPHCSIANYKKKPLPPQKVTNGF 760


>gi|357158540|ref|XP_003578160.1| PREDICTED: uncharacterized protein LOC100824739 [Brachypodium
           distachyon]
          Length = 769

 Score =  813 bits (2101), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/631 (65%), Positives = 480/631 (76%), Gaps = 26/631 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW++SFS YAA HF  ALLSVVQSS SHTWD+FQLAHASFRLYCVRNN V   
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRHALLSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ-- 194

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PKI+I+  E ++  EE S +  PAIKIYDDD+ M+FL+CGVP 
Sbjct: 195 -----SVKLGPRLLGDAPKINISAPENEMAEEEGSSDVSPAIKIYDDDIKMQFLLCGVPS 249

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           TLD  LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A +  RG+VTMRCD++TCSS+
Sbjct: 250 TLDPCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASLPRGMVTMRCDITTCSSS 309

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+SLLVSGSAQTCF+DQLLE+HIKNELIE +QLV ALPNS D +L  +EP  S S+ACGA
Sbjct: 310 HVSLLVSGSAQTCFDDQLLESHIKNELIEKNQLVRALPNSMD-KLSSTEPLTSMSVACGA 368

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV M +  WA+QVL+ LA ++SYRSLV LG+  I    VASF+++DA+R+LFFCT Q
Sbjct: 369 STFEVWMTLPKWAAQVLKHLAREISYRSLVALGVGCINATPVASFDREDADRVLFFCTGQ 428

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR----SEP-------CRESKGVESENVCNVRP-- 348
            K    ++      P W  S    R +    S+P         E K +  E   ++ P  
Sbjct: 429 QKDLVADSGPYFHLPRWSASLTKDRVKRGSESKPNVFGANGTSEDKELPIEGPSSLTPLK 488

Query: 349 -KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRK 406
            K   A MRPIPH+R  +M PF GF E   ++   VK NLP AP +KH+        HRK
Sbjct: 489 AKFKPATMRPIPHSRQQQMHPFMGFPEATIHENSHVKPNLPAAPPVKHNLIPAATAAHRK 548

Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
           S S    AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GGLA
Sbjct: 549 STSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHNRLVPHGGLA 608

Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
           EFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTLK
Sbjct: 609 EFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLK 668

Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
           RHYETYLLEYEL+HDDVDGECCLLC+SSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYA
Sbjct: 669 RHYETYLLEYELSHDDVDGECCLLCNSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYA 728

Query: 587 KTDGLEYVCPQCSVTNFKKK---SQKTSNGY 614
           KTDGLEY+CP CS+ N+KKK    QK +NG+
Sbjct: 729 KTDGLEYICPHCSLANYKKKPQQPQKVANGF 759


>gi|414589557|tpg|DAA40128.1| TPA: hypothetical protein ZEAMMB73_432104 [Zea mays]
          Length = 768

 Score =  810 bits (2092), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/632 (65%), Positives = 477/632 (75%), Gaps = 29/632 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW++SFS YAA HF  AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V   
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 194

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PKI+I+    ++  EE S E  PAIKIYD+++ M+ L+CGV C
Sbjct: 195 -----SVKLGPRLLGDSPKINISPPGTEMVDEEGSSEVTPAIKIYDEEINMKLLLCGVSC 249

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           TLD  LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A +  RG+VTMRCD++TCSS+
Sbjct: 250 TLDPCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASLPRGMVTMRCDITTCSSS 309

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+SLLVSGSAQTCF+DQLLE+HIKNE+IE SQLV AL NS D +LP SEP  S S ACGA
Sbjct: 310 HVSLLVSGSAQTCFDDQLLESHIKNEIIEKSQLVRALSNSED-KLPSSEPLTSMSTACGA 368

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV M +  WA+QVL+ LAPD+SYRSLV LGI  I G  VASFE+ DA+RLLFFCT Q
Sbjct: 369 STFEVWMSLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFERRDADRLLFFCTSQ 428

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESE---------NVCN 345
            K    EN      P W  S + ++ R++   ESK       GV  +         +   
Sbjct: 429 CKDLANENGPYFHLPRW--SASLTKDRTKLGSESKQNLLGANGVLEDKKHMMEGPSSFSA 486

Query: 346 VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTH 404
           V+ KL  A MRPIPH++  +M PF    E   ++   VK +L VAP +KH+S    P TH
Sbjct: 487 VKAKLKPATMRPIPHSQKQQMHPFMCLPETSFHETSIVKPSLTVAPTVKHNSVSYAPTTH 546

Query: 405 RKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGG 464
           RKS S       +I LNPLP+KKHGC R  IQVCSEE+FL+DVMQFLI RGH RLVP GG
Sbjct: 547 RKSTSGPSHTPSVIQLNPLPMKKHGCDRLSIQVCSEEDFLKDVMQFLIQRGHNRLVPHGG 606

Query: 465 LAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNT 524
           LAEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRMTGVGNT
Sbjct: 607 LAEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRMTGVGNT 666

Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKD 584
           LKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKD
Sbjct: 667 LKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKD 726

Query: 585 YAKTDGLEYVCPQCSVTNFKKK--SQKTSNGY 614
           YAKTDGLEY+CP CS+ N+KK     K +NG+
Sbjct: 727 YAKTDGLEYICPHCSLANYKKPPLPPKVANGF 758


>gi|326505012|dbj|BAK02893.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 641

 Score =  809 bits (2089), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 411/619 (66%), Positives = 472/619 (76%), Gaps = 24/619 (3%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
            QG+PYVIYW++SFS YAA HF  AL SVVQSS SHTWD+FQLAHASFRLYCVRNN V  
Sbjct: 6   FQGIPYVIYWRNSFSSYAASHFRNALFSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ- 64

Query: 61  SNSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGV 119
                 S KLGP LLGD PK+++ + E ++ + E +S E  PAIKIYDDD+ M+FL+CGV
Sbjct: 65  ------SVKLGPRLLGDAPKVNVVIPENEMAEEEGSSSEVSPAIKIYDDDINMKFLLCGV 118

Query: 120 PCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCS 179
           P  LD+ LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A +  RG+VTMRCD++TCS
Sbjct: 119 PSALDSCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASVPRGMVTMRCDMTTCS 178

Query: 180 SAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIAC 239
           S+H+SLLVSGSAQTCF+DQLLE+HIKNELIE SQLV ALPNS D +L  +EP  S S AC
Sbjct: 179 SSHVSLLVSGSAQTCFDDQLLESHIKNELIEKSQLVRALPNSED-KLSSTEPFTSMSTAC 237

Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
           GAS FEV M +  WA+QVL+ LAP++SYRSLV LG+  I    VASF++ DA+RLLFFCT
Sbjct: 238 GASTFEVWMTLPKWAAQVLKHLAPEISYRSLVALGVGCINATPVASFDRQDADRLLFFCT 297

Query: 300 RQGKADHTENSVLTRPPSWLTSPAPSRKRS-----------EPCRESKGVESENVCNVRP 348
            Q K    E+      P W  S A  R ++               E +    E   ++ P
Sbjct: 298 GQQKDLAGESGPYFHLPRWSASLAKDRAKTGSESKSNLLGVNGTLEDRKAPVEGPSSLTP 357

Query: 349 ---KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTH 404
              KL  A MRPIPH+R  +M PF GF E   ++  Q K NLP AP +KHSSA  +   H
Sbjct: 358 FKGKLKPATMRPIPHSRQQQMHPFMGFPEANVHETSQAKPNLPSAPPVKHSSAPASTAAH 417

Query: 405 RKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGG 464
           RKS S    AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GG
Sbjct: 418 RKSTSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHHRLVPHGG 477

Query: 465 LAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNT 524
           LAEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNT
Sbjct: 478 LAEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNT 537

Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKD 584
           LKRHYETYLLEYEL+HDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKD
Sbjct: 538 LKRHYETYLLEYELSHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKD 597

Query: 585 YAKTDGLEYVCPQCSVTNF 603
           YAKTDGLEY+CP CS+ N+
Sbjct: 598 YAKTDGLEYICPHCSIANY 616


>gi|326502772|dbj|BAJ99014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 771

 Score =  806 bits (2082), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/618 (66%), Positives = 472/618 (76%), Gaps = 24/618 (3%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW++SFS YAA HF  AL SVVQSS SHTWD+FQLAHASFRLYCVRNN V   
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRNALFSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ-- 194

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
                S KLGP LLGD PK+++ + E ++ + E +S E  PAIKIYDDD+ M+FL+CGVP
Sbjct: 195 -----SVKLGPRLLGDAPKVNVVIPENEMAEEEGSSSEVSPAIKIYDDDINMKFLLCGVP 249

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
             LD+ LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A +  RG+VTMRCD++TCSS
Sbjct: 250 SALDSCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASVPRGMVTMRCDMTTCSS 309

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           +H+SLLVSGSAQTCF+DQLLE+HIKNELIE SQLV ALPNS D +L  +EP  S S ACG
Sbjct: 310 SHVSLLVSGSAQTCFDDQLLESHIKNELIEKSQLVRALPNSED-KLSSTEPFTSMSTACG 368

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           AS FEV M +  WA+QVL+ LAP++SYRSLV LG+  I    VASF++ DA+RLLFFCT 
Sbjct: 369 ASTFEVWMTLPKWAAQVLKHLAPEISYRSLVALGVGCINATPVASFDRQDADRLLFFCTG 428

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRS-----------EPCRESKGVESENVCNVRP- 348
           Q K    E+      P W  S A  R ++               E +    E   ++ P 
Sbjct: 429 QQKDLAGESGPYFHLPRWSASLAKDRAKTGSESKSNLLGVNGTLEDRKAPVEGPSSLTPF 488

Query: 349 --KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHR 405
             KL  A MRPIPH+R  +M PF GF E   ++  Q K NLP AP +KHSSA  +   HR
Sbjct: 489 KGKLKPATMRPIPHSRQQQMHPFMGFPEANVHETSQAKPNLPSAPPVKHSSAPASTAAHR 548

Query: 406 KSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGL 465
           KS S    AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GGL
Sbjct: 549 KSTSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHHRLVPHGGL 608

Query: 466 AEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTL 525
           AEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTL
Sbjct: 609 AEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTL 668

Query: 526 KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDY 585
           KRHYETYLLEYEL+HDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDY
Sbjct: 669 KRHYETYLLEYELSHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDY 728

Query: 586 AKTDGLEYVCPQCSVTNF 603
           AKTDGLEY+CP CS+ N+
Sbjct: 729 AKTDGLEYICPHCSIANY 746


>gi|218201239|gb|EEC83666.1| hypothetical protein OsI_29439 [Oryza sativa Indica Group]
          Length = 761

 Score =  787 bits (2032), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/628 (61%), Positives = 465/628 (74%), Gaps = 25/628 (3%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
           ++G+PYV+YW++SFS YAA HF QAL+SVVQSSCSHTWDAFQLA ASFRLYC RNN    
Sbjct: 136 LKGIPYVLYWRNSFSSYAASHFRQALISVVQSSCSHTWDAFQLAQASFRLYCARNN---- 191

Query: 61  SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
                 S KLGP LLGD PKI+I L E ++  EE S E+ PAIKIYD+DV M+ L+CG P
Sbjct: 192 ---DAQSVKLGPRLLGDAPKINIFLPENEMVEEEGSSEHFPAIKIYDEDVNMKLLICGAP 248

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           C LD SLLG LEDGLNALLNIEIRG +L NR SA PPPL A     GVVTMRCD++TCSS
Sbjct: 249 CILDASLLGSLEDGLNALLNIEIRGCRLQNRVSAAPPPLHAETLPHGVVTMRCDITTCSS 308

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           +H+SLLVSGS QTCF+D+LLENHIK E++E  QLV A+    D++    EP  S S+A G
Sbjct: 309 SHVSLLVSGSPQTCFDDKLLENHIKKEIVEKGQLVRAVLVRADDKPSSVEPLTSISVASG 368

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           AS FEV M +  WA QVL+ LAP++SY+SLV LGIA + G  V+SF++ D +RLLFFC  
Sbjct: 369 ASTFEVWMTLPKWAGQVLKYLAPEISYKSLVPLGIACVNGTPVSSFDRQDVDRLLFFC-- 426

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------VRP 348
             K +   N + +  P W  S    R +  P  +S    +  V              V+P
Sbjct: 427 --KNEAIVNGLYSHLPRWSASLVKDRLKGTPESKSSTFSANGVGEYQKHPMKGTSLLVKP 484

Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSL 408
           KL SA MRPIPH+   +M PF G      +D  QVK +LP  P++H++    P T RK  
Sbjct: 485 KLKSAKMRPIPHSSKRQMHPFVGIPPSFIHDASQVKPSLPAPPVRHNALPVAPTTQRKLS 544

Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
           S + + +  + LNPLP+KKHGC R PI +CSEE+FL+DVMQFL+ RGHTRLVPQGGLAEF
Sbjct: 545 SGTSRVEPAVPLNPLPMKKHGCDRLPIGICSEEDFLKDVMQFLLQRGHTRLVPQGGLAEF 604

Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
           PDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKRH
Sbjct: 605 PDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMSNHTVTNKMTGVGNTLKRH 664

Query: 529 YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
           YETYLLEYEL+HDDV GECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYAKT
Sbjct: 665 YETYLLEYELSHDDVGGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYAKT 724

Query: 589 DGLEYVCPQCSVTNFKKK--SQKTSNGY 614
           DGLEY+CP CS+ N+KKK    +++NG+
Sbjct: 725 DGLEYICPHCSLANYKKKPPPPESANGF 752


>gi|115476700|ref|NP_001061946.1| Os08g0451400 [Oryza sativa Japonica Group]
 gi|113623915|dbj|BAF23860.1| Os08g0451400 [Oryza sativa Japonica Group]
 gi|222640662|gb|EEE68794.1| hypothetical protein OsJ_27533 [Oryza sativa Japonica Group]
          Length = 761

 Score =  786 bits (2030), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/628 (61%), Positives = 465/628 (74%), Gaps = 25/628 (3%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
           ++G+PYV+YW++SFS YAA HF QAL+SVVQSSCSHTWDAFQLA ASFRLYC RNN    
Sbjct: 136 LKGIPYVLYWRNSFSSYAASHFRQALISVVQSSCSHTWDAFQLAQASFRLYCARNN---- 191

Query: 61  SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
                 S KLGP LLGD PKI+I L E ++  EE S E+ PAIKIYD+DV M+ L+CG P
Sbjct: 192 ---DAQSVKLGPRLLGDAPKINIFLPENEMVEEEGSSEHFPAIKIYDEDVNMKLLICGAP 248

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           C LD SLLG LEDGLNALLNIEIRG +L NR SA PPPL A     GVVTMRCD++TCSS
Sbjct: 249 CILDASLLGSLEDGLNALLNIEIRGCRLQNRVSAAPPPLHAETLPHGVVTMRCDITTCSS 308

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           +H+SLLVSGS QTCF+D+LLENHIK E++E  QLV A+    D++    EP  S S+A G
Sbjct: 309 SHVSLLVSGSPQTCFDDKLLENHIKKEIVEKGQLVRAVLVREDDKPSSVEPLTSISVASG 368

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           AS FEV M +  WA QVL+ LAP++SY+SLV LGIA + G  V+SF++ D +RLLFFC  
Sbjct: 369 ASTFEVWMTLPKWAGQVLKYLAPEISYKSLVPLGIACVNGTPVSSFDRQDVDRLLFFC-- 426

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------VRP 348
             K +   N + +  P W  S    R +  P  +S    +  V              V+P
Sbjct: 427 --KNEAIVNGLYSHLPRWSASLVKDRLKGTPESKSSTFSANGVGEYQKHPMKGTSLVVKP 484

Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSL 408
           KL SA MRPIPH+   +M PF G      +D  QVK +LP  P++H++    P T RK  
Sbjct: 485 KLKSAKMRPIPHSSKRQMHPFVGIPPSFIHDASQVKPSLPAPPVRHNALPVAPTTQRKLS 544

Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
           S + + +  + LNPLP+KKHGC R PI +CSEE+FL+DVMQFL+ RGHTRLVPQGGLAEF
Sbjct: 545 SGTSRVEPAVPLNPLPMKKHGCDRLPIGICSEEDFLKDVMQFLLQRGHTRLVPQGGLAEF 604

Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
           PDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKRH
Sbjct: 605 PDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMSNHTVTNKMTGVGNTLKRH 664

Query: 529 YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
           YETYLLEYEL+HDDV GECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYAKT
Sbjct: 665 YETYLLEYELSHDDVGGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYAKT 724

Query: 589 DGLEYVCPQCSVTNFKKK--SQKTSNGY 614
           DGLEY+CP CS+ N+KKK    +++NG+
Sbjct: 725 DGLEYICPHCSLANYKKKPPPPESANGF 752


>gi|42407626|dbj|BAD08741.1| fiber protein-like [Oryza sativa Japonica Group]
          Length = 683

 Score =  786 bits (2029), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/628 (61%), Positives = 465/628 (74%), Gaps = 25/628 (3%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
           ++G+PYV+YW++SFS YAA HF QAL+SVVQSSCSHTWDAFQLA ASFRLYC RNN    
Sbjct: 58  LKGIPYVLYWRNSFSSYAASHFRQALISVVQSSCSHTWDAFQLAQASFRLYCARNN---- 113

Query: 61  SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
                 S KLGP LLGD PKI+I L E ++  EE S E+ PAIKIYD+DV M+ L+CG P
Sbjct: 114 ---DAQSVKLGPRLLGDAPKINIFLPENEMVEEEGSSEHFPAIKIYDEDVNMKLLICGAP 170

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           C LD SLLG LEDGLNALLNIEIRG +L NR SA PPPL A     GVVTMRCD++TCSS
Sbjct: 171 CILDASLLGSLEDGLNALLNIEIRGCRLQNRVSAAPPPLHAETLPHGVVTMRCDITTCSS 230

Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
           +H+SLLVSGS QTCF+D+LLENHIK E++E  QLV A+    D++    EP  S S+A G
Sbjct: 231 SHVSLLVSGSPQTCFDDKLLENHIKKEIVEKGQLVRAVLVREDDKPSSVEPLTSISVASG 290

Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
           AS FEV M +  WA QVL+ LAP++SY+SLV LGIA + G  V+SF++ D +RLLFFC  
Sbjct: 291 ASTFEVWMTLPKWAGQVLKYLAPEISYKSLVPLGIACVNGTPVSSFDRQDVDRLLFFC-- 348

Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------VRP 348
             K +   N + +  P W  S    R +  P  +S    +  V              V+P
Sbjct: 349 --KNEAIVNGLYSHLPRWSASLVKDRLKGTPESKSSTFSANGVGEYQKHPMKGTSLVVKP 406

Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSL 408
           KL SA MRPIPH+   +M PF G      +D  QVK +LP  P++H++    P T RK  
Sbjct: 407 KLKSAKMRPIPHSSKRQMHPFVGIPPSFIHDASQVKPSLPAPPVRHNALPVAPTTQRKLS 466

Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
           S + + +  + LNPLP+KKHGC R PI +CSEE+FL+DVMQFL+ RGHTRLVPQGGLAEF
Sbjct: 467 SGTSRVEPAVPLNPLPMKKHGCDRLPIGICSEEDFLKDVMQFLLQRGHTRLVPQGGLAEF 526

Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
           PDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKRH
Sbjct: 527 PDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMSNHTVTNKMTGVGNTLKRH 586

Query: 529 YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
           YETYLLEYEL+HDDV GECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYAKT
Sbjct: 587 YETYLLEYELSHDDVGGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYAKT 646

Query: 589 DGLEYVCPQCSVTNFKKK--SQKTSNGY 614
           DGLEY+CP CS+ N+KKK    +++NG+
Sbjct: 647 DGLEYICPHCSLANYKKKPPPPESANGF 674


>gi|357147984|ref|XP_003574575.1| PREDICTED: uncharacterized protein LOC100844696 [Brachypodium
           distachyon]
          Length = 766

 Score =  785 bits (2026), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/619 (63%), Positives = 466/619 (75%), Gaps = 22/619 (3%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PY+IYW++SFS YAA HF   L SV+QSSCSH WDAFQLA ASFRLYCVRNN V   
Sbjct: 134 KGIPYLIYWRNSFSSYAATHFRHVLTSVIQSSCSHIWDAFQLARASFRLYCVRNNHVR-- 191

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S +LGPHLLGD PKI+IA  +  +   E S E   AIKI+D DV M+FL+CGVPC
Sbjct: 192 -----SVELGPHLLGDAPKINIAPPDNGMADGEGSSEAFSAIKIHDKDVNMKFLICGVPC 246

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           TLD  LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A     GVVTMR D++TCSS+
Sbjct: 247 TLDACLLGSLEDGLNALLNIEIRGSKLRNRVSAAPPPLEAETLPCGVVTMRSDITTCSSS 306

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           ++SLLVSGSAQTCF+D++LE+HIKNE+IE SQLV +LPNS DN+   +EP  S SIACGA
Sbjct: 307 YMSLLVSGSAQTCFDDKILESHIKNEIIEKSQLVRSLPNSEDNKPSSAEPLTSMSIACGA 366

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S+FEV M +  WA+++L+QLAPD+SY+SLV LGIAS+    V+SF + DA+RLLFFCT Q
Sbjct: 367 SIFEVWMTLPKWAAEILKQLAPDISYKSLVGLGIASVNSTPVSSFNRQDADRLLFFCTSQ 426

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRS-----EP---CRESKGVESENVCN------VR 347
            K    E       P W  S    R+       EP   C    G + +++ +      V+
Sbjct: 427 YKDRPIEVGPYLHLPRWSASLTKERRMKGSLEVEPNVSCANGVGDDKKHLMDGSSLPSVK 486

Query: 348 PKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPL-KHSSAGPTPVTHRK 406
            +L  A MRPIP +    + PF+GF++       Q K++ P  P  KH+S      T RK
Sbjct: 487 TRLKPATMRPIPRSSKQHLHPFTGFTQSIINGAYQTKSSFPARPPGKHNSVPAASATRRK 546

Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
           S S S ++QQ + LNPLP+KKHGC R PIQ C EE+FL+D+MQFL+ RGHTRLVPQGGLA
Sbjct: 547 SSSGSSRSQQALLLNPLPMKKHGCDRLPIQTCHEEDFLKDLMQFLLQRGHTRLVPQGGLA 606

Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
           EFPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLK
Sbjct: 607 EFPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMCNHTVTNKMTGVGNTLK 666

Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
           RHYETYLLEY+L HDDVDGECCLLCHS+A GDWVNCG+CG+WAHFGCD+RQGLGAFKDY+
Sbjct: 667 RHYETYLLEYQLCHDDVDGECCLLCHSTAPGDWVNCGLCGDWAHFGCDKRQGLGAFKDYS 726

Query: 587 KTDGLEYVCPQCSVTNFKK 605
           KTDGLEY+CP CS TN+KK
Sbjct: 727 KTDGLEYICPHCSGTNYKK 745


>gi|357141526|ref|XP_003572256.1| PREDICTED: uncharacterized protein LOC100821621 [Brachypodium
           distachyon]
          Length = 735

 Score =  784 bits (2025), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/618 (63%), Positives = 465/618 (75%), Gaps = 22/618 (3%)

Query: 3   GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
           G+PY+IYW++SFS YAA HF   L SV+QSSCSH WDAFQLA ASFRLYCVRNN V    
Sbjct: 104 GIPYLIYWRNSFSSYAATHFRHVLTSVIQSSCSHIWDAFQLARASFRLYCVRNNHVR--- 160

Query: 63  SQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCT 122
               S +LGPHLLGD PKI+IA  +  +   E S E   AIKI+D DV M+FL+CGVPCT
Sbjct: 161 ----SVELGPHLLGDAPKINIAPPDNGMADGEGSSEAFSAIKIHDKDVNMKFLICGVPCT 216

Query: 123 LDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAH 182
           LD  LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A     GVVTMR D++TCSS++
Sbjct: 217 LDACLLGSLEDGLNALLNIEIRGSKLRNRVSAAPPPLEAETLPCGVVTMRSDITTCSSSY 276

Query: 183 ISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGAS 242
           +SLLVSGSAQTCF+D++LE+HIKNE+IE SQLV +LPNS DN+   +EP  S SIACGAS
Sbjct: 277 MSLLVSGSAQTCFDDKILESHIKNEIIEKSQLVRSLPNSEDNKPSSAEPLTSMSIACGAS 336

Query: 243 VFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQG 302
           +FEV M +  WA+++L+QLAPD+SY+SLV LGIAS+    V+SF + DA+RLLFFCT Q 
Sbjct: 337 IFEVWMTLPKWAAEILKQLAPDISYKSLVGLGIASVNSTPVSSFNRQDADRLLFFCTSQY 396

Query: 303 KADHTENSVLTRPPSWLTSPAPSRKRS-----EP---CRESKGVESENVCN------VRP 348
           K    E       P W  S    R+       EP   C    G + +++ +      V+ 
Sbjct: 397 KDRPIEVGPYLHLPRWSASLTKERRMKGSLEVEPNVSCANGVGDDKKHLMDGSSLPSVKT 456

Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPL-KHSSAGPTPVTHRKS 407
           +L  A MRPIP +    + PF+GF++       Q K++ P  P  KH+S      T RKS
Sbjct: 457 RLKPATMRPIPRSSKQHLHPFTGFTQSIINGAYQTKSSFPARPPGKHNSVPAASATRRKS 516

Query: 408 LSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAE 467
            S S ++QQ + LNPLP+KKHGC R PIQ C EE+FL+D+MQFL+ RGHTRLVPQGGLAE
Sbjct: 517 SSGSSRSQQALLLNPLPMKKHGCDRLPIQTCHEEDFLKDLMQFLLQRGHTRLVPQGGLAE 576

Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
           FPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKR
Sbjct: 577 FPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMCNHTVTNKMTGVGNTLKR 636

Query: 528 HYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAK 587
           HYETYLLEY+L HDDVDGECCLLCHS+A GDWVNCG+CG+WAHFGCD+RQGLGAFKDY+K
Sbjct: 637 HYETYLLEYQLCHDDVDGECCLLCHSTAPGDWVNCGLCGDWAHFGCDKRQGLGAFKDYSK 696

Query: 588 TDGLEYVCPQCSVTNFKK 605
           TDGLEY+CP CS TN+KK
Sbjct: 697 TDGLEYICPHCSGTNYKK 714


>gi|357450177|ref|XP_003595365.1| Fiber protein-like protein [Medicago truncatula]
 gi|355484413|gb|AES65616.1| Fiber protein-like protein [Medicago truncatula]
          Length = 844

 Score =  768 bits (1984), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/486 (76%), Positives = 416/486 (85%), Gaps = 13/486 (2%)

Query: 142 EIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQ--- 198
           +IRG KL NRTSAPPPPLQA  FSRGVVTMRCD+ST SSAHISLLVSGSA  CFNDQ   
Sbjct: 359 KIRGCKLQNRTSAPPPPLQAATFSRGVVTMRCDISTSSSAHISLLVSGSADACFNDQARV 418

Query: 199 ---LLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWAS 255
              LLENHIK EL ENSQLV A+P+   N+LP  EPR+SAS+ACG+SVFEV M+V TWAS
Sbjct: 419 NCILLENHIKKELTENSQLVQAIPSHEQNKLPSYEPRRSASVACGSSVFEVCMRVPTWAS 478

Query: 256 QVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVLTRP 315
           QVLRQLAP+VSYR LVMLG+ASIQGLSVASF KDDAERLLFFC RQ K + +++ V +  
Sbjct: 479 QVLRQLAPNVSYRCLVMLGVASIQGLSVASFTKDDAERLLFFCNRQEKDNCSKDIVFSSH 538

Query: 316 PSWLTSPAPSRKRSEP--CRESK-----GVESENVCNVRPKLNSAAMRPIPHTRHYKMLP 368
           PSWL  PAPSRKRSEP  C   K     GV+ E+  + R KL+ AAMRPIP +   K+LP
Sbjct: 539 PSWLMPPAPSRKRSEPESCSRVKSINASGVKVEDNGSDRQKLSFAAMRPIPQSHRQKLLP 598

Query: 369 FSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKH 428
           FSG+ E E+YDGD+ K+N P+ P+KH+  G   VT+RKS+S+S+QA QIISLNPLP+KKH
Sbjct: 599 FSGYFEGEKYDGDRGKSNQPLVPIKHNGLGSRSVTNRKSVSNSFQAHQIISLNPLPMKKH 658

Query: 429 GCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVV 488
           GC RAPI+VCSEEEFL+DVMQFL+LRGHTRL+PQGGLAEFPDA+LNAKRLDLFNLYREVV
Sbjct: 659 GCDRAPIRVCSEEEFLKDVMQFLVLRGHTRLIPQGGLAEFPDAVLNAKRLDLFNLYREVV 718

Query: 489 SRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC 548
           SRGGFHVGNGINWKGQVFSKM NHTL++RMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Sbjct: 719 SRGGFHVGNGINWKGQVFSKMSNHTLSHRMTGVGNTLKRHYETYLLEYELAHDDVDGECC 778

Query: 549 LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQ 608
           LLCHSSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP CS++NF KKSQ
Sbjct: 779 LLCHSSAAGDWVNCGMCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSMSNFSKKSQ 838

Query: 609 KTSNGY 614
           KT+NGY
Sbjct: 839 KTANGY 844



 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 141/396 (35%), Positives = 189/396 (47%), Gaps = 53/396 (13%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +GVPY IYWK+ FS  AA HF QA  SV QS+ SHTWDAFQLA +SFRLYCV+N ++   
Sbjct: 139 KGVPYTIYWKNEFSKSAASHFHQAFFSVAQSTSSHTWDAFQLAQSSFRLYCVQNEVI-PH 197

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
           NSQKGS K+GP +LG+PP I++   E D  + EE+SPE   +IKIYDDDV  RFL+CG P
Sbjct: 198 NSQKGSDKVGPKILGEPPNIEVGPCEADTKEDEEDSPETSSSIKIYDDDVNTRFLLCGFP 257

Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
           CT+D  LL  LEDGLNALL  E    K  +                 VV MR     C  
Sbjct: 258 CTVDACLLESLEDGLNALLCTECWRVKNQHENKI------------SVVEMRMLRWMCER 305

Query: 181 AHISLLV------SGSAQTCFNDQLLENHIKNELIENS---QLVH-ALPNSGD------- 223
            H+  +V        S +T   D  +    +N + + +    L+H A P   D       
Sbjct: 306 RHVDFVVWRVDLEKLSLETIKKDLEINELDRNMVYDRTLWRNLIHVANPTYWDKIRGCKL 365

Query: 224 -NRL-PPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGL 281
            NR   P  P ++A+ + G     V+M+              D+S  S   + +  + G 
Sbjct: 366 QNRTSAPPPPLQAATFSRGV----VTMRC-------------DISTSSSAHISLL-VSGS 407

Query: 282 SVASFEKDDAERLLFFCTRQGKADHTENSVLTRP-PSWLTSPAPSRKRSEPCRESKGVES 340
           + A F  D A           K + TENS L +  PS   +  PS +       + G   
Sbjct: 408 ADACF-NDQARVNCILLENHIKKELTENSQLVQAIPSHEQNKLPSYEPRRSASVACGSSV 466

Query: 341 ENVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIE 376
             VC   P   S  +R +     Y+ L   G + I+
Sbjct: 467 FEVCMRVPTWASQVLRQLAPNVSYRCLVMLGVASIQ 502


>gi|212274905|ref|NP_001130362.1| uncharacterized protein LOC100191457 [Zea mays]
 gi|194688938|gb|ACF78553.1| unknown [Zea mays]
          Length = 749

 Score =  768 bits (1982), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/617 (61%), Positives = 458/617 (74%), Gaps = 24/617 (3%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYV+YW++ FS Y A HF  AL+SV+QSSCSHTWDAFQLAHASFRLYCVRNN V   
Sbjct: 137 KGIPYVVYWRNLFSSYTASHFRHALMSVIQSSCSHTWDAFQLAHASFRLYCVRNNHVQ-- 194

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PK ++  +  +V  E+   E  P I+IYD+DV+M+FL+CGVPC
Sbjct: 195 -----SVKLGPCLLGDAPKTNVIPAGDEVNEEQGCSEGFPDIRIYDEDVSMKFLLCGVPC 249

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           TLD  LLG LEDGLNALLNIEIR SKL NR SA PP LQA     GVVTMRCD++TCSSA
Sbjct: 250 TLDACLLGALEDGLNALLNIEIRASKLQNRASAHPP-LQAEDLPHGVVTMRCDITTCSSA 308

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+S LVS SAQTCF+DQLL + IKNE+IE  QLV AL N+ DN+    EP  S  +ACGA
Sbjct: 309 HVSFLVSCSAQTCFDDQLLGSRIKNEIIEKRQLVRALLNNEDNKPSSYEPLPSMCVACGA 368

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV + +  WA+QVL+ LAP++SYRSLV LGIA + G  V+SF++ DA+RLLFFC+ Q
Sbjct: 369 STFEVWITMPKWAAQVLKHLAPEISYRSLVALGIAWVNGTPVSSFDRQDADRLLFFCSNQ 428

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN---VCNVRPKLN 351
                 +N       SW  S   ++ R+    ESK       GV  +N   + +++P+L 
Sbjct: 429 CGDQAIQNGSYAHVSSW--SAPLTKDRTTGSLESKQMSFGAKGVVGDNTISLSSLKPRLK 486

Query: 352 SAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSS 411
            A MRP+PH+R  +M PF GF +   +D  QVK +LP  P+KH++   T VT RKS S S
Sbjct: 487 PATMRPLPHSRKQQMHPFMGFPQSIIHDASQVKPSLPAPPVKHNAVPVTAVTPRKSSSGS 546

Query: 412 Y----QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAE 467
                + Q  I LNPLP+KKH C R PI  CSEE+FL+DVMQFL+ RGH RLVPQGGLAE
Sbjct: 547 PSSWSRVQLPIPLNPLPIKKHECNRLPIHSCSEEDFLKDVMQFLLQRGHIRLVPQGGLAE 606

Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
           FPDA+LN+KRLDL+NLY+EVV RGGF+VGNGINWKGQ+FSKM NHT+TN+MTGVGNTLKR
Sbjct: 607 FPDAVLNSKRLDLYNLYKEVVYRGGFYVGNGINWKGQIFSKMHNHTVTNKMTGVGNTLKR 666

Query: 528 HYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAK 587
           HYETYLLEYELAHDD+DGECCL+CHS A GDWVNCG+CGEWAH GCDRR+GL  FKDY+K
Sbjct: 667 HYETYLLEYELAHDDIDGECCLICHSGALGDWVNCGLCGEWAHLGCDRRKGLSTFKDYSK 726

Query: 588 TDGLEYVCPQCSVTNFK 604
            DG+EY+CPQCS+  +K
Sbjct: 727 ADGMEYICPQCSLAKYK 743


>gi|242081649|ref|XP_002445593.1| hypothetical protein SORBIDRAFT_07g022155 [Sorghum bicolor]
 gi|241941943|gb|EES15088.1| hypothetical protein SORBIDRAFT_07g022155 [Sorghum bicolor]
          Length = 729

 Score =  763 bits (1969), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/612 (61%), Positives = 455/612 (74%), Gaps = 25/612 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYV+YW++SFS Y A HF  AL+SV++SSCSHTWDAFQLAHASFRLYCVRNN V   
Sbjct: 127 KGIPYVVYWRNSFSSYTASHFRHALMSVIESSCSHTWDAFQLAHASFRLYCVRNNYVQ-- 184

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PKI++  +  +V  EE   E  P I+IYD+DV M+FLVCG PC
Sbjct: 185 -----SVKLGPCLLGDAPKINVIPAGNEVNKEEGCSEGFPPIRIYDEDVNMKFLVCGAPC 239

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           TLD  LLG LEDGL ALLNIE+ G +  N+ S+ PPPLQA     GVVTMRCD+STCSSA
Sbjct: 240 TLDACLLGALEDGLTALLNIEVCGHRCSNQLSSAPPPLQAENLPHGVVTMRCDISTCSSA 299

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+S LVS SAQTCF+D+LLE+ IKNE+IE  QLV AL N+ DN+ P  EP  S  +ACGA
Sbjct: 300 HVSFLVSASAQTCFDDKLLESRIKNEIIEKRQLVRALSNTEDNK-PSYEPLPSMCVACGA 358

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV + +  WA+QVL+QLAP++ YRSLV LGIA + G  V+SF++ DA+RLLFFC+ Q
Sbjct: 359 STFEVWITLPKWAAQVLKQLAPEICYRSLVALGIAWVNGTPVSSFDRQDADRLLFFCSNQ 418

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN---VCNVRPKLN 351
            K    +N       SW  S + ++ R+    ESK       GV  +N   + +++P+L 
Sbjct: 419 CKDKAIQNVSYAHLSSW--SASLTKDRTTGSIESKQMSFGAKGVGGDNKMSLSSLKPRLK 476

Query: 352 SAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSS 411
            A MRPIPH+R  +M PF GF++   +D   VK +LP  P+K ++     VT RKS S S
Sbjct: 477 PATMRPIPHSRKQQMHPFMGFAQ-SIHDASPVKPSLPAPPVKDNTVPVIKVTQRKSSSGS 535

Query: 412 Y----QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAE 467
                + Q  I LNPLP+KKH C R PI +CSEE+FL+DVMQFL+ RGH RLVPQGG AE
Sbjct: 536 PSSSSRVQPSIPLNPLPMKKHECNRLPINICSEEDFLKDVMQFLLQRGHDRLVPQGGPAE 595

Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
           FPDA+LN+KRLDL+NLY+EVV RGGFHVGNGINWKGQVFSKMRNHT+TN+MTGVGNTLKR
Sbjct: 596 FPDAVLNSKRLDLYNLYKEVVYRGGFHVGNGINWKGQVFSKMRNHTVTNKMTGVGNTLKR 655

Query: 528 HYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAK 587
           HYETYLLEYELAHDD++ ECCL+CHSS  GDWVNCG+CGEWAH GCDRRQGLG FKDY+K
Sbjct: 656 HYETYLLEYELAHDDIEVECCLICHSSDLGDWVNCGVCGEWAHLGCDRRQGLGNFKDYSK 715

Query: 588 TDGLEYVCPQCS 599
           T GLEY+CP CS
Sbjct: 716 TGGLEYICPHCS 727


>gi|302825149|ref|XP_002994207.1| hypothetical protein SELMODRAFT_449331 [Selaginella moellendorffii]
 gi|300137918|gb|EFJ04712.1| hypothetical protein SELMODRAFT_449331 [Selaginella moellendorffii]
          Length = 733

 Score =  567 bits (1461), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 322/624 (51%), Positives = 400/624 (64%), Gaps = 47/624 (7%)

Query: 3   GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
           GV +V +W  S +  AA HF Q+L++ +++     WDAF+LA+AS  ++  +      S 
Sbjct: 133 GVRHVAHWDGSVTSLAAAHFRQSLVACLRTPGCDPWDAFELANASLEIHYGQT-----SG 187

Query: 63  SQKGSSKLGPHLLGD-PPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
           SQ       P LLG+ PP +D  L +M     E+ P   P I+IYD++  +R LVC   C
Sbjct: 188 SQ-------PMLLGEGPPVLDDPLKDM-----EDDP---PLIQIYDEETEIRLLVCAEAC 232

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
             D+S L  +E  L +L  IE+RG +L +R SAPPPP  A  F+RGVVTMRCDL T SSA
Sbjct: 233 ASDSSSLQAVEVALTSLFAIEVRGMRLIHRISAPPPPSAASTFARGVVTMRCDLCTSSSA 292

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
            ISL+VSGSAQTCF D  LE+ I+  L+E SQ +  L  S DN   P+E R+S SIACGA
Sbjct: 293 RISLVVSGSAQTCFADHFLESTIRKGLMEKSQALQ-LIMSEDNM--PAEIRRSTSIACGA 349

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           +V E   KV  WA+Q LRQL+ D SY++LV LGIA I+G  VA+F+++DAERL       
Sbjct: 350 AVVETRAKVPNWAAQTLRQLSTDTSYKTLVALGIAGIEGSPVAAFQQEDAERLGLLRNEP 409

Query: 302 GKADHTE--NSVLTRP--PSWLTSPAPSRKRSEPCRESKGVESENVCNVRPK-LNSAAMR 356
            KA  +   N     P  P+WLT  AP+RKR   C  S     +   N   K +  AAM+
Sbjct: 410 PKALGSPRCNGFSEAPIIPAWLTPAAPTRKRQNLCLSSINFNGDAPLNGDSKSVFLAAMK 469

Query: 357 PIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQ 416
           PIPH    K++PF+G        G  +K N           G T  T  +  S+     +
Sbjct: 470 PIPHATRRKLMPFAGVVSAGAQAGWSMKLN-----------GNTKSTRPEGTSAPSGHGR 518

Query: 417 IISLNPLPL---KKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL 473
                P+ L   KKH C R  +  C EEEFL DV+QFL+ RGH RL+P  G+  FPD +L
Sbjct: 519 GGHTFPIVLPSVKKHHCSRPSMLECPEEEFLNDVVQFLVSRGHGRLIPPTGIEAFPDVVL 578

Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYL 533
           N KRLDL+NLYREVVSRGGF VGNGINWKGQ+FSKMRNHT TNRMTGVGNTLK+HYETYL
Sbjct: 579 NGKRLDLYNLYREVVSRGGFRVGNGINWKGQIFSKMRNHTTTNRMTGVGNTLKKHYETYL 638

Query: 534 LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
           LEYELAHDDVDGECC+LCHSSA GDWVNCGICGEWAHFGCDRR GL  FK+YAKTDGLEY
Sbjct: 639 LEYELAHDDVDGECCILCHSSAEGDWVNCGICGEWAHFGCDRRTGLATFKEYAKTDGLEY 698

Query: 594 VCPQCSVTNFK----KKSQKTSNG 613
           +CP+CSV + +    +K Q+ S+G
Sbjct: 699 ICPRCSVGSARGVTSRKKQRPSSG 722


>gi|302823192|ref|XP_002993250.1| hypothetical protein SELMODRAFT_431374 [Selaginella moellendorffii]
 gi|300138920|gb|EFJ05671.1| hypothetical protein SELMODRAFT_431374 [Selaginella moellendorffii]
          Length = 753

 Score =  566 bits (1459), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 321/624 (51%), Positives = 399/624 (63%), Gaps = 47/624 (7%)

Query: 3   GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
           GV +V +W  S +  AA HF Q+L++ +++     WDAF+LA+AS  ++  +      S 
Sbjct: 153 GVRHVAHWDGSVTSLAAAHFRQSLVACLRTPGCDPWDAFELANASLEIHYGQT-----SG 207

Query: 63  SQKGSSKLGPHLLGD-PPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
           SQ       P LLG+ PP +D  L +M     E+ P   P I+IYD++  +R LVC   C
Sbjct: 208 SQ-------PMLLGEGPPVLDDPLKDM-----EDDP---PLIQIYDEETEIRLLVCAEAC 252

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
             D+S L  +E  L +L  IE+RG +L +R SAPPPP  A  F+RGVVTMRCDL T SSA
Sbjct: 253 ASDSSSLQAVEVALTSLFAIEVRGMRLIHRISAPPPPSAASTFARGVVTMRCDLCTSSSA 312

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
            ISL+VSGSAQTCF D  LE+ I+  L+E SQ +  L  S DN   P+E R+S SIACGA
Sbjct: 313 RISLVVSGSAQTCFADHFLESTIRKGLMEKSQALQ-LIMSEDNM--PAEIRRSTSIACGA 369

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           +V E   KV  WA+Q LRQL+ D SY++LV LGIA I+G  VA+F+++DAERL       
Sbjct: 370 AVVETRAKVPNWAAQTLRQLSTDTSYKTLVALGIAGIEGSPVAAFQQEDAERLGLLRNEP 429

Query: 302 GKADHTE--NSVLTRP--PSWLTSPAPSRKRSEPCRESKGVESENVCNVRPK-LNSAAMR 356
            K   +   N     P  P+WLT  AP+RKR   C  S     +   N   K +  AAM+
Sbjct: 430 PKPLGSPRCNGFSEAPIIPAWLTPAAPTRKRQNLCLSSINFNGDGPLNGDSKSVFLAAMK 489

Query: 357 PIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQ 416
           PIPH    K++PF+G        G  +K N           G T  T  +  S+     +
Sbjct: 490 PIPHATRRKLMPFAGVVSAGAQAGWSMKLN-----------GNTKSTRPEGTSAPSGHGR 538

Query: 417 IISLNPLPL---KKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL 473
                P+ L   KKH C R  +  C EEEFL DV+QFL+ RGH RL+P  G+  FPD +L
Sbjct: 539 GGHTFPIVLPSVKKHHCSRPSMLECPEEEFLNDVVQFLVSRGHGRLIPPTGIEAFPDVVL 598

Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYL 533
           N KRLDL+NLYREVVSRGGF VGNGINWKGQ+FSKMRNHT TNRMTGVGNTLK+HYETYL
Sbjct: 599 NGKRLDLYNLYREVVSRGGFRVGNGINWKGQIFSKMRNHTTTNRMTGVGNTLKKHYETYL 658

Query: 534 LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
           LEYELAHDDVDGECC+LCHSSA GDWVNCGICGEWAHFGCDRR GL  FK+YAKTDGLEY
Sbjct: 659 LEYELAHDDVDGECCILCHSSAEGDWVNCGICGEWAHFGCDRRTGLATFKEYAKTDGLEY 718

Query: 594 VCPQCSVTNFK----KKSQKTSNG 613
           +CP+CSV + +    +K Q+ S+G
Sbjct: 719 ICPRCSVGSARGVTSRKKQRPSSG 742


>gi|168019329|ref|XP_001762197.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
           patens subsp. patens]
 gi|162686601|gb|EDQ72989.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
           patens subsp. patens]
          Length = 645

 Score =  557 bits (1436), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 294/624 (47%), Positives = 399/624 (63%), Gaps = 20/624 (3%)

Query: 4   VPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASNS 63
           VP+VI WK S +   A HF +ALL+ ++S  +   DAFQ+A ASF+++C +      +  
Sbjct: 1   VPHVICWKGSPTFSLATHFRRALLATLRSCATGAKDAFQIASASFQMHCGQTKSASVNGL 60

Query: 64  QKGSSKLGPHLLGDPPKIDIALSEM-DVQGEENSPENLPA----IKIYDDDVTMRFLVCG 118
           +K  S + P LL + P   ++   M D  GEEN  E + +    ++IYD++ ++R LVC 
Sbjct: 61  EKTISVI-PVLL-EVPLSKVSEDRMSDSSGEENGNEPISSDTAPVQIYDENHSIRLLVCC 118

Query: 119 VPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTC 178
                 ++ +G +EDGL+ALL IE +G++L +R SAPPPP+ A +  RGVVTMRCD+ T 
Sbjct: 119 EASRPSSAWIGAIEDGLSALLTIEAKGARLLHRVSAPPPPIAATSLLRGVVTMRCDVCTS 178

Query: 179 SSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEP--RKSAS 236
           + A + LLVSG+AQTCF+DQ+LEN +K E+++  + +  +    ++R P      R+S S
Sbjct: 179 TFARVPLLVSGAAQTCFDDQVLENSVKKEMLDKCEHIQVVVED-ESRRPALNLNLRRSVS 237

Query: 237 IACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLF 296
           +ACGAS+ E+ +K  TW  QVLRQLA + SYR LV +GIA ++G  VA+F+++DA RL+ 
Sbjct: 238 VACGASIVELRIKAPTWVGQVLRQLAAESSYRGLVAMGIAGVEGAPVAAFQEEDAARLVS 297

Query: 297 F-CTRQGKADHTENSVLTRPPSWLTSPAPSRKRSE--PCRESKGVESENVCNVRPKLNS- 352
             CT    +D+   S     P WL  PA SRKR +  P  +S    +E         N  
Sbjct: 298 LKCTWNDASDNDRLSNHLALPPWLAPPAASRKRLKLLPSVKSSDNFAEEDRKASKASNGG 357

Query: 353 ------AAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRK 406
                 AAM+P+PHT   K +PF+      ++ G  +K ++     +  SA P       
Sbjct: 358 GNIALLAAMKPLPHTGRRKFMPFANAVMAAQHAGWSMKLDVSSRQRRPESARPRSAYFMP 417

Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
             +          L PL +K HGC R  ++ C+EEEFL D++ FL  RGH RL+P  G+ 
Sbjct: 418 GSAGHNAGGAAPFLPPLNVKSHGCKRPHMERCTEEEFLADLVSFLESRGHGRLIPPAGVD 477

Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
            FP+ +LN KRLDL+NLYREVVSRGGFHVGNGINWKGQ+F+KMRNHT  N+MTGVGNTLK
Sbjct: 478 AFPEVVLNGKRLDLYNLYREVVSRGGFHVGNGINWKGQIFAKMRNHTSVNKMTGVGNTLK 537

Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
           +HYE YLLEYELAHDDV G+CC++CHS A GDWVNCG C EWAHFGCD+R GLGAFKDYA
Sbjct: 538 KHYEVYLLEYELAHDDVGGDCCIICHSGAEGDWVNCGSCSEWAHFGCDQRTGLGAFKDYA 597

Query: 587 KTDGLEYVCPQCSVTNFKKKSQKT 610
           KTDGLEY+CP+CS  N K   ++ 
Sbjct: 598 KTDGLEYICPRCSAGNGKPNPRRA 621


>gi|168048799|ref|XP_001776853.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
           patens subsp. patens]
 gi|162671857|gb|EDQ58403.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
           patens subsp. patens]
          Length = 682

 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 293/609 (48%), Positives = 386/609 (63%), Gaps = 20/609 (3%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           QGVP+VIYWK + +   A HF QALL+ ++SS +   DAFQ+A+ASF+++C +      +
Sbjct: 83  QGVPHVIYWKGAPTLTLASHFRQALLAALRSSATEAKDAFQIANASFQIHCGQTKTASVN 142

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPA----IKIYDDDVTMRFLVC 117
             +K S+ L   L    PK     S      EE+  + +PA    ++IYDDD+ +R LVC
Sbjct: 143 GHEKASNVLPVLLAPADPKASDEDSSTSAD-EEDEKQTIPADSAPVQIYDDDINIRLLVC 201

Query: 118 GVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLST 177
                   +    LE GL+ALL IE     LH+  SAP PP  A +  RGVVTMRCD+ T
Sbjct: 202 SEASRPTPAWFAALETGLSALLTIE-----LHHTCSAPAPPQAASSLHRGVVTMRCDICT 256

Query: 178 CSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEP-RKSAS 236
            + A I+LLVSG+AQTCF+DQLLEN ++ EL+E    +  + +S   +   +   R+S S
Sbjct: 257 TTFARIALLVSGAAQTCFDDQLLENSVRKELLERCVKIRLVEDSESQKPRLNHNLRRSVS 316

Query: 237 IACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLF 296
           IA G+SV E+ ++  TW  QVLRQLA + SYRSLV LGIA ++G  V +F K+D+  L+ 
Sbjct: 317 IASGSSVVELKIRAPTWVGQVLRQLASEPSYRSLVALGIAGVEGAPVNAFLKEDSAHLVS 376

Query: 297 FCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSE--PCRESKGVESENVCNVRPKLNSAA 354
               +      E    +  PSWLT P  SRKR        +  ++  +  + R   + AA
Sbjct: 377 LKRSRDCPSIKEIPSSSSVPSWLTPPPASRKRMRVISSHSTDSLKEHDRRSSRANAD-AA 435

Query: 355 MRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQA 414
           M+P+PH+   + +PF+      ++ G  +K  L             P T R   +++  +
Sbjct: 436 MKPVPHSNRRRFMPFANAVMAAQHAGWTMK--LDQTTRVERRRQRRPHTGRPRTATTQGS 493

Query: 415 QQII----SLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPD 470
             ++    S+ P  +K HGC R P++ C+E+EFL+D++ FL  RGH RLVP  GL  FP+
Sbjct: 494 GGLVNPHLSITPAMVKPHGCSRPPMEECTEDEFLQDLVNFLESRGHNRLVPPAGLEAFPE 553

Query: 471 AILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYE 530
            +LN KRLDL+NLY+EVVSRGGFHVGNGINWKGQVFSKM NHT  N+MTGVGNTLK+HYE
Sbjct: 554 VVLNGKRLDLYNLYKEVVSRGGFHVGNGINWKGQVFSKMHNHTSVNKMTGVGNTLKKHYE 613

Query: 531 TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDG 590
           TYLLEYELAHDDVDGECC+LCHS + GDWVNCG CGEWAHFGCD R GLGAFKDYAKTDG
Sbjct: 614 TYLLEYELAHDDVDGECCILCHSGSEGDWVNCGSCGEWAHFGCDHRLGLGAFKDYAKTDG 673

Query: 591 LEYVCPQCS 599
           LEY+CP CS
Sbjct: 674 LEYICPCCS 682


>gi|226508456|ref|NP_001146332.1| uncharacterized protein LOC100279908 [Zea mays]
 gi|219886665|gb|ACL53707.1| unknown [Zea mays]
          Length = 407

 Score =  515 bits (1326), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 254/385 (65%), Positives = 286/385 (74%), Gaps = 19/385 (4%)

Query: 236 SIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLL 295
           S ACGAS FEV M +  WA+QVL+ LAPD+SYRSLV LGI  I G  VASFE+ DA+RLL
Sbjct: 2   STACGASTFEVWMSLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFERRDADRLL 61

Query: 296 FFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESE------- 341
           FFCT Q K    EN      P W  S + ++ R++   ESK       GV  +       
Sbjct: 62  FFCTSQCKDLANENGPYFHLPRW--SASLTKDRTKVGSESKQNLLGANGVLEDKKHMMEG 119

Query: 342 --NVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAG 398
             +   V+ KL  A MRPIPH+R  +M PF GF E   ++   VK +LP AP +KH+S  
Sbjct: 120 PSSWSAVKAKLKPATMRPIPHSRKQQMHPFMGFPETSLHETSIVKPSLPAAPTVKHNSVS 179

Query: 399 PTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTR 458
               THRKS S       +I LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH R
Sbjct: 180 SATTTHRKSTSGPSHTPSVIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHNR 239

Query: 459 LVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRM 518
           LVP GGLAEFPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRM
Sbjct: 240 LVPHGGLAEFPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRM 299

Query: 519 TGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQG 578
           TGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQG
Sbjct: 300 TGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQG 359

Query: 579 LGAFKDYAKTDGLEYVCPQCSVTNF 603
           LG FKDYAKTDGLEY+CP CS+ N+
Sbjct: 360 LGTFKDYAKTDGLEYICPHCSLANY 384


>gi|408690212|gb|AFU81566.1| ARID-type transcription factor, partial [Zea mays subsp. mays]
 gi|414885599|tpg|DAA61613.1| TPA: hypothetical protein ZEAMMB73_042311 [Zea mays]
          Length = 292

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 197/259 (76%), Positives = 216/259 (83%), Gaps = 1/259 (0%)

Query: 346 VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTH 404
           V+ KL  A MRPIPH+R  +M PF GF E   ++   VK +LP AP +KH+S      TH
Sbjct: 11  VKAKLKPATMRPIPHSRKQQMHPFMGFPETSLHETSIVKPSLPAAPTVKHNSVSSATTTH 70

Query: 405 RKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGG 464
           RKS S       +I LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GG
Sbjct: 71  RKSTSGPSHTPSVIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHNRLVPHGG 130

Query: 465 LAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNT 524
           LAEFPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRMTGVGNT
Sbjct: 131 LAEFPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRMTGVGNT 190

Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKD 584
           LKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKD
Sbjct: 191 LKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKD 250

Query: 585 YAKTDGLEYVCPQCSVTNF 603
           YAKTDGLEY+CP CS+ N+
Sbjct: 251 YAKTDGLEYICPHCSLANY 269


>gi|224034479|gb|ACN36315.1| unknown [Zea mays]
          Length = 273

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 193/263 (73%), Positives = 216/263 (82%), Gaps = 3/263 (1%)

Query: 355 MRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRKSLSSSYQ 413
           MRPIPH++  +M PF    E   ++   VK +L VAP +KH+S    P THRKS S    
Sbjct: 1   MRPIPHSQKQQMHPFMCLPETSFHETSIVKPSLTVAPTVKHNSVSYAPTTHRKSTSGPSH 60

Query: 414 AQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL 473
              +I LNPLP+KKHGC R  IQVCSEE+FL+DVMQFLI RGH RLVP GGLAEFPDA+L
Sbjct: 61  TPSVIQLNPLPMKKHGCDRLSIQVCSEEDFLKDVMQFLIQRGHNRLVPHGGLAEFPDAVL 120

Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYL 533
           NAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRMTGVGNTLKRHYETYL
Sbjct: 121 NAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYL 180

Query: 534 LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
           LEYELAHDDVDGECCLLCHSSA GDW+NCG+CGEWAHFGCDRRQGLG FKDYAKTDGLEY
Sbjct: 181 LEYELAHDDVDGECCLLCHSSAPGDWMNCGLCGEWAHFGCDRRQGLGTFKDYAKTDGLEY 240

Query: 594 VCPQCSVTNFKKK--SQKTSNGY 614
           +CP CS+ N+KK     K +NG+
Sbjct: 241 ICPHCSLANYKKPPLPPKVANGF 263


>gi|408690214|gb|AFU81567.1| ARID-type transcription factor, partial [Zea mays subsp. mays]
 gi|414870277|tpg|DAA48834.1| TPA: hypothetical protein ZEAMMB73_381937 [Zea mays]
          Length = 287

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 184/277 (66%), Positives = 222/277 (80%), Gaps = 7/277 (2%)

Query: 335 SKGVESEN---VCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP 391
           +KGV  +N   + +++P+L  A MRP+PH+R  +M PF GF +   +D  QVK +LP  P
Sbjct: 5   AKGVVGDNTISLSSLKPRLKPATMRPLPHSRKQQMHPFMGFPQSIIHDASQVKPSLPAPP 64

Query: 392 LKHSSAGPTPVTHRKSLSSSY----QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDV 447
           +KH++   T VT RKS S S     + Q  I LNPLP+KKH C R PI  CSEE+FL+DV
Sbjct: 65  VKHNAVPVTAVTPRKSSSGSPSSWSRVQLPIPLNPLPIKKHECNRLPIHSCSEEDFLKDV 124

Query: 448 MQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFS 507
           MQFL+ RGH RLVPQGGLAEFPDA+LN+KRLDL+NLY+EVV RGGF+VGNGINWKGQ+FS
Sbjct: 125 MQFLLQRGHIRLVPQGGLAEFPDAVLNSKRLDLYNLYKEVVYRGGFYVGNGINWKGQIFS 184

Query: 508 KMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGE 567
           KM NHT+TN+MTGVGNTLKRHYETYLLEYELAHDD+DGECCL+CHS A GDWVNCG+CGE
Sbjct: 185 KMHNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDIDGECCLICHSGALGDWVNCGLCGE 244

Query: 568 WAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFK 604
           WAH GCDRR+GL  FKDY+K DG+EY+CPQCS+  +K
Sbjct: 245 WAHLGCDRRKGLSTFKDYSKADGMEYICPQCSLAKYK 281


>gi|115476706|ref|NP_001061949.1| Os08g0451700 [Oryza sativa Japonica Group]
 gi|42407649|dbj|BAD08781.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623918|dbj|BAF23863.1| Os08g0451700 [Oryza sativa Japonica Group]
 gi|215678864|dbj|BAG95301.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215695192|dbj|BAG90383.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 464

 Score =  311 bits (798), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 175/336 (52%), Positives = 224/336 (66%), Gaps = 11/336 (3%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+ Y++YW++S S YAA HF  +LLSVV+SSCSH WDAFQLA+ASF  YCVRNN V   
Sbjct: 136 KGISYIMYWRYSLSSYAASHFRHSLLSVVRSSCSHAWDAFQLAYASFEQYCVRNNDVQ-- 193

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                   LGPHLLGD P+I I      +  EE++ E  P IKIYD+DV ++ L+CG  C
Sbjct: 194 -----RLMLGPHLLGDAPRIYITPPGNKMAEEEDTSEYFPDIKIYDEDVHLKLLICGAHC 248

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           T D+S+L  LEDGLNALLNIE R  KL +R SA PP     A   GVVT+ CD++T SS+
Sbjct: 249 TPDSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSALLDGVVTICCDITTSSSS 308

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+SLL+SGS QTCF+D+LLE HIK ELIE+ +LV  +  S D+    +EP  S S+A GA
Sbjct: 309 HVSLLLSGSPQTCFDDELLEKHIKKELIESRRLVRVVSVSEDDGPSSAEPLTSMSVASGA 368

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV M +  WA+QVL+ LA + SY+SLV LGIAS+    V+SF+K+D +RLLFFCT Q
Sbjct: 369 STFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNDTPVSSFDKEDVDRLLFFCTNQ 428

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR----SEPCR 333
            + +   N +    P W  S A  R +    S+P R
Sbjct: 429 DEDEAIGNGLYHHLPRWSASLAKDRVKRNFVSKPAR 464


>gi|222640664|gb|EEE68796.1| hypothetical protein OsJ_27535 [Oryza sativa Japonica Group]
          Length = 684

 Score =  311 bits (797), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 172/327 (52%), Positives = 220/327 (67%), Gaps = 7/327 (2%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+ Y++YW++S S YAA HF  +LLSVV+SSCSH WDAFQLA+ASF  YCVRNN V   
Sbjct: 144 KGISYIMYWRYSLSSYAASHFRHSLLSVVRSSCSHAWDAFQLAYASFEQYCVRNNDVQ-- 201

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                   LGPHLLGD P+I I      +  EE++ E  P IKIYD+DV ++ L+CG  C
Sbjct: 202 -----RLMLGPHLLGDAPRIYITPPGNKMAEEEDTSEYFPDIKIYDEDVHLKLLICGAHC 256

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
           T D+S+L  LEDGLNALLNIE R  KL +R SA PP     A   GVVT+ CD++T SS+
Sbjct: 257 TPDSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSALLDGVVTICCDITTSSSS 316

Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
           H+SLL+SGS QTCF+D+LLE HIK ELIE+ +LV  +  S D+    +EP  S S+A GA
Sbjct: 317 HVSLLLSGSPQTCFDDELLEKHIKKELIESRRLVRVVSVSEDDGPSSAEPLTSMSVASGA 376

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV M +  WA+QVL+ LA + SY+SLV LGIAS+    V+SF+K+D +RLLFFCT Q
Sbjct: 377 STFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNDTPVSSFDKEDVDRLLFFCTNQ 436

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR 328
            + +   N +    P W  S A  R +
Sbjct: 437 DEDEAIGNGLYHHLPRWSASLAKDRVK 463


>gi|115476084|ref|NP_001061638.1| Os08g0361000 [Oryza sativa Japonica Group]
 gi|113623607|dbj|BAF23552.1| Os08g0361000, partial [Oryza sativa Japonica Group]
          Length = 325

 Score =  308 bits (788), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 178/336 (52%), Positives = 225/336 (66%), Gaps = 16/336 (4%)

Query: 3   GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
           G+ Y++YW++S S YAA HF  +L+SVVQSSCSH WDAFQ+A+A+F+LYCVRNN V    
Sbjct: 1   GISYIMYWRYSLSSYAASHFRHSLMSVVQSSCSHAWDAFQVAYATFQLYCVRNNEVQ--- 57

Query: 63  SQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCT 122
                  LGPHLLGD P+I I     ++  EE++ E  P IKIYD+DV ++ L+CG  CT
Sbjct: 58  ----RLMLGPHLLGDAPRIYITPPGNEIAEEEDTSEYFPDIKIYDEDVNLKLLICGAHCT 113

Query: 123 LDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAH 182
           LD+SLL  LEDGLNALLNIE R  KL +R SA PP         G+VT+ CD++T SS+H
Sbjct: 114 LDSSLLNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGMVTICCDITTSSSSH 173

Query: 183 ISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPS-EPRKSASIACGA 241
           +SLL+SGS QTCF+D+LLE HIK ELIE+ QLV  +  S D   P S EP  S S+A GA
Sbjct: 174 VSLLLSGSPQTCFDDKLLEKHIKKELIESRQLVRVVSVSEDG--PSSAEPLTSMSVASGA 231

Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
           S FEV M +  WA+QVL+ LA + SY+SLV LGIAS+ G  V SF+  D + LLFF T Q
Sbjct: 232 STFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVCSFDSQDVDWLLFFRTIQ 291

Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR----SEPCR 333
            +A  T  S+   PP W  S A +R +    S+P R
Sbjct: 292 DEAIGT--SLYPHPPRWSASLAKNRVKGSMVSKPAR 325


>gi|384251328|gb|EIE24806.1| hypothetical protein COCSUDRAFT_46949 [Coccomyxa subellipsoidea
           C-169]
          Length = 708

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 201/623 (32%), Positives = 278/623 (44%), Gaps = 79/623 (12%)

Query: 2   QGVPYVIYWKHSFS----CYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNI 57
           QGV +V+YW    +          F  A  + +++  +   +AF +A  S   +C  N  
Sbjct: 135 QGVAHVVYWDKKLTPSPKALHVTQFSHAFFATLRNRSATIPEAFAVALHSGHAHCGSNPG 194

Query: 58  VMASNSQKGSSKLGPHLLGD----PPKIDIA----LSEMDVQ-GEENSPENLPAIKIYDD 108
            M   S        P L+ D     P ID      +   D + G   +      +++   
Sbjct: 195 PMLRPSWL------PELVSDFEPLLPSIDSVPLPLVRGADFKNGVAAAVPGWENVRLLAP 248

Query: 109 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKL--HNRTSAPPPPLQAGAFSR 166
              +R LVCG    +D   L  L + L ALL +E+R   L  HN     P  L A A   
Sbjct: 249 HAELRLLVCGQSTIIDAHRLSYLGEALRALLVLEVRQLVLASHNPCERTPAHLPANA--- 305

Query: 167 GVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENS-QLVHALPNSGDNR 225
                RC +   S+AH ++++ G  Q   +D L+E  ++  L+ ++  L   LP +G   
Sbjct: 306 --TATRCSMRWASNAHATVVLGGPPQVLGHDSLVEYALRQTLVADALSLQFRLPPAG--- 360

Query: 226 LPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVAS 285
           +P    R+SA+IA   +V +  +  S WA  +LR L+   S+R LV LG   +    VA 
Sbjct: 361 IPAPVARRSAAIANNTAVVDALVVTSVWAVHLLRTLSQYTSFRGLVALGFGGVGSARVAG 420

Query: 286 FEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSP-----APSRKRSEPCRES-KGVE 339
           F   DA R           +  + +           P     APS   S P   + K  +
Sbjct: 421 FNSTDAARYSALSDAYDTPEALQAAGALPAALPGGQPGAGPSAPSGPASYPTMLAPKHPD 480

Query: 340 SENVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGP 399
           ++   + +P  +S+  +P     H K          E  D D+              AG 
Sbjct: 481 AQKDADAQPGGSSSQAQPA--NGHAK----------EEADEDE-------------GAGA 515

Query: 400 TPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRL 459
                RK   ++      +S  P            +  CSE +FL D+  F   R    +
Sbjct: 516 GAAKRRKLTEAAAAVPVWVSKRPR-----------MWECSEAQFLEDLCAFHCERNGKNV 564

Query: 460 VPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMT 519
            P+     FPDAILN  RLDL+NLY+EV SRGG  VGNGINWKGQVF KM N T  NRMT
Sbjct: 565 TPE----TFPDAILNGSRLDLYNLYKEVTSRGGIKVGNGINWKGQVFPKMNNFTAHNRMT 620

Query: 520 GVGNTLKRHYETYLLEYELAH-DDVDGECCLLCHSS--AAGDWVNCGICGEWAHFGCDRR 576
           GVGN LKRHY  YLLEYE AH +DV  + C +C  S  AA DW++C  C  W HF CD R
Sbjct: 621 GVGNALKRHYTMYLLEYEQAHPEDVIRDTCAICGGSDEAATDWISCDSCNNWVHFSCDGR 680

Query: 577 QGLGAFKDYAKTDGLEYVCPQCS 599
             LG FKDYAK +G  Y C  C+
Sbjct: 681 TNLGTFKDYAKGNGATYNCINCA 703


>gi|222640433|gb|EEE68565.1| hypothetical protein OsJ_27051 [Oryza sativa Japonica Group]
          Length = 349

 Score =  243 bits (621), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 149/287 (51%), Positives = 188/287 (65%), Gaps = 12/287 (4%)

Query: 43  LAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPA 102
           +A+A+F+LYCVRNN V           LGPHLLGD P+I I     ++  EE++ E  P 
Sbjct: 66  VAYATFQLYCVRNNEVQ-------RLMLGPHLLGDAPRIYITPPGNEIAEEEDTSEYFPD 118

Query: 103 IKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAG 162
           IKIYD+DV ++ L+CG  CTLD+SLL  LEDGLNALLNIE R  KL +R SA PP     
Sbjct: 119 IKIYDEDVNLKLLICGAHCTLDSSLLNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDS 178

Query: 163 AFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSG 222
               G+VT+ CD++T SS+H+SLL+SGS QTCF+D+LLE HIK ELIE+ QLV  +  S 
Sbjct: 179 TLLDGMVTICCDITTSSSSHVSLLLSGSPQTCFDDKLLEKHIKKELIESRQLVRVVSVSE 238

Query: 223 DNRLPPS-EPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGL 281
           D   P S EP  S S+A GAS FEV M +  WA+QVL+ LA + SY+SLV LGIAS+ G 
Sbjct: 239 DG--PSSAEPLTSMSVASGASTFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGT 296

Query: 282 SVASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKR 328
            V SF+  D + LLFF T Q +A  T  S+   PP W  S A +R +
Sbjct: 297 PVCSFDSQDVDWLLFFRTIQDEAIGT--SLYPHPPRWSASLAKNRVK 341


>gi|414870278|tpg|DAA48835.1| TPA: hypothetical protein ZEAMMB73_381937 [Zea mays]
          Length = 137

 Score =  238 bits (606), Expect = 1e-59,   Method: Composition-based stats.
 Identities = 95/119 (79%), Positives = 109/119 (91%)

Query: 486 EVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDG 545
           +VV RGGF+VGNGINWKGQ+FSKM NHT+TN+MTGVGNTLKRHYETYLLEYELAHDD+DG
Sbjct: 13  QVVYRGGFYVGNGINWKGQIFSKMHNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDIDG 72

Query: 546 ECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFK 604
           ECCL+CHS A GDWVNCG+CGEWAH GCDRR+GL  FKDY+K DG+EY+CPQCS+  +K
Sbjct: 73  ECCLICHSGALGDWVNCGLCGEWAHLGCDRRKGLSTFKDYSKADGMEYICPQCSLAKYK 131


>gi|218201035|gb|EEC83462.1| hypothetical protein OsI_28965 [Oryza sativa Indica Group]
          Length = 522

 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 188/316 (59%), Gaps = 33/316 (10%)

Query: 33  SCSHTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQG 92
           SCSH WDAFQ+A+A+F LYCVRNN V           LGPHLLGD P+I I     ++  
Sbjct: 212 SCSHAWDAFQVAYATFELYCVRNNEVQ-------RLMLGPHLLGDAPRIYITPPGNEMAE 264

Query: 93  EENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIE---------- 142
           EE++ E  P IKIYD++V ++ L+CG  CTLD+SLL  LEDGLNALLNIE          
Sbjct: 265 EEDTSEYFPDIKIYDENVNLKLLICGAHCTLDSSLLNSLEDGLNALLNIETMQHTHIEKL 324

Query: 143 --------IRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTC 194
                    R  KL +R SA PP         G+VT+ CD++T SS+    L+SGS Q C
Sbjct: 325 SSGGGDWWFRWCKLQDRVSAAPPLHVDSTLLDGMVTICCDITT-SSSSHVSLLSGSPQIC 383

Query: 195 FNDQLLENHIKNELIENSQLVHALPNSGDNRLPPS--EPRKSASIACGASVFEVSMKVST 252
           F+D+LLE HIK ELI++ +LV  +  S D    PS  EP  S S+A GAS FEV M +  
Sbjct: 384 FDDKLLEKHIKKELIDSRRLVRVVSVSKDG---PSSAEPLTSMSVASGASTFEVLMTLPR 440

Query: 253 WASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVL 312
           W +QVL+ LA + SY+SLV LGIAS+ G  V  F+  D + LLFF T Q +A  T  S+ 
Sbjct: 441 WVAQVLKYLAQETSYKSLVPLGIASVNGTPVCLFDSQDVDWLLFFRTIQDEAIGT--SLY 498

Query: 313 TRPPSWLTSPAPSRKR 328
             PP W  S A  R +
Sbjct: 499 PHPPRWSASLAKDRVK 514


>gi|147799373|emb|CAN61485.1| hypothetical protein VITISV_043023 [Vitis vinifera]
          Length = 106

 Score =  236 bits (601), Expect = 4e-59,   Method: Composition-based stats.
 Identities = 101/106 (95%), Positives = 105/106 (99%)

Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
           MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW
Sbjct: 1   MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 60

Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
           AHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+TNF+KKSQKT+NGY
Sbjct: 61  AHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFQKKSQKTANGY 106


>gi|218201241|gb|EEC83668.1| hypothetical protein OsI_29441 [Oryza sativa Indica Group]
          Length = 736

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 142/267 (53%), Positives = 176/267 (65%), Gaps = 9/267 (3%)

Query: 19  ACHFLQALLSVVQSSCS--HTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLG 76
           A HF  AL+SV     S  H WDAFQLA+ASF  YCVRNN V           LGPHLLG
Sbjct: 247 ASHFCHALVSVSVCQSSCSHAWDAFQLAYASFEQYCVRNNEVQCL-------MLGPHLLG 299

Query: 77  DPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLN 136
           D P+I +      +  EE++ E  P IKIYD+DV ++ L+CG  CT D+S+L  +EDGLN
Sbjct: 300 DAPRIYMTPPGSKMAEEEDTSEYFPDIKIYDEDVHLKLLICGAHCTPDSSVLNSVEDGLN 359

Query: 137 ALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFN 196
           ALLNIE R  KL +R SA PP     A   GVVT+ CD++T SS+H+SLL+SGS QTCF+
Sbjct: 360 ALLNIEFRWCKLQDRVSAAPPLHVDSALLDGVVTICCDITTSSSSHVSLLLSGSPQTCFD 419

Query: 197 DQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQ 256
           D+LLE HIK ELIE  +LV  +  S D+R   +EP  S S+A GAS FEV M +  WA+Q
Sbjct: 420 DELLEKHIKKELIETRRLVRVVSVSEDDRPSSAEPLTSMSVASGASTFEVLMTLPKWAAQ 479

Query: 257 VLRQLAPDVSYRSLVMLGIASIQGLSV 283
           VL+ LA + SY+SLV LGIAS+ G  V
Sbjct: 480 VLKYLAQETSYKSLVPLGIASVNGCYV 506



 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 120/236 (50%), Positives = 158/236 (66%)

Query: 93  EENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRT 152
           EE++ E  P I IYD+DV ++ L+CG  CT D+S+L  LEDGLNALLNIE R  KL +R 
Sbjct: 4   EEDTSEYFPDITIYDEDVHLKLLICGAHCTPDSSILNSLEDGLNALLNIEFRWCKLQDRV 63

Query: 153 SAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENS 212
           SA  P     A   GVVT+ CD++T SS+H+SLL+SGS QTCF+D+LLE HIK ELIE+ 
Sbjct: 64  SAVAPLHVDSALLDGVVTICCDITTSSSSHVSLLLSGSPQTCFDDELLEKHIKKELIESR 123

Query: 213 QLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVM 272
           +LV  +  S D+    +EP  S S+A GAS FEV M +  WA+QVL+ LA + SY+SLV 
Sbjct: 124 RLVRVVSVSEDDGPSSAEPLTSMSVASGASTFEVLMTLPKWAAQVLKYLAQETSYKSLVP 183

Query: 273 LGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKR 328
           LGIAS+ G  V+SF+K+D +RLLFF T Q + +   + +    P W  S A  R +
Sbjct: 184 LGIASVNGTPVSSFDKEDVDRLLFFYTNQDEDEAIGSGLYHHLPRWSASLAKDRVK 239


>gi|413936076|gb|AFW70627.1| hypothetical protein ZEAMMB73_228285 [Zea mays]
          Length = 933

 Score =  213 bits (542), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 113/208 (54%), Positives = 141/208 (67%), Gaps = 1/208 (0%)

Query: 89  DVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKL 148
           +V  EE   E   AI+I+D+DV ++FL+CGVPCTLD  LLG L+DGLNALLNI+I  SKL
Sbjct: 307 EVDEEEGGFEGFSAIRIHDEDVNLKFLLCGVPCTLDACLLGTLKDGLNALLNIDICASKL 366

Query: 149 HNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEL 208
            NR SA PP LQA   S GV T RC+++T SSAH++ LVSGSAQTCF DQLL + IKNE+
Sbjct: 367 QNRASAHPP-LQAEDLSHGVATRRCEITTWSSAHVAYLVSGSAQTCFGDQLLGSRIKNEI 425

Query: 209 IENSQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYR 268
           IE  QLV  L N+ D +    EP  S  +AC  S FE+ + +  WA+QVL+ LAP++SYR
Sbjct: 426 IEKRQLVCVLLNNDDIKTSSFEPWPSMGVACETSTFELWITMPKWAAQVLKHLAPEISYR 485

Query: 269 SLVMLGIASIQGLSVASFEKDDAERLLF 296
           SLV LGIA I   +      +    LLF
Sbjct: 486 SLVALGIAWINACTRQEVRLECKNVLLF 513


>gi|30841464|gb|AAP34368.1| fiber protein Fb21 [Gossypium barbadense]
          Length = 104

 Score =  213 bits (541), Expect = 3e-52,   Method: Composition-based stats.
 Identities = 93/102 (91%), Positives = 96/102 (94%)

Query: 513 TLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFG 572
           TLTNRMTGVGNTLKRHYET LLEYELAHDDVDGECCLLCHSSAAGDWVNCG+C EWAHFG
Sbjct: 3   TLTNRMTGVGNTLKRHYETSLLEYELAHDDVDGECCLLCHSSAAGDWVNCGVCDEWAHFG 62

Query: 573 CDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
           CDRRQGLGAFKDYAKTDGLEYVCP CS++NFKKK  KT NGY
Sbjct: 63  CDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPHKTVNGY 104


>gi|159489410|ref|XP_001702690.1| DNA binding protein [Chlamydomonas reinhardtii]
 gi|158280712|gb|EDP06469.1| DNA binding protein [Chlamydomonas reinhardtii]
          Length = 729

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 96/178 (53%), Positives = 121/178 (67%), Gaps = 11/178 (6%)

Query: 428 HGCGRAPIQVCSEEEFLRDVMQFL-ILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYRE 486
           + C R P+  CSE +F  D+ +FL +LRG T        ++FP+A+LN   LDLF LYRE
Sbjct: 542 YTCRRPPLSACSEAQFYDDLCEFLTLLRGKTV-----ERSKFPEAVLNGVSLDLFALYRE 596

Query: 487 VVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DDVDG 545
           VVSRGGF VGNGINWKGQVF +MRN T +N+ TGVGN LKRHY+ YL EYE+AH +DV  
Sbjct: 597 VVSRGGFRVGNGINWKGQVFPRMRNWTESNKQTGVGNALKRHYQNYLWEYEVAHPEDVTL 656

Query: 546 ECCLLCHS----SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
           + C+LC++      A DW+ C  C  W H  CD+R GLG +KDY + +G  YVCP CS
Sbjct: 657 DRCVLCNARDREGGAADWLCCDCCENWVHHSCDKRPGLGQYKDYTQGNGRVYVCPSCS 714



 Score = 79.0 bits (193), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 136/309 (44%), Gaps = 28/309 (9%)

Query: 2   QGVPYVIYWKH--SFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVM 59
           +GV  V+ W    +     A H+  A  + + ++ +   +AF  A  S + +C    +V+
Sbjct: 128 RGVANVVVWPAGTAIPTAVAAHYNCAFAAFLINTLASPPEAFAAASHSTQAHC----MVL 183

Query: 60  ASNSQKGSSKLGPHLLG-------DPPKIDIALSEMDVQGEENSPENLPAIKIYDD---- 108
            S+       L P  +G       D   I        +Q ++  P NLPA   + D    
Sbjct: 184 GSDGTHTVPAL-PAFVGAARPALPDSASIPPLAVPPGLQ-QQLDPNNLPAFPGWTDVRLL 241

Query: 109 --DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSR 166
                +R L+ G    +D++ L  L + L ALL +E RG +L  RT     P        
Sbjct: 242 APRAELRLLLTGGSGLVDSTRLSFLGEALRALLVMEARGLQLTARTPLDKLPRN---LPA 298

Query: 167 GVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENS-QLVHALPNSGDNR 225
           G    RCD +T S A  S+++        +  L+E+ ++  L+ ++  L   LP  G + 
Sbjct: 299 GCAAARCDYTTSSGAAGSVVLGAPPAVLEHPTLVEHALRMTLVTDAVSLQFRLPPPGLS- 357

Query: 226 LPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVAS 285
           LP   PR S ++A G  V +  +  S WA +VLR L  +  YRSL  LG+A++   +  +
Sbjct: 358 LP--TPRSSKAVAGGTPVVDTVVVTSVWAVEVLRALCKEPRYRSLAALGVAAVGNAADCA 415

Query: 286 FEKDDAERL 294
           F   D  R+
Sbjct: 416 FTPADVARM 424


>gi|194705224|gb|ACF86696.1| unknown [Zea mays]
          Length = 118

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 86/95 (90%), Positives = 90/95 (94%)

Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
           MRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEW
Sbjct: 1   MRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEW 60

Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
           AHFGCDRRQGLG FKDYAKTDGLEY+CP CS+ N+
Sbjct: 61  AHFGCDRRQGLGTFKDYAKTDGLEYICPHCSLANY 95


>gi|115479313|ref|NP_001063250.1| Os09g0434100 [Oryza sativa Japonica Group]
 gi|113631483|dbj|BAF25164.1| Os09g0434100 [Oryza sativa Japonica Group]
          Length = 295

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/148 (65%), Positives = 110/148 (74%), Gaps = 7/148 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYVIYW++SFS YAA HF  AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V   
Sbjct: 136 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 193

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PKI+I   E ++  EE S +  PAIKIYDDD+ M+FL+CGVP 
Sbjct: 194 -----SVKLGPRLLGDAPKINITPPENEMVEEEGSSDVFPAIKIYDDDINMKFLLCGVPS 248

Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLH 149
           T D  LLG LEDGLNALLNIE+    L 
Sbjct: 249 TPDPCLLGSLEDGLNALLNIEVWNKALQ 276


>gi|302845046|ref|XP_002954062.1| hypothetical protein VOLCADRAFT_121257 [Volvox carteri f.
           nagariensis]
 gi|300260561|gb|EFJ44779.1| hypothetical protein VOLCADRAFT_121257 [Volvox carteri f.
           nagariensis]
          Length = 680

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/173 (54%), Positives = 116/173 (67%), Gaps = 9/173 (5%)

Query: 425 LKKHGCGRAPIQVCSEEEFLRDVMQFL-ILRGHTRLVPQGGLAEFPDAILNAKRLDLFNL 483
           L  + C R P+  C+E +F  D+++FL +LRG            FP+A+LN   LDLF L
Sbjct: 435 LPGYVCRRPPLSSCTESQFYDDLVEFLTLLRGKPI-----DRVRFPEAVLNGVSLDLFGL 489

Query: 484 YREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DD 542
           YREVV+RGGF VGNGINWKGQVF +MRN T TN+ TGVGN LKRHY+ YL EYELAH +D
Sbjct: 490 YREVVTRGGFRVGNGINWKGQVFPRMRNWTETNKQTGVGNALKRHYQNYLWEYELAHPED 549

Query: 543 VDGECCLLCHSSAA--GDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
           V  + C+LC+   A   DWV C  C  WAHF CD+R G+G FKDYA+  G  Y
Sbjct: 550 VTLDRCVLCNGGDAVSSDWVCCDACENWAHFSCDKRPGIGMFKDYAQGQGRVY 602



 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 61/216 (28%), Positives = 99/216 (45%), Gaps = 8/216 (3%)

Query: 79  PKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNAL 138
           P + +   E+D  G ++ P  +  +++      +R L+ G    +D   L  L + L AL
Sbjct: 57  PAVVVPGLELDEHGLKSFPGWM-DVRLLAPRAELRLLLVGTSSLIDARRLSSLGEALRAL 115

Query: 139 LNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQ 198
           L +E+R  KL +  +    P    A SR V   RC   T S     +++        ++ 
Sbjct: 116 LVMEVRTLKLVSLGAVDSLPRNLPAGSRAV---RCQFLTASGVSAGVVLGTPPSVLQHEA 172

Query: 199 LLENHIKNELIENS-QLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQV 257
           L+E+ ++  L+ +S  L   LP  G + LP   PR S  +A G  V +     S WA +V
Sbjct: 173 LVEHALRMTLVTDSVGLQFRLPPPGLS-LP--APRSSRLVAGGLPVMDTVAVTSVWAVEV 229

Query: 258 LRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAER 293
           LR L  D  YR+L  LG+A++  ++   F   D  R
Sbjct: 230 LRSLCKDPRYRTLAALGVAAVGNVADGVFTAADVRR 265


>gi|297608671|ref|NP_001061947.2| Os08g0451500 [Oryza sativa Japonica Group]
 gi|255678493|dbj|BAF23861.2| Os08g0451500 [Oryza sativa Japonica Group]
          Length = 298

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 111/214 (51%), Positives = 142/214 (66%), Gaps = 4/214 (1%)

Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
           D+S+L  LEDGLNALLNIE R  KL +R SA PP         GVVT+ CD++T SS+H+
Sbjct: 85  DSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 144

Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
           SLL SGS QTCF+D+LLE HIK ELIE+ +LV  +  S D+R   +EP  S S+A GAS 
Sbjct: 145 SLLHSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 204

Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
           FEV M +  WA+QVL+ LA + SY+SLV LGIAS+ G  V+SF++ D +RLLFFCT Q +
Sbjct: 205 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 264

Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR----SEPCR 333
            +   N +   PP W  S A  R +    S+P R
Sbjct: 265 DEAIANGLYHHPPRWSASLAKDRVKGNMVSKPAR 298


>gi|42407647|dbj|BAD08779.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 301

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 108/205 (52%), Positives = 138/205 (67%)

Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
           D+S+L  LEDGLNALLNIE R  KL +R SA PP         GVVT+ CD++T SS+H+
Sbjct: 85  DSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 144

Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
           SLL SGS QTCF+D+LLE HIK ELIE+ +LV  +  S D+R   +EP  S S+A GAS 
Sbjct: 145 SLLHSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 204

Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
           FEV M +  WA+QVL+ LA + SY+SLV LGIAS+ G  V+SF++ D +RLLFFCT Q +
Sbjct: 205 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 264

Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR 328
            +   N +   PP W  S A  R +
Sbjct: 265 DEAIANGLYHHPPRWSASLAKDRVK 289


>gi|222640663|gb|EEE68795.1| hypothetical protein OsJ_27534 [Oryza sativa Japonica Group]
          Length = 353

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 108/205 (52%), Positives = 138/205 (67%)

Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
           D+S+L  LEDGLNALLNIE R  KL +R SA PP         GVVT+ CD++T SS+H+
Sbjct: 137 DSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 196

Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
           SLL SGS QTCF+D+LLE HIK ELIE+ +LV  +  S D+R   +EP  S S+A GAS 
Sbjct: 197 SLLHSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 256

Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
           FEV M +  WA+QVL+ LA + SY+SLV LGIAS+ G  V+SF++ D +RLLFFCT Q +
Sbjct: 257 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 316

Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR 328
            +   N +   PP W  S A  R +
Sbjct: 317 DEAIANGLYHHPPRWSASLAKDRVK 341


>gi|414883857|tpg|DAA59871.1| TPA: hypothetical protein ZEAMMB73_492055 [Zea mays]
          Length = 884

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 110/238 (46%), Positives = 142/238 (59%), Gaps = 10/238 (4%)

Query: 93  EENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRT 152
           EE  PE   AI++YD+DV M+FL+CGVPCTLD  L+G LEDGLN+LLNIEI  SKL NR 
Sbjct: 529 EEGCPEGFSAIRVYDEDVNMKFLLCGVPCTLDACLMGALEDGLNSLLNIEICASKLQNRV 588

Query: 153 SAPPPPLQAGAFSRGVV-TMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIEN 211
           SA P  LQA    + ++ +      TC  +   L      QTCF+DQLL + I+NE+IE 
Sbjct: 589 SAHPL-LQAEDLPQELLRSHNLQFCTCVVSCFWL-----RQTCFDDQLLGSRIRNEIIEK 642

Query: 212 SQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLV 271
            QLV  L  + DN+    E   S  +AC  S FEV + +  WA+QVL+ LAP++SYRSLV
Sbjct: 643 WQLVCVLLYNEDNKPSSFECWPSTGVACRDSTFEVWITMPKWAAQVLKHLAPEISYRSLV 702

Query: 272 MLGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRS 329
            LGI+ + G  V+SF+  DA RLLF C+ +       N       +W    AP  K S
Sbjct: 703 ALGISWVNGTPVSSFDWQDAGRLLFLCSNKCGDQAILNGSFAHGSNW---AAPLTKDS 757


>gi|414870276|tpg|DAA48833.1| TPA: hypothetical protein ZEAMMB73_656706 [Zea mays]
          Length = 276

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 90/141 (63%), Positives = 107/141 (75%), Gaps = 7/141 (4%)

Query: 2   QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
           +G+PYV+YW++ FS Y A HF  AL+SV+QSSCSHTWDAFQLAHASFRLYCVRNN V   
Sbjct: 137 KGIPYVVYWRNLFSSYTASHFRHALMSVIQSSCSHTWDAFQLAHASFRLYCVRNNHVQ-- 194

Query: 62  NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
                S KLGP LLGD PK ++  +  +V  E+   E  P I+IYD+DV+M+FL+CGVPC
Sbjct: 195 -----SVKLGPCLLGDAPKTNVIPAGDEVNEEQGCSEGFPDIRIYDEDVSMKFLLCGVPC 249

Query: 122 TLDTSLLGPLEDGLNALLNIE 142
           TLD  LLG LEDGLNALLNIE
Sbjct: 250 TLDACLLGALEDGLNALLNIE 270


>gi|218201240|gb|EEC83667.1| hypothetical protein OsI_29440 [Oryza sativa Indica Group]
          Length = 353

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 107/205 (52%), Positives = 138/205 (67%)

Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
           D+S+L  LEDG NALLNIE R  KL +R SA PP         GVVT+ CD++T SS+H+
Sbjct: 137 DSSILNSLEDGSNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 196

Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
           SLL+SGS QTCF+D+LLE HIK ELIE+ +LV  +  S D+R   +EP  S S+A GAS 
Sbjct: 197 SLLLSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 256

Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
           FEV M +  WA+QVL+ LA + SY+SLV LGIAS+ G  V+SF++ D +RLLFFCT Q +
Sbjct: 257 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 316

Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR 328
            +   N +   PP W  S A  R +
Sbjct: 317 DEAIANGLYHHPPRWSASLAKDRVK 341


>gi|345451024|gb|AEN94092.1| DNA binding protein [Pyrus pyrifolia]
          Length = 93

 Score =  171 bits (432), Expect = 1e-39,   Method: Composition-based stats.
 Identities = 71/82 (86%), Positives = 76/82 (92%)

Query: 533 LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLE 592
           L  YE AHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLE
Sbjct: 2   LTAYEFAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLE 61

Query: 593 YVCPQCSVTNFKKKSQKTSNGY 614
           Y+CP CS++NFKKK QK +NG+
Sbjct: 62  YICPHCSISNFKKKPQKIANGF 83


>gi|414870275|tpg|DAA48832.1| TPA: hypothetical protein ZEAMMB73_656706 [Zea mays]
          Length = 351

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/111 (65%), Positives = 83/111 (74%), Gaps = 7/111 (6%)

Query: 32  SSCSHTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQ 91
           SSCSHTWDAFQLAHASFRLYCVRNN V        S KLGP LLGD PK ++  +  +V 
Sbjct: 242 SSCSHTWDAFQLAHASFRLYCVRNNHVQ-------SVKLGPCLLGDAPKTNVIPAGDEVN 294

Query: 92  GEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIE 142
            E+   E  P I+IYD+DV+M+FL+CGVPCTLD  LLG LEDGLNALLNIE
Sbjct: 295 EEQGCSEGFPDIRIYDEDVSMKFLLCGVPCTLDACLLGALEDGLNALLNIE 345


>gi|414885600|tpg|DAA61614.1| TPA: hypothetical protein ZEAMMB73_011257, partial [Zea mays]
          Length = 91

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/97 (62%), Positives = 72/97 (74%), Gaps = 7/97 (7%)

Query: 27  LSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALS 86
           +S +QSS SHTWDAFQLAHASFRLYCVRNN V        S KLGP LLGD PKI+I+  
Sbjct: 1   MSAIQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-------SVKLGPRLLGDAPKINISPP 53

Query: 87  EMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTL 123
           E ++  EE S E  PAIKIYD+++ M+ L+CGVPCTL
Sbjct: 54  ETEMVDEEGSSEVTPAIKIYDEEINMKLLLCGVPCTL 90


>gi|307104284|gb|EFN52539.1| hypothetical protein CHLNCDRAFT_15191, partial [Chlorella
           variabilis]
          Length = 90

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 49/90 (54%), Positives = 62/90 (68%), Gaps = 3/90 (3%)

Query: 513 TLTNRMTGVGNTLKRHYETYLLEYELAH-DDVDGECCLLCH--SSAAGDWVNCGICGEWA 569
           T  ++MTGVGN LKRHY+ +LLEYE AH +DV G+ C +C      A DW++C +C  W 
Sbjct: 1   TAHHKMTGVGNALKRHYQVFLLEYEQAHPEDVTGDRCGICGRGDEHAADWLSCDMCDCWV 60

Query: 570 HFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
           HF CD RQG G+FKDY+K  G  Y CP+CS
Sbjct: 61  HFSCDTRQGRGSFKDYSKGRGRLYHCPRCS 90


>gi|357515069|ref|XP_003627823.1| hypothetical protein MTR_8g038730 [Medicago truncatula]
 gi|92885128|gb|ABE87648.1| DNA binding , related [Medicago truncatula]
 gi|355521845|gb|AET02299.1| hypothetical protein MTR_8g038730 [Medicago truncatula]
          Length = 204

 Score = 95.1 bits (235), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 40/59 (67%), Positives = 51/59 (86%)

Query: 1   MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVM 59
           ++G+PYV++WK++FS YAACHF QAL SVVQSS +HTWDAF LA ASF+ YCV+NN V+
Sbjct: 129 LKGIPYVVFWKNAFSQYAACHFHQALFSVVQSSSTHTWDAFHLARASFQPYCVQNNQVL 187


>gi|384246380|gb|EIE19870.1| hypothetical protein COCSUDRAFT_48713 [Coccomyxa subellipsoidea
           C-169]
          Length = 1111

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/127 (40%), Positives = 66/127 (51%), Gaps = 19/127 (14%)

Query: 430 CGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRL--DLFNLYREV 487
           C R P+ +C+E EF  D+  FL  RG   L       +  +  +N   L  D+F LYREV
Sbjct: 301 CRRPPLGICTEIEFEEDLYAFLSQRGEAELA-----NDLRNKRINCNNLPIDIFGLYREV 355

Query: 488 VSRGGFHVGN-----------GINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
           V  GGF V N           GIN+ G+VF +M+N+T  NR T VGN L  +Y  +LL Y
Sbjct: 356 VRHGGF-VDNERYDDYNRWTGGINFGGKVFPRMKNYTPYNRATSVGNQLLNNYRKFLLAY 414

Query: 537 ELAHDDV 543
           E  H D 
Sbjct: 415 EPTHSDA 421


>gi|255075697|ref|XP_002501523.1| predicted protein [Micromonas sp. RCC299]
 gi|226516787|gb|ACO62781.1| predicted protein [Micromonas sp. RCC299]
          Length = 1085

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 448 MQFLILRGHTRLVPQGGLAE-FPDAILNAKRLDLFNLYREVVSRGGFHVGN-------GI 499
           + FL  +G  +L P+G   E FP   LN  +LDLF LYR V  RGGF   +        +
Sbjct: 70  ISFLQAKGRFKLAPEGCDPESFPRRTLNGFKLDLFGLYRAVCVRGGFPRADRKKSDAPKL 129

Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
           NW   VF+++ NHT T++ T VG+ L  HY+ YL EYE  H
Sbjct: 130 NWTRDVFTRLANHTDTHKATSVGHDLITHYQNYLAEYEDEH 170


>gi|255089563|ref|XP_002506703.1| predicted protein [Micromonas sp. RCC299]
 gi|226521976|gb|ACO67961.1| predicted protein [Micromonas sp. RCC299]
          Length = 764

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 70/154 (45%), Gaps = 13/154 (8%)

Query: 429 GCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL--NAKRLDLFNLYRE 486
           G  R P+  C+ E+F+ D M+F   +G    V    + +  D  L  + + +D    Y+E
Sbjct: 234 GWRRPPLSRCTPEDFMLDFMRFTDSQGDDWEV----VWDILDNSLRRSGRPMDALGFYKE 289

Query: 487 VVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DD 542
           +  RGGF        +     VF ++ NH + +  T +GN L   YE Y   YE AH +D
Sbjct: 290 ICWRGGFVNRESAKKRIKMPDVFKQLHNHYVNHTYTDIGNNLLNTYERYFWAYERAHPED 349

Query: 543 VDGECCLLCHSS---AAGDWVNCGICGEWAHFGC 573
           V  + C+ C        G  + C  CG+W H  C
Sbjct: 350 VSNQRCVTCEQDRVEKPGQMLRCDGCGDWHHAEC 383


>gi|145353915|ref|XP_001421244.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|145353987|ref|XP_001421278.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581481|gb|ABO99537.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581515|gb|ABO99571.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 638

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 67/125 (53%), Gaps = 14/125 (11%)

Query: 432 RAPIQVCSEEEFLRDVMQFLILRGH---TRLVPQG-GLAEFPDAILNAKRLDLFNLYREV 487
           R  ++   E EFL D++  L LR      R VP+G   ++FP   +N  +LDLF LY  V
Sbjct: 19  RRSLEALGEREFLDDLI--LTLRKDPALAREVPEGVDYSQFPQRTVNGFKLDLFGLYHNV 76

Query: 488 VSRGGFHV-GNG------INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
            +RGGF V  NG       +W  ++F  M N T  +R T VG+ L   Y  YLL YE AH
Sbjct: 77  CARGGFPVETNGSASKVPWSWSKEIFKGMANWTDNHRCTSVGHDLICSYRKYLLAYEQAH 136

Query: 541 -DDVD 544
            +DVD
Sbjct: 137 PEDVD 141


>gi|303277379|ref|XP_003057983.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460640|gb|EEH57934.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1199

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/86 (45%), Positives = 47/86 (54%), Gaps = 13/86 (15%)

Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNG-------------INWKGQVFSKMRNHTL 514
           FP   LN  +LDLF LYR VVSRGGF    G             +NW   VF  + N+T 
Sbjct: 117 FPTRTLNGAKLDLFGLYRAVVSRGGFPSSKGGVNGGGGGDKREKLNWTRDVFPCLSNYTE 176

Query: 515 TNRMTGVGNTLKRHYETYLLEYELAH 540
            +R T VG+ L  HY+ YL EYELA+
Sbjct: 177 NHRATSVGHDLITHYQNYLCEYELAN 202


>gi|303278346|ref|XP_003058466.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459626|gb|EEH56921.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 569

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 77/190 (40%), Gaps = 29/190 (15%)

Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRL-----DLFNLYRE 486
           R P+  CS E+F+ D  +F   RG    +         D + NA R      D   LYR+
Sbjct: 236 RPPLSECSPEDFMLDFFRFHESRGDDWEI-------VWDILDNALRRSGRPHDTLGLYRQ 288

Query: 487 VVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           V +RGGF        +    +VF  + N+ + +  T +GN L   YE Y L YE+AH + 
Sbjct: 289 VCARGGFISRESAKARIKMPEVFGHLFNYWIGHTYTDIGNNLLNVYERYFLPYEVAHPED 348

Query: 544 DGE-------CCLLCHSS-------AAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTD 589
             E       C  + H+        A   W+ C  CGE  H  C  R       +     
Sbjct: 349 TPEQTTPCVACARVVHAGGDAVPMYAQSTWIECDACGERRHECCASRDDHVRSAERDGGR 408

Query: 590 GLEYVCPQCS 599
              YVC +C+
Sbjct: 409 ATSYVCEKCA 418


>gi|412986545|emb|CCO14971.1| predicted protein [Bathycoccus prasinos]
          Length = 995

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 48/82 (58%), Gaps = 5/82 (6%)

Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGN-----GINWKGQVFSKMRNHTLTNRMTGVG 522
           +P   LN  +LDLF +YR+VV+RGG+          INW  ++F K+ N T T++ T VG
Sbjct: 8   WPAKTLNGYKLDLFGIYRQVVARGGWGEDEITCKRKINWSREIFPKLANFTSTHKATSVG 67

Query: 523 NTLKRHYETYLLEYELAHDDVD 544
           + L   Y+ YL  YE  + DVD
Sbjct: 68  HDLITSYKNYLHSYERKYADVD 89


>gi|308810917|ref|XP_003082767.1| fiber protein-like (ISS) [Ostreococcus tauri]
 gi|116061236|emb|CAL56624.1| fiber protein-like (ISS) [Ostreococcus tauri]
          Length = 327

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 86/203 (42%), Gaps = 25/203 (12%)

Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL--NAKRLDLFNLYREVVS 489
           R  ++ CS+E+F+ D M+F   RG         L    D  L      +D  +LYR + +
Sbjct: 56  RPALRECSQEDFMLDFMRFHHARGRDWERCWSNL----DRALARTGAPMDCMHLYRMICA 111

Query: 490 RGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DDVDG 545
            GGF        +    ++F  M N+   + MT VGN L   YE Y  EYELAH DDV  
Sbjct: 112 LGGFVNRESAKTRLSMTEIFKLMFNYYENHTMTDVGNRLLNAYENYFWEYELAHEDDVSR 171

Query: 546 ECCLLC-----HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDG--LEYVCPQC 598
             C  C     HSS +G    C IC    H GC   +         K  G  + +VC  C
Sbjct: 172 GTCRSCGGGIFHSSDSGLLHVCKICHANYHRGCQPPERFPETFGTGKDAGCSMHFVCWAC 231

Query: 599 S--------VTNFKKKSQKTSNG 613
           +        V +FK +  +T + 
Sbjct: 232 TRSRPAEAGVESFKVRDAQTVDA 254


>gi|145353421|ref|XP_001421012.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581248|gb|ABO99305.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 55/123 (44%), Gaps = 23/123 (18%)

Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHT---------RLVPQGGLAEFPDAILNAKRLDLFN 482
           R  +  CS E+F  D+M+F   RG           R+V + G             LD  +
Sbjct: 80  RPALAACSLEDFTFDLMRFHQERGRDWEQCWANLDRVVGRTGAP-----------LDCLH 128

Query: 483 LYREVVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELA 539
           LYR+V S GGF        +    ++F  M NH   + MT  GN +   YE Y LEYE A
Sbjct: 129 LYRQVCSLGGFASRTSAKERIKAAEIFRNMFNHYENHTMTDCGNRMLTAYENYFLEYEAA 188

Query: 540 HDD 542
           H++
Sbjct: 189 HEE 191


>gi|145357351|ref|XP_001422883.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144583127|gb|ABP01242.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 281

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 55/123 (44%), Gaps = 23/123 (18%)

Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHT---------RLVPQGGLAEFPDAILNAKRLDLFN 482
           R  +  CS E+F  D+M+F   RG           R+V + G             LD  +
Sbjct: 33  RPALAACSLEDFTFDLMRFHQERGRDWEQCWANLDRVVGRTGAP-----------LDCLH 81

Query: 483 LYREVVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELA 539
           LYR+V S GGF        +    ++F  M NH   + MT  GN +   YE Y LEYE A
Sbjct: 82  LYRQVCSLGGFASRTSAKERIKAAEIFRNMFNHYENHTMTDCGNRMLTAYENYFLEYEAA 141

Query: 540 HDD 542
           H++
Sbjct: 142 HEE 144


>gi|194758138|ref|XP_001961319.1| GF11056 [Drosophila ananassae]
 gi|190622617|gb|EDV38141.1| GF11056 [Drosophila ananassae]
          Length = 1718

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 42/121 (34%), Positives = 58/121 (47%), Gaps = 18/121 (14%)

Query: 418 ISLNPLPLK-KHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAK 476
           I L P+P K +     AP      EEF RD+ QF   RG     P    A+     +N K
Sbjct: 52  IQLPPVPEKIEEQPASAP------EEFWRDLQQFHERRG----TPMTQTAK-----INGK 96

Query: 477 RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
           ++DL+ LY+EV  RGGF+  N  +   +V+S +   TL  R       +K  Y  YL +Y
Sbjct: 97  QVDLYKLYQEVTERGGFNKVNLRDEWDEVYSAL--ETLRERCVNGTAGIKHIYRRYLDKY 154

Query: 537 E 537
           E
Sbjct: 155 E 155


>gi|194864076|ref|XP_001970758.1| GG23207 [Drosophila erecta]
 gi|190662625|gb|EDV59817.1| GG23207 [Drosophila erecta]
          Length = 1696

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 16/129 (12%)

Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
           ++  +A+  + L  LP K      +P      EEF RD+ QF   RG     P       
Sbjct: 40  TTPVRARNPVQLQILPEKVEEMPASP-----PEEFWRDLQQFHERRGTPLTQP------- 87

Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
             A ++ K++DL+ LY EV  RGGF+  N  +   +V+S M   TL  R       +K  
Sbjct: 88  --ARISGKQVDLYKLYNEVTERGGFNKVNMRDEWDEVYSSM--ETLRERCVNGTAGIKHI 143

Query: 529 YETYLLEYE 537
           Y  YL +YE
Sbjct: 144 YRRYLDKYE 152


>gi|195474131|ref|XP_002089345.1| GE19061 [Drosophila yakuba]
 gi|194175446|gb|EDW89057.1| GE19061 [Drosophila yakuba]
          Length = 1695

 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 47/97 (48%), Gaps = 11/97 (11%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
           EEF RD+ QF   RG     P         A ++ K++DL+ LY EV  RGGF+  N  +
Sbjct: 67  EEFWRDLQQFHERRGTPLTQP---------ARISGKQVDLYKLYNEVTERGGFNKVNMRD 117

Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
              +V+S M   TL  R       +K  Y  YL +YE
Sbjct: 118 EWDEVYSAM--ETLRERCVNGTAGIKHIYRRYLDKYE 152


>gi|198461268|ref|XP_001361964.2| GA17110 [Drosophila pseudoobscura pseudoobscura]
 gi|198137288|gb|EAL26543.2| GA17110 [Drosophila pseudoobscura pseudoobscura]
          Length = 1695

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 61/141 (43%), Gaps = 20/141 (14%)

Query: 397 AGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGH 456
           A  TP+  R    ++YQ Q      P P         P    + EEF RD+ QF     H
Sbjct: 44  ATTTPLRAR----NAYQQQ---PQTPAPFTPENTAGKP--ASAPEEFWRDLQQF-----H 89

Query: 457 TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTN 516
            R     G A    A ++ K +DL+ LY EV  RGGF+  N  +   +V+S M   TL  
Sbjct: 90  ER----RGTALTHAAKISGKHVDLYKLYTEVTDRGGFNKVNMRDEWDEVYSAM--ETLRE 143

Query: 517 RMTGVGNTLKRHYETYLLEYE 537
           R       +K  Y  YL +YE
Sbjct: 144 RCVNGTAGIKHIYRRYLDKYE 164


>gi|195580956|ref|XP_002080300.1| GD10411 [Drosophila simulans]
 gi|194192309|gb|EDX05885.1| GD10411 [Drosophila simulans]
          Length = 1684

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 11/97 (11%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
           EEF RD+ QF   RG     P         A ++ K +DL+ LY EV  RGGF+     +
Sbjct: 67  EEFWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRD 117

Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
              +V+S M   TL  R      ++K  Y  YL +YE
Sbjct: 118 EWDEVYSAM--DTLRERCVNGTASIKHIYRRYLDKYE 152


>gi|195331853|ref|XP_002032613.1| GM20880 [Drosophila sechellia]
 gi|194124583|gb|EDW46626.1| GM20880 [Drosophila sechellia]
          Length = 1692

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 11/97 (11%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
           EEF RD+ QF   RG     P         A ++ K +DL+ LY EV  RGGF+     +
Sbjct: 67  EEFWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRD 117

Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
              +V+S M   TL  R      ++K  Y  YL +YE
Sbjct: 118 EWDEVYSAM--DTLRERCVNGTASIKHIYRRYLDKYE 152


>gi|195171065|ref|XP_002026331.1| GL20250 [Drosophila persimilis]
 gi|194111233|gb|EDW33276.1| GL20250 [Drosophila persimilis]
          Length = 376

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 61/141 (43%), Gaps = 20/141 (14%)

Query: 397 AGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGH 456
           A  TP+  R    ++YQ Q      P P         P    + EEF RD+ QF     H
Sbjct: 44  ATTTPLRAR----NAYQQQ---PQTPAPFTPENTAGKP--ASAPEEFWRDLQQF-----H 89

Query: 457 TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTN 516
            R     G A    A ++ K +DL+ LY EV  RGGF+  N  +   +V+S M   TL  
Sbjct: 90  ERR----GTALTHAAKISGKHVDLYKLYTEVTDRGGFNKVNMRDEWDEVYSAM--ETLRE 143

Query: 517 RMTGVGNTLKRHYETYLLEYE 537
           R       +K  Y  YL +YE
Sbjct: 144 RCVNGTAGIKHIYRRYLDKYE 164


>gi|303289831|ref|XP_003064203.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454519|gb|EEH51825.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 564

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 35/69 (50%), Gaps = 5/69 (7%)

Query: 476 KRLDLFNLYREVVSRGGFHVGNGINWKG----QVFSKMRNHTLTNRMTGVGNTLKRHYET 531
           K  D + LYR V +RGGF V      K     +VF +M NH   +  T +G  L   YE 
Sbjct: 121 KPFDTYGLYRAVTNRGGF-VERPTARKNLSMVEVFREMSNHYDGHTYTDIGTLLLNTYEK 179

Query: 532 YLLEYELAH 540
           Y LEYE AH
Sbjct: 180 YFLEYERAH 188


>gi|195124688|ref|XP_002006823.1| GI18380 [Drosophila mojavensis]
 gi|193911891|gb|EDW10758.1| GI18380 [Drosophila mojavensis]
          Length = 1765

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 15/116 (12%)

Query: 426 KKHGCGRAP----IQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLF 481
           +K   G AP     Q+ + +EF RD+ QF   RG     P    A+     ++ + +DL+
Sbjct: 65  EKSQAGEAPGAEEQQLSAPDEFWRDLHQFHERRG----TPLTQAAK-----ISGQHVDLY 115

Query: 482 NLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            LY+EV  RGGF+  N  +   +V+S +   TL +R       +K  Y  YL +YE
Sbjct: 116 RLYQEVTERGGFNKVNMRDEWDEVYSAL--DTLRDRCVNGTAGIKHIYRRYLDKYE 169


>gi|384251312|gb|EIE24790.1| hypothetical protein COCSUDRAFT_62200 [Coccomyxa subellipsoidea
           C-169]
          Length = 88

 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 4/50 (8%)

Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREV 487
           CSE +F+ D+  F   R   ++ P+      PDAILN  RLDL+NLY+E 
Sbjct: 23  CSEAQFMEDLCAFHCERNGKKVTPETS----PDAILNGSRLDLYNLYKET 68


>gi|195380922|ref|XP_002049205.1| GJ21457 [Drosophila virilis]
 gi|194144002|gb|EDW60398.1| GJ21457 [Drosophila virilis]
          Length = 1779

 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 11/102 (10%)

Query: 436 QVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV 495
           Q+ + EEF RD+ QF   RG     P    A+     ++ + +DL+ LY+EV  RGGF+ 
Sbjct: 92  QLRAPEEFWRDLHQFHERRG----TPLTQAAK-----ISGQHVDLYRLYQEVTERGGFNK 142

Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            N  +   +V+S +   TL +R       +K  Y  YL +YE
Sbjct: 143 VNMRDEWDEVYSAL--DTLRDRCVNGTAGIKHIYRRYLDKYE 182


>gi|253741679|gb|EES98544.1| ARID1 AT-rich interaction domain protein [Giardia intestinalis ATCC
           50581]
          Length = 467

 Score = 47.4 bits (111), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 6/66 (9%)

Query: 472 ILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
           I+  K+LD++ L+R V +RGG    N I WK ++  K+    L   +T VG TL+  YE+
Sbjct: 37  IVGHKQLDMYQLFRAVQARGG--AKNVIQWK-EIGKKL---GLPASVTNVGYTLRTKYES 90

Query: 532 YLLEYE 537
           Y+L YE
Sbjct: 91  YILPYE 96


>gi|19921692|ref|NP_610216.1| brahma associated protein 170kD [Drosophila melanogaster]
 gi|7302272|gb|AAF57363.1| brahma associated protein 170kD [Drosophila melanogaster]
          Length = 1688

 Score = 46.2 bits (108), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 44/95 (46%), Gaps = 11/95 (11%)

Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
           F RD+ QF   RG     P         A ++ K +DL+ LY EV  RGGF+     +  
Sbjct: 69  FWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRDEW 119

Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            +V+S M   TL  R      ++K  Y  YL +YE
Sbjct: 120 DEVYSAM--ETLRERCVNGTASIKHIYRRYLDKYE 152


>gi|432885673|ref|XP_004074709.1| PREDICTED: uncharacterized protein LOC101173321 [Oryzias latipes]
          Length = 525

 Score = 45.8 bits (107), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 54/107 (50%), Gaps = 14/107 (13%)

Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRG 491
           R  ++  +EE+FL+D+  F+  R          +   P+  L  K++DLF +++ V   G
Sbjct: 12  RDQMEDITEEKFLKDLYIFMKKRD-------TPIERIPN--LGFKQIDLFVMFKTVNDLG 62

Query: 492 GFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
           G+H V +   WK QV++ +  +    R T      +RHYE  LL YE
Sbjct: 63  GYHKVTSHQLWK-QVYNILGGNP---RSTSAATCTRRHYERLLLPYE 105


>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
 gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 6753

 Score = 45.8 bits (107), Expect = 0.070,   Method: Composition-based stats.
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 6/66 (9%)

Query: 548  CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
            C++C+     D    WV C +C  W H  CD+ +      +      ++Y CP CS+  F
Sbjct: 1764 CIMCNEKYDEDDSKKWVQCDVCKFWIHLSCDKNESRNI--ETLSNKNIDYKCPTCSIGTF 1821

Query: 604  KKKSQK 609
              K ++
Sbjct: 1822 HDKIER 1827


>gi|383856249|ref|XP_003703622.1| PREDICTED: uncharacterized protein LOC100880603 [Megachile
           rotundata]
          Length = 1208

 Score = 45.4 bits (106), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 51/113 (45%), Gaps = 14/113 (12%)

Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
             E++FL DV  F+  RG     P G +      +L  +++DLF  Y +V   GG+  V 
Sbjct: 314 AEEKKFLTDVQNFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQLLGGYDSVS 364

Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 549
            G  WK  ++  +  +T +   T      +RHYE  LL YE      D +  L
Sbjct: 365 AGRLWK-TIYDDIGGNTGS---TSAATITRRHYERLLLPYERYQRGEDAKVKL 413


>gi|327272142|ref|XP_003220845.1| PREDICTED: lysine-specific demethylase 5A-like [Anolis
           carolinensis]
          Length = 1695

 Score = 45.4 bits (106), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL++L + V S+GGF V +    
Sbjct: 93  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYSLSKIVASKGGFEVVS---- 138

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +          G+ LK HYE  L  YEL    V
Sbjct: 139 KEKKWSKVASRLAYMPGKATGSLLKAHYERILYPYELFQSGV 180


>gi|74193153|dbj|BAE20593.1| unnamed protein product [Mus musculus]
          Length = 215

 Score = 45.4 bits (106), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|348552013|ref|XP_003461823.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A-like
           [Cavia porcellus]
          Length = 1635

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF V      
Sbjct: 164 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEVVT---- 209

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 210 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 251


>gi|350414064|ref|XP_003490196.1| PREDICTED: hypothetical protein LOC100746223 [Bombus impatiens]
          Length = 1200

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 437 VCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
              E++FL DV  F+  RG     P G +      +L  +++DLF  Y +V   GG+  V
Sbjct: 312 TAEEKKFLTDVQNFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQLLGGYDSV 362

Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
             G  WK  ++  +  +T +   T      +RHYE  LL YE
Sbjct: 363 SAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 400


>gi|340717787|ref|XP_003397357.1| PREDICTED: hypothetical protein LOC100652122 [Bombus terrestris]
          Length = 1202

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 437 VCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
              E++FL DV  F+  RG     P G +      +L  +++DLF  Y +V   GG+  V
Sbjct: 312 TAEEKKFLTDVQNFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQLLGGYDSV 362

Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
             G  WK  ++  +  +T +   T      +RHYE  LL YE
Sbjct: 363 SAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 400


>gi|195429395|ref|XP_002062748.1| GK19622 [Drosophila willistoni]
 gi|194158833|gb|EDW73734.1| GK19622 [Drosophila willistoni]
          Length = 1685

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 11/97 (11%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
           +EF RD+ QF     H R     G      A ++ + +DL+ LY+EV  RGGF   N  +
Sbjct: 72  DEFWRDLHQF-----HER----RGTPITQTAKISGQHVDLYKLYQEVTERGGFSKVNMRD 122

Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
              +V++ M   TL  R       +K  Y  YL +YE
Sbjct: 123 EWDEVYTAM--ETLRERCVNGTAGIKHIYRRYLDKYE 157


>gi|380015168|ref|XP_003691580.1| PREDICTED: uncharacterized protein LOC100865061 [Apis florea]
          Length = 1122

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 52/119 (43%), Gaps = 23/119 (19%)

Query: 429 GCGRAPIQV---------CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLD 479
             GR P  V           E++F+ DV  F+  RG     P G +      +L  +++D
Sbjct: 227 SVGRPPSSVVRRSERKTSAEEKKFITDVQNFMNSRG----TPVGKMP-----LLGYRQID 277

Query: 480 LFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
           LF  Y +V   GG+  V  G  WK  ++  +  +T +   T      +RHYE  LL YE
Sbjct: 278 LFLFYTKVQMLGGYDSVSAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 332


>gi|328865792|gb|EGG14178.1| ARID/BRIGHT DNA binding domain-containing protein [Dictyostelium
           fasciculatum]
          Length = 978

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 43/97 (44%), Gaps = 12/97 (12%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
           E F   + QFL  +G         L  FP    + + LDL+ L+ EV  RGG H     N
Sbjct: 369 EVFASQLCQFLEEKG-------TPLEHFPK--YDGRDLDLYKLFVEVNRRGGLHEVTKTN 419

Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
               + + ++   +  R     NTLK HY  YLL YE
Sbjct: 420 SWHDILTALKTTDICQRPI---NTLKHHYNDYLLAYE 453


>gi|392347603|ref|XP_002729471.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
           [Rattus norvegicus]
          Length = 1722

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|344278059|ref|XP_003410814.1| PREDICTED: lysine-specific demethylase 5A [Loxodonta africana]
          Length = 1693

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          I+  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------IVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|46409136|gb|AAS93725.1| RE64461p [Drosophila melanogaster]
          Length = 902

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 44/95 (46%), Gaps = 11/95 (11%)

Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
           F RD+ QF   RG     P         A ++ K +DL+ LY EV  RGGF+     +  
Sbjct: 69  FWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRDEW 119

Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            +V+S M   TL  R      ++K  Y  YL +YE
Sbjct: 120 DEVYSAM--ETLRERCVNGTASIKHIYRRYLDKYE 152


>gi|74150603|dbj|BAE32322.1| unnamed protein product [Mus musculus]
 gi|74227277|dbj|BAE21737.1| unnamed protein product [Mus musculus]
          Length = 264

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|148667219|gb|EDK99635.1| mCG129751 [Mus musculus]
          Length = 1710

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|392340216|ref|XP_002726529.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
           [Rattus norvegicus]
          Length = 1639

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|14715099|gb|AAH10717.1| Jarid1a protein [Mus musculus]
          Length = 621

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|226958545|ref|NP_666109.2| lysine-specific demethylase 5A [Mus musculus]
 gi|150383496|sp|Q3UXZ9.2|KDM5A_MOUSE RecName: Full=Lysine-specific demethylase 5A; AltName: Full=Histone
           demethylase JARID1A; AltName: Full=Jumonji/ARID
           domain-containing protein 1A; AltName:
           Full=Retinoblastoma-binding protein 2; Short=RBBP-2
          Length = 1690

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|354476339|ref|XP_003500382.1| PREDICTED: lysine-specific demethylase 5A [Cricetulus griseus]
          Length = 1671

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 70  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 115

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 116 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 157


>gi|149049582|gb|EDM02036.1| rCG29703 [Rattus norvegicus]
          Length = 1526

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|195056152|ref|XP_001994976.1| GH22895 [Drosophila grimshawi]
 gi|193899182|gb|EDV98048.1| GH22895 [Drosophila grimshawi]
          Length = 1762

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 49/97 (50%), Gaps = 11/97 (11%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
           +EF RD+ QF   RG     P    A+     ++ + +DL+ LY+EV  RGGF+  N  +
Sbjct: 95  DEFWRDLHQFHERRG----TPLTQAAK-----ISGQHVDLYRLYQEVTERGGFNKVNVRD 145

Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
              +V+S +   TL +R       +K  Y  YL +YE
Sbjct: 146 EWDEVYSAL--DTLRDRCVNGTAGIKHIYRRYLDKYE 180


>gi|307187859|gb|EFN72795.1| AT-rich interactive domain-containing protein 5B [Camponotus
           floridanus]
          Length = 1073

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 437 VCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
              E++F+ DV  F+  RG     P G +      +L  +++DLF  Y +V   GG+  V
Sbjct: 174 TAEEKKFITDVQSFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQMLGGYDSV 224

Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
             G  WK  ++  +  +T +   T      +RHYE  LL YE
Sbjct: 225 SAGRLWK-TIYDDIGGNTGS---TSAATITRRHYERLLLPYE 262


>gi|344253284|gb|EGW09388.1| Lysine-specific demethylase 5A [Cricetulus griseus]
          Length = 1608

 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 7   DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 52

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 53  KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 94


>gi|328776925|ref|XP_003249244.1| PREDICTED: hypothetical protein LOC100576405 [Apis mellifera]
          Length = 1273

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 23/118 (19%)

Query: 430 CGRAPIQV---------CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
            GR P  V           E++F+ DV  F+  RG     P G +      +L  +++DL
Sbjct: 296 VGRPPSSVVRRSERKTSAEEKKFIADVQNFMNSRG----TPVGKMP-----LLGYRQIDL 346

Query: 481 FNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
           F  Y +V   GG+  V  G  WK  ++  +  +T +   T      +RHYE  LL YE
Sbjct: 347 FLFYTKVQMLGGYDSVSAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 400


>gi|402884752|ref|XP_003905839.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
           [Papio anubis]
          Length = 1842

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 242 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 287

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 288 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 329


>gi|332024185|gb|EGI64399.1| AT-rich interactive domain-containing protein 5B [Acromyrmex
           echinatior]
          Length = 1093

 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 14/101 (13%)

Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
             E++F+ DV  F+  RG     P G +      +L  +++DLF  Y +V   GG+  V 
Sbjct: 173 AEEKKFITDVQSFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQMLGGYDSVS 223

Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            G  WK  ++  +  +T +   T      +RHYE  LL YE
Sbjct: 224 AGRLWK-TIYDDIGGNTGS---TSAATITRRHYERLLLPYE 260


>gi|426227070|ref|XP_004007650.1| PREDICTED: lysine-specific demethylase 5A [Ovis aries]
          Length = 1793

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 192 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 237

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 238 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 279


>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
           sativus]
 gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
           sativus]
          Length = 1055

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 60/140 (42%), Gaps = 17/140 (12%)

Query: 462 QGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV-GNGINWKGQVFSKMRNHTLTNRMTG 520
           +  ++  PDA  + + L+ ++  ++VV++G  H  G G+    +   K++  T   ++  
Sbjct: 357 EADVSGLPDASTSNQDLEYYS-EKKVVNKGSRHCDGCGLLSLCKTLKKVKGPTSATQLLC 415

Query: 521 VGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLG 580
                 R  + Y            G C  + H S  G+WV C  C  W H  CD+     
Sbjct: 416 KHCHKLRQSKQYC-----------GVCKKIWHHSDGGNWVCCDGCNVWVHAECDKISS-K 463

Query: 581 AFKDYAKTDGLEYVCPQCSV 600
            FKD A +   EY CP C V
Sbjct: 464 LFKDLAHS---EYYCPDCKV 480


>gi|417406677|gb|JAA49985.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|329663691|ref|NP_001192554.1| lysine-specific demethylase 5A [Bos taurus]
 gi|296487060|tpg|DAA29173.1| TPA: JARID1A variant protein-like [Bos taurus]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|359323071|ref|XP_854690.3| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
           [Canis lupus familiaris]
          Length = 1688

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|197246491|gb|AAI69044.1| Jarid1a protein [Rattus norvegicus]
          Length = 579

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|410963533|ref|XP_003988319.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
           [Felis catus]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|297261517|ref|XP_002798486.1| PREDICTED: lysine-specific demethylase 5A-like [Macaca mulatta]
          Length = 1687

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|417406655|gb|JAA49977.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
          Length = 1649

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|194211564|ref|XP_001914992.1| PREDICTED: lysine-specific demethylase 5A isoform 1 [Equus
           caballus]
          Length = 1692

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 91  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 136

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 137 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 178


>gi|403286751|ref|XP_003934640.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
           [Saimiri boliviensis boliviensis]
          Length = 1952

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 351 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 396

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 397 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 438


>gi|291392891|ref|XP_002712829.1| PREDICTED: retinoblastoma binding protein 2 isoform 1 [Oryctolagus
           cuniculus]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|301756621|ref|XP_002914152.1| PREDICTED: lysine-specific demethylase 5A-like [Ailuropoda
           melanoleuca]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|431892152|gb|ELK02599.1| Lysine-specific demethylase 5A [Pteropus alecto]
          Length = 1692

 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|126340225|ref|XP_001373115.1| PREDICTED: lysine-specific demethylase 5A [Monodelphis domestica]
          Length = 1689

 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|290990959|ref|XP_002678103.1| predicted protein [Naegleria gruberi]
 gi|284091714|gb|EFC45359.1| predicted protein [Naegleria gruberi]
          Length = 702

 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 10/98 (10%)

Query: 440 EEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGI 499
           E++FL+   +++ L  +T   P       P  +L +  LD+F+LY  V+ +GGF   +  
Sbjct: 185 EKQFLQLFFEYM-LDKNTEDFPS------PVPMLRSVPLDIFHLYGRVLEKGGFLYMDSG 237

Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
           +W+  VF K+RN+     M      LK  YE YL  +E
Sbjct: 238 DWE-LVFKKIRNY--DELMKKPEEMLKSMYEQYLKNFE 272


>gi|435778|gb|AAB28544.1| retinoblastoma binding protein 2 [Homo sapiens]
          Length = 1722

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|397499371|ref|XP_003820427.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A [Pan
           paniscus]
          Length = 1717

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|332249134|ref|XP_003273720.1| PREDICTED: lysine-specific demethylase 5A [Nomascus leucogenys]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|110618244|ref|NP_001036068.1| lysine-specific demethylase 5A [Homo sapiens]
 gi|215274124|sp|P29375.3|KDM5A_HUMAN RecName: Full=Lysine-specific demethylase 5A; AltName: Full=Histone
           demethylase JARID1A; AltName: Full=Jumonji/ARID
           domain-containing protein 1A; AltName:
           Full=Retinoblastoma-binding protein 2; Short=RBBP-2
 gi|168275608|dbj|BAG10524.1| histone demethylase JARID1A [synthetic construct]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|355563873|gb|EHH20373.1| Lysine-specific demethylase 5A [Macaca mulatta]
 gi|383420467|gb|AFH33447.1| lysine-specific demethylase 5A [Macaca mulatta]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|71122345|gb|AAH99835.1| Jarid1a protein [Rattus norvegicus]
          Length = 1099

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|410354793|gb|JAA44000.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
          Length = 1688

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|355785795|gb|EHH65978.1| Lysine-specific demethylase 5A [Macaca fascicularis]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|332838263|ref|XP_003313474.1| PREDICTED: lysine-specific demethylase 5A [Pan troglodytes]
          Length = 1688

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|68533053|dbj|BAE06081.1| JARID1A variant protein [Homo sapiens]
          Length = 1731

 Score = 43.9 bits (102), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 130 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 175

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 176 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 217


>gi|410261294|gb|JAA18613.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|351710793|gb|EHB13712.1| Lysine-specific demethylase 5A [Heterocephalus glaber]
          Length = 1694

 Score = 43.9 bits (102), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|410223430|gb|JAA08934.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
 gi|410300510|gb|JAA28855.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
 gi|410354795|gb|JAA44001.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
          Length = 1690

 Score = 43.9 bits (102), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|449269524|gb|EMC80287.1| Lysine-specific demethylase 5A, partial [Columba livia]
          Length = 1650

 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G    +P          ++  K LDL+ L + V S+GGF V      
Sbjct: 48  DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVASKGGFEVVT---- 93

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 94  KEKKWSKVASRLGYLPGKGTGSLLKSHYERLLYPYELFQSGV 135


>gi|74201318|dbj|BAE26113.1| unnamed protein product [Mus musculus]
          Length = 1094

 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|51593641|gb|AAH80691.1| Jarid1a protein, partial [Mus musculus]
          Length = 1102

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|74141547|dbj|BAE38548.1| unnamed protein product [Mus musculus]
          Length = 1093

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|355697494|gb|AES00689.1| lysine -specific demethylase 5A [Mustela putorius furo]
          Length = 1504

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 109 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 154

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 155 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 196


>gi|308162935|gb|EFO65302.1| Hypothetical protein GLP15_4319 [Giardia lamblia P15]
          Length = 618

 Score = 43.5 bits (101), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 12/63 (19%)

Query: 545 GECCLLCHS------SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
            + C+LC +         GDW+ C +C EW H  CD         +    + + Y CP+C
Sbjct: 65  SDLCVLCSNHKCEFIETGGDWIQCDLCKEWVHQQCD------GIYNALLAEKIHYTCPRC 118

Query: 599 SVT 601
            V 
Sbjct: 119 RVV 121


>gi|395743750|ref|XP_002822777.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A,
           partial [Pongo abelii]
          Length = 1613

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 129 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 174

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 175 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 216


>gi|380798887|gb|AFE71319.1| lysine-specific demethylase 5A, partial [Macaca mulatta]
          Length = 1653

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 52  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 97

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 98  KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 139


>gi|119609377|gb|EAW88971.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_b
           [Homo sapiens]
 gi|162319392|gb|AAI56462.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
 gi|225000544|gb|AAI72533.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
          Length = 1641

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|159112278|ref|XP_001706368.1| Hypothetical protein GL50803_8381 [Giardia lamblia ATCC 50803]
 gi|157434464|gb|EDO78694.1| hypothetical protein GL50803_8381 [Giardia lamblia ATCC 50803]
          Length = 618

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 12/62 (19%)

Query: 545 GECCLLCHS------SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
            + C+LC +         GDW+ C +C EW H  CD         +    + + Y CP+C
Sbjct: 65  SDLCILCSNHKCELIETGGDWIQCDLCKEWLHQQCD------GIYNALLAEKIHYTCPRC 118

Query: 599 SV 600
            V
Sbjct: 119 RV 120


>gi|427793739|gb|JAA62321.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
          Length = 374

 Score = 43.5 bits (101), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 48/98 (48%), Gaps = 14/98 (14%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGI 499
           +EFL D+  F+  RG     P   +   P  I+  + LDLF LYR VVSRGG   V N  
Sbjct: 100 KEFLDDLFSFMQKRG----TP---VNRIP--IMAKQVLDLFELYRLVVSRGGLVEVINKK 150

Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            W+    +K  N  L + +T    TL+  Y  YL  YE
Sbjct: 151 IWR--EITKGLN--LPSSITSAAFTLRTQYMKYLYPYE 184


>gi|291392893|ref|XP_002712830.1| PREDICTED: retinoblastoma binding protein 2 isoform 2 [Oryctolagus
           cuniculus]
          Length = 1648

 Score = 43.5 bits (101), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|253748038|gb|EET02406.1| Hypothetical protein GL50581_289 [Giardia intestinalis ATCC 50581]
          Length = 612

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 12/62 (19%)

Query: 545 GECCLLCHS------SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
            + C LC +         GDW+ C +C EW H  CD         +    + ++Y CP+C
Sbjct: 65  SDLCALCSNHKCEFIETEGDWIQCDLCKEWVHQQCD------GIYNALLAEKIQYTCPRC 118

Query: 599 SV 600
            V
Sbjct: 119 RV 120


>gi|255083290|ref|XP_002504631.1| histone demethylase [Micromonas sp. RCC299]
 gi|226519899|gb|ACO65889.1| histone demethylase [Micromonas sp. RCC299]
          Length = 827

 Score = 43.5 bits (101), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 41/64 (64%), Gaps = 4/64 (6%)

Query: 478 LDLFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
           LDL ++YREVV+RGG+  V +   WK +V + +  H LT + T  G  ++++YE  LLEY
Sbjct: 112 LDLGHVYREVVARGGYKAVCDNRRWK-EVCATL-GHDLTGQ-TSAGFQMRQNYERCLLEY 168

Query: 537 ELAH 540
           E A 
Sbjct: 169 EFAE 172


>gi|427796997|gb|JAA63950.1| Putative dna-binding bright/brcaa1/rbp1, partial [Rhipicephalus
           pulchellus]
          Length = 349

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 48/98 (48%), Gaps = 14/98 (14%)

Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGI 499
           +EFL D+  F+  RG     P   +   P  I+  + LDLF LYR VVSRGG   V N  
Sbjct: 75  KEFLDDLFSFMQKRG----TP---VNRIP--IMAKQVLDLFELYRLVVSRGGLVEVINKK 125

Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            W+    +K  N  L + +T    TL+  Y  YL  YE
Sbjct: 126 IWR--EITKGLN--LPSSITSAAFTLRTQYMKYLYPYE 159


>gi|189234904|ref|XP_968114.2| PREDICTED: similar to AT-rich interactive domain-containing protein
           5B (ARID domain-containing protein 5B) (Mrf1-like)
           (Modulator recognition factor 2) (MRF-2) [Tribolium
           castaneum]
 gi|270002738|gb|EEZ99185.1| serine protease H6 [Tribolium castaneum]
          Length = 1176

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 10/101 (9%)

Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
            +E    +D ++ L +   T   P G +       L  K LDL++ Y +V   GG+  V 
Sbjct: 306 ATESNESKDFIKELTVFMKTNYTPIGKIPS-----LGYKELDLYSFYTKVQKLGGYDSVT 360

Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
               WK  +F  M  H  +   T     ++RHYE +LL YE
Sbjct: 361 ANRLWKS-IFDDMSGHASS---TSAATVIRRHYERFLLPYE 397


>gi|119609376|gb|EAW88970.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_a
           [Homo sapiens]
          Length = 1315

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|29387265|gb|AAH48307.1| JARID1A protein [Homo sapiens]
 gi|31753201|gb|AAH53893.1| JARID1A protein [Homo sapiens]
 gi|83406060|gb|AAI10917.1| JARID1A protein [Homo sapiens]
          Length = 1102

 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|432091446|gb|ELK24528.1| Lysine-specific demethylase 5A [Myotis davidii]
          Length = 1097

 Score = 42.7 bits (99), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 89  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|122114608|ref|NP_001073670.1| uncharacterized protein LOC572528 [Danio rerio]
 gi|120537766|gb|AAI29398.1| Zgc:158706 [Danio rerio]
          Length = 603

 Score = 42.7 bits (99), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 16/102 (15%)

Query: 438 CSEEEFLRDVMQFLILRGH-TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
            SEE FL+D+  F+  RG     +P  G           K++D+F +Y+ V   GG+  V
Sbjct: 18  MSEERFLKDLYLFMKDRGTPIERIPHLGF----------KQIDMFLMYKTVKELGGYQQV 67

Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
                WK +V++ +  +    R T      +RHYE  LL YE
Sbjct: 68  TTQQLWK-KVYNILGGNP---RSTSAATCTRRHYEKLLLPYE 105


>gi|224096413|ref|XP_002198181.1| PREDICTED: lysine-specific demethylase 5A [Taeniopygia guttata]
          Length = 1690

 Score = 42.7 bits (99), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G    +P          ++  K LDL+ L + V ++GGF V      
Sbjct: 89  DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVANKGGFEVVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVASRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|429964439|gb|ELA46437.1| hypothetical protein VCUG_02073 [Vavraia culicis 'floridensis']
          Length = 816

 Score = 42.7 bits (99), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 59/128 (46%), Gaps = 16/128 (12%)

Query: 439 SEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGN 497
           +E+EF R   +++++    + +P          +   K +++++LYR V   GG  +V N
Sbjct: 218 TEQEFERAYKEYVLM--DDKFIPDSEKNSANIIVPGTKEINMYSLYRIVSENGGMENVTN 275

Query: 498 GINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE---LAHDDVDGECCLLCHSS 554
              WK   +  MR        T V  T++  Y+ +L E+E    A   V+ EC  +C   
Sbjct: 276 EQKWKSLFYGAMRK-------TNVSYTVRTFYKKFLYEFEQYRRARHYVNCECTGMC--- 325

Query: 555 AAGDWVNC 562
              D+V+C
Sbjct: 326 ECNDFVDC 333


>gi|350415404|ref|XP_003490629.1| PREDICTED: protein dead ringer-like [Bombus impatiens]
          Length = 651

 Score = 42.7 bits (99), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 63/147 (42%), Gaps = 30/147 (20%)

Query: 394 HSSAGPTPVTHRKSLSSSYQAQ--QIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFL 451
           HS +  T   H++  S S++ Q  Q+  +N  P +K              EFL D+  ++
Sbjct: 194 HSQSNSTGTNHQQQTSWSFEEQFKQLYEINDDPKRK--------------EFLDDLFSYM 239

Query: 452 ILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMR 510
             RG         +   P  I+    LDL+ LY  V++RGG   V N   W+ ++   + 
Sbjct: 240 QKRGTP-------INRLP--IMAKSVLDLYELYNLVIARGGLVDVINKKLWQ-EIIKGLH 289

Query: 511 NHTLTNRMTGVGNTLKRHYETYLLEYE 537
              L + +T    TL+  Y  YL  YE
Sbjct: 290 ---LPSSITSAAFTLRTQYMKYLYPYE 313


>gi|391337315|ref|XP_003743015.1| PREDICTED: lysine-specific demethylase 5A-like [Metaseiulus
           occidentalis]
          Length = 1479

 Score = 42.7 bits (99), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 48/97 (49%), Gaps = 16/97 (16%)

Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VGNGINW 501
           FL  + +F  LRG++  +P          I+  + LDLF L+R V S GGF  VG    W
Sbjct: 85  FLDKIAKFWHLRGNSLKIP----------IVERRSLDLFKLHRIVESEGGFERVGKERKW 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
             +V  K+    L   ++ V   L+ HYE  LL Y++
Sbjct: 135 -FRVVQKL-GLPLVKSLSTV---LRNHYERLLLPYDI 166


>gi|308159465|gb|EFO61994.1| ARID1 protein [Giardia lamblia P15]
          Length = 462

 Score = 42.4 bits (98), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 6/66 (9%)

Query: 472 ILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
           I+  ++LD++ L+R V +RGG    N   WK ++  K+    L   +T  G TL+  YE+
Sbjct: 37  IVGHRQLDMYQLFRAVQARGG--AKNVTQWK-EIGKKL---GLPASVTNAGYTLRTKYES 90

Query: 532 YLLEYE 537
           Y+L YE
Sbjct: 91  YILPYE 96


>gi|354543992|emb|CCE40714.1| hypothetical protein CPAR2_107490 [Candida parapsilosis]
          Length = 1720

 Score = 42.4 bits (98), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 39/128 (30%)

Query: 436 QVCSEEEFLRDVMQFLILRGHTRLVPQGG---------------LAEFPDAILNAKRLDL 480
           +V +  +F RD+++F     H       G               L++ P  +++ + LDL
Sbjct: 177 EVLARLKFYRDLLEFHQRDQHYHFEDSEGTKKKEKNGASNLQSHLSKLP--MIDKRPLDL 234

Query: 481 FNLYREVVSRGGF----------HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYE 530
           F LY+ V+ RGGF           +G  + +KG++            MT + ++LK  Y+
Sbjct: 235 FKLYQSVIVRGGFIEVINKKLWAQIGRELGYKGKI------------MTSLSSSLKASYQ 282

Query: 531 TYLLEYEL 538
             L  YEL
Sbjct: 283 KILYPYEL 290


>gi|159117256|ref|XP_001708848.1| ARID1 AT-rich interaction domain [Giardia lamblia ATCC 50803]
 gi|114803613|gb|ABI81170.1| ARID1 [Giardia intestinalis]
 gi|157436962|gb|EDO81174.1| ARID1 AT-rich interaction domain protein [Giardia lamblia ATCC
           50803]
          Length = 469

 Score = 42.4 bits (98), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 6/66 (9%)

Query: 472 ILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
           I+  ++LD++ L+R V +RGG    N   WK ++  K+    L   +T  G TL+  YE+
Sbjct: 37  IVGHRQLDMYQLFRAVQARGG--AKNVTQWK-EIGKKL---GLPASVTNAGYTLRTKYES 90

Query: 532 YLLEYE 537
           Y+L YE
Sbjct: 91  YILPYE 96


>gi|326912327|ref|XP_003202505.1| PREDICTED: lysine-specific demethylase 5A-like [Meleagris
           gallopavo]
          Length = 1487

 Score = 42.4 bits (98), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G    +P          ++  K LDL+ L + V  +GGF V      
Sbjct: 68  DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVAGKGGFEVVT---- 113

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 114 KEKKWSKVASRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 155


>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
 gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
          Length = 2678

 Score = 42.0 bits (97), Expect = 0.88,   Method: Composition-based stats.
 Identities = 24/69 (34%), Positives = 30/69 (43%), Gaps = 6/69 (8%)

Query: 545 GECCLLCH----SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
           G+ C +C     SS  GDW+ C  C  W H  CD    L    +  K+  + Y CP C  
Sbjct: 767 GQYCAICRQIWSSSWDGDWLQCDTCRFWIHTECD--TNLINSIEVLKSPSVSYHCPVCRS 824

Query: 601 TNFKKKSQK 609
              K K QK
Sbjct: 825 NEKKPKYQK 833


>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
           PHD-bromo-3xPHD-SET domain and associated cysteine
           cluster at the C-terminus [Cryptosporidium parvum Iowa
           II]
 gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
           PHD-bromo-3xPHD-SET domain and associated cysteine
           cluster at the C-terminus [Cryptosporidium parvum Iowa
           II]
          Length = 2244

 Score = 42.0 bits (97), Expect = 0.88,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 6/69 (8%)

Query: 545 GECCLLCH----SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
           G+ C +C     SS  G+W+ C IC  W H+ CD+       + Y+    L Y CP C  
Sbjct: 609 GQFCSICRKIWTSSWEGEWLQCDICKFWVHYDCDKDLN-EPIEFYSNVKNL-YNCPACRS 666

Query: 601 TNFKKKSQK 609
            +   K Q+
Sbjct: 667 NDNSVKYQR 675


>gi|403216349|emb|CCK70846.1| hypothetical protein KNAG_0F01780 [Kazachstania naganishii CBS 8797]
          Length = 1421

 Score = 42.0 bits (97), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 7/115 (6%)

Query: 500  NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW 559
            +W  ++   ++   +TN +T V   L   ++ +     LA  D   + C  C    +G  
Sbjct: 1206 SWTTELNKCLKRQKITNILTLVTACLDLDFDCFTKRGSLAEKD--AKYCF-CRKGESGTM 1262

Query: 560  VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
            + C IC EW H  C  ++G    K  A T+ + +VC  C++ +    S++    Y
Sbjct: 1263 IECDICKEWYHTNCIGQKGS---KSTATTNSV-FVCSLCNINSNTVMSKRNGVNY 1313


>gi|118082971|ref|XP_416379.2| PREDICTED: lysine-specific demethylase 5A [Gallus gallus]
          Length = 1691

 Score = 42.0 bits (97), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G    +P          ++  K LDL+ L + V  +GGF V      
Sbjct: 89  DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVAGKGGFEVVT---- 134

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 135 KEKKWSKVASRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176


>gi|195434725|ref|XP_002065353.1| GK15405 [Drosophila willistoni]
 gi|194161438|gb|EDW76339.1| GK15405 [Drosophila willistoni]
          Length = 1900

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 14/96 (14%)

Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
           FL  + +F  L+G +  +P          ++  K LDL+ L+R V   GG         K
Sbjct: 268 FLDQIAKFWELQGASLKIP----------MVERKALDLYTLHRIVQEEGGMETAT----K 313

Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
            + ++K+ N         VG TLK HYE  L  +E+
Sbjct: 314 ERKWAKVANRMQYPSSKSVGATLKAHYERILHPFEV 349


>gi|403344053|gb|EJY71365.1| DUF1423 domain containing protein [Oxytricha trifallax]
          Length = 663

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 14/41 (34%), Positives = 20/41 (48%)

Query: 558 DWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           +WV C  C +W H  C+   G     +  + DG +Y CP C
Sbjct: 446 EWVQCENCNKWEHTDCEVINGFTELPEKLQQDGWKYFCPLC 486


>gi|340507679|gb|EGR33604.1| hypothetical protein IMG5_048210 [Ichthyophthirius multifiliis]
          Length = 479

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 37/87 (42%), Gaps = 13/87 (14%)

Query: 476 KRLDLFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL 534
           + LD F LY+ V+ RGG   V N   WK  V          +  T    TL+ HY+ YLL
Sbjct: 40  RELDFFKLYKSVIRRGGAQAVSNNKMWKEIV----NEFGFPSTCTSASFTLRNHYQKYLL 95

Query: 535 EYELAH--------DDVDGECCLLCHS 553
            YE  +         ++  E  +  HS
Sbjct: 96  GYEFKYFFHQLQSTKNIKQESQIQSHS 122


>gi|412991505|emb|CCO16350.1| predicted protein [Bathycoccus prasinos]
          Length = 436

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 6/76 (7%)

Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWK-----GQVFSKMRNHTLTNRMTGVGNTLKRH 528
           + K  DL   Y EV  RGGF   N +  K      ++F  M N+   +  T +GN L   
Sbjct: 288 SGKPFDLLTFYGEVCRRGGFG-RNRVEAKCRFSISKIFKVMFNYFDQHSYTDIGNKLFDV 346

Query: 529 YETYLLEYELAHDDVD 544
           YE + L YE AH   D
Sbjct: 347 YELFFLPYERAHSQED 362


>gi|395541541|ref|XP_003772701.1| PREDICTED: lysine-specific demethylase 5A-like, partial
           [Sarcophilus harrisii]
          Length = 200

 Score = 41.2 bits (95), Expect = 1.4,   Method: Composition-based stats.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 65  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 110

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
           K + +SK+ +        G G+ LK HYE  L  YEL    V
Sbjct: 111 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 152


>gi|208435489|pdb|2JXJ|A Chain A, Nmr Structure Of The Arid Domain From The Histone H3k4
           Demethylase Rbp2
          Length = 96

 Score = 41.2 bits (95), Expect = 1.6,   Method: Composition-based stats.
 Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 14/97 (14%)

Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
           +FL  + +F  L+G T  +P          ++  K LDL+ L + V S+GGF +      
Sbjct: 10  DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 55

Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
           K + +SK+ +        G G+ LK HYE  L  YEL
Sbjct: 56  KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYEL 92


>gi|355753250|gb|EHH57296.1| AT-rich interactive domain-containing protein 3C [Macaca
           fascicularis]
          Length = 411

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 44/99 (44%), Gaps = 16/99 (16%)

Query: 441 EEFLRDVMQFLILRGH-TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNG 498
           +EFL D+  F+  RG     VP          I+  + LDL+ L+R V ++GG   V N 
Sbjct: 116 KEFLDDLFSFMQKRGTPVNRVP----------IMAKQVLDLYALFRLVTAKGGLVEVINR 165

Query: 499 INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
             W+       R  +L   +T    TL+  Y  YL  YE
Sbjct: 166 KVWR----EVTRGLSLPTTITSAAFTLRTQYMKYLYPYE 200


>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 989

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + H S  G+WV C  C  W H  CD+      FKD   TD   Y CP C
Sbjct: 375 GICKRIWHHSDGGNWVCCDGCNVWVHAECDKISS-KVFKDLENTD---YYCPDC 424


>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
 gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
          Length = 1149

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + H S  G+WV C  C  W H  CD +     FKD   TD   Y CP C
Sbjct: 323 GICKKIWHHSDGGNWVCCDGCNVWVHAECD-KISTEHFKDLENTD---YYCPDC 372


>gi|297821026|ref|XP_002878396.1| hypothetical protein ARALYDRAFT_907712 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324234|gb|EFH54655.1| hypothetical protein ARALYDRAFT_907712 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 645

 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 25/56 (44%), Gaps = 4/56 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
           G C  + H S  GDWV C  C  W H GCD        K + + +   Y CP C V
Sbjct: 337 GICKRMWHPSDDGDWVCCDGCDVWVHAGCDNISN----KHFKELEHNNYYCPNCKV 388


>gi|356560272|ref|XP_003548417.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 954

 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + H S  G+WV C  C  W H  CD+      FKD   TD   Y CP C
Sbjct: 337 GICKRIWHHSDGGNWVCCDGCNVWVHAECDKISS-KLFKDLENTD---YYCPDC 386


>gi|312075048|ref|XP_003140243.1| hypothetical protein LOAG_04658 [Loa loa]
          Length = 388

 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 75/194 (38%), Gaps = 43/194 (22%)

Query: 350 LNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLS 409
           L  A+  P PHT           S++ +     V AN P+ P  +    PT +  R    
Sbjct: 232 LPGASPNPWPHTP----------SQMSQGHRTPVPANRPITPTVNKRQDPTSIVDRL--- 278

Query: 410 SSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHT-RLVPQGGLAEF 468
                  + ++NP  +           +     F   ++QF   +G     VPQ      
Sbjct: 279 ----VGPVTAMNPQHI-----------MPERRAFFEKLVQFCEQQGEPITQVPQ------ 317

Query: 469 PDAILNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
               ++ + +DL  LY  V+ RGGF  V     WK QV ++  +    +  +  G  L+R
Sbjct: 318 ----VSKQTVDLHRLYLAVMKRGGFEQVTREKTWK-QVCTEANSEM--SESSAAGYQLRR 370

Query: 528 HYETYLLEYELAHD 541
           HY+ YLL  E   D
Sbjct: 371 HYQKYLLGLECLED 384


>gi|241948971|ref|XP_002417208.1| conserved hypothetical protein [Candida dubliniensis CD36]
 gi|223640546|emb|CAX44800.1| conserved hypothetical protein [Candida dubliniensis CD36]
          Length = 1668

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 8/62 (12%)

Query: 539  AHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
              D+V  +   +C     G  V C  C EW H  C         KD +  D  +Y+CP C
Sbjct: 1365 TEDEVTPKYYCVCREYEYGTMVECDKCNEWYHVQC--------VKDVSNPDADKYICPTC 1416

Query: 599  SV 600
             V
Sbjct: 1417 LV 1418


>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 985

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + H S  G+WV C  C  W H  CD+      FKD   TD   Y CP C
Sbjct: 373 GICKKIWHHSDGGNWVCCDGCNVWVHAECDKISS-KHFKDLENTD---YYCPDC 422


>gi|357436505|ref|XP_003588528.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
 gi|355477576|gb|AES58779.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
          Length = 973

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 5/61 (8%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVT-NF 603
           G C  + + S +G WV C  C  W H  CD+   +  FK+   TD   Y CP C V  NF
Sbjct: 408 GICKKVWNQSDSGSWVRCDGCKVWVHAECDKISSI-LFKNLGSTD---YFCPACKVKFNF 463

Query: 604 K 604
           +
Sbjct: 464 E 464


>gi|357127667|ref|XP_003565500.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
           [Brachypodium distachyon]
          Length = 972

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 34/87 (39%), Gaps = 9/87 (10%)

Query: 516 NRMTGVGNTLKRHYETYLL----EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHF 571
           NR+T   +  K+     LL    E  L      G C  + H +  G+WV C  C  W H 
Sbjct: 331 NRLTSKISKKKKQEVEQLLCRHCEKLLQSKQYCGICKKIWHHTDGGNWVCCDECEIWVHV 390

Query: 572 GCDRRQGLGAFKDYAKTDGLEYVCPQC 598
            CDR        D    +  EY CP C
Sbjct: 391 ECDR-----TCSDLEDLENTEYFCPDC 412


>gi|403345100|gb|EJY71909.1| ARID/BRIGHT DNA binding domain containing protein [Oxytricha
           trifallax]
          Length = 673

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 39/86 (45%), Gaps = 9/86 (10%)

Query: 464 GLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVG 522
           G+  F    +  K LDL  LY+ V+ RGG   V N   WK  V        + +  T   
Sbjct: 36  GVENFKVPSIGGKELDLCKLYKAVIQRGGSQRVSNNKLWKEIV----NEFEIPSSCTSAS 91

Query: 523 NTLKRHYETYLLEYE----LAHDDVD 544
            TL+ HY   LL+YE    L H ++D
Sbjct: 92  FTLRNHYNKCLLQYEKKYFLGHSNLD 117


>gi|340386800|ref|XP_003391896.1| PREDICTED: hypothetical protein LOC100633834, partial [Amphimedon
           queenslandica]
          Length = 192

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 5/51 (9%)

Query: 548 CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           CL+  ++    WV C  C +W H  C      G  K  A  +G +Y CP+C
Sbjct: 145 CLMKRNNPNSTWVLCEKCPQWVHIRC-----AGITKHKASKEGFKYFCPKC 190


>gi|145479355|ref|XP_001425700.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392772|emb|CAK58302.1| unnamed protein product [Paramecium tetraurelia]
          Length = 570

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 19/102 (18%)

Query: 440 EEE--FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV-- 495
           EEE  F++ + +F   RG T  +PQ G           + L++F LY+ V  RGG  V  
Sbjct: 5   EEEKVFIQQLQKFWEQRGVTIKIPQIG----------GRELEVFKLYKAVTKRGGLKVVS 54

Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
            N + WK  V       +     T    TL+ HY+  LL YE
Sbjct: 55  ANKL-WKEIV----DQFSFPATCTSASFTLRNHYQKLLLAYE 91


>gi|68474912|ref|XP_718497.1| hypothetical protein CaO19.2476 [Candida albicans SC5314]
 gi|46440265|gb|EAK99573.1| hypothetical protein CaO19.2476 [Candida albicans SC5314]
          Length = 1684

 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 8/60 (13%)

Query: 541  DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
            D++  +   +C     G  V C  C EW H  C         KD +  D  +Y+CP C V
Sbjct: 1377 DEITPQYYCVCREYEYGTMVECDKCNEWYHVQC--------VKDVSNPDADKYICPTCLV 1428


>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
 gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
          Length = 6442

 Score = 40.0 bits (92), Expect = 4.1,   Method: Composition-based stats.
 Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)

Query: 548  CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
            C++C+     D    WV C +C  W H  CD+ +      +      + Y CP C   +F
Sbjct: 1683 CIMCNEKYEIDDSNKWVQCDVCKFWIHLSCDKNENRNI--ETLSIKSINYKCPTCRSGSF 1740

Query: 604  KKKSQK 609
              K ++
Sbjct: 1741 HDKIER 1746


>gi|389584530|dbj|GAB67262.1| SET domain containing protein [Plasmodium cynomolgi strain B]
          Length = 5788

 Score = 40.0 bits (92), Expect = 4.1,   Method: Composition-based stats.
 Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)

Query: 548  CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
            C++C+     D    WV C +C  W H  CD+ +      +      + Y CP C   +F
Sbjct: 1214 CIMCNEKYEIDDSNKWVQCDVCKFWIHLSCDKNESRNI--ETLSIKSINYKCPTCRSGSF 1271

Query: 604  KKKSQK 609
              K ++
Sbjct: 1272 HDKIER 1277


>gi|56566281|gb|AAN75616.2| ZNF1 [Cryptococcus neoformans var. neoformans]
          Length = 846

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 44/106 (41%), Gaps = 10/106 (9%)

Query: 506 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE--CCLLCHSSAAGDWVNCG 563
           F  +R   + N  + V + L R YE  + E E+  D  D    CC     S+  D V C 
Sbjct: 121 FEVIRTANVNNIPSTVKDVLMRRYEYLITEREMVADLTDDFRICCCCRQWSSFQDSVKCE 180

Query: 564 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQK 609
            C E+ H  C +   L          G  +VCP C    F++ +Q+
Sbjct: 181 SCREYYHMTCLQPPLLA-----KPAKGYSWVCPSCF---FQRNTQQ 218


>gi|340500107|gb|EGR27007.1| PHD-finger family protein, putative [Ichthyophthirius multifiliis]
          Length = 544

 Score = 39.7 bits (91), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 33/75 (44%), Gaps = 6/75 (8%)

Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD-WVNCGICGEWAHFGCDRRQGLGAFK 583
           +K   E Y   YE  ++D D   C+       GD  + C IC EW HF C     LG   
Sbjct: 371 IKETEEKYAENYESVNEDSDQLYCICRQKYTYGDQMMACEICNEWFHFKC-----LGYKG 425

Query: 584 DYAKTDGLEYVCPQC 598
              + + ++++C  C
Sbjct: 426 SIEEAEKIQFICTLC 440


>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1002

 Score = 39.3 bits (90), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 25/56 (44%), Gaps = 4/56 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
           G C  + H S  GDWV C  C  W H GCD       FK+    +   Y CP C V
Sbjct: 369 GICKRIWHPSDDGDWVCCDGCNVWVHAGCDNITN-ERFKELEHNN---YYCPDCKV 420


>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
 gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
          Length = 6587

 Score = 39.3 bits (90), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)

Query: 548  CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
            C++C+     D    WV C +C  W H  CD+ +      +      + Y CP C   +F
Sbjct: 1730 CIMCNEKYEIDDSNKWVQCDVCKFWIHLSCDKNENRNI--ETLSIKSINYKCPTCRSGSF 1787

Query: 604  KKKSQK 609
              K ++
Sbjct: 1788 HDKIER 1793


>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
           sativus]
          Length = 1073

 Score = 39.3 bits (90), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + + S +G WV C  C  W H  CD+      FKD   TD   Y CP C
Sbjct: 448 GICKKIWNHSDSGSWVRCDGCKVWVHAECDKISS-NLFKDLGSTD---YFCPTC 497


>gi|255555783|ref|XP_002518927.1| hypothetical protein RCOM_1314010 [Ricinus communis]
 gi|223541914|gb|EEF43460.1| hypothetical protein RCOM_1314010 [Ricinus communis]
          Length = 868

 Score = 39.3 bits (90), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 35/82 (42%), Gaps = 14/82 (17%)

Query: 533 LLEYELAHDDVDG----ECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
           L  ++L  D +DG    +C    +     + VNC  CG W H  C R         Y K 
Sbjct: 8   LQSHDLHEDWIDGSWTVDCVCGVNFDDGEEMVNCDECGVWVHTRCSR---------YVKG 58

Query: 589 DGLEYVCPQCSVTNFKKKSQKT 610
           D L + C +C   N K  S++T
Sbjct: 59  DEL-FTCDKCKSKNNKDDSEET 79


>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 985

 Score = 39.3 bits (90), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 24/54 (44%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + H S  G+WV C  C  W H  CD+      FKD    D   Y CP C
Sbjct: 372 GICKKIWHHSDGGNWVCCDGCNVWVHAECDKISS-KLFKDLENAD---YYCPDC 421


>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
 gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
          Length = 1018

 Score = 39.3 bits (90), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + + S +G WV C  C  W H  CD+      FKD   TD   Y CP C
Sbjct: 391 GICKKIWNHSDSGSWVRCDGCKVWVHAECDKISN-SRFKDLGATD---YYCPAC 440


>gi|321262771|ref|XP_003196104.1| ZNF1p [Cryptococcus gattii WM276]
 gi|54112196|gb|AAV28798.1| ZNF1p [Cryptococcus gattii]
 gi|317462579|gb|ADV24317.1| ZNF1p [Cryptococcus gattii WM276]
          Length = 850

 Score = 39.3 bits (90), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 37/95 (38%), Gaps = 7/95 (7%)

Query: 506 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDG--ECCLLCHSSAAGDWVNCG 563
           F  +R  ++ N    V   L R YE  + E E+  D +D    CC     S+  D V C 
Sbjct: 121 FEVIRTASVNNVPATVKEVLLRRYEYLITEREMVADLMDNFRNCCRCSQWSSFQDSVKCE 180

Query: 564 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
            C E  H  C +   L          G  +VCP C
Sbjct: 181 SCREHYHMSCLQPPLLA-----KPAKGYSWVCPSC 210


>gi|307196235|gb|EFN77881.1| AT-rich interactive domain-containing protein 5B [Harpegnathos
           saltator]
          Length = 1163

 Score = 39.3 bits (90), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 26/119 (21%)

Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
             E++F+ DV  F+  RG     P G +      +L  K++DLF  Y +V   GG+  V 
Sbjct: 225 AEEKKFIIDVQGFMNSRG----TPFGKMP-----LLGNKQIDLFLFYTKVQMLGGYDSVS 275

Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTL------KRHYETYLLEYELAHDDVDGECCL 549
            G  WK          T+ + + G   +       +RHYE  LL YE      D +  L
Sbjct: 276 TGRLWK----------TIYDDIGGSTTSTNAATTTRRHYERLLLPYERYQKGEDTKIKL 324


>gi|91087509|ref|XP_968902.1| PREDICTED: similar to GA18855-PA [Tribolium castaneum]
          Length = 917

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 52/126 (41%), Gaps = 28/126 (22%)

Query: 413 QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAI 472
           Q +Q+  +N  P +K              EFL D+  F+  RG     P   L      I
Sbjct: 144 QVRQLYEINDDPKRK--------------EFLDDLFSFMQKRG----TPINRLP-----I 180

Query: 473 LNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
           +    LDL+ LY  V++RGG   V N   W+      ++   L + +T    TL+  Y  
Sbjct: 181 MAKSVLDLYELYNLVIARGGLVDVINKKLWQ----EIIKGLHLPSSITSAAFTLRTQYMK 236

Query: 532 YLLEYE 537
           YL  YE
Sbjct: 237 YLYPYE 242


>gi|145527630|ref|XP_001449615.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417203|emb|CAK82218.1| unnamed protein product [Paramecium tetraurelia]
          Length = 204

 Score = 38.9 bits (89), Expect = 7.8,   Method: Composition-based stats.
 Identities = 16/75 (21%), Positives = 33/75 (44%), Gaps = 11/75 (14%)

Query: 546 ECCLLCHSSAAGD---------WVNCG--ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYV 594
           +CC  C    +           W+ C    C +W H  C+ + G+ + +   +    +Y+
Sbjct: 48  KCCYFCAQVYSDQNENFLDGQKWIACDQEKCDKWTHLSCEAKNGIHSIECLVEDMKFKYI 107

Query: 595 CPQCSVTNFKKKSQK 609
           CP C + + K+++ K
Sbjct: 108 CPWCRIEDQKQRNMK 122


>gi|328872173|gb|EGG20540.1| hypothetical protein DFA_00401 [Dictyostelium fasciculatum]
          Length = 436

 Score = 38.9 bits (89), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 19/72 (26%), Positives = 31/72 (43%), Gaps = 8/72 (11%)

Query: 548 CLLC--------HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
           CLLC          +++  W+ C  C +W    CD   G  +  D +  + L Y CP+C 
Sbjct: 314 CLLCGIVYNSKEAMASSISWIRCDDCKQWVMSKCDSGMGDISLYDDSNPNPLHYSCPKCR 373

Query: 600 VTNFKKKSQKTS 611
               K K+ + +
Sbjct: 374 TDPSKPKTTRNN 385


>gi|56566239|gb|AAN75157.2| ZNF1 [Cryptococcus neoformans var. grubii]
          Length = 850

 Score = 38.9 bits (89), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 7/95 (7%)

Query: 506 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE--CCLLCHSSAAGDWVNCG 563
           F  +R   + N  + V + L R YE  + E E+  D  D    CC     S+  D V C 
Sbjct: 121 FEVIRTVNVNNIPSTVKDVLMRRYEYLITEREMVADLTDDFRICCCCRQWSSFQDSVKCE 180

Query: 564 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
            C E+ H  C +   L          G  +VCP C
Sbjct: 181 SCREYYHMTCLQPPLLA-----KPAKGYSWVCPSC 210


>gi|134114177|ref|XP_774336.1| hypothetical protein CNBG3170 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256971|gb|EAL19689.1| hypothetical protein CNBG3170 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 1249

 Score = 38.9 bits (89), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 550 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTN 602
           +C    +G  + C +C +W HF C     +   +D A+    +YVCP C+++N
Sbjct: 55  VCKQDTSGPMIECDVCSDWFHFKC-----INLAEDDAEKIH-KYVCPSCTLSN 101


>gi|58269444|ref|XP_571878.1| transcriptional activator [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|57228114|gb|AAW44571.1| transcriptional activator, putative [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 1242

 Score = 38.9 bits (89), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 550 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTN 602
           +C    +G  + C +C +W HF C     +   +D A+    +YVCP C+++N
Sbjct: 55  VCKQDTSGPMIECDVCSDWFHFKC-----INLAEDDAEKI-HKYVCPSCTLSN 101


>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
 gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
          Length = 1053

 Score = 38.5 bits (88), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
           G C  + + S +G WV C  C  W H  CD+      FKD   TD   Y CP C
Sbjct: 429 GICKKVSNHSDSGSWVRCDGCKVWVHAECDKISS-NHFKDLETTD---YFCPTC 478


>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
 gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
          Length = 1078

 Score = 38.5 bits (88), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVT-NF 603
           G C  + + S +G WV C  C  W H  CD+      FKD   TD   Y CP C    NF
Sbjct: 447 GICKKVWNHSDSGSWVRCDGCKVWVHAECDKISS-NRFKDLGGTD---YYCPACKAKFNF 502

Query: 604 K 604
           +
Sbjct: 503 E 503


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.413 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,864,351,716
Number of Sequences: 23463169
Number of extensions: 418522381
Number of successful extensions: 908587
Number of sequences better than 100.0: 235
Number of HSP's better than 100.0 without gapping: 72
Number of HSP's successfully gapped in prelim test: 163
Number of HSP's that attempted gapping in prelim test: 908150
Number of HSP's gapped (non-prelim): 370
length of query: 614
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 465
effective length of database: 8,863,183,186
effective search space: 4121380181490
effective search space used: 4121380181490
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 80 (35.4 bits)