BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007176
(614 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255550299|ref|XP_002516200.1| DNA binding protein, putative [Ricinus communis]
gi|223544686|gb|EEF46202.1| DNA binding protein, putative [Ricinus communis]
Length = 749
Score = 1092 bits (2823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/619 (84%), Positives = 556/619 (89%), Gaps = 5/619 (0%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
+GVPYVIYWK +FSCYAA HF QALLSVVQSSCSHT DAFQLAHASF LYCVRNN ++
Sbjct: 131 FKGVPYVIYWKSTFSCYAAAHFRQALLSVVQSSCSHTCDAFQLAHASFSLYCVRNNTGLS 190
Query: 61 SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
SN+QK K GP LLG+PPKIDI L E DVQ EE+S LPAIKIYDDDVTMRFLVC +P
Sbjct: 191 SNNQKVGGKPGPRLLGEPPKIDITLPEADVQDEESSSGTLPAIKIYDDDVTMRFLVCELP 250
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
TLD LLG LEDGLNALLNIEIRGSKLHNRTSAPPPPLQAG FSRGVVTMRCDLSTCSS
Sbjct: 251 STLDACLLGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGVVTMRCDLSTCSS 310
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
AHISLLVSGSAQ CFNDQLLENHIKNELIENSQLVHALP+S +++L SEPRKSASI CG
Sbjct: 311 AHISLLVSGSAQACFNDQLLENHIKNELIENSQLVHALPSSEESKLLTSEPRKSASIGCG 370
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
ASVFEV +KV +WASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK+D ERLLFFCTR
Sbjct: 371 ASVFEVCLKVPSWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKEDTERLLFFCTR 430
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRE-----SKGVESENVCNVRPKLNSAAM 355
QGK + NS++ +PP WL PAPSRKRSEPCRE SKG+E EN +V+ KLN AAM
Sbjct: 431 QGKELYPNNSIIIKPPCWLIPPAPSRKRSEPCRETKLFTSKGLERENGGSVKQKLNVAAM 490
Query: 356 RPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQ 415
RPIPHTRH+KMLPFSGF+E ERYDGDQ K +LPVAP KH GP PV+HRKSLSSSYQAQ
Sbjct: 491 RPIPHTRHHKMLPFSGFAEGERYDGDQGKPSLPVAPAKHGVVGPAPVSHRKSLSSSYQAQ 550
Query: 416 QIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 475
QIISLNPLPLKKHGCGRAPIQ CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA
Sbjct: 551 QIISLNPLPLKKHGCGRAPIQACSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 610
Query: 476 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE 535
KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE
Sbjct: 611 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE 670
Query: 536 YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC 595
YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+C
Sbjct: 671 YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYIC 730
Query: 596 PQCSVTNFKKKSQKTSNGY 614
P CS+ NF+KKSQKT+NGY
Sbjct: 731 PHCSIANFRKKSQKTANGY 749
>gi|224141545|ref|XP_002324130.1| predicted protein [Populus trichocarpa]
gi|222865564|gb|EEF02695.1| predicted protein [Populus trichocarpa]
Length = 745
Score = 1047 bits (2708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/617 (84%), Positives = 543/617 (88%), Gaps = 5/617 (0%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GVPYVIYWK +FSCYAA HF QALLSVVQSSCSHT DAFQLAHASFRLYCV+NN AS
Sbjct: 130 KGVPYVIYWKSAFSCYAASHFRQALLSVVQSSCSHTCDAFQLAHASFRLYCVQNNNTPAS 189
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
NSQK K GP LLGDPPK DI+L E D QGEE S LPAIKIYDDDVTMRFLVCG+
Sbjct: 190 NSQKVGGKPGPRLLGDPPKFDISLPEADDQGEEGSSGALPAIKIYDDDVTMRFLVCGLTG 249
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
TLD LG LEDGLNALLNIEIRGSKLHNRTSAPPPPLQAG FSRGVVTMRCDLSTCSSA
Sbjct: 250 TLDACALGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGVVTMRCDLSTCSSA 309
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
HISLLVSGSAQ CFNDQLLENHIK+ELIENSQLVHA +S +++ P SEPRKSASIACGA
Sbjct: 310 HISLLVSGSAQNCFNDQLLENHIKSELIENSQLVHASTSSDESKSPSSEPRKSASIACGA 369
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
SVFEVSMKV TWASQVLRQLAPDV+YRSLVMLGIASIQGLSVASFEKDDA+RLLFFCT+Q
Sbjct: 370 SVFEVSMKVPTWASQVLRQLAPDVTYRSLVMLGIASIQGLSVASFEKDDADRLLFFCTKQ 429
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK----GVESENVCNVRPKLNSAAMRP 357
K H N VLTR PSWL PAP RKR EP RE+K G EN N + KL AAMRP
Sbjct: 430 SKDPHPRNPVLTRHPSWLIPPAPCRKRYEPSRETKPLTFGCGGENGGNFKQKLYVAAMRP 489
Query: 358 IPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQI 417
IPHTR +KMLPFSGF E ERYDG+Q K +LP P KHS GP PVTHRKSLS+SYQAQQI
Sbjct: 490 IPHTRRHKMLPFSGFLEAERYDGEQTKPSLPPPP-KHSVVGPAPVTHRKSLSNSYQAQQI 548
Query: 418 ISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKR 477
ISLNPLPLKKHGCGR+PIQ CSEEEFLRDVMQFLILRGH+RLVPQGGLAEFPDAILNAKR
Sbjct: 549 ISLNPLPLKKHGCGRSPIQACSEEEFLRDVMQFLILRGHSRLVPQGGLAEFPDAILNAKR 608
Query: 478 LDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
LDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE
Sbjct: 609 LDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 668
Query: 538 LAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQ 597
LAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP
Sbjct: 669 LAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPH 728
Query: 598 CSVTNFKKKSQKTSNGY 614
CS+ NFKKKSQK +NGY
Sbjct: 729 CSIANFKKKSQKNANGY 745
>gi|225444591|ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-containing protein 4 [Vitis
vinifera]
gi|297738501|emb|CBI27746.3| unnamed protein product [Vitis vinifera]
Length = 739
Score = 1023 bits (2645), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 502/620 (80%), Positives = 540/620 (87%), Gaps = 14/620 (2%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GV YVIYWK++FSCYAACHF QAL SVVQSSCSHTWDAFQLAHASFRLYCV+NN V S
Sbjct: 127 KGVSYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRLYCVQNNTV-PS 185
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
N+QK S KLGP LLGDPPKI++ E+D EE+ P LP IKIYD DV+MRFLVCG P
Sbjct: 186 NNQKVSGKLGPCLLGDPPKINVVPPEVD--EEESLPATLPVIKIYDADVSMRFLVCGAPS 243
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
LD LLG LEDGLNALL IEIRGSKLHNR SAPPPPLQAG FSRGVVTMRCDLSTCSSA
Sbjct: 244 ALDACLLGSLEDGLNALLCIEIRGSKLHNRVSAPPPPLQAGTFSRGVVTMRCDLSTCSSA 303
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
HISLLVSGSAQTC NDQLLE++IKNELIE SQLVHA+P+ +++L SEPR+SASIACGA
Sbjct: 304 HISLLVSGSAQTCLNDQLLESYIKNELIEKSQLVHAVPSCEESKLSSSEPRRSASIACGA 363
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
SVFEV +KV TWASQVLRQLAPDVSYRSLV LGIASIQGLSVASFEKDDA+RLLFFCTR
Sbjct: 364 SVFEVRIKVPTWASQVLRQLAPDVSYRSLVTLGIASIQGLSVASFEKDDADRLLFFCTRH 423
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESENVCNVRPKLNSAA 354
K + NS+L RPPSWL +P SRKRS PC E+K G + V +PK+ AA
Sbjct: 424 AKQLNQNNSILPRPPSWLIAPPASRKRSGPCHETKPSGYKVLGGVNGGVLQQKPKI--AA 481
Query: 355 MRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQA 414
MRPIPHTR++KMLPFSG SE R DGDQ K NL V P KH+ G TPVTHRK LSSS+QA
Sbjct: 482 MRPIPHTRNHKMLPFSGISEASRCDGDQAKGNLSVVPAKHN--GTTPVTHRKLLSSSFQA 539
Query: 415 QQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILN 474
QQIISLNPLPLKKHGCGR+PIQ+CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILN
Sbjct: 540 QQIISLNPLPLKKHGCGRSPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILN 599
Query: 475 AKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL 534
AKRLDL+NLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL
Sbjct: 600 AKRLDLYNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL 659
Query: 535 EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYV 594
EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+
Sbjct: 660 EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYI 719
Query: 595 CPQCSVTNFKKKSQKTSNGY 614
CP CS+TNF+KKSQKT+NGY
Sbjct: 720 CPHCSITNFQKKSQKTANGY 739
>gi|297742936|emb|CBI35803.3| unnamed protein product [Vitis vinifera]
Length = 746
Score = 991 bits (2562), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/615 (78%), Positives = 524/615 (85%), Gaps = 2/615 (0%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYWK++FSCYAACHF AL SVVQSS +HTWDAFQLA+ASFRLYCVRNN V+ +
Sbjct: 131 KGIPYVIYWKNAFSCYAACHFRNALFSVVQSSSTHTWDAFQLAYASFRLYCVRNNHVLPA 190
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
NS K S KLGP LLGDP ID+ E+D + EE S LPAIKIYDDDV +RFLVCG P
Sbjct: 191 NSHKVSGKLGPRLLGDPATIDVPPPEVDAGEDEEGSLGTLPAIKIYDDDVGIRFLVCGEP 250
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
C LD+ L LEDGLNALL+IEIRGSKLHNR SAPPPPLQAG FSRGVVTMRCDLSTCSS
Sbjct: 251 CMLDSCLFESLEDGLNALLSIEIRGSKLHNRVSAPPPPLQAGTFSRGVVTMRCDLSTCSS 310
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
AHISLLVSGSAQTCF+DQLLEN+IK E+ E SQLVHALP S N+ P SEPR+SASIACG
Sbjct: 311 AHISLLVSGSAQTCFDDQLLENNIKKEVTEQSQLVHALPYSEGNKPPLSEPRRSASIACG 370
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
A+VFEV KV WASQVLRQLAPDVSYRSLV LGIASIQGL+VASFEKDDA RLLFFCTR
Sbjct: 371 AAVFEVCAKVPAWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDDANRLLFFCTR 430
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
QGK H N +R PSWL P PSRKR EP +++ + + +L AAMRPIPH
Sbjct: 431 QGKYIHPNNFTPSRLPSWLKPPPPSRKRVEPSQDTMNGVTMPLLPAGQRLKVAAMRPIPH 490
Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRKSLSSSYQAQQIIS 419
RH+KMLPFSG SE + +DG QVKANL V P KHS G T HRKS SSSYQA+QIIS
Sbjct: 491 IRHHKMLPFSGISEADGHDGGQVKANLSVPPPTKHSIVGSTSAMHRKSFSSSYQAKQIIS 550
Query: 420 LNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLD 479
LNPLPLKKHGCGR+PI+VCSEEEFL+DVMQFL LRGHTRL+PQGGLAEFPDAILNAKRLD
Sbjct: 551 LNPLPLKKHGCGRSPIRVCSEEEFLKDVMQFLNLRGHTRLIPQGGLAEFPDAILNAKRLD 610
Query: 480 LFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELA 539
L+NLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELA
Sbjct: 611 LYNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELA 670
Query: 540 HDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
HDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CPQCS
Sbjct: 671 HDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCS 730
Query: 600 VTNFKKKSQKTSNGY 614
VTNFKKK+ K NG+
Sbjct: 731 VTNFKKKANKAPNGF 745
>gi|356530473|ref|XP_003533805.1| PREDICTED: uncharacterized protein LOC100808357 [Glycine max]
Length = 752
Score = 974 bits (2519), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/619 (75%), Positives = 522/619 (84%), Gaps = 7/619 (1%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GVPY IYWK+ FS YAA HF +L SV QS+ SHTWDAFQLA ASFRLYC+ NN V+ S
Sbjct: 135 KGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-VLPS 193
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
N KG+ KLGP +LG PP ID++ D+ + EE+SPE + A+KIYDDDV MRFL+CGVP
Sbjct: 194 NCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKIYDDDVNMRFLICGVP 253
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
CTLD LLG LEDGLNALL EIRG KLHNRTSA PPPLQAG FSRGVVTMRCD+STCSS
Sbjct: 254 CTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDISTCSS 313
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
AHISLLVSGSA TCFNDQLLENHIK ELIE SQLV A PN ++ P SEPR+SAS+ACG
Sbjct: 314 AHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRRSASVACG 373
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
+SVFEV M+V WASQVLRQLAP++SYRSLVMLGIASIQGL VASF KDDAERLLFFCTR
Sbjct: 374 SSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAERLLFFCTR 433
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVES-----ENVCNVRPKLNSAAM 355
Q K + + V + PSWL P+ SRKRSEPC SK + E + + R K N A+M
Sbjct: 434 QEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGVEAIGSHRQKFNLASM 493
Query: 356 RPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQ 415
RPIPH+ +K+LPFSG SE RYDGD K+NLP+AP+KH+ +GPT VT+RKS+S+S+QA
Sbjct: 494 RPIPHSNRHKILPFSGLSEGTRYDGDHGKSNLPLAPIKHNVSGPTSVTNRKSVSNSFQAH 553
Query: 416 QIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 475
QIISLNPLP+KKHGC RAPI+ CSEEEFLRDVMQFLILRGH RL+P GGLAEFPDAILNA
Sbjct: 554 QIISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRGHNRLIPPGGLAEFPDAILNA 613
Query: 476 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLE 535
KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLE
Sbjct: 614 KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLE 673
Query: 536 YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC 595
YEL+HDDVDGECCL+CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC
Sbjct: 674 YELSHDDVDGECCLMCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC 733
Query: 596 PQCSVTNFKKKSQKTSNGY 614
P+CS F KKSQKT+NG+
Sbjct: 734 PRCSALKFSKKSQKTANGF 752
>gi|356556430|ref|XP_003546529.1| PREDICTED: uncharacterized protein LOC100775894 [Glycine max]
Length = 750
Score = 974 bits (2517), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/618 (75%), Positives = 521/618 (84%), Gaps = 6/618 (0%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GVPY IYWK+ FS YAA HF + SV QS+ SHTWDAFQLA ASFRLYCV+NN V+ S
Sbjct: 134 KGVPYTIYWKNDFSKYAASHFRHSFFSVAQSTSSHTWDAFQLALASFRLYCVQNN-VLPS 192
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
NS KG+ KLGP +LG PP ID++ D++ EE SPE + ++KIYDDDV MRFL+CGVPC
Sbjct: 193 NSHKGAGKLGPQILGVPPNIDVSPCVADMKEEEGSPETISSLKIYDDDVNMRFLICGVPC 252
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
TLD LLG LEDGLNALL EIRG KLHNRTSA PPPLQAG FSRGVVTMRCD+STCSSA
Sbjct: 253 TLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDISTCSSA 312
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
HISLLVSGSA TCFNDQLLENHIK ELIE SQLV A PN ++ P SEPR+SAS+ACG+
Sbjct: 313 HISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHQQSKAPSSEPRRSASVACGS 372
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
SVFEV M+V WASQVLRQLAP+++YRSLVMLGIASIQ L VASF KDDAERLLFFCTRQ
Sbjct: 373 SVFEVCMRVPAWASQVLRQLAPNLAYRSLVMLGIASIQALPVASFSKDDAERLLFFCTRQ 432
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVES-----ENVCNVRPKLNSAAMR 356
K + ++ V + PSWL P PSRKRSEPC SK + + E + + R K A+MR
Sbjct: 433 EKENCPKDHVFSGIPSWLKPPPPSRKRSEPCSSSKSINASGRGVEAIGSHRQKFIVASMR 492
Query: 357 PIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQ 416
PIPH+ +K+LPFSG SE RYDGD K+NLP+A +KH+ +GPT VT+RKS+S+S+QA Q
Sbjct: 493 PIPHSNRHKILPFSGLSEGTRYDGDHGKSNLPLALIKHNVSGPTSVTNRKSVSNSFQAHQ 552
Query: 417 IISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAK 476
IISLNPLP+KKHGC RAPI+ CSEEEFLRDVMQFLILRGH RL+P GGL+EFPDAILNAK
Sbjct: 553 IISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRGHNRLIPPGGLSEFPDAILNAK 612
Query: 477 RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEY
Sbjct: 613 RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEY 672
Query: 537 ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP 596
EL+HDDVDGECCL+CHSSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP
Sbjct: 673 ELSHDDVDGECCLMCHSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP 732
Query: 597 QCSVTNFKKKSQKTSNGY 614
+CS F KKSQKT+NGY
Sbjct: 733 RCSALKFSKKSQKTANGY 750
>gi|42565391|ref|NP_189910.2| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
thaliana]
gi|75223288|sp|Q6NQ79.1|ARID4_ARATH RecName: Full=AT-rich interactive domain-containing protein 4;
Short=ARID domain-containing protein 4
gi|34849889|gb|AAQ82841.1| At3g43240 [Arabidopsis thaliana]
gi|332644257|gb|AEE77778.1| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
thaliana]
Length = 747
Score = 948 bits (2450), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/613 (75%), Positives = 527/613 (85%), Gaps = 4/613 (0%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GV YVIYWK+ FS YAACHF +L SV+QSSCS TWD F +A ASFRLYC +N V+ S
Sbjct: 133 KGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFRLYCTSDNAVLPS 192
Query: 62 NS-QKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
NS +K + ++GP LLG+PPKID+ E D EENS E+LP+IKIYD+DVT+RFL+CG P
Sbjct: 193 NSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDEDVTVRFLLCGPP 252
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
CT+DT LLG L DGLNALL IE+RGSKLHNR+SAP PPLQAG F+RGVVTMRCD+STCSS
Sbjct: 253 CTVDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGVVTMRCDVSTCSS 312
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
AHIS+LVSG+AQTCF+DQLLENHIK+E++E QLVH++ NS + + SEPR+SASIACG
Sbjct: 313 AHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHSVVNSEETKRGFSEPRRSASIACG 372
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
ASV EVSM+V TWA QVLRQLAPDVSYRSLV+LG+ASIQGLSVASFEKDDAERLLFFC +
Sbjct: 373 ASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEKDDAERLLFFCGQ 432
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
Q +++L++ P+WLT P P+RKRSEPCRESK E EN K+N AA+RPIPH
Sbjct: 433 QINDTSNHDALLSKIPNWLTPPLPTRKRSEPCRESK--EIENGGPTSRKINVAALRPIPH 490
Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISL 420
TR +KM+PFSG+SEI R+DGD K +LP+ P KH ++G TPVTHRK+ S SYQ +QIISL
Sbjct: 491 TRRHKMIPFSGYSEIGRFDGDHTKGSLPMPP-KHGASGGTPVTHRKAFSGSYQRKQIISL 549
Query: 421 NPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
NPLPLKKH CGRA IQVCSEEEFLRDVMQFL++RGHTRLVP GGLAEFPDA+LN+KRLDL
Sbjct: 550 NPLPLKKHDCGRAHIQVCSEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDL 609
Query: 481 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE AH
Sbjct: 610 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAH 669
Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
DDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP CSV
Sbjct: 670 DDVDGECCLICRSSTAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSV 729
Query: 601 TNFKKKSQKTSNG 613
+N++KKSQKTSNG
Sbjct: 730 SNYRKKSQKTSNG 742
>gi|297818776|ref|XP_002877271.1| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
lyrata subsp. lyrata]
gi|297323109|gb|EFH53530.1| ARID/BRIGHT DNA-binding domain-containing protein [Arabidopsis
lyrata subsp. lyrata]
Length = 748
Score = 942 bits (2435), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/613 (75%), Positives = 523/613 (85%), Gaps = 4/613 (0%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GV YVIYWK FS YAACHF AL SV+QSSCS TWD F +A ASFRLYC +N V+ S
Sbjct: 134 KGVQYVIYWKDVFSKYAACHFRHALFSVIQSSCSDTWDVFHVAEASFRLYCTSDNAVLPS 193
Query: 62 NS-QKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
NS +K + ++GP LLG+PPKID+ E D EENS E+LP+IKIYD+DVT+RFL+CG P
Sbjct: 194 NSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDEDVTVRFLLCGPP 253
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
CTLDT LLG L DGLNALL IE+RGSKLHNR+SAP PPLQAG F+RGVVTMRCD+STCSS
Sbjct: 254 CTLDTLLLGSLVDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGVVTMRCDVSTCSS 313
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
AHIS+LVSG+AQTCF+DQLLENHIK+E++E QLVH + NS + + SEPR+SASIACG
Sbjct: 314 AHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHPVGNSEETKRGFSEPRRSASIACG 373
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
ASV EVSM+V TWA QVLRQLAPDVSYRSLV+LG+ASIQGLSVASFEKDDAERLLFFC +
Sbjct: 374 ASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEKDDAERLLFFCGQ 433
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
Q +++L++ P WLT P P+RKRSEPCRESK E EN K+N AA+RPIPH
Sbjct: 434 QINDTSNHDTLLSKIPHWLTPPLPTRKRSEPCRESK--EIENGGPTSRKINVAALRPIPH 491
Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISL 420
TR +KM+PFSG+SEI R+DGD K +LP+ P KH ++G TPVTHRK+ S SYQ +QIISL
Sbjct: 492 TRRHKMIPFSGYSEIGRFDGDHTKGSLPMPP-KHGASGGTPVTHRKAFSGSYQRKQIISL 550
Query: 421 NPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
NPLPLKKH CGRA IQVCSEEEFLRDVMQFL++RGH+RLVP GGLAEFPDA+LN+KRLDL
Sbjct: 551 NPLPLKKHDCGRAHIQVCSEEEFLRDVMQFLLIRGHSRLVPPGGLAEFPDAVLNSKRLDL 610
Query: 481 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE AH
Sbjct: 611 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAH 670
Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
DDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP CSV
Sbjct: 671 DDVDGECCLICRSSTAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSV 730
Query: 601 TNFKKKSQKTSNG 613
+N++KK+ KTSNG
Sbjct: 731 SNYRKKTPKTSNG 743
>gi|7649364|emb|CAB89045.1| putative protein [Arabidopsis thaliana]
Length = 717
Score = 874 bits (2257), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/613 (71%), Positives = 496/613 (80%), Gaps = 32/613 (5%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GV YVIYWK+ FS YAACHF +L SV+QSSCS TWD F +A ASFRLYC +N V+ S
Sbjct: 131 KGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFRLYCTSDNAVLPS 190
Query: 62 NS-QKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
NS +K + ++GP LLG+PPKID+ E D EENS E+LP+IKIYD+DVT+ + +
Sbjct: 191 NSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDEDVTVSYKAFSL- 249
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
DT LLG L DGLNALL IE+RGSKLHNR+SAP PPLQAG F+RGVVTMRCD+STCSS
Sbjct: 250 -FQDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGVVTMRCDVSTCSS 308
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
AHIS+LVSG+AQTCF + + SEPR+SASIACG
Sbjct: 309 AHISMLVSGNAQTCFKE--------------------------TKRGFSEPRRSASIACG 342
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
ASV EVSM+V TWA QVLRQLAPDVSYRSLV+LG+ASIQGLSVASFEKDDAERLLFFC +
Sbjct: 343 ASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEKDDAERLLFFCGQ 402
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
Q +++L++ P+WLT P P+RKRSEPCRESK E EN K+N AA+RPIPH
Sbjct: 403 QINDTSNHDALLSKIPNWLTPPLPTRKRSEPCRESK--EIENGGPTSRKINVAALRPIPH 460
Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISL 420
TR +KM+PFSG+SEI R+DGD K +LP+ P KH ++G TPVTHRK+ S SYQ +QIISL
Sbjct: 461 TRRHKMIPFSGYSEIGRFDGDHTKGSLPMPP-KHGASGGTPVTHRKAFSGSYQRKQIISL 519
Query: 421 NPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
NPLPLKKH CGRA IQVCSEEEFLRDVMQFL++RGHTRLVP GGLAEFPDA+LN+KRLDL
Sbjct: 520 NPLPLKKHDCGRAHIQVCSEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDL 579
Query: 481 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE AH
Sbjct: 580 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAH 639
Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
DDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP CSV
Sbjct: 640 DDVDGECCLICRSSTAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSV 699
Query: 601 TNFKKKSQKTSNG 613
+N++KKSQKTSNG
Sbjct: 700 SNYRKKSQKTSNG 712
>gi|357514291|ref|XP_003627434.1| Fiber protein Fb21 [Medicago truncatula]
gi|355521456|gb|AET01910.1| Fiber protein Fb21 [Medicago truncatula]
Length = 769
Score = 858 bits (2216), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/635 (67%), Positives = 496/635 (78%), Gaps = 25/635 (3%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
++G+PYV++WK++FS YAACHF QAL SVVQSS +HTWDAF LA ASF LYCV+NN V+
Sbjct: 129 LKGIPYVVFWKNAFSQYAACHFRQALFSVVQSSSTHTWDAFHLARASFELYCVQNNQVLP 188
Query: 61 SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPEN--LPAIKIYDDDVTMRFLVCG 118
++S S +GPHLLG+ KI++ EMD + ++ + LP+I+I+DD+V +RFL+CG
Sbjct: 189 TDSNDADSDMGPHLLGECLKINVDPPEMDEEDDDEESSSGSLPSIQIHDDEVNLRFLICG 248
Query: 119 VPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTC 178
P T+D SLL LEDGL ALL IE+R KLH + SAPPPPLQA +FSRGVVTMRCD+STC
Sbjct: 249 APSTVDESLLRSLEDGLRALLTIEMRSCKLHGKYSAPPPPLQAASFSRGVVTMRCDISTC 308
Query: 179 SSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIA 238
SSAHISLLVSGS Q CFNDQLLENHIKNE+IE SQ+VHA N N SEPR+SASIA
Sbjct: 309 SSAHISLLVSGSPQACFNDQLLENHIKNEIIEKSQIVHARLNGEANTQIISEPRRSASIA 368
Query: 239 CGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFC 298
CGA++FEVSMK+ WA Q+LRQLAPDVSYRSLV LGIASIQGL VASFEKDDAERLLFF
Sbjct: 369 CGATIFEVSMKLPQWALQILRQLAPDVSYRSLVALGIASIQGLPVASFEKDDAERLLFFY 428
Query: 299 TRQGKAD-HTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVE------------------ 339
K N V +RPP WL P P+RKR C S+G
Sbjct: 429 QSSAKDGCDNGNIVFSRPPVWLKPPPPTRKR---CESSQGASPDIHNDEEEKDRKMVNGI 485
Query: 340 SENVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGP 399
S + R +L +AMRPIPH R ++M PFSG S + + G V+A +P+ P+K SS G
Sbjct: 486 STPLTPARQRLKVSAMRPIPHVRRHRMTPFSGPSGVNGFGGPHVEAYVPLVPVKRSSIGS 545
Query: 400 TPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRL 459
+ T RKS SSS Q +Q+ISLNPLPLKKHGC R +Q CSEEEF++DVM+FLILRGH+RL
Sbjct: 546 SSATQRKSFSSSSQPKQVISLNPLPLKKHGCSRGSVQTCSEEEFIKDVMEFLILRGHSRL 605
Query: 460 VPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMT 519
+PQGGLAEFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T TNRMT
Sbjct: 606 IPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMGNYTSTNRMT 665
Query: 520 GVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGL 579
GVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGL
Sbjct: 666 GVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGL 725
Query: 580 GAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
GAFKDYAKTDGLEY+CP CSVTNFKKK Q +NGY
Sbjct: 726 GAFKDYAKTDGLEYICPHCSVTNFKKK-QSVANGY 759
>gi|356559199|ref|XP_003547888.1| PREDICTED: uncharacterized protein LOC100816540 [Glycine max]
Length = 782
Score = 855 bits (2208), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/645 (69%), Positives = 512/645 (79%), Gaps = 32/645 (4%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
++G+PYVI+WK++FSCYAACHF QA LSVVQSS +HTWDAF LA ASF LYCV+NN V+
Sbjct: 129 LKGIPYVIFWKNTFSCYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYCVQNNQVLP 188
Query: 61 SNSQKGSSKLGPHLLGDPPKIDI-ALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGV 119
S+S SS++GPHLLGD KI++ + +E+S +LPAIKI++D+V +RFL+CG
Sbjct: 189 SDSDDASSEMGPHLLGDCLKINVDPPEIDEEDDDESSSGSLPAIKIHEDEVNLRFLICGA 248
Query: 120 PCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCS 179
P T+D SLL LEDGL ALL IEIRG KLH + SAPPPPLQA AFSRGVVTMRCD+STCS
Sbjct: 249 PSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAAAFSRGVVTMRCDISTCS 308
Query: 180 SAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIAC 239
SAHISLLVSGSAQTCFNDQLLENHIKNE+IE SQLVHA N+ N+ EPR+SASIAC
Sbjct: 309 SAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQLNNEGNKENICEPRRSASIAC 368
Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
GASVFE+ MK+ WA Q+LRQLAP+VSYRSLV LGIASIQGL +ASFEKDDAERLLFF
Sbjct: 369 GASVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDDAERLLFFYQ 428
Query: 300 RQGKADHTE--NSVLTRPPSWLTSPAPSRKRSEPCRES-----KGVESEN--VCNV---- 346
K T N + + PP WL P P+RKR EP +E+ +GV + VC +
Sbjct: 429 NCEKDSCTNKNNIIFSSPPGWLKPPPPTRKRCEPRQEASPGLHEGVFAGQGGVCKLNEEE 488
Query: 347 ----------------RPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLP-V 389
R +L +AMRPIPH R ++M PF G SE + +DG QV+A LP V
Sbjct: 489 KDRKIVNGISMPLTPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFDGTQVEAILPLV 548
Query: 390 APLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQ 449
AP K +S G T THRKS SS+ Q++Q+ISLNPLPLKKHGCGR P+Q CSEEEFL+DVM+
Sbjct: 549 APTKRTSIGSTSGTHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTCSEEEFLKDVME 608
Query: 450 FLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKM 509
FLILRGH RL+PQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM
Sbjct: 609 FLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKM 668
Query: 510 RNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWA 569
RN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWA
Sbjct: 669 RNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWA 728
Query: 570 HFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
HFGCDRRQGLGAFKDYAKTDGLEY+CP CSVTNFKKK Q +NGY
Sbjct: 729 HFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKK-QNVANGY 772
>gi|449517916|ref|XP_004165990.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
[Cucumis sativus]
Length = 772
Score = 854 bits (2207), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/632 (67%), Positives = 499/632 (78%), Gaps = 20/632 (3%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PY+IYW +FSCYAA HF ALLSVVQSS +HTWDAFQLA A+FRLY V +N +
Sbjct: 130 KGIPYLIYWNSTFSCYAAAHFRHALLSVVQSSSTHTWDAFQLARAAFRLYSVGSNYGLPG 189
Query: 62 NSQKGS-SKLGPHLLGDPPKIDIALSEMDV---QGEENSPENLPAIKIYDDDVTMRFLVC 117
+ + S L P L+G+P KID+ E+DV + E+ S E LPAI I+D++VTMRFL+C
Sbjct: 190 IADDSTMSDLEPQLIGEPLKIDVEPPELDVGEGEDEDGSLEALPAINIHDNNVTMRFLIC 249
Query: 118 GVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLST 177
GVPCT DT LL LEDGL+ALL IE+RGSKL + SAPPPPLQAG+FSRGVVTMRCD+ T
Sbjct: 250 GVPCTPDTCLLRSLEDGLDALLKIEMRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVT 309
Query: 178 CSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASI 237
CSSAHIS+LVSGSA TCF+DQLLE HIK+E+IE++QLVHA+ + N+ +PRKSASI
Sbjct: 310 CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEHNQLVHAIHDCEGNKHHMHKPRKSASI 369
Query: 238 ACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFF 297
ACGA+VFEVSMKV WASQVLRQLAPD+SYRSLV LGI +QGL VASFEK+DAERLLFF
Sbjct: 370 ACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDAERLLFF 429
Query: 298 CTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------ 345
C+ G H+E +++ PSW P PSRKR EP + + S ++
Sbjct: 430 CSGDGNDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRNSLSHDIGREDPVPMNGFKAS 489
Query: 346 ---VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLP-VAPLKHSSAGPTP 401
R KL A+MRP+P KM PF+G +E++ +G KA+L V P KH + G T
Sbjct: 490 LHPARKKLKVASMRPVPRLHRNKMTPFAGLTEVDGNNGGLSKASLSIVTPPKHVTVGSTS 549
Query: 402 VTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVP 461
THRKS SSS Q++QIISLNPLPLKKHGCGR PIQ CSEEEFL+DVM+FL+LRGHTRL+P
Sbjct: 550 ATHRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHTRLIP 609
Query: 462 QGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGV 521
QGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGV
Sbjct: 610 QGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGV 669
Query: 522 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 581
GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA
Sbjct: 670 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 729
Query: 582 FKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
FKDYAKTDGLEYVCP CS+T +KKK + +NG
Sbjct: 730 FKDYAKTDGLEYVCPHCSITTYKKKPHRVANG 761
>gi|449453463|ref|XP_004144477.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
[Cucumis sativus]
Length = 772
Score = 846 bits (2186), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/632 (67%), Positives = 496/632 (78%), Gaps = 20/632 (3%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PY+IYW +FSCYAA HF ALLSVVQSS +HTWDAFQLA A+FRLY V +N +
Sbjct: 130 KGIPYLIYWNSTFSCYAAAHFRHALLSVVQSSSTHTWDAFQLARAAFRLYSVGSNYGLPG 189
Query: 62 NSQKGS-SKLGPHLLGDPPKIDIALSEMDV---QGEENSPENLPAIKIYDDDVTMRFLVC 117
+ S L P L+G+P KID+ E+DV + E+ S E LPAI I+D++VTMRFL+C
Sbjct: 190 IADDSMMSDLEPQLIGEPLKIDVEPPELDVGEGEDEDGSLEALPAINIHDNNVTMRFLIC 249
Query: 118 GVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLST 177
GVPCT DT LL LEDGL+ALL IE+RGSKL + SAPPPPLQAG+FSRGVVTMRCD+ T
Sbjct: 250 GVPCTPDTCLLRSLEDGLDALLKIEMRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVT 309
Query: 178 CSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASI 237
CSSAHIS+LVSGSA TCF+DQLLE HIK+E+IE++QLVHA+ + N+ +PRKSASI
Sbjct: 310 CSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEHNQLVHAIHDCEGNKHHMHKPRKSASI 369
Query: 238 ACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFF 297
ACGA+VFEVSMKV WASQVLRQLAPD+SYRSLV LGI +QGL VASFEK+DAERLLFF
Sbjct: 370 ACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDAERLLFF 429
Query: 298 CTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------ 345
C+ G H+E +++ PSW P PSRKR EP + + S ++
Sbjct: 430 CSGDGNDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRNSLSHDIGREDPVPMNGFKAS 489
Query: 346 ---VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLP-VAPLKHSSAGPTP 401
R KL A+MRP+P KM PF+G +E++ +G KA+L V P KH + G T
Sbjct: 490 LHPARKKLKVASMRPVPRLHRNKMTPFAGLTEVDGNNGGLSKASLSIVTPPKHVTVGSTS 549
Query: 402 VTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVP 461
THRKS SSS Q++QIISLNPLPLKKHGCGR PIQ CSEEEFL+DVM+FL+LRGHTRL+P
Sbjct: 550 ATHRKSFSSSSQSKQIISLNPLPLKKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHTRLIP 609
Query: 462 QGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGV 521
QGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKM N+T+TNRMTGV
Sbjct: 610 QGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGV 669
Query: 522 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 581
GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA
Sbjct: 670 GNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGA 729
Query: 582 FKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
FK YAKTDGLEYVC CS+T +KKK + +NG
Sbjct: 730 FKVYAKTDGLEYVCSHCSITTYKKKPHRVANG 761
>gi|449448140|ref|XP_004141824.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
[Cucumis sativus]
Length = 781
Score = 830 bits (2144), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/645 (64%), Positives = 487/645 (75%), Gaps = 38/645 (5%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW+ +F+CYAACHF A LSV+QSS +HTWDAFQLAHASFR+YC+ NN V+ S
Sbjct: 132 KGIPYVIYWRSAFTCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRMYCLGNNFVLPS 191
Query: 62 NSQKGSSK-LGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
+S K S+ LGPHLLG+ KI++ E +V +E S ++ I D+DV MRFLVCG P
Sbjct: 192 SSHKEVSEDLGPHLLGERLKINVEPLEKEVADDEESSSEGISVNILDNDVEMRFLVCGEP 251
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
+LD +L LEDGLNALL+IEIRGSKLH + SAPPPPLQAG S GVVTMRCDLSTCS
Sbjct: 252 GSLDAYVLEALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGTLSNGVVTMRCDLSTCSF 311
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNS-GDNRLPPSEPRKSASIAC 239
AHISLLVSGSAQ CF+DQL EN+IK E+I+ +LV L +S G L SEPRKS SIAC
Sbjct: 312 AHISLLVSGSAQACFDDQLFENYIKTEIIDRGELVQTLLDSEGSKHL--SEPRKSTSIAC 369
Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
GA+VFEVS+KV +WASQ+ RQLAPDVSYRSLV LGIASIQGLSVASFEKDDAERLLFFC+
Sbjct: 370 GATVFEVSLKVPSWASQIFRQLAPDVSYRSLVGLGIASIQGLSVASFEKDDAERLLFFCS 429
Query: 300 RQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNV------------- 346
R+ N + PSWL PAP RKR + +++ E + ++
Sbjct: 430 RKENDLFLSNLTDSTLPSWLKPPAP-RKRPKYIKDTSLGSHEIIEHLKVSPGSRIHGANM 488
Query: 347 -----------------RPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPV 389
R + AAMRPIPH +KM+ F G SE ++G +KA++P
Sbjct: 489 EIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFHGISETGGHNGSLLKASVPS 548
Query: 390 A-PLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVM 448
+ P KH + G V +K S+ +QII +NPLPLKKHGCGR+ IQ C EEEFL+D+M
Sbjct: 549 SNPTKHVTVGSASVFQQKVFPSASHYKQIIPMNPLPLKKHGCGRSHIQACFEEEFLKDLM 608
Query: 449 QFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSK 508
QFL LRGH+RL+P GGLAEFPDAILN KRLDL+NLY+EVVSRGGF VGNGINWKGQ+FSK
Sbjct: 609 QFLALRGHSRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGINWKGQIFSK 668
Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
MRN+T+TNRMTGVGNTLKRHYETYLLEYELAH+DVDGECCLLCHSSAAGDWVNCGICGEW
Sbjct: 669 MRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWVNCGICGEW 728
Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
AHFGCDRRQGLGAFKDYAKTDGLEY+CP CSV N+KKK K +NG
Sbjct: 729 AHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKK--KVANG 771
>gi|449480653|ref|XP_004155958.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
[Cucumis sativus]
Length = 781
Score = 830 bits (2143), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/645 (64%), Positives = 487/645 (75%), Gaps = 38/645 (5%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW+ +F+CYAACHF A LSV+QSS +HTWDAFQLAHASFR+YC+ NN V+ S
Sbjct: 132 KGIPYVIYWRSAFTCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRMYCLGNNFVLPS 191
Query: 62 NSQKGSSK-LGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
+S K S+ LGPHLLG+ KI++ E +V +E S ++ I D+DV MRFLVCG P
Sbjct: 192 SSHKEVSEDLGPHLLGEHLKINVEPLEKEVADDEESSSEGISVNILDNDVEMRFLVCGEP 251
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
+LD +L LEDGLNALL+IEIRGSKLH + SAPPPPLQAG S GVVTMRCDLSTCS
Sbjct: 252 GSLDAYVLEALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGTLSNGVVTMRCDLSTCSF 311
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNS-GDNRLPPSEPRKSASIAC 239
AHISLLVSGSAQ CF+DQL EN+IK E+I+ +LV L +S G L SEPRKS SIAC
Sbjct: 312 AHISLLVSGSAQACFDDQLFENYIKTEIIDRGELVQTLLDSEGSKHL--SEPRKSTSIAC 369
Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
GA+VFEVS+KV +WASQ+ RQLAPDVSYRSLV LGIASIQGLSVASFEKDDAERLLFFC+
Sbjct: 370 GATVFEVSLKVPSWASQIFRQLAPDVSYRSLVGLGIASIQGLSVASFEKDDAERLLFFCS 429
Query: 300 RQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNV------------- 346
R+ N + PSWL PAP RKR + +++ E + ++
Sbjct: 430 RKENDLFLSNLTDSTLPSWLKPPAP-RKRPKYIKDTSLGSHEIIEHLKVSPGSRIHGANM 488
Query: 347 -----------------RPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPV 389
R + AAMRPIPH +KM+ F G SE ++G +KA++P
Sbjct: 489 EIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMVSFHGISETGGHNGSLLKASVPS 548
Query: 390 A-PLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVM 448
+ P KH + G V +K S+ +QII +NPLPLKKHGCGR+ IQ C EEEFL+D+M
Sbjct: 549 SNPTKHVTVGSASVFQQKVFPSASHYKQIIPMNPLPLKKHGCGRSHIQACFEEEFLKDLM 608
Query: 449 QFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSK 508
QFL LRGH+RL+P GGLAEFPDAILN KRLDL+NLY+EVVSRGGF VGNGINWKGQ+FSK
Sbjct: 609 QFLALRGHSRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGINWKGQIFSK 668
Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
MRN+T+TNRMTGVGNTLKRHYETYLLEYELAH+DVDGECCLLCHSSAAGDWVNCGICGEW
Sbjct: 669 MRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWVNCGICGEW 728
Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 613
AHFGCDRRQGLGAFKDYAKTDGLEY+CP CSV N+KKK K +NG
Sbjct: 729 AHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKK--KVANG 771
>gi|222641636|gb|EEE69768.1| hypothetical protein OsJ_29479 [Oryza sativa Japonica Group]
Length = 765
Score = 829 bits (2142), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/630 (66%), Positives = 484/630 (76%), Gaps = 27/630 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW++SFS YAA HF AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V
Sbjct: 136 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 193
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PKI+I E ++ EE S + PAIKIYDDD+ M+FL+CGVP
Sbjct: 194 -----SVKLGPRLLGDAPKINITPPENEMVEEEGSSDVFPAIKIYDDDINMKFLLCGVPS 248
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
T D LLG LEDGLNALLNIEIRG KL NR SA PPPL A + RG+VTMRCD++TCSS+
Sbjct: 249 TPDPCLLGSLEDGLNALLNIEIRGCKLQNRISASPPPLHAASLPRGMVTMRCDITTCSSS 308
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+SLLVSGSAQTCF+DQLLE+HIK+E+IE SQLVHALPN+ D++L S P S S ACGA
Sbjct: 309 HVSLLVSGSAQTCFDDQLLESHIKDEIIEKSQLVHALPNN-DDKLSSSVPFTSMSTACGA 367
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV M + WA+QVL+ LAPD+SYRSLV LGI I G VASF++ DA+RLLFFCT Q
Sbjct: 368 STFEVWMTLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFDRRDADRLLFFCTNQ 427
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN-------VCNVR 347
K EN P W S + +++R + ESK G+ + + +
Sbjct: 428 HKDLAIENGPYFHLPRW--SASLTKERVKVGLESKPNLLGANGIPEDKKRLIEGPSSSSK 485
Query: 348 PKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRK 406
KL A MRPIPH+R +M PF GF E ++ QVK NLP AP +KH+S P THRK
Sbjct: 486 AKLKPATMRPIPHSRKQQMHPFMGFLEATVHETSQVKPNLPAAPPVKHNSVPAAPATHRK 545
Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
S S AQ II LNPLPLKKHGC R PIQ+CSEE+FL+DVMQFLI RGH RLVP GGLA
Sbjct: 546 STSGPSHAQSIIQLNPLPLKKHGCDRLPIQMCSEEDFLKDVMQFLIQRGHNRLVPHGGLA 605
Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
EFP+A+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTLK
Sbjct: 606 EFPEAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLK 665
Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
RHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYA
Sbjct: 666 RHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYA 725
Query: 587 KTDGLEYVCPQCSVTNFKKK--SQKTSNGY 614
KTDGLEY+CP CS+ N+KKK QK +NG+
Sbjct: 726 KTDGLEYICPHCSLANYKKKPPPQKVANGF 755
>gi|218202201|gb|EEC84628.1| hypothetical protein OsI_31490 [Oryza sativa Indica Group]
Length = 766
Score = 817 bits (2110), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/617 (67%), Positives = 474/617 (76%), Gaps = 25/617 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW++SFS YAA HF AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V
Sbjct: 136 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 193
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PKI+I E ++ EE S + PAIKIYDDD+ M+FL+CGVP
Sbjct: 194 -----SVKLGPRLLGDAPKINITPPENEMVEEEGSSDVFPAIKIYDDDINMKFLLCGVPS 248
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
T D LLG LEDGLNALLNIEIRG KL NR SA PPPL A + RG+VTMRCD++TCSS+
Sbjct: 249 TPDPCLLGSLEDGLNALLNIEIRGCKLQNRISASPPPLHAASLPRGMVTMRCDITTCSSS 308
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+SLLVSGSAQTCF+DQLLE+HIK+E+IE SQLVHALPN+ D++L P S S ACGA
Sbjct: 309 HVSLLVSGSAQTCFDDQLLESHIKDEIIEKSQLVHALPNN-DDKLSSIVPFTSMSTACGA 367
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV M + WA+QVL+ LAPD+SYRSLV LGI I G VASF++ DA+RLLFFCT Q
Sbjct: 368 STFEVWMTLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFDRRDADRLLFFCTNQ 427
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN-------VCNVR 347
K EN P W S + +++R + ESK G+ + + +
Sbjct: 428 HKDLAIENGPYFHLPRW--SASLTKERVKVGLESKPKLLGANGIPEDKKRLIEGPSSSSK 485
Query: 348 PKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRK 406
KL A MRPIPH+R +M PF GF E ++ QVK NLP AP +KH+S P THRK
Sbjct: 486 AKLKPATMRPIPHSRKQQMHPFMGFLEATVHETSQVKPNLPAAPPVKHNSVPAAPTTHRK 545
Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
S S AQ II LNPLPLKKHGC R PIQ+CSEE+FL+DVMQFLI RGH RLVP GGLA
Sbjct: 546 STSGPSHAQSIIQLNPLPLKKHGCDRLPIQMCSEEDFLKDVMQFLIQRGHNRLVPHGGLA 605
Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
EFP+A+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTLK
Sbjct: 606 EFPEAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLK 665
Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
RHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYA
Sbjct: 666 RHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYA 725
Query: 587 KTDGLEYVCPQCSVTNF 603
KTDGLEY+CP CS+ N+
Sbjct: 726 KTDGLEYICPHCSLANY 742
>gi|326493346|dbj|BAJ85134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 771
Score = 816 bits (2109), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/632 (65%), Positives = 481/632 (76%), Gaps = 27/632 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW++SFS YAA HF AL SVVQSS SHTWD+FQLAHASFRLYCVRNN V
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRNALFSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ-- 194
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
S KLGP LLGD PK+++ + E ++ + E +S E PAIKIYDDD+ M+FL+CGVP
Sbjct: 195 -----SVKLGPRLLGDAPKVNVVIPENEMAEEEGSSSEVSPAIKIYDDDINMKFLLCGVP 249
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
LD+ LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A + RG+VTMRCD++TCSS
Sbjct: 250 SALDSCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASVPRGMVTMRCDMTTCSS 309
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
+H+SLLVSGSAQTCF+DQLLE+HIKNELIE SQLV ALPNS D +L +EP S S ACG
Sbjct: 310 SHVSLLVSGSAQTCFDDQLLESHIKNELIEKSQLVRALPNSED-KLSSTEPFTSMSTACG 368
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
AS FEV M + WA+QVL+ LAP++SYRSLV LG+ I VASF++ DA+RLLFFCT
Sbjct: 369 ASTFEVWMTLPKWAAQVLKHLAPEISYRSLVALGVGCINATPVASFDRQDADRLLFFCTG 428
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRS-----------EPCRESKGVESENVCNVRP- 348
Q K E+ P W S A R ++ E + E ++ P
Sbjct: 429 QQKDLAGESGPYFHLPRWSASLAKDRAKTGSESKSNLLGVNGTLEDRKAPVEGPSSLTPF 488
Query: 349 --KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHR 405
KL A MRPIPH+R +M PF GF E ++ Q K NLP AP +KHSSA + HR
Sbjct: 489 KGKLKPATMRPIPHSRQQQMHPFMGFPEANVHETSQAKPNLPSAPPVKHSSAPASTAAHR 548
Query: 406 KSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGL 465
KS S AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GGL
Sbjct: 549 KSTSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHHRLVPHGGL 608
Query: 466 AEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTL 525
AEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTL
Sbjct: 609 AEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTL 668
Query: 526 KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDY 585
KRHYETYLLEYEL+HDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDY
Sbjct: 669 KRHYETYLLEYELSHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDY 728
Query: 586 AKTDGLEYVCPQCSVTNFKKK---SQKTSNGY 614
AKTDGLEY+CP CS+ N+KKK QK +NG+
Sbjct: 729 AKTDGLEYICPHCSIANYKKKPLPPQKVTNGF 760
>gi|357158540|ref|XP_003578160.1| PREDICTED: uncharacterized protein LOC100824739 [Brachypodium
distachyon]
Length = 769
Score = 813 bits (2101), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/631 (65%), Positives = 480/631 (76%), Gaps = 26/631 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW++SFS YAA HF ALLSVVQSS SHTWD+FQLAHASFRLYCVRNN V
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRHALLSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ-- 194
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PKI+I+ E ++ EE S + PAIKIYDDD+ M+FL+CGVP
Sbjct: 195 -----SVKLGPRLLGDAPKINISAPENEMAEEEGSSDVSPAIKIYDDDIKMQFLLCGVPS 249
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
TLD LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A + RG+VTMRCD++TCSS+
Sbjct: 250 TLDPCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASLPRGMVTMRCDITTCSSS 309
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+SLLVSGSAQTCF+DQLLE+HIKNELIE +QLV ALPNS D +L +EP S S+ACGA
Sbjct: 310 HVSLLVSGSAQTCFDDQLLESHIKNELIEKNQLVRALPNSMD-KLSSTEPLTSMSVACGA 368
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV M + WA+QVL+ LA ++SYRSLV LG+ I VASF+++DA+R+LFFCT Q
Sbjct: 369 STFEVWMTLPKWAAQVLKHLAREISYRSLVALGVGCINATPVASFDREDADRVLFFCTGQ 428
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR----SEP-------CRESKGVESENVCNVRP-- 348
K ++ P W S R + S+P E K + E ++ P
Sbjct: 429 QKDLVADSGPYFHLPRWSASLTKDRVKRGSESKPNVFGANGTSEDKELPIEGPSSLTPLK 488
Query: 349 -KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRK 406
K A MRPIPH+R +M PF GF E ++ VK NLP AP +KH+ HRK
Sbjct: 489 AKFKPATMRPIPHSRQQQMHPFMGFPEATIHENSHVKPNLPAAPPVKHNLIPAATAAHRK 548
Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
S S AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GGLA
Sbjct: 549 STSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHNRLVPHGGLA 608
Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
EFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTLK
Sbjct: 609 EFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLK 668
Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
RHYETYLLEYEL+HDDVDGECCLLC+SSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYA
Sbjct: 669 RHYETYLLEYELSHDDVDGECCLLCNSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYA 728
Query: 587 KTDGLEYVCPQCSVTNFKKK---SQKTSNGY 614
KTDGLEY+CP CS+ N+KKK QK +NG+
Sbjct: 729 KTDGLEYICPHCSLANYKKKPQQPQKVANGF 759
>gi|414589557|tpg|DAA40128.1| TPA: hypothetical protein ZEAMMB73_432104 [Zea mays]
Length = 768
Score = 810 bits (2092), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/632 (65%), Positives = 477/632 (75%), Gaps = 29/632 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW++SFS YAA HF AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 194
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PKI+I+ ++ EE S E PAIKIYD+++ M+ L+CGV C
Sbjct: 195 -----SVKLGPRLLGDSPKINISPPGTEMVDEEGSSEVTPAIKIYDEEINMKLLLCGVSC 249
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
TLD LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A + RG+VTMRCD++TCSS+
Sbjct: 250 TLDPCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASLPRGMVTMRCDITTCSSS 309
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+SLLVSGSAQTCF+DQLLE+HIKNE+IE SQLV AL NS D +LP SEP S S ACGA
Sbjct: 310 HVSLLVSGSAQTCFDDQLLESHIKNEIIEKSQLVRALSNSED-KLPSSEPLTSMSTACGA 368
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV M + WA+QVL+ LAPD+SYRSLV LGI I G VASFE+ DA+RLLFFCT Q
Sbjct: 369 STFEVWMSLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFERRDADRLLFFCTSQ 428
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESE---------NVCN 345
K EN P W S + ++ R++ ESK GV + +
Sbjct: 429 CKDLANENGPYFHLPRW--SASLTKDRTKLGSESKQNLLGANGVLEDKKHMMEGPSSFSA 486
Query: 346 VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTH 404
V+ KL A MRPIPH++ +M PF E ++ VK +L VAP +KH+S P TH
Sbjct: 487 VKAKLKPATMRPIPHSQKQQMHPFMCLPETSFHETSIVKPSLTVAPTVKHNSVSYAPTTH 546
Query: 405 RKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGG 464
RKS S +I LNPLP+KKHGC R IQVCSEE+FL+DVMQFLI RGH RLVP GG
Sbjct: 547 RKSTSGPSHTPSVIQLNPLPMKKHGCDRLSIQVCSEEDFLKDVMQFLIQRGHNRLVPHGG 606
Query: 465 LAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNT 524
LAEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRMTGVGNT
Sbjct: 607 LAEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRMTGVGNT 666
Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKD 584
LKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKD
Sbjct: 667 LKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKD 726
Query: 585 YAKTDGLEYVCPQCSVTNFKKK--SQKTSNGY 614
YAKTDGLEY+CP CS+ N+KK K +NG+
Sbjct: 727 YAKTDGLEYICPHCSLANYKKPPLPPKVANGF 758
>gi|326505012|dbj|BAK02893.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 641
Score = 809 bits (2089), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/619 (66%), Positives = 472/619 (76%), Gaps = 24/619 (3%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
QG+PYVIYW++SFS YAA HF AL SVVQSS SHTWD+FQLAHASFRLYCVRNN V
Sbjct: 6 FQGIPYVIYWRNSFSSYAASHFRNALFSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ- 64
Query: 61 SNSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGV 119
S KLGP LLGD PK+++ + E ++ + E +S E PAIKIYDDD+ M+FL+CGV
Sbjct: 65 ------SVKLGPRLLGDAPKVNVVIPENEMAEEEGSSSEVSPAIKIYDDDINMKFLLCGV 118
Query: 120 PCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCS 179
P LD+ LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A + RG+VTMRCD++TCS
Sbjct: 119 PSALDSCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASVPRGMVTMRCDMTTCS 178
Query: 180 SAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIAC 239
S+H+SLLVSGSAQTCF+DQLLE+HIKNELIE SQLV ALPNS D +L +EP S S AC
Sbjct: 179 SSHVSLLVSGSAQTCFDDQLLESHIKNELIEKSQLVRALPNSED-KLSSTEPFTSMSTAC 237
Query: 240 GASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCT 299
GAS FEV M + WA+QVL+ LAP++SYRSLV LG+ I VASF++ DA+RLLFFCT
Sbjct: 238 GASTFEVWMTLPKWAAQVLKHLAPEISYRSLVALGVGCINATPVASFDRQDADRLLFFCT 297
Query: 300 RQGKADHTENSVLTRPPSWLTSPAPSRKRS-----------EPCRESKGVESENVCNVRP 348
Q K E+ P W S A R ++ E + E ++ P
Sbjct: 298 GQQKDLAGESGPYFHLPRWSASLAKDRAKTGSESKSNLLGVNGTLEDRKAPVEGPSSLTP 357
Query: 349 ---KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTH 404
KL A MRPIPH+R +M PF GF E ++ Q K NLP AP +KHSSA + H
Sbjct: 358 FKGKLKPATMRPIPHSRQQQMHPFMGFPEANVHETSQAKPNLPSAPPVKHSSAPASTAAH 417
Query: 405 RKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGG 464
RKS S AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GG
Sbjct: 418 RKSTSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHHRLVPHGG 477
Query: 465 LAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNT 524
LAEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNT
Sbjct: 478 LAEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNT 537
Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKD 584
LKRHYETYLLEYEL+HDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKD
Sbjct: 538 LKRHYETYLLEYELSHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKD 597
Query: 585 YAKTDGLEYVCPQCSVTNF 603
YAKTDGLEY+CP CS+ N+
Sbjct: 598 YAKTDGLEYICPHCSIANY 616
>gi|326502772|dbj|BAJ99014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 771
Score = 806 bits (2082), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/618 (66%), Positives = 472/618 (76%), Gaps = 24/618 (3%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW++SFS YAA HF AL SVVQSS SHTWD+FQLAHASFRLYCVRNN V
Sbjct: 137 KGIPYVIYWRNSFSSYAASHFRNALFSVVQSSVSHTWDSFQLAHASFRLYCVRNNHVQ-- 194
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
S KLGP LLGD PK+++ + E ++ + E +S E PAIKIYDDD+ M+FL+CGVP
Sbjct: 195 -----SVKLGPRLLGDAPKVNVVIPENEMAEEEGSSSEVSPAIKIYDDDINMKFLLCGVP 249
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
LD+ LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A + RG+VTMRCD++TCSS
Sbjct: 250 SALDSCLLGSLEDGLNALLNIEIRGSKLQNRISASPPPLEAASVPRGMVTMRCDMTTCSS 309
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
+H+SLLVSGSAQTCF+DQLLE+HIKNELIE SQLV ALPNS D +L +EP S S ACG
Sbjct: 310 SHVSLLVSGSAQTCFDDQLLESHIKNELIEKSQLVRALPNSED-KLSSTEPFTSMSTACG 368
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
AS FEV M + WA+QVL+ LAP++SYRSLV LG+ I VASF++ DA+RLLFFCT
Sbjct: 369 ASTFEVWMTLPKWAAQVLKHLAPEISYRSLVALGVGCINATPVASFDRQDADRLLFFCTG 428
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRS-----------EPCRESKGVESENVCNVRP- 348
Q K E+ P W S A R ++ E + E ++ P
Sbjct: 429 QQKDLAGESGPYFHLPRWSASLAKDRAKTGSESKSNLLGVNGTLEDRKAPVEGPSSLTPF 488
Query: 349 --KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHR 405
KL A MRPIPH+R +M PF GF E ++ Q K NLP AP +KHSSA + HR
Sbjct: 489 KGKLKPATMRPIPHSRQQQMHPFMGFPEANVHETSQAKPNLPSAPPVKHSSAPASTAAHR 548
Query: 406 KSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGL 465
KS S AQ II LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GGL
Sbjct: 549 KSTSGPSHAQSIIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHHRLVPHGGL 608
Query: 466 AEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTL 525
AEFPDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT+TNRMTGVGNTL
Sbjct: 609 AEFPDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTVTNRMTGVGNTL 668
Query: 526 KRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDY 585
KRHYETYLLEYEL+HDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDY
Sbjct: 669 KRHYETYLLEYELSHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDY 728
Query: 586 AKTDGLEYVCPQCSVTNF 603
AKTDGLEY+CP CS+ N+
Sbjct: 729 AKTDGLEYICPHCSIANY 746
>gi|218201239|gb|EEC83666.1| hypothetical protein OsI_29439 [Oryza sativa Indica Group]
Length = 761
Score = 787 bits (2032), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/628 (61%), Positives = 465/628 (74%), Gaps = 25/628 (3%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
++G+PYV+YW++SFS YAA HF QAL+SVVQSSCSHTWDAFQLA ASFRLYC RNN
Sbjct: 136 LKGIPYVLYWRNSFSSYAASHFRQALISVVQSSCSHTWDAFQLAQASFRLYCARNN---- 191
Query: 61 SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
S KLGP LLGD PKI+I L E ++ EE S E+ PAIKIYD+DV M+ L+CG P
Sbjct: 192 ---DAQSVKLGPRLLGDAPKINIFLPENEMVEEEGSSEHFPAIKIYDEDVNMKLLICGAP 248
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
C LD SLLG LEDGLNALLNIEIRG +L NR SA PPPL A GVVTMRCD++TCSS
Sbjct: 249 CILDASLLGSLEDGLNALLNIEIRGCRLQNRVSAAPPPLHAETLPHGVVTMRCDITTCSS 308
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
+H+SLLVSGS QTCF+D+LLENHIK E++E QLV A+ D++ EP S S+A G
Sbjct: 309 SHVSLLVSGSPQTCFDDKLLENHIKKEIVEKGQLVRAVLVRADDKPSSVEPLTSISVASG 368
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
AS FEV M + WA QVL+ LAP++SY+SLV LGIA + G V+SF++ D +RLLFFC
Sbjct: 369 ASTFEVWMTLPKWAGQVLKYLAPEISYKSLVPLGIACVNGTPVSSFDRQDVDRLLFFC-- 426
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------VRP 348
K + N + + P W S R + P +S + V V+P
Sbjct: 427 --KNEAIVNGLYSHLPRWSASLVKDRLKGTPESKSSTFSANGVGEYQKHPMKGTSLLVKP 484
Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSL 408
KL SA MRPIPH+ +M PF G +D QVK +LP P++H++ P T RK
Sbjct: 485 KLKSAKMRPIPHSSKRQMHPFVGIPPSFIHDASQVKPSLPAPPVRHNALPVAPTTQRKLS 544
Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
S + + + + LNPLP+KKHGC R PI +CSEE+FL+DVMQFL+ RGHTRLVPQGGLAEF
Sbjct: 545 SGTSRVEPAVPLNPLPMKKHGCDRLPIGICSEEDFLKDVMQFLLQRGHTRLVPQGGLAEF 604
Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
PDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKRH
Sbjct: 605 PDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMSNHTVTNKMTGVGNTLKRH 664
Query: 529 YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
YETYLLEYEL+HDDV GECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYAKT
Sbjct: 665 YETYLLEYELSHDDVGGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYAKT 724
Query: 589 DGLEYVCPQCSVTNFKKK--SQKTSNGY 614
DGLEY+CP CS+ N+KKK +++NG+
Sbjct: 725 DGLEYICPHCSLANYKKKPPPPESANGF 752
>gi|115476700|ref|NP_001061946.1| Os08g0451400 [Oryza sativa Japonica Group]
gi|113623915|dbj|BAF23860.1| Os08g0451400 [Oryza sativa Japonica Group]
gi|222640662|gb|EEE68794.1| hypothetical protein OsJ_27533 [Oryza sativa Japonica Group]
Length = 761
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/628 (61%), Positives = 465/628 (74%), Gaps = 25/628 (3%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
++G+PYV+YW++SFS YAA HF QAL+SVVQSSCSHTWDAFQLA ASFRLYC RNN
Sbjct: 136 LKGIPYVLYWRNSFSSYAASHFRQALISVVQSSCSHTWDAFQLAQASFRLYCARNN---- 191
Query: 61 SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
S KLGP LLGD PKI+I L E ++ EE S E+ PAIKIYD+DV M+ L+CG P
Sbjct: 192 ---DAQSVKLGPRLLGDAPKINIFLPENEMVEEEGSSEHFPAIKIYDEDVNMKLLICGAP 248
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
C LD SLLG LEDGLNALLNIEIRG +L NR SA PPPL A GVVTMRCD++TCSS
Sbjct: 249 CILDASLLGSLEDGLNALLNIEIRGCRLQNRVSAAPPPLHAETLPHGVVTMRCDITTCSS 308
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
+H+SLLVSGS QTCF+D+LLENHIK E++E QLV A+ D++ EP S S+A G
Sbjct: 309 SHVSLLVSGSPQTCFDDKLLENHIKKEIVEKGQLVRAVLVREDDKPSSVEPLTSISVASG 368
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
AS FEV M + WA QVL+ LAP++SY+SLV LGIA + G V+SF++ D +RLLFFC
Sbjct: 369 ASTFEVWMTLPKWAGQVLKYLAPEISYKSLVPLGIACVNGTPVSSFDRQDVDRLLFFC-- 426
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------VRP 348
K + N + + P W S R + P +S + V V+P
Sbjct: 427 --KNEAIVNGLYSHLPRWSASLVKDRLKGTPESKSSTFSANGVGEYQKHPMKGTSLVVKP 484
Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSL 408
KL SA MRPIPH+ +M PF G +D QVK +LP P++H++ P T RK
Sbjct: 485 KLKSAKMRPIPHSSKRQMHPFVGIPPSFIHDASQVKPSLPAPPVRHNALPVAPTTQRKLS 544
Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
S + + + + LNPLP+KKHGC R PI +CSEE+FL+DVMQFL+ RGHTRLVPQGGLAEF
Sbjct: 545 SGTSRVEPAVPLNPLPMKKHGCDRLPIGICSEEDFLKDVMQFLLQRGHTRLVPQGGLAEF 604
Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
PDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKRH
Sbjct: 605 PDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMSNHTVTNKMTGVGNTLKRH 664
Query: 529 YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
YETYLLEYEL+HDDV GECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYAKT
Sbjct: 665 YETYLLEYELSHDDVGGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYAKT 724
Query: 589 DGLEYVCPQCSVTNFKKK--SQKTSNGY 614
DGLEY+CP CS+ N+KKK +++NG+
Sbjct: 725 DGLEYICPHCSLANYKKKPPPPESANGF 752
>gi|42407626|dbj|BAD08741.1| fiber protein-like [Oryza sativa Japonica Group]
Length = 683
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/628 (61%), Positives = 465/628 (74%), Gaps = 25/628 (3%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMA 60
++G+PYV+YW++SFS YAA HF QAL+SVVQSSCSHTWDAFQLA ASFRLYC RNN
Sbjct: 58 LKGIPYVLYWRNSFSSYAASHFRQALISVVQSSCSHTWDAFQLAQASFRLYCARNN---- 113
Query: 61 SNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
S KLGP LLGD PKI+I L E ++ EE S E+ PAIKIYD+DV M+ L+CG P
Sbjct: 114 ---DAQSVKLGPRLLGDAPKINIFLPENEMVEEEGSSEHFPAIKIYDEDVNMKLLICGAP 170
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
C LD SLLG LEDGLNALLNIEIRG +L NR SA PPPL A GVVTMRCD++TCSS
Sbjct: 171 CILDASLLGSLEDGLNALLNIEIRGCRLQNRVSAAPPPLHAETLPHGVVTMRCDITTCSS 230
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
+H+SLLVSGS QTCF+D+LLENHIK E++E QLV A+ D++ EP S S+A G
Sbjct: 231 SHVSLLVSGSPQTCFDDKLLENHIKKEIVEKGQLVRAVLVREDDKPSSVEPLTSISVASG 290
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
AS FEV M + WA QVL+ LAP++SY+SLV LGIA + G V+SF++ D +RLLFFC
Sbjct: 291 ASTFEVWMTLPKWAGQVLKYLAPEISYKSLVPLGIACVNGTPVSSFDRQDVDRLLFFC-- 348
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCN------------VRP 348
K + N + + P W S R + P +S + V V+P
Sbjct: 349 --KNEAIVNGLYSHLPRWSASLVKDRLKGTPESKSSTFSANGVGEYQKHPMKGTSLVVKP 406
Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSL 408
KL SA MRPIPH+ +M PF G +D QVK +LP P++H++ P T RK
Sbjct: 407 KLKSAKMRPIPHSSKRQMHPFVGIPPSFIHDASQVKPSLPAPPVRHNALPVAPTTQRKLS 466
Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
S + + + + LNPLP+KKHGC R PI +CSEE+FL+DVMQFL+ RGHTRLVPQGGLAEF
Sbjct: 467 SGTSRVEPAVPLNPLPMKKHGCDRLPIGICSEEDFLKDVMQFLLQRGHTRLVPQGGLAEF 526
Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
PDA+LNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKRH
Sbjct: 527 PDAVLNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMSNHTVTNKMTGVGNTLKRH 586
Query: 529 YETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
YETYLLEYEL+HDDV GECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKDYAKT
Sbjct: 587 YETYLLEYELSHDDVGGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKDYAKT 646
Query: 589 DGLEYVCPQCSVTNFKKK--SQKTSNGY 614
DGLEY+CP CS+ N+KKK +++NG+
Sbjct: 647 DGLEYICPHCSLANYKKKPPPPESANGF 674
>gi|357147984|ref|XP_003574575.1| PREDICTED: uncharacterized protein LOC100844696 [Brachypodium
distachyon]
Length = 766
Score = 785 bits (2026), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/619 (63%), Positives = 466/619 (75%), Gaps = 22/619 (3%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PY+IYW++SFS YAA HF L SV+QSSCSH WDAFQLA ASFRLYCVRNN V
Sbjct: 134 KGIPYLIYWRNSFSSYAATHFRHVLTSVIQSSCSHIWDAFQLARASFRLYCVRNNHVR-- 191
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S +LGPHLLGD PKI+IA + + E S E AIKI+D DV M+FL+CGVPC
Sbjct: 192 -----SVELGPHLLGDAPKINIAPPDNGMADGEGSSEAFSAIKIHDKDVNMKFLICGVPC 246
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
TLD LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A GVVTMR D++TCSS+
Sbjct: 247 TLDACLLGSLEDGLNALLNIEIRGSKLRNRVSAAPPPLEAETLPCGVVTMRSDITTCSSS 306
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
++SLLVSGSAQTCF+D++LE+HIKNE+IE SQLV +LPNS DN+ +EP S SIACGA
Sbjct: 307 YMSLLVSGSAQTCFDDKILESHIKNEIIEKSQLVRSLPNSEDNKPSSAEPLTSMSIACGA 366
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S+FEV M + WA+++L+QLAPD+SY+SLV LGIAS+ V+SF + DA+RLLFFCT Q
Sbjct: 367 SIFEVWMTLPKWAAEILKQLAPDISYKSLVGLGIASVNSTPVSSFNRQDADRLLFFCTSQ 426
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRS-----EP---CRESKGVESENVCN------VR 347
K E P W S R+ EP C G + +++ + V+
Sbjct: 427 YKDRPIEVGPYLHLPRWSASLTKERRMKGSLEVEPNVSCANGVGDDKKHLMDGSSLPSVK 486
Query: 348 PKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPL-KHSSAGPTPVTHRK 406
+L A MRPIP + + PF+GF++ Q K++ P P KH+S T RK
Sbjct: 487 TRLKPATMRPIPRSSKQHLHPFTGFTQSIINGAYQTKSSFPARPPGKHNSVPAASATRRK 546
Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
S S S ++QQ + LNPLP+KKHGC R PIQ C EE+FL+D+MQFL+ RGHTRLVPQGGLA
Sbjct: 547 SSSGSSRSQQALLLNPLPMKKHGCDRLPIQTCHEEDFLKDLMQFLLQRGHTRLVPQGGLA 606
Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
EFPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLK
Sbjct: 607 EFPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMCNHTVTNKMTGVGNTLK 666
Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
RHYETYLLEY+L HDDVDGECCLLCHS+A GDWVNCG+CG+WAHFGCD+RQGLGAFKDY+
Sbjct: 667 RHYETYLLEYQLCHDDVDGECCLLCHSTAPGDWVNCGLCGDWAHFGCDKRQGLGAFKDYS 726
Query: 587 KTDGLEYVCPQCSVTNFKK 605
KTDGLEY+CP CS TN+KK
Sbjct: 727 KTDGLEYICPHCSGTNYKK 745
>gi|357141526|ref|XP_003572256.1| PREDICTED: uncharacterized protein LOC100821621 [Brachypodium
distachyon]
Length = 735
Score = 784 bits (2025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/618 (63%), Positives = 465/618 (75%), Gaps = 22/618 (3%)
Query: 3 GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
G+PY+IYW++SFS YAA HF L SV+QSSCSH WDAFQLA ASFRLYCVRNN V
Sbjct: 104 GIPYLIYWRNSFSSYAATHFRHVLTSVIQSSCSHIWDAFQLARASFRLYCVRNNHVR--- 160
Query: 63 SQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCT 122
S +LGPHLLGD PKI+IA + + E S E AIKI+D DV M+FL+CGVPCT
Sbjct: 161 ----SVELGPHLLGDAPKINIAPPDNGMADGEGSSEAFSAIKIHDKDVNMKFLICGVPCT 216
Query: 123 LDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAH 182
LD LLG LEDGLNALLNIEIRGSKL NR SA PPPL+A GVVTMR D++TCSS++
Sbjct: 217 LDACLLGSLEDGLNALLNIEIRGSKLRNRVSAAPPPLEAETLPCGVVTMRSDITTCSSSY 276
Query: 183 ISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGAS 242
+SLLVSGSAQTCF+D++LE+HIKNE+IE SQLV +LPNS DN+ +EP S SIACGAS
Sbjct: 277 MSLLVSGSAQTCFDDKILESHIKNEIIEKSQLVRSLPNSEDNKPSSAEPLTSMSIACGAS 336
Query: 243 VFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQG 302
+FEV M + WA+++L+QLAPD+SY+SLV LGIAS+ V+SF + DA+RLLFFCT Q
Sbjct: 337 IFEVWMTLPKWAAEILKQLAPDISYKSLVGLGIASVNSTPVSSFNRQDADRLLFFCTSQY 396
Query: 303 KADHTENSVLTRPPSWLTSPAPSRKRS-----EP---CRESKGVESENVCN------VRP 348
K E P W S R+ EP C G + +++ + V+
Sbjct: 397 KDRPIEVGPYLHLPRWSASLTKERRMKGSLEVEPNVSCANGVGDDKKHLMDGSSLPSVKT 456
Query: 349 KLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPL-KHSSAGPTPVTHRKS 407
+L A MRPIP + + PF+GF++ Q K++ P P KH+S T RKS
Sbjct: 457 RLKPATMRPIPRSSKQHLHPFTGFTQSIINGAYQTKSSFPARPPGKHNSVPAASATRRKS 516
Query: 408 LSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAE 467
S S ++QQ + LNPLP+KKHGC R PIQ C EE+FL+D+MQFL+ RGHTRLVPQGGLAE
Sbjct: 517 SSGSSRSQQALLLNPLPMKKHGCDRLPIQTCHEEDFLKDLMQFLLQRGHTRLVPQGGLAE 576
Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
FPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKM NHT+TN+MTGVGNTLKR
Sbjct: 577 FPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMCNHTVTNKMTGVGNTLKR 636
Query: 528 HYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAK 587
HYETYLLEY+L HDDVDGECCLLCHS+A GDWVNCG+CG+WAHFGCD+RQGLGAFKDY+K
Sbjct: 637 HYETYLLEYQLCHDDVDGECCLLCHSTAPGDWVNCGLCGDWAHFGCDKRQGLGAFKDYSK 696
Query: 588 TDGLEYVCPQCSVTNFKK 605
TDGLEY+CP CS TN+KK
Sbjct: 697 TDGLEYICPHCSGTNYKK 714
>gi|357450177|ref|XP_003595365.1| Fiber protein-like protein [Medicago truncatula]
gi|355484413|gb|AES65616.1| Fiber protein-like protein [Medicago truncatula]
Length = 844
Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/486 (76%), Positives = 416/486 (85%), Gaps = 13/486 (2%)
Query: 142 EIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQ--- 198
+IRG KL NRTSAPPPPLQA FSRGVVTMRCD+ST SSAHISLLVSGSA CFNDQ
Sbjct: 359 KIRGCKLQNRTSAPPPPLQAATFSRGVVTMRCDISTSSSAHISLLVSGSADACFNDQARV 418
Query: 199 ---LLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWAS 255
LLENHIK EL ENSQLV A+P+ N+LP EPR+SAS+ACG+SVFEV M+V TWAS
Sbjct: 419 NCILLENHIKKELTENSQLVQAIPSHEQNKLPSYEPRRSASVACGSSVFEVCMRVPTWAS 478
Query: 256 QVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVLTRP 315
QVLRQLAP+VSYR LVMLG+ASIQGLSVASF KDDAERLLFFC RQ K + +++ V +
Sbjct: 479 QVLRQLAPNVSYRCLVMLGVASIQGLSVASFTKDDAERLLFFCNRQEKDNCSKDIVFSSH 538
Query: 316 PSWLTSPAPSRKRSEP--CRESK-----GVESENVCNVRPKLNSAAMRPIPHTRHYKMLP 368
PSWL PAPSRKRSEP C K GV+ E+ + R KL+ AAMRPIP + K+LP
Sbjct: 539 PSWLMPPAPSRKRSEPESCSRVKSINASGVKVEDNGSDRQKLSFAAMRPIPQSHRQKLLP 598
Query: 369 FSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKH 428
FSG+ E E+YDGD+ K+N P+ P+KH+ G VT+RKS+S+S+QA QIISLNPLP+KKH
Sbjct: 599 FSGYFEGEKYDGDRGKSNQPLVPIKHNGLGSRSVTNRKSVSNSFQAHQIISLNPLPMKKH 658
Query: 429 GCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVV 488
GC RAPI+VCSEEEFL+DVMQFL+LRGHTRL+PQGGLAEFPDA+LNAKRLDLFNLYREVV
Sbjct: 659 GCDRAPIRVCSEEEFLKDVMQFLVLRGHTRLIPQGGLAEFPDAVLNAKRLDLFNLYREVV 718
Query: 489 SRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECC 548
SRGGFHVGNGINWKGQVFSKM NHTL++RMTGVGNTLKRHYETYLLEYELAHDDVDGECC
Sbjct: 719 SRGGFHVGNGINWKGQVFSKMSNHTLSHRMTGVGNTLKRHYETYLLEYELAHDDVDGECC 778
Query: 549 LLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQ 608
LLCHSSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCP CS++NF KKSQ
Sbjct: 779 LLCHSSAAGDWVNCGMCGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSMSNFSKKSQ 838
Query: 609 KTSNGY 614
KT+NGY
Sbjct: 839 KTANGY 844
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 141/396 (35%), Positives = 189/396 (47%), Gaps = 53/396 (13%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GVPY IYWK+ FS AA HF QA SV QS+ SHTWDAFQLA +SFRLYCV+N ++
Sbjct: 139 KGVPYTIYWKNEFSKSAASHFHQAFFSVAQSTSSHTWDAFQLAQSSFRLYCVQNEVI-PH 197
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDV-QGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
NSQKGS K+GP +LG+PP I++ E D + EE+SPE +IKIYDDDV RFL+CG P
Sbjct: 198 NSQKGSDKVGPKILGEPPNIEVGPCEADTKEDEEDSPETSSSIKIYDDDVNTRFLLCGFP 257
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
CT+D LL LEDGLNALL E K + VV MR C
Sbjct: 258 CTVDACLLESLEDGLNALLCTECWRVKNQHENKI------------SVVEMRMLRWMCER 305
Query: 181 AHISLLV------SGSAQTCFNDQLLENHIKNELIENS---QLVH-ALPNSGD------- 223
H+ +V S +T D + +N + + + L+H A P D
Sbjct: 306 RHVDFVVWRVDLEKLSLETIKKDLEINELDRNMVYDRTLWRNLIHVANPTYWDKIRGCKL 365
Query: 224 -NRL-PPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGL 281
NR P P ++A+ + G V+M+ D+S S + + + G
Sbjct: 366 QNRTSAPPPPLQAATFSRGV----VTMRC-------------DISTSSSAHISLL-VSGS 407
Query: 282 SVASFEKDDAERLLFFCTRQGKADHTENSVLTRP-PSWLTSPAPSRKRSEPCRESKGVES 340
+ A F D A K + TENS L + PS + PS + + G
Sbjct: 408 ADACF-NDQARVNCILLENHIKKELTENSQLVQAIPSHEQNKLPSYEPRRSASVACGSSV 466
Query: 341 ENVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIE 376
VC P S +R + Y+ L G + I+
Sbjct: 467 FEVCMRVPTWASQVLRQLAPNVSYRCLVMLGVASIQ 502
>gi|212274905|ref|NP_001130362.1| uncharacterized protein LOC100191457 [Zea mays]
gi|194688938|gb|ACF78553.1| unknown [Zea mays]
Length = 749
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/617 (61%), Positives = 458/617 (74%), Gaps = 24/617 (3%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYV+YW++ FS Y A HF AL+SV+QSSCSHTWDAFQLAHASFRLYCVRNN V
Sbjct: 137 KGIPYVVYWRNLFSSYTASHFRHALMSVIQSSCSHTWDAFQLAHASFRLYCVRNNHVQ-- 194
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PK ++ + +V E+ E P I+IYD+DV+M+FL+CGVPC
Sbjct: 195 -----SVKLGPCLLGDAPKTNVIPAGDEVNEEQGCSEGFPDIRIYDEDVSMKFLLCGVPC 249
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
TLD LLG LEDGLNALLNIEIR SKL NR SA PP LQA GVVTMRCD++TCSSA
Sbjct: 250 TLDACLLGALEDGLNALLNIEIRASKLQNRASAHPP-LQAEDLPHGVVTMRCDITTCSSA 308
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+S LVS SAQTCF+DQLL + IKNE+IE QLV AL N+ DN+ EP S +ACGA
Sbjct: 309 HVSFLVSCSAQTCFDDQLLGSRIKNEIIEKRQLVRALLNNEDNKPSSYEPLPSMCVACGA 368
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV + + WA+QVL+ LAP++SYRSLV LGIA + G V+SF++ DA+RLLFFC+ Q
Sbjct: 369 STFEVWITMPKWAAQVLKHLAPEISYRSLVALGIAWVNGTPVSSFDRQDADRLLFFCSNQ 428
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN---VCNVRPKLN 351
+N SW S ++ R+ ESK GV +N + +++P+L
Sbjct: 429 CGDQAIQNGSYAHVSSW--SAPLTKDRTTGSLESKQMSFGAKGVVGDNTISLSSLKPRLK 486
Query: 352 SAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSS 411
A MRP+PH+R +M PF GF + +D QVK +LP P+KH++ T VT RKS S S
Sbjct: 487 PATMRPLPHSRKQQMHPFMGFPQSIIHDASQVKPSLPAPPVKHNAVPVTAVTPRKSSSGS 546
Query: 412 Y----QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAE 467
+ Q I LNPLP+KKH C R PI CSEE+FL+DVMQFL+ RGH RLVPQGGLAE
Sbjct: 547 PSSWSRVQLPIPLNPLPIKKHECNRLPIHSCSEEDFLKDVMQFLLQRGHIRLVPQGGLAE 606
Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
FPDA+LN+KRLDL+NLY+EVV RGGF+VGNGINWKGQ+FSKM NHT+TN+MTGVGNTLKR
Sbjct: 607 FPDAVLNSKRLDLYNLYKEVVYRGGFYVGNGINWKGQIFSKMHNHTVTNKMTGVGNTLKR 666
Query: 528 HYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAK 587
HYETYLLEYELAHDD+DGECCL+CHS A GDWVNCG+CGEWAH GCDRR+GL FKDY+K
Sbjct: 667 HYETYLLEYELAHDDIDGECCLICHSGALGDWVNCGLCGEWAHLGCDRRKGLSTFKDYSK 726
Query: 588 TDGLEYVCPQCSVTNFK 604
DG+EY+CPQCS+ +K
Sbjct: 727 ADGMEYICPQCSLAKYK 743
>gi|242081649|ref|XP_002445593.1| hypothetical protein SORBIDRAFT_07g022155 [Sorghum bicolor]
gi|241941943|gb|EES15088.1| hypothetical protein SORBIDRAFT_07g022155 [Sorghum bicolor]
Length = 729
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/612 (61%), Positives = 455/612 (74%), Gaps = 25/612 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYV+YW++SFS Y A HF AL+SV++SSCSHTWDAFQLAHASFRLYCVRNN V
Sbjct: 127 KGIPYVVYWRNSFSSYTASHFRHALMSVIESSCSHTWDAFQLAHASFRLYCVRNNYVQ-- 184
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PKI++ + +V EE E P I+IYD+DV M+FLVCG PC
Sbjct: 185 -----SVKLGPCLLGDAPKINVIPAGNEVNKEEGCSEGFPPIRIYDEDVNMKFLVCGAPC 239
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
TLD LLG LEDGL ALLNIE+ G + N+ S+ PPPLQA GVVTMRCD+STCSSA
Sbjct: 240 TLDACLLGALEDGLTALLNIEVCGHRCSNQLSSAPPPLQAENLPHGVVTMRCDISTCSSA 299
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+S LVS SAQTCF+D+LLE+ IKNE+IE QLV AL N+ DN+ P EP S +ACGA
Sbjct: 300 HVSFLVSASAQTCFDDKLLESRIKNEIIEKRQLVRALSNTEDNK-PSYEPLPSMCVACGA 358
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV + + WA+QVL+QLAP++ YRSLV LGIA + G V+SF++ DA+RLLFFC+ Q
Sbjct: 359 STFEVWITLPKWAAQVLKQLAPEICYRSLVALGIAWVNGTPVSSFDRQDADRLLFFCSNQ 418
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESEN---VCNVRPKLN 351
K +N SW S + ++ R+ ESK GV +N + +++P+L
Sbjct: 419 CKDKAIQNVSYAHLSSW--SASLTKDRTTGSIESKQMSFGAKGVGGDNKMSLSSLKPRLK 476
Query: 352 SAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSS 411
A MRPIPH+R +M PF GF++ +D VK +LP P+K ++ VT RKS S S
Sbjct: 477 PATMRPIPHSRKQQMHPFMGFAQ-SIHDASPVKPSLPAPPVKDNTVPVIKVTQRKSSSGS 535
Query: 412 Y----QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAE 467
+ Q I LNPLP+KKH C R PI +CSEE+FL+DVMQFL+ RGH RLVPQGG AE
Sbjct: 536 PSSSSRVQPSIPLNPLPMKKHECNRLPINICSEEDFLKDVMQFLLQRGHDRLVPQGGPAE 595
Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
FPDA+LN+KRLDL+NLY+EVV RGGFHVGNGINWKGQVFSKMRNHT+TN+MTGVGNTLKR
Sbjct: 596 FPDAVLNSKRLDLYNLYKEVVYRGGFHVGNGINWKGQVFSKMRNHTVTNKMTGVGNTLKR 655
Query: 528 HYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAK 587
HYETYLLEYELAHDD++ ECCL+CHSS GDWVNCG+CGEWAH GCDRRQGLG FKDY+K
Sbjct: 656 HYETYLLEYELAHDDIEVECCLICHSSDLGDWVNCGVCGEWAHLGCDRRQGLGNFKDYSK 715
Query: 588 TDGLEYVCPQCS 599
T GLEY+CP CS
Sbjct: 716 TGGLEYICPHCS 727
>gi|302825149|ref|XP_002994207.1| hypothetical protein SELMODRAFT_449331 [Selaginella moellendorffii]
gi|300137918|gb|EFJ04712.1| hypothetical protein SELMODRAFT_449331 [Selaginella moellendorffii]
Length = 733
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 322/624 (51%), Positives = 400/624 (64%), Gaps = 47/624 (7%)
Query: 3 GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
GV +V +W S + AA HF Q+L++ +++ WDAF+LA+AS ++ + S
Sbjct: 133 GVRHVAHWDGSVTSLAAAHFRQSLVACLRTPGCDPWDAFELANASLEIHYGQT-----SG 187
Query: 63 SQKGSSKLGPHLLGD-PPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
SQ P LLG+ PP +D L +M E+ P P I+IYD++ +R LVC C
Sbjct: 188 SQ-------PMLLGEGPPVLDDPLKDM-----EDDP---PLIQIYDEETEIRLLVCAEAC 232
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
D+S L +E L +L IE+RG +L +R SAPPPP A F+RGVVTMRCDL T SSA
Sbjct: 233 ASDSSSLQAVEVALTSLFAIEVRGMRLIHRISAPPPPSAASTFARGVVTMRCDLCTSSSA 292
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
ISL+VSGSAQTCF D LE+ I+ L+E SQ + L S DN P+E R+S SIACGA
Sbjct: 293 RISLVVSGSAQTCFADHFLESTIRKGLMEKSQALQ-LIMSEDNM--PAEIRRSTSIACGA 349
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
+V E KV WA+Q LRQL+ D SY++LV LGIA I+G VA+F+++DAERL
Sbjct: 350 AVVETRAKVPNWAAQTLRQLSTDTSYKTLVALGIAGIEGSPVAAFQQEDAERLGLLRNEP 409
Query: 302 GKADHTE--NSVLTRP--PSWLTSPAPSRKRSEPCRESKGVESENVCNVRPK-LNSAAMR 356
KA + N P P+WLT AP+RKR C S + N K + AAM+
Sbjct: 410 PKALGSPRCNGFSEAPIIPAWLTPAAPTRKRQNLCLSSINFNGDAPLNGDSKSVFLAAMK 469
Query: 357 PIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQ 416
PIPH K++PF+G G +K N G T T + S+ +
Sbjct: 470 PIPHATRRKLMPFAGVVSAGAQAGWSMKLN-----------GNTKSTRPEGTSAPSGHGR 518
Query: 417 IISLNPLPL---KKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL 473
P+ L KKH C R + C EEEFL DV+QFL+ RGH RL+P G+ FPD +L
Sbjct: 519 GGHTFPIVLPSVKKHHCSRPSMLECPEEEFLNDVVQFLVSRGHGRLIPPTGIEAFPDVVL 578
Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYL 533
N KRLDL+NLYREVVSRGGF VGNGINWKGQ+FSKMRNHT TNRMTGVGNTLK+HYETYL
Sbjct: 579 NGKRLDLYNLYREVVSRGGFRVGNGINWKGQIFSKMRNHTTTNRMTGVGNTLKKHYETYL 638
Query: 534 LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
LEYELAHDDVDGECC+LCHSSA GDWVNCGICGEWAHFGCDRR GL FK+YAKTDGLEY
Sbjct: 639 LEYELAHDDVDGECCILCHSSAEGDWVNCGICGEWAHFGCDRRTGLATFKEYAKTDGLEY 698
Query: 594 VCPQCSVTNFK----KKSQKTSNG 613
+CP+CSV + + +K Q+ S+G
Sbjct: 699 ICPRCSVGSARGVTSRKKQRPSSG 722
>gi|302823192|ref|XP_002993250.1| hypothetical protein SELMODRAFT_431374 [Selaginella moellendorffii]
gi|300138920|gb|EFJ05671.1| hypothetical protein SELMODRAFT_431374 [Selaginella moellendorffii]
Length = 753
Score = 566 bits (1459), Expect = e-158, Method: Compositional matrix adjust.
Identities = 321/624 (51%), Positives = 399/624 (63%), Gaps = 47/624 (7%)
Query: 3 GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
GV +V +W S + AA HF Q+L++ +++ WDAF+LA+AS ++ + S
Sbjct: 153 GVRHVAHWDGSVTSLAAAHFRQSLVACLRTPGCDPWDAFELANASLEIHYGQT-----SG 207
Query: 63 SQKGSSKLGPHLLGD-PPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
SQ P LLG+ PP +D L +M E+ P P I+IYD++ +R LVC C
Sbjct: 208 SQ-------PMLLGEGPPVLDDPLKDM-----EDDP---PLIQIYDEETEIRLLVCAEAC 252
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
D+S L +E L +L IE+RG +L +R SAPPPP A F+RGVVTMRCDL T SSA
Sbjct: 253 ASDSSSLQAVEVALTSLFAIEVRGMRLIHRISAPPPPSAASTFARGVVTMRCDLCTSSSA 312
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
ISL+VSGSAQTCF D LE+ I+ L+E SQ + L S DN P+E R+S SIACGA
Sbjct: 313 RISLVVSGSAQTCFADHFLESTIRKGLMEKSQALQ-LIMSEDNM--PAEIRRSTSIACGA 369
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
+V E KV WA+Q LRQL+ D SY++LV LGIA I+G VA+F+++DAERL
Sbjct: 370 AVVETRAKVPNWAAQTLRQLSTDTSYKTLVALGIAGIEGSPVAAFQQEDAERLGLLRNEP 429
Query: 302 GKADHTE--NSVLTRP--PSWLTSPAPSRKRSEPCRESKGVESENVCNVRPK-LNSAAMR 356
K + N P P+WLT AP+RKR C S + N K + AAM+
Sbjct: 430 PKPLGSPRCNGFSEAPIIPAWLTPAAPTRKRQNLCLSSINFNGDGPLNGDSKSVFLAAMK 489
Query: 357 PIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQ 416
PIPH K++PF+G G +K N G T T + S+ +
Sbjct: 490 PIPHATRRKLMPFAGVVSAGAQAGWSMKLN-----------GNTKSTRPEGTSAPSGHGR 538
Query: 417 IISLNPLPL---KKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL 473
P+ L KKH C R + C EEEFL DV+QFL+ RGH RL+P G+ FPD +L
Sbjct: 539 GGHTFPIVLPSVKKHHCSRPSMLECPEEEFLNDVVQFLVSRGHGRLIPPTGIEAFPDVVL 598
Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYL 533
N KRLDL+NLYREVVSRGGF VGNGINWKGQ+FSKMRNHT TNRMTGVGNTLK+HYETYL
Sbjct: 599 NGKRLDLYNLYREVVSRGGFRVGNGINWKGQIFSKMRNHTTTNRMTGVGNTLKKHYETYL 658
Query: 534 LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
LEYELAHDDVDGECC+LCHSSA GDWVNCGICGEWAHFGCDRR GL FK+YAKTDGLEY
Sbjct: 659 LEYELAHDDVDGECCILCHSSAEGDWVNCGICGEWAHFGCDRRTGLATFKEYAKTDGLEY 718
Query: 594 VCPQCSVTNFK----KKSQKTSNG 613
+CP+CSV + + +K Q+ S+G
Sbjct: 719 ICPRCSVGSARGVTSRKKQRPSSG 742
>gi|168019329|ref|XP_001762197.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
patens subsp. patens]
gi|162686601|gb|EDQ72989.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
patens subsp. patens]
Length = 645
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 294/624 (47%), Positives = 399/624 (63%), Gaps = 20/624 (3%)
Query: 4 VPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASNS 63
VP+VI WK S + A HF +ALL+ ++S + DAFQ+A ASF+++C + +
Sbjct: 1 VPHVICWKGSPTFSLATHFRRALLATLRSCATGAKDAFQIASASFQMHCGQTKSASVNGL 60
Query: 64 QKGSSKLGPHLLGDPPKIDIALSEM-DVQGEENSPENLPA----IKIYDDDVTMRFLVCG 118
+K S + P LL + P ++ M D GEEN E + + ++IYD++ ++R LVC
Sbjct: 61 EKTISVI-PVLL-EVPLSKVSEDRMSDSSGEENGNEPISSDTAPVQIYDENHSIRLLVCC 118
Query: 119 VPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTC 178
++ +G +EDGL+ALL IE +G++L +R SAPPPP+ A + RGVVTMRCD+ T
Sbjct: 119 EASRPSSAWIGAIEDGLSALLTIEAKGARLLHRVSAPPPPIAATSLLRGVVTMRCDVCTS 178
Query: 179 SSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEP--RKSAS 236
+ A + LLVSG+AQTCF+DQ+LEN +K E+++ + + + ++R P R+S S
Sbjct: 179 TFARVPLLVSGAAQTCFDDQVLENSVKKEMLDKCEHIQVVVED-ESRRPALNLNLRRSVS 237
Query: 237 IACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLF 296
+ACGAS+ E+ +K TW QVLRQLA + SYR LV +GIA ++G VA+F+++DA RL+
Sbjct: 238 VACGASIVELRIKAPTWVGQVLRQLAAESSYRGLVAMGIAGVEGAPVAAFQEEDAARLVS 297
Query: 297 F-CTRQGKADHTENSVLTRPPSWLTSPAPSRKRSE--PCRESKGVESENVCNVRPKLNS- 352
CT +D+ S P WL PA SRKR + P +S +E N
Sbjct: 298 LKCTWNDASDNDRLSNHLALPPWLAPPAASRKRLKLLPSVKSSDNFAEEDRKASKASNGG 357
Query: 353 ------AAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRK 406
AAM+P+PHT K +PF+ ++ G +K ++ + SA P
Sbjct: 358 GNIALLAAMKPLPHTGRRKFMPFANAVMAAQHAGWSMKLDVSSRQRRPESARPRSAYFMP 417
Query: 407 SLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLA 466
+ L PL +K HGC R ++ C+EEEFL D++ FL RGH RL+P G+
Sbjct: 418 GSAGHNAGGAAPFLPPLNVKSHGCKRPHMERCTEEEFLADLVSFLESRGHGRLIPPAGVD 477
Query: 467 EFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLK 526
FP+ +LN KRLDL+NLYREVVSRGGFHVGNGINWKGQ+F+KMRNHT N+MTGVGNTLK
Sbjct: 478 AFPEVVLNGKRLDLYNLYREVVSRGGFHVGNGINWKGQIFAKMRNHTSVNKMTGVGNTLK 537
Query: 527 RHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYA 586
+HYE YLLEYELAHDDV G+CC++CHS A GDWVNCG C EWAHFGCD+R GLGAFKDYA
Sbjct: 538 KHYEVYLLEYELAHDDVGGDCCIICHSGAEGDWVNCGSCSEWAHFGCDQRTGLGAFKDYA 597
Query: 587 KTDGLEYVCPQCSVTNFKKKSQKT 610
KTDGLEY+CP+CS N K ++
Sbjct: 598 KTDGLEYICPRCSAGNGKPNPRRA 621
>gi|168048799|ref|XP_001776853.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
patens subsp. patens]
gi|162671857|gb|EDQ58403.1| ARID/BRIGHT DNA-binding domain-containing protein [Physcomitrella
patens subsp. patens]
Length = 682
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 293/609 (48%), Positives = 386/609 (63%), Gaps = 20/609 (3%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
QGVP+VIYWK + + A HF QALL+ ++SS + DAFQ+A+ASF+++C + +
Sbjct: 83 QGVPHVIYWKGAPTLTLASHFRQALLAALRSSATEAKDAFQIANASFQIHCGQTKTASVN 142
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPA----IKIYDDDVTMRFLVC 117
+K S+ L L PK S EE+ + +PA ++IYDDD+ +R LVC
Sbjct: 143 GHEKASNVLPVLLAPADPKASDEDSSTSAD-EEDEKQTIPADSAPVQIYDDDINIRLLVC 201
Query: 118 GVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLST 177
+ LE GL+ALL IE LH+ SAP PP A + RGVVTMRCD+ T
Sbjct: 202 SEASRPTPAWFAALETGLSALLTIE-----LHHTCSAPAPPQAASSLHRGVVTMRCDICT 256
Query: 178 CSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEP-RKSAS 236
+ A I+LLVSG+AQTCF+DQLLEN ++ EL+E + + +S + + R+S S
Sbjct: 257 TTFARIALLVSGAAQTCFDDQLLENSVRKELLERCVKIRLVEDSESQKPRLNHNLRRSVS 316
Query: 237 IACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLF 296
IA G+SV E+ ++ TW QVLRQLA + SYRSLV LGIA ++G V +F K+D+ L+
Sbjct: 317 IASGSSVVELKIRAPTWVGQVLRQLASEPSYRSLVALGIAGVEGAPVNAFLKEDSAHLVS 376
Query: 297 FCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSE--PCRESKGVESENVCNVRPKLNSAA 354
+ E + PSWLT P SRKR + ++ + + R + AA
Sbjct: 377 LKRSRDCPSIKEIPSSSSVPSWLTPPPASRKRMRVISSHSTDSLKEHDRRSSRANAD-AA 435
Query: 355 MRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQA 414
M+P+PH+ + +PF+ ++ G +K L P T R +++ +
Sbjct: 436 MKPVPHSNRRRFMPFANAVMAAQHAGWTMK--LDQTTRVERRRQRRPHTGRPRTATTQGS 493
Query: 415 QQII----SLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPD 470
++ S+ P +K HGC R P++ C+E+EFL+D++ FL RGH RLVP GL FP+
Sbjct: 494 GGLVNPHLSITPAMVKPHGCSRPPMEECTEDEFLQDLVNFLESRGHNRLVPPAGLEAFPE 553
Query: 471 AILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYE 530
+LN KRLDL+NLY+EVVSRGGFHVGNGINWKGQVFSKM NHT N+MTGVGNTLK+HYE
Sbjct: 554 VVLNGKRLDLYNLYKEVVSRGGFHVGNGINWKGQVFSKMHNHTSVNKMTGVGNTLKKHYE 613
Query: 531 TYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDG 590
TYLLEYELAHDDVDGECC+LCHS + GDWVNCG CGEWAHFGCD R GLGAFKDYAKTDG
Sbjct: 614 TYLLEYELAHDDVDGECCILCHSGSEGDWVNCGSCGEWAHFGCDHRLGLGAFKDYAKTDG 673
Query: 591 LEYVCPQCS 599
LEY+CP CS
Sbjct: 674 LEYICPCCS 682
>gi|226508456|ref|NP_001146332.1| uncharacterized protein LOC100279908 [Zea mays]
gi|219886665|gb|ACL53707.1| unknown [Zea mays]
Length = 407
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 254/385 (65%), Positives = 286/385 (74%), Gaps = 19/385 (4%)
Query: 236 SIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLL 295
S ACGAS FEV M + WA+QVL+ LAPD+SYRSLV LGI I G VASFE+ DA+RLL
Sbjct: 2 STACGASTFEVWMSLPKWAAQVLKHLAPDISYRSLVALGIGCINGTPVASFERRDADRLL 61
Query: 296 FFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESK-------GVESE------- 341
FFCT Q K EN P W S + ++ R++ ESK GV +
Sbjct: 62 FFCTSQCKDLANENGPYFHLPRW--SASLTKDRTKVGSESKQNLLGANGVLEDKKHMMEG 119
Query: 342 --NVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAG 398
+ V+ KL A MRPIPH+R +M PF GF E ++ VK +LP AP +KH+S
Sbjct: 120 PSSWSAVKAKLKPATMRPIPHSRKQQMHPFMGFPETSLHETSIVKPSLPAAPTVKHNSVS 179
Query: 399 PTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTR 458
THRKS S +I LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH R
Sbjct: 180 SATTTHRKSTSGPSHTPSVIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHNR 239
Query: 459 LVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRM 518
LVP GGLAEFPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRM
Sbjct: 240 LVPHGGLAEFPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRM 299
Query: 519 TGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQG 578
TGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQG
Sbjct: 300 TGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQG 359
Query: 579 LGAFKDYAKTDGLEYVCPQCSVTNF 603
LG FKDYAKTDGLEY+CP CS+ N+
Sbjct: 360 LGTFKDYAKTDGLEYICPHCSLANY 384
>gi|408690212|gb|AFU81566.1| ARID-type transcription factor, partial [Zea mays subsp. mays]
gi|414885599|tpg|DAA61613.1| TPA: hypothetical protein ZEAMMB73_042311 [Zea mays]
Length = 292
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 197/259 (76%), Positives = 216/259 (83%), Gaps = 1/259 (0%)
Query: 346 VRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTH 404
V+ KL A MRPIPH+R +M PF GF E ++ VK +LP AP +KH+S TH
Sbjct: 11 VKAKLKPATMRPIPHSRKQQMHPFMGFPETSLHETSIVKPSLPAAPTVKHNSVSSATTTH 70
Query: 405 RKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGG 464
RKS S +I LNPLP+KKHGC R PIQVCSEE+FL+DVMQFLI RGH RLVP GG
Sbjct: 71 RKSTSGPSHTPSVIQLNPLPMKKHGCDRLPIQVCSEEDFLKDVMQFLIQRGHNRLVPHGG 130
Query: 465 LAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNT 524
LAEFPDAILNAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRMTGVGNT
Sbjct: 131 LAEFPDAILNAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRMTGVGNT 190
Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKD 584
LKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEWAHFGCDRRQGLG FKD
Sbjct: 191 LKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRQGLGTFKD 250
Query: 585 YAKTDGLEYVCPQCSVTNF 603
YAKTDGLEY+CP CS+ N+
Sbjct: 251 YAKTDGLEYICPHCSLANY 269
>gi|224034479|gb|ACN36315.1| unknown [Zea mays]
Length = 273
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 193/263 (73%), Positives = 216/263 (82%), Gaps = 3/263 (1%)
Query: 355 MRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRKSLSSSYQ 413
MRPIPH++ +M PF E ++ VK +L VAP +KH+S P THRKS S
Sbjct: 1 MRPIPHSQKQQMHPFMCLPETSFHETSIVKPSLTVAPTVKHNSVSYAPTTHRKSTSGPSH 60
Query: 414 AQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL 473
+I LNPLP+KKHGC R IQVCSEE+FL+DVMQFLI RGH RLVP GGLAEFPDA+L
Sbjct: 61 TPSVIQLNPLPMKKHGCDRLSIQVCSEEDFLKDVMQFLIQRGHNRLVPHGGLAEFPDAVL 120
Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYL 533
NAKRLDL+NLY+EVVSRGGF+VGNGINWKGQVFSKMRNHT TNRMTGVGNTLKRHYETYL
Sbjct: 121 NAKRLDLYNLYKEVVSRGGFYVGNGINWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYL 180
Query: 534 LEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
LEYELAHDDVDGECCLLCHSSA GDW+NCG+CGEWAHFGCDRRQGLG FKDYAKTDGLEY
Sbjct: 181 LEYELAHDDVDGECCLLCHSSAPGDWMNCGLCGEWAHFGCDRRQGLGTFKDYAKTDGLEY 240
Query: 594 VCPQCSVTNFKKK--SQKTSNGY 614
+CP CS+ N+KK K +NG+
Sbjct: 241 ICPHCSLANYKKPPLPPKVANGF 263
>gi|408690214|gb|AFU81567.1| ARID-type transcription factor, partial [Zea mays subsp. mays]
gi|414870277|tpg|DAA48834.1| TPA: hypothetical protein ZEAMMB73_381937 [Zea mays]
Length = 287
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 184/277 (66%), Positives = 222/277 (80%), Gaps = 7/277 (2%)
Query: 335 SKGVESEN---VCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAP 391
+KGV +N + +++P+L A MRP+PH+R +M PF GF + +D QVK +LP P
Sbjct: 5 AKGVVGDNTISLSSLKPRLKPATMRPLPHSRKQQMHPFMGFPQSIIHDASQVKPSLPAPP 64
Query: 392 LKHSSAGPTPVTHRKSLSSSY----QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDV 447
+KH++ T VT RKS S S + Q I LNPLP+KKH C R PI CSEE+FL+DV
Sbjct: 65 VKHNAVPVTAVTPRKSSSGSPSSWSRVQLPIPLNPLPIKKHECNRLPIHSCSEEDFLKDV 124
Query: 448 MQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFS 507
MQFL+ RGH RLVPQGGLAEFPDA+LN+KRLDL+NLY+EVV RGGF+VGNGINWKGQ+FS
Sbjct: 125 MQFLLQRGHIRLVPQGGLAEFPDAVLNSKRLDLYNLYKEVVYRGGFYVGNGINWKGQIFS 184
Query: 508 KMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGE 567
KM NHT+TN+MTGVGNTLKRHYETYLLEYELAHDD+DGECCL+CHS A GDWVNCG+CGE
Sbjct: 185 KMHNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDIDGECCLICHSGALGDWVNCGLCGE 244
Query: 568 WAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFK 604
WAH GCDRR+GL FKDY+K DG+EY+CPQCS+ +K
Sbjct: 245 WAHLGCDRRKGLSTFKDYSKADGMEYICPQCSLAKYK 281
>gi|115476706|ref|NP_001061949.1| Os08g0451700 [Oryza sativa Japonica Group]
gi|42407649|dbj|BAD08781.1| unknown protein [Oryza sativa Japonica Group]
gi|113623918|dbj|BAF23863.1| Os08g0451700 [Oryza sativa Japonica Group]
gi|215678864|dbj|BAG95301.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215695192|dbj|BAG90383.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 464
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 175/336 (52%), Positives = 224/336 (66%), Gaps = 11/336 (3%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+ Y++YW++S S YAA HF +LLSVV+SSCSH WDAFQLA+ASF YCVRNN V
Sbjct: 136 KGISYIMYWRYSLSSYAASHFRHSLLSVVRSSCSHAWDAFQLAYASFEQYCVRNNDVQ-- 193
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
LGPHLLGD P+I I + EE++ E P IKIYD+DV ++ L+CG C
Sbjct: 194 -----RLMLGPHLLGDAPRIYITPPGNKMAEEEDTSEYFPDIKIYDEDVHLKLLICGAHC 248
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
T D+S+L LEDGLNALLNIE R KL +R SA PP A GVVT+ CD++T SS+
Sbjct: 249 TPDSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSALLDGVVTICCDITTSSSS 308
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+SLL+SGS QTCF+D+LLE HIK ELIE+ +LV + S D+ +EP S S+A GA
Sbjct: 309 HVSLLLSGSPQTCFDDELLEKHIKKELIESRRLVRVVSVSEDDGPSSAEPLTSMSVASGA 368
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV M + WA+QVL+ LA + SY+SLV LGIAS+ V+SF+K+D +RLLFFCT Q
Sbjct: 369 STFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNDTPVSSFDKEDVDRLLFFCTNQ 428
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR----SEPCR 333
+ + N + P W S A R + S+P R
Sbjct: 429 DEDEAIGNGLYHHLPRWSASLAKDRVKRNFVSKPAR 464
>gi|222640664|gb|EEE68796.1| hypothetical protein OsJ_27535 [Oryza sativa Japonica Group]
Length = 684
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 172/327 (52%), Positives = 220/327 (67%), Gaps = 7/327 (2%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+ Y++YW++S S YAA HF +LLSVV+SSCSH WDAFQLA+ASF YCVRNN V
Sbjct: 144 KGISYIMYWRYSLSSYAASHFRHSLLSVVRSSCSHAWDAFQLAYASFEQYCVRNNDVQ-- 201
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
LGPHLLGD P+I I + EE++ E P IKIYD+DV ++ L+CG C
Sbjct: 202 -----RLMLGPHLLGDAPRIYITPPGNKMAEEEDTSEYFPDIKIYDEDVHLKLLICGAHC 256
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSA 181
T D+S+L LEDGLNALLNIE R KL +R SA PP A GVVT+ CD++T SS+
Sbjct: 257 TPDSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSALLDGVVTICCDITTSSSS 316
Query: 182 HISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGA 241
H+SLL+SGS QTCF+D+LLE HIK ELIE+ +LV + S D+ +EP S S+A GA
Sbjct: 317 HVSLLLSGSPQTCFDDELLEKHIKKELIESRRLVRVVSVSEDDGPSSAEPLTSMSVASGA 376
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV M + WA+QVL+ LA + SY+SLV LGIAS+ V+SF+K+D +RLLFFCT Q
Sbjct: 377 STFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNDTPVSSFDKEDVDRLLFFCTNQ 436
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR 328
+ + N + P W S A R +
Sbjct: 437 DEDEAIGNGLYHHLPRWSASLAKDRVK 463
>gi|115476084|ref|NP_001061638.1| Os08g0361000 [Oryza sativa Japonica Group]
gi|113623607|dbj|BAF23552.1| Os08g0361000, partial [Oryza sativa Japonica Group]
Length = 325
Score = 308 bits (788), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 178/336 (52%), Positives = 225/336 (66%), Gaps = 16/336 (4%)
Query: 3 GVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASN 62
G+ Y++YW++S S YAA HF +L+SVVQSSCSH WDAFQ+A+A+F+LYCVRNN V
Sbjct: 1 GISYIMYWRYSLSSYAASHFRHSLMSVVQSSCSHAWDAFQVAYATFQLYCVRNNEVQ--- 57
Query: 63 SQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCT 122
LGPHLLGD P+I I ++ EE++ E P IKIYD+DV ++ L+CG CT
Sbjct: 58 ----RLMLGPHLLGDAPRIYITPPGNEIAEEEDTSEYFPDIKIYDEDVNLKLLICGAHCT 113
Query: 123 LDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAH 182
LD+SLL LEDGLNALLNIE R KL +R SA PP G+VT+ CD++T SS+H
Sbjct: 114 LDSSLLNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGMVTICCDITTSSSSH 173
Query: 183 ISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPS-EPRKSASIACGA 241
+SLL+SGS QTCF+D+LLE HIK ELIE+ QLV + S D P S EP S S+A GA
Sbjct: 174 VSLLLSGSPQTCFDDKLLEKHIKKELIESRQLVRVVSVSEDG--PSSAEPLTSMSVASGA 231
Query: 242 SVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQ 301
S FEV M + WA+QVL+ LA + SY+SLV LGIAS+ G V SF+ D + LLFF T Q
Sbjct: 232 STFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVCSFDSQDVDWLLFFRTIQ 291
Query: 302 GKADHTENSVLTRPPSWLTSPAPSRKR----SEPCR 333
+A T S+ PP W S A +R + S+P R
Sbjct: 292 DEAIGT--SLYPHPPRWSASLAKNRVKGSMVSKPAR 325
>gi|384251328|gb|EIE24806.1| hypothetical protein COCSUDRAFT_46949 [Coccomyxa subellipsoidea
C-169]
Length = 708
Score = 262 bits (669), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 201/623 (32%), Positives = 278/623 (44%), Gaps = 79/623 (12%)
Query: 2 QGVPYVIYWKHSFS----CYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNI 57
QGV +V+YW + F A + +++ + +AF +A S +C N
Sbjct: 135 QGVAHVVYWDKKLTPSPKALHVTQFSHAFFATLRNRSATIPEAFAVALHSGHAHCGSNPG 194
Query: 58 VMASNSQKGSSKLGPHLLGD----PPKIDIA----LSEMDVQ-GEENSPENLPAIKIYDD 108
M S P L+ D P ID + D + G + +++
Sbjct: 195 PMLRPSWL------PELVSDFEPLLPSIDSVPLPLVRGADFKNGVAAAVPGWENVRLLAP 248
Query: 109 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKL--HNRTSAPPPPLQAGAFSR 166
+R LVCG +D L L + L ALL +E+R L HN P L A A
Sbjct: 249 HAELRLLVCGQSTIIDAHRLSYLGEALRALLVLEVRQLVLASHNPCERTPAHLPANA--- 305
Query: 167 GVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENS-QLVHALPNSGDNR 225
RC + S+AH ++++ G Q +D L+E ++ L+ ++ L LP +G
Sbjct: 306 --TATRCSMRWASNAHATVVLGGPPQVLGHDSLVEYALRQTLVADALSLQFRLPPAG--- 360
Query: 226 LPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVAS 285
+P R+SA+IA +V + + S WA +LR L+ S+R LV LG + VA
Sbjct: 361 IPAPVARRSAAIANNTAVVDALVVTSVWAVHLLRTLSQYTSFRGLVALGFGGVGSARVAG 420
Query: 286 FEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSP-----APSRKRSEPCRES-KGVE 339
F DA R + + + P APS S P + K +
Sbjct: 421 FNSTDAARYSALSDAYDTPEALQAAGALPAALPGGQPGAGPSAPSGPASYPTMLAPKHPD 480
Query: 340 SENVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGP 399
++ + +P +S+ +P H K E D D+ AG
Sbjct: 481 AQKDADAQPGGSSSQAQPA--NGHAK----------EEADEDE-------------GAGA 515
Query: 400 TPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRL 459
RK ++ +S P + CSE +FL D+ F R +
Sbjct: 516 GAAKRRKLTEAAAAVPVWVSKRPR-----------MWECSEAQFLEDLCAFHCERNGKNV 564
Query: 460 VPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMT 519
P+ FPDAILN RLDL+NLY+EV SRGG VGNGINWKGQVF KM N T NRMT
Sbjct: 565 TPE----TFPDAILNGSRLDLYNLYKEVTSRGGIKVGNGINWKGQVFPKMNNFTAHNRMT 620
Query: 520 GVGNTLKRHYETYLLEYELAH-DDVDGECCLLCHSS--AAGDWVNCGICGEWAHFGCDRR 576
GVGN LKRHY YLLEYE AH +DV + C +C S AA DW++C C W HF CD R
Sbjct: 621 GVGNALKRHYTMYLLEYEQAHPEDVIRDTCAICGGSDEAATDWISCDSCNNWVHFSCDGR 680
Query: 577 QGLGAFKDYAKTDGLEYVCPQCS 599
LG FKDYAK +G Y C C+
Sbjct: 681 TNLGTFKDYAKGNGATYNCINCA 703
>gi|222640433|gb|EEE68565.1| hypothetical protein OsJ_27051 [Oryza sativa Japonica Group]
Length = 349
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 149/287 (51%), Positives = 188/287 (65%), Gaps = 12/287 (4%)
Query: 43 LAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPA 102
+A+A+F+LYCVRNN V LGPHLLGD P+I I ++ EE++ E P
Sbjct: 66 VAYATFQLYCVRNNEVQ-------RLMLGPHLLGDAPRIYITPPGNEIAEEEDTSEYFPD 118
Query: 103 IKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAG 162
IKIYD+DV ++ L+CG CTLD+SLL LEDGLNALLNIE R KL +R SA PP
Sbjct: 119 IKIYDEDVNLKLLICGAHCTLDSSLLNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDS 178
Query: 163 AFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSG 222
G+VT+ CD++T SS+H+SLL+SGS QTCF+D+LLE HIK ELIE+ QLV + S
Sbjct: 179 TLLDGMVTICCDITTSSSSHVSLLLSGSPQTCFDDKLLEKHIKKELIESRQLVRVVSVSE 238
Query: 223 DNRLPPS-EPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGL 281
D P S EP S S+A GAS FEV M + WA+QVL+ LA + SY+SLV LGIAS+ G
Sbjct: 239 DG--PSSAEPLTSMSVASGASTFEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGT 296
Query: 282 SVASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKR 328
V SF+ D + LLFF T Q +A T S+ PP W S A +R +
Sbjct: 297 PVCSFDSQDVDWLLFFRTIQDEAIGT--SLYPHPPRWSASLAKNRVK 341
>gi|414870278|tpg|DAA48835.1| TPA: hypothetical protein ZEAMMB73_381937 [Zea mays]
Length = 137
Score = 238 bits (606), Expect = 1e-59, Method: Composition-based stats.
Identities = 95/119 (79%), Positives = 109/119 (91%)
Query: 486 EVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDG 545
+VV RGGF+VGNGINWKGQ+FSKM NHT+TN+MTGVGNTLKRHYETYLLEYELAHDD+DG
Sbjct: 13 QVVYRGGFYVGNGINWKGQIFSKMHNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDIDG 72
Query: 546 ECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFK 604
ECCL+CHS A GDWVNCG+CGEWAH GCDRR+GL FKDY+K DG+EY+CPQCS+ +K
Sbjct: 73 ECCLICHSGALGDWVNCGLCGEWAHLGCDRRKGLSTFKDYSKADGMEYICPQCSLAKYK 131
>gi|218201035|gb|EEC83462.1| hypothetical protein OsI_28965 [Oryza sativa Indica Group]
Length = 522
Score = 236 bits (602), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 149/316 (47%), Positives = 188/316 (59%), Gaps = 33/316 (10%)
Query: 33 SCSHTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQG 92
SCSH WDAFQ+A+A+F LYCVRNN V LGPHLLGD P+I I ++
Sbjct: 212 SCSHAWDAFQVAYATFELYCVRNNEVQ-------RLMLGPHLLGDAPRIYITPPGNEMAE 264
Query: 93 EENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIE---------- 142
EE++ E P IKIYD++V ++ L+CG CTLD+SLL LEDGLNALLNIE
Sbjct: 265 EEDTSEYFPDIKIYDENVNLKLLICGAHCTLDSSLLNSLEDGLNALLNIETMQHTHIEKL 324
Query: 143 --------IRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTC 194
R KL +R SA PP G+VT+ CD++T SS+ L+SGS Q C
Sbjct: 325 SSGGGDWWFRWCKLQDRVSAAPPLHVDSTLLDGMVTICCDITT-SSSSHVSLLSGSPQIC 383
Query: 195 FNDQLLENHIKNELIENSQLVHALPNSGDNRLPPS--EPRKSASIACGASVFEVSMKVST 252
F+D+LLE HIK ELI++ +LV + S D PS EP S S+A GAS FEV M +
Sbjct: 384 FDDKLLEKHIKKELIDSRRLVRVVSVSKDG---PSSAEPLTSMSVASGASTFEVLMTLPR 440
Query: 253 WASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVL 312
W +QVL+ LA + SY+SLV LGIAS+ G V F+ D + LLFF T Q +A T S+
Sbjct: 441 WVAQVLKYLAQETSYKSLVPLGIASVNGTPVCLFDSQDVDWLLFFRTIQDEAIGT--SLY 498
Query: 313 TRPPSWLTSPAPSRKR 328
PP W S A R +
Sbjct: 499 PHPPRWSASLAKDRVK 514
>gi|147799373|emb|CAN61485.1| hypothetical protein VITISV_043023 [Vitis vinifera]
Length = 106
Score = 236 bits (601), Expect = 4e-59, Method: Composition-based stats.
Identities = 101/106 (95%), Positives = 105/106 (99%)
Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW
Sbjct: 1 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 60
Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
AHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+TNF+KKSQKT+NGY
Sbjct: 61 AHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFQKKSQKTANGY 106
>gi|218201241|gb|EEC83668.1| hypothetical protein OsI_29441 [Oryza sativa Indica Group]
Length = 736
Score = 234 bits (597), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 142/267 (53%), Positives = 176/267 (65%), Gaps = 9/267 (3%)
Query: 19 ACHFLQALLSVVQSSCS--HTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLG 76
A HF AL+SV S H WDAFQLA+ASF YCVRNN V LGPHLLG
Sbjct: 247 ASHFCHALVSVSVCQSSCSHAWDAFQLAYASFEQYCVRNNEVQCL-------MLGPHLLG 299
Query: 77 DPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLN 136
D P+I + + EE++ E P IKIYD+DV ++ L+CG CT D+S+L +EDGLN
Sbjct: 300 DAPRIYMTPPGSKMAEEEDTSEYFPDIKIYDEDVHLKLLICGAHCTPDSSVLNSVEDGLN 359
Query: 137 ALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFN 196
ALLNIE R KL +R SA PP A GVVT+ CD++T SS+H+SLL+SGS QTCF+
Sbjct: 360 ALLNIEFRWCKLQDRVSAAPPLHVDSALLDGVVTICCDITTSSSSHVSLLLSGSPQTCFD 419
Query: 197 DQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQ 256
D+LLE HIK ELIE +LV + S D+R +EP S S+A GAS FEV M + WA+Q
Sbjct: 420 DELLEKHIKKELIETRRLVRVVSVSEDDRPSSAEPLTSMSVASGASTFEVLMTLPKWAAQ 479
Query: 257 VLRQLAPDVSYRSLVMLGIASIQGLSV 283
VL+ LA + SY+SLV LGIAS+ G V
Sbjct: 480 VLKYLAQETSYKSLVPLGIASVNGCYV 506
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 120/236 (50%), Positives = 158/236 (66%)
Query: 93 EENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRT 152
EE++ E P I IYD+DV ++ L+CG CT D+S+L LEDGLNALLNIE R KL +R
Sbjct: 4 EEDTSEYFPDITIYDEDVHLKLLICGAHCTPDSSILNSLEDGLNALLNIEFRWCKLQDRV 63
Query: 153 SAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENS 212
SA P A GVVT+ CD++T SS+H+SLL+SGS QTCF+D+LLE HIK ELIE+
Sbjct: 64 SAVAPLHVDSALLDGVVTICCDITTSSSSHVSLLLSGSPQTCFDDELLEKHIKKELIESR 123
Query: 213 QLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVM 272
+LV + S D+ +EP S S+A GAS FEV M + WA+QVL+ LA + SY+SLV
Sbjct: 124 RLVRVVSVSEDDGPSSAEPLTSMSVASGASTFEVLMTLPKWAAQVLKYLAQETSYKSLVP 183
Query: 273 LGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKR 328
LGIAS+ G V+SF+K+D +RLLFF T Q + + + + P W S A R +
Sbjct: 184 LGIASVNGTPVSSFDKEDVDRLLFFYTNQDEDEAIGSGLYHHLPRWSASLAKDRVK 239
>gi|413936076|gb|AFW70627.1| hypothetical protein ZEAMMB73_228285 [Zea mays]
Length = 933
Score = 213 bits (542), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 113/208 (54%), Positives = 141/208 (67%), Gaps = 1/208 (0%)
Query: 89 DVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKL 148
+V EE E AI+I+D+DV ++FL+CGVPCTLD LLG L+DGLNALLNI+I SKL
Sbjct: 307 EVDEEEGGFEGFSAIRIHDEDVNLKFLLCGVPCTLDACLLGTLKDGLNALLNIDICASKL 366
Query: 149 HNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEL 208
NR SA PP LQA S GV T RC+++T SSAH++ LVSGSAQTCF DQLL + IKNE+
Sbjct: 367 QNRASAHPP-LQAEDLSHGVATRRCEITTWSSAHVAYLVSGSAQTCFGDQLLGSRIKNEI 425
Query: 209 IENSQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYR 268
IE QLV L N+ D + EP S +AC S FE+ + + WA+QVL+ LAP++SYR
Sbjct: 426 IEKRQLVCVLLNNDDIKTSSFEPWPSMGVACETSTFELWITMPKWAAQVLKHLAPEISYR 485
Query: 269 SLVMLGIASIQGLSVASFEKDDAERLLF 296
SLV LGIA I + + LLF
Sbjct: 486 SLVALGIAWINACTRQEVRLECKNVLLF 513
>gi|30841464|gb|AAP34368.1| fiber protein Fb21 [Gossypium barbadense]
Length = 104
Score = 213 bits (541), Expect = 3e-52, Method: Composition-based stats.
Identities = 93/102 (91%), Positives = 96/102 (94%)
Query: 513 TLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFG 572
TLTNRMTGVGNTLKRHYET LLEYELAHDDVDGECCLLCHSSAAGDWVNCG+C EWAHFG
Sbjct: 3 TLTNRMTGVGNTLKRHYETSLLEYELAHDDVDGECCLLCHSSAAGDWVNCGVCDEWAHFG 62
Query: 573 CDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
CDRRQGLGAFKDYAKTDGLEYVCP CS++NFKKK KT NGY
Sbjct: 63 CDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPHKTVNGY 104
>gi|159489410|ref|XP_001702690.1| DNA binding protein [Chlamydomonas reinhardtii]
gi|158280712|gb|EDP06469.1| DNA binding protein [Chlamydomonas reinhardtii]
Length = 729
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 96/178 (53%), Positives = 121/178 (67%), Gaps = 11/178 (6%)
Query: 428 HGCGRAPIQVCSEEEFLRDVMQFL-ILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYRE 486
+ C R P+ CSE +F D+ +FL +LRG T ++FP+A+LN LDLF LYRE
Sbjct: 542 YTCRRPPLSACSEAQFYDDLCEFLTLLRGKTV-----ERSKFPEAVLNGVSLDLFALYRE 596
Query: 487 VVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DDVDG 545
VVSRGGF VGNGINWKGQVF +MRN T +N+ TGVGN LKRHY+ YL EYE+AH +DV
Sbjct: 597 VVSRGGFRVGNGINWKGQVFPRMRNWTESNKQTGVGNALKRHYQNYLWEYEVAHPEDVTL 656
Query: 546 ECCLLCHS----SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
+ C+LC++ A DW+ C C W H CD+R GLG +KDY + +G YVCP CS
Sbjct: 657 DRCVLCNARDREGGAADWLCCDCCENWVHHSCDKRPGLGQYKDYTQGNGRVYVCPSCS 714
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 136/309 (44%), Gaps = 28/309 (9%)
Query: 2 QGVPYVIYWKH--SFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVM 59
+GV V+ W + A H+ A + + ++ + +AF A S + +C +V+
Sbjct: 128 RGVANVVVWPAGTAIPTAVAAHYNCAFAAFLINTLASPPEAFAAASHSTQAHC----MVL 183
Query: 60 ASNSQKGSSKLGPHLLG-------DPPKIDIALSEMDVQGEENSPENLPAIKIYDD---- 108
S+ L P +G D I +Q ++ P NLPA + D
Sbjct: 184 GSDGTHTVPAL-PAFVGAARPALPDSASIPPLAVPPGLQ-QQLDPNNLPAFPGWTDVRLL 241
Query: 109 --DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSR 166
+R L+ G +D++ L L + L ALL +E RG +L RT P
Sbjct: 242 APRAELRLLLTGGSGLVDSTRLSFLGEALRALLVMEARGLQLTARTPLDKLPRN---LPA 298
Query: 167 GVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENS-QLVHALPNSGDNR 225
G RCD +T S A S+++ + L+E+ ++ L+ ++ L LP G +
Sbjct: 299 GCAAARCDYTTSSGAAGSVVLGAPPAVLEHPTLVEHALRMTLVTDAVSLQFRLPPPGLS- 357
Query: 226 LPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVAS 285
LP PR S ++A G V + + S WA +VLR L + YRSL LG+A++ + +
Sbjct: 358 LP--TPRSSKAVAGGTPVVDTVVVTSVWAVEVLRALCKEPRYRSLAALGVAAVGNAADCA 415
Query: 286 FEKDDAERL 294
F D R+
Sbjct: 416 FTPADVARM 424
>gi|194705224|gb|ACF86696.1| unknown [Zea mays]
Length = 118
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 86/95 (90%), Positives = 90/95 (94%)
Query: 509 MRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 568
MRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+CGEW
Sbjct: 1 MRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEW 60
Query: 569 AHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
AHFGCDRRQGLG FKDYAKTDGLEY+CP CS+ N+
Sbjct: 61 AHFGCDRRQGLGTFKDYAKTDGLEYICPHCSLANY 95
>gi|115479313|ref|NP_001063250.1| Os09g0434100 [Oryza sativa Japonica Group]
gi|113631483|dbj|BAF25164.1| Os09g0434100 [Oryza sativa Japonica Group]
Length = 295
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/148 (65%), Positives = 110/148 (74%), Gaps = 7/148 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYVIYW++SFS YAA HF AL+SVVQSS SHTWDAFQLAHASFRLYCVRNN V
Sbjct: 136 KGIPYVIYWRNSFSSYAASHFRHALMSVVQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-- 193
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PKI+I E ++ EE S + PAIKIYDDD+ M+FL+CGVP
Sbjct: 194 -----SVKLGPRLLGDAPKINITPPENEMVEEEGSSDVFPAIKIYDDDINMKFLLCGVPS 248
Query: 122 TLDTSLLGPLEDGLNALLNIEIRGSKLH 149
T D LLG LEDGLNALLNIE+ L
Sbjct: 249 TPDPCLLGSLEDGLNALLNIEVWNKALQ 276
>gi|302845046|ref|XP_002954062.1| hypothetical protein VOLCADRAFT_121257 [Volvox carteri f.
nagariensis]
gi|300260561|gb|EFJ44779.1| hypothetical protein VOLCADRAFT_121257 [Volvox carteri f.
nagariensis]
Length = 680
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/173 (54%), Positives = 116/173 (67%), Gaps = 9/173 (5%)
Query: 425 LKKHGCGRAPIQVCSEEEFLRDVMQFL-ILRGHTRLVPQGGLAEFPDAILNAKRLDLFNL 483
L + C R P+ C+E +F D+++FL +LRG FP+A+LN LDLF L
Sbjct: 435 LPGYVCRRPPLSSCTESQFYDDLVEFLTLLRGKPI-----DRVRFPEAVLNGVSLDLFGL 489
Query: 484 YREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DD 542
YREVV+RGGF VGNGINWKGQVF +MRN T TN+ TGVGN LKRHY+ YL EYELAH +D
Sbjct: 490 YREVVTRGGFRVGNGINWKGQVFPRMRNWTETNKQTGVGNALKRHYQNYLWEYELAHPED 549
Query: 543 VDGECCLLCHSSAA--GDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY 593
V + C+LC+ A DWV C C WAHF CD+R G+G FKDYA+ G Y
Sbjct: 550 VTLDRCVLCNGGDAVSSDWVCCDACENWAHFSCDKRPGIGMFKDYAQGQGRVY 602
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 99/216 (45%), Gaps = 8/216 (3%)
Query: 79 PKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNAL 138
P + + E+D G ++ P + +++ +R L+ G +D L L + L AL
Sbjct: 57 PAVVVPGLELDEHGLKSFPGWM-DVRLLAPRAELRLLLVGTSSLIDARRLSSLGEALRAL 115
Query: 139 LNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQ 198
L +E+R KL + + P A SR V RC T S +++ ++
Sbjct: 116 LVMEVRTLKLVSLGAVDSLPRNLPAGSRAV---RCQFLTASGVSAGVVLGTPPSVLQHEA 172
Query: 199 LLENHIKNELIENS-QLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQV 257
L+E+ ++ L+ +S L LP G + LP PR S +A G V + S WA +V
Sbjct: 173 LVEHALRMTLVTDSVGLQFRLPPPGLS-LP--APRSSRLVAGGLPVMDTVAVTSVWAVEV 229
Query: 258 LRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAER 293
LR L D YR+L LG+A++ ++ F D R
Sbjct: 230 LRSLCKDPRYRTLAALGVAAVGNVADGVFTAADVRR 265
>gi|297608671|ref|NP_001061947.2| Os08g0451500 [Oryza sativa Japonica Group]
gi|255678493|dbj|BAF23861.2| Os08g0451500 [Oryza sativa Japonica Group]
Length = 298
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 111/214 (51%), Positives = 142/214 (66%), Gaps = 4/214 (1%)
Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
D+S+L LEDGLNALLNIE R KL +R SA PP GVVT+ CD++T SS+H+
Sbjct: 85 DSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 144
Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
SLL SGS QTCF+D+LLE HIK ELIE+ +LV + S D+R +EP S S+A GAS
Sbjct: 145 SLLHSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 204
Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
FEV M + WA+QVL+ LA + SY+SLV LGIAS+ G V+SF++ D +RLLFFCT Q +
Sbjct: 205 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 264
Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR----SEPCR 333
+ N + PP W S A R + S+P R
Sbjct: 265 DEAIANGLYHHPPRWSASLAKDRVKGNMVSKPAR 298
>gi|42407647|dbj|BAD08779.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 301
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 108/205 (52%), Positives = 138/205 (67%)
Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
D+S+L LEDGLNALLNIE R KL +R SA PP GVVT+ CD++T SS+H+
Sbjct: 85 DSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 144
Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
SLL SGS QTCF+D+LLE HIK ELIE+ +LV + S D+R +EP S S+A GAS
Sbjct: 145 SLLHSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 204
Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
FEV M + WA+QVL+ LA + SY+SLV LGIAS+ G V+SF++ D +RLLFFCT Q +
Sbjct: 205 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 264
Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR 328
+ N + PP W S A R +
Sbjct: 265 DEAIANGLYHHPPRWSASLAKDRVK 289
>gi|222640663|gb|EEE68795.1| hypothetical protein OsJ_27534 [Oryza sativa Japonica Group]
Length = 353
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/205 (52%), Positives = 138/205 (67%)
Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
D+S+L LEDGLNALLNIE R KL +R SA PP GVVT+ CD++T SS+H+
Sbjct: 137 DSSILNSLEDGLNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 196
Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
SLL SGS QTCF+D+LLE HIK ELIE+ +LV + S D+R +EP S S+A GAS
Sbjct: 197 SLLHSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 256
Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
FEV M + WA+QVL+ LA + SY+SLV LGIAS+ G V+SF++ D +RLLFFCT Q +
Sbjct: 257 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 316
Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR 328
+ N + PP W S A R +
Sbjct: 317 DEAIANGLYHHPPRWSASLAKDRVK 341
>gi|414883857|tpg|DAA59871.1| TPA: hypothetical protein ZEAMMB73_492055 [Zea mays]
Length = 884
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/238 (46%), Positives = 142/238 (59%), Gaps = 10/238 (4%)
Query: 93 EENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRT 152
EE PE AI++YD+DV M+FL+CGVPCTLD L+G LEDGLN+LLNIEI SKL NR
Sbjct: 529 EEGCPEGFSAIRVYDEDVNMKFLLCGVPCTLDACLMGALEDGLNSLLNIEICASKLQNRV 588
Query: 153 SAPPPPLQAGAFSRGVV-TMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIEN 211
SA P LQA + ++ + TC + L QTCF+DQLL + I+NE+IE
Sbjct: 589 SAHPL-LQAEDLPQELLRSHNLQFCTCVVSCFWL-----RQTCFDDQLLGSRIRNEIIEK 642
Query: 212 SQLVHALPNSGDNRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLV 271
QLV L + DN+ E S +AC S FEV + + WA+QVL+ LAP++SYRSLV
Sbjct: 643 WQLVCVLLYNEDNKPSSFECWPSTGVACRDSTFEVWITMPKWAAQVLKHLAPEISYRSLV 702
Query: 272 MLGIASIQGLSVASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRS 329
LGI+ + G V+SF+ DA RLLF C+ + N +W AP K S
Sbjct: 703 ALGISWVNGTPVSSFDWQDAGRLLFLCSNKCGDQAILNGSFAHGSNW---AAPLTKDS 757
>gi|414870276|tpg|DAA48833.1| TPA: hypothetical protein ZEAMMB73_656706 [Zea mays]
Length = 276
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/141 (63%), Positives = 107/141 (75%), Gaps = 7/141 (4%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+G+PYV+YW++ FS Y A HF AL+SV+QSSCSHTWDAFQLAHASFRLYCVRNN V
Sbjct: 137 KGIPYVVYWRNLFSSYTASHFRHALMSVIQSSCSHTWDAFQLAHASFRLYCVRNNHVQ-- 194
Query: 62 NSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPC 121
S KLGP LLGD PK ++ + +V E+ E P I+IYD+DV+M+FL+CGVPC
Sbjct: 195 -----SVKLGPCLLGDAPKTNVIPAGDEVNEEQGCSEGFPDIRIYDEDVSMKFLLCGVPC 249
Query: 122 TLDTSLLGPLEDGLNALLNIE 142
TLD LLG LEDGLNALLNIE
Sbjct: 250 TLDACLLGALEDGLNALLNIE 270
>gi|218201240|gb|EEC83667.1| hypothetical protein OsI_29440 [Oryza sativa Indica Group]
Length = 353
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 107/205 (52%), Positives = 138/205 (67%)
Query: 124 DTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSSAHI 183
D+S+L LEDG NALLNIE R KL +R SA PP GVVT+ CD++T SS+H+
Sbjct: 137 DSSILNSLEDGSNALLNIEFRWCKLQDRVSAAPPLHVDSTLLDGVVTICCDITTSSSSHV 196
Query: 184 SLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACGASV 243
SLL+SGS QTCF+D+LLE HIK ELIE+ +LV + S D+R +EP S S+A GAS
Sbjct: 197 SLLLSGSPQTCFDDKLLEKHIKKELIESRRLVRVVSVSEDDRPSSAEPLTSMSVASGAST 256
Query: 244 FEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTRQGK 303
FEV M + WA+QVL+ LA + SY+SLV LGIAS+ G V+SF++ D +RLLFFCT Q +
Sbjct: 257 FEVLMTLPKWAAQVLKYLAQETSYKSLVPLGIASVNGTPVSSFDRQDVDRLLFFCTNQDE 316
Query: 304 ADHTENSVLTRPPSWLTSPAPSRKR 328
+ N + PP W S A R +
Sbjct: 317 DEAIANGLYHHPPRWSASLAKDRVK 341
>gi|345451024|gb|AEN94092.1| DNA binding protein [Pyrus pyrifolia]
Length = 93
Score = 171 bits (432), Expect = 1e-39, Method: Composition-based stats.
Identities = 71/82 (86%), Positives = 76/82 (92%)
Query: 533 LLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLE 592
L YE AHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLE
Sbjct: 2 LTAYEFAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLE 61
Query: 593 YVCPQCSVTNFKKKSQKTSNGY 614
Y+CP CS++NFKKK QK +NG+
Sbjct: 62 YICPHCSISNFKKKPQKIANGF 83
>gi|414870275|tpg|DAA48832.1| TPA: hypothetical protein ZEAMMB73_656706 [Zea mays]
Length = 351
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/111 (65%), Positives = 83/111 (74%), Gaps = 7/111 (6%)
Query: 32 SSCSHTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQ 91
SSCSHTWDAFQLAHASFRLYCVRNN V S KLGP LLGD PK ++ + +V
Sbjct: 242 SSCSHTWDAFQLAHASFRLYCVRNNHVQ-------SVKLGPCLLGDAPKTNVIPAGDEVN 294
Query: 92 GEENSPENLPAIKIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIE 142
E+ E P I+IYD+DV+M+FL+CGVPCTLD LLG LEDGLNALLNIE
Sbjct: 295 EEQGCSEGFPDIRIYDEDVSMKFLLCGVPCTLDACLLGALEDGLNALLNIE 345
>gi|414885600|tpg|DAA61614.1| TPA: hypothetical protein ZEAMMB73_011257, partial [Zea mays]
Length = 91
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/97 (62%), Positives = 72/97 (74%), Gaps = 7/97 (7%)
Query: 27 LSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALS 86
+S +QSS SHTWDAFQLAHASFRLYCVRNN V S KLGP LLGD PKI+I+
Sbjct: 1 MSAIQSSVSHTWDAFQLAHASFRLYCVRNNHVQ-------SVKLGPRLLGDAPKINISPP 53
Query: 87 EMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVPCTL 123
E ++ EE S E PAIKIYD+++ M+ L+CGVPCTL
Sbjct: 54 ETEMVDEEGSSEVTPAIKIYDEEINMKLLLCGVPCTL 90
>gi|307104284|gb|EFN52539.1| hypothetical protein CHLNCDRAFT_15191, partial [Chlorella
variabilis]
Length = 90
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/90 (54%), Positives = 62/90 (68%), Gaps = 3/90 (3%)
Query: 513 TLTNRMTGVGNTLKRHYETYLLEYELAH-DDVDGECCLLCH--SSAAGDWVNCGICGEWA 569
T ++MTGVGN LKRHY+ +LLEYE AH +DV G+ C +C A DW++C +C W
Sbjct: 1 TAHHKMTGVGNALKRHYQVFLLEYEQAHPEDVTGDRCGICGRGDEHAADWLSCDMCDCWV 60
Query: 570 HFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
HF CD RQG G+FKDY+K G Y CP+CS
Sbjct: 61 HFSCDTRQGRGSFKDYSKGRGRLYHCPRCS 90
>gi|357515069|ref|XP_003627823.1| hypothetical protein MTR_8g038730 [Medicago truncatula]
gi|92885128|gb|ABE87648.1| DNA binding , related [Medicago truncatula]
gi|355521845|gb|AET02299.1| hypothetical protein MTR_8g038730 [Medicago truncatula]
Length = 204
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 40/59 (67%), Positives = 51/59 (86%)
Query: 1 MQGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVM 59
++G+PYV++WK++FS YAACHF QAL SVVQSS +HTWDAF LA ASF+ YCV+NN V+
Sbjct: 129 LKGIPYVVFWKNAFSQYAACHFHQALFSVVQSSSTHTWDAFHLARASFQPYCVQNNQVL 187
>gi|384246380|gb|EIE19870.1| hypothetical protein COCSUDRAFT_48713 [Coccomyxa subellipsoidea
C-169]
Length = 1111
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 66/127 (51%), Gaps = 19/127 (14%)
Query: 430 CGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRL--DLFNLYREV 487
C R P+ +C+E EF D+ FL RG L + + +N L D+F LYREV
Sbjct: 301 CRRPPLGICTEIEFEEDLYAFLSQRGEAELA-----NDLRNKRINCNNLPIDIFGLYREV 355
Query: 488 VSRGGFHVGN-----------GINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
V GGF V N GIN+ G+VF +M+N+T NR T VGN L +Y +LL Y
Sbjct: 356 VRHGGF-VDNERYDDYNRWTGGINFGGKVFPRMKNYTPYNRATSVGNQLLNNYRKFLLAY 414
Query: 537 ELAHDDV 543
E H D
Sbjct: 415 EPTHSDA 421
>gi|255075697|ref|XP_002501523.1| predicted protein [Micromonas sp. RCC299]
gi|226516787|gb|ACO62781.1| predicted protein [Micromonas sp. RCC299]
Length = 1085
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 448 MQFLILRGHTRLVPQGGLAE-FPDAILNAKRLDLFNLYREVVSRGGFHVGN-------GI 499
+ FL +G +L P+G E FP LN +LDLF LYR V RGGF + +
Sbjct: 70 ISFLQAKGRFKLAPEGCDPESFPRRTLNGFKLDLFGLYRAVCVRGGFPRADRKKSDAPKL 129
Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
NW VF+++ NHT T++ T VG+ L HY+ YL EYE H
Sbjct: 130 NWTRDVFTRLANHTDTHKATSVGHDLITHYQNYLAEYEDEH 170
>gi|255089563|ref|XP_002506703.1| predicted protein [Micromonas sp. RCC299]
gi|226521976|gb|ACO67961.1| predicted protein [Micromonas sp. RCC299]
Length = 764
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 70/154 (45%), Gaps = 13/154 (8%)
Query: 429 GCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL--NAKRLDLFNLYRE 486
G R P+ C+ E+F+ D M+F +G V + + D L + + +D Y+E
Sbjct: 234 GWRRPPLSRCTPEDFMLDFMRFTDSQGDDWEV----VWDILDNSLRRSGRPMDALGFYKE 289
Query: 487 VVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DD 542
+ RGGF + VF ++ NH + + T +GN L YE Y YE AH +D
Sbjct: 290 ICWRGGFVNRESAKKRIKMPDVFKQLHNHYVNHTYTDIGNNLLNTYERYFWAYERAHPED 349
Query: 543 VDGECCLLCHSS---AAGDWVNCGICGEWAHFGC 573
V + C+ C G + C CG+W H C
Sbjct: 350 VSNQRCVTCEQDRVEKPGQMLRCDGCGDWHHAEC 383
>gi|145353915|ref|XP_001421244.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|145353987|ref|XP_001421278.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581481|gb|ABO99537.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581515|gb|ABO99571.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 638
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 67/125 (53%), Gaps = 14/125 (11%)
Query: 432 RAPIQVCSEEEFLRDVMQFLILRGH---TRLVPQG-GLAEFPDAILNAKRLDLFNLYREV 487
R ++ E EFL D++ L LR R VP+G ++FP +N +LDLF LY V
Sbjct: 19 RRSLEALGEREFLDDLI--LTLRKDPALAREVPEGVDYSQFPQRTVNGFKLDLFGLYHNV 76
Query: 488 VSRGGFHV-GNG------INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
+RGGF V NG +W ++F M N T +R T VG+ L Y YLL YE AH
Sbjct: 77 CARGGFPVETNGSASKVPWSWSKEIFKGMANWTDNHRCTSVGHDLICSYRKYLLAYEQAH 136
Query: 541 -DDVD 544
+DVD
Sbjct: 137 PEDVD 141
>gi|303277379|ref|XP_003057983.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460640|gb|EEH57934.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1199
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 47/86 (54%), Gaps = 13/86 (15%)
Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGNG-------------INWKGQVFSKMRNHTL 514
FP LN +LDLF LYR VVSRGGF G +NW VF + N+T
Sbjct: 117 FPTRTLNGAKLDLFGLYRAVVSRGGFPSSKGGVNGGGGGDKREKLNWTRDVFPCLSNYTE 176
Query: 515 TNRMTGVGNTLKRHYETYLLEYELAH 540
+R T VG+ L HY+ YL EYELA+
Sbjct: 177 NHRATSVGHDLITHYQNYLCEYELAN 202
>gi|303278346|ref|XP_003058466.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459626|gb|EEH56921.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 569
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 77/190 (40%), Gaps = 29/190 (15%)
Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRL-----DLFNLYRE 486
R P+ CS E+F+ D +F RG + D + NA R D LYR+
Sbjct: 236 RPPLSECSPEDFMLDFFRFHESRGDDWEI-------VWDILDNALRRSGRPHDTLGLYRQ 288
Query: 487 VVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
V +RGGF + +VF + N+ + + T +GN L YE Y L YE+AH +
Sbjct: 289 VCARGGFISRESAKARIKMPEVFGHLFNYWIGHTYTDIGNNLLNVYERYFLPYEVAHPED 348
Query: 544 DGE-------CCLLCHSS-------AAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTD 589
E C + H+ A W+ C CGE H C R +
Sbjct: 349 TPEQTTPCVACARVVHAGGDAVPMYAQSTWIECDACGERRHECCASRDDHVRSAERDGGR 408
Query: 590 GLEYVCPQCS 599
YVC +C+
Sbjct: 409 ATSYVCEKCA 418
>gi|412986545|emb|CCO14971.1| predicted protein [Bathycoccus prasinos]
Length = 995
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 48/82 (58%), Gaps = 5/82 (6%)
Query: 468 FPDAILNAKRLDLFNLYREVVSRGGFHVGN-----GINWKGQVFSKMRNHTLTNRMTGVG 522
+P LN +LDLF +YR+VV+RGG+ INW ++F K+ N T T++ T VG
Sbjct: 8 WPAKTLNGYKLDLFGIYRQVVARGGWGEDEITCKRKINWSREIFPKLANFTSTHKATSVG 67
Query: 523 NTLKRHYETYLLEYELAHDDVD 544
+ L Y+ YL YE + DVD
Sbjct: 68 HDLITSYKNYLHSYERKYADVD 89
>gi|308810917|ref|XP_003082767.1| fiber protein-like (ISS) [Ostreococcus tauri]
gi|116061236|emb|CAL56624.1| fiber protein-like (ISS) [Ostreococcus tauri]
Length = 327
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 86/203 (42%), Gaps = 25/203 (12%)
Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAIL--NAKRLDLFNLYREVVS 489
R ++ CS+E+F+ D M+F RG L D L +D +LYR + +
Sbjct: 56 RPALRECSQEDFMLDFMRFHHARGRDWERCWSNL----DRALARTGAPMDCMHLYRMICA 111
Query: 490 RGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH-DDVDG 545
GGF + ++F M N+ + MT VGN L YE Y EYELAH DDV
Sbjct: 112 LGGFVNRESAKTRLSMTEIFKLMFNYYENHTMTDVGNRLLNAYENYFWEYELAHEDDVSR 171
Query: 546 ECCLLC-----HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDG--LEYVCPQC 598
C C HSS +G C IC H GC + K G + +VC C
Sbjct: 172 GTCRSCGGGIFHSSDSGLLHVCKICHANYHRGCQPPERFPETFGTGKDAGCSMHFVCWAC 231
Query: 599 S--------VTNFKKKSQKTSNG 613
+ V +FK + +T +
Sbjct: 232 TRSRPAEAGVESFKVRDAQTVDA 254
>gi|145353421|ref|XP_001421012.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581248|gb|ABO99305.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 55/123 (44%), Gaps = 23/123 (18%)
Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHT---------RLVPQGGLAEFPDAILNAKRLDLFN 482
R + CS E+F D+M+F RG R+V + G LD +
Sbjct: 80 RPALAACSLEDFTFDLMRFHQERGRDWEQCWANLDRVVGRTGAP-----------LDCLH 128
Query: 483 LYREVVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELA 539
LYR+V S GGF + ++F M NH + MT GN + YE Y LEYE A
Sbjct: 129 LYRQVCSLGGFASRTSAKERIKAAEIFRNMFNHYENHTMTDCGNRMLTAYENYFLEYEAA 188
Query: 540 HDD 542
H++
Sbjct: 189 HEE 191
>gi|145357351|ref|XP_001422883.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583127|gb|ABP01242.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 281
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 55/123 (44%), Gaps = 23/123 (18%)
Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHT---------RLVPQGGLAEFPDAILNAKRLDLFN 482
R + CS E+F D+M+F RG R+V + G LD +
Sbjct: 33 RPALAACSLEDFTFDLMRFHQERGRDWEQCWANLDRVVGRTGAP-----------LDCLH 81
Query: 483 LYREVVSRGGFHVGNGINWK---GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELA 539
LYR+V S GGF + ++F M NH + MT GN + YE Y LEYE A
Sbjct: 82 LYRQVCSLGGFASRTSAKERIKAAEIFRNMFNHYENHTMTDCGNRMLTAYENYFLEYEAA 141
Query: 540 HDD 542
H++
Sbjct: 142 HEE 144
>gi|194758138|ref|XP_001961319.1| GF11056 [Drosophila ananassae]
gi|190622617|gb|EDV38141.1| GF11056 [Drosophila ananassae]
Length = 1718
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 58/121 (47%), Gaps = 18/121 (14%)
Query: 418 ISLNPLPLK-KHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAK 476
I L P+P K + AP EEF RD+ QF RG P A+ +N K
Sbjct: 52 IQLPPVPEKIEEQPASAP------EEFWRDLQQFHERRG----TPMTQTAK-----INGK 96
Query: 477 RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
++DL+ LY+EV RGGF+ N + +V+S + TL R +K Y YL +Y
Sbjct: 97 QVDLYKLYQEVTERGGFNKVNLRDEWDEVYSAL--ETLRERCVNGTAGIKHIYRRYLDKY 154
Query: 537 E 537
E
Sbjct: 155 E 155
>gi|194864076|ref|XP_001970758.1| GG23207 [Drosophila erecta]
gi|190662625|gb|EDV59817.1| GG23207 [Drosophila erecta]
Length = 1696
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 16/129 (12%)
Query: 409 SSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEF 468
++ +A+ + L LP K +P EEF RD+ QF RG P
Sbjct: 40 TTPVRARNPVQLQILPEKVEEMPASP-----PEEFWRDLQQFHERRGTPLTQP------- 87
Query: 469 PDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRH 528
A ++ K++DL+ LY EV RGGF+ N + +V+S M TL R +K
Sbjct: 88 --ARISGKQVDLYKLYNEVTERGGFNKVNMRDEWDEVYSSM--ETLRERCVNGTAGIKHI 143
Query: 529 YETYLLEYE 537
Y YL +YE
Sbjct: 144 YRRYLDKYE 152
>gi|195474131|ref|XP_002089345.1| GE19061 [Drosophila yakuba]
gi|194175446|gb|EDW89057.1| GE19061 [Drosophila yakuba]
Length = 1695
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 47/97 (48%), Gaps = 11/97 (11%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
EEF RD+ QF RG P A ++ K++DL+ LY EV RGGF+ N +
Sbjct: 67 EEFWRDLQQFHERRGTPLTQP---------ARISGKQVDLYKLYNEVTERGGFNKVNMRD 117
Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+V+S M TL R +K Y YL +YE
Sbjct: 118 EWDEVYSAM--ETLRERCVNGTAGIKHIYRRYLDKYE 152
>gi|198461268|ref|XP_001361964.2| GA17110 [Drosophila pseudoobscura pseudoobscura]
gi|198137288|gb|EAL26543.2| GA17110 [Drosophila pseudoobscura pseudoobscura]
Length = 1695
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 61/141 (43%), Gaps = 20/141 (14%)
Query: 397 AGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGH 456
A TP+ R ++YQ Q P P P + EEF RD+ QF H
Sbjct: 44 ATTTPLRAR----NAYQQQ---PQTPAPFTPENTAGKP--ASAPEEFWRDLQQF-----H 89
Query: 457 TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTN 516
R G A A ++ K +DL+ LY EV RGGF+ N + +V+S M TL
Sbjct: 90 ER----RGTALTHAAKISGKHVDLYKLYTEVTDRGGFNKVNMRDEWDEVYSAM--ETLRE 143
Query: 517 RMTGVGNTLKRHYETYLLEYE 537
R +K Y YL +YE
Sbjct: 144 RCVNGTAGIKHIYRRYLDKYE 164
>gi|195580956|ref|XP_002080300.1| GD10411 [Drosophila simulans]
gi|194192309|gb|EDX05885.1| GD10411 [Drosophila simulans]
Length = 1684
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 11/97 (11%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
EEF RD+ QF RG P A ++ K +DL+ LY EV RGGF+ +
Sbjct: 67 EEFWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRD 117
Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+V+S M TL R ++K Y YL +YE
Sbjct: 118 EWDEVYSAM--DTLRERCVNGTASIKHIYRRYLDKYE 152
>gi|195331853|ref|XP_002032613.1| GM20880 [Drosophila sechellia]
gi|194124583|gb|EDW46626.1| GM20880 [Drosophila sechellia]
Length = 1692
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 11/97 (11%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
EEF RD+ QF RG P A ++ K +DL+ LY EV RGGF+ +
Sbjct: 67 EEFWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRD 117
Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+V+S M TL R ++K Y YL +YE
Sbjct: 118 EWDEVYSAM--DTLRERCVNGTASIKHIYRRYLDKYE 152
>gi|195171065|ref|XP_002026331.1| GL20250 [Drosophila persimilis]
gi|194111233|gb|EDW33276.1| GL20250 [Drosophila persimilis]
Length = 376
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 61/141 (43%), Gaps = 20/141 (14%)
Query: 397 AGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGH 456
A TP+ R ++YQ Q P P P + EEF RD+ QF H
Sbjct: 44 ATTTPLRAR----NAYQQQ---PQTPAPFTPENTAGKP--ASAPEEFWRDLQQF-----H 89
Query: 457 TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTN 516
R G A A ++ K +DL+ LY EV RGGF+ N + +V+S M TL
Sbjct: 90 ERR----GTALTHAAKISGKHVDLYKLYTEVTDRGGFNKVNMRDEWDEVYSAM--ETLRE 143
Query: 517 RMTGVGNTLKRHYETYLLEYE 537
R +K Y YL +YE
Sbjct: 144 RCVNGTAGIKHIYRRYLDKYE 164
>gi|303289831|ref|XP_003064203.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454519|gb|EEH51825.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 564
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 35/69 (50%), Gaps = 5/69 (7%)
Query: 476 KRLDLFNLYREVVSRGGFHVGNGINWKG----QVFSKMRNHTLTNRMTGVGNTLKRHYET 531
K D + LYR V +RGGF V K +VF +M NH + T +G L YE
Sbjct: 121 KPFDTYGLYRAVTNRGGF-VERPTARKNLSMVEVFREMSNHYDGHTYTDIGTLLLNTYEK 179
Query: 532 YLLEYELAH 540
Y LEYE AH
Sbjct: 180 YFLEYERAH 188
>gi|195124688|ref|XP_002006823.1| GI18380 [Drosophila mojavensis]
gi|193911891|gb|EDW10758.1| GI18380 [Drosophila mojavensis]
Length = 1765
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 15/116 (12%)
Query: 426 KKHGCGRAP----IQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLF 481
+K G AP Q+ + +EF RD+ QF RG P A+ ++ + +DL+
Sbjct: 65 EKSQAGEAPGAEEQQLSAPDEFWRDLHQFHERRG----TPLTQAAK-----ISGQHVDLY 115
Query: 482 NLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
LY+EV RGGF+ N + +V+S + TL +R +K Y YL +YE
Sbjct: 116 RLYQEVTERGGFNKVNMRDEWDEVYSAL--DTLRDRCVNGTAGIKHIYRRYLDKYE 169
>gi|384251312|gb|EIE24790.1| hypothetical protein COCSUDRAFT_62200 [Coccomyxa subellipsoidea
C-169]
Length = 88
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREV 487
CSE +F+ D+ F R ++ P+ PDAILN RLDL+NLY+E
Sbjct: 23 CSEAQFMEDLCAFHCERNGKKVTPETS----PDAILNGSRLDLYNLYKET 68
>gi|195380922|ref|XP_002049205.1| GJ21457 [Drosophila virilis]
gi|194144002|gb|EDW60398.1| GJ21457 [Drosophila virilis]
Length = 1779
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 11/102 (10%)
Query: 436 QVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV 495
Q+ + EEF RD+ QF RG P A+ ++ + +DL+ LY+EV RGGF+
Sbjct: 92 QLRAPEEFWRDLHQFHERRG----TPLTQAAK-----ISGQHVDLYRLYQEVTERGGFNK 142
Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
N + +V+S + TL +R +K Y YL +YE
Sbjct: 143 VNMRDEWDEVYSAL--DTLRDRCVNGTAGIKHIYRRYLDKYE 182
>gi|253741679|gb|EES98544.1| ARID1 AT-rich interaction domain protein [Giardia intestinalis ATCC
50581]
Length = 467
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 6/66 (9%)
Query: 472 ILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
I+ K+LD++ L+R V +RGG N I WK ++ K+ L +T VG TL+ YE+
Sbjct: 37 IVGHKQLDMYQLFRAVQARGG--AKNVIQWK-EIGKKL---GLPASVTNVGYTLRTKYES 90
Query: 532 YLLEYE 537
Y+L YE
Sbjct: 91 YILPYE 96
>gi|19921692|ref|NP_610216.1| brahma associated protein 170kD [Drosophila melanogaster]
gi|7302272|gb|AAF57363.1| brahma associated protein 170kD [Drosophila melanogaster]
Length = 1688
Score = 46.2 bits (108), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 44/95 (46%), Gaps = 11/95 (11%)
Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
F RD+ QF RG P A ++ K +DL+ LY EV RGGF+ +
Sbjct: 69 FWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRDEW 119
Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+V+S M TL R ++K Y YL +YE
Sbjct: 120 DEVYSAM--ETLRERCVNGTASIKHIYRRYLDKYE 152
>gi|432885673|ref|XP_004074709.1| PREDICTED: uncharacterized protein LOC101173321 [Oryzias latipes]
Length = 525
Score = 45.8 bits (107), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 54/107 (50%), Gaps = 14/107 (13%)
Query: 432 RAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRG 491
R ++ +EE+FL+D+ F+ R + P+ L K++DLF +++ V G
Sbjct: 12 RDQMEDITEEKFLKDLYIFMKKRD-------TPIERIPN--LGFKQIDLFVMFKTVNDLG 62
Query: 492 GFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
G+H V + WK QV++ + + R T +RHYE LL YE
Sbjct: 63 GYHKVTSHQLWK-QVYNILGGNP---RSTSAATCTRRHYERLLLPYE 105
>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 6753
Score = 45.8 bits (107), Expect = 0.070, Method: Composition-based stats.
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 6/66 (9%)
Query: 548 CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
C++C+ D WV C +C W H CD+ + + ++Y CP CS+ F
Sbjct: 1764 CIMCNEKYDEDDSKKWVQCDVCKFWIHLSCDKNESRNI--ETLSNKNIDYKCPTCSIGTF 1821
Query: 604 KKKSQK 609
K ++
Sbjct: 1822 HDKIER 1827
>gi|383856249|ref|XP_003703622.1| PREDICTED: uncharacterized protein LOC100880603 [Megachile
rotundata]
Length = 1208
Score = 45.4 bits (106), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 51/113 (45%), Gaps = 14/113 (12%)
Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
E++FL DV F+ RG P G + +L +++DLF Y +V GG+ V
Sbjct: 314 AEEKKFLTDVQNFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQLLGGYDSVS 364
Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 549
G WK ++ + +T + T +RHYE LL YE D + L
Sbjct: 365 AGRLWK-TIYDDIGGNTGS---TSAATITRRHYERLLLPYERYQRGEDAKVKL 413
>gi|327272142|ref|XP_003220845.1| PREDICTED: lysine-specific demethylase 5A-like [Anolis
carolinensis]
Length = 1695
Score = 45.4 bits (106), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL++L + V S+GGF V +
Sbjct: 93 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYSLSKIVASKGGFEVVS---- 138
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G+ LK HYE L YEL V
Sbjct: 139 KEKKWSKVASRLAYMPGKATGSLLKAHYERILYPYELFQSGV 180
>gi|74193153|dbj|BAE20593.1| unnamed protein product [Mus musculus]
Length = 215
Score = 45.4 bits (106), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|348552013|ref|XP_003461823.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A-like
[Cavia porcellus]
Length = 1635
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF V
Sbjct: 164 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEVVT---- 209
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 210 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 251
>gi|350414064|ref|XP_003490196.1| PREDICTED: hypothetical protein LOC100746223 [Bombus impatiens]
Length = 1200
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 437 VCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
E++FL DV F+ RG P G + +L +++DLF Y +V GG+ V
Sbjct: 312 TAEEKKFLTDVQNFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQLLGGYDSV 362
Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
G WK ++ + +T + T +RHYE LL YE
Sbjct: 363 SAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 400
>gi|340717787|ref|XP_003397357.1| PREDICTED: hypothetical protein LOC100652122 [Bombus terrestris]
Length = 1202
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 437 VCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
E++FL DV F+ RG P G + +L +++DLF Y +V GG+ V
Sbjct: 312 TAEEKKFLTDVQNFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQLLGGYDSV 362
Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
G WK ++ + +T + T +RHYE LL YE
Sbjct: 363 SAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 400
>gi|195429395|ref|XP_002062748.1| GK19622 [Drosophila willistoni]
gi|194158833|gb|EDW73734.1| GK19622 [Drosophila willistoni]
Length = 1685
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 11/97 (11%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
+EF RD+ QF H R G A ++ + +DL+ LY+EV RGGF N +
Sbjct: 72 DEFWRDLHQF-----HER----RGTPITQTAKISGQHVDLYKLYQEVTERGGFSKVNMRD 122
Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+V++ M TL R +K Y YL +YE
Sbjct: 123 EWDEVYTAM--ETLRERCVNGTAGIKHIYRRYLDKYE 157
>gi|380015168|ref|XP_003691580.1| PREDICTED: uncharacterized protein LOC100865061 [Apis florea]
Length = 1122
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 52/119 (43%), Gaps = 23/119 (19%)
Query: 429 GCGRAPIQV---------CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLD 479
GR P V E++F+ DV F+ RG P G + +L +++D
Sbjct: 227 SVGRPPSSVVRRSERKTSAEEKKFITDVQNFMNSRG----TPVGKMP-----LLGYRQID 277
Query: 480 LFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
LF Y +V GG+ V G WK ++ + +T + T +RHYE LL YE
Sbjct: 278 LFLFYTKVQMLGGYDSVSAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 332
>gi|328865792|gb|EGG14178.1| ARID/BRIGHT DNA binding domain-containing protein [Dictyostelium
fasciculatum]
Length = 978
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 43/97 (44%), Gaps = 12/97 (12%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
E F + QFL +G L FP + + LDL+ L+ EV RGG H N
Sbjct: 369 EVFASQLCQFLEEKG-------TPLEHFPK--YDGRDLDLYKLFVEVNRRGGLHEVTKTN 419
Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+ + ++ + R NTLK HY YLL YE
Sbjct: 420 SWHDILTALKTTDICQRPI---NTLKHHYNDYLLAYE 453
>gi|392347603|ref|XP_002729471.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Rattus norvegicus]
Length = 1722
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|344278059|ref|XP_003410814.1| PREDICTED: lysine-specific demethylase 5A [Loxodonta africana]
Length = 1693
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P I+ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------IVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|46409136|gb|AAS93725.1| RE64461p [Drosophila melanogaster]
Length = 902
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 44/95 (46%), Gaps = 11/95 (11%)
Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
F RD+ QF RG P A ++ K +DL+ LY EV RGGF+ +
Sbjct: 69 FWRDLQQFHERRGTPLTQP---------ARISGKHVDLYKLYNEVTERGGFNKVTMRDEW 119
Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+V+S M TL R ++K Y YL +YE
Sbjct: 120 DEVYSAM--ETLRERCVNGTASIKHIYRRYLDKYE 152
>gi|74150603|dbj|BAE32322.1| unnamed protein product [Mus musculus]
gi|74227277|dbj|BAE21737.1| unnamed protein product [Mus musculus]
Length = 264
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|148667219|gb|EDK99635.1| mCG129751 [Mus musculus]
Length = 1710
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|392340216|ref|XP_002726529.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Rattus norvegicus]
Length = 1639
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|14715099|gb|AAH10717.1| Jarid1a protein [Mus musculus]
Length = 621
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|226958545|ref|NP_666109.2| lysine-specific demethylase 5A [Mus musculus]
gi|150383496|sp|Q3UXZ9.2|KDM5A_MOUSE RecName: Full=Lysine-specific demethylase 5A; AltName: Full=Histone
demethylase JARID1A; AltName: Full=Jumonji/ARID
domain-containing protein 1A; AltName:
Full=Retinoblastoma-binding protein 2; Short=RBBP-2
Length = 1690
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|354476339|ref|XP_003500382.1| PREDICTED: lysine-specific demethylase 5A [Cricetulus griseus]
Length = 1671
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 70 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 115
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 116 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 157
>gi|149049582|gb|EDM02036.1| rCG29703 [Rattus norvegicus]
Length = 1526
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|195056152|ref|XP_001994976.1| GH22895 [Drosophila grimshawi]
gi|193899182|gb|EDV98048.1| GH22895 [Drosophila grimshawi]
Length = 1762
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 49/97 (50%), Gaps = 11/97 (11%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 500
+EF RD+ QF RG P A+ ++ + +DL+ LY+EV RGGF+ N +
Sbjct: 95 DEFWRDLHQFHERRG----TPLTQAAK-----ISGQHVDLYRLYQEVTERGGFNKVNVRD 145
Query: 501 WKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+V+S + TL +R +K Y YL +YE
Sbjct: 146 EWDEVYSAL--DTLRDRCVNGTAGIKHIYRRYLDKYE 180
>gi|307187859|gb|EFN72795.1| AT-rich interactive domain-containing protein 5B [Camponotus
floridanus]
Length = 1073
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 437 VCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
E++F+ DV F+ RG P G + +L +++DLF Y +V GG+ V
Sbjct: 174 TAEEKKFITDVQSFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQMLGGYDSV 224
Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
G WK ++ + +T + T +RHYE LL YE
Sbjct: 225 SAGRLWK-TIYDDIGGNTGS---TSAATITRRHYERLLLPYE 262
>gi|344253284|gb|EGW09388.1| Lysine-specific demethylase 5A [Cricetulus griseus]
Length = 1608
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 7 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 52
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 53 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 94
>gi|328776925|ref|XP_003249244.1| PREDICTED: hypothetical protein LOC100576405 [Apis mellifera]
Length = 1273
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 23/118 (19%)
Query: 430 CGRAPIQV---------CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
GR P V E++F+ DV F+ RG P G + +L +++DL
Sbjct: 296 VGRPPSSVVRRSERKTSAEEKKFIADVQNFMNSRG----TPVGKMP-----LLGYRQIDL 346
Query: 481 FNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
F Y +V GG+ V G WK ++ + +T + T +RHYE LL YE
Sbjct: 347 FLFYTKVQMLGGYDSVSAGRLWKN-IYDDIGGNTGS---TSAATITRRHYERLLLPYE 400
>gi|402884752|ref|XP_003905839.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Papio anubis]
Length = 1842
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 242 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 287
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 288 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 329
>gi|332024185|gb|EGI64399.1| AT-rich interactive domain-containing protein 5B [Acromyrmex
echinatior]
Length = 1093
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 14/101 (13%)
Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
E++F+ DV F+ RG P G + +L +++DLF Y +V GG+ V
Sbjct: 173 AEEKKFITDVQSFMNSRG----TPVGKMP-----LLGYRQIDLFLFYTKVQMLGGYDSVS 223
Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
G WK ++ + +T + T +RHYE LL YE
Sbjct: 224 AGRLWK-TIYDDIGGNTGS---TSAATITRRHYERLLLPYE 260
>gi|426227070|ref|XP_004007650.1| PREDICTED: lysine-specific demethylase 5A [Ovis aries]
Length = 1793
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 192 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 237
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 238 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 279
>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
Length = 1055
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 60/140 (42%), Gaps = 17/140 (12%)
Query: 462 QGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV-GNGINWKGQVFSKMRNHTLTNRMTG 520
+ ++ PDA + + L+ ++ ++VV++G H G G+ + K++ T ++
Sbjct: 357 EADVSGLPDASTSNQDLEYYS-EKKVVNKGSRHCDGCGLLSLCKTLKKVKGPTSATQLLC 415
Query: 521 VGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLG 580
R + Y G C + H S G+WV C C W H CD+
Sbjct: 416 KHCHKLRQSKQYC-----------GVCKKIWHHSDGGNWVCCDGCNVWVHAECDKISS-K 463
Query: 581 AFKDYAKTDGLEYVCPQCSV 600
FKD A + EY CP C V
Sbjct: 464 LFKDLAHS---EYYCPDCKV 480
>gi|417406677|gb|JAA49985.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
Length = 1690
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|329663691|ref|NP_001192554.1| lysine-specific demethylase 5A [Bos taurus]
gi|296487060|tpg|DAA29173.1| TPA: JARID1A variant protein-like [Bos taurus]
Length = 1690
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|359323071|ref|XP_854690.3| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Canis lupus familiaris]
Length = 1688
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|197246491|gb|AAI69044.1| Jarid1a protein [Rattus norvegicus]
Length = 579
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|410963533|ref|XP_003988319.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Felis catus]
Length = 1690
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|297261517|ref|XP_002798486.1| PREDICTED: lysine-specific demethylase 5A-like [Macaca mulatta]
Length = 1687
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|417406655|gb|JAA49977.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
Length = 1649
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|194211564|ref|XP_001914992.1| PREDICTED: lysine-specific demethylase 5A isoform 1 [Equus
caballus]
Length = 1692
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 91 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 136
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 137 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 178
>gi|403286751|ref|XP_003934640.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Saimiri boliviensis boliviensis]
Length = 1952
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 351 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 396
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 397 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 438
>gi|291392891|ref|XP_002712829.1| PREDICTED: retinoblastoma binding protein 2 isoform 1 [Oryctolagus
cuniculus]
Length = 1690
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|301756621|ref|XP_002914152.1| PREDICTED: lysine-specific demethylase 5A-like [Ailuropoda
melanoleuca]
Length = 1690
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|431892152|gb|ELK02599.1| Lysine-specific demethylase 5A [Pteropus alecto]
Length = 1692
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|126340225|ref|XP_001373115.1| PREDICTED: lysine-specific demethylase 5A [Monodelphis domestica]
Length = 1689
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|290990959|ref|XP_002678103.1| predicted protein [Naegleria gruberi]
gi|284091714|gb|EFC45359.1| predicted protein [Naegleria gruberi]
Length = 702
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 10/98 (10%)
Query: 440 EEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGI 499
E++FL+ +++ L +T P P +L + LD+F+LY V+ +GGF +
Sbjct: 185 EKQFLQLFFEYM-LDKNTEDFPS------PVPMLRSVPLDIFHLYGRVLEKGGFLYMDSG 237
Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
+W+ VF K+RN+ M LK YE YL +E
Sbjct: 238 DWE-LVFKKIRNY--DELMKKPEEMLKSMYEQYLKNFE 272
>gi|435778|gb|AAB28544.1| retinoblastoma binding protein 2 [Homo sapiens]
Length = 1722
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|397499371|ref|XP_003820427.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A [Pan
paniscus]
Length = 1717
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|332249134|ref|XP_003273720.1| PREDICTED: lysine-specific demethylase 5A [Nomascus leucogenys]
Length = 1690
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|110618244|ref|NP_001036068.1| lysine-specific demethylase 5A [Homo sapiens]
gi|215274124|sp|P29375.3|KDM5A_HUMAN RecName: Full=Lysine-specific demethylase 5A; AltName: Full=Histone
demethylase JARID1A; AltName: Full=Jumonji/ARID
domain-containing protein 1A; AltName:
Full=Retinoblastoma-binding protein 2; Short=RBBP-2
gi|168275608|dbj|BAG10524.1| histone demethylase JARID1A [synthetic construct]
Length = 1690
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|355563873|gb|EHH20373.1| Lysine-specific demethylase 5A [Macaca mulatta]
gi|383420467|gb|AFH33447.1| lysine-specific demethylase 5A [Macaca mulatta]
Length = 1690
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|71122345|gb|AAH99835.1| Jarid1a protein [Rattus norvegicus]
Length = 1099
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|410354793|gb|JAA44000.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1688
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|355785795|gb|EHH65978.1| Lysine-specific demethylase 5A [Macaca fascicularis]
Length = 1690
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|332838263|ref|XP_003313474.1| PREDICTED: lysine-specific demethylase 5A [Pan troglodytes]
Length = 1688
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|68533053|dbj|BAE06081.1| JARID1A variant protein [Homo sapiens]
Length = 1731
Score = 43.9 bits (102), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 130 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 175
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 176 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 217
>gi|410261294|gb|JAA18613.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1690
Score = 43.9 bits (102), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|351710793|gb|EHB13712.1| Lysine-specific demethylase 5A [Heterocephalus glaber]
Length = 1694
Score = 43.9 bits (102), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|410223430|gb|JAA08934.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
gi|410300510|gb|JAA28855.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
gi|410354795|gb|JAA44001.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1690
Score = 43.9 bits (102), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|449269524|gb|EMC80287.1| Lysine-specific demethylase 5A, partial [Columba livia]
Length = 1650
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G +P ++ K LDL+ L + V S+GGF V
Sbjct: 48 DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVASKGGFEVVT---- 93
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 94 KEKKWSKVASRLGYLPGKGTGSLLKSHYERLLYPYELFQSGV 135
>gi|74201318|dbj|BAE26113.1| unnamed protein product [Mus musculus]
Length = 1094
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|51593641|gb|AAH80691.1| Jarid1a protein, partial [Mus musculus]
Length = 1102
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|74141547|dbj|BAE38548.1| unnamed protein product [Mus musculus]
Length = 1093
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|355697494|gb|AES00689.1| lysine -specific demethylase 5A [Mustela putorius furo]
Length = 1504
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 109 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 154
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 155 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 196
>gi|308162935|gb|EFO65302.1| Hypothetical protein GLP15_4319 [Giardia lamblia P15]
Length = 618
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 12/63 (19%)
Query: 545 GECCLLCHS------SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
+ C+LC + GDW+ C +C EW H CD + + + Y CP+C
Sbjct: 65 SDLCVLCSNHKCEFIETGGDWIQCDLCKEWVHQQCD------GIYNALLAEKIHYTCPRC 118
Query: 599 SVT 601
V
Sbjct: 119 RVV 121
>gi|395743750|ref|XP_002822777.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A,
partial [Pongo abelii]
Length = 1613
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 129 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 174
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 175 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 216
>gi|380798887|gb|AFE71319.1| lysine-specific demethylase 5A, partial [Macaca mulatta]
Length = 1653
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 52 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 97
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 98 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 139
>gi|119609377|gb|EAW88971.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_b
[Homo sapiens]
gi|162319392|gb|AAI56462.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
gi|225000544|gb|AAI72533.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
Length = 1641
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|159112278|ref|XP_001706368.1| Hypothetical protein GL50803_8381 [Giardia lamblia ATCC 50803]
gi|157434464|gb|EDO78694.1| hypothetical protein GL50803_8381 [Giardia lamblia ATCC 50803]
Length = 618
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 12/62 (19%)
Query: 545 GECCLLCHS------SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
+ C+LC + GDW+ C +C EW H CD + + + Y CP+C
Sbjct: 65 SDLCILCSNHKCELIETGGDWIQCDLCKEWLHQQCD------GIYNALLAEKIHYTCPRC 118
Query: 599 SV 600
V
Sbjct: 119 RV 120
>gi|427793739|gb|JAA62321.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 374
Score = 43.5 bits (101), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 48/98 (48%), Gaps = 14/98 (14%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGI 499
+EFL D+ F+ RG P + P I+ + LDLF LYR VVSRGG V N
Sbjct: 100 KEFLDDLFSFMQKRG----TP---VNRIP--IMAKQVLDLFELYRLVVSRGGLVEVINKK 150
Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ +K N L + +T TL+ Y YL YE
Sbjct: 151 IWR--EITKGLN--LPSSITSAAFTLRTQYMKYLYPYE 184
>gi|291392893|ref|XP_002712830.1| PREDICTED: retinoblastoma binding protein 2 isoform 2 [Oryctolagus
cuniculus]
Length = 1648
Score = 43.5 bits (101), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|253748038|gb|EET02406.1| Hypothetical protein GL50581_289 [Giardia intestinalis ATCC 50581]
Length = 612
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 12/62 (19%)
Query: 545 GECCLLCHS------SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
+ C LC + GDW+ C +C EW H CD + + ++Y CP+C
Sbjct: 65 SDLCALCSNHKCEFIETEGDWIQCDLCKEWVHQQCD------GIYNALLAEKIQYTCPRC 118
Query: 599 SV 600
V
Sbjct: 119 RV 120
>gi|255083290|ref|XP_002504631.1| histone demethylase [Micromonas sp. RCC299]
gi|226519899|gb|ACO65889.1| histone demethylase [Micromonas sp. RCC299]
Length = 827
Score = 43.5 bits (101), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 41/64 (64%), Gaps = 4/64 (6%)
Query: 478 LDLFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 536
LDL ++YREVV+RGG+ V + WK +V + + H LT + T G ++++YE LLEY
Sbjct: 112 LDLGHVYREVVARGGYKAVCDNRRWK-EVCATL-GHDLTGQ-TSAGFQMRQNYERCLLEY 168
Query: 537 ELAH 540
E A
Sbjct: 169 EFAE 172
>gi|427796997|gb|JAA63950.1| Putative dna-binding bright/brcaa1/rbp1, partial [Rhipicephalus
pulchellus]
Length = 349
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 48/98 (48%), Gaps = 14/98 (14%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGI 499
+EFL D+ F+ RG P + P I+ + LDLF LYR VVSRGG V N
Sbjct: 75 KEFLDDLFSFMQKRG----TP---VNRIP--IMAKQVLDLFELYRLVVSRGGLVEVINKK 125
Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ +K N L + +T TL+ Y YL YE
Sbjct: 126 IWR--EITKGLN--LPSSITSAAFTLRTQYMKYLYPYE 159
>gi|189234904|ref|XP_968114.2| PREDICTED: similar to AT-rich interactive domain-containing protein
5B (ARID domain-containing protein 5B) (Mrf1-like)
(Modulator recognition factor 2) (MRF-2) [Tribolium
castaneum]
gi|270002738|gb|EEZ99185.1| serine protease H6 [Tribolium castaneum]
Length = 1176
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 10/101 (9%)
Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
+E +D ++ L + T P G + L K LDL++ Y +V GG+ V
Sbjct: 306 ATESNESKDFIKELTVFMKTNYTPIGKIPS-----LGYKELDLYSFYTKVQKLGGYDSVT 360
Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
WK +F M H + T ++RHYE +LL YE
Sbjct: 361 ANRLWKS-IFDDMSGHASS---TSAATVIRRHYERFLLPYE 397
>gi|119609376|gb|EAW88970.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_a
[Homo sapiens]
Length = 1315
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|29387265|gb|AAH48307.1| JARID1A protein [Homo sapiens]
gi|31753201|gb|AAH53893.1| JARID1A protein [Homo sapiens]
gi|83406060|gb|AAI10917.1| JARID1A protein [Homo sapiens]
Length = 1102
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|432091446|gb|ELK24528.1| Lysine-specific demethylase 5A [Myotis davidii]
Length = 1097
Score = 42.7 bits (99), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|122114608|ref|NP_001073670.1| uncharacterized protein LOC572528 [Danio rerio]
gi|120537766|gb|AAI29398.1| Zgc:158706 [Danio rerio]
Length = 603
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 16/102 (15%)
Query: 438 CSEEEFLRDVMQFLILRGH-TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-V 495
SEE FL+D+ F+ RG +P G K++D+F +Y+ V GG+ V
Sbjct: 18 MSEERFLKDLYLFMKDRGTPIERIPHLGF----------KQIDMFLMYKTVKELGGYQQV 67
Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
WK +V++ + + R T +RHYE LL YE
Sbjct: 68 TTQQLWK-KVYNILGGNP---RSTSAATCTRRHYEKLLLPYE 105
>gi|224096413|ref|XP_002198181.1| PREDICTED: lysine-specific demethylase 5A [Taeniopygia guttata]
Length = 1690
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G +P ++ K LDL+ L + V ++GGF V
Sbjct: 89 DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVANKGGFEVVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVASRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|429964439|gb|ELA46437.1| hypothetical protein VCUG_02073 [Vavraia culicis 'floridensis']
Length = 816
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 59/128 (46%), Gaps = 16/128 (12%)
Query: 439 SEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGN 497
+E+EF R +++++ + +P + K +++++LYR V GG +V N
Sbjct: 218 TEQEFERAYKEYVLM--DDKFIPDSEKNSANIIVPGTKEINMYSLYRIVSENGGMENVTN 275
Query: 498 GINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE---LAHDDVDGECCLLCHSS 554
WK + MR T V T++ Y+ +L E+E A V+ EC +C
Sbjct: 276 EQKWKSLFYGAMRK-------TNVSYTVRTFYKKFLYEFEQYRRARHYVNCECTGMC--- 325
Query: 555 AAGDWVNC 562
D+V+C
Sbjct: 326 ECNDFVDC 333
>gi|350415404|ref|XP_003490629.1| PREDICTED: protein dead ringer-like [Bombus impatiens]
Length = 651
Score = 42.7 bits (99), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 63/147 (42%), Gaps = 30/147 (20%)
Query: 394 HSSAGPTPVTHRKSLSSSYQAQ--QIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFL 451
HS + T H++ S S++ Q Q+ +N P +K EFL D+ ++
Sbjct: 194 HSQSNSTGTNHQQQTSWSFEEQFKQLYEINDDPKRK--------------EFLDDLFSYM 239
Query: 452 ILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMR 510
RG + P I+ LDL+ LY V++RGG V N W+ ++ +
Sbjct: 240 QKRGTP-------INRLP--IMAKSVLDLYELYNLVIARGGLVDVINKKLWQ-EIIKGLH 289
Query: 511 NHTLTNRMTGVGNTLKRHYETYLLEYE 537
L + +T TL+ Y YL YE
Sbjct: 290 ---LPSSITSAAFTLRTQYMKYLYPYE 313
>gi|391337315|ref|XP_003743015.1| PREDICTED: lysine-specific demethylase 5A-like [Metaseiulus
occidentalis]
Length = 1479
Score = 42.7 bits (99), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 48/97 (49%), Gaps = 16/97 (16%)
Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VGNGINW 501
FL + +F LRG++ +P I+ + LDLF L+R V S GGF VG W
Sbjct: 85 FLDKIAKFWHLRGNSLKIP----------IVERRSLDLFKLHRIVESEGGFERVGKERKW 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
+V K+ L ++ V L+ HYE LL Y++
Sbjct: 135 -FRVVQKL-GLPLVKSLSTV---LRNHYERLLLPYDI 166
>gi|308159465|gb|EFO61994.1| ARID1 protein [Giardia lamblia P15]
Length = 462
Score = 42.4 bits (98), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 6/66 (9%)
Query: 472 ILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
I+ ++LD++ L+R V +RGG N WK ++ K+ L +T G TL+ YE+
Sbjct: 37 IVGHRQLDMYQLFRAVQARGG--AKNVTQWK-EIGKKL---GLPASVTNAGYTLRTKYES 90
Query: 532 YLLEYE 537
Y+L YE
Sbjct: 91 YILPYE 96
>gi|354543992|emb|CCE40714.1| hypothetical protein CPAR2_107490 [Candida parapsilosis]
Length = 1720
Score = 42.4 bits (98), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 39/128 (30%)
Query: 436 QVCSEEEFLRDVMQFLILRGHTRLVPQGG---------------LAEFPDAILNAKRLDL 480
+V + +F RD+++F H G L++ P +++ + LDL
Sbjct: 177 EVLARLKFYRDLLEFHQRDQHYHFEDSEGTKKKEKNGASNLQSHLSKLP--MIDKRPLDL 234
Query: 481 FNLYREVVSRGGF----------HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYE 530
F LY+ V+ RGGF +G + +KG++ MT + ++LK Y+
Sbjct: 235 FKLYQSVIVRGGFIEVINKKLWAQIGRELGYKGKI------------MTSLSSSLKASYQ 282
Query: 531 TYLLEYEL 538
L YEL
Sbjct: 283 KILYPYEL 290
>gi|159117256|ref|XP_001708848.1| ARID1 AT-rich interaction domain [Giardia lamblia ATCC 50803]
gi|114803613|gb|ABI81170.1| ARID1 [Giardia intestinalis]
gi|157436962|gb|EDO81174.1| ARID1 AT-rich interaction domain protein [Giardia lamblia ATCC
50803]
Length = 469
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 6/66 (9%)
Query: 472 ILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
I+ ++LD++ L+R V +RGG N WK ++ K+ L +T G TL+ YE+
Sbjct: 37 IVGHRQLDMYQLFRAVQARGG--AKNVTQWK-EIGKKL---GLPASVTNAGYTLRTKYES 90
Query: 532 YLLEYE 537
Y+L YE
Sbjct: 91 YILPYE 96
>gi|326912327|ref|XP_003202505.1| PREDICTED: lysine-specific demethylase 5A-like [Meleagris
gallopavo]
Length = 1487
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G +P ++ K LDL+ L + V +GGF V
Sbjct: 68 DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVAGKGGFEVVT---- 113
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 114 KEKKWSKVASRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 155
>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 2678
Score = 42.0 bits (97), Expect = 0.88, Method: Composition-based stats.
Identities = 24/69 (34%), Positives = 30/69 (43%), Gaps = 6/69 (8%)
Query: 545 GECCLLCH----SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
G+ C +C SS GDW+ C C W H CD L + K+ + Y CP C
Sbjct: 767 GQYCAICRQIWSSSWDGDWLQCDTCRFWIHTECD--TNLINSIEVLKSPSVSYHCPVCRS 824
Query: 601 TNFKKKSQK 609
K K QK
Sbjct: 825 NEKKPKYQK 833
>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
Length = 2244
Score = 42.0 bits (97), Expect = 0.88, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 6/69 (8%)
Query: 545 GECCLLCH----SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
G+ C +C SS G+W+ C IC W H+ CD+ + Y+ L Y CP C
Sbjct: 609 GQFCSICRKIWTSSWEGEWLQCDICKFWVHYDCDKDLN-EPIEFYSNVKNL-YNCPACRS 666
Query: 601 TNFKKKSQK 609
+ K Q+
Sbjct: 667 NDNSVKYQR 675
>gi|403216349|emb|CCK70846.1| hypothetical protein KNAG_0F01780 [Kazachstania naganishii CBS 8797]
Length = 1421
Score = 42.0 bits (97), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 7/115 (6%)
Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW 559
+W ++ ++ +TN +T V L ++ + LA D + C C +G
Sbjct: 1206 SWTTELNKCLKRQKITNILTLVTACLDLDFDCFTKRGSLAEKD--AKYCF-CRKGESGTM 1262
Query: 560 VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 614
+ C IC EW H C ++G K A T+ + +VC C++ + S++ Y
Sbjct: 1263 IECDICKEWYHTNCIGQKGS---KSTATTNSV-FVCSLCNINSNTVMSKRNGVNY 1313
>gi|118082971|ref|XP_416379.2| PREDICTED: lysine-specific demethylase 5A [Gallus gallus]
Length = 1691
Score = 42.0 bits (97), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G +P ++ K LDL+ L + V +GGF V
Sbjct: 89 DFLDQLAKFWELQGSNLKIP----------VVERKILDLYGLSKIVAGKGGFEVVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVASRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>gi|195434725|ref|XP_002065353.1| GK15405 [Drosophila willistoni]
gi|194161438|gb|EDW76339.1| GK15405 [Drosophila willistoni]
Length = 1900
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 14/96 (14%)
Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
FL + +F L+G + +P ++ K LDL+ L+R V GG K
Sbjct: 268 FLDQIAKFWELQGASLKIP----------MVERKALDLYTLHRIVQEEGGMETAT----K 313
Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
+ ++K+ N VG TLK HYE L +E+
Sbjct: 314 ERKWAKVANRMQYPSSKSVGATLKAHYERILHPFEV 349
>gi|403344053|gb|EJY71365.1| DUF1423 domain containing protein [Oxytricha trifallax]
Length = 663
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 14/41 (34%), Positives = 20/41 (48%)
Query: 558 DWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
+WV C C +W H C+ G + + DG +Y CP C
Sbjct: 446 EWVQCENCNKWEHTDCEVINGFTELPEKLQQDGWKYFCPLC 486
>gi|340507679|gb|EGR33604.1| hypothetical protein IMG5_048210 [Ichthyophthirius multifiliis]
Length = 479
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 37/87 (42%), Gaps = 13/87 (14%)
Query: 476 KRLDLFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLL 534
+ LD F LY+ V+ RGG V N WK V + T TL+ HY+ YLL
Sbjct: 40 RELDFFKLYKSVIRRGGAQAVSNNKMWKEIV----NEFGFPSTCTSASFTLRNHYQKYLL 95
Query: 535 EYELAH--------DDVDGECCLLCHS 553
YE + ++ E + HS
Sbjct: 96 GYEFKYFFHQLQSTKNIKQESQIQSHS 122
>gi|412991505|emb|CCO16350.1| predicted protein [Bathycoccus prasinos]
Length = 436
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 6/76 (7%)
Query: 474 NAKRLDLFNLYREVVSRGGFHVGNGINWK-----GQVFSKMRNHTLTNRMTGVGNTLKRH 528
+ K DL Y EV RGGF N + K ++F M N+ + T +GN L
Sbjct: 288 SGKPFDLLTFYGEVCRRGGFG-RNRVEAKCRFSISKIFKVMFNYFDQHSYTDIGNKLFDV 346
Query: 529 YETYLLEYELAHDDVD 544
YE + L YE AH D
Sbjct: 347 YELFFLPYERAHSQED 362
>gi|395541541|ref|XP_003772701.1| PREDICTED: lysine-specific demethylase 5A-like, partial
[Sarcophilus harrisii]
Length = 200
Score = 41.2 bits (95), Expect = 1.4, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 65 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 110
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 111 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 152
>gi|208435489|pdb|2JXJ|A Chain A, Nmr Structure Of The Arid Domain From The Histone H3k4
Demethylase Rbp2
Length = 96
Score = 41.2 bits (95), Expect = 1.6, Method: Composition-based stats.
Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 14/97 (14%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 10 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 55
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
K + +SK+ + G G+ LK HYE L YEL
Sbjct: 56 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYEL 92
>gi|355753250|gb|EHH57296.1| AT-rich interactive domain-containing protein 3C [Macaca
fascicularis]
Length = 411
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 44/99 (44%), Gaps = 16/99 (16%)
Query: 441 EEFLRDVMQFLILRGH-TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNG 498
+EFL D+ F+ RG VP I+ + LDL+ L+R V ++GG V N
Sbjct: 116 KEFLDDLFSFMQKRGTPVNRVP----------IMAKQVLDLYALFRLVTAKGGLVEVINR 165
Query: 499 INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ R +L +T TL+ Y YL YE
Sbjct: 166 KVWR----EVTRGLSLPTTITSAAFTLRTQYMKYLYPYE 200
>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 989
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + H S G+WV C C W H CD+ FKD TD Y CP C
Sbjct: 375 GICKRIWHHSDGGNWVCCDGCNVWVHAECDKISS-KVFKDLENTD---YYCPDC 424
>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
Length = 1149
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + H S G+WV C C W H CD + FKD TD Y CP C
Sbjct: 323 GICKKIWHHSDGGNWVCCDGCNVWVHAECD-KISTEHFKDLENTD---YYCPDC 372
>gi|297821026|ref|XP_002878396.1| hypothetical protein ARALYDRAFT_907712 [Arabidopsis lyrata subsp.
lyrata]
gi|297324234|gb|EFH54655.1| hypothetical protein ARALYDRAFT_907712 [Arabidopsis lyrata subsp.
lyrata]
Length = 645
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 25/56 (44%), Gaps = 4/56 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
G C + H S GDWV C C W H GCD K + + + Y CP C V
Sbjct: 337 GICKRMWHPSDDGDWVCCDGCDVWVHAGCDNISN----KHFKELEHNNYYCPNCKV 388
>gi|356560272|ref|XP_003548417.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 954
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + H S G+WV C C W H CD+ FKD TD Y CP C
Sbjct: 337 GICKRIWHHSDGGNWVCCDGCNVWVHAECDKISS-KLFKDLENTD---YYCPDC 386
>gi|312075048|ref|XP_003140243.1| hypothetical protein LOAG_04658 [Loa loa]
Length = 388
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 75/194 (38%), Gaps = 43/194 (22%)
Query: 350 LNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLS 409
L A+ P PHT S++ + V AN P+ P + PT + R
Sbjct: 232 LPGASPNPWPHTP----------SQMSQGHRTPVPANRPITPTVNKRQDPTSIVDRL--- 278
Query: 410 SSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHT-RLVPQGGLAEF 468
+ ++NP + + F ++QF +G VPQ
Sbjct: 279 ----VGPVTAMNPQHI-----------MPERRAFFEKLVQFCEQQGEPITQVPQ------ 317
Query: 469 PDAILNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKR 527
++ + +DL LY V+ RGGF V WK QV ++ + + + G L+R
Sbjct: 318 ----VSKQTVDLHRLYLAVMKRGGFEQVTREKTWK-QVCTEANSEM--SESSAAGYQLRR 370
Query: 528 HYETYLLEYELAHD 541
HY+ YLL E D
Sbjct: 371 HYQKYLLGLECLED 384
>gi|241948971|ref|XP_002417208.1| conserved hypothetical protein [Candida dubliniensis CD36]
gi|223640546|emb|CAX44800.1| conserved hypothetical protein [Candida dubliniensis CD36]
Length = 1668
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 8/62 (12%)
Query: 539 AHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
D+V + +C G V C C EW H C KD + D +Y+CP C
Sbjct: 1365 TEDEVTPKYYCVCREYEYGTMVECDKCNEWYHVQC--------VKDVSNPDADKYICPTC 1416
Query: 599 SV 600
V
Sbjct: 1417 LV 1418
>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + H S G+WV C C W H CD+ FKD TD Y CP C
Sbjct: 373 GICKKIWHHSDGGNWVCCDGCNVWVHAECDKISS-KHFKDLENTD---YYCPDC 422
>gi|357436505|ref|XP_003588528.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355477576|gb|AES58779.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 973
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 5/61 (8%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVT-NF 603
G C + + S +G WV C C W H CD+ + FK+ TD Y CP C V NF
Sbjct: 408 GICKKVWNQSDSGSWVRCDGCKVWVHAECDKISSI-LFKNLGSTD---YFCPACKVKFNF 463
Query: 604 K 604
+
Sbjct: 464 E 464
>gi|357127667|ref|XP_003565500.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
[Brachypodium distachyon]
Length = 972
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 34/87 (39%), Gaps = 9/87 (10%)
Query: 516 NRMTGVGNTLKRHYETYLL----EYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHF 571
NR+T + K+ LL E L G C + H + G+WV C C W H
Sbjct: 331 NRLTSKISKKKKQEVEQLLCRHCEKLLQSKQYCGICKKIWHHTDGGNWVCCDECEIWVHV 390
Query: 572 GCDRRQGLGAFKDYAKTDGLEYVCPQC 598
CDR D + EY CP C
Sbjct: 391 ECDR-----TCSDLEDLENTEYFCPDC 412
>gi|403345100|gb|EJY71909.1| ARID/BRIGHT DNA binding domain containing protein [Oxytricha
trifallax]
Length = 673
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 39/86 (45%), Gaps = 9/86 (10%)
Query: 464 GLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VGNGINWKGQVFSKMRNHTLTNRMTGVG 522
G+ F + K LDL LY+ V+ RGG V N WK V + + T
Sbjct: 36 GVENFKVPSIGGKELDLCKLYKAVIQRGGSQRVSNNKLWKEIV----NEFEIPSSCTSAS 91
Query: 523 NTLKRHYETYLLEYE----LAHDDVD 544
TL+ HY LL+YE L H ++D
Sbjct: 92 FTLRNHYNKCLLQYEKKYFLGHSNLD 117
>gi|340386800|ref|XP_003391896.1| PREDICTED: hypothetical protein LOC100633834, partial [Amphimedon
queenslandica]
Length = 192
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 5/51 (9%)
Query: 548 CLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
CL+ ++ WV C C +W H C G K A +G +Y CP+C
Sbjct: 145 CLMKRNNPNSTWVLCEKCPQWVHIRC-----AGITKHKASKEGFKYFCPKC 190
>gi|145479355|ref|XP_001425700.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392772|emb|CAK58302.1| unnamed protein product [Paramecium tetraurelia]
Length = 570
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 19/102 (18%)
Query: 440 EEE--FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV-- 495
EEE F++ + +F RG T +PQ G + L++F LY+ V RGG V
Sbjct: 5 EEEKVFIQQLQKFWEQRGVTIKIPQIG----------GRELEVFKLYKAVTKRGGLKVVS 54
Query: 496 GNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
N + WK V + T TL+ HY+ LL YE
Sbjct: 55 ANKL-WKEIV----DQFSFPATCTSASFTLRNHYQKLLLAYE 91
>gi|68474912|ref|XP_718497.1| hypothetical protein CaO19.2476 [Candida albicans SC5314]
gi|46440265|gb|EAK99573.1| hypothetical protein CaO19.2476 [Candida albicans SC5314]
Length = 1684
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 8/60 (13%)
Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
D++ + +C G V C C EW H C KD + D +Y+CP C V
Sbjct: 1377 DEITPQYYCVCREYEYGTMVECDKCNEWYHVQC--------VKDVSNPDADKYICPTCLV 1428
>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
Length = 6442
Score = 40.0 bits (92), Expect = 4.1, Method: Composition-based stats.
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)
Query: 548 CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
C++C+ D WV C +C W H CD+ + + + Y CP C +F
Sbjct: 1683 CIMCNEKYEIDDSNKWVQCDVCKFWIHLSCDKNENRNI--ETLSIKSINYKCPTCRSGSF 1740
Query: 604 KKKSQK 609
K ++
Sbjct: 1741 HDKIER 1746
>gi|389584530|dbj|GAB67262.1| SET domain containing protein [Plasmodium cynomolgi strain B]
Length = 5788
Score = 40.0 bits (92), Expect = 4.1, Method: Composition-based stats.
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)
Query: 548 CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
C++C+ D WV C +C W H CD+ + + + Y CP C +F
Sbjct: 1214 CIMCNEKYEIDDSNKWVQCDVCKFWIHLSCDKNESRNI--ETLSIKSINYKCPTCRSGSF 1271
Query: 604 KKKSQK 609
K ++
Sbjct: 1272 HDKIER 1277
>gi|56566281|gb|AAN75616.2| ZNF1 [Cryptococcus neoformans var. neoformans]
Length = 846
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 44/106 (41%), Gaps = 10/106 (9%)
Query: 506 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE--CCLLCHSSAAGDWVNCG 563
F +R + N + V + L R YE + E E+ D D CC S+ D V C
Sbjct: 121 FEVIRTANVNNIPSTVKDVLMRRYEYLITEREMVADLTDDFRICCCCRQWSSFQDSVKCE 180
Query: 564 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQK 609
C E+ H C + L G +VCP C F++ +Q+
Sbjct: 181 SCREYYHMTCLQPPLLA-----KPAKGYSWVCPSCF---FQRNTQQ 218
>gi|340500107|gb|EGR27007.1| PHD-finger family protein, putative [Ichthyophthirius multifiliis]
Length = 544
Score = 39.7 bits (91), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 33/75 (44%), Gaps = 6/75 (8%)
Query: 525 LKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD-WVNCGICGEWAHFGCDRRQGLGAFK 583
+K E Y YE ++D D C+ GD + C IC EW HF C LG
Sbjct: 371 IKETEEKYAENYESVNEDSDQLYCICRQKYTYGDQMMACEICNEWFHFKC-----LGYKG 425
Query: 584 DYAKTDGLEYVCPQC 598
+ + ++++C C
Sbjct: 426 SIEEAEKIQFICTLC 440
>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1002
Score = 39.3 bits (90), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 25/56 (44%), Gaps = 4/56 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
G C + H S GDWV C C W H GCD FK+ + Y CP C V
Sbjct: 369 GICKRIWHPSDDGDWVCCDGCNVWVHAGCDNITN-ERFKELEHNN---YYCPDCKV 420
>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
Length = 6587
Score = 39.3 bits (90), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)
Query: 548 CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
C++C+ D WV C +C W H CD+ + + + Y CP C +F
Sbjct: 1730 CIMCNEKYEIDDSNKWVQCDVCKFWIHLSCDKNENRNI--ETLSIKSINYKCPTCRSGSF 1787
Query: 604 KKKSQK 609
K ++
Sbjct: 1788 HDKIER 1793
>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
sativus]
Length = 1073
Score = 39.3 bits (90), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + + S +G WV C C W H CD+ FKD TD Y CP C
Sbjct: 448 GICKKIWNHSDSGSWVRCDGCKVWVHAECDKISS-NLFKDLGSTD---YFCPTC 497
>gi|255555783|ref|XP_002518927.1| hypothetical protein RCOM_1314010 [Ricinus communis]
gi|223541914|gb|EEF43460.1| hypothetical protein RCOM_1314010 [Ricinus communis]
Length = 868
Score = 39.3 bits (90), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 35/82 (42%), Gaps = 14/82 (17%)
Query: 533 LLEYELAHDDVDG----ECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKT 588
L ++L D +DG +C + + VNC CG W H C R Y K
Sbjct: 8 LQSHDLHEDWIDGSWTVDCVCGVNFDDGEEMVNCDECGVWVHTRCSR---------YVKG 58
Query: 589 DGLEYVCPQCSVTNFKKKSQKT 610
D L + C +C N K S++T
Sbjct: 59 DEL-FTCDKCKSKNNKDDSEET 79
>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 24/54 (44%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + H S G+WV C C W H CD+ FKD D Y CP C
Sbjct: 372 GICKKIWHHSDGGNWVCCDGCNVWVHAECDKISS-KLFKDLENAD---YYCPDC 421
>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
Length = 1018
Score = 39.3 bits (90), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + + S +G WV C C W H CD+ FKD TD Y CP C
Sbjct: 391 GICKKIWNHSDSGSWVRCDGCKVWVHAECDKISN-SRFKDLGATD---YYCPAC 440
>gi|321262771|ref|XP_003196104.1| ZNF1p [Cryptococcus gattii WM276]
gi|54112196|gb|AAV28798.1| ZNF1p [Cryptococcus gattii]
gi|317462579|gb|ADV24317.1| ZNF1p [Cryptococcus gattii WM276]
Length = 850
Score = 39.3 bits (90), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 37/95 (38%), Gaps = 7/95 (7%)
Query: 506 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDG--ECCLLCHSSAAGDWVNCG 563
F +R ++ N V L R YE + E E+ D +D CC S+ D V C
Sbjct: 121 FEVIRTASVNNVPATVKEVLLRRYEYLITEREMVADLMDNFRNCCRCSQWSSFQDSVKCE 180
Query: 564 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
C E H C + L G +VCP C
Sbjct: 181 SCREHYHMSCLQPPLLA-----KPAKGYSWVCPSC 210
>gi|307196235|gb|EFN77881.1| AT-rich interactive domain-containing protein 5B [Harpegnathos
saltator]
Length = 1163
Score = 39.3 bits (90), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 26/119 (21%)
Query: 438 CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFH-VG 496
E++F+ DV F+ RG P G + +L K++DLF Y +V GG+ V
Sbjct: 225 AEEKKFIIDVQGFMNSRG----TPFGKMP-----LLGNKQIDLFLFYTKVQMLGGYDSVS 275
Query: 497 NGINWKGQVFSKMRNHTLTNRMTGVGNTL------KRHYETYLLEYELAHDDVDGECCL 549
G WK T+ + + G + +RHYE LL YE D + L
Sbjct: 276 TGRLWK----------TIYDDIGGSTTSTNAATTTRRHYERLLLPYERYQKGEDTKIKL 324
>gi|91087509|ref|XP_968902.1| PREDICTED: similar to GA18855-PA [Tribolium castaneum]
Length = 917
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 52/126 (41%), Gaps = 28/126 (22%)
Query: 413 QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAI 472
Q +Q+ +N P +K EFL D+ F+ RG P L I
Sbjct: 144 QVRQLYEINDDPKRK--------------EFLDDLFSFMQKRG----TPINRLP-----I 180
Query: 473 LNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
+ LDL+ LY V++RGG V N W+ ++ L + +T TL+ Y
Sbjct: 181 MAKSVLDLYELYNLVIARGGLVDVINKKLWQ----EIIKGLHLPSSITSAAFTLRTQYMK 236
Query: 532 YLLEYE 537
YL YE
Sbjct: 237 YLYPYE 242
>gi|145527630|ref|XP_001449615.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417203|emb|CAK82218.1| unnamed protein product [Paramecium tetraurelia]
Length = 204
Score = 38.9 bits (89), Expect = 7.8, Method: Composition-based stats.
Identities = 16/75 (21%), Positives = 33/75 (44%), Gaps = 11/75 (14%)
Query: 546 ECCLLCHSSAAGD---------WVNCG--ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYV 594
+CC C + W+ C C +W H C+ + G+ + + + +Y+
Sbjct: 48 KCCYFCAQVYSDQNENFLDGQKWIACDQEKCDKWTHLSCEAKNGIHSIECLVEDMKFKYI 107
Query: 595 CPQCSVTNFKKKSQK 609
CP C + + K+++ K
Sbjct: 108 CPWCRIEDQKQRNMK 122
>gi|328872173|gb|EGG20540.1| hypothetical protein DFA_00401 [Dictyostelium fasciculatum]
Length = 436
Score = 38.9 bits (89), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 19/72 (26%), Positives = 31/72 (43%), Gaps = 8/72 (11%)
Query: 548 CLLC--------HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCS 599
CLLC +++ W+ C C +W CD G + D + + L Y CP+C
Sbjct: 314 CLLCGIVYNSKEAMASSISWIRCDDCKQWVMSKCDSGMGDISLYDDSNPNPLHYSCPKCR 373
Query: 600 VTNFKKKSQKTS 611
K K+ + +
Sbjct: 374 TDPSKPKTTRNN 385
>gi|56566239|gb|AAN75157.2| ZNF1 [Cryptococcus neoformans var. grubii]
Length = 850
Score = 38.9 bits (89), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 7/95 (7%)
Query: 506 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE--CCLLCHSSAAGDWVNCG 563
F +R + N + V + L R YE + E E+ D D CC S+ D V C
Sbjct: 121 FEVIRTVNVNNIPSTVKDVLMRRYEYLITEREMVADLTDDFRICCCCRQWSSFQDSVKCE 180
Query: 564 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
C E+ H C + L G +VCP C
Sbjct: 181 SCREYYHMTCLQPPLLA-----KPAKGYSWVCPSC 210
>gi|134114177|ref|XP_774336.1| hypothetical protein CNBG3170 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256971|gb|EAL19689.1| hypothetical protein CNBG3170 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1249
Score = 38.9 bits (89), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 6/53 (11%)
Query: 550 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTN 602
+C +G + C +C +W HF C + +D A+ +YVCP C+++N
Sbjct: 55 VCKQDTSGPMIECDVCSDWFHFKC-----INLAEDDAEKIH-KYVCPSCTLSN 101
>gi|58269444|ref|XP_571878.1| transcriptional activator [Cryptococcus neoformans var. neoformans
JEC21]
gi|57228114|gb|AAW44571.1| transcriptional activator, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1242
Score = 38.9 bits (89), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 6/53 (11%)
Query: 550 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTN 602
+C +G + C +C +W HF C + +D A+ +YVCP C+++N
Sbjct: 55 VCKQDTSGPMIECDVCSDWFHFKC-----INLAEDDAEKI-HKYVCPSCTLSN 101
>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 1053
Score = 38.5 bits (88), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
G C + + S +G WV C C W H CD+ FKD TD Y CP C
Sbjct: 429 GICKKVSNHSDSGSWVRCDGCKVWVHAECDKISS-NHFKDLETTD---YFCPTC 478
>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
Length = 1078
Score = 38.5 bits (88), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 28/61 (45%), Gaps = 5/61 (8%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVT-NF 603
G C + + S +G WV C C W H CD+ FKD TD Y CP C NF
Sbjct: 447 GICKKVWNHSDSGSWVRCDGCKVWVHAECDKISS-NRFKDLGGTD---YYCPACKAKFNF 502
Query: 604 K 604
+
Sbjct: 503 E 503
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,864,351,716
Number of Sequences: 23463169
Number of extensions: 418522381
Number of successful extensions: 908587
Number of sequences better than 100.0: 235
Number of HSP's better than 100.0 without gapping: 72
Number of HSP's successfully gapped in prelim test: 163
Number of HSP's that attempted gapping in prelim test: 908150
Number of HSP's gapped (non-prelim): 370
length of query: 614
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 465
effective length of database: 8,863,183,186
effective search space: 4121380181490
effective search space used: 4121380181490
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 80 (35.4 bits)