BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000404
(1562 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 819 bits (2115), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/937 (47%), Positives = 582/937 (62%), Gaps = 94/937 (10%)
Query: 289 MQMEVSNETNGAK-----GRKKVKRKRGRPPKMLENDESEGEQTKIS-------KVKPGR 336
M++E +E G++ G +KVKRKRGRP K + E KI K K GR
Sbjct: 167 MKLERRSEEQGSEQQLFSGDQKVKRKRGRPRKTEKEVEEVVVSPKIVVSPMKKLKRKRGR 226
Query: 337 PPKLNKIDES---CEQRRKKVKDKRGRPRHKTHESGESDGEQRKRLKNKLGRPRKDGSDD 393
PPKL CE R KK+K KRGRPR E+ S ++ N L
Sbjct: 227 PPKLESEKNHQFVCELRNKKLKRKRGRPRKIDKENDNSLFDELNSELNTL---------- 276
Query: 394 RGSKRLKKKRGRPPKLQGINEVLKGKVGKGKKVNGIRKSQRHTLAVGLKRDVPTYGLIPE 453
+ K+ RGRPPKLQ N LK + +G+KV RK L ++ +VPT L +
Sbjct: 277 ----KPKRGRGRPPKLQKSNGALKNEHTEGRKVRLARKLSM-KLRNKVRSNVPTDRLSSD 331
Query: 454 KRH------------GGTEFNAQRFAPD-------KKNSCAETGEAISRQTMKTVNQREK 494
KRH G + + + P+ K SC E + + + + E
Sbjct: 332 KRHIRKEIHMKKTLQAGNDLSQEILEPEATLTASSKVISCGEKTKKVKKVKKPKIEVDEC 391
Query: 495 KCLETHQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWS 554
K + AK LLR+RI E+L AGW ++YRPR REY DAVYV+PEG+THWS
Sbjct: 392 K-----------RSIAKNLLRERITEILKTAGWTVQYRPRFNREYKDAVYVSPEGRTHWS 440
Query: 555 ITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLG 614
ITLAY+VLK HYE+ G S KTGF FTPIP++E+ L + K+ R+ K
Sbjct: 441 ITLAYNVLKRHYEEGDGDSTVYKTGFIFTPIPDEEIMTLTRCNEKASYSRSPVSKSTKRK 500
Query: 615 TDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQ 674
++ + NS KG PS TQ
Sbjct: 501 RKKAMLHQDVHNSDCNNSLE-----------KGFPSSF-------------------RTQ 530
Query: 675 QRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVML 734
R+RCALLVRN+ E +S+ DG++ Y+GK TLLAWMID G + L+EKVQY NQRKTRV L
Sbjct: 531 NRQRCALLVRNTEETADSSNDGYLLYNGKRTLLAWMIDLGILSLDEKVQYMNQRKTRVKL 590
Query: 735 QGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDES 794
+GR+ RDGI C CC E+ TISKF+ H+ S++ P +N+Y +GSSLLQC+L+SWNKQ+E
Sbjct: 591 EGRLTRDGIHCSCCDEVITISKFEMHAGSRVGQPLENIYVHTGSSLLQCLLESWNKQNEP 650
Query: 795 KRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
+ KG++FV+ D EDPNDDTCGICGDGGDLICCD CPSTFHQ+CLDIKKFPSG WHC+YCS
Sbjct: 651 QCKGYNFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYCS 710
Query: 855 CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
C+ CG++ ++D ++A L C LCEEKYH C Q + A + ++ FCGKKC
Sbjct: 711 CKVCGQVTIGLHPMDDHHEAAADVLCKCDLCEEKYHPICVQMNNASGDDVNNPLFCGKKC 770
Query: 915 QEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSL-SDVCQKVECNARLAVALSVMDE 973
Q + ERL++LLGV+ D++ G++W+L+ R DV +D+SL ++V QK++CN+ LAVAL VMDE
Sbjct: 771 QMLHERLQRLLGVRQDMKEGFSWTLIRRSDVDSDVSLCNEVAQKIKCNSELAVALFVMDE 830
Query: 974 CFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAE 1033
CFLP+ DHRSGINLIHNILYN GSNF RLN+ GF+TAILE+DDE+I AAS+RIHG ELAE
Sbjct: 831 CFLPVIDHRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAE 890
Query: 1034 MPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSS 1093
MPFIGTR+MYRRQGMCRR L+ IES L SLNVEKL+IPAISE+R+TW SVFGF+PL+ ++
Sbjct: 891 MPFIGTRYMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDETT 950
Query: 1094 KQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAKG--LRSSELENPRTADEVGKNSDE 1151
KQ+MR MSLLVFPGV+MLQK ++K+ P E +G +S EL +T + V + +E
Sbjct: 951 KQRMRKMSLLVFPGVEMLQKLLLKDHLPMECTTLGEGSISKSPELSEHQTLEVVANSPEE 1010
Query: 1152 KYSAGFDLNACINAAAPHVCKIHDKSAPGESTLQFPN 1188
+ S LN+C A I A ES+++ PN
Sbjct: 1011 RGSPCSCLNSCSEGTAQDGMGISGDPAVIESSVK-PN 1046
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 794 bits (2051), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/755 (54%), Positives = 512/755 (67%), Gaps = 51/755 (6%)
Query: 521 LLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGF 580
+L+ AGW I+YRPR +EY DAVY +P G+ +WS+TLAY+VLK+HYE + GF
Sbjct: 1 MLMRAGWTIDYRPRRDKEYNDAVYTSPTGRGYWSVTLAYNVLKSHYED-----GHCEPGF 55
Query: 581 TFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKS 640
TFTPIP+ L+ LK+ N S+ + + K + + GE+ KKK N
Sbjct: 56 TFTPIPDGVLTKLKR--NASKGKKRRLKLEQEYDSGGEMKCCIVKKKSGKN--------- 104
Query: 641 QKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAY 700
+H +TQ KR ALL R+S EG ++ DG+V Y
Sbjct: 105 --------------------------KHAGGKTQNTKRFALLARHSKEGLTTDTDGYVPY 138
Query: 701 DGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH 760
GK TLL+WM+D GTVPLN KVQY N+RKTR +L+G I+RDGIRC CCSEIFTISKF+ H
Sbjct: 139 SGKRTLLSWMVDLGTVPLNAKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTISKFEIH 198
Query: 761 SKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDG 820
+ KLC P QN+ E+G SLLQC LDSWNKQ+ES+R GFH V+ +DPNDDTCGICGDG
Sbjct: 199 AGMKLCEPSQNIILETGISLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCGICGDG 258
Query: 821 GDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQ 880
GDLICCDGCPSTFHQ+CLDI+KFPSG WHC+YCSC+FCG + +T +N D S L
Sbjct: 259 GDLICCDGCPSTFHQSCLDIQKFPSGDWHCIYCSCKFCGMFSGNTDQMNYNLDVNDSALL 318
Query: 881 ICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLV 940
C LCEEKYH C+Q + ++ + SS SFCGK C+E+FE+L+ LLGVKH+LE G++W+LV
Sbjct: 319 TCQLCEEKYHHMCTQGEDSILDDSSSPSFCGKTCRELFEQLQMLLGVKHELEDGFSWTLV 378
Query: 941 HRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFK 1000
R +V D+SL+ + QKVECN++LAVALS+MDECFLP+ D RSGINLIHN+LYN GSNF
Sbjct: 379 QRTEVGFDISLNGIPQKVECNSKLAVALSIMDECFLPIVDQRSGINLIHNVLYNCGSNFN 438
Query: 1001 RLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
RLNY GFFTAILER +EIISAASIRIHG +LAEMPFIGTRH+YRRQGMCRRLL IESAL
Sbjct: 439 RLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIYRRQGMCRRLLNAIESAL 498
Query: 1061 CSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQF 1120
SLNVEKL+IPAISEL +TWTSVFGF+PLEVSS+++MRNM++LVF G DMLQKP++K+Q
Sbjct: 499 HSLNVEKLVIPAISELMQTWTSVFGFKPLEVSSRKEMRNMNMLVFHGTDMLQKPLLKDQS 558
Query: 1121 PRENMISAKGLRSSELENPRTADE-VGKNSDEKYSAGFDLNACINAAAPHVCKIHDKSAP 1179
E+MI + L S+EL+ V NSD+ S G DLN A + I + A
Sbjct: 559 AEESMIPSAVLESNELKKDLDIKHGVANNSDKTCSPGSDLNISSKGANLSL-AICNGPAA 617
Query: 1180 GESTLQFPNGSTHDASGLTSETVNFPESTTDTNCI--DQL-----GVTSNDLQANDKIAV 1232
ES Q GS +D+S +TSET NFPES T+ + D L V A D AV
Sbjct: 618 VESGSQLNEGSLNDSSDITSETTNFPESATNEKSLVHDNLEGKNRTVICPQPSACDAHAV 677
Query: 1233 NTLGSPSDADEQTEDTDDPNASSLAVKSIASEVQI 1267
N + D+ DD + A +++ S+ ++
Sbjct: 678 NAHSATEGIDKHQTAVDDSIILAPAERTVESDSKL 712
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/983 (43%), Positives = 587/983 (59%), Gaps = 107/983 (10%)
Query: 257 KRGSEKESNQAEIEERGSNRVFDGSENERDVDMQMEVSNETNGAKGRKKVKRKRGRPPKM 316
KRG +S ++++ ++ + V G ++D + K KK+KRKRGRPPK
Sbjct: 331 KRGRPPKSQESDVFQQKCDDVEKG-----EIDQSAGQESHQFNNKVSKKIKRKRGRPPKA 385
Query: 317 LENDESE---------------GEQTK------ISKVKP--GRPPKLNKIDESCEQRRKK 353
++D SE G+Q++ I+ +KP GRP + K D S ++++
Sbjct: 386 QQSDGSEKKWGGVEKEVIDLSIGQQSRAPNNEAINFLKPRRGRPSRAKKSDLS--EKKRA 443
Query: 354 VKDKRGRPRHKTHESGESDGEQRKRLKNKLGRPRKDGSDDRGSKRLKKKRGRPPKLQGIN 413
++ R +S + D E R+ LK+ L RP K + KKK GRP K
Sbjct: 444 DPEEEACDRVADEKSDQLDNEVRENLKHNLERPFKLNKAKKVGALRKKKMGRPSK----- 498
Query: 414 EVLKGKVGKGKKVNGIRKSQRHTLAVGLKRDVPTYGLIPEKRHGGTEFNAQRFAPDKKNS 473
H + ++R+ + + +R + N+
Sbjct: 499 ------------------DNIHDVNHNIRRN--------------SSLSGKRLLVKENNT 526
Query: 474 CAETGEAISRQTMKTVNQREKKCLETHQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRP 533
G+ I + + V ++K E + +Q +RD+IV++LL AGW+I++RP
Sbjct: 527 KLFPGKKIKDNSEENVGNTKQKAGEITHSRS-----ERQAVRDKIVDMLLGAGWEIQHRP 581
Query: 534 RNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSIL 593
RNGR+Y DAVYVNPEG+THWS+TLAY VLK HYE G S + F FTPIPE+ELSIL
Sbjct: 582 RNGRQYMDAVYVNPEGRTHWSVTLAYRVLKKHYEDGNGGSKMHNSSFQFTPIPEEELSIL 641
Query: 594 KKVINKSRSDRNKKKKGKNLG-----TDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGK 648
KV+ K RSD+NKKKK N G T G + KK K + + A + G S K +KG+
Sbjct: 642 TKVMIKERSDKNKKKKKWNKGEKGDKTAGAVNKKKWKLQKRKLGAWA--GVSHKM-LKGR 698
Query: 649 PSVSE--------GGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAY 700
+ T G ++ R HK ET RKRC+L+ R S +G ES+ DG+V Y
Sbjct: 699 KKLKNRHCQQDDLAATLGEGSTVSVRGHKRLETHGRKRCSLIARKSQDGIESDKDGYVLY 758
Query: 701 DGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH 760
+GK T+LAWMID GTVPL+ KVQY +RK R + +G I DGI+CDCC++ FT ++F+ H
Sbjct: 759 NGKRTVLAWMIDLGTVPLDGKVQYLKRRKARFVTKGSITTDGIQCDCCNKTFTSAEFEAH 818
Query: 761 SKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDG 820
+ K C PF+N+Y E+GSSLLQC LDSW K+D+S KGFHF++ DGEDPNDDTCGICGDG
Sbjct: 819 AGGKSCQPFENIYLETGSSLLQCQLDSWYKEDDSAHKGFHFIDIDGEDPNDDTCGICGDG 878
Query: 821 GDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQ 880
GDLICCD CPSTFHQ+CL+I+KFPSG WHC+YC C+FCG + +TC + + L
Sbjct: 879 GDLICCDSCPSTFHQSCLEIRKFPSGLWHCMYCLCKFCGMVGGNTCQRDGNMAAVSHALV 938
Query: 881 ICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLV 940
C LCE+KYH SC Q + +P S SFCG CQE++ERL+ L GVK +LE G++W+ V
Sbjct: 939 TCHLCEDKYHHSCFQEKDIINADPGSPSFCGNNCQELYERLQMLFGVKQELEAGFSWTFV 998
Query: 941 HRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFK 1000
RFDVS+D+S+S + KV+CN+++AVAL +MDECF+P+ DH+SG+NLI NI+Y+FGSNF
Sbjct: 999 RRFDVSSDISVSGMSWKVDCNSKVAVALQIMDECFVPMVDHKSGVNLIRNIVYSFGSNFN 1058
Query: 1001 RLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
RLNY GFF A+LER DE+I+AASIR MP + + L
Sbjct: 1059 RLNYSGFFNAVLERGDEMIAAASIRY----FYSMPV--------------SFHSSLSMGL 1100
Query: 1061 CSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQF 1120
CSLNV KL+IPAISEL TWTSVFGF+ LE S KQ MRNM+++VFPGVDMLQKP++K+ F
Sbjct: 1101 CSLNVGKLVIPAISELTGTWTSVFGFKHLEGSDKQIMRNMNMMVFPGVDMLQKPLLKHPF 1160
Query: 1121 PRENMISAKGLRSSELENPRTADEVGKNSDEKYSAGFDLNACINAAAPHVCKIHDKSAPG 1180
ENM +GL S++ E T +E+ K +E SAG DL + H I ++ A
Sbjct: 1161 TEENMHPIEGLNSTKREEFHTKEEMKKFFNENCSAGCDLKGSSESDVTHSGNIMNEHAAV 1220
Query: 1181 ESTLQFPNGSTHDASGLTSETVN 1203
ES+ +G +D S +T++ +
Sbjct: 1221 ESS-SVLDGCLNDISDITAQNAS 1242
Score = 44.3 bits (103), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 19/26 (73%), Positives = 23/26 (88%)
Query: 1537 VALHCASGGGNSHGTPEVMVLSNKAS 1562
V LHCASGGGNS G PEV++LSN+A+
Sbjct: 1675 VILHCASGGGNSCGAPEVVILSNQAN 1700
Score = 42.4 bits (98), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 55/117 (47%), Gaps = 26/117 (22%)
Query: 305 KVKRKRGRPPKMLENDESEG-----EQTKISKVKPGRPPKLNKIDESCEQRRKKVKDKRG 359
++KRKRGRP K E+DE +G E+ +I + LNK + ++K KRG
Sbjct: 238 RIKRKRGRPRKSQESDEFQGTWGDVEKGEIDQSAGHESHHLNK------KLSNEIKRKRG 291
Query: 360 RPRHKTHESGESDGEQRKRLKNKLGRPRKD------GSDDRGSKRLKKKRGRPPKLQ 410
RP K+ ES E + K G K G ++ S K KRGRPPK Q
Sbjct: 292 RPP-KSQESNE--------FQQKWGDVEKGEVDQYAGLHNKSSNEKKCKRGRPPKSQ 339
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 734 bits (1896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/787 (50%), Positives = 504/787 (64%), Gaps = 33/787 (4%)
Query: 497 LETHQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSIT 556
LE ++ + ++ KQL+RD+I E L AAGW +++RPRNGREY DAVYV+ +G THWSIT
Sbjct: 268 LEKKEDSLVKRNAVKQLVRDQIKEQLSAAGWTVDFRPRNGREYHDAVYVSLDGHTHWSIT 327
Query: 557 LAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTD 616
LAY LK +YE G K GF FTPI E++ +L KV+NK R K K D
Sbjct: 328 LAYKRLKEYYEAGNGEGKAYKPGFKFTPIAEEDFKMLTKVMNKQRKKGGKGGKK----VD 383
Query: 617 GEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQR 676
G GK N S +G + +K K TS N M + R HK Q+TQ +
Sbjct: 384 G--------VNGKKNKEKSGYGAGMGKSMKRKMKRK---TSPNRMPV-VRDHKRQKTQNK 431
Query: 677 KRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQG 736
KRCA L RN+ E +S +G+V Y GK TLLAWMID+GTV N KV Y + +L G
Sbjct: 432 KRCAPLARNAEE-IDSETEGYVLYCGKRTLLAWMIDSGTVLQNGKVHYMPHKSKSAVLDG 490
Query: 737 RIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKR 796
I +GI C CC +IFTIS F+ H+ SKL P +N+Y G+SLLQC+LDSWNKQDES+R
Sbjct: 491 EITGNGIHCGCCDKIFTISDFELHAGSKLADPLKNIYVGEGTSLLQCLLDSWNKQDESER 550
Query: 797 KGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQ 856
KGFHFV+ GEDPNDDTCG+CGDGGDLICCDGCPSTFHQ CLDIKKFPSG WHC+YC C+
Sbjct: 551 KGFHFVDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHQGCLDIKKFPSGDWHCIYCCCK 610
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQE 916
FCG ++ S+ +D D+ +S L C LCEEKYH+SC + + A + + FCG +CQE
Sbjct: 611 FCGSVSGSSNQRDDNDELIVSKLLTCQLCEEKYHRSCIEANDANTDDSRDVFFCGNRCQE 670
Query: 917 IFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFL 976
+ ERLE LLGVKH++E GY+W+ + R DV D S Q VECN++LAVA+S+MDECF+
Sbjct: 671 LSERLEMLLGVKHEMEDGYSWTFIRRSDVGFDASQIKP-QMVECNSKLAVAVSIMDECFM 729
Query: 977 PLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPF 1036
P DHRSGINLIH+ILYN GSNF RLNY GF TAILER DEIISAASIRI G +LAEMPF
Sbjct: 730 PYIDHRSGINLIHSILYNRGSNFNRLNYSGFVTAILERGDEIISAASIRIRGNQLAEMPF 789
Query: 1037 IGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQK 1096
IGTR+MYRRQGMCRRLL +E L SLNVE L+IPAISELRETWTSVFGF+ LE +SKQ
Sbjct: 790 IGTRYMYRRQGMCRRLLNAVEWGLGSLNVELLVIPAISELRETWTSVFGFESLESTSKQI 849
Query: 1097 MRNMSLLVFPGVDMLQKPMMKNQFPRENMISAKGLRSSE---LENP---------RTADE 1144
+ N +LLVFP VDMLQK + K++ +N+ ++G + E +E+ + D
Sbjct: 850 LHNKNLLVFPHVDMLQKKISKHKLAGQNLNPSEGQINDETISIESGCHLPDGSLNKVPDI 909
Query: 1145 VGKNSDEKYSAGFDLNACINAAAPHVCKIHDKSAPGESTLQFPNGSTHDA--SGLTSETV 1202
D + S+ D + AAP + + D ++ E+ + G+ A S L S V
Sbjct: 910 ASNAKDHRKSSTDDTCQVVCQAAPEILVMEDNNSI-ENARDYVTGTCCRAGDSQLNSCWV 968
Query: 1203 NFPESTTDTNCIDQLGVTSNDLQANDKIAVNTLGSPSDADEQTEDTDDPNASSLAVKSIA 1262
E T+CI N + + ++ N L A+E ++ N++S + I
Sbjct: 969 PCEERQCLTSCIRSEATEGNCVSVSLEVTENFLNQVKVANEAISNSSAENSASCGAERIP 1028
Query: 1263 SEVQIEY 1269
+ QI +
Sbjct: 1029 LDSQIAF 1035
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/803 (46%), Positives = 500/803 (62%), Gaps = 52/803 (6%)
Query: 465 RFAPDKKNSCAETGEAISRQTMKTVNQREKKCLETHQEETLSKHGA---KQLLRDRIVEL 521
+ P + T A S + NQR L T +E K G KQ LR+RI E+
Sbjct: 391 KLGPKNEEPHKSTKGASSSGEITPSNQR----LPTRSKEGKIKRGTGTEKQKLRERIREM 446
Query: 522 LLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLK---NHYEQEGGSSDTSKT 578
LL AGW I+YRPR R+Y DAVY+NP G +WSI AY L N E+E S D S
Sbjct: 447 LLNAGWTIDYRPRRNRDYLDAVYINPTGTAYWSIIKAYDALLKQLNDEEEEARSKDES-- 504
Query: 579 GFTFTPIPEDELSIL-----------------KKVINKSRSDRNKKKKGKNLGTDGEIVT 621
F P+ ++ LS L ++ +++S + R + + E
Sbjct: 505 ---FMPLSDEVLSQLTRKTRKKMEKEMKMKKKQRDVSESENARETAARKSSSSRHDEESM 561
Query: 622 KKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMS----------IPARRHKLQ 671
+ K +S GKS K + G S + + N + + H+ +
Sbjct: 562 DSGSHEEKLSSFIKQGGKSLKSRMNGNSSFNLNTKNQNSIHPLHGAVEQTFSGSNSHQGR 621
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
++++ RC LLVRNS EG S DGFV Y GK TLL+W+ID G V L++KV+Y N+R+T+
Sbjct: 622 KSRKLGRCTLLVRNSNEGLNSESDGFVPYAGKRTLLSWLIDCGAVQLSQKVRYMNRRRTK 681
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQ 791
VML+G + RDGI C CCS+I T+SKF+ H+ SKL PFQN+Y +SG SLL+C +D+WN+Q
Sbjct: 682 VMLEGWVTRDGIHCGCCSKILTVSKFEIHAGSKLRQPFQNIYLDSGVSLLECQIDAWNRQ 741
Query: 792 DESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCV 851
+ +R GFH VN DG+DPNDDTCGICGDGGDLICCDGCPSTFHQ+CLDI P G WHC
Sbjct: 742 ESIERIGFHSVNTDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIMMLPPGDWHCP 801
Query: 852 YCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSF 909
C+C+FCG +E +D + +S L CSLC +KYH+SC Q A + + S+ F
Sbjct: 802 NCTCKFCGIASEDFVQ---EDGTNVSELLTCSLCAKKYHKSCLQDVDAPCIDFNNSTPCF 858
Query: 910 CGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALS 969
CGK C+E+FE+L+K LG+KH+LE G++WSLVHR D+ D+SL + Q+VECN++LAVALS
Sbjct: 859 CGKTCRELFEQLQKYLGIKHELESGFSWSLVHRMDIDLDMSLQGLPQRVECNSKLAVALS 918
Query: 970 VMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGK 1029
VMDECFLP+ D RSGIN+I N+LYN GSNF RLNY GF+ AILER DEIISAASIR HG
Sbjct: 919 VMDECFLPIVDRRSGINIIQNVLYNCGSNFNRLNYSGFYAAILERGDEIISAASIRFHGT 978
Query: 1030 ELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL 1089
+LAEMPFIGTRH+YRRQGMCRRL + IESALCSL V+KLIIPAISEL TWT VFGF L
Sbjct: 979 QLAEMPFIGTRHVYRRQGMCRRLFSAIESALCSLKVQKLIIPAISELTHTWTGVFGFTTL 1038
Query: 1090 EVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAKGLRSSELENPR-TADEVGKN 1148
S KQ++++M++LVFPG+DMLQK +++ + NM + G + SELE+ + EV
Sbjct: 1039 SDSLKQELKSMNMLVFPGIDMLQKQLLEKENTDGNMTLSAGFKGSELEDSQCVTPEVAAK 1098
Query: 1149 SDEKYSAGFDLNAC-INAAAPHVCKIHDKSAPGESTLQFPNGSTHDASGLTSETVNFPES 1207
SD SA DL+ IN H + +D+ S F + +D S ++S + +
Sbjct: 1099 SDIDSSAMHDLDKYDINGDLEHASRANDEVVTANSDSHFLDVPMNDTSVISS---SLDST 1155
Query: 1208 TTDTNCIDQLGVTSNDLQANDKI 1230
N + + + + D + DK+
Sbjct: 1156 QEQKNLVLLIEMVNADFDSGDKL 1178
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 714 bits (1843), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/894 (45%), Positives = 533/894 (59%), Gaps = 122/894 (13%)
Query: 308 RKRGRPPKM-LENDESEGEQTKISKVKPGRPPKLNKIDESCEQRRKKVKDKRGRPRH--K 364
RK GRP K L+ D+ +Q + K GRP R+ ++KD PR +
Sbjct: 431 RKLGRPHKTELKEDDRLVDQ---PQRKLGRP------------RKTELKDHDQSPRKLPR 475
Query: 365 THESG-ESDGEQRKRLKNKLGRPRK-DGSDDRGSKRLKKKRGRPPK------LQGINEVL 416
T +S + DG+ + KLGRPRK + ++D S R KRGRP K L +
Sbjct: 476 TRKSELKGDGQFVDQSPRKLGRPRKTELNEDDQSPR---KRGRPFKADQQKHLMTVAHNS 532
Query: 417 KGKVGK--GKKVNGIRKSQRHTLAVGLKRDVPTYGLIPEKRHGGTEFNAQRFAPDKKNSC 474
KGK+ + GKKV + S A + D G R G + +RF+P +KN+
Sbjct: 533 KGKMSRENGKKVLTVTDS-----ASTNEVDDTCSG-----RSSGKKLKEKRFSPVRKNN- 581
Query: 475 AETGEAISRQTMKTVN-QREKKCLETHQEETLSKHGAK----QLLRDRIVELLLAAGWKI 529
R+ +KT N + + T + ++K G + QL+RD+I E L AAGW +
Sbjct: 582 -------PRKVLKTENDEMVSPLVSTTVKAAVAKEGPRRKEQQLVRDKIRECLFAAGWTV 634
Query: 530 EYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDE 589
++RPRNGR Y D+VYV+ +G THWS+TLAY LK HYE G K GF FTPI +++
Sbjct: 635 DHRPRNGRNYVDSVYVSLDGTTHWSVTLAYKRLKQHYEAGDGEGKLYKPGFIFTPILQED 694
Query: 590 LSILKKVINKSRSDRNKK------KKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKS--- 640
+ L +V+ KS+ N K K GK +G K+K+KK K +S A GKS
Sbjct: 695 FNRLTRVVTKSKKGSNVKCEPFEEKGGKKVGV------KRKEKKIKPDSGAG-KGKSVKG 747
Query: 641 --QKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFV 698
+++ + +P EG T+ ++ P R K +TQ + R LLVR++ E +S +G+V
Sbjct: 748 KMKRKLKRKRPLPEEGNTN---VTSPNRDRKRHKTQNKTRSTLLVRDATEEVDSEINGYV 804
Query: 699 AYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFD 758
Y GK T+L+WMID GT+ N KV Y R L+G+I DGI C CC+EI TIS F
Sbjct: 805 PYSGKRTILSWMIDLGTILQNGKVHYTQDRLENASLEGKITGDGIHCGCCNEIVTISDFG 864
Query: 759 THSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICG 818
H+ SK P +N+Y E +SLLQC+LDSWNKQDES+ K FHF + GEDPNDDTCG+CG
Sbjct: 865 AHAGSKQSDPLKNIYTEEETSLLQCLLDSWNKQDESELKSFHFFDVAGEDPNDDTCGVCG 924
Query: 819 DGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALST 878
DGGDLICCDGCPSTFH++CLDIKKFPSG WHC YC C+FC + S+ D+ +
Sbjct: 925 DGGDLICCDGCPSTFHKSCLDIKKFPSGDWHCAYCCCKFCRLVGGSSNQSVVNDEFTMPA 984
Query: 879 LQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWS 938
L C LCEEK+H SC + +G + FCG KCQE+ ERLE LLGVKH++E G++WS
Sbjct: 985 LLTCHLCEEKFHISCVEANGGKTDDSKDALFCGNKCQELSERLEMLLGVKHEIEDGFSWS 1044
Query: 939 LVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSN 998
+ R DV DLSL++ Q VECN++LAVALS+M+ECF+P DHRSG NL+ +ILYN GSN
Sbjct: 1045 FIRRSDVGCDLSLTNP-QLVECNSKLAVALSIMNECFMPYIDHRSGTNLLRSILYNCGSN 1103
Query: 999 FKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIES 1058
FKRL+Y GF T ILER DEII ASIR+HG LAEMP+IGTR+MYRRQGMCRRLL IES
Sbjct: 1104 FKRLDYSGFITVILERGDEIICVASIRVHGNRLAEMPYIGTRYMYRRQGMCRRLLNAIES 1163
Query: 1059 ----------------------------------------------ALCSLNVEKLIIPA 1072
AL SL+VE L+IPA
Sbjct: 1164 EAVYGLSVNLGACPMVVTVRVCNYSMNGSRGFVFDVLRGNDWGGPKALSSLDVELLVIPA 1223
Query: 1073 ISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMI 1126
ISELRETWTSVFGF+PL+ +SKQ NM+LLVFP VD+LQK + K+ EN+I
Sbjct: 1224 ISELRETWTSVFGFEPLKQTSKQITNNMNLLVFPHVDLLQKKISKHAIANENLI 1277
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/787 (47%), Positives = 492/787 (62%), Gaps = 72/787 (9%)
Query: 353 KVKDKRGRPRHKTHESGESDGEQRKRLKNKLGRP---RKDGSDDRGSKRLKKKRGRPPKL 409
+VK KRGRPR K S +SD E R KL R S DR S L + RGRPPK
Sbjct: 243 QVKRKRGRPR-KVQISSQSD-ESRPNTNCKLARTPELSSQSSVDRIS--LSRLRGRPPKT 298
Query: 410 QGINEVLKGKVGKGKKVNGIRKSQRHTLAVGLKRDVPTYGLIPEK-RHGGTEFNAQRFAP 468
+ + L + KG + NG R + KR P P+K R G + A
Sbjct: 299 KETSVSLY--IEKGPESNGRRMVR--------KRGRPP---TPQKKRKSGMTDESDWKAK 345
Query: 469 DKKNSCAETGEAISRQTMKTVNQREKKCLETHQEET----LSKHGAKQLLRDRIVELLLA 524
+ C E SR ++ E+ E ++T S+ +K++L DRI++LLL
Sbjct: 346 KRLKLCESPLE--SRHNNPLIDD-ERMIGEQRSKQTEAGGHSRSKSKKMLSDRILQLLLT 402
Query: 525 AGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTP 584
AGW +EYRPRNGR Y DAVY+NPEGKTHWS+T AY V K E S TG F
Sbjct: 403 AGWTVEYRPRNGRAYQDAVYLNPEGKTHWSVTKAYQVYKKQLESNPNDQKNSTTGSGFGL 462
Query: 585 IPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRG 644
+PE++L +L++ I K RSD K++ K K TN
Sbjct: 463 LPEEDLHLLERTIQKKRSDTGKQRS--------------KLKDRDTNDIL---------- 498
Query: 645 IKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKW 704
VS GT + K +E RKRC R+S++ +S DG++ ++GK
Sbjct: 499 ------VSTKGTG---------KIKREEKHSRKRCTPSARSSLKDVDSKEDGYILFEGKR 543
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L WMID+ VPLN KVQ + +KT ++L+G I ++GIRC+CC E+F++ F+ H+
Sbjct: 544 TMLGWMIDSTIVPLNGKVQCMDCKKTDILLEGIITKEGIRCNCCDEVFSVLDFEVHAGGN 603
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
PF++LY E G+SLLQC+ +S NKQ ES+ KG+HFV+F DPNDDTCGICGDGGDLI
Sbjct: 604 RNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYHFVDFGSGDPNDDTCGICGDGGDLI 663
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCDGCPSTFHQ+CLDIKKFPSG W+C CSC+FC + +E+ H + S L +L C L
Sbjct: 664 CCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK-DEAAKH----ETSTLPSLSSCRL 718
Query: 885 CEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFD 944
CEEKYHQ+C DG V E S+ SFCGK CQE+FE L+ +GVKH L G++WS + RF+
Sbjct: 719 CEEKYHQACINQDGTVPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRRFE 778
Query: 945 VSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNY 1004
+ ++++ D+ +K+ NA++AVA SVMDECF PL DHRSG+NL+ NI+YNFGSNF RL++
Sbjct: 779 LPSEVADCDISEKIAYNAKMAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRLDF 838
Query: 1005 KGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLN 1064
F TA+LER DEII+ ASIRIHG +LAEMPFIGTR+MYRRQGMCRRL+ GIESAL SL
Sbjct: 839 SSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALGSLK 898
Query: 1065 VEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPREN 1124
V+KL+IPA+ EL +TWTS FGF P+ S K+ ++N++LLVFPGVDML K ++K +
Sbjct: 899 VDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTIKNLNLLVFPGVDMLGKSLVKEKITDSV 958
Query: 1125 MISAKGL 1131
+ S GL
Sbjct: 959 VSSPNGL 965
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/822 (46%), Positives = 505/822 (61%), Gaps = 84/822 (10%)
Query: 353 KVKDKRGRPRHKTHESGESDGEQRKRLKNKLGR-PRKDGSDDRGSKRLKKKRGRPPKLQG 411
+VK KRGRPR K S +SD E R KL R P L ++RGRP K +
Sbjct: 270 QVKRKRGRPR-KLQISSQSD-ESRPNTNCKLARTPELSSPSSVDRISLSRRRGRPSKTK- 326
Query: 412 INEVLKGKVGKGKKVNGI-------------------RKSQRHTLAVGLKRD----VPTY 448
E G + VN + R+S+R+ A G + D V
Sbjct: 327 --ETAVGLYIEKDNVNCMHGSPLIMPDQSMSDSIDESRRSKRNCRAKGPESDGKKMVRKR 384
Query: 449 GLIPEKRHGGTEFNAQRFAPDKKNSCAETGEAISRQT---------MKTVNQR-----EK 494
G P P KK T E++ ++ +++ N E+
Sbjct: 385 GRPP--------------TPQKKRKSGMTDESVCKEKKRLKLCESPLESQNNNSSIDGER 430
Query: 495 KCLETHQEET----LSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK 550
E H ++T S+ +K++L DRI++LLLAAGW +EYRPRNGR Y DAVY+NPEGK
Sbjct: 431 MIGEQHNKQTEAGTQSRSKSKKMLSDRILQLLLAAGWTVEYRPRNGRAYEDAVYLNPEGK 490
Query: 551 THWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKK-K 609
THWS+T AY V K H E S TG +PE++L +L + I K RSD K + K
Sbjct: 491 THWSVTKAYQVYKKHLESSMNDQMNSTTGSGCGLLPEEDLHLLGRRIQKKRSDTGKHRPK 550
Query: 610 GKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGI-KGKPSVSEGGTSHNGMSIPARRH 668
K+ T+ +V+ K + K S HG +GI + K G + +S+ R+
Sbjct: 551 LKDRDTNENVVSTKGRGK------RSLHGNRLLKGIMQNKRKGDHGYHNVERISVSVRKI 604
Query: 669 KLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQR 728
K +E RKRC R+S+E ++ G++ ++GK T+L WMID+ VPLN KVQ + +
Sbjct: 605 KREEKHNRKRCTPSARSSLEDADAKDGGYILFEGKRTMLGWMIDSTIVPLNGKVQCMDCK 664
Query: 729 KTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSW 788
KT ++L+G I ++GIRC+CC E+F++ F+ H+ K PF++LY E G+SLLQC+ D
Sbjct: 665 KTDMLLEGIITKEGIRCNCCDEVFSVLDFEVHAGGKRNQPFKSLYLEGGNSLLQCLHDFM 724
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
NKQ ES+ KG+HFV+F DPNDDTCGICGDGGDLICCDGCPSTFHQ+CLDIKKFPSG W
Sbjct: 725 NKQSESQHKGYHFVDFCSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAW 784
Query: 849 HCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKY----------HQSCSQTDG 898
+C CSC+FC ++ E+ H D SAL +L C LCEEK HQ+C DG
Sbjct: 785 YCCNCSCKFCEKV-EAAIH----DTSALHSLSSCRLCEEKCSNHYPHTLADHQACINQDG 839
Query: 899 AVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKV 958
V E S+ SFCGK CQE+FE L+ L+GVKH L G++WS + RF++ ++++ D+ +K+
Sbjct: 840 TVPGERSTDSFCGKYCQELFEELQLLIGVKHPLPEGFSWSFLRRFELPSEVADCDISEKI 899
Query: 959 ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
NA++AVA SVMDECF PL DHRSG+NL+ NI+YNF SNF RLN+ F TA+LER DEI
Sbjct: 900 AYNAKMAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFWSNFHRLNFSSFLTAVLERGDEI 959
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
I+ ASIRIHG +LAEMPFIGTR+MYRRQGMCRRL+ GIESAL SL V KL+IPA+ EL +
Sbjct: 960 IAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALGSLKVAKLVIPAVPELID 1019
Query: 1079 TWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQF 1120
TWTS FGF P+ S K+ ++N++LLVFPGVDML K ++K Q
Sbjct: 1020 TWTSGFGFTPVNESEKKTIKNLNLLVFPGVDMLGKSLVKEQI 1061
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 690 bits (1781), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/1122 (41%), Positives = 602/1122 (53%), Gaps = 199/1122 (17%)
Query: 521 LLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGF 580
+L+ AGW I+YRPR +EY DAVY +P G+ +WS+TLAY+VLK+HYE + GF
Sbjct: 1 MLMRAGWTIDYRPRRDKEYNDAVYTSPTGRGYWSVTLAYNVLKSHYED-----GHCEPGF 55
Query: 581 TFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEI-VTKKKKKKGKTNSAASPHGK 639
TFTPIP+ L+ LK+ N S+ + + K + + GE+ KKK GK A GK
Sbjct: 56 TFTPIPDGVLTKLKR--NASKGKKRRLKLEQEYDSGGEMKCCIVKKKSGKNKHAG---GK 110
Query: 640 SQKRGIKGKPSVSE----GGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGD 695
S R +KG+ S+S G H G+ R KLQ TQ KR ALL R+S EG ++ D
Sbjct: 111 SSNRKMKGRSSLSGQDNLTGMLHKGILTSVRNRKLQRTQNTKRFALLARHSKEGLTTDTD 170
Query: 696 GFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTIS 755
G+V Y GK TLL+WM+D GTVPLN KVQY N+RKTR +L+G I+RDGIRC CCSEIFTIS
Sbjct: 171 GYVPYSGKRTLLSWMVDLGTVPLNAKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTIS 230
Query: 756 KFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCG 815
KF+ H+ KLC P QN+ E+G SLLQC LDSWNKQ+ES+R GFH V+ +DPNDDTCG
Sbjct: 231 KFEIHAGMKLCEPSQNIILETGISLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCG 290
Query: 816 ICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSA 875
I C DG
Sbjct: 291 I--------CGDG----------------------------------------------- 295
Query: 876 LSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGY 935
L C C +HQSC ++FE+L+ LLGVKH+LE G+
Sbjct: 296 -GDLICCDGCPSTFHQSCLDI-------------------QLFEQLQMLLGVKHELEDGF 335
Query: 936 TWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNF 995
+W+LV R +V D+SL+ + QKVECN++LAVALS+MDECFLP+ D RSGINLIHN+LYN
Sbjct: 336 SWTLVQRTEVGFDISLNGIPQKVECNSKLAVALSIMDECFLPIVDQRSGINLIHNVLYNC 395
Query: 996 GSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTG 1055
GSNF RLNY GFFTAILER +EIISAASIRIHG +LAEMPFIGTRH+YRRQGMCRRLL
Sbjct: 396 GSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIYRRQGMCRRLLNA 455
Query: 1056 IESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
IESAL SLNVEKL+IPAISEL +TWTSVFGF+PLEVSS+++MRNM++LVF G DMLQKP+
Sbjct: 456 IESALHSLNVEKLVIPAISELMQTWTSVFGFKPLEVSSRKEMRNMNMLVFHGTDMLQKPL 515
Query: 1116 MKNQFPRENMISAKGLRSSELENPRTADE-VGKNSDEKYSAGFDLNACINAAAPHVCKIH 1174
+K+Q E+MI + L S+EL+ V NSD+ S G DLN A + I
Sbjct: 516 LKDQSAEESMIPSAVLESNELKKDLDIKHGVANNSDKTCSPGSDLNISSKGANLSL-AIC 574
Query: 1175 DKSAPGESTLQFPNGSTHDASGLTSETVNFPESTTDTNCI--DQL-----GVTSNDLQAN 1227
+ A ES Q GS +D+S +TSET NFPES T+ + D L V A
Sbjct: 575 NGPAAVESGSQLNEGSLNDSSDITSETTNFPESATNEKSLVHDNLEGKNRTVICPQPSAC 634
Query: 1228 DKIAVNTLGSPSDADEQTEDTDDPNASSLAVKSIASEVQIE------------------- 1268
D AVN + D+ DD + A +++ S+ ++
Sbjct: 635 DAHAVNAHSATEGIDKHQTAVDDSIILAPAERTVESDSKLNQQRTCDMEKKPLGVSCLGS 694
Query: 1269 --YGSAKQSLNSCDEASAQQSAE----IIK-------------HQSLGFL--SELNVSSE 1307
G K+ ++C E E IIK H + FL S N +
Sbjct: 695 EATGCEKEVFHACKEGKETVGFELKNDIIKQSNNSLDDNINPQHLDVDFLHGSGENTACL 754
Query: 1308 NVVAHDSVSIDCRPLDANVIIDQDHQQLPVVGCISVSGERKLETDKVKSDSVFSQISSVV 1367
N A +S D + D D QQL V+G LE D +++ ++ V
Sbjct: 755 NFEARSQMSDDAK--DGK---HSDSQQLQVIG-------YTLEADDKVLNTLKAEDGVAV 802
Query: 1368 AGDMKQALTESIKANHILRETIITSSCNADQAS------PSAPQRAQ---NDDC------ 1412
G A + + I+ C + + PS PQ+ + ND C
Sbjct: 803 VGPNPSAPGGACPTSTQCCSEILCQRCTSSNDAVSHLMPPSTPQKVRDVANDPCAALLVG 862
Query: 1413 --INSSRQANGLDDHDI--VLSPINTDCHSPCVPFSDAPNKPD---FPPCMSNGLCLSED 1465
+ SS Q +GLD H + V + +NT+C SDA +P+ P +S+G L +
Sbjct: 863 CDLGSSCQGDGLDSHKLEYVAASVNTNCQP-----SDAATEPNEHLQPMELSSG--LDRN 915
Query: 1466 EMMSCKACVDDGPNLKDKPVSVGGSQICDAVFQVRIESSEQPESDSQVHLVGV----TNT 1521
++ AC P VG +C Q +S + E+ H G T
Sbjct: 916 DVPESDACAFTCP--------VG---MC---HQTPKDSHKASETGLCSHDDGFPPDRPQT 961
Query: 1522 NCNCEPLCNL-SPGSGVALHCASGGGNSHGTPEVMVLSNKAS 1562
+ + + S GSG+AL CASGG + G PEVMV+SN+AS
Sbjct: 962 VIESDSVADHSSSGSGMALLCASGGCKAGGAPEVMVVSNQAS 1003
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 684 bits (1764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/675 (52%), Positives = 457/675 (67%), Gaps = 32/675 (4%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQLLR+RI +L+ AGW I+YRPR R+Y DAVY+NP G +WSI AY L+ + E
Sbjct: 366 KQLLRERIRGMLVNAGWTIDYRPRRNRDYLDAVYINPTGTAYWSIIKAYDALQKQIDDEE 425
Query: 571 GSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKK------ 624
S S F+PI ++ LS L + K K+K+ + GT K
Sbjct: 426 SKSKPSGDLSPFSPIADEVLSKLTRQTRKKIEKEMKRKQKDHAGTKNTDAYTKDDSEDAD 485
Query: 625 --KKKGKTNSAASPHGKSQKRGIKG----KPSVSEGGTSHNGMSIPARRHKLQETQQRKR 678
K + K +S +GKS KR ++ K S + H +++++ R
Sbjct: 486 DIKHEEKLSSFIKQNGKSIKRTLRHDRGEKLSFASNSLVHG-----------RKSRKIGR 534
Query: 679 CALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRI 738
C LLVRNS +G DGFV Y GK TLL+W+ID+GTV L+EKVQY N+R+T+VML+G I
Sbjct: 535 CTLLVRNSGKGLNLETDGFVPYTGKRTLLSWLIDSGTVQLSEKVQYMNRRRTKVMLEGWI 594
Query: 739 ARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKG 798
RDGI C CCS+I T+SKF+ H+ SKL PFQN+ +SG SLLQC +D+WN+Q+ES+R G
Sbjct: 595 TRDGIHCRCCSKILTVSKFEIHAGSKLRQPFQNIVLDSGVSLLQCQVDAWNRQEESERSG 654
Query: 799 FHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFC 858
FH ++ DG+DPNDDTCGICGDGGDLICCDGCPSTFHQ+CL+I+ PSG WHC C+C+FC
Sbjct: 655 FHPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCTCKFC 714
Query: 859 GRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPS--SLSFCGKKCQE 916
G + S N +DD+ +S L CSLCE+KYH SC Q AV + + S SFCG+ C+E
Sbjct: 715 GMADGS----NAEDDTTVSELVTCSLCEKKYHTSCIQGVDAVLSDTNNPSTSFCGQGCRE 770
Query: 917 IFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFL 976
+FE L+K +GVK +LE G++WSL+HR D +D S+ Q+VE N++LA+AL+VMDECFL
Sbjct: 771 LFEHLQKFIGVKQELEAGFSWSLIHRTDPGSDTSVRGFPQRVESNSKLAIALTVMDECFL 830
Query: 977 PLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPF 1036
+ D RS INLIHN+LYN GSNF RLNY GF+TAILER DEII AASIRIHG +LAEMPF
Sbjct: 831 SIVDRRSEINLIHNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIHGTQLAEMPF 890
Query: 1037 IGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQK 1096
IGTRH+YRRQGMCRRL IESALCSL VE LIIPAISEL TWT FGF PLE S KQ+
Sbjct: 891 IGTRHIYRRQGMCRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFNPLEESHKQE 950
Query: 1097 MRNMSLLVFPGVDMLQKPMMKNQFPRENMISAKGLRSSELE--NPRTADEVGKNSDEKYS 1154
+R++++LVFPG DMLQK +++ + NM ++ G +S E + N T D K SD S
Sbjct: 951 LRSLNMLVFPGTDMLQKLLLEQETADGNMTASPGTKSVESKGNNCNTPDLENK-SDIDSS 1009
Query: 1155 AGFDLNACINAAAPH 1169
G DL+ ++ + H
Sbjct: 1010 NGHDLSIHNHSISQH 1024
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1179
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/797 (46%), Positives = 491/797 (61%), Gaps = 82/797 (10%)
Query: 353 KVKDKRGRPRHKTHESGESDGEQRKRLKNKLGRP---RKDGSDDRGSKRLKKKRGRPPKL 409
+VK KRGRPR K S +SD E R KL R S DR S L + RGRPPK
Sbjct: 243 QVKRKRGRPR-KVQISSQSD-ESRPNTNCKLARTPELSSQSSVDRIS--LSRLRGRPPKT 298
Query: 410 QGINEVLKGKVGKGKKVNGIRKSQRHTLAVGLKRDVPTYGLIPEK-RHGGTEFNAQRFAP 468
+ + L + KG + NG R + KR P P+K R G + A
Sbjct: 299 KETSVSLY--IEKGPESNGRRMVR--------KRGRPP---TPQKKRKSGMTDESDWKAK 345
Query: 469 DKKNSCAETGEAISRQTMKTVNQREKKCLETHQEET----LSKHGAKQLLRDRIVELLLA 524
+ C E SR ++ E+ E ++T S+ +K++L DRI++LLL
Sbjct: 346 KRLKLCESPLE--SRHNNPLIDD-ERMIGEQRSKQTEAGGHSRSKSKKMLSDRILQLLLT 402
Query: 525 AGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTP 584
AGW +EYRPRNGR Y DAVY+NPEGKTHWS+T AY V K E S TG F
Sbjct: 403 AGWTVEYRPRNGRAYQDAVYLNPEGKTHWSVTKAYQVYKKQLESNPNDQKNSTTGSGFGL 462
Query: 585 IPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRG 644
+PE++L +L++ I K RSD K++ K K TN
Sbjct: 463 LPEEDLHLLERTIQKKRSDTGKQRS--------------KLKDRDTNDIL---------- 498
Query: 645 IKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKW 704
VS GT + K +E RKRC R+S++ +S DG++ ++GK
Sbjct: 499 ------VSTKGTG---------KIKREEKHSRKRCTPSARSSLKDVDSKEDGYILFEGKR 543
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L WMID+ VPLN KVQ + +KT ++L+G I ++GIRC+CC E+F++ F+ H+
Sbjct: 544 TMLGWMIDSTIVPLNGKVQCMDCKKTDILLEGIITKEGIRCNCCDEVFSVLDFEVHAGGN 603
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
PF++LY E G+SLLQC+ +S NKQ ES+ KG+HFV+F DPNDDTCGICGDGGDLI
Sbjct: 604 RNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYHFVDFGSGDPNDDTCGICGDGGDLI 663
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCDGCPSTFHQ+CLDIKKFPSG W+C CSC+FC + +E+ H + S L +L C L
Sbjct: 664 CCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK-DEAAKH----ETSTLPSLSSCRL 718
Query: 885 CEEKY----------HQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGG 934
CEEK HQ+C DG V E S+ SFCGK CQE+FE L+ +GVKH L G
Sbjct: 719 CEEKCSKHYPHTLADHQACINQDGTVPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEG 778
Query: 935 YTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYN 994
++WS + RF++ ++++ D+ +K+ NA++AVA SVMDECF PL DHRSG+NL+ NI+YN
Sbjct: 779 FSWSFLRRFELPSEVADCDISEKIAYNAKMAVAFSVMDECFSPLVDHRSGVNLLQNIVYN 838
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLT 1054
FGSNF RL++ F TA+LER DEII+ ASIRIHG +LAEMPFIGTR+MYRRQGMCRRL+
Sbjct: 839 FGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMD 898
Query: 1055 GIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKP 1114
GIESAL SL V+KL+IPA+ EL +TWTS FGF P+ S K+ ++N++LLVFPGVDML K
Sbjct: 899 GIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTIKNLNLLVFPGVDMLGKS 958
Query: 1115 MMKNQFPRENMISAKGL 1131
++K + + S GL
Sbjct: 959 LVKEKITDSVVSSPNGL 975
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/812 (46%), Positives = 493/812 (60%), Gaps = 81/812 (9%)
Query: 408 KLQGINEVLKGKVGKGKKVNGIRKSQRHT----------LAVGLKRDVPTYGLI------ 451
++QG N VLK V K K V+G H LKR V +
Sbjct: 237 RVQGKNGVLKVMVNKKKNVSGASDIYEHRKLEESRKGLRTEDTLKRKVLVSPSLHPETKP 296
Query: 452 ---------PEKRHGGTEFNAQRFAPDKKNSCAETGE-AISRQTMKTVNQREKK------ 495
PEK H T+F + K S ++G+ ++S + K V + +K
Sbjct: 297 NVKQDLFSKPEKDH--TDFQTSASTKNVKGSSWDSGDGSVSLKARKKVVEAQKSTKKAAC 354
Query: 496 ------CLET---HQEETLSKHGA---KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAV 543
C +T +E K G+ KQ LR+RI +LL AGWKI+YRPR R+Y DAV
Sbjct: 355 EVEKVPCEDTPPSTAKEGKVKRGSGTEKQKLRERIRGMLLGAGWKIDYRPRRNRDYLDAV 414
Query: 544 YVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSIL---------K 594
YVNP G +WSI AY L+ EG + G +FTPI +D LS L K
Sbjct: 415 YVNPTGTAYWSIIKAYDALQKQL-NEGAEAKPIADG-SFTPISDDILSQLTRKTRKKIEK 472
Query: 595 KVINKSRSDRNKKKKG-----KNLGTDGEI-VTKKKKKKGKTNSAASPHGKSQKRGIK-- 646
+ NK R D + + ++ GT ++ + K +S GKS K +
Sbjct: 473 EWKNKRRDDSDSENAKDASALRSAGTKNDMDSMDSDSNEEKLSSFIKQGGKSLKNKLNDN 532
Query: 647 GKPSVSEGGTSHNGMSIPA-------RRHKLQETQQRKRCALLVRNSVEGEESNGDGFVA 699
G PSV+ G + + S A ++ ++ ++ LLVR S G +S DG+V
Sbjct: 533 GLPSVNSKGQTSSKYSRDAIVKSSSGSNSRVLHGRKGRKLGLLVRGSSRGLDSENDGYVP 592
Query: 700 YDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDT 759
Y GK TLL+W+ID+GTV L++KV+Y N+R+TRVML+G I RDGI C CCS+I T+SKF+
Sbjct: 593 YTGKRTLLSWLIDSGTVQLSQKVRYMNRRQTRVMLEGWITRDGIHCGCCSKILTVSKFEI 652
Query: 760 HSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGD 819
H+ SKL PFQN++ ESG SLLQC D+WN+Q+ESK FH V DG+DPNDDTCGICGD
Sbjct: 653 HAGSKLRQPFQNIFLESGLSLLQCQRDAWNRQEESKLLSFHTVEIDGDDPNDDTCGICGD 712
Query: 820 GGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTL 879
GGDLICCDGCPSTFHQ+CLDI P G WHC C+C++CG + C D++++S +
Sbjct: 713 GGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCTCKYCGVASIDICQ---GDNTSVSEI 769
Query: 880 QICSLCEEKYHQSCS-QTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWS 938
C LCE+K+H+SC+ + D V SFCGK C+E+FE L+K LGVKH+L+ G++WS
Sbjct: 770 STCILCEKKFHESCNLEMDTPVHSSGLVTSFCGKSCRELFESLQKNLGVKHELDAGFSWS 829
Query: 939 LVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSN 998
L+ R +D+S+ + Q++E N++LAVAL+VMDECFLP+ D RSGINLIHN+LYN GSN
Sbjct: 830 LIRRTSEDSDVSVRGLSQRIESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSN 889
Query: 999 FKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIES 1058
F RLNY GF+TAILER DEIISAA+IR HG +LAEMPFIGTRH+YRRQGMCRRL IES
Sbjct: 890 FYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIES 949
Query: 1059 ALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKN 1118
AL VEKLIIPAI+EL TW +FGF PLE S KQ+MR M++LVFPG DMLQK +++
Sbjct: 950 ALRVFKVEKLIIPAIAELMHTWNVIFGFSPLEPSLKQEMRLMNMLVFPGTDMLQKLLIQE 1009
Query: 1119 QFPRENMISAKGL-----RSSELENPRTADEV 1145
EN + G RS+E +P+ E
Sbjct: 1010 TIVEENTSNGSGAKQTDCRSTEFSSPKMETET 1041
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 677 bits (1747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/812 (46%), Positives = 493/812 (60%), Gaps = 81/812 (9%)
Query: 408 KLQGINEVLKGKVGKGKKVNGIRKSQRHT----------LAVGLKRDVPTYGLI------ 451
++QG N VLK V K K V+G H LKR V +
Sbjct: 119 RVQGKNGVLKVMVNKKKNVSGASDIYEHRKLEESRKGLRTEDTLKRKVLVSPSLHPETKP 178
Query: 452 ---------PEKRHGGTEFNAQRFAPDKKNSCAETGE-AISRQTMKTVNQREKK------ 495
PEK H T+F + K S ++G+ ++S + K V + +K
Sbjct: 179 NVKQDLFSKPEKDH--TDFQTSASTKNVKGSSWDSGDGSVSLKARKKVVEAQKSTKKAAC 236
Query: 496 ------CLET---HQEETLSKHGA---KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAV 543
C +T +E K G+ KQ LR+RI +LL AGWKI+YRPR R+Y DAV
Sbjct: 237 EVEKVPCEDTPPSTAKEGKVKRGSGTEKQKLRERIRGMLLGAGWKIDYRPRRNRDYLDAV 296
Query: 544 YVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSIL---------K 594
YVNP G +WSI AY L+ EG + G +FTPI +D LS L K
Sbjct: 297 YVNPTGTAYWSIIKAYDALQKQL-NEGAEAKPIADG-SFTPISDDILSQLTRKTRKKIEK 354
Query: 595 KVINKSRSDRNKKKKG-----KNLGTDGEI-VTKKKKKKGKTNSAASPHGKSQKRGIK-- 646
+ NK R D + + ++ GT ++ + K +S GKS K +
Sbjct: 355 EWKNKRRDDSDSENAKDASALRSAGTKNDMDSMDSDSNEEKLSSFIKQGGKSLKNKLNDN 414
Query: 647 GKPSVSEGGTSHNGMSIPA-------RRHKLQETQQRKRCALLVRNSVEGEESNGDGFVA 699
G PSV+ G + + S A ++ ++ ++ LLVR S G +S DG+V
Sbjct: 415 GLPSVNSKGQTSSKYSRDAIVKSSSGSNSRVLHGRKGRKLGLLVRGSSRGLDSENDGYVP 474
Query: 700 YDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDT 759
Y GK TLL+W+ID+GTV L++KV+Y N+R+TRVML+G I RDGI C CCS+I T+SKF+
Sbjct: 475 YTGKRTLLSWLIDSGTVQLSQKVRYMNRRQTRVMLEGWITRDGIHCGCCSKILTVSKFEI 534
Query: 760 HSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGD 819
H+ SKL PFQN++ ESG SLLQC D+WN+Q+ESK FH V DG+DPNDDTCGICGD
Sbjct: 535 HAGSKLRQPFQNIFLESGLSLLQCQRDAWNRQEESKLLSFHTVEIDGDDPNDDTCGICGD 594
Query: 820 GGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTL 879
GGDLICCDGCPSTFHQ+CLDI P G WHC C+C++CG + C D++++S +
Sbjct: 595 GGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCTCKYCGVASIDICQ---GDNTSVSEI 651
Query: 880 QICSLCEEKYHQSCS-QTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWS 938
C LCE+K+H+SC+ + D V SFCGK C+E+FE L+K LGVKH+L+ G++WS
Sbjct: 652 STCILCEKKFHESCNLEMDTPVHSSGLVTSFCGKSCRELFESLQKNLGVKHELDAGFSWS 711
Query: 939 LVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSN 998
L+ R +D+S+ + Q++E N++LAVAL+VMDECFLP+ D RSGINLIHN+LYN GSN
Sbjct: 712 LIRRTSEDSDVSVRGLSQRIESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSN 771
Query: 999 FKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIES 1058
F RLNY GF+TAILER DEIISAA+IR HG +LAEMPFIGTRH+YRRQGMCRRL IES
Sbjct: 772 FYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIES 831
Query: 1059 ALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKN 1118
AL VEKLIIPAI+EL TW +FGF PLE S KQ+MR M++LVFPG DMLQK +++
Sbjct: 832 ALRVFKVEKLIIPAIAELMHTWNVIFGFSPLEPSLKQEMRLMNMLVFPGTDMLQKLLIQE 891
Query: 1119 QFPRENMISAKGL-----RSSELENPRTADEV 1145
EN + G RS+E +P+ E
Sbjct: 892 TIVEENTSNGSGAKQTDCRSTEFSSPKMETET 923
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 670 bits (1729), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/677 (48%), Positives = 455/677 (67%), Gaps = 49/677 (7%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR+RI E+LL +GW I+YRPR R+Y DAVY+NP G +WSI AY L+ ++
Sbjct: 387 KQKLRERIREMLLDSGWTIDYRPRRNRDYLDAVYINPAGTAYWSIIKAYEALQKQLNEDA 446
Query: 571 GSSDTSKTGFTFTPIPEDELSILKKVINK-------------SRSDRNKKKKGKNLG--- 614
+ +F PI ++ L+ L + K S SD K+ + ++
Sbjct: 447 NEAKPKGDSSSFAPIADEVLNQLTRKTRKKMEKELKKKKKYDSESDNEKEPQIRSASHKR 506
Query: 615 ----TDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKL 670
TDG+ +K K S S K + I PS + T+H+G I
Sbjct: 507 DMNSTDGDNNEEKLSSFIKQGSK-SMKNKMFENTIISAPSKIQNATNHSGDGIEKSLFGC 565
Query: 671 ------QETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQY 724
+++++ RC LLVR+S +G S DGFV Y GK T+LAW+ID+GTV L++KVQY
Sbjct: 566 DPQIHGRKSKKHGRCTLLVRSSNKGSNSESDGFVPYTGKRTVLAWLIDSGTVELSQKVQY 625
Query: 725 WNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCI 784
+R+ +VML+G I RDGI C CCS+I T+SKF+ H+ SKL P+QN+Y ESG SLLQC
Sbjct: 626 --RRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAGSKLPQPYQNIYLESGVSLLQCQ 683
Query: 785 LDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFP 844
+D+WN+Q+ +++ GFH V+ DG DPNDDTCGICGDGGDLICCDGCPSTFHQ+CLDI+ P
Sbjct: 684 IDAWNRQEHAEKIGFHSVDIDGGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIQMLP 743
Query: 845 SGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAV--QY 902
G+W C+ C+C+FCG + ++ ++DD+++ L IC+LCE+KYH SC++ +
Sbjct: 744 PGEWRCMNCTCKFCGIASGTS----EKDDASVCVLHICNLCEKKYHDSCTKEMDTLPNNI 799
Query: 903 EPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNA 962
SSLSFCGK+C+E+ E L+K LG KH+LE G++WSL+HR D ++ + + Q+VECN+
Sbjct: 800 NSSSLSFCGKECKELSEHLKKYLGTKHELESGFSWSLIHRTDDDSEAACRGISQRVECNS 859
Query: 963 RLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAA 1022
+LA+ L+VMDECFLP+ D RSGINLI N+LYN GSNF RL+Y GF+TAILER DEII+AA
Sbjct: 860 KLAITLTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLSYSGFYTAILERGDEIIAAA 919
Query: 1023 SIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTS 1082
SIR HG ++AEMPFIGTRH+YRRQGMCRRL + IES LCSL VEKL+IPAI+E+ TWT+
Sbjct: 920 SIRFHGTQIAEMPFIGTRHIYRRQGMCRRLFSAIESTLCSLKVEKLVIPAIAEVTNTWTT 979
Query: 1083 VFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM-----------MKNQ---FPRENMISA 1128
VFGF L+ S +Q+M++++++VFPG+DMLQK + M+N+ F + M S
Sbjct: 980 VFGFTHLDKSLRQEMKSLNMMVFPGIDMLQKLLVEQGNHEGSEKMENEDDDFIKTKMESR 1039
Query: 1129 KGLRSSELENPRTADEV 1145
+ SS ++P +D+V
Sbjct: 1040 SDVGSSTPQDPHGSDDV 1056
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 665 bits (1717), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/870 (42%), Positives = 518/870 (59%), Gaps = 77/870 (8%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR++I E+LL +GW I+YRPR R+Y DAVY+NP G +WSI AY L+ +
Sbjct: 384 KQKLREQIREMLLNSGWTIDYRPRRNRDYLDAVYINPAGTAYWSIIKAYDALQKQSNDDA 443
Query: 571 GSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEI-------VTKK 623
+F PI ++ LS L + K+R K+ K K D E + +
Sbjct: 444 DEVKPKGDSSSFAPIADEVLSQLTR---KTRKKMEKELKKKKKRHDSESDNEKEPQIRRS 500
Query: 624 KKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQ-------- 675
K NS S + + + + S SI AR T Q
Sbjct: 501 ASHKRDMNSMDSDSNEEKLSSFIKQGNRSMKNKMFENTSISARSKIQNATHQSSDGIEKP 560
Query: 676 ----------RK-----RCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNE 720
RK RC LLVR+S +G S DGFV Y GK T+LAW+ID+GTV L++
Sbjct: 561 LFGCDPHIHGRKSKKHGRCTLLVRSSNKGSNSESDGFVPYMGKRTVLAWLIDSGTVELSQ 620
Query: 721 KVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSL 780
KVQY +R+ +VML+G I RDGI C CCS+I T+SKF+ H+ SKL P+QN+Y ESG SL
Sbjct: 621 KVQY--RRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAGSKLPQPYQNIYLESGVSL 678
Query: 781 LQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDI 840
LQC +D+WN+Q+ +++ GFH V+ DG DPNDDTCGICGDGGDLICCDGCPSTFHQ+CLDI
Sbjct: 679 LQCQIDAWNRQEHAEKIGFHSVDIDGNDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDI 738
Query: 841 KKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAV 900
+ P G+WHC C+C+FCG +E++ D+DD++++ L+ C LCE+KYH SC++ +
Sbjct: 739 QMLPPGEWHCPNCTCKFCGIASETS----DKDDASVNVLRTCILCEKKYHDSCTKEMDTL 794
Query: 901 --QYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKV 958
SSLSFCGK+C+E+ E L+K LG KH+LE G++W L+HR D ++ + + Q+V
Sbjct: 795 PNNINSSSLSFCGKECKELSEYLKKYLGTKHELEAGFSWCLIHRSDEDSEAACRGLTQRV 854
Query: 959 ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
ECN++LA+AL+VMDECFLP+ D RSGINLI NILYN GSNF RL+Y GF+TAILER DEI
Sbjct: 855 ECNSKLAIALTVMDECFLPVIDRRSGINLIRNILYNSGSNFSRLSYSGFYTAILERGDEI 914
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
I+AASIR HG ++AEMPFIGTRH+YRRQGMCRRL + IE ALCSL VEKL+IPA++EL
Sbjct: 915 IAAASIRFHGTKIAEMPFIGTRHIYRRQGMCRRLFSAIELALCSLKVEKLVIPAVAELTH 974
Query: 1079 TWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMK--------------NQFPREN 1124
TWT+VFGF L+ S +Q+M++++++VFPG+DMLQK +++ N F +
Sbjct: 975 TWTTVFGFTYLDESLRQEMKSLNMMVFPGIDMLQKLLVEQGNHEGSEKMENGDNDFIKTK 1034
Query: 1125 MISAKGLRSSELENPRTADEVGKN-----SDEKYSAGFDLNACINAAAPHVCKIHDK--- 1176
M + + SS ++ +D+V N +DE A +LN + K H +
Sbjct: 1035 MGNKSDMGSSTPQDSHGSDDVSSNPANETNDECSDASQELNNQVLVDGIICSKSHSEEMM 1094
Query: 1177 SAPGESTLQFPNGSTHDASGLTSETVNFP-----ESTTDTNCIDQLGVTSNDLQANDKIA 1231
S P P+ +++ + ++ P +S+T I + TS D +
Sbjct: 1095 SDPISDKCDSPSRTSNSELEMKNKVAAAPPVDRLDSSTKCQSISPID-TSVSSHPVDILK 1153
Query: 1232 VNTL---GSPSDADEQTEDTDDPNASSLAVKSIASEVQIEYGSAKQSLNSCDEASAQQSA 1288
V L + SD + E+ D SS A+ +SE+ I LNS + +
Sbjct: 1154 VQALVQETTYSDPCSEEENLDKKCHSSTAMNCDSSELDI-----NPVLNSQMADNTLPTK 1208
Query: 1289 EIIKHQSLGFLSELNVSSENVVAHDSVSID 1318
E+ + +L + N+S +N+ ++ ++D
Sbjct: 1209 EVCMNDTLEVVPSGNISEDNITKRNNRNVD 1238
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/519 (59%), Positives = 397/519 (76%), Gaps = 4/519 (0%)
Query: 673 TQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRV 732
TQ R+RCALLVRN+ E +S+ DG++ Y+GK TLLAWMID G + L+EKVQY NQRKTRV
Sbjct: 502 TQNRQRCALLVRNTEETADSSNDGYLLYNGKRTLLAWMIDLGILSLDEKVQYMNQRKTRV 561
Query: 733 MLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQD 792
L+GR+ RDGI C CC E+ TISKF+ H+ S++ P +N+Y +GSSLLQC+L+SWNKQ+
Sbjct: 562 KLEGRLTRDGIHCSCCDEVITISKFEMHAGSRVGQPLENIYVHTGSSLLQCLLESWNKQN 621
Query: 793 ESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVY 852
E + KG++FV+ D EDPNDDTCGICGDGGDLICCD CPSTFHQ+CLDIKKFPSG WHC+Y
Sbjct: 622 EPQCKGYNFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLY 681
Query: 853 CSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGK 912
CSC+ CG++ ++D ++A L C LCEEKYH C Q + A + ++ FCGK
Sbjct: 682 CSCKVCGQVTIGLHPMDDHHEAAADVLCKCDLCEEKYHPICVQMNNASGDDVNNPLFCGK 741
Query: 913 KCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSL-SDVCQKVECNARLAVALSVM 971
KCQ + ERL++LLGV+ D++ G++W+L+ R DV +D+SL ++V QK++CN+ LAVAL VM
Sbjct: 742 KCQMLHERLQRLLGVRQDMKEGFSWTLIRRSDVDSDVSLCNEVAQKIKCNSELAVALFVM 801
Query: 972 DECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKEL 1031
DECFLP+ DHRSGINLIHNILYN GSNF RLN+ GF+TAILE+DDE+I AAS+RIHG EL
Sbjct: 802 DECFLPVIDHRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNEL 861
Query: 1032 AEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEV 1091
AEMPFIGTR+MYRRQGMCRR L+ IES L SLNVEKL+IPAISE+R+TW SVFGF+PL+
Sbjct: 862 AEMPFIGTRYMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDE 921
Query: 1092 SSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAKG--LRSSELENPRTADEVGKNS 1149
++KQ+MR MSLLVFPGV+MLQK ++K+ P E +G +S EL +T + V +
Sbjct: 922 TTKQRMRKMSLLVFPGVEMLQKLLLKDHLPMECTTLGEGSISKSPELSEHQTLEVVANSP 981
Query: 1150 DEKYSAGFDLNACINAAAPHVCKIHDKSAPGESTLQFPN 1188
+E+ S LN+C A I A ES+++ PN
Sbjct: 982 EERGSPCSCLNSCSEGTAQDGMGISGDPAVIESSVK-PN 1019
Score = 180 bits (456), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 174/346 (50%), Gaps = 60/346 (17%)
Query: 289 MQMEVSNETNGAK-----GRKKVKRKRGRPPKMLENDESEGEQTKIS-------KVKPGR 336
M++E +E G++ G +KVKRKRGRP K + E KI K K GR
Sbjct: 167 MKLERRSEEQGSEQQLFSGDQKVKRKRGRPRKTEKEVEEVVVSPKIVVSPMKKLKRKRGR 226
Query: 337 PPKLNKIDES---CEQRRKKVKDKRGRPRHKTHESGESDGEQRKRLKNKLGRPRKDGSDD 393
PPKL CE R KK+K KRGRPR E+ S ++ N L
Sbjct: 227 PPKLESEKNHQFVCELRNKKLKRKRGRPRKIDKENDNSLFDELNSELNTL---------- 276
Query: 394 RGSKRLKKKRGRPPKLQGINEVLKGKVGKGKKVNGIRKSQRHTLAVGLKRDVPTYGLIPE 453
+ K+ RGRPPKLQ N LK + +G+KV RK L ++ +VPT L +
Sbjct: 277 ----KPKRGRGRPPKLQKSNGALKNEHTEGRKVRLARKLS-MKLRNKVRSNVPTDRLSSD 331
Query: 454 KRH------------GGTEFNAQRFAPD-------KKNSCAETGEAISRQTMKTVNQREK 494
KRH G + + + P+ K SC E + + + + E
Sbjct: 332 KRHIRKEIHMKKTLQAGNDLSQEILEPEATLTASSKVISCGEKTKKVKKVKKPKIEVDEC 391
Query: 495 KCLETHQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWS 554
K + AK LLR+RI E+L AGW ++YRPR REY DAVYV+PEG+THWS
Sbjct: 392 K-----------RSIAKNLLRERITEILKTAGWTVQYRPRFNREYKDAVYVSPEGRTHWS 440
Query: 555 ITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKS 600
ITLAY+VLK HYE+ G S KTGF FTPIP++E+ L + ++ S
Sbjct: 441 ITLAYNVLKRHYEEGDGDSTVYKTGFIFTPIPDEEIMTLTRDVHNS 486
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/676 (48%), Positives = 453/676 (67%), Gaps = 47/676 (6%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR+RI E+LL +GW I+YRPR R+Y DAVY+NP G +WSI AY L+ ++
Sbjct: 402 KQKLRERIREMLLNSGWTIDYRPRRNRDYLDAVYINPAGTAYWSIIKAYEALQKQLNEDA 461
Query: 571 GSSDTSKTGFTFTPIPEDELSILKKVINK-------------SRSDRNKKKKGKNLGTDG 617
+ +F PI ++ L+ L + K S SD K+ + ++
Sbjct: 462 NEAKPKGDSSSFAPIADEVLNQLTRKTRKKMEKELKKKKKYDSESDNEKEPQIRSASNKR 521
Query: 618 EI-VTKKKKKKGKTNSAASPHGKSQKR-----GIKGKPSVSEGGTSHNGMSIPARRHKL- 670
++ T + K +S KS K I S + T+H+G I +
Sbjct: 522 DLNSTDGDNNEEKLSSFIKQGSKSMKNKMFEDTIMSASSKIQNATNHSGDGIEKSLFECD 581
Query: 671 -----QETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYW 725
+++++ RC LLVR+S +G S DGFV Y GK T+L+W+ID+GTV L++KVQY
Sbjct: 582 PQIHGRKSKKHGRCTLLVRSSKKGSNSESDGFVPYMGKRTVLSWLIDSGTVELSQKVQY- 640
Query: 726 NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCIL 785
+R+ +VML+G I RDGI C CCS+I T+SKF+ H+ SKL P+QN+Y ESG SLLQC +
Sbjct: 641 -RRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAGSKLPQPYQNIYLESGVSLLQCQI 699
Query: 786 DSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPS 845
++WN+Q+ S++ FH V+ DG+DPNDDTCGICGDGGDLICCDGCPSTFHQ+CLDI+ P
Sbjct: 700 EAWNRQEHSEKICFHSVDIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIQMLPL 759
Query: 846 GKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAV--QYE 903
G+WHC C+C+FCG + ++ ++DD+++ LQIC+LCE+KYH SC++ +
Sbjct: 760 GEWHCPNCTCKFCGIASGNS----EKDDASVYVLQICNLCEKKYHDSCTKEMDNLPNNIN 815
Query: 904 PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNAR 963
SSLSFCGK+C+E+ E L+K LG KH+LE G++WSL+HR D ++ + + Q+VECN++
Sbjct: 816 TSSLSFCGKECKELSEHLKKYLGTKHELEAGFSWSLIHRIDEDSEAACRGISQRVECNSK 875
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
LA+AL+VMDECFLP+ D RSGINLI N+LYN GSNF RLNY GF+TA LER DEII++AS
Sbjct: 876 LAIALTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLNYSGFYTATLERGDEIIASAS 935
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSV 1083
IR HG ++AEMPFIGTRHMYRRQGMCRRL + IES LCSL VEKL+IPAI+EL TWT+V
Sbjct: 936 IRFHGTQIAEMPFIGTRHMYRRQGMCRRLFSAIESTLCSLKVEKLVIPAIAELTNTWTTV 995
Query: 1084 FGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMK--------------NQFPRENMISAK 1129
FGF L+ S +Q+M++++++VFPG+DML KP+ + N F + M +
Sbjct: 996 FGFTHLDESLRQEMKSLNMMVFPGIDMLMKPLAEQGNHEGSEKLENGDNDFIKTKMENKS 1055
Query: 1130 GLRSSELENPRTADEV 1145
+ SS ++P +D++
Sbjct: 1056 DMGSSTPQDPHGSDDI 1071
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/801 (45%), Positives = 478/801 (59%), Gaps = 86/801 (10%)
Query: 353 KVKDKRGRPRHKTHESGESDGEQRKRLKNKLGR-PRKDGSDDRGSKRLKKKRGRPPKLQG 411
+VK KRGRPR K S +SD E R KL R P L + RGRPPK +
Sbjct: 90 QVKRKRGRPR-KVQISSQSD-ESRPNTNCKLARTPELSSQSSVDRISLSRLRGRPPKTKE 147
Query: 412 INEVLKGKVGKGKKVNGIRKSQRHTLAVGLKRDVPTYGLIPEK-RHGGTEFNAQRFAPDK 470
+ L + KG + NG R + KR P P+K R G + A +
Sbjct: 148 TSVSLY--IEKGPESNGRRMVR--------KRGRPP---TPQKKRKSGMTDESDWKAKKR 194
Query: 471 KNSCAETGEAISRQTMKTVNQREKKCLETHQEET----LSKHGAKQLLRDRIVELLLAAG 526
C E SR ++ E+ E ++T S+ +K++L DRI++LLL AG
Sbjct: 195 LKLCESPLE--SRHNNPLIDD-ERMIGEQRSKQTEAGGHSRSKSKKMLSDRILQLLLTAG 251
Query: 527 WKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIP 586
W +EYRPRNGR Y DAVY+NPEGKTHWS+T AY V K E S TG F +P
Sbjct: 252 WTVEYRPRNGRAYQDAVYLNPEGKTHWSVTKAYQVYKKQLESNPNDQKNSTTGSGFGLLP 311
Query: 587 EDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIK 646
E++L +L++ I K RSD K++ K K TN
Sbjct: 312 EEDLHLLERTIQKKRSDTGKQRS--------------KLKDRDTNDIL------------ 345
Query: 647 GKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTL 706
VS GT + K +E RKRC R+S++ +S DG++ ++GK T+
Sbjct: 346 ----VSTKGTG---------KIKREEKHSRKRCTPSARSSLKDVDSKEDGYILFEGKRTM 392
Query: 707 LAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLC 766
L WMID+ VPLN KVQ + +KT ++L+G I ++GIRC+CC E+F++ F+ H+
Sbjct: 393 LGWMIDSTIVPLNGKVQCMDCKKTDILLEGIITKEGIRCNCCDEVFSVLDFEVHAGGNRN 452
Query: 767 HPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICC 826
PF++LY E G+SLLQC+ +S NKQ ES+ KG+HFV+F DPNDDTCGICGDGGDLICC
Sbjct: 453 QPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYHFVDFGSGDPNDDTCGICGDGGDLICC 512
Query: 827 DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCE 886
DGCPSTFHQ+CLDIKKFPSG W+C CSC+FC + +E+ H + S L +L C LCE
Sbjct: 513 DGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK-DEAAKH----ETSTLPSLSSCRLCE 567
Query: 887 EKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVS 946
EKYHQ+C DG V E S+ SFCGK CQE+FE L+ +GVKH L G++WS + RF++
Sbjct: 568 EKYHQACINQDGTVPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRRFELP 627
Query: 947 TDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKG 1006
++++ D+ +K+ NA++AVA SVMDECF PL DHRSG+NL+ NI+YNFGSNF RL++
Sbjct: 628 SEVADCDISEKIAYNAKMAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRLDFSS 687
Query: 1007 FFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVE 1066
F TA+LER DEII+ ASIRIHG +LAEMPFIGTR+MYRRQGMCRRL+ GIES + +
Sbjct: 688 FLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESFVAYFSQM 747
Query: 1067 KLIIPAISELRETWT----------------SVFGFQPLEVSSKQKMRNMSLLVFPGVDM 1110
L I + L + W S FGF P+ S K+ ++N++LLVFPGVDM
Sbjct: 748 FLAISEV--LLDVWQFCCYPACFGDGPFCFFSGFGFAPVNDSEKKTIKNLNLLVFPGVDM 805
Query: 1111 LQKPMMKNQFPRENMISAKGL 1131
L K ++K + + S GL
Sbjct: 806 LGKSLVKEKITDSVVSSPNGL 826
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 363/813 (44%), Positives = 480/813 (59%), Gaps = 100/813 (12%)
Query: 353 KVKDKRGRPRHKTHESGESDGEQRKRLKNKLGRP---RKDGSDDRGSKRLKKKRGRPPKL 409
+VK KRGRPR K S +SD E R KL R S DR S L + RGRPPK
Sbjct: 243 QVKRKRGRPR-KVQISSQSD-ESRPNTNCKLARTPELSSQSSVDRIS--LSRLRGRPPKT 298
Query: 410 QGINEVLKGKVGKGKKVNGIRKSQRHTLAVGLKRDVPTYGLIPEK-RHGGTEFNAQRFAP 468
+ + L + KG + NG R + KR P P+K R G + A
Sbjct: 299 KETSVSLY--IEKGPESNGRRMVR--------KRGRPP---TPQKKRKSGMTDESDWKAK 345
Query: 469 DKKNSCAETGEAISRQTMKTVNQREKKCLETHQEET----LSKHGAKQLLRDRIVELLLA 524
+ C E SR ++ E+ E ++T S+ +K++L DRI++LLL
Sbjct: 346 KRLKLCESPLE--SRHNNPLIDD-ERMIGEQRSKQTEAGGHSRSKSKKMLSDRILQLLLT 402
Query: 525 AGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFTFTP 584
AGW +EYRPRNGR Y DAVY+NPEGKTHWS+T AY V K E S TG F
Sbjct: 403 AGWTVEYRPRNGRAYQDAVYLNPEGKTHWSVTKAYQVYKKQLESNPNDQKNSTTGSGFGL 462
Query: 585 IPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRG 644
+PE++L +L++ I K RSD K++ K K TN
Sbjct: 463 LPEEDLHLLERTIQKKRSDTGKQRS--------------KLKDRDTNDIL---------- 498
Query: 645 IKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKW 704
VS GT + K +E RKRC R+S++ +S DG++ ++GK
Sbjct: 499 ------VSTKGTG---------KIKREEKHSRKRCTPSARSSLKDVDSKEDGYILFEGKR 543
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L WMID+ VPLN KVQ + +KT ++L+G I ++GIRC+CC E+F++ F+ H+
Sbjct: 544 TMLGWMIDSTIVPLNGKVQCMDCKKTDILLEGIITKEGIRCNCCDEVFSVLDFEVHAGGN 603
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
PF++LY E G+SLLQC+ +S NKQ ES+ KG+HFV+F DPNDDTCGICGDGGDLI
Sbjct: 604 RNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYHFVDFGSGDPNDDTCGICGDGGDLI 663
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCDGCPSTFHQ+CLDIKKFPSG W+C CSC+FC + +E+ H + S L +L C L
Sbjct: 664 CCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK-DEAAKH----ETSTLPSLSSCRL 718
Query: 885 CEEKY----------HQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGG 934
CEEK HQ+C DG V E S+ SFCGK CQE+FE L+ +GVKH L G
Sbjct: 719 CEEKCSKHYPHTLADHQACINQDGTVPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEG 778
Query: 935 YTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYN 994
++WS + RF++ ++++ D+ +K+ NA++AVA SVMDECF PL DHRSG+NL+ NI+YN
Sbjct: 779 FSWSFLRRFELPSEVADCDISEKIAYNAKMAVAFSVMDECFSPLVDHRSGVNLLQNIVYN 838
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLT 1054
FGSNF RL++ F TA+LER DEII+ ASIRIHG +LAEMPFIGTR+MYRRQGMCRRL+
Sbjct: 839 FGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMD 898
Query: 1055 GIESALCSLNVEKLIIPAISELRETWT----------------SVFGFQPLEVSSKQKMR 1098
GIES + + L I + L + W S FGF P+ S K+ ++
Sbjct: 899 GIESFVAYFSQMFLAISEV--LLDVWQFCCYPACFGDGPFCFFSGFGFAPVNDSEKKTIK 956
Query: 1099 NMSLLVFPGVDMLQKPMMKNQFPRENMISAKGL 1131
N++LLVFPGVDML K ++K + + S GL
Sbjct: 957 NLNLLVFPGVDMLGKSLVKEKITDSVVSSPNGL 989
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 313/631 (49%), Positives = 417/631 (66%), Gaps = 23/631 (3%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR+RI +L AGW I+Y+PR + Y DAVYVNP G +WSI AY L + EG
Sbjct: 295 KQRLRERIKGMLTDAGWTIDYKPRRNQSYLDAVYVNPSGTAYWSIIKAYDALLKQLKDEG 354
Query: 571 GSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVT----KKKKK 626
+ K + ED ++ L + K+R++ KK K + G+D E +
Sbjct: 355 VDARPRKDTAAVASVSEDIVNKLARKAKKTRTEMTKKWKKNSSGSDSENKSDGGAYTDTS 414
Query: 627 KGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARR-------HKLQETQQRK-- 677
+ + S+ GKS K+G G S + + R H L + +K
Sbjct: 415 EERIRSSIKLGGKSTKKGRNGIDWDELHTKSKRSLYYKSARPSSGSDSHYLHGRKSKKIG 474
Query: 678 RCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGR 737
RC LLVR+S + + DGF Y GK TLL+W+I++G V L +KVQY +R +VML+G
Sbjct: 475 RCTLLVRSSKDKKNPAIDGFNPYSGKRTLLSWLIESGVVQLRQKVQYMKRRGAKVMLEGW 534
Query: 738 IARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRK 797
I R+GI CDCCS+I T+S+F+ H+ SK C PFQN+Y ESG+SLLQC + +WN Q ++
Sbjct: 535 ITREGIHCDCCSKILTVSRFEIHAGSKACQPFQNIYLESGASLLQCQVRAWNMQKDATNV 594
Query: 798 GFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF 857
G H V+ DG+DPNDD CGICGDGGDLICCDGCPST+HQ CL ++ PSG WHC C+C+F
Sbjct: 595 GLHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQTCLGMQVLPSGDWHCPNCTCKF 654
Query: 858 CGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQE 916
C ++ +D + LS L CS+CE +YHQ C S VQ S+ SFCG KC E
Sbjct: 655 C----DAAVASGGKDGNFLSLLS-CSMCERRYHQLCLSDEAQKVQSFGSASSFCGPKCLE 709
Query: 917 IFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFL 976
+FE+L+K LGVK+++EGGY+WSL+HR D +D++ Q++E N++LAV L++MDECFL
Sbjct: 710 LFEKLQKYLGVKNEIEGGYSWSLIHRVDTDSDINSQLSAQRIENNSKLAVGLAIMDECFL 769
Query: 977 PLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPF 1036
P+ D RSG+NLI N+LYN GSNF R+NY GF+TAILER DEIISAAS+R HG +LAEMPF
Sbjct: 770 PIVDRRSGVNLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGTQLAEMPF 829
Query: 1037 IGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQK 1096
IGTRH+YRRQGMCRRL IESA+ SL VEKL+IPAI + WT FGF PL+ S +++
Sbjct: 830 IGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSVRKE 889
Query: 1097 MRNMSLLVFPGVDMLQKPMMKNQFPRENMIS 1127
MR+++ LVFPG+DMLQKP++ +EN+I+
Sbjct: 890 MRSLNTLVFPGIDMLQKPLLH----KENIIA 916
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 617 bits (1591), Expect = e-173, Method: Compositional matrix adjust.
Identities = 335/760 (44%), Positives = 463/760 (60%), Gaps = 57/760 (7%)
Query: 406 PPKLQGINEVLKGKVGKGKKVNGIRKSQRHTLAVGLK-RDVPTYGLIPEKRHGGTEFNAQ 464
P ++QGIN VLK KV ++ +TLA + RD Y P R N
Sbjct: 193 PMRVQGINGVLKVKV----------NNKTNTLAASINPRDAEIYERPPSSRKAQRRENVV 242
Query: 465 RFAPDKKNSCAETGEAISRQTMKTVNQREKKCLETHQEETLSKHGA-------------- 510
P +K++ + M ++R+K ++E +K +
Sbjct: 243 VKPPFRKSNNVDNNSESEESDMSRKSKRKKSEYSKPKKEFNTKSKSTFPELVNPDVREER 302
Query: 511 ---------KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSV 561
KQ LR+RI +L AGW I+Y+PR + Y DAVYVNP G +WSI AY
Sbjct: 303 RGRRGGGTDKQRLRERIKGMLTDAGWTIDYKPRRNQSYLDAVYVNPSGTAYWSIIKAYDA 362
Query: 562 LKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVT 621
L + EG + K + E+ ++ L + K+RS+ KK K + G+D E +
Sbjct: 363 LLKQLKDEGVDARPRKDTAAVASVSEEIVNKLARKAKKTRSEMTKKWKQNSSGSDSENKS 422
Query: 622 K----KKKKKGKTNSAASPHGKSQKRGIKGKP-----SVSEGGTSHNGM--SIPARRHKL 670
+ + + S+ GKS K+G G S+ +N + S + H L
Sbjct: 423 EGGAYTDTSEERIRSSIKLGGKSTKKGRNGADWDELHKKSKRSLYYNNVRPSCGSDSHYL 482
Query: 671 --QETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQR 728
++T++ RC LLVR+S + + +GF Y GK TLL+W+I++G V L +KVQY +R
Sbjct: 483 HGRKTKKIGRCTLLVRSSKDKKNPAINGFNPYSGKRTLLSWLIESGVVQLRQKVQYMRRR 542
Query: 729 KTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSW 788
+VML+G I R+GI CDCCS+I T+S+F+ H+ SK C PFQN+Y ESG+SLLQC + +W
Sbjct: 543 GAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSCQPFQNIYLESGASLLQCQVRAW 602
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
N Q ++ H V+ DG+DPNDD CGICGDGGDLICCDGCPST+HQNCL ++ PSG W
Sbjct: 603 NMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDW 662
Query: 849 HCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDG-AVQYEPSSL 907
HC C+C+FC ++ +D +++S L C +CE +YHQ C + VQ S+
Sbjct: 663 HCPNCTCKFC----DAAVASGGKDGNSISLLS-CGMCERRYHQLCLNDEAHKVQSFGSAS 717
Query: 908 SFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVA 967
SFCG KC E+FE+L+K LGVK ++EGGY+WSL+HR D +D + Q++E N++LAV
Sbjct: 718 SFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVDTDSDTNSQMSAQRIENNSKLAVG 777
Query: 968 LSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIH 1027
L++MDECFLP+ D RSG++LI N+LYN GSNF R+NY GF+TAILER DEIISAAS+R H
Sbjct: 778 LAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFH 837
Query: 1028 GKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQ 1087
G +LAEMPFIGTRH+YRRQGMCRRL IESA+ SL VEKL+IPAI + WT FGF
Sbjct: 838 GMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFT 897
Query: 1088 PLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMIS 1127
PL+ S +++MR+++ LVFPG+DMLQKP++ EN+I+
Sbjct: 898 PLDDSVRKEMRSLNTLVFPGIDMLQKPLLHE----ENIIA 933
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 616 bits (1588), Expect = e-173, Method: Compositional matrix adjust.
Identities = 335/760 (44%), Positives = 461/760 (60%), Gaps = 57/760 (7%)
Query: 406 PPKLQGINEVLKGKVGKGKKVNGIRKSQRHTLAVGLK-RDVPTYGLIPEKRHGGTEFNAQ 464
P ++QGIN VLK KV ++ +TLA + RD Y P R N
Sbjct: 193 PMRVQGINGVLKVKV----------NNKTNTLAASINPRDAEIYERPPSSRKAQRRENVV 242
Query: 465 RFAPDKKNSCAETGEAISRQTMKTVNQREKKCLETHQEETLSKHGA-------------- 510
P +K++ + M ++R+K ++E +K +
Sbjct: 243 VKPPFRKSNNVDNNSESEESDMSRKSKRKKSEYSKPKKEFNTKSKSTFPELVNPDVREER 302
Query: 511 ---------KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSV 561
KQ LR+RI +L AGW I+Y+PR + Y DAVYVNP G +WSI AY
Sbjct: 303 RGRRGGGTDKQRLRERIKGMLTDAGWTIDYKPRRNQSYLDAVYVNPSGTAYWSIIKAYDA 362
Query: 562 LKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVT 621
L + EG + K + E+ ++ L + K+RS+ KK K + G+D E +
Sbjct: 363 LLKQLKDEGVDARPRKDTAAVASVSEEIVNKLARKAKKTRSEMTKKWKQNSSGSDSENKS 422
Query: 622 K----KKKKKGKTNSAASPHGKSQKRGIKG-------KPSVSEGGTSHNGMSIPARRHKL 670
+ + + S+ GKS K+G G K S ++ S + H L
Sbjct: 423 EGGAYTDTSEERIRSSIKLGGKSTKKGRNGADWDELHKKSKRSLYYNNARPSCGSDSHYL 482
Query: 671 --QETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQR 728
++T++ RC LLVR+S + + +GF Y GK TLL+W+I++G V L +KVQY +R
Sbjct: 483 HGRKTKKIGRCTLLVRSSKDKKNPAINGFNPYSGKRTLLSWLIESGVVQLRQKVQYMRRR 542
Query: 729 KTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSW 788
+VML+G I R+GI CDCCS+I T+S+F+ H+ SK C PFQN+Y ESG+SLLQC + +W
Sbjct: 543 GAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSCQPFQNIYLESGASLLQCQVRAW 602
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
N Q ++ H V+ DG+DPNDD CGICGDGGDLICCDGCPST+HQNCL ++ PSG W
Sbjct: 603 NMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDW 662
Query: 849 HCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDG-AVQYEPSSL 907
HC C+C+FC ++ +D + +S L C +CE +YHQ C + VQ S+
Sbjct: 663 HCPNCTCKFC----DAAVASGGKDGNFISLLS-CGMCERRYHQLCLNDEAHKVQSFGSAS 717
Query: 908 SFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVA 967
SFCG KC E+FE+L+K LGVK ++EGGY+WSL+HR D +D + Q++E N++LAV
Sbjct: 718 SFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVDTDSDTNSQMSAQRIENNSKLAVG 777
Query: 968 LSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIH 1027
L++MDECFLP+ D RSG++LI N+LYN GSNF R+NY GF+TAILER DEIISAAS+R H
Sbjct: 778 LAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFH 837
Query: 1028 GKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQ 1087
G +LAEMPFIGTRH+YRRQGMCRRL IESA+ SL VEKL+IPAI + WT FGF
Sbjct: 838 GMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFT 897
Query: 1088 PLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMIS 1127
PL+ S +++MR+++ LVFPG+DMLQKP++ EN+I+
Sbjct: 898 PLDDSVRKEMRSLNTLVFPGIDMLQKPLLHE----ENIIA 933
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 299/496 (60%), Positives = 373/496 (75%), Gaps = 9/496 (1%)
Query: 678 RCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGR 737
RC LLVRNS +G DGFV Y GK TLL+W+ID+GTV L+EKVQY N+R+T+VML+G
Sbjct: 366 RCTLLVRNSGKGLNLETDGFVPYTGKRTLLSWLIDSGTVQLSEKVQYMNRRRTKVMLEGW 425
Query: 738 IARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRK 797
I RDGI C CCS+I T+SKF+ H+ SKL PFQN+ +SG SLLQC +D+WN+Q+ES+R
Sbjct: 426 ITRDGIHCRCCSKILTVSKFEIHAGSKLRQPFQNIVLDSGVSLLQCQVDAWNRQEESERS 485
Query: 798 GFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF 857
GFH ++ DG+DPNDDTCGICGDGGDLICCDGCPSTFHQ+CL+I+ PSG WHC C+C+F
Sbjct: 486 GFHPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCTCKF 545
Query: 858 CGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPS--SLSFCGKKCQ 915
CG + S N +DD+ +S L CSLCE+KYH SC Q AV + + S SFCG+ C+
Sbjct: 546 CGMADGS----NAEDDTTVSELVTCSLCEKKYHTSCIQGVDAVLSDTNNPSTSFCGQGCR 601
Query: 916 EIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECF 975
E+FE L+K +GVK +LE G++WSL+HR D +D S+ Q+VE N++LA+AL+VMDECF
Sbjct: 602 ELFEHLQKFIGVKQELEAGFSWSLIHRTDPGSDTSVRGFPQRVESNSKLAIALTVMDECF 661
Query: 976 LPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMP 1035
L + D RS INLIHN+LYN GSNF RLNY GF+TAILER DEII AASIRIHG +LAEMP
Sbjct: 662 LSIVDRRSEINLIHNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIHGTQLAEMP 721
Query: 1036 FIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQ 1095
FIGTRH+YRRQGMCRRL IESALCSL VE LIIPAISEL TWT FGF PLE S KQ
Sbjct: 722 FIGTRHIYRRQGMCRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFNPLEESHKQ 781
Query: 1096 KMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAKGLRSSELE--NPRTADEVGKNSDEKY 1153
++R++++LVFPG DMLQK +++ + NM ++ G +S E + N T D K SD
Sbjct: 782 ELRSLNMLVFPGTDMLQKLLLEQETADGNMTASPGTKSVESKGNNCNTPDLENK-SDIDS 840
Query: 1154 SAGFDLNACINAAAPH 1169
S G DL+ ++ + H
Sbjct: 841 SNGHDLSIHNHSISQH 856
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/59 (54%), Positives = 41/59 (69%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQE 569
KQLLR+RI +L+ AGW I+YRPR R+Y DAVY+NP G +WSI AY L+ + E
Sbjct: 277 KQLLRERIRGMLVNAGWTIDYRPRRNRDYLDAVYINPTGTAYWSIIKAYDALQKQIDDE 335
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 597 bits (1540), Expect = e-167, Method: Compositional matrix adjust.
Identities = 305/635 (48%), Positives = 407/635 (64%), Gaps = 47/635 (7%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR+RI E+LL AGW I+YRPR R+Y DAVY++P G +WSI AY L
Sbjct: 368 KQRLRERIREMLLEAGWTIDYRPRRNRDYLDAVYISPRGTAYWSIIKAYEALLKQLNSGE 427
Query: 571 GSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKK--------GK----------- 611
+ TF+ I ++ LS L + KS+ +++ K++ GK
Sbjct: 428 KVAKPCDDSSTFSLISDEILSQLTRK-TKSKIEKDMKRELHSASDSDGKATFARNFLAIK 486
Query: 612 -NLGTDGEIVTKKKKK----KGKTNSAASPHGKSQKR--GIKGKPSVSEGGTSHNGMSIP 664
+G D V K+++ K + NS S G + K + + S G +SH
Sbjct: 487 NEVGNDDRYVHKQQRNVMSVKNEVNSRDSSQGTTSKSESPLHHQTEKSTGSSSH------ 540
Query: 665 ARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQY 724
R ++ + R LLVR SV G+ S DGFV K T+LAW+ID+GT+ L+EKV Y
Sbjct: 541 --RVDGGKSSKHGRSTLLVRRSVRGDNSESDGFVPSSEKRTVLAWLIDSGTLQLSEKVMY 598
Query: 725 WNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCI 784
NQR+TR ML+G I RDGI C CCS+I +SKF+ H+ SKL PFQN++ SG SLLQC
Sbjct: 599 MNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHAGSKLRQPFQNIFLNSGVSLLQCQ 658
Query: 785 LDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFP 844
+D+W+KQ + GF V+ +DPNDD CGICGDGGDL+CCDGCPSTFHQ CLDI+ FP
Sbjct: 659 IDAWDKQKGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCDGCPSTFHQRCLDIRMFP 718
Query: 845 SGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC---SQTDGAVQ 901
G WHC C+C+FC + E D + C +CE+KYH+SC + A
Sbjct: 719 LGDWHCPNCTCKFCKAVIE--------DVTQTVGANTCKMCEKKYHKSCMPKANVTPADT 770
Query: 902 YEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECN 961
EP + SFCGKKC+ + E ++K +GVKH+LE G++WSLVHR ++DLSLS VE N
Sbjct: 771 TEPIT-SFCGKKCKALSEGVKKYVGVKHELEAGFSWSLVHRECTNSDLSLSGHPHIVENN 829
Query: 962 ARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISA 1021
++LA+AL+VMDECFLP+ D RSG+N++ N+LYN GSNF RLN+ GF+TA+LER DEI+++
Sbjct: 830 SKLALALTVMDECFLPIIDRRSGVNIVQNVLYNCGSNFNRLNFGGFYTALLERGDEIVAS 889
Query: 1022 ASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWT 1081
ASIR HG LAEMPFIGTRH+YR QGMCRRL + +ESAL L V+ LIIPA ++ W
Sbjct: 890 ASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESALQHLKVKLLIIPATADFSHVWI 949
Query: 1082 SVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
S FGF+ +E S K++MR+M+LL FPG+D+LQK ++
Sbjct: 950 SKFGFRQVEDSLKKEMRSMNLLTFPGIDVLQKELL 984
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 588 bits (1515), Expect = e-164, Method: Compositional matrix adjust.
Identities = 263/390 (67%), Positives = 320/390 (82%)
Query: 727 QRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILD 786
+RKTR +L+G+I DGI+CDCC E F IS F+ H+ SK C P +N++ E+G SLL C L+
Sbjct: 3 RRKTRTVLKGKITTDGIQCDCCGETFAISDFEAHAGSKSCQPLKNIFLENGPSLLHCQLE 62
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG 846
SW++QDES RKGFHFV+ DG+DPNDDTCGICGDGG+LICCD CPSTFHQ+CL+IKK PSG
Sbjct: 63 SWHRQDESDRKGFHFVDIDGQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKLPSG 122
Query: 847 KWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSS 906
W+C YCSC+FCG C +++ D +A L C LCEEKYH SC + + SS
Sbjct: 123 VWNCTYCSCKFCGMAGGDACQMDENDAAARPALLTCCLCEEKYHHSCIPAEDTINDYHSS 182
Query: 907 LSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAV 966
LSFCGKKCQE+ ++L+ LLGVKH++E G+ W++V RFDV +D++LS + +KVECN+++AV
Sbjct: 183 LSFCGKKCQELHDKLQALLGVKHEMEEGFAWTVVRRFDVGSDITLSGMHRKVECNSKVAV 242
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
AL +MDECFLP+PDHRSG+NLI NI+YNFGSNF RLNY GF TAILER DE+ISAASIRI
Sbjct: 243 ALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNYCGFLTAILERGDEVISAASIRI 302
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
HG +LAEMPFIGTRHMYRRQGMCRRLL IE+ALCSLNVEKL+IPAISELRETWTSVFGF
Sbjct: 303 HGNQLAEMPFIGTRHMYRRQGMCRRLLGAIETALCSLNVEKLVIPAISELRETWTSVFGF 362
Query: 1087 QPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
+ LE SKQKMR M ++ FPGVDMLQKP++
Sbjct: 363 KQLEGLSKQKMRYMKMVAFPGVDMLQKPLL 392
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 585 bits (1509), Expect = e-164, Method: Compositional matrix adjust.
Identities = 271/390 (69%), Positives = 325/390 (83%)
Query: 727 QRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILD 786
+RKTR +L+G+I DGI+CDCC E F I F++H+ SK C P +N+ E+G SLLQC L+
Sbjct: 3 RRKTRTVLKGKITTDGIQCDCCGETFAILDFESHAGSKSCQPLKNICLENGHSLLQCQLE 62
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG 846
SWNKQDES RKGFHFV+ D +DPNDDTCGICGDGG+LICCD CPSTFHQ+CL+IKKFPSG
Sbjct: 63 SWNKQDESDRKGFHFVDTDDQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKFPSG 122
Query: 847 KWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSS 906
W+C YCSC+FCG TC +++ D +A L C LCEEKYH SC + V SS
Sbjct: 123 VWNCTYCSCKFCGMAGGDTCQMDENDTAAQPALLACCLCEEKYHHSCILAENTVNDGYSS 182
Query: 907 LSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAV 966
+SFCGKKCQE++++L+ LLGVKH++E G+ W+LV RFDV +D+SLS + +KVECN+++AV
Sbjct: 183 VSFCGKKCQELYDKLQALLGVKHEMEEGFAWTLVRRFDVGSDISLSGMHRKVECNSKVAV 242
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
AL +MDECFLP+PDHRSG+NLI NI+YNFGSNF RLNY GF TAILER DEIISAASIRI
Sbjct: 243 ALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNYSGFLTAILERGDEIISAASIRI 302
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
HG LAEMPFIGTRHMYRRQGMCRRLL+ IE+ALCSLNVEKL+IPAISELRETWTSVFGF
Sbjct: 303 HGNHLAEMPFIGTRHMYRRQGMCRRLLSAIETALCSLNVEKLVIPAISELRETWTSVFGF 362
Query: 1087 QPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
+PLE SSKQKMRNM ++ FPG+DMLQKP++
Sbjct: 363 KPLEGSSKQKMRNMKMVAFPGIDMLQKPLL 392
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 293/665 (44%), Positives = 420/665 (63%), Gaps = 45/665 (6%)
Query: 499 THQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITL 557
+ Q T S KQ + D I +LL AGW I+ RPRNGR Y D+VY+ P GK ++WS+T
Sbjct: 663 SEQINTGSLREEKQKISDHIKGMLLDAGWTIDLRPRNGRNYLDSVYIPPSGKGSYWSVTK 722
Query: 558 AYSVLKNHYEQE--GGSSDTSKTGFTF-----TPIPEDELSILKK-VINKSRSDRNKKKK 609
AY+V E E G + D + + + + E+ LS LK+ V+NK R
Sbjct: 723 AYAVFLEGMESEKKGRAKDQRPSKKSVGSPGKSHVSEEILSKLKRIVVNKRR-------- 774
Query: 610 GKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHK 669
T E+ KK+K G K+ KR +G S N +S + H
Sbjct: 775 -----TKVELQKLKKRKHGLLKKQ-----KTSKRNSRG---------SKNKISNSRKLHL 815
Query: 670 LQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRK 729
E ++R CALL R S + S+ +GFV Y+ K T+L+W+ID + +N K++ ++
Sbjct: 816 GSERKKRGGCALLARGSNKDGGSSTNGFVPYEWKRTVLSWLIDLDIIDINAKLKCVDETH 875
Query: 730 TRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSW 788
++V+L+G RDGI C CCS++FT+ +F H+ + P++N+ + + LL C++++W
Sbjct: 876 SKVLLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAW 935
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
+KQ +S+R+ F ++ + +DPNDDTCGICGDGG+LICCDGCPSTFH +CL+++ PS W
Sbjct: 936 DKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDW 995
Query: 849 HCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEPSS 906
C CSC+FC + + DS+L T CS CEEKYH CS T+ + +
Sbjct: 996 RCAKCSCKFCQEHSRQDAQDIAEVDSSLCT---CSQCEEKYHPGCSPETTNTSNVSSQAC 1052
Query: 907 LSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAV 966
FC + C+ +FE L LL VK DLE ++ ++ R + ++ + ++VECN+++AV
Sbjct: 1053 DLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDERVECNSKIAV 1112
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
ALS+MDECFLP+ D R+GINLI N++YN GSNF R+++ GF+ +LER DEII+AAS+RI
Sbjct: 1113 ALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRI 1172
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
HG +LAEMPFIGTR+MYRRQGMCRRLL GIE L SLNVEKLIIPAI+EL +TWTS FGF
Sbjct: 1173 HGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGF 1232
Query: 1087 QPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAK--GLRSSELENPRTADE 1144
L+VS KQ++++ S+LVFPG +LQKP++K P EN S + G+ SELE+ +T++
Sbjct: 1233 SSLDVSEKQEVKSTSMLVFPGTGLLQKPLLKKTSPGENSSSQEVDGV-FSELESGKTSNV 1291
Query: 1145 VGKNS 1149
++S
Sbjct: 1292 ANEDS 1296
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 554 bits (1427), Expect = e-154, Method: Compositional matrix adjust.
Identities = 293/665 (44%), Positives = 420/665 (63%), Gaps = 45/665 (6%)
Query: 499 THQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITL 557
+ Q T S KQ + D I +LL AGW I+ RPRNGR Y D+VY+ P GK ++WS+T
Sbjct: 508 SEQINTGSLREEKQKISDHIKGMLLDAGWTIDLRPRNGRNYLDSVYIPPSGKGSYWSVTK 567
Query: 558 AYSVLKNHYEQE--GGSSDTSKTGFTF-----TPIPEDELSILKK-VINKSRSDRNKKKK 609
AY+V E E G + D + + + + E+ LS LK+ V+NK R
Sbjct: 568 AYAVFLEGMESEKKGRAKDQRPSKKSVGSPGKSHVSEEILSKLKRIVVNKRR-------- 619
Query: 610 GKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHK 669
T E+ KK+K G K+ KR +G S N +S + H
Sbjct: 620 -----TKVELQKLKKRKHGLLKKQ-----KTSKRNSRG---------SKNKISNSRKLHL 660
Query: 670 LQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRK 729
E ++R CALL R S + S+ +GFV Y+ K T+L+W+ID + +N K++ ++
Sbjct: 661 GSERKKRGGCALLARGSNKDGGSSTNGFVPYEWKRTVLSWLIDLDIIDINAKLKCVDETH 720
Query: 730 TRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSW 788
++V+L+G RDGI C CCS++FT+ +F H+ + P++N+ + + LL C++++W
Sbjct: 721 SKVLLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAW 780
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
+KQ +S+R+ F ++ + +DPNDDTCGICGDGG+LICCDGCPSTFH +CL+++ PS W
Sbjct: 781 DKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDW 840
Query: 849 HCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEPSS 906
C CSC+FC + + DS+L T CS CEEKYH CS T+ + +
Sbjct: 841 RCAKCSCKFCQEHSRQDAQDIAEVDSSLCT---CSQCEEKYHPGCSPETTNTSNVSSQAC 897
Query: 907 LSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAV 966
FC + C+ +FE L LL VK DLE ++ ++ R + ++ + ++VECN+++AV
Sbjct: 898 DLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDERVECNSKIAV 957
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
ALS+MDECFLP+ D R+GINLI N++YN GSNF R+++ GF+ +LER DEII+AAS+RI
Sbjct: 958 ALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRI 1017
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
HG +LAEMPFIGTR+MYRRQGMCRRLL GIE L SLNVEKLIIPAI+EL +TWTS FGF
Sbjct: 1018 HGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGF 1077
Query: 1087 QPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAK--GLRSSELENPRTADE 1144
L+VS KQ++++ S+LVFPG +LQKP++K P EN S + G+ SELE+ +T++
Sbjct: 1078 SSLDVSEKQEVKSTSMLVFPGTGLLQKPLLKKTSPGENSSSQEVDGV-FSELESGKTSNV 1136
Query: 1145 VGKNS 1149
++S
Sbjct: 1137 ANEDS 1141
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 552 bits (1423), Expect = e-154, Method: Compositional matrix adjust.
Identities = 281/642 (43%), Positives = 402/642 (62%), Gaps = 76/642 (11%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITLAYSVLKNHYEQE 569
KQ + D+I +LL AGW I+ RPRNGR Y D+VY++P GK ++WS+T AY V + E E
Sbjct: 731 KQKISDQIKAMLLDAGWTIDLRPRNGRNYMDSVYIHPSGKGSYWSVTKAYYVFRADMESE 790
Query: 570 ------------------GGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGK 611
G +S +G T T +D LS LK+V+ R+ + + ++
Sbjct: 791 QKESSKDHILSKKSVGSPGKRQVSSSSGCTLT---DDILSKLKRVVVNKRTTKVEIQR-- 845
Query: 612 NLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQ 671
+ +KK+KK TNS R H
Sbjct: 846 --------LRQKKEKKNTTNST--------------------------------RLHLGN 865
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALLVR S + S DGFV Y+ K T+ +W+ID + +N K+ ++ ++
Sbjct: 866 ERKKRGGCALLVRGSNKESGSTTDGFVPYEWKRTIFSWLIDLNVLSVNTKLNCLDESHSK 925
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWNK 790
V+L+G + RDGI C CCSE+ ++ +F TH+ S++ P++N+ + LL C++++WN
Sbjct: 926 VLLEGFVTRDGINCSCCSEVISVPEFVTHAGSEVNKPYRNILVDGLDIDLLHCLINAWNM 985
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
Q +++R+ F V+ +G+DPNDDTCGICGDGG+LICCDGCPSTFH +CL ++ P+ W C
Sbjct: 986 QSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCC 1045
Query: 851 VYCSCQFCGRINESTCHVNDQDDSAL--STLQICSLCEEKYHQSCSQTDGAVQYEPSSLS 908
CSC+FC + +D +D+A S+L CS CEE+ ++CS ++ SS +
Sbjct: 1046 SNCSCKFCHEHSS-----DDAEDTADVDSSLHTCSQCEEQCTEACSPDIDSIATNLSSQT 1100
Query: 909 ---FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLA 965
FC + C+ +FE L+ LL VK DLE Y+ +V R + + ++VECN+++A
Sbjct: 1101 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLALDKRVECNSKIA 1160
Query: 966 VALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
VALS+MDECFLP+ D R+GINLI N++YN GSNF RL+++GF+ ILER DEII+AAS+R
Sbjct: 1161 VALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVR 1220
Query: 1026 IHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFG 1085
IHG +LAEMPFIGTR+MYRRQGMCRRL+ GIE L SLN+EKLIIPAI+EL +TWTS FG
Sbjct: 1221 IHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFG 1280
Query: 1086 FQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMIS 1127
F PL+ S KQ+++++S+LVFPG +LQKP++K P E+ S
Sbjct: 1281 FSPLDDSEKQEVKSVSMLVFPGTGLLQKPLLKTS-PSEDQCS 1321
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 290/657 (44%), Positives = 411/657 (62%), Gaps = 48/657 (7%)
Query: 499 THQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITL 557
+ Q T S KQ + D I +LL AGW I+ RPRNGR Y D+VY+ P GK ++WS+T
Sbjct: 663 SEQINTGSLREEKQKISDHIKGMLLDAGWTIDLRPRNGRNYLDSVYIPPSGKGSYWSVTK 722
Query: 558 AYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDG 617
AY+V E E K G P + ++ K K + KK+K L
Sbjct: 723 AYAVFLEGMESE-------KKGRAKDQRPSKKSVVVNKRRTKVELQKLKKRKHGLL---- 771
Query: 618 EIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRK 677
KK+K K NS RG K K S S + H E ++R
Sbjct: 772 -----KKQKTSKRNS----------RGSKNKISNSR------------KLHLGSERKKRG 804
Query: 678 RCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGR 737
CALL R S + S+ +GFV Y+ K T+L+W+ID + +N K++ ++ ++V+L+G
Sbjct: 805 GCALLARGSNKDGGSSTNGFVPYEWKRTVLSWLIDLDIIDINAKLKCVDETHSKVLLEGV 864
Query: 738 IARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWNKQDESKR 796
RDGI C CCS++FT+ +F H+ + P++N+ + + LL C++++W+KQ +S+R
Sbjct: 865 TTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAWDKQSDSER 924
Query: 797 KGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQ 856
+ F ++ + +DPNDDTCGICGDGG+LICCDGCPSTFH +CL+++ PS W C CSC+
Sbjct: 925 QAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCK 984
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEPSSLSFCGKKC 914
FC + + DS+L T CS CEEKYH CS T+ + + FC + C
Sbjct: 985 FCQEHSRQDAQDIAEVDSSLCT---CSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSC 1041
Query: 915 QEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDEC 974
+ +FE L LL VK DLE ++ ++ R + ++ + ++VECN+++AVALS+MDEC
Sbjct: 1042 RLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDERVECNSKIAVALSLMDEC 1101
Query: 975 FLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEM 1034
FLP+ D R+GINLI N++YN GSNF R++++GF+ +LER DEII+AAS+RIHG +LAEM
Sbjct: 1102 FLPIVDQRTGINLIRNVVYNCGSNFVRMDFRGFYIFVLERGDEIIAAASVRIHGTKLAEM 1161
Query: 1035 PFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSK 1094
PFIGTR+MYRRQGMCRRLL GIE L SLNVEKLIIPAI+EL +TWTS FGF L+VS K
Sbjct: 1162 PFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEK 1221
Query: 1095 QKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAK--GLRSSELENPRTADEVGKNS 1149
Q++++ S+LVFPG +LQKP++K P EN S + G+ SELE+ +T++ ++S
Sbjct: 1222 QEVKSTSMLVFPGTGLLQKPLLKKTSPGENSSSQEVDGV-FSELESGKTSNVANEDS 1277
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 290/677 (42%), Positives = 413/677 (61%), Gaps = 69/677 (10%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR++I +LL GWKI+ R R ++Y D+VYV+P G +WSIT AY+V + ++ G
Sbjct: 464 KQDLRNKIKRVLLDNGWKIDLRQRKNKDYEDSVYVSPAGVGYWSITKAYAVFQEQFQNMG 523
Query: 571 GSS--DTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKG 628
SS + +K G + I +D+L++LKK I K R+++ EI + +KK
Sbjct: 524 RSSKLNNTKPGAS-DAISKDDLALLKKNIVKRRTNK-------------EICSAEKKP-- 567
Query: 629 KTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQR---KRCALLVRN 685
G S+ R K ++ G+ R+K Q + R +RC LLVR
Sbjct: 568 ---------GVSRNRSRSSKDILANRGS----------RNKHQNKEDRVKDRRCGLLVRG 608
Query: 686 SVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRC 745
S E N DG++ Y+ K T+ +WMID G V + +V+Y N +TR ML G+I R+GI C
Sbjct: 609 STHDAEDNMDGYIPYEWKRTVYSWMIDLGVVSEDMQVKYMNNNRTREMLAGKITREGIFC 668
Query: 746 DCCSEIFTISKFDTHSKSKLCHPFQNLYFESGS-SLLQCILDSWNKQDESKRKGFHFVNF 804
CCS+I T+ KF+ H+ SK P+ N++ E G SLLQC+LD+W K + KGF+ ++
Sbjct: 669 GCCSKILTVGKFELHAGSKEKKPYANIFLEGGKVSLLQCLLDAWEKHTLCENKGFYKID- 727
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINES 864
GED +DDTC ICGDGGDL+CCD C STFH +CL IK PSG W+C C C+FCG E
Sbjct: 728 KGEDEHDDTCAICGDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSCLCRFCGFPQE- 785
Query: 865 TCHVNDQDDSALSTLQICSLCEEKYHQSCSQ---TDGAVQYEPSSLS-FCGKKCQEIFER 920
+ S+ L C C KYHQ+CS TD +S+ FC C++I++R
Sbjct: 786 ------KPSSSPELLLSCLQCSRKYHQTCSSGTGTDFDCTIPGTSIDCFCSPGCRKIYKR 839
Query: 921 LEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPD 980
L KLLG+K+ +E G++WSLVH F + + + +CN+++A+A +V+DECF P D
Sbjct: 840 LNKLLGIKNHMEAGFSWSLVHCFPNDQAMPPKNKEKMAQCNSKIALAFTVLDECFQPHID 899
Query: 981 HRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTR 1040
RSGIN+IHN+ YN GS+F RL++ GF+ ILER DE+ISAAS+RIHG +LAEMPFIGTR
Sbjct: 900 ERSGINMIHNVAYNCGSDFSRLDFSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGTR 959
Query: 1041 HMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNM 1100
MYR QGM RRLL GIESALCSLNV+KL++ A++E+ TWT+VFGF+P++ S KQ+++++
Sbjct: 960 GMYRHQGMLRRLLNGIESALCSLNVQKLVVSAVTEMENTWTTVFGFKPVQPSKKQRIKSL 1019
Query: 1101 SLLVFPGVDMLQKPMMKNQFPRENMISAKGLRSSELENPRTADEVGKNSDEKYSAGFDLN 1160
+LL+ G +L+K +++ Q + + +TA+ VG N K A
Sbjct: 1020 NLLIMNGTGLLEKRLLQTQTVDGQITT------------KTANTVGSN---KMDAKMLGE 1064
Query: 1161 ACINAAAPHVCKIHDKS 1177
A + HV +HD +
Sbjct: 1065 ASGSVTPVHVSSVHDAA 1081
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 293/665 (44%), Positives = 420/665 (63%), Gaps = 45/665 (6%)
Query: 499 THQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITL 557
+ Q T S KQ + D I +LL AGW I+ RPRNGR Y D+VY+ P GK ++WS+T
Sbjct: 116 SEQINTGSLREEKQKISDHIKGMLLDAGWTIDLRPRNGRNYLDSVYIPPSGKGSYWSVTK 175
Query: 558 AYSVLKNHYEQE--GGSSDTSKTGFTF-----TPIPEDELSILKK-VINKSRSDRNKKKK 609
AY+V E E G + D + + + + E+ LS LK+ V+NK R
Sbjct: 176 AYAVFLEGMESEKKGRAKDQRPSKKSVGSPGKSHVSEEILSKLKRIVVNKRR-------- 227
Query: 610 GKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHK 669
T E+ KK+K G K+ KR +G S N +S + H
Sbjct: 228 -----TKVELQKLKKRKHGLLKKQ-----KTSKRNSRG---------SKNKISNSRKLHL 268
Query: 670 LQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRK 729
E ++R CALL R S + S+ +GFV Y+ K T+L+W+ID + +N K++ ++
Sbjct: 269 GSERKKRGGCALLARGSNKDGGSSTNGFVPYEWKRTVLSWLIDLDIIDINAKLKCVDETH 328
Query: 730 TRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSW 788
++V+L+G RDGI C CCS++FT+ +F H+ + P++N+ + + LL C++++W
Sbjct: 329 SKVLLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAW 388
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
+KQ +S+R+ F ++ + +DPNDDTCGICGDGG+LICCDGCPSTFH +CL+++ PS W
Sbjct: 389 DKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDW 448
Query: 849 HCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEPSS 906
C CSC+FC + + DS+L T CS CEEKYH CS T+ + +
Sbjct: 449 RCAKCSCKFCQEHSRQDAQDIAEVDSSLCT---CSQCEEKYHPGCSPETTNTSNVSSQAC 505
Query: 907 LSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAV 966
FC + C+ +FE L LL VK DLE ++ ++ R + ++ + ++VECN+++AV
Sbjct: 506 DLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDERVECNSKIAV 565
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
ALS+MDECFLP+ D R+GINLI N++YN GSNF R+++ GF+ +LER DEII+AAS+RI
Sbjct: 566 ALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRI 625
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
HG +LAEMPFIGTR+MYRRQGMCRRLL GIE L SLNVEKLIIPAI+EL +TWTS FGF
Sbjct: 626 HGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGF 685
Query: 1087 QPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAK--GLRSSELENPRTADE 1144
L+VS KQ++++ S+LVFPG +LQKP++K P EN S + G+ SELE+ +T++
Sbjct: 686 SSLDVSEKQEVKSTSMLVFPGTGLLQKPLLKKTSPGENSSSQEVDGV-FSELESGKTSNV 744
Query: 1145 VGKNS 1149
++S
Sbjct: 745 ANEDS 749
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 545 bits (1403), Expect = e-151, Method: Compositional matrix adjust.
Identities = 278/639 (43%), Positives = 404/639 (63%), Gaps = 61/639 (9%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITLAYSV---LKNHY 566
K+ + ++I +LL AGWKI+ RPRNGR+Y D+VY+ P GK ++WSIT AY++ +++
Sbjct: 751 KRKISEQIKAILLDAGWKIDLRPRNGRDYLDSVYIPPTGKGSYWSITKAYAIYQSMQSEQ 810
Query: 567 EQEGGSSDTSKT---------GFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDG 617
+ E S + SK T + +PE+ LS LK+V+ NK+K T
Sbjct: 811 KDEAASENLSKKSPGSPGKIHASTVSSLPEEILSKLKRVV------VNKRK------TKI 858
Query: 618 EIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRK 677
E+ KK+K G ++ + G+ +++ K GG
Sbjct: 859 ELQKSKKRKHGLLKNSKNLKGRPKEKKKISKERKKRGG---------------------- 896
Query: 678 RCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGR 737
CALL R S + E + +GF Y+ K T+ +W+ID + +N +++ ++ +++V+L+G
Sbjct: 897 -CALLARGSNQ-EAGSSNGFAPYEWKRTVFSWLIDLDVLSVNARLKCMDESRSKVLLEGL 954
Query: 738 IARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWNKQDESKR 796
I RDGI C CCS++ T+ +F H+ +L P++N+ + + LL C++ +W+KQ +S+R
Sbjct: 955 ITRDGINCSCCSKVVTVLEFVAHAGGQLSKPYRNILVDGLDNDLLHCLIIAWDKQSDSER 1014
Query: 797 KGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQ 856
+ F V+ +G+DPNDDTCGICGDGG+LICCDGCPSTFH +CL++++ PS W C CSC+
Sbjct: 1015 QAFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWRCTNCSCK 1074
Query: 857 FCGRINESTCHVNDQ--DDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLS--FCGK 912
C H+N D++ + L CS CE+KYH SCS + S FC +
Sbjct: 1075 LCHE------HLNHDAPDNAEIDPLHSCSQCEKKYHPSCSPETEKLSSVSSQAGNHFCQQ 1128
Query: 913 KCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMD 972
C+ +FE L+ LL V+ DL Y ++ ++ D+ +VECN+++AVALS+MD
Sbjct: 1129 SCRLLFEELQNLLAVEKDLGPEYACRIIQCIHEDAPETVLDLDGRVECNSKIAVALSLMD 1188
Query: 973 ECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELA 1032
ECFLP+ D R+GINLI N++YN GSNF RL+++GF+ ILER DEI+SAAS+RIHG +LA
Sbjct: 1189 ECFLPIVDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKLA 1248
Query: 1033 EMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVS 1092
EMPFIGTRHMYRRQGMCRRLL GIE L SL VEKLIIPAI+EL +TWTS FGF PLEVS
Sbjct: 1249 EMPFIGTRHMYRRQGMCRRLLDGIEMILSSLKVEKLIIPAINELVDTWTSKFGFSPLEVS 1308
Query: 1093 SKQKMRNMSLLVFPGVDMLQKPMMKNQF-PRENMISAKG 1130
KQ+++++++LVFPG +LQK ++ Q P+E+ S G
Sbjct: 1309 DKQEVKSINMLVFPGTGLLQKSLLNKQASPQEHPGSEGG 1347
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 545 bits (1403), Expect = e-151, Method: Compositional matrix adjust.
Identities = 286/641 (44%), Positives = 398/641 (62%), Gaps = 70/641 (10%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITLAYSVLKNHYEQE 569
KQ + D+I +LL AGW I+ RPRNGR Y D+VY+ P GK ++WS+T AY + E E
Sbjct: 721 KQKISDQIKAMLLDAGWTIDLRPRNGRNYMDSVYIPPSGKGSYWSVTKAYYAFRAGMESE 780
Query: 570 G--------------GSSD----TSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGK 611
GS D +S +G+T T ED LS LK+V+ R+
Sbjct: 781 QQESPKDQSLSTKSVGSPDKRQVSSSSGYTLT---EDILSKLKRVVVNKRT--------- 828
Query: 612 NLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQ 671
T EI ++ ++ + S R + H
Sbjct: 829 ---TKLEI---QRLRQKRFKKEKKKKNVSNSRSL----------------------HLGN 860
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALL R S + S+ DGFV Y+ K T+ +W+ID + +N K++ + ++
Sbjct: 861 ERKKRGGCALLARGSNKESGSSTDGFVPYEWKRTIFSWLIDLDVLSVNTKLKCMDGSHSK 920
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWNK 790
V+L+G RDGI C CC+E++++ +F TH+ S++ P++N+ + LL C++++WN
Sbjct: 921 VLLEGFATRDGINCSCCNEVYSVLEFVTHAGSEVNKPYRNILVDGLDIDLLHCLINAWNM 980
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
Q + +R+ F V+ +G+DPNDDTCGICGDGG+LICCDGCPSTFH +CL ++ PS W C
Sbjct: 981 QSDVERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEVLPSDYWCC 1040
Query: 851 VYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS-QTDGAVQYEPSSLS- 908
CSC+FC S+ D D S L CS CEE+YH++CS +TD + +
Sbjct: 1041 ANCSCKFCHE--HSSDGAEDTADVDYS-LHTCSQCEEQYHEACSPETDSITNLSSQTGNL 1097
Query: 909 FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF--DVSTDLSLSDVCQKVECNARLAV 966
FC + C+ +FE L+ LL VK DLE Y+ +V R DV ++ D ++VECN+R+AV
Sbjct: 1098 FCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLALD--KRVECNSRIAV 1155
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
ALS+MDECFLP+ D R+GINLI N++Y+ GSNF RL+++GF+ ILER DEII+AAS+RI
Sbjct: 1156 ALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRI 1215
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
HG +LAEMPFIGTR+MYRRQGMCRRL+ GIE L SLNVEKLIIPAI+EL +TWTS FGF
Sbjct: 1216 HGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGF 1275
Query: 1087 QPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMIS 1127
PLE S KQ+++++S+LVFPG +LQKP++K P E S
Sbjct: 1276 SPLEDSEKQEVKSISMLVFPGTGLLQKPLLK-ALPNEEQCS 1315
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 544 bits (1401), Expect = e-151, Method: Compositional matrix adjust.
Identities = 281/634 (44%), Positives = 398/634 (62%), Gaps = 59/634 (9%)
Query: 499 THQEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITL 557
+ Q T+ + K+ + + I +LL AGWKI+ RPRNGR Y D+VY+ P GK ++WS+T
Sbjct: 690 SEQVNTVGRFEEKRKISEHIKGILLDAGWKIDLRPRNGRNYLDSVYIPPSGKGSYWSVTK 749
Query: 558 AYSVLKNHYEQEGGSSDTSKTGFTFTP-----IPEDELSILKK-VINKSRSDRNKKKKGK 611
AYSV +++G S +S +P +PE++LS LK+ V+NK R
Sbjct: 750 AYSVFMQSQQKDGAKSQSSPKKSACSPDKSSSLPEEDLSKLKRMVLNKRR---------- 799
Query: 612 NLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQ 671
T+ E+ +KKK G S K + N +S +
Sbjct: 800 ---TEPELQKLRKKKHGLLKKFKSSKSKPK--------------ERKNKIS--------K 834
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALL R S + S+ DGF Y+ K ++ +W+ID + +N ++ ++ ++
Sbjct: 835 ERKKRGGCALLARGSNQEAGSSTDGFAPYEWKRSVFSWLIDLDILSVNAGLKCMDESCSK 894
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWNK 790
V+L+G + RDGI+C CCS+ F + +F H+ ++ P++N+ + LL C++ +W+K
Sbjct: 895 VLLEGLVTRDGIQCRCCSKDFALLEFVAHAGGQVSKPYRNVLVDGLDKDLLHCLISAWDK 954
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
Q +S+R+ F V+ +G+DPNDDTCGICGDGG+LICCDGCPSTFH +CL++++ PS W C
Sbjct: 955 QSDSERQSFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWRC 1014
Query: 851 VYCSCQFCGRINESTCHVNDQ--DDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLS 908
C C+FC H ND D + + +L CS CEE YH CS PS
Sbjct: 1015 ANCCCKFCQE------HSNDDAPDIAEVDSLCTCSQCEENYHPVCSPETENPSSVPSQAG 1068
Query: 909 --FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLV---HRFDVSTDLSLSDVCQKVECNAR 963
FC + C+ +FE L+ LL VK DLE + ++ H T L+L ++VECN++
Sbjct: 1069 DLFCQQSCRLLFEELQNLLAVKKDLEPEFACRIIKCIHEDVPETALALD---ERVECNSK 1125
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
+AVALS+MDECFLP+ D R+GINLI N++YN GSNF RL+++GF+ ILER DEI+SAAS
Sbjct: 1126 IAVALSLMDECFLPIIDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAAS 1185
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSV 1083
+RIHG + AEMPFIGTR+MYRRQGMCRRLL GIE L SL V+KLIIPAISEL +TWTS
Sbjct: 1186 VRIHGTKCAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLKVQKLIIPAISELVDTWTSK 1245
Query: 1084 FGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
FGF PLEVS KQ+++++S+LVFPG +LQKP++K
Sbjct: 1246 FGFSPLEVSEKQEVKSISMLVFPGTGLLQKPLLK 1279
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 543 bits (1400), Expect = e-151, Method: Compositional matrix adjust.
Identities = 281/634 (44%), Positives = 390/634 (61%), Gaps = 62/634 (9%)
Query: 497 LETH---QEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHW 553
LE H +E ++ KQ LR +I ++LL GWKI+ R R ++Y D+VYV+P+G +W
Sbjct: 447 LEKHGIKKERKGPRYTMKQKLRGQIKDILLNNGWKIDLRRRKNKDYEDSVYVSPQGNGYW 506
Query: 554 SITLAYSVLKNHYE--QEGGSSDTSKTGFT---FTPIPEDELSILKKVINKSRSDRNKKK 608
SIT AY+V + + + G S K G I E++L++L++ + K R+
Sbjct: 507 SITKAYAVFQEQSKSGKHTGKSSKHKAGVADAACNAISENDLAMLQRNVVKRRTK----- 561
Query: 609 KGKNLGTDGEIVTKKKKKKGKTNSAASPHG-KSQKRGIKGKPSVSEGGTSHNGMSIPARR 667
K LG + + K N+A G K Q G++G
Sbjct: 562 --KELGASKKKYEDSSSRNSKDNNAGRSSGNKHQSSGVRG-------------------- 599
Query: 668 HKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQ 727
CALLVR S E N DG+ Y K T+L+WMID G V + KV+Y N+
Sbjct: 600 -----------CALLVRGSTHSMEGNVDGYFPYRWKRTVLSWMIDMGVVSEDAKVKYMNK 648
Query: 728 RKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFE-SGSSLLQCILD 786
+ TR L+GRI RDGI C CCS+I T++KF+ H+ SK P++N++ E G++L QC++D
Sbjct: 649 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENIFLEDGGATLSQCLVD 708
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG 846
+W KQ +S++KGF+ V+ + +DDTCGICGDGGDL+CCD CPSTFH CL IK PSG
Sbjct: 709 AWKKQSQSEKKGFYKVDPGDDP-DDDTCGICGDGGDLLCCDNCPSTFHLACLGIK-MPSG 766
Query: 847 KWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEP 904
WHC C C+FCG E T ++ + L C C KYHQ C+ +V+ E
Sbjct: 767 DWHCSSCICRFCGSTQEIT--------TSSAELLSCLQCSRKYHQVCAPGTMKDSVKAES 818
Query: 905 SSLS--FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNA 962
+S + FC C++I++ L KLLGVK+ +E G++WSLV F + CN+
Sbjct: 819 NSSTDCFCSPGCRKIYKHLRKLLGVKNAIEAGFSWSLVRCFPDKLAAPPKGKAHLIHCNS 878
Query: 963 RLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAA 1022
+ AVA SVMDECFLP D RSGIN+IHN++YN GS+F RLN+ F+T ILER DE+ISAA
Sbjct: 879 KTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAA 938
Query: 1023 SIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTS 1082
++RIHG +LAEMPFIGTR +YRRQGMC RLL IESAL SLNV +L+IPAI EL+ TWT+
Sbjct: 939 AVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTT 998
Query: 1083 VFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
VFGF+P+E S +QK++++++L+ G +L+K ++
Sbjct: 999 VFGFKPVEPSKRQKIKSLNILIIHGTGLLEKRLL 1032
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 543 bits (1399), Expect = e-151, Method: Compositional matrix adjust.
Identities = 251/392 (64%), Positives = 307/392 (78%), Gaps = 5/392 (1%)
Query: 726 NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCIL 785
N+R+T+VML+G + RDGI C CCS+I T+SKF+ H+ SKL PFQN+Y ESG+SLL C +
Sbjct: 2 NRRRTKVMLEGWVTRDGIHCGCCSKILTVSKFEIHAGSKLRQPFQNIYLESGASLLDCQI 61
Query: 786 DSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPS 845
++WN+Q+ KR GF V+ DG DPNDDTCG+CGDGGDLICCDGCPSTFHQ+CLDIK P
Sbjct: 62 EAWNRQEPVKRLGFQAVDVDGNDPNDDTCGLCGDGGDLICCDGCPSTFHQSCLDIKMLPP 121
Query: 846 GKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEP- 904
G WHC CSC+FCG ++ +DD+ +S L CSLC +KYH+SC Q + +
Sbjct: 122 GDWHCPNCSCKFCGVASDKN---FQRDDTTVSKLLTCSLCVKKYHKSCMQEINTLSIDTN 178
Query: 905 -SSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNAR 963
S SFCGKKC+E+FE+L+K LGVKH+LE G++WSL+HR D +D SL + Q+VECN++
Sbjct: 179 NSVASFCGKKCRELFEQLQKYLGVKHELEAGFSWSLIHRTDADSDTSLQGLPQRVECNSK 238
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
LAV+LSVMDECFLP+ D RSGINLI N+LYN GSNF RLN+ GF+ ILER DEIISAAS
Sbjct: 239 LAVSLSVMDECFLPIVDRRSGINLIQNVLYNCGSNFNRLNFGGFYALILERGDEIISAAS 298
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSV 1083
IR HG LAEMPFIGTRHMYRRQGMCRRL IES LCSL VEKLIIPAISEL TWT V
Sbjct: 299 IRFHGTRLAEMPFIGTRHMYRRQGMCRRLFYAIESTLCSLKVEKLIIPAISELMHTWTEV 358
Query: 1084 FGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
FGF L+ S KQ++++M++LVFPG+DMLQK +
Sbjct: 359 FGFTTLDESLKQELKSMNMLVFPGIDMLQKQL 390
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 540 bits (1392), Expect = e-150, Method: Compositional matrix adjust.
Identities = 275/632 (43%), Positives = 392/632 (62%), Gaps = 73/632 (11%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITLAYSVLKNHYEQE 569
KQ + D+I +LL AGW I+ RPRNGR+Y D+VY+ P GK ++WS+T AY E E
Sbjct: 724 KQKISDQIKAMLLDAGWTIDLRPRNGRDYMDSVYIPPSGKGSYWSVTKAYYAFCASMESE 783
Query: 570 ------------------GGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGK 611
G +S G T T ED LS LK+V+ R+ +
Sbjct: 784 QKESSKDQILSKKSVGSPGKKQVSSSPGCTLT---EDILSKLKRVVVNKRTTK------- 833
Query: 612 NLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQ 671
V ++ K+ + + S+ R H
Sbjct: 834 --------VAIQRLKQKRFKKEKKKNANSR------------------------RLHLGN 861
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALLVR S + S DGFV Y+ K T+ +W+ID + +N K++ ++ ++
Sbjct: 862 ERKKRGGCALLVRGSNKESGSGTDGFVPYEWKRTIFSWLIDLDVLSVNTKLKCMDESNSK 921
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGS-SLLQCILDSWNK 790
V+L+G + RDGI C CCS++ ++ +F H+ S++ P++N+ + LL C++++WN
Sbjct: 922 VLLEGIVTRDGIDCSCCSKVLSVLEFVAHAGSEVNTPYRNILVDGQDIDLLHCLINAWNM 981
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
Q +++++ F V+ +G+DPNDDTCGICGDGG+LICCDGCPSTFH +CL +++ PS W C
Sbjct: 982 QSDAEKQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEELPSDYWCC 1041
Query: 851 VYCSCQFCGRINESTCHVND-QDDSAL--STLQICSLCEEKYHQSCSQTDGAVQYEPSSL 907
CSC+FC H ND +D+A S+L CS CEE+YH++CS + ++ S
Sbjct: 1042 ANCSCKFCHE------HSNDGAEDTADVDSSLHTCSQCEEQYHEACSPENDSITNLSSQT 1095
Query: 908 S--FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLA 965
FC + C+ +FE L+ LL VK DLE Y+ +V R + + +VECN+++A
Sbjct: 1096 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLPLDTRVECNSKIA 1155
Query: 966 VALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
VALS+MDECFLP+ D R+GINLI N++Y+ GSNF RL+++GF+ ILER DEII+AAS+R
Sbjct: 1156 VALSLMDECFLPIVDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVR 1215
Query: 1026 IHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFG 1085
IHG +LAEMPFIGTR+MYRRQGMCRRL+ GIE L SLNVEKLIIPAI+EL +TWTS FG
Sbjct: 1216 IHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSRFG 1275
Query: 1086 FQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
F PLE S K++++++S+LVFPG +LQKP++K
Sbjct: 1276 FSPLEDSEKEEVKSISMLVFPGTGLLQKPLLK 1307
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 538 bits (1387), Expect = e-150, Method: Compositional matrix adjust.
Identities = 282/634 (44%), Positives = 391/634 (61%), Gaps = 62/634 (9%)
Query: 497 LETH---QEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHW 553
LE H +E ++ KQ LR +I ++LL GWKI+ R R ++Y D+VYV+P+G +W
Sbjct: 24 LEKHGIKKERKGPRYTMKQKLRGQIKDILLNNGWKIDLRRRKNKDYEDSVYVSPQGNGYW 83
Query: 554 SITLAYSVLKNHYE--QEGGSSDTSKTGFT---FTPIPEDELSILKKVINKSRSDRNKKK 608
SIT AY+V + + + G S K G I E++L++L++ + K R+
Sbjct: 84 SITKAYAVFQEQSKSGKRTGKSSKHKAGVADAACNAISENDLAMLQRNVVKRRTK----- 138
Query: 609 KGKNLGTDGEIVTKKKKKKGKTNSAASPHG-KSQKRGIKGKPSVSEGGTSHNGMSIPARR 667
K LG + + K N+A G K Q G++G
Sbjct: 139 --KELGASKKKYEDSSSRNSKDNNAGRSSGNKHQSSGVRG-------------------- 176
Query: 668 HKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQ 727
CALLVR S E N DG+V Y K T+L+WMID G V + KV+Y N+
Sbjct: 177 -----------CALLVRGSTHSMEGNVDGYVPYRWKRTVLSWMIDMGVVSEDAKVKYMNK 225
Query: 728 RKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFE-SGSSLLQCILD 786
+ TR L+GRI RDGI C CCS+I T++KF+ H+ SK P++N++ E G++L QC++D
Sbjct: 226 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENIFLEDGGATLSQCLVD 285
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG 846
+W KQ +S++KGF+ V+ + +DDTCGICGDGGDL+CCD CPSTFH CL IK PSG
Sbjct: 286 AWKKQSQSEKKGFYKVDPGDDP-DDDTCGICGDGGDLLCCDNCPSTFHLACLGIK-MPSG 343
Query: 847 KWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEP 904
WHC C C+FCG E T ++ + L C C KYHQ C+ +V+ E
Sbjct: 344 DWHCRSCICRFCGSTQEIT--------TSSAELLSCLQCSRKYHQVCAPGTMKDSVKAES 395
Query: 905 SSLS--FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNA 962
+S + FC C++I++ L KLLGVK+ +E G++WSLV F + CN+
Sbjct: 396 NSSTDCFCSPGCRKIYKHLRKLLGVKNAIEAGFSWSLVRCFPDKLAAPPKGKAHLIHCNS 455
Query: 963 RLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAA 1022
+ AVA SVMDECFLP D RSGIN+IHN++YN GS+F RLN+ F+T ILER DE+ISAA
Sbjct: 456 KTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAA 515
Query: 1023 SIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTS 1082
++RIHG +LAEMPFIGTR +YRRQGMC RLL IESAL SLNV +L+IPAI EL+ TWT+
Sbjct: 516 AVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTT 575
Query: 1083 VFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
VFGF+P+E S +QK++++++L+ G +L+K ++
Sbjct: 576 VFGFKPVEPSKRQKIKSLNILIIHGTGLLEKRLL 609
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 538 bits (1385), Expect = e-149, Method: Compositional matrix adjust.
Identities = 279/629 (44%), Positives = 386/629 (61%), Gaps = 71/629 (11%)
Query: 497 LETH---QEETLSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHW 553
LE H +E ++ KQ LR +I ++LL GWKI+ R R ++Y D+VYV+P+G +W
Sbjct: 465 LEKHGIKKERKGPRYTMKQKLRGQIKDILLNNGWKIDLRRRKNKDYEDSVYVSPQGNGYW 524
Query: 554 SITLAYSVLKNHYEQEGGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNL 613
SIT AY+V + EQ T E++L++L++ + K R+ K L
Sbjct: 525 SITKAYAVFQ---EQSKSGKHT-----------ENDLAMLQRNVVKRRTK-------KEL 563
Query: 614 GTDGEIVTKKKKKKGKTNSAASPHG-KSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQE 672
G + + K N+A G K Q G++G
Sbjct: 564 GASKKKYEDSSSRNSKDNNAGRSSGNKHQSSGVRG------------------------- 598
Query: 673 TQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRV 732
CALLVR S E N DG+ Y K T+L+WMID G V + KV+Y N++ TR
Sbjct: 599 ------CALLVRGSTHSMEGNVDGYFPYRWKRTVLSWMIDMGVVSEDAKVKYMNKKGTRA 652
Query: 733 MLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFE-SGSSLLQCILDSWNKQ 791
L+GRI RDGI C CCS+I T++KF+ H+ SK P++N++ E G++L QC++D+W KQ
Sbjct: 653 RLEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENIFLEDGGATLSQCLVDAWKKQ 712
Query: 792 DESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCV 851
+S++KGF+ V+ + +DDTCGICGDGGDL+CCD CPSTFH CL IK PSG WHC
Sbjct: 713 SQSEKKGFYKVDPGDDP-DDDTCGICGDGGDLLCCDNCPSTFHLACLGIK-MPSGDWHCS 770
Query: 852 YCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEPSSLS- 908
C C+FCG E T ++ + L C C KYHQ C+ +V+ E +S +
Sbjct: 771 SCICRFCGSTQEIT--------TSSAELLSCLQCSRKYHQVCAPGTMKDSVKAESNSSTD 822
Query: 909 -FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVA 967
FC C++I++ L KLLGVK+ +E G++WSLV F + CN++ AVA
Sbjct: 823 CFCSPGCRKIYKHLRKLLGVKNAIEAGFSWSLVRCFPDKLAAPPKGKAHLIHCNSKTAVA 882
Query: 968 LSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIH 1027
SVMDECFLP D RSGIN+IHN++YN GS+F RLN+ F+T ILER DE+ISAA++RIH
Sbjct: 883 FSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIH 942
Query: 1028 GKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQ 1087
G +LAEMPFIGTR +YRRQGMC RLL IESAL SLNV +L+IPAI EL+ TWT+VFGF+
Sbjct: 943 GTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVFGFK 1002
Query: 1088 PLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
P+E S +QK++++++L+ G +L+K ++
Sbjct: 1003 PVEPSKRQKIKSLNILIIHGTGLLEKRLL 1031
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 286/643 (44%), Positives = 411/643 (63%), Gaps = 45/643 (6%)
Query: 521 LLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITLAYSVLKNHYEQE--GGSSDTSK 577
+LL AGW I+ RPRNGR Y D+VY+ P GK ++WS+T AY+V E E G + D
Sbjct: 1 MLLDAGWTIDLRPRNGRNYLDSVYIPPSGKGSYWSVTKAYAVFLEGMESEKKGRAKDQRP 60
Query: 578 TGFTF-----TPIPEDELSILKK-VINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTN 631
+ + + + E+ LS LK+ V+NK R T E+ KK+K G
Sbjct: 61 SKKSVGSPGKSHVSEEILSKLKRIVVNKRR-------------TKVELQKLKKRKHGLLK 107
Query: 632 SAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEE 691
K+ KR +G S N +S + H E ++R CALL R S +
Sbjct: 108 KQ-----KTSKRNSRG---------SKNKISNSRKLHLGSERKKRGGCALLARGSNKDGG 153
Query: 692 SNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEI 751
S+ +GFV Y+ K T+L+W+ID + +N K++ ++ ++V+L+G RDGI C CCS++
Sbjct: 154 SSTNGFVPYEWKRTVLSWLIDLDIIDINAKLKCVDETHSKVLLEGVTTRDGINCRCCSKV 213
Query: 752 FTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPN 810
FT+ +F H+ + P++N+ + + LL C++++W+KQ +S+R+ F ++ + +DPN
Sbjct: 214 FTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPN 273
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVND 870
DDTCGICGDGG+LICCDGCPSTFH +CL+++ PS W C CSC+FC +
Sbjct: 274 DDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQDAQDIA 333
Query: 871 QDDSALSTLQICSLCEEKYHQSCS--QTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVK 928
+ DS+L T CS CEEKYH CS T+ + + FC + C+ +FE L LL VK
Sbjct: 334 EVDSSLCT---CSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVK 390
Query: 929 HDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLI 988
DLE ++ ++ R + ++ + ++VECN+++AVALS+MDECFLP+ D R+GINLI
Sbjct: 391 KDLEPEFSCRIIQRIHENVPETVVALDERVECNSKIAVALSLMDECFLPIVDQRTGINLI 450
Query: 989 HNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGM 1048
N++YN GSNF R+++ GF+ +LER DEII+AAS+RIHG +LAEMPFIGTR+MYRRQGM
Sbjct: 451 RNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGM 510
Query: 1049 CRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGV 1108
CRRLL GIE L SLNVEKLIIPAI+EL +TWTS FGF L+VS KQ++++ S+LVFPG
Sbjct: 511 CRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEVKSTSMLVFPGT 570
Query: 1109 DMLQKPMMKNQFPRENMISAK--GLRSSELENPRTADEVGKNS 1149
+LQKP++K P EN S + G+ SELE+ +T++ ++S
Sbjct: 571 GLLQKPLLKKTSPGENSSSQEVDGV-FSELESGKTSNVANEDS 612
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 272/620 (43%), Positives = 389/620 (62%), Gaps = 57/620 (9%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LRD+I +LL GW I+ RPR ++Y D+VYV+P+G +WSIT AY+V + ++
Sbjct: 573 KQKLRDQIKGILLDNGWTIDLRPRRSKDYDDSVYVSPKGTGYWSITKAYAVFQKQFKSSH 632
Query: 571 GSSDTSKTG---------FTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVT 621
T+++ +F I +D+L++L++ +
Sbjct: 633 DEKTTARSSKLNNRGAADASFDAISQDDLAMLQRNV------------------------ 668
Query: 622 KKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCAL 681
+K+K K + A + G ++ R +K S G +S N + K++ +R AL
Sbjct: 669 -RKRKSKKEHFADNKCGDNRSRNMK---DASAGRSSRNKYQNNKEKVKIKH----RRSAL 720
Query: 682 LVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARD 741
LVR S E + D ++ Y K T+L+WMID G V + KV+Y N++ TR L GR+ R+
Sbjct: 721 LVRGSTRSME-DMDDYIPYKWKRTVLSWMIDLGVVSKDAKVKYMNKKGTRARLVGRVTRE 779
Query: 742 GIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGS-SLLQCILDSWNKQDESKRKGFH 800
GI CDCCS+I T++KF+ H+ SK P+ N++ E G SL QC+L++W+KQ ++++KGF+
Sbjct: 780 GICCDCCSKILTVAKFELHAGSKEQQPYANIFLEDGGLSLFQCLLNAWDKQAQNEKKGFY 839
Query: 801 FVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGR 860
++ + +DDTCGICGDGGDL+CCD C STFH CL I+ PSG W C C C+FCG
Sbjct: 840 KIDPADDP-DDDTCGICGDGGDLLCCDRCTSTFHVACLGIE-MPSGDWFCRNCICKFCGS 897
Query: 861 INESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQ--TDGAVQYEPSSL--SFCGKKCQE 916
E T S+ + L C C KYHQ C+Q V PS+ FC C +
Sbjct: 898 AEERT--------SSPAELLSCLQCSRKYHQVCAQGIEREFVSTTPSASIDCFCSPGCTK 949
Query: 917 IFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFL 976
I++RL++LLG+K+DLE G++WSLV F + S Q V CN++ A+A SV+DECFL
Sbjct: 950 IYKRLKRLLGLKNDLEAGFSWSLVRCFADTEATSTKKKAQLVHCNSKTALAFSVLDECFL 1009
Query: 977 PLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPF 1036
P D RSGIN+IHN++YN GS+F RLN+ GF+T ILER DE+ISAA++RIHG + AEMPF
Sbjct: 1010 PRIDERSGINIIHNVVYNCGSDFSRLNFSGFYTFILERGDEVISAATVRIHGTDFAEMPF 1069
Query: 1037 IGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQK 1096
IGTR MYR QGMC RLL IESALCSLNV +L+IPAI EL+ TW++VFGF+P+ + KQK
Sbjct: 1070 IGTRGMYRHQGMCHRLLDAIESALCSLNVRRLVIPAIPELQNTWSTVFGFKPVGPTKKQK 1129
Query: 1097 MRNMSLLVFPGVDMLQKPMM 1116
+++++LL+ G +L+K ++
Sbjct: 1130 IKSVNLLIIHGTGLLEKRLL 1149
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 241/424 (56%), Positives = 316/424 (74%), Gaps = 10/424 (2%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
TLL+W+I++G V L +KVQY +R +VML+G I R+GI CDCCS+I T+S+F+ H+ SK
Sbjct: 1 TLLSWLIESGVVQLRQKVQYMRRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSK 60
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
C PFQN+Y ESG+SLLQC + +WN Q ++ H V+ DG+DPNDD CGICGDGGDLI
Sbjct: 61 SCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLI 120
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCDGCPST+HQNCL ++ PSG WHC C+C+FC ++ +D + +S L C +
Sbjct: 121 CCDGCPSTYHQNCLGMQVLPSGDWHCPNCTCKFC----DAAVASGGKDGNFISLLS-CGM 175
Query: 885 CEEKYHQSCSQTDG-AVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF 943
CE +YHQ C + VQ S+ SFCG KC E+FE+L+K LGVK ++EGGY+WSL+HR
Sbjct: 176 CERRYHQLCLNDEAHKVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRV 235
Query: 944 DVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLN 1003
D +D + Q++E N++LAV L++MDECFLP+ D RSG++LI N+LYN GSNF R+N
Sbjct: 236 DTDSDTNSQMSAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRIN 295
Query: 1004 YKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSL 1063
Y GF+TAILER DEIISAAS+R HG +LAEMPFIGTRH+YRRQGMCRRL IESA+ SL
Sbjct: 296 YTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSL 355
Query: 1064 NVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRE 1123
VEKL+IPAI + WT FGF PL+ S +++MR+++ LVFPG+DMLQKP++ E
Sbjct: 356 KVEKLVIPAIPDFLHAWTGNFGFTPLDDSVRKEMRSLNTLVFPGIDMLQKPLLH----EE 411
Query: 1124 NMIS 1127
N+I+
Sbjct: 412 NIIA 415
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 484 bits (1246), Expect = e-133, Method: Compositional matrix adjust.
Identities = 229/402 (56%), Positives = 299/402 (74%), Gaps = 10/402 (2%)
Query: 727 QRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILD 786
+R +VML+G I R+GI CDCCS+I T+S+F+ H+ SK C PFQN+Y ESG+SLLQC +
Sbjct: 3 RRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSCQPFQNIYLESGASLLQCQVR 62
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG 846
+WN Q ++ H V+ DG+DPNDD CGICGDGGDLICCDGCPST+HQNCL ++ PSG
Sbjct: 63 AWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSG 122
Query: 847 KWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDG-AVQYEPS 905
WHC C+C+FC ++ +D + +S L C +CE +YHQ C + VQ S
Sbjct: 123 DWHCPNCTCKFC----DAAVASGGKDGNFISLLS-CGMCERRYHQLCLNDEAHKVQSFGS 177
Query: 906 SLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLA 965
+ SFCG KC E+FE+L+K LGVK ++EGGY+WSL+HR D +D + Q++E N++LA
Sbjct: 178 ASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVDTDSDTNSQMSAQRIENNSKLA 237
Query: 966 VALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
V L++MDECFLP+ D RSG++LI N+LYN GSNF R+NY GF+TAILER DEIISAAS+R
Sbjct: 238 VGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLR 297
Query: 1026 IHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFG 1085
HG +LAEMPFIGTRH+YRRQGMCRRL IESA+ SL VEKL+IPAI + WT FG
Sbjct: 298 FHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFG 357
Query: 1086 FQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMIS 1127
F PL+ S +++MR+++ LVFPG+DMLQKP++ EN+I+
Sbjct: 358 FTPLDDSVRKEMRSLNTLVFPGIDMLQKPLLH----EENIIA 395
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 253/568 (44%), Positives = 349/568 (61%), Gaps = 60/568 (10%)
Query: 505 LSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKN 564
L+KH KQ LR++I +LL GWKI+ R R ++Y D+VYV+P G +WSIT AY+V +
Sbjct: 480 LNKH--KQDLRNKIKHVLLDNGWKIDLRQRKNKDYEDSVYVSPTGTGYWSITKAYAVFQE 537
Query: 565 HYEQEGGSS--DTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTK 622
++ G SS + +K G + I +D+L++LKK I K R+++ EI
Sbjct: 538 QFQNMGRSSKLNNTKPGAS-DAISKDDLALLKKNIVKRRTNK-------------EIC-- 581
Query: 623 KKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQR-----K 677
G + KP VS +S + ++ R+K Q + R +
Sbjct: 582 ---------------------GAEKKPGVSRNRSSKDILANRDFRNKHQNKEDRVKIKDR 620
Query: 678 RCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGR 737
RC LLVR S E N DG++ Y+ K TL +WMID G + + V+Y N +TR ML G+
Sbjct: 621 RCGLLVRGSTHDTEDNMDGYIPYEWKRTLCSWMIDLGVISEDMHVKYMNSNRTREMLAGK 680
Query: 738 IARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGS-SLLQCILDSWNKQDESKR 796
I R+GI C CCS+I TI+KF+ H+ SK P+ N++ E G SLLQC+LD+W K +S+
Sbjct: 681 ITREGIFCGCCSKILTIAKFELHAGSKEKKPYANIFLEGGKVSLLQCLLDAWEKHTQSEN 740
Query: 797 KGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQ 856
KGF+ ++ G+D +DDTC ICGDGGDL+CCD C STFH +CL IK PSG W+C C C+
Sbjct: 741 KGFYKID-KGDDEHDDTCAICGDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSCLCR 798
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQ---TDGAVQYEPSSLS-FCGK 912
FCG E + S+ L C C KYHQ+CS TD +S+ FC
Sbjct: 799 FCGFPQE-------KPSSSPELLLSCLQCSRKYHQTCSSGTGTDSGCTMPGTSIDCFCSP 851
Query: 913 KCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMD 972
C++I++RL KLLG+K+ +E G++WSLVH F + + + +CN++ A+A +V+D
Sbjct: 852 GCRKIYKRLNKLLGIKNHMEAGFSWSLVHCFANDQAMPNKNKEKLAQCNSKTALAFTVLD 911
Query: 973 ECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELA 1032
ECF P D RSGIN+IHN+ YN GS+F RL++ GF+ ILER DE+I+AAS+RIHG +LA
Sbjct: 912 ECFQPHIDDRSGINMIHNVAYNCGSDFSRLDFSGFYAFILERGDEVIAAASVRIHGTDLA 971
Query: 1033 EMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
EMPFIGTR MYR QGMCRRLL GIES +
Sbjct: 972 EMPFIGTRGMYRHQGMCRRLLNGIESVI 999
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 457 bits (1177), Expect = e-125, Method: Compositional matrix adjust.
Identities = 249/573 (43%), Positives = 333/573 (58%), Gaps = 67/573 (11%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR+RI E+LL AGW I+YRPR R+Y DAVY++P G +WSI AY L G
Sbjct: 340 KQRLRERIREMLLEAGWTIDYRPRKNRDYLDAVYISPRGTAYWSIIKAYEALLKQL-NSG 398
Query: 571 GSSDTSKTGFTFTPIPEDELSILKK-------------------VINKSRSDRNKKKKGK 611
+ + TFT I ++ LS L + + K+ RN
Sbjct: 399 EKAKPCEDSSTFTLISDEILSQLTRKTKRKIEKDMKSEEQSASDSVGKATFARNFLAIKN 458
Query: 612 NLGTDGEIVTKKKKKKGKTNSAASP-HGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKL 670
+G D + +G T+ +ASP H +++K S G +SH+ + +H
Sbjct: 459 EVGND----DSRDSSRGTTSKSASPLHHQTEK---------STGSSSHHVDGGKSSKHV- 504
Query: 671 QETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKT 730
R L VR V G+ S GDGFV K T+LAW+ID+GT+ L+EKV Y NQR+T
Sbjct: 505 -------RSTLSVRRPVRGDNSEGDGFVPSSEKQTILAWLIDSGTLKLSEKVMYMNQRRT 557
Query: 731 RVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNK 790
ML+G I RDGI C CCS+I ++SKF+ H+ SKL PFQN++ +G SL QC +D+W+K
Sbjct: 558 HAMLEGWITRDGIHCGCCSKILSVSKFEIHAGSKLRQPFQNIFLNTGVSLFQCQIDAWDK 617
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
Q + GF V+ +DPNDD CGICGDGGDL+CCDGCPSTFHQ CLDI+ W
Sbjct: 618 QKGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCDGCPSTFHQRCLDIRGHLMPDWIF 677
Query: 851 VYCS--CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLS 908
+ + C H N S Q+ + Q ++
Sbjct: 678 LRFNYRCFLLVIGIAPIVHAN-------SVGQLLKMLLRPRMQIPAK------------- 717
Query: 909 FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVAL 968
C +K + E ++K +GVKH+LE G++WSLVHR +DL L + VE N++LA+AL
Sbjct: 718 -CVRK--NLSEGVKKYVGVKHELEAGFSWSLVHRECADSDLFLGEHPHIVENNSKLALAL 774
Query: 969 SVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHG 1028
+VMDECFLP+ D RSG+N++ N+LYN GSNF RLN+ GF+TA+LER DE++++ASIR HG
Sbjct: 775 TVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTALLERGDEVVASASIRFHG 834
Query: 1029 KELAEMPFIGTRHMYRRQGMCRRLLTGIESALC 1061
LAEMPFIGTRH+YR QGMCRRL + IES C
Sbjct: 835 NHLAEMPFIGTRHVYRHQGMCRRLFSVIESVKC 867
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 408 bits (1049), Expect = e-110, Method: Compositional matrix adjust.
Identities = 193/452 (42%), Positives = 282/452 (62%), Gaps = 18/452 (3%)
Query: 674 QQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVM 733
+++ C LL R+ +G DG G T+L+W+ID G + N+ +QY N + V+
Sbjct: 894 KRKGNCKLLPRSVGKGGRQATDGKWTSSGVRTVLSWLIDAGVISSNDVIQYRNLKDNAVV 953
Query: 734 LQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDE 793
G + RDGI C CC+E+F++ F H+ KL P +NL+ ESG S C L +W+ + +
Sbjct: 954 KDGYVTRDGIVCKCCTELFSVCNFKIHAGFKLNRPCRNLFMESGKSFTLCQLQAWSTEYK 1013
Query: 794 SKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
++ G V D D NDD+CG+CGDGG+LICCD CPSTFHQ CL K+ P G W+C C
Sbjct: 1014 VRKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC 1073
Query: 854 SCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKK 913
+C+ CG + V D++ S+ CS CE KYH C + + V+ FCG+
Sbjct: 1074 TCRICGDL------VKDREASSSFLALKCSQCEHKYHMPCLK-EKCVKEVGGDARFCGEN 1126
Query: 914 CQEIFERLEKLLGVKHDLEGGYTWSLV------HRFDVSTDLSLSDVCQKVECNARLAVA 967
CQEI+ L+ LLG + + G+TW+L+ + S L+L K ECN++LAVA
Sbjct: 1127 CQEIYSGLQGLLGFVNHIADGFTWTLLRCIHDDQKVHSSQKLAL-----KAECNSKLAVA 1181
Query: 968 LSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIH 1027
L++M+ECFL + D R+GI++I ++LYN GS+F RLN+ GF+T +LE+DD ++S ASIR+H
Sbjct: 1182 LTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVH 1241
Query: 1028 GKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQ 1087
G +AEMP I T +R +GMCR L+ IE L S+ VEK+++ AI L ETWT FGF+
Sbjct: 1242 GVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFK 1301
Query: 1088 PLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
P+E K ++ ++L+VFPG +L+K + +NQ
Sbjct: 1302 PVEDDEKASLKKINLMVFPGTILLKKSLYENQ 1333
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 28/40 (70%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK 550
+ LL D I +L+AAGW IE R RN R + ++VY +PEG+
Sbjct: 279 RPLLHDHINHVLIAAGWSIEKRKRNSRPHEESVYRSPEGR 318
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 408 bits (1048), Expect = e-110, Method: Compositional matrix adjust.
Identities = 193/452 (42%), Positives = 282/452 (62%), Gaps = 18/452 (3%)
Query: 674 QQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVM 733
+++ C LL R+ +G DG G T+L+W+ID G + N+ +QY N + V+
Sbjct: 830 KRKGNCKLLPRSVGKGGRHATDGKWTSSGVRTVLSWLIDAGVISSNDVIQYRNLKDNAVV 889
Query: 734 LQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDE 793
G + RDGI C CC+E+F++ F H+ KL P +NL+ ESG S C L +W+ + +
Sbjct: 890 KDGYVTRDGIVCKCCTELFSVCNFKIHAGFKLNRPCRNLFMESGKSFTLCQLQAWSTEYK 949
Query: 794 SKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
++ G V D D NDD+CG+CGDGG+LICCD CPSTFHQ CL K+ P G W+C C
Sbjct: 950 VRKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC 1009
Query: 854 SCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKK 913
+C+ CG + V D++ S+ CS CE KYH C + + V+ FCG+
Sbjct: 1010 TCRICGDL------VKDREASSSFLALKCSQCEHKYHMPCLK-EKCVKEVGGDARFCGEN 1062
Query: 914 CQEIFERLEKLLGVKHDLEGGYTWSLV------HRFDVSTDLSLSDVCQKVECNARLAVA 967
CQEI+ L+ LLG + + G+TW+L+ + S L+L K ECN++LAVA
Sbjct: 1063 CQEIYSGLQGLLGFVNHIADGFTWTLLRCIHDDQKVHSSQKLAL-----KAECNSKLAVA 1117
Query: 968 LSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIH 1027
L++M+ECFL + D R+GI++I ++LYN GS+F RLN+ GF+T +LE+DD ++S ASIR+H
Sbjct: 1118 LTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVH 1177
Query: 1028 GKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQ 1087
G +AEMP I T +R +GMCR L+ IE L S+ VEK+++ AI L ETWT FGF+
Sbjct: 1178 GVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFK 1237
Query: 1088 PLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
P+E K ++ ++L+VFPG +L+K + +NQ
Sbjct: 1238 PVEDDEKASLKKINLMVFPGTILLKKSLYENQ 1269
Score = 45.8 bits (107), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 28/40 (70%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK 550
+ LL D I +L+AAGW IE R RN R + ++VY +PEG+
Sbjct: 279 RPLLHDHINHVLIAAGWSIEKRKRNSRPHEESVYRSPEGR 318
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 241/588 (40%), Positives = 325/588 (55%), Gaps = 77/588 (13%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEG 570
KQ LR+RI E+LL AGW I+YRPR R+Y DAVY++P G +WSI AY L
Sbjct: 353 KQRLRERIREMLLEAGWTIDYRPRRNRDYLDAVYISPRGTAYWSIIKAYEALLKQLNSGE 412
Query: 571 GSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGKNLG-TDGEIVTKK-----K 624
+ TF+ I ++ LS L + KS+ +++ K++ + +DG+ + K
Sbjct: 413 KVAKPCDDSSTFSLISDEILSQLTRK-TKSKIEKDMKRELHSASDSDGKATFARNFLAIK 471
Query: 625 KKKGKTNSAASPHGKSQKR--GIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALL 682
+ G +S S G + K + + S G +SH R ++ + R LL
Sbjct: 472 NEVGNDDSRDSSQGTTSKSESPLHHQTEKSTGSSSH--------RVDGGKSSKHGRSTLL 523
Query: 683 VRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDG 742
VR SV G+ S DGFV K T+LAW+ID+GT+ L+EKV Y NQR+TR ML+G I RDG
Sbjct: 524 VRRSVRGDNSESDGFVPSSEKRTVLAWLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDG 583
Query: 743 IRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFV 802
I C CCS+I +SKF+ H+ SKL PFQN++ SG + GF V
Sbjct: 584 IHCGCCSKILAVSKFEIHAGSKLRQPFQNIFLNSGG---------------AGNIGFCSV 628
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS--CQFCGR 860
+ +DPNDD CGICGDGGDL+CCDGCPSTFHQ CLDI+ W + + C
Sbjct: 629 DVIADDPNDDACGICGDGGDLVCCDGCPSTFHQRCLDIRGHLMPDWIFLRFNYRCFLLVI 688
Query: 861 INESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFER 920
H N S Q+ + + Q ++ C +K + E
Sbjct: 689 GIAPIVHAN-------SVRQLLKMLLRLWVQIPAK--------------CVRK--NLSEG 725
Query: 921 LEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPD 980
++K +GVKH+LE G++WSLVHR ++DLSLS VE N++LA+AL+VMDECFLP+ D
Sbjct: 726 VKKYVGVKHELEAGFSWSLVHRECTNSDLSLSGHPHIVENNSKLALALTVMDECFLPIID 785
Query: 981 HRSG-------INLIHNILYNFG-------------SNFKRLNYKGFFTAILERDDEIIS 1020
RSG N I + SNF RLN+ GF+TA+LER DEI++
Sbjct: 786 RRSGHCKKFCLRNFTTVIFFGISLCWFVCLYIAFRRSNFNRLNFGGFYTALLERGDEIVA 845
Query: 1021 AASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKL 1068
+ASIR HG LAEMPFIGTRH+YR QGMCRRL + +ES + +V KL
Sbjct: 846 SASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESVSSTADVAKL 893
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 404 bits (1039), Expect = e-109, Method: Compositional matrix adjust.
Identities = 224/591 (37%), Positives = 333/591 (56%), Gaps = 47/591 (7%)
Query: 541 DAVYVNPEGKT-HWSITLAYSVLKNHYEQEGGSSDTSKT-----GFTFTPIP---EDELS 591
D V V P KT W I S L +Q+ +DT + +PI E + S
Sbjct: 685 DIVSVAPGNKTEFWDIDGTCSQLDMIEDQKSHIADTKNVDSHEKNLSLSPISCHSERKGS 744
Query: 592 ILKKVINKSRSDRNKKKKGK--NLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKP 649
LKK + + + +KKK + D +V+ + K ++SAA
Sbjct: 745 KLKKNFDSHKGSKTRKKKLNECQIEDDDLLVSAIIRNKDVSSSAA--------------- 789
Query: 650 SVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAW 709
G SH +R K+ Q+ C LL+R+ GE++ DG G T+L+W
Sbjct: 790 -----GFSHVRKYFKSRA-KMNRKSQKSSCKLLLRSLGSGEKNYKDGKWYALGARTVLSW 843
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPF 769
++D G + N+ +QY + + V+ GRI DGI C+CCS+I +IS+F +H+ K
Sbjct: 844 LLDAGVISSNDIIQYQSPKDGSVVKYGRITGDGIICNCCSDILSISEFKSHAGFKFNRAC 903
Query: 770 QNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGC 829
NL+ +SG + C L +W+ + ++++ V D +D NDD+CGICGDGG+LICCD C
Sbjct: 904 SNLFLDSGRPFMLCQLQAWSTEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNC 963
Query: 830 PSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKY 889
PSTFH +CL I++ P G W+C+ C+C+ CG + VN ++ S+ S C CE+KY
Sbjct: 964 PSTFHHSCLSIQELPEGNWYCLNCTCRICGDL------VNFEEISSSSDALKCFQCEQKY 1017
Query: 890 HQSCSQ---TDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVH--RFD 944
H C + D V+ S + FC CQ+I+ L+ LG+ + G++W+L+ +D
Sbjct: 1018 HGQCLKQRDIDSGVE---SHIWFCSGSCQKIYAALQSQLGLTNQFANGFSWTLLRCIHYD 1074
Query: 945 VSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNY 1004
LS + + ECN+RL VAL++M+ECFL + D R+GI++I +++Y++ S+F RL++
Sbjct: 1075 QKI-LSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPRLDF 1133
Query: 1005 KGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLN 1064
GF+T ILE+DD ++ ASIR+HG ELAEMP I T YRRQGMCRRLL IE L S
Sbjct: 1134 HGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLMSFK 1193
Query: 1065 VEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
V+KL+I AI L ETWT FGF +E KQ + +L+VFPG +L+K +
Sbjct: 1194 VKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHKFNLMVFPGTVLLKKAL 1244
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 402 bits (1032), Expect = e-108, Method: Compositional matrix adjust.
Identities = 221/588 (37%), Positives = 329/588 (55%), Gaps = 41/588 (6%)
Query: 541 DAVYVNPEGKT-HWSITLAYSVLKNHYEQEGGSSDTSKT-----GFTFTPIP---EDELS 591
D V V P KT W I S L +Q+ +DT + +PI E + S
Sbjct: 685 DIVSVAPGNKTEFWDIDGTCSQLDMIEDQKSHIADTKNVDSHEKNLSLSPISCHSERKGS 744
Query: 592 ILKKVINKSRSDRNKKKKGK--NLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKP 649
L K + + + +KKK + D +V+ + K ++SAA
Sbjct: 745 KLNKNFDSHKGSKTRKKKLNECQIEDDDLLVSAIIRNKDVSSSAA--------------- 789
Query: 650 SVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAW 709
G SH +R K+ Q+ C LL+R+ GE++ DG G T+L+W
Sbjct: 790 -----GFSHVRKYFKSRA-KMNRKSQKSSCKLLLRSLGSGEKNYKDGKWYALGARTVLSW 843
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPF 769
++D G + N+ +QY + + V+ GRI DGI C+CCS+I +IS+F +H+ K
Sbjct: 844 LLDLGVISSNDIIQYQSPKDGSVVKYGRITGDGIICNCCSDILSISEFKSHAGFKFNRAC 903
Query: 770 QNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGC 829
NL+ +SG + C L +W+ + ++++ V D +D NDD+CGICGDGG+LICCD C
Sbjct: 904 SNLFLDSGRPFMLCQLQAWSTEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNC 963
Query: 830 PSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKY 889
PSTFH +CL I++ P G W+C+ C+C+ CG + VN ++ S+ S C CE+KY
Sbjct: 964 PSTFHHSCLSIQELPEGNWYCLNCTCRICGDL------VNFEEISSSSDALKCFQCEQKY 1017
Query: 890 HQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVH--RFDVST 947
H C + S + FC CQ+I+ L+ LG+ + G++W+L+ +D
Sbjct: 1018 HGQCLKQRDINSGVESHIWFCSGSCQKIYAALQSQLGLTNQFANGFSWTLLRCIHYDQKI 1077
Query: 948 DLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGF 1007
LS + + ECN+RL VAL++M+ECFL + D R+GI++I +++Y++ S+F RL++ GF
Sbjct: 1078 -LSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPRLDFHGF 1136
Query: 1008 FTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEK 1067
+T ILE+DD ++ ASIR+HG ELAEMP I T YRRQGMCRRLL IE L S V+K
Sbjct: 1137 YTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLMSFKVKK 1196
Query: 1068 LIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
L+I AI L ETWT FGF +E KQ + +L+VFPG +L+K +
Sbjct: 1197 LVIAAIPSLVETWTEGFGFVTVENEEKQSLHKFNLMVFPGTVLLKKAL 1244
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 401 bits (1031), Expect = e-108, Method: Compositional matrix adjust.
Identities = 191/443 (43%), Positives = 276/443 (62%), Gaps = 9/443 (2%)
Query: 675 QRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVML 734
Q+ RC LL RN + N DG Y G T+L+W+ID G + L++ +QY N + V+
Sbjct: 662 QKGRCRLLPRNPSNAGKHNKDGNRFYLGARTILSWLIDNGVISLSDVIQYRNPKDNVVIK 721
Query: 735 QGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDES 794
GRI +DGI C CC ++ T+S+F H+ L P N++ ESG C+L +W+ + ++
Sbjct: 722 DGRITKDGIICICCGKVLTLSEFKFHAGFTLNRPCLNIFMESGEPFTLCLLQAWSTEYKA 781
Query: 795 KRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
++ V+ D D NDD+CG+CG+GG+LICCD CPSTFH CL ++ P G W+C C+
Sbjct: 782 RKSQNQAVHADENDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT 841
Query: 855 CQFCGRINESTCHVNDQDD-SALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKK 913
C+ CG + V D+D A +LQ CS CE KYH+ C + + FCG+
Sbjct: 842 CRICGNL------VIDKDTLDAHDSLQ-CSQCEHKYHEKCLEDRDKQEGAILDTWFCGQS 894
Query: 914 CQEIFERLEKLLGVKHDLEGGYTWSLVHRF-DVSTDLSLSDVCQKVECNARLAVALSVMD 972
CQE++ L+ +G+ + + G +W+L+ D S K CN +LAVAL++M+
Sbjct: 895 CQEVYSGLQSQVGLVNQVADGISWTLLRCIHDDQKVHSAQWFALKAVCNTKLAVALTIME 954
Query: 973 ECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELA 1032
ECF+ + D R+GI+LI +LYN+GS F RLN++GF+T +LE+DD +IS ASIR+HG +A
Sbjct: 955 ECFVSMFDPRTGIHLIPQVLYNWGSEFARLNFQGFYTIVLEKDDVLISVASIRVHGTTVA 1014
Query: 1033 EMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVS 1092
EMP I T YRRQGMCR L+T IE L S VEKL+I AI +L ETWT FGF P++
Sbjct: 1015 EMPLIATCSQYRRQGMCRLLVTAIEQVLISFKVEKLVISAIPDLVETWTKGFGFIPVDDI 1074
Query: 1093 SKQKMRNMSLLVFPGVDMLQKPM 1115
+Q++ ++L+VFPG +L K +
Sbjct: 1075 ERQRLNKINLMVFPGTVLLVKSL 1097
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%)
Query: 505 LSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAY 559
L + + +L+ IV+LL AGW IE R R R Y ++VY PEGKT T A+
Sbjct: 269 LEEEDPRTILQVHIVQLLKMAGWSIEKRQRPSRRYPESVYRTPEGKTVREFTKAW 323
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 185/421 (43%), Positives = 269/421 (63%), Gaps = 18/421 (4%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+W+ID G + N+ +QY N + V+ G + RDGI C CC+E+F++ F H+ K
Sbjct: 764 TVLSWLIDAGVISSNDVIQYRNLKDNAVVKDGYVTRDGIVCKCCTELFSVCNFKIHAGFK 823
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
L P +NL+ ESG S C L +W+ + + ++ G V D D NDD+CG+CGDGG+LI
Sbjct: 824 LNRPCRNLFMESGKSFTLCQLQAWSTEYKVRKGGIKNVQIDEIDQNDDSCGLCGDGGELI 883
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCD CPSTFHQ CL K+ P G W+C C+C+ CG + V D++ S+ CS
Sbjct: 884 CCDNCPSTFHQACLSAKELPEGNWYCPNCTCRICGDL------VKDREASSSFLALKCSQ 937
Query: 885 CEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLV---- 940
CE KYH C + + V+ FCG+ CQEI+ L+ LLG + + G+TW+L+
Sbjct: 938 CEHKYHMPCLK-EKCVKEVGGDARFCGENCQEIYSGLQGLLGFVNHIADGFTWTLLRCIH 996
Query: 941 --HRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSN 998
+ S L+L K ECN++LAVAL++M+ECFL + D R+GI++I ++LYN GS+
Sbjct: 997 DDQKVHSSQKLAL-----KAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSD 1051
Query: 999 FKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIES 1058
F RLN+ GF+T +LE+DD ++S ASIR+HG +AEMP I T +R +GMCR L+ IE
Sbjct: 1052 FARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEK 1111
Query: 1059 ALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKN 1118
L S+ VEK+++ AI L ETWT FGF+P+E K ++ ++L+VFPG +L+K + +N
Sbjct: 1112 MLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKKINLMVFPGTILLKKSLYEN 1171
Query: 1119 Q 1119
Q
Sbjct: 1172 Q 1172
Score = 45.8 bits (107), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 28/40 (70%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK 550
+ LL D I +L+AAGW IE R RN R + ++VY +PEG+
Sbjct: 259 RPLLHDHINHVLIAAGWSIEKRKRNSRPHEESVYRSPEGR 298
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 188/443 (42%), Positives = 275/443 (62%), Gaps = 9/443 (2%)
Query: 675 QRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVML 734
Q+ RC LL RN + N D Y G T+L+W+ID G + LN+ +QY N + V+
Sbjct: 662 QKGRCRLLPRNPCNAGKHNKDCNRFYLGARTILSWLIDNGVISLNDVIQYRNPKDNVVIK 721
Query: 735 QGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDES 794
GRI +DGI C CC ++ T+S+F H+ + P N++ ESG C+L +W+ + ++
Sbjct: 722 DGRITKDGIICTCCDKVLTLSEFKFHAGFTVNRPCLNIFMESGEPFTLCLLQAWSAEYKA 781
Query: 795 KRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
+R V+ D D NDD+CG+CG+GG+LICCD CPSTFH CL ++ P G W+C C+
Sbjct: 782 RRSQNQAVHADDNDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT 841
Query: 855 CQFCGRINESTCHVNDQDDS-ALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKK 913
C+ CG + V D+D S A +LQ CS CE KYH+ C + + S FCG+
Sbjct: 842 CRICGNL------VIDKDTSDAHDSLQ-CSQCEHKYHEKCLEDRDKQEVAISDTWFCGQS 894
Query: 914 CQEIFERLEKLLGVKHDLEGGYTWSLVHRF-DVSTDLSLSDVCQKVECNARLAVALSVMD 972
CQE++ L+ +G+ + + G +W+L+ D S K CN +LAVAL++M+
Sbjct: 895 CQEVYSGLQTQVGLVNQVADGISWTLLRCIHDDQKVHSAQWFALKAVCNTKLAVALTIME 954
Query: 973 ECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELA 1032
ECF+ + D R+GI++I +LYN+GS F RLN++GF+T +LE+ D +IS ASIR+HG +A
Sbjct: 955 ECFVSMFDPRTGIHMIPQVLYNWGSEFARLNFQGFYTIVLEKKDVLISVASIRVHGTTVA 1014
Query: 1033 EMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVS 1092
EMP I T YRRQGMCR L++ IE L S VEKL++ AI +L ETWT FGF ++
Sbjct: 1015 EMPLIATCSQYRRQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDDI 1074
Query: 1093 SKQKMRNMSLLVFPGVDMLQKPM 1115
+Q++ ++L+VFPG +L K +
Sbjct: 1075 ERQRLNKINLMVFPGTVLLVKSL 1097
Score = 47.0 bits (110), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 32/55 (58%)
Query: 505 LSKHGAKQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAY 559
L + + +L+ I++LL AGW IE R R R Y ++VY PEGKT T A+
Sbjct: 269 LEEEDPRTILQFHILQLLKMAGWSIEKRQRPSRRYPESVYRTPEGKTIREFTKAW 323
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 191/455 (41%), Positives = 277/455 (60%), Gaps = 11/455 (2%)
Query: 670 LQETQQRKR----CALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYW 725
L+ +RKR C LL RN + + G + G T+L+W+ID G + + + VQY
Sbjct: 74 LRAGSKRKRKKGGCRLLPRNLGKLGKHYVGGKWSRMGSRTVLSWLIDAGVLSVKDVVQYR 133
Query: 726 NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCIL 785
N + V+ G + +DGI C CC+ + +++KF +H+ KL P NL+ ESG C L
Sbjct: 134 NLKDDFVIKDGVVTKDGIMCKCCNMVLSVTKFKSHAGFKLNRPCSNLFMESGKPFTLCQL 193
Query: 786 DSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPS 845
+W+ + +S++ G V D +D NDD+CG+CGDGG+LICCD CPSTFHQ CL + P
Sbjct: 194 QAWSAEYKSRKSGTQVVRADEDDKNDDSCGLCGDGGELICCDNCPSTFHQACLCTEDLPE 253
Query: 846 GKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPS 905
G W+C C+C CG + VND++ S+ C CE KYH +C Q + S
Sbjct: 254 GSWYCPNCTCWICGDL------VNDKEASSSVGAYKCLQCEHKYHGACQQGKQTHEGLVS 307
Query: 906 SLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF-DVSTDLSLSDVCQKVECNARL 964
FC CQE++ L +G+ + + G+ W+L+ + LS + K ECN++L
Sbjct: 308 DAWFCSGSCQEVYSGLHSRVGINNPIADGFCWTLLRCIHEDQKVLSAQRLALKAECNSKL 367
Query: 965 AVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASI 1024
AVAL++M+ECF + D R+GI++I + LYN+GS+F RLN+ GF+T +LE+DD ++SAAS+
Sbjct: 368 AVALTIMEECFQSMVDPRTGIDMIPHALYNWGSDFARLNFFGFYTVVLEKDDVLVSAASV 427
Query: 1025 RIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVF 1084
R+HG +AEMP I T YRRQGMCR L+T IE L S VEKL+I AI +L ETWT F
Sbjct: 428 RVHGVTVAEMPLIATCSNYRRQGMCRHLMTAIEEMLISYKVEKLVISAIPDLVETWTKGF 487
Query: 1085 GFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
GF P+ KQ + ++ +VFPG +L+K + K +
Sbjct: 488 GFIPVSKDEKQSLNKINFMVFPGTILLKKQLYKTK 522
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 392 bits (1006), Expect = e-105, Method: Compositional matrix adjust.
Identities = 192/473 (40%), Positives = 280/473 (59%), Gaps = 8/473 (1%)
Query: 645 IKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKW 704
IK K +S G S R K + Q+ C LL+RN + + DG + G
Sbjct: 699 IKNKDFISNGPKSTYKKKAFKSRAKTRTKSQKGSCRLLLRNLSKVGKHCNDGKWSIMGPR 758
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+W+ID + LN+ +QY N V+ G I ++GI C CC+ + +++ F H+ K
Sbjct: 759 TVLSWLIDIEAISLNDVIQYRNPTDDTVIKDGLIKKEGIMCKCCNMVLSVTNFKNHAGFK 818
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKR-KGFHFVNFDGEDPNDDTCGICGDGGDL 823
P N++ +SG C L +W+ + ++++ + V +D NDD+CG+CGDGG+L
Sbjct: 819 QSRPCLNVFMKSGKPFTLCQLQAWSAEYKTRKSRTIKVVRTADDDENDDSCGLCGDGGEL 878
Query: 824 ICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICS 883
ICCD CPSTFHQ CL ++ P G W+C C+C CG + VND++D S CS
Sbjct: 879 ICCDNCPSTFHQACLSTEELPEGSWYCPNCTCWICGEL------VNDKEDINSSNAFKCS 932
Query: 884 LCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF 943
CE KYH SC + + S FCG CQ ++ L+ +G+ + + G W+L+
Sbjct: 933 QCEHKYHDSCWKNKTIGKGGASDTWFCGGSCQAVYFGLQSRVGIINHIADGVCWTLLKCI 992
Query: 944 DVSTDL-SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRL 1002
+ S + K ECN++LAVAL++M+ECF + D R+GI++I ++LYN+ S F RL
Sbjct: 993 HEDQKVHSAQRLALKAECNSKLAVALTIMEECFQSMVDPRTGIDMIPHVLYNWRSEFARL 1052
Query: 1003 NYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCS 1062
N+ GF+T +LE+DD ++S ASIRIHG +AEMP I T YRRQGMCRRL+T IE L S
Sbjct: 1053 NFHGFYTVVLEKDDVLLSVASIRIHGATVAEMPLIATCSNYRRQGMCRRLMTAIEEMLIS 1112
Query: 1063 LNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
VEKL++ AI +L ETWT FGF P+ KQ + ++L+VFPG +L+KP+
Sbjct: 1113 FKVEKLVVSAIPDLVETWTEGFGFTPMSNDEKQSLNKINLMVFPGTILLKKPL 1165
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 365 bits (936), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 195/545 (35%), Positives = 300/545 (55%), Gaps = 32/545 (5%)
Query: 667 RHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWN 726
+ + ++ R C LL R+S E G + G T+L+W+I T + +E +Q +
Sbjct: 582 KARTKKRNNRGGCRLLPRSSSNVENHFFQGNWSILGPRTVLSWLIATKVISRDEVIQLRD 641
Query: 727 QRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILD 786
V+ G + +DG+ C CC++ ++S+F H+ P NL+ SG C L+
Sbjct: 642 PDDDTVVKTGLVTKDGVVCTCCNKTVSLSEFKNHAGFNQNCPCLNLFMGSGKPFASCQLE 701
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG 846
+W+ + +++R G+ +DPNDD+CG+CGDGG+LICCD CPSTFHQ CL ++ P G
Sbjct: 702 AWSAEYKARRNGWRLEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEG 761
Query: 847 KWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSS 906
W+C C+C C + D++ S CS C KYH +C Q +
Sbjct: 762 SWYCSSCTCWICSELV--------SDNAERSQDFKCSQCAHKYHGTCLQGISKRRKLFPE 813
Query: 907 LSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDL-SLSDVCQKVECNARLA 965
FCGK C++++ L +G+ + G +WS++ F + S + K ECN++LA
Sbjct: 814 TYFCGKNCEKVYNGLSSRVGIINPNADGLSWSILKCFQEDGMVHSARRLALKAECNSKLA 873
Query: 966 VALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
VALS+M+E FL + D R+GI++I ++LYN+GS F RL++ GF+T ++E+DD +IS ASIR
Sbjct: 874 VALSIMEESFLSMVDPRTGIDMIPHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIR 933
Query: 1026 IHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFG 1085
+HG +AEMP + T YRRQGMCR L+ IE L SL VEKL++ A+ L ETWT FG
Sbjct: 934 VHGVTIAEMPLVATCSKYRRQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFG 993
Query: 1086 FQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMISAKGL-RSSELENPRTADE 1144
F+P++ + ++ ++L+VFPG +L+K + ++ P + KG+ S E NP
Sbjct: 994 FKPMDDEERDALKRINLMVFPGTTLLKKTLYESTKPS----TMKGVCLSKERNNP----- 1044
Query: 1145 VGKNSDEKYSAGFDLNACINAAAPHVCKIH--DKSAPG----ESTLQFPNGSTHDASGLT 1198
N + G D A +P ++ D+ P E + FP D + T
Sbjct: 1045 --SNKEADLEPGLD-----KAGSPMSTQVESCDQMVPAGSDDEPSPGFPVPLGADQTEPT 1097
Query: 1199 SETVN 1203
SET N
Sbjct: 1098 SETEN 1102
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 364 bits (935), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 177/426 (41%), Positives = 265/426 (62%), Gaps = 29/426 (6%)
Query: 705 TLLAWMIDTGTVPLNEKVQYW----------NQRKTRVMLQGRIARDGIRCDCCSEIFTI 754
T+++ +ID G + NEK+ Y Q K V+++GR R+G+RC+CC+E+ T+
Sbjct: 91 TIMSRLIDHGMIIENEKIYYVADREGDSDARKQSKKEVLMKGRARREGVRCECCNEVMTV 150
Query: 755 SKFDTHSKSKLCHPFQNLYF-ESGSSLLQCILDSWNKQDESKRKGFH-FVNFDG-EDPND 811
F+TH+ S L P+++++ S SSLLQC + W E +R+ F+ V +G D +D
Sbjct: 151 WDFETHAGSVLQRPYEHIHVARSNSSLLQCQFEVWQSNVEVERRTFNEIVPRNGASDKHD 210
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
D C IC DGGDLICC+ C ST H C+ +++ P G W C YC C+ C + ND+
Sbjct: 211 DACLICADGGDLICCEKCWSTSHLKCMGLERIPQGDWICPYCVCKHCNK--------NDK 262
Query: 872 DDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSF-CGKKCQEIFERLEKLLGVKHD 930
D LQ C C++KYH C ++ + S + C C E++E+L+ L+GVKH+
Sbjct: 263 D------LQTCVQCDKKYHCQCLVSNKELDLNASGETLACDSHCGEVYEKLQSLVGVKHE 316
Query: 931 LEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHN 990
LEGG+ W+L+ R + +L D+ ECN+++A+A V+DECF + D + IN++ +
Sbjct: 317 LEGGFCWTLLQRME-PDNLDFKDLHLITECNSKIALAWEVLDECFTTIIDRHTQINVVQS 375
Query: 991 ILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCR 1050
+ Y+ GSN R+N++GF+TAILE++D+IISAA+IR+HG +LAEMPFIGTRH+YR+ GM R
Sbjct: 376 VAYSRGSNLNRINFRGFYTAILEKNDDIISAATIRVHGTDLAEMPFIGTRHLYRQNGMSR 435
Query: 1051 RLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDM 1110
LL +ES + VE LIIP++ EL E W GF P+E + QK+ N + L FP
Sbjct: 436 MLLVTLESIFSVMGVEHLIIPSVQELTEMWEGKCGFSPIEDAVSQKITNWNTLTFPSAVR 495
Query: 1111 LQKPMM 1116
LQK ++
Sbjct: 496 LQKALL 501
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 363 bits (931), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 183/458 (39%), Positives = 274/458 (59%), Gaps = 15/458 (3%)
Query: 661 MSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNE 720
+S A+ KL+ +++ R LL+R +G DG + T+L W+I TG V L +
Sbjct: 510 VSSKAKFRKLKSSKKDSR--LLIR---KGGTDVLDGRQIVLARKTVLCWLIATGFVTLKD 564
Query: 721 KVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSL 780
VQY N V+ G + DGI C CCS+ +IS F H+ L NL +SG S
Sbjct: 565 IVQYRNPENNEVLKDGWVTWDGILCSCCSKTLSISDFKAHAMISLPRSSLNLCLQSGKSF 624
Query: 781 LQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDI 840
C +++WN + +R + D NDDTCG CGDGG+L+CCD CPST+HQ+CL +
Sbjct: 625 TLCQIEAWNAEYMDRRSNACRRKVEAADGNDDTCGFCGDGGELLCCDNCPSTYHQSCLSV 684
Query: 841 KKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAV 900
K+ P W+C C C+ CG C V +++ S+ S + C C +H +C + GA
Sbjct: 685 KELPDDSWYCHNCICRICG------CPVTEKEISSFSAIIKCLQCGAAHHDTCVEM-GAT 737
Query: 901 QYEP--SSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDL-SLSDVCQK 957
+E S FCG C+EI+ L +GV+ L G +W+++ + S+ +
Sbjct: 738 AFEEMDSDEWFCGTHCKEIYLGLHGCVGVESSLGDGLSWTILRCNSGGQKMHSVQKIAHA 797
Query: 958 VECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDE 1017
+ECN++LAVAL++M+ECF + D R+GIN+I ++LYN GS + RLNY+GF+T ILE+ +E
Sbjct: 798 IECNSKLAVALTLMEECFAQMVDTRTGINMIPHVLYNQGSKYARLNYQGFYTVILEKGEE 857
Query: 1018 IISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELR 1077
I+ AASIR+HG + AE+PFI T +RR+GMCRRL+ IE L S +V+ L++ AI EL
Sbjct: 858 ILCAASIRVHGMKAAELPFIATCREHRRKGMCRRLINTIEEMLKSFHVKMLVLSAIPELV 917
Query: 1078 ETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
TW S FGF+P+E ++++ ++L++FPG +L K +
Sbjct: 918 STWVSGFGFKPIEEYERKQLDTINLMLFPGTSLLIKSL 955
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 360 bits (924), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 179/448 (39%), Positives = 274/448 (61%), Gaps = 22/448 (4%)
Query: 675 QRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVML 734
Q++ L VR +G + G + T+L W+I TG + + + +QY N R V+
Sbjct: 900 QKRGSKLFVRTCGKGGTNLLGGKRISLARKTVLCWLIATGFLTVKDVIQYRNLRSNEVIK 959
Query: 735 QGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDES 794
G++ +GI C+CC++ +IS F H+ +L L+ +SG S C +++W+ + S
Sbjct: 960 DGQVTWEGILCNCCTKTLSISDFKAHAGCRLRLSSLGLFLQSGKSYTLCQVEAWSAELMS 1019
Query: 795 KRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
+R + + D NDDTCG CGDGG+L+CCD CPST+H+ CL ++ P G W+C C+
Sbjct: 1020 RRSDAYGRKVEAVDENDDTCGFCGDGGELLCCDNCPSTYHEACLSSQELPEGSWYCHNCT 1079
Query: 855 CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQ-----TDGAVQYEPSSLSF 909
C+ CG VN+++ S+ S + C C + YH +C +DG + S F
Sbjct: 1080 CRSCGN------PVNEKEVSSFSDILKCLQCGDAYHNTCIDRVMLPSDG----KRSDTWF 1129
Query: 910 CGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKV----ECNARLA 965
CG+ C+EIF L +GV++ + +W+++ ++D QK+ ECN +LA
Sbjct: 1130 CGRYCKEIFMGLHSQVGVENVINNDLSWTILR---CNSDGQRLHSAQKIGLMTECNTKLA 1186
Query: 966 VALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
VAL++++ECF+ + D R+G+++I ++LYN GSNF RL+YKGF+T ILE+ DEI+ ASIR
Sbjct: 1187 VALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARLDYKGFYTVILEKGDEILCVASIR 1246
Query: 1026 IHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFG 1085
+HG + AE+PFI T YRRQGMCRRLL IE L S +VE L++ AI EL TW S FG
Sbjct: 1247 LHGTKAAELPFIATSVDYRRQGMCRRLLDIIEKMLRSFHVEMLVLSAIPELVNTWVSGFG 1306
Query: 1086 FQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
F+P+E K+++RN++L++FPG +L K
Sbjct: 1307 FKPIEDDEKKQLRNVNLMLFPGASLLTK 1334
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 358 bits (920), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 187/510 (36%), Positives = 286/510 (56%), Gaps = 32/510 (6%)
Query: 702 GKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS 761
G T+L+W+I T + +E +Q + V+ G + +DG+ C CC++ ++S+F H+
Sbjct: 573 GPRTVLSWLIATKVISRDEVIQLRDPDDDTVVKTGLVTKDGVVCTCCNKTVSLSEFKNHA 632
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
P NL+ SG C L++W+ + +++R G+ +DPNDD+CG+CGDGG
Sbjct: 633 GFNQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWRLEKASDDDPNDDSCGVCGDGG 692
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQI 881
+LICCD CPSTFHQ CL ++ P G W+C C+C C + D++ S
Sbjct: 693 ELICCDNCPSTFHQACLSMQVLPEGSWYCSSCTCWICSELV--------SDNAERSQDFK 744
Query: 882 CSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVH 941
CS C KYH +C Q + FCGK C++++ L +G+ + G +WS++
Sbjct: 745 CSQCAHKYHGTCLQGISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSWSILK 804
Query: 942 RFDVSTDL-SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFK 1000
F + S + K ECN++LAVALS+M+E FL + D R+GI++I ++LYN+GS F
Sbjct: 805 CFQEDGMVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWGSTFA 864
Query: 1001 RLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
RL++ GF+T ++E+DD +IS ASIR+HG +AEMP + T YRRQGMCR L+ IE L
Sbjct: 865 RLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGMCRILVAAIEEML 924
Query: 1061 CSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQF 1120
SL VEKL++ A+ L ETWT FGF+P++ + ++ ++L+VFPG +L+K + ++
Sbjct: 925 MSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKRINLMVFPGTTLLKKTLYESTK 984
Query: 1121 PRENMISAKGL-RSSELENPRTADEVGKNSDEKYSAGFDLNACINAAAPHVCKIH--DKS 1177
P + KG+ S E NP N + G D A +P ++ D+
Sbjct: 985 PS----TMKGVCLSKERNNP-------SNKEADLEPGLD-----KAGSPMSTQVESCDQM 1028
Query: 1178 APG----ESTLQFPNGSTHDASGLTSETVN 1203
P E + FP D + TSET N
Sbjct: 1029 VPAGSDDEPSPGFPVPLGADQTEPTSETEN 1058
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 348 bits (894), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 161/411 (39%), Positives = 257/411 (62%), Gaps = 8/411 (1%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T++ W++ TG + + + +QY + + +++ G + +GI C+CC + ++S F H+
Sbjct: 852 TVICWLLATGFLSVKDVIQYRDPKSNKIIKDGMVTWEGIVCNCCKKNLSVSDFMAHAGRS 911
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
L+ ESG S C++++W+ + S+R + D +DDTCG CGDGG+L+
Sbjct: 912 HPQSSLGLFLESGKSYTLCLVEAWSAESMSRRSNAWGRKVEAIDESDDTCGFCGDGGELL 971
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCD CPST+HQ CL K+ P G W+C C+CQ CG ++++ S S + C
Sbjct: 972 CCDNCPSTYHQACLSAKELPEGSWYCHNCTCQVCGGP------FSEKEVSTFSAIFKCFQ 1025
Query: 885 CEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF 943
C + YH +C Q ++ + S FCGK C+EIF L +G + L+ +WS++
Sbjct: 1026 CGDAYHDTCIEQEKLPLEDQISQTWFCGKYCKEIFIGLRSHVGTDNILDSDLSWSILRCN 1085
Query: 944 DVSTDL-SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRL 1002
+ L S+ + ECN +LAVAL++++ECF+ + D R+G+++I ++LYN GSNF R+
Sbjct: 1086 NDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARV 1145
Query: 1003 NYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCS 1062
+Y+GF+T ILE+ DEI+ ASIR+HG + AE+PFI T +RRQGMCR L++ IE LCS
Sbjct: 1146 DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDFRRQGMCRILMSIIEKMLCS 1205
Query: 1063 LNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
NV+ L++ AI EL TW S FGF+P+E + ++++ N++L++FPG +L K
Sbjct: 1206 FNVKMLVLSAIPELVSTWVSGFGFKPIEDAERKQLHNVNLMLFPGTSLLTK 1256
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 345 bits (886), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 170/426 (39%), Positives = 264/426 (61%), Gaps = 28/426 (6%)
Query: 703 KWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK 762
K T+L+W+ID+ V N K+ Y N+ ++ LQG + DGI C CC+ + T+S+F H+
Sbjct: 153 KRTILSWLIDSRVVENNAKIVYKNEAGEQI-LQGVLTGDGIWCSCCNTVITVSEFQLHAG 211
Query: 763 SKLCHPFQNLYF-ESGSSLLQCILDSWNKQDESKRKGFHFVNF--DGEDPNDDTCGICGD 819
+ P+Q ++ E+G SLL C ++WN+Q + +G+H + D D DD C +C D
Sbjct: 212 DEPNRPYQRIFISETGLSLLTCQAEAWNQQGIPELQGYHLIEPREDVSDKYDDACVVCAD 271
Query: 820 GGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTL 879
GG+LICCD CPST+H +CL ++ P G+W C C+C+FC H + D S +
Sbjct: 272 GGNLICCDKCPSTYHISCLQMEDEPQGEWRCPACACKFC--------HTHAFDISVFT-- 321
Query: 880 QICSLCEEKYHQSCSQTDGAVQYE-----PS-SLSFCGKKCQEIFERLEKLLGVKHDLEG 933
CS C++KYH C + + + + PS S FC C +I+E+LE+L+GV+++L+
Sbjct: 322 --CSQCDKKYHWECFRENEGMLIDLNMDGPSTSTPFCSSICSQIYEKLERLVGVRNELDE 379
Query: 934 GYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILY 993
G TW+L+ R D + L + + CN+++AVA++VM+ECF P+ D + IN++ +++Y
Sbjct: 380 GLTWTLLRRMDPEAGVYLEESYDRTLCNSKIAVAVAVMEECFEPVIDRHTQINVVRSVIY 439
Query: 994 NFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLL 1053
N G+NF R++++GF+TAILE+ DE IS AS+RIHG +LAEMPFI TR YRR GMC +LL
Sbjct: 440 NCGANFPRISFEGFYTAILEKGDETISVASMRIHGNKLAEMPFIATRPSYRRLGMCHKLL 499
Query: 1054 TGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
IES V+ L+IP+I + W +GFQ +E ++ + L+F LQK
Sbjct: 500 VAIES------VQYLVIPSIEQRVRRWEESYGFQAIENKVMGELIKVKSLMFHCAIRLQK 553
Query: 1114 PMMKNQ 1119
P++ ++
Sbjct: 554 PLLVHE 559
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 345 bits (885), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 164/413 (39%), Positives = 258/413 (62%), Gaps = 12/413 (2%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T++ W++ TG + + + +QY + + V+ G + +G+ C+CC + ++S F H+
Sbjct: 876 TVICWLLATGFLTVKDVIQYRDPKSNEVIKDGLVTWEGVVCNCCKKTLSVSGFMAHAG-- 933
Query: 765 LCHPFQNL--YFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGD 822
HP +L + ESG S C +++W+ + S+R + D +DDTCG CGDGG+
Sbjct: 934 FSHPQSSLGLFLESGKSYTLCQVEAWSAEFMSRRSNAWGRKVEAIDESDDTCGFCGDGGE 993
Query: 823 LICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQIC 882
L+CCD CPST+H CL K+ P G W+C C+CQ CG V++++ S S + C
Sbjct: 994 LLCCDNCPSTYHPACLSAKELPEGSWYCHNCTCQICGGP------VSEKEVSTFSAIFKC 1047
Query: 883 SLCEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVH 941
C + YH +C Q ++ + S FCGK C+EIF L +G ++ L+ +WS++
Sbjct: 1048 FQCGDAYHDTCIEQEKLPLEDQISQTWFCGKYCKEIFIGLRSHVGTENILDSELSWSILR 1107
Query: 942 RFDVSTDL-SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFK 1000
+ L S+ + ECN +LAVAL++++ECF+ + D R+G+++I ++LYN GSNF
Sbjct: 1108 CNNDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFA 1167
Query: 1001 RLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
R++Y+GF+T ILE+ DEI+ ASIR+HG + AE+PFI T YRRQGMCR L+ IE L
Sbjct: 1168 RVDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRILMNIIEKML 1227
Query: 1061 CSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
CS NV+ L++ AI EL TW S FGF+P+E + ++++ N++L++FPG +L K
Sbjct: 1228 CSFNVKMLVLSAIPELVSTWVSGFGFKPIEDAERKQLHNVNLMLFPGTSLLTK 1280
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 344 bits (882), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 171/435 (39%), Positives = 265/435 (60%), Gaps = 12/435 (2%)
Query: 681 LLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIAR 740
LL R S +G + G + T+L W+I TG + + + +QY N + V+ G++
Sbjct: 895 LLARMSGKGGSNLLGGKRVSLARKTVLCWLIATGFLTVKDVIQYQNLKSNEVVKDGQVTW 954
Query: 741 DGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFH 800
+GI C+CC++ F+IS F H L L+ +SG S C +++W+ + S++
Sbjct: 955 EGILCNCCTKTFSISDFKVHGGCSLPKSSLGLFLQSGKSYTLCQVEAWSAEFLSRKCDAS 1014
Query: 801 FVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGR 860
+ D NDDTCG CGDGG+L+CCD CPST+HQ CL ++ P G W+C C+C+ CG
Sbjct: 1015 GRKVEAMDENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCTCRSCGN 1074
Query: 861 INESTCHVNDQDDSALSTLQICSLCEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQEIFE 919
+++++ S S + C C + YH +C Q + S++ FCG+ C+EIF
Sbjct: 1075 ------PLSEKEVSTFSAILKCLQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFI 1128
Query: 920 RLEKLLGVKHDLEGGYTWSLVH-RFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPL 978
L +G+++ L+ +WS++ D S + ECN +LAVAL++++ECF+ +
Sbjct: 1129 GLHNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRM 1188
Query: 979 PDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIG 1038
D R+G+++I ++L SNF RL+Y+GF+T ILE+ DEI+ ASIR+HG + AE+PFI
Sbjct: 1189 VDPRTGVDMIPHVL----SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIA 1244
Query: 1039 TRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMR 1098
T YRRQGMCRRL+ IE L S +VE L++ AI EL TW S FGF+P+E + K+++R
Sbjct: 1245 TSVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLR 1304
Query: 1099 NMSLLVFPGVDMLQK 1113
N++L++FPG +L K
Sbjct: 1305 NVNLMLFPGTSLLTK 1319
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 343 bits (880), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 176/444 (39%), Positives = 262/444 (59%), Gaps = 9/444 (2%)
Query: 679 CALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRI 738
C LL R++ E G + G T+L+W+I T + +E +Q + V+ G +
Sbjct: 582 CRLLPRSTSNVENQFCQGNWSILGPRTVLSWLIATKVISRDEVIQLRDPDDDTVVKTGLV 641
Query: 739 ARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKG 798
+DG+ C CC+ ++S+F H+ P NL+ SG C L++W+ + +++R G
Sbjct: 642 TKDGVVCTCCNRTVSLSEFKNHAGFNQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNG 701
Query: 799 FHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFC 858
+ +DPNDD+CG+CGDGG+LICCD CPSTFHQ CL ++ P G W+C CSCQ C
Sbjct: 702 WRSEEASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCSSCSCQIC 761
Query: 859 GRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIF 918
+ D+ S CS C KYH C Q + FCGK C++++
Sbjct: 762 SELV--------SDNGERSQDFKCSQCAHKYHGICLQGISKRRKLFPETYFCGKNCEKVY 813
Query: 919 ERLEKLLGVKHDLEGGYTWSLVHRFDVSTDL-SLSDVCQKVECNARLAVALSVMDECFLP 977
L +GV + G +WS++ F + S + K ECN++LAVALS+M+E FL
Sbjct: 814 TGLSSRVGVINPNADGLSWSILKCFQEDGKVHSARRLALKAECNSKLAVALSIMEESFLS 873
Query: 978 LPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFI 1037
+ D R+GI++I ++LYN+GSNF RL++ GF+T +LE+DD +IS ASIR+HG +AEMP +
Sbjct: 874 MVDPRTGIDMIPHVLYNWGSNFARLDFDGFYTMVLEKDDVMISVASIRVHGVTVAEMPLV 933
Query: 1038 GTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM 1097
T YRRQGMCR L+ IE L SL VEKL++ A+ L ETWT FGF+P++ + +
Sbjct: 934 ATCSKYRRQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDAL 993
Query: 1098 RNMSLLVFPGVDMLQKPMMKNQFP 1121
+ ++L+VFPG +L K + ++ P
Sbjct: 994 KRINLMVFPGTILLMKTLYESTKP 1017
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 334 bits (856), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 176/430 (40%), Positives = 263/430 (61%), Gaps = 18/430 (4%)
Query: 700 YDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDT 759
Y+ K T+L+W+++T + +KV Y +++ +V+L+G + +GI C CC +++++S F+
Sbjct: 11 YEAKRTILSWLLETKMLHEFQKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEA 70
Query: 760 HSKSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGD 819
HS + +++ G SLL + +W D SK V D NDD CG+CGD
Sbjct: 71 HSGTSQRRACASIFNNKGESLLDLQVQAWELLD-SKVNPKENVKAAPSDENDDACGVCGD 129
Query: 820 GGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALS-- 877
GG LICCD CPST+H +CL +K+ P G+W C C C CG + D S+ +
Sbjct: 130 GGRLICCDHCPSTYHLSCLLLKELPEGEWFCPSCRCAICGGSE------YNADGSSFNEM 183
Query: 878 TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGY 935
T+ +C CE +YH SC + G + P FCG C +IF+ L KL+G+ +++ G
Sbjct: 184 TVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFQGLRKLVGISNNIGEGL 243
Query: 936 TWSLVHRF--DVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILY 993
+W+L+ D+ + S++ Q E ++LAVAL VM ECFLP+ D R+ I+L+ +ILY
Sbjct: 244 SWTLLRSGEDDLPSANSMNRE-QMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILY 302
Query: 994 NFG-SNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRL 1052
N G + RLN++GF+T +LE+DDE+IS ASIRIHG LAEMP IGTR +RRQGMCRRL
Sbjct: 303 NRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLIGTRFHHRRQGMCRRL 362
Query: 1053 LTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQ 1112
+ IE L L + ++PA+ EL TW + FGFQ + + + ++ +S++ FPGV +LQ
Sbjct: 363 VRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRLELVKLSVVSFPGVTLLQ 422
Query: 1113 KPMMKNQFPR 1122
KP+ Q PR
Sbjct: 423 KPL---QTPR 429
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 327 bits (839), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 167/440 (37%), Positives = 250/440 (56%), Gaps = 19/440 (4%)
Query: 702 GKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS 761
G T+L +++ G V +QY V+ G I + GIRC CC +FT+S F H+
Sbjct: 935 GARTVLGKLLEMGIVCKVNILQYRRPGSKNVLKDGNITKKGIRCRCCDMVFTMSMFKYHA 994
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
+ P NL+ SG S C L +W+ + +++++ D NDDTCG+CGDGG
Sbjct: 995 GLRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAKCTMPLQADENDDTCGLCGDGG 1054
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQD-DSALSTLQ 880
+LICCD CP+++HQ+CL + P G W+C C C CG + +N ++ S+L L+
Sbjct: 1055 ELICCDNCPASYHQDCLPCQDIPDGSWYCYRCLCDICGEV------INLKELRSSLPALE 1108
Query: 881 ICSLCEEKYHQSCSQTDGAVQYEPSSLS-FCGKKCQEIFERLEKLLGVKHDLEGGYTWSL 939
C+ CE +YH C E + FCG++CQ+I+ L +G+ G++ ++
Sbjct: 1109 -CAQCERQYHAKCIYGKLLCNEEGGPCAWFCGRRCQQIYMNLRSRVGIPIHTIDGFSCTV 1167
Query: 940 VHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNF 999
+ + +D+ ECN +L +ALS+M+ECFLP+ D R+GI++I ILYN+ S+F
Sbjct: 1168 LRNNGDQRVSTAADIAILAECNMKLVIALSIMEECFLPIIDARTGIDIIPPILYNWRSDF 1227
Query: 1000 KRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESA 1059
L+YKGF+T +LE DD IIS ASIR+HG +AEMP I T R+QGMCRRL+ IE
Sbjct: 1228 VHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQQGMCRRLMDYIEQM 1287
Query: 1060 LCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
L SL VE L++ AI L +TWT FGF P++ ++ + + L+ PG +L++
Sbjct: 1288 LKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDRKNLSRLRLVSVPGTVLLKR------ 1341
Query: 1120 FPRENMISAKGLRSSELENP 1139
N+ G + EL NP
Sbjct: 1342 ----NLYECPGTNAGELRNP 1357
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 323 bits (828), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 161/414 (38%), Positives = 242/414 (58%), Gaps = 9/414 (2%)
Query: 702 GKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS 761
G T+L +++ G V +QY V+ G I + GIRC CC +FT+S F H+
Sbjct: 884 GARTVLGKLLEMGIVCKVNILQYRRPGSKNVLKDGNITKKGIRCRCCDMVFTMSMFKYHA 943
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
+ P NL+ SG S C L +W+ + +++++ D NDDTCG+CGDGG
Sbjct: 944 GLRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAKCTMPLQADENDDTCGLCGDGG 1003
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQD-DSALSTLQ 880
+LICCD CP+++HQ+CL + P G W+C C C CG + +N ++ S+L L+
Sbjct: 1004 ELICCDNCPASYHQDCLPCQDIPDGSWYCYRCLCDICGEV------INLKELRSSLPALE 1057
Query: 881 ICSLCEEKYHQSCSQTDGAVQYEPSSLS-FCGKKCQEIFERLEKLLGVKHDLEGGYTWSL 939
C+ CE +YH C E + FCG++CQ+I+ L +G+ G++ ++
Sbjct: 1058 -CAQCERQYHAKCIYGKLLCNEEGGPCAWFCGRRCQQIYMNLRSRVGIPIHTIDGFSCTV 1116
Query: 940 VHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNF 999
+ + +D+ ECN +L +ALS+M+ECFLP+ D R+GI++I ILYN+ S+F
Sbjct: 1117 LRNNGDQRVSTAADIAILAECNMKLVIALSIMEECFLPIIDARTGIDIIPPILYNWRSDF 1176
Query: 1000 KRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESA 1059
L+YKGF+T +LE DD IIS ASIR+HG +AEMP I T R+QGMCRRL+ IE
Sbjct: 1177 VHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQQGMCRRLMDYIEQM 1236
Query: 1060 LCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
L SL VE L++ AI L +TWT FGF P++ ++ + + L+ PG +L++
Sbjct: 1237 LKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDRKNLSRLRLVSVPGTVLLKR 1290
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 322 bits (826), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 170/435 (39%), Positives = 253/435 (58%), Gaps = 37/435 (8%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
++ +W+ID + V Y N+ +V G I+RDGI C CC+E+F+++ F H+ +
Sbjct: 2 SIFSWLIDGEILSEGAAVSYVNKDSNQVA-SGVISRDGILCKCCNEVFSMTSFQVHAGDE 60
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESK--------------------RKGFHFVNF 804
+ L E G S+L+C + K +++K K V
Sbjct: 61 VHRTAALLTLEDGRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVD 120
Query: 805 DGE-DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
D E D NDDTC +CGDGG L+CCD CPSTFH CL ++ P G W C C C CGR
Sbjct: 121 DVEMDENDDTCAVCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRCCCASCGR--- 177
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEK 923
+ D + + + C CE +YH +C A++YE S FC +KC +IF L K
Sbjct: 178 -----SLYDPTIQTEILYCDQCEREYHSNCV-PGSAMKYESSDNQFCSRKCLKIFRGLRK 231
Query: 924 LLGVKHDLEGGYTWSLVH--RFDVSTDLS-LSDVCQKVECNARLAVALSVMDECFLPLPD 980
L+G + ++ Y+W+L+ +D S + S L V + N RLA+AL+V+ ECF P+ D
Sbjct: 232 LVGRVNKVDDMYSWTLLRSEHYDQSEENSKLESV---ADLNTRLALALTVIQECFRPMID 288
Query: 981 HRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTR 1040
RS I+++ +ILYN KR++++GF+T +LE++ E+IS AS+R+HG AE+PFIGTR
Sbjct: 289 PRSNIDMVSHILYNRRGEDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGTR 348
Query: 1041 HMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNM 1100
YR+QGMCRRL+ I+ L +L V+ L++PAI+E ETWTS FGFQ L + ++ +
Sbjct: 349 SQYRKQGMCRRLINVIQQVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLTAAQGIQLMEL 408
Query: 1101 SLLVFPGVDMLQKPM 1115
+++ FPG +LQKP+
Sbjct: 409 NIVTFPGSSVLQKPL 423
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 322 bits (826), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 168/403 (41%), Positives = 244/403 (60%), Gaps = 15/403 (3%)
Query: 720 EKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFESGSS 779
+KV Y +++ +V+L+G + +GI C CC +++++S F+ HS + +++ G S
Sbjct: 6 QKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRRACASIFNNKGES 65
Query: 780 LLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD 839
LL + +W D SK V D NDD CG+CGDGG LICCD CPST+H +CL
Sbjct: 66 LLDLQVQAWELLD-SKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHLSCLL 124
Query: 840 IKKFPSGKWHCVYCSCQFCG--RINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTD 897
+K+ P G+W C C C CG N N+ T+ +C CE +YH SC +
Sbjct: 125 LKELPEGEWFCPSCRCAICGGSEYNADGSSFNEM------TVLLCDQCEREYHVSCLYSR 178
Query: 898 GAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF--DVSTDLSLSD 953
G + P FCG C +IFE L KL+G+ + + G +W+L+ D+ + S++
Sbjct: 179 GMAKMTSCPDDSWFCGDHCDKIFEGLRKLVGISNTIGEGLSWTLLRSGEDDLPSASSMNR 238
Query: 954 VCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFG-SNFKRLNYKGFFTAIL 1012
Q E ++LAVAL VM ECFLP+ D R+ I+L+ +ILYN G + RLN++GF+T +L
Sbjct: 239 E-QMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVL 297
Query: 1013 ERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPA 1072
E+DDE+IS ASIRIHG LAEMP IGTR +RRQGMCRRL+ IE L L + ++PA
Sbjct: 298 EKDDEVISVASIRIHGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPA 357
Query: 1073 ISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+ EL TW + FGFQ + + + ++ +S++ FPGV +LQKP+
Sbjct: 358 VPELLHTWKNAFGFQEMAPTQRLELVKLSVVSFPGVTLLQKPL 400
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 317 bits (811), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 158/412 (38%), Positives = 236/412 (57%), Gaps = 6/412 (1%)
Query: 702 GKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS 761
G T+L +++ G V +Q V+ G I GIRC CC FT+SKF H+
Sbjct: 863 GARTVLNKLLEMGIVARVNILQCRGPGGKIVLKDGNITTSGIRCRCCGTTFTMSKFKCHA 922
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
+ P NL+ +G S C+L +W+ + + +++ D NDDTCG CGDGG
Sbjct: 923 GLRQEVPSLNLFLGTGKSYSLCLLQAWSIEQKVRKERVKDTMSLQADQNDDTCGSCGDGG 982
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQI 881
+LICCD CP+++HQ CL + P G W+C C C CG + S + ++L L+
Sbjct: 983 ELICCDNCPASYHQACLPCQDIPDGNWYCSSCLCNICGEVITS-----KELRTSLPALE- 1036
Query: 882 CSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVH 941
CS CE +YH C + + FCG+KCQ+I+ +GV ++ + +++
Sbjct: 1037 CSQCERQYHVKCVSAKVSCNEDGPGTWFCGRKCQQIYMIFRSRVGVPDHVDNDLSCTILR 1096
Query: 942 RFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKR 1001
+ ++ ECN +L +ALS+M+ECFLP+ D R+GI++I +ILYN+ S+F
Sbjct: 1097 NNGDKKVRTAGEIALMAECNMKLMIALSIMEECFLPILDPRTGIDIIPSILYNWRSDFIH 1156
Query: 1002 LNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALC 1061
N+KGF+T +LE DD ++S ASIR+HG +AEMP + T R+QGMCRRL+ IE L
Sbjct: 1157 FNHKGFYTVVLENDDSMVSVASIRLHGTIVAEMPLVATSTENRQQGMCRRLMDYIEEMLK 1216
Query: 1062 SLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
SL VE L++ AI L ETWTS FGF+ ++ S K+++ + L PG +L+K
Sbjct: 1217 SLKVEMLLLSAIPHLVETWTSTFGFREIDDSDKKRLSMVRLAAVPGTVLLKK 1268
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 313 bits (803), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 158/412 (38%), Positives = 235/412 (57%), Gaps = 7/412 (1%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L +++ G V + +Q V+ G I GI+C CC FT+SKF H+ +
Sbjct: 877 TVLNKLLEMGIVARVDILQCHGPGGKNVLKDGNITASGIQCQCCGTTFTMSKFKCHAGLR 936
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
P NL+ ++G S C L +W+ + + +++ D NDDTCG CGDGG+LI
Sbjct: 937 QEVPSLNLFLDTGKSYSLCQLQAWSIEQKVRKEHAKDTMSLQADQNDDTCGSCGDGGELI 996
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCD CP+++HQ CL + P G W+C C C CG + +S V ++L L CS
Sbjct: 997 CCDNCPASYHQACLSCQDIPDGNWYCSSCLCDICGEVIDSKELV-----TSLPALD-CSQ 1050
Query: 885 CEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFD 944
CE +YH C + S FCG+KC EI+ +GV ++ ++++
Sbjct: 1051 CERQYHVKCVSAKVPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNG 1110
Query: 945 VSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNY 1004
+ ++ ECN +L +A S+M+ECFLP+ D R+GI++I +ILYN+ S+ NY
Sbjct: 1111 DKKVRTAEEIALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDL-HFNY 1169
Query: 1005 KGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLN 1064
KGF+T +LE DD ++S ASIR+HG LAEMP + T R+QGMCRRL+ IE L SL
Sbjct: 1170 KGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIEEMLKSLK 1229
Query: 1065 VEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
VE L++ AI L ETWTS FGF+ ++ S K+++ + L PG +L+K ++
Sbjct: 1230 VEMLLLSAIPHLAETWTSTFGFREIDESDKKRLSKVRLAAVPGTVLLKKDLL 1281
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 309 bits (791), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 168/435 (38%), Positives = 247/435 (56%), Gaps = 44/435 (10%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
++ +W+ID + V Y N+ +V G I+RDGI C CC+E+F+++ F H+ +
Sbjct: 2 SIFSWLIDGEILSEGAAVSYVNKDSNQVA-SGVISRDGILCKCCNEVFSMTSFQVHAGDE 60
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESK--------------------RKGFHFVNF 804
+ L E G S+L+C + K +++K K V
Sbjct: 61 VHRTAALLTLEDGRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVD 120
Query: 805 DGE-DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
D E D NDDTC +CGDGG L+CCD CPSTFH CL ++ P G W C C C CGR
Sbjct: 121 DVEMDENDDTCAVCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRCCCASCGR--- 177
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEK 923
D + T E Y+ S A++YE S FC +KC +IF L K
Sbjct: 178 ------SLYDPTIQT-------EILYYHSNCVPGCAMKYESSDNQFCSRKCFKIFRGLRK 224
Query: 924 LLGVKHDLEGGYTWSLVH--RFDVSTDLS-LSDVCQKVECNARLAVALSVMDECFLPLPD 980
L+G + ++ Y+W+L+ +D S + S L V + N RLA+AL+V+ ECF P+ D
Sbjct: 225 LVGRVNKVDDMYSWTLLRSEHYDQSAENSKLESV---ADLNTRLALALTVIQECFRPMID 281
Query: 981 HRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTR 1040
RS I+++ +ILYN KR++++GF+T +LE++ E+IS AS+R+HG AE+PFIGTR
Sbjct: 282 PRSNIDMVSHILYNRRGEDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGTR 341
Query: 1041 HMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNM 1100
YR+QGMCRRL+ I+ L +L V+ L++PAI+E ETWTS FGFQ L + ++ +
Sbjct: 342 SQYRKQGMCRRLINVIQQVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLTAAQGIQLMEL 401
Query: 1101 SLLVFPGVDMLQKPM 1115
+++ FPG +LQKP+
Sbjct: 402 NIVTFPGSSVLQKPL 416
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 308 bits (789), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 165/473 (34%), Positives = 250/473 (52%), Gaps = 68/473 (14%)
Query: 692 SNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYW----NQRKTRVMLQGRIARDGIRCDC 747
S D K T+L W+ID G V E V Y ++ G + R G+ C C
Sbjct: 357 SGTDAVAPATKKHTILTWLIDGGFVSDGETVLYVPGGDGGAGAEKVVSGAVTRAGVHCSC 416
Query: 748 CSEIFTISKFDTHSKSKLC-------HPFQNLYFESGSSLLQCILDSW------------ 788
C + + F+ H+ ++ P++ L SG+SLL+C+ ++W
Sbjct: 417 CDGVVPLPVFEAHAGARRRDPGPGQRQPWEKLLLVSGNSLLRCMQEAWEMEKVRTFHAQA 476
Query: 789 ----------NKQDESKRKGF--HF----------------VNFDGEDPNDDTCGICGDG 820
+K ++KR+ H + +D +DD CG+C DG
Sbjct: 477 KVRAALEQEEDKCSQAKRRLLAKHLKKGVVVERIMSPRMEKIKAGEKDSSDDACGVCADG 536
Query: 821 GDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQ 880
G+L+CCD C STFH CL IK P G W C YC C C +++ D LST
Sbjct: 537 GELLCCDSCTSTFHPECLAIK-VPEGSWSCHYCRCVLC---------MSNDDLQGLST-- 584
Query: 881 ICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLV 940
C C KYH+SC G ++CG+ C+++F +L ++ GV + G+ W+L+
Sbjct: 585 -CQQCARKYHESCRPLPG---NGCDIGTYCGETCKKLFSQLAQVTGVTNPTGDGFWWALL 640
Query: 941 HRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFK 1000
R S ++ +E N +LAVAL V +ECF P+ D R+ I+++H +Y+ GS FK
Sbjct: 641 -RIQKDEPASSEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQAVYSLGSQFK 699
Query: 1001 RLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
RL+Y+GF+T +LE+D EI+SAA +RIHG ++AEMPF GT YR+QGM RRL++ +E L
Sbjct: 700 RLSYEGFYTMVLEKDGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMRRLVSAVEQVL 759
Query: 1061 CSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
S+ VEKL+IPAI L +TW F F+P++ +++++ +SL+V G +L K
Sbjct: 760 ASVQVEKLVIPAIDSLVDTWKRSFFFRPVDPQLREELKRLSLVVITGTTLLHK 812
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 303 bits (777), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 163/462 (35%), Positives = 256/462 (55%), Gaps = 71/462 (15%)
Query: 705 TLLAWMIDTGTVPLNEKVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS- 761
T+L W+ID G + EKV Y + + +V + G + R G+ C CC+ + ++ F+ H+
Sbjct: 359 TILTWLIDAGFLSDKEKVFYVPVDGGEGKV-VSGAVTRTGVHCGCCNVVVSLPAFEAHAG 417
Query: 762 -----KSKLCHPFQNLYFESGSSLLQCILDSWNK-------------------QDESKRK 797
+ ++ L SG+SLL + ++W K Q++S +
Sbjct: 418 RGDPGHGQQQRSWEKLLLVSGNSLLNRMQEAWEKERVKIFLVQEKARAALEQEQEKSAQA 477
Query: 798 GFHFVN------FDG---------------EDPNDDTCGICGDGGDLICCDGCPSTFHQN 836
+ +G +D +DD CG+C DGG+L+CCD CPSTFH
Sbjct: 478 KRRLLAKQKKGAVEGVITSPRIRTKLRSGEKDSSDDACGVCADGGELLCCDSCPSTFHPA 537
Query: 837 CLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC--S 894
CL +K P G W C YC C C DD LS Q C+L KYH+ C S
Sbjct: 538 CLAMK-VPEGLWACHYCRCVLC----------MANDDQGLSRCQHCTL---KYHEICRPS 583
Query: 895 QTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDV 954
++G + ++C + C+++ +L ++G+ + E G++W+L+ + +S +
Sbjct: 584 LSNGR-----GNGAYCSETCKKVSAQLSDMIGITNHTEDGFSWALL-KIQKDEPVSSQNS 637
Query: 955 CQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILER 1014
+ECN +LAVAL V++ECF P+ D R+ I+++H +Y+ GS FKR++Y+GF+T +LE+
Sbjct: 638 PDVLECNVKLAVALGVLNECFNPVKDRRTKIDMLHQAVYSLGSEFKRVSYEGFYTMVLEK 697
Query: 1015 DDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAIS 1074
+ EIISAA +RIHG ++AEMPF GT YR+QGM RRL+ +E L S+ VEKL+IPAI+
Sbjct: 698 NGEIISAALLRIHGTKVAEMPFAGTLPAYRKQGMMRRLVNAVEQVLASVQVEKLVIPAIA 757
Query: 1075 ELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
L +TW F F+ L+ K+++R SL+V G +LQKP++
Sbjct: 758 ALVDTWKKSFSFKALDPELKEEIRRRSLVVITGTTLLQKPVV 799
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 298 bits (763), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 160/466 (34%), Positives = 251/466 (53%), Gaps = 73/466 (15%)
Query: 703 KWTLLAWMIDTGTVPLNEKVQYW----NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFD 758
K T+L W+ID G + E V Y ++ G + R G+ C+CC + + F+
Sbjct: 278 KHTILTWLIDGGFLSDGETVYYVPGDSGGAGKEKIVSGAVTRAGVHCNCCDAVVPLPVFE 337
Query: 759 THS-------KSKLCHPFQNLYFESGSSLLQCILDSWN-----------------KQDES 794
H+ + + ++ L SG SLLQ + ++W +Q+E
Sbjct: 338 VHAGRVPGTGQQQQQVAWEKLLLVSGDSLLQSMQEAWQNEKVRTFHAQAKVRAALEQEEE 397
Query: 795 K-------------RKGFHF----------VNFDGEDPNDDTCGICGDGGDLICCDGCPS 831
K +KG + +D +DD CG+C DGG+L+CCD C S
Sbjct: 398 KNSQAKRRLLAKHQKKGVVVERIMSPRMEKIKAGEKDSSDDACGVCADGGELLCCDFCTS 457
Query: 832 TFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQ 891
TFH CL I+ P G W C YC C C DD LST C C KYH+
Sbjct: 458 TFHPECLAIE-VPDGSWSCHYCRCTLC----------MSNDDQDLST---CQECACKYHE 503
Query: 892 SCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSL--VHRFDVSTDL 949
SC G + ++CG+ C+++ +L +++GV + E G++WSL +H + ++
Sbjct: 504 SCRPLLGNGR---DIGAYCGEICKKLSAKLSEVIGVMNSTEDGFSWSLLRIHEDEPASSQ 560
Query: 950 SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFT 1009
+ V ++ N +LAVAL V+++CF P+ D R+ I+++H +Y+ GS FKRL+Y+GF+T
Sbjct: 561 GMPAVLER---NVKLAVALGVLNQCFNPVKDRRTKIDMLHQAVYSLGSQFKRLSYEGFYT 617
Query: 1010 AILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLI 1069
ILE+D EI+S A +RIHG+++AEMPF GT YR+QGM R+++ +E L S+ VE LI
Sbjct: 618 MILEKDGEIVSTALLRIHGRKVAEMPFAGTLPAYRKQGMMHRVVSAVEQVLASVQVETLI 677
Query: 1070 IPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
IPAI+ + +TW F F+P++ +++++ +SL+V G ML KP+
Sbjct: 678 IPAIASMVDTWKRSFSFRPVDPQLREELKRLSLVVITGTTMLHKPV 723
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 297 bits (760), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 155/423 (36%), Positives = 245/423 (57%), Gaps = 18/423 (4%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
++L+W+ID + E ++Y NQ+ V+ +G++ R+GI+C CC IFTIS F+ H+
Sbjct: 598 SILSWLIDNNVILPRENIRYRNQKDDTVIKEGKLTREGIKCSCCRRIFTISGFEVHANGG 657
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNF-DGEDPNDDTCGICGDGGDL 823
C N++ + G SLL+C ++++ + +++ + GE ND C +C GG L
Sbjct: 658 SCRAAANIFLDDGRSLLECQVEAYETRKKAQPPDILKMKLRQGE--NDVICSVCHYGGKL 715
Query: 824 ICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICS 883
I CDGCPS FH NCL +++ P G W C C C CG+ +++ +S C
Sbjct: 716 ILCDGCPSAFHANCLGLEEVPDGDWFCESCCCGACGQFFLKATSKYAKEEKFIS----CK 771
Query: 884 LCEEKYHQSCSQTDGAVQYEPSSLS---FCGKKCQEIFERLEKLLGVKHDL-EGGYTWSL 939
CE KYH SC + DGA + L FC K C+EIF L +L+G ++ TW L
Sbjct: 772 QCELKYHPSCLRYDGAGDSLDTFLGEKWFCSKDCEEIFVNLCELIGKPREVGVEKLTWRL 831
Query: 940 VHRFDVSTDLSLSDVCQKVECNA----RLAVALSVMDECFLPLPDHRSGINLIHNILYNF 995
V F+ + D K+E A +L+VAL VM E F P+ G +L +++++
Sbjct: 832 VQSFEPNM---YGDDAYKIEAVAENHCKLSVALDVMHELFEPVKRPHGGRDLAEDVIFSR 888
Query: 996 GSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTG 1055
S FKRLN+ GF+T +LER++E+++ A++RI GK++AEMPFIGTR +R++GMCR L+
Sbjct: 889 WSKFKRLNFSGFYTVLLERNEELVTVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINE 948
Query: 1056 IESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+E L L VE+L++PA+ + TW + FGF + +S +++ +LL F + QK +
Sbjct: 949 LEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKEFLKFTLLEFGRTILCQKIL 1008
Query: 1116 MKN 1118
+K+
Sbjct: 1009 IKS 1011
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 296 bits (758), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 155/422 (36%), Positives = 240/422 (56%), Gaps = 18/422 (4%)
Query: 706 LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKL 765
+L+W++D V E ++ NQ+ T V +G++ R+GI+C CC +FTI+ F+ H+
Sbjct: 599 ILSWLMDNNVVLPRENIRCCNQKDTTVRKEGKLTREGIKCSCCRRVFTINGFEVHANGAS 658
Query: 766 CHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNF-DGEDPNDDTCGICGDGGDLI 824
C N++ + G SLL+C ++++ K+ +++ + GE ND C +C GG LI
Sbjct: 659 CSGAANIFLDDGRSLLECQVEAYKKRKKAQPPDMLKMKLRQGE--NDVFCSVCHYGGKLI 716
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CDGCPS FH NCL ++ P G W C C C CG+ T N +++ +S C
Sbjct: 717 LCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLKTTSTNAKEEKFIS----CKQ 772
Query: 885 CEEKYHQSCSQTDGAVQYEPSSLS---FCGKKCQEIFERLEKLLGVKHDLE-GGYTWSLV 940
CE KYH SC + DGA L FC K C+EIF L L+G ++ TW LV
Sbjct: 773 CELKYHPSCLRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIGKPREVSVEKLTWRLV 832
Query: 941 HRFDVSTDLSLSDVCQKVECNAR----LAVALSVMDECFLPLPDHRSGINLIHNILYNFG 996
+ + D K+E A L+VAL VM E F P+ G +L +++++
Sbjct: 833 QSLEPNM---YGDDASKIEAAAENHCILSVALDVMHELFEPVKRPHGGRDLAEDVIFSRW 889
Query: 997 SNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGI 1056
S FKRLN+ GF+T +LER++E++S A++RI GK++AEMPFIGTR +R++GMCR L+ +
Sbjct: 890 SKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINEL 949
Query: 1057 ESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
E L L VE+L++PA+ + TW + FGF + +S ++ +LL F + +K ++
Sbjct: 950 EKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLKFTLLEFGRTILCEKILI 1009
Query: 1117 KN 1118
K+
Sbjct: 1010 KS 1011
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 295 bits (755), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 156/432 (36%), Positives = 242/432 (56%), Gaps = 26/432 (6%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+W+ID V KV Y ++R M GRI RDGI+C CC E+F++S+F+ H+ S
Sbjct: 812 TILSWLIDNNVVLPRAKVHYSSRRDHHPMADGRITRDGIKCSCCQEVFSLSRFEAHAGSS 871
Query: 765 LCHPFQNLYFESGSSLLQCILD--------SWNKQDESKRKGFHFVNFDGEDPNDDTCGI 816
N++ E G SLL+C + + K+ S++K + + ND C +
Sbjct: 872 YHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKESFSRKK-----SNERHHENDHICSV 926
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSAL 876
C GGDL+ CD CPS+FH++CL +K P G W C C C CG ++QD+
Sbjct: 927 CHYGGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSCCCGICGENKFDGG--SEQDNVVF 984
Query: 877 STLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDL-EG 933
S C CE Q C + G V+ P+ FC K+C++IF L+KLLG +
Sbjct: 985 S----CYQCER---QCCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVD 1037
Query: 934 GYTWSLVHRF-DVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNIL 992
TW+L+ ++ L D+ E ++L +AL VM ECF P+ + + +++ +++
Sbjct: 1038 NLTWTLLKPIRSKGLEIDLPDIEALTEVYSKLNIALGVMHECFEPVKEPHTRRDVVEDVI 1097
Query: 993 YNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRL 1052
+ GS+ RLN++GF+T +LER+DE+IS A++R++G+++AE+P IGTR YRR GMC L
Sbjct: 1098 FCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYRRLGMCHIL 1157
Query: 1053 LTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQ 1112
+ +E L L VE+L++PA+ + TWT+ FGF + S + + + S L F M Q
Sbjct: 1158 MNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFLDYSFLDFQDTVMCQ 1217
Query: 1113 KPMMKNQFPREN 1124
K +MK + N
Sbjct: 1218 KLLMKIPLAKSN 1229
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 293 bits (750), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 156/432 (36%), Positives = 243/432 (56%), Gaps = 23/432 (5%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+W+ID V KV Y ++R M GRI RDGI+C CC E+F++S+F+ H+ S
Sbjct: 557 TILSWLIDNNVVLPRAKVHYSSRRDHHPMADGRITRDGIKCSCCQEVFSLSRFEAHAGSS 616
Query: 765 LCHPFQNLYFESGSSLLQCILD--------SWNKQDESKRKGFHFVNFDGEDPNDDTCGI 816
N++ E G SLL+C + + K+ S++K + + ND C +
Sbjct: 617 YHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKESFSRKK-----SNERHHENDHICSV 671
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSAL 876
C GGDL+ CD CPS+FH++CL +K G W C C C CG ++QD+
Sbjct: 672 CHYGGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSCCCGICGENKFDGG--SEQDNVVF 729
Query: 877 STLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDL-EG 933
S C CE +YH C + G V+ P+ FC K+C++IF L+KLLG +
Sbjct: 730 S----CYQCERQYHVGCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVD 785
Query: 934 GYTWSLVHRF-DVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNIL 992
TW+L+ ++ L D+ E ++L +AL VM ECF P+ + + +++ +++
Sbjct: 786 NLTWTLLKPIRSKGLEIDLPDIEALTEVYSKLNIALGVMHECFEPVKEPHTRRDVVEDVI 845
Query: 993 YNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRL 1052
+ GS+ RLN++GF+T +LER+DE+IS A++R++G+++AE+P IGTR YRR GMC L
Sbjct: 846 FCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYRRLGMCHIL 905
Query: 1053 LTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQ 1112
+ +E L L VE+L++PA+ + TWT+ FGF + S + + + S L F M Q
Sbjct: 906 MNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFLDYSFLDFQDTVMCQ 965
Query: 1113 KPMMKNQFPREN 1124
K +MK + N
Sbjct: 966 KLLMKIPLAKSN 977
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 293 bits (749), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 147/366 (40%), Positives = 213/366 (58%), Gaps = 18/366 (4%)
Query: 760 HSKSKLCH-------PFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDD 812
H + CH P NL+ SG S C L +W + + + + D NDD
Sbjct: 943 HRLTARCHAGLKQDIPSLNLFLGSGKSYSLCQLQAWYIEHKVREERAKVTLLLQADQNDD 1002
Query: 813 TCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQD 872
TCG+CGDGG+LICCD CP+++H CL ++ P G W+C C C CG + S +
Sbjct: 1003 TCGLCGDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCRCDVCGEVVSSK-----EP 1057
Query: 873 DSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSS---LSFCGKKCQEIFERLEKLLGVKH 929
+ L + CS CE +YH C G V S FCG++CQ+I+ L +G+
Sbjct: 1058 RTPLHAFE-CSQCERQYHIKC--ISGKVLCNEESGPGTWFCGRRCQQIYTSLRSRVGIPD 1114
Query: 930 DLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIH 989
L+ G++ +++H +D+ ECN +L +ALS+++ECFLP+ D R+G++++
Sbjct: 1115 HLDDGFSCTILHNNGDQKVRMAADIALLAECNMKLIIALSILEECFLPIFDPRTGMDIMP 1174
Query: 990 NILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMC 1049
ILYN+ SNF L+YKGF+T +LE+DD IIS ASIR+HG +AEMP I T R+QGMC
Sbjct: 1175 LILYNWRSNFVHLDYKGFYTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGMC 1234
Query: 1050 RRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVD 1109
RR++ IE L SL VE L++ AI L +TWTS FGF+P+E K+K+ + L PG
Sbjct: 1235 RRIVDYIEQMLKSLKVEMLLLSAIPSLVDTWTSAFGFRPIEDCDKKKLSKIRLASVPGTV 1294
Query: 1110 MLQKPM 1115
+L+K +
Sbjct: 1295 LLKKDL 1300
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 291 bits (746), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 206/310 (66%), Gaps = 8/310 (2%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
D NDDTCG CGDGG+L+CCD CPST+HQ CL ++ P G W+C C+C+ CG
Sbjct: 2 DENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCTCRSCGN------P 55
Query: 868 VNDQDDSALSTLQICSLCEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLG 926
+++++ S S + C C + YH +C Q + S++ FCG+ C+EIF L +G
Sbjct: 56 LSEKEVSTFSAILKCLQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVG 115
Query: 927 VKHDLEGGYTWSLVH-RFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGI 985
+++ L+ +WS++ D S + ECN +LAVAL++++ECF+ + D R+G+
Sbjct: 116 IENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGV 175
Query: 986 NLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRR 1045
++I ++LYN GSNF RL+Y+GF+T ILE+ DEI+ ASIR+HG + AE+PFI T YRR
Sbjct: 176 DMIPHVLYNKGSNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRR 235
Query: 1046 QGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVF 1105
QGMCRRL+ IE L S +VE L++ AI EL TW S FGF+P+E + K+++RN++L++F
Sbjct: 236 QGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRNVNLMLF 295
Query: 1106 PGVDMLQKPM 1115
PG +L K +
Sbjct: 296 PGTSLLTKRL 305
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 286 bits (732), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 206/345 (59%), Gaps = 39/345 (11%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINEST 865
+D +DD CG+CGDGG+L+CCD CPSTFH CL +K P G W C YC C C
Sbjct: 448 AKDSSDDACGVCGDGGELLCCDSCPSTFHPACLAMK-VPQGWWACHYCRCVLC------- 499
Query: 866 CHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLS-------FCGKKCQEIF 918
DD LST Q CSL KYH+ C + SLS +C + C+++
Sbjct: 500 ---MANDDQGLSTCQHCSL---KYHEVCRR---------PSLSNGRGIGAYCSETCKKVS 544
Query: 919 ERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPL 978
RL ++GV + E G++W+L+ + +S D +ECN +LAVAL V++ECF P
Sbjct: 545 ARLSDMVGVTNHTEDGFSWALL-KIQKDEAVSSQDTAAVLECNVKLAVALGVLNECFNPA 603
Query: 979 PDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIG 1038
D R+ I+++H +Y+ GS FKR++Y+GF+T +L++D E I+AA +RIHG ++AEMPF
Sbjct: 604 KDRRTKIDMLHQAVYSLGSEFKRVSYEGFYTMVLDKDGETIAAALLRIHGTKVAEMPFAA 663
Query: 1039 TRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQP-LEVSSKQKM 1097
T YR+QGM RRL+ +E L S+ V+KL+IPAI+ L +TWT F F+P L+ S++++
Sbjct: 664 TLPAYRKQGMMRRLVNAVEQVLASVQVDKLVIPAIAALVDTWTRSFSFRPLLDPESREEI 723
Query: 1098 RNMSLLVFPGVDMLQKPMMKNQFPRENMISAKGLRSSELENPRTA 1142
R SL+V G +L KP+ + P + R +E P+ A
Sbjct: 724 RRRSLVVIAGTTLLHKPVAAARPPSPH-------RHAEAAAPKNA 761
Score = 44.7 bits (104), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS 761
T+LAW+ID G + EKV Y ++ G + R G+ C CC + + F+ H+
Sbjct: 306 TILAWLIDAGFLSDGEKVFYVPVDGK--VVSGAVTRTGVHCGCCDAVVPLPSFEAHA 360
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 286 bits (731), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 249/466 (53%), Gaps = 68/466 (14%)
Query: 703 KWTLLAWMIDTGTVPLNEKVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH 760
K T+L W+IDTG + KV Y + ++ G + + GIRC CC+ + ++ F+TH
Sbjct: 371 KHTVLTWLIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFETH 430
Query: 761 SK-SKLCHPFQNLYFESGSSLLQCILDSW----------------------NKQDESKRK 797
++ + P++ L SG L +C+ ++W K ++KRK
Sbjct: 431 ARCERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAREKAMASLEQEKEKSSQAKRK 490
Query: 798 -----------GFHFV-----------NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQ 835
G V N G+D +DD CG+C DGG L+CCD CPSTFH
Sbjct: 491 LAKTKKMQLLDGVVVVSTSSPRHQVKKNGGGKDCSDDACGVCADGGQLLCCDTCPSTFHP 550
Query: 836 NCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS- 894
+CL I+ F W + Q I + +A+S HQ C
Sbjct: 551 DCLAIQ-FMIKSW--LLFDRQQLTTIYGQQPWLQTAPGAAISA----------DHQYCRP 597
Query: 895 -QTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSD 953
Q+ G +E + +C + C+++ L ++GV + E G++W+L+ + ++ D
Sbjct: 598 LQSPG---FEIGA--YCSETCKKMSSHLSDMIGVMNHTEDGFSWALL-KIQKDELVTSED 651
Query: 954 VCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILE 1013
+ +E N +LAVAL V++ECF P+ D R+ I+++H +Y+ GS FKR+NY+GF+T +LE
Sbjct: 652 MPVILESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLE 711
Query: 1014 RDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAI 1073
+D EIIS A +R HG++LAEMPF GT Y++QGM RRL+ +E L SL VE L+IPA+
Sbjct: 712 KDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAV 771
Query: 1074 SELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
++L ETW F F+P++ + + + +SL+ G +LQKP+ +Q
Sbjct: 772 ADLVETWKRSFSFRPMQAEVRDEAKKLSLVAITGTTLLQKPISAHQ 817
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 285 bits (730), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 249/466 (53%), Gaps = 68/466 (14%)
Query: 703 KWTLLAWMIDTGTVPLNEKVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH 760
K T+L W+IDTG + KV Y + ++ G + + GIRC CC+ + ++ F+TH
Sbjct: 373 KHTVLTWLIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFETH 432
Query: 761 SK-SKLCHPFQNLYFESGSSLLQCILDSW----------------------NKQDESKRK 797
++ + P++ L SG L +C+ ++W K ++KRK
Sbjct: 433 ARCERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAREKAMASLEQEKEKSSQAKRK 492
Query: 798 -----------GFHFV-----------NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQ 835
G V N G+D +DD CG+C DGG L+CCD CPSTFH
Sbjct: 493 LAKTKKMQLLDGVVVVSTSSPRHQVKKNGGGKDCSDDACGVCADGGQLLCCDTCPSTFHP 552
Query: 836 NCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCS- 894
+CL I+ F W + Q I + +A+S HQ C
Sbjct: 553 DCLAIQ-FMIKSW--LLFDRQQLTTIYGQQPWLQTAPGAAISA----------DHQYCRP 599
Query: 895 -QTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSD 953
Q+ G +E + +C + C+++ L ++GV + E G++W+L+ + ++ D
Sbjct: 600 LQSPG---FEIGA--YCSETCKKMSSHLSDMIGVMNHTEDGFSWALL-KIQKDELVTSED 653
Query: 954 VCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILE 1013
+ +E N +LAVAL V++ECF P+ D R+ I+++H +Y+ GS FKR+NY+GF+T +LE
Sbjct: 654 MPVILESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLE 713
Query: 1014 RDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAI 1073
+D EIIS A +R HG++LAEMPF GT Y++QGM RRL+ +E L SL VE L+IPA+
Sbjct: 714 KDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAV 773
Query: 1074 SELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
++L ETW F F+P++ + + + +SL+ G +LQKP+ +Q
Sbjct: 774 ADLVETWKRSFSFRPMQAEVRDEAKKLSLVAITGTTLLQKPISAHQ 819
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 285 bits (729), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 158/450 (35%), Positives = 249/450 (55%), Gaps = 20/450 (4%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+W+ID V EKV Y + + M +GRI+R+GI+C CC++++TI+ F+ H
Sbjct: 792 TVLSWLIDNNMVLPREKVYYCKGKSRQPMAEGRISRNGIKCCCCNKLYTINGFEIHVSGT 851
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKR-------KGFHFVNFDGEDPNDDTCGIC 817
++ E G SLL C + WNK+ S + KG + +D ND C IC
Sbjct: 852 SSRSAAHILLEDGKSLLDCQI-LWNKKTRSFKNQASTCGKGDY-----SKDENDYICSIC 905
Query: 818 GDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALS 877
GG LI CD CPS+FHQ+CL +K P G W C C C CG+ N+ + H N D L+
Sbjct: 906 HFGGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSCCCGICGQ-NKLSEHANIVDGPFLT 964
Query: 878 TLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEG-GYT 936
C CE KYH C + FC K C++I+ L+KLLG + G T
Sbjct: 965 ----CYQCECKYHVQCLRGTKKFGSCSKPHWFCNKHCKQIYWGLQKLLGKSIPVGGDNLT 1020
Query: 937 WSLVHRFDVSTD-LSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNF 995
WSL+ T+ + + E ++L VAL VM ECF P+ + + +++ +++++
Sbjct: 1021 WSLLKSPSSDTNYFNPPHLETLTENQSKLNVALRVMHECFEPVREQHTRRDIVEDVIFSR 1080
Query: 996 GSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTG 1055
S KRLN++GF+T +LER++E+I+ A+IR++G+++AE+P +GTR YRR GMC L+
Sbjct: 1081 RSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAEVPLVGTRFQYRRLGMCHILMNE 1140
Query: 1056 IESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+E L L V++L++PA+ + + WT+ FGF + S + + N + L F M QK +
Sbjct: 1141 LEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSERSEFLNYTFLNFQETVMCQKFL 1200
Query: 1116 MKNQFPRENMISAKGLRSSELENPRTADEV 1145
+KN ++ L + +N ++D +
Sbjct: 1201 LKNTVVPSSLSGKSELHDAVNKNSNSSDNI 1230
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 283 bits (723), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 164/438 (37%), Positives = 251/438 (57%), Gaps = 36/438 (8%)
Query: 699 AYDGKWTLLAWMIDTGTVPLNEKVQYW-NQRKTRVMLQGRIARDGIRCDCCSEIFTISKF 757
+Y +L+W+ID V KV YW + M +GRI R+GIRC CC ++ +S F
Sbjct: 672 SYQKPQNILSWLIDCNIVLPKYKVFYWETEGGNSPMFEGRITREGIRCTCCQNLYGLSGF 731
Query: 758 DTHSK-SKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGF---HFVNFDGEDPNDDT 813
H+ S C P ++ + G SLL C+++ QD R+ H F+GE ND+
Sbjct: 732 ANHAGGSSNCRPSACIFLKDGRSLLDCMMEV--MQDHRTREITEKPHNDLFEGE--NDNI 787
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG--RINESTCHVNDQ 871
C +C GG+LI CD CPS +H+NCL+++ P G W C C C CG +I E+ +
Sbjct: 788 CSVCNYGGELILCDQCPSAYHKNCLNLEGIPDGDWFCPSCRCGICGQNKIEET------E 841
Query: 872 DDSALSTLQICSLCEEKYHQSC---SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVK 928
D L+ +Q CE KYH C + D + + + FCG++C+ ++ L+ LLG K
Sbjct: 842 DGHFLTCIQ----CEHKYHVECLRNGEKDDSRRCMKNW--FCGEECERVYTGLQNLLG-K 894
Query: 929 HDLEGG--YTWSLVHRFDVST---DLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRS 983
L G TW+LV + T + SD+ VE ++L+VALSVM ECF PL + S
Sbjct: 895 PVLVGADNLTWTLVKYVNSETCGVGGAESDLV--VENYSKLSVALSVMHECFEPLHNPFS 952
Query: 984 GINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMY 1043
+++ ++++N S RLN++GF+T +LER++E+IS A++RI G+++AE+P +GTR Y
Sbjct: 953 SRDIVEDVIFNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIAEVPLVGTRFQY 1012
Query: 1044 RRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLL 1103
RR GMCR L+ +E L L VE+L++PA+ + +TWT+ FGF+ + + + + S L
Sbjct: 1013 RRLGMCRVLMDELEKKLKQLGVERLVLPAVPGVLDTWTNSFGFEQMTNFERSQFLDYSFL 1072
Query: 1104 VFPGVDMLQKPMMKNQFP 1121
F G M QK + +FP
Sbjct: 1073 DFQGTVMCQK--LLTRFP 1088
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 281 bits (718), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 204/311 (65%), Gaps = 18/311 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
D NDDTCG CGDGG+L+CCD CPST+HQ CL ++ P G W+C C+C+ CG
Sbjct: 2 DENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCTCRSCGN------P 55
Query: 868 VNDQDDSALSTLQICSLCEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLG 926
+++++ S S + C C + YH +C Q + S++ FCG+ C+EIF L +G
Sbjct: 56 LSEKEVSTFSAILKCLQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVG 115
Query: 927 VKHDLEGGYTWSLVHRFDVSTD----LSLSDVCQKVECNARLAVALSVMDECFLPLPDHR 982
+++ L+ +WS++ +TD S + ECN +LAVAL++++ECF+ + D R
Sbjct: 116 IENFLDNELSWSIL---KCNTDGRKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPR 172
Query: 983 SGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHM 1042
+G+++I ++L SNF RL+Y+GF+T ILE+ DEI+ ASIR+HG + AE+PFI T
Sbjct: 173 TGVDMIPHVL----SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVD 228
Query: 1043 YRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSL 1102
YRRQGMCRRL+ IE L S +VE L++ AI EL TW S FGF+P+E + K+++RN++L
Sbjct: 229 YRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRNVNL 288
Query: 1103 LVFPGVDMLQK 1113
++FPG +L K
Sbjct: 289 MLFPGTSLLTK 299
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 281 bits (718), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 202/308 (65%), Gaps = 12/308 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
D NDDTCG CGDGG+L+CCD CPST+HQ CL ++ P G W+C C+C+ CG
Sbjct: 2 DENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCTCRSCGN------P 55
Query: 868 VNDQDDSALSTLQICSLCEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLG 926
+++++ S S + C C + YH +C Q + S++ FCG+ C+EIF L +G
Sbjct: 56 LSEKEVSTFSAILKCLQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVG 115
Query: 927 VKHDLEGGYTWSLVH-RFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGI 985
+++ L+ +WS++ D S + ECN +LAVAL++++ECF+ + D R+G+
Sbjct: 116 IENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGV 175
Query: 986 NLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRR 1045
++I ++L SNF RL+Y+GF+T ILE+ DEI+ ASIR+HG + AE+PFI T YRR
Sbjct: 176 DMIPHVL----SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRR 231
Query: 1046 QGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVF 1105
QGMCRRL+ IE L S +VE L++ AI EL TW S FGF+P+E + K+++RN++L++F
Sbjct: 232 QGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRNVNLMLF 291
Query: 1106 PGVDMLQK 1113
PG +L K
Sbjct: 292 PGTSLLTK 299
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 280 bits (715), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 139/242 (57%), Positives = 182/242 (75%), Gaps = 7/242 (2%)
Query: 890 HQSCS-QTDGAVQYEPSSLS-FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF--DV 945
H++CS +TD + + FC + C+ +FE L+ LL VK DLE Y+ +V R DV
Sbjct: 6 HEACSPETDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDV 65
Query: 946 STDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYK 1005
++ D ++VECN+R+AVALS+MDECFLP+ D R+GINLI N++Y+ GSNF RL+++
Sbjct: 66 PEEVLALD--KRVECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFR 123
Query: 1006 GFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNV 1065
GF+ ILER DEII+AAS+RIHG +LAEMPFIGTR+MYRRQGMCRRL+ GIE L SLNV
Sbjct: 124 GFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNV 183
Query: 1066 EKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENM 1125
EKLIIPAI+EL +TWTS FGF PLE S KQ+++++S+LVFPG +LQKP++K P E
Sbjct: 184 EKLIIPAITELVDTWTSKFGFSPLEDSEKQEVKSISMLVFPGTGLLQKPLLK-ALPNEEQ 242
Query: 1126 IS 1127
S
Sbjct: 243 CS 244
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 279 bits (714), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 171/524 (32%), Positives = 272/524 (51%), Gaps = 55/524 (10%)
Query: 608 KKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARR 667
KK +L + T+K ++K K + H +Q +P S G + G+ I +R
Sbjct: 496 KKHHDLHEINVLTTRKARRKRKDSLHVETHSDAQN---TSRPK-SRSGITSRGL-IGSRN 550
Query: 668 HKLQETQQRK-RCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWN 726
K R R + V++ V + S+ + T+L+ +ID V KV Y +
Sbjct: 551 DKKHTKWVRVLRSSKRVQHVVAPDPSHHNPR-------TVLSLLIDNDIVLPRTKVHYGS 603
Query: 727 QRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS--------KSKLCHPFQNLYFESGS 778
Q+ ++GRIARDGI+C CC +++T+S F+ H+ SK C P +++ + G
Sbjct: 604 QKDRNPTVEGRIARDGIKCSCCGKVYTLSGFELHAGIKSCRSGASKYCKPAASIFLDDGR 663
Query: 779 SLLQCILDSWNKQDESKRKGFHFVNFDG---EDPNDDTCGICGDGGDLICCDGCPSTFHQ 835
SLL+C + ++ S K + G D ND C +C GG+LI CD CPS+FH+
Sbjct: 664 SLLECQIQMMRDKEMSNHKAETPDSLKGSWDRDGNDHVCSVCHYGGELILCDHCPSSFHK 723
Query: 836 NCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQ 895
CL +K P G W C C C+ CG+ N+ D D L+ C+ CE +YH C
Sbjct: 724 RCLGMKDVPDGDWFCPSCCCKICGQ-NKLKKDTKDFIDGVLN----CTQCEHQYHIMCLS 778
Query: 896 TDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSD 953
+++ P SFC KKC+ Y S H+ D D +L
Sbjct: 779 NSWTDKWKDHPKENSFCSKKCEV------------------YMQSDQHKLDAFDDETL-- 818
Query: 954 VCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILE 1013
VE ++L +AL V+ ECF P+ + R+G +L+ +++++ GS RLN++GF+T +LE
Sbjct: 819 ----VETYSKLKIALDVVHECFEPIEEPRTGRDLMKDVIFSNGSELNRLNFQGFYTILLE 874
Query: 1014 RDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAI 1073
++DE++S A++RIHG ++AE+P +GTR +R+ GMCR L+ +E L L V++L++PA+
Sbjct: 875 KNDELVSVATVRIHGDKVAEIPLVGTRFQFRQLGMCRILMDVLEKKLMELGVQRLVLPAV 934
Query: 1074 SELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
+ TWT FGF + S + + + + L F M QK +MK
Sbjct: 935 PGVLNTWTGSFGFSKMTDSERLQFVDYTFLDFQDTVMCQKLLMK 978
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 276 bits (706), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 157/431 (36%), Positives = 238/431 (55%), Gaps = 22/431 (5%)
Query: 706 LLAWMIDTGTVPLNEKVQY---WNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH-S 761
+L+++ID + KV Y RK + G+I RDGI+C+CC I++ F+ H S
Sbjct: 629 VLSYLIDNSIILPRCKVYYKVKGRHRKVCTLADGKITRDGIKCNCCMGIYSFVGFENHAS 688
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
S C P ++ E G SLL C + + + G F + ND C +C GG
Sbjct: 689 GSSTCRPSARIFLEDGRSLLDCQIKMMHDHKTRETTGKSFSGLSLVE-NDYICSVCHYGG 747
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQI 881
+LI CD CPS+FH+ CL ++ P+G W C C C CG+ D DD + L
Sbjct: 748 ELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQ------RKIDGDDE-VGQLLP 800
Query: 882 CSLCEEKYHQSCSQTDGAVQYEPSSLS--FCGKKCQEIFERLEKLLGVKHDL-EGGYTWS 938
C CE KYH C + +GA L FCGK C++I+E L KLLG + TW+
Sbjct: 801 CIQCEHKYHVRCLE-NGAADISTRYLGNWFCGKDCEKIYEGLHKLLGEPVSVGVDNLTWT 859
Query: 939 LVHRFDVST---DLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNF 995
LV + + D S SD+ E ++L +A+SVM ECF PL + + +L+ +++++
Sbjct: 860 LVKFINPDSCEHDSSKSDLL--AESYSKLNLAISVMHECFEPLKESLTNRDLVEDVIFSR 917
Query: 996 GSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTG 1055
S RLN++GF+T +LER++E+IS A++R++GK++AE+P +GTR YRR+GMC L+
Sbjct: 918 WSELNRLNFQGFYTVLLERNEELISVATVRVYGKKVAEIPLVGTRLQYRRRGMCHILIEE 977
Query: 1056 IESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+E L L VE+L++PA+ + ETWT FGF + + + + + L F G M QK +
Sbjct: 978 LEKKLKQLGVERLVLPAVPSVLETWTRSFGFAKMTNLERSQFLDYTFLDFQGAIMCQK-L 1036
Query: 1116 MKNQFPRENMI 1126
+ N P N +
Sbjct: 1037 LTNNNPSPNPV 1047
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 275 bits (704), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/421 (35%), Positives = 229/421 (54%), Gaps = 42/421 (9%)
Query: 706 LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKL 765
+L+W++D V E ++ NQ+ T V +G++ R+GI+C CC +FTI+ F+ H+
Sbjct: 590 ILSWLMDNNVVLPRENIRCCNQKDTTVRKEGKLTREGIKCSCCRRVFTINGFEVHANGAS 649
Query: 766 CHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNF-DGEDPNDDTCGICGDGGDLI 824
C N++ + G SLL+C ++++ K+ +++ + GE ND C +C GG LI
Sbjct: 650 CSGAANIFLDDGRSLLECQVEAYKKRKKAQPPDMLKMKLRQGE--NDVFCSVCHYGGKLI 707
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CDGCPS FH NCL ++ P G W C C C CG+ T N +++ +S C
Sbjct: 708 LCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLKTTSTNAKEEKFIS----CKQ 763
Query: 885 CEEKYHQSCSQTDGAVQYEPSSLS---FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVH 941
CE KYH SC + DGA L FC K C+E E + G
Sbjct: 764 CELKYHPSCLRYDGACDSLDKILGEKWFCSKDCEESLE--PNMYG--------------- 806
Query: 942 RFDVSTDLSLSDVCQKVECNAR----LAVALSVMDECFLPLPDHRSGINLIHNILYNFGS 997
D K+E A L+VAL VM E F P+ G +L +++++ S
Sbjct: 807 -----------DDASKIEAAAENHCILSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWS 855
Query: 998 NFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIE 1057
FKRLN+ GF+T +LER++E++S A++RI GK++AEMPFIGTR +R++GMCR L+ +E
Sbjct: 856 KFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINELE 915
Query: 1058 SALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
L L VE+L++PA+ + TW + FGF + +S ++ +LL F + +K ++K
Sbjct: 916 KVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLKFTLLEFGRTILCEKILIK 975
Query: 1118 N 1118
+
Sbjct: 976 S 976
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 272 bits (696), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 158/438 (36%), Positives = 237/438 (54%), Gaps = 49/438 (11%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+ +ID V KV Y +Q+ M +G+IARDGI+ CC E+F+ F+ H+ S
Sbjct: 242 TILSQLIDNNVVLCRAKVHYSSQKDHHPMPEGKIARDGIKNSCCQEVFSPRGFEAHAGSS 301
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFV---NFDGEDP-----------N 810
N++ E SLL E +R+ H + +F E N
Sbjct: 302 FHQSDANIFLEDEGSLL-----------EGQRQMVHRITGKSFTKESSHGKKSNGDQCNN 350
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG--RINESTCHV 868
DD C +C GGDL+ CD CPS FHQ+CL +K+ P G W C C C+ CG R +E +
Sbjct: 351 DDICSVCHYGGDLVLCDQCPSCFHQSCLGLKELPEGDWFCPSCCCRICGENRFDEYS--- 407
Query: 869 NDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKL-- 924
++D+ S C CE +YH C + V+ E P FC +C++IF L KL
Sbjct: 408 -EEDNFKFS----CHQCELQYHVGCLRKQRHVKLETYPDGTRFCSTQCEKIFLGLLKLLG 462
Query: 925 ----LGVKHDLEGGYTWSLVH-RFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLP 979
+GV + TW+L+ D+ + D E ++L +AL+VM ECF P+
Sbjct: 463 KPIPVGVDN-----LTWTLLKPTISEWFDMDVPDNKALTEVYSKLNIALNVMHECFEPIK 517
Query: 980 DHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGT 1039
+ +G +L+ ++++ GS+ KRLN++GF+ +LER+DE+IS A+IR+HG+++AE+P +GT
Sbjct: 518 EPHTGRDLVEDVIFCRGSDLKRLNFRGFYIVLLERNDELISVATIRVHGEKVAEVPLVGT 577
Query: 1040 RHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRN 1099
R YRR GMCR L+ IE L L VE+L +PA + +TW + FGF + S + +
Sbjct: 578 RSQYRRLGMCRILINEIEKKLVELGVERLTLPAAPSVLDTWVTSFGFSKMTDSERLTFLD 637
Query: 1100 MSLLVFPGVDMLQKPMMK 1117
+ L F M QK +MK
Sbjct: 638 YTFLDFQDTVMCQKLLMK 655
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 270 bits (690), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 215/763 (28%), Positives = 349/763 (45%), Gaps = 115/763 (15%)
Query: 497 LETHQEET-LSKHGAKQLLRDRIVELLLAAGWKIEYR----------PRNGREYCDAVYV 545
L ++QE T S+ ++R + + LLA GW I ++ P + Y Y
Sbjct: 422 LASYQEVTSFSQGHISDIIRMNLKKHLLALGWSIVFKEDEIIRPNGQPSIIKRY---RYK 478
Query: 546 NPEGKTHWSI-----TLAYSVLK-----NHYEQEGGSSDTSKTGFTFTPIPEDELSILKK 595
+P GKT+ S + A +K N + + + G +L+IL K
Sbjct: 479 SPVGKTYVSFLQVLRSFAVQCIKRVKGNNTEDIPDNHNYLAAHGVNLDASVSRDLAILGK 538
Query: 596 --------VINKS--------RSDRNKK----KKGKNLGTDGEIVTKK-KKKKGKTNSAA 634
V+ K ++DR KK K K L + G IV +K K K +
Sbjct: 539 RKREYKSDVVGKYVDYVEADVQNDRKKKLLRSKAKKFLKSAGWIVRQKMKSSKKRKLRYH 598
Query: 635 SPHGKSQK--------------------RGIKGKPSVSEGGTS----------------H 658
SPHGKS K GI ++ GG + H
Sbjct: 599 SPHGKSYKCLLAACKGYLEQDLKENNASSGITTDTFIALGGGARDTSGRKDPLVSVLDRH 658
Query: 659 NGMSIPARRHKLQETQQRKRCAL---LVRNSVEGEESNGDGFVAYDGKW-TLLAWMIDTG 714
+G+ H + ++ + V +S G+ + Y + T+L+ ++D
Sbjct: 659 DGLFTWPTCHVKSKKRKSSSVTMSHARVLSSTHGQ------ILPYQHRAKTVLSLLVDKN 712
Query: 715 TVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYF 774
+ K+ Y + + +G + +DGI+C CC+E+FT+ F+ H+ P +++
Sbjct: 713 ILLPRVKLTYKQRSDGPRLKEGAVTKDGIKCRCCNELFTLESFEVHAGCSTRLPAAHIFL 772
Query: 775 ESGSSLLQCILDSW--NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPST 832
+ G L QC+++ NK ES D E +D C IC +GG+++ CD CPS+
Sbjct: 773 KDGRPLSQCLVELMGENKPKESLHVRLKTNYSDTE--SDSICSICNEGGEILLCDNCPSS 830
Query: 833 FHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQS 892
FH C+ ++ P G W+C C C C +S+ + D + T+ C CE +YH
Sbjct: 831 FHHACVGLESTPEGSWYCPSCRCSIC----DSSDYDPDTNKFTEKTIMYCDQCEREYHVG 886
Query: 893 CSQTDG-AVQYEPSSLSFCGKKCQEIFERLEKLLG--VKHDLEGGYTWSLVHRFDVSTDL 949
C + G + P FC + C EIF+ L+ L+G + +EG + RFD
Sbjct: 887 CMRNKGDQLTCCPEGCWFCSRGCSEIFQHLQGLIGKSIPTPVEG--LSCTILRFDRENAS 944
Query: 950 SLSDVCQKV--ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGF 1007
D ++ E +L +AL V+ ECF+ + + + +L +I++N S +RLN++GF
Sbjct: 945 QHGDFYNEIIAEQYGKLCIALDVLHECFVTIIEPSTRRDLSEDIVFNRESGLRRLNFRGF 1004
Query: 1008 FTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEK 1067
+T IL++D E+IS + R+ GK+ AE+P IGTR YRRQGMCR L+ +E L L VE+
Sbjct: 1005 YTLILQKDGELISVGTFRVCGKKFAELPLIGTRVQYRRQGMCRLLMNELEKLLSGLGVER 1064
Query: 1068 LIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM-MKNQFPRENMI 1126
L++PAI +L ETWT FGF+ + S + ++ S+L F G + QK + + PR+ I
Sbjct: 1065 LVLPAIPQLLETWTGSFGFRAMSFSDRFELAESSILSFQGTTICQKILDVAVHNPRDMSI 1124
Query: 1127 SAKGLRSSELENPRTADEVGKNSDEKYSAGFDLNACINAAAPH 1169
L + E+ E+GKNS + + +N + H
Sbjct: 1125 QLV-LNAEEM-------ELGKNSIVSFERTTACDTVVNNVSNH 1159
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 268 bits (686), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 156/421 (37%), Positives = 237/421 (56%), Gaps = 23/421 (5%)
Query: 707 LAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLC 766
L+WMID V EKV Y + + +GRI R+GI C+CC +IFT++ F++H+ S
Sbjct: 515 LSWMIDNNLVSPGEKVSYRGSKGPGELTRGRITREGIECNCCQKIFTLTGFESHAGSTNH 574
Query: 767 HPFQNLYFESGSSLLQCI------LDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDG 820
P N+ E G SLL C + E+K KG GE D C +C DG
Sbjct: 575 RPAANIILEDGRSLLDCQRKKKPRIKMQRVTREAKWKGRQ-NQHQGE--TDYICSVCHDG 631
Query: 821 GDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQ 880
GDLI CD CPSTFH+NC+ ++ P G+W C C C CG N+ +V + DS L +
Sbjct: 632 GDLIVCDHCPSTFHKNCVGLEDIPEGEWFCPPCCCGICGE-NKFKYNVQEPKDSRLLS-- 688
Query: 881 ICSLCEEKYHQSCSQTDGAVQY---EPSSLSFCGKKCQEIFERLEKLLGVKHDLEG--GY 935
C CE KYH C + G V+ +P FC KC++IF L+ LLG K + G
Sbjct: 689 -CDQCERKYHIGCLRNKGVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLG-KSVVVGPDNL 746
Query: 936 TWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNF 995
TW+L D + DV +++L +A+ V+ ECF P + +G ++ +++++
Sbjct: 747 TWTLWKFMDSDS----CDVEAPTGKHSKLDLAVEVIHECFEPATETYTGRDIAEDVIFSR 802
Query: 996 GSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTG 1055
N RLN++GF+T +LER+DE+I+ A++R+ G ++AE+P +GTR ++RR GMC+ L+
Sbjct: 803 ECNLNRLNFRGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGTRFLFRRLGMCKILMDE 862
Query: 1056 IESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+E L +L VE+L++PA+ + TW + FGF L + K + + + L FPG QK +
Sbjct: 863 LEKQLMNLGVERLMLPAVPSVLYTWINGFGFSKLTDAEKMQYLDHTFLDFPGTIKCQKVL 922
Query: 1116 M 1116
+
Sbjct: 923 L 923
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 268 bits (684), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/405 (36%), Positives = 223/405 (55%), Gaps = 73/405 (18%)
Query: 511 KQLLRDRIVELLLAAGWKIEYRPRNGREYCDAVYVNPEGK-THWSITLAYSVLKNHYEQE 569
KQ + D+I +LL AGW I+ RPRNGR Y D+VY++P GK ++WS+T AY V + E E
Sbjct: 731 KQKISDQIKAMLLDAGWTIDLRPRNGRNYMDSVYIHPSGKGSYWSVTKAYYVFRADMESE 790
Query: 570 ------------------GGSSDTSKTGFTFTPIPEDELSILKKVINKSRSDRNKKKKGK 611
G +S +G T T +D LS LK+V+ R+ + + ++
Sbjct: 791 QKESSKDHILSKKSVGSPGKRQVSSSSGCTLT---DDILSKLKRVVVNKRTTKVEIQR-- 845
Query: 612 NLGTDGEIVTKKKKKKGKTNSAASPHGKSQKRGIKGKPSVSEGGTSHNGMSIPARRHKLQ 671
+ +KK+KK TNS R H
Sbjct: 846 --------LRQKKEKKNTTNST--------------------------------RLHLGN 865
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALLVR S + S DGFV Y+ K T+ +W+ID + +N K+ ++ ++
Sbjct: 866 ERKKRGGCALLVRGSNKESGSTTDGFVPYEWKRTIFSWLIDLNVLSVNTKLNCLDESHSK 925
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWNK 790
V+L+G + RDGI C CCSE+ ++ +F TH+ S++ P++N+ + LL C++++WN
Sbjct: 926 VLLEGFVTRDGINCSCCSEVISVPEFVTHAGSEVNKPYRNILVDGLDIDLLHCLINAWNM 985
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
Q +++R+ F V+ +G+DPNDDTCGICGDGG+LICCDGCPSTFH +CL ++ P+ W C
Sbjct: 986 QSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCC 1045
Query: 851 VYCSCQFCGRINESTCHVNDQDDSA--LSTLQICSLCEE-KYHQS 892
CSC+FC + +D +D+A S+L CS YH+S
Sbjct: 1046 SNCSCKFCHEHSS-----DDAEDTADVDSSLHTCSQIHNIWYHES 1085
Score = 256 bits (654), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 132/279 (47%), Positives = 172/279 (61%), Gaps = 46/279 (16%)
Query: 890 HQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDL 949
H G V P L+ Q +FE L+ LL VK DLE Y+ +V R
Sbjct: 1215 HVDSKNKSGWVVLSPLELAVA----QALFEELQNLLAVKKDLEPEYSCRVVQRIHEEVPE 1270
Query: 950 SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFT 1009
+ + ++VECN+++AVALS+MDECFLP+ D R+GINLI N++YN GSNF RL+++GF+
Sbjct: 1271 EVLALDKRVECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYI 1330
Query: 1010 AILERDDEIISAASIR-----------------------------------------IHG 1028
ILER DEII+AAS+R IHG
Sbjct: 1331 IILERGDEIIAAASVRLKEKNILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHG 1390
Query: 1029 KELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQP 1088
+LAEMPFIGTR+MYRRQGMCRRL+ GIE L SLN+EKLIIPAI+EL +TWTS FGF P
Sbjct: 1391 TKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSP 1450
Query: 1089 LEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRENMIS 1127
L+ S KQ+++++S+LVFPG +LQKP++K P E+ S
Sbjct: 1451 LDDSEKQEVKSVSMLVFPGTGLLQKPLLKTS-PSEDQCS 1488
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 265 bits (678), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/436 (33%), Positives = 234/436 (53%), Gaps = 41/436 (9%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+W+ID+ V K++Y + + +GRI+ GI+C+CC +++T+ FD H+ K
Sbjct: 318 TVLSWLIDSTHVLPRAKIKYCCSKGQHSIAEGRISGSGIKCNCCGKVYTLCGFDYHASGK 377
Query: 765 ----LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDP----------- 809
F N++ E G SLL C + D +K G E+P
Sbjct: 378 QGRTATSIFSNIFLEDGRSLLDCQMQI--MHDHTKNLG--------EEPLERWQSSKDQV 427
Query: 810 -NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
ND C +C GG+LI CD CPS+FH++CL + P G W C C C+ CG+ + +
Sbjct: 428 ENDHICSVCHYGGELILCDQCPSSFHKSCLGLMDVPDGDWFCSSCCCKICGQCLKRDSDL 487
Query: 869 NDQDDSALSTLQICSLCEEKYHQSC--SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLG 926
+ +DD L C+ CE KYH C ++ + ++Y P FC K+CQ+IF L +LLG
Sbjct: 488 SMEDDGVLD----CTQCERKYHVVCLGNKREECLEYFPKEHWFCSKRCQQIFLGLHELLG 543
Query: 927 VK-----HDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDH 981
K H+L TW+L+ + SD+ E + L +AL +M E F P+ +
Sbjct: 544 KKIPVGLHNL----TWTLLKSIQFNDQCEASDIEALSENYSMLNIALDMMHEFFDPVEEP 599
Query: 982 RSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRH 1041
+ +L+ +++++ S RLN+ GF+T +L++DDE IS A++R++G+++AE+P +GTR
Sbjct: 600 HTKRDLLKDVIFSKRSELNRLNFHGFYTVLLQKDDEFISVATVRVYGEKVAEIPLVGTRF 659
Query: 1042 MYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMS 1101
YRR GMC L+ +E L L V++LI+PA+ TW FGF L + ++ + +
Sbjct: 660 QYRRLGMCCILMNVLEKKLRELGVQRLILPAVPSALNTWIGSFGFSKLTELDRLQLLDYT 719
Query: 1102 LLVFPGVDMLQKPMMK 1117
L F M K + K
Sbjct: 720 FLDFQDTIMCHKLLTK 735
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 265 bits (677), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 187/309 (60%), Gaps = 7/309 (2%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
D NDDTCG CGDGG+LICCD CP+++HQ CL + P G W+C C C CG + +S
Sbjct: 6 DQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCLCDICGEVIDSKEL 65
Query: 868 VNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGV 927
V ++L L CS CE +YH C + S FCG+KC EI+ +GV
Sbjct: 66 V-----TSLPALD-CSQCERQYHVKCVSAKVPCNEDGSGTWFCGRKCHEIYMTFRSRVGV 119
Query: 928 KHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINL 987
++ ++++ + ++ ECN +L +A S+M+ECFLP+ D R+GI++
Sbjct: 120 PDHMDDDLCFTVLRNNGDKKVRTAEEIALMAECNMKLMIATSIMEECFLPILDPRTGIDI 179
Query: 988 IHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQG 1047
I +ILYN+ S+ NYKGF+T +LE DD ++S ASIR+HG LAEMP + T R+QG
Sbjct: 180 IPSILYNWRSDL-HFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQG 238
Query: 1048 MCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPG 1107
MCRRL+ IE L SL VE L++ AI L ETWTS FGF+ ++ S K+++ + L PG
Sbjct: 239 MCRRLMDYIEEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDESDKKRLSKVRLAAVPG 298
Query: 1108 VDMLQKPMM 1116
+L+K ++
Sbjct: 299 TVLLKKDLL 307
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 260 bits (665), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 155/428 (36%), Positives = 233/428 (54%), Gaps = 28/428 (6%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH-SKS 763
T+L+W+I+ V K+ Y + +GRI+ DGI+C+CC ++F++S F H + +
Sbjct: 31 TVLSWLIENNAVVPRSKLYYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLSGFQAHVTGN 90
Query: 764 KLCHPFQNLYFESGSSLLQCILDSWNKQ-------------DESKRKGFHFVN-FDGEDP 809
+C P +NL+ +G SL+ C ++ K+ R F + E+
Sbjct: 91 NICRPAENLFLGNGKSLVSCQVELMRKKIMRFNQEPVVRATGTGSRSKFRLLAPLGSENC 150
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
ND C IC GGDLICCD CPS+FH CL+I++ P G W C C C CG
Sbjct: 151 NDYVCSICHYGGDLICCDRCPSSFHATCLNIERVPEGDWFCPCCCCGICGDSQFDKMAEQ 210
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLG--- 926
DDS L C CE ++H C + G V E FC K C+ + L++LLG
Sbjct: 211 FADDSLLR----CHQCERQFHARCKKEGGMVSSEEHW--FCCKTCEMMQWGLQQLLGKPI 264
Query: 927 -VKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGI 985
V H+L + ++ + D L+ + E ++L+VAL VM ECF P+ D ++
Sbjct: 265 LVGHNLTCTLIKPMQYQAEDRVDYDLAAM---AENYSKLSVALEVMHECFDPVKDPKTKR 321
Query: 986 NLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRR 1045
+L+ ++L+ GSN RLN++GF+T +LER+DE+I+ A +RI+G ++AEMP IGTR +RR
Sbjct: 322 DLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRFQHRR 381
Query: 1046 QGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVF 1105
GMCR L+ IE L +L V+KL++PA + TWT+ FGF P+ S + + L F
Sbjct: 382 LGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGFTPMTESDRLDFLGFTFLDF 441
Query: 1106 PGVDMLQK 1113
M +K
Sbjct: 442 HDTIMCKK 449
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 260 bits (665), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 142/421 (33%), Positives = 231/421 (54%), Gaps = 20/421 (4%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T+L+ ++ V +KV Y Q + +G I RDGI+C CC+EIFT+ F+ H+ S
Sbjct: 571 TVLSMLVKKKIVVPGDKVTY-KQSDGPGIKEGSIRRDGIKCMCCNEIFTVENFEVHAGSS 629
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPN---DDTCGICGDGG 821
P +++ + G SL QC+++ H G++ + D C +C DGG
Sbjct: 630 TPLPSAHMFLKDGMSLSQCLVEFMGGNKPRDPHPLH-ARLKGKNSDLESDSICSVCHDGG 688
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALS--TL 879
DL+ CD CPS++H +C+ ++ P G W+C C C C + D D S + T+
Sbjct: 689 DLLLCDNCPSSYHHDCVGLEAIPEGNWYCPSCRCSICNLSD------YDPDTSQFTEKTI 742
Query: 880 QICSLCEEKYHQSCSQ-TDGAVQYEPSSLSFCGKKCQEIFERLEKLLG--VKHDLEGGYT 936
C CE +YH C++ +D + P FC + C +F+ L++L+G V +EG +
Sbjct: 743 VYCDQCEREYHVGCTRNSDNQLICRPEGCWFCSRGCSNVFQHLQELIGKSVPTPIEG-VS 801
Query: 937 WSLVHRFDVSTDLSLSDVCQKVECN--ARLAVALSVMDECFLPLPDHRSGINLIHNILYN 994
W+++ +F D ++ + +L VA+ ++ ECF+ + + R+ ++ +I++N
Sbjct: 802 WTIL-KFCSGNGSDHGDYDDEIMADHYGKLCVAVGILHECFVTIIEPRTQSDISEDIVFN 860
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLT 1054
S +RLN++GF+T +L++ E IS + RI G++ AE+P IGT YRRQGMCR L+
Sbjct: 861 RESELRRLNFRGFYTILLQKGGEPISVGTFRICGQKFAELPLIGTSSPYRRQGMCRLLIN 920
Query: 1055 GIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKP 1114
+E L L VE+LI+PA+ EL ETWT FGF + S + ++ S+L F G M QK
Sbjct: 921 ELEKLLLDLGVERLILPAVPELLETWTCSFGFTIMSNSDRLELAGNSILSFQGTTMCQKI 980
Query: 1115 M 1115
+
Sbjct: 981 L 981
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 257 bits (657), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 153/425 (36%), Positives = 229/425 (53%), Gaps = 22/425 (5%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH-SKS 763
T+L+W+I+ V K+ Y + +GRI+ DGI+C+CC ++F+++ F H + +
Sbjct: 31 TVLSWLIENNAVVPRSKLYYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLTGFQAHVTGN 90
Query: 764 KLCHPFQNLYFESGSSLLQCILDSWNKQ--------------DESKRKGFHFVNFDGEDP 809
+C P +NL+ +G SL+ C ++ K+ S+ K E+
Sbjct: 91 NICRPAENLFLGNGKSLVSCQVELMRKKIMMFDQGPAVRAAGTGSRSKFRSLAPLGSENC 150
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
ND C IC GGDLICCD CPS+FH CL+I+ P G W C C C CG
Sbjct: 151 NDYVCSICHYGGDLICCDRCPSSFHAACLNIESVPEGDWFCPCCCCGICGDSQFDKMAEQ 210
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKH 929
DDS L C CE ++H C + G V E FC K C+ + L++LLG
Sbjct: 211 FADDSLLR----CHQCERQFHARCKKEGGMVSSEEHW--FCCKTCEMMQWGLQQLLGKPI 264
Query: 930 DLEGGYTWSLVHRFDV-STDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLI 988
+ T +L+ + D D+ E ++L+VAL VM ECF P+ D ++ +L+
Sbjct: 265 LVGQNLTCTLIKPMQYQAEDREDYDLAAMAENYSKLSVALEVMHECFDPVKDPKTKRDLV 324
Query: 989 HNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGM 1048
++L+ GSN RLN++GF+T +LER+DE+I+ A +RI+G ++AEMP IGTR +RR GM
Sbjct: 325 EDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRFQHRRLGM 384
Query: 1049 CRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGV 1108
CR L+ IE L +L V+KL++PA + TWT+ FGF P+ S + + L F
Sbjct: 385 CRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGFTPMTESDRLDFLGFTFLDFHDT 444
Query: 1109 DMLQK 1113
M +K
Sbjct: 445 IMCKK 449
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 248 bits (634), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 118/323 (36%), Positives = 193/323 (59%), Gaps = 8/323 (2%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
T++ W++ TG + + + +QY + + +++ G + +GI C+CC + ++S F H+
Sbjct: 852 TVICWLLATGFLSVKDVIQYRDPKSNKIIKDGMVTWEGIVCNCCKKNLSVSDFMAHAGRS 911
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
L+ ESG S C++++W+ + S+R + D +DDTCG CGDGG+L+
Sbjct: 912 HPQSSLGLFLESGKSYTLCLVEAWSAESMSRRSNAWGRKVEAIDESDDTCGFCGDGGELL 971
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSL 884
CCD CPST+HQ CL K+ P G W+C C+CQ CG ++++ S S + C
Sbjct: 972 CCDNCPSTYHQACLSAKELPEGSWYCHNCTCQVCGGP------FSEKEVSTFSAIFKCFQ 1025
Query: 885 CEEKYHQSC-SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF 943
C + YH +C Q ++ + S FCGK C+EIF L +G + L+ +WS++
Sbjct: 1026 CGDAYHDTCIEQEKLPLEDQISQTWFCGKYCKEIFIGLRSHVGTDNILDSDLSWSILRCN 1085
Query: 944 DVSTDL-SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRL 1002
+ L S+ + ECN +LAVAL++++ECF+ + D R+G+++I ++LYN GSNF R+
Sbjct: 1086 NDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARV 1145
Query: 1003 NYKGFFTAILERDDEIISAASIR 1025
+Y+GF+T ILE+ DEI+ ASIR
Sbjct: 1146 DYQGFYTVILEKGDEILCVASIR 1168
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 238 bits (606), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 184/626 (29%), Positives = 299/626 (47%), Gaps = 82/626 (13%)
Query: 522 LLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFT 581
LLA GW+ Y + RE + Y P GK + S+ +A K + EQ+G
Sbjct: 83 LLAMGWEFFYIQKKTRE--ELRYKAPNGKVYISLRMA---CKAYIEQKGC---------- 127
Query: 582 FTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQ 641
V++ S + N + K +D ++ +K++ + K+
Sbjct: 128 --------------VVSHSTTQMNGSQPEKETPSDQDLGNEKQRPR-----------KAA 162
Query: 642 KRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESN-------- 693
K P ++ +N S P + ++ K LV++ V E +
Sbjct: 163 KGTPPRNPPTADFSVHYNVASEPEKETPASSFEKPKDKYELVKSPVLPSEQDVRNEKPQP 222
Query: 694 --GDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR-VMLQGRIARDG-IRCDCCS 749
+ + T L++++D + ++V R R V +G I +G I+CDCCS
Sbjct: 223 RKAAKGTSRRNQRTALSYLVDRELISPGDRVHCNVTRDGRLVTWRGSITNEGFIKCDCCS 282
Query: 750 EIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQC----ILDSWNKQDESK---RKGFHFV 802
+F ISKF+ H+ S P N++ E G SLL C + ++ Q E+K +K H
Sbjct: 283 NLFPISKFEAHTGSTKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETKATEKKVNHND 342
Query: 803 NFD----GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFC 858
N D G D ND C +C GG+LI CD CP+ FH +CL IK PSG W+C C C+ C
Sbjct: 343 NADSDTHGLDKNDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSCCCKIC 402
Query: 859 GRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLS----FCGKKC 914
G++ T +DQ S ++ C CE+ H C ++ ++ ++ FC ++C
Sbjct: 403 GQV---TYDFDDQVSSFDTSFVRCVQCEQNVHIGCVKSIQVLEDSNQTIDRENWFCTRRC 459
Query: 915 QEIFERLEKLLGVK---HDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVM 971
++I L+ LL + D TW+L+ +S+ +K +L AL VM
Sbjct: 460 EDIHMGLQNLLWKQIPVGDARENLTWTLMKH----CPYKVSEHNRK-----KLNKALGVM 510
Query: 972 DECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKEL 1031
+ F P+ D + +LI ++ + S KRLN++GF+TAILER + +++ A++R++G E+
Sbjct: 511 HKSFRPVKDPITKNDLIEDVFLSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEV 570
Query: 1032 AEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEV 1091
AE+P + TR YRR GMCRRLL +E L + V++L +PA+ E TWT FGF +
Sbjct: 571 AEIPLVATRLKYRRHGMCRRLLNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTD 630
Query: 1092 SSKQKMRNMSLLVFPGVDMLQKPMMK 1117
S + + + L F QK +++
Sbjct: 631 SDRLDLIKYTFLGFQHTVRCQKDLLE 656
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 238 bits (606), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 184/626 (29%), Positives = 299/626 (47%), Gaps = 82/626 (13%)
Query: 522 LLAAGWKIEYRPRNGREYCDAVYVNPEGKTHWSITLAYSVLKNHYEQEGGSSDTSKTGFT 581
LLA GW+ Y + RE + Y P GK + S+ +A K + EQ+G
Sbjct: 60 LLAMGWEFFYIQKKTRE--ELRYKAPNGKVYISLRMA---CKAYIEQKGC---------- 104
Query: 582 FTPIPEDELSILKKVINKSRSDRNKKKKGKNLGTDGEIVTKKKKKKGKTNSAASPHGKSQ 641
V++ S + N + K +D ++ +K++ + K+
Sbjct: 105 --------------VVSHSMTQMNGSQPEKETPSDQDLGNEKQRPR-----------KAA 139
Query: 642 KRGIKGKPSVSEGGTSHNGMSIPARRHKLQETQQRKRCALLVRNSVEGEESN-------- 693
K P ++ +N S P + ++ K LV++ V E +
Sbjct: 140 KGTPPRNPPTADFSVHYNVASKPEKETPASSFEKPKDKYELVKSPVLPSEQDVWNEKPQP 199
Query: 694 --GDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR-VMLQGRIARDG-IRCDCCS 749
+ + T L++++D + ++V R R V +G I +G I+CDCCS
Sbjct: 200 RKAAKGTSRRNQRTALSYLVDRELISPGDRVHCNVTRDGRLVTWRGSITNEGFIKCDCCS 259
Query: 750 EIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQC----ILDSWNKQDESK---RKGFHFV 802
+F ISKF+ H+ S P N++ E G SLL C + ++ Q E+K +K H
Sbjct: 260 NLFPISKFEAHTGSTKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETKATEKKVNHND 319
Query: 803 NFD----GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFC 858
N D G D ND C +C GG+LI CD CP+ FH +CL IK PSG W+C C C+ C
Sbjct: 320 NADSDTHGLDKNDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSCCCKIC 379
Query: 859 GRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLS----FCGKKC 914
G++ T +DQ S ++ C CE+ H C ++ ++ ++ FC ++C
Sbjct: 380 GQV---TYDFDDQVSSFDTSFVRCVQCEQNVHIGCVKSIQVLEDSNQTIDRENWFCTRRC 436
Query: 915 QEIFERLEKLLGVK---HDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVM 971
++I L+ LL + D TW+L+ +S+ +K +L AL VM
Sbjct: 437 EDIHMGLQNLLWKQIPVGDARENLTWTLMKH----CPYKVSEHNRK-----KLNKALGVM 487
Query: 972 DECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKEL 1031
+ F P+ D + +LI ++ + S KRLN++GF+TAILER + +++ A++R++G E+
Sbjct: 488 HKSFRPVKDPITKNDLIEDVFLSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEV 547
Query: 1032 AEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEV 1091
AE+P + TR YRR GMCRRLL +E L + V++L +PA+ E TWT FGF +
Sbjct: 548 AEIPLVATRLKYRRHGMCRRLLNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTD 607
Query: 1092 SSKQKMRNMSLLVFPGVDMLQKPMMK 1117
S + + + L F QK +++
Sbjct: 608 SDRLDLIKYTFLGFQHTVRCQKDLLE 633
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 229 bits (585), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 139/428 (32%), Positives = 220/428 (51%), Gaps = 24/428 (5%)
Query: 704 WTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKS 763
+++++W+I+ L + + ++ +G I DGI C+CC FT+S F+ H+
Sbjct: 382 YSIVSWLIENKV--LVSGTHVFCRGSENIVKRGSIFSDGIVCNCCRVNFTVSGFEAHAGC 439
Query: 764 KLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVN-FDGEDPNDDTCGICGDGGD 822
P ++ E G SL +C +++ +KG H + + E ND+ C ICG GGD
Sbjct: 440 TRHRPSISILLEDGRSLFKC------QREARDQKGSHCIGEANSEANNDNVCSICGFGGD 493
Query: 823 LICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQIC 882
L+ CD CPS FH CL + + P G W C C C+ C R C + D + +C
Sbjct: 494 LVLCDRCPSAFHLGCLGLDRVPDGDWFCPTCCCKICYR---PKCK-QECADGNENNFLVC 549
Query: 883 SLCEEKYHQSCSQTD--GAVQYEPSSLS---FCGKKCQEIFERLEKLLGVKHDLEGGYTW 937
CE+K+H C +T G+ E + FC C +F L+KLLG + W
Sbjct: 550 VQCEQKFHFGCVKTTRFGSSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKVADNINW 609
Query: 938 SLVHRF---DVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYN 994
+L+ D D + ++ Q+ +L AL V+ E F P D SG LI +++++
Sbjct: 610 TLLKNVSSDDDGGDFTSNEFSQE---KHKLNAALGVLYEGFNPTIDALSGRELIKDLVFS 666
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLT 1054
S KRLN++GF+T ILE+ E+IS A+IRI G+++AE+ F+ T+ +R +GMCR L+
Sbjct: 667 RDSEHKRLNFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQHRGRGMCRLLMD 726
Query: 1055 GIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKP 1114
+E L L V +L++ + + TWT FGF + K K+ + + L F M KP
Sbjct: 727 ELEEQLTRLGVGRLVLHSSEDAINTWTKSFGFARMTSEDKCKLIDNTFLEFHNSIMCLKP 786
Query: 1115 MMKNQFPR 1122
+ +PR
Sbjct: 787 LNIPIWPR 794
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 229 bits (585), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 142/436 (32%), Positives = 218/436 (50%), Gaps = 33/436 (7%)
Query: 706 LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKL 765
+++W+I + V + Q +V+ +G + D I CDCC FTI+ F++H+
Sbjct: 3 IVSWLIKNQVLVSGTNV--FCQGSNKVVKRGSLFSDAIVCDCCHVTFTITGFESHAGCTR 60
Query: 766 CHPFQNLYFESGSSLLQCILDSWNKQD--------------------ESKRKGFHFVNFD 805
P ++ E G SLL C ++ + D E+++K V
Sbjct: 61 HRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVVNENQKKNHSIVKENRKKNHCVVKEK 120
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINEST 865
E ND+ C ICG GGDL CD CPS FH CL + + P G+W C C C+ C R
Sbjct: 121 SEANNDNVCSICGFGGDLALCDRCPSAFHLGCLGLNRVPIGEWFCPTCCCKICYRPKCKQ 180
Query: 866 CHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLL 925
+ +D++ L +C CE+KYH C + G +++ FC C +F L+KLL
Sbjct: 181 ECKDHKDNNIL----VCVQCEQKYHFGCVKAVG-IEFNHMENWFCSVVCGNMFLCLKKLL 235
Query: 926 GVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGI 985
G + TW+LV D + QK ++L +AL V+ E F P D SG
Sbjct: 236 GKPIKVADNLTWTLVKNVSSVDDKEFN---QK---ESKLNMALGVLYEGFNPTFDALSGR 289
Query: 986 NLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRR 1045
LI +++++ S RLN+ GF+ ILE+ E+IS A++RI+G+++AE+ F+ T+ YRR
Sbjct: 290 ELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVATKEQYRR 349
Query: 1046 QGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVF 1105
QGMC L+ IE L L VEKL++ + + TWT FGF + K + + + L F
Sbjct: 350 QGMCHLLMDEIEKQLTRLGVEKLLLHSSEDAMNTWTRSFGFARMASKDKCQFIDHTFLEF 409
Query: 1106 PGVDMLQKPMMKNQFP 1121
M K + ++P
Sbjct: 410 QNSTMCLKALKTPKWP 425
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 228 bits (581), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 102/147 (69%), Positives = 127/147 (86%)
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
MDECFLP+ D R+GINLI N++Y+ GSNF RL+++GF+ ILER DEII+AAS+RIHG +
Sbjct: 1 MDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTK 60
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLE 1090
LAEMPFIGTR+MYRRQGMCRRL+ GIE L SLNVEKLIIPAI+EL +TWTS FGF PLE
Sbjct: 61 LAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLE 120
Query: 1091 VSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
S KQ+++++S+LVFPG +LQKP++K
Sbjct: 121 DSEKQEVKSISMLVFPGTGLLQKPLLK 147
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 225 bits (573), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 139/430 (32%), Positives = 215/430 (50%), Gaps = 33/430 (7%)
Query: 706 LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKL 765
+++W+I + V + Q +V+ +G + D I CDCC FTI+ F++H+
Sbjct: 3 IVSWLIKNQVLVSGTNV--FCQGSNKVVKRGSLFSDAIVCDCCHVTFTITGFESHAGCTR 60
Query: 766 CHPFQNLYFESGSSLLQCILDSWNKQD--------------------ESKRKGFHFVNFD 805
P ++ E G SLL C ++ + D E+++K V +
Sbjct: 61 HRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVVNENQKKNHSIVKENRKKNHCVVKEN 120
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINEST 865
E ND+ C ICG GGDL CD CPS FH CL + + P G+W C C C+ C R
Sbjct: 121 SEAKNDNVCSICGFGGDLALCDRCPSAFHLGCLGLNRVPIGEWFCPTCCCKICYRPKCKQ 180
Query: 866 CHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLL 925
+ +D++ L +C CE+KYH C + G +++ FC C +F L+KLL
Sbjct: 181 ECKDHKDNNIL----VCVQCEQKYHFGCVKAVG-IEFNHMENWFCSVVCGNMFLCLKKLL 235
Query: 926 GVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGI 985
G + TW+LV D + QK ++L +AL V+ E F P D SG
Sbjct: 236 GKPIKVADNLTWTLVKNVSSVDDKEFN---QK---ESKLNMALGVLYEGFNPTFDALSGR 289
Query: 986 NLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRR 1045
LI +++++ S RLN+ GF+ ILE+ E+IS A++RI+G+++AE+ F+ T+ YRR
Sbjct: 290 ELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVATKEQYRR 349
Query: 1046 QGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVF 1105
QG+C L+ IE L L VEKL++ + + WT FGF + K + + + L F
Sbjct: 350 QGICHLLMDEIEKQLTRLGVEKLLLHSSEDAMNIWTKSFGFARMASKDKCQFIDHTFLEF 409
Query: 1106 PGVDMLQKPM 1115
M K +
Sbjct: 410 QNSTMCLKAL 419
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 223 bits (567), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 120/330 (36%), Positives = 179/330 (54%), Gaps = 18/330 (5%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
+D C C GGDLI CD CPSTFH CL++K P W C C C+ CG+
Sbjct: 142 SDTICSFCHYGGDLILCDKCPSTFHLGCLELKDVPLENWFCPSCCCELCGK--------- 192
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAV-QYEPSSLSFCGKKCQEIFERLEKLLGVK 928
DS+ ST C C YH C DG + + S +FC K C E+ +L +LLG+
Sbjct: 193 --GDSSTST-NACLQCARAYHVHCLTKDGCLLPTDYPSENFCSKSCYELCAQLHQLLGIS 249
Query: 929 HDLE-GGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINL 987
+ G TW+L S D+ + + + L VM ECF + + + ++
Sbjct: 250 NPTSVDGLTWTLTRS---SKDVYNFPGMPRSSTHVKSFQILRVMHECFRSVKEPHTQKDM 306
Query: 988 IHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQG 1047
+ +++YN GS FKRLN+ GF+ +L R D+I+S A++RIHG + AEMP + T +RRQG
Sbjct: 307 VTDLIYNSGSKFKRLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVATPFNFRRQG 366
Query: 1048 MCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPG 1107
MCR L+ + L VE+LI+PAI +LR+ W + FGF + +S +Q++ S + F G
Sbjct: 367 MCRLLMQEVLKLLNKFRVERLILPAIPQLRKMWEASFGFSEMPLSERQQLSGYSFVGFQG 426
Query: 1108 VDMLQKPMMKNQFPRENMISAKGLRSSELE 1137
MLQ + ++ E +++G+ S +LE
Sbjct: 427 TMMLQNVLTSSRITLEINANSEGM-SQDLE 455
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 222 bits (566), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/458 (30%), Positives = 222/458 (48%), Gaps = 55/458 (12%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSK 764
TLL+ +ID V +KV Y R G I +GIRC CC++ T+++F H+ ++
Sbjct: 391 TLLSVLIDKDIVVPRDKVTYRAARDRPAAKDGFITGEGIRCTCCNKTLTVAEFAAHATAR 450
Query: 765 LC----HPFQNLYFESGSSLLQCILDSWNKQDE--SKRKGFHFVNFDGEDP-NDDTCGIC 817
+ ++ + G SL QC+++ + + R G V DP D C IC
Sbjct: 451 RGSDRREAWARVFLKDGRSLSQCLVELMRRDVAVVAARNGDVRVKEKCSDPEGDSVCSIC 510
Query: 818 GDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALS 877
DGGDL+ CD CPS FH C+ ++ P G W C C C CG +D D +A
Sbjct: 511 NDGGDLLLCDNCPSAFHHACVGLQATPEGDWFCPSCRCGVCGG--------SDFDATAAG 562
Query: 878 -------TLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGK-------------KCQEI 917
T+ C CE +YH C + G+ + E S+ +C + +C E+
Sbjct: 563 GGGFTDKTIIYCDQCEREYHVGCVRRRGSEEEEESAAEWCRRPEEQEEWPWLCSPECGEV 622
Query: 918 FERLEKLLGVKHDLE-----------GGYTWSLVHRFDVS--TDLSLSDVCQ------KV 958
F L+ L V + G + S++ R + ++ CQ
Sbjct: 623 FRHLQGLAAVARERSIPIPTTVPTTVEGVSLSILRRRRRRPISMVATGSGCQEEEEEEDA 682
Query: 959 ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
+ +L AL V+ ECF+ L + R+ +L +I++N S +RLN++G++ LE+ E+
Sbjct: 683 AEHGQLCSALDVLHECFVTLIEPRTQTDLTADIVFNRESELRRLNFRGYYVVGLEKAGEL 742
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
I+ ++R+ G E+AE+P +GTR +RRQGMC L+T +E L + V +L++PA+ EL
Sbjct: 743 ITVGTLRVLGTEVAELPLVGTRFAHRRQGMCHLLVTELEKVLRQVGVRRLVLPAVPELLP 802
Query: 1079 TWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
WT+ GF P+ S ++ ++L F G M K +
Sbjct: 803 MWTASLGFHPMTRSDVMEIAAEHAILSFQGTTMCHKSL 840
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 134/459 (29%), Positives = 220/459 (47%), Gaps = 55/459 (11%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQ-GRIARDGI-RCDCCSEIFTISKFDTHSK 762
TLL+ +ID+G + V YW R + + G I +GI RC CC++ FT+++ + H+
Sbjct: 320 TLLSVLIDSGILAPRGNVTYWAVRDGQPACKYGFITGEGIIRCKCCNKTFTVAELEAHAT 379
Query: 763 SKLC-----HPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFD--------GEDP 809
+ + ++ E G SL C+++ + D N E+
Sbjct: 380 GGIGTDDRREAWARVFVEDGRSLSLCLMELMRRDDVGAAAANRNRNGSVMRVKEKCSEEE 439
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
D C +C D G+L+ CD CPS FH C+ ++ P G W C C C CG +
Sbjct: 440 GDSVCSVCIDSGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCRCGVCGG--------S 491
Query: 870 DQDDSALS-----TLQICSLCEEKYHQSC-----SQTDGAVQY-----------EPSSLS 908
D DD T+ C CE +YH C S+ + A ++ E
Sbjct: 492 DLDDDTAEGFTDKTIIYCEQCEREYHVGCMRRGGSEEESAAEWCRRLSESEGPEEEWRPW 551
Query: 909 FCGKKCQEIFERLEKLLG------VKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNA 962
C +C E+F+ L+ L+ + H G Y + R + ++ Q E +A
Sbjct: 552 LCSPECGEVFQHLQALVASSRARSIPHYSRGAYHSAPCGRRRYMSTVTRITRWQHEEEDA 611
Query: 963 ----RLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
+L AL V+ ECF + + R+ +L +I++N S +RLN++G++ LE+ E+
Sbjct: 612 ADHGQLCAALDVLHECFDDMVEPRTQTDLAADIVFNQESGLRRLNFRGYYVVGLEKAGEL 671
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
I+ ++R+ G ++AE+P +GTR +RRQGMCR L+T +E L + V +L++PA+ EL
Sbjct: 672 INVGTLRVFGNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELMP 731
Query: 1079 TWTSVFGFQPLEVSSKQKMR-NMSLLVFPGVDMLQKPMM 1116
WT+ GF + S +M ++L F G M QK ++
Sbjct: 732 MWTASLGFHAMTRSDVMEMAVEHAILSFKGTTMCQKTLL 770
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 220 bits (560), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 133/412 (32%), Positives = 195/412 (47%), Gaps = 75/412 (18%)
Query: 702 GKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS 761
G T+L +++ G V +QY V+ G I + GIRC CC +FT+S F H+
Sbjct: 935 GARTVLGKLLEMGIVCKVNILQYRRPGSKNVLKDGNITKKGIRCRCCDMVFTMSMFKYHA 994
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
+ P NL+ SG S C L +W+ + +++++ D NDDTCG+
Sbjct: 995 GLRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAKCTMPLQADENDDTCGL----- 1049
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQI 881
CG E C
Sbjct: 1050 ------------------------------------CGDGGELIC--------------- 1058
Query: 882 CSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVH 941
C C YHQ C CQ I+ L +G+ G++ +++
Sbjct: 1059 CDNCPASYHQDC------------------LPCQ-IYMNLRSRVGIPIHTIDGFSCTVLR 1099
Query: 942 RFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKR 1001
+ +D+ ECN +L +ALS+M+ECFLP+ D R+GI++I ILYN+ S+F
Sbjct: 1100 NNGDQRVSTAADIAILAECNMKLVIALSIMEECFLPIIDARTGIDIIPPILYNWRSDFVH 1159
Query: 1002 LNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALC 1061
L+YKGF+T +LE DD IIS ASIR+HG +AEMP I T R+QGMCRRL+ IE L
Sbjct: 1160 LDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQQGMCRRLMDYIEQMLK 1219
Query: 1062 SLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
SL VE L++ AI L +TWT FGF P++ ++ + + L+ PG +L++
Sbjct: 1220 SLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDRKNLSRLRLVSVPGTVLLKR 1271
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 207 bits (528), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 132/451 (29%), Positives = 226/451 (50%), Gaps = 60/451 (13%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDG-IRCDCCSEI----------FT 753
TLL+ +ID + +++ Y +R G I DG ++C C I +T
Sbjct: 283 TLLSLLIDKKILAPRDQLIYTTKR-------GLITGDGMVKCMCGGCINNNNKRRVAEYT 335
Query: 754 ISKFDTHSKSKLC-----HPFQNLYFESGSSLLQCILDSWNKQDES----KRKGFHFVNF 804
+++F H + P+ ++ G SL QC++ DE+ K+K ++ +
Sbjct: 336 VAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKKYLPY 395
Query: 805 ---------DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSC 855
E+ +D C +C D G+L+ CD CPS FH C+ ++ P G W C C+C
Sbjct: 396 VWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPACTC 455
Query: 856 QFCGR--INESTCHVNDQDDSALSTLQICSLCEEKYHQSC-SQTDGAVQYEPS---SLSF 909
CG +++ Q S+ + C C +YH C + D + Y +
Sbjct: 456 AICGSSDLDDPPATTTTQGFSSDRMVISCEQCRREYHVGCMRERDNGLWYPEADGEGPWL 515
Query: 910 CGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKV------ECNAR 963
C + C +I+ RLE+L V+ V++ LSL + + E +A+
Sbjct: 516 CSEACSKIYLRLEELAVVQAPCR-----------SVASGLSLVVLRRGAARDGEEEEHAK 564
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
L +AL V+ ECF+ L + R+ +L +I++N S +RL+++GF+ LE+ E+I+ A+
Sbjct: 565 LCMALDVLRECFVTLIEPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVAT 624
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTS- 1082
+R++G+E+AE+P +GTR RRQGMCR L+ I+ L + VE+L++PA+ E+ TWT
Sbjct: 625 LRVYGEEVAEVPLVGTRFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGP 684
Query: 1083 VFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
FGF+ + + +Q + + ++L F G M K
Sbjct: 685 SFGFREMGQADRQDVAHHAILRFQGTIMCHK 715
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 183 bits (465), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 129/459 (28%), Positives = 210/459 (45%), Gaps = 73/459 (15%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYF 774
+P +V Y+ + + +L G GI C CC+ + S+F+ H+ + P+ ++Y
Sbjct: 518 LPDGTEVAYYARGQK--LLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYT 575
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G SL ++ S KG F D NDD C IC DGGDL+CCDGCP +FH
Sbjct: 576 SNGVSL--------HELSISLSKGRKFSLTD----NDDLCSICADGGDLLCCDGCPRSFH 623
Query: 835 QNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINES 864
++C+ ++ P+G W+C YC F C RI ++
Sbjct: 624 RDCVPLQCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAGVDPIEQITTRCIRIVKT 683
Query: 865 T---------CHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCG 911
C +D S T+ +C CE+++H C + + ++ P FC
Sbjct: 684 MEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQGKWFCC 743
Query: 912 KKCQEIFERLEKLLGVKHD-LEGGYTWSLVHRFDVSTDLSLSDV-----------CQKVE 959
+C I LEKL+ + + L S+ + + S++DV E
Sbjct: 744 PECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASINDVEIRWRVLNWKMLSSDE 803
Query: 960 CNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEII 1019
+ L+ A+S+ +CF P+ D SG + I ++LY G N + + G + A+L ++ ++
Sbjct: 804 TRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLY--GRNIRGQEFGGIYCAVLTVNESVV 861
Query: 1020 SAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRET 1079
S RI G E+AE+P + T ++ QG + L IE L LNV+ L++PA E
Sbjct: 862 SVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADEAESL 921
Query: 1080 WTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMK 1117
W + FGF L + R+ +++F G MLQK + K
Sbjct: 922 WINKFGFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEVPK 960
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 183 bits (465), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 136/473 (28%), Positives = 212/473 (44%), Gaps = 80/473 (16%)
Query: 706 LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SK 764
L + + +P ++ Y+ + +L G GI C CC+E + S+F+ H+ +
Sbjct: 547 LHKLVFEADVLPDGTELAYYAHGQK--LLVGYKKGCGIFCTCCNEQVSASQFEAHAGWAS 604
Query: 765 LCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLI 824
P+ ++Y +G SL + S + + +R F N D NDD C IC DGGDL+
Sbjct: 605 RRKPYLHIYTSNGISLHEL---SISLSKDHRR----FSNND----NDDLCIICEDGGDLL 653
Query: 825 CCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF---------------CGRI-------- 861
CCDGCP FH +C+ + PSG W+C YC F GRI
Sbjct: 654 CCDGCPRAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILEL 713
Query: 862 --------------NESTCHVNDQDDSALS----TLQICSLCEEKYHQSCSQTDGAVQYE 903
+ C + + + + S T+ IC CE++YH C + E
Sbjct: 714 MNKRCIRVVKTVEVDHGGCALCSRPNFSKSFGPRTVIICDQCEKEYHVGCLKEHNMENLE 773
Query: 904 --PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHR------FDVSTDLSL---- 951
P FC C I L L+ K SL+ + ++ L +
Sbjct: 774 KLPEGNWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKKKHEEKSLEIGAGLDVKWRV 833
Query: 952 ------SDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYK 1005
SD VE L+ A+++ E F P+ D SG + I +L FG N + ++
Sbjct: 834 MNWKLDSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPTML--FGRNIRGQDFS 891
Query: 1006 GFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNV 1065
G + A+L + +I+SA R+ G E+AE+P + T ++ QG + L + IE+ L SLNV
Sbjct: 892 GIYCAVLTVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNV 951
Query: 1066 EKLIIPAISELRETWTSVFGFQPL---EVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+ L++PA E WT FGF L E++ +K M ++F G +LQKP+
Sbjct: 952 KNLVLPAADEAESIWTGKFGFTKLPQDEINKYKKFYRM--MIFQGTSVLQKPV 1002
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 183 bits (465), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 128/459 (27%), Positives = 208/459 (45%), Gaps = 73/459 (15%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYF 774
+P +V Y+ + + +L G GI C CC+ + S+F+ H+ + P+ ++Y
Sbjct: 518 LPDGTEVAYYARGQK--LLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYT 575
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G SL ++ S KG F D NDD C IC DGGDL+CCDGCP +FH
Sbjct: 576 SNGVSL--------HELSISLSKGRKFSLTD----NDDLCSICADGGDLLCCDGCPRSFH 623
Query: 835 QNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINES 864
++C+ + P+G W+C YC F C RI ++
Sbjct: 624 RDCVPLPCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAGVDPIEQITTRCIRIVKT 683
Query: 865 T-----------CHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCG 911
CH + T+ +C CE+++H C + + ++ P FC
Sbjct: 684 MEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQGKWFCC 743
Query: 912 KKCQEIFERLEKLLGVKHD-LEGGYTWSLVHRFDVSTDLSLSDV-----------CQKVE 959
+C I LEKL+ + + L S+ + + S++DV E
Sbjct: 744 PECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASINDVEIRWRVLNWKMLSSDE 803
Query: 960 CNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEII 1019
+ L+ A+S+ +CF P+ D SG + I ++LY G N + + G + A+L ++ ++
Sbjct: 804 TRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLY--GRNIRGQEFGGIYCAVLTVNESVV 861
Query: 1020 SAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRET 1079
S RI G E+AE+P + T ++ QG + L IE L LNV+ L++PA E
Sbjct: 862 SVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADEAESL 921
Query: 1080 WTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMK 1117
W + FGF L + R+ +++F G MLQK + K
Sbjct: 922 WINKFGFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEVPK 960
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 183 bits (465), Expect = 7e-43, Method: Composition-based stats.
Identities = 112/388 (28%), Positives = 180/388 (46%), Gaps = 62/388 (15%)
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYF--ESGSSLLQC------ 783
++ +G + DG+ C CC E+F +S F+ H+ SKL P N++ E+ S+ C
Sbjct: 1 LLKEGMVTSDGLVCSCCDELFNLSGFEAHTGSKLRRPAANIFVGDEAQLSIADCQNVAFK 60
Query: 784 --ILDSWNKQDESKRKGFHF------------------VNFDGEDPNDDTCGICGDGGDL 823
L+S ++R+ F V+++ +D CGIC +GG+L
Sbjct: 61 METLESLPGLPVARRRKFDSYCQSDEDSGLTTVSSGSDVDYEAAANSDQCCGICNEGGEL 120
Query: 824 ICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICS 883
+CC+ CP TFH C+ + + P W C C C CG L T Q C
Sbjct: 121 VCCETCPLTFHMECVSLLEVPKDAWFCFRCLCCHCGE--------------PLRT-QPCE 165
Query: 884 LCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDL-EGGYTWSLVHR 942
CE +H C D A+ FC C +F+RL +++ + L +WSL+ R
Sbjct: 166 QCERCFHPGC--CDDAIL--AGDFFFCSSGCWNLFQRLAEMVATVNPLGRSELSWSLLRR 221
Query: 943 FDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRL 1002
L LA AL ++ F P+ D + ++ + ++++ + RL
Sbjct: 222 GRCDDKL--------------LAEALQLISSRFDPVLDCWTQLDYLDAMVFSRSHHSPRL 267
Query: 1003 NYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCS 1062
++ GF+TA+L+R E++ A +RIH LAEMPFI T+ QG+CR L T +E L
Sbjct: 268 DFSGFYTAVLQRGAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLAR 327
Query: 1063 LNVEKLIIPAISELRETWTSVFGFQPLE 1090
L VE + + A + + W + F F ++
Sbjct: 328 LGVETMALLAAKDTEKMWKNSFEFHAVD 355
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 182 bits (462), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 138/471 (29%), Positives = 209/471 (44%), Gaps = 79/471 (16%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH----SKSKL 765
+ + G +P ++ Y+ + + +L G GI C CC + S F+ H S+ K
Sbjct: 527 VFERGGLPEGTELGYYARGQK--LLGGYKMGAGIYCYCCKSEVSPSLFEAHAGWASRRK- 583
Query: 766 CHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLIC 825
P+ +Y +G SL + W RK + D N+D C IC DGG+L+
Sbjct: 584 --PYFYIYTSNGVSLHE-----WATTFSQGRK------YSAND-NNDLCVICADGGNLLL 629
Query: 826 CDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF---------------------------- 857
CD CP FH C+ + P G WHC YC +F
Sbjct: 630 CDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQSA 689
Query: 858 --CGRINEST---------CHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE- 903
C R+ ++ C +D S T+ IC CE++YH C + V +
Sbjct: 690 GRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKE 749
Query: 904 -PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDL-SLSD-------V 954
P FC C I L+KLL + + ++ R TD+ S+SD +
Sbjct: 750 LPKGNWFCSMDCTRINSTLQKLLLGGAETLSDSSLGIIQRKQERTDVYSISDLDIRWRLI 809
Query: 955 CQKV---ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAI 1011
KV E L+ AL++ +CF P+ D SG NLI ++Y G + +Y G A+
Sbjct: 810 SGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGRNLIPRMVY--GKTMQGQDYGGICCAV 867
Query: 1012 LERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIP 1071
L + ++SA +R+ G+E+AE+P + TR R +G + L + IE L SLNVE +++P
Sbjct: 868 LTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVP 927
Query: 1072 AISELRETWTSVFGFQPL--EVSSKQKMRNMSLLVFPGVDMLQKPMMKNQF 1120
A E W + FGF+ L E SK ++ F G MLQKP+ +Q
Sbjct: 928 AAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVHAHQI 978
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 181 bits (459), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 161/285 (56%), Gaps = 23/285 (8%)
Query: 840 IKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGA 899
+++ P G+W C +C C CG+ N+ + QD LS C C K+H +C+++ G
Sbjct: 13 MEEVPDGEWFCPFCCCNICGQ-NKLLDNDVQQDGFILS----CDQCPRKFHVACARSRGL 67
Query: 900 VQYEPSSLS---FCGKKCQEIFERLEKLLGVKHDL-EGGYTWSLVHR-----FDVSTDLS 950
++ E FC KC+ +F L+ LLG + TW+L+ R FD+ LS
Sbjct: 68 IKLERKGTCYSWFCSDKCEYVFSGLQHLLGKSVPVGTDNLTWTLLKRVEPDCFDLEV-LS 126
Query: 951 LSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTA 1010
+ N++L +AL VM ECF P D +G +L+ +++++ GSN RLN+ GF+T
Sbjct: 127 AN--------NSKLKLALEVMHECFEPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTV 178
Query: 1011 ILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLII 1070
+LER++E+ + A++R+ G ++AE+PF+ T+ YRR GMCR L+ +E L +L VEKL++
Sbjct: 179 LLERNNELTTVANVRVFGDKVAEVPFVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVL 238
Query: 1071 PAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
PA ETW FGF + S K+ + +L F G + QK +
Sbjct: 239 PAAFSTLETWIKGFGFSVMTYSDKKAHSDYPILFFQGTVLCQKFL 283
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 181 bits (458), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 138/469 (29%), Positives = 211/469 (44%), Gaps = 83/469 (17%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHP 768
+ + +P ++ Y+ + +L G GI C CC+E + S+F+ H+ + P
Sbjct: 529 VFEADVLPDGTELAYYAHGQK--LLVGYKKGYGIFCTCCNEQVSASQFEAHAGWASRRKP 586
Query: 769 FQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDG 828
+ ++Y +G SL + S + + +R F N D NDD C IC DGGDL+CCDG
Sbjct: 587 YLHIYTSNGISLHEL---SISLSKDHRR----FSNND----NDDLCIICEDGGDLLCCDG 635
Query: 829 CPSTFHQNCLDIKKFPSGKWHCVYCSCQF---------------CGRI---------NES 864
CP FH +C+ + PSG W+C YC F GRI N+
Sbjct: 636 CPRAFHIDCVPLPCIPSGTWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELMNKR 695
Query: 865 TCHVN-----DQDDSAL------------STLQICSLCEEKYHQSCSQTDGAVQYE--PS 905
V D AL T+ IC CE++YH C + E P
Sbjct: 696 CIRVVRTLEVDHGGCALCSRPNFSKSFGPQTVIICDQCEKEYHVGCLKDHNMENLEELPV 755
Query: 906 SLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHR------FDVSTDLSL-------- 951
FC C +I L L+ K +L+ + D+ L +
Sbjct: 756 GNWFCSGNCSQIHTALMDLVASKEKDVPDPLLNLIKKKHEEKSLDIGAGLDVKWRVINWK 815
Query: 952 --SDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFT 1009
SD VE L+ A+++ E F P+ D SG + I +L FG N + ++ G +
Sbjct: 816 LDSD---SVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPAML--FGRNIRGQDFSGIYC 870
Query: 1010 AILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLI 1069
A+L + +I+SA R+ G E+AE+P + T ++ QG + L + IE+ L SLNV+ L+
Sbjct: 871 AVLTVNGDIVSAGVFRVFGLEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNVKNLV 930
Query: 1070 IPAISELRETWTSVFGFQPL---EVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+PA E WT FGF L E++ +K M ++F G +LQKP+
Sbjct: 931 LPAADEAESIWTGKFGFTKLPQDEINKYKKFYRM--MIFQGTSVLQKPV 977
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 179 bits (454), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 140/471 (29%), Positives = 208/471 (44%), Gaps = 79/471 (16%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH----SKSKL 765
+ D G +P ++ Y+ + + +L G GI C CC + S F+ H S+ K
Sbjct: 527 VFDRGGLPEGTELGYYARGQK--LLGGYKMGAGIYCYCCKCEVSPSLFEAHAGWASRRK- 583
Query: 766 CHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLIC 825
P+ +Y +G SL + W RK + D N+D C IC DGG+L+
Sbjct: 584 --PYFYIYTSNGVSLHE-----WATTFSHGRK------YSAND-NNDLCVICADGGNLLL 629
Query: 826 CDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF---------------------------- 857
CD CP FH C+ + P G WHC YC +F
Sbjct: 630 CDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLA 689
Query: 858 --CGRINEST---------CHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE- 903
C R+ ++ C +D S T+ IC CE++YH C + V +
Sbjct: 690 GRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKE 749
Query: 904 -PSSLSFCGKKCQEIFERLEKLL--GVK--HDLEGGYTWSLVHRFDVST----DLSLSDV 954
P FC C I L+KLL G + D G + R DV + D+ +
Sbjct: 750 LPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLI 809
Query: 955 CQKV---ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAI 1011
KV E L+ AL++ +CF P+ D SG NLI ++Y G + +Y G A+
Sbjct: 810 SGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVY--GKTMQGQDYGGICCAV 867
Query: 1012 LERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIP 1071
L + ++SA +R+ G+E+AE+P + TR R +G + L + IE L SLNVE +++P
Sbjct: 868 LTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVP 927
Query: 1072 AISELRETWTSVFGFQPL--EVSSKQKMRNMSLLVFPGVDMLQKPMMKNQF 1120
A E W + FGF+ L E SK ++ F G MLQKP+ +Q
Sbjct: 928 AAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQI 978
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 176 bits (447), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 136/460 (29%), Positives = 202/460 (43%), Gaps = 82/460 (17%)
Query: 715 TVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLY 773
+P + Y+ + K +L+G I C CC + S+F+ H+ + P+ ++Y
Sbjct: 604 VLPDGTALAYYVRGKK--LLEGYKKGGAIFCYCCQSEVSPSQFEAHAGCASRRKPYSHIY 661
Query: 774 FESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTF 833
+G SL + + K +R D NDD C IC DGGDL+CCD CP F
Sbjct: 662 TSNGVSLHELSI----KLSMERR--------SSSDENDDLCSICADGGDLLCCDNCPRAF 709
Query: 834 HQNCLDIKKFPSGKWHCVYCSCQF-------------CGRINE----------------- 863
H C+ + P G W+C YC F GR+
Sbjct: 710 HTECVSLPNIPRGTWYCKYCENMFLKEKFDRSANAIAAGRVAGIDALEQITKCSIRIVDT 769
Query: 864 --------STCHVNDQDDSAL--STLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCG 911
C +D S T+ IC CE++YH C + ++ P FC
Sbjct: 770 LHAEVGVCVLCRSHDFSTSGFGPQTVIICDQCEKEYHVKCLEEHNMDDLKELPKDKWFCC 829
Query: 912 KKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVST------DLSLSDVCQKV------- 958
K+C I L+KL+ D E SL+ + D S++DV ++
Sbjct: 830 KECNSIHYALQKLVS---DGEQSLPDSLMGIINEKIKAKNLEDNSINDVKWRLLSGKNST 886
Query: 959 -ECNARLAVALSVMDECFLPLPDHR-SGINLIHNILYNFGSNFKRLNYKGFFTAILERDD 1016
E L+ A+S+ + F P+ D S ++LI ++Y G NFK ++ G AIL +
Sbjct: 887 EETRVWLSGAVSIFHDSFDPIADSSTSRLDLIPTMVY--GRNFKDQDFGGMLCAILMVNS 944
Query: 1017 EIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISEL 1076
++SA IRI GKE+AE+P + T + +G + L IE+ L SL V+ L++PA E
Sbjct: 945 LVVSAGVIRIFGKEVAELPLVATSLDCQGKGYFQSLFYSIENLLKSLGVKYLVLPAAEEA 1004
Query: 1077 RETWTSVFGFQ---PLEVSSKQKMRNMSLLVFPGVDMLQK 1113
WT FGFQ P E+ K N L++F G MLQK
Sbjct: 1005 ESIWTKKFGFQHITPEEL--KHYKDNYQLMIFQGTAMLQK 1042
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 174 bits (440), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 151/313 (48%), Gaps = 34/313 (10%)
Query: 802 VNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRI 861
V+++ +D CGIC +GG+L+CC+ CP TFH C+ + + P W C C C CG
Sbjct: 260 VDYEAAANSDQCCGICNEGGELVCCETCPLTFHMECVSLLEVPKDAWFCFRCLCCHCGE- 318
Query: 862 NESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERL 921
L T Q C CE +H C D A+ FC C IF+RL
Sbjct: 319 -------------PLRT-QPCEQCERCFHPGC--CDDAIL--AGDFFFCSSGCWNIFQRL 360
Query: 922 EKLLGVKHDL-EGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPD 980
+++ + L +WSL+ R L LA AL V+ F P+ D
Sbjct: 361 AEMVATVNPLGRSELSWSLLRRGRCDDKL--------------LAEALQVISSRFDPVLD 406
Query: 981 HRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTR 1040
+ ++ + ++++ + RL++ GF+TA+L+R E++ A +RIHG LAEMPFI T+
Sbjct: 407 CWTQLDYLDAMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHGAWLAEMPFIATK 466
Query: 1041 HMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNM 1100
QG+CR L T +E L L VE +++ A + + W + F F ++ K + +
Sbjct: 467 AGMEGQGICRSLFTAVEEMLARLGVEMMVLLAAKDTEKMWKNSFEFHAMDRKLKARTVAL 526
Query: 1101 SLLVFPGVDMLQK 1113
L+ G LQK
Sbjct: 527 GLVALNGAGFLQK 539
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 2/71 (2%)
Query: 705 TLLAWMIDTGTVPLNEKVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK 762
T+ W+I G + EKV Y +R ++ +G + DG+ C CC E+F +S F+ H+
Sbjct: 56 TVFGWLIGLGLIAEQEKVVYIGGTKRFRHLLKEGMVTSDGLVCSCCDELFNLSGFEAHTG 115
Query: 763 SKLCHPFQNLY 773
SKL P N++
Sbjct: 116 SKLRRPAANIF 126
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 144/513 (28%), Positives = 226/513 (44%), Gaps = 97/513 (18%)
Query: 664 PARRHKLQETQQRKRCALLVRNSV-EGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKV 722
P RR K ++ + + A +S+ +++N G D L +M + +P ++
Sbjct: 379 PFRRQKYMDSSEEHKRAFRRPSSLSHPKKTNEGGTRRRDNDLHRLLFMPNG--LPDGAEL 436
Query: 723 QYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLL 781
Y+ K + ML G +GI C CC + S+F+ H+ + P++++Y +G +L
Sbjct: 437 AYY--IKGQKMLAGYKQGNGIVCSCCDREISPSQFEAHAGMAARRQPYRHIYTSNGLTLH 494
Query: 782 QCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIK 841
N Q+ + +DD C CGDGGDLI C+ CP FH CL +K
Sbjct: 495 DIATSLANGQNLTT------------GLSDDMCAECGDGGDLIFCESCPRAFHLVCLGLK 542
Query: 842 KFPSGKWHCVYCS-----------------------------CQFCGRINESTCHVNDQD 872
PS WHC C+ C FC + ST ND+
Sbjct: 543 YVPSDVWHCPNCNKFGHGGNFSRSIVIRLTRVVKTPEYEVGGCVFCRAHDFSTHTFNDR- 601
Query: 873 DSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLL--GV- 927
T+ +C CE ++H C + +G + P FC C I+E L+ + GV
Sbjct: 602 -----TVILCDQCEREFHVGCLRDNGLCDLKEIPKDNWFCSNDCNRIYEALQNFVSSGVQ 656
Query: 928 ------------KHDLEGGYT--------WS-LVHRFDVSTDLSLSDVCQKVECNARLAV 966
KH +G Y W L+ + DLSL L+
Sbjct: 657 MIPSLQLNIITGKHAEKGLYIDGQANDFQWRILMGKSRYQEDLSL------------LSA 704
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
A ++ ECF P+ +SG +LI ++Y G N + G + +L + ++SA +RI
Sbjct: 705 AAAIFRECFDPIVA-KSGRDLIPVMVY--GRNISGQEFGGMYCVLLLVKNVVVSAGLLRI 761
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
G+++AE+P + T ++ +G + L + IE LCSLNV KL++PA E WT FGF
Sbjct: 762 FGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLPAAEEAESIWTRRFGF 821
Query: 1087 QPL-EVSSKQKMRNMSLLVFPGVDMLQK--PMM 1116
+ + E Q R + L +F G ML+K P+M
Sbjct: 822 RKMTEEQLSQYTRELQLTIFKGTSMLEKEVPLM 854
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 169 bits (428), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/455 (27%), Positives = 198/455 (43%), Gaps = 76/455 (16%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ KT+ +LQG GI C CCS + S+F+ H+ + P+++++
Sbjct: 386 LPDGTELAYY--VKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGMAARRQPYRHIFI 443
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
SG SL + N H + +DD C ICGDGGDL+ C GCP FH
Sbjct: 444 SSGLSLHDIAMSLANG---------HVIT---TGDSDDMCSICGDGGDLLLCAGCPQAFH 491
Query: 835 QNCLDIKKFPSGKWHCVYCS--------------------------------------CQ 856
CL + P G W+C C+ C
Sbjct: 492 TACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPESDIGGCV 551
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKC 914
FC + S +D+ T+ +C CE++YH C + +G + P FC C
Sbjct: 552 FCRSHDFSIGKFDDR------TVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNC 605
Query: 915 QEIFERLEKLLGVKHDLEGGYTWSLVHRFD----VSTDLS-------LSDVCQKVECNAR 963
I ++ + ++ R D + TD+ LS + E
Sbjct: 606 SRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEHLPL 665
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
L+ A + ECF P+ +SG +LI ++Y G N + G + +L + ++SAA
Sbjct: 666 LSRAAVIFRECFDPIVA-KSGRDLIPVMVY--GRNISGQEFGGMYCLVLIVNSLVVSAAL 722
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSV 1083
+RI G+E+AE+P + T Y+ +G + L +E+ L SLNVE L++PA E WT
Sbjct: 723 LRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKK 782
Query: 1084 FGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMK 1117
FGF + Q+ + + L +F G ML+K + K
Sbjct: 783 FGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKVPK 817
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 167 bits (424), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 164/356 (46%), Gaps = 47/356 (13%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF------------ 857
N+D C IC DGG+L+ CD CP FH C+ + P G WHC YC +F
Sbjct: 576 NNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNS 635
Query: 858 ------------------CGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGA 899
C R+ ++ + T+ IC CE++YH C +
Sbjct: 636 SAVGQLEGVDPVDQLAGRCIRVVKNMEAETNGSGFGPRTIIICDQCEKEYHIGCLSSQNI 695
Query: 900 VQYE--PSSLSFCGKKCQEIFERLEKLL--GVK--HDLEGGYTWSLVHRFDVST----DL 949
V + P FC C I L+KLL G + D G + R DV + D+
Sbjct: 696 VDLKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQERNDVYSISDLDI 755
Query: 950 SLSDVCQKV---ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKG 1006
+ KV E L+ AL++ +CF P+ D SG NLI ++Y G + +Y G
Sbjct: 756 RWRLISGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVY--GKTMQGQDYGG 813
Query: 1007 FFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVE 1066
A+L + ++SA +R+ G+E+AE+P + TR R +G + L + IE L SLNVE
Sbjct: 814 ICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVE 873
Query: 1067 KLIIPAISELRETWTSVFGFQPL--EVSSKQKMRNMSLLVFPGVDMLQKPMMKNQF 1120
+++PA E W + FGF+ L E SK ++ F G MLQKP+ +Q
Sbjct: 874 SIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQI 929
>gi|297805014|ref|XP_002870391.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316227|gb|EFH46650.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 351
Score = 167 bits (424), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 99/282 (35%), Positives = 137/282 (48%), Gaps = 84/282 (29%)
Query: 838 LDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTD 897
L I+ FP G WHC C+C+FC + E CSQT
Sbjct: 140 LHIQMFPHGDWHCPNCTCKFCRAVVEE----------------------------CSQT- 170
Query: 898 GAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQK 957
+FE ++K +GVKH+LE ++WSLVHR +D L
Sbjct: 171 -------------------LFEGVKKYVGVKHELEARFSWSLVHRECTDSDFILR--WTP 209
Query: 958 VECNARLAV---ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILER 1014
C + +L+VMDECFLP+ D RSG
Sbjct: 210 SYCGKQFQAGHSSLTVMDECFLPIIDRRSG------------------------------ 239
Query: 1015 DDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAIS 1074
+ + ++ HG LAEM FIGTRH+YR QGMCRRL + +ES L +L VE L+IPA +
Sbjct: 240 -GKYCTKCPLQFHGNRLAEMQFIGTRHVYRHQGMCRRLFSVVESTLQNLKVELLVIPATA 298
Query: 1075 ELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
+L W S FGF+ +E S K+++R+M+LL FPG+D+LQK ++
Sbjct: 299 DLSHVWISKFGFKYVEDSLKKELRSMNLLAFPGIDVLQKELL 340
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 165 bits (418), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 125/464 (26%), Positives = 207/464 (44%), Gaps = 77/464 (16%)
Query: 715 TVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLY 773
+P +V Y+++ + +L G GI C CC+ + S+F+ H+ + P+ ++Y
Sbjct: 585 VLPDGTEVAYYSRGQK--LLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIY 642
Query: 774 FESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTF 833
+G SL + + SK + F NDD C IC DGGDL+CCD CP +
Sbjct: 643 TSNGVSLHELAISL------SKSRKF------STHQNDDLCQICRDGGDLLCCDVCPRAY 690
Query: 834 HQNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINE 863
H++CL + + P+G+W+C +C F C RI +
Sbjct: 691 HKDCLALPEIPTGRWYCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQITRRCIRIVK 750
Query: 864 ST---------CHVNDQDD-SALSTLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCG 911
+ C +D D T+ +C CE+++H C + ++ P FC
Sbjct: 751 TMDADFGGCVFCRGHDFDKIFGPRTVLLCDQCEKEFHVGCLKDHNMEDLKELPKGNWFCC 810
Query: 912 KKCQEIFERLEKLLGVKHDLEGGYTWSLVHR-------------FDVSTDLSLSDVCQKV 958
C I LEKL+ + + +L+++ DV L +
Sbjct: 811 SDCCRIHSALEKLVLRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRLLNDKINPAG 870
Query: 959 ECNARLAVALSVMDECFLPL----PDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILER 1014
+ A L+ AL+++ E F P+ ++ +LI +++ FG N K + G + A+L
Sbjct: 871 DTAALLSEALAILHEQFNPILVAGTSSKADRDLITSMV--FGDNLKGQEFGGMYCAVLMI 928
Query: 1015 DDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAIS 1074
+ ++S A IR G ELAE+P + T + +G + L T IE L LN++ L++PA
Sbjct: 929 NQAVVSCAIIRFFGLELAELPLVATSSKAQGKGYFQALFTCIEKLLGFLNIKNLVLPAAE 988
Query: 1075 ELRETWTSVFGFQPLEVSSKQKMR-NMSLLVFPGVDMLQKPMMK 1117
E W + FGF+ L K R + ++VF G ML KP+ K
Sbjct: 989 EAESIWINKFGFRKLTHEEFLKFRKDYQMMVFQGTSMLHKPVPK 1032
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 165 bits (418), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 136/477 (28%), Positives = 212/477 (44%), Gaps = 98/477 (20%)
Query: 715 TVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLY 773
+P +V Y++Q K +L G GI C CC+ + S+F+ H+ + P+ ++Y
Sbjct: 493 VLPDGTEVGYYSQGKK--LLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIY 550
Query: 774 FESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTF 833
+G SL + + + S ++ NDD C IC DGG L+CCD CP F
Sbjct: 551 TSNGVSLHELAISLSKCRRHSTKE------------NDDLCQICRDGGKLLCCDVCPRAF 598
Query: 834 HQNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINE 863
HQ CL + P GKW+C YC F C RI +
Sbjct: 599 HQECLSLPSIPKGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGTDPIEQITRRCIRIVK 658
Query: 864 ST---------CHVNDQDDS-ALSTLQICSLCEEKYHQSC-----SQTDGAVQYEPSSLS 908
+ C +D + + T+ IC CE+++H C Q A+ P+
Sbjct: 659 TFEAEVGGCVFCRGHDFERTFGPRTVIICDQCEKEFHVGCLKEHQMQDLKAICELPTGKW 718
Query: 909 FCGKKCQEIFERLEKL---------------LGVKHDLE-----GG--YTWSLVHRFDVS 946
FC C+ I L+KL + KH+ GG W L+ +
Sbjct: 719 FCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKKHEESASESGGGDDIRWRLLSK---K 775
Query: 947 TDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLI-HNILYNF--GSNFKRLN 1003
TD S DV + + L+ A+++ E F P+ +S H+ + + G + K +
Sbjct: 776 TDPS--DVTESL-----LSEAVAIFHERFAPITVDKSKRKRDDHDFIPSMVKGGDMKGQD 828
Query: 1004 YKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSL 1063
G + A+L + E++SAA +RI G+ELAE+P + T + QG + L T IE L L
Sbjct: 829 LGGMYCAVLLVNHEVVSAAVMRIFGQELAELPIVATSSKSQGQGYFQTLFTCIEKLLGFL 888
Query: 1064 NVEKLIIPAISELRETWTSVFGFQPL---EVSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
NV+ L++PA E+ WT+ FGF + EV + ++ ++ F G MLQKP+ K
Sbjct: 889 NVKNLVLPAAEEVESIWTNKFGFSTITQDEVRLMEYRKSYQIMEFQGSLMLQKPVPK 945
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 165 bits (417), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 135/507 (26%), Positives = 229/507 (45%), Gaps = 73/507 (14%)
Query: 657 SHNGMSIPARRHKLQE--TQQRKRCA-----LLVRNSVEGEESNGDGFVAYDGKWTLLAW 709
S G +P ++ +L + TQ+RK A + + ++ ++S+ D L +
Sbjct: 406 SFTGSRLPLKQKELMKEMTQERKHAAKKPSSYIYGSGLQHKKSSEGAIKKRDNDLHRLLF 465
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHP 768
M + +P ++ Y+ K + +L G +GI C C + S+F+ H+ + P
Sbjct: 466 MPNG--LPDGAELAYY--VKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGWAARRQP 521
Query: 769 FQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDG 828
++++Y +G +L + N Q+ + +DD C +CGDGGDLI CDG
Sbjct: 522 YRHIYTSNGLTLHDIAISLANGQNCTT------------GDSDDMCTLCGDGGDLILCDG 569
Query: 829 CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG--------RINEST-----------CHVN 869
CP FH CL+++ P G W C C FC RI + C V
Sbjct: 570 CPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKVARPIRIQLTRAVKAPESEIGGCVVC 629
Query: 870 DQDDSALS-----TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLE 922
D ++S T+ +C CE+++H C + G + P FC C + L+
Sbjct: 630 RAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKWFCCDDCSRVHVALQ 689
Query: 923 KLLGVKHDLEGGYTWSLVHRFDVSTDL---SLSDV---------CQKVECNARLAVALSV 970
L ++ S+++R ++ L + D+ C K E L+ ++
Sbjct: 690 NLASRGPEMIPASVSSMINRKNLEKGLIDGAADDIQWCILSGKSCYK-EHLPLLSRTTAI 748
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
ECF P+ SG +LI ++Y G N + G + +L ++SA IR+ G+E
Sbjct: 749 FRECFDPIVA-SSGRDLIPVMVY--GRNISGQEFGGMYCVVLLAKSTVVSAGLIRVFGQE 805
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLE 1090
+AE+P + T ++ +G R L + IE L SL V+ L++PA E WT+ GFQ +
Sbjct: 806 VAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAEAIWTNKLGFQKM- 864
Query: 1091 VSSKQKM----RNMSLLVFPGVDMLQK 1113
S+++M R + L +F G ML+K
Sbjct: 865 --SEERMLKYTRELQLTIFKGTSMLEK 889
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 164 bits (416), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 135/507 (26%), Positives = 229/507 (45%), Gaps = 73/507 (14%)
Query: 657 SHNGMSIPARRHKLQE--TQQRKRCA-----LLVRNSVEGEESNGDGFVAYDGKWTLLAW 709
S G +P ++ +L + TQ+RK A + + ++ ++S+ D L +
Sbjct: 351 SFTGSRLPLKQKELMKEMTQERKHAAKKPSSYIYGSGLQHKKSSEGAIKKRDNDLHRLLF 410
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHP 768
M + +P ++ Y+ K + +L G +GI C C + S+F+ H+ + P
Sbjct: 411 MPNG--LPDGAELAYY--VKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGWAARRQP 466
Query: 769 FQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDG 828
++++Y +G +L + N Q+ + +DD C +CGDGGDLI CDG
Sbjct: 467 YRHIYTSNGLTLHDIAISLANGQNCTT------------GDSDDMCTLCGDGGDLILCDG 514
Query: 829 CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG--------RINEST-----------CHVN 869
CP FH CL+++ P G W C C FC RI + C V
Sbjct: 515 CPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKVARPIRIQLTRAVKAPESEIGGCVVC 574
Query: 870 DQDDSALS-----TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLE 922
D ++S T+ +C CE+++H C + G + P FC C + L+
Sbjct: 575 RAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKWFCCDDCSRVHVALQ 634
Query: 923 KLLGVKHDLEGGYTWSLVHRFDVSTDL---SLSDV---------CQKVECNARLAVALSV 970
L ++ S+++R ++ L + D+ C K E L+ ++
Sbjct: 635 NLASRGPEMIPASVSSMINRKNLEKGLIDGAADDIQWCILSGKSCYK-EHLPLLSRTTAI 693
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
ECF P+ SG +LI ++Y G N + G + +L ++SA IR+ G+E
Sbjct: 694 FRECFDPIVA-SSGRDLIPVMVY--GRNISGQEFGGMYCVVLLAKSTVVSAGLIRVFGQE 750
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLE 1090
+AE+P + T ++ +G R L + IE L SL V+ L++PA E WT+ GFQ +
Sbjct: 751 VAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAEAIWTNKLGFQKM- 809
Query: 1091 VSSKQKM----RNMSLLVFPGVDMLQK 1113
S+++M R + L +F G ML+K
Sbjct: 810 --SEERMLKYTRELQLTIFKGTSMLEK 834
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 164 bits (414), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 125/455 (27%), Positives = 198/455 (43%), Gaps = 76/455 (16%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ KT+ +LQG GI C CCS + S+F+ H+ + P+++++
Sbjct: 91 LPDGTELAYY--VKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGMAARRQPYRHIFI 148
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
SG SL + N H + +DD C ICGDGGDL+ C GCP FH
Sbjct: 149 SSGLSLHDIAMSLANG---------HVIT---TGDSDDMCSICGDGGDLLLCAGCPQAFH 196
Query: 835 QNCLDIKKFPSGKWHCVYCS--------------------------------------CQ 856
CL + P G W+C C+ C
Sbjct: 197 TACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPESDIGGCV 256
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKC 914
FC + S +D+ T+ +C CE++YH C + +G + P FC C
Sbjct: 257 FCRSHDFSIGKFDDR------TVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNC 310
Query: 915 QEIFERLEKLLGVKHDLEGGYTWSLVHRFD----VSTDLS-------LSDVCQKVECNAR 963
I ++ + ++ R D + TD+ LS + E
Sbjct: 311 SRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEHLPL 370
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
L+ A + ECF P+ +SG +LI ++Y G N + G + +L + ++SAA
Sbjct: 371 LSRAAVIFRECFDPIVA-KSGRDLIPVMVY--GRNISGQEFGGMYCLVLIVNSLVVSAAL 427
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSV 1083
+RI G+E+AE+P + T Y+ +G + L +E+ L SLNVE L++PA E WT
Sbjct: 428 LRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKK 487
Query: 1084 FGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMK 1117
FGF + Q+ + + L +F G ML+K + K
Sbjct: 488 FGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKVPK 522
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 164 bits (414), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 153/314 (48%), Gaps = 47/314 (14%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
E+ D C +C DGG+L+ CD CPS FH C+ ++ P G W C C C CG
Sbjct: 560 EEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCRCGVCGG------ 613
Query: 867 HVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLG 926
+D DD D A + ++ +C +
Sbjct: 614 --SDLDD-----------------------DTAEGFTDKTIIYCEAR------------S 636
Query: 927 VKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNA---RLAVALSVMDECFLPLPDHRS 983
+ +EG +L R +ST ++ + E A +L AL V+ ECF + + R+
Sbjct: 637 IPTTVEGVSLSTLRRRRYMSTVTRITRWQHEEEDAADHGQLCAALDVLHECFDDMVEPRT 696
Query: 984 GINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMY 1043
+L +I++N S RLN++G++ LE+ E+I+ ++R+ G ++AE+P +GTR +
Sbjct: 697 QTDLAADIVFNQESGLCRLNFRGYYVVGLEKAGELITVGTLRVFGNQVAELPLVGTRFAH 756
Query: 1044 RRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMR-NMSL 1102
RRQGMCR L+T +E L + V +L++PA+ EL WT+ GF + S +M ++
Sbjct: 757 RRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELLPMWTASLGFHAMTRSDVMEMAVEHAI 816
Query: 1103 LVFPGVDMLQKPMM 1116
L F G M QK ++
Sbjct: 817 LSFKGTTMCQKTLL 830
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 164 bits (414), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 131/466 (28%), Positives = 198/466 (42%), Gaps = 80/466 (17%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHP 768
+ + G +P +V Y+ + + +L G GI C CC+ + S F+ H+ + P
Sbjct: 394 VFEDGGLPDGTEVAYYARGQK--LLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKP 451
Query: 769 FQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDG 828
+ +Y +G SL + + + S R NDD C +C DGG LI CDG
Sbjct: 452 YAYIYTSNGVSLHELAISLSKGRKYSARD------------NDDLCIVCADGGSLILCDG 499
Query: 829 CPSTFHQNCLDIKKFPSGKWHCVYCSCQF------------------------------C 858
CP FH+ C + P GKW C +C F C
Sbjct: 500 CPRAFHKGCASLSSIPRGKWFCQFCQNMFQREKFVEHNANAVAAGRISGVDPIEQITQRC 559
Query: 859 GRINEST---------CHVNDQDDSALS--TLQICSLCEEKYHQSC--SQTDGAVQYEPS 905
RI ++ C D S T+ +C C +++H C S ++ P
Sbjct: 560 IRIVKNIEAELTGCVLCRGYDFSRSGFGPRTIILCDQCGKEFHVGCLRSHKIANLKELPK 619
Query: 906 SLSFCGKKCQEIFERLEKLLG--------------VKHDLEGGYTWSLVHRFDVSTDLSL 951
FC C I L+KLL +K + E G V+ DV L L
Sbjct: 620 GKWFCCPDCGRIHSALKKLLAREAEIIPNKLLEVVMKKNEEKGL--ETVNNIDVRWKL-L 676
Query: 952 SDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAI 1011
+ E L+ AL++ ECF P+ D +G +LI ++Y G N K +Y G + A+
Sbjct: 677 TGKSASPETKLLLSQALAIFQECFDPIVD-TTGRDLIPLMVY--GKNSKGQDYGGMYCAV 733
Query: 1012 LERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIP 1071
L + ++SAA +RI G+E+AE+P + T + +G + L + IE L L V +++P
Sbjct: 734 LMVNSFVVSAAIVRIFGQEVAELPLVATSNGNHGKGYFQLLFSFIEKLLAYLKVHSIVLP 793
Query: 1072 AISELRETWTSVFGFQPLEVSSKQKMRN--MSLLVFPGVDMLQKPM 1115
A E WT FGFQ ++ K R +L F G MLQK +
Sbjct: 794 AAEEAESIWTDKFGFQKIKPDQLSKYRKSCCQILTFKGTSMLQKAV 839
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 163 bits (412), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 117/441 (26%), Positives = 195/441 (44%), Gaps = 56/441 (12%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ K + +L G +GI C CC + S+F++H+ S P++++Y
Sbjct: 441 LPDGTELAYY--VKGQKILGGYKQGNGIVCSCCEIEISPSQFESHAGMSARRQPYRHIYT 498
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+ +L + N Q+ + G DD C CGDGGDL+ C CP FH
Sbjct: 499 SNRLTLHDIAISLANGQNITTGIG------------DDMCAECGDGGDLMFCQSCPRAFH 546
Query: 835 QNCLDIKKFPSGKWHCVYCSCQFCG---------RINE------------STCHVNDQDD 873
CLD+ P G WHC C+ G R+ + C +D
Sbjct: 547 AACLDLHDTPEGAWHCPNCNKLGHGGNFARPIVIRLTRVVKTPEYDVGGCAVCRAHDFSG 606
Query: 874 SALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKH 929
T+ +C CE+++H C + G + P FC + C I+ L +
Sbjct: 607 DTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGV 666
Query: 930 DLEGGYTWSLVHRFDVSTDLSLSDVCQKVECN------------ARLAVALSVMDECFLP 977
+ ++R V L + + V+ + L+ A ++ ECF P
Sbjct: 667 QTIPVSLLNTINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIFRECFDP 726
Query: 978 LPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFI 1037
+ ++G +LI ++Y G N + G + +L ++SA +RI G+E+AE+P +
Sbjct: 727 IVA-KTGRDLIPVMVY--GRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLV 783
Query: 1038 GTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM 1097
T ++ +G + L + IE LCSLNVE+L++PA E WT FGF+ + K
Sbjct: 784 ATNREHQGKGYFQALFSCIERLLCSLNVEQLVLPAAEEAESIWTRRFGFRKMSEGQLLKY 843
Query: 1098 -RNMSLLVFPGVDMLQKPMMK 1117
R L +F G ML+K +++
Sbjct: 844 TREFQLTIFKGTSMLEKEVLR 864
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 163 bits (412), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 209/468 (44%), Gaps = 82/468 (17%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCS-EIFTISKFDTHSK-SKLCH 767
+ + G +P ++ Y+ + + +L G GI C CC+ E+ + S F+ H+ +
Sbjct: 275 VFEEGGLPDGTELAYYARGQK--LLGGYKRGFGILCHCCNCEVVSPSTFEAHAGWATRKK 332
Query: 768 PFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCD 827
P+ +Y +G SL D +S++ + +D NDD C IC DGGDL+ CD
Sbjct: 333 PYACIYTSNGVSLH----DLAISLSKSRK-------YSSQD-NDDLCIICADGGDLLLCD 380
Query: 828 GCPSTFHQNCLDIKKFPSGKWHCVYCSCQF--------------CGRINE---------- 863
GCP FH+ C + PSG W+C +C F GR++E
Sbjct: 381 GCPRAFHKGCASLSTVPSGDWYCQHCQNTFQREKFVEHNANAFAAGRVSEIDSIEQITKR 440
Query: 864 ---------------STCHVNDQDDSALS--TLQICSLCEEKYHQSC--SQTDGAVQYEP 904
+ C D S T+ +C CE+++H C S ++ P
Sbjct: 441 CFRIVKNVEAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMANLKELP 500
Query: 905 SSLSFCGKKCQEIFERLEKLL---------------GVKHDLEGGYTWSLVHRFDVSTDL 949
FC C I L+KLL KH+ +G ++ + DV L
Sbjct: 501 KGNWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEEKG---LNISNSIDVRWTL 557
Query: 950 SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFT 1009
LS E L+ ALS+ ECF P+ D G +LI ++Y G N K +Y G +
Sbjct: 558 -LSGKIASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVY--GKNSKGQDYGGMYC 614
Query: 1010 AILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLI 1069
A+L + I+SA +R+ G+E+AE+P + TR+ +G + L + IE L LNV+ L+
Sbjct: 615 AVLIVNSCIVSAGILRVFGEEVAELPLVATRNGDHGKGYFQLLFSCIEKLLAFLNVQNLV 674
Query: 1070 IPAISELRETWTSVFGFQPL--EVSSKQKMRNMSLLVFPGVDMLQKPM 1115
+PA E W FGFQ + E SK + ++ F G MLQK +
Sbjct: 675 LPAAEEAESIWIEKFGFQKIKPEQLSKYRKNCCQMVRFEGTSMLQKAV 722
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 162 bits (411), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/456 (26%), Positives = 196/456 (42%), Gaps = 76/456 (16%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ KT+ +L G GI C CCS + S+F+ H+ + P+++++
Sbjct: 416 LPDGTELAYY--VKTQKLLHGYKQGSGIVCSCCSREISPSQFEAHAGMAARRQPYRHIFI 473
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
SG SL + N H + +DD C ICGDGGDL+ C GCP FH
Sbjct: 474 SSGLSLHDIAMSLANG---------HVIT---TGDSDDMCSICGDGGDLLLCAGCPQAFH 521
Query: 835 QNCLDIKKFPSGKWHCVYCS--------------------------------------CQ 856
CL + P G W+C C+ C
Sbjct: 522 TACLKFQSVPEGTWYCSSCNDGPISSKKATATDPSGNARPIVIRLSRVVKAPESEIGGCV 581
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKC 914
FC + S +D+ T+ +C CE++YH C + +G + P FC C
Sbjct: 582 FCRSHDFSIGKFDDR------TVILCDQCEKEYHVGCLRENGLCDLKEIPQEKWFCCSDC 635
Query: 915 QEIFERLEKLLGVKHDLEGGYTWSLVHRFD----VSTDLS-------LSDVCQKVECNAR 963
I ++ + ++ R D + TD LS + E
Sbjct: 636 SRIHTAVQNSVSCGPQTIPTPLLDMICRKDREKGIFTDNGDIVEWRILSGKSRYPEHLPL 695
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
L+ A + ECF P+ +SG +LI ++Y G N + G + +L + ++SAA
Sbjct: 696 LSRAAVIFRECFDPIVA-KSGRDLIPVMVY--GRNISGQEFGGMYCLVLIVNSLVVSAAL 752
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSV 1083
+RI G+++AE+P + T Y+ +G + L +E+ L SLNVE L++PA E WT
Sbjct: 753 LRIFGQQVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKK 812
Query: 1084 FGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMKN 1118
FGF + Q+ + + L +F G ML+K + K
Sbjct: 813 FGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKVPKT 848
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 162 bits (411), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 130/461 (28%), Positives = 203/461 (44%), Gaps = 85/461 (18%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH----SKSKLCHPFQN 771
+P +V Y+ + + +L+G GI C CC+ + S+F+ H S+ K P+
Sbjct: 295 LPNGAEVAYYARGQK--LLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRK---PYAF 349
Query: 772 LYFESGSSLLQ-CILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCP 830
+Y +G SL + I S + + +K+ ND C +C DGG+L+ CDGCP
Sbjct: 350 IYTSNGVSLHELAIFLSKDHKCTTKQ-------------NDYVCVVCWDGGNLLLCDGCP 396
Query: 831 STFHQNCLDIKKFPSGKWHCVYCSCQF------------------------------CGR 860
FH+ C + P G+W+C C F C R
Sbjct: 397 RAFHKECASVSSIPRGEWYCQICQHTFLRERPVLHNADAVAAGRVEGVDPIEQIAKRCIR 456
Query: 861 INEST---------CHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE---PSS 906
I + C +D S T+ IC CE++YH C + D + Y P
Sbjct: 457 IVKDIGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLR-DHKMAYLKELPEG 515
Query: 907 LSFCGKKCQEIF-----------ERL-EKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDV 954
FC C I ERL E LL V + G ++ DV L L+
Sbjct: 516 DWFCCNDCTRIHSTLENLLIRVAERLPESLLDVIKKKQVGRCLEPLNEIDVRWKL-LNGK 574
Query: 955 CQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILER 1014
E L A+S+ ECF P+ D +G +LI ++Y G N + ++ G + A+L
Sbjct: 575 IASPETRPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVY--GRNLQTQDFGGMYCALLIV 632
Query: 1015 DDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAIS 1074
+ ++SA +RI G+++AE+P + TR+ R +G + L IE L LNV+ L++PA
Sbjct: 633 NSSVVSAGMVRIFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAE 692
Query: 1075 ELRETWTSVFGFQPLEVS--SKQKMRNMSLLVFPGVDMLQK 1113
E WT FGF ++ + + +M ++ F G +ML K
Sbjct: 693 EAASIWTEKFGFSKMKPNQLTNYRMNCHQIMAFKGTNMLHK 733
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 162 bits (409), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/446 (28%), Positives = 201/446 (45%), Gaps = 74/446 (16%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ K + +L G +GI C CC + S+F+ H+ + P++++Y
Sbjct: 472 LPDGAELAYY--VKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYA 529
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G +L L N Q N D +DD C +CGDGGDLI C+GCP FH
Sbjct: 530 SNGLTLHDIALSLANGQ-----------NLTTGD-SDDMCAVCGDGGDLILCNGCPRAFH 577
Query: 835 QNCLDIKKFPSGKWHCVYC-----------------------------SCQFCGRINEST 865
CL + P WHC+ C C C R N+ +
Sbjct: 578 AACLGLHSVPESGWHCLNCEDNTGDERGARPIMIRLTRVDKEPEYEVGGCVVC-RANDFS 636
Query: 866 CHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLE- 922
V+ DD T+ IC CE++YH C + G + E P FC C I+ L+
Sbjct: 637 --VDKFDDR---TVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQN 691
Query: 923 --------------KLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVAL 968
+L+ KH+ G T+ ++ D+ + LS + E L+ A
Sbjct: 692 SVSAGADTIPSSLSELIIRKHEDRGLCTYGDMN--DIQWRI-LSGKSRYAEHLPLLSRAA 748
Query: 969 SVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHG 1028
++ ECF P+ SG +LI ++Y G N + G + +L + ++SA +RI G
Sbjct: 749 AIFRECFDPIV-AISGRDLIPVMVY--GRNISGQEFGGMYCIVLIVNSIVVSAGLLRIFG 805
Query: 1029 KELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQP 1088
+ +AE+P + T ++ +G + L + IE L SLNVEKL++PA + WT GF
Sbjct: 806 RNIAELPLVATSREHQGKGYFQALFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFHK 865
Query: 1089 L-EVSSKQKMRNMSLLVFPGVDMLQK 1113
+ E + ++ + L +F +L+K
Sbjct: 866 MSEDQLTKHLKEVQLTLFNKTSVLEK 891
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 162 bits (409), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/460 (27%), Positives = 201/460 (43%), Gaps = 83/460 (18%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH----SKSKLCHPFQN 771
+P +V Y+ + + +L+G R GI C CC+ + S+F+ H S+ K P+
Sbjct: 303 LPNGAEVAYYARGQK--LLEGIKTRCGIVCRCCNTEVSPSQFEVHAGWASRRK---PYAY 357
Query: 772 LYFESGSSLLQ-CILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCP 830
+Y +G SL + I S + + +K+ ND C +C DGG+L+ CDGCP
Sbjct: 358 IYTSNGVSLHELAIFLSKDHKCTTKQ-------------NDYVCVVCWDGGNLLLCDGCP 404
Query: 831 STFHQNCLDIKKFPSGKWHCVYCSCQF------------------------------CGR 860
FH+ C + P G+W+C C F C R
Sbjct: 405 RAFHKECASVSSIPRGEWYCQICQHTFLRERPVLYNADAVAAGRVEGVDPIEQIAKRCIR 464
Query: 861 INEST---------CHVNDQDDSALS--TLQICSLCEEKYHQSC--SQTDGAVQYEPSSL 907
I + C +D S T+ IC CE++YH C ++ P
Sbjct: 465 IVKDIGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKKAYLKELPEGD 524
Query: 908 SFCGKKCQEIF-----------ERL-EKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVC 955
FC C I ERL E LL V + ++ DV L L+
Sbjct: 525 WFCCNDCTIIHSTLENLLIRVAERLPEALLDVIKKKQVERCLEPLNEIDVRWKL-LNGKI 583
Query: 956 QKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERD 1015
E L A+S+ ECF P+ D +G +LI ++Y G N + ++ G + A+L +
Sbjct: 584 ASPETRPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVY--GRNLQTQDFGGMYCALLIVN 641
Query: 1016 DEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISE 1075
++SA +RI G+++AE+P + TR+ R +G + L IE L LNV+ L++PA E
Sbjct: 642 SSVVSAGMVRIFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEE 701
Query: 1076 LRETWTSVFGFQPLEVS--SKQKMRNMSLLVFPGVDMLQK 1113
WT FGF +++ + +M ++ F G +ML K
Sbjct: 702 AESIWTEKFGFSKMKLDQLTNYRMNCHQIMAFKGTNMLHK 741
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 161 bits (407), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 122/459 (26%), Positives = 203/459 (44%), Gaps = 82/459 (17%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYF 774
+P +V Y+ + + +L G GI C CC+ + S+F+ H+ + P+ ++Y
Sbjct: 383 LPEGTEVGYY--VRGQQLLVGYKRGSGIFCTCCNSEVSPSQFEAHAGWASRRKPYLHIYT 440
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G SL + + S +G D NDD C IC DGG+L+CCDGCP FH
Sbjct: 441 SNGVSLHEFSI--------SLSRGREISVSD----NDDLCSICLDGGNLLCCDGCPRVFH 488
Query: 835 QNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINES 864
+ C+ + P GKW C +C+ C RI +
Sbjct: 489 KECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQITKRCIRIVNT 548
Query: 865 TCHVNDQDDSALS-------------TLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSF 909
V++ AL T+ +C CE+++H C + ++ P F
Sbjct: 549 --QVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLKEVPKGKWF 606
Query: 910 CGKKCQEIFERLEKLLGVKH---DLEGGYTWSLVHRF-----------DVSTDLSLSDVC 955
C C+ I L+KL V H +L ++ ++ D+ L
Sbjct: 607 CCHDCKRINSSLQKL--VVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRLICGRRA 664
Query: 956 QKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERD 1015
+E + L+ ALS+ E F P+ D +G +L+ ++++ G + + ++ G + AIL
Sbjct: 665 SSIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVH--GKSTREWDFGGMYCAILTIS 721
Query: 1016 DEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISE 1075
+++SAA+ RI GKE+AE+P + TR + QG + L + +E L L V L++PA
Sbjct: 722 SQVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVLPAAEG 781
Query: 1076 LRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQK 1113
WT+ FGF + + R+ ++ F G MLQK
Sbjct: 782 AESIWTNKFGFNKVTQEQRNNFRRDYQMVTFQGTLMLQK 820
>gi|297817898|ref|XP_002876832.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322670|gb|EFH53091.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 386
Score = 160 bits (406), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 141/292 (48%), Gaps = 84/292 (28%)
Query: 834 HQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC 893
H + IK FP G WHC C+C+FC + E
Sbjct: 130 HLHIQVIKMFPHGDWHCPNCTCKFCRAVVEDV---------------------------- 161
Query: 894 SQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSD 953
SQT GA KC +FE ++K +GVKH+LE ++WSLVHR +D L
Sbjct: 162 SQTVGA-------------KC--LFEGVKKYVGVKHELEARFSWSLVHRECTDSDFILR- 205
Query: 954 VCQKVECNARLAV---ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTA 1010
C + +L+VMDECFLP+ D RSG G + + F
Sbjct: 206 -WTPSYCGKQFQAGHSSLTVMDECFLPIIDRRSG-----------GKYCTKCPLQLF--- 250
Query: 1011 ILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIES------ALCSLN 1064
HG LAEM FIGTRH+YR QGMCRRL + +ES L +L
Sbjct: 251 ----------------HGNRLAEMQFIGTRHVYRHQGMCRRLFSVVESMSFDVKTLQNLK 294
Query: 1065 VEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
VE L+IPA ++L W S FGF+ +E S K+++R+M+LL FPG+D+LQK ++
Sbjct: 295 VELLVIPATADLSHVWISKFGFKYVEDSLKKELRSMNLLAFPGIDVLQKELL 346
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 160 bits (404), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 125/444 (28%), Positives = 204/444 (45%), Gaps = 67/444 (15%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ K + +L G +GI C CC + S+F+ H+ + P++++Y
Sbjct: 423 LPDGAELAYY--VKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYT 480
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G +L L N Q N D +DD C +CGDGGDLI C+GCP FH
Sbjct: 481 SNGLTLHDIALSLANGQ-----------NLTTGD-SDDMCAVCGDGGDLILCNGCPRAFH 528
Query: 835 QNCLDIKKFPSGKWHCVYCSCQFCGRINEST----------------------CHVNDQD 872
CL ++ P W C+ C G ES+ C V +
Sbjct: 529 AACLGLQCVPDSGWQCLNC-IDNAGNGRESSIVRPIMIRLTRVDKTPEVEMGGCVVCREH 587
Query: 873 DSALS-----TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLE--- 922
D +++ T+ IC CE++YH C + G + E P FC C I+ L+
Sbjct: 588 DFSVAKFDERTVIICDQCEKEYHVGCLRDMGLCELEELPKDKWFCCDDCNRIYAALQNSV 647
Query: 923 ------------KLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSV 970
+L+ KH+ +G T+ ++ D+ + LS + E L+ A ++
Sbjct: 648 SAGAEIIPASFSELIIRKHEDKGLCTYGAMN--DIQWRI-LSGKSRYPEHLPLLSRAAAI 704
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
ECF P+ SG +LI ++Y G N + G + +L + ++SA +RI G+
Sbjct: 705 FRECFDPIVA-ISGRDLIPVMVY--GRNISGQEFGGMYCIVLIVNYVVVSAGLLRIFGRN 761
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL- 1089
+AE+P + T ++ +G + L + IE L SLNVEKL++PA + WT GF+ +
Sbjct: 762 VAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKMS 821
Query: 1090 EVSSKQKMRNMSLLVFPGVDMLQK 1113
E + +R + L +F ML+K
Sbjct: 822 EDQLSKHLREVQLTLFNKTSMLEK 845
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 159 bits (403), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 125/446 (28%), Positives = 205/446 (45%), Gaps = 67/446 (15%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ K + +L G +GI C CC + S+F+ H+ + P++++Y
Sbjct: 426 LPDGAELAYY--VKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYT 483
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G +L L N Q N D +DD C +CGDGGDLI C+GCP FH
Sbjct: 484 SNGLTLHDIALSLANGQ-----------NLTTGD-SDDMCAVCGDGGDLILCNGCPRAFH 531
Query: 835 QNCLDIKKFPSGKWHCVYCSCQFCGRINEST----------------------CHVNDQD 872
CL ++ P W C+ C G ES+ C V +
Sbjct: 532 AACLGLQCVPDSGWQCLNCR-DNAGNGRESSIVRPIMIRLTRVDKTPEFEMGGCVVCREH 590
Query: 873 DSALS-----TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLE--- 922
D +++ T+ IC CE++YH C + G + E P FC C I+ L+
Sbjct: 591 DFSVAKFDERTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQNSV 650
Query: 923 ------------KLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSV 970
+L+ KH+ +G T+ ++ D+ + LS + E L+ A ++
Sbjct: 651 AAGAEIIPASVSELIIRKHEDKGLCTYGAMN--DIQWRI-LSGKSRYPEHLPLLSRAAAI 707
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
ECF P+ SG +LI ++Y G N + G + +L + ++SA +RI G+
Sbjct: 708 FRECFDPIVA-ISGRDLIPVMVY--GRNISGQEFGGMYCIVLIVNSVVVSAGLLRIFGRN 764
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL- 1089
+AE+P + T ++ +G + L + IE L SLNVEKL++PA + WT GF+ +
Sbjct: 765 VAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKMS 824
Query: 1090 EVSSKQKMRNMSLLVFPGVDMLQKPM 1115
E + +R + L +F ML+K +
Sbjct: 825 EDQLSKHLREVQLTLFNKTSMLEKTV 850
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 159 bits (403), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 132/473 (27%), Positives = 206/473 (43%), Gaps = 96/473 (20%)
Query: 715 TVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLY 773
+P +V Y+ Q K +L G GI C CC+ + S+F+ H+ + P+ N+Y
Sbjct: 517 VLPDGTEVGYYCQGKK--LLAGYKKGFGIFCSCCNSEVSPSQFEAHAGWASRRKPYLNIY 574
Query: 774 FESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTF 833
+G SL + + + S ++ NDD C IC DGG L+CCD CP F
Sbjct: 575 TSNGVSLHELAISLSKGRRHSIKE------------NDDLCQICRDGGKLLCCDVCPRAF 622
Query: 834 HQNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINE 863
HQ CL + P GKW+C YC F C RI +
Sbjct: 623 HQECLSLPSIPRGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRIVK 682
Query: 864 S---------TCHVNDQDDS-ALSTLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCG 911
+ C +D + + T+ IC CE+++H C + ++ P FC
Sbjct: 683 TFEAEVGGCVFCRGHDFERTFGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKWFCC 742
Query: 912 KKCQEIFERLEKL--------------LGVKHD---LEGGYT----WSLVHRFDVSTDLS 950
C+ I L+KL KH+ E G + W L+ + S+D++
Sbjct: 743 TGCERIHSALQKLVIRGEEKLPDSSLNFIKKHEESASESGCSDDVRWRLLSKKTDSSDVT 802
Query: 951 LSDVCQKVECNARLAVALSVMDECFLPLP-----DHRSGINLIHNILYNFGSNFKRLNYK 1005
A L+ A+++ ECF P+ R + I +++ G N K +
Sbjct: 803 ----------EALLSDAVAIFHECFDPITVDKSKRRRDDHDFIPSMVK--GGNMKGQDLG 850
Query: 1006 GFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNV 1065
G + A+L + ++S A +RI G+ELAE+P + T ++ QG + L T IE L LNV
Sbjct: 851 GMYCAVLLVNHVVVSVAVVRIFGQELAELPIVATSSRWQGQGYFQTLFTCIEKLLGFLNV 910
Query: 1066 EKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMK 1117
+ L++PA E+ W + FGF + + R ++VF G MLQKP+ K
Sbjct: 911 KNLVLPAAEEVGSIWKNKFGFGAITQDELMEYRRRYQIMVFQGALMLQKPVPK 963
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 158 bits (400), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 162/366 (44%), Gaps = 66/366 (18%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF------------ 857
NDD C IC DGG+L+ CDGCP FH+ C I PSG W+C YC F
Sbjct: 326 NDDLCIICADGGNLLLCDGCPRAFHKGCASIPTVPSGDWYCQYCQNTFEREKLVEHNANA 385
Query: 858 ------------------CGRINE---------STCHVNDQDDSALS--TLQICSLCEEK 888
C RI + + C D S T+ +C CE++
Sbjct: 386 SAAGRDSGIDSIEQITKRCFRIVKNIEAELTGCALCRGYDFMRSGFGPRTIILCDQCEKE 445
Query: 889 YHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLL---------------GVKHDL 931
+H C ++ + P FC C I L+KLL KH+
Sbjct: 446 FHVGCLRSHKMTNLKELPKGNWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEE 505
Query: 932 EGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNI 991
G ++ + DV L LS E L+ ALS+ ECF P+ D G +LI +
Sbjct: 506 RG---LNISNNIDVRWTL-LSGKIASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPLM 561
Query: 992 LYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRR 1051
+Y G N K +Y G + A+L + I+SA +R+ G+E+AE+P + TR+ +G +
Sbjct: 562 VY--GKNSKGQDYGGMYCAVLTINSSIVSAGILRVFGEEVAELPLVATRNGEHGKGYFQL 619
Query: 1052 LLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRN--MSLLVFPGVD 1109
L + IE L LNV+ L++PA E WT FGFQ ++ K R ++ F G
Sbjct: 620 LFSCIEKLLAFLNVQNLVLPAAEEAESIWTEKFGFQKIKPEQLNKYRKSCCQMVRFEGTS 679
Query: 1110 MLQKPM 1115
MLQK +
Sbjct: 680 MLQKAV 685
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 158 bits (400), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 128/472 (27%), Positives = 198/472 (41%), Gaps = 81/472 (17%)
Query: 702 GKWT----LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKF 757
GKW L + +P ++ Y+ K++ +LQG GI C CC + S+F
Sbjct: 372 GKWKRDNDLHRLLFLPNGLPDGTELAYF--VKSQKLLQGYKQGSGIVCSCCDTEISPSQF 429
Query: 758 DTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGI 816
+ H+ + P+++++ SG SL + S G H + +DD C I
Sbjct: 430 EAHAGMAGRRQPYRHIHISSGLSLHDIAM--------SLADGGHVIT---TGDSDDMCSI 478
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC-----SCQFCG------------ 859
CGDGGDL+ C GCP FH CL + P G W+C C SC+
Sbjct: 479 CGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCKTATATDPNLKSIVGS 538
Query: 860 ----------RINESTCHVNDQDDSALS---------TLQICSLCEEKYHQSCSQTDGAV 900
R+ S + D +L T+ +C CE++YH C + +
Sbjct: 539 IAIFSLSAHIRVLHSAYCFSPISDRSLDFSIGKFDDRTVILCDQCEKEYHVGCLRENDLC 598
Query: 901 QYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVC--- 955
+ P FC C I L+ G T V +S +C
Sbjct: 599 DLKGIPQDKWFCCSDCSRIHTALQSSASC-----GPQTIPTVLLDTISRKYREKGICIDN 653
Query: 956 -------------QKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRL 1002
+ E L+ A ++ ECF P+ +SG +LI ++Y G N
Sbjct: 654 GDNVEWRMLSGKSRYAEHLPLLSRAATIFRECFDPIVA-KSGRDLIPVMVY--GRNISGQ 710
Query: 1003 NYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCS 1062
+ G + +L + ++SAA +RI G+++AE+P + T Y+ +G + L +E+ L S
Sbjct: 711 EFGGMYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSS 770
Query: 1063 LNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQK 1113
LNVE L++PA E WT FGF + QK R + L +F G ML+K
Sbjct: 771 LNVENLLLPAAEEAESIWTKKFGFTKMTEHQLQKYQREVQLTIFKGTSMLEK 822
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 157 bits (398), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 127/482 (26%), Positives = 202/482 (41%), Gaps = 99/482 (20%)
Query: 698 VAYDGKWT----LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFT 753
V+ GKW L + +P ++ Y+ K++ +LQG GI C CC +
Sbjct: 381 VSGGGKWKRDNDLHRLLFLPNGLPDGTELAYY--VKSQKLLQGYKQGSGIVCSCCDTKIS 438
Query: 754 ISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDD 812
S+F+ H+ + P++ ++ SG SL + S G H + +DD
Sbjct: 439 PSQFEAHAGMAGRRQPYRRIHISSGLSLHDIAV--------SLADGGHVIT---TGDSDD 487
Query: 813 TCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS------------------ 854
C ICG+GGDL+ C GCP FH CL + P G W+C C+
Sbjct: 488 MCSICGNGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCKIATASDPNLKP 547
Query: 855 ------------------CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQT 896
C FC + S +D+ T+ +C CE++YH C +
Sbjct: 548 IVIRLTRVVKAPESEIGGCVFCRSHDFSIGKFDDR------TVILCDQCEKEYHVGCLRE 601
Query: 897 DGAVQYE--PSSLSFCGKKCQEIFERLEK---------------LLGVKHDLEGGYT--- 936
+ + P FC C I L+ + K+ +G Y
Sbjct: 602 NELCDLKGIPQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNG 661
Query: 937 ----WSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNIL 992
W + LS + E L+ A ++ ECF P+ +SG +LI ++
Sbjct: 662 NTVEWRM-----------LSGKSRYPEHLPLLSRAATIFRECFDPIVA-KSGRDLIPVMV 709
Query: 993 YNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRL 1052
Y G N + G + +L + ++SAA +RI G+++AE+P + T Y+ +G + L
Sbjct: 710 Y--GRNISGQEFGGMYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGL 767
Query: 1053 LTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDML 1111
+E+ L SLNVE L++PA E WT+ FGF + Q+ R + L +F G ML
Sbjct: 768 FACVENLLSSLNVENLLLPAAEEAESIWTNKFGFTKMTEHRLQRYQREVQLTIFKGTSML 827
Query: 1112 QK 1113
+K
Sbjct: 828 EK 829
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 157 bits (396), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 116/461 (25%), Positives = 192/461 (41%), Gaps = 71/461 (15%)
Query: 713 TGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQN 771
+G +P V Y+ K +L G I GI C CC+ + + S+F+ H+ ++ P+ N
Sbjct: 295 SGILPEGTDVGYYVGGKR--LLDGYIKEPGIHCHCCNTVVSPSQFEGHAGRAARRKPYHN 352
Query: 772 LYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPS 831
+Y +G SL + + + S R+ +DD C IC DGG+L+ CD CP
Sbjct: 353 IYMSNGVSLHELSVSLSRGRKTSDRQ------------SDDLCSICSDGGELLLCDTCPR 400
Query: 832 TFHQNCLDIKKFPSGKWHCVYCSCQ--------------FCGRINE-------------- 863
FH+ C+D+ P G W C YC + GRI+
Sbjct: 401 AFHRECVDLTAVPKGTWCCRYCETRQQRESSLAYNHNAIAAGRIDGIDSMEQIFTRSIRI 460
Query: 864 -----------STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLS 908
+ C ++D S T+ +C C +YH C + P
Sbjct: 461 ATTPETGFGGCALCKLHDFGKKKFSARTVLLCDQCGREYHVGCLKEHSMADLTALPEGAW 520
Query: 909 FCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF--------DVSTDLSLSDVCQKVEC 960
+C C I E ++ LL + L+ + D D+ + K
Sbjct: 521 YCSSDCVRISETMKDLLSGGAEPVPAMDADLIKKKREDKGLNEDGDLDVRWRVLRDKSSE 580
Query: 961 NARLAV--ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
+++L + A+++ E F P+ +G +LI ++Y G + + +Y G + A+L + +
Sbjct: 581 DSKLVLSKAVAIFHESFDPIIQTTTGRDLIPAMVY--GRSVRDQDYTGMYCAVLTVGNTV 638
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
+SA RI G+E AE+P + T + G + L IE L SL V+ ++PA E
Sbjct: 639 VSAGLFRIMGREAAELPLVATSRDNQGFGYFQALFGCIERLLASLKVKYFVLPAADEAVS 698
Query: 1079 TWTSVFGFQPLEVSS-KQKMRNMSLLVFPGVDMLQKPMMKN 1118
WT FGF + + ++ VF G L K + +N
Sbjct: 699 IWTQRFGFSKISRDELLEHLKGARTTVFQGTSTLHKLIPEN 739
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 157 bits (396), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 126/456 (27%), Positives = 202/456 (44%), Gaps = 75/456 (16%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYF 774
+P +V Y+ + + +LQG GI C CC+ + + S+F+ H+ S P+ +Y
Sbjct: 487 LPDGTEVAYFARGQK--LLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYAYIYT 544
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G SL + + S KG + D NDD C IC DGG+L+ CDGCP FH
Sbjct: 545 SNGVSLHELAI--------SLSKGRKYSAKD----NDDLCIICLDGGNLLLCDGCPRAFH 592
Query: 835 QNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINES 864
+ C + P G W+C +C F C RI +
Sbjct: 593 KECASLSSTPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRN 652
Query: 865 T---------CHVNDQDDSALS--TLQICSLCEEKYHQSCSQTD--GAVQYEPSSLSFCG 911
C +D S T+ +C CE+++H C + ++ P FC
Sbjct: 653 IETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCS 712
Query: 912 KKCQEIFERLEKLL-----GVKHDLEGGYTWSL---VHRFDVSTDLSLSDVCQKV---EC 960
C I L+KLL + + L G L V D+S + K+ E
Sbjct: 713 IVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGKIASPET 772
Query: 961 NARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIIS 1020
L+ A+++ + F P+ D SG +LI ++Y G + + G + AIL + ++S
Sbjct: 773 RLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVY--GRDVGGQEFGGMYCAILIVNSFVVS 830
Query: 1021 AASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETW 1080
AA +R+ G+++AE+P + T + +G + L + IE L L V+ L++PA E W
Sbjct: 831 AAMLRVFGQDIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIW 890
Query: 1081 TSVFGFQ---PLEVSSKQKMRNMSLLVFPGVDMLQK 1113
T FGF+ P ++SS ++ ++ F G MLQK
Sbjct: 891 TEKFGFERIKPDQLSSYRR-SCCQMVTFKGTSMLQK 925
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 156 bits (395), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 125/467 (26%), Positives = 200/467 (42%), Gaps = 94/467 (20%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYF 774
+P +V Y+ R R +L G I C CC+ + S+F+ HS + P+ ++Y
Sbjct: 408 LPEGSEVSYY-VRGER-LLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYT 465
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G SL + L ++ S + ND+ C IC DGG L+CCDGCP FH
Sbjct: 466 SNGVSLHELSLSLLRGREPSI------------NTNDEICSICLDGGTLLCCDGCPRVFH 513
Query: 835 QNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINE- 863
+ C+ ++ P GKW C +C C RI +
Sbjct: 514 KECVSLENIPKGKWFCKFCLNTLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCIRIVKN 573
Query: 864 --------STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCG 911
+ C ++ S T+ IC CE+++H C + ++ P FC
Sbjct: 574 QTDEAGGCALCRRHEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCC 633
Query: 912 KKCQEIFERLEKLL-------------------GVKHDLEGG---YTWSLVHRFDVSTDL 949
+ C++I L K++ G K + G W L+H S
Sbjct: 634 RDCKDINSSLRKIVVRQEEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRLLHGRRASA-- 691
Query: 950 SLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFT 1009
E + L+ ALS+ E F P+ D G +L+ +++++ + L + G +
Sbjct: 692 --------TEAGSLLSQALSLFHEQFNPIADAE-GRDLLLDMVHSNSTG--ELEFGGMYC 740
Query: 1010 AILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLI 1069
AIL +++SAA+ R+ GKE+AE+P + TR + QG + L T IE LC L V L+
Sbjct: 741 AILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCFLQVNSLV 800
Query: 1070 IPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
+PA W + F F +E + R+ ++ F G MLQKP+
Sbjct: 801 LPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQGTSMLQKPV 847
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 155 bits (391), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 129/474 (27%), Positives = 206/474 (43%), Gaps = 85/474 (17%)
Query: 698 VAYDGKWT----LLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFT 753
V+ GKW L + +P ++ Y+ K++ +LQG GI C CC +
Sbjct: 381 VSGGGKWKRDNDLHRLLFLPNGLPDGTELAYY--VKSQKLLQGYKQGSGIVCSCCDTKIS 438
Query: 754 ISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDD 812
S+F+ H+ + P++ ++ SG SL + S G H + +DD
Sbjct: 439 PSQFEAHAGMAGRRQPYRRIHISSGLSLHDIAV--------SLADGGHVIT---TGDSDD 487
Query: 813 TCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC-----SCQ----------F 857
C ICG+GGDL+ C GCP FH CL + P G W+C C SC+ F
Sbjct: 488 MCSICGNGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCKIATASWLYTYF 547
Query: 858 CGRINESTCH-------VNDQD-DSALS-----TLQICSLCEEKYHQSCSQTDGAVQYE- 903
N H ++D+ D ++ T+ +C CE++YH C + + +
Sbjct: 548 NLNANILVLHSAYSLSPISDRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKG 607
Query: 904 -PSSLSFCGKKCQEIFERLEK---------------LLGVKHDLEGGYT-------WSLV 940
P FC C I L+ + K+ +G Y W +
Sbjct: 608 IPQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRM- 666
Query: 941 HRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFK 1000
LS + E L+ A ++ ECF P+ +SG +LI ++Y G N
Sbjct: 667 ----------LSGKSRYPEHLPLLSRAATIFRECFDPIVA-KSGRDLIPVMVY--GRNIS 713
Query: 1001 RLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
+ G + +L + ++SAA +RI G+++AE+P + T Y+ +G + L +E+ L
Sbjct: 714 GQEFGGMYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLL 773
Query: 1061 CSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQK 1113
SLNVE L++PA E WT+ FGF + Q+ R + L +F G ML+K
Sbjct: 774 SSLNVENLLLPAAEEAESIWTNKFGFTKMTEHRLQRYQREVQLTIFKGTSMLEK 827
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 155 bits (391), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 125/464 (26%), Positives = 202/464 (43%), Gaps = 80/464 (17%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHP 768
+ + G +P +V Y+ K +L G GI C CC + S+F+ H+ + P
Sbjct: 1956 VFEEGGLPDGTEVAYYAGGKK--LLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRKKP 2013
Query: 769 FQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDG 828
+ +Y +G SL + + S KG + D NDD C ICGDGG+L+ CDG
Sbjct: 2014 YSYIYTSNGVSLHELAI--------SLSKGRKYSARD----NDDLCSICGDGGNLLLCDG 2061
Query: 829 CPSTFHQNCLDIKKFPSGKWHCVYCSCQF------------------------------C 858
CP FH+ C + P W+C YC F C
Sbjct: 2062 CPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRC 2121
Query: 859 GRI-----NESTCHVNDQDDSALS-----TLQICSLCEEKYHQSCSQTDGA--VQYEPSS 906
RI S C + D + S T+ +C CE+++H C + ++ PS
Sbjct: 2122 IRIVNPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSG 2181
Query: 907 LSFCGKKCQEIFERLEKL------------LGV---KHDLEGGYTWSLVHRFDVSTDLSL 951
FC +C I L+KL L V KH+ +G + + ++V L L
Sbjct: 2182 KWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLES---IADYNVRWRL-L 2237
Query: 952 SDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAI 1011
S E L+ A+++ + F P+ D +G +LI ++Y G N + ++ G + A+
Sbjct: 2238 SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVY--GRNVRGQDFSGLYCAV 2295
Query: 1012 LERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIP 1071
+ + ++SA +R+ G+E+AE+P + T + +G + L + IE L LNV ++P
Sbjct: 2296 ITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLP 2355
Query: 1072 AISELRETWTSVFGFQPLEVSSKQKMRN--MSLLVFPGVDMLQK 1113
A E WT FGF+ + + R ++ F G ML+K
Sbjct: 2356 AAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEK 2399
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 557
Score = 154 bits (388), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 167/340 (49%), Gaps = 27/340 (7%)
Query: 784 ILDSWNKQDESKRKGFHFVNFD-GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK 842
+++ ++++ +++ F D D N D C +C GGDL+ CDGCPS FH CL +
Sbjct: 228 VMEKKQQKNKHEKESLRFCRKDCSPDMNCDVCCVCHWGGDLLLCDGCPSAFHHACLGLSS 287
Query: 843 FPS-GKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQ 901
P W C C C CG + + A S L C C+ ++H +C + D +
Sbjct: 288 LPEEDLWFCPCCCCDICGSM----------ESPANSKLMACEQCQRRFHLTCLKEDSCIV 337
Query: 902 YEPSSLSFCGKKCQEIFERLEKLLGVKHDL--EGGYTWSLVHRFDVSTDLSLSDVCQKVE 959
S FC +C +F LE LLG K + +G W+L+ + +
Sbjct: 338 --SSRGWFCSSQCNRVFSALENLLGSKIAVGNDGDLVWTLMRAPNEGEHYDDEQI----- 390
Query: 960 CNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEII 1019
++L A+ ++ + F P D SG +L+ ++Y + +GF+T ++ER +E I
Sbjct: 391 --SKLESAVEILHQGFEPTNDVFSGRDLVEELIYR---KDRTGVGRGFYTVLIERKNEPI 445
Query: 1020 SAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRET 1079
+ A++R+ K++ E+P + T YRR GMCR L+ +E + + V +L++PA E+ T
Sbjct: 446 TVAAVRV-DKDVVEIPLVATLSSYRRSGMCRVLMDELEKQMSQMGVCRLVLPAAKEVVTT 504
Query: 1080 WTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
WT FGF + S + ++ +L F G M K + K +
Sbjct: 505 WTERFGFSVMNSSERLELVKHGMLDFVGTIMCHKFLQKER 544
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 153 bits (387), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 117/442 (26%), Positives = 196/442 (44%), Gaps = 75/442 (16%)
Query: 733 MLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYFESGSSLLQCILDSWNKQ 791
ML G GI C CC+++ + S F+ H+ + PFQ++Y +G SL + L
Sbjct: 628 MLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTNGVSLHE--LSVALSM 685
Query: 792 DESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCV 851
D+ R H NDD C IC DGG+L+CCD CP ++H+ C + PS +W C
Sbjct: 686 DQ--RFSIH--------ENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERWSCK 735
Query: 852 YC--------------SCQFCGRIN--ESTCHVNDQDDSALS------------------ 877
YC + GR+ ++ + ++ +S
Sbjct: 736 YCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRGHSF 795
Query: 878 --------TLQICSLCEEKYHQSC-SQTDGA-VQYEPSSLSFCGKKCQEIFERLEKLLGV 927
T+ IC CE+++H C + D A ++ P FC C+EI L L+
Sbjct: 796 CRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNLIVR 855
Query: 928 KHDLEGGYTWSLVHR-----------FDVSTDLSLSDVCQKV----ECNARLAVALSVMD 972
+ + + + + + D+ + K+ + LA ALS++
Sbjct: 856 GEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALSILH 915
Query: 973 ECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELA 1032
E F P+ + + +LI ++Y G K ++ G + +L D+ I+S R+ G ELA
Sbjct: 916 ERFDPISESGTKGDLIPAMVY--GRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELA 973
Query: 1033 EMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL-EV 1091
E+P + T + QG + L IE L LNV+ +++PA E + WT FGF + +
Sbjct: 974 ELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDE 1033
Query: 1092 SSKQKMRNMSLLVFPGVDMLQK 1113
K+ ++ S+++F G ML+K
Sbjct: 1034 EVKEYRKDYSVMIFHGTSMLRK 1055
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 153 bits (386), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 125/464 (26%), Positives = 202/464 (43%), Gaps = 80/464 (17%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHP 768
+ + G +P +V Y+ K +L G GI C CC + S+F+ H+ + P
Sbjct: 424 VFEEGGLPDGTEVAYYAGGKK--LLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRKKP 481
Query: 769 FQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDG 828
+ +Y +G SL + + S KG + D NDD C ICGDGG+L+ CDG
Sbjct: 482 YSYIYTSNGVSLHELAI--------SLSKGRKYSARD----NDDLCSICGDGGNLLLCDG 529
Query: 829 CPSTFHQNCLDIKKFPSGKWHCVYCSCQF------------------------------C 858
CP FH+ C + P W+C YC F C
Sbjct: 530 CPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRC 589
Query: 859 GRI-----NESTCHVNDQDDSALS-----TLQICSLCEEKYHQSCSQTDGA--VQYEPSS 906
RI S C + D + S T+ +C CE+++H C + ++ PS
Sbjct: 590 IRIVNPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSG 649
Query: 907 LSFCGKKCQEIFERLEKL------------LGV---KHDLEGGYTWSLVHRFDVSTDLSL 951
FC +C I L+KL L V KH+ +G + + ++V L L
Sbjct: 650 KWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLES---IADYNVRWRL-L 705
Query: 952 SDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAI 1011
S E L+ A+++ + F P+ D +G +LI ++Y G N + ++ G + A+
Sbjct: 706 SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVY--GRNVRGQDFSGLYCAV 763
Query: 1012 LERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIP 1071
+ + ++SA +R+ G+E+AE+P + T + +G + L + IE L LNV ++P
Sbjct: 764 ITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLP 823
Query: 1072 AISELRETWTSVFGFQPLEVSSKQKMRN--MSLLVFPGVDMLQK 1113
A E WT FGF+ + + R ++ F G ML+K
Sbjct: 824 AAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEK 867
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 152 bits (384), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 133/463 (28%), Positives = 202/463 (43%), Gaps = 91/463 (19%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH----SKSKLCHPFQN 771
+P +V Y+ + + +L+G GI C CC+ + S+F+ H S+ K P+
Sbjct: 348 LPDGTEVAYYARGQK--LLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKK---PYAY 402
Query: 772 LYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPS 831
+Y +G SL + + + S + NDD C +C DGG+L+ CDGCP
Sbjct: 403 IYTSNGVSLHELAISLSKDRKYSAKD------------NDDLCIVCWDGGNLLLCDGCPR 450
Query: 832 TFHQNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRI 861
FH+ C + P G W+C +C F C RI
Sbjct: 451 AFHKECAALSSIPRGDWYCQFCQNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRI 510
Query: 862 ------NESTCHVNDQDDSALS-----TLQICSLCEEKYHQSCSQTDGAVQYE---PSSL 907
+ S+C + D + S T+ +C CE++YH C + D + Y P
Sbjct: 511 VKDIEADLSSCALCRGVDFSRSGFGPRTIILCDQCEKEYHVGCLR-DHKMAYLKELPEGN 569
Query: 908 SFCGKKCQEIF-----------ERL-EKLLGV--KHDLEGGYTWSLVHRFDVSTDLSLSD 953
C C I ERL E LLGV K E G L DV L L+
Sbjct: 570 WLCCNDCTRIHSTLENLLVKGAERLPESLLGVIKKKQEEKG----LEPIIDVRWRL-LNG 624
Query: 954 VCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILE 1013
E L A+S+ ECF P+ D SG +LI ++Y G N + + G + A+L
Sbjct: 625 KIASPETRPLLLEAVSIFHECFNPIVDAASGRDLIPAMVY--GRNVRGQEFGGMYCALLI 682
Query: 1014 RDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAI 1073
+ ++SA +RI G ++AE+P + T + +G + L + IE L LNV+ L++PA
Sbjct: 683 VNSSVVSAGMLRIFGSDVAELPLVATSNGNHGKGYFQTLFSCIERLLAFLNVKNLVLPAA 742
Query: 1074 SELRETWTSVFGF---QPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
E WT FGF P E+++ +K ++ F G +ML K
Sbjct: 743 EEAESIWTDKFGFSKMNPDELTNYRK-NCHQMVSFKGTNMLHK 784
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 150 bits (380), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 83/214 (38%), Positives = 130/214 (60%), Gaps = 7/214 (3%)
Query: 915 QEIFERLEKLLGVKHDL-EGGYTWSLV---HRFDVSTDLSLSDVCQKVECNARLAVALSV 970
++I+E L KLLG + TW+LV H D S SD+ E ++L +A+SV
Sbjct: 928 KKIYEGLHKLLGEPVSVGVDNLTWTLVKFIHPDRFEHDSSKSDLL--AESYSKLHLAISV 985
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
M ECF PL + S +L+ +++++ S RLN++GF+T +LER++E+IS A++R++GK+
Sbjct: 986 MHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVATVRVYGKK 1045
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLE 1090
+AE+P +GTR YRR GMC L+ +E L L VE+L++PA+ + ETWT FGF +
Sbjct: 1046 VAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRSFGFAKMT 1105
Query: 1091 VSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPREN 1124
+ + + + L F G M QK ++ N P N
Sbjct: 1106 NLERSQFLDYTFLDFQGAIMCQK-LLTNNNPSPN 1138
Score = 130 bits (326), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 111/218 (50%), Gaps = 15/218 (6%)
Query: 706 LLAWMIDTGTVPLNEKVQY---WNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTH-S 761
+L+++ID + KV Y RK + G+I RDGI+C+CC I++ F+ H S
Sbjct: 533 VLSYLIDNSIILPRCKVYYKVKGRHRKVCTLADGKITRDGIKCNCCMGIYSFVGFENHAS 592
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
+ C P +++ E G SLL C++ + + G F + ND C +C GG
Sbjct: 593 GNSTCRPSASIFLEDGRSLLDCLIKMMHDHKTMETSGKSFSGLSLVE-NDYICSVCHYGG 651
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQI 881
+LI CD CPS+FH+ CL ++ P+G W C C C CG+ D+DD + L
Sbjct: 652 ELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQ------RKIDRDDE-VEQLLP 704
Query: 882 CSLCEEKYHQSCSQTDGAVQYEPSSLS--FCGKKCQEI 917
C CE KYH C + +GA L FCGK C+++
Sbjct: 705 CIQCEHKYHVRCLE-NGAADISTRYLGNWFCGKDCEKL 741
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101203549
[Cucumis sativus]
Length = 946
Score = 150 bits (378), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 126/465 (27%), Positives = 202/465 (43%), Gaps = 84/465 (18%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYF 774
+P +V Y+ + + +LQG GI C CC+ + + S+F+ H+ S P+ +Y
Sbjct: 487 LPDGTEVAYFARGQK--LLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYAYIYT 544
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G SL + + S KG + D NDD C IC DGG+L+ CDGCP FH
Sbjct: 545 SNGVSLHELAI--------SLSKGRKYSAKD----NDDLCIICLDGGNLLLCDGCPRAFH 592
Query: 835 QNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINES 864
+ C + P G W+C +C F C RI +
Sbjct: 593 KECASLSSIPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRN 652
Query: 865 T---------CHVNDQDDSALS--TLQICSLCEEKYHQSCSQTD--GAVQYEPSSLSFCG 911
C +D S T+ +C CE+++H C + ++ P FC
Sbjct: 653 IETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCS 712
Query: 912 KKCQEIFERLEKLL-----GVKHDLEGGYTWSL---VHRFDVSTDLSLSDVCQKV---EC 960
C I L+KLL + + L G L V D+S + K+ E
Sbjct: 713 IVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGKIASPET 772
Query: 961 NARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIIS 1020
L+ A+++ + F P+ D SG +LI ++Y G + + G + AIL + ++S
Sbjct: 773 RLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVY--GRDVGGQEFGGMYCAILIVNSFVVS 830
Query: 1021 AASIRIHGK---------ELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIP 1071
AA +R+ G+ ++AE+P + T + +G + L + IE L L V+ L++P
Sbjct: 831 AAMLRVFGQYCRAAIGCXDIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLP 890
Query: 1072 AISELRETWTSVFGFQ---PLEVSSKQKMRNMSLLVFPGVDMLQK 1113
A E WT FGF+ P ++SS ++ ++ F G MLQK
Sbjct: 891 AAEEAESIWTEKFGFERIKPDQLSSYRR-SCCQMVTFKGTSMLQK 934
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 149 bits (377), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 112/416 (26%), Positives = 178/416 (42%), Gaps = 74/416 (17%)
Query: 755 SKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDT 813
S+F+ H+ + P+++++ SG SL + N H + +DD
Sbjct: 419 SQFEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANG---------HVIT---TGDSDDM 466
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS------------------- 854
C ICGDGGDL+ C GCP FH CL + P G W+C C+
Sbjct: 467 CSICGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNAR 526
Query: 855 -------------------CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQ 895
C FC + S +D+ T+ +C CE++YH C +
Sbjct: 527 PIVIRLSRVVKAPESDIGGCVFCRSHDFSIGKFDDR------TVILCDQCEKEYHVGCLR 580
Query: 896 TDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFD----VSTDL 949
+G + P FC C I ++ + ++ R D + TD+
Sbjct: 581 ENGFCDLKEIPQEKWFCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDI 640
Query: 950 S-------LSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRL 1002
LS + E L+ A + ECF P+ +SG +LI ++Y G N
Sbjct: 641 GDTVEWRILSGKSRYPEHLPLLSRAAVIFRECFDPIVA-KSGRDLIPVMVY--GRNISGQ 697
Query: 1003 NYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCS 1062
+ G + +L + ++SAA +RI G+E+AE+P + T Y+ +G + L +E+ L S
Sbjct: 698 EFGGMYCLVLIVNSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSS 757
Query: 1063 LNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMK 1117
LNVE L++PA E WT FGF + Q+ + + L +F G ML+K + K
Sbjct: 758 LNVENLVLPAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKVPK 813
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 147 bits (370), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 118/445 (26%), Positives = 196/445 (44%), Gaps = 81/445 (18%)
Query: 733 MLQGRIARDGIRCDCCSEIFTISKFDTH----SKSKLCHPFQNLYFESGSSLLQCILDSW 788
ML G GI C CC+++ + S F+ H S+ K PFQ++Y +G SL + L
Sbjct: 192 MLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRK---PFQHIYTTNGVSLHE--LSVA 246
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
D+ R H NDD C IC DGG+L+CCD CP ++H+ C + PS +W
Sbjct: 247 LSMDQ--RFSIH--------ENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERW 296
Query: 849 HCVYC--------------SCQFCGRIN--ESTCHVNDQDDSALSTLQ------------ 880
C YC + GR+ ++ + ++ +S+
Sbjct: 297 SCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRG 356
Query: 881 --------------ICSLCEEKYHQSC-SQTDGA-VQYEPSSLSFCGKKCQEIFERLEKL 924
IC CE+++H C + D A ++ P FC C+EI L L
Sbjct: 357 HSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNL 416
Query: 925 LGVKHDLEGGYTWSLVHR-----------FDVSTDLSLSDVCQKV----ECNARLAVALS 969
+ + + + + + + D+ + K+ + LA ALS
Sbjct: 417 IVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALS 476
Query: 970 VMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGK 1029
++ E F P+ + + +LI ++Y G K ++ G + +L D+ I+S R+ G
Sbjct: 477 ILHERFDPISESGTKGDLIPAMVY--GRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGS 534
Query: 1030 ELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL 1089
ELAE+P + T + QG + L IE L LNV+ +++PA E + WT FGF +
Sbjct: 535 ELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKM 594
Query: 1090 -EVSSKQKMRNMSLLVFPGVDMLQK 1113
+ K+ ++ S+++F G ML+K
Sbjct: 595 TDEEVKEYRKDYSVMIFHGTSMLRK 619
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 146 bits (369), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 120/445 (26%), Positives = 196/445 (44%), Gaps = 61/445 (13%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHP 768
+ + G +P +V Y+ K +L G GI C CC + S+F+ H+ + P
Sbjct: 1991 VFEEGGLPDGTEVAYYAGGKK--LLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRKKP 2048
Query: 769 FQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDG 828
+ +Y +G SL + + S KG + D NDD C ICGDGG+L+ CDG
Sbjct: 2049 YSYIYTSNGVSLHELAI--------SLSKGRKYSARD----NDDLCSICGDGGNLLLCDG 2096
Query: 829 CPSTFHQNCLDIKKFPSGKWHCVYCSCQF--------------CGRIN-----ESTCHVN 869
CP FH+ C + P W+C YC F GR++ E
Sbjct: 2097 CPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRC 2156
Query: 870 DQDDSALSTLQICSLCE-EKYHQSCSQTDGAV---QYEPSSLSFCGKKCQEIFERLEK-- 923
+ + + + C LC + +S + Q PS FC +C I L+K
Sbjct: 2157 IRIVNPEAEVSACVLCRGYDFSKSGFGPRTIILCDQELPSGKWFCCLECIRIHSALQKLH 2216
Query: 924 ----------LLGV---KHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSV 970
LL V KH+ +G + + ++V L LS E L+ A+++
Sbjct: 2217 VRGEEKLPDSLLNVIKEKHERKGLES---IADYNVRWRL-LSGKLASPETRVLLSEAVAI 2272
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
+ F P+ D +G +LI ++Y G N + ++ G + A++ + ++SA +R+ G+E
Sbjct: 2273 FHDRFDPIIDSVTGRDLIPAMVY--GRNVRGQDFSGLYCAVITVNSHVVSAGILRVFGQE 2330
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLE 1090
+AE+P + T + +G + L + IE L LNV ++PA E WT FGF+ +
Sbjct: 2331 VAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGFKKIT 2390
Query: 1091 VSSKQKMRN--MSLLVFPGVDMLQK 1113
+ R ++ F G ML+K
Sbjct: 2391 PDQLSEYRKSFYQMISFQGTCMLEK 2415
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 145 bits (366), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 116/452 (25%), Positives = 194/452 (42%), Gaps = 77/452 (17%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P ++ Y+ K + +L G +GI C C+ + S+F+ H+ + P++++Y
Sbjct: 407 LPDGAELAYF--VKGQRILGGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYT 464
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G +L + + Q + +DD C CG+GGDLI CD CP +H
Sbjct: 465 TNGLTLHDIAISLASGQKLTT------------GDSDDMCAACGNGGDLIFCDRCPRAYH 512
Query: 835 QNCLDIKKFPSGKWHCVYC--------------------------------------SCQ 856
CL ++ P G W C C C
Sbjct: 513 TGCLHLQNVPEGVWSCPNCRDKVGSNSKAISGGSLSFSKPIVFRLTRVVKAPEYEIGGCV 572
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKC 914
C R + S +D+ T+ +C CE ++H C + G + P FC +C
Sbjct: 573 VCRRHDFSAAKFDDR------TVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDEC 626
Query: 915 QEIFERLEKLLGVKHDLEGGYTWSLVHRFDVS----TDLSLSDV-CQKVECNAR------ 963
I L+ + + L+ R V D +L+DV Q + +R
Sbjct: 627 SNIHVALQNTVLNGAQIIPDSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPEDLP 686
Query: 964 -LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAA 1022
L+ A ++ ECF P+ +SG +LI ++Y G N + G + +L ++SA
Sbjct: 687 FLSRATAIFRECFDPIVA-KSGRDLIPVMVY--GRNISGQEFGGMYCVVLIVRSIVVSAG 743
Query: 1023 SIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTS 1082
+RI G+E+AE+P + T ++ +G + L + IE L SLNV+ L++PA + WT
Sbjct: 744 LLRIFGREVAELPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTK 803
Query: 1083 VFGFQPL-EVSSKQKMRNMSLLVFPGVDMLQK 1113
GF+ + E + MR + L +F G ML+K
Sbjct: 804 KLGFRKMSEEQLIKYMREVQLTIFNGTSMLEK 835
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 144 bits (363), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 166/344 (48%), Gaps = 35/344 (10%)
Query: 784 ILDSWNKQDESKRKGFHFVNFDGE-DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK 842
+++ + ++ +++ F D D N D C +C GGDL+ CDGCPS FH CL +
Sbjct: 226 VMEKKQQMNKHEKESLRFCRKDSSPDMNCDVCCVCHWGGDLLLCDGCPSAFHHTCLGLSS 285
Query: 843 FPS-GKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQ 901
P W C C C CG + VN S L C C+ ++H C ++
Sbjct: 286 LPEEDLWFCPCCCCDICGSMESP---VN-------SKLMACEQCQRRFHLKC------LK 329
Query: 902 YEPSSLS----FCGKKCQEIFERLEKLLGVKHDL--EGGYTWSLVHRFDVSTDLSLSDVC 955
EP +S FC +C + LE L+G K + G W+L+ + +
Sbjct: 330 EEPGIVSCRGWFCSSQCNRVSSALENLIGCKIAVGNNGDLVWTLMRAPNEGEHYDDEQI- 388
Query: 956 QKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERD 1015
++L A+ ++ + F P D SG +L+ +++ + +GF+T ++ER
Sbjct: 389 ------SKLESAVEILHQGFEPTKDVFSGRDLVEELIFR---KDRTGVGRGFYTVLIERK 439
Query: 1016 DEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISE 1075
E I+ A++R+ K++ E+P + T YRR GMCR L+ +E + + V +L++PA E
Sbjct: 440 KEPITVAAVRV-DKDVVEIPLVATLSNYRRSGMCRVLVDELEKQMSQMGVCRLVLPAAKE 498
Query: 1076 LRETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
+ TWT FGF +E S + ++ +L F G M K ++K +
Sbjct: 499 VVSTWTQRFGFSVMESSERLELVKHGMLDFVGTVMCHKFLVKER 542
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 142 bits (358), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 117/442 (26%), Positives = 192/442 (43%), Gaps = 79/442 (17%)
Query: 733 MLQGRIARDGIRCDCCSEIFTISKFDTH----SKSKLCHPFQNLYFESGSSLLQCILDSW 788
+L+G GI C CC+ + S+F+ H S+ K P+ +Y +G SL + +
Sbjct: 295 LLEGFKKGSGIVCRCCNTEISPSQFEVHAGWASRKK---PYAYIYTSNGVSLHELSISL- 350
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKW 848
SK + + + NDD C +C DGG+L+ CDGCP FH+ C + P G W
Sbjct: 351 -----SKDRKY------SANDNDDLCVVCWDGGNLLLCDGCPRAFHKECASLSSIPRGDW 399
Query: 849 HCVYCSCQF------------------------------CGRINE------STCHVNDQD 872
+C +C F C RI + S C +
Sbjct: 400 YCQFCQNMFQREKFVAYNVNAFAAGRVEGVDPIEQITKRCIRIVKDIDAELSACALCRGV 459
Query: 873 DSALS-----TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLL 925
D + S T+ +C CE++YH C + + P C C I LE +L
Sbjct: 460 DFSKSGFGPRTIILCDQCEKEYHVGCLRDHKMTFLKELPKGNWLCCNDCTRIHSTLENVL 519
Query: 926 GVKHDLEGGYTWSLVHR------FDVSTDLS-----LSDVCQKVECNARLAVALSVMDEC 974
+ +++ + D D++ LS E L A+S+ EC
Sbjct: 520 VRGAERLPKSLLAVIKKKQEEKGLDPINDINVRWRLLSGKKASPETRPLLLEAVSIFHEC 579
Query: 975 FLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEM 1034
F P+ D SG +LI ++Y G + + + G + A+L + ++SA +RI G ++AE+
Sbjct: 580 FDPIVDAVSGRDLIRAMVY--GKSVRGQEFGGMYCALLIVNSSVVSAGMLRIFGTDIAEL 637
Query: 1035 PFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF---QPLEV 1091
P + T + +G + L + IE L + V+ L++PA E + WT FGF +P E+
Sbjct: 638 PLVATSNSQHGKGYFQALFSCIERLLAFMKVKNLVLPAAEEAQSIWTDKFGFSKIKPDEL 697
Query: 1092 SSKQKMRNMSLLVFPGVDMLQK 1113
++ ++ N + F G +ML K
Sbjct: 698 ANYRRNCN-QFVTFQGTNMLHK 718
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 141 bits (355), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 114/439 (25%), Positives = 188/439 (42%), Gaps = 75/439 (17%)
Query: 729 KTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDS 787
K + +L G +GI C C+ + S+F+ H+ + P++++Y +G +L +
Sbjct: 357 KGQRILGGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISL 416
Query: 788 WNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGK 847
+ Q + +DD C CG+GGDLI CD CP +H CL ++ P G
Sbjct: 417 ASGQKLTT------------GDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGV 464
Query: 848 WHCVYC--------------------------------------SCQFCGRINESTCHVN 869
W C C C C R + S +
Sbjct: 465 WSCPNCRDKVGSNSKAISGGSLSFSKPIVFRLTRVVKAPEYEIGGCVVCRRHDFSAAKFD 524
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGV 927
D+ T+ +C CE ++H C + G + P FC +C I L+ +
Sbjct: 525 DR------TVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLN 578
Query: 928 KHDLEGGYTWSLVHRFDVS----TDLSLSDV-CQKVECNAR-------LAVALSVMDECF 975
+ L+ R V D +L+DV Q + +R L+ A ++ ECF
Sbjct: 579 GAQIIPDSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPEDLPFLSRATAIFRECF 638
Query: 976 LPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMP 1035
P+ +SG +LI ++Y G N + G + +L ++SA +RI G+E+AE+P
Sbjct: 639 DPIVA-KSGRDLIPVMVY--GRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELP 695
Query: 1036 FIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL-EVSSK 1094
+ T ++ +G + L + IE L SLNV+ L++PA + WT GF+ + E
Sbjct: 696 IVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLI 755
Query: 1095 QKMRNMSLLVFPGVDMLQK 1113
+ MR + L +F G ML+K
Sbjct: 756 KYMREVQLTIFNGTSMLEK 774
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 140 bits (354), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 115/467 (24%), Positives = 198/467 (42%), Gaps = 100/467 (21%)
Query: 716 VPLNEKVQYWNQRKTRV-MLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLY 773
+P +V Y+ + + +L G GI C CC+ + S+F+ H+ + P+ ++Y
Sbjct: 415 LPEGTEVGYYVRGQVVTQLLVGYKRGSGIXCTCCNSEVSPSQFEAHAGWASRRKPYLHIY 474
Query: 774 FESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTF 833
+G SL + + S +G D NDD C IC DGG+L+CCDGCP F
Sbjct: 475 TSNGVSLHEFSI--------SLSRGREISVSD----NDDLCSICLDGGNLLCCDGCPRVF 522
Query: 834 HQNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINE 863
H+ C+ + P GKW C +C+ C RI
Sbjct: 523 HKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQITKRCIRIVN 582
Query: 864 STCHVNDQDDSALS-------------TLQICSLCEEKYHQSCSQTDGA--VQYEPSSLS 908
+ V++ AL T+ +C CE+++H C + ++ P
Sbjct: 583 T--QVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLKEVPKGKW 640
Query: 909 FCGKKCQEIFERLEKL-LGVKHDLEGGYTWSLVHRF-----------DVSTDLSLSDVCQ 956
FC C+ I L+KL + + +L ++ ++ D+ L
Sbjct: 641 FCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRLICGRRAS 700
Query: 957 KVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDD 1016
+E + L+ ALS+ E F P+ D +G +L+ ++++
Sbjct: 701 SIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVHG---------------------S 738
Query: 1017 EIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISEL 1076
+++SAA+ RI GKE+AE+P + TR + QG + L + +E L L V L++PA
Sbjct: 739 QVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVLPAAEGA 798
Query: 1077 RETWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFPRE 1123
WT+ FGF +V+ +Q + ++ + + QK + FPR+
Sbjct: 799 ESIWTNKFGFN--KVTQEQYIMDLFGIA---AEQFQKRLPDGDFPRD 840
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 139 bits (351), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 117/472 (24%), Positives = 194/472 (41%), Gaps = 115/472 (24%)
Query: 716 VPLNEKVQYW-----NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPF 769
+P +V Y+ +++ +L G I CDCC+ + S+F+ HS + P+
Sbjct: 400 LPEGSEVSYYVRGEVGTMRSKRLLSGHKKGCRILCDCCNSEVSPSQFEAHSGWASRRKPY 459
Query: 770 QNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGC 829
++Y +G SL + L ++ S + ND+ C IC DGG L+CCDGC
Sbjct: 460 LHIYTSNGVSLHELSLSLLRGREPSI------------NTNDEICSICLDGGTLLCCDGC 507
Query: 830 PSTFHQNCLDIKKFPSGKWHCVYCSCQF------------------------------CG 859
P FH+ C+ ++ P GKW C +C C
Sbjct: 508 PRVFHKECVSLENIPKGKWFCKFCLNTLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCI 567
Query: 860 RINES---------TCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGA--VQYEPSS 906
RI +S C ++ S T+ IC CE+++H C + ++ P
Sbjct: 568 RIVKSQTDEAGGCALCRRHEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKAVPKG 627
Query: 907 LSFCGKKCQEIFERLEKLL-------------------GVKHDLEGG---YTWSLVHRFD 944
FC + C++I L K++ G K + G W L+H
Sbjct: 628 KWFCCRDCKDINSSLRKIVVRREEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRLLHGRX 687
Query: 945 VSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNY 1004
S E + L+ ALS+ E F P+ D G +L+ +++++ ++ L +
Sbjct: 688 ASA----------TEAGSLLSQALSLFHEQFNPIADA-EGRDLLLDMVHS--NSTGELEF 734
Query: 1005 KGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLN 1064
G + AIL +++SAA+ R+ GKE+AE+P + TR + C
Sbjct: 735 GGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATR------------------SDCQGQ 776
Query: 1065 VEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
V L++PA W + F F +E + R+ ++ F G MLQKP+
Sbjct: 777 VNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQGTSMLQKPV 828
>gi|296084643|emb|CBI25766.3| unnamed protein product [Vitis vinifera]
Length = 126
Score = 138 bits (348), Expect = 2e-29, Method: Composition-based stats.
Identities = 63/125 (50%), Positives = 92/125 (73%), Gaps = 1/125 (0%)
Query: 944 DVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLN 1003
D + L + + CN+++AVA++VM+ECF P+ D + IN++ +++YN G+NF R++
Sbjct: 2 DPEAGVYLEESYDRTLCNSKIAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRIS 61
Query: 1004 YKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSL 1063
++GF+TAILE+ DE IS AS+RIHG +LAEMPFI TR YRRQGMC +LL IES + SL
Sbjct: 62 FEGFYTAILEKGDETISVASMRIHGNKLAEMPFIATRPSYRRQGMCHKLLVAIES-VSSL 120
Query: 1064 NVEKL 1068
++ L
Sbjct: 121 HINIL 125
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 137 bits (346), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 100/152 (65%)
Query: 962 ARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISA 1021
+L +A+ ++ ECF+ + + R+ ++ +I++N S +RLN++GF+ +L++ E++S
Sbjct: 6 GKLCIAVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSV 65
Query: 1022 ASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWT 1081
+ RI G++ AE+P IGTR +YRRQGMCR L+ +E L L VE+L++PA+ EL +TWT
Sbjct: 66 GTFRICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWT 125
Query: 1082 SVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQK 1113
FGF + S + ++ S+L F G M QK
Sbjct: 126 CSFGFTVMSNSERLELAGNSILSFQGTTMCQK 157
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 137 bits (344), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 112/423 (26%), Positives = 174/423 (41%), Gaps = 79/423 (18%)
Query: 755 SKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDT 813
S+F+ H+ ++ P+ N+Y +G SL + + Q S R+ +DD
Sbjct: 474 SQFEAHAGRAARRKPYHNIYMSNGVSLHELSISLLKGQKMSNRQ------------SDDL 521
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC-------SCQ---------- 856
C IC DGG L+ CD CP FH+ C+ + P G W C YC SC
Sbjct: 522 CSICSDGGQLLLCDTCPRAFHRECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAG 581
Query: 857 -----------FCGRINEST-----------CHVNDQDDSALST--LQICSLCEEKYHQS 892
F I +T C ++D ST + +C C +YH
Sbjct: 582 RVEGVDALEQIFTRSIRIATTPETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVG 641
Query: 893 CSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSL--------VHR 942
C + P +C C I + L+ LL H E T L V
Sbjct: 642 CLKEHNMADLTALPEGAWYCSTDCVRINQTLQDLL--NHGGEPVPTMDLDVIKKKREVKG 699
Query: 943 FDVSTDLSLSDVCQKV-------ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNF 995
F+ DL DV +V + L+ A+++ E F P+ +G +LI ++Y
Sbjct: 700 FNEDADL---DVRWRVLKDKSSDDSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVY-- 754
Query: 996 GSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTG 1055
G + + +Y G + A+L ++ ++SA RI G E+AE+P + T + G + L +
Sbjct: 755 GRSARDQDYTGMYCAVLTVNNTVVSAGLFRIMGNEIAELPLVATSRDSQGLGYFQALFSC 814
Query: 1056 IESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSS-KQKMRNMSLLVFPGVDMLQKP 1114
IE L SL V+ ++PA E WT FGF + ++ ++ VF G L K
Sbjct: 815 IERLLASLEVKHFVLPAAEEAESIWTERFGFTKISQDELREYLKGGRTTVFQGTSNLHKL 874
Query: 1115 MMK 1117
+ K
Sbjct: 875 VAK 877
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 135 bits (340), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 108/443 (24%), Positives = 179/443 (40%), Gaps = 81/443 (18%)
Query: 742 GIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSL--LQCILDSWNKQDE-SKRK 797
GI C CC+ + + S+F+ H+ + P+ N++ +G SL L C++ + + S R
Sbjct: 489 GILCACCNGVISCSQFEAHAGRGSRRAPYDNIFTAAGVSLRKLACLMPASEAESPISHRP 548
Query: 798 GFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF 857
D + + G+ + H C+
Sbjct: 549 AALCAVADRRALEPELVTVSGE-----------AALHGGCV------------------- 578
Query: 858 CGRINESTCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE-----PSSLS-- 908
C V D T+ IC CE +YH C G +SL
Sbjct: 579 -------LCKVPDFLRGGFGERTMIICDQCEREYHIGCLAEHGRAHLTELPEGKASLYDI 631
Query: 909 ----------FCGKKCQEIFERL-EKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQK 957
C +C+ I R+ E++ V L+G Y+W ++ D + + +
Sbjct: 632 LLTLHLNGEWHCSPECKGIATRMRERVSSVPVPLQGEYSWQVLRGKDGTHATTWA----- 686
Query: 958 VECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDE 1017
L A ++ E F P+ D +G +L+ ++Y +Y G +TA+L R +
Sbjct: 687 ------LKAAQEILTESFDPILDLVTGADLMMAMVY--AQELGDWDYTGMYTAVLRRRGK 738
Query: 1018 IISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELR 1077
+ +A R+ G++LAE+P + TR RRQG R L+ E SL V+ L +PA
Sbjct: 739 AVCSAVFRVFGRQLAEVPLVATRLGARRQGHARVLMAAFEDYFRSLGVQSLCLPAAQSTV 798
Query: 1078 ETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMKNQFPRENM----ISAKGLR 1132
ETW FGF + + + +L+FPG ++LQKP+ N P M + A+
Sbjct: 799 ETWIHGFGFAAITPEEQAATCSELRVLIFPGTELLQKPL--NPTPTLEMAEPSVHAQDAP 856
Query: 1133 SSELENPRTADEVGKNSDEKYSA 1155
+ E P+ A EV +S +A
Sbjct: 857 AVTKEAPKAAKEVAGDSKGAVAA 879
Score = 75.5 bits (184), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 68/141 (48%), Gaps = 27/141 (19%)
Query: 724 YWNQRKTRVMLQGRIARDG-IRCDC--C----SEIFTISKFDTHSKSKLCHPFQNLYFES 776
++ R ++L G I +G I C C C + + S+F+ H+ S+ P +++Y +
Sbjct: 245 FFQSRHGDLLLNGSITEEGQIACPCKQCRAKKTPGVSCSEFEEHAGSRERRPGESIYLTN 304
Query: 777 GSSLLQ--CILDSWNKQDESKRKGFHFVNFDGE--DPNDDTCGICGDGGDLICCDGCPST 832
S L+ C L VN +G D + CG+C DGGDL+CCDGCP+
Sbjct: 305 LSISLKEFCAL----------------VNDEGRSADRHGSACGLCMDGGDLLCCDGCPTA 348
Query: 833 FHQNCLDIKKFPSGKWHCVYC 853
H C +++ P G W C C
Sbjct: 349 VHAYCAGLEEVPEGDWFCDAC 369
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 135 bits (339), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 56/92 (60%), Positives = 76/92 (82%)
Query: 1025 RIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVF 1084
RIHG +LAEMPFIGTR +YRRQGMC RLL IESAL SLNV +L+IPAI EL+ TWT+VF
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 1085 GFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMM 1116
GF+P+E S +QK++++++L+ G +L+K ++
Sbjct: 61 GFKPVEPSKRQKIKSLNILIIHGTGLLEKRLL 92
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 130 bits (327), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 103/156 (66%), Gaps = 1/156 (0%)
Query: 961 NARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIIS 1020
+A+L +AL V+ ECF+ L + R+ +L +I++N S +RL+++GF+ LE+ E+I+
Sbjct: 499 HAKLCMALDVLRECFVTLIEPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIA 558
Query: 1021 AASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETW 1080
A++R++G+E+AE+P +GTR RRQGMCR L+ I+ L + VE+L++PA+ E+ TW
Sbjct: 559 VATLRVYGEEVAEVPLVGTRFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATW 618
Query: 1081 TS-VFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
T FG + + + +Q + + ++L F G M K +
Sbjct: 619 TGPSFGIREMGQADRQDVAHHAILRFQGTIMCHKQL 654
Score = 77.8 bits (190), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 83/184 (45%), Gaps = 36/184 (19%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDG-IRCDCCSEI----------FT 753
TLL+ +ID + +++ Y +R G I DG ++C C I +T
Sbjct: 283 TLLSLLIDKKILAPRDQLIYTTKR-------GLITGDGMVKCMCGGCINNNNKRRVAEYT 335
Query: 754 ISKFDTHSKSKLC-----HPFQNLYFESGSSLLQCILDSWNKQDES----KRKGFHFVNF 804
+++F H + P+ ++ G SL QC++ DE+ K+K ++ +
Sbjct: 336 VAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKKYLPY 395
Query: 805 ---------DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSC 855
E+ +D C +C D G+L+ CD CPS FH C+ ++ P G W C C+C
Sbjct: 396 VWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPACTC 455
Query: 856 QFCG 859
CG
Sbjct: 456 AICG 459
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 130 bits (326), Expect = 8e-27, Method: Composition-based stats.
Identities = 61/169 (36%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 959 ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
E + +L +A V+ ECF+ L + + +L +I++N S +RL ++GF+ LE+ E+
Sbjct: 4 EEHGKLCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGEL 63
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
I+ ++R++GK++AE+P +GTR +RRQGMC L+ +E L VE+L++PA+ EL +
Sbjct: 64 ITVGTLRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQ 123
Query: 1079 TWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPMMKNQFP-RENMI 1126
TWT FGFQ + S K + +++ F G M K + P R N++
Sbjct: 124 TWTGSFGFQVMTQSQKLDIAQHTIMCFQGTTMCHKFITNTAAPQRRNIL 172
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 125 bits (314), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 114/464 (24%), Positives = 196/464 (42%), Gaps = 100/464 (21%)
Query: 721 KVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESG 777
+V Y+ QRK + G I I C+ C+ + + S F+ H+ + P+ N++ +G
Sbjct: 586 EVAYYVDGQRK----VDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNG 641
Query: 778 SSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC 837
SL + + + S+R+ DD C CG GGD+ C CP +FH C
Sbjct: 642 VSLHELSMKISKDMELSERE------------TDDLCRECGQGGDIFPCKMCPRSFHPAC 689
Query: 838 LDIKKFPSGKWHCVYCS--------------CQFCGR-------------------INE- 863
+ + PS +W+C CS + GR I++
Sbjct: 690 VGLSGVPS-EWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISDD 748
Query: 864 ----STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQ 915
+ C D ++S T+ +C CE++YH C ++ V + P FC C
Sbjct: 749 LGGCALCKQKDFNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCS 808
Query: 916 EIFERLEKLLG---------------VKHDLEGGYTWSLVHRFDVSTDLS---LSDVCQK 957
EI L+K++ KH+++G D +TDL L+
Sbjct: 809 EIRSSLDKIISDGALILAESDIDIIRKKHEMKG-------LSMDTNTDLRWRLLAGRKAS 861
Query: 958 VECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKR-------LNYKGFFTA 1010
+ + L+ A+ ++ + F P+ + +SG +LI ++ N +R +Y G + A
Sbjct: 862 EDGDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMV-----NGRRPKDGMPGQDYSGMYCA 916
Query: 1011 ILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLII 1070
+L ++SAA +R+ G E+AE+P + T + G + L + IE L SL ++ ++
Sbjct: 917 VLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFML 976
Query: 1071 PAISELRETWTSVFGFQPL-EVSSKQKMRNMSLLVFPGVDMLQK 1113
PA E W + FGF + + S + L +F G L K
Sbjct: 977 PAAQEAEGIWMNKFGFTKIPQEQSDAYLNGAHLTIFHGTSNLYK 1020
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 125 bits (314), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 115/496 (23%), Positives = 201/496 (40%), Gaps = 86/496 (17%)
Query: 680 ALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIA 739
+LL R S S G G L+ ++ GT +V Y+ + ++ G I
Sbjct: 685 SLLKRMSASKGASTGKVTTKDTGLHKLVFKVLLDGT-----EVNYYVDGQKKI--DGYIK 737
Query: 740 RDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKG 798
I C+ C+++ + S F+ H+ + P+ N++ +G SL + + S+R+
Sbjct: 738 DQRIYCNHCNKVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHELSMSISKDMQLSERE- 796
Query: 799 FHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS---- 854
DD C CG GGD+ C CP +FH C+ + PS +W C C+
Sbjct: 797 -----------TDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCTILVQ 845
Query: 855 ---------------------------------CQFCGRINE-STCHVNDQDDSALS--T 878
C + + C D +++ T
Sbjct: 846 KEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICDDLGGCALCKKKDFNNAVFDERT 905
Query: 879 LQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYT 936
+ +C CE++YH C +++ V + P FC C EI L+K++ EG +
Sbjct: 906 VILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMIS-----EGAHP 960
Query: 937 WS-----LVHR--------FDVSTDLSLSDVCQKVEC---NARLAVALSVMDECFLPLPD 980
S ++ + D +T++ V + N+ L+ A+ V+ + F P+ +
Sbjct: 961 LSESDVDIIRKKHESKGLVMDANTEIRWQLVAGRSATEDGNSLLSSAVPVIHQSFDPIIE 1020
Query: 981 HRSGINLIHNILYN--FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIG 1038
+G +LI +++ +Y G + A+L ++SAA +R+ G ++AE+P +
Sbjct: 1021 AHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLVA 1080
Query: 1039 TRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMR 1098
T + G + L + IE L SL V+ ++PA E W FGF + +
Sbjct: 1081 TSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEAYL 1140
Query: 1099 NMS-LLVFPGVDMLQK 1113
N L VF G L K
Sbjct: 1141 NGGHLTVFHGTLNLYK 1156
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 125 bits (313), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/464 (24%), Positives = 196/464 (42%), Gaps = 100/464 (21%)
Query: 721 KVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESG 777
+V Y+ QRK + G I I C+ C+ + + S F+ H+ + P+ N++ +G
Sbjct: 577 EVAYYVDGQRK----VDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNG 632
Query: 778 SSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC 837
SL + + + S+R+ DD C CG GGD+ C CP +FH C
Sbjct: 633 VSLHELSMKISKDMELSERE------------TDDLCRECGQGGDIFPCKMCPRSFHPAC 680
Query: 838 LDIKKFPSGKWHCVYCS--------------CQFCGR-------------------INE- 863
+ + PS +W+C CS + GR I++
Sbjct: 681 VGLSGVPS-EWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISDD 739
Query: 864 ----STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQ 915
+ C D ++S T+ +C CE++YH C ++ V + P FC C
Sbjct: 740 LGGCALCKQKDFNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCS 799
Query: 916 EIFERLEKLLG---------------VKHDLEGGYTWSLVHRFDVSTDLS---LSDVCQK 957
EI L+K++ KH+++G D +TDL L+
Sbjct: 800 EIRSSLDKIISDGALILAESDIDIIRKKHEMKG-------LSMDTNTDLRWRLLAGRKAS 852
Query: 958 VECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKR-------LNYKGFFTA 1010
+ + L+ A+ ++ + F P+ + +SG +LI ++ N +R +Y G + A
Sbjct: 853 EDGDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMV-----NGRRPKDGMPGQDYSGMYCA 907
Query: 1011 ILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLII 1070
+L ++SAA +R+ G E+AE+P + T + G + L + IE L SL ++ ++
Sbjct: 908 VLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFML 967
Query: 1071 PAISELRETWTSVFGFQPL-EVSSKQKMRNMSLLVFPGVDMLQK 1113
PA E W + FGF + + S + L +F G L K
Sbjct: 968 PAAQEAEGIWMNKFGFTKIPQEQSDAYLNGAHLTIFHGTSNLYK 1011
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 124 bits (311), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 107/454 (23%), Positives = 191/454 (42%), Gaps = 76/454 (16%)
Query: 721 KVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESG 777
+V Y+ QRK + G I I C+ C+ + + S F+ H+ + P+ N++ +G
Sbjct: 410 EVAYYVDGQRK----VDGYIKDHRIYCNHCNRVVSPSAFEAHAGEGSRRKPYDNIFTSNG 465
Query: 778 SSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC 837
SL + + + S+R+ DD C CG GGD+ C CP +FH C
Sbjct: 466 VSLHELAMKISKDMELSERE------------TDDLCRECGQGGDIFPCKICPRSFHPAC 513
Query: 838 LDIKKFPSGKWHCVYC--------------SCQFCGR-------------------INE- 863
+ + K P+ +W+C C + + GR I++
Sbjct: 514 VGLSKVPA-EWYCDSCRNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAIRIVPISDD 572
Query: 864 ----STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQ 915
+ C D +++ T+ +C CE++YH C Q+ V+ + P FC C
Sbjct: 573 LGGCALCKQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEEEWFCCSSCS 632
Query: 916 EIFERLEKLLGVKHDLEGGYTWSLVHR--------FDVSTDLS---LSDVCQKVECNARL 964
E L+K++ L ++ + D S DL LS + + L
Sbjct: 633 ETRSSLDKIISDGAQLLADPDLEIIKKKHETRGLCMDTSKDLKWQLLSGKRATEDGSILL 692
Query: 965 AVALSVMDECFLPLPDHRSGINLIHNILYNFG--SNFKRLNYKGFFTAILERDDEIISAA 1022
+ A+ + + F P+ + +G +LI ++ G +Y G + A+L ++SAA
Sbjct: 693 SAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAA 752
Query: 1023 SIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTS 1082
+R+ G ++AE+P + T + G + L + IE L SL ++ ++PA E W +
Sbjct: 753 LLRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMN 812
Query: 1083 VFGFQPLEVSSKQKMRNMS-LLVFPGVDMLQKPM 1115
FGF + + N + L +F G + K +
Sbjct: 813 KFGFSRISPEELEAYLNGAHLTIFHGTSYMYKAV 846
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/429 (24%), Positives = 182/429 (42%), Gaps = 83/429 (19%)
Query: 731 RVMLQGRIARDGIRCDCCS---EIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILD 786
+ +L+G IA IRC C S + S F+ H+ + P+ N++ +G SL + +
Sbjct: 326 KAVLKGVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMK 385
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG 846
S+R+ DD C CG GGD+ C CP +FH C+ + PS
Sbjct: 386 ISKDMQLSERE------------TDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVPS- 432
Query: 847 KWHCVYCS--------------CQFCGR---------INESTCHV----------NDQDD 873
+W+C CS + GR I + + D ++
Sbjct: 433 EWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPNLWIELGQKDFNN 492
Query: 874 SALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLG--- 926
S T+ +C CE++YH C Q+ V + P FC C EI L+K++
Sbjct: 493 SVFDERTVILCDQCEKEYHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGA 552
Query: 927 ------------VKHDLEGGYTWSLVHRFDVSTDLS---LSDVCQKVECNARLAVALSVM 971
KH+++G D +TDL L+ + + L+ A+ ++
Sbjct: 553 LILAESDIDIIRKKHEMKG-------LSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPII 605
Query: 972 DECFLPLPDHRSGINLIHNIL--YNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGK 1029
+ F P+ + +SG +LI ++ +Y G + A+L ++SAA +R+ G
Sbjct: 606 HQSFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGG 665
Query: 1030 ELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL 1089
E+AE+P + T + G + L + IE L SL ++ ++PA E W + FGF
Sbjct: 666 EVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFT-- 723
Query: 1090 EVSSKQKMR 1098
++ +Q +R
Sbjct: 724 KIPQEQLLR 732
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/468 (23%), Positives = 187/468 (39%), Gaps = 108/468 (23%)
Query: 721 KVQYW--NQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESG 777
+V Y+ QRK + G I I C+ CS + + S F+ H+ + P+ N++ +G
Sbjct: 747 EVAYYVDGQRK----VDGYIKDQRIYCNHCSRVVSPSAFEAHAGEGSRRKPYDNIFTSNG 802
Query: 778 SSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC 837
SL + + + S+R+ DD C CG GGD+ C CP +FH C
Sbjct: 803 VSLHELSMKISKDMELSERE------------TDDLCRECGLGGDIFPCKMCPRSFHPAC 850
Query: 838 LDIKKFPSGKWHCVYCS-------------------------------------CQFCGR 860
+ + +FPS +W C CS C
Sbjct: 851 VRLSEFPS-EWFCDNCSNLVQKEKALAANKNAKAAGRQAGVDSIEQIMKRAIRIVPICDD 909
Query: 861 INE-STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQ 915
+ + C D +++ T+ +C CE++YH C +T V + P FC C
Sbjct: 910 LGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRTQWQVDLKELPDGEWFCCSSCS 969
Query: 916 EIFERLEKLL----------------------GVKHDLEGGYTWSLVHRFDVSTDLSLSD 953
EI L+K++ G+ D + W L+ + D SL
Sbjct: 970 EIRSCLDKMISDGAQPLSGSDLEIIRKKHESRGLSMDTDIDIRWQLLAGRSATEDGSL-- 1027
Query: 954 VCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRL-------NYKG 1006
L+ A+ ++ + F P+ + +G +LI ++ N +R +Y G
Sbjct: 1028 ---------LLSSAVPIIHQSFDPIIEANTGRDLIPEMV-----NGRRPKEGMPGQDYSG 1073
Query: 1007 FFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVE 1066
+ A++ ++SAA +RI G ++AE+P + T + G + L + +E L SL ++
Sbjct: 1074 MYCAVITLGSTVVSAALLRIMGGDVAELPLVATSMDLQGLGYFQVLFSCMERMLISLKIK 1133
Query: 1067 KLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMS-LLVFPGVDMLQK 1113
++PA E W FGF + + N + L VF G L K
Sbjct: 1134 HFMLPAAQEAEAIWMKKFGFSRIPQEQLEAYLNGAHLTVFHGTSNLYK 1181
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 120 bits (302), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 128/252 (50%), Gaps = 18/252 (7%)
Query: 878 TLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLEKLLG---------VK 928
T+ +C C +YH C + G + P + +C CQ I + L +L+ +
Sbjct: 947 TMLVCDQCGREYHVGCLRESGILDELPEAEWYCQPNCQHIVQVLSQLVANGPELLSDNIV 1006
Query: 929 HDL--EGGYTWSLVHRFDVSTDLSLSDVCQKV---ECNAR-LAVALSVMDECFLPLPDHR 982
+DL + +V + S+ + + N R LA A+ + EC P+ D
Sbjct: 1007 NDLLESRQHQQGIVEMAESSSPVFGWQILHGAGENPVNGRTLAQAVEIFTECSDPIKDAP 1066
Query: 983 SGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHM 1042
SG N+I ++Y+ FK ++ G + +L +++++S A ++I G+E+AE+P I T
Sbjct: 1067 SGQNMIPIMVYS--RRFKDYDFDGIYCVVLTLNEKVVSTALLQIFGREVAEVPLIATSVD 1124
Query: 1043 YRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRN-MS 1101
++ QG C+ L+T IE L LNVE+L++PA W + FGF +E + + +R+ M
Sbjct: 1125 HQDQGFCKALMTTIERLLGVLNVERLVLPASKNAEFVWVNRFGFSRMEDAQLKHIRSMMG 1184
Query: 1102 LLVFPGVDMLQK 1113
LLVF G ML K
Sbjct: 1185 LLVFTGTTMLVK 1196
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 58/136 (42%), Gaps = 20/136 (14%)
Query: 734 LQGRIARDGIRCDC--C--SEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDSWN 789
L G + G+ C+C C +++ +IS F+ HS S HP N+Y E+G +L D +
Sbjct: 570 LTGILQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNL----RDILS 625
Query: 790 KQDESKRKGFHFVN--------FDGEDPNDDTCGICG--DGGDLICCD--GCPSTFHQNC 837
ES G + G + CG CG +GGD + C C + +H C
Sbjct: 626 AGQESADCGGDILGALKHAIGEIQGIPKKEGACGKCGKREGGDFVSCKEPKCSAVYHAEC 685
Query: 838 LDIKKFPSGKWHCVYC 853
+ + W C C
Sbjct: 686 VGLPSPHRVDWFCAKC 701
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 120 bits (301), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 107/441 (24%), Positives = 177/441 (40%), Gaps = 119/441 (26%)
Query: 734 LQGRI--ARDGIRCDCCSEIF---------------TISKFDTH----SKSKLCHPFQNL 772
LQG I + GI CDCC E+ + SKF+ H S+ K P+ ++
Sbjct: 116 LQGEINIKQSGILCDCCKEVILEFFFCQIKKLDEQVSPSKFEAHAGWASRRK---PYFHI 172
Query: 773 YFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPST 832
G SL Q ++ H ++ +D+ C C G+L+CCDGC
Sbjct: 173 RTTDGVSLHQLAIN-------------HRISISN---SDEHCSKCKQRGNLLCCDGCQRA 216
Query: 833 FHQNCLDIKKFPSGKWHCVYCS------------------------------CQFCG--- 859
FH C+ ++ P KW+C YC + C
Sbjct: 217 FHLGCIPVESPPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKIIESDPSEQIAKICTLSV 276
Query: 860 ---RINESTC------HVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE--PSSLS 908
+ S+C H N+ + S T+ IC CE+ YH C + + P
Sbjct: 277 KHKEVEHSSCALCSERHFNNGEFSPW-TVMICDQCEKDYHVGCLKDHNMANLKKVPKHYW 335
Query: 909 FCGKKCQEIFERLEKLL----------------------GVKHDLEGGYTWSLVHRFDVS 946
FCG C +I +L+ + G++ + W + +R
Sbjct: 336 FCGVDCYDIHMKLKNFMARGDVLLSDSLLSLIKNKKEQKGLETEFGLDIKWKVFNR---- 391
Query: 947 TDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFK-RLNYK 1005
+ K+ ++ L+ +++ E F + + I+LI ++ G K + +
Sbjct: 392 -----QLIVSKIITSSLLSDVVTIFHEQFDSIVVTGTKIDLIPAMVK--GRKIKDKYYFG 444
Query: 1006 GFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNV 1065
G + A+L + ++SA R+ GKE+AE+ I T+ Y++QG + LL+ IE+ L L V
Sbjct: 445 GMYCAVLIVNQVVVSAGIFRVFGKEVAELSLIATKAEYQKQGFFKCLLSCIENVLKELKV 504
Query: 1066 EKLIIPAISELRETWTSVFGF 1086
E+L++PA E W FGF
Sbjct: 505 ERLVLPAAHEAESMWIDKFGF 525
>gi|224099799|ref|XP_002311623.1| predicted protein [Populus trichocarpa]
gi|222851443|gb|EEE88990.1| predicted protein [Populus trichocarpa]
Length = 80
Score = 118 bits (296), Expect = 2e-23, Method: Composition-based stats.
Identities = 51/80 (63%), Positives = 67/80 (83%)
Query: 917 IFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFL 976
+FE+L+K LGVKH+L+ G++WSL+HR D +D SL + Q+VECN++LAV+LSVMDECFL
Sbjct: 1 LFEQLQKYLGVKHELDAGFSWSLIHRVDADSDASLQGLPQRVECNSKLAVSLSVMDECFL 60
Query: 977 PLPDHRSGINLIHNILYNFG 996
P+ D RSGINLI N+LYN G
Sbjct: 61 PVVDRRSGINLIQNVLYNCG 80
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 115 bits (287), Expect = 3e-22, Method: Composition-based stats.
Identities = 54/133 (40%), Positives = 80/133 (60%), Gaps = 12/133 (9%)
Query: 997 SNFKRLNYKGFFTAILERDDEIISAAS------------IRIHGKELAEMPFIGTRHMYR 1044
S FKRL+++GF+ L++DDE + AA+ +RIHG ++AEMP + T YR
Sbjct: 56 SKFKRLDFRGFYIMALQKDDEFVCAATAFMNCVYEYLHGLRIHGHKVAEMPLVATAFKYR 115
Query: 1045 RQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMSLLV 1104
RQGMC+ L+ +E L L+VE+L++PAISE E W S+FGF + + + ++ L
Sbjct: 116 RQGMCQVLVHELEKMLSQLHVERLVLPAISERSELWQSLFGFSEMSSAERLELLRFPFLG 175
Query: 1105 FPGVDMLQKPMMK 1117
F G M QK + K
Sbjct: 176 FQGTTMFQKILSK 188
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 110 bits (275), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 56/154 (36%), Positives = 87/154 (56%), Gaps = 14/154 (9%)
Query: 962 ARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISA 1021
+L +AL V+ E F+ + + R+ +L +I++ S ++LN++GF+T
Sbjct: 6 GKLCIALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYT------------ 53
Query: 1022 ASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWT 1081
I + GK+ AE+P IGTR YRRQGMCR L+ +E L L VE+L++P + +L ETWT
Sbjct: 54 --ILVCGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWT 111
Query: 1082 SVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
FGF + S + + +L F G M QK +
Sbjct: 112 GSFGFTEMSYSDRFQYAANIILSFQGTTMCQKIL 145
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 110 bits (275), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/345 (25%), Positives = 146/345 (42%), Gaps = 53/345 (15%)
Query: 733 MLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQ 791
+L G +GI C CC + S+F++H+ S P++++Y +G +L + N Q
Sbjct: 19 ILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLANGQ 78
Query: 792 DESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCV 851
+ + G DD C CGDGGDL+ C CP FH CLD+ P G WHC
Sbjct: 79 NITTGIG------------DDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCP 126
Query: 852 YCSCQFCG---------RINE------------STCHVNDQDDSALS--TLQICSLCEEK 888
C+ G R+ + C +D T+ +C CE++
Sbjct: 127 NCNKLGHGGNFARPIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTFDDRTVILCDQCEKE 186
Query: 889 YHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVS 946
+H C + G + P FC + C I+ L + ++++R V
Sbjct: 187 FHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPASLLNIINRKHVE 246
Query: 947 TDLSLSDVCQKVECN------------ARLAVALSVMDECFLPLPDHRSGINLIHNILYN 994
L + + V+ + L+ A ++ ECF P+ ++G +LI ++Y
Sbjct: 247 KGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIFRECFDPIVA-KTGRDLIPVMVY- 304
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGT 1039
G N + G + +L ++SA +RI G+E+AE+P + T
Sbjct: 305 -GRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVAT 348
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 109 bits (272), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 177/439 (40%), Gaps = 95/439 (21%)
Query: 729 KTRVMLQGRIARDGIRCDCC----SEIFTISKFDTHSKSKLC-HPFQN-LYFESGSSLLQ 782
K+++ L+G GI C CC S + +F+ H+ + +P+ N L G SL
Sbjct: 512 KSQLKLKGVKRGKGICCSCCNKEASSDISCFEFEQHAGCEARRNPYGNILVLVDGRSLKD 571
Query: 783 CILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK 842
D +K +++ N + + + C C G+L C+GC + NC ++
Sbjct: 572 VCKDLTHKNKLGEQQ-----NCEPLARDVNCCYECSSSGELKTCNGCEEAWCDNCTKGEE 626
Query: 843 FPS-GKWHCVYC---------------------------------------------SCQ 856
S KW+C C C
Sbjct: 627 VDSDSKWYCRMCRNDTLKVAQNGQKVSGKHQEESSSITEIDERGRCIRHLEGHREVGGCA 686
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCGKKC 914
C + N S D T+ +C C +YH SC + G + P FC K C
Sbjct: 687 ICKKWNLSKTGFVDG-----MTILVCDQCGREYHVSCLKDSGMDNLNELPEGEWFCQKGC 741
Query: 915 QEIFERLEKLLGVKHDLEGGYTWSLVHRF-------------------DVSTDLSLSDVC 955
+ I E L +L+ + + SL H +S +C
Sbjct: 742 KVIDEILTQLVAIGPE-------SLSHSIISELPENRQQKSGVIEKAESISPSFEWQILC 794
Query: 956 QKVECNAR---LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAIL 1012
K A LA A+++ EC P+ D ++G NLI ++ + K +++G F +L
Sbjct: 795 GKGSSPANIQTLAEAVNIFTECSDPIRDAKTGKNLIPLMVQS--RRTKDYDFEGVFCVVL 852
Query: 1013 ERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPA 1072
+ + +++SAA ++I G+E AE+P + T ++ QG C+ L+T IE L L+VE+L++P
Sbjct: 853 KLNGKVVSAALLQIFGREFAEVPLVATSLPHQGQGFCKALMTTIERLLGVLSVERLVLPT 912
Query: 1073 ISELRETWTSVFGFQPLEV 1091
+ W + FGF + V
Sbjct: 913 AKDTESLWVNKFGFSRVPV 931
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 58/134 (43%), Gaps = 12/134 (8%)
Query: 732 VMLQGRIARDGIRCDC--C--SEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDS 787
+ L G + G++CDC C S I +IS F+ HS S HP N+Y E+G +L +
Sbjct: 314 IQLTGMLKDMGVQCDCRNCRGSVIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDILSAG 373
Query: 788 WNKQDES----KRKGFHFVNFDGEDPNDDTCGICGDG--GDLICCDG--CPSTFHQNCLD 839
D + + G + + C CG+ GDLI C G C H C++
Sbjct: 374 QEAADCGDNILRALKMAIGDVQGVEKSKSKCAKCGNPEEGDLIYCKGARCSVVAHSGCVE 433
Query: 840 IKKFPSGKWHCVYC 853
I G W C C
Sbjct: 434 IANPHLGDWFCGKC 447
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 106 bits (265), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 113/261 (43%), Gaps = 46/261 (17%)
Query: 847 KWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLC-EEKYHQSCSQTDGAVQYEPS 905
W + C CG D + L +CS C H C + V EP
Sbjct: 8 PWWSMPEICTHCGG--------GDVEPEGRRVLVLCSACFAAGTHTGCHED---VTGEPL 56
Query: 906 S--------LSFCGKKCQEIFERLEKLLGVK---HDLEGGYTWSLVHRFDVSTDLSLSDV 954
S L FCGK+CQ +E LE G + D YT+ LVH
Sbjct: 57 SSEITHGDGLYFCGKECQRSYEALEAATGRRSRIRDEPEQYTFELVHY------------ 104
Query: 955 CQKVECNARLAV--ALSVMDECFLPLPDHRSGINLIHNILYNF-------GSNFKRLNYK 1005
++ + R AV A+ + F PL +G +L+ + + N+
Sbjct: 105 -KQDDRTVRSAVETAMRMFRTSFAPL-IMENGRDLLEMVCTAYETPDEEVEEEGGGHNFS 162
Query: 1006 GFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNV 1065
F AIL II+AA++R+ G + AEMPF+ TR +RR G C+RL+ +E L + V
Sbjct: 163 AFRLAILRMGGTIITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGV 222
Query: 1066 EKLIIPAISELRETWTSVFGF 1086
L+IP+I+EL WT+ FGF
Sbjct: 223 HCLVIPSINELLPMWTNKFGF 243
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 106 bits (265), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 141/327 (43%), Gaps = 60/327 (18%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF------------ 857
+D+ C C G+L+ CD C +FH C+ ++ P K +C YC +F
Sbjct: 257 SDERCSHCEQVGNLLWCDRCQRSFHLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKEND 316
Query: 858 --CGRINE--------STCHVNDQ-------------------DDSALSTLQICSLCEEK 888
GRI E C ++++ ++S T+ IC CE++
Sbjct: 317 VATGRIAEGDPSEQITEVCTLSEKQKEVKDGPCALCSERDFNNNESGPRTVMICKQCEKE 376
Query: 889 YHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGV-KHDLEGGYTWSLVHRFDV 945
+H C + P FCG C +I +L+KL+ + +L W L++
Sbjct: 377 FHVECLKDHNMANLVELPKDKWFCGIDCDDIHMKLQKLMARGEAELGLDIKWRLLNTKLN 436
Query: 946 STDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYK 1005
+ ++S + K A ++ E F + D ++ I+LI +LY + +++
Sbjct: 437 NPKHNISPLISK---------ANAIFHERFKSIKDPKTKIDLIRAMLYGMEIE-GQYSFE 486
Query: 1006 GFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQ------GMCRRLLTGIESA 1059
G + A+L I A R+ G+E+AE+P + T Y+++ G R L + IE+
Sbjct: 487 GMYCAVLYFKKVIACAGIFRVLGQEVAELPLVATTTKYQKRVILFTSGYFRSLFSCIENM 546
Query: 1060 LCSLNVEKLIIPAISELRETWTSVFGF 1086
L L V+ L++PA E W FGF
Sbjct: 547 LRHLKVKTLVLPAAHEAESMWIDKFGF 573
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 105 bits (261), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 142/341 (41%), Gaps = 76/341 (22%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD-IKKFPSGKWHCVYC----------------- 853
++C CG G+L C GC T+ +C ++ GKW+C C
Sbjct: 691 NSCYECGTRGELKNCHGCVETWCNSCTKGLETDSDGKWYCRMCRQDTLNVAQIEQKRSNK 750
Query: 854 ----------------------------SCQFCGRINESTCHVNDQDDSALSTLQICSLC 885
C C + N S D T+ +C C
Sbjct: 751 HIEGMSNIAETDERDRCVRHLEGHREVGGCAICKKWNLSKTGFVDG-----MTILVCDQC 805
Query: 886 EEKYHQSCSQTDGA--VQYEPSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF 943
+YH SC + G + P FC K C+ I E L +L+ +L T S++
Sbjct: 806 GREYHVSCLKDSGVDDLNELPEGEWFCQKDCKVIDEILTQLVANGPEL---LTDSIISEL 862
Query: 944 DVS----------TDLSLSDVCQKVEC--------NARLAVALSVMDECFLPLPDHRSGI 985
S + S ++ C LA A+++ EC P+ D ++G
Sbjct: 863 LESRQQQTGAKDKAESSCPSFAWQILCGKSGNTANTQTLAEAINIFTECSDPIRDAKTGK 922
Query: 986 NLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRR 1045
NLI ++ + S K +++G F +L+ +++++SAA ++I G E+AE+P + T ++
Sbjct: 923 NLIPLMVQSRRS--KDHDFEGVFCIVLKLNEKVVSAALLQIFGGEIAEVPLVATSLTHQG 980
Query: 1046 QGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
QG C+ L+T IE L L+VE+L++P W + FGF
Sbjct: 981 QGFCKALMTTIERLLGVLSVERLVLPTAKNTESIWINKFGF 1021
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 57/134 (42%), Gaps = 12/134 (8%)
Query: 732 VMLQGRIARDGIRCDC----CSEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDS 787
+ L G + G+RCDC S I +IS F+ HS S HP N+Y E+G +L +
Sbjct: 415 IQLTGMLKDMGVRCDCRNCKSSVIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDILSAG 474
Query: 788 WNKQD--ESKRKGFHFV--NFDGEDPNDDTCGIC--GDGGDLICCDG--CPSTFHQNCLD 839
D ++ + + G + TC C D GDLI C G C H C+
Sbjct: 475 QEAADCGDNILRALKMAIGDIQGVEKWKVTCAKCWNSDEGDLIYCKGARCSIIAHSRCIG 534
Query: 840 IKKFPSGKWHCVYC 853
I G W C C
Sbjct: 535 ISNPRLGDWFCDKC 548
>gi|255577029|ref|XP_002529399.1| hypothetical protein RCOM_0623590 [Ricinus communis]
gi|223531147|gb|EEF32995.1| hypothetical protein RCOM_0623590 [Ricinus communis]
Length = 275
Score = 104 bits (259), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 91/148 (61%), Gaps = 10/148 (6%)
Query: 917 IFERLEKLLGVKHDL-EGGYTWSLV---HRFDVSTDLSLSDVCQKVECNARLAVALSVMD 972
IF L +LLG + + TW+L+ H D D S +E ++L++AL VM
Sbjct: 11 IFLGLHELLGKQIPVGLNNLTWTLLKSNHSSDHKPDAS------DIENYSKLSIALHVMH 64
Query: 973 ECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELA 1032
ECF P+ + R+ + + ++++ S RLN++GF+T +L++DDE I+ A++R++G+++A
Sbjct: 65 ECFQPVEEPRTKGDFLKDVIFRKRSELNRLNFRGFYTVLLQKDDEFITVATVRVYGEKVA 124
Query: 1033 EMPFIGTRHMYRRQGMCRRLLTGIESAL 1060
E+P +GTR YRR GMC L+ +E L
Sbjct: 125 EIPLVGTRVQYRRLGMCGILMNVLEKNL 152
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 104 bits (259), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 100/406 (24%), Positives = 166/406 (40%), Gaps = 89/406 (21%)
Query: 768 PFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCD 827
PFQ++Y +G SL + L D+ R H NDD C IC DG C
Sbjct: 615 PFQHIYTTNGVSLHE--LSVALSMDQ--RFSIH--------ENDDLCSICRDG----VCA 658
Query: 828 GCPSTFHQNCLDIKKFPSGKWHCVYC--------------SCQFCGRIN--ESTCHVNDQ 871
PS PS +W C YC + GR+ ++ + ++
Sbjct: 659 SLPS-----------LPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNR 707
Query: 872 DDSALS--------------------------TLQICSLCEEKYHQSC-SQTDGA-VQYE 903
+S T+ IC CE+++H C + D A ++
Sbjct: 708 CIRIVSSFVTELPSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKEL 767
Query: 904 PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHR-----------FDVSTDLSLS 952
P FC C+EI L L+ + + + + + + D+
Sbjct: 768 PEEKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWR 827
Query: 953 DVCQKV----ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFF 1008
+ K+ + LA ALS++ E F P+ + + +LI ++Y G K ++ G +
Sbjct: 828 VLSGKLTSSDDTKILLAKALSILHERFDPISESGTKGDLIPAMVY--GRQTKAQDFSGMY 885
Query: 1009 TAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKL 1068
+L D+ I+S R+ G ELAE+P + T + QG + L IE L LNV+ +
Sbjct: 886 CTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHI 945
Query: 1069 IIPAISELRETWTSVFGFQPL-EVSSKQKMRNMSLLVFPGVDMLQK 1113
++PA E + WT FGF + + K+ ++ S+++F G ML+K
Sbjct: 946 VLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRK 991
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 102 bits (254), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 148/366 (40%), Gaps = 58/366 (15%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSC----------- 855
E DD C CG GGD+ C CP +FH C+ + PS +W C C+
Sbjct: 19 ERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCTILVQKEKALAAN 78
Query: 856 ----------------QFCGRINESTCHVNDQDDSALS-------------TLQICSLCE 886
Q R +D AL T+ +C CE
Sbjct: 79 KNAKAAGRQAGVDSIEQILKRAIRIVPICDDLGGCALCKKKDFNNAVFDERTVILCDQCE 138
Query: 887 EKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLL-GVKHDLEGGYTWSLVHR- 942
++YH C +++ V + P FC C EI L+K++ G H L ++ +
Sbjct: 139 KEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMISGGAHPLSESDV-DIIRKK 197
Query: 943 -------FDVSTDLSLSDVCQKVEC---NARLAVALSVMDECFLPLPDHRSGINLIHNIL 992
D +T++ V + N+ L+ A+ V+ + F P+ + +G +LI ++
Sbjct: 198 HESKGLVMDANTEIRWQLVAGRSATEDGNSLLSSAVPVIHQSFDPIIEAHTGRDLIPEMV 257
Query: 993 YNF--GSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCR 1050
+ +Y G + A+L ++SAA +R+ G ++AE+P + T + G +
Sbjct: 258 HGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLVATSMDLQGLGYFQ 317
Query: 1051 RLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMS-LLVFPGVD 1109
L + IE L SL V+ ++PA E W FGF + + N L VF G
Sbjct: 318 VLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEAYLNGGHLTVFHGTL 377
Query: 1110 MLQKPM 1115
L K +
Sbjct: 378 NLYKAI 383
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 101 bits (252), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 152/388 (39%), Gaps = 75/388 (19%)
Query: 768 PFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCD 827
P L S +L +LD+ N+++ S R+ +DD C IC DGG+L+ CD
Sbjct: 212 PGSKLALRSLKPMLAVLLDA-NRRNMSNRQ------------SDDLCSICSDGGELLLCD 258
Query: 828 GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSA------------ 875
CP FH+ C+ P G W C YC + + ES+ N +A
Sbjct: 259 SCPRAFHRECVGFTTIPRGTWCCRYCENR---QQRESSLAYNHNAIAAGRIDGIDPMEQI 315
Query: 876 ------LSTLQI-----CSLCE----EKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFER 920
++T + C+LC K S Q P +C C I E
Sbjct: 316 FTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLLCDQALPEGAWYCTADCVRISET 375
Query: 921 LEKLLGVKHDLEGGYTWSLVHR--------FDVSTDLSLSDVCQKVECNARLAV--ALSV 970
L+ LL + ++ R D D+ + K +++L + A+++
Sbjct: 376 LKDLLSRGAEPISSVDVEIIKRKYEQKALNKDGDLDVRWRVLKDKSSADSKLVLSKAVAI 435
Query: 971 MDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKE 1030
E F P+ +G +LI ++Y + ++SA R+ G E
Sbjct: 436 FHESFDPIIQIATGRDLIPAMVYG---------------------NTVVSAGLFRVMGSE 474
Query: 1031 LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL- 1089
+AE+P + T + G + L IE L SL V+ ++PA E WT FGF +
Sbjct: 475 IAELPLVATSRDSQGLGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKIT 534
Query: 1090 EVSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
+ ++ ++ VF G L K + K
Sbjct: 535 QDELREYLKGGRTTVFQGTSTLHKLVPK 562
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 100 bits (248), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 100/428 (23%), Positives = 169/428 (39%), Gaps = 90/428 (21%)
Query: 755 SKFDTHSK-SKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDT 813
S F+ H+ + PFQ++Y +G SL + L D+ R H NDD
Sbjct: 619 SSFEAHAGCASRRKPFQHIYTTNGVSLHE--LSVALSMDQ--RFSIH--------ENDDL 666
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC--------------SCQFCG 859
C IC DG C + PS +W C YC + G
Sbjct: 667 CSICRDG---------------VCASLSSLPSERWSCKYCVNMVEREKFVDSNLNAIAAG 711
Query: 860 RIN--ESTCHVNDQDDSALS--------------------------TLQICSLCEEKYHQ 891
R+ ++ + ++ +S T+ IC CE+++H
Sbjct: 712 RVQGVDAIAEITNRCIRVVSSFGTELPSVCVLCRGHSFCRLGFNSRTVIICDQCEKEFHV 771
Query: 892 SCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFD----- 944
C + + P FC C++I L L+ + + + +
Sbjct: 772 GCLKEHNIADLKELPEEKWFCSVDCEKINTTLGNLIIRGEEKLTNNILNFIRTKEKPNEE 831
Query: 945 -VSTDLSLSDVCQKV---------ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYN 994
D + D+ +V E LA A+S++ E F P+ + + +LI ++Y
Sbjct: 832 SCPDDNTTPDIRWRVLSGKLTSSDETKILLAKAVSILHERFDPISETGTRGDLIPAMVY- 890
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLT 1054
G K ++ G + +L D+ I+S R+ G ELAE+P + T + QG + L
Sbjct: 891 -GRQAKGQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFA 949
Query: 1055 GIESALCSLNVEKLIIPAISELRETWTSVFGFQPL-EVSSKQKMRNMSLLVFPGVDMLQK 1113
IE L LNV+ +++PA E + WT FGF + + K+ ++ S+++F G ML+K
Sbjct: 950 CIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRK 1009
Query: 1114 PMMKNQFP 1121
+ P
Sbjct: 1010 SVPAPSAP 1017
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 100 bits (248), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 129/283 (45%), Gaps = 33/283 (11%)
Query: 855 CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCGK 912
C C + N S D T+ +C C +YH SC + G + P FC K
Sbjct: 1038 CAICKKWNLSKTGFVDG-----MTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHK 1092
Query: 913 KCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVEC-NAR-------- 963
C+ I E L +L+ +L T S + + + LS +K+E N R
Sbjct: 1093 DCKVIDEILAQLVANGPELLSNSTISGL----LESRQQLSSAKEKIESSNPRFEWQILCG 1148
Query: 964 ----------LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILE 1013
LA A ++ +C P+ D ++G NLI ++ + K +++G F +L+
Sbjct: 1149 KGSSPADVQTLAEAENIFTDCSDPIRDVKTGKNLIPLMVQS--RRTKDHDFEGVFCVVLK 1206
Query: 1014 RDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAI 1073
+ +++SAA ++I G+E+AE+P I T + Q C+ L+T IE L L+VE+L++P
Sbjct: 1207 LNGKVVSAALLQIFGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTA 1266
Query: 1074 SELRETWTSVFGFQPL-EVSSKQKMRNMSLLVFPGVDMLQKPM 1115
W + FGF + E K + L F G M+ K +
Sbjct: 1267 KSTESVWINKFGFSRVQEDQLKSICTTIRLTTFTGTSMVVKAI 1309
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 57/134 (42%), Gaps = 12/134 (8%)
Query: 732 VMLQGRIARDGIRCDC--C--SEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDS 787
+ L G + G++C+C C S I +IS F+ HS S HP N+Y E+G +L +
Sbjct: 591 IQLTGMLKDMGVQCNCRNCKGSVIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDILSAG 650
Query: 788 WNKQD--ESKRKGFHFV--NFDGEDPNDDTCGICGDG--GDLICCDG--CPSTFHQNCLD 839
D ++ + + G + TC C GDLI C G C H C+
Sbjct: 651 QEAADCGDNILRALKMAIGDIQGVEKRKVTCAKCESSQEGDLIYCKGARCSVVAHSRCVG 710
Query: 840 IKKFPSGKWHCVYC 853
I G W C C
Sbjct: 711 IANPQLGDWFCGKC 724
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 26/257 (10%)
Query: 878 TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGY 935
T+ IC CE +YH C + + P FC C I L+ LL L G
Sbjct: 13 TVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTLL-----LRGAQ 67
Query: 936 TWSL-----------VHRFDV--STDLS---LSDVCQKVECNARLAVALSVMDECFLPLP 979
L + F++ +TD+ LS E L+ A+++ E F P+
Sbjct: 68 PLPLLDVDVIRKKCDIKGFNIGANTDIRWQLLSGKTADAESRLLLSKAVAIFHESFDPIV 127
Query: 980 DHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGT 1039
D +G +LI ++Y G + +Y G + A+L ++SA +R+ G E+AE+P + T
Sbjct: 128 DATTGRDLIPTMVY--GRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAELPLVAT 185
Query: 1040 RHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRN 1099
++ QG + L + IE L +L V+ ++PA E WT FGF + K N
Sbjct: 186 SREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDELHKYLN 245
Query: 1100 MS-LLVFPGVDMLQKPM 1115
+ VF G L KP+
Sbjct: 246 GARTTVFQGTSTLHKPV 262
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 98.2 bits (243), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 87/148 (58%), Gaps = 3/148 (2%)
Query: 967 ALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRI 1026
++ ++ ECF P+ D SG +LI ++Y + + ++ G + +L+ D+ ++S A IR+
Sbjct: 1324 SIFLLLECFDPIADGVSGGDLIPLMVYR--RSLRDQDFGGIYCVVLKYDNRVVSTALIRV 1381
Query: 1027 HGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGF 1086
G++LAE+P + T ++ QG C+ LL IE L L VE+L +PA W + FGF
Sbjct: 1382 FGRQLAELPLLATNPSHQGQGHCKALLLSIERLLGVLRVERLALPAAEGAEGIWLNKFGF 1441
Query: 1087 QPL-EVSSKQKMRNMSLLVFPGVDMLQK 1113
+ + E KQ +++++VF G ML+K
Sbjct: 1442 RRMAEGQVKQFHSDLNMMVFTGSFMLEK 1469
Score = 43.9 bits (102), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 722 VQYWNQRKTRVMLQGRIARDGIRCDCCS----EIFTISKFDTHSKSKLCHPFQNLYFESG 777
V+Y R +ML G I G+ CDC S ++ +S F+ H+ S HP ++ E+G
Sbjct: 607 VRYMG-RGGHIMLTGIIQEGGVLCDCSSCKGVQVVNVSAFEKHAGSSARHPSDFIFLENG 665
Query: 778 SSLLQCILDSWN 789
L + WN
Sbjct: 666 KCLKDILEIGWN 677
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 97.8 bits (242), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 82/157 (52%), Gaps = 29/157 (18%)
Query: 959 ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
E +L +A+ ++ ECF+ + + R+ ++ +I++N
Sbjct: 766 EHYGKLCIAVDILHECFVTIIEPRTQSDISEDIVFNR----------------------- 802
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
I G++ AE+P IGTR +YRRQGMCR L+ +E L L VE+L++PA+ EL +
Sbjct: 803 ------EICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQ 856
Query: 1079 TWTSVFGFQPLEVSSKQKMRNMSLLVFPGVDMLQKPM 1115
TWT FGF + S + ++ S+L F G M QK +
Sbjct: 857 TWTCSFGFTVMSNSERLELAGNSILSFQGTTMCQKIL 893
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 97.4 bits (241), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 101/430 (23%), Positives = 178/430 (41%), Gaps = 54/430 (12%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KS 763
TL + G + N + Y + V+ QG GI C+CCS+ F S F+ H+
Sbjct: 108 TLHPLIFKEGGLADNTLLTY-KLKNGEVLKQGYKWGTGIICNCCSQEFAPSHFEEHAGMG 166
Query: 764 KLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDL 823
+ P+ N+Y GS+L + L QD GF ++ + D +L
Sbjct: 167 RRRQPYHNIYTPEGSTLHKLALQ---LQDHLNSNGF----------DNASVSSFSDYPNL 213
Query: 824 ICCDGC---PSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQ 880
GC PST + +K+ G+ SC FCG + + ++ D D +
Sbjct: 214 TSASGCGRQPSTTSGPIVPLKRTLQGRVVETE-SCYFCGYGHTTIGNI-DPD-----MII 266
Query: 881 ICSLCEEKYHQSCSQTDGAVQYEPSSL--------SFCGKKCQEIFERLEKLL------G 926
C+ CE H C + P + C ++CQ + +RLE+ L
Sbjct: 267 FCNQCERPCHVKCYNNRVVKKKVPLEILKEYVCFRFLCCQECQLLRDRLEEGLEKCEEIA 326
Query: 927 VKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGIN 986
+ W L+ D S D+ L + + + + F+ D S I
Sbjct: 327 FLRRIRSNICWRLLSGMDASRDVKLF-----------MPQVIDIFKDAFVESTDEHSDI- 374
Query: 987 LIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQ 1046
+ + N + + +++G + A+L ++SAA +++ +++AE+ I TR R++
Sbjct: 375 --FSDMVNCKNGDQEKDFRGMYCALLTASTHVVSAAILKVRMEQIAELVLIATRRECRKK 432
Query: 1047 GMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQK-MRNMSLLVF 1105
G LL IE+ L + NV L P E+ + W+ GF L K+ + + L++F
Sbjct: 433 GYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSAEEKESVLESHPLVMF 492
Query: 1106 PGVDMLQKPM 1115
+ ++QK +
Sbjct: 493 KNLVLVQKSL 502
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 95.9 bits (237), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/419 (22%), Positives = 173/419 (41%), Gaps = 57/419 (13%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P N + Y + +LQG GI C+CC++ + S+F+ H+ K P+QN+Y
Sbjct: 184 LPDNTLLTY-KLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEKHAGMGKRRQPYQNIYT 242
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGD----GGDLICCDGCP 830
G +L L + S V+ + PN + G CG G ++
Sbjct: 243 SQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPNLTSSG-CGKEPSVSGPIV---PLK 298
Query: 831 STFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYH 890
T + + + SC FCG + ++ + + C+ CE H
Sbjct: 299 RTLQERVVQTE------------SCYFCGYGHTELGKIDP------NMIVFCNQCERPCH 340
Query: 891 QSCSQTDGAVQYEPSSL--------SFCGKKCQEIFERLEKL-----LGVKHDLEGGYTW 937
C + + P + C ++CQ + RLE + + + W
Sbjct: 341 VKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRLEGMEKCEEIAFLGRIRSNICW 400
Query: 938 SLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGS 997
L+ D S D+ L L+ + + + FL D S I+ +++Y
Sbjct: 401 RLLSSADASRDVKL-----------YLSQVIDIFKDAFLESTDAHSDIS---DMVYGKNR 446
Query: 998 NFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIE 1057
++ +++G + +L ++SAA +++ + AE+ I TR R++G R LL IE
Sbjct: 447 EGEK-DFRGMYCVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIE 505
Query: 1058 SALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
+ L + NV L+ P E+ + W+ GF L K+ M + L++F + ++QKP+
Sbjct: 506 TNLRACNVSLLMAPVDPEMAQIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 564
>gi|363722543|gb|AEW30713.1| hypothetical protein, partial [Cenchrus americanus]
Length = 189
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 78/132 (59%), Gaps = 1/132 (0%)
Query: 659 NGMSIPARRHKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPL 718
N ++ R H E ++R CALL R S + +S+ DGFV Y+ K T+ +W+ID + +
Sbjct: 58 NKITKSRRPHSGNERKKRGGCALLARGSNKESDSSTDGFVPYEWKRTIFSWLIDLDILSV 117
Query: 719 NEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-G 777
N K++ ++ + V+L+G I RDGI C CCS++ + +F H+ + P++N+ +
Sbjct: 118 NTKLKCLDESHSEVLLEGIITRDGINCSCCSKVLAVLEFVAHAGGDVKKPYRNIVVDGLD 177
Query: 778 SSLLQCILDSWN 789
L+ C+ ++WN
Sbjct: 178 IDLMHCLTNAWN 189
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 95.1 bits (235), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/419 (22%), Positives = 173/419 (41%), Gaps = 57/419 (13%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYF 774
+P N + Y + +LQG GI C+CC++ + S+F+ H+ K P+QN+Y
Sbjct: 153 LPDNTLLTY-KLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEKHAGMGKRRQPYQNIYT 211
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGD----GGDLICCDGCP 830
G +L L + S V+ + PN + G CG G ++
Sbjct: 212 SQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPNLTSSG-CGKEPSVSGPIV---PLK 267
Query: 831 STFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYH 890
T + + + SC FCG + ++ + + C+ CE H
Sbjct: 268 RTLQERVVQTE------------SCYFCGYGHTELGKIDP------NMIVFCNQCERPCH 309
Query: 891 QSCSQTDGAVQYEPSSL--------SFCGKKCQEIFERLEKL-----LGVKHDLEGGYTW 937
C + + P + C ++CQ + RLE + + + W
Sbjct: 310 VKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRLEGMEKCEEIAFLGRIRSNICW 369
Query: 938 SLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGS 997
L+ D S D+ L L+ + + + FL D S I+ +++Y
Sbjct: 370 RLLSSADASRDVKL-----------YLSQVIDIFKDAFLESTDAHSDIS---DMVYGKNR 415
Query: 998 NFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIE 1057
++ +++G + +L ++SAA +++ + AE+ I TR R++G R LL IE
Sbjct: 416 EGEK-DFRGMYCVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIE 474
Query: 1058 SALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
+ L + NV L+ P E+ + W+ GF L K+ M + L++F + ++QKP+
Sbjct: 475 TNLRACNVSLLMAPVDPEMAQIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 533
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 95.1 bits (235), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 121/267 (45%), Gaps = 29/267 (10%)
Query: 866 CHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAV--QYEPSSLSFCGKKCQEIFERL 921
CH D D S T+ IC CE++YH C + V Q P FC +C I + L
Sbjct: 1515 CHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVRIRDAL 1574
Query: 922 EKLLGVKHDLEGG---YTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPL 978
+ + L G Y W + L + + L+ L+++ E F P+
Sbjct: 1575 GEDVAAGEVLMPGNPAYRWQI-----------LRGKNGRQQTWHALSTVLNILQESFDPI 1623
Query: 979 PDHRSGINLIHNILYNFGSNFKRLNYKGFFTAIL-------ERDDEIISAASIRIHGKEL 1031
D SG +L+ ++ +++G ++ +L E + + AA R+ G +
Sbjct: 1624 IDTGSGSDLLPAMVN--AETAGDYDFQGMYSILLRYRGPDKEARGKPVLAALFRVLGSSM 1681
Query: 1032 AEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEV 1091
AEMP + TR+ RRQG R L+ + L L V +++PA ++ + W + FQ L+
Sbjct: 1682 AEMPLVATRYDCRRQGHLRALVDAMRHKLLGLGVRAIVLPATADAQPAWRQLQ-FQDLDE 1740
Query: 1092 SSKQKMRN-MSLLVFPGVDMLQKPMMK 1117
S + R+ +++FP +L +P+++
Sbjct: 1741 PSTRVARSEHRMVIFPHTTVLARPLIR 1767
>gi|363722577|gb|AEW30730.1| hypothetical protein, partial [Cenchrus americanus]
Length = 189
Score = 94.4 bits (233), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 74/119 (62%), Gaps = 1/119 (0%)
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALL R S + +S+ DGFV Y+ K T+ +W+ID + +N K++ ++ +
Sbjct: 71 ERKKRGGCALLARGSNKESDSSTDGFVPYEWKRTIFSWLIDLDILSVNTKLKCLDESHSE 130
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWN 789
V+L+G I RDGI C CCS++ + +F H+ S + P++N+ + L+ C+ ++WN
Sbjct: 131 VLLEGIITRDGINCSCCSKVLAVLEFVAHAGSDVKKPYRNIVVDGLDIDLMHCLTNAWN 189
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 94.0 bits (232), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 135/322 (41%), Gaps = 41/322 (12%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
DT G+GG+ C+GC + D + + SC C H +
Sbjct: 1745 DTAHAVGEGGED--CEGCSDSVTLTGRDEEHVSEALAADLASSCVLC--------HQPEF 1794
Query: 872 DDSALS--TLQICSLCEEKYHQSCSQTDGAV--QYEPSSLSFCGKKCQEIFERLEKLLGV 927
D S T+ IC CE++YH C + V Q P FC +C I E L K L
Sbjct: 1795 DREGFSDQTVLICDQCEKEYHIGCLRKHKMVDMQAVPEGEWFCSDECVRIRELLTKSLEE 1854
Query: 928 KHDLEGG---YTWSLVHRFDVSTDLSLSDVCQKVECNAR-LAVALSVMDECFLPLPDHRS 983
G Y W + D + AR L L ++ E F P+ D+ S
Sbjct: 1855 GETTMSGNPAYRWQFIRGRD------------GTKATARALKTVLEILQESFDPIIDNGS 1902
Query: 984 GINLIHNILYNFGSNFKRLNYKGFFTAIL-------ERDDEIISAASIRIHGKELAEMPF 1036
G +L+ +++ + +++G ++ +L E + A +R+ G +AE+P
Sbjct: 1903 GEDLLPRMVH--AESAGDYDFQGMYSILLRYRGADKEARGRPVLAGLVRVLGSSMAEVPL 1960
Query: 1037 IGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQK 1096
+ TR+ RRQG R L+ G+ L +L V +++PA ++ W F L+ S +
Sbjct: 1961 VATRYDCRRQGHLRALVEGLRHRLIALGVRAMVLPATADALPAWRQ-LAFMDLDEGSVRV 2019
Query: 1097 MRN-MSLLVFPGVDMLQKPMMK 1117
R +++FP ++ + +++
Sbjct: 2020 ARGEHRMIIFPHTSVVVRQLIR 2041
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 94.0 bits (232), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 84/172 (48%), Gaps = 18/172 (10%)
Query: 684 RNSVEGEESNGDGFVAY-DGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDG 742
+N+ G ++ G + D LA+M +G +P V Y+ K +L G I G
Sbjct: 337 KNAFPGVKTTSTGRITRKDKGLHKLAFM--SGVLPEGTDVGYYVGGKR--LLDGYIKEFG 392
Query: 743 IRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHF 801
I C CC+ + + S+F+ H+ ++ P+ N+Y +G SL + + ++ S R+
Sbjct: 393 IYCHCCNTVVSPSQFEAHAGRAARRKPYHNIYMSNGVSLHELSVSLSKGRNMSNRQ---- 448
Query: 802 VNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
+DD C IC DGG+L+ CD CP FH+ C+ P G W C YC
Sbjct: 449 --------SDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYC 492
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 22/151 (14%)
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
L+ A+++ E F P+ +G +LI ++Y + ++SA
Sbjct: 664 LSKAVAIFHESFDPIIQIATGRDLIPAMVYG---------------------NTVVSAGL 702
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSV 1083
R+ G E+AE+P + T + G + L IE L SL V+ ++PA E WT
Sbjct: 703 FRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQR 762
Query: 1084 FGFQPL-EVSSKQKMRNMSLLVFPGVDMLQK 1113
FGF + + ++ ++ VF G L K
Sbjct: 763 FGFVKITQDELREYLKGGRTTVFQGTSTLHK 793
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 56/87 (64%)
Query: 1003 NYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCS 1062
N+ GF A+L + +++AA++R+ G+ AE+PF+ TR YRR G CRRL+ +E L S
Sbjct: 266 NFSGFHLAVLRQRGAVVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLS 325
Query: 1063 LNVEKLIIPAISELRETWTSVFGFQPL 1089
V +L++P+I L W + FGF PL
Sbjct: 326 AGVGQLVMPSIKPLLPMWAAKFGFTPL 352
>gi|363722513|gb|AEW30698.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722515|gb|AEW30699.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722519|gb|AEW30701.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722531|gb|AEW30707.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722535|gb|AEW30709.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722557|gb|AEW30720.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722571|gb|AEW30727.1| hypothetical protein, partial [Cenchrus americanus]
Length = 189
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 73/119 (61%), Gaps = 1/119 (0%)
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALL R S + +S+ DGFV Y+ K T+ +W+ID + +N K++ ++ +
Sbjct: 71 ERKKRGGCALLARGSNKESDSSTDGFVPYEWKRTIFSWLIDLDILSVNTKLKCLDESHSE 130
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWN 789
V+L+G I RDGI C CCS++ + +F H+ + P++N+ + L+ C+ ++WN
Sbjct: 131 VLLEGIITRDGINCSCCSKVLAVLEFVAHAGGDVKKPYRNIVVDGLDIDLMHCLTNAWN 189
>gi|363722501|gb|AEW30692.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722505|gb|AEW30694.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722507|gb|AEW30695.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722509|gb|AEW30696.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722511|gb|AEW30697.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722517|gb|AEW30700.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722521|gb|AEW30702.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722523|gb|AEW30703.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722525|gb|AEW30704.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722527|gb|AEW30705.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722529|gb|AEW30706.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722533|gb|AEW30708.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722537|gb|AEW30710.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722539|gb|AEW30711.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722541|gb|AEW30712.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722545|gb|AEW30714.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722547|gb|AEW30715.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722549|gb|AEW30716.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722551|gb|AEW30717.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722553|gb|AEW30718.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722555|gb|AEW30719.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722559|gb|AEW30721.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722563|gb|AEW30723.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722565|gb|AEW30724.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722567|gb|AEW30725.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722569|gb|AEW30726.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722573|gb|AEW30728.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722575|gb|AEW30729.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722579|gb|AEW30731.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722581|gb|AEW30732.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722583|gb|AEW30733.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722585|gb|AEW30734.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722587|gb|AEW30735.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722589|gb|AEW30736.1| hypothetical protein, partial [Cenchrus americanus]
gi|363722591|gb|AEW30737.1| hypothetical protein, partial [Cenchrus americanus]
Length = 189
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 73/119 (61%), Gaps = 1/119 (0%)
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALL R S + +S+ DGFV Y+ K T+ +W+ID + +N K++ ++ +
Sbjct: 71 ERKKRGGCALLARGSNKESDSSTDGFVPYEWKRTIFSWLIDLDILSVNTKLKCLDESHSE 130
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWN 789
V+L+G I RDGI C CCS++ + +F H+ + P++N+ + L+ C+ ++WN
Sbjct: 131 VLLEGIITRDGINCSCCSKVLAVLEFVAHAGGDVKKPYRNIVVDGLDIDLMHCLTNAWN 189
>gi|363722561|gb|AEW30722.1| hypothetical protein, partial [Cenchrus americanus]
Length = 189
Score = 92.8 bits (229), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 73/119 (61%), Gaps = 1/119 (0%)
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALL R S + +S+ DGFV Y+ K T+ +W+ID + +N K++ ++ +
Sbjct: 71 ERKKRGGCALLARGSNKESDSSTDGFVPYEWKRTIFSWLIDLDILSVNTKLKCLDESHSE 130
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWN 789
V+L+G I RDGI C CCS++ + +F H+ + P++N+ + L+ C+ ++WN
Sbjct: 131 VLLEGIITRDGINCSCCSKVLAVLEFVAHAGGDVKKPYRNIVVDGLDIDLMHCLTNAWN 189
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 91.7 bits (226), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 94/391 (24%), Positives = 167/391 (42%), Gaps = 52/391 (13%)
Query: 743 IRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHF 801
I C+CC++ F+ S F+ H+ + P+ N+Y G SL + L QD GF
Sbjct: 430 IICNCCNQEFSPSHFEEHAGMGRRRQPYHNIYTLEGLSLHKLAL---QLQDHLNPNGFDN 486
Query: 802 VNFDG-EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC-SCQFCG 859
+ D ++ T CG PST + +K+ + + V SC FCG
Sbjct: 487 ASVSSVSDYHNLTSSGCGRE---------PSTTSGPIVPLKR--TLQERVVETESCYFCG 535
Query: 860 RINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSL--------SFCG 911
+ + ++N T+ C+ CE H C + P + C
Sbjct: 536 YGHTTIGNINPD------TIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCC 589
Query: 912 KKCQEIFERLE----KLLGVKH--DLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLA 965
++CQ + RLE K +G+ + W L+ D S D+ L +
Sbjct: 590 QECQSLRARLEEGLEKCVGITFLRRIRSNICWRLLSGMDASRDVKL-----------YMP 638
Query: 966 VALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
+ + + F+ D S I I +++ N + + +++G + A+L ++SAA ++
Sbjct: 639 QVIDIFKDAFMDSTDEHSDI--ISDMV-NGKNGDQEKDFRGMYCALLTASTHVVSAAILK 695
Query: 1026 IHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFG 1085
+ +++AE+ I TR R++G LL IE+ L + NV L P E+ + W+ G
Sbjct: 696 VRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLG 755
Query: 1086 FQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
F L K+ M + L++F + ++QK +
Sbjct: 756 FTILSAEEKESMLESHPLVMFKNLVLVQKSL 786
>gi|363722503|gb|AEW30693.1| hypothetical protein, partial [Cenchrus americanus]
Length = 189
Score = 91.3 bits (225), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 72/119 (60%), Gaps = 1/119 (0%)
Query: 672 ETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTR 731
E ++R CALL R S + +S+ D FV Y+ K T+ +W+ID + +N K++ ++ +
Sbjct: 71 ERKKRGGCALLARGSNKESDSSTDAFVPYECKRTIFSWLIDLDILSVNTKLKCLDESHSE 130
Query: 732 VMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPFQNLYFES-GSSLLQCILDSWN 789
V+L+G I RDGI C CCS++ + +F H+ + P++N+ + L+ C+ ++WN
Sbjct: 131 VLLEGIITRDGINCSCCSKVLAVLEFVAHAGGDVKKPYRNIVVDGLDIDLMHCLTNAWN 189
>gi|255646689|gb|ACU23818.1| unknown [Glycine max]
Length = 296
Score = 90.5 bits (223), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 54/82 (65%)
Query: 1034 MPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSS 1093
MP I T YRRQGMCR L++ IE L S VEKL++ AI +L ETWT FGF ++
Sbjct: 1 MPLIATCSQYRRQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDDIE 60
Query: 1094 KQKMRNMSLLVFPGVDMLQKPM 1115
+Q++ ++L+VFPG +L K +
Sbjct: 61 RQRLNKINLMVFPGTVLLVKSL 82
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 90.5 bits (223), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 94/391 (24%), Positives = 167/391 (42%), Gaps = 52/391 (13%)
Query: 743 IRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHF 801
I C+CC++ F+ S F+ H+ + P+ N+Y G SL + L QD GF
Sbjct: 395 IICNCCNQEFSPSHFEEHAGMGRRRQPYHNIYTLEGLSLHKLAL---QLQDHLNPNGFDN 451
Query: 802 VNFDG-EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC-SCQFCG 859
+ D ++ T CG PST + +K+ + + V SC FCG
Sbjct: 452 ASVSSVSDYHNLTSSGCGRE---------PSTTSGPIVPLKR--TLQERVVETESCYFCG 500
Query: 860 RINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSL--------SFCG 911
+ + ++N T+ C+ CE H C + P + C
Sbjct: 501 YGHTTIGNINPD------TIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCC 554
Query: 912 KKCQEIFERLE----KLLGVKH--DLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLA 965
++CQ + RLE K +G+ + W L+ D S D+ L +
Sbjct: 555 QECQSLRARLEEGLEKCVGITFLRRIRSNICWRLLSGMDASRDVKL-----------YMP 603
Query: 966 VALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
+ + + F+ D S I I +++ N + + +++G + A+L ++SAA ++
Sbjct: 604 QVIDIFKDAFMDSTDEHSDI--ISDMV-NGKNGDQEKDFRGMYCALLTASTHVVSAAILK 660
Query: 1026 IHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFG 1085
+ +++AE+ I TR R++G LL IE+ L + NV L P E+ + W+ G
Sbjct: 661 VRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLG 720
Query: 1086 FQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
F L K+ M + L++F + ++QK +
Sbjct: 721 FTILSAEEKESMLESHPLVMFKNLVLVQKSL 751
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 89.0 bits (219), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 134/309 (43%), Gaps = 33/309 (10%)
Query: 833 FHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQS 892
++C+ ++ P G+W C + C I+E + LS I L E + QS
Sbjct: 958 MQRHCITSQELPEGEWFCQ----KDCKVIDEILTQLVANGPELLSNSIISELLESRQQQS 1013
Query: 893 CSQTDGAVQYEPSSLSF-----CGK-----KCQEIFERLEKLLGVKHDLEGGYTWSLVHR 942
+ V+ E S+ F CG+ Q + E + D+ Y W +V
Sbjct: 1014 SVK----VKLESSNPRFGWQILCGEGGSSANVQTLAEAANIFTSID-DINLPYLWLVVGN 1068
Query: 943 FDVSTDLSLSDVCQ---KVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNF 999
+ ST +S C S EC I+ ++ L
Sbjct: 1069 YSPSTHTPVSSTGMFGSNQGCKNWKEFDPSHGSEC-------HEDIHSLNVALVVCSRRA 1121
Query: 1000 KRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESA 1059
K +++G F +L+ +++++SAA ++I G+E+AE+P I T ++ QG C+ L+T IE
Sbjct: 1122 KDHDFEGVFCVVLKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQGFCKALMTTIERL 1181
Query: 1060 LCSLNVEKLIIPAISELRETWTSVFGFQPL-EVSSKQKMRNMSLLVFPGVDMLQK---PM 1115
L L+VE+L++P W + FGF + E K+ + L+ F G ML K PM
Sbjct: 1182 LGVLSVERLVLPTAKNTESIWINKFGFSRVPEDQLKRICTTIRLMTFTGTRMLGKAITPM 1241
Query: 1116 MKNQFPREN 1124
N+ R++
Sbjct: 1242 TLNRIQRQS 1250
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 59/134 (44%), Gaps = 12/134 (8%)
Query: 732 VMLQGRIARDGIRCDC--C--SEIFTISKFDTHSKSKLCHPFQNLYFESGSSLLQCILDS 787
+ L G + G++C+C C S I +IS F+ HS S HP N+Y E+G +L +
Sbjct: 543 MQLTGMLKDMGVQCNCRNCKGSMIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDVLSAG 602
Query: 788 WNKQD--ESKRKGFHFV--NFDGEDPNDDTCGICG--DGGDLICCDG--CPSTFHQNCLD 839
D ++ + + G + + TC CG + GDLI C G C H C+
Sbjct: 603 QEAADCGDNILRALKMAIGDIQGVEKSKVTCAECGGSEEGDLIYCKGARCSVVSHSRCVG 662
Query: 840 IKKFPSGKWHCVYC 853
G W C C
Sbjct: 663 SANPQLGDWFCGKC 676
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 81/157 (51%), Gaps = 4/157 (2%)
Query: 959 ECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEI 1018
E L A+S+ ECF P+ D +G +LI ++ G N + ++ G + A+L + +
Sbjct: 30 ETKPLLLEAVSMFHECFDPIVDPATGRDLIPAMVN--GRNLRTQDFGGMYCALLMVNSSV 87
Query: 1019 ISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRE 1078
SAA +RI G ++AE+P I TR+ R +G + L + IE L L+V+ L++PA E
Sbjct: 88 ASAAMLRIFGGDIAELPLIATRNKNRGKGYFQTLFSCIERLLSFLSVKNLVLPAAEEAES 147
Query: 1079 TWTSVFGFQPLEVSSKQKMRN--MSLLVFPGVDMLQK 1113
W FGF +E R ++ F G ML K
Sbjct: 148 IWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGTVMLHK 184
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 84.0 bits (206), Expect = 7e-13, Method: Composition-based stats.
Identities = 46/132 (34%), Positives = 69/132 (52%), Gaps = 6/132 (4%)
Query: 1002 LNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALC 1061
L + G + AIL +++SAA+ R+ GKE+AE+P + TR + QG + L T IE LC
Sbjct: 16 LEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLC 75
Query: 1062 SLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPMMKNQF 1120
L V L++PA W + F F +E + R+ ++ F G MLQKP+
Sbjct: 76 FLQVNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQGTSMLQKPV----- 130
Query: 1121 PRENMISAKGLR 1132
P IS + L+
Sbjct: 131 PEYRRISQEKLK 142
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 82.8 bits (203), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/256 (25%), Positives = 113/256 (44%), Gaps = 18/256 (7%)
Query: 878 TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFERLEKLLGVKHDLEGGY 935
T+ +C CE++YH C Q+ V+ + P FC C E L+K++ L
Sbjct: 73 TVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSETRSSLDKIISDGAQLLAER 132
Query: 936 TWSLVHR--------FDVSTDLSLSDVCQKV---ECNARLAVALSVMDECFLPLPDHRSG 984
++ + D S DL + K E + L+ A+ + + F P+ + +G
Sbjct: 133 DLEIIRKKHETRGLCMDTSKDLKWQLLSGKRATEEGSILLSAAVPIFHQSFDPIREALTG 192
Query: 985 INLIHNILYNFG--SNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHM 1042
+LI ++ G +Y G + A+L ++SAA +R+ G ++AE+P + T
Sbjct: 193 RDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALMRVMGGDVAELPLVATSQD 252
Query: 1043 YRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNMS- 1101
+ G + L + IE L SL ++ ++PA E W FGF + + N +
Sbjct: 253 VQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMKKFGFSRIPPEELEACLNGAH 312
Query: 1102 LLVFPGVDMLQK--PM 1115
L +F G L K PM
Sbjct: 313 LTIFHGTSYLYKAVPM 328
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 81.6 bits (200), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 68/270 (25%), Positives = 119/270 (44%), Gaps = 18/270 (6%)
Query: 864 STCHVNDQDDSALST--LQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFE 919
+ C ++D ST + +C C +YH C + P +C C I +
Sbjct: 71 ALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCSTDCVRINQ 130
Query: 920 RLEKLLG------VKHDLEGGYTWSLVHRF---DVSTDLSLSDVCQKVECNARLAV--AL 968
L+ LL + DL+ V F D D+ + K +++L + A+
Sbjct: 131 TLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDVRWRVLKDKSSDDSKLVLSKAV 190
Query: 969 SVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHG 1028
++ E F P+ +G +LI ++Y G + + +Y G + +L ++ ++SA RI G
Sbjct: 191 AIFHETFDPIIQVSTGRDLIPAMVY--GRSARDQDYTGMYCTVLTVNNIVVSAGLFRIMG 248
Query: 1029 KELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQP 1088
E+AE+P + T + G + L + IE L SL V+ ++PA E WT FGF
Sbjct: 249 SEIAELPLVATSRDRQGLGYFQALFSCIERLLSSLEVKHFVLPAAEEAESIWTERFGFAK 308
Query: 1089 L-EVSSKQKMRNMSLLVFPGVDMLQKPMMK 1117
+ + ++ ++ VF G L K + K
Sbjct: 309 ISQDELREYLKGGRTTVFQGTSNLHKLVAK 338
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 80.1 bits (196), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 120/267 (44%), Gaps = 31/267 (11%)
Query: 866 CHVNDQDDSAL--STLQICSLCEEKYHQSCSQT-DGAVQYEPSSL--------SFCGKKC 914
C D + L +T+ C+ CE H C + D V+ P + C ++C
Sbjct: 78 CRYGDTEFGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEILKEYMCFRFLCCEEC 137
Query: 915 QEIFERLEKL-----LGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALS 969
Q + RLE + + + W L+ + D S D+ L ++ A+
Sbjct: 138 QSLRARLEGVEKGEEIAFLRQIRSNICWRLLSKADASRDVKLY-----------MSQAID 186
Query: 970 VMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGK 1029
+ + F+ D S I +++Y ++ +++G + +L ++SAA +++ +
Sbjct: 187 IFKDAFVESTDAHSDI--FSDMVYGKNGAGEK-DFRGMYCVVLTASTHVVSAAILKVRVE 243
Query: 1030 ELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPL 1089
+ AE+ I TR R++G R LL IE+ L + NV L+ P E+ + W+ GF L
Sbjct: 244 QFAELVLIATRSECRKKGYFRLLLKSIEANLRACNVSLLMAPVDPEMAQIWSEKLGFTIL 303
Query: 1090 EVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
K+ M + L++F + ++QKP+
Sbjct: 304 SAEEKKSMLESHPLVMFQDLVLVQKPL 330
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 65/254 (25%), Positives = 105/254 (41%), Gaps = 58/254 (22%)
Query: 716 VPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSK-SKLCHPFQNLYF 774
+P +V Y+ R R +L G I C CC+ + S+F+ HS + P+ ++Y
Sbjct: 470 LPEGSEVSYY-VRGER-LLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYT 527
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+G SL + L ++ S + ND+ C IC DGG L+CCDGCP FH
Sbjct: 528 SNGVSLHELSLSLLRGREPSI------------NTNDEICSICLDGGTLLCCDGCPRVFH 575
Query: 835 QNCLDIKKFPSGKWHCVYCSCQF------------------------------CGRINE- 863
+ C+ ++ P GKW C +C C RI +
Sbjct: 576 KECVSLENIPKGKWFCKFCLNTLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCIRIVKN 635
Query: 864 --------STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGA--VQYEPSSLSFCG 911
+ C ++ S T+ IC CE+++H C + ++ P FC
Sbjct: 636 QTDEAGGCALCRRHEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCC 695
Query: 912 KKCQEIFERLEKLL 925
+ C++I L K++
Sbjct: 696 RDCKDINSSLRKIV 709
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 122/280 (43%), Gaps = 36/280 (12%)
Query: 864 STCHVNDQDDSALS--TLQICSLCEEKYHQSCSQTDGAVQYE--PSSLSFCGKKCQEIFE 919
S C D +++ T+ +C CE++YH C + V+ + P FC C E
Sbjct: 11 SLCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRS 70
Query: 920 RLEKL------LGVKHDLEGGYTWSLVHR--------FDVSTDLSLSDVCQKVEC---NA 962
L+K+ L V+ DLE ++ + D S DL + K +
Sbjct: 71 SLDKIISDGAQLLVEPDLE------IIKKKHVTRGLCMDTSKDLKWQLLSGKRTTEDGSI 124
Query: 963 RLAVALSVMDECFLPLPDHRSGINLIHNILYNFG--SNFKRLNYKGFFTAILERDDEIIS 1020
L+ A+ + + F P+ + +G +LI ++ G +Y G + A+L ++S
Sbjct: 125 LLSAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVS 184
Query: 1021 AASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETW 1080
AA +R+ G ++AE+P + T + G + L + IE L SL ++ ++PA E W
Sbjct: 185 AALLRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIW 244
Query: 1081 TSVFGFQ---PLEVSSKQKMRNMSLLVFPGVDMLQK--PM 1115
FGF P E+ + + L +F G L K PM
Sbjct: 245 MKKFGFSRTTPEELEA--YLNGAHLTIFHGTSYLYKAVPM 282
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 78.2 bits (191), Expect = 3e-11, Method: Composition-based stats.
Identities = 41/122 (33%), Positives = 68/122 (55%), Gaps = 4/122 (3%)
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLT 1054
+G N + + G + A+L + ++SAA +RI G ++AE+P + T + +G + L +
Sbjct: 3 YGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTLFS 62
Query: 1055 GIESALCSLNVEKLIIPAISELRETWTSVFGF---QPLEVSSKQKMRNMSLLVFPGVDML 1111
IE L +NV+ L++PA E WT FGF +P E+S +K N ++ F G +ML
Sbjct: 63 CIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFN-QMVTFKGTNML 121
Query: 1112 QK 1113
K
Sbjct: 122 HK 123
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 69/157 (43%), Gaps = 8/157 (5%)
Query: 710 MIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKFDTHSKSKLCHPF 769
+I G V E ++Y + ++M GR GI S++ + F+ + SK P
Sbjct: 12 LIKQGFVADGELLRY-KSKHGQLMAVGRAREGGIEVAGTSKLLGYTAFEDIAGSKYHRPA 70
Query: 770 QNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGE-------DPNDDTCGICGDGGD 822
++ + +G +L L + K E D NDD C ICG GGD
Sbjct: 71 EHTHTTTGRTLQSLALLAGGGAGGGAGKRGGKAAPPPEPVVELLEDDNDDLCHICGLGGD 130
Query: 823 LICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG 859
L+CC+ CP FH CL + P G +HC C C CG
Sbjct: 131 LMCCETCPGVFHAACLGLAAPPEGDYHCPLCRCSVCG 167
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 60/120 (50%), Gaps = 2/120 (1%)
Query: 956 QKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERD 1015
Q+ E L+ AL V+ C+ PLPD R+G +++ +L + +Y G TA+L
Sbjct: 417 QRQELRVALSAALHVLHSCYEPLPDSRTGADMLPWLLRGAVLAGGKADYSGMHTAVLFAG 476
Query: 1016 DEIISAASIRIHGKELAEMPFIGTR-HMYRRQGMCRRLLTGIESALCSLNVEKLIIPAIS 1074
++ A R G +LAE+P + R + RR G+ R LL +E L + + PA +
Sbjct: 477 PAAVAVAVFRSFG-DLAEVPVLAVRPELQRRNGLGRLLLAAVEQLLLLAGAKAIFTPAFT 535
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 121/277 (43%), Gaps = 35/277 (12%)
Query: 854 SCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSL------ 907
SC FCG + + ++N T+ C+ CE H C + P +
Sbjct: 81 SCYFCGYGHTTIGNINPD------TIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMC 134
Query: 908 --SFCGKKCQEIFERLE----KLLGVKH--DLEGGYTWSLVHRFDVSTDLSLSDVCQKVE 959
C ++CQ + RLE K +G+ + W L+ D S D+ L
Sbjct: 135 FHFLCCQECQSLRARLEEGLEKCVGITFLRRIRSNICWRLLSGMDASRDVKLY------- 187
Query: 960 CNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEII 1019
+ + + + F+ D S I I +++ N + + +++G + A+L ++
Sbjct: 188 ----MPQVIDIFKDAFMDSTDEHSDI--ISDMV-NGKNGDQEKDFRGMYCALLTASTHVV 240
Query: 1020 SAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRET 1079
SAA +++ +++AE+ I TR R++G LL IE+ L + NV L P E+ +
Sbjct: 241 SAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQI 300
Query: 1080 WTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
W+ GF L K+ M + L++F + ++QK +
Sbjct: 301 WSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 337
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 74.7 bits (182), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 56/115 (48%), Gaps = 19/115 (16%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
ED D C +CG GG+L+CCDGCP FH NC+ + + P +W C C+ Q CG C
Sbjct: 1235 EDQWDVDCSVCGLGGELLCCDGCPRAFHVNCIGLAEIPDTEWFCNECNLQTCG-----AC 1289
Query: 867 HVNDQDDSALSTLQICSL------CEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQ 915
N L + IC C+ +H C++ D P+ +C KKC+
Sbjct: 1290 KKN---KIRLDSHVICGSEDGTKGCDRVFHLKCAKLDAV----PADDWYC-KKCR 1336
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 31/46 (67%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
D ++ C IC +GG+L+CCDGCP FH +C+ +++ P GK C C
Sbjct: 1111 DEWEEDCYICTEGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCHEC 1156
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS 854
D N C IC DGG L+CCD CP FH +CL + P +W+C C+
Sbjct: 155 DHNRWYCNICKDGGQLLCCDRCPRAFHMSCLGMSVDMIPDSEWYCKMCT 203
>gi|296088520|emb|CBI37511.3| unnamed protein product [Vitis vinifera]
Length = 156
Score = 74.3 bits (181), Expect = 6e-10, Method: Composition-based stats.
Identities = 39/109 (35%), Positives = 68/109 (62%), Gaps = 8/109 (7%)
Query: 920 RLEKLLGVKHDLE-GGYTWSLVH--RFDVSTDLSLSDVCQKVECNARLAVALSVMDECFL 976
R+ +LLG+++ G TW L+ R D + L +++ + L+ AL++M ECF
Sbjct: 53 RIHQLLGIRNPTSVDGLTWKLLRSTRNDCNVYNEL-----RIDTCSELSQALNLMHECFE 107
Query: 977 PLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
P+ +H + +LI +I+YN S FKRL+++GF+ L++DDE + AA++R
Sbjct: 108 PIIEHHTKRDLIVDIVYNSVSKFKRLDFRGFYIMALQKDDEFVCAATVR 156
>gi|242055865|ref|XP_002457078.1| hypothetical protein SORBIDRAFT_03g000900 [Sorghum bicolor]
gi|241929053|gb|EES02198.1| hypothetical protein SORBIDRAFT_03g000900 [Sorghum bicolor]
Length = 103
Score = 73.9 bits (180), Expect = 6e-10, Method: Composition-based stats.
Identities = 37/83 (44%), Positives = 56/83 (67%), Gaps = 2/83 (2%)
Query: 964 LAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAAS 1023
L VAL V++ECF + + + I+++H+ +Y G FK L YKGF+T +LE+ EIIS A
Sbjct: 3 LVVALDVLNECFNLMKEWHTKIDMLHHDVYILGFGFKWLIYKGFYTMVLEKYGEIISVAL 62
Query: 1024 IRIHGKELAEMPFIGTRHMYRRQ 1046
+RIHG ++ + FIGT Y++Q
Sbjct: 63 LRIHGTKVVD--FIGTLPAYQKQ 83
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 73.9 bits (180), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 63/111 (56%), Gaps = 2/111 (1%)
Query: 1011 ILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLII 1070
I++ + +++A +R+ G+E+AE+P + TR R +G + L + IE L SLNV +++
Sbjct: 80 IIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIVV 139
Query: 1071 PAISELRETWTSVFGFQPL--EVSSKQKMRNMSLLVFPGVDMLQKPMMKNQ 1119
PA E W + FGF+ L E SK ++ F G MLQKP+ +Q
Sbjct: 140 PAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQ 190
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 73.6 bits (179), Expect = 1e-09, Method: Composition-based stats.
Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 13/109 (11%)
Query: 743 IRCDCCSEIFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHF 801
I C CC + S+F++H+ S P++++Y +G SL + N Q+ + G
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIG--- 60
Query: 802 VNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
DD C GDGGDL+ C CP FH CLD++ P G WHC
Sbjct: 61 ---------DDMCAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHC 100
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 59/129 (45%), Gaps = 23/129 (17%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSC 855
H N G +D D C +CG GG+L+CCDGCP FH C+ ++K P +W C C+
Sbjct: 1177 HVENHTGPKNPDDQWDVDCSVCGLGGELLCCDGCPRAFHVTCIGLEKIPETEWFCNECNL 1236
Query: 856 QFCGRINESTCHVNDQDDSALSTLQICSL------CEEKYHQSCSQTDGAVQYEPSSLSF 909
Q CG ++ ++ IC C+ +H C++ D P+ +
Sbjct: 1237 QTCGACKKNRIRLDSH--------VICGSEDGTKGCDRVFHLKCAKLDAV----PADDWY 1284
Query: 910 CGKKCQEIF 918
C KKC+
Sbjct: 1285 C-KKCRATL 1292
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 35/66 (53%), Gaps = 10/66 (15%)
Query: 799 FHFVNFDGEDPNDD--------TCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKW 848
H + DG D DD C IC DGG+L+CCD CP FH NCL + + P +W
Sbjct: 138 IHSADEDGNDTGDDGWADHNRWYCNICKDGGELLCCDRCPRAFHMNCLGMSEDMIPDSEW 197
Query: 849 HCVYCS 854
+C CS
Sbjct: 198 YCKMCS 203
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 31/46 (67%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
D ++ C IC +GG+L+CCDGCP FH +C+ +++ P GK C C
Sbjct: 1056 DEWEEDCYICSEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHEC 1101
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 17/103 (16%)
Query: 768 PFQNLYFESGSSLLQCILDSWNKQDE-----SKRKGFH---FVNFDGEDP---------N 810
P+ +L E+ ++L+ +LD E ++R + + G P N
Sbjct: 993 PYHHLSLEAKLTMLEFLLDDLLTVPEIAGEMTRRHQITDSIELPYGGPPPLSHEFAEIVN 1052
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
DD C +CG GDL+CCDGCP +FH+ C+ + + P GKW C C
Sbjct: 1053 DDECTVCGIEGDLLCCDGCPGSFHRQCIGVARLPEGKWLCPEC 1095
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 72.0 bits (175), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 41/70 (58%), Gaps = 8/70 (11%)
Query: 802 VNFDGEDP-----NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS 854
V +DP N+D C +CGDGG+LICCDGCP FH CL + + PSG W C C
Sbjct: 251 VPVHSQDPAPYQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCV 310
Query: 855 CQFCGRINES 864
+ GR+ E+
Sbjct: 311 AEL-GRLREA 319
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 72.0 bits (175), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 13/100 (13%)
Query: 755 SKFDTHS-KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDT 813
S+F+ H+ ++ P+ N+Y +G SL + + ++ S R+ +DD
Sbjct: 384 SQFEAHAGRAARRKPYHNIYMSNGVSLHELSVSLSKGRNMSNRQ------------SDDL 431
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C IC DGG+L+ CD CP FH+ C+ P G W C YC
Sbjct: 432 CSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYC 471
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 37/57 (64%), Gaps = 3/57 (5%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRINES 864
N+D C +CGDGG+LICCDGCP FH CL + + PSG W C C + GR+ E+
Sbjct: 237 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKL-GRLREA 292
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
DG NDD C C DGG+LICCDGCP FH CLD + PSG W C +C
Sbjct: 246 DGAHVNDDECAACKDGGELICCDGCPQAFHLTCLDPPLTSIPSGPWQCDWC 296
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 40/69 (57%), Gaps = 10/69 (14%)
Query: 787 SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFP 844
+++ Q E+ + H+ NDD C +C DGG+LICCDGCP FH CLD + P
Sbjct: 244 TFHHQGETATRMIHY--------NDDECTVCKDGGELICCDGCPRAFHLACLDPPLSSIP 295
Query: 845 SGKWHCVYC 853
SG W C +C
Sbjct: 296 SGSWQCEWC 304
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 70.9 bits (172), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 28/46 (60%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF 857
D C IC DGGDL+CCD CP FH C+ + P G W+C YC F
Sbjct: 668 DLCSICADGGDLLCCDNCPRAFHTECVSLPNIPRGTWYCKYCENMF 713
>gi|146165350|ref|XP_001014826.2| SNF2 family N-terminal domain containing protein [Tetrahymena
thermophila]
gi|146145551|gb|EAR94494.2| SNF2 family N-terminal domain containing protein [Tetrahymena
thermophila SB210]
Length = 1612
Score = 70.9 bits (172), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 43/91 (47%), Gaps = 5/91 (5%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
N D N+ C IC D G LICC+ C TFH C+ IKK P+G W C YC + N
Sbjct: 288 NLSETDANESWCFICRDQGKLICCENCSKTFHLTCVGIKKPPTGAWECPYCREE-----N 342
Query: 863 ESTCHVNDQDDSALSTLQICSLCEEKYHQSC 893
+ C ++ + CSLC H C
Sbjct: 343 KDICCACEKSTNEAEIKVTCSLCYRLMHFEC 373
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 70.5 bits (171), Expect = 7e-09, Method: Composition-based stats.
Identities = 39/122 (31%), Positives = 67/122 (54%), Gaps = 4/122 (3%)
Query: 995 FGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLT 1054
+G + + G + AIL + ++SAA +R+ G+++AE+P + T + +G + L +
Sbjct: 3 YGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTLFS 62
Query: 1055 GIESALCSLNVEKLIIPAISELRETWTSVFGFQ---PLEVSSKQKMRNMSLLVFPGVDML 1111
IE L L V+ L++PA E WT FGF+ P ++SS ++ ++ F G ML
Sbjct: 63 CIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRR-SCCQMVTFKGTSML 121
Query: 1112 QK 1113
QK
Sbjct: 122 QK 123
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 70.5 bits (171), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 40/58 (68%), Gaps = 3/58 (5%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ-FCGRI 861
ED + + C +C DGG+L+CCD CPS +H +CL+ +K P G+WHC CSC+ GR+
Sbjct: 379 EDDHMEFCRVCKDGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPRCSCEPLKGRV 436
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 53/137 (38%), Gaps = 35/137 (25%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS---------- 854
E + D C +C GG++I CD CP +H C D +++ P GKW C +C
Sbjct: 321 ETEHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWSCPHCEGEGIKEQEED 380
Query: 855 -----CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSF 909
C+ C E C C C YH C + ++ P
Sbjct: 381 DHMEFCRVCKDGGELLC---------------CDTCPSAYHVHC--LNPPMKMIPDGEWH 423
Query: 910 CGK-KCQEIFERLEKLL 925
C + C+ + R+ K+L
Sbjct: 424 CPRCSCEPLKGRVAKIL 440
>gi|118344202|ref|NP_001071923.1| zinc finger protein [Ciona intestinalis]
gi|92081546|dbj|BAE93320.1| zinc finger protein [Ciona intestinalis]
Length = 1205
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 10/71 (14%)
Query: 785 LDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKK 842
WN E+ + + DPN+D CG+C +GGDL+CCD CP FH +C +I
Sbjct: 916 FSEWNNFTENNMRKY--------DPNEDWCGVCNNGGDLLCCDSCPKVFHVHCHVPEILA 967
Query: 843 FPSGKWHCVYC 853
P G W C+ C
Sbjct: 968 APPGIWQCMLC 978
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
NDD C +C DGG+LICCDGCP FH CL + + PSG W CV C
Sbjct: 281 NDDECAVCRDGGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRC 326
>gi|395535724|ref|XP_003769871.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Sarcophilus
harrisii]
Length = 962
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 718 EDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 766
>gi|395536811|ref|XP_003770405.1| PREDICTED: autoimmune regulator-like [Sarcophilus harrisii]
Length = 529
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 62/140 (44%), Gaps = 14/140 (10%)
Query: 750 EIFTISKFDTHS---KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDG 806
E +T KF+ S K++ P G + LQ S ++ H
Sbjct: 195 EFYTPGKFEEPSGKNKTRSPKPPTRTKATQGRAELQLSQQSGGPATPAQPPELHL----- 249
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINES 864
NDD C +C DGG+LICCDGCP FH CL+ + + PSG W C C G++++
Sbjct: 250 HQKNDDECAVCRDGGELICCDGCPRAFHLACLEPPLTEIPSGMWR---CGCCIVGKVHQD 306
Query: 865 TCHVNDQDDSALSTLQICSL 884
H DQ D TL L
Sbjct: 307 GRH-GDQRDGPSETLAPLGL 325
>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
carolinensis]
Length = 947
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 703 EDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 751
>gi|405960336|gb|EKC26267.1| E3 ubiquitin-protein ligase TRIM33 [Crassostrea gigas]
Length = 899
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 35/50 (70%), Gaps = 2/50 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCS 854
+DPN+D C +C +GGDL+CCD CP +H C ++K+FPS +W C C+
Sbjct: 669 DDPNEDYCAVCQNGGDLLCCDKCPKVYHLKCHIPELKEFPSDEWQCTMCT 718
>gi|345322044|ref|XP_001508010.2| PREDICTED: transcription intermediary factor 1-alpha
[Ornithorhynchus anatinus]
Length = 800
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 572 EDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFC 620
>gi|145483001|ref|XP_001427523.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394605|emb|CAK60125.1| unnamed protein product [Paramecium tetraurelia]
Length = 883
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 35/57 (61%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRI 861
DGE+ ++ C +CG GG ++ CD CP FH CL +K+ P GKW C+ C F ++
Sbjct: 815 DGENIWEEQCKVCGQGGKVLLCDTCPRVFHPRCLKLKEIPKGKWSCMICLSYFSRQV 871
>gi|145533979|ref|XP_001452734.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420433|emb|CAK85337.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 30/47 (63%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF 857
+D C CG GG +ICCD CP FH CL +K+ P GKW+C+ C F
Sbjct: 844 EDRCNKCGQGGKVICCDTCPKVFHTKCLGLKEVPKGKWNCLVCLSNF 890
>gi|327272268|ref|XP_003220907.1| PREDICTED: transcription intermediary factor 1-alpha-like [Anolis
carolinensis]
Length = 1038
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CCD CP FH +C ++ FPSG+W C +C
Sbjct: 809 DDPNEDWCAVCQNGGELLCCDRCPKVFHLSCHVPTLRNFPSGEWICTFC 857
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 69.3 bits (168), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/60 (48%), Positives = 38/60 (63%), Gaps = 3/60 (5%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRINES 864
+ N+D C +CGDGG+LICCDGCP FH CL + + PSG W C C + GR+ E+
Sbjct: 210 QQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKL-GRLREA 268
>gi|291413605|ref|XP_002723061.1| PREDICTED: transcriptional intermediary factor 1 alpha [Oryctolagus
cuniculus]
Length = 903
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CCD CP FH +C + FPSG+W
Sbjct: 659 DASQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTNFPSGEW 718
Query: 849 HCVYC 853
C +C
Sbjct: 719 ICTFC 723
>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Monodelphis domestica]
Length = 1366
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 16/106 (15%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CLD+ K P GKW C + C CG+ +
Sbjct: 1234 EGKKESEDDCFRCGDGGQLVLCDRKCCTKAYHLSCLDLVKRPFGKWECPWHHCDVCGKPS 1293
Query: 863 ESTCHV-------NDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQ 901
S CH QD + ST Q LC CS+ D V+
Sbjct: 1294 VSFCHFCPNSFCKEHQDGTVFSTTQDGRLC-------CSEHDFGVE 1332
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W+C C
Sbjct: 831 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWYCNDC 873
Score = 40.8 bits (94), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
C +C G+L+ C+G C FH +CL + + P GK+ C C+ +C V +
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSCLGLSRRPEGKFICSECTSGI------HSCFVCKE 723
Query: 872 DDSALSTLQICSLCEEKYHQSC 893
+ + + S C + YH++C
Sbjct: 724 KKTEVKRC-VVSQCGKFYHEAC 744
>gi|242055863|ref|XP_002457077.1| hypothetical protein SORBIDRAFT_03g000895 [Sorghum bicolor]
gi|241929052|gb|EES02197.1| hypothetical protein SORBIDRAFT_03g000895 [Sorghum bicolor]
Length = 72
Score = 68.9 bits (167), Expect = 2e-08, Method: Composition-based stats.
Identities = 30/71 (42%), Positives = 50/71 (70%)
Query: 963 RLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAA 1022
+L VAL ++ +CF + D + I++++ +Y GS F+ L+Y+GF+T +LE+D EIIS A
Sbjct: 2 KLVVALDMLIKCFNLVKDRCTKIDMLYQAMYILGSKFRWLSYEGFYTIVLEKDGEIISTA 61
Query: 1023 SIRIHGKELAE 1033
+RIHG ++ E
Sbjct: 62 LLRIHGTKVVE 72
>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
Length = 987
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 43/81 (53%), Gaps = 3/81 (3%)
Query: 775 ESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
+SG LL + S R G N D +DPN+D C +C +GGDL+CC+ CP FH
Sbjct: 712 QSGHGLLNGKSPMRSLMHRSARTGGEGTNKD-DDPNEDWCAVCQNGGDLLCCEKCPKVFH 770
Query: 835 QNC--LDIKKFPSGKWHCVYC 853
C + FPSG W C +C
Sbjct: 771 LTCHVPTLLSFPSGDWICTFC 791
>gi|449490172|ref|XP_002194420.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Taeniopygia guttata]
Length = 912
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 664 NNKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 716
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 68.6 bits (166), Expect = 3e-08, Method: Composition-based stats.
Identities = 29/60 (48%), Positives = 38/60 (63%), Gaps = 3/60 (5%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRINES 864
+ N+D C +CGDGG+LICCDGCP FH CL + + PSG W C C + GR+ E+
Sbjct: 221 QQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKL-GRLREA 279
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N D C IC DGGDL+CCD CP +FH C+ +K P W+C C
Sbjct: 120 NRDHCNICKDGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRC 163
Score = 47.0 bits (110), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQ 856
D+ C C GG ++CC+ C H +C + K P +WHC C Q
Sbjct: 1302 DEACYKCNKGGKVLCCESCNHVCHLSCSGLLKKPLDEWHCEECLIQ 1347
>gi|449271665|gb|EMC81949.1| E3 ubiquitin-protein ligase TRIM33 [Columba livia]
Length = 878
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 651 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 699
>gi|67514537|ref|NP_001002870.2| tripartite motif-containing 24 [Danio rerio]
gi|66910275|gb|AAH96849.1| Tripartite motif-containing 24 [Danio rerio]
gi|182888610|gb|AAI63977.1| Trim24 protein [Danio rerio]
Length = 961
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GG+L+CCD CP FH +C + PSG+W+C +C
Sbjct: 693 EDPNEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTASPSGEWYCTFC 741
>gi|395842293|ref|XP_003793952.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Otolemur garnettii]
Length = 1138
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 890 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 942
>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 846 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 894
>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Bos taurus]
gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
taurus]
gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
Length = 1365
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 46/85 (54%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ P++D C CGDGG L+ CD C +H CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKKPSEDECFRCGDGGQLVLCDRKSCTKAYHLPCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
+ CH + QD +A S+ Q
Sbjct: 1293 TAFCHFCPNSFCKDHQDTAAFSSTQ 1317
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLSI-EMPDGSWFCNDC 872
>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 846 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 894
>gi|298714147|emb|CBJ27328.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 3041
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 33/45 (73%), Gaps = 1/45 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFH-QNCLDIKKFPSGKWHCVYC 853
++ TC ICG+GG LICCD CP++FH + CLD+ K P+G W C C
Sbjct: 2964 HEATCSICGEGGALICCDYCPASFHMEPCLDLAKDPAGPWACPLC 3008
>gi|449269479|gb|EMC80242.1| Transcription intermediary factor 1-alpha, partial [Columba livia]
Length = 914
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C ++ FPSG+W C +C
Sbjct: 695 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLRSFPSGEWICTFC 743
>gi|334323908|ref|XP_001382045.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1
[Monodelphis domestica]
Length = 1143
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 899 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 947
>gi|149411557|ref|XP_001506399.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial
[Ornithorhynchus anatinus]
Length = 1050
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 806 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 854
>gi|145513166|ref|XP_001442494.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409847|emb|CAK75097.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 30/47 (63%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQF 857
+D C CG GG +ICCD CP FH CL +K+ P G+W+C+ C F
Sbjct: 844 EDQCKECGKGGKVICCDTCPKVFHAKCLGLKEIPKGRWNCLVCLSNF 890
>gi|358344833|ref|XP_003636491.1| hypothetical protein MTR_042s0033 [Medicago truncatula]
gi|355502426|gb|AES83629.1| hypothetical protein MTR_042s0033 [Medicago truncatula]
Length = 355
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 88/182 (48%), Gaps = 21/182 (11%)
Query: 908 SFCGKKCQEIFERLEKLLGVKHDLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAV- 966
+FC C+ F+ L+K L ++ SL+ R+D S + Q +C+ L
Sbjct: 192 TFCSNSCKGCFDDLQKQLFENVTIDDASASSLIIRYDTKK----SKINQ--QCDYELFKE 245
Query: 967 ALSVMDECFLPLPDHRSGI-NLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIR 1025
A + F P GI +++ ++YN K+ + GF+T I++++ + S A R
Sbjct: 246 AAETLCYGFGP------GIESMVEQMVYN-----KKGEHGGFYTCIIQKEGKCASVAVFR 294
Query: 1026 IHGKE-LAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVF 1084
H + E+P + T +R QG+C+RL+ IE+ + L + LI+P++ W F
Sbjct: 295 AHARNGFVEIPIVATMESFRSQGLCKRLIHSIENCVKRLGIHHLILPSLRTRLAMWEH-F 353
Query: 1085 GF 1086
GF
Sbjct: 354 GF 355
>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
Length = 999
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 751 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 803
>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Felis catus]
Length = 1211
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 963 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 1015
>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
Length = 1135
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 886 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 938
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GRI +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRIQQ 345
>gi|334323910|ref|XP_003340460.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2
[Monodelphis domestica]
Length = 1126
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 899 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 947
>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
Length = 759
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 515 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 563
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 39/65 (60%), Gaps = 5/65 (7%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCH 867
NDD C +C DGG+LICCDGCP FH CL+ + PSG W C C G++++ H
Sbjct: 286 NDDECAVCRDGGELICCDGCPRAFHLACLEPPLTDIPSGMWR---CGCCIVGKVHQDGHH 342
Query: 868 VNDQD 872
++D
Sbjct: 343 GEERD 347
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQ 856
NDD C +C DGG+LICCDGCP +FH +CL + PSG W C C+ Q
Sbjct: 28 NDDECSVCRDGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDACNTQ 76
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 37/60 (61%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C ICGDGG ++ C GCP +H +CL++ K P+G+W C + C CG+ S C +
Sbjct: 2175 EDECFICGDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQCDICGKEAASFCEL 2234
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W+C C
Sbjct: 1750 HFTPRRGVKNHEHVNVSWCFVCTEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDC 1806
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
NDD C +C DGG+LICCDGCP FH CL+ + PSG W C C
Sbjct: 156 NDDECAVCKDGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERC 201
>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
gorilla]
Length = 759
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 515 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 563
>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
Length = 951
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 703 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 755
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/63 (46%), Positives = 38/63 (60%), Gaps = 5/63 (7%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCH 867
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C CSC G + + H
Sbjct: 293 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRC--CSC-LQGGVQQGLPH 349
Query: 868 VND 870
+
Sbjct: 350 AEE 352
>gi|297474056|ref|XP_002687040.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Bos
taurus]
gi|296488233|tpg|DAA30346.1| TPA: tripartite motif-containing 24 isoform 1 [Bos taurus]
Length = 1050
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 806 DASQKSPLHAGEMRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 865
Query: 849 HCVYC 853
C +C
Sbjct: 866 ICTFC 870
>gi|50235054|gb|AAT70733.1| transcriptional intermediary factor 1 alpha [Danio rerio]
Length = 961
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GG+LICCD CP FH +C + PSG+W+C C
Sbjct: 693 EDPNEDWCAVCQNGGELICCDKCPKVFHLSCHVPSLTASPSGEWYCTLC 741
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 306 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLREIPSGLWR---CSCCLQGRVQQ 358
>gi|432864384|ref|XP_004070295.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like, partial
[Oryzias latipes]
Length = 1042
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 40/68 (58%), Gaps = 4/68 (5%)
Query: 790 KQDESKRKGFHFVNFDG--EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPS 845
K+ E+ R G N G +DPN+D C +C +GGDL+CCD CP FH C IK FP
Sbjct: 781 KEAETSRGGASEGNHGGREDDPNEDWCAVCINGGDLLCCDNCPKVFHMKCHVPTIKIFPK 840
Query: 846 GKWHCVYC 853
G + C +C
Sbjct: 841 GDFLCTFC 848
>gi|444728362|gb|ELW68820.1| Transcription intermediary factor 1-alpha, partial [Tupaia
chinensis]
Length = 869
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 40/65 (61%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H + ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 650 DASQKSPLHVGDTRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 709
Query: 849 HCVYC 853
C +C
Sbjct: 710 ICTFC 714
>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
Length = 1144
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 896 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 948
>gi|417413221|gb|JAA52953.1| Putative transcription intermediary factor 1-alpha, partial
[Desmodus rotundus]
Length = 951
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 707 DASQKSPLHVGEARKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 766
Query: 849 HCVYC 853
C +C
Sbjct: 767 ICTFC 771
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 409 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSC 454
>gi|74151452|dbj|BAE38840.1| unnamed protein product [Mus musculus]
Length = 764
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 536 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFC 584
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1833 EDECFSCGDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQCDMCGKEAASFCEM 1892
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W+C C
Sbjct: 1407 HFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDC 1463
Score = 46.2 bits (108), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 29/113 (25%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCS--------CQFCGRI 861
++ C IC G+L+ C+ C FH CL + + P+GK+ C CS C+ CG+
Sbjct: 1258 ENVCQICEKPGELLLCEAQCCGAFHLQCLGLSEMPTGKFICNECSTGVHTCFVCKNCGQ- 1316
Query: 862 NESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
+ C + LC + YH+ C Q +Y P+ G +C
Sbjct: 1317 DVKRC--------------LLPLCGKYYHEECIQ-----KYPPTVTQNKGFRC 1350
>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
[Canis lupus familiaris]
Length = 961
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 717 DASQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 776
Query: 849 HCVYC 853
C +C
Sbjct: 777 ICTFC 781
>gi|348579257|ref|XP_003475397.1| PREDICTED: transcription intermediary factor 1-alpha-like [Cavia
porcellus]
Length = 1165
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 40/65 (61%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H + ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 921 DTSQKSPLHVGDTRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 980
Query: 849 HCVYC 853
C +C
Sbjct: 981 ICTFC 985
>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
Length = 1142
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 894 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 946
>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
Length = 1140
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 892 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 944
>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
rotundus]
Length = 1056
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 808 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 860
>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
familiaris]
Length = 1203
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 959 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 1007
>gi|297474058|ref|XP_002687041.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Bos
taurus]
gi|296488234|tpg|DAA30347.1| TPA: tripartite motif-containing 24 isoform 2 [Bos taurus]
Length = 1016
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 772 DASQKSPLHAGEMRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 831
Query: 849 HCVYC 853
C +C
Sbjct: 832 ICTFC 836
>gi|358411974|ref|XP_617403.5| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Bos taurus]
Length = 1047
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 803 DASQKSPLHAGEMRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 862
Query: 849 HCVYC 853
C +C
Sbjct: 863 ICTFC 867
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/63 (46%), Positives = 37/63 (58%), Gaps = 5/63 (7%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRINESTCH 867
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C GR + T H
Sbjct: 303 NEDECAVCRDGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSC---LQGRALQDTPH 359
Query: 868 VND 870
+
Sbjct: 360 AEE 362
>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
mulatta]
Length = 1050
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 806 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 865
Query: 849 HCVYC 853
C +C
Sbjct: 866 ICTFC 870
>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
jacchus]
Length = 1127
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 879 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
[Sus scrofa]
Length = 881
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 637 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 685
>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
Length = 933
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 689 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 748
Query: 849 HCVYC 853
C +C
Sbjct: 749 ICTFC 753
>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
griseus]
Length = 954
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 726 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFC 774
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1821 EDECFSCGDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQCDMCGKEAASFCEM 1880
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W+C C
Sbjct: 1395 HFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDC 1451
Score = 46.2 bits (108), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 29/113 (25%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCS--------CQFCGRI 861
++ C IC G+L+ C+ C FH CL + + P+GK+ C CS C+ CG+
Sbjct: 1246 ENVCQICEKPGELLLCEAQCCGAFHLQCLGLSEMPTGKFICNECSTGVHTCFVCKSCGQ- 1304
Query: 862 NESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
+ C + LC + YH+ C Q +Y P+ G +C
Sbjct: 1305 DVKRC--------------LLPLCGKYYHEECIQ-----KYPPTVTQNKGFRC 1338
>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
mulatta]
Length = 1151
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 903 NNKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 955
>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
Length = 1127
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 879 NNKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
Length = 961
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 717 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 776
Query: 849 HCVYC 853
C +C
Sbjct: 777 ICTFC 781
>gi|395837426|ref|XP_003791635.1| PREDICTED: transcription intermediary factor 1-alpha [Otolemur
garnettii]
Length = 1014
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 770 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 829
Query: 849 HCVYC 853
C +C
Sbjct: 830 ICTFC 834
>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
[Macaca mulatta]
Length = 955
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 711 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 770
Query: 849 HCVYC 853
C +C
Sbjct: 771 ICTFC 775
>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
griseus]
Length = 1008
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 760 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 812
>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
Length = 1131
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 NNKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 935
>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
anubis]
Length = 1151
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 907 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 955
>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
jacchus]
Length = 1110
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 879 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Papio anubis]
Length = 1050
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 806 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 865
Query: 849 HCVYC 853
C +C
Sbjct: 866 ICTFC 870
>gi|296210521|ref|XP_002807106.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Callithrix jacchus]
Length = 1045
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 801 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 860
Query: 849 HCVYC 853
C +C
Sbjct: 861 ICTFC 865
>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
anubis]
Length = 1127
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Nomascus leucogenys]
gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Gorilla gorilla gorilla]
Length = 1050
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 806 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 865
Query: 849 HCVYC 853
C +C
Sbjct: 866 ICTFC 870
>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
Length = 1127
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
Length = 1123
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 892 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 944
>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
protein; Short=Protein Rfg7; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
Length = 1127
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
troglodytes]
gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
paniscus]
gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase TRIM24; AltName: Full=RING finger protein 82;
AltName: Full=Tripartite motif-containing protein 24
gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 806 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 865
Query: 849 HCVYC 853
C +C
Sbjct: 866 ICTFC 870
>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 806 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 865
Query: 849 HCVYC 853
C +C
Sbjct: 866 ICTFC 870
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 112/532 (21%), Positives = 202/532 (37%), Gaps = 100/532 (18%)
Query: 624 KKKKGKTNSAASPHGKSQKRGIKG-------------KPSVSEGGTSHNGMSI---PARR 667
+ K+G T ++ +S ++GI G PS + G H+ ++I P R
Sbjct: 416 QYKRGGTATSTQTLSRSPEKGISGLSTGTSMKINTEETPSENTAGPLHSEVTIVQEPPRG 475
Query: 668 HKLQETQQRKRCALLVRNSVEGEESNGDGFVAYDGKWTLLAWMIDTGTVPLNEKVQYWNQ 727
H + + R + + NS L + G VP + +
Sbjct: 476 HSVGPKSKESRTSKVRDNS-------------------LHQLVFKEGGVP-ELTILTYKL 515
Query: 728 RKTRVMLQGRIARDGIRCDCCSE--IFTISKFDTHS-KSKLCHPFQNLYFESGSSLLQCI 784
+ V+ QG I CDCCSE FT S F+ H+ K P++N+Y G +L +
Sbjct: 516 KHGEVLKQGYKQGTCILCDCCSEEVQFTPSHFEEHAGMGKRRQPYRNIYTPEGLTLHELA 575
Query: 785 LDSWNKQDESKRKGFHFVNFDG--EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK 842
L Q G NF G E PN + +T+ + + +K+
Sbjct: 576 L---KLQGGLNSNGNSSANFPGGDEPPNLSSGSSRESS----------TTYRPSIVPLKR 622
Query: 843 ----FPSGKWHCVYC--SCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQT 896
C C +C G I+E + C+ CE H C
Sbjct: 623 TLQQIADKTESCRLCGDACTTIGTISEDM-------------IVFCNQCERPCHVKCYNN 669
Query: 897 DGAVQYEPSSL--------SFCGKKCQEIFERLEKLLGVKHDLE---GGYTWSLVHRFDV 945
Q P ++ FC +KCQ + L ++L + + W +++ +
Sbjct: 670 GLQKQKGPLNVLAEYMQFHFFCCQKCQLLRASLHEVLNKREKIRQKRSYVFWQILNGMNP 729
Query: 946 STDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYK 1005
++ + Q +E VA S +I +++ N +++
Sbjct: 730 GINVQ-KYIHQVIEI---FKVAFP---------KTAASDFGVIQDMV-NAKDVGGEKDFR 775
Query: 1006 GFFTAILERDDE-IISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLN 1064
G + A+L + ++SAA +++ +E+AE+ + T + +R++G LL IE+ L ++N
Sbjct: 776 GMYCAVLTTSSKLVVSAAVLKVRTEEVAELVIVATCNQFRKKGYFTLLLRQIEAHLKAMN 835
Query: 1065 VEKLIIPAISELRETWTSVFGFQPLEVSSKQKMRNM-SLLVFPGVDMLQKPM 1115
V L E+ W+ GF L K+ + L++F + ++QK +
Sbjct: 836 VRLLTALVDPEMESIWSKKLGFTILSGEEKETLLEAHPLVMFEDLTLMQKSL 887
>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
Length = 616
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 438 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 486
>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
porcellus]
Length = 1128
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 880 NNKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 932
>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
[Canis lupus familiaris]
Length = 1018
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 774 DASQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 833
Query: 849 HCVYC 853
C +C
Sbjct: 834 ICTFC 838
>gi|47419909|ref|NP_003843.3| transcription intermediary factor 1-alpha isoform b [Homo sapiens]
gi|114616228|ref|XP_001149035.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3 [Pan
troglodytes]
gi|397484617|ref|XP_003813470.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Pan
paniscus]
gi|4325107|gb|AAD17258.1| transcriptional intermediary factor 1 alpha [Homo sapiens]
gi|51094800|gb|EAL24046.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|119604290|gb|EAW83884.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|119604292|gb|EAW83886.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|410249254|gb|JAA12594.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287762|gb|JAA22481.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1016
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 772 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 831
Query: 849 HCVYC 853
C +C
Sbjct: 832 ICTFC 836
>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
Length = 1071
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 840 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 892
>gi|395739021|ref|XP_002818554.2| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Pongo abelii]
Length = 1221
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 977 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 1036
Query: 849 HCVYC 853
C +C
Sbjct: 1037 ICTFC 1041
>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
Length = 980
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 736 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 784
>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
Length = 944
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 700 DASQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 759
Query: 849 HCVYC 853
C +C
Sbjct: 760 ICTFC 764
>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
Length = 1120
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
troglodytes]
gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1127
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|332224572|ref|XP_003261443.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Nomascus leucogenys]
gi|426358064|ref|XP_004046342.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Gorilla gorilla gorilla]
Length = 1016
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 772 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 831
Query: 849 HCVYC 853
C +C
Sbjct: 832 ICTFC 836
>gi|383422495|gb|AFH34461.1| transcription intermediary factor 1-alpha isoform b [Macaca
mulatta]
Length = 1016
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 772 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 831
Query: 849 HCVYC 853
C +C
Sbjct: 832 ICTFC 836
>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
caballus]
Length = 942
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 698 DASQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEW 757
Query: 849 HCVYC 853
C +C
Sbjct: 758 ICTFC 762
>gi|403276202|ref|XP_003929796.1| PREDICTED: transcription intermediary factor 1-alpha [Saimiri
boliviensis boliviensis]
Length = 1010
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 766 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 825
Query: 849 HCVYC 853
C +C
Sbjct: 826 ICTFC 830
>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
mulatta]
Length = 1110
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 879 NNKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 365 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 413
>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
Length = 1032
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 788 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 836
>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
abelii]
Length = 1127
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
Length = 1110
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|402864955|ref|XP_003896705.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Papio anubis]
Length = 1016
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 772 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 831
Query: 849 HCVYC 853
C +C
Sbjct: 832 ICTFC 836
>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
anubis]
Length = 1110
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
mulatta]
Length = 1044
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 800 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 848
>gi|344252625|gb|EGW08729.1| E3 ubiquitin-protein ligase TRIM33 [Cricetulus griseus]
Length = 910
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 662 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 714
>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
Length = 1110
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
Length = 1110
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
Length = 1116
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 829 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 877
>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
Length = 929
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 685 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 744
Query: 849 HCVYC 853
C +C
Sbjct: 745 ICTFC 749
>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
aries]
Length = 1127
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 348
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 349
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GRI +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRIQQ 345
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL + + PSG W C C
Sbjct: 332 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 377
>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
porcellus]
Length = 1111
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 884 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 932
>gi|449019235|dbj|BAM82637.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 540
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 28/43 (65%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
DD C ICG GGD++CCD CP +H C+ + PSG+W C C
Sbjct: 177 DDVCSICGLGGDIVCCDECPMGYHLQCIGLPSIPSGEWFCPAC 219
>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Sarcophilus harrisii]
Length = 1464
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 46/85 (54%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CLD+ K P GKW C + C CG+ +
Sbjct: 1234 EGKKESEDDCFRCGDGGQLVLCDRKCCTKAYHLSCLDLVKRPFGKWECPWHHCDVCGKPS 1293
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH + QD + ST Q
Sbjct: 1294 VSFCHFCPNSFCKDHQDGTVFSTTQ 1318
Score = 54.3 bits (129), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W+C C
Sbjct: 831 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWYCNDC 873
Score = 41.2 bits (95), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
C +C G+L+ C+G C FH +CL + + P GK+ C C+ +C V +
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSCLGLSRRPEGKFICSECTSGI------HSCFVCKE 723
Query: 872 DDSALSTLQICSLCEEKYHQSC 893
+ + + S C + YH++C
Sbjct: 724 KKTEVKRC-VVSQCGKFYHEAC 744
>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
Length = 1012
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 768 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 816
>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
abelii]
Length = 1110
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
mulatta]
Length = 1027
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 800 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 848
>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
troglodytes]
gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1110
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GRI +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRIQQ 345
>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
taurus]
Length = 1126
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 882 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 930
>gi|431911670|gb|ELK13818.1| Transcription intermediary factor 1-alpha [Pteropus alecto]
Length = 1615
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 1389 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 1437
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 340
>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Loxodonta africana]
Length = 1048
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 821 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 869
>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
Length = 1041
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 797 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 845
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 345
>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
Length = 1036
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 746 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 798
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 344
>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
Length = 1000
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 772 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 820
>gi|2267585|gb|AAB63585.1| transcription intermediary factor 1 [Homo sapiens]
Length = 1012
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 792 DESKRKGFHFVNFDGED-PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKW 848
D S++ H ED PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W
Sbjct: 768 DPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEW 827
Query: 849 HCVYC 853
C +C
Sbjct: 828 ICTFC 832
>gi|2660669|gb|AAC79140.1| human Mi-2 autoantigen-like protein [Arabidopsis thaliana]
gi|9758384|dbj|BAB08833.1| helicase-like protein [Arabidopsis thaliana]
Length = 2228
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 42/71 (59%), Gaps = 10/71 (14%)
Query: 786 DSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKF 843
DS +++ SKRKG D N C IC GGDL+CCD CP T+H CL+ +K+
Sbjct: 44 DSTPERNSSKRKG--------NDGNYFECVICDLGGDLLCCDSCPRTYHTACLNPPLKRI 95
Query: 844 PSGKWHCVYCS 854
P+GKW C CS
Sbjct: 96 PNGKWICPKCS 106
>gi|344297160|ref|XP_003420267.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Loxodonta africana]
Length = 1014
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 787 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 835
>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
aries]
Length = 1110
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 883 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 931
>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Pan paniscus]
Length = 1258
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 1014 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 1062
>gi|440893214|gb|ELR46066.1| Transcription intermediary factor 1-alpha, partial [Bos grunniens
mutus]
Length = 940
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 712 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 760
>gi|334188199|ref|NP_199293.3| chromatin remodeling 4 protein [Arabidopsis thaliana]
gi|332007781|gb|AED95164.1| chromatin remodeling 4 protein [Arabidopsis thaliana]
Length = 2223
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 42/71 (59%), Gaps = 10/71 (14%)
Query: 786 DSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKF 843
DS +++ SKRKG D N C IC GGDL+CCD CP T+H CL+ +K+
Sbjct: 58 DSTPERNSSKRKG--------NDGNYFECVICDLGGDLLCCDSCPRTYHTACLNPPLKRI 109
Query: 844 PSGKWHCVYCS 854
P+GKW C CS
Sbjct: 110 PNGKWICPKCS 120
>gi|189230248|ref|NP_001121448.1| tripartite motif containing 24 [Xenopus (Silurana) tropicalis]
gi|183985692|gb|AAI66206.1| LOC100158542 protein [Xenopus (Silurana) tropicalis]
Length = 1040
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 816 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFC 864
>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
taurus]
Length = 1109
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 882 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 930
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 57/122 (46%), Gaps = 26/122 (21%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCH 867
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ ++
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQNLSQ 353
Query: 868 ----------VNDQDDSALSTLQICSLCEEKYH--QSCSQTDGAVQYEPSSLSFC---GK 912
D S L L +C L H +SC+ + PSS C G+
Sbjct: 354 PEVSRPPELPAETPDQSPLQIL-LCRLDSHARHTGRSCTHL-----WAPSSTWACQGRGR 407
Query: 913 KC 914
C
Sbjct: 408 LC 409
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 252 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 297
>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Ailuropoda melanoleuca]
Length = 1118
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 890 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 938
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 67.0 bits (162), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 54/115 (46%), Gaps = 23/115 (20%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCH 867
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ ++
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQNLSQ 349
Query: 868 ----------VNDQDDSALSTLQICSLCEEKYH--QSCSQTDGAVQYEPSSLSFC 910
D S L L +C L H +SC+ + PSS C
Sbjct: 350 PEVSRPPELPAETPDQSPLQIL-LCRLDSHARHTGRSCTHL-----WAPSSTWAC 398
>gi|347300253|ref|NP_001002871.2| E3 ubiquitin-protein ligase TRIM33 [Danio rerio]
Length = 1176
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CCD CP FH C +K PSG W C +C
Sbjct: 930 DDPNEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFC 978
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 57/122 (46%), Gaps = 26/122 (21%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCH 867
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ ++
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQNLSQ 352
Query: 868 ----------VNDQDDSALSTLQICSLCEEKYH--QSCSQTDGAVQYEPSSLSFC---GK 912
D S L L +C L H +SC+ + PSS C G+
Sbjct: 353 PEVSRPPELPAETPDQSPLQIL-LCRLDSHARHTGRSCTHL-----WAPSSTWACQGRGR 406
Query: 913 KC 914
C
Sbjct: 407 LC 408
>gi|395539485|ref|XP_003771699.1| PREDICTED: transcription intermediary factor 1-alpha [Sarcophilus
harrisii]
Length = 880
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 652 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFC 700
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL + K PSG W C C
Sbjct: 302 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCC 347
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 57/122 (46%), Gaps = 26/122 (21%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCH 867
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ ++
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQNLSQ 348
Query: 868 ----------VNDQDDSALSTLQICSLCEEKYH--QSCSQTDGAVQYEPSSLSFC---GK 912
D S L L +C L H +SC+ + PSS C G+
Sbjct: 349 PEVSRPPELPAETPDQSPLQIL-LCRLDSHARHTGRSCTHL-----WAPSSTWACQGRGR 402
Query: 913 KC 914
C
Sbjct: 403 LC 404
>gi|94420998|ref|NP_659542.3| transcription intermediary factor 1-alpha isoform 1 [Mus musculus]
gi|6175058|sp|Q64127.1|TIF1A_MOUSE RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase Trim24; AltName: Full=Tripartite motif-containing
protein 24
gi|998815|gb|AAB34290.1| TIF1 isoform [Mus sp.]
gi|34785214|gb|AAH56959.1| Tripartite motif-containing 24 [Mus musculus]
gi|148681701|gb|EDL13648.1| tripartite motif protein 24, isoform CRA_a [Mus musculus]
Length = 1051
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 823 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFC 871
>gi|344242940|gb|EGV99043.1| Transcription intermediary factor 1-alpha [Cricetulus griseus]
Length = 493
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 265 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFC 313
>gi|440918679|ref|NP_001258993.1| transcription intermediary factor 1-alpha isoform 2 [Mus musculus]
gi|998813|gb|AAB34289.1| TIF1 [Mus sp.]
gi|148681703|gb|EDL13650.1| tripartite motif protein 24, isoform CRA_c [Mus musculus]
Length = 1017
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 789 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFC 837
>gi|82085579|sp|Q6E2N3.1|TRI33_DANRE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Protein
moonshine; AltName: Full=Transcription intermediary
factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|50235052|gb|AAT70732.1| transcriptional intermediary factor 1 gamma [Danio rerio]
Length = 1163
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CCD CP FH C +K PSG W C +C
Sbjct: 917 DDPNEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFC 965
>gi|363728093|ref|XP_416340.3| PREDICTED: transcription intermediary factor 1-alpha [Gallus
gallus]
Length = 939
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 711 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFC 759
>gi|351701489|gb|EHB04408.1| Transcription intermediary factor 1-alpha [Heterocephalus glaber]
Length = 925
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 697 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFC 745
>gi|149065269|gb|EDM15345.1| rCG27932, isoform CRA_a [Rattus norvegicus]
Length = 1050
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 822 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFC 870
>gi|81294213|gb|AAI07916.1| Trim24 protein [Rattus norvegicus]
Length = 607
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 379 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFC 427
>gi|149065270|gb|EDM15346.1| rCG27932, isoform CRA_b [Rattus norvegicus]
Length = 1016
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 788 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFC 836
>gi|126340851|ref|XP_001374515.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Monodelphis domestica]
Length = 1061
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 833 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFC 881
>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
[Equus caballus]
Length = 412
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 184 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 232
>gi|334348540|ref|XP_003342075.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3
[Monodelphis domestica]
Length = 1060
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 832 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFC 880
>gi|334348538|ref|XP_003342074.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Monodelphis domestica]
Length = 1027
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 799 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFC 847
>gi|410899062|ref|XP_003963016.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Takifugu
rubripes]
Length = 1148
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CCD CP FH C IK FP G++ C +C
Sbjct: 905 DDPNEDWCAVCINGGDLLCCDRCPKVFHMKCHVPTIKIFPKGEFLCTFC 953
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2013 EDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2072
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1587 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1643
Score = 44.3 bits (103), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1438 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1491
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1492 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVVQNKGFRC 1530
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 349
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 380 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 425
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 66.6 bits (161), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 98 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|397629214|gb|EJK69257.1| hypothetical protein THAOC_09499 [Thalassiosira oceanica]
Length = 734
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
D C +CGDGGDL+CCDGC +++HQ CL+ +++ P G+W C C
Sbjct: 672 DVCEVCGDGGDLLCCDGCINSYHQRCLNPPMEQVPEGQWFCPSC 715
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1808 EDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 1867
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1382 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1438
Score = 44.3 bits (103), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1233 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1286
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1287 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVVQNKGFRC 1325
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 348
>gi|89130583|gb|AAI14246.1| Trim33 protein [Danio rerio]
Length = 1058
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CCD CP FH C +K PSG W C +C
Sbjct: 812 DDPNEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFC 860
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 345
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ PSG W CSC GR+ +
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWR---CSCCLQGRVQQ 344
>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
caballus]
Length = 351
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 243 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 295
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 66.2 bits (160), Expect = 1e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 98 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|74221665|dbj|BAE21530.1| unnamed protein product [Mus musculus]
Length = 981
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 753 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCQVPTLTNFPSGEWICTFC 801
>gi|348526504|ref|XP_003450759.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1043
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GG+L+CCD CP FH C + + PSG+W C +C
Sbjct: 706 EDPNEDWCAVCQNGGELLCCDKCPKVFHLACHIPTLNESPSGEWFCSFC 754
>gi|440918697|ref|NP_001259005.1| transcription intermediary factor 1-alpha isoform 3 [Mus musculus]
Length = 981
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 753 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFC 801
>gi|326912229|ref|XP_003202456.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Meleagris gallopavo]
Length = 1051
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 823 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFC 871
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 847 EDECFSCGDGGQLVSCKKSGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 906
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W+C C
Sbjct: 421 HFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDC 477
Score = 45.4 bits (106), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 47/105 (44%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C IC G+L+ C+ C FH CL + + P GK+ C CS TC V
Sbjct: 272 ENVCQICEKPGELLLCEAQCCGAFHLQCLGLSEMPKGKFICNECS------TGVHTCFVC 325
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
+ + LC + YH++C Q +Y P+ + G +C
Sbjct: 326 KSCGEDVKRC-LLPLCGKYYHEACIQ-----KYPPTVMQNKGFRC 364
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|343961759|dbj|BAK62469.1| transcription intermediary factor 1-alpha [Pan troglodytes]
Length = 375
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 147 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFC 195
>gi|432942390|ref|XP_004082995.1| PREDICTED: uncharacterized protein LOC101161205 [Oryzias latipes]
Length = 1040
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GG+L+CCD CP FH +C + + PSG+W C +C
Sbjct: 710 EDPNEDWCAVCQNGGELLCCDKCPKVFHLSCHIPALNESPSGEWFCSFC 758
>gi|19343832|gb|AAH25482.1| Trim24 protein [Mus musculus]
Length = 516
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH C + FPSG+W C +C
Sbjct: 288 DDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFC 336
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 274 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 319
>gi|218675692|gb|AAI69321.2| tripartite motif protein 33 isoform 1 [synthetic construct]
Length = 360
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 112 NSKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 164
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 65.9 bits (159), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSC 344
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD CPS +H+ CL+ +++ P G+W C CSC
Sbjct: 430 DDEHQEFCRICKDGGELLCCDSCPSAYHRFCLNPPLEEVPDGEWKCPRCSC 480
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 43/107 (40%), Gaps = 33/107 (30%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS-------- 854
+GE + D C +C GG++I CD CP +H CLD +++ P G+W C YC
Sbjct: 369 EGEQEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYCQAEGNQEQE 428
Query: 855 --------CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC 893
C+ C E C C C YH+ C
Sbjct: 429 DDDEHQEFCRICKDGGELLC---------------CDSCPSAYHRFC 460
>gi|148232826|ref|NP_001089077.1| E3 ubiquitin-protein ligase TRIM33 [Xenopus laevis]
gi|82122015|sp|Q56R14.1|TRI33_XENLA RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|59891841|gb|AAX10105.1| ectodermin [Xenopus laevis]
Length = 1091
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 809 PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
PN+D C +C +GGDL+CC+ CP FH C + FPSG+W C +C
Sbjct: 848 PNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFC 894
>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
gallus]
Length = 1369
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CLD+ K P GKW C + C CG+ +
Sbjct: 1238 EGKKESEDDCFRCGDGGQLVLCDRKSCTKAYHLSCLDLAKRPFGKWECPWHHCDVCGKPS 1297
Query: 863 ESTCHV-------NDQDDSAL-STL--QIC 882
S CH QD + L STL Q+C
Sbjct: 1298 VSFCHFCPNSFCKEHQDGTVLNSTLNGQLC 1327
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDGEDP----NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G+ N C +C GG L+CC+ CP+ FH +CL+I + P G W+C C
Sbjct: 821 HFTAMKGKSHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWYCNDC 877
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
C +C GDL+ C+G C FH +CL + P+GK+ C C+ TC V +
Sbjct: 674 VCQLCEKTGDLLLCEGLCYRAFHVSCLGLSGRPAGKFVCSECTSGV------HTCFVCKE 727
Query: 872 DDSALSTLQICSLCEEKYHQSC 893
+ L + S C + YH++C
Sbjct: 728 RKADLKRC-VVSHCGKFYHEAC 748
>gi|301122693|ref|XP_002909073.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262099835|gb|EEY57887.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 751
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 801 FVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG--KWHCVYCS 854
FVN D D N+D C CG G+LICCDGCP+ FH NC ++ P W C C+
Sbjct: 600 FVNTDQNDLNEDACFRCGLAGELICCDGCPAAFHLNCTNLAMVPPDGIPWFCSECT 655
>gi|335305258|ref|XP_003360173.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
[Sus scrofa]
Length = 298
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 70 DDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFC 118
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 65.9 bits (159), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL ++ PSG W C C
Sbjct: 307 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 352
Score = 40.8 bits (94), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 6/55 (10%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC-VYCSCQFCGRINESTCH 867
CG+CGDG D++ C C + FH C FP+G C+FC + + CH
Sbjct: 446 CGVCGDGADVLRCAHCAAAFHWRC----HFPAGAARPGASPRCRFCSG-DPAPCH 495
>gi|47211547|emb|CAF96112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 886
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GG+L+CCD CP FH +C + + PSG+W C +C
Sbjct: 702 EDPNEDWCAVCQNGGELLCCDRCPKVFHLSCHIPALHEPPSGEWFCSFC 750
>gi|224096253|ref|XP_002191494.1| PREDICTED: transcription intermediary factor 1-alpha [Taeniopygia
guttata]
Length = 1061
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GG L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 833 DDPNEDWCAVCQNGGKLLCCEKCPKVFHLSCHVPTLMSFPSGEWICTFC 881
>gi|403373758|gb|EJY86802.1| PHD domain containing protein [Oxytricha trifallax]
Length = 988
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDI-KKFPSGKWHCVYC 853
D N D CGIC G LICCD CPS FH +CL K+ P GKW C +C
Sbjct: 574 DLNADVCGICDLPGKLICCDDCPSAFHADCLGYEKQCPRGKWKCYFC 620
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSC 855
N + C +C DGG+L+CCD CPS++H+ CL + P G WHC C+C
Sbjct: 406 NMEFCRLCRDGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRCTC 453
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 51/106 (48%), Gaps = 20/106 (18%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGR 860
N+D + + D C +C GG++I CD CP +H CL D+++ P GKW C +C
Sbjct: 333 NYDTD--HQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEEPPGGKWSCPHCENDL--- 387
Query: 861 INESTCHVNDQDDSALS-TLQICSLCEE------------KYHQSC 893
+N++ + + A + ++ C LC + YH+ C
Sbjct: 388 VNDNDAVTSKEAAPAKAGNMEFCRLCRDGGELLCCDSCPSSYHRYC 433
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 65.5 bits (158), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 31/53 (58%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG 859
E+ D C +C DGG+L+ CD CPS FH C+ ++ P G W C C C CG
Sbjct: 461 EEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCRCGVCG 513
>gi|189238110|ref|XP_001814014.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270008739|gb|EFA05187.1| hypothetical protein TcasGA2_TC015317 [Tribolium castaneum]
Length = 706
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 781 LQCILDSWNKQDESKRKG-----FHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQ 835
+Q ++ +D RKG + G+ N D+C CG+GGDLICCD CPS+FH
Sbjct: 18 IQALIAPPQSEDTKSRKGATATEHPYFKRPGKGHNHDSCDACGEGGDLICCDKCPSSFHL 77
Query: 836 NC----LDIKKFPSGKWHCVYC 853
C L+ K PSG+W C C
Sbjct: 78 QCHDPPLEEKDIPSGEWLCHSC 99
>gi|10438579|dbj|BAB15281.1| unnamed protein product [Homo sapiens]
Length = 699
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 180
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 65.5 bits (158), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 426 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSC 471
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 65.5 bits (158), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
ED + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 434 EDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRCTC 484
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D+ K P GKW C +C
Sbjct: 356 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHC 404
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 65.5 bits (158), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD CPS +H +CL+ + + P G W C CSC
Sbjct: 427 DDEHQEFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRCSC 477
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 38/91 (41%), Gaps = 16/91 (17%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
D C +C GG++I CD CP +H CLD ++ P GKW C +C NE +
Sbjct: 374 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCE-------NEGPAEQD 426
Query: 870 DQDDSAL-------STLQICSLCEEKYHQSC 893
D + L C C YH C
Sbjct: 427 DDEHQEFCRICKDGGELLCCDSCPSAYHTHC 457
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 65.5 bits (158), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD CPS +H +CL+ + + P G W C CSC
Sbjct: 430 DDEHQEFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRCSC 480
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 38/91 (41%), Gaps = 16/91 (17%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
D C +C GG++I CD CP +H CLD ++ P GKW C +C NE +
Sbjct: 377 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCE-------NEGPAEQD 429
Query: 870 DQDDSAL-------STLQICSLCEEKYHQSC 893
D + L C C YH C
Sbjct: 430 DDEHQEFCRICKDGGELLCCDSCPSAYHTHC 460
>gi|297791319|ref|XP_002863544.1| hypothetical protein ARALYDRAFT_917093 [Arabidopsis lyrata subsp.
lyrata]
gi|297309379|gb|EFH39803.1| hypothetical protein ARALYDRAFT_917093 [Arabidopsis lyrata subsp.
lyrata]
Length = 2251
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS 854
G D N C IC GGDL+CCD CP T+H CL+ +K+ P+GKW C CS
Sbjct: 70 GNDGNYFECVICDLGGDLLCCDSCPRTYHTACLNPPLKRIPNGKWICPKCS 120
>gi|145482349|ref|XP_001427197.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394277|emb|CAK59799.1| unnamed protein product [Paramecium tetraurelia]
Length = 922
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRI 861
+D C C GG +ICCD CP FH C+++K+ P GKW+C+ C F +I
Sbjct: 860 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLTNFERQI 910
>gi|297737996|emb|CBI27197.3| unnamed protein product [Vitis vinifera]
Length = 1638
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCHV 868
C IC GG+L+CCD CP T+H CL+ +K+ P+GKW C C C +++ S V
Sbjct: 78 CVICDLGGNLLCCDSCPRTYHLQCLNPPLKRIPNGKWQCPKCFCSIEKKLDSSQIDV 134
>gi|410919013|ref|XP_003972979.1| PREDICTED: transcription intermediary factor 1-alpha-like [Takifugu
rubripes]
Length = 1006
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
EDPN+D C +C +GG+L+CCD CP FH C + PSG+W C +C
Sbjct: 687 EDPNEDWCAVCQNGGELLCCDKCPKVFHLTCHIPTLIASPSGEWFCSFC 735
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 96/429 (22%), Positives = 178/429 (41%), Gaps = 61/429 (14%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCS-EI-FTISKFDTH-S 761
TL + G +P + + + + V+ QG GI C+CCS E+ +T S+F+ H
Sbjct: 488 TLHPMLFKEGGLP-DYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVG 546
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
+ P++++Y G +L + L + + VN D E P + G G
Sbjct: 547 MGRRRQPYRSIYTSDGLTLHELALKLQDGLSSN-------VNID-ELPTLTS----GSGK 594
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQI 881
+ +T + +K+ + V SC C R + V D +
Sbjct: 595 EY------STTSRPIIVPLKRTLQERVLTVE-SCYMC-RKPHTVLGVISVD-----MIVF 641
Query: 882 CSLCEEKYHQSC-------SQTDGAVQYEPSSLSF-CGKKCQEIFERLEKLLGVKHDL-- 931
C+ CE H C + V E + +F C +KCQ + L + L + D+
Sbjct: 642 CNQCERALHVKCYNNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLRASLHEGLKKREDIAF 701
Query: 932 ----EGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINL 987
W L++ ++ SDV +V + + + F I+
Sbjct: 702 LRRIRYNICWQLLNGTNMR-----SDVQHQV---------IEIFKDAFAETAPQ--DIDD 745
Query: 988 IHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQG 1047
I N++ + + ++ +++G + A+L ++SAA +++ +E+AE+ I T + R++G
Sbjct: 746 IRNMVNSKDTTGEK-DFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKG 804
Query: 1048 MCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFP 1106
LL+ IE+ L + NV L P E+ W+ G+ L K M L++F
Sbjct: 805 YFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYTILSDEQKHSMLMAHPLVMFA 864
Query: 1107 GVDMLQKPM 1115
+ ++QK +
Sbjct: 865 NLSLVQKSL 873
>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
Length = 2055
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG ++ C GCP +H +CL++ K P+G+W C + C CGR S C +
Sbjct: 1800 EDECFYCGDGGQIVSCKKPGCPKVYHADCLNLSKRPAGRWECPWHQCNECGREAASYCEM 1859
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W C C
Sbjct: 1375 HFTPRKGCKNHEHINVSWCFVCSEGGSLLCCESCPAAFHRECLNI-EMPQGSWFCNDC 1431
Score = 44.7 bits (104), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 48/105 (45%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C +C G+L+ C+G C FH C+ + + P G++ C C ++ TC V
Sbjct: 1226 ENVCQVCEKTGELLLCEGQCCGAFHLQCIGLTETPKGRFICQEC------KMGVHTCFVC 1279
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
+ D + I +C + YH C ++Y P+ G +C
Sbjct: 1280 KKPDKEVRRCMI-PVCGKFYHMDC-----ILKYSPTVAQNRGFRC 1318
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1834 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1893
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1408 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1464
Score = 45.1 bits (105), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1259 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1312
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1313 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVIQNKGFRC 1351
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
NDD C +C DGG+LICCDGCP FH +CL + P G W C C
Sbjct: 291 NDDECAVCKDGGELICCDGCPRAFHLSCLVPPLTSIPRGTWRCQLC 336
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2180
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1695 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 44.7 bits (104), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1546 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1599
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1600 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1638
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2180
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1695 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 45.1 bits (105), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1546 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1599
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1600 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1638
>gi|358255504|dbj|GAA57198.1| probable histone-lysine N-methyltransferase NSD2 [Clonorchis
sinensis]
Length = 1293
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 809 PNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
P++D C CGDGG+L+ CD CP +FH NCL + PSG W+C + C CG + C
Sbjct: 1056 PHEDFCYRCGDGGELLLCDKSSCPKSFHLNCLGLSSPPSGIWYCPWHYCDQCGHPSTHLC 1115
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 96/429 (22%), Positives = 178/429 (41%), Gaps = 61/429 (14%)
Query: 705 TLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCS-EI-FTISKFDTH-S 761
TL + G +P + + + + V+ QG GI C+CCS E+ +T S+F+ H
Sbjct: 587 TLHPMLFKEGGLP-DYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVG 645
Query: 762 KSKLCHPFQNLYFESGSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGG 821
+ P++++Y G +L + L + + VN D E P + G G
Sbjct: 646 MGRRRQPYRSIYTSDGLTLHELALKLQDGLSSN-------VNID-ELPTLTS----GSGK 693
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQI 881
+ +T + +K+ + V SC C R + V D +
Sbjct: 694 EY------STTSRPIIVPLKRTLQERVLTVE-SCYMC-RKPHTVLGVISVD-----MIVF 740
Query: 882 CSLCEEKYHQSC-------SQTDGAVQYEPSSLSF-CGKKCQEIFERLEKLLGVKHDL-- 931
C+ CE H C + V E + +F C +KCQ + L + L + D+
Sbjct: 741 CNQCERALHVKCYNNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLRASLHEGLKKREDIAF 800
Query: 932 ----EGGYTWSLVHRFDVSTDLSLSDVCQKVECNARLAVALSVMDECFLPLPDHRSGINL 987
W L++ ++ SDV +V + + + F I+
Sbjct: 801 LRRIRYNICWQLLNGTNMR-----SDVQHQV---------IEIFKDAFAETAPQ--DIDD 844
Query: 988 IHNILYNFGSNFKRLNYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQG 1047
I N++ + + ++ +++G + A+L ++SAA +++ +E+AE+ I T + R++G
Sbjct: 845 IRNMVNSKDTTGEK-DFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKG 903
Query: 1048 MCRRLLTGIESALCSLNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFP 1106
LL+ IE+ L + NV L P E+ W+ G+ L K M L++F
Sbjct: 904 YFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYTILSDEQKHSMLMAHPLVMFA 963
Query: 1107 GVDMLQKPM 1115
+ ++QK +
Sbjct: 964 NLSLVQKSL 972
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2180
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1695 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 44.7 bits (104), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1546 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1599
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1600 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1638
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2180
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1695 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 45.1 bits (105), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1546 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLNEMPRGKFICNEC------RTGIHTCFVC 1599
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1600 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1638
>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
Length = 2363
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG ++ C GCP +H +CL++ K P+G+W C + C CG+ S C +
Sbjct: 2121 EDECFSCGDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQCDICGKEAASFCEM 2180
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W+C C
Sbjct: 1698 HFTPRRGVKNHEHVNVSWCFVCTEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDC 1754
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2119 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2178
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1693 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 41.6 bits (96), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 44/105 (41%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C TC V
Sbjct: 1544 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNECHTGI------HTCFVC 1597
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ G +C
Sbjct: 1598 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVTQNKGFRC 1636
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2122 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2181
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1696 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1752
Score = 44.7 bits (104), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1547 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1600
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1601 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1639
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2016 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2075
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1590 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 41.6 bits (96), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 44/105 (41%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C TC V
Sbjct: 1441 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNECHTGI------HTCFVC 1494
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ G +C
Sbjct: 1495 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVTQNKGFRC 1533
>gi|52545752|emb|CAH56331.1| hypothetical protein [Homo sapiens]
Length = 881
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 303 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 362
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2016 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2075
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1590 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 41.6 bits (96), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 44/105 (41%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C TC V
Sbjct: 1441 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNECHTGI------HTCFVC 1494
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ G +C
Sbjct: 1495 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVTQNKGFRC 1533
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2119 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2178
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1693 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1544 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1597
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1598 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1636
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 64.7 bits (156), Expect = 4e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 242 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCC 287
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2119 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2178
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1693 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 45.1 bits (105), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1544 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPKGKFICNEC------RTGIHTCFVC 1597
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1598 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1636
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2119 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2178
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1693 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 44.7 bits (104), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1544 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1597
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1598 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1636
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2119 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2178
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1693 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 44.7 bits (104), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1544 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1597
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1598 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1636
>gi|372467017|pdb|3U5M|A Chain A, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467018|pdb|3U5M|B Chain B, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467019|pdb|3U5M|C Chain C, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467020|pdb|3U5M|D Chain D, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467021|pdb|3U5M|E Chain E, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467022|pdb|3U5M|F Chain F, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467023|pdb|3U5M|G Chain G, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467024|pdb|3U5M|H Chain H, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467025|pdb|3U5M|I Chain I, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467026|pdb|3U5M|J Chain J, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467027|pdb|3U5M|K Chain K, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467028|pdb|3U5M|L Chain L, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467029|pdb|3U5N|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467031|pdb|3U5N|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467033|pdb|3U5O|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467034|pdb|3U5O|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467035|pdb|3U5O|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467036|pdb|3U5O|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467037|pdb|3U5O|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467038|pdb|3U5O|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467039|pdb|3U5O|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467040|pdb|3U5O|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467049|pdb|3U5P|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467051|pdb|3U5P|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467053|pdb|3U5P|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467055|pdb|3U5P|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467057|pdb|3U5P|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467059|pdb|3U5P|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467061|pdb|3U5P|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467063|pdb|3U5P|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
Length = 207
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+CC+ CP FH C + FPSG W C +C
Sbjct: 3 DDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 51
>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Equus caballus]
Length = 1365
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKKKSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH QD +A S+ Q
Sbjct: 1293 TSFCHFCPNSFCKEHQDGTAFSSTQ 1317
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
Score = 40.8 bits (94), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 8/82 (9%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
C +C G L+ C+G C FH CL + + P G++ C C+ TC V +
Sbjct: 669 VCQLCEKTGSLVLCEGPCCGAFHLACLGLSRRPEGRFTCSECASGI------HTCFVCKE 722
Query: 872 DDSALSTLQICSLCEEKYHQSC 893
+ + + + S C + YH++C
Sbjct: 723 NKTDVKRC-VVSQCGKFYHEAC 743
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2118 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2177
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1692 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 44.7 bits (104), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1543 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1596
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1597 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1635
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2117 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2176
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1691 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 44.7 bits (104), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1542 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1595
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1596 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1634
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2117 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2176
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1691 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 44.7 bits (104), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1542 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1595
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1596 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1634
>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
Length = 1048
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 47/87 (54%), Gaps = 8/87 (9%)
Query: 810 NDDTCGICGDGGDLICCD-GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
NDDTC ICG+ G LICCD GC FH CL I + P+G++ C CS TC+
Sbjct: 438 NDDTCRICGELGQLICCDGGCRGAFHLECLSILQPPTGEFRCDECS------TGNHTCYT 491
Query: 869 NDQDDSALSTLQICSLCEEKYHQSCSQ 895
D+ + L Q C + YH+ C++
Sbjct: 492 CDKVGADLIKCQF-PHCNKLYHRGCAE 517
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 15/97 (15%)
Query: 802 VNFDGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG 859
++ D DD C IC DGGDL+ CD C +H CL + K P+GK+ C + +C CG
Sbjct: 913 TTYEDLDRTDDECFICKDGGDLLMCDKKNCDKVYHLACLGMNKVPAGKFICPHHACLKCG 972
Query: 860 RIN--------ESTCHVNDQDDSALSTLQICSLCEEK 888
R E+ C + ++T ++ +LCE K
Sbjct: 973 RKATIFSETGPEAYCSMKH-----IATPEVRALCEGK 1004
Score = 48.1 bits (113), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 13/83 (15%)
Query: 777 GSSLLQCILDSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQN 836
G+ L++C NK +H + + D+ C +CG GGDL+ CDGCP +H
Sbjct: 496 GADLIKCQFPHCNKL-------YHRGCAEKQFKADNFCLVCGTGGDLVVCDGCPGAYHAA 548
Query: 837 CLDI------KKFPSGKWHCVYC 853
C+ K G+W C C
Sbjct: 549 CIKSTFAFTGKPDEQGQWFCHDC 571
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2120 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2179
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1694 HFAPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1750
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2119 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2178
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1693 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 44.7 bits (104), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1544 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1597
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1598 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1636
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1812 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1871
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1386 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1442
Score = 44.7 bits (104), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1237 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1290
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1291 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1329
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2119 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2178
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1693 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 42.4 bits (98), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 45/105 (42%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C TC V
Sbjct: 1544 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECHTGI------HTCFVC 1597
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1598 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1636
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C +C DGG+L+CCDGCP FH CL ++ PSG W C C
Sbjct: 293 NEDECAVCRDGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 338
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2117 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2176
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1691 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 44.7 bits (104), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1542 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1595
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1596 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1634
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
ED + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 435 EDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRCTC 485
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D+ K P GKW C +C
Sbjct: 357 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHC 405
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2180
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1695 HFAPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH C+ + + P GK+ C C R TC V
Sbjct: 1546 ENVCQNCEKVGELLLCEAQCCGAFHLECIGLTEMPKGKFICKEC------RTGIHTCFVC 1599
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
+ + LC + YH++C Q +Y P+ L G +C
Sbjct: 1600 KTSGEDVKRC-LLPLCGKFYHEACIQ-----KYPPTVLQNKGFRC 1638
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1849 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1908
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1423 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1479
Score = 44.7 bits (104), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1274 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1327
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1328 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1366
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1849 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1908
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1423 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1479
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1274 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1327
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1328 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1366
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2122 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2181
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1696 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1752
Score = 44.7 bits (104), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1547 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1600
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1601 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1639
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1751 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1810
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1336 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1392
Score = 44.7 bits (104), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1187 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1240
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1241 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1279
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1850 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1909
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1424 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1480
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1275 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1328
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1329 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1367
>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Otolemur garnettii]
Length = 1371
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
DG+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1239 DGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1298
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH QD +A S+ +
Sbjct: 1299 TSFCHFCPNSFCKEHQDGTAFSSTK 1323
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2118 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2177
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1692 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 42.4 bits (98), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 45/105 (42%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C TC V
Sbjct: 1543 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECHTGI------HTCFVC 1596
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1597 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1635
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1850 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1909
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1424 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1480
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1275 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1328
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1329 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1367
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2015 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2074
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1589 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1645
Score = 44.7 bits (104), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1440 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1493
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1494 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1532
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2016 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2075
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1590 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1441 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1494
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1495 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1533
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1848 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1907
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1422 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1478
Score = 44.7 bits (104), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1273 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1326
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1327 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1365
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2015 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 2074
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1589 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1645
Score = 44.7 bits (104), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1440 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1493
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1494 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1532
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1853 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1912
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1427 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1483
Score = 44.7 bits (104), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1278 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1331
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1332 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1370
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2110 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2169
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1684 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1740
Score = 45.1 bits (105), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1535 ENVCQNCEKVGELLLCEAQCCGAFHLECLGLAEMPRGKFICNEC------RTGIHTCFVC 1588
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1589 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1627
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2180
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1695 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 43.5 bits (101), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 45/105 (42%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1546 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1599
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ G +C
Sbjct: 1600 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVTQNKGFRC 1638
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1810 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 1869
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1384 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1440
Score = 41.6 bits (96), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 44/105 (41%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C TC V
Sbjct: 1235 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNECHTGI------HTCFVC 1288
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ G +C
Sbjct: 1289 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVTQNKGFRC 1327
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 64.7 bits (156), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1931 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1990
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1505 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1561
Score = 44.7 bits (104), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1356 ENVCQNCEKVGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1409
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1410 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1448
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2118 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2177
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1692 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1543 ENVCQNCEKVGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1596
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1597 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1635
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1813 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 1872
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1387 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1443
Score = 44.7 bits (104), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1238 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1291
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1292 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1330
>gi|312371986|gb|EFR20039.1| hypothetical protein AND_20711 [Anopheles darlingi]
Length = 828
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 9/86 (10%)
Query: 780 LLQCILDSWNKQDESKRKGFH-----FVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFH 834
L+Q ++ D R G + G N+DTC C +GGDL+CCD CPS+FH
Sbjct: 10 LIQALIKPPESTDSLGRTGLRRSSHPYYRRPGRGHNNDTCDACKEGGDLLCCDRCPSSFH 69
Query: 835 QNCLDI----KKFPSGKWHCVYCSCQ 856
C D ++ P G+W C C C+
Sbjct: 70 LGCHDPPLSEQEIPHGQWVCHTCKCK 95
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1749 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 1808
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1323 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1379
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 45/105 (42%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1174 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1227
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ G +C
Sbjct: 1228 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVTQNKGFRC 1266
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 64.3 bits (155), Expect = 5e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL + + PSG W C C
Sbjct: 293 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 338
>gi|145547050|ref|XP_001459207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427031|emb|CAK91810.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRI 861
+D C C GG +ICCD CP FH C+++K+ P GKW+C+ C F +I
Sbjct: 865 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLRNFERQI 915
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG ++ C GCP +H +CL++ K P+G+W C + C CG+ S C +
Sbjct: 2041 EDECFSCGDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQCDLCGKEAASFCEM 2100
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W C C
Sbjct: 1615 HFTPRKGCKNHEHINVSWCFVCSEGGSLLCCEACPAAFHRECLNI-EMPQGSWFCNDC 1671
Score = 42.0 bits (97), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 38/92 (41%), Gaps = 9/92 (9%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C +C GDL+ C+G C FH C+ + P GK+ C C+ + C V
Sbjct: 1467 ENVCQVCERPGDLLVCEGHCYGAFHLQCVGLSVAPKGKFFCQECN------TGDHMCFVC 1520
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQ 901
+ I LC + YH C A Q
Sbjct: 1521 KKSGDVKRC--IIPLCGKFYHMDCILAFSATQ 1550
>gi|195055097|ref|XP_001994457.1| GH17187 [Drosophila grimshawi]
gi|193892220|gb|EDV91086.1| GH17187 [Drosophila grimshawi]
Length = 1128
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 889 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 939
>gi|443719422|gb|ELU09603.1| hypothetical protein CAPTEDRAFT_166178 [Capitella teleta]
Length = 1711
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 57/131 (43%), Gaps = 14/131 (10%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRINES 864
E + D C +C GG++I CD CP +H CL ++++ P GKW C +C E
Sbjct: 149 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEQAPEGKWSCPHCVRSSVNIAFEE 208
Query: 865 TCHVNDQDDSALSTLQICS---------LCEEKYHQSCSQTDGAVQYEPSSLSFCGK-KC 914
V DD + ++C C YH C + V P + C + C
Sbjct: 209 NGPVEQDDDEHMEFCRVCKDGGELLCCDTCTSAYHVFCLAPN--VANVPDGIWHCPRCSC 266
Query: 915 QEIFERLEKLL 925
Q + R++K+L
Sbjct: 267 QPLKARVQKIL 277
>gi|442620095|ref|NP_001262765.1| bonus, isoform C [Drosophila melanogaster]
gi|440217666|gb|AGB96145.1| bonus, isoform C [Drosophila melanogaster]
Length = 1207
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 893 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 943
>gi|390478260|ref|XP_003735458.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator, partial
[Callithrix jacchus]
Length = 330
Score = 64.3 bits (155), Expect = 5e-07, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 77 NEDECAVCRDGGELICCDGCPRAFHLACLTPPLREIPSGTWRCSSC 122
>gi|442620101|ref|NP_001262768.1| bonus, isoform F [Drosophila melanogaster]
gi|440217669|gb|AGB96148.1| bonus, isoform F [Drosophila melanogaster]
Length = 1122
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
N +DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 879 NTQEDDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 933
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 1852 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 1911
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1426 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1482
Score = 44.7 bits (104), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1277 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1330
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1331 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1369
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 2121 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDVCGKEAASFCEM 2180
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1695 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 44.3 bits (103), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1546 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1599
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1600 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1638
>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
Length = 1882
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ 856
D + + C +C DGG+L+CCD CP+ FH CL+ +K P+GKW+C CS +
Sbjct: 472 DEHMEFCRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPRCSVE 522
Score = 44.7 bits (104), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 7/60 (11%)
Query: 801 FVNFDGEDP-----NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
F N D E+ + D C +C GG++I CD CP +H C ++++ P GKW C +C
Sbjct: 402 FPNSDAENDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWSCPHC 461
>gi|195113037|ref|XP_002001076.1| GI10584 [Drosophila mojavensis]
gi|193917670|gb|EDW16537.1| GI10584 [Drosophila mojavensis]
Length = 1121
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 883 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 933
>gi|442620099|ref|NP_001262767.1| bonus, isoform E [Drosophila melanogaster]
gi|440217668|gb|AGB96147.1| bonus, isoform E [Drosophila melanogaster]
Length = 1125
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
N +DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 881 NTQEDDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 935
>gi|195391037|ref|XP_002054172.1| GJ22942 [Drosophila virilis]
gi|194152258|gb|EDW67692.1| GJ22942 [Drosophila virilis]
Length = 1119
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 882 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 932
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C C DGG+L+CCDGCP FH CL + + PSG W C C
Sbjct: 432 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNC 477
>gi|442620097|ref|NP_001262766.1| bonus, isoform D [Drosophila melanogaster]
gi|440217667|gb|AGB96146.1| bonus, isoform D [Drosophila melanogaster]
Length = 1119
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
N +DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 875 NTQEDDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 929
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 100/214 (46%), Gaps = 21/214 (9%)
Query: 909 FCGKKCQEIFERLE----KLLGVKH--DLEGGYTWSLVHRFDVSTDLSLSDVCQKVECNA 962
C ++CQ + RLE K +G+ + W L+ D S D+ L
Sbjct: 6 LCCQECQSLRARLEEGLEKCVGITFLRRIRSNICWRLLSGMDASRDVKLY---------- 55
Query: 963 RLAVALSVMDECFLPLPDHRSGINLIHNILYNFGSNFKRLNYKGFFTAILERDDEIISAA 1022
+ + + + F+ D S I I +++ N + + +++G + A+L ++SAA
Sbjct: 56 -MPQVIDIFKDAFMDSTDEHSDI--ISDMV-NGKNGDQEKDFRGMYCALLTASTHVVSAA 111
Query: 1023 SIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCSLNVEKLIIPAISELRETWTS 1082
+++ +++AE+ I TR R++G LL IE+ L + NV L P E+ + W+
Sbjct: 112 ILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSE 171
Query: 1083 VFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
GF L K+ M + L++F + ++QK +
Sbjct: 172 KLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 205
>gi|6631000|gb|AAF19646.1|AF210315_1 bonus [Drosophila melanogaster]
Length = 1133
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 893 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 943
>gi|24648457|ref|NP_524724.2| bonus, isoform A [Drosophila melanogaster]
gi|23176008|gb|AAF55786.2| bonus, isoform A [Drosophila melanogaster]
Length = 1133
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 893 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 943
>gi|195498245|ref|XP_002096441.1| GE25673 [Drosophila yakuba]
gi|194182542|gb|EDW96153.1| GE25673 [Drosophila yakuba]
Length = 1133
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 893 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 943
>gi|194744148|ref|XP_001954557.1| GF16683 [Drosophila ananassae]
gi|190627594|gb|EDV43118.1| GF16683 [Drosophila ananassae]
Length = 1183
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 892 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 942
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 64.3 bits (155), Expect = 6e-07, Method: Composition-based stats.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C CGDGG+LICCDGCP FH CL + PSG W C C
Sbjct: 246 NEDECAACGDGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSC 291
>gi|195569415|ref|XP_002102705.1| GD19361 [Drosophila simulans]
gi|194198632|gb|EDX12208.1| GD19361 [Drosophila simulans]
Length = 1140
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 891 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 941
>gi|195353945|ref|XP_002043462.1| GM23122 [Drosophila sechellia]
gi|194127603|gb|EDW49646.1| GM23122 [Drosophila sechellia]
Length = 1131
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 891 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 941
>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
Length = 2214
Score = 64.3 bits (155), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG ++ C GCP +H +CL++ K P+G+W C + C CG+ S C +
Sbjct: 2000 EDECFSCGDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQCDVCGKEAASFCEM 2059
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W+C C
Sbjct: 1575 HFTPRRGVKNHEHVNVSWCFVCTEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDC 1631
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL + PSG W C C
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSC 344
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG ++ C GCP +H +CL++ K P+G+W C + C CG+ S C +
Sbjct: 1822 EDECFSCGDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQCDVCGKEAASFCEM 1881
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL++ + P G W C C
Sbjct: 1396 HFTPRKGCKNHEHINVSWCFVCSEGGSLLCCEACPAAFHRECLNM-EMPQGSWFCNDC 1452
Score = 42.4 bits (98), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 37/92 (40%), Gaps = 8/92 (8%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
+ C +C GDL+ C+G C FH C+ + P GK+ C C+ C V
Sbjct: 1247 ESVCQVCERTGDLLVCEGHCYGAFHLQCIGLSAPPKGKFLCRECN------TGVHACFVC 1300
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQ 901
+ + I LC + YH C A Q
Sbjct: 1301 KKSGDGVKRC-IIPLCGKFYHTDCIMAYSATQ 1331
>gi|281362149|ref|NP_001163665.1| bonus, isoform B [Drosophila melanogaster]
gi|159884207|gb|ABX00782.1| RE48191p [Drosophila melanogaster]
gi|272477073|gb|ACZ94961.1| bonus, isoform B [Drosophila melanogaster]
Length = 1125
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 885 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 935
>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Crassostrea gigas]
Length = 1079
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 39/65 (60%), Gaps = 2/65 (3%)
Query: 804 FDGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRI 861
DG+ +DD C CG+GG+L+ CD GCP +H +CL + K P GKW C + C CG+
Sbjct: 881 IDGKKEHDDECFRCGEGGELVMCDRGGCPKVYHLHCLKLSKPPHGKWDCPWHHCDECGKP 940
Query: 862 NESTC 866
+ C
Sbjct: 941 AITMC 945
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
N C +C GG LICCD CP++FH CL+I P G W+C C+
Sbjct: 474 NVSWCFVCNLGGSLICCDSCPASFHAECLNINP-PEGNWYCNDCA 517
>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
Length = 1982
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D +D + + C IC DGG+L+CCD CPS +H CL + P G W C CSC
Sbjct: 452 DDDDEHQEFCRICKDGGELLCCDMCPSAYHTFCLTPPLDDIPDGDWRCPRCSC 504
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 46/100 (46%), Gaps = 19/100 (19%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRIN 862
DGE + D C +C GG++I CD CP +H CL +++ P GKW C +C+ G +
Sbjct: 393 DGEHEHQDFCEVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWSC--PTCEADGGVA 450
Query: 863 ESTCHVNDQDDSALSTLQI---------CSLCEEKYHQSC 893
E D DD +I C +C YH C
Sbjct: 451 E------DDDDEHQEFCRICKDGGELLCCDMCPSAYHTFC 484
>gi|340371689|ref|XP_003384377.1| PREDICTED: hypothetical protein LOC100637285 [Amphimedon
queenslandica]
Length = 400
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQ 856
D C IC DGG+L+CCD CP +H CL +KK P G WHC C+ +
Sbjct: 318 DYCHICQDGGELLCCDRCPKAYHLQCLYPPLKKIPDGTWHCPRCTVK 364
>gi|260792165|ref|XP_002591087.1| hypothetical protein BRAFLDRAFT_69357 [Branchiostoma floridae]
gi|229276287|gb|EEN47098.1| hypothetical protein BRAFLDRAFT_69357 [Branchiostoma floridae]
Length = 1005
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+ EDPN+D C +C +GGDL+CCD CP +H C +I PSG + C C
Sbjct: 784 NPEDPNEDWCAVCHNGGDLLCCDTCPKVYHLTCHVPNIPAMPSGDFMCTLC 834
>gi|194899728|ref|XP_001979410.1| GG24000 [Drosophila erecta]
gi|190651113|gb|EDV48368.1| GG24000 [Drosophila erecta]
Length = 1131
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 891 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 941
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 39/56 (69%), Gaps = 5/56 (8%)
Query: 805 DGEDPND---DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D E+ +D + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 441 DAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRCTC 496
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 802 VNFDG-EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
V DG E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 361 VAVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 415
>gi|195449701|ref|XP_002072186.1| GK22457 [Drosophila willistoni]
gi|194168271|gb|EDW83172.1| GK22457 [Drosophila willistoni]
Length = 1120
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 4/89 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYCSCQFCGRIN 862
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C +
Sbjct: 885 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLCVNLKELTKS 944
Query: 863 ESTCHVNDQDDSALSTLQICSLCEEKYHQ 891
ES+ V+ + S+L + +C E Y Q
Sbjct: 945 ESSDKVSQGELSSLELRILQRICLELYCQ 973
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 39/56 (69%), Gaps = 5/56 (8%)
Query: 805 DGEDPND---DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D E+ +D + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 434 DAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRCTC 489
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 802 VNFDG-EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
V DG E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 354 VAVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 408
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 39/56 (69%), Gaps = 5/56 (8%)
Query: 805 DGEDPND---DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D E+ +D + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 441 DAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRCTC 496
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 802 VNFDG-EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
V DG E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 361 VAVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 415
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 63.9 bits (154), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 39/56 (69%), Gaps = 5/56 (8%)
Query: 805 DGEDPND---DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D E+ +D + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 435 DAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRCTC 490
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 361 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 420
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ ++ ++DD + ++C C YH C
Sbjct: 421 DNSEGEETMEDAVGDAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHC 470
>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
Length = 2037
Score = 63.9 bits (154), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS +H CL+ + P G+W C CSC
Sbjct: 467 DDEHQEFCRVCKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGEWRCPRCSC 517
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ 856
+GE + D C +C GG++I CD CP +H CLD ++ P GKW C C +
Sbjct: 407 EGEHEHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTCEAE 460
>gi|195158907|ref|XP_002020325.1| GL13567 [Drosophila persimilis]
gi|194117094|gb|EDW39137.1| GL13567 [Drosophila persimilis]
Length = 1053
Score = 63.9 bits (154), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 882 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 932
>gi|111226798|ref|XP_642746.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|90970774|gb|EAL68857.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 914
Score = 63.9 bits (154), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS 854
NDD C CGDGGDL+CCD C ++FH CL+ + + PSG W C C+
Sbjct: 60 NDDYCSGCGDGGDLLCCDNCQASFHLICLNPPLNEIPSGDWFCDSCT 106
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 63.9 bits (154), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 510 DDEHQEFCRICKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGDWRCPRCSC 560
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 43/100 (43%), Gaps = 20/100 (20%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRIN 862
+GE + D C +C GG++I CD CP +H CLD ++ P GKW C C
Sbjct: 450 EGEHEHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC--------- 500
Query: 863 ESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
E+ ++ DD +IC C YH C
Sbjct: 501 EAEGPADEDDDEHQEFCRICKDGGELLCCDNCPSAYHTFC 540
>gi|302793688|ref|XP_002978609.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
gi|300153958|gb|EFJ20595.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
Length = 1349
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
D N D C +CG G+LICCDGCP+ +H C+ + K P G W+C C+ Q
Sbjct: 469 DWNSDDCVLCGMDGNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPECTLQ 519
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C C DGG+L+CCDGCP FH CL + + PSG W C C
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNC 430
>gi|198449950|ref|XP_001357792.2| GA18737 [Drosophila pseudoobscura pseudoobscura]
gi|198130826|gb|EAL26927.2| GA18737 [Drosophila pseudoobscura pseudoobscura]
Length = 1128
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYC 853
+DPN+D C +C DGG+L+CCD CP FHQNC I P S W C+ C
Sbjct: 882 DDPNEDWCAVCLDGGELMCCDKCPKVFHQNCHIPAISSLPDESESWQCLLC 932
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C C DGG+L+CCDGCP FH CL + + PSG W C C
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNC 430
>gi|348523329|ref|XP_003449176.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1174
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 4/55 (7%)
Query: 803 NFDG--EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +G +DPN+D C +C +GGDL+CCD CP FH C IK P G + C +C
Sbjct: 925 NLEGKEDDPNEDWCAVCINGGDLLCCDRCPKVFHMKCHVPTIKIVPKGDFLCTFC 979
>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Canis lupus familiaris]
Length = 1364
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ + S CH
Sbjct: 1237 SEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSTSFCH 1296
Query: 868 V-------NDQDDSALSTLQ 880
QD +A S+ Q
Sbjct: 1297 FCPNSFCKEHQDGTAFSSTQ 1316
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 829 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-DMPDGSWFCNDC 871
>gi|302774224|ref|XP_002970529.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
gi|300162045|gb|EFJ28659.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
Length = 1340
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
D N D C +CG G+LICCDGCP+ +H C+ + K P G W+C C+ Q
Sbjct: 469 DWNSDDCVLCGMDGNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPECTLQ 519
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 13/64 (20%)
Query: 805 DGEDPNDDT-----------CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCV 851
+GED N D C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C
Sbjct: 471 EGEDDNGDVGEMEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICP 530
Query: 852 YCSC 855
C+C
Sbjct: 531 RCTC 534
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P G W C +C
Sbjct: 408 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC 456
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
ED + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 435 EDHHMEFCRVCKDGGELLCCDVCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 485
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D+ K P GKW C +C
Sbjct: 357 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHC 405
>gi|326427315|gb|EGD72885.1| hypothetical protein PTSG_12193 [Salpingoeca sp. ATCC 50818]
Length = 2049
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 15/102 (14%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS----CQFCGRINE 863
++D C +C G+L+CCDGCP +H +C+ + + P G W C C+ C CG+
Sbjct: 717 HEDYCIVCQQSGELLCCDGCPRVYHLDCVTPRLAEVPEGDWFCPACARNDPCTICGKPLP 776
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPS 905
+S ++C C H +C + GA + P+
Sbjct: 777 W---------KPMSQRRVCVQCGVPMHPACFKATGAPKEFPA 809
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 63.5 bits (153), Expect = 1e-06, Method: Composition-based stats.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 7 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 52
>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
guttata]
Length = 1339
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 47/90 (52%), Gaps = 9/90 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1237 EGKKESEDDCFRCGDGGQLVLCDRKSCTKAYHLSCLGLVKRPFGKWECPWHHCDVCGKPS 1296
Query: 863 ESTCHV-------NDQDDSALSTLQICSLC 885
S CH QD + L+++ LC
Sbjct: 1297 VSFCHFCPNSFCKEHQDRTVLNSMLNGQLC 1326
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDGEDP----NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G+ N C +C GG L+CC+ CP+ FH +CL+I + P G W+C C
Sbjct: 820 HFTAMKGKSHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWYCNDC 876
Score = 41.6 bits (96), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
C +C GDL+ C+G C FH +CL + P+GK+ C C+ TC V +
Sbjct: 673 VCQLCEKTGDLLLCEGLCYRAFHVSCLGLSGRPAGKFICSECTSGV------HTCFVCKE 726
Query: 872 DDSALSTLQICSLCEEKYHQSC 893
+ + + S C + YH++C
Sbjct: 727 RKADVKRC-VVSHCGKFYHEAC 747
>gi|5834582|emb|CAB55313.1| rfg7 protein [Homo sapiens]
Length = 1052
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N +DPN+D C +C +GGDL+CC+ CP H C + FPSG W C +C
Sbjct: 804 NNKDDDPNEDWCAVCQNGGDLLCCEKCPKVSHLTCHVPTLLSFPSGDWICTFC 856
>gi|222637620|gb|EEE67752.1| hypothetical protein OsJ_25457 [Oryza sativa Japonica Group]
Length = 1646
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
D ED N D C +CG G+L+CCDGCP+ FH C+ + + P G W C C Q
Sbjct: 422 DTEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPECLIQ 475
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 389 DDHMEFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 438
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 315 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 363
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 40/59 (67%), Gaps = 5/59 (8%)
Query: 802 VNFDGEDPND---DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
V D E+ +D + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 437 VGGDAEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 47/110 (42%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDAEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 462
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 342 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 387
>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
catus]
Length = 1300
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CL + K P GKW C + C CGR + S CH
Sbjct: 1173 SEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGRPSTSFCH 1232
Query: 868 V-------NDQDDSALSTLQ 880
QD +AL + +
Sbjct: 1233 FCPNSFCKEHQDGAALGSTR 1252
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL I + P G W C C
Sbjct: 765 NVSWCFVCSKGGSLLCCESCPAAFHPDCLSI-EMPDGSWFCNEC 807
>gi|113195649|ref|NP_001037823.1| zinc finger protein LOC723799 [Ciona intestinalis]
gi|92081516|dbj|BAE93305.1| zinc finger protein [Ciona intestinalis]
Length = 1298
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSCQFCGRINEST 865
N +TC CG+GGDL+CCD CP+ FH C LD K P G+W C C ++ +
Sbjct: 53 NHETCDACGEGGDLLCCDFCPAAFHLQCCNPPLDEDKVPHGEWACHRCVVTNNFKVFDE- 111
Query: 866 CHVNDQDDSALSTL 879
CH + +AL+ +
Sbjct: 112 CHRHTLTKTALTII 125
>gi|50510226|dbj|BAD31424.1| PHD finger transcription factor-like protein [Oryza sativa Japonica
Group]
Length = 1696
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
D ED N D C +CG G+L+CCDGCP+ FH C+ + + P G W C C Q
Sbjct: 437 DTEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPECLIQ 490
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 428 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 473
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 350 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 398
>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Cavia porcellus]
Length = 1367
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CL ++K P GKW C + C CG+ + + CH
Sbjct: 1239 SEDECFRCGDGGQLVLCDRKLCTKAYHLSCLGLRKRPFGKWECPWHHCDVCGKPSAAFCH 1298
Query: 868 V-------NDQDDSALSTLQ 880
+ QD + L T Q
Sbjct: 1299 LCPNSLCKEHQDGAVLRTTQ 1318
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 831 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-DMPDGSWFCNDC 873
>gi|125559491|gb|EAZ05027.1| hypothetical protein OsI_27209 [Oryza sativa Indica Group]
Length = 1696
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
D ED N D C +CG G+L+CCDGCP+ FH C+ + + P G W C C Q
Sbjct: 437 DTEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPECLIQ 490
>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cavia porcellus]
Length = 1366
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CL ++K P GKW C + C CG+ + + CH
Sbjct: 1238 SEDECFRCGDGGQLVLCDRKLCTKAYHLSCLGLRKRPFGKWECPWHHCDVCGKPSAAFCH 1297
Query: 868 V-------NDQDDSALSTLQ 880
+ QD + L T Q
Sbjct: 1298 LCPNSLCKEHQDGAVLRTTQ 1317
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-DMPDGSWFCNDC 872
>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
scrofa]
Length = 1338
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 44/85 (51%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H CL + K P GKW C + C CG+ +
Sbjct: 1206 EGKKRSEDECFRCGDGGQLVLCDRKSCTKAYHLPCLGLGKRPFGKWECPWHHCDVCGKPS 1265
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH + QD +A + Q
Sbjct: 1266 TSFCHFCPNSFCKDHQDGTAFGSTQ 1290
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 803 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 845
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 469 DDHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 518
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL +++K P GKW C +C
Sbjct: 383 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHC 431
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 143 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 188
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 65 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 113
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
N+D C C DGG+L+CCDGCP FH CL + + PSG W C C
Sbjct: 303 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNC 348
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 893 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 952
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 467 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 523
Score = 43.5 bits (101), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 318 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 371
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 372 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 410
>gi|357116142|ref|XP_003559843.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium
distachyon]
Length = 1679
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 34/54 (62%), Gaps = 2/54 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
D +D N D C +CG G+L+CCDGCP+ FH C+ + + P G+W+C C Q
Sbjct: 416 DTDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGEWYCPECLMQ 469
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 415 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 460
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 340 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 385
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 419 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 464
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 341 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 389
>gi|357472039|ref|XP_003606304.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507359|gb|AES88501.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 856
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/58 (46%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 805 DGE-DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCSCQFCG 859
DG+ D N D C +CG G LICCDGCPS +H C+ + K P G W+C C G
Sbjct: 409 DGDVDRNGDECRLCGMDGTLICCDGCPSAYHSRCIGVMKMFIPEGPWYCPECKIDMAG 466
>gi|115473879|ref|NP_001060538.1| Os07g0661500 [Oryza sativa Japonica Group]
gi|113612074|dbj|BAF22452.1| Os07g0661500 [Oryza sativa Japonica Group]
Length = 1752
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
D ED N D C +CG G+L+CCDGCP+ FH C+ + + P G W C C Q
Sbjct: 437 DTEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPECLIQ 490
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 427 DDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCTC 477
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 21/108 (19%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGRI 861
E + D C +C GG++I CD CP +H CL D++K P G W C +C Q+ R
Sbjct: 350 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEARE 409
Query: 862 NESTCHVN-------DQDDSALSTLQICS---------LCEEKYHQSC 893
+ S + ++DD + ++C C YH C
Sbjct: 410 DASEGEEDNEAGGEAEEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHC 457
>gi|449433493|ref|XP_004134532.1| PREDICTED: uncharacterized protein LOC101204186 [Cucumis sativus]
Length = 2368
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 10/65 (15%)
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKW 848
Q SK+KG F+ C +C GG+L+CCD CP T+H CL+ +K+ P GKW
Sbjct: 101 QFSSKKKGNDGYFFE--------CVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKW 152
Query: 849 HCVYC 853
HC C
Sbjct: 153 HCPTC 157
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+++ C +C DGGDL+CCD CPS +H+ CL +K P G W C C
Sbjct: 37 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 82
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 420 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 465
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 341 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 389
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 437 DDHMEFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 486
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 366 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 411
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 421 CRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 464
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 341 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 389
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 462
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 339 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 387
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 453 DDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCTC 503
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 21/108 (19%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGRI 861
E + D C +C GG++I CD CP +H CL D++K P G W C +C Q+ R
Sbjct: 376 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEARE 435
Query: 862 NESTCHVN-------DQDDSALSTLQICS---------LCEEKYHQSC 893
+ S + ++DD + ++C C YH C
Sbjct: 436 DVSDAEEDNGETGEMEEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHC 483
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 462
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 342 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 387
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 391 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 436
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 313 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 361
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 462
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 342 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 387
>gi|356510796|ref|XP_003524120.1| PREDICTED: uncharacterized protein LOC100793933 [Glycine max]
Length = 2325
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 29/42 (69%), Gaps = 2/42 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
C IC GG+L+CCD CP T+H CLD +K+ P+GKW C C
Sbjct: 73 CVICDVGGNLLCCDSCPRTYHLQCLDPPLKRIPNGKWQCPSC 114
>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1283
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 18/110 (16%)
Query: 809 PNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
P D C IC G+L+ CDGCP FH C+ IK +P +W C C Q CG V
Sbjct: 1174 PWDVECFICKLYGELLGCDGCPKAFHLACIGIKSWPQEEWFCDECDMQTCG--------V 1225
Query: 869 NDQDDSALSTLQICSL------CEEKYHQSCSQTDGAVQYEPSSLSFCGK 912
++ L++ IC C++ +H C + + P S FC K
Sbjct: 1226 CGRNKIKLNSHVICGSEDGSKGCDKVFHLKCVKLEKV----PESDWFCSK 1271
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 31/44 (70%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
++ C +C +GG ++ CDGC FH +CL+I++ P GK +C +CS
Sbjct: 1077 EEECYVCREGGQVVSCDGCQRVFHLSCLNIRRMPRGKLYCKHCS 1120
Score = 47.4 bits (111), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 29/68 (42%), Gaps = 21/68 (30%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNC---------------------LDIKKFPSG 846
D N C +C DGG+L+CCD CP FH L ++ P
Sbjct: 130 DHNRWYCNLCKDGGELLCCDRCPRAFHLKWYVGCFPSAVVAHQASRYASLGLQKEEIPES 189
Query: 847 KWHCVYCS 854
+W+C +C+
Sbjct: 190 EWYCKFCA 197
>gi|255577031|ref|XP_002529400.1| hypothetical protein RCOM_0623600 [Ricinus communis]
gi|223531148|gb|EEF32996.1| hypothetical protein RCOM_0623600 [Ricinus communis]
Length = 186
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%)
Query: 698 VAYDGKWTLLAWMIDTGTVPLNEKVQYWNQRKTRVMLQGRIARDGIRCDCCSEIFTISKF 757
VA T+L W+ID V EKV +++ + + QG I R GI+C+CC +++T+S F
Sbjct: 60 VANPSALTILTWLIDNNAVFPREKVYNCSRKGSPPVAQGCIHRGGIKCNCCGKVYTLSGF 119
Query: 758 DTHSKSKLCHPFQNLYFESGSSLLQC 783
+ H K ++ E G SLL C
Sbjct: 120 EFHVSGKCSRSAAKIFLEDGRSLLDC 145
>gi|194378472|dbj|BAG63401.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
+DPN+D C +C +GGDL+ C+ CP FH C + FPSG W C +C
Sbjct: 389 DDPNEDWCAVCQNGGDLLRCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 437
>gi|395538667|ref|XP_003771296.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Sarcophilus harrisii]
Length = 2011
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 518 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 568
>gi|307136401|gb|ADN34210.1| chromatin remodeling complex subunit [Cucumis melo subsp. melo]
Length = 2374
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 10/65 (15%)
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKW 848
Q SK+KG F+ C +C GG+L+CCD CP T+H CL+ +K+ P GKW
Sbjct: 108 QFSSKKKGNDGYFFE--------CVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKW 159
Query: 849 HCVYC 853
HC C
Sbjct: 160 HCPTC 164
>gi|405959089|gb|EKC25157.1| Bromodomain adjacent to zinc finger domain protein 1A [Crassostrea
gigas]
Length = 1488
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS 854
ND+ C +C GG L+CCD CP +H +C +KK P GKW C C+
Sbjct: 1226 NDNVCAVCSTGGMLVCCDTCPLVYHLDCAVPPLKKVPRGKWQCQLCT 1272
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 401 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 446
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 323 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 371
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 403 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 448
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 328 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 373
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 454 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 499
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 376 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 424
>gi|428182510|gb|EKX51370.1| hypothetical protein GUITHDRAFT_102641 [Guillardia theta CCMP2712]
Length = 1947
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
DD C IC +GG+LICCD CP T H CL + K P G ++C C
Sbjct: 1759 DDYCLICFEGGNLICCDSCPRTVHAACLGLSKIPKGDFYCFDC 1801
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 555 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLAEIPNGEWLCPRCTC 600
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 480 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 525
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 427 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 477
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 348 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 407
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 408 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 457
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 419 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 469
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 340 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDSDMEKAPEGKWSCPHCEKEGIQWEAKE 399
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 400 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 449
>gi|390461100|ref|XP_003732597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 3
[Callithrix jacchus]
Length = 1389
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1257 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1316
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1317 TSFCHLCPNSFCKEHQDGTAFS 1338
Score = 46.2 bits (108), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Query: 820 GGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 875 GGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 907
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 426 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 476
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 347 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 395
>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Callithrix jacchus]
Length = 1400
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1268 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1327
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1328 TSFCHLCPNSFCKEHQDGTAFS 1349
Score = 46.2 bits (108), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Query: 820 GGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 875 GGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 907
>gi|449490665|ref|XP_004158671.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101228553, partial [Cucumis sativus]
Length = 1851
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 10/65 (15%)
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKW 848
Q SK+KG F+ C +C GG+L+CCD CP T+H CL+ +K+ P GKW
Sbjct: 101 QFSSKKKGNDGYFFE--------CVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKW 152
Query: 849 HCVYC 853
HC C
Sbjct: 153 HCPTC 157
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 454 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 499
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 376 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 424
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 418
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 419 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 468
>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
Length = 1069
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 491 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 550
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 65 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 121
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 432 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 482
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 47/107 (43%), Gaps = 23/107 (21%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR---- 860
+ D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 356 HQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNS 415
Query: 861 -----INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 416 EGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 462
>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cricetulus griseus]
Length = 1365
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFRSTQ 1317
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-DMPDGSWFCNDC 872
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 419 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 469
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 340 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 388
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|345570767|gb|EGX53588.1| hypothetical protein AOL_s00006g454 [Arthrobotrys oligospora ATCC
24927]
Length = 1515
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 14/94 (14%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIK-KFPSG---KWHCVYCSCQFCGRINEST 865
N D C +C DGGD++CC CP ++H +CLD + K+ S ++HC C C
Sbjct: 1129 NQDYCQVCTDGGDIVCCSSCPRSYHYDCLDEEHKYKSNGKMQYHCSQHECHDC------- 1181
Query: 866 CHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGA 899
+ A + L C CE Y + C D A
Sbjct: 1182 ---EQKTSDAGNMLYRCRFCERAYCEDCLDFDNA 1212
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 356 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 406
Score = 48.9 bits (115), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 820 GGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 290 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 325
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 6/54 (11%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQF----CGRI 861
C C DGGDL+ CD CP ++H NCL+ ++K P G+W C C+C C RI
Sbjct: 426 CSRCKDGGDLLICDTCPHSYHLNCLNPPVEKVPEGEWSCPRCTCPMLKGKCQRI 479
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CDGCP +H CL+ + + P G W C C
Sbjct: 340 ETDHQDYCEVCKQGGEIILCDGCPRAYHLVCLEPPLDQPPEGSWPCPTC 388
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 505 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 550
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 430 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 475
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 380 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 430
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 301 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 360
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 361 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 410
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 442 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 492
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 363 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 422
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 423 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 472
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 32/53 (60%), Gaps = 3/53 (5%)
Query: 804 FDG-EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DG E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 362 MDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 449 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 499
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 32/53 (60%), Gaps = 3/53 (5%)
Query: 804 FDG-EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DG E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 MDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 418
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 810 CRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 853
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 733 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 778
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 443 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 493
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 364 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 423
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 424 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 473
>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
Length = 1366
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1234 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1293
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1294 TSFCHLCPNSFCKEHQDGTAFRSTQ 1318
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 831 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 873
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 299 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 349
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 223 HQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 268
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 439 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 489
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 360 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 408
>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Pongo abelii]
Length = 1365
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 407
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 435 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 485
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 356 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 415
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 416 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 465
>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
syndrome candidate 1 protein homolog; Short=WHSC1
Length = 1365
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFRSTQ 1317
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|115530816|emb|CAL49405.1| novel protein containing a PHD-finger domain [Xenopus (Silurana)
tropicalis]
Length = 587
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ C GCP +H CL + + P+GKW C + C C + S C
Sbjct: 80 HEDECFSCGDGGQLVSCKKPGCPKVYHAECLKLTRRPAGKWECPWHQCDICHKEAASLCE 139
Query: 868 V 868
+
Sbjct: 140 M 140
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Saimiri
boliviensis boliviensis]
Length = 1368
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|260817522|ref|XP_002603635.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
gi|229288956|gb|EEN59646.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
Length = 190
Score = 62.8 bits (151), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/49 (48%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
EDPN+D C +C +GGDL+CCD CP +H C +I PSG + C C
Sbjct: 10 EDPNEDWCAVCHNGGDLLCCDTCPKVYHLTCHVPNIPAMPSGDFMCTLC 58
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 444 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 489
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 414
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 418
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 419 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 468
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 418
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 419 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 468
>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus griseus]
Length = 1344
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1212 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1271
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1272 TSFCHLCPNSFCKEHQDGTAFRSTQ 1296
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 809 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-DMPDGSWFCNDC 851
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 407
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 466 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 516
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 387 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 435
>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 6
[Pan troglodytes]
gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 7
[Pan troglodytes]
gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 8
[Pan troglodytes]
gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
10 [Pan troglodytes]
gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
paniscus]
gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
Length = 1365
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Gorilla gorilla gorilla]
Length = 1365
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 520 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 565
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 442 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 490
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 419 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 469
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 340 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 388
>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Protein trithorax-5;
AltName: Full=Wolf-Hirschhorn syndrome candidate 1
protein; Short=WHSC1
gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
Length = 1365
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 407
>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Callithrix jacchus]
Length = 1365
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 1293 TSFCHLCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 442 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 492
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 363 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 422
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 423 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 472
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 407
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 495
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 407
>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus norvegicus]
Length = 1346
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1214 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1273
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1274 TSFCHLCPNSFCKEHQDGTAFRSTQ 1298
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL I + P G W C C
Sbjct: 811 NVSWCFVCSKGGSLLCCEACPAAFHPDCLSI-EMPDGSWFCNDC 853
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 404 CRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 447
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 324 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 372
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C + H +
Sbjct: 429 CRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTCPPLKGKVQRILHWKWK 488
Query: 872 DDSALSTL 879
+ S L +L
Sbjct: 489 ESSPLLSL 496
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 351 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHC 399
>gi|242084736|ref|XP_002442793.1| hypothetical protein SORBIDRAFT_08g002920 [Sorghum bicolor]
gi|241943486|gb|EES16631.1| hypothetical protein SORBIDRAFT_08g002920 [Sorghum bicolor]
Length = 1897
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 61/148 (41%), Gaps = 33/148 (22%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC----------- 853
+D N D C ICG G L+CCDGCP +H C+ K P G W C C
Sbjct: 546 QDGNSDDCRICGMDGTLVCCDGCPWAYHSRCIGQNKAFLPQGDWFCPECVVNKLGPTSSR 605
Query: 854 -----------SCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQY 902
CGR+ TC+ ++AL E Y + +Q D A +
Sbjct: 606 IERGARGAQMFGIDMCGRLFLGTCNYLLVIEAALDA--------ESYARYYNQYDVAKVF 657
Query: 903 EPSSLSFCGKK-CQEIFERLEKLLGVKH 929
+ ++S C++I + + LLG+ H
Sbjct: 658 QRLAISDAYVDICRQIKDYWKHLLGINH 685
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 496 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 546
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 417 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 465
>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
Length = 1346
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1214 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1273
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1274 TSFCHLCPNSFCKEHQDGTAFRSTQ 1298
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 811 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 853
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 505 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 550
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 427 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 475
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 487 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 532
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 409 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 457
>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Meleagris gallopavo]
Length = 1348
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 48/90 (53%), Gaps = 12/90 (13%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1217 EGKKESEDDCFRCGDGGQLVLCDRKSCTKAYHLSCLGLVKRPFGKWECPWHHCDVCGKPS 1276
Query: 863 ESTCHV-------NDQDDSAL-STL--QIC 882
S CH QD + L STL Q+C
Sbjct: 1277 VSFCHFCPNSFCKEHQDGTVLNSTLNGQLC 1306
Score = 55.1 bits (131), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDGEDP----NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G+ N C +C GG L+CC+ CP+ FH +CL+I + P G W+C C
Sbjct: 800 HFTAMKGKSHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWYCNDC 856
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVNDQ 871
C +C GDL+ C+G C FH +CL + P+GK+ C C+ TC V +
Sbjct: 653 VCQLCEKTGDLLLCEGLCYRAFHVSCLGLSGRPAGKFVCSECTSGV------HTCFVCKE 706
Query: 872 DDSALSTLQICSLCEEKYHQSC 893
+ L + S C + YH++C
Sbjct: 707 RKADLKRC-VVSHCGKFYHEAC 727
>gi|196010575|ref|XP_002115152.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
gi|190582535|gb|EDV22608.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
Length = 709
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 41/73 (56%), Gaps = 7/73 (9%)
Query: 792 DESK---RKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFP 844
D+SK RK V G N+DTC C +GGDL+CC+ CPS FH +C LD P
Sbjct: 22 DDSKKLNRKVVREVRRSGRSFNNDTCDSCNEGGDLLCCERCPSAFHLHCCDPPLDEDGIP 81
Query: 845 SGKWHCVYCSCQF 857
+G+W C C ++
Sbjct: 82 AGEWICKKCKAEY 94
>gi|432114829|gb|ELK36567.1| Putative histone-lysine N-methyltransferase NSD2 [Myotis davidii]
Length = 1037
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 2/66 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 847 EGKKESEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWVCPWHHCDVCGKPS 906
Query: 863 ESTCHV 868
S CH+
Sbjct: 907 ASFCHL 912
Score = 42.7 bits (99), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 822 DLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
DL+CC+ CP+ FH +CL I P G W C C
Sbjct: 456 DLLCCESCPAAFHPDCLSI-DMPDGSWFCNDC 486
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
NDD C C DGGDL+CCD C +FH CL+ +++ P G W+C C
Sbjct: 65 NDDFCSSCKDGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSC 110
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 29/41 (70%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
C +C DGGDL+ CD CP +FHQ+C+++ + P G W C C+
Sbjct: 460 CEVCKDGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICT 500
Score = 47.8 bits (112), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+D C +C GGD+I CD C FH CL+ + + P G W C C
Sbjct: 526 HDQFCKVCKRGGDVILCDFCSCVFHLRCLNPPLGEVPEGDWKCPRC 571
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 667 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 712
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 592 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 637
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG L+ C GCP +H +CL + + P+GKW C + C C + S C +
Sbjct: 1548 EDECFSCGDGGQLVSCKRPGCPKVYHADCLSLTRRPAGKWECPWHQCDVCRKEAASFCEM 1607
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I P G W+C C
Sbjct: 1123 HFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI-DMPEGSWYCNDC 1179
Score = 48.1 bits (113), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C IC G+L+ C+ C FH CL + + P GK+ C CS TC V
Sbjct: 974 ENVCQICEKPGELLLCEAQCCGAFHLECLGLSEMPKGKFICTECS------TGVHTCFVC 1027
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
+ + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1028 KNSGENVKRC-LLPLCGKYYHEECIQ-----KYPPTVIQNKGFRC 1066
>gi|170058471|ref|XP_001864936.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877568|gb|EDS40951.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 830
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 44/82 (53%), Gaps = 8/82 (9%)
Query: 780 LLQCILDSWNKQDESK---RKGFH-FVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQ 835
L+Q ++ + SK +K H + G N+DTC CG+GGDLICCD CPS+FH
Sbjct: 10 LIQALIKPPESGEASKPPGKKPHHPYYRKPGRGHNNDTCDACGEGGDLICCDRCPSSFHL 69
Query: 836 NCLDI----KKFPSGKWHCVYC 853
C D + P+G W C C
Sbjct: 70 GCHDPPLSEQDIPNGLWICHTC 91
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 421 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 468
>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
Length = 1298
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1166 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1225
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1226 TSFCHLCPNSFCKEHQDGTAFRSTQ 1250
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 763 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 805
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 427 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 474
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 407 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRCTC 452
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 330 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 378
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 440 CRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 483
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ C GCP +H CL + + P+GKW C + C C + S C
Sbjct: 1933 HEDECFSCGDGGQLVSCKKPGCPKVYHAECLKLTRRPAGKWECPWHQCDICHKEAASLCE 1992
Query: 868 V 868
+
Sbjct: 1993 M 1993
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I P G W C C
Sbjct: 1511 HFTPRRGCKNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI-DMPEGSWFCNDC 1567
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 11/106 (10%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN-ESTCHV 868
++ C +C G+L+ C+ C FH CL ++ P GK+ C CS G + TC V
Sbjct: 1358 ENVCQVCEKPGELLLCEAQCCGAFHLQCLGMEAMPQGKFVCTECS---SGTLQGYHTCFV 1414
Query: 869 NDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
+ D + + LC + YH+ C A++Y P++ G +C
Sbjct: 1415 CKESDQGVKRCML-PLCGKYYHEEC-----ALKYPPATQQNRGFRC 1454
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGD G L+ C GCP +H +CL++ K P+GKW C + C CG+ S C +
Sbjct: 698 EDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEM 757
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 270 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 326
Score = 42.7 bits (99), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 121 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 174
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 175 KQSGEDVKRCLL-PLCGKFYHEECVQ-----KYPPTVMQNKGFRC 213
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 440 CRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 483
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 441 CRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 484
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 423 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 470
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DGE + D C +C GG++I CD CP +H CL ++ + P GKW C +C
Sbjct: 358 DGEHEHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 408
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 347 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRCTC 392
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 127 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 172
Score = 56.2 bits (134), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 49 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 97
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 888 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 933
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 813 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 858
>gi|242024147|ref|XP_002432491.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212517929|gb|EEB19753.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 1049
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYCS 854
ED N+D C +C DGG+LICCD CP FH NC +K P + W C+ C+
Sbjct: 844 EDSNEDWCAVCMDGGELICCDNCPKVFHVNCHIPALKAMPGETETWQCLLCT 895
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 442 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 489
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DGE + D C +C GG++I CD CP +H CL ++ + P GKW C +C
Sbjct: 377 DGEHEHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 427
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 427 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 474
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 438 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 488
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 418
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 419 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 468
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 392 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRCTC 437
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 315 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 363
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 440 CRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 483
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DGE + D C +C GG++I CD CP +H CL ++ + P GKW C +C
Sbjct: 371 DGEHEHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|189236075|ref|XP_972162.2| PREDICTED: similar to Dual specificity
tyrosine-phosphorylation-regulated kinase [Tribolium
castaneum]
Length = 2981
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 34/52 (65%), Gaps = 6/52 (11%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCS 854
+DPN+D C +C DGG+L+CCD CP FHQ C L +++ + W C+ C+
Sbjct: 2780 DDPNEDWCAVCMDGGELVCCDKCPKVFHQYCHIPNLSVEE--NDTWQCLLCT 2829
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 410 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 455
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 332 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 380
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+++ C +C DGGDL+CCD CPS +H+ CL +K P G W C C
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
(predicted) [Rattus norvegicus]
Length = 1298
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1166 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1225
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 1226 TSFCHLCPNSFCKEHQDGTAFRSTQ 1250
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL I + P G W C C
Sbjct: 763 NVSWCFVCSKGGSLLCCEACPAAFHPDCLSI-EMPDGSWFCNDC 805
>gi|357476459|ref|XP_003608515.1| Chromodomain-helicase-DNA-binding protein, partial [Medicago
truncatula]
gi|355509570|gb|AES90712.1| Chromodomain-helicase-DNA-binding protein, partial [Medicago
truncatula]
Length = 1406
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
G D C IC GG+L+CCD CP T+H CLD +K+ P GKW C C
Sbjct: 65 GHDGYFYECVICDLGGNLLCCDSCPRTYHLQCLDPPLKRIPMGKWQCPSC 114
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 435 CRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 478
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DGE + D C +C GG++I CD CP +H CL ++ + P GKW C +C
Sbjct: 366 DGEHEHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 416
>gi|157136705|ref|XP_001656884.1| hypothetical protein AaeL_AAEL013616 [Aedes aegypti]
gi|108869892|gb|EAT34117.1| AAEL013616-PA [Aedes aegypti]
Length = 392
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 780 LLQCIL---DSWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQN 836
L+Q ++ DS + K+ + G N+DTC CG+GGDLICCD CPS+FH
Sbjct: 19 LIQALIKPPDSGEPKPICKKTTHSYYKKPGRGHNNDTCDACGEGGDLICCDRCPSSFHLG 78
Query: 837 C----LDIKKFPSGKWHCVYC 853
C LD P+G W C C
Sbjct: 79 CHDPPLDETDIPNGLWICHTC 99
>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
Length = 1365
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH QD +A S
Sbjct: 1293 TSFCHFCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
Length = 1365
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH QD +A S
Sbjct: 1293 TSFCHFCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|198466497|ref|XP_002135204.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
gi|198150627|gb|EDY73831.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
Length = 2036
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 433 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRCSC 480
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DGE + D C +C GG++I CD CP +H CL ++ + P GKW C +C
Sbjct: 368 DGEHEHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 418
>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
fascicularis]
Length = 1365
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1233 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 1292
Query: 863 ESTCHV-------NDQDDSALS 877
S CH QD +A S
Sbjct: 1293 TSFCHFCPNSFCKEHQDGTAFS 1314
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 830 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 872
>gi|320167629|gb|EFW44528.1| hypothetical protein CAOG_02553 [Capsaspora owczarzaki ATCC 30864]
Length = 1716
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
D N+D+C +C G+L+CCDGCP +H C LD P G W C C
Sbjct: 424 DLNNDSCEVCSQTGELLCCDGCPRVYHATCLKLDTASLPQGDWFCPTC 471
Score = 50.8 bits (120), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 26/46 (56%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C C G+++CCD CP ++H CL + K P G W C C
Sbjct: 1434 NEDHCNACKARGNVLCCDYCPRSYHLKCLKPPMSKPPRGDWKCPIC 1479
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 417 CRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRCTC 460
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 335 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 383
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 431 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 478
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DGE + D C +C GG++I CD CP +H CL ++ + P GKW C +C
Sbjct: 366 DGEHEHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 416
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + +D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 435 DMEEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCMC 489
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P G W C +C
Sbjct: 362 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC 410
>gi|255573016|ref|XP_002527438.1| DNA binding protein, putative [Ricinus communis]
gi|223533173|gb|EEF34930.1| DNA binding protein, putative [Ricinus communis]
Length = 1723
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCS 854
G D N D C +CG G L+CCDGCPS +H C+ + K P G W+C C+
Sbjct: 411 GMDGNSDECRLCGMDGTLLCCDGCPSAYHSRCIGVVKMYIPDGPWYCPECT 461
>gi|195166785|ref|XP_002024215.1| GL22908 [Drosophila persimilis]
gi|194107570|gb|EDW29613.1| GL22908 [Drosophila persimilis]
Length = 1898
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 291 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRCSC 338
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
DGE + D C +C GG++I CD CP +H CL ++ + P GKW C +C
Sbjct: 226 DGEHEHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 276
>gi|307199491|gb|EFN80104.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Harpegnathos saltator]
Length = 1948
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D +D + + C +C DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 422 DDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRCSC 474
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 40/95 (42%), Gaps = 18/95 (18%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
+ D C +C GG++I CD CP +H C ++++ P GKW C +C E
Sbjct: 367 HQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCE-------GEGITG 419
Query: 868 VNDQDDSALSTLQICS---------LCEEKYHQSC 893
D DD + ++C C YH C
Sbjct: 420 AADDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHC 454
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 364 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 414
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 285 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 333
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+++ C +C DGGDL+CCD CPS +H+ CL +K P G W C C
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 328 DDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCTC 378
Score = 54.7 bits (130), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P G W C +C
Sbjct: 252 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC 300
>gi|224125208|ref|XP_002329920.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222871157|gb|EEF08288.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 2332
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS 854
C IC GG+L+CCD CP +H CLD +K+ P GKW C CS
Sbjct: 65 CVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCPKCS 107
>gi|359472760|ref|XP_003631193.1| PREDICTED: uncharacterized protein LOC100247555 [Vitis vinifera]
Length = 2355
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 31/45 (68%), Gaps = 3/45 (6%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ 856
C IC GG+L+CCD CP T+H CL+ +K+ P+GKW C C CQ
Sbjct: 78 CVICDLGGNLLCCDSCPRTYHLQCLNPPLKRIPNGKWQCPKC-CQ 121
>gi|307111603|gb|EFN59837.1| hypothetical protein CHLNCDRAFT_133598 [Chlorella variabilis]
Length = 1305
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 810 NDDTCGICG-DGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
++ C CG D GDL+CCDGCPS +H C + P G+W C CS
Sbjct: 1181 HEHQCAACGGDQGDLLCCDGCPSVYHPRCCGLGGVPPGRWFCPVCS 1226
>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
Length = 1363
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 41/80 (51%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ + CH
Sbjct: 1236 SEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSTVFCH 1295
Query: 868 V-------NDQDDSALSTLQ 880
QD +A S+ Q
Sbjct: 1296 FCPNSFCKEHQDGTAFSSTQ 1315
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 828 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-DMPDGSWFCNDC 870
Score = 40.4 bits (93), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 6/83 (7%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE-STCHVND 870
C +C G L+ C+G C FH CL + + P G++ C C+ GR + +C V
Sbjct: 663 VCQLCEKTGSLVLCEGPCCGAFHLACLGLSRRPEGRFTCSECA---SGRFSWIHSCFVCK 719
Query: 871 QDDSALSTLQICSLCEEKYHQSC 893
+ + + + S C + YH++C
Sbjct: 720 ESKTDVKRC-VVSQCGKFYHEAC 741
>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
Length = 1364
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 41/80 (51%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ + CH
Sbjct: 1237 SEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSTVFCH 1296
Query: 868 V-------NDQDDSALSTLQ 880
QD +A S+ Q
Sbjct: 1297 FCPNSFCKEHQDGTAFSSTQ 1316
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 829 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-DMPDGSWFCNDC 871
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 419 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRCTC 464
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 345 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHC 390
>gi|224071533|ref|XP_002303505.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222840937|gb|EEE78484.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 2327
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS 854
C IC GG+L+CCD CP +H CLD +K+ P GKW C CS
Sbjct: 75 CVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCPKCS 117
>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
Length = 1366
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 27/111 (24%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD GC +H +CLD K P G+W C + C CG+ +++ C
Sbjct: 1231 SEDECFRCGDGGQLVLCDKKGCTKAYHLSCLDRTKRPFGRWDCPWHHCDVCGKNSDAFC- 1289
Query: 868 VNDQDDSALSTLQIC--SLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQE 916
Q+C S C K HQ +GA++ P + C CQE
Sbjct: 1290 ------------QLCPNSFC--KAHQ-----EGALRSHPLTGQLC---CQE 1318
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C IC GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 827 NVSWCFICSKGGRLLCCESCPAAFHPDCLNI-AMPDGSWFCNDC 869
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 462 DDHHIEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCKC 512
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 381 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 429
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 463 DDHHIEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCKC 513
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 382 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 430
>gi|297734860|emb|CBI17094.3| unnamed protein product [Vitis vinifera]
Length = 1382
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCS 854
D +D N D C +CG G L+CCDGCPS +H C+ + K P G W C C+
Sbjct: 190 DDQDVNGDECRLCGMDGTLLCCDGCPSVYHSRCIGVSKMFIPDGPWFCPECT 241
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N + C IC G+LI CD CP +FH +C+D+K+ P G W C C
Sbjct: 1064 NQEVCNICESPGELILCDFCPKSFHLDCIDLKRLPRGTWKCPPC 1107
>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Takifugu rubripes]
Length = 1329
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 57/116 (49%), Gaps = 27/116 (23%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G ++D C CGDGG L+ CD C +H +CL++ K P G+W C + C CG+ +
Sbjct: 1191 EGRKKSEDECFRCGDGGQLVLCDKKTCTKAYHLSCLNLTKRPFGRWDCPWHHCDVCGKNS 1250
Query: 863 ESTCHVNDQDDSALSTLQIC--SLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQE 916
E+ C Q+C S C K HQ +GA++ P + C CQE
Sbjct: 1251 EAFC-------------QLCPNSFC--KCHQ-----EGALRPSPVTGQLC---CQE 1283
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 791 NVSWCFVCSKGGQLLCCESCPAAFHPDCLNI-AMPDGSWFCNDC 833
>gi|255544538|ref|XP_002513330.1| conserved hypothetical protein [Ricinus communis]
gi|223547238|gb|EEF48733.1| conserved hypothetical protein [Ricinus communis]
Length = 602
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 50/101 (49%), Gaps = 24/101 (23%)
Query: 801 FVNFD------GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVY 852
F +FD G+D + C IC +GGDL+CCD CP T+H CL ++ PSG W C
Sbjct: 45 FNSFDLPPRRIGDDGHYYECVICDNGGDLLCCDTCPGTYHLQCLTPPLELVPSGNWQCEN 104
Query: 853 CSCQFCGRI--------------NE-STCHVNDQDDSALST 878
C CQ + NE +T HV DQ ++ ST
Sbjct: 105 C-CQAADLLTPLKYLEGLKRNASNENATNHVGDQKETETST 144
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+++CCD CPS +H CL + P G W C CSC
Sbjct: 432 DDEHQEFCRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPRCSC 482
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 40/106 (37%), Gaps = 32/106 (30%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS-------- 854
DGE + D C +C GG++I CD CP +H CLD ++ P GKW C C
Sbjct: 372 DGEFEHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTCEAEGPADED 431
Query: 855 -------CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC 893
C+ C E C C C YH C
Sbjct: 432 DDEHQEFCRVCKDGGEMLC---------------CDSCPSAYHTWC 462
>gi|255956317|ref|XP_002568911.1| Pc21g19200 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590622|emb|CAP96817.1| Pc21g19200 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 986
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSCQ 856
N D C CG GGDL+CCD C +++H +C LD K P G+WHC CS +
Sbjct: 544 NSDDCHQCGKGGDLLCCDTCINSYHFDCLDPPLDPKNPPQGEWHCPKCSIR 594
>gi|359479395|ref|XP_002271257.2| PREDICTED: uncharacterized protein LOC100243147 [Vitis vinifera]
Length = 1582
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCS 854
D +D N D C +CG G L+CCDGCPS +H C+ + K P G W C C+
Sbjct: 416 DDQDVNGDECRLCGMDGTLLCCDGCPSVYHSRCIGVSKMFIPDGPWFCPECT 467
>gi|74149099|dbj|BAE32198.1| unnamed protein product [Mus musculus]
Length = 895
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 52 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 102
>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
carolinensis]
Length = 1438
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C+ CG S C
Sbjct: 1321 HEDNCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPFGKWECPWHQCEICGNPATSFC 1379
Score = 48.5 bits (114), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CC+ CP++FH CL I + P+G W+C C
Sbjct: 918 CEKGGKLLCCESCPASFHPECLHI-EMPNGSWNCNDC 953
Score = 40.8 bits (94), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 47/102 (46%), Gaps = 13/102 (12%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C IC G+ L+ C+G C FH CL + P GK+ C+ C G+
Sbjct: 695 GTSKKDSICQICESSGESLLSCEGECYRVFHTECLGLSSQPEGKFICIECK---NGQHTC 751
Query: 864 STCHVNDQDDSALSTLQICSL--CEEKYHQSCSQTDGAVQYE 903
+C + +D ++ CS+ C + YH++C + +E
Sbjct: 752 FSCKLPGKD------VKRCSVNACGKFYHEACVRKFATTVFE 787
>gi|159162701|pdb|1MM2|A Chain A, Solution Structure Of The 2nd Phd Domain From Mi2b
gi|319443275|pdb|2L75|A Chain A, Solution Structure Of Chd4-Phd2 In Complex With H3k9me3
Length = 61
Score = 61.6 bits (148), Expect = 4e-06, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 37/52 (71%), Gaps = 2/52 (3%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
G D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 4 GSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 55
>gi|348675982|gb|EGZ15800.1| hypothetical protein PHYSODRAFT_263017 [Phytophthora sojae]
Length = 823
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 801 FVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSG--KWHCVYCS 854
++ D D N+D C CG G+LICCDGCP+ FH NC ++ P W C C+
Sbjct: 643 YIKSDQNDLNEDACFRCGLAGELICCDGCPAAFHLNCTNLAMVPPDGIPWFCSECT 698
>gi|315113501|pdb|3O33|A Chain A, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113502|pdb|3O33|B Chain B, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113503|pdb|3O33|C Chain C, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113504|pdb|3O33|D Chain D, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113505|pdb|3O34|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(13-32)k23ac Peptide
gi|315113507|pdb|3O35|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113508|pdb|3O35|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113511|pdb|3O36|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113512|pdb|3O36|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113515|pdb|3O37|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113516|pdb|3O37|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113517|pdb|3O37|C Chain C, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113518|pdb|3O37|D Chain D, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
Length = 184
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 809 PNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
PN+D C +C +GG+L+CC+ CP FH +C + FPSG+W C +C
Sbjct: 2 PNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFC 48
>gi|242051184|ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
gi|241926713|gb|EER99857.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
Length = 1688
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ 856
+D N D C +CG G+L+CCDGCP+ FH C+ + + P G W+C C Q
Sbjct: 422 DDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPECLIQ 473
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+++ C +C DGGDL+CCD CPS +H+ CL +K P G W C C
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|431897323|gb|ELK06585.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
Length = 502
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 41/80 (51%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H CL + K P GKW C + C CG+ + S CH
Sbjct: 307 SEDECFRCGDGGQLVLCDRRSCTKAYHLACLGLGKRPFGKWECPWHHCDVCGKPSTSFCH 366
Query: 868 V-------NDQDDSALSTLQ 880
QD +A S+ Q
Sbjct: 367 FCPNSFCKEHQDGTAFSSTQ 386
>gi|270005635|gb|EFA02083.1| hypothetical protein TcasGA2_TC007718 [Tribolium castaneum]
Length = 980
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 34/52 (65%), Gaps = 6/52 (11%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCS 854
+DPN+D C +C DGG+L+CCD CP FHQ C L +++ + W C+ C+
Sbjct: 779 DDPNEDWCAVCMDGGELVCCDKCPKVFHQYCHIPNLSVEE--NDTWQCLLCT 828
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ + C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 437 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 484
>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
Length = 1461
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 27/111 (24%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD GC +H +CLD K P G+W C + C CG+ + + C
Sbjct: 1326 SEDECFRCGDGGQLVLCDKKGCTKAYHLSCLDRTKRPFGRWDCPWHHCDVCGKNSAAFC- 1384
Query: 868 VNDQDDSALSTLQIC--SLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQE 916
Q+C S C K HQ +GA++ P + C CQE
Sbjct: 1385 ------------QLCPNSFC--KAHQ-----EGALRSHPLTGQLC---CQE 1413
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C IC GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 922 NVSWCFICSKGGRLLCCESCPAAFHPDCLNI-AMPDGSWFCNDC 964
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 462
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 339 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 387
>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
glaber]
Length = 1372
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 9/80 (11%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ + S CH
Sbjct: 1244 SEDECFRCGDGGQLVLCDRKLCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSMSFCH 1303
Query: 868 V-------NDQDDSALSTLQ 880
+ QD ++ T Q
Sbjct: 1304 LCPNSLCKEHQDGASFRTAQ 1323
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 843 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-DMPDGSWFCNDC 885
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+++ C +C DGGDL+CCD CPS +H+ CL +K P G W C C
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|118384512|ref|XP_001025404.1| PHD-finger family protein [Tetrahymena thermophila]
gi|89307171|gb|EAS05159.1| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 1453
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
D+ C +CG G+++ CD CPS FH C+ +K P G W C+ C
Sbjct: 1391 DEQCKVCGQHGEVLMCDTCPSVFHLKCIGLKSLPDGDWSCLEC 1433
>gi|194874449|ref|XP_001973400.1| GG13363 [Drosophila erecta]
gi|190655183|gb|EDV52426.1| GG13363 [Drosophila erecta]
Length = 711
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS +H CL+ + P G W C CSC
Sbjct: 440 CRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSC 483
>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Takifugu rubripes]
Length = 1169
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 811 DDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+D C CGDGG ++ C GCP +H +CL++ K P+G+W C + C CG+ S C +
Sbjct: 953 EDECFSCGDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQCDICGKEAASFCEM 1012
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CC+ CP+ FH+ CL+I + P G W C C
Sbjct: 527 HFTPRKGCKNHEHINVSWCFVCSEGGSLLCCESCPAAFHRECLNI-EMPQGSWFCNDC 583
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 39/92 (42%), Gaps = 8/92 (8%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C +C GDL+ CDG C FH C+ + P GK+ C CS +C V
Sbjct: 379 ENVCQVCERTGDLLACDGHCYGAFHPQCIGLSAAPKGKFFCRECSSGV------HSCFVC 432
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQ 901
+ + I LC + YH C A Q
Sbjct: 433 KTSGNGVKRCMI-PLCGKFYHNDCITAFSATQ 463
>gi|383851360|ref|XP_003701201.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Megachile rotundata]
Length = 1967
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 426 DDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRCSC 476
Score = 48.1 bits (113), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 18/93 (19%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
D C +C GG++I CD CP +H C ++++ P GKW C +C E
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCE-------GEGITGAA 423
Query: 870 DQDDSALSTLQICS---------LCEEKYHQSC 893
+ DD + +IC C YH C
Sbjct: 424 EDDDEHMEFCRICKDGGELLCCDSCTSAYHTHC 456
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 190 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTC 240
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 111 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 159
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 35/50 (70%), Gaps = 2/50 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 398 DDHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCMC 447
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL +++K P GKW C +C
Sbjct: 325 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHC 373
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
DGE ++D C C GG+LICCD CP FH C ++K P G W C C
Sbjct: 1416 DGEIEHEDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENC 1466
>gi|350424653|ref|XP_003493868.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Bombus impatiens]
Length = 1965
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 425 DDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRCSC 475
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 18/93 (19%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
D C +C GG++I CD CP +H C ++++ P GKW C +C E
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCE-------GEGIAGAA 422
Query: 870 DQDDSALSTLQICS---------LCEEKYHQSC 893
+ DD + +IC C YH C
Sbjct: 423 EDDDEHMEFCRICKDGGELLCCDSCTSAYHTHC 455
>gi|347965727|ref|XP_321817.5| AGAP001328-PA [Anopheles gambiae str. PEST]
gi|333470378|gb|EAA01183.5| AGAP001328-PA [Anopheles gambiae str. PEST]
Length = 1278
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYC 853
DPN+D C +C DGG+L+CCD CP FHQ C +D S W C+ C
Sbjct: 1076 DPNEDWCAVCMDGGELMCCDKCPKVFHQTCHIPVIDSLPDESETWQCLLC 1125
>gi|110777198|ref|XP_624414.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Apis mellifera]
Length = 1966
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 425 DDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRCSC 475
Score = 48.1 bits (113), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 18/93 (19%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
D C +C GG++I CD CP +H C ++++ P GKW C +C E
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCE-------GEGIAGAA 422
Query: 870 DQDDSALSTLQICS---------LCEEKYHQSC 893
+ DD + +IC C YH C
Sbjct: 423 EDDDEHMEFCRICKDGGELLCCDSCTSAYHTHC 455
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 381 DDHHIEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCKC 431
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 300 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 348
>gi|156554524|ref|XP_001605295.1| PREDICTED: transcription intermediary factor 1-alpha-like [Nasonia
vitripennis]
Length = 1085
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 5/52 (9%)
Query: 808 DPNDDTCGICGDGGD-LICCDGCPSTFHQNC--LDIKKFP--SGKWHCVYCS 854
DPN+D C +C DGGD ++CCD CP FH C ++K FP S W C+ C+
Sbjct: 876 DPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPNLKSFPEESETWQCMLCT 927
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 426 CRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCMC 469
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 347 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 395
>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Apis florea]
Length = 1964
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 424 DDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRCSC 474
Score = 48.1 bits (113), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 18/93 (19%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
D C +C GG++I CD CP +H C ++++ P GKW C +C E
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCE-------GEGIAGAA 421
Query: 870 DQDDSALSTLQICS---------LCEEKYHQSC 893
+ DD + +IC C YH C
Sbjct: 422 EDDDEHMEFCRICKDGGELLCCDSCTSAYHTHC 454
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 397 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTC 442
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 322 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 367
>gi|312373820|gb|EFR21503.1| hypothetical protein AND_16980 [Anopheles darlingi]
Length = 1342
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYC 853
DPN+D C +C DGG+L+CCD CP FHQ C +D S W C+ C
Sbjct: 1004 DPNEDWCAVCMDGGELMCCDKCPKVFHQTCHIPVIDSLPDESETWQCLLC 1053
>gi|340726708|ref|XP_003401695.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Bombus terrestris]
Length = 1974
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C IC DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 434 DDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRCSC 484
Score = 48.1 bits (113), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 40/95 (42%), Gaps = 18/95 (18%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
+ D C +C GG++I CD CP +H C ++++ P GKW C +C E
Sbjct: 377 HQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCE-------GEGIAG 429
Query: 868 VNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ DD + +IC C YH C
Sbjct: 430 AAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHC 464
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 402 CRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCMC 445
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 323 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 371
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 39/60 (65%), Gaps = 3/60 (5%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
ED + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 529 EDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 588
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 452 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 500
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 39/60 (65%), Gaps = 3/60 (5%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
ED + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 275 EDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 334
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 198 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 246
>gi|347969095|ref|XP_563067.4| AGAP003029-PA [Anopheles gambiae str. PEST]
gi|333467701|gb|EAL40781.4| AGAP003029-PA [Anopheles gambiae str. PEST]
Length = 914
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 4/70 (5%)
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDI----KKFPSG 846
+++ KR + G N+DTC C +GG L+CCD CPS+FH C D ++ P G
Sbjct: 40 RNQQKRNNHPYYRKPGRGHNNDTCDSCKEGGALLCCDRCPSSFHLGCHDPPLSEEEIPYG 99
Query: 847 KWHCVYCSCQ 856
+W C C C+
Sbjct: 100 QWVCHTCKCK 109
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 29/45 (64%), Gaps = 2/45 (4%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
++ C +C DGGDL+CCD CPS +H+ CL +K P G W C C
Sbjct: 8 EEYCRVCRDGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRC 52
>gi|156544115|ref|XP_001605754.1| PREDICTED: PHD finger protein 12-like [Nasonia vitripennis]
Length = 661
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 4/54 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSC 855
G N D+C C DGG+LICCD CP++FH C L++ P+G+W C C C
Sbjct: 53 GRGHNRDSCDACHDGGELICCDKCPASFHLQCHDPPLELSDIPNGEWICHACRC 106
Score = 45.8 bits (107), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 5/42 (11%)
Query: 814 CGICGDG---GDLICCDGCPSTFHQNCLD--IKKFPSGKWHC 850
C +C LI CD CP FHQ+CLD + FPSG+W C
Sbjct: 193 CFVCRKSCRKAPLIACDYCPLYFHQDCLDPPLTAFPSGRWMC 234
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 39/74 (52%), Gaps = 7/74 (9%)
Query: 783 CILD-SWNKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD-- 839
C++D + N D + G + DGE ++D C C GG+LICCD CP FH C
Sbjct: 1468 CVMDRTTNGADSEEEMGSN----DGEIEHEDVCSRCRHGGELICCDTCPKAFHMECCKPV 1523
Query: 840 IKKFPSGKWHCVYC 853
++K P G W C C
Sbjct: 1524 LRKVPKGHWECENC 1537
Score = 43.9 bits (102), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+++ C CG G LI C CP +H CLD + K W C C
Sbjct: 1423 HNELCQSCGHAGQLILCHDCPIVYHCECLDPPLSKLTQDHWFCPLC 1468
>gi|425777912|gb|EKV16064.1| hypothetical protein PDIP_37980 [Penicillium digitatum Pd1]
gi|425779981|gb|EKV18004.1| hypothetical protein PDIG_11750 [Penicillium digitatum PHI26]
Length = 988
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 31/51 (60%), Gaps = 4/51 (7%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSCQ 856
N D C CG GGDL+CCD C +++H C LD K P G+WHC CS +
Sbjct: 543 NSDDCHQCGKGGDLLCCDTCINSYHFECLDPPLDPKNPPQGEWHCPKCSIR 593
>gi|159162702|pdb|1MM3|A Chain A, Solution Structure Of The 2nd Phd Domain From Mi2b With C-
Terminal Loop Replaced By Corresponding Loop From Wstf
Length = 61
Score = 60.8 bits (146), Expect = 7e-06, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 35/52 (67%), Gaps = 2/52 (3%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
G D + + C +C DGG+L+CCD CPS++H +CL + + P G+W C C+C
Sbjct: 4 GSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLRPALYEVPDGEWQCPRCTC 55
>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Nasonia vitripennis]
Length = 2009
Score = 60.8 bits (146), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 2/54 (3%)
Query: 804 FDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ +D + + C +C DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 419 LEDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLTEIPDGDWKCPRCSC 472
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 41/105 (39%), Gaps = 31/105 (29%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCS-------- 854
DGE + D C +C GG++I CD CP +H C ++++ P GKW C +C
Sbjct: 363 DGEGDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCENDGALEDD 422
Query: 855 ------CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC 893
C+ C E C C C YH C
Sbjct: 423 DEHMEFCRVCKDGGELLC---------------CDSCTSAYHTHC 452
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 60.8 bits (146), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 579 CRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCMC 622
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD ++K P GKW C +C
Sbjct: 500 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 548
>gi|321453629|gb|EFX64846.1| hypothetical protein DAPPUDRAFT_229741 [Daphnia pulex]
Length = 1431
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
D E +++ C +C +GG++ICCD CP+ +H C++ ++K P GKW C C
Sbjct: 1144 DVESVDNEICPVCQEGGEVICCDTCPAVYHLECINPPLRKVPRGKWSCPQC 1194
>gi|255565329|ref|XP_002523656.1| chromodomain helicase DNA binding protein, putative [Ricinus
communis]
gi|223537108|gb|EEF38742.1| chromodomain helicase DNA binding protein, putative [Ricinus
communis]
Length = 2257
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
C IC GG+L+CCD CP +H CLD +K+ P GKW C C
Sbjct: 78 CVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCPKC 119
>gi|380020007|ref|XP_003693890.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase
TRIM33-like [Apis florea]
Length = 1046
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 38/71 (53%), Gaps = 8/71 (11%)
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGD-LICCDGCPSTFHQNC--LDIKKFP- 844
N D SK + +DPN+D C +C DGGD ++CCD CP FH C +K FP
Sbjct: 821 NSSDTSKNTA---ASMTKDDPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPD 877
Query: 845 -SGKWHCVYCS 854
S W C+ C+
Sbjct: 878 ESETWQCMLCT 888
>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
Length = 1541
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 2/61 (3%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C C +GG+L+ CD CP +H CL++ K P GKW C + C CG++ C
Sbjct: 1313 HEDECFRCSEGGELVMCDRKTCPKAYHLTCLNLTKPPHGKWECPWHHCDVCGKLATVLCD 1372
Query: 868 V 868
+
Sbjct: 1373 I 1373
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 23/35 (65%)
Query: 820 GGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
GGDL+CC+ CP+ FH CL ++ P G W C CS
Sbjct: 908 GGDLLCCEMCPAAFHPQCLGLEDLPEGTWFCRDCS 942
Score = 40.8 bits (94), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 22/84 (26%), Positives = 40/84 (47%), Gaps = 8/84 (9%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C +C G+L+ C+G C FH +C+ +++ P+G + C C TC V
Sbjct: 680 ENLCQVCEQVGELLLCEGSCCGAFHLDCIGLQQMPTGTFKCDECISGV------HTCFVC 733
Query: 870 DQDDSALSTLQICSLCEEKYHQSC 893
+ + I +C + YH+ C
Sbjct: 734 RKSEVTTKRCSI-PICGKYYHEDC 756
>gi|307180140|gb|EFN68184.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Camponotus floridanus]
Length = 1960
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD C S +H +CL+ + + P G W C CSC
Sbjct: 423 DDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRCSC 473
Score = 48.5 bits (114), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 18/95 (18%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
+ D C +C GG++I CD CP +H C ++++ P GKW C +C E
Sbjct: 366 HQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCAHCE-------GEGIAG 418
Query: 868 VNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ DD + ++C C YH C
Sbjct: 419 AAEDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHC 453
>gi|414887990|tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1679
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC 853
+D N D C +CG G+L+CCDGCP+ FH C+ + + P G W+C C
Sbjct: 417 DDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 465
>gi|383860809|ref|XP_003705881.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Megachile rotundata]
Length = 1061
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 38/71 (53%), Gaps = 8/71 (11%)
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGD-LICCDGCPSTFHQNC--LDIKKFP- 844
N D SK + +DPN+D C +C DGGD ++CCD CP FH C +K FP
Sbjct: 836 NSSDNSKNAA---ASMAKDDPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPD 892
Query: 845 -SGKWHCVYCS 854
S W C+ C+
Sbjct: 893 ESETWQCMLCT 903
>gi|340715100|ref|XP_003396058.1| PREDICTED: LOW QUALITY PROTEIN: e3 ubiquitin-protein ligase
TRIM33-like [Bombus terrestris]
Length = 1036
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 38/71 (53%), Gaps = 8/71 (11%)
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGD-LICCDGCPSTFHQNC--LDIKKFP- 844
N D SK + +DPN+D C +C DGGD ++CCD CP FH C +K FP
Sbjct: 811 NSSDNSKNAA---ASMAKDDPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPD 867
Query: 845 -SGKWHCVYCS 854
S W C+ C+
Sbjct: 868 ESETWQCMLCT 878
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
N+D C +C DGG+LICCDGCP FH CL +++ PSG W C C
Sbjct: 4 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 49
>gi|168012021|ref|XP_001758701.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
gi|162690311|gb|EDQ76679.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
Length = 1220
Score = 60.5 bits (145), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
++D C IC GGDL+CCD C + FH CLD +K P GKW C C
Sbjct: 52 SEDFCSICHLGGDLLCCDTCTAVFHLGCLDPPMKVVPRGKWSCPKC 97
>gi|350414692|ref|XP_003490389.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Bombus
impatiens]
Length = 1036
Score = 60.5 bits (145), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 38/71 (53%), Gaps = 8/71 (11%)
Query: 789 NKQDESKRKGFHFVNFDGEDPNDDTCGICGDGGD-LICCDGCPSTFHQNC--LDIKKFP- 844
N D SK + +DPN+D C +C DGGD ++CCD CP FH C +K FP
Sbjct: 811 NSSDNSKNAA---ASMAKDDPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPD 867
Query: 845 -SGKWHCVYCS 854
S W C+ C+
Sbjct: 868 ESETWQCMLCT 878
>gi|332024007|gb|EGI64225.1| E3 ubiquitin-protein ligase TRIM33 [Acromyrmex echinatior]
Length = 1083
Score = 60.5 bits (145), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 5/53 (9%)
Query: 807 EDPNDDTCGICGDGGD-LICCDGCPSTFHQNCL--DIKKFP--SGKWHCVYCS 854
+DPN+D C +C DGGD ++CCD CP FH C +K FP S W C+ C+
Sbjct: 872 DDPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHVPSLKSFPDESETWQCMLCT 924
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 60.5 bits (145), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 433 CRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCTC 476
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
D C +C GG++I CD CP +H CL D++K P G W C +C
Sbjct: 354 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC 397
>gi|168051599|ref|XP_001778241.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
gi|162670338|gb|EDQ56908.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
Length = 1245
Score = 60.5 bits (145), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
++D C IC GGDL+CCD C + FH CLD +K P GKW C C
Sbjct: 52 SEDFCSICHLGGDLLCCDTCTAVFHLGCLDPPMKVVPRGKWSCPKC 97
>gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
gi|414887992|tpg|DAA64006.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1712
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC 853
+D N D C +CG G+L+CCDGCP+ FH C+ + + P G W+C C
Sbjct: 417 DDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 465
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+C
Sbjct: 433 CRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCTC 476
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
D C +C GG++I CD CP +H CL D++K P G W C +C
Sbjct: 354 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC 397
>gi|47206539|emb|CAF92235.1| unnamed protein product [Tetraodon nigroviridis]
Length = 993
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 33/46 (71%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+ C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C C
Sbjct: 2 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCMC 47
>gi|145479813|ref|XP_001425929.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393001|emb|CAK58531.1| unnamed protein product [Paramecium tetraurelia]
Length = 1299
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 56/117 (47%), Gaps = 12/117 (10%)
Query: 808 DPNDDTCGICGDGGD-LICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
D N+D C C L+CCD C +FH C+ IKK P+G W+C C CQ+ +S C
Sbjct: 79 DYNEDYCWKCRQKNRPLLCCDSCYRSFHMACVGIKKMPAGSWYCPQC-CQY----EQSYC 133
Query: 867 HVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKK--CQEIFERL 921
D+ + ICS C H C D + +E S +K +E+F++L
Sbjct: 134 PYCDEQSTNEKI--ICSKCNTFIHLECILKD--IPFEAVLRSPVNRKHISEELFQQL 186
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 3/53 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRI 861
+ C +C DGG+L+CCD C S++H +CL+ + + P+G+W C C+C GR+
Sbjct: 515 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPEIPNGEWLCPRCTCPMLKGRV 567
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + K P GKW C +C
Sbjct: 434 ETDHQDYCEVCQQGGEIILCDSCPRAYHLVCLDPELDKAPEGKWSCPHC 482
>gi|297746282|emb|CBI16338.3| unnamed protein product [Vitis vinifera]
Length = 1452
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 44/77 (57%), Gaps = 8/77 (10%)
Query: 783 CILDSWNKQDESK--RKGFHFVNFDGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCL 838
C+ + ++ ESK K FH + G+D N TC +C GG L+CCD GC ++H CL
Sbjct: 416 CMTTGYAEKCESKIKEKEFHLDSQTGDDHN--TCVVCKLGGKLLCCDGKGCKRSYHLACL 473
Query: 839 D--IKKFPSGKWHCVYC 853
D + + P G WHC+ C
Sbjct: 474 DPPLGEIPPGIWHCMLC 490
>gi|40644808|emb|CAE53912.1| hypothetical protein [Triticum aestivum]
Length = 134
Score = 60.1 bits (144), Expect = 1e-05, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC 853
+D N D C ICG G L+CCDGCP +H C+ + K P G W C C
Sbjct: 22 QDSNSDDCRICGMDGTLVCCDGCPWAYHSRCIGLNKAFLPQGLWFCPEC 70
>gi|157116322|ref|XP_001658420.1| hypothetical protein AaeL_AAEL007538 [Aedes aegypti]
gi|108876507|gb|EAT40732.1| AAEL007538-PA, partial [Aedes aegypti]
Length = 982
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYC 853
DPN+D C +C DGG+L+CCD CP FHQ C +D S W C+ C
Sbjct: 804 DPNEDWCAVCMDGGELMCCDKCPKVFHQTCHIPVIDSLPDESETWQCLLC 853
>gi|440639055|gb|ELR08974.1| hypothetical protein GMDG_00592 [Geomyces destructans 20631-21]
Length = 1061
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 37/74 (50%), Gaps = 9/74 (12%)
Query: 791 QDESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD-----IKKFPS 845
Q +S G VNFD ND+ C CG GDL+CCDGC FH C+D + + P
Sbjct: 644 QRDSPVSGGPGVNFDE---NDEYCSACGGNGDLVCCDGCTRAFHLICVDPPMESVSQLPP 700
Query: 846 GKWHCVYCSCQFCG 859
W+CV C G
Sbjct: 701 -DWYCVSCGSLRTG 713
>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Xenopus
(Silurana) tropicalis]
Length = 1298
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 12/90 (13%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
DG+ ++D C C DGG+L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 1177 DGKKQSEDYCFRCNDGGELVLCDRKFCTKAYHLSCLSLTKRPFGKWECPWHHCDVCGKAS 1236
Query: 863 ESTC----------HVNDQDDSALSTLQIC 882
S C H +D + + Q+C
Sbjct: 1237 VSCCSLCPNSFCKGHYDDSQFTRTAEGQLC 1266
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C +GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 774 NVSWCFVCSNGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 816
Score = 40.8 bits (94), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 37/87 (42%), Gaps = 18/87 (20%)
Query: 813 TCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYC-----SCQFCGRINESTC 866
C +C GDL+ C+G C S FH +C+ + P+GK+ C C SC C N
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLSCIGLSTRPAGKYLCKECTSGARSCFLCKESNRDV- 671
Query: 867 HVNDQDDSALSTLQICSLCEEKYHQSC 893
I C + YH+SC
Sbjct: 672 -----------KRCIVPHCGKFYHESC 687
>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
magnipapillata]
Length = 1214
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 37/59 (62%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICC--DGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
+D+ C +CGDGG+LI C + C T+H CL + K P G+W C + C CG++ +S C
Sbjct: 1020 HDNDCFVCGDGGELIMCSRNKCSKTYHIKCLKLDKRPYGRWECPWHYCDVCGKLAKSLC 1078
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL-DIKKFPSGKWHCVYC 853
N C IC GG L+CCD CP++FH C D+K+ P W C C
Sbjct: 613 NVTWCFICSQGGSLVCCDTCPASFHVECYEDLKEIPENSWQCNDC 657
>gi|449683391|ref|XP_002153791.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Hydra
magnipapillata]
Length = 1699
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
D E+ + D C IC DGG+L+CCD CP T+H C+ + P G+W C C
Sbjct: 221 DDENFHGDYCKICRDGGELLCCDFCPGTYHMRCVKPQLITVPEGEWKCPLC 271
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 42/100 (42%), Gaps = 19/100 (19%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINES 864
E ++ C +CG+GGD++ CD C +H CLD + + P G W C C + G + E
Sbjct: 160 EPIHNSVCDVCGEGGDILLCDTCTCVWHLTCLDPPLDEVPEGDWSCPKCEDELSGPVEE- 218
Query: 865 TCHVNDQDDSAL-----------STLQICSLCEEKYHQSC 893
D+DD L C C YH C
Sbjct: 219 -----DEDDENFHGDYCKICRDGGELLCCDFCPGTYHMRC 253
>gi|281205840|gb|EFA80029.1| hypothetical protein PPL_06850 [Polysphondylium pallidum PN500]
Length = 704
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDI--KKFPSGKWHCVYCSCQFCGRINESTC 866
N+D C C DGGDL+CC+ C +FH CL+ + P G W+C C+ + C + + C
Sbjct: 87 NNDFCYSCRDGGDLLCCENCELSFHLLCLNPPNPEVPDGDWYCTRCTNKICTNASHTYC 145
>gi|332838463|ref|XP_508960.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 isoform 3 [Pan troglodytes]
Length = 1825
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRI 861
E + D C +C GG++I CD CP +H CL D++K P GKW C +C G++
Sbjct: 359 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCCPALKGKV 415
>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
Length = 857
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 725 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 784
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 785 TSFCHLCPNSFCKEHQDGTAFRSTQ 809
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 322 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 364
>gi|402894298|ref|XP_003910303.1| PREDICTED: tripartite motif-containing protein 66, partial [Papio
anubis]
Length = 1041
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 962 NEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLC 1007
>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
Length = 1279
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 48/87 (55%), Gaps = 10/87 (11%)
Query: 810 NDDTCGICGDGGDLICCD-GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
+DD CGICG+ G+L+CC+ GC S++H C+ + P G + C C+ C
Sbjct: 676 HDDYCGICGEAGNLLCCEGGCLSSYHLFCVGLSCAPQGAFVCDACT------TGNHLCFA 729
Query: 869 NDQDDSALSTLQICSL--CEEKYHQSC 893
+Q L LQ CS+ C +KYH++C
Sbjct: 730 CEQ-PGGLEGLQTCSVRNCGKKYHRAC 755
Score = 47.8 bits (112), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 6/48 (12%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD----IKKFPS--GKWHCVYC 853
+ C CGDGG L CCD CP+ +HQ CL + PS W+C C
Sbjct: 831 NICLACGDGGRLFCCDTCPAAYHQECLKDVLALTGTPSEDSPWYCHEC 878
>gi|340718584|ref|XP_003397745.1| PREDICTED: PHD finger protein 12-like [Bombus terrestris]
Length = 638
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSCQFCGRI 861
G N D C C DGG+LICCD CP+++H C +D P+G+W C C C +
Sbjct: 51 GRGHNHDICDACRDGGELICCDKCPASYHLQCHYPAVDPADIPNGEWLCYACRCASKRNL 110
Query: 862 NESTCHVNDQDDSALSTLQICS 883
++ + ++ SAL L + +
Sbjct: 111 LDTKGNEKNKKKSALEVLALAA 132
Score = 45.4 bits (106), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHC 850
CG LI CD CP FHQ+CLD + FP G+W C
Sbjct: 197 CGRSCRKAPLIACDYCPLYFHQDCLDPPLTAFPIGRWMC 235
>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
homolog; Short=WHSC1-like protein 1
Length = 1439
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 802 VNFDGEDPNDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG 859
V + + ++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG
Sbjct: 1314 VKAEAKPIHEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPHGKWECPWHRCDECG 1373
Query: 860 RINESTC 866
+ S C
Sbjct: 1374 SVAVSFC 1380
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CC+ CP++FH CL I P G W+C C
Sbjct: 917 CEKGGRLLCCESCPASFHPECLSI-DMPEGCWNCNDC 952
Score = 41.6 bits (96), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C +C GD L+ C+G C FH CL + P G + C C G+
Sbjct: 696 GTSKKDTVCQVCEKAGDCLVACEGECCRHFHVECLGLTAVPEGHFTCEECE---TGQHPC 752
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSC 893
+C V+ +D S S+C + YH++C
Sbjct: 753 FSCKVSGKDVKRCSV----SVCGKFYHEAC 778
>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
construct]
Length = 1446
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 802 VNFDGEDPNDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG 859
V + + ++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG
Sbjct: 1321 VKAEAKPIHEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPHGKWECPWHRCDECG 1380
Query: 860 RINESTC 866
+ S C
Sbjct: 1381 SVAVSFC 1387
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CC+ CP++FH CL I P G W+C C
Sbjct: 927 CEKGGRLLCCESCPASFHPECLSI-DMPEGCWNCNDC 962
Score = 41.6 bits (96), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C +C GD L+ C+G C FH CL + P G + C C G+
Sbjct: 706 GTSKKDTVCQVCEKAGDCLVACEGECCRHFHVECLGLTAVPEGHFTCEECE---TGQHPC 762
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSC 893
+C V+ +D S S+C + YH++C
Sbjct: 763 FSCKVSGKDVKRCSV----SVCGKFYHEAC 788
>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
Length = 1562
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 814 CGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
C CG+GG+L+ CD CP +H CL++ K P G+W C + C C R S CH
Sbjct: 1437 CYCCGEGGELVMCDKKDCPKAYHLLCLNLSKPPYGRWECPWHQCSVCQRTATSFCHF 1493
Score = 47.0 bits (110), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CC+ CP++FH CL I + P G W C C
Sbjct: 1030 CQKGGRLLCCEACPASFHPECLKI-EMPEGTWLCGEC 1065
>gi|292621498|ref|XP_002664670.1| PREDICTED: PHD finger protein 12 [Danio rerio]
Length = 939
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 793 ESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKW 848
E +RK G N DTC C +GGDL+CCD CP+ FH C L + P G W
Sbjct: 37 EKRRKPERETRRGGRATNHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSREMLPPGDW 96
Query: 849 HCVYCS 854
C CS
Sbjct: 97 MCHRCS 102
>gi|307180087|gb|EFN68155.1| E3 ubiquitin-protein ligase TRIM33 [Camponotus floridanus]
Length = 1078
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 5/53 (9%)
Query: 807 EDPNDDTCGICGDGGD-LICCDGCPSTFHQNCL--DIKKFP--SGKWHCVYCS 854
+DPN+D C +C DGGD ++CCD CP FH C +K FP S W C+ C+
Sbjct: 867 DDPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCT 919
>gi|256075776|ref|XP_002574192.1| SET domain protein [Schistosoma mansoni]
Length = 1343
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 52/114 (45%), Gaps = 15/114 (13%)
Query: 789 NKQDESKRKGFHFVNFDGEDP--NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFP 844
N ++K +G N G +P ++D C CGDGG+LI CD C ++H NCL + P
Sbjct: 1011 NHTIQTKVEGNAESNTIGFNPAVHEDFCYRCGDGGELILCDKSTCSKSYHLNCLGLSVPP 1070
Query: 845 SGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDG 898
G W+C + C CG + C S CEE + +C Q D
Sbjct: 1071 LGIWYCPWHYCDLCGHPSNHLCWRCPN-----------SYCEEHANHNCIQVDS 1113
>gi|355752346|gb|EHH56466.1| hypothetical protein EGM_05877, partial [Macaca fascicularis]
Length = 1236
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 989 NEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLC 1034
>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
Length = 1381
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 802 VNFDGEDPNDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG 859
V + + ++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG
Sbjct: 1256 VKAEAKPIHEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPHGKWECPWHRCDECG 1315
Query: 860 RINESTC 866
+ S C
Sbjct: 1316 SVAVSFC 1322
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP++FH CL I P G W+C C
Sbjct: 846 NVGFCFVCARGGRLLCCESCPASFHPECLSI-DMPEGCWNCNDC 888
Score = 41.6 bits (96), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C +C GD L+ C+G C FH CL + P G + C C G+
Sbjct: 681 GTSKKDTVCQVCEKAGDCLVACEGECCRHFHVECLGLTAVPEGHFTCEECE---TGQHPC 737
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSC 893
+C V+ +D S S+C + YH++C
Sbjct: 738 FSCKVSGKDVKRCSV----SVCGKFYHEAC 763
>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Papio anubis]
Length = 1013
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 881 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 940
Query: 863 ESTCHV-------NDQDDSALS 877
S CH QD +A S
Sbjct: 941 TSFCHFCPNSFCKEHQDGTAFS 962
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 478 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDC 520
>gi|356564737|ref|XP_003550605.1| PREDICTED: uncharacterized protein LOC100794210 [Glycine max]
Length = 1608
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCSCQFCG 859
D D N D C +CG G L+CCDGCP+ +H C+ + K P G W+C C G
Sbjct: 377 DDVDGNGDECRLCGMDGTLLCCDGCPAVYHSRCIGVMKMHIPEGAWYCPECKIDMIG 433
>gi|355697937|gb|EHH28485.1| hypothetical protein EGK_18929, partial [Macaca mulatta]
Length = 1236
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 989 NEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLC 1034
>gi|353231818|emb|CCD79173.1| putative set domain protein [Schistosoma mansoni]
Length = 1306
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 52/114 (45%), Gaps = 15/114 (13%)
Query: 789 NKQDESKRKGFHFVNFDGEDP--NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFP 844
N ++K +G N G +P ++D C CGDGG+LI CD C ++H NCL + P
Sbjct: 974 NHTIQTKVEGNAESNTIGFNPAVHEDFCYRCGDGGELILCDKSTCSKSYHLNCLGLSVPP 1033
Query: 845 SGKWHCVYCSCQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDG 898
G W+C + C CG + C S CEE + +C Q D
Sbjct: 1034 LGIWYCPWHYCDLCGHPSNHLCWRCPN-----------SYCEEHANHNCIQVDS 1076
>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
Length = 851
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 719 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 778
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 779 TSFCHLCPNSFCKEHQDGTAFRSTQ 803
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 316 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 358
>gi|297268466|ref|XP_001097566.2| PREDICTED: tripartite motif-containing protein 66 [Macaca mulatta]
Length = 1326
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 962 NEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLC 1007
>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
Length = 1461
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 27/111 (24%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C CGDGG L+ CD C +H +CLD K P G+W C + C CG+ +++ C
Sbjct: 1326 SEDECFRCGDGGQLVLCDKKSCTKAYHLSCLDRTKRPFGRWDCPWHHCDVCGKNSDAFC- 1384
Query: 868 VNDQDDSALSTLQIC--SLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQE 916
Q+C S C K HQ +GA++ P + C CQE
Sbjct: 1385 ------------QLCPNSFC--KAHQ-----EGALRSHPLTGQLC---CQE 1413
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C IC GG L+CC+ CP+ FH +CL+I P G W C C
Sbjct: 922 NVSWCFICSKGGRLLCCESCPAAFHPDCLNI-AMPDGSWFCNDC 964
>gi|13509324|emb|CAC35389.1| KIAA0298 protein [Homo sapiens]
Length = 1214
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 967 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 1012
>gi|119589026|gb|EAW68620.1| hCG23889, isoform CRA_a [Homo sapiens]
Length = 1214
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 967 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 1012
>gi|426367401|ref|XP_004050721.1| PREDICTED: tripartite motif-containing protein 66 [Gorilla gorilla
gorilla]
Length = 1085
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 927 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 972
>gi|321477322|gb|EFX88281.1| hypothetical protein DAPPUDRAFT_305689 [Daphnia pulex]
Length = 2229
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC 858
+G +D C +C GDL+CC+ CP+ +H +CLD ++ P+ W C C+ Q C
Sbjct: 353 EGNIKYNDHCRVCHKVGDLLCCETCPAVYHLHCLDPPLEHVPNEDWQCPICTAQLC 408
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 70/159 (44%), Gaps = 37/159 (23%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ-------- 856
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTPLEGRPERQF 504
Query: 857 FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCG----- 911
F S H S +S LQ+ C+ + + D EP S F G
Sbjct: 505 FVKWQGMSYWHC-----SWVSELQLELHCQVMFRNYQRKND---MDEPPSGDFGGDEEKS 556
Query: 912 -------KKCQEIFERLEKLLGVKHDLEGGYTWSLVHRF 943
K E+ ER + G+K + W ++HR
Sbjct: 557 RKRKNKDPKFAEMEERFYR-YGIKPE------WMMIHRI 588
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|209977097|ref|NP_055633.1| tripartite motif-containing protein 66 [Homo sapiens]
gi|269849740|sp|O15016.4|TRI66_HUMAN RecName: Full=Tripartite motif-containing protein 66
Length = 1216
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 969 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 1014
>gi|397494634|ref|XP_003818179.1| PREDICTED: tripartite motif-containing protein 66 [Pan paniscus]
Length = 1216
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 969 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 1014
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 451 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPMLKGRVQK 509
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 376 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 421
>gi|350409927|ref|XP_003488890.1| PREDICTED: PHD finger protein 12-like [Bombus impatiens]
Length = 638
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSCQFCGRI 861
G N D C C DGG+LICCD CP+++H C +D P+G+W C C C +
Sbjct: 51 GRGHNHDICDACRDGGELICCDKCPASYHLQCHYPAVDPADIPNGEWLCYACRCASKRNL 110
Query: 862 NESTCHVNDQDDSALSTLQICS 883
++ + ++ SAL L + +
Sbjct: 111 LDTKGNEKNKKKSALEVLALAA 132
Score = 45.1 bits (105), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHC 850
CG LI CD CP FHQ+CLD + FP G+W C
Sbjct: 197 CGRSCRKAPLIACDYCPLYFHQDCLDPPLTAFPIGRWMC 235
>gi|156392707|ref|XP_001636189.1| predicted protein [Nematostella vectensis]
gi|156223290|gb|EDO44126.1| predicted protein [Nematostella vectensis]
Length = 1071
Score = 59.7 bits (143), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 29/52 (55%), Gaps = 4/52 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIK----KFPSGKWHCVYC 853
G N D C CG+GGDL+CCD CP FH +C D P G+W C+ C
Sbjct: 43 GRSVNHDNCDSCGEGGDLLCCDQCPCAFHLSCCDPPLEEDDIPDGEWLCIEC 94
>gi|441646789|ref|XP_003254588.2| PREDICTED: tripartite motif-containing protein 66 [Nomascus
leucogenys]
Length = 1223
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 976 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 1021
>gi|403254198|ref|XP_003919864.1| PREDICTED: tripartite motif-containing protein 66 [Saimiri
boliviensis boliviensis]
Length = 1207
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 960 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 1005
>gi|241569706|ref|XP_002402600.1| PHD finger protein, putative [Ixodes scapularis]
gi|215502035|gb|EEC11529.1| PHD finger protein, putative [Ixodes scapularis]
Length = 712
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 31/53 (58%), Gaps = 4/53 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCS 854
G N D C C +GGDLICCD CP+TFH C LD P+G+W C CS
Sbjct: 43 GRTVNHDACDSCNEGGDLICCDRCPATFHLQCHDPPLDEDDLPAGEWICHRCS 95
>gi|357474041|ref|XP_003607305.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
gi|355508360|gb|AES89502.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
Length = 1573
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIK--KFPSGKWHCVYCS 854
D D N D C +C G+LICCDGCP+ FH C+ I P G W+C C+
Sbjct: 790 DTTDLNSDECCLCKMDGNLICCDGCPAAFHSRCVGIASDSLPEGDWYCPECA 841
>gi|156379809|ref|XP_001631648.1| predicted protein [Nematostella vectensis]
gi|156218692|gb|EDO39585.1| predicted protein [Nematostella vectensis]
Length = 824
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQ 856
ED + D C +C DGG L+CCD CP +H CL + + P+G+W C C +
Sbjct: 7 EDIHSDYCRVCKDGGQLLCCDKCPMAYHLKCLIPPMMRVPTGEWKCPRCQSE 58
>gi|432909970|ref|XP_004078255.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1882
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQFCGRI 861
E + D C +C GG++I CD CP +H CL D++K P GKW C +C G++
Sbjct: 380 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCCPPMKGKV 436
>gi|301623129|ref|XP_002940874.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Xenopus
(Silurana) tropicalis]
Length = 1954
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC-QFCGRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P G+W C C+C Q G++ +
Sbjct: 474 DDHMEFCRVCKDGGELLCCDACVSSYHIHCLNPPLPDIPHGEWLCPRCTCPQLKGKVQK 532
Score = 55.1 bits (131), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CL ++++ P GKW C +C
Sbjct: 400 HQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELERAPQGKWSCPHC 445
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD C S++H +CL+ + + P+G+W C C+C
Sbjct: 476 DDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRCTC 526
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL ++ K P GKW C +C
Sbjct: 387 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC 435
>gi|363734262|ref|XP_420989.3| PREDICTED: tripartite motif-containing protein 66 [Gallus gallus]
Length = 1166
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 920 NEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLC 965
>gi|326920032|ref|XP_003206280.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Meleagris gallopavo]
Length = 1167
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 921 NEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLC 966
>gi|256052547|ref|XP_002569826.1| chromodomain helicase DNA binding protein [Schistosoma mansoni]
gi|353229870|emb|CCD76041.1| putative chromodomain helicase DNA binding protein [Schistosoma
mansoni]
Length = 1966
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCSCQ 856
+D + + C C DGGDLICCD CP+++H CL + P G W C C C+
Sbjct: 416 KDEHQEFCTECRDGGDLICCDNCPASYHIACLIPPLANIPEGVWLCPRCGCK 467
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG+++ CD CP +H CLD +++ P G W C +C
Sbjct: 325 ETDHQDYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGTWSCPHC 373
>gi|427786979|gb|JAA58941.1| Putative sin3-type complex [Rhipicephalus pulchellus]
Length = 533
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 4/53 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCS 854
G N D C C +GGDLICCD CP+TFH C LD + PSG+W C C+
Sbjct: 41 GRAVNHDCCDSCKEGGDLICCDRCPATFHLQCHDPPLDEESLPSGEWICHRCT 93
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 38/59 (64%), Gaps = 5/59 (8%)
Query: 802 VNFDGEDPNDD---TCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
V E+ +DD C +C DGG+L+CCD C S++H +CL+ + + P+G+W C C+C
Sbjct: 351 VPMGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRCTC 409
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 9/65 (13%)
Query: 798 GFHFVNFDGEDPND-------DTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKW 848
GF G++ D D C +C GG++I CD CP +H CL ++ K P GKW
Sbjct: 254 GFDLAPVPGDEEGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKW 313
Query: 849 HCVYC 853
C +C
Sbjct: 314 SCPHC 318
>gi|410044930|ref|XP_003313014.2| PREDICTED: tripartite motif-containing protein 66 [Pan troglodytes]
Length = 1037
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 790 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 835
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
+D + + C +C DGG+L+CCD C S++H +CL+ + + P+G+W C C+C
Sbjct: 463 DDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRCTC 513
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL ++ K P GKW C +C
Sbjct: 374 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC 422
>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
latipes]
Length = 1597
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 2/64 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++ C CGDGG+L+ CD CP +H CL++ K P G+W C + C CG
Sbjct: 1468 EGKHTHEYFCFCCGDGGELVMCDRKDCPKAYHLLCLNLTKPPYGRWECPWHDCSVCGSPA 1527
Query: 863 ESTC 866
S C
Sbjct: 1528 SSHC 1531
Score = 48.1 bits (113), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 818 GDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
G GG L+CCD CP++FH CL++ P G W C C
Sbjct: 1071 GKGGKLLCCDSCPASFHPECLEM-DMPEGAWSCSDC 1105
>gi|428169481|gb|EKX38414.1| hypothetical protein GUITHDRAFT_144200 [Guillardia theta CCMP2712]
Length = 994
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 28/47 (59%), Gaps = 4/47 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL---DIKKFPSGKWHCVYC 853
N D C CGD GDL+CCDGCP+ +H C+ D K P W+C C
Sbjct: 433 NHDYCDDCGDAGDLLCCDGCPAAYHAACIGVADPSKLPD-PWYCNAC 478
>gi|307208076|gb|EFN85607.1| PHD finger protein 12 [Harpegnathos saltator]
Length = 633
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSC 855
G N D C C DGG+LICCD CP+++H C +D P+G+W C C C
Sbjct: 51 GRGHNHDFCDACKDGGELICCDRCPASYHLQCHYPAVDPMDIPNGEWLCYTCRC 104
Score = 45.4 bits (106), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHC 850
CG LI CD CP FHQ+CLD + FP G+W C
Sbjct: 187 CGRSCRKAPLIACDYCPLYFHQDCLDPPLTTFPIGRWMC 225
>gi|196008026|ref|XP_002113879.1| hypothetical protein TRIADDRAFT_27056 [Trichoplax adhaerens]
gi|190584283|gb|EDV24353.1| hypothetical protein TRIADDRAFT_27056, partial [Trichoplax
adhaerens]
Length = 871
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSC 855
D C +C DGG L+CCD CP ++H CL+ ++ P G W C C C
Sbjct: 1 DFCHVCKDGGQLLCCDSCPLSYHLRCLNPPLEDIPEGDWRCPRCLC 46
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 3/78 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINES 864
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 604 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQKI 663
Query: 865 TCHVNDQDDSALSTLQIC 882
+ A T Q+
Sbjct: 664 LYWRWGEPPVAAPTPQLA 681
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 526 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 574
>gi|356541435|ref|XP_003539182.1| PREDICTED: uncharacterized protein LOC100796377 [Glycine max]
Length = 1612
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 805 DGE-DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQFCG 859
DG+ D N D C +CG G L+CCDGCPS +H C+ + K P G W+C C G
Sbjct: 408 DGDVDRNGDECRLCGMDGILLCCDGCPSAYHSRCIGVLKNHIPEGPWYCPECKINMMG 465
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 275 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 333
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 198 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 246
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS 854
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCT 494
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS 854
+D + + C +C DGG+L+CCD CPS++H +CL+ + + P+G+W C C+
Sbjct: 445 DDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCT 494
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS---CQFCGR- 860
E + D C +C GG++I CD CP +H CL D++K P GKW C +C Q+ +
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE 425
Query: 861 --------INESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
+ E + ++DD + ++C C YH C
Sbjct: 426 DNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHC 475
>gi|327264985|ref|XP_003217289.1| PREDICTED: tripartite motif-containing protein 66-like [Anolis
carolinensis]
Length = 1265
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYCS 854
N+D C +C +GG+L+CCD CP FH C + FP G+W C CS
Sbjct: 909 NEDFCAVCLNGGELLCCDYCPKVFHLLCHVPALLSFPVGEWVCTLCS 955
>gi|119175045|ref|XP_001239819.1| hypothetical protein CIMG_09440 [Coccidioides immitis RS]
gi|392870013|gb|EAS28560.2| hypothetical protein CIMG_09440 [Coccidioides immitis RS]
Length = 893
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 792 DESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGK 847
DE++R H D N D C +C G+L+CCDGC +FH C LD P+G+
Sbjct: 509 DETRRSRAHAEEVD----NIDFCRVCNGTGNLLCCDGCVDSFHFGCLNPPLDANFPPAGR 564
Query: 848 WHCVYCSCQFCGRINES 864
W C C + G + E+
Sbjct: 565 WFCTTCEGKGPGAVFEA 581
>gi|449456180|ref|XP_004145828.1| PREDICTED: uncharacterized protein LOC101215849 [Cucumis sativus]
gi|449510841|ref|XP_004163779.1| PREDICTED: uncharacterized LOC101215849 [Cucumis sativus]
Length = 1719
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCS 854
N D C +CG G L+CCDGCPS +H C+ + K P G W+C CS
Sbjct: 427 NSDECRLCGMDGSLLCCDGCPSAYHLRCIGMVKVLIPQGPWYCPECS 473
>gi|224104983|ref|XP_002313643.1| predicted protein [Populus trichocarpa]
gi|222850051|gb|EEE87598.1| predicted protein [Populus trichocarpa]
Length = 1604
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCS 854
D N D C +CG G L+CCDGCPS++H C+ + K P G W+C C+
Sbjct: 417 DGNGDECRLCGIDGILLCCDGCPSSYHSRCIGVVKMYIPKGPWYCPECT 465
>gi|326669387|ref|XP_003199000.1| PREDICTED: hypothetical protein LOC566660 [Danio rerio]
Length = 1062
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C GGDL+CCD CP FH +C + FP G W C C
Sbjct: 828 NEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLHSFPVGDWICTLC 873
>gi|427788397|gb|JAA59650.1| Putative bromodomain adjacent to zinc finger domain protein 1a
[Rhipicephalus pulchellus]
Length = 1568
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS 854
N+D+C CG GG LICCD CP +H C +++ P G W+C C+
Sbjct: 1263 NEDSCDACGKGGTLICCDSCPLAYHLECTRPPLRRLPRGNWNCHKCT 1309
>gi|427788395|gb|JAA59649.1| Putative atp-dependent chromatin assembly factor large subunit
[Rhipicephalus pulchellus]
Length = 1568
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS 854
N+D+C CG GG LICCD CP +H C +++ P G W+C C+
Sbjct: 1263 NEDSCDACGKGGTLICCDSCPLAYHLECTRPPLRRLPRGNWNCHKCT 1309
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 459 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 517
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 384 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 429
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|223996275|ref|XP_002287811.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976927|gb|EED95254.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1562
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
+D C C +GGDLICCD CP FH NC I P G+W C+ C
Sbjct: 302 HDANCYTCEEGGDLICCDNCPRVFHSNCHIPKIYSLPEGEWFCMLC 347
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 37/79 (46%), Gaps = 11/79 (13%)
Query: 785 LDSWNKQDESKRKGFHFVN-------FDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC 837
L+ N E K K H+ FDGE ND C IC GGDLICCD C FH C
Sbjct: 440 LNKTNTPIEKKEKMSHYARQMQLQPVFDGEHDND--CYICFIGGDLICCDYCEKAFHMRC 497
Query: 838 L--DIKKFPSGKWHCVYCS 854
+ P+G W C CS
Sbjct: 498 HIPPLVNVPAGLWKCCECS 516
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 449 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 507
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 374 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 419
>gi|356544696|ref|XP_003540783.1| PREDICTED: uncharacterized protein LOC100808261 [Glycine max]
Length = 1644
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 805 DGE-DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQFCG 859
DG+ D N D C +CG G L+CCDGCPS +H C+ + K P G W+C C G
Sbjct: 416 DGDVDRNGDECRLCGMDGILLCCDGCPSAYHSRCIGVLKNHIPEGPWYCPECKINMMG 473
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 37/57 (64%), Gaps = 3/57 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRI 861
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+
Sbjct: 328 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRV 384
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 250 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 298
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|380013110|ref|XP_003690612.1| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 12-like [Apis
florea]
Length = 639
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSC 855
G N D C C DGG+LICCD CP+++H C +D P+G+W C C C
Sbjct: 51 GRGHNRDICDACRDGGELICCDKCPASYHLQCHYPAVDPADIPNGEWLCYACRC 104
Score = 45.1 bits (105), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHC 850
CG LI CD CP FHQ+CLD + FP G+W C
Sbjct: 197 CGRSCRKAPLIACDYCPLYFHQDCLDPPLTAFPIGRWMC 235
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 443 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 501
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 365 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 413
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 37/57 (64%), Gaps = 3/57 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRI 861
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+
Sbjct: 359 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRV 415
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 281 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 329
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|348553320|ref|XP_003462475.1| PREDICTED: tripartite motif-containing protein 66-like [Cavia
porcellus]
Length = 1392
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G+W C C
Sbjct: 1145 NEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLC 1190
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 424 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 478
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 342 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 390
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 498 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 556
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 423 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 468
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CG+GG+L+ CD CP +H CL + K P GKW C + C CG+ + C
Sbjct: 1499 HEDDCFRCGEGGELVMCDKKTCPKAYHVKCLGLSKRPYGKWECPWHHCDTCGKQSTKLC 1557
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 27/45 (60%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYCS 854
N C +C GG L+CC+ CP+ FH +C+ + P G W+C C+
Sbjct: 1085 NVSWCFMCSKGGSLMCCESCPAAFHPDCIGYDEIPDGSWYCRDCT 1129
Score = 42.4 bits (98), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 43/96 (44%), Gaps = 22/96 (22%)
Query: 806 GEDPNDDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYC-----SCQFCG 859
G ++ C +C G+L+ C+G C FH +C+ +++ PSG + C C SC C
Sbjct: 917 GASRKENVCQVCERPGELLLCEGQCCGAFHLDCIGLQQMPSGVFKCDECISGVHSCFICT 976
Query: 860 R--INESTCHVNDQDDSALSTLQICSLCEEKYHQSC 893
+ I C V +LC + YH+ C
Sbjct: 977 KCDIEVKRCSV--------------ALCGKFYHEDC 998
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 428 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 486
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 350 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 398
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 468 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 522
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 386 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 434
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|357153170|ref|XP_003576362.1| PREDICTED: uncharacterized protein LOC100837721 [Brachypodium
distachyon]
Length = 2116
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC 853
+D N D C ICG G L+CCDGCP +H C+ + K P G W C C
Sbjct: 741 QDGNSDDCRICGMDGTLVCCDGCPWAYHSRCIGLNKAFLPQGLWFCPEC 789
>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Cricetulus griseus]
Length = 1436
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C C R S C
Sbjct: 1319 HEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPYGKWECPWHQCDKCNRAAVSFC 1377
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CC+ CP++FH CL+I P G W C C
Sbjct: 917 CEKGGKLLCCESCPASFHPECLNI-DMPEGCWSCNDC 952
Score = 40.0 bits (92), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C IC GD L+ C+G C FH CL + P G++ C C G+
Sbjct: 696 GMSKKDTVCQICETAGDCLVSCEGECCRHFHWECLGLTAVPEGRFICEECK---TGQHPC 752
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSC 893
+C + +D S S+C + YH++C
Sbjct: 753 FSCKESGEDVKRCSV----SVCGKFYHEAC 778
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 452 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 510
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 511 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 569
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 436 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 481
>gi|328781491|ref|XP_001121177.2| PREDICTED: PHD finger protein 12-like [Apis mellifera]
Length = 639
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSC 855
G N D C C DGG+LICCD CP+++H C +D P+G+W C C C
Sbjct: 51 GRGHNRDICDACRDGGELICCDKCPASYHLQCHYPAVDPADIPNGEWLCYACRC 104
Score = 45.1 bits (105), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHC 850
CG LI CD CP FHQ+CLD + FP G+W C
Sbjct: 197 CGRSCRKAPLIACDYCPLYFHQDCLDPPLTAFPIGRWMC 235
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 452 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 506
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 370 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 418
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 512 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 570
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 437 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 482
>gi|444728247|gb|ELW68711.1| Chromodomain-helicase-DNA-binding protein 5 [Tupaia chinensis]
Length = 2128
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRI 861
+ D C +C GG++I CD CP +H CLD ++K P GKW C +C G++
Sbjct: 364 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCCPPLKGKV 417
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 37/57 (64%), Gaps = 3/57 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRI 861
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+
Sbjct: 420 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRV 476
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 342 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 390
>gi|383847495|ref|XP_003699388.1| PREDICTED: PHD finger protein 12-like [Megachile rotundata]
Length = 637
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCSC 855
G N D C C DGG+LICCD CP+++H C +D P+G+W C C C
Sbjct: 51 GRGHNRDICDACRDGGELICCDKCPASYHLQCHYPAVDPADIPNGEWLCYACRC 104
Score = 45.1 bits (105), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHC 850
CG LI CD CP FHQ+CLD + FP G+W C
Sbjct: 197 CGRSCRKAPLIACDYCPLYFHQDCLDPPLTAFPIGRWMC 235
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 439 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 497
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 361 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 409
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 447 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 505
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 372 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 417
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 547 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 605
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 472 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 517
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 289 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 343
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 210 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 255
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 505 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 563
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 428 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 476
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|323456424|gb|EGB12291.1| hypothetical protein AURANDRAFT_61296 [Aureococcus anophagefferens]
Length = 757
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 26/46 (56%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
DPN+D C IC GG+L+CCD C FH C + P G W C C
Sbjct: 695 DPNEDHCFICRSGGELLCCDTCERAFHVTCCGLADVPEGAWSCYVC 740
Score = 41.6 bits (96), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 24/47 (51%), Gaps = 7/47 (14%)
Query: 814 CGICG---DGGDLICCDG----CPSTFHQNCLDIKKFPSGKWHCVYC 853
C +CG D G L+ CDG C ST H +C+ + P G W C C
Sbjct: 492 CAVCGTDHDAGSLLLCDGRDGRCVSTAHTHCIGLDAVPDGDWFCPQC 538
>gi|145539934|ref|XP_001455657.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423465|emb|CAK88260.1| unnamed protein product [Paramecium tetraurelia]
Length = 1292
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 42/91 (46%), Gaps = 8/91 (8%)
Query: 808 DPNDDTCGICGDGGD-LICCDGCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
D N+D C C L+CCD C +FH C+ IKK P G W+C C CQ+ +S C
Sbjct: 70 DYNEDYCWKCRQKNRPLLCCDSCYRSFHMACVGIKKMPPGSWYCPQC-CQY----EQSYC 124
Query: 867 HVNDQDDSALSTLQICSLCEEKYHQSCSQTD 897
D+ + ICS C H C D
Sbjct: 125 PYCDEQSTNEKI--ICSKCNTFIHFECILKD 153
>gi|10178250|dbj|BAB11682.1| unnamed protein product [Arabidopsis thaliana]
Length = 1516
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCSCQFCG 859
D N D C +CG G L+CCDGCP +H C+ + K P G W+C C+ + G
Sbjct: 410 DANSDECRLCGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWYCPECTIKKMG 463
>gi|440799225|gb|ELR20283.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 561
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ 856
N D C CG+GG+LICCD CP++FH CL+ + P G W C C Q
Sbjct: 26 NHDNCDSCGEGGELICCDRCPASFHLECLNPPLPCVPDGDWFCRACLLQ 74
>gi|395847339|ref|XP_003796336.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
[Otolemur garnettii]
Length = 1427
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG S C
Sbjct: 1310 HEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPYGKWECPWHQCAECGSAAVSFC 1368
Score = 46.6 bits (109), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CCD CP++FH CL + P G W+C C
Sbjct: 918 CEKGGRLLCCDSCPASFHPECLSM-DVPEGCWNCSDC 953
Score = 40.4 bits (93), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 43/92 (46%), Gaps = 13/92 (14%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C IC GD L C+G C FH CL + P GK+ C C G+
Sbjct: 696 GMSKKDTVCQICESSGDSLTPCEGECCKHFHLECLGLTSVPDGKFICTECK---TGQHPC 752
Query: 864 STCHVNDQDDSALSTLQICSL--CEEKYHQSC 893
+C V+ +D ++ CS+ C + YH++C
Sbjct: 753 FSCKVSGKD------VKRCSVGACGKFYHEAC 778
>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Otolemur garnettii]
Length = 1438
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG S C
Sbjct: 1321 HEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPYGKWECPWHQCAECGSAAVSFC 1379
Score = 46.6 bits (109), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CCD CP++FH CL + P G W+C C
Sbjct: 918 CEKGGRLLCCDSCPASFHPECLSM-DVPEGCWNCSDC 953
Score = 40.4 bits (93), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 43/92 (46%), Gaps = 13/92 (14%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C IC GD L C+G C FH CL + P GK+ C C G+
Sbjct: 696 GMSKKDTVCQICESSGDSLTPCEGECCKHFHLECLGLTSVPDGKFICTECK---TGQHPC 752
Query: 864 STCHVNDQDDSALSTLQICSL--CEEKYHQSC 893
+C V+ +D ++ CS+ C + YH++C
Sbjct: 753 FSCKVSGKD------VKRCSVGACGKFYHEAC 778
>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Cricetulus griseus]
Length = 1387
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C C R S C
Sbjct: 1270 HEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPYGKWECPWHQCDKCNRAAVSFC 1328
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C +C GG L+CC+ CP++FH CL+I P G W C C
Sbjct: 859 CFVCARGGKLLCCESCPASFHPECLNI-DMPEGCWSCNDC 897
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 453 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 375 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 475 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 529
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 396 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 441
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 511
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 378 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 423
>gi|302796603|ref|XP_002980063.1| hypothetical protein SELMODRAFT_177985 [Selaginella moellendorffii]
gi|300152290|gb|EFJ18933.1| hypothetical protein SELMODRAFT_177985 [Selaginella moellendorffii]
Length = 1274
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+DD C +C GG L+CCD C + +H CLD +K P G W C+ C
Sbjct: 47 SDDHCSVCSLGGKLLCCDTCTAVYHLECLDPPMKSVPKGDWSCLKC 92
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 469 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 527
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 392 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 440
>gi|348543602|ref|XP_003459272.1| PREDICTED: PHD finger protein 12-like [Oreochromis niloticus]
Length = 1018
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 796 RKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCV 851
RK V G N DTC C +GGDL+CCD CP+ FH C L + P G+W C
Sbjct: 41 RKLVRDVRRSGRATNHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCH 100
Query: 852 YC 853
C
Sbjct: 101 RC 102
>gi|31418293|gb|AAH53454.1| Whsc1 protein, partial [Mus musculus]
Length = 558
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 426 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 485
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 486 TSFCHLCPNSFCKEHQDGTAFRSTQ 510
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 23 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 65
>gi|344241953|gb|EGV98056.1| Autoimmune regulator [Cricetulus griseus]
Length = 606
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 33/56 (58%), Gaps = 9/56 (16%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFCGRINE 863
N+D C +C DGG+LICCDGCP FH CL +++ P CSC GR+ +
Sbjct: 413 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLREIP-------RCSCCLQGRVQQ 461
>gi|410973162|ref|XP_003993024.1| PREDICTED: tripartite motif-containing protein 66 [Felis catus]
Length = 1211
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G W C C
Sbjct: 964 NEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLSFPGGDWVCTLC 1009
>gi|358342086|dbj|GAA49633.1| nucleosome-remodeling factor subunit BPTF, partial [Clonorchis
sinensis]
Length = 2682
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
DG P++D C +C GD++CCDGC + +H CL+ ++ PS W C C
Sbjct: 479 DGFLPHEDHCRVCHQSGDVLCCDGCTAVYHLTCLNPPLQSAPSSSWLCPIC 529
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 468 DDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 526
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 393 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 438
>gi|325188434|emb|CCA22970.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 609
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC 853
D D N D C IC GD +CCD CP +FH CL +++ P G+W C C
Sbjct: 433 DEHDTNLDYCSICLKDGDFVCCDICPRSFHLECLRLRQEDLPKGEWQCEGC 483
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 49/115 (42%), Gaps = 21/115 (18%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCH 867
++D C C +GG+L CD CP +H +CL + K P GKW C + C CG+
Sbjct: 1412 HEDYCFRCAEGGELTMCDVKTCPKAYHLDCLGLTKQPYGKWQCPWHHCDVCGK------- 1464
Query: 868 VNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKCQEIFERLE 922
++++C C Y C G + FC + E ER+E
Sbjct: 1465 ---------GSVKLCQECPNSY---CKHHAGETTVIINDKVFCKEHDLEDIERME 1507
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 814 CGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C C GGDLICC+ CP+ +H CL P G W C C
Sbjct: 998 CFTCSLGGDLICCESCPAAYHAKCLGFDSVPDGNWFCRDC 1037
Score = 42.0 bits (97), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 41/96 (42%), Gaps = 22/96 (22%)
Query: 806 GEDPNDDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYC-----SCQFCG 859
G ++ C +C G L+ C+G C FH +C+ ++ PSG + C C SC C
Sbjct: 826 GAAKKENVCQVCERTGQLLLCEGGCCGAFHLDCIGLQVAPSGSFRCDECISGVHSCFVCK 885
Query: 860 RINEST--CHVNDQDDSALSTLQICSLCEEKYHQSC 893
++ CHV +C + YH+ C
Sbjct: 886 LSDQEVRRCHV--------------PVCGKYYHEGC 907
>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Otolemur garnettii]
Length = 1389
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG S C
Sbjct: 1272 HEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPYGKWECPWHQCAECGSAAVSFC 1330
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CCD CP++FH CL + P G W+C C
Sbjct: 862 NVGFCFVCARGGRLLCCDSCPASFHPECLSM-DVPEGCWNCSDC 904
Score = 40.4 bits (93), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 43/92 (46%), Gaps = 13/92 (14%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G D C IC GD L C+G C FH CL + P GK+ C C G+
Sbjct: 696 GMSKKDTVCQICESSGDSLTPCEGECCKHFHLECLGLTSVPDGKFICTECK---TGQHPC 752
Query: 864 STCHVNDQDDSALSTLQICSL--CEEKYHQSC 893
+C V+ +D ++ CS+ C + YH++C
Sbjct: 753 FSCKVSGKD------VKRCSVGACGKFYHEAC 778
>gi|302811536|ref|XP_002987457.1| hypothetical protein SELMODRAFT_158748 [Selaginella moellendorffii]
gi|300144863|gb|EFJ11544.1| hypothetical protein SELMODRAFT_158748 [Selaginella moellendorffii]
Length = 1292
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+DD C +C GG L+CCD C + +H CLD +K P G W C+ C
Sbjct: 47 SDDHCSVCSLGGKLLCCDTCTAVYHLECLDPPMKSVPKGDWSCLKC 92
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 413 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 467
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 332 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 380
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 413 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 467
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 332 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 380
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 40/62 (64%), Gaps = 6/62 (9%)
Query: 806 GEDPNDD---TCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ-FCG 859
G+D +DD C +C DGG+L+CCD C S++H +CL+ + + P+G+W C C Q G
Sbjct: 461 GDDEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRCLSQPIKG 520
Query: 860 RI 861
R+
Sbjct: 521 RV 522
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL ++ K P GKW C +C
Sbjct: 376 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC 424
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 412 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 466
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 331 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 379
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 415 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 469
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 334 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 382
>gi|384244681|gb|EIE18180.1| hypothetical protein COCSUDRAFT_20922, partial [Coccomyxa
subellipsoidea C-169]
Length = 72
Score = 58.5 bits (140), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYCS 854
D N D C +C GG L+CCDGCP+++H C+ K P G+W C C+
Sbjct: 1 DGNTDNCCLCNQGGSLLCCDGCPASYHLRCIGEQAKSLPEGEWLCPECA 49
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 812 DTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
+ C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 523 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 577
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
+ D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 445 HQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 490
>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Amphimedon queenslandica]
Length = 1171
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 807 EDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCG 859
EDP DD C IC +GGDL+ CD C +H +C+ +K+ P G +HC SC CG
Sbjct: 1075 EDPYDDECFICNEGGDLVICDYPDCYKVYHLSCIGLKELPDGDFHCRRHSCFVCG 1129
Score = 40.8 bits (94), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCL---DIKKFPSGKWHCVYC 853
N +TC CG+ G LICCD C + +H CL D + +W C C
Sbjct: 550 NMNTCLDCGEPGSLICCDSCSAAYHVKCLPDEDQSRAGQERWLCPNC 596
>gi|303314685|ref|XP_003067351.1| PHD-finger motif containing protein [Coccidioides posadasii C735
delta SOWgp]
gi|240107019|gb|EER25206.1| PHD-finger motif containing protein [Coccidioides posadasii C735
delta SOWgp]
Length = 893
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 792 DESKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGK 847
DE++R H D N D C +C G+L+CCDGC +FH C LD P+G+
Sbjct: 509 DETRRSRAHAEEVD----NIDFCRVCNGTGNLLCCDGCVDSFHFGCLNPPLDANFPPAGR 564
Query: 848 WHCVYCSCQFCGRINES 864
W C C + G + E+
Sbjct: 565 WFCTTCEEKGPGAVFEA 581
>gi|30688628|ref|NP_197668.2| PHD finger family protein [Arabidopsis thaliana]
gi|332005688|gb|AED93071.1| PHD finger family protein [Arabidopsis thaliana]
Length = 1566
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKF--PSGKWHCVYCSCQFCG 859
D N D C +CG G L+CCDGCP +H C+ + K P G W+C C+ + G
Sbjct: 410 DANSDECRLCGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWYCPECTIKKMG 463
>gi|26347387|dbj|BAC37342.1| unnamed protein product [Mus musculus]
Length = 601
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 469 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 528
Query: 863 ESTCHV-------NDQDDSALSTLQ 880
S CH+ QD +A + Q
Sbjct: 529 TSFCHLCPNSFCKEHQDGTAFRSTQ 553
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
N C +C GG L+CC+ CP+ FH +CL+I + P G W C C
Sbjct: 66 NVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSWFCNDC 108
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 803 NFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ 856
N +D + + C IC DGG+L+CCD C S +H CL+ + + P G W C CS +
Sbjct: 476 NAADDDEHSEFCRICKDGGELLCCDSCTSAYHTFCLNPPLSEIPDGDWKCPRCSAK 531
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 41/100 (41%), Gaps = 13/100 (13%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ--FCGRIN 862
E + D C +C GG++I CD CP +H C D +++ P G+W C +C + +
Sbjct: 411 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEAPEGRWSCPHCEGEGITAATVT 470
Query: 863 ESTCHVNDQDDSALSTLQICS---------LCEEKYHQSC 893
E DD +IC C YH C
Sbjct: 471 EKAGRNAADDDEHSEFCRICKDGGELLCCDSCTSAYHTFC 510
>gi|321454459|gb|EFX65629.1| hypothetical protein DAPPUDRAFT_65368 [Daphnia pulex]
Length = 574
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC 858
+G +D C +C GDL+CC+ CP+ +H +CLD ++ P+ W C C+ Q C
Sbjct: 186 EGNIKYNDHCRVCHKVGDLLCCETCPAVYHLHCLDPPLEHVPNEDWQCPICTAQLC 241
>gi|410930175|ref|XP_003978474.1| PREDICTED: PHD finger protein 12-like [Takifugu rubripes]
Length = 1011
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 796 RKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCV 851
RK V G N DTC C +GGDL+CCD CP+ FH C L + P G+W C
Sbjct: 41 RKLVRDVRRSGRATNHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCH 100
Query: 852 YC 853
C
Sbjct: 101 RC 102
>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
Length = 1440
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG + C
Sbjct: 1323 HEDNCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPFGKWECPWHQCSVCGGPAAALC 1381
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CC+ CP++FH CL I P G W+C C
Sbjct: 908 CEKGGKLLCCESCPASFHPECLSI-DMPEGCWNCNDC 943
Score = 41.2 bits (95), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 43/100 (43%), Gaps = 9/100 (9%)
Query: 806 GEDPNDDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINE 863
G + D C IC G+ L+ C+G C FH CL +K P K+ C C G
Sbjct: 686 GTNKRDTVCQICESSGESLVSCEGECCGVFHPECLGLKSLPDEKFICAECK---NGEHTC 742
Query: 864 STCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE 903
+C + +D S S C + YH++C + +E
Sbjct: 743 FSCKLPGKDVKRCSV----STCGKFYHEACVRKFATALFE 778
>gi|432894203|ref|XP_004075956.1| PREDICTED: PHD finger protein 12-like [Oryzias latipes]
Length = 865
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 796 RKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCV 851
RK V G N DTC C +GGDL+CCD CP+ FH C L + P G+W C
Sbjct: 37 RKLVRDVRRSGRATNHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCH 96
Query: 852 YC 853
C
Sbjct: 97 RC 98
>gi|390367174|ref|XP_003731194.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Strongylocentrotus purpuratus]
Length = 2202
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQ 856
D + + C +C DGG+L+CC+ CPS++H CL+ ++K P W C C+C+
Sbjct: 404 DEHMEFCRVCHDGGELLCCEQCPSSYHIFCLNPPLRKIPDDDWVCPRCACE 454
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 27/102 (26%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCS---------- 854
E + D C +C GG++I CD CP FH CLD ++ P GKW C C
Sbjct: 344 ETDHQDYCEVCQQGGEIILCDTCPKAFHLVCLDPELETAPEGKWSCPNCEGEGIPEPEPA 403
Query: 855 ---CQFCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSC 893
+FC CH + L C C YH C
Sbjct: 404 DEHMEFC-----RVCHDGGE-------LLCCEQCPSSYHIFC 433
>gi|307205723|gb|EFN83968.1| E3 ubiquitin-protein ligase TRIM33 [Harpegnathos saltator]
Length = 1101
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 33/53 (62%), Gaps = 5/53 (9%)
Query: 807 EDPNDDTCGICGDGGD-LICCDGCPSTFHQNCL--DIKKFP--SGKWHCVYCS 854
+DPN+D C +C DGGD ++CCD CP FH C ++ FP S W C+ C+
Sbjct: 890 DDPNEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLESFPDESETWQCMLCT 942
>gi|321466585|gb|EFX77580.1| hypothetical protein DAPPUDRAFT_9448 [Daphnia pulex]
Length = 58
Score = 58.5 bits (140), Expect = 3e-05, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 811 DDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC 858
+DTC +C GDL+CC+ CP+ +H +CLD ++ P+ W C C+ Q C
Sbjct: 7 NDTCRVCHKVGDLLCCETCPAVYHLHCLDPPLEHVPNEDWQCPICTAQLC 56
>gi|410913151|ref|XP_003970052.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Takifugu
rubripes]
Length = 1057
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 794 SKRKGFHFVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCV 851
++R F + D E ++D C +C GGDL+CCD CP FH +C + FPSG W C
Sbjct: 809 AQRGPEEFQSDDTEMESEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLLSFPSGDWVCS 868
Query: 852 YC 853
C
Sbjct: 869 LC 870
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 58.5 bits (140), Expect = 3e-05, Method: Composition-based stats.
Identities = 32/114 (28%), Positives = 60/114 (52%), Gaps = 1/114 (0%)
Query: 1003 NYKGFFTAILERDDEIISAASIRIHGKELAEMPFIGTRHMYRRQGMCRRLLTGIESALCS 1062
+++G + A+L ++SAA +++ +E+AE+ I T + R++G LL+ IE+ L +
Sbjct: 120 DFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKA 179
Query: 1063 LNVEKLIIPAISELRETWTSVFGFQPLEVSSKQKM-RNMSLLVFPGVDMLQKPM 1115
NV L P E+ W+ G+ L K M L++F + ++QK +
Sbjct: 180 WNVRLLTAPVDPEMAPIWSEKLGYTILSDEQKHSMLMAHPLVMFANLSLVQKSL 233
>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
Length = 1235
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C CG + C
Sbjct: 1118 HEDFCFRCGDGGELVMCDKKDCPKAYHLLCLNLPQPPYGKWECPWHQCSECGSAATAFC 1176
Score = 40.8 bits (94), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 41/95 (43%), Gaps = 9/95 (9%)
Query: 811 DDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
D C +C GD L+ C+G C FH CL + P GK+ CV C G +C
Sbjct: 555 DTLCQVCESAGDSLLPCEGDCCRHFHLECLGLTSLPDGKFTCVECQ---TGLHPCFSCKT 611
Query: 869 NDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYE 903
+D S S C + YH++C + +E
Sbjct: 612 PGKDVKRCSV----SACGKFYHEACVRKFPTAVFE 642
>gi|413916094|gb|AFW56026.1| hypothetical protein ZEAMMB73_379838 [Zea mays]
Length = 1869
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 61/149 (40%), Gaps = 33/149 (22%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYCSCQ-------- 856
+D N D C ICG G L+CCDGCP +H C+ K P G W C C
Sbjct: 554 QDGNSDDCRICGMDGTLVCCDGCPWAYHSRCIGQNKAFLPQGDWFCPECMVNKLGPTSSR 613
Query: 857 --------------FCGRINESTCHVNDQDDSALSTLQICSLCEEKYHQSCSQTDGAVQY 902
CGR+ TC+ L ++ S E Y + +Q D A
Sbjct: 614 IERGARGAQMFGIDMCGRLFLGTCNY-------LLVIETASDA-ESYARYYNQYDVAKVL 665
Query: 903 EPSSLSFCGKK-CQEIFERLEKLLGVKHD 930
+ ++S C++I + + LLG+ H+
Sbjct: 666 QSLAVSDAYVDICKQIKDYWKHLLGIIHN 694
>gi|13992535|emb|CAC38114.1| KIAA0298 protein [Mus musculus]
gi|148684989|gb|EDL16936.1| tripartite motif-containing 66, isoform CRA_b [Mus musculus]
Length = 1209
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP +H +C + FP G+W C C
Sbjct: 962 NEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLC 1007
>gi|358398564|gb|EHK47915.1| hypothetical protein TRIATDRAFT_129087 [Trichoderma atroviride IMI
206040]
Length = 1065
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 32/48 (66%), Gaps = 4/48 (8%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK---FPSGKWHCVYCS 854
ND+ C CG+ GD++CCDGCP +FH C+D+ + P +W+C CS
Sbjct: 632 NDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDHLPD-EWYCNECS 678
>gi|321454453|gb|EFX65623.1| hypothetical protein DAPPUDRAFT_1432 [Daphnia pulex]
Length = 646
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC 858
+G +D C +C GDL+CC+ CP+ +H +CLD ++ P+ W C C+ Q C
Sbjct: 246 EGNIKYNDHCRVCHKVGDLLCCETCPAVYHLHCLDPPLEHVPNEDWQCPICTAQLC 301
>gi|431905346|gb|ELK10391.1| Chromodomain-helicase-DNA-binding protein 4 [Pteropus alecto]
Length = 1854
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCL--DIKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CL D++K P GKW C +C
Sbjct: 366 ETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 414
>gi|348554221|ref|XP_003462924.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Cavia
porcellus]
Length = 1438
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 810 NDDTCGICGDGGDLICCDG--CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTC 866
++D C CGDGG+L+ CD CP +H CL++ + P GKW C + C C S C
Sbjct: 1321 HEDYCFQCGDGGELVMCDKKDCPKAYHLLCLNLTQPPYGKWECPWHQCNMCSSAAVSFC 1379
Score = 46.2 bits (108), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 1/37 (2%)
Query: 817 CGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
C GG L+CC+ CP++FH CL+I P G W+C C
Sbjct: 918 CEKGGRLLCCESCPASFHPECLNIDT-PEGCWNCNDC 953
Score = 41.2 bits (95), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 13/97 (13%)
Query: 811 DDTCGICGDGGD-LICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHV 868
D C IC GD L+ C+G C FH CL P GK+ C+ C G+ +C V
Sbjct: 701 DTVCQICESSGDSLVPCEGECYRYFHLECLGWTSVPDGKFTCMECK---TGQHPCFSCKV 757
Query: 869 NDQDDSALSTLQICSL--CEEKYHQSCSQTDGAVQYE 903
+D ++ CS+ C + YH++C + +E
Sbjct: 758 PGED------VKRCSVGACGKFYHEACVRKFSTAIFE 788
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 800 HFVNFDG----EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
HF G E N C +C +GG L+CCD CP+ FH+ CL+I P G W+C C
Sbjct: 1336 HFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1392
Score = 45.1 bits (105), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 811 DDTCGICGDGGDLICCDG-CPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRINESTCHVN 869
++ C C G+L+ C+ C FH CL + + P GK+ C C R TC V
Sbjct: 1187 ENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNEC------RTGIHTCFVC 1240
Query: 870 DQDDSALSTLQICSLCEEKYHQSCSQTDGAVQYEPSSLSFCGKKC 914
Q + + LC + YH+ C Q +Y P+ + G +C
Sbjct: 1241 KQSGEDVKRC-LLPLCGKFYHEECVQ-----KYPPTVMQNKGFRC 1279
>gi|198467663|ref|XP_001354469.2| GA17705 [Drosophila pseudoobscura pseudoobscura]
gi|198149350|gb|EAL31522.2| GA17705 [Drosophila pseudoobscura pseudoobscura]
Length = 951
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 8/81 (9%)
Query: 781 LQCILDSWNKQDESKRKGFH----FVNFDGEDPNDDTCGICGDGGDLICCDGCPSTFHQN 836
++ +++ +DE KG + + G N D C C +GG+L+CCD CPS+FH
Sbjct: 18 IKVLIEPPPNEDEKAVKGTNTKHPYYRRPGRGHNHDYCDACEEGGNLLCCDRCPSSFHLQ 77
Query: 837 CLDI----KKFPSGKWHCVYC 853
C D + PSG+W C C
Sbjct: 78 CHDPPLSEEDIPSGQWLCHSC 98
>gi|291384596|ref|XP_002708841.1| PREDICTED: tripartite motif-containing 66 [Oryctolagus cuniculus]
Length = 1211
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP +H +C + FP G+W C C
Sbjct: 964 NEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLC 1009
>gi|345787769|ref|XP_542497.3| PREDICTED: tripartite motif-containing protein 66 [Canis lupus
familiaris]
Length = 1207
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP FH +C + FP G W C C
Sbjct: 960 NEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLC 1005
>gi|302803789|ref|XP_002983647.1| hypothetical protein SELMODRAFT_155996 [Selaginella moellendorffii]
gi|300148484|gb|EFJ15143.1| hypothetical protein SELMODRAFT_155996 [Selaginella moellendorffii]
Length = 1296
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
D E+ +D C IC GG ++CCD C + +H CLD +K P G W C C
Sbjct: 41 DLEEAVEDFCTICKSGGKVLCCDACTAVYHLQCLDPPMKSVPKGSWRCPKC 91
>gi|356544359|ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791832 [Glycine max]
Length = 1702
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC 853
D D N D C +C G LICCDGCP+ FH C+ I P G W+C C
Sbjct: 669 DTTDWNSDECCLCKMDGSLICCDGCPAAFHSRCVGIASDHLPEGDWYCPEC 719
>gi|395815210|ref|XP_003781127.1| PREDICTED: tripartite motif-containing protein 66 [Otolemur
garnettii]
Length = 1211
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP +H +C + FP G+W C C
Sbjct: 964 NEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLC 1009
>gi|219110357|ref|XP_002176930.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411465|gb|EEC51393.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 2413
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 3/49 (6%)
Query: 808 DPNDD---TCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHCVYC 853
DP D C +CG GDL+CCDGC + H C+ + FP G W C C
Sbjct: 1888 DPTDGHCVYCIVCGLSGDLLCCDGCANVVHGKCIGLTSFPEGDWFCEEC 1936
>gi|187950891|gb|AAI38033.1| Tripartite motif-containing 66 [Mus musculus]
gi|219519295|gb|AAI45107.1| Tripartite motif-containing 66 [Mus musculus]
Length = 1242
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP +H +C + FP G+W C C
Sbjct: 995 NEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLC 1040
>gi|284004879|ref|NP_862901.3| tripartite motif-containing protein 66 isoform 2 [Mus musculus]
gi|342187123|sp|Q924W6.3|TRI66_MOUSE RecName: Full=Tripartite motif-containing protein 66; AltName:
Full=Transcriptional intermediary factor 1 delta
Length = 1242
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP +H +C + FP G+W C C
Sbjct: 995 NEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLC 1040
>gi|45861758|gb|AAS78677.1| transcriptional intermediary factor 1 delta short isoform [Mus
musculus]
Length = 1242
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP +H +C + FP G+W C C
Sbjct: 995 NEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLC 1040
>gi|301627494|ref|XP_002942909.1| PREDICTED: tripartite motif-containing protein 66-like [Xenopus
(Silurana) tropicalis]
Length = 1207
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+++CCD CP FH +C + FP G+W C C
Sbjct: 965 NEDFCAVCLNGGEMLCCDRCPKVFHLSCHVPALLSFPVGEWLCTLC 1010
>gi|149068363|gb|EDM17915.1| rCG40452, isoform CRA_b [Rattus norvegicus]
Length = 1210
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNC--LDIKKFPSGKWHCVYC 853
N+D C +C +GG+L+CCD CP +H +C + FP G+W C C
Sbjct: 963 NEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLC 1008
>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
Length = 715
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 9/82 (10%)
Query: 805 DGEDPNDDTCGICGDGGDLICCD--GCPSTFHQNCLDIKKFPSGKWHCVYCSCQFCGRIN 862
+G+ ++D C CGDGG L+ CD C +H +CL + K P GKW C + C CG+ +
Sbjct: 583 EGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPS 642
Query: 863 ESTCHV-------NDQDDSALS 877
S CH+ QD +A S
Sbjct: 643 TSFCHLCPNSFCKEHQDGTAFS 664
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 810 NDDTCGICGDGGDLICCDGCPSTFHQNCLDIKKFPSGKWHC 850
N C +C GG L+CC+ CP+ FH +CL+I + P G W C
Sbjct: 180 NVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSWFC 219
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 450 DDHMEYCRVCKDGGELLCCDTCISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 508
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 372 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 420
>gi|357155257|ref|XP_003577059.1| PREDICTED: uncharacterized protein LOC100838217 [Brachypodium
distachyon]
Length = 1786
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLDIKK--FPSGKWHCVYC 853
D N D C ICG G L+CCDGCP +H C+ + K P G W C C
Sbjct: 424 DGNSDDCRICGMDGTLVCCDGCPWAYHSRCIGLNKAFLPQGLWFCPEC 471
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 3/59 (5%)
Query: 808 DPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYCSCQFC-GRINE 863
D + + C +C DGG+L+CCD C S++H +CL+ + P+G+W C C+C GR+ +
Sbjct: 450 DDHMEYCRVCKDGGELLCCDTCISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQK 508
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 807 EDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
E + D C +C GG++I CD CP +H CLD + + P GKW C +C
Sbjct: 372 ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC 420
>gi|302817780|ref|XP_002990565.1| hypothetical protein SELMODRAFT_185367 [Selaginella moellendorffii]
gi|300141733|gb|EFJ08442.1| hypothetical protein SELMODRAFT_185367 [Selaginella moellendorffii]
Length = 1296
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 805 DGEDPNDDTCGICGDGGDLICCDGCPSTFHQNCLD--IKKFPSGKWHCVYC 853
D E+ +D C IC GG ++CCD C + +H CLD +K P G W C C
Sbjct: 41 DLEEAVEDFCTICKSGGKVLCCDACTAVYHLQCLDPPMKSVPKGSWRCPKC 91
>gi|390346322|ref|XP_799097.3| PREDICTED: PHD finger protein 12-like [Strongylocentrotus
purpuratus]
Length = 1172
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 32/53 (60%), Gaps = 4/53 (7%)
Query: 806 GEDPNDDTCGICGDGGDLICCDGCPSTFHQNC----LDIKKFPSGKWHCVYCS 854
G N D+C C +GGDLICCD CP++FH C + + PSG+W C C+
Sbjct: 32 GRATNRDSCDSCTEGGDLICCDRCPASFHLQCCNPPISEEDLPSGEWLCHRCT 84
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.313 0.131 0.382
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 25,260,961,062
Number of Sequences: 23463169
Number of extensions: 1146936544
Number of successful extensions: 2799234
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2328
Number of HSP's successfully gapped in prelim test: 9233
Number of HSP's that attempted gapping in prelim test: 2692682
Number of HSP's gapped (non-prelim): 83393
length of query: 1562
length of database: 8,064,228,071
effective HSP length: 156
effective length of query: 1406
effective length of database: 8,698,941,003
effective search space: 12230711050218
effective search space used: 12230711050218
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)