BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002950
(863 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 990 bits (2560), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 534/889 (60%), Positives = 636/889 (71%), Gaps = 59/889 (6%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDN-VESDSFPNGKHAKEEALNDDIKS 59
MGE+AVC+++ + + + +S TE K+D + L DN E++SFPN K AKE N+DIKS
Sbjct: 1 MGEAAVCVDITTENDNN---TSGTELKRDHQFLIDNDTETESFPNKKQAKES--NEDIKS 55
Query: 60 EVSNPVVSPKEFTSS------LQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDE 113
EVSNP++SPKE ++ DITSQ T+ ++ + G + E T +
Sbjct: 56 EVSNPIISPKENNNNNASSSSWHDITSQPTE-------ELATANQLGGVGGEVTSTISGN 108
Query: 114 RTESSYAETSRTDNNS----GD---VSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYV 166
SS E S +NN+ GD VS SHVVLEIPKH +S++GIRKITFKFSKRKEDY
Sbjct: 109 SCPSS--EHSSENNNASICNGDCDSVSTSHVVLEIPKH-ASTTGIRKITFKFSKRKEDYD 165
Query: 167 APLAYE----EGRNYTLYD---DLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLL 219
L E R ++ D ++ +G D CA NME+KMSKKV+PN +P+NVKKLL
Sbjct: 166 TRLNQELSPSRSREFSWVDSGTEMPETG--DRYFCAPNMELKMSKKVLPNTFPSNVKKLL 223
Query: 220 STGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPN 279
STGILDGA VKYIS RE L GI++GGGYLCGCP CNFS+V++A+EFE HAGAKTRHPN
Sbjct: 224 STGILDGARVKYISPQRE--LYGIIDGGGYLCGCPSCNFSRVLTAYEFELHAGAKTRHPN 281
Query: 280 NHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHD 339
NHIYLENGKPI SIIQELK APLG ++EV+K AGSS NE F+VWKAS H G++ D
Sbjct: 282 NHIYLENGKPICSIIQELKAAPLGAVDEVIKDAAGSSINEEFFQVWKASLHQCNGIIGAD 341
Query: 340 EKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRP-- 397
EK LP PHS+ S SS LEES P SFV + + EE KR +RP
Sbjct: 342 EKCYSMLPYSPHSLGSYSSQGLEESGCPPCSSFVHSNPFRRQKYMDSSEEHKRAFRRPSS 401
Query: 398 FMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCN 457
H K+T EGGT++RDNDLHRLLF+PNGLPDG L Y +KGQ++ G KQGNGIVC CC+
Sbjct: 402 LSHPKKTNEGGTRRRDNDLHRLLFMPNGLPDGAELAYYIKGQKMLAGYKQGNGIVCSCCD 461
Query: 458 KEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDG 517
+EISPSQFEAHAGMAARRQPYRHIYTSNG+TLHDIA SLA GQ TTG SDDMC CGDG
Sbjct: 462 REISPSQFEAHAGMAARRQPYRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDG 521
Query: 518 ENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ-GHSSSMSRSV--DLKGGLEAPGAE 574
+L+ C CP AFH CL +P W CPNC + GH + SRS+ L ++ P E
Sbjct: 522 GDLIFCESCPRAFHLVCLGLKYVPSDVWHCPNCNKFGHGGNFSRSIVIRLTRVVKTPEYE 581
Query: 575 VGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 634
VGGCV CR+HDFS TF+DRTVI CDQCE+EFHVGCLR NGLCDLKE
Sbjct: 582 VGGCV-------------FCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLKE 628
Query: 635 IPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQM 694
IPKD WFC +DCNRI+ ALQ+FVS+ Q IP+ L+ I KH EKG+ DG ND QW++
Sbjct: 629 IPKDNWFCSNDCNRIYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDFQWRI 688
Query: 695 LKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVIL 754
L ++E + SLLS+A AIFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY V+L
Sbjct: 689 LMGKSRYQE-DLSLLSAAAAIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCVLL 747
Query: 755 TVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPA 814
VK+VVVSAGLLRIFGR+VAELPLVAT RE+QGKG FQALFSCIERLLCSLNV LVLPA
Sbjct: 748 LVKNVVVSAGLLRIFGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLPA 807
Query: 815 AEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 863
AE+AESIWT++FGFRKM+ E+L +Y R+ QLTIFKGTSMLEK+V + E
Sbjct: 808 AEEAESIWTRRFGFRKMTEEQLSQYTRELQLTIFKGTSMLEKEVPLMNE 856
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 978 bits (2529), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/884 (58%), Positives = 628/884 (71%), Gaps = 64/884 (7%)
Query: 1 MGESAVCL------EVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALN 54
MGE CL +V DG+ + E SSR E K+D E + N+E+++ P K KE ALN
Sbjct: 1 MGEGEGCLPVVKDIKVLKDGKTESENSSRMELKRDHECIAGNIETEASPRKKPVKE-ALN 59
Query: 55 DDIKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD 110
++ SEVSNP++SPK SS+Q ITSQ +L S NQ +++TS+GN S E+LSD
Sbjct: 60 EEGCSEVSNPILSPKYNASSVQTITSQVAELA--STNQAVLGEITSTSSGN-SVPESLSD 116
Query: 111 GDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLA 170
E SR ++ G VS + VVLEIPKHVSS+ GIRKITFKFSK KE Y
Sbjct: 117 ---------EEHSRNGSSDG-VSTTQVVLEIPKHVSST-GIRKITFKFSKSKEAY----- 160
Query: 171 YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVK 230
R + + + A NME+KMSKKVVP YPTNVKKLLSTGILDGA VK
Sbjct: 161 ---NRTNMRVNTCWNLETRNLHFRAPNMELKMSKKVVPKSYPTNVKKLLSTGILDGALVK 217
Query: 231 YISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPI 290
YISTSRE++L G++ GYLCGC CNF+KV++A+EFEQHAG +TRHPNNHIYLENGKPI
Sbjct: 218 YISTSREKELQGVIRESGYLCGCSACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPI 277
Query: 291 YSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLP 350
YSIIQ+LKTAPL L+EV+K +AGSS N F+ WKAS H G+ E DE ++ +L + P
Sbjct: 278 YSIIQQLKTAPLSDLDEVIKNIAGSSVNMECFKAWKASFHQNNGVTEADENYHAQLLNHP 337
Query: 351 HSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPF-------MHQKR 403
SI+S A+E+S + + +++ MKEM +ERK K+P + K+
Sbjct: 338 QSIVSFPVQAVEDSFTGSRLPL-----KQKELMKEMTQERKHAAKKPSSYIYGSGLQHKK 392
Query: 404 TAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPS 463
++EG KKRDNDLHRLLF+PNGLPDG L Y VKGQR+ G KQGNGIVC C+ E+SPS
Sbjct: 393 SSEGAIKKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDSEVSPS 452
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLC 523
QFEAHAG AARRQPYRHIYTSNG+TLHDIAISLA GQ TTG SDDMC +CGDG +L+LC
Sbjct: 453 QFEAHAGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILC 512
Query: 524 NGCPLAFHAACLDPLLIPESGWRCPNCRQGH--SSSMSRSV--DLKGGLEAPGAEVGGCV 579
+GCP AFH ACL+ +PE WRCP C + ++R + L ++AP +E+GGCV
Sbjct: 513 DGCPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKVARPIRIQLTRAVKAPESEIGGCV 572
Query: 580 ICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 639
+CR +HDFS + FDDRTV+ CDQCEKEFHVGCLR +GLCDLKE+PKDK
Sbjct: 573 VCR-------------AHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDK 619
Query: 640 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ 699
WFCCDDC+R+H ALQ+ S + IPAS S INRK++EKG L DG +D+QW +L
Sbjct: 620 WFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKG-LIDGAADDIQWCILSGKS 678
Query: 700 CFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSV 759
C++E LLS TAIFRECFDPI+A GRDLIPVMVYGRNISGQEFGGMY V+L KS
Sbjct: 679 CYKE-HLPLLSRTTAIFRECFDPIVASSGRDLIPVMVYGRNISGQEFGGMYCVVLLAKST 737
Query: 760 VVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAE 819
VVSAGL+R+FG+EVAELP+VAT +E+QGKG F+ALFSCIE LL SL V+ LVLPAAE+AE
Sbjct: 738 VVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAE 797
Query: 820 SIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 863
+IWT K GF+KMS ER+LKY R+ QLTIFKGTSMLEK+V C+ E
Sbjct: 798 AIWTNKLGFQKMSEERMLKYTRELQLTIFKGTSMLEKEVPCIVE 841
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 978 bits (2528), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/912 (56%), Positives = 635/912 (69%), Gaps = 84/912 (9%)
Query: 1 MGESAVCL------EVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALN 54
MGE CL +V DG+ + E SSR E K+D E + N+E+++ P K KE ALN
Sbjct: 20 MGEGEGCLPVVKDIKVLKDGKTESENSSRMELKRDHECIAGNIETEASPRKKPVKE-ALN 78
Query: 55 DDIKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD 110
++ SEVSNP++SPK SS+Q ITSQ +L S NQ +++TS+GN S E+LSD
Sbjct: 79 EEGCSEVSNPILSPKYNASSVQTITSQVAELA--STNQAVLGEITSTSSGN-SVPESLSD 135
Query: 111 GDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLA 170
E SR ++ G VS + VVLEIPKHV SS+GIRKITFKFSK KE Y + L+
Sbjct: 136 ---------EEHSRNGSSDG-VSTTQVVLEIPKHV-SSTGIRKITFKFSKSKEAYNSKLS 184
Query: 171 YEE-------GRNYTLYDDLGSSGANDGV---------------------LCARNMEIKM 202
E G +++ G G N A NME+KM
Sbjct: 185 SEPLHVLGRVGNSHSYIGYPGDPGRNIASPDTGTNMRVNTCWNLETRNLHFRAPNMELKM 244
Query: 203 SKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVV 262
SKKVVP YPTNVKKLLSTGILDGA VKYISTSRE++L G++ GYLCGC CNF+KV+
Sbjct: 245 SKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKELQGVIRESGYLCGCSACNFTKVL 304
Query: 263 SAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF 322
+A+EFEQHAG +TRHPNNHIYLENGKPIYSIIQ+LKTAPL L+EV+K +AGSS N F
Sbjct: 305 TAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAGSSVNMECF 364
Query: 323 RVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTH 382
+ WKAS H G+ E DE ++ +L + P SI+S A+E+S + + +++
Sbjct: 365 KAWKASFHQNNGVTEADENYHAQLLNHPQSIVSFPVQAVEDSFTGSRLPL-----KQKEL 419
Query: 383 MKEMLEERKRGVKRPF-------MHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYI 435
MKEM +ERK K+P + K+++EG KKRDNDLHRLLF+PNGLPDG L Y
Sbjct: 420 MKEMTQERKHAAKKPSSYIYGSGLQHKKSSEGAIKKRDNDLHRLLFMPNGLPDGAELAYY 479
Query: 436 VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS 495
VKGQR+ G KQGNGIVC C+ E+SPSQFEAHAG AARRQPYRHIYTSNG+TLHDIAIS
Sbjct: 480 VKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGWAARRQPYRHIYTSNGLTLHDIAIS 539
Query: 496 LAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH- 554
LA GQ TTG SDDMC +CGDG +L+LC+GCP AFH ACL+ +PE WRCP C +
Sbjct: 540 LANGQNCTTGDSDDMCTLCGDGGDLILCDGCPRAFHPACLELQCLPEGDWRCPCCVENFC 599
Query: 555 -SSSMSRS--VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIY 611
++R + L ++AP +E+GGCV+ CR+HDFS + FDDRTV+
Sbjct: 600 PDRKVARPIRIQLTRAVKAPESEIGGCVV-------------CRAHDFSVSKFDDRTVML 646
Query: 612 CDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLST 671
CDQCEKEFHVGCLR +GLCDLKE+PKDKWFCCDDC+R+H ALQ+ S + IPAS S
Sbjct: 647 CDQCEKEFHVGCLRDSGLCDLKELPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSM 706
Query: 672 INRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDL 731
INRK++EKG L DG +D+QW +L C++E LLS TAIFRECFDPI+A GRDL
Sbjct: 707 INRKNLEKG-LIDGAADDIQWCILSGKSCYKE-HLPLLSRTTAIFRECFDPIVASSGRDL 764
Query: 732 IPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 791
IPVMVYGRNISGQEFGGMY V+L KS VVSAGL+R+FG+EVAELP+VAT +E+QGKG F
Sbjct: 765 IPVMVYGRNISGQEFGGMYCVVLLAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGKGFF 824
Query: 792 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 851
+ALFSCIE LL SL V+ LVLPAAE+AE+IWT K GF+KMS ER+LKY R+ QLTIFKGT
Sbjct: 825 RALFSCIEELLSSLGVKTLVLPAAEEAEAIWTNKLGFQKMSEERMLKYTRELQLTIFKGT 884
Query: 852 SMLEKKVQCLPE 863
SMLEK+V C+ E
Sbjct: 885 SMLEKEVPCIVE 896
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 976 bits (2522), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 521/858 (60%), Positives = 610/858 (71%), Gaps = 58/858 (6%)
Query: 33 LTDNVESDSFPNGKHAK-EEALNDDIKSEVSNPV---VSPKEFTSSLQDITSQETKLVTE 88
+ +N + FPN K +K EEA NDD KSEVSNPV VSPK SS DI+ +
Sbjct: 31 VNENETGELFPNKKQSKQEEASNDDTKSEVSNPVRTLVSPKGNGSSSHDISEESPTNACP 90
Query: 89 SCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSS 148
S + L+ + G SS +E + + S ++ VS S VVL+IP+H +S+
Sbjct: 91 SSEETLTVSQEGGGSS----------SEDNTSHQSLRNDTCDSVSMSPVVLKIPEH-AST 139
Query: 149 SGIRKITFKFSKRKEDYVA------PL--AYEEG----RNYTLYDDLGSSGANDGVLCA- 195
+G+RKITFKFSKRKEDY PL ++G RN Y S N
Sbjct: 140 TGVRKITFKFSKRKEDYDTKTSSPHPLHGGIDQGLLYHRNGDYYPRNHSVWVNSCTEMPQ 199
Query: 196 ---RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCG 252
R +E+ MSKKVVPN YPTNVKKLL+TGILD A VKYI S ER+LDGI++GGGYLCG
Sbjct: 200 TRERYVELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERELDGIIDGGGYLCG 259
Query: 253 CPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKV 312
C C+FSKV+SA+EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPL +++ V+K V
Sbjct: 260 CSSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDV 319
Query: 313 AGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSF 372
AGSS NE FRVWKAS + LV D+K +LP LPHS +S +S AL+ES P S SF
Sbjct: 320 AGSSINEEFFRVWKASLNQSNALVGADKKSYSELPCLPHSHVSYASQALKESFCPISSSF 379
Query: 373 VQDTG--REQTHMKEMLEERKRGVKRPFMH-------QKRTAEGGTKKRDNDLHRLLFLP 423
+ + +QT+M E K+ KRP + QK+TAE G +KRDNDLHRLLF+P
Sbjct: 380 LYNNNFVSQQTNM-ETSGVNKQTSKRPSFYVPGSATKQKKTAESGVRKRDNDLHRLLFMP 438
Query: 424 NGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
NGLPDG L Y VKGQ++ G KQGNGIVC CC EISPSQFE+HAGM+ARRQPYRHIYT
Sbjct: 439 NGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEIEISPSQFESHAGMSARRQPYRHIYT 498
Query: 484 SNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
SN +TLHDIAISLA GQ TTG DDMC CGDG +L+ C CP AFHAACLD PE
Sbjct: 499 SNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEG 558
Query: 544 GWRCPNCRQ-GHSSSMSRSV--DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFS 600
W CPNC + GH + +R + L ++ P +VGGC +CR +HDFS
Sbjct: 559 AWHCPNCNKLGHGGNFARPIVIRLTRVVKTPEYDVGGCAVCR-------------AHDFS 605
Query: 601 AATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNR 660
TFDDRTVI CDQCEKEFHVGCLR++GLCDLKEIPKD WFCC DCN I+ AL++ VS
Sbjct: 606 GDTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTG 665
Query: 661 AQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECF 720
QTIP S L+TINRKH+EKG+L D DVQWQ+L + ++ SLLS A AIFRECF
Sbjct: 666 VQTIPVSLLNTINRKHVEKGLLVDEAAYDVQWQILM-GKSRNREDLSLLSGAAAIFRECF 724
Query: 721 DPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVA 780
DPI+A+ GRDLIPVMVYGRNISGQEFGGMY V+LTV+ VVVSAGLLRIFGREVAELPLVA
Sbjct: 725 DPIVAKTGRDLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVA 784
Query: 781 TCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQ 840
T RE+QGKG FQALFSCIERLLCSLNVE LVLPAAE+AESIWT++FGFRKMS +LLKY
Sbjct: 785 TNREHQGKGYFQALFSCIERLLCSLNVEQLVLPAAEEAESIWTRRFGFRKMSEGQLLKYT 844
Query: 841 RDFQLTIFKGTSMLEKKV 858
R+FQLTIFKGTSMLEK+V
Sbjct: 845 REFQLTIFKGTSMLEKEV 862
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 947 bits (2448), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/900 (56%), Positives = 616/900 (68%), Gaps = 93/900 (10%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE VC+ +G+ + SRTE K+D + + E PN K AKE + ND+++SE
Sbjct: 1 MGEEVVCVHALEEGKQENNEESRTELKRDYDQCVADTEPHLSPNKKQAKEVS-NDEVRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD--GDER 114
VSNP VS E + QDI+SQ T+ +E+ N L++T N SS+ETLSD G++
Sbjct: 60 VSNPNVSAAELAQTFQDISSQPTE--SENVNHAECGELTSTCLENSSSDETLSDEAGEQN 117
Query: 115 TESSYAETSRTDNNSGDVS-KSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVA------ 167
++ TS++D ++ + S VV+EIPKH +SSSGIRKITFKFSK+KEDY
Sbjct: 118 NNNNNNNTSQSDKDTSSAAMTSRVVMEIPKH-ASSSGIRKITFKFSKKKEDYDYQPPPPM 176
Query: 168 --PLAYEEG---------RNYTLYDD-----------LGSSGANDGVLCARNMEIKMSKK 205
P Y +G Y DD +G D L RNME+KMSKK
Sbjct: 177 HHPALYNDGNYIGFHGDDEEYLARDDCSGGSLESPCGMGYVRDGDLDLYTRNMELKMSKK 236
Query: 206 VVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAH 265
VVPN YPTNVKKLLSTGILDGA VKYI + +L GI++GGGYLCGC +CN+S+V+SA+
Sbjct: 237 VVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQGIIDGGGYLCGCSMCNYSRVLSAY 296
Query: 266 EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW 325
EFEQHAGAKTRHPNNHI+LENG+PIYSIIQE+KTAPL +L+EV+K VAGSS NE SF+ W
Sbjct: 297 EFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIKNVAGSSVNEESFQAW 356
Query: 326 KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKE 385
K S G V+ + ++ KL +PH+ +I P+S + ++G
Sbjct: 357 KESLLQSNGKVQAHKSYSTKLVGMPHT-----------NIRPSS--YTSNSG-------- 395
Query: 386 MLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGC 445
+ QKR+A+G TK+RDNDLHRLLF+PNGLPDG L Y VKGQ+L G
Sbjct: 396 -------------VLQKRSADGCTKRRDNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGY 442
Query: 446 KQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG 505
KQGNGIVC CC+ EISPSQFEAHAGMAARRQPYRHIYTSNG+TLHDIA+SLA GQ TTG
Sbjct: 443 KQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYTSNGLTLHDIALSLANGQNLTTG 502
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC----RQGHSSSMSRS 561
SDDMC VCGDG +L+LCNGCP AFHAACL +P+SGW+C NC G SS+ R
Sbjct: 503 DSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVPDSGWQCLNCIDNAGNGRESSIVRP 562
Query: 562 VDLKGGL--EAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ ++ + P E+GGCV+ CR HDFS A FD+RTVI CDQCEKE+
Sbjct: 563 IMIRLTRVDKTPEVEMGGCVV-------------CREHDFSVAKFDERTVIICDQCEKEY 609
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
HVGCLR GLC+L+E+PKDKWFCCDDCNRI+AALQ+ VS A+ IPAS I RKH +K
Sbjct: 610 HVGCLRDMGLCELEELPKDKWFCCDDCNRIYAALQNSVSAGAEIIPASFSELIIRKHEDK 669
Query: 680 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGR 739
G+ G MND+QW++L + E LLS A AIFRECFDPI+A GRDLIPVMVYGR
Sbjct: 670 GLCTYGAMNDIQWRILSGKSRYPE-HLPLLSRAAAIFRECFDPIVAISGRDLIPVMVYGR 728
Query: 740 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 799
NISGQEFGGMY ++L V VVVSAGLLRIFGR VAELPLVAT R +QGKG FQ LFSCIE
Sbjct: 729 NISGQEFGGMYCIVLIVNYVVVSAGLLRIFGRNVAELPLVATSRAHQGKGYFQVLFSCIE 788
Query: 800 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
RLL SLNVE LVLPAA AESIWTKK GFRKMS ++L K+ R+ QLT+F TSMLEK VQ
Sbjct: 789 RLLSSLNVEKLVLPAAGDAESIWTKKLGFRKMSEDQLSKHLREVQLTLFNKTSMLEKTVQ 848
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 947 bits (2447), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 506/897 (56%), Positives = 620/897 (69%), Gaps = 97/897 (10%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE A L + + E +S TE K+D + L ++ E +S N K AKE + N+D++SE
Sbjct: 1 MGEEAAGLAAVTNETLGKENASSTELKRDHQCLDEDTEPESLHNKKQAKEVS-NEDVRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSDGDERTE 116
VSNPVVSPKE + DITSQ ++ E+ QV L++ +GN SSE+ +S G R +
Sbjct: 60 VSNPVVSPKE--NHFHDITSQPEEV--ENTTQVERGELTSACSGNSSSED-ISSGGVRCQ 114
Query: 117 SSYAETSRTDNNSGDVSK-SHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGR 175
+ + S+ D + DV++ S VV+EIPKH +SS+GIRKITFKFSK+K + A ++ ++
Sbjct: 115 N---DMSQNDVDMCDVNEVSRVVIEIPKH-ASSTGIRKITFKFSKKKGNNGASVSADKVH 170
Query: 176 NY-----------TLYDDL----------GSSGANDGVLCARNMEIKMSKKVVPNEYPTN 214
+Y +L DD GS+ ++ L ME+KMSKKV+PN YP+N
Sbjct: 171 SYGNSDRDGKPEPSLLDDACTETSAHSCEGSAESSRYSLGPNKMELKMSKKVLPNNYPSN 230
Query: 215 VKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAK 274
VKKLLSTGILDGA VKY+ST+ E +L GI+NGGGY+CGC CNF+ ++SA+EFEQHAG K
Sbjct: 231 VKKLLSTGILDGARVKYVSTTSEMKLQGIINGGGYMCGCSTCNFTAILSAYEFEQHAGFK 290
Query: 275 TRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKG 334
TRHPNNHIYLENG+PIYS+IQE+K+APL IL+EV+ +VAGSS N SF WKAS H
Sbjct: 291 TRHPNNHIYLENGRPIYSVIQEIKSAPLSILDEVIMEVAGSSVNMNSFEAWKASFHQDSA 350
Query: 335 LVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGV 394
+ E H++KLP L H + E +P + V
Sbjct: 351 NIVV-ENHDVKLPKLSHPV---------ERPNPNFSNAV--------------------- 379
Query: 395 KRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCD 454
+ K+TAE GTK+RDNDLHRLLF+PNGLPDG L Y VKGQR+ G KQGNGI+C
Sbjct: 380 ----LQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVKGQRILGGFKQGNGILCS 435
Query: 455 CCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC 514
CN+EISPSQFEAHAGMAARRQPYRHIYT+NG+TLHDIAISLA GQ+ TTG SDDMC C
Sbjct: 436 HCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISLASGQKLTTGDSDDMCAAC 495
Query: 515 GDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ----------GHSSSMSRSV-- 562
G+G +L+ C+ CP A+H CL +PE W CPNCR G S S S+ +
Sbjct: 496 GNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWSCPNCRDKVGSNSKAISGGSLSFSKPIVF 555
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
L ++AP E+GGCV+CR HDFSAA FDDRTV+ CDQCE+EFHVG
Sbjct: 556 RLTRVVKAPEYEIGGCVVCR-------------RHDFSAAKFDDRTVLLCDQCEREFHVG 602
Query: 623 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 682
CLR +GLCDLKE+PKDKWFCCD+C+ IH ALQ+ V N AQ IP S I RKH+ KG+L
Sbjct: 603 CLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQIIPDSLSDLIIRKHVGKGLL 662
Query: 683 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNIS 742
D +NDV+WQ+L F E + LS ATAIFRECFDPI+A+ GRDLIPVMVYGRNIS
Sbjct: 663 VDEALNDVRWQILSGKSRFPE-DLPFLSRATAIFRECFDPIVAKSGRDLIPVMVYGRNIS 721
Query: 743 GQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 802
GQEFGGMY V+L V+S+VVSAGLLRIFGREVAELP+VAT RE+QGKG FQ LFSCIERLL
Sbjct: 722 GQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSREHQGKGYFQVLFSCIERLL 781
Query: 803 CSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
SLNV+NLVLPAAE AESIWTKK GFRKMS E+L+KY R+ QLTIF GTSMLEK V+
Sbjct: 782 SSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLIKYMREVQLTIFNGTSMLEKVVE 838
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 942 bits (2436), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 512/903 (56%), Positives = 617/903 (68%), Gaps = 96/903 (10%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE A C+ +G+ + SRTE K+D + + E + PN K AKE + ND+++SE
Sbjct: 1 MGEEADCVHALEEGKKENNEESRTELKRDYDQCVADTEPNVSPNKKQAKEVS-NDEVRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD-----G 111
VSNP VS E + QDI+SQ T+ +E+ N L++T N SS+ETLSD
Sbjct: 60 VSNPNVSAAEHALTFQDISSQPTE--SENVNHAECGELTSTCLENSSSDETLSDEAGEHN 117
Query: 112 DERTESSYAETSRTDNNSGDVS-KSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVA--- 167
+ ++ TS++D ++G + S VV+EIPKHVSSS GIRKITFKFSK+KEDY
Sbjct: 118 NNNNNNNNNNTSQSDKDTGSAAMTSCVVMEIPKHVSSS-GIRKITFKFSKKKEDYDYQPP 176
Query: 168 -----PLAYEEGRNYTLYDD--------------------LGSSGANDGVLCARNMEIKM 202
P Y +G + + D +G D L RNME+KM
Sbjct: 177 PAVHHPALYNDGNHIGFHGDDEEYLARDDCSGGSLESPCGMGYVHDGDLDLYTRNMELKM 236
Query: 203 SKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVV 262
SKKVVPN YPTNVKKLLSTGILDGA VKYI + +L GI++GGGYLCGC +CN+S+V+
Sbjct: 237 SKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQGIIDGGGYLCGCSMCNYSRVL 296
Query: 263 SAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF 322
SA+EFEQHAGAKTRHPNNHI+LENG+PIYSIIQE+KTAPL IL+EV+K VAGSS NE SF
Sbjct: 297 SAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIKNVAGSSVNEESF 356
Query: 323 RVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTH 382
+ WK S G V+ + ++ KL +PH+ +I P+S + +TG
Sbjct: 357 QAWKESLLQSNGKVQAHKSYSTKLVGMPHT-----------NIRPSS--YTSNTG----- 398
Query: 383 MKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLR 442
+ QKR+A+G TK+RDNDLHRLLF+PNGLPDG L Y VKGQ+L
Sbjct: 399 ----------------VLQKRSADGCTKRRDNDLHRLLFMPNGLPDGAELAYYVKGQKLL 442
Query: 443 FGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR 502
G KQGNGIVC CC+ EISPSQFEAHAGMAARRQPYRHIYTSNG+TLHDIA+SLA GQ
Sbjct: 443 GGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYTSNGLTLHDIALSLANGQNL 502
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCR----QGHSSSM 558
TTG SDDMC VCGDG +L+LCNGCP AFHAACL +P+SGW+C NCR G SS+
Sbjct: 503 TTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVPDSGWQCLNCRDNAGNGRESSI 562
Query: 559 SRSVDLKGGL--EAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
R + ++ + P E+GGCV+ CR HDFS A FD+RTVI CDQCE
Sbjct: 563 VRPIMIRLTRVDKTPEFEMGGCVV-------------CREHDFSVAKFDERTVIICDQCE 609
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKH 676
KE+HVGCLR GLC+L+E+PKDKWFCCDDCNRI+ ALQ+ V+ A+ IPAS I RKH
Sbjct: 610 KEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQNSVAAGAEIIPASVSELIIRKH 669
Query: 677 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMV 736
+KG+ G MND+QW++L + E LLS A AIFRECFDPI+A GRDLIPVMV
Sbjct: 670 EDKGLCTYGAMNDIQWRILSGKSRYPE-HLPLLSRAAAIFRECFDPIVAISGRDLIPVMV 728
Query: 737 YGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 796
YGRNISGQEFGGMY ++L V SVVVSAGLLRIFGR VAELPLVAT R +QGKG FQ LFS
Sbjct: 729 YGRNISGQEFGGMYCIVLIVNSVVVSAGLLRIFGRNVAELPLVATSRAHQGKGYFQVLFS 788
Query: 797 CIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
CIERLL SLNVE LVLPAA AESIWTKK GFRKMS ++L K+ R+ QLT+F TSMLEK
Sbjct: 789 CIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKMSEDQLSKHLREVQLTLFNKTSMLEK 848
Query: 857 KVQ 859
VQ
Sbjct: 849 TVQ 851
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/917 (55%), Positives = 609/917 (66%), Gaps = 81/917 (8%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE AVC++ DG+ + SR E K+D + E++ PN K AKE + ND+++SE
Sbjct: 1 MGEEAVCVQELVDGKTENTDESRLELKRDYNQCVADTEANVPPNKKQAKEVS-NDELRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQVLSTTSTG--NLSSEETLSD-GDERTES 117
V+NP VS E + DI+SQ T+ S + TSTG N SS +T+SD R
Sbjct: 60 VTNPNVSATEHAQTFHDISSQPTESENVSHAECGELTSTGLENSSSHDTVSDEAGVRNND 119
Query: 118 SYAETSRTDNNSGDVSK---SHVVLEIPKHVSSSSGIRKITFKFSKRKEDY---VAPLAY 171
S + N+ G S S VV+EIPKH +SS+GIRKITFKFSKRKEDY P Y
Sbjct: 120 SDNINNLCQNDKGTSSNDAVSRVVMEIPKH-ASSTGIRKITFKFSKRKEDYDDYQTPTGY 178
Query: 172 EEGRN------------------------YTLYDDLGSSG------------ANDGVLCA 195
+G Y DD ++G D L +
Sbjct: 179 TDGSGSDYGFGYGNGSGYGYGYGYHGDDEYLANDDYNNNGLVESSYGRGYVPYEDSELYS 238
Query: 196 RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL 255
NME+KMSKKVVPN +P NVKKLLSTGILDGA VKYI + +LDGI+ GGYLCGC +
Sbjct: 239 GNMELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYNPGKVELDGIIGDGGYLCGCSM 298
Query: 256 CNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGS 315
C++S+V+SA+EFEQHAGAKTRHPNNHI+LENGKPIYSII E+KTA +EV+K VAGS
Sbjct: 299 CSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKNVAGS 358
Query: 316 SFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQD 375
S NEGSF+VWK S V +K++ K +PH+ S S + S V++
Sbjct: 359 SINEGSFQVWKESLLQSNKKVPTQKKYSTKSTGIPHTYNSQSIESASSFSSLR----VRN 414
Query: 376 TGREQTHMKEMLEERKRGVKRPFMH-------QKRTAEGGTKKRDNDLHRLLFLPNGLPD 428
+Q ++ + +E KR VK+P + QKR+A+G TKKRDNDLHRLLF+PNGLPD
Sbjct: 415 HFEQQMYVNQTADEWKRVVKKPSTYTYYSGIPQKRSADGCTKKRDNDLHRLLFMPNGLPD 474
Query: 429 GERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT 488
G L Y VKGQ+L G KQGNGIVC CC+ EISPSQFEAHAGMAARRQPYRHIY SNG+T
Sbjct: 475 GAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYASNGLT 534
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
LHDIA+SLA GQ TTG SDDMC VCGDG +L+LCNGCP AFHAACL +PESGW C
Sbjct: 535 LHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLHSVPESGWHCL 594
Query: 549 NC------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAA 602
NC +G M R + + P EVGGCV+ CR++DFS
Sbjct: 595 NCEDNTGDERGARPIMIRLTRVD---KEPEYEVGGCVV-------------CRANDFSVD 638
Query: 603 TFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQ 662
FDDRTVI CDQCEKE+HVGCLR GLC+L+E+PKDKWFCCDDCNRI+ ALQ+ VS A
Sbjct: 639 KFDDRTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQNSVSAGAD 698
Query: 663 TIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 722
TIP+S I RKH ++G+ G MND+QW++L + E LLS A AIFRECFDP
Sbjct: 699 TIPSSLSELIIRKHEDRGLCTYGDMNDIQWRILSGKSRYAE-HLPLLSRAAAIFRECFDP 757
Query: 723 IIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATC 782
I+A GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSAGLLRIFGR +AELPLVAT
Sbjct: 758 IVAISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSIVVSAGLLRIFGRNIAELPLVATS 817
Query: 783 REYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD 842
RE+QGKG FQALFSCIERLL SLNVE LVLPAA AESIWTKK GF KMS ++L K+ ++
Sbjct: 818 REHQGKGYFQALFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFHKMSEDQLTKHLKE 877
Query: 843 FQLTIFKGTSMLEKKVQ 859
QLT+F TS+LEK VQ
Sbjct: 878 VQLTLFNKTSVLEKMVQ 894
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/904 (53%), Positives = 598/904 (66%), Gaps = 105/904 (11%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE +CLE+P + E SS + K RE L ++ + D FPN K AKE A NDDI SE
Sbjct: 1 MGEGTICLEMPME-----ENSSMGQFK--RERLDESNKGDHFPNKKQAKE-ASNDDITSE 52
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYA 120
+SNPV SP E TS +D++SQ K C S + SEET+SD ++S A
Sbjct: 53 ISNPVASPVESTSLFRDVSSQPVKSGLGEC-------SGSDFGSEETVSD-----DASVA 100
Query: 121 ETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLY 180
+S+T+ +S DV S VLEIPKH+SS+ GI KITFK SK K+++
Sbjct: 101 GSSQTEQSS-DVLPSRFVLEIPKHLSST-GITKITFKLSKPKKEF--------------- 143
Query: 181 DDL---GSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE 237
DDL + GV +KM KK+V YP+NVKKLL TGIL+GA VKYIST
Sbjct: 144 DDLPVIKDHTCDVGV-------VKMPKKIVALSYPSNVKKLLETGILEGAPVKYISTPPV 196
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R+L GI++ GGYLCGC C+FSKV+SA+EFE HAGAKTRHPNNHI+LENG+ +Y+I+QEL
Sbjct: 197 RELQGIIHSGGYLCGCTTCSFSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQEL 256
Query: 298 KTAPLGILEEVVKKVAGSSFNEGSFRVWKA-----------------SHHLRKGLVEHDE 340
KTAP +LEEV++ VAGS+ NE + WK S ++ ++ D
Sbjct: 257 KTAPRDVLEEVIRNVAGSALNEEGLQAWKGDGQQVSNRVIACLTGTISRNIPLSVIYLDP 316
Query: 341 --KHNMKLPSLPHSIISCSSLA------LEESISPTSCSFVQDTGREQTHMKEMLEERKR 392
++ L S S SCS L L+ES S T CS E+T+ K+ L+E KR
Sbjct: 317 VVTYHFVLLSETDSCESCSYLGSGTGPGLDESQSLTPCSVENHYFPEKTYAKDTLDEPKR 376
Query: 393 GVKRPFMH------QKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCK 446
K+ H K+ +EG +KRDNDLHRLLF+PNGLPDG L Y VK Q+L G K
Sbjct: 377 IAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLHGYK 436
Query: 447 QGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG 506
QG+GIVC CC++EISPSQFEAHAGMAARRQPYRHI+ S+G++LHDIA+SLA G TTG
Sbjct: 437 QGSGIVCSCCSREISPSQFEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGD 496
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLK 565
SDDMC +CGDG +LLLC GCP AFH ACL +PE W C +C G SS + + D
Sbjct: 497 SDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSVPEGTWYCSSCNDGPISSKKATATDPS 556
Query: 566 GG-----------LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQ 614
G ++AP +E+GGCV CRSHDFS FDDRTVI CDQ
Sbjct: 557 GNARPIVIRLSRVVKAPESEIGGCV-------------FCRSHDFSIGKFDDRTVILCDQ 603
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINR 674
CEKE+HVGCLR+NGLCDLKEIP++KWFCC DC+RIH A+Q+ VS QTIP L I R
Sbjct: 604 CEKEYHVGCLRENGLCDLKEIPQEKWFCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICR 663
Query: 675 KHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPV 734
K EKGI D + V+W++L + E LLS A IFRECFDPI+A+ GRDLIPV
Sbjct: 664 KDREKGIFTDNG-DIVEWRILSGKSRYPE-HLPLLSRAAVIFRECFDPIVAKSGRDLIPV 721
Query: 735 MVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
MVYGRNISGQEFGGMY ++L V S+VVSA LLRIFG++VAELP+VAT REYQG+G FQ L
Sbjct: 722 MVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQQVAELPIVATSREYQGRGYFQGL 781
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSML 854
++C+E LL SLNVENLVLPAAE+AESIWTKKFGF KMS ++L +YQ++ QLTIFKGTSML
Sbjct: 782 YACVENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSML 841
Query: 855 EKKV 858
EKKV
Sbjct: 842 EKKV 845
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 874 bits (2259), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/882 (54%), Positives = 589/882 (66%), Gaps = 91/882 (10%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVE----SDSFPNGKHAKEEALNDD 56
MGE +CLE+P + + K+DR L D+ + D FP+ K AKE A NDD
Sbjct: 1 MGEGTICLEMPKE--------ENGQLKRDR--LDDDTDEGNKGDHFPSKKQAKE-ASNDD 49
Query: 57 IKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTE 116
I SE+SNPV SP E TS +D++SQ K C S + SEET+SD +
Sbjct: 50 ITSEISNPVASPVESTSLFRDVSSQPVKSGLVEC-------SGSDFGSEETVSD-----D 97
Query: 117 SSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYV-APLAYEEGR 175
+S +S+T+ +S DV S VLEIPKH+SS+ GI KITFK SK K+++ PL
Sbjct: 98 ASVVGSSQTEQSS-DVLPSRFVLEIPKHLSST-GITKITFKLSKPKKEFDDLPL------ 149
Query: 176 NYTLYDDLGSSGANDGVLCARNMEIKM-SKKVVPNEYPTNVKKLLSTGILDGACVKYIST 234
+ D +G +KM KK+V YP+NVKKLL TGIL+GA VKYIST
Sbjct: 150 ---IKDHTWDAGV-----------VKMPKKKIVSLSYPSNVKKLLETGILEGARVKYIST 195
Query: 235 SRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSII 294
RQL GI++ GGYLCGC CNFSKV+SA+EFEQHAGAKTRHPNNHI+LEN + +Y+I+
Sbjct: 196 PPVRQLLGIIHSGGYLCGCTTCNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIV 255
Query: 295 QELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSII 354
QELKTAP +LEEV++ VAGS+ NE R WKAS + + + + HS +
Sbjct: 256 QELKTAPRVVLEEVIRNVAGSALNEEGLRAWKASFQQSNSMSDRNYITD-------HSTV 308
Query: 355 SCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAEGG 408
S L+ES S T CS E+T+ K+ L+E KR K+ H K+ +EG
Sbjct: 309 SYLGPGLDESQSLTPCSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGS 368
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
+KRDNDLHRLLF+PNGLPDG L Y VK Q+L G KQG+GIVC CC++EISPSQFEAH
Sbjct: 369 NRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAH 428
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
AGMAARRQPYRHI+ S+G++LHDIA+SLA G TTG SDDMC +CGDG +LLLC GCP
Sbjct: 429 AGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQ 488
Query: 529 AFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLKGG-----------LEAPGAEVG 576
AFH ACL +PE W C +C G SS + + D G ++AP +++G
Sbjct: 489 AFHTACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPESDIG 548
Query: 577 GCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 636
GCV CRSHDFS FDDRTVI CDQCEKE+HVGCLR+NG CDLKEIP
Sbjct: 549 GCV-------------FCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIP 595
Query: 637 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK 696
++KWFCC +C+RIH A+Q+ VS QT+P L I RK EKGI D + V+W++L
Sbjct: 596 QEKWFCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTD-IGDTVEWRILS 654
Query: 697 KAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTV 756
+ E LLS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V
Sbjct: 655 GKSRYPE-HLPLLSRAAVIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLIV 713
Query: 757 KSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAE 816
S+VVSA LLRIFG+EVAELP+VAT REYQG+G FQ L++C+E LL SLNVENLVLPAAE
Sbjct: 714 NSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAE 773
Query: 817 KAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+AESIWTKKFGF KMS ++L +YQ++ QLTIFKGTSMLEKKV
Sbjct: 774 EAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKV 815
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 827 bits (2135), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/888 (51%), Positives = 571/888 (64%), Gaps = 77/888 (8%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELL-TDNV-ESDSFPNGKHAKEEALNDDIK 58
MGE+ VC ++ + K+DR DN+ E + + + K +E NDD+K
Sbjct: 1 MGEATVCF-------AKENSETKKDLKRDRLCFEQDNLDEEELYSSNKRQTKEPSNDDMK 53
Query: 59 SEVSNPVVSPK-EFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTES 117
SE+SNPV SP + SS +DITS K + S ++V S + S ET++D
Sbjct: 54 SEISNPVPSPVVDNASSFRDITSNPAK--SSSGDRVGSCSG-----SYETITD------E 100
Query: 118 SYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNY 177
++E + NS D S EIPKH+S++ GI KITFK SKR ED+ +E
Sbjct: 101 KHSEYCSSLANS-DAVPSSFEREIPKHLSTT-GITKITFKLSKRNEDFCDLPMIQEHTWE 158
Query: 178 TLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE 237
++ SS + +KM KK+ + +NVKKLL TGILDGA VKY+STS
Sbjct: 159 GYPSNVASS----------TLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLSTSAA 208
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R+L GI++ GGYLCGC C+FSKV+ A+EFE+HAG KT+HPNNHIYLENG+P+Y++IQEL
Sbjct: 209 RELQGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQEL 268
Query: 298 KTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISC- 356
+ AP +LEEV++KVAGS+ +E F+ WK S K + E D H M HS S
Sbjct: 269 RIAPPDVLEEVIRKVAGSALSEEGFQAWKGSFQQDKNMTEDDSNHIMD-----HSFQSLV 323
Query: 357 ----SSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAE 406
S +L+ES S T C + RE+ K+ K K+ H K+ +
Sbjct: 324 SYPGSGWSLDESQSSTPCFPEDNYFREKICTKDTRHAHKPKAKKLTSHMFGMGCHKKVSG 383
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFE 466
GG KRDNDLHRLLFLPNGLPDG L Y VK Q+L G KQG+GIVC CC+ +ISPSQFE
Sbjct: 384 GGKWKRDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFE 443
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISLA-MGQRRTTGGSDDMCHVCGDGENLLLCNG 525
AHAGMA RRQPYR I+ S+G++LHDIA+SLA G TTG SDDMC +CG+G +LLLC G
Sbjct: 444 AHAGMAGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAG 503
Query: 526 CPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS----------VDLKGGLEAPGAEV 575
CP AFH ACL +PE W C +C G +S + + L ++AP +E+
Sbjct: 504 CPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCKIATASDPNLKPIVIRLTRVVKAPESEI 563
Query: 576 GGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 635
GGCV CRSHDFS FDDRTVI CDQCEKE+HVGCLR+N LCDLK I
Sbjct: 564 GGCV-------------FCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGI 610
Query: 636 PKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQML 695
P+DKWFCC DC+RIH LQ S QTIP L TI+RK+ EKGI D N V+W+ML
Sbjct: 611 PQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNG-NTVEWRML 669
Query: 696 KKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILT 755
+ E LLS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L
Sbjct: 670 SGKSRYPE-HLPLLSRAATIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLM 728
Query: 756 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 815
V S+VVSA LLRIFG++VAELP+VAT REYQG+G FQ LF+C+E LL SLNVENL+LPAA
Sbjct: 729 VNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAA 788
Query: 816 EKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 863
E+AESIWT KFGF KM+ RL +YQR+ QLTIFKGTSMLEKKV E
Sbjct: 789 EEAESIWTNKFGFTKMTEHRLQRYQREVQLTIFKGTSMLEKKVPSFSE 836
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 817 bits (2111), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/871 (53%), Positives = 564/871 (64%), Gaps = 112/871 (12%)
Query: 30 RELLTDNVE----SDSFPNGKHAKEEALNDDIKSEVSNPVVSPKEFTSSLQDITSQETKL 85
R+ L D+ + D FP+ K AKE A NDDI SE+SNPV SP E TS +D++SQ K
Sbjct: 11 RDRLDDDTDEGNKGDHFPSKKQAKE-ASNDDITSEISNPVASPVESTSLFRDVSSQPVKS 69
Query: 86 VTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHV 145
C S + SEET+SD ++S +S+T+ +S DV S VLEIPKH+
Sbjct: 70 GLVEC-------SGSDFGSEETVSD-----DASVVGSSQTEQSS-DVLPSRFVLEIPKHL 116
Query: 146 SSSSGIRKITFKFSKRKEDYV-APLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKM-S 203
SS+ GI KITFK SK K+++ PL + D +G +KM
Sbjct: 117 SST-GITKITFKLSKPKKEFDDLPL---------IKDHTWDAGV-----------VKMPK 155
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
KK+V YP+NVKKLL TGIL+GA VKYIST RQL GI++ GGYLCGC CNFSKV+S
Sbjct: 156 KKIVSLSYPSNVKKLLETGILEGARVKYISTPPVRQLLGIIHSGGYLCGCTTCNFSKVLS 215
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFR 323
A+EFEQHAGAKTRHPNNHI+LEN + +Y+I+QELKTAP +LEEV++ VAGS+ NE R
Sbjct: 216 AYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGSALNEEGLR 275
Query: 324 VWKA-----SHHLRKGL-----------VEHDEKHNMKLPSLPHSIISCSSL--ALEESI 365
WK S+ + L V ++ L S S SCS L L+ES
Sbjct: 276 AWKGDGQQVSNRVIACLTGIILRIIPLSVNALLSYHFVLLSETDSCESCSYLGPGLDESQ 335
Query: 366 SPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAEGGTKKRDNDLHRL 419
S T CS E+T+ K+ L+E KR K+ H K+ +EG +KRDNDLHRL
Sbjct: 336 SLTPCSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRL 395
Query: 420 LFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYR 479
LF+PNGLPDG L Y VK Q ISPSQFEAHAGMAARRQPYR
Sbjct: 396 LFMPNGLPDGTELAYYVKTQ--------------------ISPSQFEAHAGMAARRQPYR 435
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL 539
HI+ S+G++LHDIA+SLA G TTG SDDMC +CGDG +LLLC GCP AFH ACL
Sbjct: 436 HIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQS 495
Query: 540 IPESGWRCPNCRQGH-SSSMSRSVDLKGG-----------LEAPGAEVGGCVICRLSPSE 587
+PE W C +C G SS + + D G ++AP +++GGCV
Sbjct: 496 MPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPESDIGGCV-------- 547
Query: 588 NFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
CRSHDFS FDDRTVI CDQCEKE+HVGCLR+NG CDLKEIP++KWFCC +C+
Sbjct: 548 -----FCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCS 602
Query: 648 RIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKS 707
RIH A+Q+ VS QT+P L I RK EKGI D + V+W++L + E
Sbjct: 603 RIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTD-IGDTVEWRILSGKSRYPE-HLP 660
Query: 708 LLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLR 767
LLS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSA LLR
Sbjct: 661 LLSRAAVIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLR 720
Query: 768 IFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFG 827
IFG+EVAELP+VAT REYQG+G FQ L++C+E LL SLNVENLVLPAAE+AESIWTKKFG
Sbjct: 721 IFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFG 780
Query: 828 FRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
F KMS ++L +YQ++ QLTIFKGTSMLEKKV
Sbjct: 781 FTKMSDQQLQEYQKEVQLTIFKGTSMLEKKV 811
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 816 bits (2107), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/845 (53%), Positives = 557/845 (65%), Gaps = 43/845 (5%)
Query: 27 KQDRELL-TDNVESDSFPNGKHAKEEALNDDIKSEVSNPVVSPK-EFTSSLQDITSQETK 84
K+DR + DN+E + + + K +E NDDI+SE+SNPV SP + SS +DITS K
Sbjct: 10 KRDRLCIEQDNLEEELYSSNKRQAKEPSNDDIRSEISNPVASPVVDNASSFRDITSNPAK 69
Query: 85 LVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKH 144
+ S ++V S + + +S+E S+ Y + + S D S V EIPKH
Sbjct: 70 --SSSGDRVGSCSGSYEATSDEKRSE--------YCSSLADSSQSSDAVPSSFVREIPKH 119
Query: 145 VSSSSGIRKITFKFSKRKEDYV-APLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMS 203
+S++ GI KITFK SKR ED P+ E S +N + + + +KM
Sbjct: 120 LSTT-GITKITFKLSKRNEDVCDLPMIQEHTWE--------GSPSN---VASSTLGVKML 167
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
K+ +P+NVKKLL+TGILDGA VKY+S S R+L GI++ GGYLCGC +C+FSKV+
Sbjct: 168 DKIDSTNFPSNVKKLLATGILDGARVKYLSISPARELQGIIHSGGYLCGCTVCDFSKVLG 227
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFR 323
A+EFE+HAG KT+HPNNHIYLENG+P+Y++IQEL+ AP +LEEV++KVAGS+ +E F+
Sbjct: 228 AYEFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVAPPDVLEEVIRKVAGSALSEEGFQ 287
Query: 324 VWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHM 383
WK S + D H M S +++ES S T + R++
Sbjct: 288 AWKESFQ------QDDSNHIMDYSFQSLVSYPGSGWSIDESQSSTPYFPENNYFRKKIST 341
Query: 384 KEMLEERKRGVKRPFMH------QKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK 437
K+ E K K+ H K+ A GG KRDNDLHRLLFLPNGLPDG L Y VK
Sbjct: 342 KDTRHEHKPKAKKVTSHMFGMGCHKKAAGGGKWKRDNDLHRLLFLPNGLPDGTELAYFVK 401
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLA 497
Q+L G KQG+GIVC CC+ EISPSQFEAHAGMA RRQPYRHI+ S+G++LHDIA+SLA
Sbjct: 402 SQKLLQGYKQGSGIVCSCCDTEISPSQFEAHAGMAGRRQPYRHIHISSGLSLHDIAMSLA 461
Query: 498 -MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
G TTG SDDMC +CGDG +LLLC GCP AFH ACL +PE W C +C G +S
Sbjct: 462 DGGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCNDGPTS 521
Query: 557 SMSRSV---DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCD 613
+ + +LK + + I L + F RS DFS FDDRTVI CD
Sbjct: 522 CKTATATDPNLKSIVGSIAIFSLSAHIRVLHSAYCFSPISDRSLDFSIGKFDDRTVILCD 581
Query: 614 QCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTIN 673
QCEKE+HVGCLR+N LCDLK IP+DKWFCC DC+RIH ALQ S QTIP L TI+
Sbjct: 582 QCEKEYHVGCLRENDLCDLKGIPQDKWFCCSDCSRIHTALQSSASCGPQTIPTVLLDTIS 641
Query: 674 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIP 733
RK+ EKGI D N V+W+ML + E LLS A IFRECFDPI+A+ GRDLIP
Sbjct: 642 RKYREKGICIDNGDN-VEWRMLSGKSRYAE-HLPLLSRAATIFRECFDPIVAKSGRDLIP 699
Query: 734 VMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
VMVYGRNISGQEFGGMY ++L V S+VVSA LLRIFG++VAELP+VAT REYQG+G FQ
Sbjct: 700 VMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQG 759
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 853
LF+C+E LL SLNVENL+LPAAE+AESIWTKKFGF KM+ +L KYQR+ QLTIFKGTSM
Sbjct: 760 LFACVENLLSSLNVENLLLPAAEEAESIWTKKFGFTKMTEHQLQKYQREVQLTIFKGTSM 819
Query: 854 LEKKV 858
LEKKV
Sbjct: 820 LEKKV 824
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 811 bits (2096), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/879 (51%), Positives = 565/879 (64%), Gaps = 61/879 (6%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELL-TDNV-ESDSFPNGKHAKEEALNDDIK 58
MGE+ VC ++ + K+DR DN+ E + + + K +E NDD+K
Sbjct: 1 MGEATVCF-------AKENSETKKDLKRDRLCFEQDNLDEEELYSSNKRQTKEPSNDDMK 53
Query: 59 SEVSNPVVSPK-EFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTES 117
SE+SNPV SP + SS +DITS K + S ++V S + S ET++D
Sbjct: 54 SEISNPVPSPVVDNASSFRDITSNPAK--SSSGDRVGSCSG-----SYETITD------E 100
Query: 118 SYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNY 177
++E + NS D S EIPKH+S++ GI KITFK SKR ED+ +E
Sbjct: 101 KHSEYCSSLANS-DAVPSSFEREIPKHLSTT-GITKITFKLSKRNEDFCDLPMIQEHTWE 158
Query: 178 TLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE 237
++ SS + +KM KK+ + +NVKKLL TGILDGA VKY+STS
Sbjct: 159 GYPSNVASS----------TLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLSTSAA 208
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R+L GI++ GGYLCGC C+FSKV+ A+EFE+HAG KT+HPNNHIYLENG+P+Y++IQEL
Sbjct: 209 RELQGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQEL 268
Query: 298 KTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISC- 356
+ AP +LEEV++KVAGS+ +E F+ WK S K + E D H M HS S
Sbjct: 269 RIAPPDVLEEVIRKVAGSALSEEGFQAWKGSFQQDKNMTEDDSNHIMD-----HSFQSLV 323
Query: 357 ----SSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAE 406
S +L+ES S T C + RE+ K+ K K+ H K+ +
Sbjct: 324 SYPGSGWSLDESQSSTPCFPEDNYFREKICTKDTRHAHKPKAKKLTSHMFGMGCHKKVSG 383
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFE 466
GG KRDNDLHRLLFLPNGLPDG L Y VK Q+L G KQG+GIVC CC+ +ISPSQFE
Sbjct: 384 GGKWKRDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFE 443
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISLA-MGQRRTTGGSDDMCHVCGDGENLLLCNG 525
AHAGMA RRQPYR I+ S+G++LHDIA+SLA G TTG SDDMC +CG+G +LLLC G
Sbjct: 444 AHAGMAGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAG 503
Query: 526 CPLAFHAACLDPLLIPESGWRCPNCRQGHSS-SMSRSVDLKGGLEAPGAEVGGCVICRLS 584
CP AFH ACL +PE W C +C G +S ++ + L + LS
Sbjct: 504 CPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCKIATASWLYTYFNLNANILVLHSAYSLS 563
Query: 585 PSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD 644
P + RSHDFS FDDRTVI CDQCEKE+HVGCLR+N LCDLK IP+DKWFCC
Sbjct: 564 PISD------RSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFCCS 617
Query: 645 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK 704
DC+RIH LQ S QTIP L TI+RK+ EKGI D N V+W+ML + E
Sbjct: 618 DCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNG-NTVEWRMLSGKSRYPE- 675
Query: 705 EKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAG 764
LLS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSA
Sbjct: 676 HLPLLSRAATIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSAA 735
Query: 765 LLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTK 824
LLRIFG++VAELP+VAT REYQG+G FQ LF+C+E LL SLNVENL+LPAAE+AESIWT
Sbjct: 736 LLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWTN 795
Query: 825 KFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 863
KFGF KM+ RL +YQR+ QLTIFKGTSMLEKKV E
Sbjct: 796 KFGFTKMTEHRLQRYQREVQLTIFKGTSMLEKKVPSFSE 834
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 752 bits (1942), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/614 (61%), Positives = 441/614 (71%), Gaps = 61/614 (9%)
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++++SA+EFEQHAG KTRHPNNHIYLENG+PIYS+IQE+K+APL IL+EV+ +VAGSS
Sbjct: 213 ITEILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSAPLSILDEVIMEVAGSSV 272
Query: 318 NEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTG 377
N SF WKAS H + E H++KLP L H + E +P + V
Sbjct: 273 NMNSFEAWKASFHQDSANIVV-ENHDVKLPKLSHPV---------ERPNPNFSNAV---- 318
Query: 378 REQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK 437
+ K+TAE GTK+RDNDLHRLLF+PNGLPDG L Y VK
Sbjct: 319 ---------------------LQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVK 357
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLA 497
GQR+ G KQGNGI+C CN+EISPSQFEAHAGMAARRQPYRHIYT+NG+TLHDIAISLA
Sbjct: 358 GQRILGGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISLA 417
Query: 498 MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ----- 552
GQ+ TTG SDDMC CG+G +L+ C+ CP A+H CL +PE W CPNCR
Sbjct: 418 SGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWSCPNCRDKVGSN 477
Query: 553 -----GHSSSMSRSV--DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD 605
G S S S+ + L ++AP E+GGCV+CR HDFSAA FD
Sbjct: 478 SKAISGGSLSFSKPIVFRLTRVVKAPEYEIGGCVVCR-------------RHDFSAAKFD 524
Query: 606 DRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP 665
DRTV+ CDQCE+EFHVGCLR +GLCDLKE+PKDKWFCCD+C+ IH ALQ+ V N AQ IP
Sbjct: 525 DRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQIIP 584
Query: 666 ASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA 725
S I RKH+ KG+L D +NDV+WQ+L F E + LS ATAIFRECFDPI+A
Sbjct: 585 DSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPE-DLPFLSRATAIFRECFDPIVA 643
Query: 726 ECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
+ GRDLIPVMVYGRNISGQEFGGMY V+L V+S+VVSAGLLRIFGREVAELP+VAT RE+
Sbjct: 644 KSGRDLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSREH 703
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
QGKG FQ LFSCIERLL SLNV+NLVLPAAE AESIWTKK GFRKMS E+L+KY R+ QL
Sbjct: 704 QGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLIKYMREVQL 763
Query: 846 TIFKGTSMLEKKVQ 859
TIF GTSMLEK V+
Sbjct: 764 TIFNGTSMLEKVVE 777
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 40/84 (47%), Gaps = 6/84 (7%)
Query: 218 LLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTR 276
+ G+ DGA + Y +R L G G G LC CN + +S +FE HAG A R
Sbjct: 341 FMPNGLPDGAELAYF-VKGQRILGGFKQGNGILCSH--CN--REISPSQFEAHAGMAARR 395
Query: 277 HPNNHIYLENGKPIYSIIQELKTA 300
P HIY NG ++ I L +
Sbjct: 396 QPYRHIYTTNGLTLHDIAISLASG 419
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/857 (50%), Positives = 522/857 (60%), Gaps = 121/857 (14%)
Query: 33 LTDNVESDSFPNGKHAK-EEALNDDIKSEVSNPV---VSPKEFTSSLQDITSQETKLVTE 88
+ +N + FPN K AK EEA NDD KSEVSNPV VSPK SS DI+ +
Sbjct: 34 VNENETGELFPNKKQAKQEEASNDDTKSEVSNPVITLVSPKGNGSSSHDISEESPTNACP 93
Query: 89 SCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSS 148
S + L+ + G G +E + + S ++ VS S VVLEIP+H +S+
Sbjct: 94 SSEETLTVSQEG----------GGSSSEDNTSNQSPRNDTCDSVSMSPVVLEIPEH-AST 142
Query: 149 SGIRKITFKFSKRKEDY---VAPLAYEEG--------RNYTLYDDLGSSGANDGVLCA-- 195
+G+RKITFKFSKRKEDY ++P G RN Y S N
Sbjct: 143 TGVRKITFKFSKRKEDYDTKISPHPLHGGIDQGLLYHRNGDYYPRNHSVWVNSCTEMPQT 202
Query: 196 --RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
R ME+ MSKKVVPN YPTNVKKLL+TGILD A VKYI S ER+LDGI++GGGYLCGC
Sbjct: 203 RERYMELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERELDGIIDGGGYLCGC 262
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CNFSKV+SA+EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPL +++ V+K VA
Sbjct: 263 SSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDVA 322
Query: 314 GSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFV 373
GSS NE FRVWKAS + LV D+K + +LP LPHS +S +S AL+ES P S SF+
Sbjct: 323 GSSINEEFFRVWKASLNQSNALVGADKKCHSELPCLPHSHVSYASQALKESFCPISSSFL 382
Query: 374 QDTG--REQTHMKEMLEERKRGVKRPFMH-------QKRTAEGGTKKRDNDLHRLLFLPN 424
+ +Q +M E K+ KRP ++ QK+TAE G +KRDNDLHRLLF+PN
Sbjct: 383 YNNNFVSQQMYM-ETSGVNKQTSKRPSLYFPGSATKQKKTAESGVRKRDNDLHRLLFMPN 441
Query: 425 GLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS 484
GLPDG L Y VKGQ++ G KQGNGIVC CC EISPSQFE+HAGM+ARRQPYRHIYTS
Sbjct: 442 GLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTS 501
Query: 485 NGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG 544
NG+TLHDIAISLA GQ TTG DDMC CGDG +L+ +H LL
Sbjct: 502 NGLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLM--------WHVWIYRILL----- 548
Query: 545 WRCPNCRQGHSSSMSRS--VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAA 602
+ Q + +R + L + P VG CV+CRL+ + F I DF+ A
Sbjct: 549 -KVLGIVQIDGGNFARPTVIRLTRVGKIPEYNVGDCVVCRLNLLK-FLI------DFTLA 600
Query: 603 TFDD-RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRA 661
V+ +KEFHVGCLR++GLCDL+EIP+D WFCC DCN I+ AL++ VS
Sbjct: 601 NSRKCLNVMLSFSAKKEFHVGCLRESGLCDLEEIPEDNWFCCQDCNNIYVALRNSVSTGV 660
Query: 662 QTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFD 721
Q IPAS L+ INRKH+EKG+L D DVQWQ+L + ++ SLLS A AIFR
Sbjct: 661 QKIPASLLNIINRKHVEKGLLVDEAAYDVQWQILM-GKSRNREDLSLLSGAAAIFR---- 715
Query: 722 PIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 781
++ C L+ + L S+
Sbjct: 716 --VSSCTLHLMKCWFF----------------LAFNSL---------------------- 735
Query: 782 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQR 841
F LFSCIERLLCSLNVE LVLPAAE +IWT++FGFRKMS +LLKY R
Sbjct: 736 ---------FNPLFSCIERLLCSLNVEQLVLPAAE---TIWTRRFGFRKMSEGQLLKYTR 783
Query: 842 DFQLTIFKGTSMLEKKV 858
+FQLTIFKGTSMLEK+V
Sbjct: 784 EFQLTIFKGTSMLEKEV 800
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 616 bits (1589), Expect = e-173, Method: Compositional matrix adjust.
Identities = 313/526 (59%), Positives = 375/526 (71%), Gaps = 33/526 (6%)
Query: 351 HSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRT 404
HS +S L+ES S T CS E+T+ K+ L+E KR K+ H K+
Sbjct: 10 HSTVSYLGPGLDESQSLTPCSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKV 69
Query: 405 AEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQ 464
+EG +KRDNDLHRLLF+PNGLPDG L Y VK Q+L G KQG+GIVC CC++EISPSQ
Sbjct: 70 SEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQ 129
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FEAHAGMAARRQPYRHI+ S+G++LHDIA+SLA G TTG SDDMC +CGDG +LLLC
Sbjct: 130 FEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCA 189
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLKGG-----------LEAPG 572
GCP AFH ACL +PE W C +C G SS + + D G ++AP
Sbjct: 190 GCPQAFHTACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPE 249
Query: 573 AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 632
+++GGCV CRSHDFS FDDRTVI CDQCEKE+HVGCLR+NG CDL
Sbjct: 250 SDIGGCV-------------FCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDL 296
Query: 633 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQW 692
KEIP++KWFCC +C+RIH A+Q+ VS QT+P L I RK EKGI D + V+W
Sbjct: 297 KEIPQEKWFCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTD-IGDTVEW 355
Query: 693 QMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSV 752
++L + E LLS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY +
Sbjct: 356 RILSGKSRYPE-HLPLLSRAAVIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCL 414
Query: 753 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 812
+L V S+VVSA LLRIFG+EVAELP+VAT REYQG+G FQ L++C+E LL SLNVENLVL
Sbjct: 415 VLIVNSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVL 474
Query: 813 PAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
PAAE+AESIWTKKFGF KMS ++L +YQ++ QLTIFKGTSMLEKKV
Sbjct: 475 PAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKV 520
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 218 LLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTR 276
+ G+ DG + Y + ++ L G G G +C C S+ +S +FE HAG A R
Sbjct: 86 FMPNGLPDGTELAYYVKT-QKLLQGYKQGSGIVCSC----CSREISPSQFEAHAGMAARR 140
Query: 277 HPNNHIYLENGKPIYSIIQEL 297
P HI++ +G ++ I L
Sbjct: 141 QPYRHIFISSGLSLHDIAMSL 161
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/731 (41%), Positives = 422/731 (57%), Gaps = 65/731 (8%)
Query: 152 RKITFKFSKRKEDYVAPLA--YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN 209
++ T K KED V L Y + + D ++GA + + + +KMSKK+ N
Sbjct: 1712 KRFTRSALKSKEDTVESLESDYNFCNSVAIGVDEKTNGAVRSLTSPKKLGLKMSKKIALN 1771
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQ 269
+ P ++ LL TG+L+G V Y + +L G + G G LC C LC S+VV +FE
Sbjct: 1772 KVPLTIRDLLETGMLEGYPVTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFEL 1831
Query: 270 HAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASH 329
HA RH +IYL+NGK ++ ++ K APL LE ++ GS F V ++
Sbjct: 1832 HACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGS------FPVKRS-- 1883
Query: 330 HLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEE 389
+ DE K+ L +S I ++ SP + + T +K +
Sbjct: 1884 ------LPADEA--AKMDPLGNSCIKRNN-------SPATS--IHRTSERARLLKPIPVT 1926
Query: 390 RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGN 449
+ G + + + G K+D LHRL+F GLPDG + Y G++L G K+G
Sbjct: 1927 KSSG-SALYNSSENKSLGKITKKDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGF 1985
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDD 509
GI C CC+ E+S SQFEAHAG A+R++PY +IYTSNG++LH++AISL+ G++ + +DD
Sbjct: 1986 GIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDD 2045
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--------GHSSSMSRS 561
+C +CGDG NLLLC+GCP AFH C IP+ W C C+ H+++ +
Sbjct: 2046 LCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAA 2105
Query: 562 VDLKG------------GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV 609
+ G + P AEV CV LCR +DFS + F RT+
Sbjct: 2106 GRVSGVDPIEQITKRCIRIVNPEAEVSACV-------------LCRGYDFSKSGFGPRTI 2152
Query: 610 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 669
I CDQCEKEFH+GCLR + + DLKE+P KWFCC +C RIH+ALQ + +P S L
Sbjct: 2153 ILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLL 2212
Query: 670 STINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 728
+ I KH KG+ N V+W++L + + + LLS A AIF + FDPII + G
Sbjct: 2213 NVIKEKHERKGLESIADYN-VRWRLL-SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTG 2270
Query: 729 RDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 788
RDLIP MVYGRN+ GQ+F G+Y ++TV S VVSAG+LR+FG+EVAELPLVAT + QG+
Sbjct: 2271 RDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGR 2330
Query: 789 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTI 847
G FQ LFSCIE+LL LNV + VLPAAE+AE IWTKKFGF+K++ ++L +Y++ F Q+
Sbjct: 2331 GYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMIS 2390
Query: 848 FKGTSMLEKKV 858
F+GT MLEK V
Sbjct: 2391 FQGTCMLEKGV 2401
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 303/731 (41%), Positives = 423/731 (57%), Gaps = 65/731 (8%)
Query: 152 RKITFKFSKRKEDYVAPLA--YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN 209
++ T K KED V L Y + + D ++GA + + + +KMSKK+ N
Sbjct: 180 KRFTRSALKSKEDTVESLESDYNFCNSVAIGVDEKTNGAVRSLTSPKKLGLKMSKKIALN 239
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQ 269
+ P ++ LL TG+L+G V Y + +L G + G G LC C LC S+VV +FE
Sbjct: 240 KVPLTIRDLLETGMLEGYPVTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFEL 299
Query: 270 HAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASH 329
HA RH +IYL+NGK ++ ++ K APL LE ++ GS F V ++
Sbjct: 300 HACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGS------FPVKRS-- 351
Query: 330 HLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEE 389
+ DE K+ L +S I ++ + SI TS E+ + + +
Sbjct: 352 ------LPADEA--AKMDPLGNSCIKRNN-SPATSIHRTS---------ERARLLKPIPV 393
Query: 390 RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGN 449
K + + + G K+D LHRL+F GLPDG + Y G++L G K+G
Sbjct: 394 TKSSGSALYNSSENKSLGKITKKDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGF 453
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDD 509
GI C CC+ E+S SQFEAHAG A+R++PY +IYTSNG++LH++AISL+ G++ + +DD
Sbjct: 454 GIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDD 513
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--------GHSSSMSRS 561
+C +CGDG NLLLC+GCP AFH C IP+ W C C+ H+++ +
Sbjct: 514 LCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAA 573
Query: 562 VDLKG------------GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV 609
+ G + P AEV CV LCR +DFS + F RT+
Sbjct: 574 GRVSGVDPIEQITKRCIRIVNPEAEVSACV-------------LCRGYDFSKSGFGPRTI 620
Query: 610 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 669
I CDQCEKEFH+GCLR + + DLKE+P KWFCC +C RIH+ALQ + +P S L
Sbjct: 621 ILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLL 680
Query: 670 STINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 728
+ I KH KG+ N V+W++L + + + LLS A AIF + FDPII + G
Sbjct: 681 NVIKEKHERKGLESIADYN-VRWRLL-SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTG 738
Query: 729 RDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 788
RDLIP MVYGRN+ GQ+F G+Y ++TV S VVSAG+LR+FG+EVAELPLVAT + QG+
Sbjct: 739 RDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGR 798
Query: 789 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTI 847
G FQ LFSCIE+LL LNV + VLPAAE+AE IWTKKFGF+K++ ++L +Y++ F Q+
Sbjct: 799 GYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMIS 858
Query: 848 FKGTSMLEKKV 858
F+GT MLEK V
Sbjct: 859 FQGTCMLEKGV 869
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 299/689 (43%), Positives = 398/689 (57%), Gaps = 26/689 (3%)
Query: 184 GSSGANDGVLCARN-MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYIS-TSRERQLD 241
G SG N + RN ME+KMSKK+V N P VKKL TG LDG V Y+ + L
Sbjct: 110 GGSGINGALGAPRNKMELKMSKKIVVNRKPMTVKKLFDTGFLDGVSVVYMGGIKKASGLR 169
Query: 242 GIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP 301
G++ GG LC C LCN +V+ +FE HA + R +I LENGK + +++ + A
Sbjct: 170 GVIRDGGILCSCCLCNGRRVIPPSQFEIHACKQYRRAAQYICLENGKSLLDLLRACRGAT 229
Query: 302 LGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLAL 361
L LE V+ S E F + VE S S +++
Sbjct: 230 LHTLEVTVQNFVCSPHEERYFTCKRCKGCFPSSFVERVGPICRSCVESRKSEESSNNVVG 289
Query: 362 EESISPTSCSFVQDTGREQTHMKEMLEE-RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLL 420
+ SP + + + ++ RK+ K F+ D LH+L+
Sbjct: 290 KRVRSPRPVVLSNPSSTSELSVSSQVKRHRKKRTKLVFISISSVL-------DQRLHKLV 342
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRH 480
F NGLPDG + Y +GQ+L G K G+GIVC CCN EISPSQFE HAG A+R++PY +
Sbjct: 343 FEENGLPDGTEVAYYARGQKLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAY 402
Query: 481 IYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLI 540
IYTSNG++LH++AISL+ ++ + +DD+C VC DG NLLLC+GCP AFH C I
Sbjct: 403 IYTSNGVSLHELAISLSKDRKYSAKDNDDLCIVCWDGGNLLLCDGCPRAFHKECAALSSI 462
Query: 541 PESGWRCPNCRQGHS----SSMSRSVDLKGGLEA--PGAEVGGCVICRLSPSENFDIR-- 592
P W C C+ + + + G +E P ++ I R+ D+
Sbjct: 463 PRGDWYCQFCQNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCI-RIVKDIEADLSSC 521
Query: 593 -LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHA 651
LCR DFS + F RT+I CDQCEKE+HVGCLR + + LKE+P+ W CC+DC RIH+
Sbjct: 522 ALCRGVDFSRSGFGPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRIHS 581
Query: 652 ALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSS 711
L++ + A+ +P S L I +K EKG+ + DV+W++L + + + LL
Sbjct: 582 TLENLLVKGAERLPESLLGVIKKKQEEKGL---EPIIDVRWRLLN-GKIASPETRPLLLE 637
Query: 712 ATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFG 770
A +IF ECF+PI+ A GRDLIP MVYGRN+ GQEFGGMY +L V S VVSAG+LRIFG
Sbjct: 638 AVSIFHECFNPIVDAASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRIFG 697
Query: 771 REVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRK 830
+VAELPLVAT GKG FQ LFSCIERLL LNV+NLVLPAAE+AESIWT KFGF K
Sbjct: 698 SDVAELPLVATSNGNHGKGYFQTLFSCIERLLAFLNVKNLVLPAAEEAESIWTDKFGFSK 757
Query: 831 MSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
M+ + L Y+++ Q+ FKGT+ML K V
Sbjct: 758 MNPDELTNYRKNCHQMVSFKGTNMLHKMV 786
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 281/696 (40%), Positives = 405/696 (58%), Gaps = 75/696 (10%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLCGCPLCNF 258
KKV ++P +K LL TGIL+G V+YI S+ E L G+++G G +C C C
Sbjct: 297 KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGGVISGSGIICFCNNCKG 356
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS FE HAG+ + P +IYLE G + I+ + EE ++ G S
Sbjct: 357 KEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSFDQTEEFIQSAIGRSL- 415
Query: 319 EGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGR 378
+ + + + E D M ++ CS + +S P + + +
Sbjct: 416 ---VKRTAICLNCKGRIPESDTGIAM--------LLCCSCM---DSKKPQAIDLLSLS-- 459
Query: 379 EQTHMKEMLEERKRGVKRPFMHQKRT------------AEGGTKKRDNDLHRLLFLPNGL 426
+MKE + +P + K + G ++D LH+L+F + L
Sbjct: 460 -HYYMKEFWADHLIITPKPNVLSKSSDTITKSVSTRGKIHGRITRKDLRLHKLVFEEDIL 518
Query: 427 PDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG 486
PDG + Y +GQ+L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG
Sbjct: 519 PDGTEVAYYARGQKLLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNG 578
Query: 487 MTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
++LH+++ISL+ G++ + +DD+C +C DG +LL C+GCP +FH C+ IP W
Sbjct: 579 VSLHELSISLSKGRKFSLTDNDDLCSICADGGDLLCCDGCPRSFHRDCVPLQCIPTGIWY 638
Query: 547 CPNCRQ--------GHSSSMSRSVDLKGG-------------LEAPGAEVGGCVICRLSP 585
C C+ H+++ + + G ++ EVGGC
Sbjct: 639 CKYCQNLFQKEKFVEHNANAVAAGRVAGVDPIEQITTRCIRIVKTMEVEVGGCA------ 692
Query: 586 SENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD 645
LCR HDFS + F RTVI CDQCEKEFHVGCL++N + DLKE+P+ KWFCC +
Sbjct: 693 -------LCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQGKWFCCPE 745
Query: 646 CNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQ--WQMLKKAQCFEE 703
CNRIH+AL+ V + +P S L ++ +K ++G ++NDV+ W++L +
Sbjct: 746 CNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQG---SASINDVEIRWRVLNWKMLSSD 802
Query: 704 KEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVS 762
+ +SLLS A +IF +CFDPI+ + GRD IP M+YGRNI GQEFGG+Y +LTV VVS
Sbjct: 803 ETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEFGGIYCAVLTVNESVVS 862
Query: 763 AGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIW 822
G+ RIFG EVAELPLVAT +QG+G FQ+L++CIER L LNV+NLVLPAA++AES+W
Sbjct: 863 VGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADEAESLW 922
Query: 823 TKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
KFGF K+ E +++++R +Q+ IF+GTSML+K+V
Sbjct: 923 INKFGFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEV 958
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 72/152 (47%), Gaps = 23/152 (15%)
Query: 360 ALEESISPTSCSFVQ--DTGREQTHMKEMLEERKRGVKRPF------MHQKRTAEGGTKK 411
AL++++ PTS + +TG + E + + P + + + + KK
Sbjct: 244 ALKQNVEPTSLEHLSKCNTGVAMQVITNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKK 303
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRF-------GCKQGNGIVCDCCN---KEI- 460
L LL G+ +G R+ YI +G +++ G G+GI+C C N KE+
Sbjct: 304 FPAKLKDLL--DTGILEGLRVRYI-RGSKIKALGETGLGGVISGSGIICFCNNCKGKEVV 360
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
SP+ FE HAG + +R P +IY G TL DI
Sbjct: 361 SPTLFELHAGSSNKRPP-EYIYLETGNTLRDI 391
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 513 bits (1322), Expect = e-142, Method: Compositional matrix adjust.
Identities = 282/692 (40%), Positives = 406/692 (58%), Gaps = 67/692 (9%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLCGCPLCNF 258
KKV ++P +K LL TGIL+G V+YI S+ E L G+++G G +C C C
Sbjct: 297 KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGGVISGSGIICFCNNCKG 356
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS FE HAG+ + P +IYLE G + I+ + EE ++ G S
Sbjct: 357 KEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSFDQTEEFIQSAIGRSL- 415
Query: 319 EGSFRVWKASHHLRKGLVEHDEKHNMKL-----PSLPHSIISCSSLALEESISPTSCSFV 373
+ + + + E D M L S + S S + S +P S
Sbjct: 416 ---VKRTAICLNCKGRIPESDTGIAMLLCCSCMDSRKPQVSSSPSPSPSPSPTPIVFS-- 470
Query: 374 QDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLT 433
+++T +L + + + + + G ++D LH+L+F + LPDG +
Sbjct: 471 ----KDRTPKPNVLSKSSDTITKSVSTRGKI-HGRITRKDLRLHKLVFEEDILPDGTEVA 525
Query: 434 YIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA 493
Y +GQ+L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+++
Sbjct: 526 YYARGQKLLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELS 585
Query: 494 ISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ- 552
ISL+ G++ + +DD+C +C DG +LL C+GCP +FH C+ IP W C C+
Sbjct: 586 ISLSKGRKFSLTDNDDLCSICADGGDLLCCDGCPRSFHRDCVPLPCIPTGIWYCKYCQNL 645
Query: 553 -------GHSSS----------------MSRSVDLKGGLEAPGAEVGGCVICRLSPSENF 589
H+++ +R + + +E EVGGC
Sbjct: 646 FQKEKFVEHNANAVAAGRVAGVDPIEQITTRCIRIVKTMEV---EVGGCA---------- 692
Query: 590 DIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 649
LCR HDFS + F RTVI CDQCEKEFHVGCL++N + DLKE+P+ KWFCC +CNRI
Sbjct: 693 ---LCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQGKWFCCPECNRI 749
Query: 650 HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQ--WQMLKKAQCFEEKEKS 707
H+AL+ V + +P S L ++ +K ++G ++NDV+ W++L ++ +S
Sbjct: 750 HSALEKLVVLGGEKLPESILVSVQKKIEDQG---SASINDVEIRWRVLNWKMLSSDETRS 806
Query: 708 LLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLL 766
LLS A +IF +CFDPI+ + GRD IP M+YGRNI GQEFGG+Y +LTV VVS G+
Sbjct: 807 LLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEFGGIYCAVLTVNESVVSVGIF 866
Query: 767 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 826
RIFG EVAELPLVAT +QG+G FQ+L++CIER L LNV+NLVLPAA++AES+W KF
Sbjct: 867 RIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADEAESLWINKF 926
Query: 827 GFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
GF K+ E +++++R +Q+ IF+GTSML+K+V
Sbjct: 927 GFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEV 958
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 72/152 (47%), Gaps = 23/152 (15%)
Query: 360 ALEESISPTSCSFVQ--DTGREQTHMKEMLEERKRGVKRPF------MHQKRTAEGGTKK 411
AL++++ PTS + +TG + E + + P + + + + KK
Sbjct: 244 ALKQNVEPTSLEHLSKCNTGVAMQVITNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKK 303
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRF-------GCKQGNGIVCDCCN---KEI- 460
L LL G+ +G R+ YI +G +++ G G+GI+C C N KE+
Sbjct: 304 FPAKLKDLL--DTGILEGLRVRYI-RGSKIKALGETGLGGVISGSGIICFCNNCKGKEVV 360
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
SP+ FE HAG + +R P +IY G TL DI
Sbjct: 361 SPTLFELHAGSSNKRPP-EYIYLETGNTLRDI 391
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 324/856 (37%), Positives = 447/856 (52%), Gaps = 137/856 (16%)
Query: 86 VTESCNQVLSTTSTGNLSS------EETLSDGDERTESSYAETSRTDNNSGDVSKSHVVL 139
VTE+ V+ T + L S EE GD +T+S + D + +
Sbjct: 243 VTEAMMNVVKTDNEEQLDSLMEPTMEELPMSGDCKTKSEAIK--------ADCAYGSALA 294
Query: 140 EIPKHV-------SSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGANDGV 192
+P+ V SS +R+ T K K+ ++ E+ T +D S +
Sbjct: 295 TVPESVENSKVSTSSEKPLRRFTRSCLKTKQQAMSASPAED----TKAEDALESDEASAI 350
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGG 247
+E+KMSKKV + PT +K LL+TG+L+G V+Y+ ++ E+ L G++ G
Sbjct: 351 GTTSKLEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQGVIQGS 410
Query: 248 GYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEE 307
G LC C C +KVV+ ++FE HAG+ + P +IYL+NGK + ++ K AP LE
Sbjct: 411 GILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADALEA 470
Query: 308 VVKKVAG--------------SSFNEGSF----------RVWKASHHL----------RK 333
++ G +S E SF K S R
Sbjct: 471 AIRNATGAGDARKSTFCLNCKASLPEASFGRPRLQCDSCMTSKKSQTTPSQVGDANCSRD 530
Query: 334 GLVEH---------DEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
G +E D+ + + LP L L+ SP+S S ++ T R +
Sbjct: 531 GQLEFIFLLNYYWADDLYKLGLPDL---------RGLQ--WSPSSNSVLKSTERMSSGTC 579
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG 444
+ R ++ K EG LP+G L Y V+G++L G
Sbjct: 580 PPSKVHGRLTRKDLRMHKLVFEGDV------------LPDGTA----LAYYVRGKKLLEG 623
Query: 445 CKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT 504
K+G I C CC E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+++I L+M +R ++
Sbjct: 624 YKKGGAIFCYCCQSEVSPSQFEAHAGCASRRKPYSHIYTSNGVSLHELSIKLSMERRSSS 683
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVD 563
+DD+C +C DG +LL C+ CP AFH C+ IP W C C RS +
Sbjct: 684 DENDDLCSICADGGDLLCCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKFDRSAN 743
Query: 564 LKGGLEAPG-------------------AEVGGCVICRLSPSENFDIRLCRSHDFSAATF 604
G AEVG CV LCRSHDFS + F
Sbjct: 744 AIAAGRVAGIDALEQITKCSIRIVDTLHAEVGVCV-------------LCRSHDFSTSGF 790
Query: 605 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 664
+TVI CDQCEKE+HV CL ++ + DLKE+PKDKWFCC +CN IH ALQ VS+ Q++
Sbjct: 791 GPQTVIICDQCEKEYHVKCLEEHNMDDLKELPKDKWFCCKECNSIHYALQKLVSDGEQSL 850
Query: 665 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
P S + IN K I+ L D ++NDV+W++L EE + LS A +IF + FDPI
Sbjct: 851 PDSLMGIINEK-IKAKNLEDNSINDVKWRLLSGKNSTEET-RVWLSGAVSIFHDSFDPIA 908
Query: 725 --AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATC 782
+ DLIP MVYGRN Q+FGGM IL V S+VVSAG++RIFG+EVAELPLVAT
Sbjct: 909 DSSTSRLDLIPTMVYGRNFKDQDFGGMLCAILMVNSLVVSAGVIRIFGKEVAELPLVATS 968
Query: 783 REYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD 842
+ QGKG FQ+LF IE LL SL V+ LVLPAAE+AESIWTKKFGF+ ++ E L Y+ +
Sbjct: 969 LDCQGKGYFQSLFYSIENLLKSLGVKYLVLPAAEEAESIWTKKFGFQHITPEELKHYKDN 1028
Query: 843 FQLTIFKGTSMLEKKV 858
+QL IF+GT+ML+K+V
Sbjct: 1029 YQLMIFQGTAMLQKQV 1044
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 284/690 (41%), Positives = 394/690 (57%), Gaps = 63/690 (9%)
Query: 196 RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL 255
+N+E+KMSKK+ + P VK+L TG+L+G V Y+ + L G + G LC C
Sbjct: 186 KNLELKMSKKIALDNIPMTVKELFETGLLEGVPVVYMGGKKAFCLRGTIKDVGILCYCSF 245
Query: 256 CNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGS 315
C +V+ +FE HA + R +I ENGK + ++ + +PL LE ++
Sbjct: 246 CKGCRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISG 305
Query: 316 SFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQD 375
E +F + LV K+ L S + + E + SP + ++
Sbjct: 306 LPKEKTFTCKRCKGTYPTILVG-------KVGPLCSSCVE----SKESNGSPACETNIKS 354
Query: 376 TGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYI 435
+ + + L GV + + +D LH+L+F GLPDG + Y
Sbjct: 355 RSSKPATVSKSLNSALEGV-----SSENKCQWKITTKDQRLHKLVFEDGGLPDGTEVAYY 409
Query: 436 VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS 495
+GQ+L G K+G GI+C CCN E+SPS FEAHAG A R++PY +IYTSNG++LH++AIS
Sbjct: 410 ARGQKLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKPYAYIYTSNGVSLHELAIS 469
Query: 496 LAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--- 552
L+ G++ + +DD+C VC DG +L+LC+GCP AFH C IP W C C+
Sbjct: 470 LSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAFHKGCASLSSIPRGKWFCQFCQNMFQ 529
Query: 553 --------------GHSSSM-------SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDI 591
G S + R + + +EA E+ GCV
Sbjct: 530 REKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIEA---ELTGCV------------ 574
Query: 592 RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHA 651
LCR +DFS + F RT+I CDQC KEFHVGCLR + + +LKE+PK KWFCC DC RIH+
Sbjct: 575 -LCRGYDFSRSGFGPRTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHS 633
Query: 652 ALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMN--DVQWQMLKKAQCFEEKEKSLL 709
AL+ ++ A+ IP L + +K+ EKG+ T+N DV+W++L E K LL
Sbjct: 634 ALKKLLAREAEIIPNKLLEVVMKKNEEKGL---ETVNNIDVRWKLLTGKSASPE-TKLLL 689
Query: 710 SSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIF 769
S A AIF+ECFDPI+ GRDLIP+MVYG+N GQ++GGMY +L V S VVSA ++RIF
Sbjct: 690 SQALAIFQECFDPIVDTTGRDLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIF 749
Query: 770 GREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFR 829
G+EVAELPLVAT GKG FQ LFS IE+LL L V ++VLPAAE+AESIWT KFGF+
Sbjct: 750 GQEVAELPLVATSNGNHGKGYFQLLFSFIEKLLAYLKVHSIVLPAAEEAESIWTDKFGFQ 809
Query: 830 KMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
K+ ++L KY++ Q+ FKGTSML+K V
Sbjct: 810 KIKPDQLSKYRKSCCQILTFKGTSMLQKAV 839
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 317/879 (36%), Positives = 462/879 (52%), Gaps = 85/879 (9%)
Query: 27 KQDRELLTDNVESDSFPNGKHAKEEALNDDIKSEVSNPVVSPKEFTSSL---QDITSQET 83
++D L D + P +E NDD+K + + S +I + E
Sbjct: 190 EEDSRKLEDVTIKEEIPKRNEPEEVLGNDDLKRYADGNDQCEERISGSSPNSMNIDNFEN 249
Query: 84 KLVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPK 143
+ S N++ T+ L E+ SD + E + +S NS + + +
Sbjct: 250 QNGEHSKNEMEKVTAMNELL--ESKSDMNNVNEEGTSMSSVILMNSEGGAIDSLPINDST 307
Query: 144 HVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGA--NDGVLCARNMEIK 201
V +R+ T K K + A ++ + + DD GS A N G + +K
Sbjct: 308 KVEKEKPMRRFTRSLLKPKMEIGQEYAVKD--SSSAADDAGSPSAASNSGTM------LK 359
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLC 256
+ K ++PT +K LL +GIL+G VKY+ S+ R L G+++G LC C C
Sbjct: 360 VWKNDTSKKFPTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQGVISGSAILCFCRSC 419
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG-S 315
++VV+ FE HAG+ + P +IYLENG + ++ K A L L+E + G S
Sbjct: 420 RGNEVVTPSIFEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASLETLDEALWLSTGCS 479
Query: 316 SFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEESISPTSCSFVQ 374
S +F + R L E +M L CS + L++S + +
Sbjct: 480 SLKNSTFCL-----KCRGKLAEASTGRSMTL---------CSQCMVLKDSQASIPATTDT 525
Query: 375 DTGREQTHM---KEMLEERKRGVKRPF------MHQKRTAEGGTKKRDNDLHRLLFLPNG 425
D G ++ + + +L + V + + ++G +D +H+L+F +
Sbjct: 526 DKGYAESDVCAYRIVLTPKSHPVSKSSDSVLKCSTSRSKSQGRLTVKDLRMHKLVFEEDV 585
Query: 426 LPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSN 485
LPDG + Y +GQ+L G K+G GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSN
Sbjct: 586 LPDGTEVAYYSRGQKLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIYTSN 645
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
G++LH++AISL+ ++ +T +DD+C +C DG +LL C+ CP A+H CL IP W
Sbjct: 646 GVSLHELAISLSKSRKFSTHQNDDLCQICRDGGDLLCCDVCPRAYHKDCLALPEIPTGRW 705
Query: 546 RCPNCRQG--------HSSSMSRSVDLKGG-------------LEAPGAEVGGCVICRLS 584
C C H+++ + + G ++ A+ GGCV
Sbjct: 706 YCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQITRRCIRIVKTMDADFGGCV----- 760
Query: 585 PSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD 644
CR HDF F RTV+ CDQCEKEFHVGCL+ + + DLKE+PK WFCC
Sbjct: 761 --------FCRGHDFD-KIFGPRTVLLCDQCEKEFHVGCLKDHNMEDLKELPKGNWFCCS 811
Query: 645 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK 704
DC RIH+AL+ V + + SSL+ IN+K EK D + DV+W++L
Sbjct: 812 DCCRIHSALEKLVLRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRLLNDKINPAGD 871
Query: 705 EKSLLSSATAIFRECFDPII-----AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSV 759
+LLS A AI E F+PI+ ++ RDLI MV+G N+ GQEFGGMY +L +
Sbjct: 872 TAALLSEALAILHEQFNPILVAGTSSKADRDLITSMVFGDNLKGQEFGGMYCAVLMINQA 931
Query: 760 VVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAE 819
VVS ++R FG E+AELPLVAT + QGKG FQALF+CIE+LL LN++NLVLPAAE+AE
Sbjct: 932 VVSCAIIRFFGLELAELPLVATSSKAQGKGYFQALFTCIEKLLGFLNIKNLVLPAAEEAE 991
Query: 820 SIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
SIW KFGFRK++ E LK+++D+Q+ +F+GTSML K V
Sbjct: 992 SIWINKFGFRKLTHEEFLKFRKDYQMMVFQGTSMLHKPV 1030
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 496 bits (1278), Expect = e-137, Method: Compositional matrix adjust.
Identities = 280/668 (41%), Positives = 383/668 (57%), Gaps = 65/668 (9%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKV 261
VP +P +K+LL TGIL+ V+YI SR R L G++ G G LC C C +KV
Sbjct: 204 VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGESGLRGVIKGSGILCSCNSCKGTKV 263
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
V+ + FE HAG+ + P +IYLENG + ++ K A L L+E ++ G S
Sbjct: 264 VTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAWKNAALDSLDEAIRVAIGCSM---- 319
Query: 322 FRVWKASHHLR-KGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQ 380
+ K++ L KG + N K+ + H ++ +S S S + GR
Sbjct: 320 --IKKSTFCLNCKGRISEAGIGNSKVLCMLHELL----------VSKCSSSGSKSYGR-- 365
Query: 381 THMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQR 440
K+D LH+L+F NGLP+G + Y V+GQ+
Sbjct: 366 ----------------------------VTKKDLSLHKLVFGENGLPEGTEVGYYVRGQQ 397
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+ +ISL+ G+
Sbjct: 398 LLVGYKRGSGIFCTCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSISLSRGR 457
Query: 501 RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSR 560
+ +DD+C +C DG NLL C+GCP FH C+ IP+ W C C + +
Sbjct: 458 EISVSDNDDLCSICLDGGNLLCCDGCPRVFHKECVSLANIPKGKWFCKFCN--NMLQKEK 515
Query: 561 SVDLKGGLEAPGAEVG---------GCV-ICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
V+ A G G C+ I E LCR H+FS + F RTV+
Sbjct: 516 FVEHNANAVAAGRVAGVDPIEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVM 575
Query: 611 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS 670
CDQCEKEFHVGCLR++ + DLKE+PK KWFCC DC RI+++LQ V + + +P + L+
Sbjct: 576 LCDQCEKEFHVGCLREHDMDDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLT 635
Query: 671 TINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRD 730
TI K+ G D++W+++ + + SLLS A +IF E FDPI GRD
Sbjct: 636 TIKEKYGRNGSACSKD-PDIKWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADAAGRD 694
Query: 731 LIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
L+P MV+G++ +FGGMY ILT+ S VVSA RIFG+EVAELPLVAT + QG+G
Sbjct: 695 LLPDMVHGKSTREWDFGGMYCAILTISSQVVSAAAFRIFGKEVAELPLVATRSDCQGQGY 754
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKG 850
FQ LFSC+E LL L V +LVLPAAE AESIWT KFGF K+++E+ ++RD+Q+ F+G
Sbjct: 755 FQTLFSCLEGLLGVLEVRSLVLPAAEGAESIWTNKFGFNKVTQEQRNNFRRDYQMVTFQG 814
Query: 851 TSMLEKKV 858
T ML+K V
Sbjct: 815 TLMLQKLV 822
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 282/689 (40%), Positives = 391/689 (56%), Gaps = 69/689 (10%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRER--QLDGIVNGGGYLCGCPLCNFSKV 261
K +V ++ P VK+L TG+LDG V Y+ ++ +L G + GG LC C LCN +V
Sbjct: 87 KIIVVHKKPATVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
+ +FE HA + +I LENGK + +++ + APL LE ++ S E
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 322 FRV------WKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQD 375
F + +S+ R GL+ +SC E S +
Sbjct: 207 FTCKNCRGCFPSSNVERVGLL----------------CLSCVESRKSEK------SSIHA 244
Query: 376 TGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYI 435
G+ K ++ + + + + ++ KR LH+L+F +GLP+G + Y
Sbjct: 245 VGKRIRSSKLSVKLKTAPITSKCLSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAYY 304
Query: 436 VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS 495
+GQ+L G K +GIVC CCN EISPSQFE HAG A+RR+PY IYTSNG++LH++AI
Sbjct: 305 ARGQKLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIYTSNGVSLHELAIF 364
Query: 496 LAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQG-- 553
L+ + TT +D +C VC DG NLLLC+GCP AFH C IP W C C+
Sbjct: 365 LSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRGEWYCQICQHTFL 424
Query: 554 ------HSSSMSRSVDLKGG-------------LEAPGAEVGGCVICRLSPSENFDIRLC 594
H++ + ++G ++ GAE+GGCV LC
Sbjct: 425 RERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCV-------------LC 471
Query: 595 RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQ 654
RS DFS + F RT+I CDQCEKE+HVGCLR + + LKE+P+ WFCC+DC RIH+ L+
Sbjct: 472 RSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCCNDCTRIHSTLE 531
Query: 655 DFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATA 714
+ + A+ +P S L I +K + + L DV+W++L + + + LL A +
Sbjct: 532 NLLIRVAERLPESLLDVIKKKQVGR-CLEPLNEIDVRWKLL-NGKIASPETRPLLLEAVS 589
Query: 715 IFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREV 773
+F ECFDPI+ GRDLIP MVYGRN+ Q+FGGMY +L V S VVSAG++RIFGR++
Sbjct: 590 MFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDI 649
Query: 774 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSR 833
AELPLVAT + +GKG FQ LF+CIERLL LNV+NLVLPAAE+A SIWT+KFGF KM
Sbjct: 650 AELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAASIWTEKFGFSKMKP 709
Query: 834 ERLLKYQRD-FQLTIFKGTSMLEKKV-QC 860
+L Y+ + Q+ FKGT+ML K V QC
Sbjct: 710 NQLTNYRMNCHQIMAFKGTNMLHKTVPQC 738
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 281/689 (40%), Positives = 390/689 (56%), Gaps = 65/689 (9%)
Query: 200 IKMSKKV-VPNEYPTNVKKLLSTGILDGACVKYISTSRER--QLDGIVNGGGYLCGCPLC 256
+K +KK+ V ++ P VK+L TG+LDG V Y+ ++ +L G + GG LC C LC
Sbjct: 90 LKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCRLC 149
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSS 316
N +V+ +FE HA + +I LENGK + +++ + APL LE ++ S
Sbjct: 150 NGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDLMRACRAAPLHTLEATIQNFINSP 209
Query: 317 FNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDT 376
E F R L ++ P H ++ + + +++
Sbjct: 210 PEEKYF----TCKSCRGPLGQYYS------PIHVHVVLLNLNSVSLLKL--------RNS 251
Query: 377 GREQTHMKEMLEERKRGV--KRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTY 434
GR++ L + + V + + ++ KR LH+L+F +GLP+G + Y
Sbjct: 252 GRQEQSWSSKLSVKLKTVPITSKCLSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAY 311
Query: 435 IVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI 494
+GQ+L G K GIVC CCN E+SPSQFE HAG A+RR+PY +IYTSNG++LH++AI
Sbjct: 312 YARGQKLLEGIKTRCGIVCRCCNTEVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHELAI 371
Query: 495 SLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
L+ + TT +D +C VC DG NLLLC+GCP AFH C IP W C C+ H
Sbjct: 372 FLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRGEWYCQICQ--H 429
Query: 555 SSSMSRSVDLKGGLEAP-----------------------GAEVGGCVICRLSPSENFDI 591
+ R V A GAE+GGCV
Sbjct: 430 TFLRERPVLYNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCV------------ 477
Query: 592 RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHA 651
LCRS DFS + F RT+I CDQCEKE+HVGCLR + LKE+P+ WFCC+DC IH+
Sbjct: 478 -LCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKKAYLKELPEGDWFCCNDCTIIHS 536
Query: 652 ALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSS 711
L++ + A+ +P + L I +K +E+ L DV+W++L + + + LL
Sbjct: 537 TLENLLIRVAERLPEALLDVIKKKQVER-CLEPLNEIDVRWKLL-NGKIASPETRPLLLE 594
Query: 712 ATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFG 770
A ++F ECFDPI+ GRDLIP MVYGRN+ Q+FGGMY +L V S VVSAG++RIFG
Sbjct: 595 AVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFG 654
Query: 771 REVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRK 830
R++AELPLVAT + +GKG FQ LF+CIERLL LNV+NLVLPAAE+AESIWT+KFGF K
Sbjct: 655 RDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAESIWTEKFGFSK 714
Query: 831 MSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
M ++L Y+ + Q+ FKGT+ML K V
Sbjct: 715 MKLDQLTNYRMNCHQIMAFKGTNMLHKTV 743
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 289/731 (39%), Positives = 401/731 (54%), Gaps = 114/731 (15%)
Query: 152 RKITFKFSKRKEDYVAPLA--YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN 209
++ T K KED V L Y + + D ++GA + + + +KMSKK+ N
Sbjct: 1777 KRFTRSALKSKEDTVESLESDYNFCNSVAIGVDEKTNGAVRSLTSPKKLGLKMSKKIALN 1836
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQ 269
+ P ++ LL TG+L+G V Y + +L G + G G LC C LC S+VV +FE
Sbjct: 1837 KVPLTIRDLLETGMLEGYPVTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFEL 1896
Query: 270 HAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASH 329
HA RH +IYL+NGK ++ ++ K APL LE ++ GSF V ++
Sbjct: 1897 HACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAI------GSFPVKRS-- 1948
Query: 330 HLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEE 389
+ DE K+ L +S I ++ SP +
Sbjct: 1949 ------LPADEA--AKMDPLGNSCIKRNN-------SPATS------------------- 1974
Query: 390 RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGN 449
+H RT+E + RD LHRL+F GLPDG + Y G++L G K+G
Sbjct: 1975 ---------IH--RTSE---RARDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGF 2020
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDD 509
GI C CC+ E+S SQFEAHAG A+R++PY +IYTSNG++LH++AISL+ G++ + +DD
Sbjct: 2021 GIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDD 2080
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--------GHSSSMSRS 561
+C +CGDG NLLLC+GCP AFH C IP+ W C C+ H+++ +
Sbjct: 2081 LCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAA 2140
Query: 562 VDLKG------------GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV 609
+ G + P AEV CV LCR +DFS + F RT+
Sbjct: 2141 GRVSGVDPIEQITKRCIRIVNPEAEVSACV-------------LCRGYDFSKSGFGPRTI 2187
Query: 610 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 669
I CDQ E+P KWFCC +C RIH+ALQ + +P S L
Sbjct: 2188 ILCDQ-------------------ELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLL 2228
Query: 670 STINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 728
+ I KH KG+ N V+W++L + + + LLS A AIF + FDPII + G
Sbjct: 2229 NVIKEKHERKGLESIADYN-VRWRLL-SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTG 2286
Query: 729 RDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 788
RDLIP MVYGRN+ GQ+F G+Y ++TV S VVSAG+LR+FG+EVAELPLVAT + QG+
Sbjct: 2287 RDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGR 2346
Query: 789 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTI 847
G FQ LFSCIE+LL LNV + VLPAAE+AE IWTKKFGF+K++ ++L +Y++ F Q+
Sbjct: 2347 GYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMIS 2406
Query: 848 FKGTSMLEKKV 858
F+GT MLEK V
Sbjct: 2407 FQGTCMLEKGV 2417
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 234/353 (66%), Positives = 267/353 (75%), Gaps = 17/353 (4%)
Query: 439 QRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAM 498
Q++ G KQGNGIVC CC EISPSQFE+HAGM+ARRQPYRHIYTSNG+TLHDIAISLA
Sbjct: 17 QKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLAN 76
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ-GHSSS 557
GQ TTG DDMC CGDG +L+ C CP AFHAACLD PE W CPNC + GH +
Sbjct: 77 GQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCNKLGHGGN 136
Query: 558 MSRS--VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
+R + L ++ P +VGGC + CR+HDFS TFDDRTVI CDQC
Sbjct: 137 FARPIVIRLTRVVKTPEYDVGGCAV-------------CRAHDFSGDTFDDRTVILCDQC 183
Query: 616 EKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRK 675
EKEFHVGCLR++GLCDLKEIPKD WFCC DCN I+ AL++ VS QTIPAS L+ INRK
Sbjct: 184 EKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPASLLNIINRK 243
Query: 676 HIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVM 735
H+EKG+L D DVQWQ+L + ++ SLLS A AIFRECFDPI+A+ GRDLIPVM
Sbjct: 244 HVEKGLLVDEAAYDVQWQIL-MGKSRNREDLSLLSGAAAIFRECFDPIVAKTGRDLIPVM 302
Query: 736 VYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 788
VYGRNISGQEFGGMY V+LTV+ VVVSAGLLRIFGREVAELPLVAT RE+QGK
Sbjct: 303 VYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVATNREHQGK 355
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 284/715 (39%), Positives = 395/715 (55%), Gaps = 72/715 (10%)
Query: 181 DDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSR---- 236
DD GSS V KM ++PT +K LL +GIL+G VKY+ +
Sbjct: 264 DDTGSSS----VAAITTTPTKMFAIDGLKKFPTKLKDLLDSGILEGQKVKYLRGPKVRGP 319
Query: 237 -ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQ 295
E+ L G+V G LC C C +VV+ FE HAG+ + P +I+LENG + ++
Sbjct: 320 GEKGLHGVVKESGILCFCDDCKGKEVVTPTIFELHAGSANKRPPEYIFLENGNTLRDVMN 379
Query: 296 ELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIIS 355
K + L IL+E ++ G F K S+ + ++ S +
Sbjct: 380 ACKNSSLDILDEAIRLSIG-------FTPSKKSNFCLSC------RGSITGAGTRKSKVL 426
Query: 356 CSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDND 415
CS LE S + DT +E+T + E + + + ++G K+D
Sbjct: 427 CSQ-CLELKDSQAILAPETDT-KERTPRPSPVPESSSALLKS-SPSRSNSQGRLTKKDIR 483
Query: 416 LHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARR 475
+H+L+F LPDG + Y +G++L G K+G GI C CCN E+SPSQFEAHAG A+RR
Sbjct: 484 MHKLVFEEEVLPDGTEVGYYSQGKKLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRR 543
Query: 476 QPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACL 535
+PY HIYTSNG++LH++AISL+ +R +T +DD+C +C DG LL C+ CP AFH CL
Sbjct: 544 KPYLHIYTSNGVSLHELAISLSKCRRHSTKENDDLCQICRDGGKLLCCDVCPRAFHQECL 603
Query: 536 DPLLIPESGWRCPNCRQGHSSS--MSRSVDLKGGLEAPG-------------------AE 574
IP+ W C C + R+ + G AE
Sbjct: 604 SLPSIPKGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGTDPIEQITRRCIRIVKTFEAE 663
Query: 575 VGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK- 633
VGGCV CR HDF TF RTVI CDQCEKEFHVGCL+++ + DLK
Sbjct: 664 VGGCV-------------FCRGHDFE-RTFGPRTVIICDQCEKEFHVGCLKEHQMQDLKA 709
Query: 634 --EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQ 691
E+P KWFCC C RIH+ALQ V + +P SSL+ I +KH E+ G +D++
Sbjct: 710 ICELPTGKWFCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKKH-EESASESGGGDDIR 768
Query: 692 WQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR------DLIPVMVYGRNISGQE 745
W++L K + +SLLS A AIF E F PI + + D IP MV G ++ GQ+
Sbjct: 769 WRLLSKKTDPSDVTESLLSEAVAIFHERFAPITVDKSKRKRDDHDFIPSMVKGGDMKGQD 828
Query: 746 FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSL 805
GGMY +L V VVSA ++RIFG+E+AELP+VAT + QG+G FQ LF+CIE+LL L
Sbjct: 829 LGGMYCAVLLVNHEVVSAAVMRIFGQELAELPIVATSSKSQGQGYFQTLFTCIEKLLGFL 888
Query: 806 NVENLVLPAAEKAESIWTKKFGFRKMSRE--RLLKYQRDFQLTIFKGTSMLEKKV 858
NV+NLVLPAAE+ ESIWT KFGF ++++ RL++Y++ +Q+ F+G+ ML+K V
Sbjct: 889 NVKNLVLPAAEEVESIWTNKFGFSTITQDEVRLMEYRKSYQIMEFQGSLMLQKPV 943
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 294/766 (38%), Positives = 405/766 (52%), Gaps = 83/766 (10%)
Query: 166 VAPLAYEEGRNYTLYDDLGSSGANDGVLCA--RNMEIKMSKKVVPNEYPTNVKKLLSTGI 223
+ PLA + + D G + L +E+KMSKK+ N+ P V++L TG+
Sbjct: 172 ITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGL 231
Query: 224 LDGACVKYISTSR--ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNH 281
L+G V Y+ + + L G + G LC C CN +V+ +FE HA + + +
Sbjct: 232 LEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQY 291
Query: 282 IYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF--RVWKASHHLRKGLV--- 336
I LENGK + +++ K + LE V+ + SS E F R K G V
Sbjct: 292 ICLENGKSLLDLLKACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSSVGQVGPL 350
Query: 337 ------EHDEKHNMKLPSLPHSII-----------------------------------S 355
K + LP+ P S I S
Sbjct: 351 CPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKS 410
Query: 356 CSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTA---------- 405
SS S SP S R + K L+ RK+ +K M + +
Sbjct: 411 KSSEYTSISRSPRSAPM-----RIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAK 465
Query: 406 -EGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQ 464
+ +D LH+L+F +GLPDG + Y +GQ+L G K+G+GI+C CCN +SPSQ
Sbjct: 466 NQWKITTKDQRLHKLVFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQ 525
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FE HAG ++R++PY +IYTSNG++LH++AISL+ G++ + +DD+C +C DG NLLLC+
Sbjct: 526 FEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCD 585
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS--MSRSVDLKGGLEAPGAE-----VGG 577
GCP AFH C P W C C+ + +V+ G +
Sbjct: 586 GCPRAFHKECASLSSTPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKR 645
Query: 578 CV-ICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 636
C+ I R ++ LCR DFS + F RT+I CDQCEKEFHVGCL+ + + LKE+P
Sbjct: 646 CIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELP 705
Query: 637 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTMNDVQWQM 694
+ KWFC C RIH+ALQ + + +P S L +NRK E I D DV W++
Sbjct: 706 RGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVD---VDVSWRL 762
Query: 695 LKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVI 753
+ + + + LLS A AIF + FDPI+ GRDLIP MVYGR++ GQEFGGMY I
Sbjct: 763 I-SGKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAI 821
Query: 754 LTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLP 813
L V S VVSA +LR+FG+++AELPLVAT GKG FQ LFSCIERLL L V+ LVLP
Sbjct: 822 LIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLP 881
Query: 814 AAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
AAE+AESIWT+KFGF ++ ++L Y+R Q+ FKGTSML+K V
Sbjct: 882 AAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTV 927
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 275/710 (38%), Positives = 407/710 (57%), Gaps = 80/710 (11%)
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLCGCPLCNFSKVVSA 264
++P+ +K LL+TGIL+G V Y+ ++ E+ L G++ G LC C +CN +VV+
Sbjct: 312 KFPSKLKDLLATGILEGLPVMYMKGAKVLFAGEKGLQGVIQDSGVLCFCKICNGVEVVTP 371
Query: 265 HEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTA------PLGILEEVVKKVAGSSFN 318
FE HAG+ + P +IY+ +G ++++ A PL ++E V+K+ G
Sbjct: 372 TVFELHAGSANKRPPEYIYIHDGN-CGKTLRDVMNACCCCDFPLESMDEAVQKLLGD--- 427
Query: 319 EGSFRVWKASHHLR-KGLVEHDEKHNMKL---------------------PSLPHSIISC 356
F + K+S L +G + K L P P ++
Sbjct: 428 ---FTMKKSSICLNCRGACKGVSKLVCDLCLASPPQTAMASRKVISQPVQPRSPEPVVIQ 484
Query: 357 SSL-------ALEESISPTSC-SFVQ----DTG-REQTHMKEMLEERKRGVKRPFMHQKR 403
SL +L+ + P S + VQ DTG + ++ M RG
Sbjct: 485 KSLDNEVQPNSLDNEVPPNSLDNEVQPNSLDTGVQPKSFSNGMKHSASRG---------- 534
Query: 404 TAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPS 463
++G ++D LH+L+F + LPDG L Y GQ+L G K+G GI C CCN+++S S
Sbjct: 535 KSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKGCGIFCTCCNEQVSAS 594
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR-TTGGSDDMCHVCGDGENLLL 522
QFEAHAG A+RR+PY HIYTSNG++LH+++ISL+ RR + +DD+C +C DG +LL
Sbjct: 595 QFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLLC 654
Query: 523 CNGCPLAFHAACLDPLLIPESGWRCPNC-----RQGHSSSMSRSVDLKGGLEAPGA-EVG 576
C+GCP AFH C+ IP W C C + H ++ G + P E+
Sbjct: 655 CDGCPRAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELM 714
Query: 577 GCVICRLSPSENFD---IRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 633
R+ + D LC +FS +F RTVI CDQCEKE+HVGCL+++ + +L+
Sbjct: 715 NKRCIRVVKTVEVDHGGCALCSRPNFSK-SFGPRTVIICDQCEKEYHVGCLKEHNMENLE 773
Query: 634 EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQ 693
++P+ WFC +C+ IH AL D V+++ + +P LS I +KH EK + G DV+W+
Sbjct: 774 KLPEGNWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKKKHEEKSLEI-GAGLDVKWR 832
Query: 694 MLKKAQCFEEKE----KSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGG 748
++ + + + LLS A AIF E FDPI+ + GRD IP M++GRNI GQ+F G
Sbjct: 833 VMNWKLDSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPTMLFGRNIRGQDFSG 892
Query: 749 MYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVE 808
+Y +LTV +VSAG+ R+FG E+AELPLVAT ++QG+G FQ LFSCIE LL SLNV+
Sbjct: 893 IYCAVLTVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNVK 952
Query: 809 NLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
NLVLPAA++AESIWT KFGF K+ ++ + KY++ +++ IF+GTS+L+K V
Sbjct: 953 NLVLPAADEAESIWTGKFGFTKLPQDEINKYKKFYRMMIFQGTSVLQKPV 1002
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 463 bits (1192), Expect = e-127, Method: Compositional matrix adjust.
Identities = 277/744 (37%), Positives = 419/744 (56%), Gaps = 78/744 (10%)
Query: 183 LGSSGANDGV----------LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYI 232
+G +G +DGV + +K S + ++P ++ LL+TGIL+G V Y+
Sbjct: 244 VGVAGISDGVKRETEAGASLVMTTPSSVKFSNRGKLKKFPAKLRDLLATGILEGLPVMYM 303
Query: 233 STSR-----ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+ E+ L G++ G LC C +C +VV+ FE HAG+ + P +IY+ +G
Sbjct: 304 KGVKVLFDGEKGLQGVIQDSGVLCFCKICKGVEVVTPTVFELHAGSANKRPPEYIYIHDG 363
Query: 288 KPIYSIIQELKTA-----PLGILEEVVKKVAGSSFNEGSFRVWKASHHLR-----KGL-- 335
++ + PL ++E V+K+ G F + K+S L KG+
Sbjct: 364 NSGKTLRDVMNACCCCDFPLESMDEAVQKLLGD------FTMKKSSICLNCRGACKGVSR 417
Query: 336 --------------VEHDEKHNMKL-PSLPHSIISCSSLALEESISPTSC-SFVQ----D 375
V ++ + + P P ++ SL + + P S + VQ D
Sbjct: 418 LVCDSCLVSPAQTAVASNKGISQPVQPRSPEPVVIQKSL--DNEVQPNSLHNEVQPNKLD 475
Query: 376 TGREQTHMKEMLEE-------RKRGVKRPFMHQ--KRTAEGGTKKRDNDLHRLLFLPNGL 426
TG + + +E + + H + ++G ++D LH+L+F + L
Sbjct: 476 TGMQPNSLDNGMEPDSLNNSMKPKSFSNGMKHSASRGKSQGRLTRKDLRLHKLVFEADVL 535
Query: 427 PDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG 486
PDG L Y GQ+L G K+G GI C CCN+++S SQFEAHAG A+RR+PY HIYTSNG
Sbjct: 536 PDGTELAYYAHGQKLLVGYKKGYGIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNG 595
Query: 487 MTLHDIAISLAMGQRR-TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
++LH+++ISL+ RR + +DD+C +C DG +LL C+GCP AFH C+ IP W
Sbjct: 596 ISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGTW 655
Query: 546 RCPNC-----RQGHSSSMSRSVDLKGGLEAPGA-EVGG--CV-ICRLSPSENFDIRLCRS 596
C C + H ++ G + P E+ C+ + R ++ LC
Sbjct: 656 YCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELMNKRCIRVVRTLEVDHGGCALCSR 715
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 656
+FS +F +TVI CDQCEKE+HVGCL+ + + +L+E+P WFC +C++IH AL D
Sbjct: 716 PNFSK-SFGPQTVIICDQCEKEYHVGCLKDHNMENLEELPVGNWFCSGNCSQIHTALMDL 774
Query: 657 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK-KAQCFEEKEKSLLSSATAI 715
V+++ + +P L+ I +KH EK + G DV+W+++ K + + LLS A AI
Sbjct: 775 VASKEKDVPDPLLNLIKKKHEEKSLDI-GAGLDVKWRVINWKLDSDSVETRKLLSKAVAI 833
Query: 716 FRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 774
F E FDPI+ + GRD IP M++GRNI GQ+F G+Y +LTV +VSAG+ R+FG E+A
Sbjct: 834 FHERFDPIVDSTSGRDFIPAMLFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGLEIA 893
Query: 775 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 834
ELPLVAT ++QG+G FQ LFSCIE LL SLNV+NLVLPAA++AESIWT KFGF K+ ++
Sbjct: 894 ELPLVATTADHQGQGYFQCLFSCIETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQD 953
Query: 835 RLLKYQRDFQLTIFKGTSMLEKKV 858
+ KY++ +++ IF+GTS+L+K V
Sbjct: 954 EINKYKKFYRMMIFQGTSVLQKPV 977
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101203549 [Cucumis sativus]
Length = 946
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 296/775 (38%), Positives = 406/775 (52%), Gaps = 92/775 (11%)
Query: 166 VAPLAYEEGRNYTLYDDLG--SSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGI 223
+ PLA + + D G S N +E+KMSKK+ N+ P V++L TG+
Sbjct: 172 ITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGL 231
Query: 224 LDGACVKYISTSR--ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNH 281
L+G V Y+ + + L G + G LC C CN +V+ +FE HA + + +
Sbjct: 232 LEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQY 291
Query: 282 IYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF--RVWKASHHLRKGLV--- 336
I LENGK + +++ K + LE V+ + SS E F R K G V
Sbjct: 292 ICLENGKSLLDLLKACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSSVGQVGPL 350
Query: 337 ------EHDEKHNMKLPSLPHSII-----------------------------------S 355
K + LP+ P S I S
Sbjct: 351 CPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKS 410
Query: 356 CSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTA---------- 405
SS S SP S R + K L+ RK+ +K M + +
Sbjct: 411 KSSEYTSISRSPRSAPM-----RIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAK 465
Query: 406 -EGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQ 464
+ +D LH+L+F +GLPDG + Y +GQ+L G K+G+GI+C CCN +SPSQ
Sbjct: 466 NQWKITTKDQRLHKLVFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQ 525
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FE HAG ++R++PY +IYTSNG++LH++AISL+ G++ + +DD+C +C DG NLLLC+
Sbjct: 526 FEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCD 585
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS--MSRSVDLKGGLEAPGAE-----VGG 577
GCP AFH C IP W C C+ + +V+ G +
Sbjct: 586 GCPRAFHKECASLSSIPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKR 645
Query: 578 CV-ICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 636
C+ I R ++ LCR DFS + F RT+I CDQCEKEFHVGCL+ + + LKE+P
Sbjct: 646 CIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELP 705
Query: 637 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTMNDVQWQM 694
+ KWFC C RIH+ALQ + + +P S L +NRK E I D DV W++
Sbjct: 706 RGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVD---VDVSWRL 762
Query: 695 LKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVI 753
+ + + + LLS A AIF + FDPI+ GRDLIP MVYGR++ GQEFGGMY I
Sbjct: 763 I-SGKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAI 821
Query: 754 LTVKSVVVSAGLLRIFGR---------EVAELPLVATCREYQGKGCFQALFSCIERLLCS 804
L V S VVSA +LR+FG+ ++AELPLVAT GKG FQ LFSCIERLL
Sbjct: 822 LIVNSFVVSAAMLRVFGQYCRAAIGCXDIAELPLVATSNGNHGKGYFQTLFSCIERLLAF 881
Query: 805 LNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
L V+ LVLPAAE+AESIWT+KFGF ++ ++L Y+R Q+ FKGTSML+K V
Sbjct: 882 LKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTV 936
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 460 bits (1184), Expect = e-126, Method: Compositional matrix adjust.
Identities = 276/695 (39%), Positives = 386/695 (55%), Gaps = 66/695 (9%)
Query: 197 NMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLC 251
N KM ++PT +K LL +GIL+G VKY+ ++ E+ L G+V G LC
Sbjct: 300 NTPTKMFSIDGSKKFPTKLKDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILC 359
Query: 252 GCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKK 311
C C +VV+ F HAG+ + P +I LENG + ++ K + L L+E ++
Sbjct: 360 FCDDCKGKEVVTPAIFVLHAGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRL 419
Query: 312 VAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLA-LEESISPTSC 370
G S ++ S W + R + + + L S C L + +P +
Sbjct: 420 STGFSPSKKSNFCW----NCRGSITGAGSRKSKVLCS------QCFGLKDFQAGSAPKTA 469
Query: 371 SFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGE 430
+E+T + E + + ++G K+D H+L+F LPDG
Sbjct: 470 K------KERTAKPHSVPESSCNLLKS-SLSGSKSQGRVTKKDIRTHKLVFEEEVLPDGT 522
Query: 431 RLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLH 490
+ Y +G++L G K+G GI C CCN E+SPSQFEAHAG A+RR+PY +IYTSNG++LH
Sbjct: 523 EVGYYCQGKKLLAGYKKGFGIFCSCCNSEVSPSQFEAHAGWASRRKPYLNIYTSNGVSLH 582
Query: 491 DIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
++AISL+ G+R + +DD+C +C DG LL C+ CP AFH CL IP W C C
Sbjct: 583 ELAISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPRGKWYCKYC 642
Query: 551 RQGHSSS--MSRSVDLKGGLEAPG-------------------AEVGGCVICRLSPSENF 589
+ R+ + G AEVGGCV
Sbjct: 643 LNTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRIVKTFEAEVGGCV---------- 692
Query: 590 DIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 649
CR HDF TF RTVI CDQCEKEFHVGCL+++ + DLKE+PK KWFCC C RI
Sbjct: 693 ---FCRGHDFE-RTFGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKWFCCTGCERI 748
Query: 650 HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 709
H+ALQ V + +P SSL+ I +KH E+ G +DV+W++L K + ++LL
Sbjct: 749 HSALQKLVIRGEEKLPDSSLNFI-KKH-EESASESGCSDDVRWRLLSKKTDSSDVTEALL 806
Query: 710 SSATAIFRECFDPIIAECGR------DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSA 763
S A AIF ECFDPI + + D IP MV G N+ GQ+ GGMY +L V VVVS
Sbjct: 807 SDAVAIFHECFDPITVDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLVNHVVVSV 866
Query: 764 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 823
++RIFG+E+AELP+VAT +QG+G FQ LF+CIE+LL LNV+NLVLPAAE+ SIW
Sbjct: 867 AVVRIFGQELAELPIVATSSRWQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVGSIWK 926
Query: 824 KKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
KFGF ++++ L++Y+R +Q+ +F+G ML+K V
Sbjct: 927 NKFGFGAITQDELMEYRRRYQIMVFQGALMLQKPV 961
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 271/682 (39%), Positives = 386/682 (56%), Gaps = 54/682 (7%)
Query: 200 IKM-SKKVVPNE---YPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYL 250
+KM S +VP+ +PT +K+LL TGIL+ V+YI R ++ L G++ G G L
Sbjct: 197 VKMGSSALVPSTLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGIL 256
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
C C C + VV+ + FE HAG+ + P +IYLENG + S++ A L L+E ++
Sbjct: 257 CYCDTCKGTNVVTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIR 316
Query: 311 KVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEES-ISPT 368
GSS + +F + K ++ S++ C S + L+ES SP
Sbjct: 317 VAIGSSIKKSTFCF--------------NCKGSISEVGTSDSLVLCESCVGLKESHASPA 362
Query: 369 SCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPD 428
+ D + T + + + R K+D LH+L F N LP+
Sbjct: 363 QPTGTSDRSTKTTTVSKCSSSGSKNYGR------------VTKKDVGLHKLAFGENDLPE 410
Query: 429 GERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT 488
G ++Y V+G+RL G K+G I+C CCN E+SPSQFEAH+G A+RR+PY HIYTSNG++
Sbjct: 411 GSEVSYYVRGERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVS 470
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
LH++++SL G+ + +D++C +C DG LL C+GCP FH C+ IP+ W C
Sbjct: 471 LHELSLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCK 530
Query: 549 NC-----------RQGHSSSMSRSVDLKGGLEAPGAEVGGCV-ICRLSPSENFDIRLCRS 596
C R ++ + R GG++ C+ I + E LCR
Sbjct: 531 FCLNTLQKGKFVERNANAVAAGR----MGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRR 586
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 656
H+FS + F TV+ CDQCEKEFHVGCL+ + + DLK +PK KWFCC DC I+++L+
Sbjct: 587 HEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKI 646
Query: 657 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 716
V + + +P L I +++ KG + G D++W++L + + SLLS A ++F
Sbjct: 647 VVRQEEELPDDVLRIIKKRYGRKGSVCSGN-PDIKWRLLHGRRASATEAGSLLSQALSLF 705
Query: 717 RECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAEL 776
E F+PI GRDL+ MV+ + EFGGMY ILTV VVSA R+ G+EVAEL
Sbjct: 706 HEQFNPIADAEGRDLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAEL 765
Query: 777 PLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERL 836
PLVAT + QG+G FQAL++CIERLLC L V +LVLPAAE AES+W KF F KM +E L
Sbjct: 766 PLVATRSDCQGQGYFQALYTCIERLLCFLQVNSLVLPAAEGAESLWINKFKFHKMEQEEL 825
Query: 837 LKYQRDFQLTIFKGTSMLEKKV 858
RDFQ+ F+GTSML+K V
Sbjct: 826 NHLCRDFQMMTFQGTSMLQKPV 847
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 454 bits (1167), Expect = e-124, Method: Compositional matrix adjust.
Identities = 269/653 (41%), Positives = 364/653 (55%), Gaps = 61/653 (9%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKV 261
VP +P +K+LL TGIL+ V+YI SR R L G++ G G LC C C +KV
Sbjct: 204 VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGESGLRGVIKGSGILCSCNSCKGTKV 263
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
V+ + FE HAG+ + P +IYLENG + ++ K A L L+E ++ G S
Sbjct: 264 VTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAWKNAALDSLDEAIRVAIGCSM---- 319
Query: 322 FRVWKASHHLR-KGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQ 380
+ K++ L KG + N K ++ S L L+ES + S Q TG
Sbjct: 320 --IKKSTFCLNCKGRISEAGIGNSK-------VLCLSCLQLKESQASPS----QVTGSSD 366
Query: 381 THM---KEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK 437
+H+ K R + G K+D LH+L+F NGLP+G + Y V+
Sbjct: 367 SHLRSPKPSTISRSAESVSKCSSSGSKSYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVR 426
Query: 438 GQ---RLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI 494
GQ +L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+ +I
Sbjct: 427 GQVVTQLLVGYKRGSGIXCTCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSI 486
Query: 495 SLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
SL+ G+ + +DD+C +C DG NLL C+GCP FH C+ IP+ W C C +
Sbjct: 487 SLSRGREISVSDNDDLCSICLDGGNLLCCDGCPRVFHKECVSLANIPKGKWFCKFCN--N 544
Query: 555 SSSMSRSVDLKGGLEAPGAEVG---------GCV-ICRLSPSENFDIRLCRSHDFSAATF 604
+ V+ A G G C+ I E LCR H+FS + F
Sbjct: 545 MLQKEKFVEHNANAVAAGRVAGVDPIEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGF 604
Query: 605 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 664
RTV+ CDQCEKEFHVGCLR++ + DLKE+PK KWFCC DC RI+++LQ V + + +
Sbjct: 605 GPRTVMLCDQCEKEFHVGCLREHDMDDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEEL 664
Query: 665 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
P + L+TI K+ G D++W+++ + + SLLS A +IF E FDPI
Sbjct: 665 PHNVLTTIKEKYGRNGSACSKDP-DIKWRLICGRRASSIEAGSLLSQALSIFHEQFDPIA 723
Query: 725 AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCRE 784
GRDL+P MV+G S VVSA RIFG+EVAELPLVAT +
Sbjct: 724 DAAGRDLLPDMVHG-------------------SQVVSAAAFRIFGKEVAELPLVATRSD 764
Query: 785 YQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLL 837
QG+G FQ LFSC+E LL L V +LVLPAAE AESIWT KFGF K+++E+ +
Sbjct: 765 CQGQGYFQTLFSCLEGLLGVLEVRSLVLPAAEGAESIWTNKFGFNKVTQEQYI 817
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 273/702 (38%), Positives = 381/702 (54%), Gaps = 62/702 (8%)
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFS 259
MSKK+ P VK+L TG+L+G V Y+ + + L G + G LC C CN
Sbjct: 1 MSKKIALENVPMTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDAGILCSCAFCNGH 60
Query: 260 KVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKK-------- 311
+V+ +FE HA + R +I ENGK + ++ +TAPL LE ++
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 312 ------------VAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSL 359
+A F S + RK + +E+ ++ PS S+ S
Sbjct: 121 RTFTCKRCKEQVLALEYFKPASLSTSSQDNTPRKKKRKPEEQDSITKPSKSASVYLSSRK 180
Query: 360 ALEESISPTSCSFVQDT----GREQTH-------MKEMLEERKRGVKRPFMHQKRTAEGG 408
+ ISP F G +K + RK PF G
Sbjct: 181 RKYKKISPRLVCFFYPIDILFGLVMLSPFPFLWLVKIFVFIRKYAYLSPF-----CPFSG 235
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
+ +D LHRL+F GLPDG L Y +GQ + + NK S SQ +
Sbjct: 236 YQSQDQRLHRLVFEEGGLPDGTELAYYARGQVINITYSYPFTFLLLIVNKINSSSQ-KLL 294
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
G Y +IYTSNG++LH++AISL+ ++ ++ +DD+C +C DG NLLLC+GCP
Sbjct: 295 GG-------YAYIYTSNGVSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCDGCPR 347
Query: 529 AFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSP--- 585
AFH C +P W C C+ ++ + V+ A G + G I +++
Sbjct: 348 AFHKGCASIPTVPSGDWYCQYCQ--NTFEREKLVEHNANASAAGRDSGIDSIEQITKRCF 405
Query: 586 -------SENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 638
+E LCR +DF + F RT+I CDQCEKEFHVGCLR + + +LKE+PK
Sbjct: 406 RIVKNIEAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELPKG 465
Query: 639 KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKA 698
WFCC DC+RIH+ LQ + A+ +P S L+ I +KH E+G+ + DV+W +L
Sbjct: 466 NWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEERGLNISNNI-DVRWTLLS-G 523
Query: 699 QCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVK 757
+ + K LLS A +IF+ECFDPI+ + GRDLIP+MVYG+N GQ++GGMY +LT+
Sbjct: 524 KIASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTIN 583
Query: 758 SVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEK 817
S +VSAG+LR+FG EVAELPLVAT GKG FQ LFSCIE+LL LNV+NLVLPAAE+
Sbjct: 584 SSIVSAGILRVFGEEVAELPLVATRNGEHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEE 643
Query: 818 AESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
AESIWT+KFGF+K+ E+L KY++ Q+ F+GTSML+K V
Sbjct: 644 AESIWTEKFGFQKIKPEQLNKYRKSCCQMVRFEGTSMLQKAV 685
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 261/697 (37%), Positives = 376/697 (53%), Gaps = 72/697 (10%)
Query: 198 MEIKMSKKVV-PNEYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPL 255
ME+KMSKKV +++P N + LLSTG+L+G V YI S++ L G++ G C C
Sbjct: 507 MEMKMSKKVAFLSKHPGNTRDLLSTGLLEGMPVMYIIPNSKKPVLKGVIAGCNIRCFCVK 566
Query: 256 CNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGS 315
C+ SK ++ + FE HAG+ +HP +IYL NG + +++ +++PL L++ ++
Sbjct: 567 CDGSKTITTYFFELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDP 626
Query: 316 SFNEGSFRVWKASHHLRKGLVE----HDEKHNMKLPSLPHSIISCSSLALEESISPTSCS 371
+ L E HD P P S ++CS
Sbjct: 627 MLIRTRMNCLNCNGELPSQTEEQFLCHDCCPESNQPQDPTSPLACSKS------------ 674
Query: 372 FVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGER 431
+ KE L +R K + G +D LH+L+F L DG
Sbjct: 675 ----SSSLTPSSKESLLKRMSA-------SKGASTGKVTTKDTGLHKLVF--KVLLDGTE 721
Query: 432 LTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHD 491
+ Y V GQ+ G + I C+ CNK +SPS FEAHAG +RR+PY +I+TSNG++LH+
Sbjct: 722 VNYYVDGQKKIDGYIKDQRIYCNHCNKVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHE 781
Query: 492 IAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC- 550
+++S++ + + +DD+C CG G ++ C CP +FH AC+ ++P W C NC
Sbjct: 782 LSMSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCT 841
Query: 551 -------------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDI 591
RQ S+ + + + ++GGC
Sbjct: 842 ILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICDDLGGCA------------ 889
Query: 592 RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHA 651
LC+ DF+ A FD+RTVI CDQCEKE+HVGCLR DLKE+P+ +WFCCD C+ I +
Sbjct: 890 -LCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRS 948
Query: 652 ALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSS 711
+L +S A + S + I +KH KG++ D +++WQ++ E+ SLLSS
Sbjct: 949 SLDKMISEGAHPLSESDVDIIRKKHESKGLVMDAN-TEIRWQLVAGRSATEDG-NSLLSS 1006
Query: 712 ATAIFRECFDPII-AECGRDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVVSAGLL 766
A + + FDPII A GRDLIP MV+GR + GQ++ GMY +LTV S VVSA LL
Sbjct: 1007 AVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALL 1066
Query: 767 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 826
R+ G +VAELPLVAT + QG G FQ LFSCIERLL SL V++ +LPAA +AE+IW KKF
Sbjct: 1067 RVMGGDVAELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKF 1126
Query: 827 GFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 863
GF K+ ++++ Y LT+F GT L K + LPE
Sbjct: 1127 GFSKIPQDQMEAYLNGGHLTVFHGTLNLYKAIP-LPE 1162
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 244/516 (47%), Positives = 330/516 (63%), Gaps = 20/516 (3%)
Query: 359 LALEESISPTSCSFVQD-TGREQTHMKEM--LEERKRGVKRPFMHQKRTAEGGTKKRDND 415
LALE + +F QD T R++ E L + V + ++ RD
Sbjct: 211 LALEYFKPASLSTFSQDNTLRKKKRKPEEPDLIAKPSKVASVHLSPRKRKYKKISPRDQR 270
Query: 416 LHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEI-SPSQFEAHAGMAAR 474
LHRL+F GLPDG L Y +GQ+L G K+G GI+C CCN E+ SPS FEAHAG A R
Sbjct: 271 LHRLVFEEGGLPDGTELAYYARGQKLLGGYKRGFGILCHCCNCEVVSPSTFEAHAGWATR 330
Query: 475 RQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC 534
++PY IYTSNG++LHD+AISL+ ++ ++ +DD+C +C DG +LLLC+GCP AFH C
Sbjct: 331 KKPYACIYTSNGVSLHDLAISLSKSRKYSSQDNDDLCIICADGGDLLLCDGCPRAFHKGC 390
Query: 535 LDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG--AEVGG-------CV-ICRLS 584
+P W C +C+ ++ + V+ A G +E+ C I +
Sbjct: 391 ASLSTVPSGDWYCQHCQ--NTFQREKFVEHNANAFAAGRVSEIDSIEQITKRCFRIVKNV 448
Query: 585 PSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD 644
+E LCR +DF + F RT+I CDQCEKEFHVGCLR + + +LKE+PK WFCC
Sbjct: 449 EAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMANLKELPKGNWFCCM 508
Query: 645 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK 704
DC+RIH+ LQ + A+ +P S L+ I +KH EKG+ ++ DV+W +L + +
Sbjct: 509 DCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEEKGLNISNSI-DVRWTLLS-GKIASPE 566
Query: 705 EKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSA 763
K LLS A +IF+ECFDPI+ + GRDLIP+MVYG+N GQ++GGMY +L V S +VSA
Sbjct: 567 NKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLIVNSCIVSA 626
Query: 764 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 823
G+LR+FG EVAELPLVAT GKG FQ LFSCIE+LL LNV+NLVLPAAE+AESIW
Sbjct: 627 GILRVFGEEVAELPLVATRNGDHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWI 686
Query: 824 KKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
+KFGF+K+ E+L KY+++ Q+ F+GTSML+K V
Sbjct: 687 EKFGFQKIKPEQLSKYRKNCCQMVRFEGTSMLQKAV 722
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 59/111 (53%), Gaps = 2/111 (1%)
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFS 259
MSKK+ + P VK+L TG+L+G V Y+ + + L G + G LC C CN
Sbjct: 1 MSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCAFCNGR 60
Query: 260 KVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
+V+ +FE HA + R +I ENGK + ++ +TAPL LE ++
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQ 111
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 44/95 (46%), Gaps = 5/95 (5%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
KK+ P + + G+ DG + Y + ++ L G G G LC C CN +VVS
Sbjct: 262 KKISPRDQRLHRLVFEEGGLPDGTELAYYARG-QKLLGGYKRGFGILCHC--CN-CEVVS 317
Query: 264 AHEFEQHAGAKTR-HPNNHIYLENGKPIYSIIQEL 297
FE HAG TR P IY NG ++ + L
Sbjct: 318 PSTFEAHAGWATRKKPYACIYTSNGVSLHDLAISL 352
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 444 bits (1141), Expect = e-121, Method: Compositional matrix adjust.
Identities = 264/700 (37%), Positives = 385/700 (55%), Gaps = 58/700 (8%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNF 258
KK + N +P +K + GIL+G V Y+ ++ R+ L G++ G G LC C C
Sbjct: 371 KKFLRN-FPAKLKDIFDCGILEGLIVYYVRGAKVREAGTRGLKGVIKGSGVLCFCSACIG 429
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS FE HA + + P +I LE+G + ++ K PL LEE ++ V G
Sbjct: 430 IQVVSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILK 489
Query: 319 EGSFRVW----------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPT 368
+ S + S + K +E E PS + ++ SS S+ P
Sbjct: 490 KSSLCLSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSS---RPSVDPK 546
Query: 369 SC------SFVQDTGREQTHMKE---------MLEERKRGVKRPFMHQKRTAEGGTKKRD 413
S S Q REQ K +L E K + H K T ++D
Sbjct: 547 SILRRSKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLT------RKD 600
Query: 414 NDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAA 473
LH+L+F + LPDG + Y V G+++ G K+G GI C CCNK +SPS FEAHAG A+
Sbjct: 601 LRLHKLVFEDDILPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCAS 660
Query: 474 RRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAA 533
RR+P++HIYT+NG++LH+++++L+M QR + +DD+C +C DG L+ C+ CP ++H
Sbjct: 661 RRKPFQHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKV 720
Query: 534 CLDPLLIPESGWRCPNC-----RQGHSSSMSRSVDLK--GGLEAPGAEVGGCVICRLSPS 586
C +P W C C R+ S ++ G++A C+ R+ S
Sbjct: 721 CASLPSLPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCI--RIVSS 778
Query: 587 ENFDIR----LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
++ LCR H F F+ RTVI CDQCEKEFHVGCL++ + DLKE+P++KWFC
Sbjct: 779 FVTELPSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFC 838
Query: 643 CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQC 700
C I+ L + + + + + L+ + +K E+ T D++W++L
Sbjct: 839 SLGCEEINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLT 898
Query: 701 FEEKEKSLLSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQEFGGMYSVILTVKS 758
+ K LL+ A +I E FDPI +E G DLIP MVYGR Q+F GMY +L V
Sbjct: 899 SSDDTKILLAKALSILHERFDPI-SESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDE 957
Query: 759 VVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKA 818
V+VS G+ R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A
Sbjct: 958 VIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEA 1017
Query: 819 ESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+SIWT KFGF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 1018 KSIWTDKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 1057
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 254/664 (38%), Positives = 383/664 (57%), Gaps = 41/664 (6%)
Query: 210 EYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFE 268
++P N+++LL+TG+L+G V YI S++ L G++ G C C CN +K +SA+ FE
Sbjct: 209 KHPGNIRELLNTGMLEGMPVMYIIPHSKKAVLKGVITGCNIRCFCLSCNGAKAISAYYFE 268
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
QHAG+ +HP ++IYL NG + +++ +PL LEE ++ SS +
Sbjct: 269 QHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIR----SSIDP--------- 315
Query: 329 HHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFV-QDTGREQTHMKEML 387
+++ + + + LPS +++ L ++ P + S+ + KE L
Sbjct: 316 -VVKRSRINCLNCNELVLPSSHENVLCQVCLESKQPQDPLTASYTCNGSSSLSRSSKEAL 374
Query: 388 EERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQ 447
++ +K + G +DN LH+L+F N L DG + Y V GQR G +
Sbjct: 375 ------LRNISSGKKGGSAGKVTNKDNRLHKLVF--NVLLDGTEVAYYVDGQRKVDGYIK 426
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGS 507
+ I C+ CN+ +SPS FEAHAG +RR+PY +I+TSNG++LH++A+ ++ + +
Sbjct: 427 DHRIYCNHCNRVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHELAMKISKDMELSERET 486
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--GHSSSMSRSVDLK 565
DD+C CG G ++ C CP +FH AC+ +P + W C +CR +++++ + K
Sbjct: 487 DDLCRECGQGGDIFPCKICPRSFHPACVGLSKVP-AEWYCDSCRNLVQKEKALAKNKNAK 545
Query: 566 GGLEAPGAEVGGCVI---CRLSPSENFDI---RLCRSHDFSAATFDDRTVIYCDQCEKEF 619
G + ++ R+ P + D+ LC+ DF+ A FD+RTVI CDQCEKE+
Sbjct: 546 AAGRQAGVDSIEQIMKRAIRIVPISD-DLGGCALCKQKDFNNAVFDERTVILCDQCEKEY 604
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
HVGCL+ +LKE+P+++WFCC C+ ++L +S+ AQ + L I +KH +
Sbjct: 605 HVGCLQSQWQVELKELPEEEWFCCSSCSETRSSLDKIISDGAQLLADPDLEIIKKKHETR 664
Query: 680 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG 738
G+ D T D++WQ+L + E+ LLS+A IF + FDPI A GRDLIP MV G
Sbjct: 665 GLCMD-TSKDLKWQLLSGKRATEDGS-ILLSAAVPIFHQSFDPIREALTGRDLIPEMVNG 722
Query: 739 RN----ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
R + GQ++ GMY +LTV S VVSA LLR+ G +VAELPLVAT ++ QG G FQAL
Sbjct: 723 RGPKEGMPGQDYSGMYCALLTVGSTVVSAALLRVMGGDVAELPLVATSQDVQGLGYFQAL 782
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSML 854
FSCIER+L SL +++ VLPAA +AE IW KFGF ++S E L Y LTIF GTS +
Sbjct: 783 FSCIERVLVSLKIKHFVLPAAHEAEGIWMNKFGFSRISPEELEAYLNGAHLTIFHGTSYM 842
Query: 855 EKKV 858
K V
Sbjct: 843 YKAV 846
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 253/685 (36%), Positives = 376/685 (54%), Gaps = 80/685 (11%)
Query: 210 EYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFE 268
++P+N+++LL+TG+L+G V+YI +S++ L G++ G C C CN SK V ++ FE
Sbjct: 382 KHPSNIRELLNTGLLEGMPVRYIIPSSKKAVLKGVITGCNIRCFCLSCNGSKDVCSYFFE 441
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
QHAG+ +HP +HIYL NG + +++ +++PL LE+ ++ +
Sbjct: 442 QHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCLNCN 501
Query: 329 HHLRKGLVE----------HDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGR 378
HL E + K + PS ++ S SSL P+S F+
Sbjct: 502 EHLSSSQTEIFGSFLCQRCLEPKQHQDPPSPSYACKSNSSLI------PSSKDFL----- 550
Query: 379 EQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKG 438
+K+ ++ K + G +D LH+L+F L DG + Y V G
Sbjct: 551 ---------------LKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDG 593
Query: 439 QRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAM 498
QR G + I C+ CN+ +SPS FEAHAG RR+PY +I+TSNG++LH++++ ++
Sbjct: 594 QRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISK 653
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-------- 550
+ +DD+C CG G ++ C CP +FH AC+ +P S W C NC
Sbjct: 654 DMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEK 712
Query: 551 ------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHD 598
RQ S+ + + + ++GGC LC+ D
Sbjct: 713 ALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISDDLGGCA-------------LCKQKD 759
Query: 599 FSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVS 658
F+ + FD+RTVI CDQCEKE+HVGCLR DLKE+P+ +WFCC+ C+ I ++L +S
Sbjct: 760 FNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIIS 819
Query: 659 NRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRE 718
+ A + S + I +KH KG+ D T D++W++L + E+ + LLS+A I +
Sbjct: 820 DGALILAESDIDIIRKKHEMKGLSMD-TNTDLRWRLLAGRKASEDGDL-LLSAAVPIIHQ 877
Query: 719 CFDPII-AECGRDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREV 773
FDPII + GRDLIP MV GR + GQ++ GMY +LT+ + VVSA LLR+ G EV
Sbjct: 878 SFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEV 937
Query: 774 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSR 833
AELPLVAT ++ QG G FQALFSCIER+L SL +++ +LPAA++AE IW KFGF K+ +
Sbjct: 938 AELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQ 997
Query: 834 ERLLKYQRDFQLTIFKGTSMLEKKV 858
E+ Y LTIF GTS L K +
Sbjct: 998 EQSDAYLNGAHLTIFHGTSNLYKAI 1022
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 254/687 (36%), Positives = 375/687 (54%), Gaps = 82/687 (11%)
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQ---LDGIVNGGGYLCGCPLCNFSKVVSAHE 266
++P+N+++LL+TG+L+G V+YI S + Q L G++ G C C CN SK V ++
Sbjct: 371 KHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAVLKGVITGCNIRCFCLSCNGSKDVCSYF 430
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWK 326
FEQHAG+ +HP +HIYL NG + +++ +++PL LE+ ++
Sbjct: 431 FEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCLN 490
Query: 327 ASHHLRKGLVE----------HDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDT 376
+ HL E + K + PS ++ S SSL P+S F+
Sbjct: 491 CNEHLSSSQTEIFGSFLCQRCLEPKQHQDPPSPSYACKSNSSLI------PSSKDFL--- 541
Query: 377 GREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIV 436
+K+ ++ K + G +D LH+L+F L DG + Y V
Sbjct: 542 -----------------LKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYV 582
Query: 437 KGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISL 496
GQR G + I C+ CN+ +SPS FEAHAG RR+PY +I+TSNG++LH++++ +
Sbjct: 583 DGQRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKI 642
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC------ 550
+ + +DD+C CG G ++ C CP +FH AC+ +P S W C NC
Sbjct: 643 SKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQK 701
Query: 551 --------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRS 596
RQ S+ + + + ++GGC LC+
Sbjct: 702 EKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISDDLGGCA-------------LCKQ 748
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 656
DF+ + FD+RTVI CDQCEKE+HVGCLR DLKE+P+ +WFCC+ C+ I ++L
Sbjct: 749 KDFNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKI 808
Query: 657 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 716
+S+ A + S + I +KH KG+ D T D++W++L + E+ + LLS+A I
Sbjct: 809 ISDGALILAESDIDIIRKKHEMKGLSMD-TNTDLRWRLLAGRKASEDGDL-LLSAAVPII 866
Query: 717 RECFDPII-AECGRDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVVSAGLLRIFGR 771
+ FDPII + GRDLIP MV GR + GQ++ GMY +LT+ + VVSA LLR+ G
Sbjct: 867 HQSFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGG 926
Query: 772 EVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 831
EVAELPLVAT ++ QG G FQALFSCIER+L SL +++ +LPAA++AE IW KFGF K+
Sbjct: 927 EVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKI 986
Query: 832 SRERLLKYQRDFQLTIFKGTSMLEKKV 858
+E+ Y LTIF GTS L K +
Sbjct: 987 PQEQSDAYLNGAHLTIFHGTSNLYKAI 1013
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 228/464 (49%), Positives = 299/464 (64%), Gaps = 22/464 (4%)
Query: 413 DNDLHRLLFLPNGLPDGERLTYIVKGQ------RLRFGCKQGNGIVCDCCNKEISPSQFE 466
DN LH+L+F NGLPDG L Y GQ +L G K+G+GIVC CCN EISPSQFE
Sbjct: 261 DNRLHKLVFEENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRCCNTEISPSQFE 320
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGC 526
HAG A+R++PY +IYTSNG++LH+++ISL+ ++ + +DD+C VC DG NLLLC+GC
Sbjct: 321 VHAGWASRKKPYAYIYTSNGVSLHELSISLSKDRKYSANDNDDLCVVCWDGGNLLLCDGC 380
Query: 527 PLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG-------GLEAPGAEVGGCV 579
P AFH C IP W C C+ + ++ G++ C+
Sbjct: 381 PRAFHKECASLSSIPRGDWYCQFCQNMFQREKFVAYNVNAFAAGRVEGVDPIEQITKRCI 440
Query: 580 -ICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 638
I + +E LCR DFS + F RT+I CDQCEKE+HVGCLR + + LKE+PK
Sbjct: 441 RIVKDIDAELSACALCRGVDFSKSGFGPRTIILCDQCEKEYHVGCLRDHKMTFLKELPKG 500
Query: 639 KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND--VQWQMLK 696
W CC+DC RIH+ L++ + A+ +P S L+ I +K EKG+ +ND V+W++L
Sbjct: 501 NWLCCNDCTRIHSTLENVLVRGAERLPKSLLAVIKKKQEEKGL---DPINDINVRWRLLS 557
Query: 697 KAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILT 755
+ E + LL A +IF ECFDPI+ A GRDLI MVYG+++ GQEFGGMY +L
Sbjct: 558 GKKASPE-TRPLLLEAVSIFHECFDPIVDAVSGRDLIRAMVYGKSVRGQEFGGMYCALLI 616
Query: 756 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 815
V S VVSAG+LRIFG ++AELPLVAT GKG FQALFSCIERLL + V+NLVLPAA
Sbjct: 617 VNSSVVSAGMLRIFGTDIAELPLVATSNSQHGKGYFQALFSCIERLLAFMKVKNLVLPAA 676
Query: 816 EKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSMLEKKV 858
E+A+SIWT KFGF K+ + L Y+R+ Q F+GT+ML K V
Sbjct: 677 EEAQSIWTDKFGFSKIKPDELANYRRNCNQFVTFQGTNMLHKMV 720
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 43/83 (51%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 299
L G++ G LC C LC +V+S +FE HA + R +I ENGK + +++ +
Sbjct: 35 LRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRG 94
Query: 300 APLGILEEVVKKVAGSSFNEGSF 322
APL LE ++ + S E F
Sbjct: 95 APLHDLEATIQNIVCSPPEEKYF 117
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 16/107 (14%)
Query: 199 EIKMSKKVVPNEYPT----NVKKLL--STGILDGACVKY-----ISTSRERQLDGIVNGG 247
E K++KK+VP +PT + KL+ G+ DG+ + Y + + R++ L+G G
Sbjct: 244 EWKITKKLVPYSFPTCGDNRLHKLVFEENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGS 303
Query: 248 GYLCGCPLCNFSKVVSAHEFEQHAGAKTR-HPNNHIYLENGKPIYSI 293
G +C C CN +S +FE HAG +R P +IY NG ++ +
Sbjct: 304 GIVCRC--CNTE--ISPSQFEVHAGWASRKKPYAYIYTSNGVSLHEL 346
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 431 bits (1107), Expect = e-117, Method: Compositional matrix adjust.
Identities = 270/702 (38%), Positives = 377/702 (53%), Gaps = 91/702 (12%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-LDGIVNGGGYLCGCPLC 256
ME+KMSKK+ P N+K LL+TG+L+G VKYI +R L G++ G LC C C
Sbjct: 144 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKRAVLRGVIKRVGILCSCSSC 203
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSS 316
VVS + FE HAG+ +HP+++I+LENG ++ I++ A L +L+ ++ G +
Sbjct: 204 KGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGPA 263
Query: 317 FNEGSFR--VWKASHH-LRKGLVEHDEKHNMKLPSLPHSIISCSSLALEE-SISPTSCSF 372
+ +FR K+S LR G K + S S S +S + +PTS +
Sbjct: 264 PKKRTFRCQTCKSSFATLRTG------KFALLCDSCLESKGSQNSTRTSKIGRNPTSSA- 316
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERL 432
+ ++ + RG K F K T+ G ++D LH+L F+ LP+G +
Sbjct: 317 --RRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKGLHKLAFMSGVLPEGTDV 374
Query: 433 TYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
Y V G+RL G + GI C CCN +SPSQFEAHAG AARR+PY +IY SNG++LH++
Sbjct: 375 GYYVGGKRLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAARRKPYHNIYMSNGVSLHEL 434
Query: 493 AISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-- 550
++SL+ G+ + SDD+C +C DG LLLC+ CP AFH C+ IP W C C
Sbjct: 435 SVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCEN 494
Query: 551 RQGHSSSMSRS--------VDLKGGLE-----------APGAEVGGCVICRLSPSENFDI 591
RQ SS++ + +D +E P GGC +C +S +
Sbjct: 495 RQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCSMSGFMDKQS 554
Query: 592 RLCRS--------------HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 637
L RS HDFS F RTV+ CDQ +P+
Sbjct: 555 VLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA-------------------LPE 595
Query: 638 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKK 697
W+C DC RI L+D +S A+ I + + I RK+ +K + DG + DV+W++LK
Sbjct: 596 GAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKALNKDGDL-DVRWRVLKD 654
Query: 698 AQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVYGRNISGQEFGGMYSVILTV 756
+ K +LS A AIF E FDPII GRDLIP MVYG
Sbjct: 655 KSSADSKL--VLSKAVAIFHESFDPIIQIATGRDLIPAMVYG------------------ 694
Query: 757 KSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAE 816
+ VVSAGL R+ G E+AELPLVAT R+ QG G FQALF CIERLL SL V++ VLPAA+
Sbjct: 695 -NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLLASLKVKHFVLPAAD 753
Query: 817 KAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+AESIWT++FGF K++++ L +Y + + T+F+GTS L K V
Sbjct: 754 EAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLV 795
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 216/466 (46%), Positives = 298/466 (63%), Gaps = 12/466 (2%)
Query: 402 KRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEIS 461
K T+ G ++D+ LH+L+FL LP+G + Y V G+RL G + GI C CCN +S
Sbjct: 274 KSTSAGRLTRKDHGLHKLVFLSGILPEGTDVGYYVGGKRLLDGYIKEPGIHCHCCNTVVS 333
Query: 462 PSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLL 521
PSQFE HAG AARR+PY +IY SNG++LH++++SL+ G++ + SDD+C +C DG LL
Sbjct: 334 PSQFEGHAGRAARRKPYHNIYMSNGVSLHELSVSLSRGRKTSDRQSDDLCSICSDGGELL 393
Query: 522 LCNGCPLAFHAACLDPLLIPESGWRCPNC--RQGHSSSMSRSVDLKGGLEAPGAEVGGCV 579
LC+ CP AFH C+D +P+ W C C RQ SS++ + + G + +
Sbjct: 394 LCDTCPRAFHRECVDLTAVPKGTWCCRYCETRQQRESSLAYNHNAIAAGRIDGIDSMEQI 453
Query: 580 ICR-----LSPSENFD-IRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 633
R +P F LC+ HDF F RTV+ CDQC +E+HVGCL+++ + DL
Sbjct: 454 FTRSIRIATTPETGFGGCALCKLHDFGKKKFSARTVLLCDQCGREYHVGCLKEHSMADLT 513
Query: 634 EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQ 693
+P+ W+C DC RI ++D +S A+ +PA I +K +KG+ DG + DV+W+
Sbjct: 514 ALPEGAWYCSSDCVRISETMKDLLSGGAEPVPAMDADLIKKKREDKGLNEDGDL-DVRWR 572
Query: 694 MLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSV 752
+L+ E K +LS A AIF E FDPII GRDLIP MVYGR++ Q++ GMY
Sbjct: 573 VLRDKS--SEDSKLVLSKAVAIFHESFDPIIQTTTGRDLIPAMVYGRSVRDQDYTGMYCA 630
Query: 753 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 812
+LTV + VVSAGL RI GRE AELPLVAT R+ QG G FQALF CIERLL SL V+ VL
Sbjct: 631 VLTVGNTVVSAGLFRIMGREAAELPLVATSRDNQGFGYFQALFGCIERLLASLKVKYFVL 690
Query: 813 PAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
PAA++A SIWT++FGF K+SR+ LL++ + + T+F+GTS L K +
Sbjct: 691 PAADEAVSIWTQRFGFSKISRDELLEHLKGARTTVFQGTSTLHKLI 736
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 7/80 (8%)
Query: 216 KKLLSTGIL-DGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-A 273
K + +GIL +G V Y + R LDG + G C C CN VVS +FE HAG A
Sbjct: 290 KLVFLSGILPEGTDVGYYVGGK-RLLDGYIKEPGIHCHC--CN--TVVSPSQFEGHAGRA 344
Query: 274 KTRHPNNHIYLENGKPIYSI 293
R P ++IY+ NG ++ +
Sbjct: 345 ARRKPYHNIYMSNGVSLHEL 364
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 213/459 (46%), Positives = 295/459 (64%), Gaps = 12/459 (2%)
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
++D LH+L+F GLP+G L Y +GQ+L G K G GI C CC E+SPS FEAHAG
Sbjct: 518 RKDQGLHKLVFERGGLPEGTELGYYARGQKLLGGYKMGAGIYCYCCKSEVSPSLFEAHAG 577
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
A+RR+PY +IYTSNG++LH+ A + + G++ + ++D+C +C DG NLLLC+ CP AF
Sbjct: 578 WASRRKPYFYIYTSNGVSLHEWATTFSQGRKYSANDNNDLCVICADGGNLLLCDSCPRAF 637
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSM-------SRSVDLKGGLEAPGAEVGGCV-ICR 582
H C+ IP W C C +S + S +V G++ G C+ + +
Sbjct: 638 HIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQSAGRCIRVVK 697
Query: 583 LSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
+E LC DF + F RT+I CDQCEKE+H+GCL + DLKE+PK WFC
Sbjct: 698 NMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFC 757
Query: 643 CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFE 702
DC RI++ LQ + A+T+ SSL I RK E+ ++ + D++W+++ +
Sbjct: 758 SMDCTRINSTLQKLLLGGAETLSDSSLGIIQRKQ-ERTDVYSISDLDIRWRLI-SGKVTS 815
Query: 703 EKEKSLLSSATAIFRECFDPIIAEC-GRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVV 761
+ + LLS A AIF +CFDPI+ GR+LIP MVYG+ + GQ++GG+ +LTV + VV
Sbjct: 816 PESRMLLSQALAIFHDCFDPIVDPLSGRNLIPRMVYGKTMQGQDYGGICCAVLTVNATVV 875
Query: 762 SAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESI 821
SAGLLR+FGREVAELPLVAT + KG FQ LFSCIE+LL SLNVE++V+PAAE+AE +
Sbjct: 876 SAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPL 935
Query: 822 WTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKVQ 859
W KFGFRK++ E+L KY + +Q+ FKG SML+K V
Sbjct: 936 WMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVH 974
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 59/108 (54%), Gaps = 2/108 (1%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFSKV 261
K ++ P V++L TGILDG V Y+ T + + L GI+ GG LC C C+++ V
Sbjct: 253 KSILIRRRPETVRELFETGILDGLSVVYMGTVKSQAFGLRGIIKDGGILCSCSSCDWAHV 312
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVV 309
+S +FE HA + R + +I ENGK + ++ + PL LE +
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATI 360
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 251/679 (36%), Positives = 353/679 (51%), Gaps = 69/679 (10%)
Query: 209 NEYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEF 267
++P NVK+LL TG+L+G V YI S++ + G++ G C C CN S+ +S + F
Sbjct: 545 TKHPGNVKELLQTGLLEGMPVMYIIPNSKKAVVKGVITGCNIRCFCIKCNGSRALSTYFF 604
Query: 268 EQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKA 327
E HAG+ +HP HIYL NG + +++ + L LEE +
Sbjct: 605 ELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPMVIRSRPNCLNC 664
Query: 328 SHHLRKGLVEHDEKH---NMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
HL EH H + K P P S +
Sbjct: 665 GGHLPSSETEHFLCHCCLDSKQPQDPPS----------------------PSYSCSKSDS 702
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG 444
M K + +K + G +D LH+L+F L DG + Y V GQR G
Sbjct: 703 SMTPSFKSSSVKISSIKKAGSSGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDG 760
Query: 445 CKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT 504
+ I C+ C++ +SPS FEAHAG +RR+PY +I+TSNG++LH++++ ++ +
Sbjct: 761 YIKDQRIYCNHCSRVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHELSMKISKDMELSE 820
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-------------- 550
+DD+C CG G ++ C CP +FH AC+ P S W C NC
Sbjct: 821 RETDDLCRECGLGGDIFPCKMCPRSFHPACVRLSEFP-SEWFCDNCSNLVQKEKALAANK 879
Query: 551 ------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATF 604
RQ S+ + + + ++GGC LC+ DF+ A F
Sbjct: 880 NAKAAGRQAGVDSIEQIMKRAIRIVPICDDLGGCA-------------LCKKKDFNNAVF 926
Query: 605 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 664
D+RTVI CDQCEKE+HVGCLR DLKE+P +WFCC C+ I + L +S+ AQ +
Sbjct: 927 DERTVILCDQCEKEYHVGCLRTQWQVDLKELPDGEWFCCSSCSEIRSCLDKMISDGAQPL 986
Query: 665 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
S L I +KH +G+ D + D++WQ+L E+ LLSSA I + FDPII
Sbjct: 987 SGSDLEIIRKKHESRGLSMDTDI-DIRWQLLAGRSATEDGSL-LLSSAVPIIHQSFDPII 1044
Query: 725 -AECGRDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
A GRDLIP MV GR + GQ++ GMY ++T+ S VVSA LLRI G +VAELPLV
Sbjct: 1045 EANTGRDLIPEMVNGRRPKEGMPGQDYSGMYCAVITLGSTVVSAALLRIMGGDVAELPLV 1104
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 839
AT + QG G FQ LFSC+ER+L SL +++ +LPAA++AE+IW KKFGF ++ +E+L Y
Sbjct: 1105 ATSMDLQGLGYFQVLFSCMERMLISLKIKHFMLPAAQEAEAIWMKKFGFSRIPQEQLEAY 1164
Query: 840 QRDFQLTIFKGTSMLEKKV 858
LT+F GTS L K V
Sbjct: 1165 LNGAHLTVFHGTSNLYKAV 1183
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 257/673 (38%), Positives = 364/673 (54%), Gaps = 81/673 (12%)
Query: 211 YPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKVVSAH 265
+PT +K+LL TGIL+ V+YI R ++ L G++ G G LC C C + VV+ +
Sbjct: 212 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 271
Query: 266 EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW 325
FE HAG+ + P +IYLENG + S++ A L L+E ++ GSS + +F
Sbjct: 272 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTFCF- 330
Query: 326 KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEESISPTSCSFVQDTGREQTHMK 384
+ K ++ S++ C S + L+ES S Q TG+++ +
Sbjct: 331 -------------NCKGSISEVGTSDSLVLCESCVGLKES----HASPAQPTGQQKQRLC 373
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQ----- 439
HQ+ GG D LH+L F N LP+G ++Y V+G+
Sbjct: 374 PSA-----------AHQEVRIMGG----DVGLHKLAFGENDLPEGSEVSYYVRGEVGTMR 418
Query: 440 --RLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLA 497
RL G K+G I+CDCCN E+SPSQFEAH+G A+RR+PY HIYTSNG++LH++++SL
Sbjct: 419 SKRLLSGHKKGCRILCDCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLL 478
Query: 498 MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC------- 550
G+ + +D++C +C DG LL C+GCP FH C+ IP+ W C C
Sbjct: 479 RGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKG 538
Query: 551 ----RQGHSSSMSRSVDLKGGLEAPGAEVGGCV-ICRLSPSENFDIRLCRSHDFSAATFD 605
R ++ + R GG++ C+ I + E LCR H+FS + F
Sbjct: 539 KFVERNANAVAAGR----MGGVDPIEQIRKRCIRIVKSQTDEAGGCALCRRHEFSTSGFG 594
Query: 606 DRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP 665
TV+ CDQCEKEFHVGCL+ + + DLK +PK KWFCC DC I+++L+ V R + +P
Sbjct: 595 PHTVMICDQCEKEFHVGCLKAHNIDDLKAVPKGKWFCCRDCKDINSSLRKIVVRREEELP 654
Query: 666 ASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA 725
L I +++ KG + G D++W++L + SLLS A ++F E F+PI
Sbjct: 655 DDVLRIIKKRYGRKGSVCSGN-PDIKWRLLHGRXASATEAGSLLSQALSLFHEQFNPIAD 713
Query: 726 ECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
GRDL+ MV+ + EFGGMY ILTV VVSA R+ G+EVAELPLVAT +
Sbjct: 714 AEGRDLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDC 773
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
QG+ V +LVLPAAE AES+W KF F KM +E L RDFQ+
Sbjct: 774 QGQ------------------VNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQM 815
Query: 846 TIFKGTSMLEKKV 858
F+GTSML+K V
Sbjct: 816 MTFQGTSMLQKPV 828
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 210/458 (45%), Positives = 292/458 (63%), Gaps = 12/458 (2%)
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
++D LH+L+F GLP+G L Y +GQ+L G K G GI C CC E+SPS FEAHAG
Sbjct: 518 RKDQGLHKLVFDRGGLPEGTELGYYARGQKLLGGYKMGAGIYCYCCKCEVSPSLFEAHAG 577
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
A+RR+PY +IYTSNG++LH+ A + + G++ + ++D+C +C DG NLLLC+ CP AF
Sbjct: 578 WASRRKPYFYIYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAF 637
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSM-------SRSVDLKGGLEAPGAEVGGCV-ICR 582
H C+ IP W C C +S + S +V G++ G C+ + +
Sbjct: 638 HIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVK 697
Query: 583 LSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
+E LC DF + F RT+I CDQCEKE+H+GCL + DLKE+PK WFC
Sbjct: 698 NMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFC 757
Query: 643 CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFE 702
DC RI++ LQ + A+ + SSL I K E+ ++ + D++W+++ +
Sbjct: 758 SMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQ-ERNDVYSISDLDIRWRLI-SGKVTS 815
Query: 703 EKEKSLLSSATAIFRECFDPIIAEC-GRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVV 761
+ + LLS A AIF +CFDPI+ G +LIP MVYG+ + GQ++GG+ +LTV + VV
Sbjct: 816 PESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVV 875
Query: 762 SAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESI 821
SAGLLR+FGREVAELPLVAT + KG FQ LFSCIE+LL SLNVE++V+PAAE+AE +
Sbjct: 876 SAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPL 935
Query: 822 WTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
W KFGFRK++ E+L KY + +Q+ FKG SML+K V
Sbjct: 936 WMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPV 973
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 2/121 (1%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFSKV 261
K ++ P V+ L TG+LDG V Y+ T + + L GI+ GG LC C C+++ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
+S +FE HA + R + +I ENGK + ++ + PL LE + + E
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 322 F 322
F
Sbjct: 373 F 373
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 241/632 (38%), Positives = 350/632 (55%), Gaps = 52/632 (8%)
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW- 325
FE HA + + P +I LE+G + ++ K PL LEE ++ V G + S +
Sbjct: 2 FELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILKKSSLCLSC 61
Query: 326 ---------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSC------ 370
S + K +E E PS + ++ SS S+ P S
Sbjct: 62 QGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSS---RPSVDPKSILRRSKS 118
Query: 371 SFVQDTGREQTHMKE---------MLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLF 421
S Q REQ K +L E K + H K T ++D LH+L+F
Sbjct: 119 SPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLT------RKDLRLHKLVF 172
Query: 422 LPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHI 481
+ LPDG + Y V G+++ G K+G GI C CCNK +SPS FEAHAG A+RR+P++HI
Sbjct: 173 EDDILPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHI 232
Query: 482 YTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP 541
YT+NG++LH+++++L+M QR + +DD+C +C DG L+ C+ CP ++H C +P
Sbjct: 233 YTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLP 292
Query: 542 ESGWRCPNC-----RQGHSSSMSRSVDLK--GGLEAPGAEVGGCVICRLSPSENFDIR-- 592
W C C R+ S ++ G++A C+ R+ S ++
Sbjct: 293 SERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCI--RIVSSFVTELPSV 350
Query: 593 --LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIH 650
LCR H F F+ RTVI CDQCEKEFHVGCL++ + DLKE+P++KWFC C I+
Sbjct: 351 CVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEIN 410
Query: 651 AALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQCFEEKEKSL 708
L + + + + + L+ + +K E+ T D++W++L + K L
Sbjct: 411 TTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKIL 470
Query: 709 LSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLL 766
L+ A +I E FDPI +E G DLIP MVYGR Q+F GMY +L V V+VS G+
Sbjct: 471 LAKALSILHERFDPI-SESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIF 529
Query: 767 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 826
R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A+SIWT KF
Sbjct: 530 RVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKF 589
Query: 827 GFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
GF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 590 GFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 621
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 35/70 (50%), Gaps = 6/70 (8%)
Query: 225 DGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTRHPNNHIY 283
DG V Y + E+ L G G G C C CN KVVS FE HAG A R P HIY
Sbjct: 179 DGTEVGYF-VAGEKMLVGYKKGFGIHCSC--CN--KVVSPSTFEAHAGCASRRKPFQHIY 233
Query: 284 LENGKPIYSI 293
NG ++ +
Sbjct: 234 TTNGVSLHEL 243
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 262/705 (37%), Positives = 368/705 (52%), Gaps = 111/705 (15%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-LDGIVNGGGYLCGCPLC 256
ME+KMSKK+ P N+K LL+TG+L+G VKYI +R L G++ G LC C C
Sbjct: 143 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKRAVLRGVIKRVGILCSCSSC 202
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSS 316
VVS + FE HAG+ +HP+++I+LENG ++ I++ A L +L+ ++ G +
Sbjct: 203 KGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGPA 262
Query: 317 FNEGSFR--VWKASHH-LRKGLVEHDEKHNMKLPSLPHSIISCSSLALEE-SISPTSCSF 372
+ +FR K+S LR G K + S S S +S + +PTS +
Sbjct: 263 PKKRTFRCQTCKSSFATLRTG------KFALLCDSCLESKGSQNSTRTSKIGRNPTSSA- 315
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERL 432
+ ++ + RG K F K T+ G ++D LH+L F+ LP+G +
Sbjct: 316 --RRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKGLHKLAFMSGVLPEGTDV 373
Query: 433 TYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
Y V G+ +SPSQFEAHAG AARR+PY +IY SNG++LH++
Sbjct: 374 GYYVGGK--------------------VSPSQFEAHAGRAARRKPYHNIYMSNGVSLHEL 413
Query: 493 AISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-- 550
++SL+ G+ + SDD+C +C DG LLLC+ CP AFH C+ IP W C C
Sbjct: 414 SVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCEN 473
Query: 551 RQGHSSSMSRS--------VDLKGGLE-----------APGAEVGGCVICRLSPSENFDI 591
RQ SS++ + +D +E P GGC +C +S +
Sbjct: 474 RQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCSMSGFMDKQS 533
Query: 592 RLCRS--------------HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 637
L RS HDFS F RTV+ CDQ +P+
Sbjct: 534 VLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA-------------------LPE 574
Query: 638 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKK 697
W+C DC RI L+D +S A+ I + + I RK+ +K + DG + DV+W++LK
Sbjct: 575 GAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKALNKDGDL-DVRWRVLKD 633
Query: 698 AQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVYGRNISGQEFGGMYSVILTV 756
+ K +LS A AIF E FDPII GRDLIP MVYG
Sbjct: 634 KSSADSKL--VLSKAVAIFHESFDPIIQIATGRDLIPAMVYG------------------ 673
Query: 757 KSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAE 816
+ VVSAGL R+ G E+AELPLVAT R+ QG G FQALF CIERLL SL V++ VLPAA+
Sbjct: 674 -NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLLASLKVKHFVLPAAD 732
Query: 817 KAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 861
+AESIWT++FGF K++++ L +Y + + T+F+GTS L K V L
Sbjct: 733 EAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLVPKL 777
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 245/693 (35%), Positives = 369/693 (53%), Gaps = 79/693 (11%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNF 258
KK + N +P +K++ + GIL+G V Y+ ++ R+ L G++ G G LC C C
Sbjct: 360 KKFLRN-FPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKG 418
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS +E HA + + P +I LE+G + ++ K P LEE ++ V G +
Sbjct: 419 IQVVSTAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVGPNLK 478
Query: 319 EGSFRVW----------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPT 368
+ S + S + K +E E PS + ++ SS S+ P
Sbjct: 479 KSSLCLNCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSS---RPSVDPK 535
Query: 369 SCSFVQDTGREQTHMKEMLEER--KRGV-------KRPFMHQKRTAEGGTKKRDNDLHRL 419
S + Q++ +E + + GV + K ++G ++D LH+L
Sbjct: 536 SILSRSKSSPRQSNRQEQPTRKSTEPGVVPGTILSESKSSSIKSNSQGKLTRKDVRLHKL 595
Query: 420 LFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYR 479
+F + LPDG + Y V G E+SPS FEAHAG A+RR+P++
Sbjct: 596 VFEDDILPDGTEVGYFVAG--------------------EVSPSSFEAHAGCASRRKPFQ 635
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL 539
HIYT+NG++LH+++++L+M QR + +DD+C +C DG C
Sbjct: 636 HIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDG---------------VCASLSS 680
Query: 540 IPESGWRCPNC-----RQGHSSSMSRSVDLK--GGLEAPGAEVGGCVICRLSPSENFDIR 592
+P W C C R+ S ++ G++A C+ R+ S ++
Sbjct: 681 LPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCI--RVVSSFGTELP 738
Query: 593 ----LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNR 648
LCR H F F+ RTVI CDQCEKEFHVGCL+++ + DLKE+P++KWFC DC +
Sbjct: 739 SVCVLCRGHSFCRLGFNSRTVIICDQCEKEFHVGCLKEHNIADLKELPEEKWFCSVDCEK 798
Query: 649 IHAALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQCFEEKEK 706
I+ L + + + + + L+ I K E+ D T D++W++L ++ K
Sbjct: 799 INTTLGNLIIRGEEKLTNNILNFIRTKEKPNEESCPDDNTTPDIRWRVLSGKLTSSDETK 858
Query: 707 SLLSSATAIFRECFDPIIAECGR-DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGL 765
LL+ A +I E FDPI R DLIP MVYGR GQ+F GMY +L V V+VS G+
Sbjct: 859 ILLAKAVSILHERFDPISETGTRGDLIPAMVYGRQAKGQDFSGMYCTMLAVDEVIVSVGI 918
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A+SIWT K
Sbjct: 919 FRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDK 978
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
FGF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 979 FGFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 1011
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 206/461 (44%), Positives = 281/461 (60%), Gaps = 35/461 (7%)
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGM 471
RD +H+++F+ L +G + Y V G+ +SPSQFEAHAG
Sbjct: 443 RDKGMHKVVFMSGVLQEGTDVGYYVGGK--------------------VSPSQFEAHAGR 482
Query: 472 AARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFH 531
AARR+PY +IY SNG++LH+++ISL GQ+ + SDD+C +C DG LLLC+ CP AFH
Sbjct: 483 AARRKPYHNIYMSNGVSLHELSISLLKGQKMSNRQSDDLCSICSDGGQLLLCDTCPRAFH 542
Query: 532 AACLDPLLIPESGWRCPNC--RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICR-----LS 584
C+ P+ W C C RQ S ++ + + G + + R +
Sbjct: 543 RECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATT 602
Query: 585 PSENFD-IRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCC 643
P F LC+ HDFS F RTV+ CDQC +E+HVGCL+++ + DL +P+ W+C
Sbjct: 603 PETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCS 662
Query: 644 DDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEE 703
DC RI+ LQD +++ + +P L I +K KG D + DV+W++LK +
Sbjct: 663 TDCVRINQTLQDLLNHGGEPVPTMDLDVIKKKREVKGFNEDADL-DVRWRVLKDKS--SD 719
Query: 704 KEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVS 762
K +LS A AIF E FDPII GRDLIP MVYGR+ Q++ GMY +LTV + VVS
Sbjct: 720 DSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTVNNTVVS 779
Query: 763 AGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIW 822
AGL RI G E+AELPLVAT R+ QG G FQALFSCIERLL SL V++ VLPAAE+AESIW
Sbjct: 780 AGLFRIMGNEIAELPLVATSRDSQGLGYFQALFSCIERLLASLEVKHFVLPAAEEAESIW 839
Query: 823 TKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK---KVQC 860
T++FGF K+S++ L +Y + + T+F+GTS L K KV+C
Sbjct: 840 TERFGFTKISQDELREYLKGGRTTVFQGTSNLHKLVAKVEC 880
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 41/64 (64%)
Query: 260 KVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNE 319
+VVS + FE HAG+ +HP+++I+LENG ++ +++ L +LE V+K G + +
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 320 GSFR 323
+FR
Sbjct: 406 RTFR 409
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 54/236 (22%), Positives = 83/236 (35%), Gaps = 71/236 (30%)
Query: 460 ISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA-----ISLAM---GQRRTTGGSDDMC 511
+SP FE HAG + ++ P +I+ NG LHD+ ++L M R+ G +
Sbjct: 348 VSPYYFEVHAG-STKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQ-- 404
Query: 512 HVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS-------SMSRSVDL 564
+ C GC +F C + C +C + + M + V +
Sbjct: 405 ------KRTFRCKGCKSSFSTLCSGKFAL-----FCDSCLESKGAKNNSRDKGMHKVVFM 453
Query: 565 KGGLEAPGAEVGGCVICRLSPSE--------------------------NFDIRLCRSHD 598
G L+ G +VG V ++SPS+ I L +
Sbjct: 454 SGVLQE-GTDVGYYVGGKVSPSQFEAHAGRAARRKPYHNIYMSNGVSLHELSISLLKGQK 512
Query: 599 FSAATFDDR--------TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDC 646
S DD ++ CD C + FH C+ L PK W CC C
Sbjct: 513 MSNRQSDDLCSICSDGGQLLLCDTCPRAFHRECV------SLSSAPKGTW-CCRYC 561
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 240/695 (34%), Positives = 360/695 (51%), Gaps = 63/695 (9%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFSKV 261
K ++ P V+ L TG+LDG V Y+ T + + L GI+ GG LC C C+++ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
+S +FE HA + R + +I ENGK + ++ + PL LE + + E
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 322 FRVWK------ASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISP----TSCS 371
F + SH R E + + SL +S + AL +++ ++ +
Sbjct: 373 FTCKRCKEGFCVSHARRPS--ESTSISPVFMSSLGNSTRKITRKALRQALVGKAYLSAST 430
Query: 372 FVQDTGREQTHMKEMLEERKRGVKRPFMHQK--RTAEGGTKKRDNDLHRLLFL------- 422
V + ++ K+ML +H AE + L L+ L
Sbjct: 431 NVSSQKKCRSKFKKMLVF--------LLHNDVLMLAEPTLMIKSLILLHLVSLYVLKVDP 482
Query: 423 ----PNGLPDGERLTYIV-------KGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGM 471
P G + L +V +G L + + I D E + E +
Sbjct: 483 TFCDPQGFEKDQGLHKLVFDRGGLPEGTELGYYARGQTYITVDRNCSEATKWALEYIVTV 542
Query: 472 AARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFH 531
A+ Y IYTSNG++LH+ A + + G++ + ++D+C +C DG NLLLC+ CP AFH
Sbjct: 543 ASASYFY--IYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAFH 600
Query: 532 AACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLK----GGLEA--PGAEVGGCVICRLSP 585
C+ IP W C C +S ++ ++ G LE P ++ G I +
Sbjct: 601 IECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKN 660
Query: 586 SENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD 645
E + + + F RT+I CDQCEKE+H+GCL + DLKE+PK WFC D
Sbjct: 661 ME---------AETNGSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMD 711
Query: 646 CNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE 705
C RI++ LQ + A+ + SSL I K E+ ++ + D++W+++ + +
Sbjct: 712 CTRINSTLQKLLLGGAEKLSDSSLGIIQTKQ-ERNDVYSISDLDIRWRLI-SGKVTSPES 769
Query: 706 KSLLSSATAIFRECFDPIIAEC-GRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAG 764
+ LLS A AIF +CFDPI+ G +LIP MVYG+ + GQ++GG+ +LTV + VVSAG
Sbjct: 770 RMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAG 829
Query: 765 LLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTK 824
LLR+FGREVAELPLVAT + KG FQ LFSCIE+LL SLNVE++V+PAAE+AE +W
Sbjct: 830 LLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPLWMN 889
Query: 825 KFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
KFGFRK++ E+L KY + +Q+ FKG SML+K V
Sbjct: 890 KFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPV 924
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 366 bits (939), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 243/695 (34%), Positives = 352/695 (50%), Gaps = 112/695 (16%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
KK + N +P +K + GIL+G V Y+ + KVVS
Sbjct: 371 KKFLRN-FPAKLKDIFDCGILEGLIVYYVRGA------------------------KVVS 405
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFR 323
FE HA + + P +I LE+G + ++ K PL LEE ++ V G + S
Sbjct: 406 PAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILKKSSLC 465
Query: 324 VW----------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSC--- 370
+ S + K +E E PS + ++ SS S+ P S
Sbjct: 466 LSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSS---RPSVDPKSILRR 522
Query: 371 ---SFVQDTGREQTHMKE---------MLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHR 418
S Q REQ K +L E K + H K T ++D LH+
Sbjct: 523 SKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLT------RKDLRLHK 576
Query: 419 LLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPY 478
L+F + LPDG + Y V G E+SPS FEAHAG A+RR+P+
Sbjct: 577 LVFEDDILPDGTEVGYFVAG--------------------EVSPSTFEAHAGCASRRKPF 616
Query: 479 RHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL 538
+HIYT+NG++LH+++++L+M QR + +DD+C +C DG +C P
Sbjct: 617 QHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDG----VCASLPS---------- 662
Query: 539 LIPESGWRCPNC-----RQGHSSSMSRSVDLK--GGLEAPGAEVGGCVICRLSPSENFDI 591
+P W C C R+ S ++ G++A C+ R+ S ++
Sbjct: 663 -LPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCI--RIVSSFVTEL 719
Query: 592 R----LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
LCR H F F+ RTVI CDQCEKEFHVGCL++ + DLKE+P++KWFC C
Sbjct: 720 PSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCE 779
Query: 648 RIHAALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQCFEEKE 705
I+ L + + + + + L+ + +K E+ T D++W++L +
Sbjct: 780 EINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDT 839
Query: 706 KSLLSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSA 763
K LL+ A +I E FDPI +E G DLIP MVYGR Q+F GMY +L V V+VS
Sbjct: 840 KILLAKALSILHERFDPI-SESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSV 898
Query: 764 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 823
G+ R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A+SIWT
Sbjct: 899 GIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWT 958
Query: 824 KKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
KFGF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 959 DKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 993
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 332 bits (851), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 177/415 (42%), Positives = 258/415 (62%), Gaps = 19/415 (4%)
Query: 444 GCKQGNGIVCDC--CN-KEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
G G I C C CN ++SPS FEAHAG RR+PY +I+TSNG++LH++++ ++
Sbjct: 331 GVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKDM 390
Query: 501 RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--GHSSSM 558
+ + +DD+C CG G ++ C CP +FH AC+ +P S W C NC ++
Sbjct: 391 QLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEKAL 449
Query: 559 SRSVDLKGGLEAPGAEVGGCVI---CRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
+ + + K G + ++ R+ P N I L + DF+ + FD+RTVI CDQC
Sbjct: 450 AENKNAKAAGRQAGVDSIEQIMKRAIRIVP--NLWIELGQK-DFNNSVFDERTVILCDQC 506
Query: 616 EKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRK 675
EKE+HVGCL+ DLKE+P+ +WFCC+ C+ I ++L +S+ A + S + I +K
Sbjct: 507 EKEYHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDIIRKK 566
Query: 676 HIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPV 734
H KG+ D T D++W++L + E+ + LLS+A I + FDPII + GRDLIP
Sbjct: 567 HEMKGLSMD-TNTDLRWRLLAGRKASEDGDL-LLSAAVPIIHQSFDPIIEVQSGRDLIPE 624
Query: 735 MVYGRN----ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
MV GR + GQ++ GMY +LT+ + VVSA LLR+ G EVAELPLVAT ++ QG G
Sbjct: 625 MVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGLGY 684
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
FQALFSCIER+L SL +++ +LPAA++AE IW KFGF K+ +E+LL+ D +
Sbjct: 685 FQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQLLRKNVDLNV 739
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 166/402 (41%), Positives = 234/402 (58%), Gaps = 41/402 (10%)
Query: 487 MTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
++LH++++S++ + + +DD+C CG G ++ C CP +FH AC+ ++P W
Sbjct: 2 VSLHELSMSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWF 61
Query: 547 CPNC--------------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPS 586
C NC RQ S+ + + + ++GGC
Sbjct: 62 CDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICDDLGGCA------- 114
Query: 587 ENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDC 646
LC+ DF+ A FD+RTVI CDQCEKE+HVGCLR DLKE+P+ +WFCCD C
Sbjct: 115 ------LCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSC 168
Query: 647 NRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEK 706
+ I ++L +S A + S + I +KH KG++ D +++WQ++ E+
Sbjct: 169 SEIRSSLDKMISGGAHPLSESDVDIIRKKHESKGLVMDAN-TEIRWQLVAGRSATEDG-N 226
Query: 707 SLLSSATAIFRECFDPII-AECGRDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVV 761
SLLSSA + + FDPII A GRDLIP MV+GR + GQ++ GMY +LTV S VV
Sbjct: 227 SLLSSAVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVV 286
Query: 762 SAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESI 821
SA LLR+ G +VAELPLVAT + QG G FQ LFSCIERLL SL V++ +LPAA +AE+I
Sbjct: 287 SAALLRVMGGDVAELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAI 346
Query: 822 WTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 863
W KKFGF K+ ++++ Y LT+F GT L K + LPE
Sbjct: 347 WMKKFGFSKIPQDQMEAYLNGGHLTVFHGTLNLYKAIP-LPE 387
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 189/540 (35%), Positives = 288/540 (53%), Gaps = 54/540 (10%)
Query: 200 IKM-SKKVVPNE---YPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYL 250
+KM S +VP+ +PT +K+LL TGIL+ V+YI R ++ L G++ G G L
Sbjct: 259 VKMGSSALVPSTLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGIL 318
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
C C C + VV+ + FE HAG+ + P +IYLENG + S++ A L L+E ++
Sbjct: 319 CYCDTCKGTNVVTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIR 378
Query: 311 KVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEES-ISPT 368
GSS + +F + K ++ S++ C S + L+ES SP
Sbjct: 379 VAIGSSIKKSTFCF--------------NCKGSISEVGTSDSLVLCESCVGLKESHASPA 424
Query: 369 SCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPD 428
+ D + T + + + R K+D LH+L F N LP+
Sbjct: 425 QPTGTSDRSTKTTTVSKCSSSGSKNYGR------------VTKKDVGLHKLAFGENDLPE 472
Query: 429 GERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT 488
G ++Y V+G+RL G K+G I+C CCN E+SPSQFEAH+G A+RR+PY HIYTSNG++
Sbjct: 473 GSEVSYYVRGERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVS 532
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
LH++++SL G+ + +D++C +C DG LL C+GCP FH C+ IP+ W C
Sbjct: 533 LHELSLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCK 592
Query: 549 NC-----------RQGHSSSMSRSVDLKGGLEAPGAEVGGCV-ICRLSPSENFDIRLCRS 596
C R ++ + R GG++ C+ I + E LCR
Sbjct: 593 FCLNTLQKGKFVERNANAVAAGR----MGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRR 648
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 656
H+FS + F TV+ CDQCEKEFHVGCL+ + + DLK +PK KWFCC DC I+++L+
Sbjct: 649 HEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKI 708
Query: 657 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 716
V + + +P L I +++ KG + G D++W++L + + SLLS A ++F
Sbjct: 709 VVRQEEELPDDVLRIIKKRYGRKGSVCSGN-PDIKWRLLHGRRASATEAGSLLSQALSLF 767
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 224/709 (31%), Positives = 344/709 (48%), Gaps = 80/709 (11%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRER----------------------------QLDGI 243
P N K+L++TG+L+G V+ +R L GI
Sbjct: 514 PRNAKELMATGLLEGRYVRCSCRGEQRINSEDCLLDEDEEELRVLSSSDFPDIPLPLTGI 573
Query: 244 VNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP-- 301
+ G +C C +C ++VVS FE H+G+ + HP+++IYLENGK + I+ + +
Sbjct: 574 LQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQESADC 633
Query: 302 ----LGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCS 357
LG L+ + ++ G EG+ + G E + + K P CS
Sbjct: 634 GGDILGALKHAIGEIQGIPKKEGACG--------KCGKREGGDFVSCKEPK-------CS 678
Query: 358 SLALEESI---SPTSCS-FVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGT---K 410
++ E + SP F + Q M + + + KR R E
Sbjct: 679 AVYHAECVGLPSPHRVDWFCAKCEKAQVKMPKTVLKMKRPPAVTDREDTRLKEKELTVRS 738
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
RD LH+ LFLP GL DG L Y + Q + G KQG GI C CCN+EIS S FE HAG
Sbjct: 739 ARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCKCCNQEISCSAFEQHAG 798
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLA----MGQR-RTTGGSDDMCHVCGDGENLLLCNG 525
+RR PY I ++G +L D+ LA +G R + D+ G E +L +
Sbjct: 799 CESRRNPYGSILLADGRSLKDMCKELAYQSKLGDRAHQVARTGDVKSSSGSEEQGVLASS 858
Query: 526 --CPLAFHAACLDPLLIPESGWRCPNCR------QGHSSSMSRSVDLKGGLEAPGAEVGG 577
C + E+ P C+ +G + + VD + A + G
Sbjct: 859 QRCESTWCINFGTRFSCQEADSGHPLCKICQKNVEGAHKTSKKRVDATANIPATD-DTGR 917
Query: 578 CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 637
V +P + +C+ FD T++ CDQC +E+HVGCLR++G+ D E+P+
Sbjct: 918 NVRLFQAPDSSSGCAICKKWTLKKCGFD-MTMLVCDQCGREYHVGCLRESGILD--ELPE 974
Query: 638 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI--NRKHIEKGI--LFDGTMNDVQWQ 693
+W+C +C I L V+N + + + ++ + +R+H ++GI + + + WQ
Sbjct: 975 AEWYCQPNCQHIVQVLSQLVANGPELLSDNIVNDLLESRQH-QQGIVEMAESSSPVFGWQ 1033
Query: 694 MLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSV 752
+L A ++L + A IF EC DPI A G+++IP+MVY R +F G+Y V
Sbjct: 1034 ILHGAGENPVNGRTL-AQAVEIFTECSDPIKDAPSGQNMIPIMVYSRRFKDYDFDGIYCV 1092
Query: 753 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 812
+LT+ VVS LL+IFGREVAE+PL+AT ++Q +G +AL + IERLL LNVE LVL
Sbjct: 1093 VLTLNEKVVSTALLQIFGREVAEVPLIATSVDHQDQGFCKALMTTIERLLGVLNVERLVL 1152
Query: 813 PAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 861
PA++ AE +W +FGF +M +L + L +F GT+ML K + +
Sbjct: 1153 PASKNAEFVWVNRFGFSRMEDAQLKHIRSMMGLLVFTGTTMLVKHIDAV 1201
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 183/461 (39%), Positives = 257/461 (55%), Gaps = 39/461 (8%)
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG---CKQGNGIVCDCC------------ 456
RD LH+L+F N L DG + Y V ++ G KQ +GI+CDCC
Sbjct: 86 RDQCLHKLVFQENVLEDGAAVGYFVYEEKQLQGEINIKQ-SGILCDCCKEVILEFFFCQI 144
Query: 457 ---NKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHV 513
++++SPS+FEAHAG A+RR+PY HI T++G++LH +AI+ R + SD+ C
Sbjct: 145 KKLDEQVSPSKFEAHAGWASRRKPYFHIRTTDGVSLHQLAIN----HRISISNSDEHCSK 200
Query: 514 CGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQG-----HSSSMSRSVDLKGGL 568
C NLL C+GC AFH C+ P+ W C CR + V + +
Sbjct: 201 CKQRGNLLCCDGCQRAFHLGCIPVESPPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKII 260
Query: 569 EA-PGAEVGGCVICRLSPS----ENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGC 623
E+ P ++ IC LS E+ LC F+ F TV+ CDQCEK++HVGC
Sbjct: 261 ESDPSEQIAK--ICTLSVKHKEVEHSSCALCSERHFNNGEFSPWTVMICDQCEKDYHVGC 318
Query: 624 LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILF 683
L+ + + +LK++PK WFC DC IH L++F++ + S LS I K +KG+
Sbjct: 319 LKDHNMANLKKVPKHYWFCGVDCYDIHMKLKNFMARGDVLLSDSLLSLIKNKKEQKGLET 378
Query: 684 DGTMNDVQWQMLKKAQCFEEK-EKSLLSSATAIFRECFDPIIAECGR-DLIPVMVYGRNI 741
+ + D++W++ + + SLLS IF E FD I+ + DLIP MV GR I
Sbjct: 379 EFGL-DIKWKVFNRQLIVSKIITSSLLSDVVTIFHEQFDSIVVTGTKIDLIPAMVKGRKI 437
Query: 742 SGQE-FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIER 800
+ FGGMY +L V VVVSAG+ R+FG+EVAEL L+AT EYQ +G F+ L SCIE
Sbjct: 438 KDKYYFGGMYCAVLIVNQVVVSAGIFRVFGKEVAELSLIATKAEYQKQGFFKCLLSCIEN 497
Query: 801 LLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQR 841
+L L VE LVLPAA +AES+W KFGF + ++ +Y R
Sbjct: 498 VLKELKVERLVLPAAHEAESMWIDKFGFTEPNQGLGRRYYR 538
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 144/263 (54%), Positives = 189/263 (71%), Gaps = 3/263 (1%)
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 656
HDFS + F DRTVI CDQCE+E+HVGCL+++ + DLKE+P+ +WFC DC RIH+ALQ
Sbjct: 2 HDFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTL 61
Query: 657 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 716
+ AQ +P + I +K KG G D++WQ+L + + + + LLS A AIF
Sbjct: 62 LLRGAQPLPLLDVDVIRKKCDIKGFNI-GANTDIRWQLLS-GKTADAESRLLLSKAVAIF 119
Query: 717 RECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAE 775
E FDPI+ A GRDLIP MVYGR + Q++GG+Y +LTV S VVSAG+LR+ G E+AE
Sbjct: 120 HESFDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAE 179
Query: 776 LPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRER 835
LPLVAT RE+QG+G FQ+LFSCIERLL +L V++ VLPAA++AESIWTKKFGF K++ +
Sbjct: 180 LPLVATSREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDE 239
Query: 836 LLKYQRDFQLTIFKGTSMLEKKV 858
L KY + T+F+GTS L K V
Sbjct: 240 LHKYLNGARTTVFQGTSTLHKPV 262
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 241/768 (31%), Positives = 356/768 (46%), Gaps = 97/768 (12%)
Query: 128 NSGDVSKSHV---VLEIPKHVSSSSGIRK---ITFKFSKRKEDYVAP-LAYEEGRNYTLY 180
NSG +S S V ++ V + G RK T + + + AP L EEG +
Sbjct: 195 NSGGLSDSVVSNGLVHDTDQVVKNGGSRKRKSKTIHYMNSRVPHAAPALESEEGSDGQKS 254
Query: 181 DDLGSSGANDGVLCARNMEIKMSKKVVPN-----EYPTNVKKLLSTGILDGACVKYISTS 235
+ R IK+ K +VP + P N K+L++T +++G V+ +
Sbjct: 255 PEERGDAIARPSQAPRAAAIKL-KSMVPVGDKLLKAPRNAKELMATRLMEGHHVR--CSC 311
Query: 236 RERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSII- 294
R QL G++ G C C C S +VS FE H+G+ + HP+++IYLENGK + I+
Sbjct: 312 RGIQLTGMLKDMGVQCDCRNCRGSVIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDILS 371
Query: 295 --QELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHS 352
QE IL + + E S + +G + + + + + HS
Sbjct: 372 AGQEAADCGDNILRALKMAIGDVQGVEKSKSKCAKCGNPEEGDLIYCK--GARCSVVAHS 429
Query: 353 IISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPF----------MHQK 402
C +A + C + T + +K VKRP + +K
Sbjct: 430 --GCVEIA-NPHLGDWFCGKCEKTKKPHASVK---------VKRPISSGAEKEDSRVREK 477
Query: 403 RTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKE--- 459
RD LH+ LFLP GL +G L Y K Q G K+G GI C CCNKE
Sbjct: 478 DATVSARLSRDAHLHKALFLPGGLENGTELGYYTKSQLKLKGVKRGKGICCSCCNKEASS 537
Query: 460 -ISPSQFEAHAGMAARRQPYRHIYT-SNGMTLHDIAISLA----MGQRRT---TGGSDDM 510
IS +FE HAG ARR PY +I +G +L D+ L +G+++ +
Sbjct: 538 DISCFEFEQHAGCEARRNPYGNILVLVDGRSLKDVCKDLTHKNKLGEQQNCEPLARDVNC 597
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLI-PESGWRCPNCR-----------------Q 552
C+ C L CNGC A+ C + +S W C CR Q
Sbjct: 598 CYECSSSGELKTCNGCEEAWCDNCTKGEEVDSDSKWYCRMCRNDTLKVAQNGQKVSGKHQ 657
Query: 553 GHSSSMSRSVDLKG----GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATF-DDR 607
SSS++ +D +G LE EVGGC IC+ + S F D
Sbjct: 658 EESSSITE-IDERGRCIRHLEGH-REVGGCAICK-------------KWNLSKTGFVDGM 702
Query: 608 TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPAS 667
T++ CDQC +E+HV CL+ +G+ +L E+P+ +WFC C I L V+ +++ S
Sbjct: 703 TILVCDQCGREYHVSCLKDSGMDNLNELPEGEWFCQKGCKVIDEILTQLVAIGPESLSHS 762
Query: 668 SLSTINRKHIEKGILFDGTMN---DVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
+S + +K + + + +WQ+L + L+ A IF EC DPI
Sbjct: 763 IISELPENRQQKSGVIEKAESISPSFEWQILC-GKGSSPANIQTLAEAVNIFTECSDPIR 821
Query: 725 -AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
A+ G++LIP+MV R +F G++ V+L + VVSA LL+IFGRE AE+PLVAT
Sbjct: 822 DAKTGKNLIPLMVQSRRTKDYDFEGVFCVVLKLNGKVVSAALLQIFGREFAEVPLVATSL 881
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 831
+QG+G +AL + IERLL L+VE LVLP A+ ES+W KFGF ++
Sbjct: 882 PHQGQGFCKALMTTIERLLGVLSVERLVLPTAKDTESLWVNKFGFSRV 929
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 146/302 (48%), Positives = 194/302 (64%), Gaps = 19/302 (6%)
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+RS+ + LE GGC +C+L HDFS F RTV+ CDQC +
Sbjct: 53 FTRSIRIATTLETG---FGGCALCKL-------------HDFSKKKFSTRTVLLCDQCGR 96
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
E+HVGCL+++ + DL +P+ W+C DC RI+ LQD +++ + + A L I +K
Sbjct: 97 EYHVGCLKEHNMADLTALPEGAWYCSTDCVRINQTLQDLLNSGGEPVLAMDLDVIKKKRE 156
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
KG D DV+W++LK + K +LS A AIF E FDPII GRDLIP MV
Sbjct: 157 VKGFNDDDADLDVRWRVLKDKS--SDDSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMV 214
Query: 737 YGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 796
YGR+ Q++ GMY +LTV ++VVSAGL RI G E+AELPLVAT R+ QG G FQALFS
Sbjct: 215 YGRSARDQDYTGMYCTVLTVNNIVVSAGLFRIMGSEIAELPLVATSRDRQGLGYFQALFS 274
Query: 797 CIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
CIERLL SL V++ VLPAAE+AESIWT++FGF K+S++ L +Y + + T+F+GTS L K
Sbjct: 275 CIERLLSSLEVKHFVLPAAEEAESIWTERFGFAKISQDELREYLKGGRTTVFQGTSNLHK 334
Query: 857 KV 858
V
Sbjct: 335 LV 336
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 139/274 (50%), Positives = 181/274 (66%), Gaps = 7/274 (2%)
Query: 590 DIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 649
++ LCR DF+ A FD+RTVI CDQCEKE+HVGCL+ +LKE+P+ +WFCC C+
Sbjct: 9 EVSLCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSET 68
Query: 650 HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 709
++L +S+ AQ + L I +KH+ +G+ D T D++WQ+L + E+ LL
Sbjct: 69 RSSLDKIISDGAQLLVEPDLEIIKKKHVTRGLCMD-TSKDLKWQLLSGKRTTEDG-SILL 126
Query: 710 SSATAIFRECFDPII-AECGRDLIPVMVYGRN----ISGQEFGGMYSVILTVKSVVVSAG 764
S+A IF + FDPI A GRDLIP MV GR + GQ++ GMY +LTV S VVSA
Sbjct: 127 SAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAA 186
Query: 765 LLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTK 824
LLR+ G +VAELPLVAT ++ QG G FQALFSCIER+L SL +++ VLPAA +AE IW K
Sbjct: 187 LLRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIWMK 246
Query: 825 KFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
KFGF + + E L Y LTIF GTS L K V
Sbjct: 247 KFGFSRTTPEELEAYLNGAHLTIFHGTSYLYKAV 280
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 225/736 (30%), Positives = 329/736 (44%), Gaps = 96/736 (13%)
Query: 153 KITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN--- 209
K + R V L EEG + + G +G R K+ K +VP
Sbjct: 328 KTIHYMNSRVPHSVPTLESEEGSDGQESHEEGDTGPVRPSQAPRAAATKL-KSMVPIGDK 386
Query: 210 --EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEF 267
+ P N K+L++T +++G V+ + R QL G++ G C C C S +VS F
Sbjct: 387 LLKAPRNAKELMATRLMEGHFVR--CSCRGIQLTGMLKDMGVRCDCRNCKSSVIVSISAF 444
Query: 268 EQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP------LGILEEVVKKVAG-SSFNEG 320
E H+G+ + HP+++IYLENGK + I+ + A L L+ + + G +
Sbjct: 445 EAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKWKVT 504
Query: 321 SFRVWKASHH-------LRKGLVEHDEKHNMKLPSLPHSIIS-CSSLALEESISPTSCSF 372
+ W + R ++ H + P L C + + S
Sbjct: 505 CAKCWNSDEGDLIYCKGARCSIIAHSRCIGISNPRLGDWFCDKCEKMKKPHATVKVKRSI 564
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERL 432
T ++ ++E K E RD LH+ LFLP GL DG L
Sbjct: 565 SSGTEKDDGRVRE----------------KDATESTRLNRDAHLHKALFLPGGLEDGTEL 608
Query: 433 TYIVKGQRLRFGCKQGNG---IVCDCCNKEISPSQFEAHAGMAARRQPYRHI-YTSNGMT 488
Y K Q G K+G +V EI+ +FE HAG ARR PY +I ++G +
Sbjct: 609 GYYTKSQLKLKGVKRGEAFKKVVV-----EINCYKFEQHAGCEARRNPYGNILLVADGRS 663
Query: 489 LHDIAISLA----MGQ---RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP 541
L D+ LA +G+ R G + C+ CG L C+GC + +C L
Sbjct: 664 LKDVCKELAHKNKLGEKEKRVARAGKVNSCYECGTRGELKNCHGCVETWCNSCTKGLETD 723
Query: 542 ESG-WRCPNCRQG--------------HSSSMSRSVDLK------GGLEAPGAEVGGCVI 580
G W C CRQ H MS + LE EVGGC I
Sbjct: 724 SDGKWYCRMCRQDTLNVAQIEQKRSNKHIEGMSNIAETDERDRCVRHLEGH-REVGGCAI 782
Query: 581 CRLSPSENFDIRLCRSHDFSAATF-DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 639
C+ + S F D T++ CDQC +E+HV CL+ +G+ DL E+P+ +
Sbjct: 783 -------------CKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPEGE 829
Query: 640 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMN---DVQWQMLK 696
WFC DC I L V+N + + S +S + ++ D + WQ+L
Sbjct: 830 WFCQKDCKVIDEILTQLVANGPELLTDSIISELLESRQQQTGAKDKAESSCPSFAWQIL- 888
Query: 697 KAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILT 755
+ L+ A IF EC DPI A+ G++LIP+MV R +F G++ ++L
Sbjct: 889 CGKSGNTANTQTLAEAINIFTECSDPIRDAKTGKNLIPLMVQSRRSKDHDFEGVFCIVLK 948
Query: 756 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 815
+ VVSA LL+IFG E+AE+PLVAT +QG+G +AL + IERLL L+VE LVLP A
Sbjct: 949 LNEKVVSAALLQIFGGEIAEVPLVATSLTHQGQGFCKALMTTIERLLGVLSVERLVLPTA 1008
Query: 816 EKAESIWTKKFGFRKM 831
+ ESIW KFGF ++
Sbjct: 1009 KNTESIWINKFGFSRV 1024
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 223/757 (29%), Positives = 333/757 (43%), Gaps = 124/757 (16%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHA 271
P N K+L++T +++G V+ + R QL G++ G C C C S +VS FE H+
Sbjct: 567 PRNAKELMATRLMEGHFVR--CSCRGIQLTGMLKDMGVQCNCRNCKGSVIVSISAFEAHS 624
Query: 272 GAKTRHPNNHIYLENGKPIYSIIQELKTAP------LGILEEVVKKVAG----------- 314
G+ + HP+++IYLENGK + I+ + A L L+ + + G
Sbjct: 625 GSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKRKVTCAKC 684
Query: 315 SSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIIS-CSSLALEESISPTSCSFV 373
S EG K + R +V H + P L C + + S
Sbjct: 685 ESSQEGDLIYCKGA---RCSVVAHSRCVGIANPQLGDWFCGKCEKTKKHHAAAKVKRSI- 740
Query: 374 QDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTK-KRDNDLHRLLFLPNGLPDGERL 432
G + K L +G P + + A + RD LH+ LFLP GL DG L
Sbjct: 741 -SGGADPEDGKVRLLRDLQGYASPSASKDKAATASVRLNRDAHLHKALFLPGGLVDGTEL 799
Query: 433 TYIVKGQRLRFGCKQGNGIVCDCCNKE------------ISPSQ---------------- 464
Y K Q G K+G GI C CCN+E + PSQ
Sbjct: 800 GYYTKSQLKLKGVKRGEGICCSCCNEETRHVQGGSRGGLMFPSQQDGTYYVFSQVKEIHV 859
Query: 465 ------------------------FEAHAGMAARRQP-YRHIY-----TSN--------- 485
FE HAG ARR P Y H+ TSN
Sbjct: 860 TTFNEEMKFLSNDLDDIQQISCYEFEQHAGCEARRNPGYYHVVAKYAETSNEYDSYGNIL 919
Query: 486 ----GMTLHDIAISLAMG------QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACL 535
G +L D+ LA ++R G + + + E + N +A + +
Sbjct: 920 LVPDGRSLKDVCKDLAHKNKLDEKEKRAARGGESRFSL--ELEQDSVFNNSTVAI-VSLI 976
Query: 536 DPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLE---------APGAEVGGCVICRLSPS 586
E W C CRQ S+ + + + G E A E G C+ P
Sbjct: 977 GIEKDSEGRWYCRMCRQ---DSLKVAQNGQKGSEKIMEGMSNIAETNEQGRCIRHLDGPR 1033
Query: 587 ENFDIRLCRSHDFSAATF-DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD 645
E +C+ + S F D T++ CDQC +E+HV CL+ +G+ DL E+P +WFC D
Sbjct: 1034 EVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKD 1093
Query: 646 CNRIHAALQDFVSNRAQTIPASSLSTI--NRKHIEKGI-LFDGTMNDVQWQMLKKAQCFE 702
C I L V+N + + S++S + +R+ + + + +WQ+L +
Sbjct: 1094 CKVIDEILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILC-GKGSS 1152
Query: 703 EKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVV 761
+ L+ A IF +C DPI + G++LIP+MV R +F G++ V+L + VV
Sbjct: 1153 PADVQTLAEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKVV 1212
Query: 762 SAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESI 821
SA LL+IFGRE+AE+PL+AT + G+ +AL + IERLL L+VE LVLP A+ ES+
Sbjct: 1213 SAALLQIFGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTAKSTESV 1272
Query: 822 WTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
W KFGF ++ ++L +LT F GTSM+ K +
Sbjct: 1273 WINKFGFSRVQEDQLKSICTTIRLTTFTGTSMVVKAI 1309
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 136/269 (50%), Positives = 175/269 (65%), Gaps = 7/269 (2%)
Query: 595 RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQ 654
R DF+ A FD+RTVI CDQCEKE+HVGCL+ +LKE+P+ +WFCC C+ ++L
Sbjct: 60 RQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSETRSSLD 119
Query: 655 DFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATA 714
+S+ AQ + L I +KH +G+ D T D++WQ+L + EE LLS+A
Sbjct: 120 KIISDGAQLLAERDLEIIRKKHETRGLCMD-TSKDLKWQLLSGKRATEEG-SILLSAAVP 177
Query: 715 IFRECFDPII-AECGRDLIPVMVYGRN----ISGQEFGGMYSVILTVKSVVVSAGLLRIF 769
IF + FDPI A GRDLIP MV GR + GQ++ GMY +LTV S VVSA L+R+
Sbjct: 178 IFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALMRVM 237
Query: 770 GREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFR 829
G +VAELPLVAT ++ QG G FQALFSCIER+L SL +++ VLPAA +AE IW KKFGF
Sbjct: 238 GGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMKKFGFS 297
Query: 830 KMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
++ E L LTIF GTS L K V
Sbjct: 298 RIPPEELEACLNGAHLTIFHGTSYLYKAV 326
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 163/438 (37%), Positives = 227/438 (51%), Gaps = 65/438 (14%)
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
++D LH+L+F N L DG + Y +G+ +SPS+FEAHAG
Sbjct: 185 RKDRGLHKLVFQENMLEDGAAVGYFDRGK--------------------VSPSKFEAHAG 224
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
A+RR+PY +I T++G++LH+ LA +R + SD+ C C NLL C+ C +F
Sbjct: 225 RASRRKPYSYIRTADGVSLHE----LANNRRISMSDSDERCSHCEQVGNLLWCDRCQRSF 280
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMS---RSVDLKGGLEAPGAEVGGCV-ICRLSPS 586
H C+ P+ C CR + + D+ G A G +C LS
Sbjct: 281 HLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKENDVATGRIAEGDPSEQITEVCTLSEK 340
Query: 587 ENFDIR-----LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 641
+ +++ LC DF+ RTV+ C QCEKEFHV CL+ + + +L E+PKDKWF
Sbjct: 341 QK-EVKDGPCALCSERDFNNNESGPRTVMICKQCEKEFHVECLKDHNMANLVELPKDKWF 399
Query: 642 CCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK-KAQC 700
C DC+ IH LQ ++ R E G+ D++W++L K
Sbjct: 400 CGIDCDDIHMKLQKLMA---------------RGEAELGL-------DIKWRLLNTKLNN 437
Query: 701 FEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKS 758
+ L+S A AIF E F I + DLI M+YG I GQ F GMY +L K
Sbjct: 438 PKHNISPLISKANAIFHERFKSIKDPKTKIDLIRAMLYGMEIEGQYSFEGMYCAVLYFKK 497
Query: 759 VVVSAGLLRIFGREVAELPLVATCREYQGK------GCFQALFSCIERLLCSLNVENLVL 812
V+ AG+ R+ G+EVAELPLVAT +YQ + G F++LFSCIE +L L V+ LVL
Sbjct: 498 VIACAGIFRVLGQEVAELPLVATTTKYQKRVILFTSGYFRSLFSCIENMLRHLKVKTLVL 557
Query: 813 PAAEKAESIWTKKFGFRK 830
PAA +AES+W KFGF K
Sbjct: 558 PAAHEAESMWIDKFGFTK 575
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 150/361 (41%), Positives = 197/361 (54%), Gaps = 50/361 (13%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC--RQGHSSSMSRSVDL 564
SDD+C +C DG LLLC+ CP AFH C+ IP W C C RQ SS++ + +
Sbjct: 241 SDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCENRQQRESSLAYNHNA 300
Query: 565 KGGLEAPGAEVGGCVICR-----LSPSENFD-IRLCRSHDFSAATFDDRTVIYCDQCEKE 618
G + + R +P F LCR HDFS F RTV+ CDQ
Sbjct: 301 IAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLLCDQ---- 356
Query: 619 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 678
+P+ W+C DC RI L+D +S A+ I + + I RK+ +
Sbjct: 357 ---------------ALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQ 401
Query: 679 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY 737
K + DG + DV+W++LK K +LS A AIF E FDPII GRDLIP MVY
Sbjct: 402 KALNKDGDL-DVRWRVLKDKS--SADSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVY 458
Query: 738 GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSC 797
G + VVSAGL R+ G E+AELPLVAT R+ QG G FQALF C
Sbjct: 459 G-------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGC 499
Query: 798 IERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKK 857
IERLL SL V++ VLPAA++AESIWT++FGF K++++ L +Y + + T+F+GTS L K
Sbjct: 500 IERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKL 559
Query: 858 V 858
V
Sbjct: 560 V 560
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 49/84 (58%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 299
L G++ G LC C C VVS + FE HAG+ +HP+++I+LENG ++ I++
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSD 134
Query: 300 APLGILEEVVKKVAGSSFNEGSFR 323
A L +L+ ++ G + + +FR
Sbjct: 135 ATLDMLQSAIQNAIGPAPKKRTFR 158
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 92/229 (40%), Gaps = 59/229 (25%)
Query: 450 GIVCDCCN----KEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI------------- 492
GI+C C + +SP FE HAG + ++ P +I+ NG LHDI
Sbjct: 83 GILCSCSSCKGRTVVSPYYFEVHAG-STKKHPSDYIFLENGNNLHDILRACSDATLDMLQ 141
Query: 493 -AISLAMG---QRRTTGGSDDMCHVCGD-------GENLLLCNGCPLAFHAACLDPLLIP 541
AI A+G ++RT C C G+ LLC+ C + +
Sbjct: 142 SAIQNAIGPAPKKRTF-----RCQTCKSSFATLRTGKFALLCDSCLESKGS--------- 187
Query: 542 ESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVG----GCVICRLSPSENFDIRLCRSH 597
++ R + +SS RS + E+PG+++ ++ L + ++ +S
Sbjct: 188 QNSTRTSKIGRNPTSSARRSKN-----ESPGSKLALRSLKPMLAVLLDANRRNMSNRQSD 242
Query: 598 DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDC 646
D + D ++ CD C + FH C+ IP+ W CC C
Sbjct: 243 DLCSICSDGGELLLCDSCPRAFHRECV------GFTTIPRGTW-CCRYC 284
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 278/966 (28%), Positives = 407/966 (42%), Gaps = 197/966 (20%)
Query: 31 ELLTDNVESDSFPNGKHAKEEAL--NDDIKSEVSNPVVSPKEFTSS---------LQDIT 79
E L+ NVE+ PNG +K+ A+ +D+ + + P P E + S +++
Sbjct: 332 EELSPNVEAH-VPNGNVSKKMAVPESDEAQGRNAQPAKDPGEISDSGLAQDTDRSVKNGR 390
Query: 80 SQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVL 139
S++ K T + L SEE SDG E E GD VL
Sbjct: 391 SRKRKSKTIHYMNSRVPHAVPALESEEG-SDGPESPEE------------GDD-----VL 432
Query: 140 EIPKHVSSSSGIRKI-TFKFSKRKEDYVAPLAYEEGRNYTLYDDLG----SSGANDGVLC 194
P S + + I K K ++V +A G Y LG SS G++
Sbjct: 433 VRPSQASRVAATKSIWALKHCKHIIEHV--VAQPPGVVYRFLSTLGQHVISSSCLVGLII 490
Query: 195 ARNMEIKMS------KKVVPN-----EYPTNVKKLLSTGILDGACVKYISTSRERQLDGI 243
+ I++ K +VP + P N K+L++T +++G V+ + R QL G+
Sbjct: 491 STLATIRLKGFALRLKSMVPIGDKLLKAPRNAKELMATKLMEGHFVR--CSCRGMQLTGM 548
Query: 244 VNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIY------------ 291
+ G C C C S +VS FE H+G+ + HP+++IYLENGK +
Sbjct: 549 LKDMGVQCNCRNCKGSMIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQEAADC 608
Query: 292 --SIIQELKTAPLGILEEVVK-KVA----GSSFNEGSFRVWKASHHLRKGLVEHDEKHNM 344
+I++ LK A +G ++ V K KV G S EG K + R +V H
Sbjct: 609 GDNILRALKMA-IGDIQGVEKSKVTCAECGGS-EEGDLIYCKGA---RCSVVSHSRCVGS 663
Query: 345 KLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRT 404
P L C + T + K + G + + K T
Sbjct: 664 ANPQLGDWF----------------CGKCEKTKKRHAAAK-VKRSISAGTEDSEVRDKAT 706
Query: 405 AEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQ 464
RD L + LFLP GL DG L Y K Q G K+G GI C CCNKEIS +
Sbjct: 707 TASARLNRDAHLRKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNKEISCYE 766
Query: 465 FEAHAGMAARRQPYRHI-YTSNGMTLHDIAISLA----MG--QRRTTGGSDDMCHVCGDG 517
FE HAG ARR PY +I ++G +L D++ LA +G ++R + C+ C +
Sbjct: 767 FEQHAGCEARRNPYGNILLVADGRSLKDVSKELADKNKLGEKEKRDARAGEVCCYECSNS 826
Query: 518 ENLLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQ--------GHSSS------MSR-- 560
L C+ C A+ C + G W C CRQ GH + MS
Sbjct: 827 GELKRCHSCEEAWCDKCTKGMETDSEGRWYCRMCRQDSLKVAQNGHKGTDKIIEGMSNIA 886
Query: 561 SVDLKG----GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATF-DDRTVIYCDQC 615
D KG LE P EVGGC I C+ + S F D T++ CDQ
Sbjct: 887 ETDEKGRCVRHLEGP-REVGGCAI-------------CKKWNLSKTGFVDGMTILVCDQV 932
Query: 616 E--KEFHVG---CLRKNGLCD-----------LKEIPKDKWFCCDDCNRIHAALQDFVSN 659
+ G + NG D +E+P+ +WFC DC I L V+N
Sbjct: 933 RSLNQMLPGTRITWKVNGFTDPNILMQRHCITSQELPEGEWFCQKDCKVIDEILTQLVAN 992
Query: 660 RAQTIPASSLSTI---NRKHIEKGILFDGTMNDVQWQML-------KKAQCFEEKEKSLL 709
+ + S +S + ++ + + + WQ+L Q E
Sbjct: 993 GPELLSNSIISELLESRQQQSSVKVKLESSNPRFGWQILCGEGGSSANVQTLAEAANIFT 1052
Query: 710 S------------------------SATAIF---REC-----FDPII-AECGRDL----I 732
S S+T +F + C FDP +EC D+ +
Sbjct: 1053 SIDDINLPYLWLVVGNYSPSTHTPVSSTGMFGSNQGCKNWKEFDPSHGSECHEDIHSLNV 1112
Query: 733 PVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQ 792
++V R +F G++ V+L + VVSA LL+IFGRE+AE+PL+AT +QG+G +
Sbjct: 1113 ALVVCSRRAKDHDFEGVFCVVLKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQGFCK 1172
Query: 793 ALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTS 852
AL + IERLL L+VE LVLP A+ ESIW KFGF ++ ++L + +L F GT
Sbjct: 1173 ALMTTIERLLGVLSVERLVLPTAKNTESIWINKFGFSRVPEDQLKRICTTIRLMTFTGTR 1232
Query: 853 MLEKKV 858
ML K +
Sbjct: 1233 MLGKAI 1238
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 228 bits (582), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 202/704 (28%), Positives = 317/704 (45%), Gaps = 125/704 (17%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF-SKVVSAHEFEQHAG 272
+V+ LLSTG+L+G V Y+ E G +NG GY CGC CN+ S +++A EFE+H G
Sbjct: 114 DVRGLLSTGLLEGFRVTYMKDEVEEV--GRINGQGYSCGCSKCNYNSNIMNACEFEEHYG 171
Query: 273 AKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLR 332
+ +HI+L+ G ++ +++ LK L +L + +++ G N + WKAS R
Sbjct: 172 QSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKASFQKR 231
Query: 333 K-----------------GLVEHDEKHNMK---LPSLPHSIISCSSLALEE--------- 363
K GL + ++++ S+ +SI + + A +
Sbjct: 232 KDYLDAVASDGCLTQSSQGLAAGEMIYSLRDYLKDSVSNSISNLNWSASKRRSGRRFRQG 291
Query: 364 ---------SISPTSCSFVQDT------GREQTHMKEMLEERKRGVKRPFMH---QKRTA 405
S SP F T G E+TH + L + +RP + ++
Sbjct: 292 DTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETH-RLSLSSPVKITQRPLRNCSIDSKSK 350
Query: 406 EGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQ 464
E +K RD LH L+F +GL D LTY +K G+ L+ G K+G I+C+CCN+E SPS
Sbjct: 351 E--SKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSH 408
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FE HAGM RRQPY +IYT G++LH +A+ L ++ L N
Sbjct: 409 FEEHAGMGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPN 447
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLS 584
G A ++ D + SG C + S++ V LK L+ E C C
Sbjct: 448 GFDNASVSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYG 502
Query: 585 PSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKD 638
+ + + T+I+C+QCE+ H+ C +K L LKE
Sbjct: 503 HT-------------TIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCF 549
Query: 639 KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTM-NDVQWQML 695
+ CC +C + A L++ +EK GI F + +++ W++L
Sbjct: 550 HFLCCQECQSLRARLEE--------------------GLEKCVGITFLRRIRSNICWRLL 589
Query: 696 KKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVIL 754
+ K + IF++ F E D+I MV G+N + +F GMY +L
Sbjct: 590 SGMDASRDV-KLYMPQVIDIFKDAFMDSTDE-HSDIISDMVNGKNGDQEKDFRGMYCALL 647
Query: 755 TVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPA 814
T + VVSA +L++ ++AEL L+AT E + KG F L IE L + NV L P
Sbjct: 648 TASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPV 707
Query: 815 AEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+ IW++K GF +S E L +FK +++K +
Sbjct: 708 DPEMAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 751
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 223 bits (567), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 165/528 (31%), Positives = 233/528 (44%), Gaps = 158/528 (29%)
Query: 478 YRHIYTSNGMTLHDIAISLAMGQRRTTGGSD-------DMCHVCGDGENLLLCNGCPLAF 530
Y IY ++G +LHD A SL + Q G D C CGD +L LC GCP A+
Sbjct: 961 YTSIYLADGRSLHDAAQSLVVEQNVKQEGVTPAKIEHLDQCVECGDSGDLQLCTGCPNAY 1020
Query: 531 HAACLDP-----------LLIPE---------------SGWRCPNCRQ---GHSSSMSRS 561
H + P LL E + CP+C++ G + RS
Sbjct: 1021 HQGTVVPGVNHVAEVVVVLLGNEFDDVDCLGSTDSSSFGEFFCPDCQEQRFGGTKDRRRS 1080
Query: 562 VDL---KGG-----------------LEAPGAEV-GGCVICRLSPSENFDIRLCRSHDFS 600
+ KG L+ P A V GGCV C+S DF+
Sbjct: 1081 MTKRRSKGAAKTLLTKDRMIGRCSRLLQVPEAIVLGGCV-------------FCKSGDFA 1127
Query: 601 AATFDDRTVIYCDQCEKEFHVGCLRKNGL------------------------------- 629
F +T + CDQCE+E+HVGCL+K+GL
Sbjct: 1128 KTGFGPKTTLLCDQCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGEVVFRCVATVRLR 1187
Query: 630 -CD----------------------LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 666
CD +E+P+ +WFC DC IH+ L VSN + +
Sbjct: 1188 DCDRGCESQGIVEMRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLAD 1247
Query: 667 SSLSTI---NRKHIEKG-ILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFR----- 717
S +S + N+ +E+ + + + +WQ+L + + L+ A IF
Sbjct: 1248 SIISKVLRTNQARLERSEDATESSCSGFEWQLL-HGRGGDPSNGKALAEAVQIFSVRNLS 1306
Query: 718 -----------------------ECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVI 753
ECFDPI G DLIP+MVY R++ Q+FGG+Y V+
Sbjct: 1307 DPGFPVRTVWDSHPCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLRDQDFGGIYCVV 1366
Query: 754 LTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLP 813
L + VVS L+R+FGR++AELPL+AT +QG+G +AL IERLL L VE L LP
Sbjct: 1367 LKYDNRVVSTALIRVFGRQLAELPLLATNPSHQGQGHCKALLLSIERLLGVLRVERLALP 1426
Query: 814 AAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 861
AAE AE IW KFGFR+M+ ++ ++ D + +F G+ MLEK++ L
Sbjct: 1427 AAEGAEGIWLNKFGFRRMAEGQVKQFHSDLNMMVFTGSFMLEKEIPPL 1474
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 124/269 (46%), Gaps = 22/269 (8%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
K+V+ E P + K LL +G+LDG V+Y+ L GI+ GG LC C C +VV+
Sbjct: 582 KEVLLKEAPASAKLLLQSGLLDGHHVRYMGRGGHIMLTGIIQEGGVLCDCSSCKGVQVVN 641
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQ------ELKTAPLGILEEVVKKVAG--- 314
FE+HAG+ RHP++ I+LENGK + I++ + K + +L+ + +V G
Sbjct: 642 VSAFEKHAGSSARHPSDFIFLENGKCLKDILEIGWNANKQKMNIMDVLKSAIGEVGGVKV 701
Query: 315 --SSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSF 372
SS + A L + D K + + P + + +
Sbjct: 702 QISSLEHPLIAIQPALKKLPQPKSLLDTKPRVPVDVKPRTPQGDTKPKMPSDTKARFTPE 761
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDN---DLHRLLFLPNGLPDG 429
V+ G + L+ R +K TA R++ +LH+ LFLP GL D
Sbjct: 762 VKARGSDARASMPRLDRTPR--------EKETASPPVLSRESSGANLHKALFLPGGLEDD 813
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNK 458
+ Y VKGQ+ G K+G GI+C CC +
Sbjct: 814 TEVGYYVKGQKSLAGVKKGAGILCSCCQQ 842
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 146/487 (29%), Positives = 228/487 (46%), Gaps = 103/487 (21%)
Query: 403 RTAEGGTKKRDNDLHRLLFLPN---GLPDGERLTYIV-KGQRLRFGCKQ-------GNGI 451
R G ++R+++ H+ LFLP+ GL DGE ++YI +G+ L G + +GI
Sbjct: 431 RVVSGARRERNSNKHKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGI 490
Query: 452 VCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMC 511
+C CCN IS SQFEAHAG +RR PY +I+T+ G++L +A + + + +
Sbjct: 491 LCACCNGVISCSQFEAHAGRGSRRAPYDNIFTAAGVSLRKLACLMPASEAESP-----IS 545
Query: 512 HVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAP 571
H LC +A A L+P L+ SG EA
Sbjct: 546 H-----RPAALCA---VADRRA-LEPELVTVSG------------------------EA- 571
Query: 572 GAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 631
A GGCV+C++ DF F +RT+I CDQCE+E+H+GCL ++G
Sbjct: 572 -ALHGGCVLCKVP-------------DFLRGGFGERTMIICDQCEREYHIGCLAEHGRAH 617
Query: 632 LKEIPKDK---------------WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKH 676
L E+P+ K W C +C I +++ VS ++P
Sbjct: 618 LTELPEGKASLYDILLTLHLNGEWHCSPECKGIATRMRERVS----SVP----------- 662
Query: 677 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVM 735
+ G + WQ+L+ +L +A I E FDPI+ G DL+ M
Sbjct: 663 ----VPLQG---EYSWQVLRGKDGTHATTWAL-KAAQEILTESFDPILDLVTGADLMMAM 714
Query: 736 VYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 795
VY + + ++ GMY+ +L + V + + R+FGR++AE+PLVAT + +G + L
Sbjct: 715 VYAQELGDWDYTGMYTAVLRRRGKAVCSAVFRVFGRQLAEVPLVATRLGARRQGHARVLM 774
Query: 796 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 855
+ E SL V++L LPAA+ W FGF ++ E + ++ IF GT +L+
Sbjct: 775 AAFEDYFRSLGVQSLCLPAAQSTVETWIHGFGFAAITPEEQAATCSELRVLIFPGTELLQ 834
Query: 856 KKVQCLP 862
K + P
Sbjct: 835 KPLNPTP 841
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 63/133 (47%), Gaps = 20/133 (15%)
Query: 436 VKGQRLRFGCKQG----NG-------IVCDC--CNKEISP----SQFEAHAGMAARRQPY 478
++GQ + F + G NG I C C C + +P S+FE HAG RR P
Sbjct: 239 LRGQPVFFQSRHGDLLLNGSITEEGQIACPCKQCRAKKTPGVSCSEFEEHAGSRERR-PG 297
Query: 479 RHIYTSN-GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
IY +N ++L + +L + R+ C +C DG +LL C+GCP A HA C
Sbjct: 298 ESIYLTNLSISLKEF-CALVNDEGRSADRHGSACGLCMDGGDLLCCDGCPTAVHAYCAGL 356
Query: 538 LLIPESGWRCPNC 550
+PE W C C
Sbjct: 357 EEVPEGDWFCDAC 369
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDG-IVNGGGYLCGCPLCNFSKV--VSAHEFEQH 270
++++L G L G V + S + L+G I G C C C K VS EFE+H
Sbjct: 229 TLREVLKGGALRGQPVFFQSRHGDLLLNGSITEEGQIACPCKQCRAKKTPGVSCSEFEEH 288
Query: 271 AGAKTRHPNNHIYLEN 286
AG++ R P IYL N
Sbjct: 289 AGSRERRPGESIYLTN 304
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 145/459 (31%), Positives = 210/459 (45%), Gaps = 70/459 (15%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+KKRD LH L+F GLPD LTY +K G+ L G KQG GIVC+CCN+E+SPS+FE
Sbjct: 167 SKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEK 226
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY++IYTS G+TLHD+A+ L + G S+ D N L +GC
Sbjct: 227 HAGMGKRRQPYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPN-LTSSGC- 284
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS-VDLKGGLEAPGAEVGGCVICRLSPS 586
G S+S V LK L+ + C C +
Sbjct: 285 -------------------------GKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHT 319
Query: 587 ENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKW 640
E D +++C+QCE+ HV C +K L LK+ +
Sbjct: 320 E-------------LGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHF 366
Query: 641 FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQC 700
CC +C + L+ + E+ +++ W++L A
Sbjct: 367 LCCQECQSLRVRLEGM------------------EKCEEIAFLGRIRSNICWRLLSSADA 408
Query: 701 FEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSV 759
+ K LS IF++ F + + I MVYG+N G+ +F GMY V+LT +
Sbjct: 409 SRDV-KLYLSQVIDIFKDAF--LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTH 465
Query: 760 VVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAE 819
VVSA +L++ AEL L+AT E + KG F+ L IE L + NV L+ P +
Sbjct: 466 VVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLLMAPVDPEMA 525
Query: 820 SIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
IW+ K GF +S + L +FK +++K +
Sbjct: 526 QIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 564
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 5/77 (6%)
Query: 222 GILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTRHPNN 280
G+ D + Y + E L G G G +C C CN + VS EFE+HAG K R P
Sbjct: 183 GLPDNTLLTYKLKNGEALLQGYKQGAGIVCNC--CN--QEVSPSEFEKHAGMGKRRQPYQ 238
Query: 281 HIYLENGKPIYSIIQEL 297
+IY G ++ + +L
Sbjct: 239 NIYTSQGLTLHDVALQL 255
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 145/459 (31%), Positives = 210/459 (45%), Gaps = 70/459 (15%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+KKRD LH L+F GLPD LTY +K G+ L G KQG GIVC+CCN+E+SPS+FE
Sbjct: 136 SKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEK 195
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY++IYTS G+TLHD+A+ L + G S+ D N L +GC
Sbjct: 196 HAGMGKRRQPYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPN-LTSSGC- 253
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS-VDLKGGLEAPGAEVGGCVICRLSPS 586
G S+S V LK L+ + C C +
Sbjct: 254 -------------------------GKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHT 288
Query: 587 ENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKW 640
E D +++C+QCE+ HV C +K L LK+ +
Sbjct: 289 E-------------LGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHF 335
Query: 641 FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQC 700
CC +C + L+ + E+ +++ W++L A
Sbjct: 336 LCCQECQSLRVRLEGM------------------EKCEEIAFLGRIRSNICWRLLSSADA 377
Query: 701 FEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSV 759
+ K LS IF++ F + + I MVYG+N G+ +F GMY V+LT +
Sbjct: 378 SRDV-KLYLSQVIDIFKDAF--LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTH 434
Query: 760 VVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAE 819
VVSA +L++ AEL L+AT E + KG F+ L IE L + NV L+ P +
Sbjct: 435 VVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLLMAPVDPEMA 494
Query: 820 SIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
IW+ K GF +S + L +FK +++K +
Sbjct: 495 QIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 533
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 5/77 (6%)
Query: 222 GILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTRHPNN 280
G+ D + Y + E L G G G +C C CN + VS EFE+HAG K R P
Sbjct: 152 GLPDNTLLTYKLKNGEALLQGYKQGAGIVCNC--CN--QEVSPSEFEKHAGMGKRRQPYQ 207
Query: 281 HIYLENGKPIYSIIQEL 297
+IY G ++ + +L
Sbjct: 208 NIYTSQGLTLHDVALQL 224
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 196 bits (497), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 148/479 (30%), Positives = 213/479 (44%), Gaps = 83/479 (17%)
Query: 397 PFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK--GQRLRFGCKQGNGIVCD 454
P+ H+ +T + LL N L +LTY + G RL+ G +GI C
Sbjct: 696 PYQHRAKT-----------VLSLLVDKNILLPRVKLTYKQRSDGPRLKEGAVTKDGIKCR 744
Query: 455 CCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------------ 502
CCN+ + FE HAG + R P HI+ +G L + L MG+ +
Sbjct: 745 CCNELFTLESFEVHAGCSTR-LPAAHIFLKDGRPLSQCLVEL-MGENKPKESLHVRLKTN 802
Query: 503 -TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS 561
+ SD +C +C +G +LLC+ CP +FH AC+ PE W CP+CR
Sbjct: 803 YSDTESDSICSICNEGGEILLCDNCPSSFHHACVGLESTPEGSWYCPSCR---------- 852
Query: 562 VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
C IC S +D F ++T++YCDQCE+E+HV
Sbjct: 853 ----------------CSICDSS-----------DYDPDTNKFTEKTIMYCDQCEREYHV 885
Query: 622 GCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS-TINRKHIEKG 680
GC+R G L P+ WFC C+ I LQ + T P LS TI R E
Sbjct: 886 GCMRNKGD-QLTCCPEGCWFCSRGCSEIFQHLQGLIGKSIPT-PVEGLSCTILRFDRENA 943
Query: 681 ILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR 739
N++ + K L A + ECF II RDL +V+ R
Sbjct: 944 SQHGDFYNEIIAEQYGK-----------LCIALDVLHECFVTIIEPSTRRDLSEDIVFNR 992
Query: 740 --NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSC 797
+ F G Y++IL ++S G R+ G++ AELPL+ T +Y+ +G + L +
Sbjct: 993 ESGLRRLNFRGFYTLILQKDGELISVGTFRVCGKKFAELPLIGTRVQYRRQGMCRLLMNE 1052
Query: 798 IERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
+E+LL L VE LVLPA + WT FGFR MS + L+ F+GT++ +K
Sbjct: 1053 LEKLLSGLGVERLVLPAIPQLLETWTGSFGFRAMSFSDRFELAESSILS-FQGTTICQK 1110
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R +G V G C C CN ++ + FE HAG TR P HI+L++G+P+ + EL
Sbjct: 730 RLKEGAVTKDGIKCRC--CN--ELFTLESFEVHAGCSTRLPAAHIFLKDGRPLSQCLVEL 785
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 189 bits (481), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/175 (56%), Positives = 125/175 (71%), Gaps = 4/175 (2%)
Query: 689 DVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFG 747
DV+W++L + + K LL A ++F ECFDPI+ GRDLIP MV GRN+ Q+FG
Sbjct: 16 DVRWRLLN-GRVASPETKPLLLEAVSMFHECFDPIVDPATGRDLIPAMVNGRNLRTQDFG 74
Query: 748 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 807
GMY +L V S V SA +LRIFG ++AELPL+AT + +GKG FQ LFSCIERLL L+V
Sbjct: 75 GMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGKGYFQTLFSCIERLLSFLSV 134
Query: 808 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSMLEKKV-QC 860
+NLVLPAAE+AESIW KFGF KM ++L Y++++ Q+ FKGT ML K V QC
Sbjct: 135 KNLVLPAAEEAESIWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGTVMLHKTVPQC 189
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 143/466 (30%), Positives = 218/466 (46%), Gaps = 72/466 (15%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEI--SPSQF 465
+K RDN LH+L+F G+P+ LTY +K G+ L+ G KQG I+CDCC++E+ +PS F
Sbjct: 488 SKVRDNSLHQLVFKEGGVPELTILTYKLKHGEVLKQGYKQGTCILCDCCSEEVQFTPSHF 547
Query: 466 EAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNG 525
E HAGM RRQPYR+IYT G+TLH++A+ L G + G+ GD L
Sbjct: 548 EEHAGMGKRRQPYRNIYTPEGLTLHELALKLQGG--LNSNGNSSANFPGGDEPPNLSSGS 605
Query: 526 CPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSP 585
+ P ++P LK L+ + C
Sbjct: 606 --SRESSTTYRPSIVP----------------------LKRTLQQIADKTESC------- 634
Query: 586 SENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDK 639
RLC + T + +++C+QCE+ HV C +K L L E +
Sbjct: 635 ------RLCGDACTTIGTISEDMIVFCNQCERPCHVKCYNNGLQKQKGPLNVLAEYMQFH 688
Query: 640 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ 699
+FCC C + A+L + ++ R + I +K + V WQ+L
Sbjct: 689 FFCCQKCQLLRASLHEVLNKREK---------IRQKR-----------SYVFWQILNGMN 728
Query: 700 CFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKS 758
+K + IF+ F P A +I MV +++ G+ +F GMY +LT S
Sbjct: 729 PGINVQK-YIHQVIEIFKVAF-PKTAASDFGVIQDMVNAKDVGGEKDFRGMYCAVLTTSS 786
Query: 759 -VVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEK 817
+VVSA +L++ EVAEL +VATC +++ KG F L IE L ++NV L +
Sbjct: 787 KLVVSAAVLKVRTEEVAELVIVATCNQFRKKGYFTLLLRQIEAHLKAMNVRLLTALVDPE 846
Query: 818 AESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 863
ESIW+KK GF +S E L +F+ ++++K + P+
Sbjct: 847 MESIWSKKLGFTILSGEEKETLLEAHPLVMFEDLTLMQKSLASKPD 892
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 26/139 (18%)
Query: 200 IKMSK-KVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
++M++ K P +V+ LLSTG+L+G V Y G + C
Sbjct: 258 VRMAETKYRPETLLKDVRGLLSTGLLEGFSVTY-------------KKNGKMNAC----- 299
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
EFEQHAG + + N+HI+L++G +Y +IQ LK L +L +++++ G N
Sbjct: 300 -------EFEQHAGQSSNNQNDHIFLDSGISLYKLIQALKYKKLHLLADLIEEQTGLPPN 352
Query: 319 EGSFRVWKASHHLRKGLVE 337
+ WKAS ++ +E
Sbjct: 353 LIEYGKWKASFEVQNDDLE 371
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 240 LDGIVNGGGYLCGCP-LCNF---SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQ 295
L+G V GGY+C C C++ K +SA +FE+HAGA +++ N HI+L NG +Y++
Sbjct: 26 LEGFVAEGGYVCACTSPCDYRRGGKALSALQFEKHAGAVSKNQNGHIFLRNGTSLYALFH 85
Query: 296 ELKTAPLGILEEVVKKVAG 314
L+ P E + AG
Sbjct: 86 ALREVPAQRFAEDFRMAAG 104
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 188 bits (478), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 144/497 (28%), Positives = 222/497 (44%), Gaps = 86/497 (17%)
Query: 408 GTKKRDNDLHRLLFLPNGLPDGERLTY--IVKGQ-RLRFGCKQGNGIV-CDCCNKEISPS 463
GT+ R L +L L +TY + GQ ++G G GI+ C CCNK + +
Sbjct: 313 GTQHRSRTLLSVLIDSGILAPRGNVTYWAVRDGQPACKYGFITGEGIIRCKCCNKTFTVA 372
Query: 464 QFEAHA----GMAARRQPYRHIYTSNGMTLHDIAISL-------AMGQRRTTGGS----- 507
+ EAHA G RR+ + ++ +G +L + L A R GS
Sbjct: 373 ELEAHATGGIGTDDRREAWARVFVEDGRSLSLCLMELMRRDDVGAAAANRNRNGSVMRVK 432
Query: 508 --------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
D +C VC D LLLC+ CP AFH AC+ PE W CP CR
Sbjct: 433 EKCSEEEGDSVCSVCIDSGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR-------- 484
Query: 560 RSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
C +C S + D D +A F D+T+IYC+QCE+E+
Sbjct: 485 ------------------CGVCGGS---DLD-------DDTAEGFTDKTIIYCEQCEREY 516
Query: 620 HVGCLRKNG--------LC---DLKEIPKDKW---FCCDDCNRIHAALQDFV-SNRAQTI 664
HVGC+R+ G C E P+++W C +C + LQ V S+RA++I
Sbjct: 517 HVGCMRRGGSEEESAAEWCRRLSESEGPEEEWRPWLCSPECGEVFQHLQALVASSRARSI 576
Query: 665 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
P S + + +WQ ++ + + L +A + ECFD ++
Sbjct: 577 PHYSRGAYHSAPCGRRRYMSTVTRITRWQHEEE----DAADHGQLCAALDVLHECFDDMV 632
Query: 725 -AECGRDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 781
DL +V+ + + F G Y V L +++ G LR+FG +VAELPLV T
Sbjct: 633 EPRTQTDLAADIVFNQESGLRRLNFRGYYVVGLEKAGELINVGTLRVFGNQVAELPLVGT 692
Query: 782 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQR 841
++ +G + L + +E++L + V LVLPA + +WT GF M+R +++
Sbjct: 693 RFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELMPMWTASLGFHAMTRSDVMEMAV 752
Query: 842 DFQLTIFKGTSMLEKKV 858
+ + FKGT+M +K +
Sbjct: 753 EHAILSFKGTTMCQKTL 769
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 138/463 (29%), Positives = 208/463 (44%), Gaps = 80/463 (17%)
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEI--SPS 463
G K RDN LH +LF GLPD LTY +K G+ L+ G K G GI+C+CC+ E+ +PS
Sbjct: 480 GQPKVRDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPS 539
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLC 523
QFE H GM RRQPYR IYTS+G+TLH++A+ L G + G G+
Sbjct: 540 QFEKHVGMGRRRQPYRSIYTSDGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKE---- 595
Query: 524 NGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRL 583
++ P+++P LK L+ V C +CR
Sbjct: 596 -------YSTTSRPIIVP----------------------LKRTLQERVLTVESCYMCR- 625
Query: 584 SPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD-------LKEIP 636
+ H D +++C+QCE+ HV C NGL L E
Sbjct: 626 -----------KPHTVLGVISVD-MIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYT 672
Query: 637 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK 696
+ + CC+ C + A+L + + R E ++ WQ+L
Sbjct: 673 QFNFMCCEKCQLLRASLHEGLKKR-----------------EDIAFLRRIRYNICWQLLN 715
Query: 697 KAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILT 755
+ + ++ IF++ F A D I MV ++ +G+ +F G+Y +LT
Sbjct: 716 GTNMRSDVQHQVIE----IFKDAFAE-TAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLT 770
Query: 756 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 815
+ VVSA +L++ EVAEL L+AT E + KG F L S IE L + NV L P
Sbjct: 771 TSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVD 830
Query: 816 EKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+ IW++K G+ +S E+ L +F S+++K +
Sbjct: 831 PEMAPIWSEKLGYTILSDEQKHSMLMAHPLVMFANLSLVQKSL 873
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 94/171 (54%), Gaps = 9/171 (5%)
Query: 164 DYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSK-KVVPNEYPTNVKKLLSTG 222
D+ P A E+ RN L SS +L +++++++ K P +V+ LLSTG
Sbjct: 200 DHAMPDA-EQTRNPVLEQPWDSS-----MLITAPVKVRVTETKYRPESILKDVRGLLSTG 253
Query: 223 ILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHI 282
+L+G V Y ER G +NG GY CGC C + +++A EFEQH+G + + NNHI
Sbjct: 254 LLEGFRVTYKKNEVERI--GRINGQGYSCGCSECGYRNIMNACEFEQHSGESSNNQNNHI 311
Query: 283 YLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRK 333
+L++G +Y +IQ LK L +L +V+ KV N + WKAS L K
Sbjct: 312 FLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYEKWKASFQLEK 362
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 138/463 (29%), Positives = 208/463 (44%), Gaps = 80/463 (17%)
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEI--SPS 463
G K RDN LH +LF GLPD LTY +K G+ L+ G K G GI+C+CC+ E+ +PS
Sbjct: 579 GQPKVRDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPS 638
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLC 523
QFE H GM RRQPYR IYTS+G+TLH++A+ L G + G G+
Sbjct: 639 QFEKHVGMGRRRQPYRSIYTSDGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKE---- 694
Query: 524 NGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRL 583
++ P+++P LK L+ V C +CR
Sbjct: 695 -------YSTTSRPIIVP----------------------LKRTLQERVLTVESCYMCR- 724
Query: 584 SPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD-------LKEIP 636
+ H D +++C+QCE+ HV C NGL L E
Sbjct: 725 -----------KPHTVLGVISVD-MIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYT 771
Query: 637 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK 696
+ + CC+ C + A+L + + R E ++ WQ+L
Sbjct: 772 QFNFMCCEKCQLLRASLHEGLKKR-----------------EDIAFLRRIRYNICWQLLN 814
Query: 697 KAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILT 755
+ + ++ IF++ F A D I MV ++ +G+ +F G+Y +LT
Sbjct: 815 GTNMRSDVQHQVIE----IFKDAFAE-TAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLT 869
Query: 756 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 815
+ VVSA +L++ EVAEL L+AT E + KG F L S IE L + NV L P
Sbjct: 870 TSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVD 929
Query: 816 EKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+ IW++K G+ +S E+ L +F S+++K +
Sbjct: 930 PEMAPIWSEKLGYTILSDEQKHSMLMAHPLVMFANLSLVQKSL 972
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 213 TNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL---CNF-SKVVSAHEFE 268
T VK LL +L G V Y +L+G V GGY C CP C++ KV+SA +FE
Sbjct: 23 TCVKDLLP--LLQGVPVTYRFEKHNAKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFE 80
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG 314
+HAG +++ N HI+L NG+ +Y + +L+ P E + AG
Sbjct: 81 KHAGVTSKNQNGHIFLRNGRSLYELFHKLREVPAEKFPEAFRMAAG 126
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 83/171 (48%), Gaps = 32/171 (18%)
Query: 164 DYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSK-KVVPNEYPTNVKKLLSTG 222
D+ P A E+ RN L SS +L +++++++ K P +V+ LLSTG
Sbjct: 322 DHAMPDA-EQTRNPVLEQPWDSS-----MLITAPVKVRVTETKYRPESILKDVRGLLSTG 375
Query: 223 ILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHI 282
+L+G V Y +++++A EFEQH+G + + NNHI
Sbjct: 376 LLEGFRVTY-------------------------KKNEIMNACEFEQHSGESSNNQNNHI 410
Query: 283 YLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRK 333
+L++G +Y +IQ LK L +L +V+ KV N + WKAS L K
Sbjct: 411 FLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYEKWKASFQLEK 461
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 186 bits (471), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 133/451 (29%), Positives = 212/451 (47%), Gaps = 68/451 (15%)
Query: 426 LPDGERLTYIVKGQR--LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
L + ++++Y+ K R L G + GIVC CC K S S FEAH+G + RR I+
Sbjct: 2 LHEFQKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRR-ACASIFN 60
Query: 484 SNGMTLHDIAISL------AMGQRRTTGGS-----DDMCHVCGDGENLLLCNGCPLAFHA 532
+ G +L D+ + + + + DD C VCGDG L+ C+ CP +H
Sbjct: 61 NKGESLLDLQVQAWELLDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHL 120
Query: 533 ACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIR 592
+CL +PE W CP+CR C IC S
Sbjct: 121 SCLLLKELPEGEWFCPSCR--------------------------CAICGGS-------- 146
Query: 593 LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
++ ++F++ TV+ CDQCE+E+HV CL G+ + P D WFC D C++I
Sbjct: 147 ---EYNADGSSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFEG 203
Query: 653 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 712
L+ V + TI LS + E + +MN + Q E + K L+ A
Sbjct: 204 LRKLV-GISNTI-GEGLSWTLLRSGEDDLPSASSMN--------REQMAEHRSK--LAVA 251
Query: 713 TAIFRECFDPIIAECGR-DLIPVMVYGRN---ISGQEFGGMYSVILTVKSVVVSAGLLRI 768
+ +ECF P++ + DL+ ++Y R ++ F G Y+V+L V+S +RI
Sbjct: 252 LGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRI 311
Query: 769 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 828
G +AE+PL+ T ++ +G + L IE LL L + + VLPA + W FGF
Sbjct: 312 HGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGF 371
Query: 829 RKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
++M+ + L+ + + F G ++L+K +Q
Sbjct: 372 QEMAPTQRLELVK-LSVVSFPGVTLLQKPLQ 401
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 186 bits (471), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 135/450 (30%), Positives = 201/450 (44%), Gaps = 77/450 (17%)
Query: 429 GERLTYI-VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGM 487
G+++TY G ++ G + +GI C CCN+ + FE HAG ++ P H++ +GM
Sbjct: 585 GDKVTYKQSDGPGIKEGSIRRDGIKCMCCNEIFTVENFEVHAG-SSTPLPSAHMFLKDGM 643
Query: 488 TLHDIAISLAMGQR---------RTTGG-----SDDMCHVCGDGENLLLCNGCPLAFHAA 533
+L + G + R G SD +C VC DG +LLLC+ CP ++H
Sbjct: 644 SLSQCLVEFMGGNKPRDPHPLHARLKGKNSDLESDSICSVCHDGGDLLLCDNCPSSYHHD 703
Query: 534 CLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRL 593
C+ IPE W CP+CR C IC LS
Sbjct: 704 CVGLEAIPEGNWYCPSCR--------------------------CSICNLS--------- 728
Query: 594 CRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAAL 653
+D + F ++T++YCDQCE+E+HVGC R N L P+ WFC C+ + L
Sbjct: 729 --DYDPDTSQFTEKTIVYCDQCEREYHVGCTR-NSDNQLICRPEGCWFCSRGCSNVFQHL 785
Query: 654 QDFVSNRAQT----IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 709
Q+ + T + + L + + G D M D L
Sbjct: 786 QELIGKSVPTPIEGVSWTILKFCSGNGSDHGDYDDEIMAD---------------HYGKL 830
Query: 710 SSATAIFRECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGLL 766
A I ECF II D+ +V+ R + F G Y+++L +S G
Sbjct: 831 CVAVGILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYTILLQKGGEPISVGTF 890
Query: 767 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 826
RI G++ AELPL+ T Y+ +G + L + +E+LL L VE L+LPA + WT F
Sbjct: 891 RICGQKFAELPLIGTSSPYRRQGMCRLLINELEKLLLDLGVERLILPAVPELLETWTCSF 950
Query: 827 GFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
GF MS L+ + L+ F+GT+M +K
Sbjct: 951 GFTIMSNSDRLELAGNSILS-FQGTTMCQK 979
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 201/425 (47%), Gaps = 66/425 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------ 502
+GI C CC+K ++ S+FE HAG + RQP+++IY +G++L + I Q
Sbjct: 691 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIYLDSGVSLLECQIDAWNRQESIERIGF 749
Query: 503 ---TTGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
T G +DD C +CGDG +L+ C+GCP FH +CLD +++P W CPNC
Sbjct: 750 HSVNTDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIMMLPPGDWHCPNCT----- 804
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C C ++ SE+F D + + ++ C C
Sbjct: 805 ---------------------CKFCGIA-SEDF-----VQEDGTNVS----ELLTCSLCA 833
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKH 676
K++H CL+ + FC C + LQ ++ + + S S ++R
Sbjct: 834 KKYHKSCLQDVDAPCIDFNNSTPCFCGKTCRELFEQLQKYLGIKHELESGFSWSLVHRMD 893
Query: 677 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVM 735
I+ + G + ++ +C S L+ A ++ ECF PI+ G ++I +
Sbjct: 894 IDLDMSLQG--------LPQRVEC-----NSKLAVALSVMDECFLPIVDRRSGINIIQNV 940
Query: 736 VY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
+Y G N + + G Y+ IL ++SA +R G ++AE+P + T Y+ +G +
Sbjct: 941 LYNCGSNFNRLNYSGFYAAILERGDEIISAASIRFHGTQLAEMPFIGTRHVYRRQGMCRR 1000
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 853
LFS IE LCSL V+ L++PA + WT FGF +S + L + + + +F G M
Sbjct: 1001 LFSAIESALCSLKVQKLIIPAISELTHTWTGVFGFTTLS-DSLKQELKSMNMLVFPGIDM 1059
Query: 854 LEKKV 858
L+K++
Sbjct: 1060 LQKQL 1064
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 48/83 (57%), Gaps = 11/83 (13%)
Query: 212 PTNVKKLLSTGILD-GAC-----VKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSA 264
P K+ L + ++D GA V+Y++ R + L+G V G CGC C SK+++
Sbjct: 649 PYAGKRTLLSWLIDCGAVQLSQKVRYMNRRRTKVMLEGWVTRDGIHCGC--C--SKILTV 704
Query: 265 HEFEQHAGAKTRHPNNHIYLENG 287
+FE HAG+K R P +IYL++G
Sbjct: 705 SKFEIHAGSKLRQPFQNIYLDSG 727
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 182 bits (463), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 131/451 (29%), Positives = 208/451 (46%), Gaps = 68/451 (15%)
Query: 426 LPDGERLTYIVKGQR--LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
L + ++++Y+ K R L G + GIVC CC K S S FEAH+G + RR I+
Sbjct: 27 LHEFQKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRRA-CASIFN 85
Query: 484 SNGMTLHDIAISL------AMGQRRTTGGS-----DDMCHVCGDGENLLLCNGCPLAFHA 532
+ G +L D+ + + + + DD C VCGDG L+ C+ CP +H
Sbjct: 86 NKGESLLDLQVQAWELLDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHL 145
Query: 533 ACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIR 592
+CL +PE W CP+CR C IC S
Sbjct: 146 SCLLLKELPEGEWFCPSCR--------------------------CAICGGS-------- 171
Query: 593 LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
++ ++F++ TV+ CDQCE+E+HV CL G+ + P D WFC D C++I
Sbjct: 172 ---EYNADGSSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFQG 228
Query: 653 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 712
L+ V LS + E + +MN + Q E + K L+ A
Sbjct: 229 LRKLVGISNNI--GEGLSWTLLRSGEDDLPSANSMN--------REQMAEHRSK--LAVA 276
Query: 713 TAIFRECFDPII-AECGRDLIPVMVYGRN---ISGQEFGGMYSVILTVKSVVVSAGLLRI 768
+ +ECF P++ DL+ ++Y R ++ F G Y+V+L V+S +RI
Sbjct: 277 LGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRI 336
Query: 769 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 828
G +AE+PL+ T ++ +G + L IE LL L + + VLPA + W FGF
Sbjct: 337 HGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGF 396
Query: 829 RKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
++M+ + L+ + + F G ++L+K +Q
Sbjct: 397 QEMAPTQRLELVK-LSVVSFPGVTLLQKPLQ 426
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 124/428 (28%), Positives = 196/428 (45%), Gaps = 72/428 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS----------LAM 498
+GI C CC+K ++ S+FE HAG + RQP+++IY +G +L D I L
Sbjct: 17 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIYLESGASLLDCQIEAWNRQEPVKRLGF 75
Query: 499 GQRRTTGG--SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
G +DD C +CGDG +L+ C+GCP FH +CLD ++P W CPNC
Sbjct: 76 QAVDVDGNDPNDDTCGLCGDGGDLICCDGCPSTFHQSCLDIKMLPPGDWHCPNC------ 129
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV---IYCD 613
C C ++ +NF DD TV + C
Sbjct: 130 --------------------SCKFCGVASDKNFQ-------------RDDTTVSKLLTCS 156
Query: 614 QCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTIN 673
C K++H C+++ + FC C + LQ ++ + + S S I+
Sbjct: 157 LCVKKYHKSCMQEINTLSIDTNNSVASFCGKKCRELFEQLQKYLGVKHELEAGFSWSLIH 216
Query: 674 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLI 732
R + G + ++ +C S L+ + ++ ECF PI+ G +LI
Sbjct: 217 RTDADSDTSLQG--------LPQRVEC-----NSKLAVSLSVMDECFLPIVDRRSGINLI 263
Query: 733 PVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
++Y G N + FGG Y++IL ++SA +R G +AE+P + T Y+ +G
Sbjct: 264 QNVLYNCGSNFNRLNFGGFYALILERGDEIISAASIRFHGTRLAEMPFIGTRHMYRRQGM 323
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKG 850
+ LF IE LCSL VE L++PA + WT+ FGF + E L + + + +F G
Sbjct: 324 CRRLFYAIESTLCSLKVEKLIIPAISELMHTWTEVFGFTTLD-ESLKQELKSMNMLVFPG 382
Query: 851 TSMLEKKV 858
ML+K++
Sbjct: 383 IDMLQKQL 390
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 4/51 (7%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPI 290
L+G V G CGC SK+++ +FE HAG+K R P +IYLE+G +
Sbjct: 10 LEGWVTRDGIHCGCC----SKILTVSKFEIHAGSKLRQPFQNIYLESGASL 56
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 162/568 (28%), Positives = 230/568 (40%), Gaps = 154/568 (27%)
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL------ 255
M K+V+ E P + K LL +G+LDG V+Y+ L GI+ GG LC C
Sbjct: 577 MVKEVLLKEAPASAKLLLQSGLLDGHHVRYLGRGGHIMLTGIIQEGGVLCDCSSCKGVQV 636
Query: 256 ---------------------------------CNFS------------KVVSAHEFEQH 270
C F KVV+ FE+H
Sbjct: 637 TCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFEKH 696
Query: 271 AGAKTRHPNNHIYLENGKPIYSIIQ------ELKTAPLGILEEVVKKVAGS-----SFNE 319
AG+ RHP++ I+LENGK + I++ + K + +L+ + +V G S +
Sbjct: 697 AGSSARHPSDFIFLENGKCLKDILEIGWNANKQKMNVMDVLKSAIGEVGGEKVQIISLDH 756
Query: 320 GSFRVWKASHHLRKGLVEHDEKHNMKL---PSLPHSIISCSSLALEESISP--TSCSFVQ 374
+ A L + + D K + + P +P + + S P T
Sbjct: 757 PLIAIQPAEKKLPQPRLVLDTKPRVPVDLKPRMPQVDMKPRVMLDTRSRMPLDTKAKSTS 816
Query: 375 DTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDN---DLHRLLFLPNGLPDGER 431
D ++ + R + +K A R++ +LH+ LFLP GL D
Sbjct: 817 DVKARGGDVRATIPRLDRTTR-----EKEAASPPVPSRESSGANLHKALFLPGGLEDDIE 871
Query: 432 LTYIVKGQRLRFGCKQGNGIVCDCCN------------------------KEISPSQFEA 467
+ Y VKGQ+ G K+G GI+C CC + IS S FE
Sbjct: 872 VGYYVKGQKFLAGLKRGAGILCSCCQQVARNGMSSILMDSVVCGLTYEGERMISCSLFEQ 931
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSD-------DMCHVCGDGENL 520
HAG +RR PY IY ++G +LHD A SL + Q G+ D C CGD +L
Sbjct: 932 HAGWGSRRNPYTSIYLADGRSLHDAAQSLVVEQTVKQEGNTPAKIEHLDQCVECGDRGDL 991
Query: 521 LLCNGCPLAFHAACLDPLLIPESG-WRCPNC---RQGHSSSMSRSV---DLKGG------ 567
LC CP A+H CL + SG + CP+C R G + RS+ KG
Sbjct: 992 QLCTRCPNAYHQDCLGKVDSYSSGEFFCPDCQEQRYGGTKDRRRSMVNRRSKGAAKTLLS 1051
Query: 568 -----------LEAPGAEV-GGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQ- 614
L+ P A V GGCV C+S DF+ F +T + CDQ
Sbjct: 1052 KDRVTGRCTRLLQVPEAVVLGGCV-------------FCKSGDFAKTGFGPKTTLLCDQV 1098
Query: 615 ---------CEKEFHVGCLRKNGLCDLK 633
CE+E+HVGCL+K+GL DLK
Sbjct: 1099 SVGDMKVKGCEREYHVGCLKKHGLEDLK 1126
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 47/70 (67%)
Query: 792 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 851
+AL IERLL L VE L LPAAE AE IW +FGFR+M+ E++ ++ D + +F G+
Sbjct: 1480 KALLLSIERLLGVLRVERLALPAAEGAEGIWLNRFGFRRMAEEQVKQFHSDLNMMVFTGS 1539
Query: 852 SMLEKKVQCL 861
SMLEK + L
Sbjct: 1540 SMLEKDIPPL 1549
Score = 40.0 bits (92), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 632 LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI---NRKHIEKG-ILFDGTM 687
+E+P+ +WFC DC IH+ L VSN + + S +S + N+ +E + +
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 688 NDVQWQML 695
+ +WQ+L
Sbjct: 1262 SGFEWQLL 1269
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 179 bits (455), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 124/444 (27%), Positives = 199/444 (44%), Gaps = 71/444 (15%)
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
+++ Y + + + G +GI C CC+K ++ S+FE HAG + QPY++IY +G++L
Sbjct: 621 QKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQNIYLESGVSL 679
Query: 490 HDIAISLAMGQRRT----------TGG--SDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
I Q GG +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 680 LQCQIDAWNRQEHAEKIGFHSVDIDGGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDI 739
Query: 538 LLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSP--SENFDIRLCR 595
++P WRC NC C C ++ SE D +C
Sbjct: 740 QMLPPGEWRCMNCT--------------------------CKFCGIASGTSEKDDASVCV 773
Query: 596 SHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD 655
H C+ CEK++H C ++ FC +C + L+
Sbjct: 774 LH-------------ICNLCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKELSEHLKK 820
Query: 656 FVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAI 715
++ + + S S I+R + G + ++ +C S L+ +
Sbjct: 821 YLGTKHELESGFSWSLIHRTDDDSEAACRG--------ISQRVEC-----NSKLAITLTV 867
Query: 716 FRECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGRE 772
ECF P+I G +LI ++Y G N S + G Y+ IL +++A +R G +
Sbjct: 868 MDECFLPVIDRRSGINLIRNVLYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTQ 927
Query: 773 VAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS 832
+AE+P + T Y+ +G + LFS IE LCSL VE LV+PA + + WT FGF +
Sbjct: 928 IAEMPFIGTRHIYRRQGMCRRLFSAIESTLCSLKVEKLVIPAIAEVTNTWTTVFGFTHLD 987
Query: 833 RERLLKYQRDFQLTIFKGTSMLEK 856
+ L + + + +F G ML+K
Sbjct: 988 KS-LRQEMKSLNMMVFPGIDMLQK 1010
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 9/81 (11%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKVVSAHE 266
P K+ + ++D V+ + R+ L+G + G CGC C SK+++ +
Sbjct: 600 PYTGKRTVLAWLIDSGTVELSQKVQYRRRKKVMLEGWITRDGIHCGC--C--SKILTVSK 655
Query: 267 FEQHAGAKTRHPNNHIYLENG 287
FE HAG+K P +IYLE+G
Sbjct: 656 FELHAGSKLPQPYQNIYLESG 676
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 179 bits (455), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 136/505 (26%), Positives = 220/505 (43%), Gaps = 76/505 (15%)
Query: 396 RPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQR--LRFGCKQGNGIVC 453
R + + ++R L +L + + +++TY R + G G GI C
Sbjct: 372 RVVLRPTNSTASACQRRSRTLLSVLIDKDIVVPRDKVTYRAARDRPAAKDGFITGEGIRC 431
Query: 454 DCCNKEISPSQFEAHAGM---AARRQPYRHIYTSNGMTL---------HDIAISLA---- 497
CCNK ++ ++F AHA + RR+ + ++ +G +L D+A+ A
Sbjct: 432 TCCNKTLTVAEFAAHATARRGSDRREAWARVFLKDGRSLSQCLVELMRRDVAVVAARNGD 491
Query: 498 --MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+ ++ + D +C +C DG +LLLC+ CP AFH AC+ PE W CP+CR G
Sbjct: 492 VRVKEKCSDPEGDSVCSICNDGGDLLLCDNCPSAFHHACVGLQATPEGDWFCPSCRCG-- 549
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
+ GG + GG F D+T+IYCDQC
Sbjct: 550 --------VCGGSDFDATAAGG------------------------GGFTDKTIIYCDQC 577
Query: 616 EKEFHVGCLRKNG----------LCDLKEIPKD-KWFCCDDCNRIHAALQDFVS-NRAQT 663
E+E+HVGC+R+ G C E ++ W C +C + LQ + R ++
Sbjct: 578 EREYHVGCVRRRGSEEEEESAAEWCRRPEEQEEWPWLCSPECGEVFRHLQGLAAVARERS 637
Query: 664 IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQC------FEEKEKSLLSSATAIFR 717
IP + + + IL + + + C + E L SA +
Sbjct: 638 IPIPTTVPTTVEGVSLSILRRRRRRPIS-MVATGSGCQEEEEEEDAAEHGQLCSALDVLH 696
Query: 718 ECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 774
ECF +I DL +V+ R + F G Y V L +++ G LR+ G EVA
Sbjct: 697 ECFVTLIEPRTQTDLTADIVFNRESELRRLNFRGYYVVGLEKAGELITVGTLRVLGTEVA 756
Query: 775 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 834
ELPLV T ++ +G L + +E++L + V LVLPA + +WT GF M+R
Sbjct: 757 ELPLVGTRFAHRRQGMCHLLVTELEKVLRQVGVRRLVLPAVPELLPMWTASLGFHPMTRS 816
Query: 835 RLLKYQRDFQLTIFKGTSMLEKKVQ 859
+++ + + F+GT+M K +
Sbjct: 817 DVMEIAAEHAILSFQGTTMCHKSLH 841
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 133/433 (30%), Positives = 194/433 (44%), Gaps = 84/433 (19%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDI-AISLAMGQRRTTG 505
+GI+C CC+K ++ S+F+ HAG R P +I+ +G TL + A S RR+
Sbjct: 728 DGIICTCCDKVLTLSEFKFHAGFTVNR-PCLNIFMESGEPFTLCLLQAWSAEYKARRSQN 786
Query: 506 GS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ DD C +CG+G L+ C+ CP FH ACL IP+ W C NC
Sbjct: 787 QAVHADDNDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNC------ 840
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
CR+ + D +HD + C QCE
Sbjct: 841 -----------------------TCRICGNLVIDKDTSDAHD----------SLQCSQCE 867
Query: 617 KEFHVGCLRKNGLCDLKEIP-KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRK 675
++H CL D +E+ D WFC C +++ LQ V Q S + +
Sbjct: 868 HKYHEKCLEDR---DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGISWTLL--- 921
Query: 676 HIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECF----DPIIAECGRDL 731
+ I D ++ QW LK A C + L+ A I ECF DP G +
Sbjct: 922 ---RCIHDDQKVHSAQWFALK-AVC-----NTKLAVALTIMEECFVSMFDP---RTGIHM 969
Query: 732 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 789
IP ++Y G + F G Y+++L K V++S +R+ G VAE+PL+ATC +Y+ +G
Sbjct: 970 IPQVLYNWGSEFARLNFQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIATCSQYRRQG 1029
Query: 790 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF---RKMSRERLLKYQRDFQLT 846
+ L S IE++L S VE LV+ A WTK FGF + R+RL K L
Sbjct: 1030 MCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDDIERQRLNK----INLM 1085
Query: 847 IFKGTSMLEKKVQ 859
+F GT +L K +
Sbjct: 1086 VFPGTVLLVKSLH 1098
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/228 (40%), Positives = 126/228 (55%), Gaps = 24/228 (10%)
Query: 106 ETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDY 165
ETLS GD + E++ + R NN + S + + P H+ + + F R Y
Sbjct: 317 ETLSFGDCQKETAMGSSVRVSNNYENFSHDPAITKDPLHIEAEENM-----SFECRNPPY 371
Query: 166 VAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILD 225
+P D +L + + K +KK N +P+NVK LLSTGI D
Sbjct: 372 ASPRV-------------------DTLLVPKIKDTKTAKKGSTNTFPSNVKSLLSTGIFD 412
Query: 226 GACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLE 285
G VKY S SRER L G++ G GYLCGC C +KV++A+EFEQHA KT+HPNNHIY E
Sbjct: 413 GVTVKYYSWSRERNLKGMIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFE 472
Query: 286 NGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRK 333
NGK IY ++QELK P L + ++ V GS N +F WKAS+H+ +
Sbjct: 473 NGKTIYGVVQELKNTPQEKLFDAIQNVTGSDINHKNFNTWKASYHVAR 520
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 78/136 (57%), Positives = 104/136 (76%)
Query: 195 ARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCP 254
+++ E +MSKKV PN +P+NVK LLSTG+LDG VKY+S SRE+ L GI+ G GYLC C
Sbjct: 428 SKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCE 487
Query: 255 LCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG 314
CN SK ++A+EFE+HAG KT+HPNNHIY ENGK IY+++QELK P +L + ++ V G
Sbjct: 488 NCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTG 547
Query: 315 SSFNEGSFRVWKASHH 330
S N+ +FR+WKAS+
Sbjct: 548 SPINQKNFRIWKASYQ 563
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG + Y+ + G +G G +C C CN K ++ +FE HAG +
Sbjct: 451 LLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAG-CKTK 509
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 510 HPNNHIYFENGKTIYAVVQEL 530
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 78/136 (57%), Positives = 104/136 (76%)
Query: 195 ARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCP 254
+++ E +MSKKV PN +P+NVK LLSTG+LDG VKY+S SRE+ L GI+ G GYLC C
Sbjct: 407 SKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCE 466
Query: 255 LCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG 314
CN SK ++A+EFE+HAG KT+HPNNHIY ENGK IY+++QELK P +L + ++ V G
Sbjct: 467 NCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTG 526
Query: 315 SSFNEGSFRVWKASHH 330
S N+ +FR+WKAS+
Sbjct: 527 SPINQKNFRIWKASYQ 542
Score = 39.7 bits (91), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG + Y+ + G +G G +C C CN K ++ +FE HAG +
Sbjct: 430 LLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAG-CKTK 488
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 489 HPNNHIYFENGKTIYAVVQEL 509
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 78/138 (56%), Positives = 102/138 (73%)
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCG 252
+ +N + + SKKV PN +P+NVK LLSTG+LDG VKYIS SRE+ L G++ G GYLCG
Sbjct: 336 VTPKNKDGRPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSREKNLKGLIKGAGYLCG 395
Query: 253 CPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKV 312
C CNF+K ++A+EFE+HA KT+HPNNHIY ENGK IY ++QELK P +L E ++ V
Sbjct: 396 CQECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEMLFEAIQTV 455
Query: 313 AGSSFNEGSFRVWKASHH 330
GS N+ +FR WKAS+
Sbjct: 456 TGSPINQKNFRSWKASYQ 473
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 130/227 (57%), Gaps = 25/227 (11%)
Query: 106 ETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDY 165
+TLS GD + ES+ R N+ + S + + P H+ + ++F+F R Y
Sbjct: 301 QTLSFGDYQKESTIGSAVRVINSYENFSHDPAIAKDPPHMEAE---ENMSFEF--RNPPY 355
Query: 166 VAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILD 225
+P D +L ++ + K +KK N +P+NVK LLSTG+ D
Sbjct: 356 ASPRV-------------------DTLLVPKSKDTKTAKKGSTNTFPSNVKSLLSTGMFD 396
Query: 226 GACVKYISTSRE-RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYL 284
G VKY S SRE R L GI+ G GYLCGC CNF++V++A+EFEQHA KT+HPNNHIY
Sbjct: 397 GVTVKYYSWSREVRNLKGIIKGTGYLCGCGNCNFNRVLNAYEFEQHANCKTKHPNNHIYF 456
Query: 285 ENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHL 331
ENGK IY ++QELK P L + ++ V GS N +F WKAS+H+
Sbjct: 457 ENGKTIYGVVQELKNTPQEKLFDAIQNVTGSDINHKNFNTWKASYHV 503
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 11/141 (7%)
Query: 402 KRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRF--GCKQGNGIVCDC---- 455
K +G T +++ LL G+ DG + Y + +R G +G G +C C
Sbjct: 372 KTAKKGSTNTFPSNVKSLL--STGMFDGVTVKYYSWSREVRNLKGIIKGTGYLCGCGNCN 429
Query: 456 CNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCG 515
N+ ++ +FE HA + P HIY NG T++ + L + D + +V G
Sbjct: 430 FNRVLNAYEFEQHANCKTK-HPNNHIYFENGKTIYGVVQELKNTPQEKL--FDAIQNVTG 486
Query: 516 DGENLLLCNGCPLAFHAACLD 536
N N ++H A L+
Sbjct: 487 SDINHKNFNTWKASYHVASLE 507
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 81/130 (62%), Positives = 102/130 (78%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E K+SKKV PN +P+NV+ LLSTG+LDG VKYI+ SRE +L GI+ G GYLCGC CNF
Sbjct: 450 EQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKYIAWSRE-ELRGIIKGSGYLCGCQSCNF 508
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QELK+ P L +V++ + GS N
Sbjct: 509 SKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPIN 568
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 569 QKSFRLWKES 578
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
L G+ DG + YI + G +G+G +C C CN K I+ +FE HAG +
Sbjct: 469 LLSTGMLDGVPVKYIAWSREELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAG-CKTKH 527
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 528 PNNHIYFENGKTIYGIVQEL 547
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 78/130 (60%), Positives = 101/130 (77%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KMSKK PN +P+NV+ L+STG+LDG VKY+S SRE +L GI+ G GYLCGC CNF
Sbjct: 269 EFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRE-ELHGIIKGSGYLCGCQSCNF 327
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V GS N
Sbjct: 328 NKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTVTGSPIN 387
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 388 QKSFRIWKES 397
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHAGMAARRQ 476
+ G+ DG + Y+ + G +G+G +C C NK ++ +FE HAG +
Sbjct: 288 LISTGMLDGVPVKYVSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTK-H 346
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 347 PNNHIYFENGKTIYQIVQEL 366
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 78/130 (60%), Positives = 101/130 (77%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KMSKK PN +P+NV+ L+STG+LDG VKY+S SRE +L GI+ G GYLCGC CNF
Sbjct: 339 EFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRE-ELHGIIKGSGYLCGCQSCNF 397
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V GS N
Sbjct: 398 NKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTVTGSPIN 457
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 458 QKSFRIWKES 467
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHAGMAARRQ 476
+ G+ DG + Y+ + G +G+G +C C NK ++ +FE HAG +
Sbjct: 358 LISTGMLDGVPVKYVSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTK-H 416
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 417 PNNHIYFENGKTIYQIVQEL 436
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 76/137 (55%), Positives = 103/137 (75%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N E+K SKKV N +P+NVK LLSTG+LDG VKY+S SRE+ L+GI+ G GYLCGC
Sbjct: 435 ASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKTLEGIIKGTGYLCGC 494
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
C +K ++A+EFE+HA KT+HPNNHI+ ENGK IY+++QELK P G+L ++ V
Sbjct: 495 KECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGVLFNAIQTVT 554
Query: 314 GSSFNEGSFRVWKASHH 330
GS N+ +FR+WKAS+
Sbjct: 555 GSHINQKNFRIWKASYQ 571
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 127/433 (29%), Positives = 199/433 (45%), Gaps = 72/433 (16%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA 497
++ G +G+VC CCNK +S S+F+ HAG + P +++ +G A S
Sbjct: 604 VKTGLVTKDGVVCTCCNKTVSLSEFKNHAGFN-QNCPCLNLFMGSGKPFASCQLEAWSAE 662
Query: 498 MGQRRT---------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR +DD C VCGDG L+ C+ CP FH ACL ++PE W C
Sbjct: 663 YKARRNGWRLEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCS 722
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
+C C IC S+N + RS DF
Sbjct: 723 SCT--------------------------CWICSELVSDNAE----RSQDFK-------- 744
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
C QC ++H CL+ G+ +++ + +FC +C +++ L V S
Sbjct: 745 ---CSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLS 799
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AEC 727
S I K DG ++ + ++ KA+C S L+ A +I E F ++
Sbjct: 800 WS------ILKCFQEDGMVHSAR-RLALKAEC-----NSKLAVALSIMEESFLSMVDPRT 847
Query: 728 GRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
G D+IP ++Y G + +F G Y+V++ V++S +R+ G +AE+PLVATC +Y
Sbjct: 848 GIDMIPHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKY 907
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
+ +G + L + IE +L SL VE LV+ A WT+ FGF+ M E +R L
Sbjct: 908 RRQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INL 966
Query: 846 TIFKGTSMLEKKV 858
+F GT++L+K +
Sbjct: 967 MVFPGTTLLKKTL 979
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 127/433 (29%), Positives = 199/433 (45%), Gaps = 72/433 (16%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA 497
++ G +G+VC CCNK +S S+F+ HAG + P +++ +G A S
Sbjct: 648 VKTGLVTKDGVVCTCCNKTVSLSEFKNHAGFN-QNCPCLNLFMGSGKPFASCQLEAWSAE 706
Query: 498 MGQRRT---------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR +DD C VCGDG L+ C+ CP FH ACL ++PE W C
Sbjct: 707 YKARRNGWRLEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCS 766
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
+C C IC S+N + RS DF
Sbjct: 767 SCT--------------------------CWICSELVSDNAE----RSQDFK-------- 788
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
C QC ++H CL+ G+ +++ + +FC +C +++ L V S
Sbjct: 789 ---CSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLS 843
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AEC 727
S I K DG ++ + ++ KA+C S L+ A +I E F ++
Sbjct: 844 WS------ILKCFQEDGMVHSAR-RLALKAEC-----NSKLAVALSIMEESFLSMVDPRT 891
Query: 728 GRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
G D+IP ++Y G + +F G Y+V++ V++S +R+ G +AE+PLVATC +Y
Sbjct: 892 GIDMIPHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKY 951
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
+ +G + L + IE +L SL VE LV+ A WT+ FGF+ M E +R L
Sbjct: 952 RRQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INL 1010
Query: 846 TIFKGTSMLEKKV 858
+F GT++L+K +
Sbjct: 1011 MVFPGTTLLKKTL 1023
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 77/137 (56%), Positives = 101/137 (73%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
+N E K +KK N +P+NVK LLSTGI DG VKY+S SRE+ L GI+ G GYLC C
Sbjct: 426 IVKNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSC 485
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CN SK ++A+EFE+HAGAKT+HPNNHIY ENGK IY+++QELK P +L + ++ V
Sbjct: 486 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVT 545
Query: 314 GSSFNEGSFRVWKASHH 330
GS+ N+ +FR+WKAS+
Sbjct: 546 GSTINQKNFRIWKASYQ 562
Score = 42.7 bits (99), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG ++ Y+ + G +G G +C C CN K ++ +FE HAG A +
Sbjct: 450 LLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTK 508
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 509 HPNNHIYFENGKTIYAVVQEL 529
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 108/296 (36%), Positives = 164/296 (55%), Gaps = 29/296 (9%)
Query: 209 NEYPTNVKKLLSTGILDGACVKYISTSRERQ-LDGIVNGGGYLCGCPLCNFSKVVSAHEF 267
N+ PTN+++LL+TG+L+G VKYI +R L G++ G LC C C KVVS + F
Sbjct: 257 NKIPTNLRELLATGMLEGQPVKYIMRKGKRAVLRGVIKRIGILCSCSSCKGRKVVSPYYF 316
Query: 268 EQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW-- 325
E HAG+ +HP+++I+LENG ++ +++ A L +LE ++K G + E FR
Sbjct: 317 EVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIFRCKSC 376
Query: 326 KASHH-LRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
K+S LR G + C S LE + + S + GR QT
Sbjct: 377 KSSFSTLRSGKFA----------------LFCDS-CLESKGAKNNISSSK-VGRSQTSSA 418
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG 444
++ + G K ++ G ++D +H+++F+ LP+G + Y V G+RL G
Sbjct: 419 KVYKSASPGAKS-------SSVGRLTRKDKGMHKVVFMSGILPEGTDVGYYVGGKRLLDG 471
Query: 445 CKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
+ GI C CC+ +SPSQFE HAG AARR+PY +IY SNG++LH+++ISL GQ
Sbjct: 472 YIKELGIYCHCCSTVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHELSISLLKGQ 527
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 9/90 (10%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIV-KGQRLRF-GCKQGNGIVCDCCN----KEISP 462
T K +L LL G+ +G+ + YI+ KG+R G + GI+C C + K +SP
Sbjct: 256 TNKIPTNLRELLA--TGMLEGQPVKYIMRKGKRAVLRGVIKRIGILCSCSSCKGRKVVSP 313
Query: 463 SQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
FE HAG + ++ P +I+ NG LHD+
Sbjct: 314 YYFEVHAG-STKKHPSDYIFLENGNNLHDV 342
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 139/461 (30%), Positives = 211/461 (45%), Gaps = 72/461 (15%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+K RD LH L+F +GL D LTY +K G+ L+ G K+G I+C+CCN+E SPS FE
Sbjct: 387 SKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEE 446
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY +IYT G++LH +A+ L ++ L NG
Sbjct: 447 HAGMGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPNGFD 485
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSE 587
A ++ D + SG C + S++ V LK L+ E C C +
Sbjct: 486 NASVSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYGHT- 539
Query: 588 NFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWF 641
+ + T+I+C+QCE+ H+ C +K L LKE +
Sbjct: 540 ------------TIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFL 587
Query: 642 CCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTM-NDVQWQMLKKA 698
CC +C + A L++ +EK GI F + +++ W++L
Sbjct: 588 CCQECQSLRARLEE--------------------GLEKCVGITFLRRIRSNICWRLLSGM 627
Query: 699 QCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVK 757
+ K + IF++ F E D+I MV G+N + +F GMY +LT
Sbjct: 628 DASRDV-KLYMPQVIDIFKDAFMDSTDE-HSDIISDMVNGKNGDQEKDFRGMYCALLTAS 685
Query: 758 SVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEK 817
+ VVSA +L++ ++AEL L+AT E + KG F L IE L + NV L P +
Sbjct: 686 THVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPE 745
Query: 818 AESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
IW++K GF +S E L +FK +++K +
Sbjct: 746 MAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 786
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 72/125 (57%), Gaps = 3/125 (2%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF-SKVVSAHEFEQHAG 272
+V+ LLSTG+L+G V Y+ E G +NG GY CGC CN+ S +++A EFE+H G
Sbjct: 114 DVRGLLSTGLLEGFRVTYMKDEVEEV--GRINGQGYSCGCSKCNYNSNIMNACEFEEHYG 171
Query: 273 AKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLR 332
+ +HI+L+ G ++ +++ LK L +L + +++ G N + WKAS R
Sbjct: 172 QSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKASFQKR 231
Query: 333 KGLVE 337
K ++
Sbjct: 232 KDYLD 236
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 125/442 (28%), Positives = 196/442 (44%), Gaps = 67/442 (15%)
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
+++ Y + + + G +GI C CC+K ++ S+FE HAG + QPY++IY +G++L
Sbjct: 620 QKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQNIYLESGVSL 678
Query: 490 HDIAISLAMGQRRT------------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
I Q +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 679 LQCQIDAWNRQEHAEKIGFHSVDIDGNDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDI 738
Query: 538 LLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSH 597
++P W CPNC C C ++ SE D
Sbjct: 739 QMLPPGEWHCPNC--------------------------TCKFCGIA-SETSD-----KD 766
Query: 598 DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV 657
D S RT I C EK++H C ++ FC +C + L+ ++
Sbjct: 767 DASVNVL--RTCILC---EKKYHDSCTKEMDTLPNNINSSSLSFCGKECKELSEYLKKYL 821
Query: 658 SNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ + S I+R + G + ++ +C S L+ A +
Sbjct: 822 GTKHELEAGFSWCLIHRSDEDSEAACRG--------LTQRVEC-----NSKLAIALTVMD 868
Query: 718 ECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 774
ECF P+I G +LI ++Y G N S + G Y+ IL +++A +R G ++A
Sbjct: 869 ECFLPVIDRRSGINLIRNILYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTKIA 928
Query: 775 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 834
E+P + T Y+ +G + LFS IE LCSL VE LV+PA + WT FGF + E
Sbjct: 929 EMPFIGTRHIYRRQGMCRRLFSAIELALCSLKVEKLVIPAVAELTHTWTTVFGFTYLD-E 987
Query: 835 RLLKYQRDFQLTIFKGTSMLEK 856
L + + + +F G ML+K
Sbjct: 988 SLRQEMKSLNMMVFPGIDMLQK 1009
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
L+G + G CGC C SK+++ +FE HAG+K P +IYLE+G
Sbjct: 632 LEGWITRDGIHCGC--C--SKILTVSKFELHAGSKLPQPYQNIYLESG 675
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 126/229 (55%), Gaps = 25/229 (10%)
Query: 106 ETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDY 165
ETLS GD + E++ + R NN + S + + P H+ + + F R Y
Sbjct: 317 ETLSFGDCQKETAMGSSVRVSNNYENFSHDPAITKDPLHIEAEENM-----SFECRNPPY 371
Query: 166 VAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILD 225
+P D +L + + K +KK N +P+NVK LLSTGI D
Sbjct: 372 ASPRV-------------------DTLLVPKIKDTKTAKKGSTNTFPSNVKSLLSTGIFD 412
Query: 226 GACVKYISTSRE-RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYL 284
G VKY S SRE R L G++ G GYLCGC C +KV++A+EFEQHA KT+HPNNHIY
Sbjct: 413 GVTVKYYSWSREQRNLKGMIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYF 472
Query: 285 ENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRK 333
ENGK IY ++QELK P L + ++ V GS N +F WKAS+H+ +
Sbjct: 473 ENGKTIYGVVQELKNTPQEKLFDAIQNVTGSDINHKNFNTWKASYHVAR 521
Score = 40.8 bits (94), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 60/141 (42%), Gaps = 11/141 (7%)
Query: 402 KRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIV--KGQRLRFGCKQGNGIVCDC---- 455
K +G T +++ LL G+ DG + Y + QR G +G G +C C
Sbjct: 388 KTAKKGSTNTFPSNVKSLL--STGIFDGVTVKYYSWSREQRNLKGMIKGTGYLCGCGNCK 445
Query: 456 CNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCG 515
NK ++ +FE HA + P HIY NG T++ + L + D + +V G
Sbjct: 446 LNKVLNAYEFEQHANCKTK-HPNNHIYFENGKTIYGVVQELKNTPQEKL--FDAIQNVTG 502
Query: 516 DGENLLLCNGCPLAFHAACLD 536
N N ++H A L+
Sbjct: 503 SDINHKNFNTWKASYHVARLE 523
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 77/142 (54%), Positives = 104/142 (73%), Gaps = 9/142 (6%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE---------RQLDGIVNGGGY 249
E KMSKK PN +P+NV+ L+STG+LDG VKY+S SRE ++L GI+ G GY
Sbjct: 450 EFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHGIIKGSGY 509
Query: 250 LCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVV 309
LCGC CNF+KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + +
Sbjct: 510 LCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFBAI 569
Query: 310 KKVAGSSFNEGSFRVWKASHHL 331
+ V GS N+ SFR+WK + ++
Sbjct: 570 QTVTGSPINQKSFRIWKVNEYI 591
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 434 YIVKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
YI ++ G +G+G +C C NK ++ +FE HAG + P HIY NG T+
Sbjct: 492 YICAHKQELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTK-HPNNHIYFENGKTI 550
Query: 490 HDIAISL 496
+ I L
Sbjct: 551 YQIVQEL 557
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 197/436 (45%), Gaps = 70/436 (16%)
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG---MTLHDIAI 494
G +++G G+GI+C+CC+ +S S+F++HAG R +++ +G M A
Sbjct: 864 GSVVKYGRITGDGIICNCCSDILSISEFKSHAGFKFNRA-CSNLFLDSGRPFMLCQLQAW 922
Query: 495 SLAMGQRRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
S R++ + DD C +CGDG L+ C+ CP FH +CL +PE W
Sbjct: 923 STEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLSIQELPEGNW 982
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD 605
C NC C IC L + S+++
Sbjct: 983 YCLNCT--------------------------CRICG---------DLVNFEEISSSS-- 1005
Query: 606 DRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP 665
+ C QCE+++H CL++ + + WFC C +I+AALQ + Q
Sbjct: 1006 --DALKCFQCEQKYHGQCLKQRDI--DSGVESHIWFCSGSCQKIYAALQSQLGLTNQFAN 1061
Query: 666 ASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII- 724
S + + H ++ IL ++ A+C S L A I ECF ++
Sbjct: 1062 GFSWTLLRCIHYDQKILSTA-------RLAMMAEC-----NSRLVVALTIMEECFLSMVD 1109
Query: 725 AECGRDLIPVMVYGRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATC 782
G D+IP +VY S +F G Y+VIL V++ +R+ G E+AE+PL+ATC
Sbjct: 1110 PRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATC 1169
Query: 783 REYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD 842
+Y+ +G + L + IE +L S V+ LV+ A WT+ FGF + E +
Sbjct: 1170 SKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHK- 1228
Query: 843 FQLTIFKGTSMLEKKV 858
F L +F GT +L+K +
Sbjct: 1229 FNLMVFPGTVLLKKAL 1244
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 75/130 (57%), Positives = 102/130 (78%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E+K ++K PN +P+NV+ L+STG+LDG VKY+S SRE +L GI+ G GYLCGC CN+
Sbjct: 247 ELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYVSLSRE-ELRGIIKGSGYLCGCQSCNY 305
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L +V++ V G+ N
Sbjct: 306 SKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTVFGAPIN 365
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 366 QKSFRIWKES 375
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
+ G+ DG + Y+ + G +G+G +C C CN K ++ +FE HAG +
Sbjct: 266 LISTGMLDGVPVKYVSLSREELRGIIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTK-H 324
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 325 PNNHIYFENGKTIYQIVQEL 344
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 197/436 (45%), Gaps = 70/436 (16%)
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG---MTLHDIAI 494
G +++G G+GI+C+CC+ +S S+F++HAG R +++ +G M A
Sbjct: 864 GSVVKYGRITGDGIICNCCSDILSISEFKSHAGFKFNRA-CSNLFLDSGRPFMLCQLQAW 922
Query: 495 SLAMGQRRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
S R++ + DD C +CGDG L+ C+ CP FH +CL +PE W
Sbjct: 923 STEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLSIQELPEGNW 982
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD 605
C NC C IC L + S+++
Sbjct: 983 YCLNCT--------------------------CRICG---------DLVNFEEISSSS-- 1005
Query: 606 DRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP 665
+ C QCE+++H CL++ + + WFC C +I+AALQ + Q
Sbjct: 1006 --DALKCFQCEQKYHGQCLKQRDIN--SGVESHIWFCSGSCQKIYAALQSQLGLTNQFAN 1061
Query: 666 ASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII- 724
S + + H ++ IL ++ A+C S L A I ECF ++
Sbjct: 1062 GFSWTLLRCIHYDQKILSTA-------RLAMMAEC-----NSRLVVALTIMEECFLSMVD 1109
Query: 725 AECGRDLIPVMVYGRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATC 782
G D+IP +VY S +F G Y+VIL V++ +R+ G E+AE+PL+ATC
Sbjct: 1110 PRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATC 1169
Query: 783 REYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD 842
+Y+ +G + L + IE +L S V+ LV+ A WT+ FGF + E +
Sbjct: 1170 SKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHK- 1228
Query: 843 FQLTIFKGTSMLEKKV 858
F L +F GT +L+K +
Sbjct: 1229 FNLMVFPGTVLLKKAL 1244
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 76/138 (55%), Positives = 101/138 (73%)
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCG 252
+ +N E K +KK N +P+NVK LLSTGI DG VKY+S SRE+ L GI+ G GYLC
Sbjct: 426 IVVKNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCS 485
Query: 253 CPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKV 312
C CN SK ++A+EFE+HAGAKT+HPNNHIY ENGK IY+++QELK +L + ++ V
Sbjct: 486 CDNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNV 545
Query: 313 AGSSFNEGSFRVWKASHH 330
GS+ N+ +FR+WKAS+
Sbjct: 546 TGSTINQKNFRIWKASYQ 563
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG ++ Y+ + G +G G +C C CN K ++ +FE HAG A +
Sbjct: 451 LLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTK 509
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 510 HPNNHIYFENGKTIYAVVQEL 530
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 136/459 (29%), Positives = 205/459 (44%), Gaps = 67/459 (14%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+K RD LH L+F GL D LTY +K G+ L+ G K G GI+C+CC++E +PS FE
Sbjct: 102 SKTRDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNCCSQEFAPSHFEE 161
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY +IYT G TLH +A+ L ++ L NG
Sbjct: 162 HAGMGRRRQPYHNIYTPEGSTLHKLALQL---------------------QDHLNSNGFD 200
Query: 528 LAFHAACLD-PLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPS 586
A ++ D P L SG C + S++ V LK L+ E C C +
Sbjct: 201 NASVSSFSDYPNLTSASG-----CGRQPSTTSGPIVPLKRTLQGRVVETESCYFCGYGHT 255
Query: 587 ENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKW 640
+ D +I+C+QCE+ HV C +K L LKE ++
Sbjct: 256 -------------TIGNIDPDMIIFCNQCERPCHVKCYNNRVVKKKVPLEILKEYVCFRF 302
Query: 641 FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQC 700
CC +C + L++ + + E+ +++ W++L
Sbjct: 303 LCCQECQLLRDRLEEGL-----------------EKCEEIAFLRRIRSNICWRLLSGMDA 345
Query: 701 FEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSV 759
+ K + IF++ F E D+ MV +N + +F GMY +LT +
Sbjct: 346 SRDV-KLFMPQVIDIFKDAFVESTDE-HSDIFSDMVNCKNGDQEKDFRGMYCALLTASTH 403
Query: 760 VVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAE 819
VVSA +L++ ++AEL L+AT RE + KG F L IE L + NV L P +
Sbjct: 404 VVSAAILKVRMEQIAELVLIATRRECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMA 463
Query: 820 SIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
IW++K GF +S E L +FK +++K +
Sbjct: 464 QIWSEKLGFTILSAEEKESVLESHPLVMFKNLVLVQKSL 502
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 123/436 (28%), Positives = 194/436 (44%), Gaps = 78/436 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GI C CC + S S+FEAHAG + R +I+ +G +L + + + R TG
Sbjct: 593 DGIKCSCCQEVFSLSRFEAHAGSSYHRSA-ANIFLEDGRSLLECQMQII---RDITGKGF 648
Query: 507 ----------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+D +C VC G +L+LC+ CP +FH +CL + W CP+C
Sbjct: 649 TKESFSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSC 708
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
G +C + F + D V
Sbjct: 709 CCG---------------------------------------ICGENKFDGGSEQDNVVF 729
Query: 611 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS 670
C QCE+++HVGCLRK G L P WFC C +I LQ + ++ + +L+
Sbjct: 730 SCYQCERQYHVGCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLG-KSFPVGVDNLT 788
Query: 671 TINRKHIE-KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 728
K I KG+ D L + E S L+ A + ECF+P+
Sbjct: 789 WTLLKPIRSKGLEID----------LPDIEALTEV-YSKLNIALGVMHECFEPVKEPHTR 837
Query: 729 RDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 786
RD++ +++ G +++ F G Y+V+L ++S +R++G +VAE+PL+ T +Y+
Sbjct: 838 RDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYR 897
Query: 787 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLT 846
G L + +E+ L L VE LVLPA + WT FGF KM+ L++ D+
Sbjct: 898 RLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFL-DYSFL 956
Query: 847 IFKGTSMLEKKVQCLP 862
F+ T M +K + +P
Sbjct: 957 DFQDTVMCQKLLMKIP 972
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 116/430 (26%), Positives = 189/430 (43%), Gaps = 81/430 (18%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GI C CC+K ++ S+FE HAG + RQP+++I+ ++G++L I A +++ G
Sbjct: 615 DGIHCGCCSKILAVSKFEIHAG-SKLRQPFQNIFLNSGVSLLQCQID-AWDKQKGAGNIG 672
Query: 507 -----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+DD C +CGDG +L+ C+GCP FH CLD + P W CPNC
Sbjct: 673 FCSVDVIADDPNDDACGICGDGGDLVCCDGCPSTFHQRCLDIRMFPLGDWHCPNCT---- 728
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIY---- 611
C C+ A +D T
Sbjct: 729 ----------------------CKFCK-------------------AVIEDVTQTVGANT 747
Query: 612 CDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLST 671
C CEK++H C+ K + FC C + ++ +V + + S S
Sbjct: 748 CKMCEKKYHKSCMPKANVTPADTTEPITSFCGKKCKALSEGVKKYVGVKHELEAGFSWSL 807
Query: 672 INRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRD 730
++R+ + G + V+ S L+ A + ECF PII G +
Sbjct: 808 VHRECTNSDLSLSGHPHIVE-------------NNSKLALALTVMDECFLPIIDRRSGVN 854
Query: 731 LIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 788
++ ++Y G N + FGG Y+ +L +V++ +R G +AE+P + T Y+ +
Sbjct: 855 IVQNVLYNCGSNFNRLNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQ 914
Query: 789 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIF 848
G + LFS +E L L V+ L++PA +W KFGFR++ + L K R L F
Sbjct: 915 GMCRRLFSVVESALQHLKVKLLIIPATADFSHVWISKFGFRQVE-DSLKKEMRSMNLLTF 973
Query: 849 KGTSMLEKKV 858
G +L+K++
Sbjct: 974 PGIDVLQKEL 983
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 6/84 (7%)
Query: 206 VVPNEYPTNVKKLLSTGILD-GACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVS 263
V +E T + L+ +G L V Y++ R R L+G + G CGC C SK+++
Sbjct: 572 VPSSEKRTVLAWLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGC--C--SKILA 627
Query: 264 AHEFEQHAGAKTRHPNNHIYLENG 287
+FE HAG+K R P +I+L +G
Sbjct: 628 VSKFEIHAGSKLRQPFQNIFLNSG 651
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/428 (28%), Positives = 197/428 (46%), Gaps = 77/428 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GIVC CC + S F+ HAG R P R+++ +G + + + + G
Sbjct: 897 DGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKVRKGGI 955
Query: 507 ----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG L+ C+ CP FH ACL +PE W CPNC
Sbjct: 956 KNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT----- 1010
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C IC L + + S++ + C QCE
Sbjct: 1011 ---------------------CRICG---------DLVKDREASSSFL----ALKCSQCE 1036
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD---FVSNRAQTIPASSLSTIN 673
++H+ CL++ C +KE+ D FC ++C I++ LQ FV++ A + L I+
Sbjct: 1037 HKYHMPCLKEK--C-VKEVGGDARFCGENCQEIYSGLQGLLGFVNHIADGFTWTLLRCIH 1093
Query: 674 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLI 732
D ++ Q ++ KA+C S L+ A I ECF ++ G D+I
Sbjct: 1094 D---------DQKVHSSQ-KLALKAEC-----NSKLAVALTIMEECFLSMVDPRTGIDMI 1138
Query: 733 PVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
P ++Y R + + F G Y+V+L +VS +R+ G VAE+PL+AT +++ KG
Sbjct: 1139 PHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGM 1198
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKG 850
+ L + IE++L S+ VE +V+ A WT FGF+ + + ++ L +F G
Sbjct: 1199 CRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-INLMVFPG 1257
Query: 851 TSMLEKKV 858
T +L+K +
Sbjct: 1258 TILLKKSL 1265
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/428 (28%), Positives = 197/428 (46%), Gaps = 77/428 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GIVC CC + S F+ HAG R P R+++ +G + + + + G
Sbjct: 961 DGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKVRKGGI 1019
Query: 507 ----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG L+ C+ CP FH ACL +PE W CPNC
Sbjct: 1020 KNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT----- 1074
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C IC L + + S++ + C QCE
Sbjct: 1075 ---------------------CRICG---------DLVKDREASSSFL----ALKCSQCE 1100
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD---FVSNRAQTIPASSLSTIN 673
++H+ CL++ C +KE+ D FC ++C I++ LQ FV++ A + L I+
Sbjct: 1101 HKYHMPCLKEK--C-VKEVGGDARFCGENCQEIYSGLQGLLGFVNHIADGFTWTLLRCIH 1157
Query: 674 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLI 732
D ++ Q ++ KA+C S L+ A I ECF ++ G D+I
Sbjct: 1158 D---------DQKVHSSQ-KLALKAEC-----NSKLAVALTIMEECFLSMVDPRTGIDMI 1202
Query: 733 PVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
P ++Y R + + F G Y+V+L +VS +R+ G VAE+PL+AT +++ KG
Sbjct: 1203 PHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGM 1262
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKG 850
+ L + IE++L S+ VE +V+ A WT FGF+ + + ++ L +F G
Sbjct: 1263 CRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-INLMVFPG 1321
Query: 851 TSMLEKKV 858
T +L+K +
Sbjct: 1322 TILLKKSL 1329
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 130/433 (30%), Positives = 189/433 (43%), Gaps = 84/433 (19%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDI-AISLAMGQRRTTG 505
+GI+C CC K ++ S+F+ HAG R P +I+ +G TL + A S R++
Sbjct: 728 DGIICICCGKVLTLSEFKFHAGFTLNR-PCLNIFMESGEPFTLCLLQAWSTEYKARKSQN 786
Query: 506 GS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ DD C +CG+G L+ C+ CP FH ACL IP+ W C NC
Sbjct: 787 QAVHADENDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT----- 841
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
R+C + T D + C QCE
Sbjct: 842 ----------------------------------CRICGNLVIDKDTLDAHDSLQCSQCE 867
Query: 617 KEFHVGCLRKNGLCDLKE-IPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRK 675
++H CL D +E D WFC C +++ LQ V Q S + +
Sbjct: 868 HKYHEKCLEDR---DKQEGAILDTWFCGQSCQEVYSGLQSQVGLVNQVADGISWTLL--- 921
Query: 676 HIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECF----DPIIAECGRDL 731
+ I D ++ QW LK A C + L+ A I ECF DP G L
Sbjct: 922 ---RCIHDDQKVHSAQWFALK-AVC-----NTKLAVALTIMEECFVSMFDP---RTGIHL 969
Query: 732 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 789
IP ++Y G + F G Y+++L V++S +R+ G VAE+PL+ATC +Y+ +G
Sbjct: 970 IPQVLYNWGSEFARLNFQGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIATCSQYRRQG 1029
Query: 790 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF---RKMSRERLLKYQRDFQLT 846
+ L + IE++L S VE LV+ A WTK FGF + R+RL K L
Sbjct: 1030 MCRLLVTAIEQVLISFKVEKLVISAIPDLVETWTKGFGFIPVDDIERQRLNK----INLM 1085
Query: 847 IFKGTSMLEKKVQ 859
+F GT +L K +
Sbjct: 1086 VFPGTVLLVKSLH 1098
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 127/440 (28%), Positives = 189/440 (42%), Gaps = 87/440 (19%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDIAISLAM 498
++ G + GI+C CCN +S + F+ HAG R P +++ +G TL + A
Sbjct: 787 IKDGLIKKEGIMCKCCNMVLSVTNFKNHAGFKQSR-PCLNVFMKSGKPFTLCQLQAWSAE 845
Query: 499 GQRRTT-----------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRC 547
+ R + +DD C +CGDG L+ C+ CP FH ACL +PE W C
Sbjct: 846 YKTRKSRTIKVVRTADDDENDDSCGLCGDGGELICCDNCPSTFHQACLSTEELPEGSWYC 905
Query: 548 PNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDR 607
PNC C IC ++ DI +
Sbjct: 906 PNCT--------------------------CWICGELVNDKEDI-------------NSS 926
Query: 608 TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPAS 667
C QCE ++H C + + K D WFC C ++ LQ S
Sbjct: 927 NAFKCSQCEHKYHDSCWKNKTIG--KGGASDTWFCGGSCQAVYFGLQ------------S 972
Query: 668 SLSTINRKHIEKGILF--------DGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFREC 719
+ IN HI G+ + D ++ Q ++ KA+C S L+ A I EC
Sbjct: 973 RVGIIN--HIADGVCWTLLKCIHEDQKVHSAQ-RLALKAEC-----NSKLAVALTIMEEC 1024
Query: 720 FDPII-AECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAEL 776
F ++ G D+IP ++Y + F G Y+V+L V++S +RI G VAE+
Sbjct: 1025 FQSMVDPRTGIDMIPHVLYNWRSEFARLNFHGFYTVVLEKDDVLLSVASIRIHGATVAEM 1084
Query: 777 PLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERL 836
PL+ATC Y+ +G + L + IE +L S VE LV+ A WT+ FGF MS +
Sbjct: 1085 PLIATCSNYRRQGMCRRLMTAIEEMLISFKVEKLVVSAIPDLVETWTEGFGFTPMSNDEK 1144
Query: 837 LKYQRDFQLTIFKGTSMLEK 856
+ L +F GT +L+K
Sbjct: 1145 QSLNK-INLMVFPGTILLKK 1163
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 122/428 (28%), Positives = 197/428 (46%), Gaps = 77/428 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GIVC CC + S F+ HAG R P R+++ +G + + + + G
Sbjct: 800 DGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKVRKGGI 858
Query: 507 ----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG L+ C+ CP FH ACL +PE W CPNC
Sbjct: 859 KNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT----- 913
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C IC L + + S++ + C QCE
Sbjct: 914 ---------------------CRICG---------DLVKDREASSSFL----ALKCSQCE 939
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD---FVSNRAQTIPASSLSTIN 673
++H+ CL++ C +KE+ D FC ++C I++ LQ FV++ A + L I+
Sbjct: 940 HKYHMPCLKEK--C-VKEVGGDARFCGENCQEIYSGLQGLLGFVNHIADGFTWTLLRCIH 996
Query: 674 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLI 732
D ++ Q ++ KA+C S L+ A I ECF ++ G D+I
Sbjct: 997 D---------DQKVHSSQ-KLALKAEC-----NSKLAVALTIMEECFLSMVDPRTGIDMI 1041
Query: 733 PVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
P ++Y R + + F G Y+V+L +VS +R+ G VAE+PL+AT +++ KG
Sbjct: 1042 PHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGM 1101
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKG 850
+ L + IE++L S+ VE +V+ A WT FGF+ + + ++ L +F G
Sbjct: 1102 CRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-INLMVFPG 1160
Query: 851 TSMLEKKV 858
T +L+K +
Sbjct: 1161 TILLKKSL 1168
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 123/433 (28%), Positives = 192/433 (44%), Gaps = 70/433 (16%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDIAISLAM 498
++ G +GI+C CCN +S ++F++HAG R P +++ +G TL + A
Sbjct: 141 IKDGVVTKDGIMCKCCNMVLSVTKFKSHAGFKLNR-PCSNLFMESGKPFTLCQLQAWSAE 199
Query: 499 GQRRTTGG----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+ R +G +DD C +CGDG L+ C+ CP FH ACL +PE W CP
Sbjct: 200 YKSRKSGTQVVRADEDDKNDDSCGLCGDGGELICCDNCPSTFHQACLCTEDLPEGSWYCP 259
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
NC C IC D+ + S +
Sbjct: 260 NCT--------------------------CWICG-------DLVNDKEASSSVGAYK--- 283
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
C QCE ++H C + G + + D WFC C +++ L V
Sbjct: 284 ---CLQCEHKYHGAC--QQGKQTHEGLVSDAWFCSGSCQEVYSGLHSRVGINNPIADGFC 338
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AEC 727
+ + H ++ +L ++ KA+C S L+ A I ECF ++
Sbjct: 339 WTLLRCIHEDQKVL-------SAQRLALKAEC-----NSKLAVALTIMEECFQSMVDPRT 386
Query: 728 GRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
G D+IP +Y G + + F G Y+V+L V+VSA +R+ G VAE+PL+ATC Y
Sbjct: 387 GIDMIPHALYNWGSDFARLNFFGFYTVVLEKDDVLVSAASVRVHGVTVAEMPLIATCSNY 446
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
+ +G + L + IE +L S VE LV+ A WTK FGF +S++ +
Sbjct: 447 RRQGMCRHLMTAIEEMLISYKVEKLVISAIPDLVETWTKGFGFIPVSKDEKQSLNK-INF 505
Query: 846 TIFKGTSMLEKKV 858
+F GT +L+K++
Sbjct: 506 MVFPGTILLKKQL 518
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 12/105 (11%)
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQL-------DGIVN 245
L RN+ K+ K V ++ + + + ++D + + R L DG+V
Sbjct: 89 LLPRNLG-KLGKHYVGGKWSRMGSRTVLSWLIDAGVLSVKDVVQYRNLKDDFVIKDGVVT 147
Query: 246 GGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPI 290
G +C C CN V+S +F+ HAG K P +++++E+GKP
Sbjct: 148 KDGIMCKC--CNM--VLSVTKFKSHAGFKLNRPCSNLFMESGKPF 188
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 125/423 (29%), Positives = 183/423 (43%), Gaps = 64/423 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------ 502
+GI C CC++ I+ S+FE HAG QP +IY G +L + Q
Sbjct: 597 DGIHCSCCDEVITISKFEMHAGSRVG-QPLENIYVHTGSSLLQCLLESWNKQNEPQCKGY 655
Query: 503 ------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+ CP FH +CLD P W C C
Sbjct: 656 NFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------ 709
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
+V G V L P ++ H +AA + CD CE
Sbjct: 710 ---------------SCKVCGQVTIGLHPMDD--------HHEAAAD----VLCKCDLCE 742
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKH 676
+++H C++ N + + FC C +H LQ + R S + I R
Sbjct: 743 EKYHPICVQMNNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRRSD 800
Query: 677 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVM 735
++ + N+V +K +C E L+ A + ECF P+I G +LI +
Sbjct: 801 VDSDVSL---CNEVA----QKIKCNSE-----LAVALFVMDECFLPVIDHRSGINLIHNI 848
Query: 736 VY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
+Y G N + F G Y+ IL V+ A LRI G E+AE+P + T Y+ +G +
Sbjct: 849 LYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRR 908
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 853
S IE +L SLNVE LV+PA + W FGF+ + E + R L +F G M
Sbjct: 909 FLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLD-ETTKQRMRKMSLLVFPGVEM 967
Query: 854 LEK 856
L+K
Sbjct: 968 LQK 970
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 78/139 (56%), Positives = 104/139 (74%), Gaps = 2/139 (1%)
Query: 191 GVLCARNME-IKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGY 249
G+ A+ E +K KKV N +P+NV+ LLSTG+LDG VKYI+ SRE +L G++ G GY
Sbjct: 260 GIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKYIAWSRE-ELRGVIKGSGY 318
Query: 250 LCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVV 309
LCGC CNFSKV++A+EFE+HA KT+HPNNHIY ENGK +Y I+QEL++ P +L EV+
Sbjct: 319 LCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQELRSIPQNMLFEVI 378
Query: 310 KKVAGSSFNEGSFRVWKAS 328
+ + GS N+ SFR+WK S
Sbjct: 379 QTITGSPINQKSFRLWKES 397
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
L G+ DG + YI + G +G+G +C C CN K I+ +FE HA +
Sbjct: 288 LLSTGMLDGVPVKYIAWSREELRGVIKGSGYLCGCQTCNFSKVINAYEFERHAD-CKTKH 346
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 347 PNNHIYFENGKTVYGIVQEL 366
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 125/423 (29%), Positives = 183/423 (43%), Gaps = 64/423 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------ 502
+GI C CC++ I+ S+FE HAG QP +IY G +L + Q
Sbjct: 570 DGIHCSCCDEVITISKFEMHAGSRVG-QPLENIYVHTGSSLLQCLLESWNKQNEPQCKGY 628
Query: 503 ------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+ CP FH +CLD P W C C
Sbjct: 629 NFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------ 682
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
+V G V L P ++ H +AA + CD CE
Sbjct: 683 ---------------SCKVCGQVTIGLHPMDD--------HHEAAAD----VLCKCDLCE 715
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKH 676
+++H C++ N + + FC C +H LQ + R S + I R
Sbjct: 716 EKYHPICVQMNNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRRSD 773
Query: 677 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVM 735
++ + N+V +K +C E L+ A + ECF P+I G +LI +
Sbjct: 774 VDSDVSL---CNEVA----QKIKCNSE-----LAVALFVMDECFLPVIDHRSGINLIHNI 821
Query: 736 VY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
+Y G N + F G Y+ IL V+ A LRI G E+AE+P + T Y+ +G +
Sbjct: 822 LYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRR 881
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 853
S IE +L SLNVE LV+PA + W FGF+ + E + R L +F G M
Sbjct: 882 FLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLD-ETTKQRMRKMSLLVFPGVEM 940
Query: 854 LEK 856
L+K
Sbjct: 941 LQK 943
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 74/131 (56%), Positives = 100/131 (76%), Gaps = 1/131 (0%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCN 257
E+K +KK PN +P+NV+ L+STGILDG VKY+S SRE +L GI+ G GYLCGC CN
Sbjct: 315 QELKTAKKEAPNSFPSNVRSLISTGILDGVPVKYVSVSRE-ELRGIIKGSGYLCGCQSCN 373
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V G+
Sbjct: 374 YTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTVFGAPI 433
Query: 318 NEGSFRVWKAS 328
N+ +FR WK S
Sbjct: 434 NQKAFRNWKES 444
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + Y+ V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 335 LISTGILDGVPVKYVSVSREELR-GIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTK- 392
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 393 HPNNHIYFENGKTIYQIVQEL 413
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 169 bits (427), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 73/129 (56%), Positives = 100/129 (77%), Gaps = 1/129 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E+K +K PN +P+NV+ L+STG+LDG VKYI+ SRE +L G++ G GYLC C CN+
Sbjct: 284 EVKTGRKEAPNSFPSNVRSLISTGMLDGVPVKYIALSRE-ELRGVIKGSGYLCSCQSCNY 342
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L +V++ V G+ N
Sbjct: 343 SKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTVFGAPIN 402
Query: 319 EGSFRVWKA 327
+ SFR+WKA
Sbjct: 403 QKSFRIWKA 411
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
+ G+ DG + YI + G +G+G +C C CN K ++ +FE HAG +
Sbjct: 303 LISTGMLDGVPVKYIALSREELRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKT-KH 361
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 362 PNNHIYFENGKTIYQIVQEL 381
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/451 (26%), Positives = 198/451 (43%), Gaps = 85/451 (18%)
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
+++ Y + + + G +GI C CC+K ++ S+FE HAG + QPY++IY +G++L
Sbjct: 636 QKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQNIYLESGVSL 694
Query: 490 ------------HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
H I +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 695 LQCQIEAWNRQEHSEKICFHSVDIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDI 754
Query: 538 LLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSH 597
++P W CPNC C C ++
Sbjct: 755 QMLPLGEWHCPNCT--------------------------CKFCGIA------------- 775
Query: 598 DFSAATFDDRTVIY----CDQCEKEFHVGCLRKNGLCDLKEIPKD-----KWFCCDDCNR 648
S + D +Y C+ CEK++H C + ++ +P + FC +C
Sbjct: 776 --SGNSEKDDASVYVLQICNLCEKKYHDSCTK-----EMDNLPNNINTSSLSFCGKECKE 828
Query: 649 IHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSL 708
+ L+ ++ + + S S I+R D + ++ +C S
Sbjct: 829 LSEHLKKYLGTKHELEAGFSWSLIHR--------IDEDSEAACRGISQRVEC-----NSK 875
Query: 709 LSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGL 765
L+ A + ECF P+I G +LI ++Y G N S + G Y+ L ++++
Sbjct: 876 LAIALTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLNYSGFYTATLERGDEIIASAS 935
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
+R G ++AE+P + T Y+ +G + LFS IE LCSL VE LV+PA + + WT
Sbjct: 936 IRFHGTQIAEMPFIGTRHMYRRQGMCRRLFSAIESTLCSLKVEKLVIPAIAELTNTWTTV 995
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
FGF + E L + + + +F G ML K
Sbjct: 996 FGFTHLD-ESLRQEMKSLNMMVFPGIDMLMK 1025
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 9/81 (11%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKVVSAHE 266
P K+ + + ++D V+ + R+ L+G + G CGC C SK+++ +
Sbjct: 615 PYMGKRTVLSWLIDSGTVELSQKVQYRRRKKVMLEGWITRDGIHCGC--C--SKILTVSK 670
Query: 267 FEQHAGAKTRHPNNHIYLENG 287
FE HAG+K P +IYLE+G
Sbjct: 671 FELHAGSKLPQPYQNIYLESG 691
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 74/136 (54%), Positives = 101/136 (74%), Gaps = 1/136 (0%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N E+K SKK+ PN +P+NVK LLSTG+LDG VKY+S SRE+ L G + G GYLC C
Sbjct: 406 ASKNKELKTSKKIPPNNFPSNVKSLLSTGLLDGVAVKYVSWSREKTLRGTIKGTGYLCSC 465
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
+C +KV++A+EFE+HA KT+HPNNHIY ENGK IY+++QELK P +L ++ V
Sbjct: 466 KVCG-NKVLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFNAIETVT 524
Query: 314 GSSFNEGSFRVWKASH 329
GS+ N+ +F WKAS+
Sbjct: 525 GSAINQKNFLSWKASY 540
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 40/81 (49%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYIV--KGQRLRFGCKQGNGIVCDC---CNKEISPSQFEAHAGMAARR 475
L GL DG + Y+ + + LR G +G G +C C NK ++ +FE HA +
Sbjct: 430 LLSTGLLDGVAVKYVSWSREKTLR-GTIKGTGYLCSCKVCGNKVLNAYEFERHAN-CKTK 487
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 488 HPNNHIYFENGKTIYAVVQEL 508
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 122/436 (27%), Positives = 192/436 (44%), Gaps = 81/436 (18%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GI C CC + S S+FEAHAG + R +I+ +G +L + + + R TG
Sbjct: 848 DGIKCSCCQEVFSLSRFEAHAGSSYHRSA-ANIFLEDGRSLLECQMQII---RDITGKGF 903
Query: 507 ----------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+D +C VC G +L+LC+ CP +FH +CL +PE W CP+C
Sbjct: 904 TKESFSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSC 963
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
G +C + F + D V
Sbjct: 964 CCG---------------------------------------ICGENKFDGGSEQDNVVF 984
Query: 611 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS 670
C QCE++ CLRK G L P WFC C +I LQ + ++ + +L+
Sbjct: 985 SCYQCERQC---CLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLG-KSFPVGVDNLT 1040
Query: 671 TINRKHIE-KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 728
K I KG+ D L + E S L+ A + ECF+P+
Sbjct: 1041 WTLLKPIRSKGLEID----------LPDIEALTEV-YSKLNIALGVMHECFEPVKEPHTR 1089
Query: 729 RDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 786
RD++ +++ G +++ F G Y+V+L ++S +R++G +VAE+PL+ T +Y+
Sbjct: 1090 RDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYR 1149
Query: 787 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLT 846
G L + +E+ L L VE LVLPA + WT FGF KM+ L++ D+
Sbjct: 1150 RLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFL-DYSFL 1208
Query: 847 IFKGTSMLEKKVQCLP 862
F+ T M +K + +P
Sbjct: 1209 DFQDTVMCQKLLMKIP 1224
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 100/130 (76%), Gaps = 2/130 (1%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KM +K PN +P+NV+ L+STG+LDG VKY+S +RE +L GI+ G GYLCGC CNF
Sbjct: 315 EYKM-RKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTRE-ELRGIIKGSGYLCGCQSCNF 372
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SK+++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ + G+ N
Sbjct: 373 SKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPIN 432
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 433 QKSFRIWKES 442
Score = 42.7 bits (99), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + Y+ V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 333 LISTGMLDGVPVKYVSVTREELR-GIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTK- 390
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 391 HPNNHIYFENGKTIYQIVQEL 411
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 100/130 (76%), Gaps = 2/130 (1%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KM +K PN +P+NV+ L+STG+LDG VKY+S +RE +L GI+ G GYLCGC CNF
Sbjct: 321 EYKM-RKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTRE-ELRGIIKGSGYLCGCQSCNF 378
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SK+++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ + G+ N
Sbjct: 379 SKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPIN 438
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 439 QKSFRIWKES 448
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + Y+ V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 339 LISTGMLDGVPVKYVSVTREELR-GIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTK- 396
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 397 HPNNHIYFENGKTIYQIVQEL 417
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/131 (55%), Positives = 99/131 (75%), Gaps = 1/131 (0%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCN 257
E+K +K PN +P+NV+ L+STGILDG VKYIS SRE +L GI+ G GYLCGC CN
Sbjct: 354 QELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYISVSRE-ELRGIIKGSGYLCGCQSCN 412
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V G+
Sbjct: 413 YTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTVFGAPI 472
Query: 318 NEGSFRVWKAS 328
++ +FR WK S
Sbjct: 473 HQKAFRNWKES 483
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + YI V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 374 LISTGILDGVPVKYISVSREELR-GIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTK- 431
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 432 HPNNHIYFENGKTIYQIVQEL 452
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 75/138 (54%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 260 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 318
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENGK IY I+QEL+ P IL +V++
Sbjct: 319 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 378
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 379 TVFGSPINQKAFRIWKES 396
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 224 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 280
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 281 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 339
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 340 GCKTK-HPNNHIYFENGKTIYQIVQEL 365
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 75/138 (54%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 270 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 328
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENGK IY I+QEL+ P IL +V++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 389 TVFGSPINQKAFRIWKES 406
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 234 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 290
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 291 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 349
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 350 GCKTK-HPNNHIYFENGKTIYQIVQEL 375
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 99/130 (76%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E+K ++K PN +P+NV+ L+STG+LDG VKYIS SR ++L GI+ G GYLCGC CN+
Sbjct: 310 ELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYISLSR-KELRGIIKGSGYLCGCQSCNY 368
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHI ENGK IY I+QEL+ P +L + ++ V G+ N
Sbjct: 369 SKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPESMLFDAIQTVFGAPIN 428
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 429 QKSFRIWKES 438
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 75/138 (54%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 270 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 328
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENGK IY I+QEL+ P IL +V++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 389 TVFGSPINQKAFRIWKES 406
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 234 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 290
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 291 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 349
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 350 GCKTK-HPNNHIYFENGKTIYQIVQEL 375
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/125 (64%), Positives = 96/125 (76%), Gaps = 1/125 (0%)
Query: 735 MVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
MVYGRN+ GQEFGGMY +L V S VVSA +LRIFG +VAELPLVAT GKG FQ L
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTL 60
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSM 853
FSCIERLL +NV++LVLPAAE+AESIWT KFGF +M + L Y+++F Q+ FKGT+M
Sbjct: 61 FSCIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFNQMVTFKGTNM 120
Query: 854 LEKKV 858
L K V
Sbjct: 121 LHKLV 125
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 181/409 (44%), Gaps = 84/409 (20%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL-------HDIAISLAMGQR 501
+GIVC+CC + S FEAHAG R +P I +G +L D S +G+
Sbjct: 417 DGIVCNCCRVNFTVSGFEAHAG-CTRHRPSISILLEDGRSLFKCQREARDQKGSHCIGEA 475
Query: 502 RTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS 561
+ +D++C +CG G +L+LC+ CP AFH CL +P+ W CP C
Sbjct: 476 NSEANNDNVCSICGFGGDLVLCDRCPSAFHLGCLGLDRVPDGDWFCPTC----------- 524
Query: 562 VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
C IC P C+ A ++ + C QCE++FH
Sbjct: 525 ---------------CCKIC-YRPK-------CKQE---CADGNENNFLVCVQCEQKFHF 558
Query: 622 GCLRKNGLCDL---KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 678
GC++ I K WFC C + L+ + +
Sbjct: 559 GCVKTTRFGSSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKV--------------- 603
Query: 679 KGILFDGTMNDVQWQMLKKAQCFEE----------KEKSLLSSATAIFRECFDPII-AEC 727
+++ W +LK ++ +EK L++A + E F+P I A
Sbjct: 604 --------ADNINWTLLKNVSSDDDGGDFTSNEFSQEKHKLNAALGVLYEGFNPTIDALS 655
Query: 728 GRDLIPVMVYGRNISGQE--FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
GR+LI +V+ R+ + F G Y+VIL V+S +RIFG++VAE+ VAT ++
Sbjct: 656 GRELIKDLVFSRDSEHKRLNFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQH 715
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 834
+G+G + L +E L L V LVL ++E A + WTK FGF +M+ E
Sbjct: 716 RGRGMCRLLMDELEEQLTRLGVGRLVLHSSEDAINTWTKSFGFARMTSE 764
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 74/138 (53%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 270 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 328
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENG+ IY I+QEL+ P IL +V++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQ 388
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 389 TVFGSPINQKAFRIWKES 406
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 234 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 290
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 291 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 349
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 350 GCKTK-HPNNHIYFENGRTIYQIVQEL 375
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 118/426 (27%), Positives = 182/426 (42%), Gaps = 73/426 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI C CC+K ++ S+FE HAG + RQP+++I +G++L + Q +
Sbjct: 429 DGIHCRCCSKILTVSKFEIHAG-SKLRQPFQNIVLDSGVSLLQCQVDAWNRQEESERSGF 487
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+GCP FH +CL+ ++P W CPNC
Sbjct: 488 HPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCT----- 542
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV---IYCD 613
C C ++ N + DD TV + C
Sbjct: 543 ---------------------CKFCGMADGSNAE--------------DDTTVSELVTCS 567
Query: 614 QCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTIN 673
CEK++H C++ FC C + LQ F+ + + S S I+
Sbjct: 568 LCEKKYHTSCIQGVDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIH 627
Query: 674 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR-DLI 732
R G V+ S L+ A + ECF I+ +LI
Sbjct: 628 RTDPGSDTSVRGFPQRVE-------------SNSKLAIALTVMDECFLSIVDRRSEINLI 674
Query: 733 PVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
++Y R N + + G Y+ IL ++ A +RI G ++AE+P + T Y+ +G
Sbjct: 675 HNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGM 734
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKG 850
+ LF IE LCSL VE L++PA + WT FGF + E + R + +F G
Sbjct: 735 CRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFNPL-EESHKQELRSLNMLVFPG 793
Query: 851 TSMLEK 856
T ML+K
Sbjct: 794 TDMLQK 799
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 121/431 (28%), Positives = 190/431 (44%), Gaps = 76/431 (17%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC S F HAG ++ +P I+ +G +L D + + R
Sbjct: 716 GIRCTCCQNLYGLSGFANHAGGSSNCRPSACIFLKDGRSLLDCMMEVMQDHRTREITEKP 775
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D++C VC G L+LC+ CP A+H CL+ IP+ W CP+CR
Sbjct: 776 HNDLFEGENDNICSVCNYGGELILCDQCPSAYHKNCLNLEGIPDGDWFCPSCR------- 828
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
C IC + E + D + C QCE +
Sbjct: 829 -------------------CGICGQNKIEETE---------------DGHFLTCIQCEHK 854
Query: 619 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 678
+HV CLR NG D WFC ++C R++ LQ+ + + + A +L+ K++
Sbjct: 855 YHVECLR-NGEKDDSRRCMKNWFCGEECERVYTGLQNLLG-KPVLVGADNLTWTLVKYVN 912
Query: 679 KGIL-FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMV 736
G +D+ + K LS A ++ ECF+P+ RD++ ++
Sbjct: 913 SETCGVGGAESDLVVENYSK-----------LSVALSVMHECFEPLHNPFSSRDIVEDVI 961
Query: 737 YGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
+ + ++ F G Y+V+L ++S +RIFG ++AE+PLV T +Y+ G + L
Sbjct: 962 FNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIAEVPLVGTRFQYRRLGMCRVL 1021
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQRDFQLTIFKGT 851
+E+ L L VE LVLPA WT FGF +M+ R + L D+ F+GT
Sbjct: 1022 MDELEKKLKQLGVERLVLPAVPGVLDTWTNSFGFEQMTNFERSQFL----DYSFLDFQGT 1077
Query: 852 SMLEKKVQCLP 862
M +K + P
Sbjct: 1078 VMCQKLLTRFP 1088
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 118/426 (27%), Positives = 182/426 (42%), Gaps = 73/426 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI C CC+K ++ S+FE HAG + RQP+++I +G++L + Q +
Sbjct: 597 DGIHCRCCSKILTVSKFEIHAG-SKLRQPFQNIVLDSGVSLLQCQVDAWNRQEESERSGF 655
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+GCP FH +CL+ ++P W CPNC
Sbjct: 656 HPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCT----- 710
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV---IYCD 613
C C ++ N + DD TV + C
Sbjct: 711 ---------------------CKFCGMADGSNAE--------------DDTTVSELVTCS 735
Query: 614 QCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTIN 673
CEK++H C++ FC C + LQ F+ + + S S I+
Sbjct: 736 LCEKKYHTSCIQGVDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIH 795
Query: 674 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR-DLI 732
R G V+ S L+ A + ECF I+ +LI
Sbjct: 796 RTDPGSDTSVRGFPQRVE-------------SNSKLAIALTVMDECFLSIVDRRSEINLI 842
Query: 733 PVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 790
++Y R N + + G Y+ IL ++ A +RI G ++AE+P + T Y+ +G
Sbjct: 843 HNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGM 902
Query: 791 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKG 850
+ LF IE LCSL VE L++PA + WT FGF + E + R + +F G
Sbjct: 903 CRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFNPL-EESHKQELRSLNMLVFPG 961
Query: 851 TSMLEK 856
T ML+K
Sbjct: 962 TDMLQK 967
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 124/422 (29%), Positives = 184/422 (43%), Gaps = 69/422 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 556 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 614
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 615 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 667
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+ A G + G + +++ C CE+
Sbjct: 668 ---TCKFCDAAVASGGKDGNSI----------------------------SLLSCGMCER 696
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CL ++ FC C + LQ ++ + + S S I+R
Sbjct: 697 RYHQLCLNDEAH-KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD- 754
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
T +D QM AQ E K L+ AI ECF PI+ G DLI ++
Sbjct: 755 --------TDSDTNSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVL 802
Query: 737 Y--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
Y G N + + G Y+ IL ++SA LR G ++AE+P + T Y+ +G + L
Sbjct: 803 YNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRL 862
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSML 854
F IE + SL VE LV+PA WT FGF + + + K R +F G ML
Sbjct: 863 FDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDML 921
Query: 855 EK 856
+K
Sbjct: 922 QK 923
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 124/422 (29%), Positives = 184/422 (43%), Gaps = 69/422 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 556 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 614
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 615 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 667
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+ A G + G + +++ C CE+
Sbjct: 668 ---TCKFCDAAVASGGKDGNFI----------------------------SLLSCGMCER 696
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CL ++ FC C + LQ ++ + + S S I+R
Sbjct: 697 RYHQLCLNDEAH-KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD- 754
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
T +D QM AQ E K L+ AI ECF PI+ G DLI ++
Sbjct: 755 --------TDSDTNSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVL 802
Query: 737 Y--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
Y G N + + G Y+ IL ++SA LR G ++AE+P + T Y+ +G + L
Sbjct: 803 YNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRL 862
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSML 854
F IE + SL VE LV+PA WT FGF + + + K R +F G ML
Sbjct: 863 FDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDML 921
Query: 855 EK 856
+K
Sbjct: 922 QK 923
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 157/614 (25%), Positives = 245/614 (39%), Gaps = 133/614 (21%)
Query: 35 DNVESDSFPN--GKHAKEEALNDDIKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQ 92
++ +SDS N G KE+ LN D+ T + D+ + K +S +
Sbjct: 219 NDKQSDSLVNCGGGVEKEKELNGDV--------------TEKMMDVDKTDNKEQLDSLME 264
Query: 93 VLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHV------- 145
L+ EE GD +T+S + D + + +P+ V
Sbjct: 265 ---------LAMEELPMSGDCKTKSEAIKA--------DCAYGSALATVPESVENSKVST 307
Query: 146 SSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKK 205
SS +R+ T K K+ ++ E+ T +D S + +E+KMSKK
Sbjct: 308 SSEKPLRRFTRSCLKPKQQAMSASPAED----TKAEDALESDEASAIGTTSKLEMKMSKK 363
Query: 206 VVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAH 265
V + PT +K LL+TG+L+G V+Y E+ L G++ G G LC C C +KVV+ +
Sbjct: 364 VALVKIPTKLKGLLATGLLEGLPVRYARGRPEKGLQGVIQGSGILCFCQNCGGTKVVTPN 423
Query: 266 EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW 325
+FE HAG+ + P +IYL+NGK + ++ K AP L E + R
Sbjct: 424 QFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADAL-------------EAAIRNA 470
Query: 326 KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKE 385
+ RK V + K ++ S + C S + S T+ S V D + E
Sbjct: 471 TGAGDARKSTVCLNCKASLPEASFGRPRLQCDS-CMTSKKSQTTPSQVGDANCSRDGQLE 529
Query: 386 ML---------EERKRGV-----------KRPFMHQKRTAEGGT----------KKRDND 415
+ + K G+ + GT ++D
Sbjct: 530 FIFLLNYYWADDLYKLGLPDLRGLQWSPSSNSVLKSTERMSSGTCPPSKVHGRLTRKDLR 589
Query: 416 LHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARR 475
+H+L+F + LPDG L Y V+G+ +SPSQFEAHAG A+RR
Sbjct: 590 MHKLVFEGDVLPDGTALAYYVRGK--------------------VSPSQFEAHAGCASRR 629
Query: 476 QPYRHIYTSNGMTLHDIAISLAMGQRR-----TTGGSDDMCHVCGDGENLLLCNGCPLAF 530
+P + + + + ++ + G + G D+C +C DG +LL C+ CP AF
Sbjct: 630 KPGWYWGKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICADGGDLLCCDNCPRAF 689
Query: 531 HAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLKGGLEAPG----------------- 572
H C+ IP W C C RS + G
Sbjct: 690 HTECVSLPNIPRGTWYCKYCENMFLKEKFDRSANAIAAGRVAGIDALEQITKRSIRIVDT 749
Query: 573 --AEVGGCVICRLS 584
AEVG CV+CR +
Sbjct: 750 LHAEVGVCVLCRFA 763
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 126/446 (28%), Positives = 189/446 (42%), Gaps = 95/446 (21%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG----MTLHDIAISL 496
++ G GIVC+CC K +S S F AHAG + Q ++ +G + L + +
Sbjct: 880 IKDGMVTWEGIVCNCCKKNLSVSDFMAHAGRS-HPQSSLGLFLESGKSYTLCLVEAWSAE 938
Query: 497 AMGQRRTTGG--------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+M +R G SDD C CGDG LL C+ CP +H ACL +PE W C
Sbjct: 939 SMSRRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCH 998
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
NC C +C SE +TF
Sbjct: 999 NCT--------------------------CQVCGGPFSEK-----------EVSTF--SA 1019
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
+ C QC +H C+ + L L++ WFC C I L
Sbjct: 1020 IFKCFQCGDAYHDTCIEQEKL-PLEDQISQTWFCGKYCKEIFIGL--------------- 1063
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLK------------KAQCFEEKEKSLLSSATAIF 716
R H+ + D +D+ W +L+ K C E L + A +
Sbjct: 1064 -----RSHVGTDNILD---SDLSWSILRCNNDGQKLHSVQKIACLAECNMKL-AVALTLL 1114
Query: 717 RECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREV 773
ECF ++ G D+IP ++Y G N + ++ G Y+VIL ++ +R+ G +
Sbjct: 1115 EECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGTKA 1174
Query: 774 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM-S 832
AELP +AT +++ +G + L S IE++LCS NV+ LVL A + S W FGF+ +
Sbjct: 1175 AELPFIATSVDFRRQGMCRILMSIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKPIED 1234
Query: 833 RERLLKYQRDFQLTIFKGTSMLEKKV 858
ER K + L +F GTS+L K++
Sbjct: 1235 AER--KQLHNVNLMLFPGTSLLTKRL 1258
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/430 (27%), Positives = 185/430 (43%), Gaps = 81/430 (18%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL------------HDIAISL 496
+GI C CC+K ++ S+FE HAG + RQP+++I+ +G++L +S
Sbjct: 634 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIFLESGLSLLQCQRDAWNRQEESKLLSF 692
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ +DD C +CGDG +L+ C+GCP FH +CLD L+ P W CPNC
Sbjct: 693 HTVEIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCT----- 747
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C C ++ I +C+ + S + + C CE
Sbjct: 748 ---------------------CKYCGVA-----SIDICQGDNTSVSE-----ISTCILCE 776
Query: 617 KEFHVGC-------LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 669
K+FH C + +GL FC C + +LQ + + + S
Sbjct: 777 KKFHESCNLEMDTPVHSSGLV--------TSFCGKSCRELFESLQKNLGVKHELDAGFSW 828
Query: 670 STINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 728
S I R T D + +Q E K L+ A + ECF PI+ G
Sbjct: 829 SLIRR-----------TSEDSDVSVRGLSQRIESNSK--LAVALTVMDECFLPIVDRRSG 875
Query: 729 RDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 786
+LI ++Y G N + G Y+ IL ++SA +R G ++AE+P + T Y+
Sbjct: 876 INLIHNVLYNCGSNFYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYR 935
Query: 787 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLT 846
+G + LF IE L VE L++PA + W FGF + L + R +
Sbjct: 936 RQGMCRRLFCAIESALRVFKVEKLIIPAIAELMHTWNVIFGFSPL-EPSLKQEMRLMNML 994
Query: 847 IFKGTSMLEK 856
+F GT ML+K
Sbjct: 995 VFPGTDMLQK 1004
Score = 46.6 bits (109), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 20/119 (16%)
Query: 185 SSGANDGVLCARN-----MEIKMSKKVVPNE----YPTNVKKLLSTGILDGACV------ 229
SSG+N VL R + ++ S + + +E P K+ L + ++D V
Sbjct: 556 SSGSNSRVLHGRKGRKLGLLVRGSSRGLDSENDGYVPYTGKRTLLSWLIDSGTVQLSQKV 615
Query: 230 KYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+Y++ + R L+G + G CGC C SK+++ +FE HAG+K R P +I+LE+G
Sbjct: 616 RYMNRRQTRVMLEGWITRDGIHCGC--C--SKILTVSKFEIHAGSKLRQPFQNIFLESG 670
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 117/430 (27%), Positives = 184/430 (42%), Gaps = 81/430 (18%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL------------HDIAISL 496
+GI C CC+K ++ S+FE HAG + RQP+++I+ +G++L +S
Sbjct: 516 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIFLESGLSLLQCQRDAWNRQEESKLLSF 574
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ +DD C +CGDG +L+ C+GCP FH +CLD L+ P W CPNC
Sbjct: 575 HTVEIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCT----- 629
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C C ++ I +C+ + S + + C CE
Sbjct: 630 ---------------------CKYCGVA-----SIDICQGDNTSVSE-----ISTCILCE 658
Query: 617 KEFHVGC-------LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 669
K+FH C + +GL FC C + +LQ + + + S
Sbjct: 659 KKFHESCNLEMDTPVHSSGLV--------TSFCGKSCRELFESLQKNLGVKHELDAGFSW 710
Query: 670 STINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 728
S I R + + G ++ S L+ A + ECF PI+ G
Sbjct: 711 SLIRRTSEDSDVSVRGLSQRIE-------------SNSKLAVALTVMDECFLPIVDRRSG 757
Query: 729 RDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 786
+LI ++Y G N + G Y+ IL ++SA +R G ++AE+P + T Y+
Sbjct: 758 INLIHNVLYNCGSNFYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYR 817
Query: 787 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLT 846
+G + LF IE L VE L++PA + W FGF + L + R +
Sbjct: 818 RQGMCRRLFCAIESALRVFKVEKLIIPAIAELMHTWNVIFGFSPL-EPSLKQEMRLMNML 876
Query: 847 IFKGTSMLEK 856
+F GT ML+K
Sbjct: 877 VFPGTDMLQK 886
Score = 46.6 bits (109), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 20/119 (16%)
Query: 185 SSGANDGVLCARN-----MEIKMSKKVVPNE----YPTNVKKLLSTGILDGACV------ 229
SSG+N VL R + ++ S + + +E P K+ L + ++D V
Sbjct: 438 SSGSNSRVLHGRKGRKLGLLVRGSSRGLDSENDGYVPYTGKRTLLSWLIDSGTVQLSQKV 497
Query: 230 KYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+Y++ + R L+G + G CGC C SK+++ +FE HAG+K R P +I+LE+G
Sbjct: 498 RYMNRRQTRVMLEGWITRDGIHCGC--C--SKILTVSKFEIHAGSKLRQPFQNIFLESG 552
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 184/422 (43%), Gaps = 69/422 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG---- 505
GI CDCC+K ++ S+FE HAG A QP+++IY +G +L + Q+ T
Sbjct: 539 GIHCDCCSKILTVSRFEIHAGSKAC-QPFQNIYLESGASLLQCQVRAWNMQKDATNVGLH 597
Query: 506 --------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 598 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQTCLGMQVLPSGDWHCPNC------- 650
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+ A G + G NF +++ C CE+
Sbjct: 651 ---TCKFCDAAVASGGKDG-----------NF-----------------LSLLSCSMCER 679
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CL ++ FC C + LQ ++ + + S S I+R
Sbjct: 680 RYHQLCLSDEAQ-KVQSFGSASSFCGPKCLELFEKLQKYLGVKNEIEGGYSWSLIHRVD- 737
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
T +D+ Q+ AQ E K L+ AI ECF PI+ G +LI ++
Sbjct: 738 --------TDSDINSQL--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVNLIRNVL 785
Query: 737 Y--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
Y G N + + G Y+ IL ++SA LR G ++AE+P + T Y+ +G + L
Sbjct: 786 YNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGTQLAEMPFIGTRHIYRRQGMCRRL 845
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSML 854
F IE + SL VE LV+PA WT FGF + + + K R +F G ML
Sbjct: 846 FDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDML 904
Query: 855 EK 856
+K
Sbjct: 905 QK 906
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 105/160 (65%), Gaps = 14/160 (8%)
Query: 182 DLGSSGANDGVLCARNM-------------EIKMSKKVVPNEYPTNVKKLLSTGILDGAC 228
+ GSS + GV+ + + E K SKK +P+NV+ L+STG+LDG
Sbjct: 238 ETGSSQTSSGVVSEQQVAKPSLEPVPKNKAETKSSKKEASTSFPSNVRSLISTGMLDGVP 297
Query: 229 VKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGK 288
V Y+S SRE +L G++ G GYLCGC C F+KV++A+ FE+HAG KT+HPNNHIY ENGK
Sbjct: 298 VTYVSISRE-ELRGVIKGSGYLCGCQTCEFTKVLNAYAFERHAGCKTKHPNNHIYFENGK 356
Query: 289 PIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
IY I+QEL+ P IL +V++ V GS N+ +FR+WK S
Sbjct: 357 TIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKES 396
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 224 VQDPIGTLDIVYGQETGSSQTSSGVVSEQQ---VAKPSLEPVPKNKAETKSSKKEASTSF 280
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHA 469
+ + G+ DG +TY+ + + LR G +G+G +C C K ++ FE HA
Sbjct: 281 PSNVRSLISTGMLDGVPVTYVSISREELR-GVIKGSGYLCGCQTCEFTKVLNAYAFERHA 339
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 340 GCKT-KHPNNHIYFENGKTIYQIVQEL 365
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 122/449 (27%), Positives = 200/449 (44%), Gaps = 69/449 (15%)
Query: 426 LPDGERLTYI--VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
+P ++ Y+ K + L G +GI C CC++ + S+FE HAGM +P ++I
Sbjct: 154 VPLNAKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTISKFEIHAGMKLC-EPSQNIIL 212
Query: 484 SNGMTLHDIAISLAMGQRRTTGG------------SDDMCHVCGDGENLLLCNGCPLAFH 531
G++L + Q + +DD C +CGDG +L+ C+GCP FH
Sbjct: 213 ETGISLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFH 272
Query: 532 AACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDI 591
+CLD P W C C S + D + N D+
Sbjct: 273 QSCLDIQKFPSGDWHCIYCSCKFCGMFSGNTD--------------------QMNYNLDV 312
Query: 592 RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR-KNGLCDLKEIPKDKWFCCDDCNRIH 650
+D ++ C CE+++H C + ++ + D P FC C +
Sbjct: 313 -------------NDSALLTCQLCEEKYHHMCTQGEDSILDDSSSPS---FCGKTCRELF 356
Query: 651 AALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLS 710
LQ + + + S + + R + FD ++N + +K +C S L+
Sbjct: 357 EQLQMLLGVKHELEDGFSWTLVQRTEVG----FDISLNGIP----QKVEC-----NSKLA 403
Query: 711 SATAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLR 767
A +I ECF PI+ + G +LI ++Y G N + + G ++ IL ++SA +R
Sbjct: 404 VALSIMDECFLPIVDQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIR 463
Query: 768 IFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFG 827
I G ++AE+P + T Y+ +G + L + IE L SLNVE LV+PA + WT FG
Sbjct: 464 IHGNKLAEMPFIGTRHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFG 523
Query: 828 FRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
F+ + K R+ + +F GT ML+K
Sbjct: 524 FKPLEVSS-RKEMRNMNMLVFHGTDMLQK 551
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 134/503 (26%), Positives = 206/503 (40%), Gaps = 94/503 (18%)
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLH-----RLLFLPNGLPDGERLTYIVKGQ 439
E LE+ + G KR RTA G R + + N + GE+++Y +G
Sbjct: 481 ETLEKPRDGQKR---QSSRTAMQGVTPRSSKCKPRCALSWMIDNNLVSPGEKVSY--RGS 535
Query: 440 R----LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA-- 493
+ L G GI C+CC K + + FE+HAG R P +I +G +L D
Sbjct: 536 KGPGELTRGRITREGIECNCCQKIFTLTGFESHAGSTNHR-PAANIILEDGRSLLDCQRK 594
Query: 494 -------------ISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLI 540
Q + G +D +C VC DG +L++C+ CP FH C+ I
Sbjct: 595 KKPRIKMQRVTREAKWKGRQNQHQGETDYICSVCHDGGDLIVCDHCPSTFHKNCVGLEDI 654
Query: 541 PESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFS 600
PE W + K ++ P
Sbjct: 655 PEGEW----FCPPCCCGICGENKFKYNVQEP----------------------------- 681
Query: 601 AATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK-EIPKDKWFCCDDCNRIHAALQDFVSN 659
D ++ CDQCE+++H+GCLR G+ LK + PKD WFC + C I LQ +
Sbjct: 682 ----KDSRLLSCDQCERKYHIGCLRNKGVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGK 737
Query: 660 RAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK---EKSLLSSATAIF 716
P + T+ W+ + C E + S L A +
Sbjct: 738 SVVVGPDNLTWTL-------------------WKFMDSDSCDVEAPTGKHSKLDLAVEVI 778
Query: 717 RECFDPII-AECGRDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREV 773
ECF+P GRD+ +++ R N++ F G Y+V+L +++ +R+FG +V
Sbjct: 779 HECFEPATETYTGRDIAEDVIFSRECNLNRLNFRGFYTVLLERNDELIAVANVRVFGDKV 838
Query: 774 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSR 833
AE+PLV T ++ G + L +E+ L +L VE L+LPA W FGF K++
Sbjct: 839 AEIPLVGTRFLFRRLGMCKILMDELEKQLMNLGVERLMLPAVPSVLYTWINGFGFSKLTD 898
Query: 834 ERLLKYQRDFQLTIFKGTSMLEK 856
++Y D F GT +K
Sbjct: 899 AEKMQYL-DHTFLDFPGTIKCQK 920
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 201/435 (46%), Gaps = 76/435 (17%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA 497
++ G +G+VC CCN+ +S S+F+ HAG + P +++ +G A S
Sbjct: 636 VKTGLVTKDGVVCTCCNRTVSLSEFKNHAGFN-QNCPCLNLFMGSGKPFASCQLEAWSAE 694
Query: 498 MGQRRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR S DD C VCGDG L+ C+ CP FH ACL ++PE W C
Sbjct: 695 YKARRNGWRSEEASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCS 754
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
+C S + S+N + RS DF
Sbjct: 755 SCSCQICSELV--------------------------SDNGE----RSQDFK-------- 776
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP--A 666
C QC ++H CL+ G+ +++ + +FC +C +++ L S+R I A
Sbjct: 777 ---CSQCAHKYHGICLQ--GISKRRKLFPETYFCGKNCEKVYTGL----SSRVGVINPNA 827
Query: 667 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-A 725
LS I K DG ++ + ++ KA+C S L+ A +I E F ++
Sbjct: 828 DGLSW----SILKCFQEDGKVHSAR-RLALKAEC-----NSKLAVALSIMEESFLSMVDP 877
Query: 726 ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
G D+IP ++Y G N + +F G Y+++L V++S +R+ G VAE+PLVATC
Sbjct: 878 RTGIDMIPHVLYNWGSNFARLDFDGFYTMVLEKDDVMISVASIRVHGVTVAEMPLVATCS 937
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF 843
+Y+ +G + L + IE +L SL VE LV+ A WT+ FGF+ M E +R
Sbjct: 938 KYRRQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-I 996
Query: 844 QLTIFKGTSMLEKKV 858
L +F GT +L K +
Sbjct: 997 NLMVFPGTILLMKTL 1011
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 132/436 (30%), Positives = 192/436 (44%), Gaps = 75/436 (17%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDI-AISLA 497
++ G G+VC+CC K +S S F AHAG + Q ++ +G TL + A S
Sbjct: 904 IKDGLVTWEGVVCNCCKKTLSVSGFMAHAGFS-HPQSSLGLFLESGKSYTLCQVEAWSAE 962
Query: 498 MGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR+ SDD C CGDG LL C+ CP +H ACL +PE W C
Sbjct: 963 FMSRRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHPACLSAKELPEGSWYCH 1022
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
NC C IC SE +TF
Sbjct: 1023 NCT--------------------------CQICGGPVSEK-----------EVSTF--SA 1043
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
+ C QC +H C+ + L L++ WFC C I L+ V + I S
Sbjct: 1044 IFKCFQCGDAYHDTCIEQEKL-PLEDQISQTWFCGKYCKEIFIGLRSHVGT--ENILDSE 1100
Query: 669 LS-TINRKHIEKGILFDGTMNDVQ-WQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-A 725
LS +I R + ND Q ++K C E L + A + ECF ++
Sbjct: 1101 LSWSILRCN-----------NDGQKLHSVQKIACLAECNMKL-AVALTLLEECFIRMVDP 1148
Query: 726 ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
G D+IP ++Y G N + ++ G Y+VIL ++ +R+ G + AELP +AT
Sbjct: 1149 RTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSV 1208
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM-SRERLLKYQRD 842
+Y+ +G + L + IE++LCS NV+ LVL A + S W FGF+ + ER K +
Sbjct: 1209 DYRRQGMCRILMNIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKPIEDAER--KQLHN 1266
Query: 843 FQLTIFKGTSMLEKKV 858
L +F GTS+L K++
Sbjct: 1267 VNLMLFPGTSLLTKRL 1282
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 192/436 (44%), Gaps = 78/436 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT---LHDIAISLAMG---QRR 502
+GI C CCN+ S +F HAG + PYR+I +G+ LH + + M +R+
Sbjct: 930 DGINCSCCNEVYSVLEFVTHAGSEVNK-PYRNILV-DGLDIDLLHCLINAWNMQSDVERQ 987
Query: 503 T--------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
+DD C +CGDG NL+ C+GCP FH +CL ++P W C NC
Sbjct: 988 DFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEVLPSDYWCCANC---- 1043
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD-DRTVIYCD 613
C C S D + T D D ++ C
Sbjct: 1044 ----------------------SCKFCHE-----------HSSDGAEDTADVDYSLHTCS 1070
Query: 614 QCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTIN 673
QCE+++H C + FC C + LQ+ ++ + P S +
Sbjct: 1071 QCEEQYHEACSPETDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQ 1130
Query: 674 RKHIEKGILFDGTMNDVQWQML---KKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGR 729
R H DV ++L K+ +C S ++ A ++ ECF PII + G
Sbjct: 1131 RIH-----------EDVPEEVLALDKRVEC-----NSRIAVALSLMDECFLPIIDQRTGI 1174
Query: 730 DLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQG 787
+LI +VY G N + +F G Y IL +++A +RI G ++AE+P + T Y+
Sbjct: 1175 NLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRR 1234
Query: 788 KGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTI 847
+G + L IE +L SLNVE L++PA + WT KFGF + + + + +
Sbjct: 1235 QGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDSEKQEV-KSISMLV 1293
Query: 848 FKGTSMLEKK-VQCLP 862
F GT +L+K ++ LP
Sbjct: 1294 FPGTGLLQKPLLKALP 1309
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 132/452 (29%), Positives = 197/452 (43%), Gaps = 86/452 (19%)
Query: 431 RLTYIVKGQR-----LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSN 485
++ Y VKG+ L G +GI C+CC S FE HA ++ +P I+ +
Sbjct: 644 KVYYKVKGRHRKVCTLADGKITRDGIKCNCCMGIYSFVGFENHASGSSTCRPSARIFLED 703
Query: 486 GMTLHDIAISLAMGQ--RRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAAC 534
G +L D I + R TTG S D +C VC G L+LC+ CP +FH C
Sbjct: 704 GRSLLDCQIKMMHDHKTRETTGKSFSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTC 763
Query: 535 LDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
L IP W CP+C G R +D EVG
Sbjct: 764 LGLEDIPNGDWFCPSCCCGICGQ--RKID-------GDDEVG------------------ 796
Query: 595 RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI----H 650
++ C QCE ++HV CL +NG D+ WFC DC +I H
Sbjct: 797 -------------QLLPCIQCEHKYHVRCL-ENGAADISTRYLGNWFCGKDCEKIYEGLH 842
Query: 651 AALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLS 710
L + VS + + + IN E D + +D+ + K L+
Sbjct: 843 KLLGEPVSVGVDNLTWTLVKFINPDSCEH----DSSKSDLLAESYSK-----------LN 887
Query: 711 SATAIFRECFDPIIAE-CGRDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLR 767
A ++ ECF+P+ RDL+ +++ R ++ F G Y+V+L ++S +R
Sbjct: 888 LAISVMHECFEPLKESLTNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVATVR 947
Query: 768 IFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFG 827
++G++VAE+PLV T +Y+ +G L +E+ L L VE LVLPA WT+ FG
Sbjct: 948 VYGKKVAEIPLVGTRLQYRRRGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRSFG 1007
Query: 828 FRKMS---RERLLKYQRDFQLTIFKGTSMLEK 856
F KM+ R + L D+ F+G M +K
Sbjct: 1008 FAKMTNLERSQFL----DYTFLDFQGAIMCQK 1035
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 124/424 (29%), Positives = 185/424 (43%), Gaps = 69/424 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 18 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 76
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 77 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 129
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+ A G + G + +++ C CE+
Sbjct: 130 ---TCKFCDAAVASGGKDGNFI----------------------------SLLSCGMCER 158
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CL ++ FC C + LQ ++ + + S S I+R
Sbjct: 159 RYHQLCLNDEAH-KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD- 216
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
T +D QM AQ E K L+ AI ECF PI+ G DLI ++
Sbjct: 217 --------TDSDTNSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVL 264
Query: 737 Y--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
Y G N + + G Y+ IL ++SA LR G ++AE+P + T Y+ +G + L
Sbjct: 265 YNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRL 324
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSML 854
F IE + SL VE LV+PA WT FGF + + + K R +F G ML
Sbjct: 325 FDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDML 383
Query: 855 EKKV 858
+K +
Sbjct: 384 QKPL 387
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 72/91 (79%), Positives = 82/91 (90%)
Query: 237 ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQE 296
+R+LDGI++GGGYLCGC CNFSKV+SA+EFEQHAGAKTRHPNNHIYLENGKPIYSIIQE
Sbjct: 4 QRELDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQE 63
Query: 297 LKTAPLGILEEVVKKVAGSSFNEGSFRVWKA 327
LKTAPL +++ V+K VAGSS NE FRVWK
Sbjct: 64 LKTAPLSMIDGVIKDVAGSSINEEFFRVWKG 94
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 42/98 (42%), Gaps = 23/98 (23%)
Query: 439 QRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI-- 492
QR G G G +C C CN K +S +FE HAG A R P HIY NG ++ I
Sbjct: 4 QRELDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAG-AKTRHPNNHIYLENGKPIYSIIQ 62
Query: 493 -----AISLAMGQRRTTGGSD-----------DMCHVC 514
+S+ G + GS D+C +C
Sbjct: 63 ELKTAPLSMIDGVIKDVAGSSINEEFFRVWKGDICFIC 100
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 125/439 (28%), Positives = 181/439 (41%), Gaps = 84/439 (19%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQR----RTT 504
+GI CC + SP FEAHAG + + S+ + SL GQR R T
Sbjct: 278 DGIKNSCCQEVFSPRGFEAHAGSS--------FHQSDANIFLEDEGSLLEGQRQMVHRIT 329
Query: 505 GGS------------------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
G S DD+C VC G +L+LC+ CP FH +CL +PE W
Sbjct: 330 GKSFTKESSHGKKSNGDQCNNDDICSVCHYGGDLVLCDQCPSCFHQSCLGLKELPEGDWF 389
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDD 606
CP+C C R+C + F + +D
Sbjct: 390 CPSC---------------------------CC------------RICGENRFDEYSEED 410
Query: 607 RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 666
C QCE ++HVGCLRK L+ P FC C +I L +
Sbjct: 411 NFKFSCHQCELQYHVGCLRKQRHVKLETYPDGTRFCSTQCEKIFLGLLKLLGKPIPVGVD 470
Query: 667 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-A 725
+ T+ + I + D N ++ K L+ A + ECF+PI
Sbjct: 471 NLTWTLLKPTISEWFDMDVPDNKALTEVYSK-----------LNIALNVMHECFEPIKEP 519
Query: 726 ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
GRDL+ +++ G ++ F G Y V+L ++S +R+ G +VAE+PLV T
Sbjct: 520 HTGRDLVEDVIFCRGSDLKRLNFRGFYIVLLERNDELISVATIRVHGEKVAEVPLVGTRS 579
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF 843
+Y+ G + L + IE+ L L VE L LPAA W FGF KM+ L + D+
Sbjct: 580 QYRRLGMCRILINEIEKKLVELGVERLTLPAAPSVLDTWVTSFGFSKMTDSERLTF-LDY 638
Query: 844 QLTIFKGTSMLEKKVQCLP 862
F+ T M +K + +P
Sbjct: 639 TFLDFQDTVMCQKLLMKIP 657
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 121/436 (27%), Positives = 187/436 (42%), Gaps = 76/436 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS-----------NGMTLHDIAISLA 497
+GI C CC+K +S +F AHAG PYR+I N + A
Sbjct: 931 DGIDCSCCSKVLSVLEFVAHAGSEVN-TPYRNILVDGQDIDLLHCLINAWNMQSDAEKQD 989
Query: 498 MGQRRTTGG--SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
G +DD C +CGDG NL+ C+GCP FH +CL +P W C NC
Sbjct: 990 FFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEELPSDYWCCANC----- 1044
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD-DRTVIYCDQ 614
C C S+D + T D D ++ C Q
Sbjct: 1045 ---------------------SCKFCHE-----------HSNDGAEDTADVDSSLHTCSQ 1072
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINR 674
CE+++H C +N FC C + LQ+ ++ + P S + R
Sbjct: 1073 CEEQYHEACSPENDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQR 1132
Query: 675 KHIEKGILFDGTMNDVQWQML---KKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRD 730
H DV ++L + +C S ++ A ++ ECF PI+ + G +
Sbjct: 1133 IH-----------EDVPEEVLPLDTRVEC-----NSKIAVALSLMDECFLPIVDQRTGIN 1176
Query: 731 LIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 788
LI +VY G N + +F G Y IL +++A +RI G ++AE+P + T Y+ +
Sbjct: 1177 LIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQ 1236
Query: 789 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIF 848
G + L IE +L SLNVE L++PA + WT +FGF + + + + +F
Sbjct: 1237 GMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSRFGFSPLEDSEKEEV-KSISMLVF 1295
Query: 849 KGTSMLEKK-VQCLPE 863
GT +L+K ++ LP+
Sbjct: 1296 PGTGLLQKPLLKALPK 1311
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 124/424 (29%), Positives = 185/424 (43%), Gaps = 69/424 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 38 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 96
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 97 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 149
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+ A G + G + +++ C CE+
Sbjct: 150 ---TCKFCDAAVASGGKDGNFI----------------------------SLLSCGMCER 178
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CL ++ FC C + LQ ++ + + S S I+R
Sbjct: 179 RYHQLCLNDEAH-KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD- 236
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
T +D QM AQ E K L+ AI ECF PI+ G DLI ++
Sbjct: 237 --------TDSDTNSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVL 284
Query: 737 Y--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
Y G N + + G Y+ IL ++SA LR G ++AE+P + T Y+ +G + L
Sbjct: 285 YNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRL 344
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSML 854
F IE + SL VE LV+PA WT FGF + + + K R +F G ML
Sbjct: 345 FDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDML 403
Query: 855 EKKV 858
+K +
Sbjct: 404 QKPL 407
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 67/137 (48%), Positives = 99/137 (72%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
+N E K +KK N +P+NVK LLSTGI DG VKY + SRE+ L G++ G GYLC C
Sbjct: 384 VVKNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYCTWSREKNLQGVIKGTGYLCSC 443
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
+C K ++A+EFE+HAGAK++HPN+HI+ ENGK +Y+++QELK +P +L + ++ V
Sbjct: 444 DICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQEMLFDAIQTVT 503
Query: 314 GSSFNEGSFRVWKASHH 330
G++ N+ +FR WK+S+
Sbjct: 504 GATINQRNFRTWKSSYQ 520
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/125 (63%), Positives = 96/125 (76%), Gaps = 1/125 (0%)
Query: 735 MVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
MVYGR++ GQEFGGMY IL V S VVSA +LR+FG+++AELPLVAT GKG FQ L
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTL 60
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSM 853
FSCIERLL L V+ LVLPAAE+AESIWT+KFGF ++ ++L Y+R Q+ FKGTSM
Sbjct: 61 FSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSM 120
Query: 854 LEKKV 858
L+K V
Sbjct: 121 LQKTV 125
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 140/278 (50%), Gaps = 30/278 (10%)
Query: 593 LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
LC DF F D+TV+ CDQCEKE+H+GCLR++ + D++ +P+ +WFC D+C RI A
Sbjct: 1514 LCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVRIRDA 1573
Query: 653 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 712
L + V+ +P + +WQ+L + + ++ LS+
Sbjct: 1574 LGEDVAAGEVLMPGNPA--------------------YRWQIL-RGKNGRQQTWHALSTV 1612
Query: 713 TAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVIL-------TVKSVVVSAG 764
I +E FDPII G DL+P MV +F GMYS++L + V A
Sbjct: 1613 LNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVLAA 1672
Query: 765 LLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTK 824
L R+ G +AE+PLVAT + + +G +AL + L L V +VLPA A+ W +
Sbjct: 1673 LFRVLGSSMAEMPLVATRYDCRRQGHLRALVDAMRHKLLGLGVRAIVLPATADAQPAW-R 1731
Query: 825 KFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLP 862
+ F+ + + + ++ IF T++L + + +P
Sbjct: 1732 QLQFQDLDEPSTRVARSEHRMVIFPHTTVLARPLIRVP 1769
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 48/88 (54%), Gaps = 9/88 (10%)
Query: 418 RLLF--LPNGLPDGERLTYIVKGQRLRFGCKQ-------GNGIVCDCCNKEISPSQFEAH 468
R LF LP GL + + Y ++G+RL G Q +GI+C C+ IS S FEAH
Sbjct: 1351 RTLFDGLPGGLKHDDIVWYTLQGKRLLRGVVQLQACGAATSGILCGHCDTIISASAFEAH 1410
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISL 496
AG RR PY I T +GMTL IA L
Sbjct: 1411 AGHKHRRNPYEAILTDDGMTLKRIAERL 1438
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 68/131 (51%), Positives = 97/131 (74%), Gaps = 1/131 (0%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCN 257
+IK ++K PN +PTNV+ L+STG+LDG VKY+S +RE +L GI+ G YLCGC CN
Sbjct: 296 QDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVARE-ELRGIIKGTTYLCGCQSCN 354
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++K ++A EFE+HAG K++HPNNHIY ENGK IY I+QEL++ P L + ++ + G+
Sbjct: 355 YAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQTIFGAPI 414
Query: 318 NEGSFRVWKAS 328
N+ +FR+WK S
Sbjct: 415 NQKAFRIWKES 425
Score = 40.0 bits (92), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 64/150 (42%), Gaps = 16/150 (10%)
Query: 352 SIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKK 411
S + S+ A E+ + ++ V T T E + + K+ +K T +
Sbjct: 256 SSVHVSTTAHEKELEASNSDVVASTPLVTTKKPESVSKNKQDIKSTRKESPNTFPTNVRS 315
Query: 412 RDNDLHRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFE 466
+ G+ DG + Y+ V + LR G +G +C C CN K ++ +FE
Sbjct: 316 ---------LISTGMLDGVPVKYVSVAREELR-GIIKGTTYLCGCQSCNYAKGLNAFEFE 365
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISL 496
HAG ++ P HIY NG T++ I L
Sbjct: 366 KHAGCKSK-HPNNHIYFENGKTIYQIVQEL 394
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 118/437 (27%), Positives = 192/437 (43%), Gaps = 83/437 (18%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 727 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 785
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 786 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 845
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
C C+ ++ + D + D ++
Sbjct: 846 --------------------------SCKFCQEHSRQD-------AQDIAEV---DSSLC 869
Query: 611 YCDQCEKEFHVGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQ 662
C QCE+++H GC + + CDL FC C + L++ ++ +
Sbjct: 870 TCSQCEEKYHPGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKD 921
Query: 663 TIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 722
P S I R H + + + ++ +C S ++ A ++ ECF P
Sbjct: 922 LEPEFSCRIIQRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLP 968
Query: 723 IIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
I+ + G +LI +VY G N +F G Y +L +++A +RI G ++AE+P +
Sbjct: 969 IVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFI 1028
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 839
T Y+ +G + L IE +L SLNVE L++PA + WT KFGF + +
Sbjct: 1029 GTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEV 1088
Query: 840 QRDFQLTIFKGTSMLEK 856
+ L +F GT +L+K
Sbjct: 1089 KSTSML-VFPGTGLLQK 1104
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 118/437 (27%), Positives = 192/437 (43%), Gaps = 83/437 (18%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 882 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 940
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 941 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 1000
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
C C+ ++ + D + D ++
Sbjct: 1001 --------------------------SCKFCQEHSRQD-------AQDIAEV---DSSLC 1024
Query: 611 YCDQCEKEFHVGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQ 662
C QCE+++H GC + + CDL FC C + L++ ++ +
Sbjct: 1025 TCSQCEEKYHPGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKD 1076
Query: 663 TIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 722
P S I R H + + + ++ +C S ++ A ++ ECF P
Sbjct: 1077 LEPEFSCRIIQRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLP 1123
Query: 723 IIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
I+ + G +LI +VY G N +F G Y +L +++A +RI G ++AE+P +
Sbjct: 1124 IVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFI 1183
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 839
T Y+ +G + L IE +L SLNVE L++PA + WT KFGF + +
Sbjct: 1184 GTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEV 1243
Query: 840 QRDFQLTIFKGTSMLEK 856
+ L +F GT +L+K
Sbjct: 1244 KSTSML-VFPGTGLLQK 1259
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 118/437 (27%), Positives = 192/437 (43%), Gaps = 83/437 (18%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 863 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 921
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 922 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 981
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
C C+ ++ + D + D ++
Sbjct: 982 --------------------------SCKFCQEHSRQD-------AQDIAEV---DSSLC 1005
Query: 611 YCDQCEKEFHVGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQ 662
C QCE+++H GC + + CDL FC C + L++ ++ +
Sbjct: 1006 TCSQCEEKYHPGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKD 1057
Query: 663 TIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 722
P S I R H + + + ++ +C S ++ A ++ ECF P
Sbjct: 1058 LEPEFSCRIIQRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLP 1104
Query: 723 IIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
I+ + G +LI +VY G N +F G Y +L +++A +RI G ++AE+P +
Sbjct: 1105 IVDQRTGINLIRNVVYNCGSNFVRMDFRGFYIFVLERGDEIIAAASVRIHGTKLAEMPFI 1164
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 839
T Y+ +G + L IE +L SLNVE L++PA + WT KFGF + +
Sbjct: 1165 GTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEV 1224
Query: 840 QRDFQLTIFKGTSMLEK 856
+ L +F GT +L+K
Sbjct: 1225 KSTSML-VFPGTGLLQK 1240
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 131/436 (30%), Positives = 192/436 (44%), Gaps = 73/436 (16%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRH-IYTSNGMTLHDI-AISLAM 498
L+ G +GI+C CC+K +S S F+AHA ++ R + + TL I A +
Sbjct: 577 LKDGWVTWDGILCSCCSKTLSISDFKAHAMISLPRSSLNLCLQSGKSFTLCQIEAWNAEY 636
Query: 499 GQRRTTG---------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
RR+ G+DD C CGDG LL C+ CP +H +CL +P+ W C N
Sbjct: 637 MDRRSNACRRKVEAADGNDDTCGFCGDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHN 696
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLS--PSENFDIRLCRSHDFSAATFDDR 607
C ICR+ P +I FSA
Sbjct: 697 C-----------------------------ICRICGCPVTEKEIS-----SFSA------ 716
Query: 608 TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPAS 667
+I C QC H C+ + G +E+ D+WFC C I+ L V S
Sbjct: 717 -IIKCLQCGAAHHDTCV-EMGATAFEEMDSDEWFCGTHCKEIYLGLHGCVGVE------S 768
Query: 668 SLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AE 726
SL I + M+ VQ K A E K L+ A + ECF ++
Sbjct: 769 SLGDGLSWTILRCNSGGQKMHSVQ----KIAHAIECNSK--LAVALTLMEECFAQMVDTR 822
Query: 727 CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCRE 784
G ++IP ++Y G + + G Y+VIL ++ A +R+ G + AELP +ATCRE
Sbjct: 823 TGINMIPHVLYNQGSKYARLNYQGFYTVILEKGEEILCAASIRVHGMKAAELPFIATCRE 882
Query: 785 YQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD-F 843
++ KG + L + IE +L S +V+ LVL A + S W FGF+ + E + Q D
Sbjct: 883 HRRKGMCRRLINTIEEMLKSFHVKMLVLSAIPELVSTWVSGFGFKPI--EEYERKQLDTI 940
Query: 844 QLTIFKGTSMLEKKVQ 859
L +F GTS+L K ++
Sbjct: 941 NLMLFPGTSLLIKSLE 956
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 117/437 (26%), Positives = 192/437 (43%), Gaps = 83/437 (18%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 335 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 393
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 394 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 453
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
C C+ ++ + D + D ++
Sbjct: 454 --------------------------SCKFCQEHSRQD-------AQDIAEV---DSSLC 477
Query: 611 YCDQCEKEFHVGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQ 662
C QCE+++H GC + + CDL FC C + L++ ++ +
Sbjct: 478 TCSQCEEKYHPGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKD 529
Query: 663 TIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 722
P S I R H + + + ++ +C S ++ A ++ ECF P
Sbjct: 530 LEPEFSCRIIQRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLP 576
Query: 723 IIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
I+ + G +LI +VY G N +F G Y +L +++A +RI G ++AE+P +
Sbjct: 577 IVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFI 636
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 839
T Y+ +G + L IE +L SLNVE L++PA + WT KFGF + +
Sbjct: 637 GTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEV 696
Query: 840 QRDFQLTIFKGTSMLEK 856
+ + +F GT +L+K
Sbjct: 697 -KSTSMLVFPGTGLLQK 712
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 119/426 (27%), Positives = 186/426 (43%), Gaps = 67/426 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI CDCC + + S FEAHAG + QP ++I+ NG +L + Q +
Sbjct: 17 DGIQCDCCGETFAISDFEAHAG-SKSCQPLKNIFLENGPSLLHCQLESWHRQDESDRKGF 75
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG NL+ C+ CP FH +CL+ +P W C C
Sbjct: 76 HFVDIDGQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNCTYC------ 129
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C C ++ + C+ + AA ++ C CE
Sbjct: 130 --------------------SCKFCGMAGGDA-----CQMDENDAAA--RPALLTCCLCE 162
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKH 676
+++H C+ + + FC C +H LQ + + + + + + R
Sbjct: 163 EKYHHSCIPAED--TINDYHSSLSFCGKKCQELHDKLQALLGVKHEMEEGFAWTVVRRFD 220
Query: 677 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVM 735
+ I G M +K +C S ++ A I ECF P+ G +LI +
Sbjct: 221 VGSDITLSG--------MHRKVEC-----NSKVAVALHIMDECFLPMPDHRSGVNLIRNI 267
Query: 736 VY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
VY G N + + G + IL V+SA +RI G ++AE+P + T Y+ +G +
Sbjct: 268 VYNFGSNFNRLNYCGFLTAILERGDEVISAASIRIHGNQLAEMPFIGTRHMYRRQGMCRR 327
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQ-RDFQLTIFKGTS 852
L IE LCSLNVE LV+PA + WT FGF+++ E L K + R ++ F G
Sbjct: 328 LLGAIETALCSLNVEKLVIPAISELRETWTSVFGFKQL--EGLSKQKMRYMKMVAFPGVD 385
Query: 853 MLEKKV 858
ML+K +
Sbjct: 386 MLQKPL 391
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 185/425 (43%), Gaps = 71/425 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC + + + FE HA A +I+ +G +L + + +++
Sbjct: 635 GIKCSCCRRVFTINGFEVHAN-GASCSGAANIFLDDGRSLLECQVEAYKKRKKAQPPDML 693
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D C VC G L+LC+GCP AFHA CL +P+ W C +C
Sbjct: 694 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSC-------- 745
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
C C F ++ ++ + I C QCE +
Sbjct: 746 ------------------CCGAC-----GQFFLKT------TSTNAKEEKFISCKQCELK 776
Query: 619 FHVGCLRKNGLCD-LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CLR +G CD L +I +KWFC DC I L D + + + + L+ + +
Sbjct: 777 YHPSCLRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIG-KPREVSVEKLTWRLVQSL 835
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
E + D K + E +LS A + E F+P+ GRDL ++
Sbjct: 836 EPNMYGDDA---------SKIEAAAENH-CILSVALDVMHELFEPVKRPHGGRDLAEDVI 885
Query: 737 YGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
+ R F G Y+V+L + +VS +RI G++VAE+P + T +++ +G + L
Sbjct: 886 FSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRVL 945
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQRDFQLTIFKGT 851
+ +E++L L VE LVLPA + W FGF KM+ R+ LK F L F T
Sbjct: 946 INELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK----FTLLEFGRT 1001
Query: 852 SMLEK 856
+ EK
Sbjct: 1002 ILCEK 1006
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 153 bits (387), Expect = 4e-34, Method: Composition-based stats.
Identities = 70/99 (70%), Positives = 76/99 (76%)
Query: 451 IVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDM 510
IVC CC EISPSQFE+HAGM+ARRQPYRHIYTSNG++LHDIAISLA GQ TTG DDM
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIGDDM 63
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
C GDG +L+ C CP AFHAACLD PE W CPN
Sbjct: 64 CAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHCPN 102
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 118/437 (27%), Positives = 192/437 (43%), Gaps = 83/437 (18%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 198 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 256
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 257 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 316
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
C C+ ++ + D + D ++
Sbjct: 317 --------------------------SCKFCQEHSRQD-------AQDIAEV---DSSLC 340
Query: 611 YCDQCEKEFHVGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQ 662
C QCE+++H GC + + CDL FC C + L++ ++ +
Sbjct: 341 TCSQCEEKYHPGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKD 392
Query: 663 TIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 722
P S I R H + + + ++ +C S ++ A ++ ECF P
Sbjct: 393 LEPEFSCRIIQRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLP 439
Query: 723 IIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
I+ + G +LI +VY G N +F G Y +L +++A +RI G ++AE+P +
Sbjct: 440 IVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFI 499
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 839
T Y+ +G + L IE +L SLNVE L++PA + WT KFGF + +
Sbjct: 500 GTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEV 559
Query: 840 QRDFQLTIFKGTSMLEK 856
+ L +F GT +L+K
Sbjct: 560 KSTSML-VFPGTGLLQK 575
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 118/427 (27%), Positives = 191/427 (44%), Gaps = 68/427 (15%)
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT--- 504
GNGI C CC+K + S FE HAG + P ++IY G +L + Q +
Sbjct: 494 GNGIHCGCCDKIFTISDFELHAG-SKLADPLKNIYVGEGTSLLQCLLDSWNKQDESERKG 552
Query: 505 ---------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+DD C VCGDG +L+ C+GCP FH CLD P W C C
Sbjct: 553 FHFVDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHQGCLDIKKFPSGDWHCIYCCCKFC 612
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
S+S S + + +N ++ + + ++ C C
Sbjct: 613 GSVSGSSNQR--------------------DDNDELIVSK-------------LLTCQLC 639
Query: 616 EKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRK 675
E+++H C+ N + +D +FC + C + L+ + + + S + I R
Sbjct: 640 EEKYHRSCIEANDAN--TDDSRDVFFCGNRCQELSERLEMLLGVKHEMEDGYSWTFIRRS 697
Query: 676 HIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPV 734
+ FD + ++ QM+ +C S L+ A +I ECF P I G +LI
Sbjct: 698 DVG----FDASQ--IKPQMV---EC-----NSKLAVAVSIMDECFMPYIDHRSGINLIHS 743
Query: 735 MVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQ 792
++Y R N + + G + IL ++SA +RI G ++AE+P + T Y+ +G +
Sbjct: 744 ILYNRGSNFNRLNYSGFVTAILERGDEIISAASIRIRGNQLAEMPFIGTRYMYRRQGMCR 803
Query: 793 ALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM-SRERLLKYQRDFQLTIFKGT 851
L + +E L SLNVE LV+PA + WT FGF + S + + + ++ L +F
Sbjct: 804 RLLNAVEWGLGSLNVELLVIPAISELRETWTSVFGFESLESTSKQILHNKN--LLVFPHV 861
Query: 852 SMLEKKV 858
ML+KK+
Sbjct: 862 DMLQKKI 868
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 121/429 (28%), Positives = 189/429 (44%), Gaps = 74/429 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS--NGMTLHDIAISL---AMGQRR- 502
+GI C CC+K ++ +F AHAG +PYR+I + LH + I+ + +R+
Sbjct: 958 DGINCSCCSKVVTVLEFVAHAG-GQLSKPYRNILVDGLDNDLLHCLIIAWDKQSDSERQA 1016
Query: 503 ----TTGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+T G +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC NC
Sbjct: 1017 FFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWRCTNC----- 1071
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
C +C E+ + HD D + C QC
Sbjct: 1072 ---------------------SCKLCH----EHLN------HDAPDNAEID-PLHSCSQC 1099
Query: 616 EKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS---LSTI 672
EK++H C + FC C + LQ+ ++ P + + I
Sbjct: 1100 EKKYHPSCSPETEKLSSVSSQAGNHFCQQSCRLLFEELQNLLAVEKDLGPEYACRIIQCI 1159
Query: 673 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 731
+ E + DG + +C S ++ A ++ ECF PI+ + G +L
Sbjct: 1160 HEDAPETVLDLDG-----------RVEC-----NSKIAVALSLMDECFLPIVDQRTGINL 1203
Query: 732 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 789
I +VY G N +F G Y IL +VSA +RI G ++AE+P + T Y+ +G
Sbjct: 1204 IRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKLAEMPFIGTRHMYRRQG 1263
Query: 790 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 849
+ L IE +L SL VE L++PA + WT KFGF + + + + +F
Sbjct: 1264 MCRRLLDGIEMILSSLKVEKLIIPAINELVDTWTSKFGFSPLEVSDKQEV-KSINMLVFP 1322
Query: 850 GTSMLEKKV 858
GT +L+K +
Sbjct: 1323 GTGLLQKSL 1331
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 117/435 (26%), Positives = 188/435 (43%), Gaps = 93/435 (21%)
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQP---YRHIYTSNGMTLHDIAISLAMGQRRTT 504
G+GI C+CC K + F+ HA R + +I+ +G +L D + + +
Sbjct: 353 GSGIKCNCCGKVYTLCGFDYHASGKQGRTATSIFSNIFLEDGRSLLDCQMQIMHDHTKNL 412
Query: 505 GG--------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
G +D +C VC G L+LC+ CP +FH +CL + +P+ W C +C
Sbjct: 413 GEEPLERWQSSKDQVENDHICSVCHYGGELILCDQCPSSFHKSCLGLMDVPDGDWFCSSC 472
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
C IC C D S + +D V+
Sbjct: 473 --------------------------CCKICGQ----------CLKRD-SDLSMEDDGVL 495
Query: 611 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS 670
C QCE+++HV CL L+ PK+ WFC C +I L + + + IP
Sbjct: 496 DCTQCERKYHVVCLGNKREECLEYFPKEHWFCSKRCQQIFLGLHELLGKK---IPVG--- 549
Query: 671 TINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKS----------LLSSATAIFRECF 720
++++ W +LK Q ++ E S +L+ A + E F
Sbjct: 550 ----------------LHNLTWTLLKSIQFNDQCEASDIEALSENYSMLNIALDMMHEFF 593
Query: 721 DPII-AECGRDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELP 777
DP+ RDL+ +++ + ++ F G Y+V+L +S +R++G +VAE+P
Sbjct: 594 DPVEEPHTKRDLLKDVIFSKRSELNRLNFHGFYTVLLQKDDEFISVATVRVYGEKVAEIP 653
Query: 778 LVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RE 834
LV T +Y+ G L + +E+ L L V+ L+LPA A + W FGF K++ R
Sbjct: 654 LVGTRFQYRRLGMCCILMNVLEKKLRELGVQRLILPAVPSALNTWIGSFGFSKLTELDRL 713
Query: 835 RLLKYQ-RDFQLTIF 848
+LL Y DFQ TI
Sbjct: 714 QLLDYTFLDFQDTIM 728
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 119/432 (27%), Positives = 189/432 (43%), Gaps = 82/432 (18%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT---LHDIAISLAM---GQRR 502
+GI C CC++ IS +F HAG + PYR+I +G+ LH + + M +R+
Sbjct: 935 DGINCSCCSEVISVPEFVTHAGSEVNK-PYRNILV-DGLDIDLLHCLINAWNMQSDAERQ 992
Query: 503 T--------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
+DD C +CGDG NL+ C+GCP FH +CL +P W C NC
Sbjct: 993 DFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNC---- 1048
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD-DRTVIYCD 613
C C S D + T D D ++ C
Sbjct: 1049 ----------------------SCKFCHE-----------HSSDDAEDTADVDSSLHTCS 1075
Query: 614 QCEKEFHVGCLRKNGLCDLKEIPKD------KWFCCDDCNRIHAALQDFVSNRAQTIPAS 667
QCE++ C D+ I + FC C + LQ+ ++ + P
Sbjct: 1076 QCEEQCTEACSP-----DIDSIATNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEY 1130
Query: 668 SLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE- 726
S + R H E + + + K+ +C S ++ A ++ ECF PI+ +
Sbjct: 1131 SCRVVQRIHEE--------VPEEVLALDKRVEC-----NSKIAVALSLMDECFLPIVDQR 1177
Query: 727 CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCRE 784
G +LI +VY G N + +F G Y +IL +++A +RI G ++AE+P + T
Sbjct: 1178 TGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRIHGTKLAEMPFIGTRNM 1237
Query: 785 YQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQ 844
Y+ +G + L IE +L SLN+E L++PA + WT KFGF + + +
Sbjct: 1238 YRRQGMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEV-KSVS 1296
Query: 845 LTIFKGTSMLEK 856
+ +F GT +L+K
Sbjct: 1297 MLVFPGTGLLQK 1308
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 68/137 (49%), Positives = 95/137 (69%), Gaps = 1/137 (0%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N +I +KK + +P NVK LLSTG+ DG VKY+S +RE+ + G++ G GYLC C
Sbjct: 456 VSKNKQIN-TKKDPSSHFPLNVKSLLSTGMFDGVPVKYVSWTREKSVRGVIKGSGYLCSC 514
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CN S ++A+EFE+HA KT+HPNNHIY ENGK IY+++QELK P L EV++ V
Sbjct: 515 KDCNSSNCLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVT 574
Query: 314 GSSFNEGSFRVWKASHH 330
G N+ +F+ WKAS+
Sbjct: 575 GCPINQKNFQTWKASYQ 591
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 68/137 (49%), Positives = 95/137 (69%), Gaps = 1/137 (0%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N +I +KK + +P NVK LLSTG+ DG VKY+S +RE+ + G++ G GYLC C
Sbjct: 431 VSKNKQIN-TKKDPSSHFPLNVKSLLSTGMFDGVPVKYVSWTREKSVRGVIKGSGYLCSC 489
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CN S ++A+EFE+HA KT+HPNNHIY ENGK IY+++QELK P L EV++ V
Sbjct: 490 KDCNSSNCLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVT 549
Query: 314 GSSFNEGSFRVWKASHH 330
G N+ +F+ WKAS+
Sbjct: 550 GCPINQKNFQTWKASYQ 566
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 65/106 (61%), Positives = 83/106 (78%)
Query: 223 ILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHI 282
+LDG VKYI+ SRE++L GI+ G GYLCGC CNFSKV++A+EFE+HAG KT+HPNNHI
Sbjct: 1 MLDGVPVKYIAWSREKELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNNHI 60
Query: 283 YLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
Y ENGK IY I+QELK+ P L +V++ + GS N+ SFR+WK S
Sbjct: 61 YFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKSFRLWKES 106
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 8/77 (10%)
Query: 426 LPDGERLTYIV--KGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQPYR 479
+ DG + YI + + LR G +G+G +C C CN K I+ +FE HAG + P
Sbjct: 1 MLDGVPVKYIAWSREKELR-GIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTK-HPNN 58
Query: 480 HIYTSNGMTLHDIAISL 496
HIY NG T++ I L
Sbjct: 59 HIYFENGKTIYGIVQEL 75
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/442 (27%), Positives = 184/442 (41%), Gaps = 96/442 (21%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI-----SLAMGQRRT 503
NGI C CCNK + + FE H + R HI +G +L D I + + + +
Sbjct: 828 NGIKCCCCNKLYTINGFEIHVSGTSSRSA-AHILLEDGKSLLDCQILWNKKTRSFKNQAS 886
Query: 504 TGGSDD--------MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
T G D +C +C G L+LC+ CP +FH +CL +PE W CP+C G
Sbjct: 887 TCGKGDYSKDENDYICSICHFGGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSCCCG-- 944
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSA-ATFDDRTVIYCDQ 614
+C + S A D + C Q
Sbjct: 945 -------------------------------------ICGQNKLSEHANIVDGPFLTCYQ 967
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV--------SNRAQTI-- 664
CE ++HV CLR G K WFC C +I+ LQ + N ++
Sbjct: 968 CECKYHVQCLR--GTKKFGSCSKPHWFCNKHCKQIYWGLQKLLGKSIPVGGDNLTWSLLK 1025
Query: 665 -PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPI 723
P+S + N H+E T+ + Q S L+ A + ECF+P+
Sbjct: 1026 SPSSDTNYFNPPHLE-------TLTENQ---------------SKLNVALRVMHECFEPV 1063
Query: 724 IAE-CGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVA 780
+ RD++ +++ R + F G Y+V+L +++ +R++G +VAE+PLV
Sbjct: 1064 REQHTRRDIVEDVIFSRRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAEVPLVG 1123
Query: 781 TCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLL 837
T +Y+ G L + +E L L V+ LVLPA WT FGF KM+ R L
Sbjct: 1124 TRFQYRRLGMCHILMNELEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSERSEFL 1183
Query: 838 KYQ-RDFQLTIFKGTSMLEKKV 858
Y +FQ T+ +L+ V
Sbjct: 1184 NYTFLNFQETVMCQKFLLKNTV 1205
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 138/501 (27%), Positives = 210/501 (41%), Gaps = 109/501 (21%)
Query: 401 QKRTAEGGTKKRDND------LHRLLFLP------NGLPDGERLTYIVKGQRLRFGCKQG 448
Q R A GT +R+ + R L P N DG +T+ R
Sbjct: 221 QPRKAAKGTSRRNQRTALSYLVDRELISPGDRVHCNVTRDGRLVTW-------RGSITNE 273
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMG----QRRTT 504
I CDCC+ S+FEAH G + + +P +I+ +G +L D L Q+ T
Sbjct: 274 GFIKCDCCSNLFPISKFEAHTG-STKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETK 332
Query: 505 GG-------------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
+D +C VC G L+LC+ CP AFH +CL IP W
Sbjct: 333 ATEKKVNHNDNADSDTHGLDKNDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNW 392
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD 605
CP+C C++ +D D ++FD
Sbjct: 393 YCPSC-----------------------------CCKICGQVTYDF------DDQVSSFD 417
Query: 606 DRTVIYCDQCEKEFHVGCLRKNGLCD--LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQT 663
+ + C QCE+ H+GC++ + + + I ++ WFC C IH LQ+
Sbjct: 418 T-SFVRCVQCEQNVHIGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNL------- 469
Query: 664 IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQC-FEEKEKSLLSSATAIFRECFDP 722
+ K I ++ W ++K E + L+ A + + F P
Sbjct: 470 -------------LWKQIPVGDARENLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRP 516
Query: 723 IIAECGR-DLIP-VMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
+ + DLI V + R+ S + F G Y+ IL K+ VV+ +R++G EVAE+PLV
Sbjct: 517 VKDPITKNDLIEDVFLSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLV 576
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERL 836
AT +Y+ G + L + +E L + V+ L LPA +A + WTK FGF KM+ R L
Sbjct: 577 ATRLKYRRHGMCRRLLNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDL 636
Query: 837 LKYQ-RDFQLTIFKGTSMLEK 856
+KY FQ T+ +LEK
Sbjct: 637 IKYTFLGFQHTVRCQKDLLEK 657
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 116/472 (24%), Positives = 204/472 (43%), Gaps = 94/472 (19%)
Query: 426 LPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG-------------- 470
L +G ++Y+ K ++ G +GI+C CCN+ S + F+ HAG
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHRTAALLTLED 72
Query: 471 ----MAARRQPYRHIYTS------NGMTLHDIAISLAMGQRRT--------TGGSDDMCH 512
+ ++Q + I + NG D AM + + +DD C
Sbjct: 73 GRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVDDVEMDENDDTCA 132
Query: 513 VCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG 572
VCGDG L+ C+ CP FH CL +PE W CP C +S RS+
Sbjct: 133 VCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC---CCASCGRSL---------- 179
Query: 573 AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 632
+D + T ++YCDQCE+E+H C+ + +
Sbjct: 180 ------------------------YDPTIQT----EILYCDQCEREYHSNCVPGSAM--- 208
Query: 633 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQW 692
K D FC C +I L+ V + S + + +H +D + + +
Sbjct: 209 KYESSDNQFCSRKCLKIFRGLRKLVGRVNKVDDMYSWTLLRSEH------YDQSEENSKL 262
Query: 693 QMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQ--EFGGM 749
+ + + L+ A + +ECF P+I D++ ++Y R + +F G
Sbjct: 263 ESVADL-------NTRLALALTVIQECFRPMIDPRSNIDMVSHILYNRRGEDKRMDFRGF 315
Query: 750 YSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVEN 809
Y+V+L + ++S +R+ G AE+P + T +Y+ +G + L + I+++L +L V+
Sbjct: 316 YTVVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQT 375
Query: 810 LVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 861
LVLPA + WT FGF+K++ + ++ + + F G+S+L+K + L
Sbjct: 376 LVLPAIAEFIETWTSAFGFQKLTAAQGIQLM-ELNIVTFPGSSVLQKPLTWL 426
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 121/434 (27%), Positives = 181/434 (41%), Gaps = 91/434 (20%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPY-RHIYTSNGMTLHDI-AISLAMGQRRTTG-- 505
GI+C+CC K +S S F+AHAG R + + TL + A S + RR+
Sbjct: 967 GILCNCCTKTLSISDFKAHAGCRLRLSSLGLFLQSGKSYTLCQVEAWSAELMSRRSDAYG 1026
Query: 506 -------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
+DD C CGDG LL C+ CP +H ACL +PE W C NC
Sbjct: 1027 RKVEAVDENDDTCGFCGDGGELLCCDNCPSTYHEACLSSQELPEGSWYCHNCT------- 1079
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
C C +E ++F D ++ C QC
Sbjct: 1080 -------------------CRSCGNPVNEK-----------EVSSFSD--ILKCLQCGDA 1107
Query: 619 FHVGCLRKNGL-CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H C+ + L D K D WFC C I L V + IN
Sbjct: 1108 YHNTCIDRVMLPSDGKR--SDTWFCGRYCKEIFMGLHSQVGVE---------NVIN---- 1152
Query: 678 EKGILFDGTMNDVQWQMLK---KAQCFEEKEK--------SLLSSATAIFRECFDPII-A 725
ND+ W +L+ Q +K + L+ A + ECF ++
Sbjct: 1153 ----------NDLSWTILRCNSDGQRLHSAQKIGLMTECNTKLAVALTLLEECFIRMVDP 1202
Query: 726 ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
G D+IP ++Y G N + ++ G Y+VIL ++ +R+ G + AELP +AT
Sbjct: 1203 RTGVDMIPHVLYNKGSNFARLDYKGFYTVILEKGDEILCVASIRLHGTKAAELPFIATSV 1262
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF 843
+Y+ +G + L IE++L S +VE LVL A + + W FGF+ + + K R+
Sbjct: 1263 DYRRQGMCRRLLDIIEKMLRSFHVEMLVLSAIPELVNTWVSGFGFKPIEDDE-KKQLRNV 1321
Query: 844 QLTIFKGTSMLEKK 857
L +F G S+L K+
Sbjct: 1322 NLMLFPGASLLTKR 1335
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 113/427 (26%), Positives = 188/427 (44%), Gaps = 74/427 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI C CC+K+ + +F AHAG + PYR++ +G+ + ++ +++
Sbjct: 904 DGIQCRCCSKDFALLEFVAHAGGQVSK-PYRNVLV-DGLDKDLLHCLISAWDKQSDSERQ 961
Query: 504 ------TGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
T G +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC NC
Sbjct: 962 SFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWRCANC---- 1017
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQ 614
C C+ S+D + + ++ C Q
Sbjct: 1018 ----------------------CCKFCQE-----------HSNDDAPDIAEVDSLCTCSQ 1044
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKD--KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 672
CE+ +H C + + +P FC C + LQ+ ++ + P + I
Sbjct: 1045 CEENYHPVCSPETE--NPSSVPSQAGDLFCQQSCRLLFEELQNLLAVKKDLEPEFACRII 1102
Query: 673 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 731
H + + + + ++ +C S ++ A ++ ECF PII + G +L
Sbjct: 1103 KCIHED--------VPETALALDERVEC-----NSKIAVALSLMDECFLPIIDQRTGINL 1149
Query: 732 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 789
I +VY G N +F G Y IL +VSA +RI G + AE+P + T Y+ +G
Sbjct: 1150 IRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKCAEMPFIGTRNMYRRQG 1209
Query: 790 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 849
+ L IE +L SL V+ L++PA + WT KFGF + + + + +F
Sbjct: 1210 MCRRLLDGIEMILSSLKVQKLIIPAISELVDTWTSKFGFSPLEVSEKQEV-KSISMLVFP 1268
Query: 850 GTSMLEK 856
GT +L+K
Sbjct: 1269 GTGLLQK 1275
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 138/501 (27%), Positives = 210/501 (41%), Gaps = 109/501 (21%)
Query: 401 QKRTAEGGTKKRDND------LHRLLFLP------NGLPDGERLTYIVKGQRLRFGCKQG 448
Q R A GT +R+ + R L P N DG +T+ R
Sbjct: 198 QPRKAAKGTSRRNQRTALSYLVDRELISPGDRVHCNVTRDGRLVTW-------RGSITNE 250
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMG----QRRTT 504
I CDCC+ S+FEAH G + + +P +I+ +G +L D L Q+ T
Sbjct: 251 GFIKCDCCSNLFPISKFEAHTG-STKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETK 309
Query: 505 G-------------------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
+D +C VC G L+LC+ CP AFH +CL IP W
Sbjct: 310 ATEKKVNHNDNADSDTHGLDKNDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNW 369
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD 605
CP+C C++ +D D ++FD
Sbjct: 370 YCPSC-----------------------------CCKICGQVTYDF------DDQVSSFD 394
Query: 606 DRTVIYCDQCEKEFHVGCLRKNGLCD--LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQT 663
+ + C QCE+ H+GC++ + + + I ++ WFC C IH LQ+
Sbjct: 395 T-SFVRCVQCEQNVHIGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNL------- 446
Query: 664 IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQC-FEEKEKSLLSSATAIFRECFDP 722
+ K I ++ W ++K E + L+ A + + F P
Sbjct: 447 -------------LWKQIPVGDARENLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRP 493
Query: 723 IIAECGR-DLIP-VMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 779
+ + DLI V + R+ S + F G Y+ IL K+ VV+ +R++G EVAE+PLV
Sbjct: 494 VKDPITKNDLIEDVFLSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLV 553
Query: 780 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERL 836
AT +Y+ G + L + +E L + V+ L LPA +A + WTK FGF KM+ R L
Sbjct: 554 ATRLKYRRHGMCRRLLNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDL 613
Query: 837 LKYQ-RDFQLTIFKGTSMLEK 856
+KY FQ T+ +LEK
Sbjct: 614 IKYTFLGFQHTVRCQKDLLEK 634
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 146 bits (368), Expect = 6e-32, Method: Composition-based stats.
Identities = 72/114 (63%), Positives = 84/114 (73%)
Query: 745 EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCS 804
EFGGMY ILTV VVSA R+ G+EVAELPLVAT + QG+G FQAL++CIERLLC
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCF 76
Query: 805 LNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
L V +LVLPAAE AES+W KF F KM +E L RDFQ+ F+GTSML+K V
Sbjct: 77 LQVNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQGTSMLQKPV 130
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 121/438 (27%), Positives = 188/438 (42%), Gaps = 78/438 (17%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL---HDIAISLA 497
L+ G +GI C CC + S+F+ HAG+ + P +++ G + A S+
Sbjct: 905 LKDGNITASGIQCQCCGTTFTMSKFKCHAGLR-QEVPSLNLFLDTGKSYSLCQLQAWSIE 963
Query: 498 MGQRR---------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
R+ +DD C CGDG L+ C+ CP ++H ACL IP+ W C
Sbjct: 964 QKVRKEHAKDTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCS 1023
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
+C C IC E D S
Sbjct: 1024 SCL--------------------------CDIC----GEVID---------SKELVTSLP 1044
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
+ C QCE+++HV C+ C+ E WFC C+ I+ + V +P
Sbjct: 1045 ALDCSQCERQYHVKCVSAKVPCN--EDGSGTWFCGRKCHEIYMTFRSRVG-----VP--- 1094
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSL-LSSATAIFRECFDPII-AE 726
H++ + F N+ ++ + E ++ L AT+I ECF PI+
Sbjct: 1095 ------DHMDDDLCFTVLRNNGDKKVRTAEEIALMAECNMKLMIATSIMEECFLPILDPR 1148
Query: 727 CGRDLIPVMVYG-RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
G D+IP ++Y R+ + G Y+V+L +VS +R+ G +AE+PLVAT +E
Sbjct: 1149 TGIDIIPSILYNWRSDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKEN 1208
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRD 842
+ +G + L IE +L SL VE L+L A WT FGFR++ ++RL K
Sbjct: 1209 RQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDESDKKRLSK---- 1264
Query: 843 FQLTIFKGTSMLEKKVQC 860
+L GT +L+K + C
Sbjct: 1265 VRLAAVPGTVLLKKDLLC 1282
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 151/560 (26%), Positives = 229/560 (40%), Gaps = 123/560 (21%)
Query: 352 SIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGG--T 409
++ SCS+ + P +C + + + + + R V RP Q R EG T
Sbjct: 224 TMASCSNNNNKRPAPPAACKTATSSKKNKKKKASLQQAR---VLRP---QPRNEEGNALT 277
Query: 410 KKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIV---CDCC----NK---- 458
R L LL L ++L Y K G G+G+V C C NK
Sbjct: 278 TARARTLLSLLIDKKILAPRDQLIYTTK-----RGLITGDGMVKCMCGGCINNNNKRRVA 332
Query: 459 EISPSQFEAHA----GMAARRQPYRHIYTSNGMTLHDIAISLAMG--------------- 499
E + ++F H ++ RQP+ ++ +G +L + L M
Sbjct: 333 EYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKKY 392
Query: 500 ----------QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
+R+ D +C VC D LL+C+ CP FH AC+ P+ W CP
Sbjct: 393 LPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPA 452
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV 609
C C IC S ++ + FS+ DR V
Sbjct: 453 C--------------------------TCAICGSSDLDDPPATTT-TQGFSS----DRMV 481
Query: 610 IYCDQCEKEFHVGCLRK--NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP-A 666
I C+QC +E+HVGC+R+ NGL + + W C + C++I+ L++ +A A
Sbjct: 482 ISCEQCRREYHVGCMRERDNGLWYPEADGEGPWLCSEACSKIYLRLEELAVVQAPCRSVA 541
Query: 667 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-A 725
S LS + + +G DG EE+E + L A + RECF +I
Sbjct: 542 SGLSLVV---LRRGAARDG----------------EEEEHAKLCMALDVLRECFVTLIEP 582
Query: 726 ECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
DL +V+ + +F G Y V L +++ LR++G EVAE+PLV T
Sbjct: 583 RTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTRF 642
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT-KKFGFRKMSRERLLKYQRD 842
+ +G + L I++LL + VE LVLPA + + WT FGFR+M Q D
Sbjct: 643 ARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGFREMG-------QAD 695
Query: 843 FQ------LTIFKGTSMLEK 856
Q + F+GT M K
Sbjct: 696 RQDVAHHAILRFQGTIMCHK 715
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 117/450 (26%), Positives = 194/450 (43%), Gaps = 104/450 (23%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA-------- 497
+ IVCDCC+ + + FE+HAG R +P I +G +L D A+S +
Sbjct: 36 DAIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSV 94
Query: 498 ---------------------MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLD 536
+ + + +D++C +CG G +L LC+ CP AFH CL
Sbjct: 95 VNENQKKNHSIVKENRKKNHCVVKENSEAKNDNVCSICGFGGDLALCDRCPSAFHLGCLG 154
Query: 537 PLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRS 596
+P W CP C C IC + + C+
Sbjct: 155 LNRVPIGEWFCPTC--------------------------CCKICYRPKCK----QECKD 184
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 656
H D ++ C QCE+++H GC++ G+ + + + WFC C + L+
Sbjct: 185 HK-------DNNILVCVQCEQKYHFGCVKAVGI-EFNHM--ENWFCSVVCGNMFLCLKKL 234
Query: 657 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE----KSLLSSA 712
+ + +++ W ++K ++KE +S L+ A
Sbjct: 235 LGKPIKV-----------------------ADNLTWTLVKNVSSVDDKEFNQKESKLNMA 271
Query: 713 TAIFRECFDPII-AECGRDLIPVMVYGRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIF 769
+ E F+P A GR+LI +V+ R F G Y+VIL V+S +RI+
Sbjct: 272 LGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIY 331
Query: 770 GREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFR 829
G++VAE+ VAT +Y+ +G L IE+ L L VE L+L ++E A +IWTK FGF
Sbjct: 332 GQKVAEVVFVATKEQYRRQGICHLLMDEIEKQLTRLGVEKLLLHSSEDAMNIWTKSFGFA 391
Query: 830 KMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
+M+ + ++ D F+ ++M K ++
Sbjct: 392 RMASKDKCQFI-DHTFLEFQNSTMCLKALK 420
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 164/367 (44%), Gaps = 71/367 (19%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C +CGDG L+ C+ CP ++H ACL IP+ W C +CR
Sbjct: 1000 NDDTCGLCGDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCR--------------- 1044
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+V G V+ P H F C QCE+++H+ C+
Sbjct: 1045 ------CDVCGEVVSSKEPRTPL-------HAFE-----------CSQCERQYHIKCISG 1080
Query: 627 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG----IL 682
LC+ + P WFC C +I+ +L+ V IP H++ G IL
Sbjct: 1081 KVLCNEESGPG-TWFCGRRCQQIYTSLRSRVG-----IP---------DHLDDGFSCTIL 1125
Query: 683 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG--R 739
+ D + +M E L+ A +I ECF PI G D++P+++Y
Sbjct: 1126 HNN--GDQKVRMAADIALLAECNMKLII-ALSILEECFLPIFDPRTGMDIMPLILYNWRS 1182
Query: 740 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 799
N ++ G Y+++L ++S +R+ G VAE+PL+ATC E + +G + + IE
Sbjct: 1183 NFVHLDYKGFYTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGMCRRIVDYIE 1242
Query: 800 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKGTSMLEK 856
++L SL VE L+L A WT FGFR + +++L K +L GT +L+K
Sbjct: 1243 QMLKSLKVEMLLLSAIPSLVDTWTSAFGFRPIEDCDKKKLSK----IRLASVPGTVLLKK 1298
Query: 857 KVQCLPE 863
+ E
Sbjct: 1299 DLHEFSE 1305
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/455 (26%), Positives = 189/455 (41%), Gaps = 86/455 (18%)
Query: 434 YIVKGQR---LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLH 490
Y +KG L G +GI C+CC K S S F+AH +P +++ NG +L
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLSGFQAHVTGNNICRPAENLFLGNGKSLV 108
Query: 491 DIAISLAMGQ----------RRTTGGS----------------DDMCHVCGDGENLLLCN 524
+ L + R T GS D +C +C G +L+ C+
Sbjct: 109 SCQVELMRKKIMRFNQEPVVRATGTGSRSKFRLLAPLGSENCNDYVCSICHYGGDLICCD 168
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLS 584
CP +FHA CL+ +PE W
Sbjct: 169 RCPSSFHATCLNIERVPEGDW-------------------------------------FC 191
Query: 585 PSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD 644
P I D A F D +++ C QCE++FH C ++ G+ +E WFCC
Sbjct: 192 PCCCCGICGDSQFDKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCK 247
Query: 645 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK 704
C + LQ + + + I + + V + + A+ +
Sbjct: 248 TCEMMQWGLQQLLGKPILVGHNLTCTLIKPMQYQ-------AEDRVDYDLAAMAENY--- 297
Query: 705 EKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVV 761
S LS A + ECFDP+ + RDL+ +++ G N++ F G Y+V+L ++
Sbjct: 298 --SKLSVALEVMHECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELI 355
Query: 762 SAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESI 821
+ LLRI+G +VAE+PL+ T +++ G + L + IE+ L +L V+ LVLPA+ +
Sbjct: 356 AVALLRIYGDKVAEMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNT 415
Query: 822 WTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
WT FGF M+ L + F F T M +K
Sbjct: 416 WTTSFGFTPMTESDRLDFL-GFTFLDFHDTIMCKK 449
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 108/427 (25%), Positives = 181/427 (42%), Gaps = 76/427 (17%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L H+ + Q +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC------- 692
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
C C + H+ S ++ C CE+
Sbjct: 693 -------------------SCKFCEKDEAAK--------HETSTLP----SLSSCRLCEE 721
Query: 618 EFHVGCLRKNGLCDLKEIPKDKW---FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINR 674
++H C+ ++G +P ++ FC C + LQ F+ + S S + R
Sbjct: 722 KYHQACINQDGT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRR 776
Query: 675 KHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIP 733
+ + D+ ++ A+ ++ A ++ ECF P++ G +L+
Sbjct: 777 FELPSEV----ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQ 823
Query: 734 VMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 791
+VY G N +F + +L +++ +RI G ++AE+P + T Y+ +G
Sbjct: 824 NIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMC 883
Query: 792 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 851
+ L IE L SL V+ LV+PA + WT FGF ++ + K ++ L +F G
Sbjct: 884 RRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVN-DSEKKTIKNLNLLVFPGV 942
Query: 852 SMLEKKV 858
ML K +
Sbjct: 943 DMLGKSL 949
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 126/440 (28%), Positives = 188/440 (42%), Gaps = 75/440 (17%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G GI CDCC+K ++ ++FE HAG + +QPY +I+ +G ++L +
Sbjct: 766 KGTRARLVGRVTREGICCDCCSKILTVAKFELHAG-SKEQQPYANIFLEDGGLSLFQCLL 824
Query: 495 SL--AMGQRRTTG---------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
+ Q G DD C +CGDG +LL C+ C FH ACL + +P
Sbjct: 825 NAWDKQAQNEKKGFYKIDPADDPDDDTCGICGDGGDLLCCDRCTSTFHVACLG-IEMPSG 883
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAAT 603
W C NC IC + C S + T
Sbjct: 884 DWFCRNC-----------------------------IC----------KFCGSAE--ERT 902
Query: 604 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRA 661
++ C QC +++H C + + P FC C +I+ L+ + +
Sbjct: 903 SSPAELLSCLQCSRKYHQVCAQGIEREFVSTTPSASIDCFCSPGCTKIYKRLKRLLGLKN 962
Query: 662 QTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFD 721
S S + D + KKAQ K+ L A ++ ECF
Sbjct: 963 DLEAGFSWSLVR-------CFADTEATSTK----KKAQLVHCNSKTAL--AFSVLDECFL 1009
Query: 722 PIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPL 778
P I E G ++I +VY G + S F G Y+ IL V+SA +RI G + AE+P
Sbjct: 1010 PRIDERSGINIIHNVVYNCGSDFSRLNFSGFYTFILERGDEVISAATVRIHGTDFAEMPF 1069
Query: 779 VATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLK 838
+ T Y+ +G L IE LCSLNV LV+PA + ++ W+ FGF+ + + K
Sbjct: 1070 IGTRGMYRHQGMCHRLLDAIESALCSLNVRRLVIPAIPELQNTWSTVFGFKPVGPTKKQK 1129
Query: 839 YQRDFQLTIFKGTSMLEKKV 858
+ L I GT +LEK++
Sbjct: 1130 I-KSVNLLIIHGTGLLEKRL 1148
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 5/62 (8%)
Query: 227 ACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLE 285
A VKY++ R +L G V G C C C SK+++ +FE HAG+K + P +I+LE
Sbjct: 758 AKVKYMNKKGTRARLVGRVTREGICCDC--C--SKILTVAKFELHAGSKEQQPYANIFLE 813
Query: 286 NG 287
+G
Sbjct: 814 DG 815
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 115/450 (25%), Positives = 194/450 (43%), Gaps = 104/450 (23%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA-------- 497
+ IVCDCC+ + + FE+HAG R +P I +G +L D A+S +
Sbjct: 36 DAIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSV 94
Query: 498 ---------------------MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLD 536
+ + ++ +D++C +CG G +L LC+ CP AFH CL
Sbjct: 95 VNENQKKNHSIVKENRKKNHCVVKEKSEANNDNVCSICGFGGDLALCDRCPSAFHLGCLG 154
Query: 537 PLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRS 596
+P W CP C C IC + + C+
Sbjct: 155 LNRVPIGEWFCPTC--------------------------CCKICYRPKCK----QECKD 184
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 656
H D ++ C QCE+++H GC++ G+ + + + WFC C + L+
Sbjct: 185 HK-------DNNILVCVQCEQKYHFGCVKAVGI-EFNHM--ENWFCSVVCGNMFLCLKKL 234
Query: 657 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE----KSLLSSA 712
+ + +++ W ++K ++KE +S L+ A
Sbjct: 235 LGKPIKV-----------------------ADNLTWTLVKNVSSVDDKEFNQKESKLNMA 271
Query: 713 TAIFRECFDPII-AECGRDLIPVMVYGRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIF 769
+ E F+P A GR+LI +V+ R F G Y+VIL V+S +RI+
Sbjct: 272 LGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIY 331
Query: 770 GREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFR 829
G++VAE+ VAT +Y+ +G L IE+ L L VE L+L ++E A + WT+ FGF
Sbjct: 332 GQKVAEVVFVATKEQYRRQGMCHLLMDEIEKQLTRLGVEKLLLHSSEDAMNTWTRSFGFA 391
Query: 830 KMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
+M+ + ++ D F+ ++M K ++
Sbjct: 392 RMASKDKCQFI-DHTFLEFQNSTMCLKALK 420
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 111/416 (26%), Positives = 178/416 (42%), Gaps = 87/416 (20%)
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS-NGMTLHDI---- 492
G+++ G G+GI C CCN I+ S+F+ HAG R PY+ I+ S G++L
Sbjct: 179 GEQILQGVLTGDGIWCSCCNTVITVSEFQLHAGDEPNR-PYQRIFISETGLSLLTCQAEA 237
Query: 493 ----------AISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPE 542
L + + DD C VC DG NL+ C+ CP +H +CL P+
Sbjct: 238 WNQQGIPELQGYHLIEPREDVSDKYDDACVVCADGGNLICCDKCPSTYHISCLQMEDEPQ 297
Query: 543 SGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAA 602
WRCP C C + C +H F
Sbjct: 298 GEWRCPAC--------------------------AC-------------KFCHTHAF--- 315
Query: 603 TFDDRTVIYCDQCEKEFHVGCLRKNG--LCDLK-EIPKDKW-FCCDDCNRIHAALQDFVS 658
D +V C QC+K++H C R+N L DL + P FC C++I+ L+ V
Sbjct: 316 ---DISVFTCSQCDKKYHWECFRENEGMLIDLNMDGPSTSTPFCSSICSQIYEKLERLVG 372
Query: 659 NRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRE 718
R + + + + R E G+ + + + + C S ++ A A+ E
Sbjct: 373 VRNELDEGLTWTLLRRMDPEAGVYLEESYD--------RTLC-----NSKIAVAVAVMEE 419
Query: 719 CFDPIIAECGR-DLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAE 775
CF+P+I + +++ ++Y G N F G Y+ IL +S +RI G ++AE
Sbjct: 420 CFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVASMRIHGNKLAE 479
Query: 776 LPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 831
+P +AT Y+ G L IE +V+ LV+P+ E+ W + +GF+ +
Sbjct: 480 MPFIATRPSYRRLGMCHKLLVAIE------SVQYLVIPSIEQRVRRWEESYGFQAI 529
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 111/408 (27%), Positives = 175/408 (42%), Gaps = 67/408 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC + + S FE HA + R +I+ +G +L + + +++
Sbjct: 635 GIKCSCCRRIFTISGFEVHANGGSCRAAA-NIFLDDGRSLLECQVEAYETRKKAQPPDIL 693
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D +C VC G L+LC+GCP AFHA CL +P+ W C +C G
Sbjct: 694 KMKLRQGENDVICSVCHYGGKLILCDGCPSAFHANCLGLEEVPDGDWFCESCCCGACGQF 753
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
K E E F I C QCE +
Sbjct: 754 FLKATSKYAKE-----------------EKF--------------------ISCKQCELK 776
Query: 619 FHVGCLRKNGLCD-LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CLR +G D L +KWFC DC I L + + + + + L+ +
Sbjct: 777 YHPSCLRYDGAGDSLDTFLGEKWFCSKDCEEIFVNLCELI-GKPREVGVEKLTWRLVQSF 835
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
E + D + + + + + C LS A + E F+P+ GRDL ++
Sbjct: 836 EPNMYGD---DAYKIEAVAENHC-------KLSVALDVMHELFEPVKRPHGGRDLAEDVI 885
Query: 737 YGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
+ R F G Y+V+L +V+ +RI G++VAE+P + T +++ +G + L
Sbjct: 886 FSRWSKFKRLNFSGFYTVLLERNEELVTVATVRILGKKVAEMPFIGTRFQHRQRGMCRVL 945
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKY 839
+ +E++L L VE LVLPA + W FGF KM+ R+ LK+
Sbjct: 946 INELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKEFLKF 993
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 187/422 (44%), Gaps = 81/422 (19%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQ--PYRHIYTSNG--MTLHDI-AIS 495
L+ G GI C CC+ + S F+ HAG+ RQ P +++ +G TL + A S
Sbjct: 915 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGL---RQEIPSLNLFLGSGKSYTLCQLQAWS 971
Query: 496 LAMGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
+ R+ +DD C +CGDG L+ C+ CP ++H CL IP+ W
Sbjct: 972 IEHKARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWY 1031
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDD 606
C C C IC E +++ RS
Sbjct: 1032 CYRCL--------------------------CDIC----GEVINLKELRS---------S 1052
Query: 607 RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 666
+ C QCE+++H C+ LC+ + P WFC C +I+ L+ V TI
Sbjct: 1053 LPALECAQCERQYHAKCIYGKLLCNEEGGPC-AWFCGRRCQQIYMNLRSRVGIPIHTIDG 1111
Query: 667 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-A 725
S T+ R + ++ + T D+ A+C + L A +I ECF PII A
Sbjct: 1112 FS-CTVLRNNGDQRV---STAADIAIL----AECNMK-----LVIALSIMEECFLPIIDA 1158
Query: 726 ECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
G D+IP ++Y + ++ G Y+V+L ++S +R+ G VAE+PL+ATC
Sbjct: 1159 RTGIDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCL 1218
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF--------RKMSRER 835
E + +G + L IE++L SL VE L+L A WT FGF + +SR R
Sbjct: 1219 ENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDRKNLSRLR 1278
Query: 836 LL 837
L+
Sbjct: 1279 LV 1280
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 187/422 (44%), Gaps = 81/422 (19%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQ--PYRHIYTSNG--MTLHDI-AIS 495
L+ G GI C CC+ + S F+ HAG+ RQ P +++ +G TL + A S
Sbjct: 966 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGL---RQEIPSLNLFLGSGKSYTLCQLQAWS 1022
Query: 496 LAMGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
+ R+ +DD C +CGDG L+ C+ CP ++H CL IP+ W
Sbjct: 1023 IEHKARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWY 1082
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDD 606
C C C IC E +++ RS
Sbjct: 1083 CYRCL--------------------------CDIC----GEVINLKELRS---------S 1103
Query: 607 RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 666
+ C QCE+++H C+ LC+ + P WFC C +I+ L+ V TI
Sbjct: 1104 LPALECAQCERQYHAKCIYGKLLCNEEGGPC-AWFCGRRCQQIYMNLRSRVGIPIHTIDG 1162
Query: 667 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-A 725
S T+ R + ++ + T D+ A+C + L A +I ECF PII A
Sbjct: 1163 FS-CTVLRNNGDQRV---STAADIAIL----AECNMK-----LVIALSIMEECFLPIIDA 1209
Query: 726 ECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
G D+IP ++Y + ++ G Y+V+L ++S +R+ G VAE+PL+ATC
Sbjct: 1210 RTGIDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCL 1269
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF--------RKMSRER 835
E + +G + L IE++L SL VE L+L A WT FGF + +SR R
Sbjct: 1270 ENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDRKNLSRLR 1329
Query: 836 LL 837
L+
Sbjct: 1330 LV 1331
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 112/433 (25%), Positives = 178/433 (41%), Gaps = 101/433 (23%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARR-------QPYRHIYTSNGMTLHDIAISLAMGQR 501
+GI C CC K + S FE HAG+ + R +P I+ +G +L + I + +
Sbjct: 618 DGIKCSCCGKVYTLSGFELHAGIKSCRSGASKYCKPAASIFLDDGRSLLECQIQMMRDKE 677
Query: 502 RTT---------------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
+ G+D +C VC G L+LC+ CP +FH CL +P+ W
Sbjct: 678 MSNHKAETPDSLKGSWDRDGNDHVCSVCHYGGELILCDHCPSSFHKRCLGMKDVPDGDWF 737
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDD 606
CP+C C IC + + + DF
Sbjct: 738 CPSC--------------------------CCKICGQNKLKK------DTKDFIDG---- 761
Query: 607 RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 666
V+ C QCE ++H+ CL + K+ PK+ FC C + ++ + + A
Sbjct: 762 --VLNCTQCEHQYHIMCLSNSWTDKWKDHPKENSFCSKKC-------EVYMQSDQHKLDA 812
Query: 667 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPI-IA 725
T+ + S L A + ECF+PI
Sbjct: 813 FDDETLVETY------------------------------SKLKIALDVVHECFEPIEEP 842
Query: 726 ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 783
GRDL+ +++ G ++ F G Y+++L +VS +RI G +VAE+PLV T
Sbjct: 843 RTGRDLMKDVIFSNGSELNRLNFQGFYTILLEKNDELVSVATVRIHGDKVAEIPLVGTRF 902
Query: 784 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF 843
+++ G + L +E+ L L V+ LVLPA + WT FGF KM+ L++ D+
Sbjct: 903 QFRQLGMCRILMDVLEKKLMELGVQRLVLPAVPGVLNTWTGSFGFSKMTDSERLQFV-DY 961
Query: 844 QLTIFKGTSMLEK 856
F+ T M +K
Sbjct: 962 TFLDFQDTVMCQK 974
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/458 (25%), Positives = 191/458 (41%), Gaps = 92/458 (20%)
Query: 434 YIVKGQR---LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLH 490
Y +KG L G +GI C+CC K S + F+AH +P +++ NG +L
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLTGFQAHVTGNNICRPAENLFLGNGKSLV 108
Query: 491 DIAISLAMGQ----------RRTTGGS----------------DDMCHVCGDGENLLLCN 524
+ L + R GS D +C +C G +L+ C+
Sbjct: 109 SCQVELMRKKIMMFDQGPAVRAAGTGSRSKFRSLAPLGSENCNDYVCSICHYGGDLICCD 168
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLS 584
CP +FHAACL+ +PE W
Sbjct: 169 RCPSSFHAACLNIESVPEGDW-------------------------------------FC 191
Query: 585 PSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD 644
P I D A F D +++ C QCE++FH C ++ G+ +E WFCC
Sbjct: 192 PCCCCGICGDSQFDKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCK 247
Query: 645 DCNRIHAALQDFVSNR---AQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCF 701
C + LQ + Q + + + + + ++ + + A+ +
Sbjct: 248 TCEMMQWGLQQLLGKPILVGQNLTCTLIKPMQYQAEDRE----------DYDLAAMAENY 297
Query: 702 EEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKS 758
S LS A + ECFDP+ + RDL+ +++ G N++ F G Y+V+L
Sbjct: 298 -----SKLSVALEVMHECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERND 352
Query: 759 VVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKA 818
+++ LLRI+G +VAE+PL+ T +++ G + L + IE+ L +L V+ LVLPA+
Sbjct: 353 ELIAVALLRIYGDKVAEMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSV 412
Query: 819 ESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
+ WT FGF M+ L + F F T M +K
Sbjct: 413 LNTWTTSFGFTPMTESDRLDFL-GFTFLDFHDTIMCKK 449
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 114/425 (26%), Positives = 177/425 (41%), Gaps = 97/425 (22%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC + + + FE HA A +I+ +G +L + + +++
Sbjct: 626 GIKCSCCRRVFTINGFEVHAN-GASCSGAANIFLDDGRSLLECQVEAYKKRKKAQPPDML 684
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D C VC G L+LC+GCP AFHA CL +P+ W C +C
Sbjct: 685 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSC-------- 736
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
C C F ++ ++ + I C QCE +
Sbjct: 737 ------------------CCGAC-----GQFFLKT------TSTNAKEEKFISCKQCELK 767
Query: 619 FHVGCLRKNGLCD-LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H CLR +G CD L +I +KWFC DC + + + A I A++
Sbjct: 768 YHPSCLRYDGACDSLDKILGEKWFCSKDCE--ESLEPNMYGDDASKIEAAA--------- 816
Query: 678 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 736
+ +LS A + E F+P+ GRDL ++
Sbjct: 817 --------------------------ENHCILSVALDVMHELFEPVKRPHGGRDLAEDVI 850
Query: 737 YGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
+ R F G Y+V+L + +VS +RI G++VAE+P + T +++ +G + L
Sbjct: 851 FSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRVL 910
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQRDFQLTIFKGT 851
+ +E++L L VE LVLPA + W FGF KM+ R+ LK F L F T
Sbjct: 911 INELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK----FTLLEFGRT 966
Query: 852 SMLEK 856
+ EK
Sbjct: 967 ILCEK 971
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 116/429 (27%), Positives = 178/429 (41%), Gaps = 79/429 (18%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT-----LHDIAISLAM----GQ 500
GI C CC+K ++ +FE HAG + ++PY +I+ G L D + G
Sbjct: 665 GIFCGCCSKILTVGKFELHAG-SKEKKPYANIFLEGGKVSLLQCLLDAWEKHTLCENKGF 723
Query: 501 RRTTGGSD---DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ G D D C +CGDG +L+ C+ C FH CL + +P W C +C
Sbjct: 724 YKIDKGEDEHDDTCAICGDGGDLVCCDHCASTFHLDCLG-IKLPSGDWYCRSC------- 775
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDF--SAATFDDRTVIYCDQC 615
LCR F + ++ C QC
Sbjct: 776 -----------------------------------LCRFCGFPQEKPSSSPELLLSCLQC 800
Query: 616 EKEFHVGCLRKNGL---CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 672
+++H C G C + D FC C +I+ L + + S S +
Sbjct: 801 SRKYHQTCSSGTGTDFDCTIPGTSID-CFCSPGCRKIYKRLNKLLGIKNHMEAGFSWSLV 859
Query: 673 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 731
+ F K AQC S ++ A + ECF P I E G ++
Sbjct: 860 H--------CFPNDQAMPPKNKEKMAQC-----NSKIALAFTVLDECFQPHIDERSGINM 906
Query: 732 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 789
I + Y G + S +F G Y+ IL V+SA +RI G ++AE+P + T Y+ +G
Sbjct: 907 IHNVAYNCGSDFSRLDFSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGTRGMYRHQG 966
Query: 790 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 849
+ L + IE LCSLNV+ LV+ A + E+ WT FGF+ + + + + L I
Sbjct: 967 MLRRLLNGIESALCSLNVQKLVVSAVTEMENTWTTVFGFKPVQPSKKQRI-KSLNLLIMN 1025
Query: 850 GTSMLEKKV 858
GT +LEK++
Sbjct: 1026 GTGLLEKRL 1034
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 7/85 (8%)
Query: 207 VPNEYPTNVKK-LLSTGIL-DGACVKYISTSRERQ-LDGIVNGGGYLCGCPLCNFSKVVS 263
+P E+ V ++ G++ + VKY++ +R R+ L G + G CGC C SK+++
Sbjct: 621 IPYEWKRTVYSWMIDLGVVSEDMQVKYMNNNRTREMLAGKITREGIFCGC--C--SKILT 676
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGK 288
+FE HAG+K + P +I+LE GK
Sbjct: 677 VGKFELHAGSKEKKPYANIFLEGGK 701
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 123/439 (28%), Positives = 188/439 (42%), Gaps = 83/439 (18%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT-----LHDIAIS 495
L+ G +GI C CC + S+F+ HAG+ + P +++ G + L +I
Sbjct: 894 LKDGNITTSGIRCRCCGTTFTMSKFKCHAGLR-QEVPSLNLFLGTGKSYSLCLLQAWSIE 952
Query: 496 LAMGQRRT-------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+ + R +DD C CGDG L+ C+ CP ++H ACL IP+ W C
Sbjct: 953 QKVRKERVKDTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLPCQDIPDGNWYCS 1012
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
+C + S +L+ L A
Sbjct: 1013 SCLCNICGEVITSKELRTSLPA-------------------------------------- 1034
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA-- 666
+ C QCE+++HV C+ C+ E WFC C +I+ + V +P
Sbjct: 1035 -LECSQCERQYHVKCVSAKVSCN--EDGPGTWFCGRKCQQIYMIFRSRVG-----VPDHV 1086
Query: 667 -SSLS-TINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
+ LS TI R + +K + G ++ A+C + L A +I ECF PI+
Sbjct: 1087 DNDLSCTILRNNGDKKVRTAG-------EIALMAECNMK-----LMIALSIMEECFLPIL 1134
Query: 725 -AECGRDLIPVMVYGRNISGQEFG--GMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 781
G D+IP ++Y F G Y+V+L +VS +R+ G VAE+PLVAT
Sbjct: 1135 DPRTGIDIIPSILYNWRSDFIHFNHKGFYTVVLENDDSMVSVASIRLHGTIVAEMPLVAT 1194
Query: 782 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM--SRERLLKY 839
E + +G + L IE +L SL VE L+L A WT FGFR++ S ++ L
Sbjct: 1195 STENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLVETWTSTFGFREIDDSDKKRLSM 1254
Query: 840 QRDFQLTIFKGTSMLEKKV 858
R L GT +L+K +
Sbjct: 1255 VR---LAAVPGTVLLKKNL 1270
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 139 bits (349), Expect = 8e-30, Method: Composition-based stats.
Identities = 86/278 (30%), Positives = 141/278 (50%), Gaps = 30/278 (10%)
Query: 593 LCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
LC +F F D+TV+ CDQCEKE+H+GCLRK+ + D++ +P+ +WFC D+C RI
Sbjct: 1788 LCHQPEFDREGFSDQTVLICDQCEKEYHIGCLRKHKMVDMQAVPEGEWFCSDECVRIREL 1847
Query: 653 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 712
L K +E+G +WQ ++ + ++ L +
Sbjct: 1848 LT--------------------KSLEEGETTMSGNPAYRWQFIRGRDGTKATARA-LKTV 1886
Query: 713 TAIFRECFDPIIAE-CGRDLIPVMVYGRNISGQEFGGMYSVILTVKSV-------VVSAG 764
I +E FDPII G DL+P MV+ + +F GMYS++L + V AG
Sbjct: 1887 LEILQESFDPIIDNGSGEDLLPRMVHAESAGDYDFQGMYSILLRYRGADKEARGRPVLAG 1946
Query: 765 LLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTK 824
L+R+ G +AE+PLVAT + + +G +AL + L +L V +VLPA A W +
Sbjct: 1947 LVRVLGSSMAEVPLVATRYDCRRQGHLRALVEGLRHRLIALGVRAMVLPATADALPAW-R 2005
Query: 825 KFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLP 862
+ F + + + + ++ IF TS++ +++ +P
Sbjct: 2006 QLAFMDLDEGSVRVARGEHRMIIFPHTSVVVRQLIRVP 2043
Score = 79.0 bits (193), Expect = 1e-11, Method: Composition-based stats.
Identities = 40/77 (51%), Positives = 48/77 (62%), Gaps = 6/77 (7%)
Query: 423 PNGLPDGERLTYIVKGQRLRFGC------KQGNGIVCDCCNKEISPSQFEAHAGMAARRQ 476
P+ L DGER+ Y ++GQRL G G+GI+CDCC+K IS S FE+HAG RR
Sbjct: 1532 PDSLQDGERVHYTIQGQRLLSGTVVIVQRTAGSGILCDCCSKVISASAFESHAGHKHRRN 1591
Query: 477 PYRHIYTSNGMTLHDIA 493
PY I T NG TL IA
Sbjct: 1592 PYESIMTDNGTTLKRIA 1608
Score = 39.3 bits (90), Expect = 8.5, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 50/116 (43%), Gaps = 16/116 (13%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTG---ILDGACVKYISTSRERQLDGIV------NGGG 248
M ++ P + L G + DG V Y + +R L G V G G
Sbjct: 1507 MRASAARTSASQRRPIRSRTLFDGGPDSLQDGERVHY-TIQGQRLLSGTVVIVQRTAGSG 1565
Query: 249 YLCGCPLCNFSKVVSAHEFEQHAGAK-TRHPNNHIYLENGKPIYSIIQELKTAPLG 303
LC C C SKV+SA FE HAG K R+P I +NG + I + + T PL
Sbjct: 1566 ILCDC--C--SKVISASAFESHAGHKHRRNPYESIMTDNGTTLKRIAERI-TPPLA 1616
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 113/428 (26%), Positives = 183/428 (42%), Gaps = 71/428 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI CDCC + + FE+HAG + QP ++I NG +L + Q +
Sbjct: 17 DGIQCDCCGETFAILDFESHAG-SKSCQPLKNICLENGHSLLQCQLESWNKQDESDRKGF 75
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG NL+ C+ CP FH +CL+ P W C C
Sbjct: 76 HFVDTDDQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKFPSGVWNCTYC------ 129
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C C ++ + C+ + A ++ C CE
Sbjct: 130 --------------------SCKFCGMAGGDT-----CQMDENDTAA--QPALLACCLCE 162
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKH 676
+++H C+ + + FC C ++ LQ + + + + + + R
Sbjct: 163 EKYHHSCILAEN--TVNDGYSSVSFCGKKCQELYDKLQALLGVKHEMEEGFAWTLVRRFD 220
Query: 677 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVM 735
+ I G M +K +C S ++ A I ECF P+ G +LI +
Sbjct: 221 VGSDISLSG--------MHRKVEC-----NSKVAVALHIMDECFLPMPDHRSGVNLIRNI 267
Query: 736 VY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
VY G N + + G + IL ++SA +RI G +AE+P + T Y+ +G +
Sbjct: 268 VYNFGSNFNRLNYSGFLTAILERGDEIISAASIRIHGNHLAEMPFIGTRHMYRRQGMCRR 327
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKG 850
L S IE LCSLNVE LV+PA + WT FGF+ + S++++ R+ ++ F G
Sbjct: 328 LLSAIETALCSLNVEKLVIPAISELRETWTSVFGFKPLEGSSKQKM----RNMKMVAFPG 383
Query: 851 TSMLEKKV 858
ML+K +
Sbjct: 384 IDMLQKPL 391
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/453 (26%), Positives = 191/453 (42%), Gaps = 101/453 (22%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G +GI C CC+K ++ ++FE HAG + +QPY +I+ +G TL +
Sbjct: 649 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYENIFLEDGGATLSQCLV 707
Query: 495 SLAMGQRRT-----------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
Q ++ DD C +CGDG +LL C+ CP FH ACL + +P
Sbjct: 708 DAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLG-IKMPSG 766
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAAT 603
W C +C ICR S + + + ++
Sbjct: 767 DWHCSSC-----------------------------ICRFCGS---------TQEITTSS 788
Query: 604 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRA 661
+ ++ C QC +++H C +K FC C +I+ L+ + +
Sbjct: 789 AE---LLSCLQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK- 844
Query: 662 QTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE-------------KSL 708
IE G W +++ CF +K S
Sbjct: 845 -------------NAIEAGF---------SWSLVR---CFPDKLAAPPKGKAHLIHCNSK 879
Query: 709 LSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGL 765
+ A ++ ECF P I E G ++I ++Y G + + F Y+ IL V+SA
Sbjct: 880 TAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAA 939
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
+RI G ++AE+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT
Sbjct: 940 VRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTV 999
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
FGF+ + + K + + I GT +LEK++
Sbjct: 1000 FGFKPVEPSKRQKI-KSLNILIIHGTGLLEKRL 1031
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 45/72 (62%), Gaps = 6/72 (8%)
Query: 218 LLSTGIL-DGACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKT 275
++ G++ + A VKY++ R +L+G + G CGC C SK+++ +FE HAG+K
Sbjct: 631 MIDMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGC--C--SKILTVAKFELHAGSKE 686
Query: 276 RHPNNHIYLENG 287
+ P +I+LE+G
Sbjct: 687 QQPYENIFLEDG 698
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/453 (26%), Positives = 191/453 (42%), Gaps = 101/453 (22%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G +GI C CC+K ++ ++FE HAG + +QPY +I+ +G TL +
Sbjct: 648 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYENIFLEDGGATLSQCLV 706
Query: 495 SLAMGQRRT-----------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
Q ++ DD C +CGDG +LL C+ CP FH ACL + +P
Sbjct: 707 DAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLG-IKMPSG 765
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAAT 603
W C +C ICR S + + + ++
Sbjct: 766 DWHCSSC-----------------------------ICRFCGS---------TQEITTSS 787
Query: 604 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRA 661
+ ++ C QC +++H C +K FC C +I+ L+ + +
Sbjct: 788 AE---LLSCLQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK- 843
Query: 662 QTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE-------------KSL 708
IE G W +++ CF +K S
Sbjct: 844 -------------NAIEAGF---------SWSLVR---CFPDKLAAPPKGKAHLIHCNSK 878
Query: 709 LSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGL 765
+ A ++ ECF P I E G ++I ++Y G + + F Y+ IL V+SA
Sbjct: 879 TAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAA 938
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
+RI G ++AE+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT
Sbjct: 939 VRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTV 998
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
FGF+ + + K + + I GT +LEK++
Sbjct: 999 FGFKPVEPSKRQKI-KSLNILIIHGTGLLEKRL 1030
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 45/72 (62%), Gaps = 6/72 (8%)
Query: 218 LLSTGIL-DGACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKT 275
++ G++ + A VKY++ R +L+G + G CGC C SK+++ +FE HAG+K
Sbjct: 630 MIDMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGC--C--SKILTVAKFELHAGSKE 685
Query: 276 RHPNNHIYLENG 287
+ P +I+LE+G
Sbjct: 686 QQPYENIFLEDG 697
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 115/472 (24%), Positives = 182/472 (38%), Gaps = 112/472 (23%)
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGS 507
G+GI C CCN+ ++ S F AHAG + + P ++IYT +L + Q + S
Sbjct: 846 GDGIHCGCCNEIVTISDFGAHAG-SKQSDPLKNIYTEEETSLLQCLLDSWNKQDESELKS 904
Query: 508 ------------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
DD C VCGDG +L+ C+GCP FH +CLD P W C C
Sbjct: 905 FHFFDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHKSCLDIKKFPSGDWHCAYC----- 959
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
C CRL + + F ++ C C
Sbjct: 960 ---------------------CCKFCRL-------VGGSSNQSVVNDEFTMPALLTCHLC 991
Query: 616 EKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRK 675
E++FH+ C+ NG + KD FC + C + L+ + + + S S I R
Sbjct: 992 EEKFHISCVEANG--GKTDDSKDALFCGNKCQELSERLEMLLGVKHEIEDGFSWSFIRRS 1049
Query: 676 HIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPV 734
+ D+ + +C S L+ A +I ECF P I G +L+
Sbjct: 1050 DV---------GCDLSLTNPQLVEC-----NSKLAVALSIMNECFMPYIDHRSGTNLLRS 1095
Query: 735 MVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQ 792
++Y G N ++ G +VIL ++ +R+ G +AE+P + T Y+ +G +
Sbjct: 1096 ILYNCGSNFKRLDYSGFITVILERGDEIICVASIRVHGNRLAEMPYIGTRYMYRRQGMCR 1155
Query: 793 ALFSCIE----------------------------------------------RLLCSLN 806
L + IE + L SL+
Sbjct: 1156 RLLNAIESEAVYGLSVNLGACPMVVTVRVCNYSMNGSRGFVFDVLRGNDWGGPKALSSLD 1215
Query: 807 VENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
VE LV+PA + WT FGF + ++ + + L +F +L+KK+
Sbjct: 1216 VELLVIPAISELRETWTSVFGFEPL-KQTSKQITNNMNLLVFPHVDLLQKKI 1266
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 119/453 (26%), Positives = 191/453 (42%), Gaps = 101/453 (22%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G +GI C CC+K ++ ++FE HAG + +QPY +I+ +G TL +
Sbjct: 226 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYENIFLEDGGATLSQCLV 284
Query: 495 SLAMGQRRT-----------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
Q ++ DD C +CGDG +LL C+ CP FH ACL + +P
Sbjct: 285 DAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLG-IKMPSG 343
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAAT 603
W C +C ICR S + + + ++
Sbjct: 344 DWHCRSC-----------------------------ICRFCGS---------TQEITTSS 365
Query: 604 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRA 661
+ ++ C QC +++H C +K FC C +I+ L+ + +
Sbjct: 366 AE---LLSCLQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK- 421
Query: 662 QTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE-------------KSL 708
IE G W +++ CF +K S
Sbjct: 422 -------------NAIEAGF---------SWSLVR---CFPDKLAAPPKGKAHLIHCNSK 456
Query: 709 LSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGL 765
+ A ++ ECF P I E G ++I ++Y G + + F Y+ IL V+SA
Sbjct: 457 TAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAA 516
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
+RI G ++AE+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT
Sbjct: 517 VRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTV 576
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
FGF+ + + K + + I GT +LEK++
Sbjct: 577 FGFKPVEPSKRQKI-KSLNILIIHGTGLLEKRL 608
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 56/109 (51%), Gaps = 10/109 (9%)
Query: 185 SSGANDGVLCARNMEIKMSKKV---VPNEYP-TNVKKLLSTGIL-DGACVKYISTSRER- 238
SSG L R M V VP + T + ++ G++ + A VKY++ R
Sbjct: 171 SSGVRGCALLVRGSTHSMEGNVDGYVPYRWKRTVLSWMIDMGVVSEDAKVKYMNKKGTRA 230
Query: 239 QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+L+G + G CGC SK+++ +FE HAG+K + P +I+LE+G
Sbjct: 231 RLEGRITRDGIHCGC----CSKILTVAKFELHAGSKEQQPYENIFLEDG 275
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 107/427 (25%), Positives = 177/427 (41%), Gaps = 66/427 (15%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L HD + Q +
Sbjct: 678 GIRCNCCDEVFSVLDFEVHAG-GKRNQPFKSLYLEGGNSLLQCLHDFMNKQSESQHKGYH 736
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC
Sbjct: 737 FVDFCSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCCNCSCKFCEK 796
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+ ++ L + + C +C S ++ T D
Sbjct: 797 VEAAIHDTSALHS----LSSCRLCEEKCSNHY-----------PHTLAD----------- 830
Query: 618 EFHVGCLRKNGLCDLKEIPKDK---WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINR 674
H C+ ++G +P ++ FC C + LQ + + S S + R
Sbjct: 831 --HQACINQDGT-----VPGERSTDSFCGKYCQELFEELQLLIGVKHPLPEGFSWSFLRR 883
Query: 675 KHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIP 733
+ + D+ ++ A+ ++ A ++ ECF P++ G +L+
Sbjct: 884 FELPSEV----ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQ 930
Query: 734 VMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 791
+VY N F + +L +++ +RI G ++AE+P + T Y+ +G
Sbjct: 931 NIVYNFWSNFHRLNFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMC 990
Query: 792 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 851
+ L IE L SL V LV+PA + WT FGF ++ E K ++ L +F G
Sbjct: 991 RRLMDGIESALGSLKVAKLVIPAVPELIDTWTSGFGFTPVN-ESEKKTIKNLNLLVFPGV 1049
Query: 852 SMLEKKV 858
ML K +
Sbjct: 1050 DMLGKSL 1056
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1179
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 107/433 (24%), Positives = 179/433 (41%), Gaps = 78/433 (18%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L H+ + Q +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC------- 692
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
C C + H+ S + ++C K
Sbjct: 693 -------------------SCKFCEKDEAAK--------HETSTLPSLSSCRLCEEKCSK 725
Query: 618 EF------HVGCLRKNGLCDLKEIPKDK---WFCCDDCNRIHAALQDFVSNRAQTIPASS 668
+ H C+ ++G +P ++ FC C + LQ F+ + S
Sbjct: 726 HYPHTLADHQACINQDGT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFS 780
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-EC 727
S + R + + D+ ++ A+ ++ A ++ ECF P++
Sbjct: 781 WSFLRRFELPSEV----ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRS 827
Query: 728 GRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
G +L+ +VY G N +F + +L +++ +RI G ++AE+P + T Y
Sbjct: 828 GVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMY 887
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
+ +G + L IE L SL V+ LV+PA + WT FGF ++ + K ++ L
Sbjct: 888 RRQGMCRRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVN-DSEKKTIKNLNL 946
Query: 846 TIFKGTSMLEKKV 858
+F G ML K +
Sbjct: 947 LVFPGVDMLGKSL 959
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 117/433 (27%), Positives = 174/433 (40%), Gaps = 91/433 (21%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPY-RHIYTSNGMTLHDIAISLAMGQRRTTGGS- 507
GI+C+CC K S S F+ H G + + + + TL + A R S
Sbjct: 956 GILCNCCTKTFSISDFKVHGGCSLPKSSLGLFLQSGKSYTLCQVEAWSAEFLSRKCDASG 1015
Query: 508 ---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 1016 RKVEAMDENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCT------- 1068
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
C C SE +TF ++ C QC
Sbjct: 1069 -------------------CRSCGNPLSEK-----------EVSTFS--AILKCLQCGDS 1096
Query: 619 FHVGCLRKNGL-CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+H C+ + L C K+ + WFC C I L + V I
Sbjct: 1097 YHDTCIDQEMLPCGDKQ--SNIWFCGRYCKEIFIGLHNHVG------------------I 1136
Query: 678 EKGILFDGTMNDVQWQMLK---KAQCFEEKEK--------SLLSSATAIFRECFDPII-A 725
E D N++ W +LK Q +K + L+ A I ECF ++
Sbjct: 1137 EN--FLD---NELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDP 1191
Query: 726 ECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 785
G D+IP ++ N + ++ G Y+VIL ++ +R+ G + AELP +AT +Y
Sbjct: 1192 RTGVDMIPHVL--SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDY 1249
Query: 786 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 845
+ +G + L IE +L S +VE LVL A + + W FGF+ + + K R+ L
Sbjct: 1250 RRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNL 1308
Query: 846 TIFKGTSMLEKKV 858
+F GTS+L K++
Sbjct: 1309 MLFPGTSLLTKRL 1321
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/451 (24%), Positives = 186/451 (41%), Gaps = 91/451 (20%)
Query: 426 LPDGERLTYIVKGQRLRF---GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIY 482
+P ++ Y+ K ++ RF G +GI CDCCNK + ++FEAHAG QP+ +IY
Sbjct: 774 VPLDGKVQYL-KRRKARFVTKGSITTDGIQCDCCNKTFTSAEFEAHAG-GKSCQPFENIY 831
Query: 483 TSNGMTLHDIAISLAMGQRRTT------------GGSDDMCHVCGDGENLLLCNGCPLAF 530
G +L + + + +DD C +CGDG +L+ C+ CP F
Sbjct: 832 LETGSSLLQCQLDSWYKEDDSAHKGFHFIDIDGEDPNDDTCGICGDGGDLICCDSCPSTF 891
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFD 590
H +CL+ P W C C C C +
Sbjct: 892 HQSCLEIRKFPSGLWHCMYCL--------------------------CKFCGMVGG---- 921
Query: 591 IRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIH 650
C+ AA ++ C CE ++H C ++ + + P FC ++C ++
Sbjct: 922 -NTCQRDGNMAAV--SHALVTCHLCEDKYHHSCFQEKDIINAD--PGSPSFCGNNCQELY 976
Query: 651 AALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLS 710
LQ + + S + + R + I G + W K C S ++
Sbjct: 977 ERLQMLFGVKQELEAGFSWTFVRRFDVSSDISVSG----MSW----KVDC-----NSKVA 1023
Query: 711 SATAIFRECFDPIIA-ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLR 767
A I ECF P++ + G +LI +VY G N + + G ++ +L +++A +R
Sbjct: 1024 VALQIMDECFVPMVDHKSGVNLIRNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAASIR 1083
Query: 768 IFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFG 827
F +P+ + S + LCSLNV LV+PA + WT FG
Sbjct: 1084 YF----YSMPV--------------SFHSSLSMGLCSLNVGKLVIPAISELTGTWTSVFG 1125
Query: 828 FRKM--SRERLLKYQRDFQLTIFKGTSMLEK 856
F+ + S ++++ R+ + +F G ML+K
Sbjct: 1126 FKHLEGSDKQIM---RNMNMMVFPGVDMLQK 1153
Score = 39.3 bits (90), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 6/73 (8%)
Query: 219 LSTGILDGACVKYISTSRERQL-DGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRH 277
L T LDG V+Y+ + R + G + G C C CN K ++ EFE HAG K+
Sbjct: 771 LGTVPLDGK-VQYLKRRKARFVTKGSITTDGIQCDC--CN--KTFTSAEFEAHAGGKSCQ 825
Query: 278 PNNHIYLENGKPI 290
P +IYLE G +
Sbjct: 826 PFENIYLETGSSL 838
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 152/368 (41%), Gaps = 94/368 (25%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VCGDG LL C+ CP FH ACL + +P+ W C CR
Sbjct: 452 SDDACGVCGDGGELLCCDSCPSTFHPACLA-MKVPQGWWACHYCR--------------- 495
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
CV+C + DD+ + C C ++H C R+
Sbjct: 496 -----------CVLCMAN--------------------DDQGLSTCQHCSLKYHEVC-RR 523
Query: 627 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 686
L + + I +C + C ++ A L D V H E G
Sbjct: 524 PSLSNGRGIGA---YCSETCKKVSARLSDMVGVT--------------NHTEDGF----- 561
Query: 687 MNDVQWQMLK--KAQCFEEKEKSL-------LSSATAIFRECFDPIIAECGR-DLIPVMV 736
W +LK K + ++ + L+ A + ECF+P + D++ V
Sbjct: 562 ----SWALLKIQKDEAVSSQDTAAVLECNVKLAVALGVLNECFNPAKDRRTKIDMLHQAV 617
Query: 737 Y--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 794
Y G + G Y+++L ++A LLRI G +VAE+P AT Y+ +G + L
Sbjct: 618 YSLGSEFKRVSYEGFYTMVLDKDGETIAAALLRIHGTKVAEMPFAATLPAYRKQGMMRRL 677
Query: 795 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM----SRERLLKYQRDFQLTIFKG 850
+ +E++L S+ V+ LV+PA WT+ F FR + SRE + R L + G
Sbjct: 678 VNAVEQVLASVQVDKLVIPAIAALVDTWTRSFSFRPLLDPESREEI----RRRSLVVIAG 733
Query: 851 TSMLEKKV 858
T++L K V
Sbjct: 734 TTLLHKPV 741
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 146/313 (46%), Gaps = 57/313 (18%)
Query: 212 PTNVKKLLSTGILDGACVKYISTS-RERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQH 270
P N K+L++TG+L+G Y+ S R QL GI G +C C +C ++VVS FE H
Sbjct: 312 PRNAKELMATGLLEG---HYVHCSCRGEQLTGIFQDMGVVCNCRICKGTQVVSISAFEAH 368
Query: 271 AGAKTRHPNNHIYLENGKPIYSIIQELKTAP------LGILEEVVKKVAGSSFNEGSFRV 324
+G+ + HP+++IYLENGK + I+ + + L L+ + ++ G S
Sbjct: 369 SGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNILRALQHAIGEIQGIS-------- 420
Query: 325 WKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
K ++ G E E ISC ++ + G + H++
Sbjct: 421 -KEMTCVKCGKHEGGE------------FISCKGAKC------SAAYHAECVGVKSPHLE 461
Query: 385 EML----EERKRGVKRPFMHQKRTAEGGTKK----------------RDNDLHRLLFLPN 424
+ E+ + +P + KR+ G K+ RD LH+ LFLP
Sbjct: 462 DWFCAKCEKTQARKPQPLLKVKRSPAGTDKEDARSKGKEQTMIARSARDAHLHKALFLPG 521
Query: 425 GLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS 484
GL DG L Y + Q + G KQG GI C CCN+EI+ S FE HA AR+ PY I +
Sbjct: 522 GLADGTELGYYARNQCILKGVKQGGGICCSCCNQEITCSAFERHARCEARQNPYGSILLA 581
Query: 485 NGMTLHDIAISLA 497
+G +L D+ LA
Sbjct: 582 DGRSLKDMCKELA 594
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 158/360 (43%), Gaps = 65/360 (18%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG L+ C+ CP ++H ACL IP+ W C +C + S +L
Sbjct: 8 NDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCLCDICGEVIDSKELVT 67
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
L A + C QCE+++HV C+
Sbjct: 68 SLPA---------------------------------------LDCSQCERQYHVKCVSA 88
Query: 627 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 686
C+ E WFC C+ I+ + V +P H++ + F
Sbjct: 89 KVPCN--EDGSGTWFCGRKCHEIYMTFRSRVG-----VP---------DHMDDDLCFTVL 132
Query: 687 MNDVQWQMLKKAQCFEEKEKSL-LSSATAIFRECFDPII-AECGRDLIPVMVYG-RNISG 743
N+ ++ + E ++ L AT+I ECF PI+ G D+IP ++Y R+
Sbjct: 133 RNNGDKKVRTAEEIALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDLH 192
Query: 744 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 803
+ G Y+V+L +VS +R+ G +AE+PLVAT +E + +G + L IE +L
Sbjct: 193 FNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIEEMLK 252
Query: 804 SLNVENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKGTSMLEKKVQC 860
SL VE L+L A WT FGFR++ ++RL K +L GT +L+K + C
Sbjct: 253 SLKVEMLLLSAIPHLAETWTSTFGFREIDESDKKRLSK----VRLAAVPGTVLLKKDLLC 308
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 147/374 (39%), Gaps = 100/374 (26%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VC DG LL C+ CP FH ACL + +PE W C CR
Sbjct: 511 SDDACGVCADGGELLCCDSCPSTFHPACLA-MKVPEGLWACHYCR--------------- 554
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL-- 624
CV+C + DD+ + C C ++H C
Sbjct: 555 -----------CVLCMAN--------------------DDQGLSRCQHCTLKYHEICRPS 583
Query: 625 ----RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG 680
R NG +C + C ++ A L D + H E G
Sbjct: 584 LSNGRGNGA-----------YCSETCKKVSAQLSDMIG--------------ITNHTEDG 618
Query: 681 ILFDGTMNDVQWQMLK--KAQCFEEKEKS-------LLSSATAIFRECFDPIIAECGR-D 730
W +LK K + + L+ A + ECF+P+ + D
Sbjct: 619 F---------SWALLKIQKDEPVSSQNSPDVLECNVKLAVALGVLNECFNPVKDRRTKID 669
Query: 731 LIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 788
++ VY G + G Y+++L ++SA LLRI G +VAE+P T Y+ +
Sbjct: 670 MLHQAVYSLGSEFKRVSYEGFYTMVLEKNGEIISAALLRIHGTKVAEMPFAGTLPAYRKQ 729
Query: 789 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIF 848
G + L + +E++L S+ VE LV+PA W K F F+ + E L + R L +
Sbjct: 730 GMMRRLVNAVEQVLASVQVEKLVIPAIAALVDTWKKSFSFKALDPE-LKEEIRRRSLVVI 788
Query: 849 KGTSMLEKKVQCLP 862
GT++L+K V P
Sbjct: 789 TGTTLLQKPVVAAP 802
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 557
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 164/357 (45%), Gaps = 78/357 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQGHSSSMSRSVDLKGG 567
D+C VC G +LLLC+GCP AFH ACL +PE W D+ G
Sbjct: 257 DVCCVCHWGGDLLLCDGCPSAFHHACLGLSSLPEEDLW----------FCPCCCCDICGS 306
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
+E+P + ++ C+QC++ FH+ CL+++
Sbjct: 307 MESPA---------------------------------NSKLMACEQCQRRFHLTCLKED 333
Query: 628 GLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTM 687
+ WFC CNR+ +AL++ + ++ G
Sbjct: 334 SCI----VSSRGWFCSSQCNRVFSALENLLGSKIAV---------------------GND 368
Query: 688 NDVQWQMLK---KAQCFEEKEKSLLSSATAIFRECFDPI-IAECGRDLIPVMVYGRNISG 743
D+ W +++ + + +++++ S L SA I + F+P GRDL+ ++Y ++ +G
Sbjct: 369 GDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTNDVFSGRDLVEELIYRKDRTG 428
Query: 744 QEFG-GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 802
G G Y+V++ K+ ++ +R+ ++V E+PLVAT Y+ G + L +E+ +
Sbjct: 429 --VGRGFYTVLIERKNEPITVAAVRV-DKDVVEIPLVATLSSYRRSGMCRVLMDELEKQM 485
Query: 803 CSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
+ V LVLPAA++ + WT++FGF M+ L+ + L F GT M K +Q
Sbjct: 486 SQMGVCRLVLPAAKEVVTTWTERFGFSVMNSSERLELVKHGMLD-FVGTIMCHKFLQ 541
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 111/472 (23%), Positives = 197/472 (41%), Gaps = 101/472 (21%)
Query: 426 LPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG-------------- 470
L +G ++Y+ K ++ G +GI+C CCN+ S + F+ HAG
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHRTAALLTLED 72
Query: 471 ----MAARRQPYRHIYTS------NGMTLHDIAISLAMGQRRT--------TGGSDDMCH 512
+ ++Q + I + NG D AM + + +DD C
Sbjct: 73 GRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVDDVEMDENDDTCA 132
Query: 513 VCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG 572
VCGDG L+ C+ CP FH CL +PE W CP C +S RS+
Sbjct: 133 VCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC---CCASCGRSL---------- 179
Query: 573 AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 632
+D + T ++Y +H C+ C +
Sbjct: 180 ------------------------YDPTIQT----EILY-------YHSNCVPG---CAM 201
Query: 633 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQW 692
K D FC C +I L+ V + S + + +H +D + + +
Sbjct: 202 KYESSDNQFCSRKCFKIFRGLRKLVGRVNKVDDMYSWTLLRSEH------YDQSAENSKL 255
Query: 693 QMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQ--EFGGM 749
+ + + L+ A + +ECF P+I D++ ++Y R + +F G
Sbjct: 256 ESVADL-------NTRLALALTVIQECFRPMIDPRSNIDMVSHILYNRRGEDKRMDFRGF 308
Query: 750 YSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVEN 809
Y+V+L + ++S +R+ G AE+P + T +Y+ +G + L + I+++L +L V+
Sbjct: 309 YTVVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQT 368
Query: 810 LVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 861
LVLPA + WT FGF+K++ + ++ + + F G+S+L+K + L
Sbjct: 369 LVLPAIAEFIETWTSAFGFQKLTAAQGIQLM-ELNIVTFPGSSVLQKPLTWL 419
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 150/364 (41%), Gaps = 91/364 (25%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VC DG LL C+ C FH CL + +PE W C CR
Sbjct: 526 SDDACGVCADGGELLCCDSCTSTFHPECLA-IKVPEGSWSCHYCR--------------- 569
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGC--L 624
CV LC S+D D + + C QC +++H C L
Sbjct: 570 -----------CV-------------LCMSND------DLQGLSTCQQCARKYHESCRPL 599
Query: 625 RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 684
NG CD+ +C + C ++ + L + +
Sbjct: 600 PGNG-CDIGT------YCGETCKKLFSQLAQVTG-----------------------VTN 629
Query: 685 GTMNDVQWQMLK----KAQCFEEKEKSL-----LSSATAIFRECFDPIIAECGR-DLIPV 734
T + W +L+ + EE L L+ A +F ECF+P+ + D++
Sbjct: 630 PTGDGFWWALLRIQKDEPASSEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQ 689
Query: 735 MVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQ 792
VY G + G Y+++L +VSA LLRI G +VAE+P T Y+ +G +
Sbjct: 690 AVYSLGSQFKRLSYEGFYTMVLEKDGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMR 749
Query: 793 ALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTS 852
L S +E++L S+ VE LV+PA + W + F FR + +L + + L + GT+
Sbjct: 750 RLVSAVEQVLASVQVEKLVIPAIDSLVDTWKRSFFFRPVD-PQLREELKRLSLVVITGTT 808
Query: 853 MLEK 856
+L K
Sbjct: 809 LLHK 812
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 150/362 (41%), Gaps = 70/362 (19%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 4 NDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCT--------------- 48
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C C SE +TF ++ C QC +H C+ +
Sbjct: 49 -----------CRSCGNPLSEK-----------EVSTFS--AILKCLQCGDSYHDTCIDQ 84
Query: 627 NGL-CDLKEIPKDKWFCCDDCNRI------HAALQDFVSNRAQTIPASSLSTINRKHIEK 679
L C K+ + WFC C I H +++F+ N + + H K
Sbjct: 85 EMLPCGDKQ--SNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGQKLHSSK 142
Query: 680 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY- 737
I N + L+ A I ECF ++ G D+IP ++Y
Sbjct: 143 KIAHMTECN------------------TKLAVALTILEECFVRMVDPRTGVDMIPHVLYN 184
Query: 738 -GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 796
G N + ++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G + L
Sbjct: 185 KGSNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMD 244
Query: 797 CIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
IE +L S +VE LVL A + + W FGF+ + + K R+ L +F GTS+L K
Sbjct: 245 TIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNLMLFPGTSLLTK 303
Query: 857 KV 858
++
Sbjct: 304 RL 305
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 145/360 (40%), Gaps = 86/360 (23%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
SD C +C +G L+ C CPL FH C+ L +P+ W C C H R+
Sbjct: 264 AAANSDQCCGICNEGGELVCCETCPLTFHMECVSLLEVPKDAWFCFRCLCCHCGEPLRTQ 323
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C+QCE+ FH G
Sbjct: 324 P------------------------------------------------CEQCERCFHPG 335
Query: 623 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 682
C CD + D +FC C I L + V+ +++ + R
Sbjct: 336 C------CDDAILAGDFFFCSSGCWNIFQRLAEMVA---------TVNPLGR-------- 372
Query: 683 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR--DLIPVMVYGRN 740
+++ W +L++ +C + LL+ A + FDP++ +C D + MV+ R+
Sbjct: 373 -----SELSWSLLRRGRC----DDKLLAEALQVISSRFDPVL-DCWTQLDYLDAMVFSRS 422
Query: 741 ISGQ--EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 798
+F G Y+ +L + VV +LRI G +AE+P +AT +G+G ++LF+ +
Sbjct: 423 HHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHGAWLAEMPFIATKAGMEGQGICRSLFTAV 482
Query: 799 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
E +L L VE +VL AA+ E +W F F M R +L L G L+K V
Sbjct: 483 EEMLARLGVEMMVLLAAKDTEKMWKNSFEFHAMDR-KLKARTVALGLVALNGAGFLQKSV 541
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 426 LPDGERLTYIVKGQRLRFGCKQG----NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHI 481
+ + E++ YI +R R K+G +G+VC CC++ + S FEAH G RR P +I
Sbjct: 67 IAEQEKVVYIGGTKRFRHLLKEGMVTSDGLVCSCCDELFNLSGFEAHTGSKLRR-PAANI 125
Query: 482 YTSNGMTL 489
+ + L
Sbjct: 126 FVGDEAQL 133
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 33/264 (12%)
Query: 606 DRTVIYCDQCEKEFHVGCLRKNGLCDL-KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 664
D ++ CDQC ++FHV C R GL L ++ WFC D C + + LQ + +++
Sbjct: 44 DGFILSCDQCPRKFHVACARSRGLIKLERKGTCYSWFCSDKCEYVFSGLQHLL---GKSV 100
Query: 665 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ--CFE----EKEKSLLSSATAIFRE 718
P + +++ W +LK+ + CF+ S L A + E
Sbjct: 101 PVGT-------------------DNLTWTLLKRVEPDCFDLEVLSANNSKLKLALEVMHE 141
Query: 719 CFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAE 775
CF+P A G+DL+ +++ G N++ F G Y+V+L + + + +R+FG +VAE
Sbjct: 142 CFEPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVRVFGDKVAE 201
Query: 776 LPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRER 835
+P VAT +Y+ G + L + +ER L +L VE LVLPAA W K FGF M+
Sbjct: 202 VPFVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVLPAAFSTLETWIKGFGFSVMTYSD 261
Query: 836 LLKYQRDFQLTIFKGTSMLEKKVQ 859
K D+ + F+GT + +K ++
Sbjct: 262 -KKAHSDYPILFFQGTVLCQKFLK 284
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 108/444 (24%), Positives = 175/444 (39%), Gaps = 128/444 (28%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL-----HDIAI- 494
L+ G +G+VC CC++ + S FEAH G RR P +I+ + L ++A
Sbjct: 2 LKEGMVTSDGLVCSCCDELFNLSGFEAHTGSKLRR-PAANIFVGDEAQLSIADCQNVAFK 60
Query: 495 --------SLAMGQRR--------------------------TTGGSDDMCHVCGDGENL 520
L + +RR SD C +C +G L
Sbjct: 61 METLESLPGLPVARRRKFDSYCQSDEDSGLTTVSSGSDVDYEAAANSDQCCGICNEGGEL 120
Query: 521 LLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVI 580
+ C CPL FH C+ L +P+ W C C H R+
Sbjct: 121 VCCETCPLTFHMECVSLLEVPKDAWFCFRCLCCHCGEPLRT------------------- 161
Query: 581 CRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW 640
P C+QCE+ FH GC CD + D +
Sbjct: 162 ---QP--------------------------CEQCERCFHPGC------CDDAILAGDFF 186
Query: 641 FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQC 700
FC C + L + V+ +++ + R +++ W +L++ +C
Sbjct: 187 FCSSGCWNLFQRLAEMVA---------TVNPLGR-------------SELSWSLLRRGRC 224
Query: 701 FEEKEKSLLSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQ--EFGGMYSVILTV 756
+ LL+ A + FDP++ +C D + MV+ R+ +F G Y+ +L
Sbjct: 225 ----DDKLLAEALQLISSRFDPVL-DCWTQLDYLDAMVFSRSHHSPRLDFSGFYTAVLQR 279
Query: 757 KSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAE 816
+ VV +LRI +AE+P +AT +G+G ++LF+ +E +L L VE + L AA+
Sbjct: 280 GAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLARLGVETMALLAAK 339
Query: 817 KAESIWTKKFGFRKMSRERLLKYQ 840
E +W F F + R+ LK Q
Sbjct: 340 DTEKMWKNSFEFHAVDRK--LKAQ 361
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 165/388 (42%), Gaps = 72/388 (18%)
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+IYTS G+TLHDIA+ L + G + D NL
Sbjct: 2 NIYTSEGVTLHDIALQLHRLNLNSNGFGNASVSSFSDYPNL------------------- 42
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDF 599
SG C + S++ V LK L+ + C CR +E
Sbjct: 43 -TSSG-----CGKEPSTTSGPIVPLKRTLQERVVQTESCYFCRYGDTE------------ 84
Query: 600 SAATFDDRTVIYCDQCEKEFHVGC-------LRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
D T+ +C+QCE+ HV C ++K L LKE ++ CC++C + A
Sbjct: 85 -FGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEILKEYMCFRFLCCEECQSLRAR 143
Query: 653 LQDFVSNRAQTIPASSLSTINRKHIEKG---ILFDGTMNDVQWQMLKKAQCFEEKEKSLL 709
L+ +EKG +++ W++L KA K +
Sbjct: 144 LEG---------------------VEKGEEIAFLRQIRSNICWRLLSKADA-SRDVKLYM 181
Query: 710 SSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRI 768
S A IF++ F + D+ MVYG+N +G+ +F GMY V+LT + VVSA +L++
Sbjct: 182 SQAIDIFKDAFVE-STDAHSDIFSDMVYGKNGAGEKDFRGMYCVVLTASTHVVSAAILKV 240
Query: 769 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 828
+ AEL L+AT E + KG F+ L IE L + NV L+ P + IW++K GF
Sbjct: 241 RVEQFAELVLIATRSECRKKGYFRLLLKSIEANLRACNVSLLMAPVDPEMAQIWSEKLGF 300
Query: 829 RKMSRERLLKYQRDFQLTIFKGTSMLEK 856
+S E L +F+ +++K
Sbjct: 301 TILSAEEKKSMLESHPLVMFQDLVLVQK 328
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/107 (58%), Positives = 81/107 (75%), Gaps = 1/107 (0%)
Query: 753 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 812
I+ + VV+AGLLR+FGREVAELPLVAT + KG FQ LFSCIE+LL SLNV ++V+
Sbjct: 80 IIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIVV 139
Query: 813 PAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 858
PAAE+ E +W KFGFRK++ E+L KY + +Q+ FKG SML+K V
Sbjct: 140 PAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPV 186
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 632 LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI--NRKHIEKGIL 682
LKE+PK WFC DC RI++ LQ + A+ + S I N + G+L
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSCFGIIQTNATVVAAGLL 93
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 151/365 (41%), Gaps = 80/365 (21%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 4 NDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCT--------------- 48
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C C SE +TF ++ C QC +H C+ +
Sbjct: 49 -----------CRSCGNPLSEK-----------EVSTFS--AILKCLQCGDSYHDTCIDQ 84
Query: 627 NGL-CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L C K+ + WFC C I L + V IE D
Sbjct: 85 EMLPCGDKQ--SNIWFCGRYCKEIFIGLHNHVG------------------IEN--FLD- 121
Query: 686 TMNDVQWQMLK---KAQCFEEKEK--------SLLSSATAIFRECFDPII-AECGRDLIP 733
N++ W +LK Q +K + L+ A I ECF ++ G D+IP
Sbjct: 122 --NELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIP 179
Query: 734 VMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
++ N + ++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G +
Sbjct: 180 HVL--SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRR 237
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 853
L IE +L S +VE LVL A + + W FGF+ + + K R+ L +F GTS+
Sbjct: 238 LMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNLMLFPGTSL 296
Query: 854 LEKKV 858
L K++
Sbjct: 297 LTKRL 301
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 148/360 (41%), Gaps = 70/360 (19%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 4 NDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNCT--------------- 48
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C C SE +TF ++ C QC +H C+ +
Sbjct: 49 -----------CRSCGNPLSEK-----------EVSTFS--AILKCLQCGDSYHDTCIDQ 84
Query: 627 NGL-CDLKEIPKDKWFCCDDCNRI------HAALQDFVSNRAQTIPASSLSTINRKHIEK 679
L C K+ + WFC C I H +++F+ N + + H K
Sbjct: 85 EMLPCGDKQ--SNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGRKLHSSK 142
Query: 680 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG 738
I N + L+ A I ECF ++ G D+IP ++
Sbjct: 143 KIAHMTECN------------------TKLAVALTILEECFVRMVDPRTGVDMIPHVL-- 182
Query: 739 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 798
N + ++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G + L I
Sbjct: 183 SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTI 242
Query: 799 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
E +L S +VE LVL A + + W FGF+ + + K R+ L +F GTS+L K++
Sbjct: 243 EMMLRSFHVETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNLMLFPGTSLLTKRL 301
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 158/354 (44%), Gaps = 78/354 (22%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQGHSSSMSRSVDLKGG 567
D+C VC G +LLLC+GCP AFH CL +PE W CP C SM V+ K
Sbjct: 255 DVCCVCHWGGDLLLCDGCPSAFHHTCLGLSSLPEEDLWFCPCCCCDICGSMESPVNSK-- 312
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
++ C+QC++ FH+ CL++
Sbjct: 313 -----------------------------------------LMACEQCQRRFHLKCLKEE 331
Query: 628 -GLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 686
G+ + WFC CNR+ +AL++ + + G
Sbjct: 332 PGIVSCR-----GWFCSSQCNRVSSALENLIGCKIAV---------------------GN 365
Query: 687 MNDVQWQMLK---KAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNIS 742
D+ W +++ + + +++++ S L SA I + F+P GRDL+ +++ ++ +
Sbjct: 366 NGDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTKDVFSGRDLVEELIFRKDRT 425
Query: 743 GQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 802
G G Y+V++ K ++ +R+ ++V E+PLVAT Y+ G + L +E+ +
Sbjct: 426 GVG-RGFYTVLIERKKEPITVAAVRV-DKDVVEIPLVATLSNYRRSGMCRVLVDELEKQM 483
Query: 803 CSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
+ V LVLPAA++ S WT++FGF M L+ + L F GT M K
Sbjct: 484 SQMGVCRLVLPAAKEVVSTWTQRFGFSVMESSERLELVKHGMLD-FVGTVMCHK 536
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 150/355 (42%), Gaps = 93/355 (26%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
D +C VC DG LLLC+ CP AFH AC+ PE W CP CR
Sbjct: 564 DSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------------- 607
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
C +C S + D D +A F D+T+IYC+
Sbjct: 608 ----------CGVCGGS---DLD-------DDTAEGFTDKTIIYCEA------------- 634
Query: 628 GLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTM 687
+ IP T+ SLST+ R+ + T+
Sbjct: 635 -----RSIP-------------------------TTVEGVSLSTLRRRR------YMSTV 658
Query: 688 NDV-QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISG 743
+ +WQ ++ + + L +A + ECFD ++ DL +V+ + +
Sbjct: 659 TRITRWQHEEE----DAADHGQLCAALDVLHECFDDMVEPRTQTDLAADIVFNQESGLCR 714
Query: 744 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 803
F G Y V L +++ G LR+FG +VAELPLV T ++ +G + L + +E++L
Sbjct: 715 LNFRGYYVVGLEKAGELITVGTLRVFGNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLR 774
Query: 804 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+ V LVLPA + +WT GF M+R +++ + + FKGT+M +K +
Sbjct: 775 QVGVRRLVLPAVPELLPMWTASLGFHAMTRSDVMEMAVEHAILSFKGTTMCQKTL 829
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/443 (23%), Positives = 175/443 (39%), Gaps = 94/443 (21%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL--------HDIAISLAMGQR 501
GI C+CC++ S FE HAG R QP++ +Y G +L + + S G
Sbjct: 428 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 486
Query: 502 RTTGGS----DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
GS DD C +CGDG +L+ C+GCP FH +CLD P W C NC
Sbjct: 487 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC------- 539
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
C C + H+ S ++ C CE+
Sbjct: 540 -------------------SCKFCEKDEAAK--------HETSTLP----SLSSCRLCEE 568
Query: 618 EFHVGCLRKNGLCDLKEIPKDK---WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINR 674
++H C+ ++G +P ++ FC C + LQ F+ + S S + R
Sbjct: 569 KYHQACINQDGT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRR 623
Query: 675 KHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIP 733
+ + D+ ++ A+ ++ A ++ ECF P++ G +L+
Sbjct: 624 FELPSEV----ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQ 670
Query: 734 VMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 791
+VY G N +F + +L +++ +RI G ++AE+P + T Y+ +G
Sbjct: 671 NIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMC 730
Query: 792 QALFSCIERLLCSLNVENLVLPAAEKAESIWT----------------KKFGFRKMSRER 835
+ L IE + + + L +E +W FGF ++ +
Sbjct: 731 RRLMDGIESFVAYFS--QMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVN-DS 787
Query: 836 LLKYQRDFQLTIFKGTSMLEKKV 858
K ++ L +F G ML K +
Sbjct: 788 EKKTIKNLNLLVFPGVDMLGKSL 810
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 102/420 (24%), Positives = 167/420 (39%), Gaps = 101/420 (24%)
Query: 437 KGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT--SNGMTLH---D 491
K + L G + G+ C+CCN+ ++ FE HAG +R PY HI+ SN L +
Sbjct: 125 KKEVLMKGRARREGVRCECCNEVMTVWDFETHAGSVLQR-PYEHIHVARSNSSLLQCQFE 183
Query: 492 IAISLAMGQRRT-------TGGSD---DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP 541
+ S +RRT G SD D C +C DG +L+ C C H C+ IP
Sbjct: 184 VWQSNVEVERRTFNEIVPRNGASDKHDDACLICADGGDLICCEKCWSTSHLKCMGLERIP 243
Query: 542 ESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSA 601
+ W CP C H +
Sbjct: 244 QGDWICPYCVCKHCNK-------------------------------------------- 259
Query: 602 ATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD-CNRIHAALQDFVSNR 660
+D+ + C QC+K++H CL N DL + CD C ++ LQ V +
Sbjct: 260 ---NDKDLQTCVQCDKKYHCQCLVSNKELDLN--ASGETLACDSHCGEVYEKLQSLVGVK 314
Query: 661 AQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSL---------LSS 711
+ +E G W +L++ + K L ++
Sbjct: 315 HE--------------LEGGFC---------WTLLQRMEPDNLDFKDLHLITECNSKIAL 351
Query: 712 ATAIFRECFDPIIAECGR-DLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRI 768
A + ECF II + +++ + Y R N++ F G Y+ IL ++SA +R+
Sbjct: 352 AWEVLDECFTTIIDRHTQINVVQSVAYSRGSNLNRINFRGFYTAILEKNDDIISAATIRV 411
Query: 769 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 828
G ++AE+P + T Y+ G + L +E + + VE+L++P+ ++ +W K GF
Sbjct: 412 HGTDLAEMPFIGTRHLYRQNGMSRMLLVTLESIFSVMGVEHLIIPSVQELTEMWEGKCGF 471
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 148/357 (41%), Gaps = 74/357 (20%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VC DG LL C+ C FH CL + +P+ W C CR
Sbjct: 436 SDDACGVCADGGELLCCDFCTSTFHPECLA-IEVPDGSWSCHYCR--------------- 479
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGC--L 624
C +C + DD+ + C +C ++H C L
Sbjct: 480 -----------CTLC--------------------MSNDDQDLSTCQECACKYHESCRPL 508
Query: 625 RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 684
NG D+ +C + C ++ A L + + T S S + R H ++
Sbjct: 509 LGNGR-DIGA------YCGEICKKLSAKLSEVIGVMNSTEDGFSWSLL-RIHEDEPASSQ 560
Query: 685 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNI 741
G M V + +K L+ A + +CF+P+ + D++ VY G
Sbjct: 561 G-MPAVLERNVK------------LAVALGVLNQCFNPVKDRRTKIDMLHQAVYSLGSQF 607
Query: 742 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 801
+ G Y++IL +VS LLRI GR+VAE+P T Y+ +G + S +E++
Sbjct: 608 KRLSYEGFYTMILEKDGEIVSTALLRIHGRKVAEMPFAGTLPAYRKQGMMHRVVSAVEQV 667
Query: 802 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
L S+ VE L++PA W + F FR + +L + + L + GT+ML K V
Sbjct: 668 LASVQVETLIIPAIASMVDTWKRSFSFRPVD-PQLREELKRLSLVVITGTTMLHKPV 723
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 119/254 (46%), Gaps = 46/254 (18%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+K RD LH L+F +GL D LTY +K G+ L+ G K+G I+C+CCN+E SPS FE
Sbjct: 341 SKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEE 400
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY +IYT G++LH +A+ L ++ L NG
Sbjct: 401 HAGMGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPNGFD 439
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSE 587
A ++ D + SG C + S++ V LK L+ E C C +
Sbjct: 440 NASVSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYGHT- 493
Query: 588 NFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWF 641
+ + T+I+C+QCE+ H+ C +K L LKE +
Sbjct: 494 ------------TIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFL 541
Query: 642 CCDDCNRIHAALQD 655
CC +C + A L++
Sbjct: 542 CCQECQSLRARLEE 555
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 72/125 (57%), Gaps = 3/125 (2%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF-SKVVSAHEFEQHAG 272
+V+ LLSTG+L+G V Y+ E G +NG GY CGC CN+ S +++A EFE+H G
Sbjct: 68 DVRGLLSTGLLEGFRVTYMKDEVEEV--GRINGQGYSCGCSKCNYNSNIMNACEFEEHYG 125
Query: 273 AKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLR 332
+ +HI+L+ G ++ +++ LK L +L + +++ G N + WKAS R
Sbjct: 126 QSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKASFQKR 185
Query: 333 KGLVE 337
K ++
Sbjct: 186 KDYLD 190
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 145/369 (39%), Gaps = 85/369 (23%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
+RR + SD +C C G +L+LC+ CP FH CL+ +P W CP+C
Sbjct: 135 KRRNSSLSDTICSFCHYGGDLILCDKCPSTFHLGCLELKDVPLENWFCPSC--------- 185
Query: 560 RSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
C LC D S +T C QC + +
Sbjct: 186 ------------------CC------------ELCGKGDSSTST------NACLQCARAY 209
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
HV CL K+G + P + FC C + A L +
Sbjct: 210 HVHCLTKDGCLLPTDYPSEN-FCSKSCYELCAQLHQLL---------------------- 246
Query: 680 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATA-------IFRECFDPII-AECGRDL 731
GI +++ + W + + ++ SS + ECF + +D+
Sbjct: 247 GISNPTSVDGLTWTLTRSSKDVYNFPGMPRSSTHVKSFQILRVMHECFRSVKEPHTQKDM 306
Query: 732 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 789
+ ++Y G F G Y+V+L +VS LRI G + AE+PLVAT ++ +G
Sbjct: 307 VTDLIYNSGSKFKRLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVATPFNFRRQG 366
Query: 790 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLT 846
+ L + +LL VE L+LPA + +W FGF +M R++L Y
Sbjct: 367 MCRLLMQEVLKLLNKFRVERLILPAIPQLRKMWEASFGFSEMPLSERQQLSGY----SFV 422
Query: 847 IFKGTSMLE 855
F+GT ML+
Sbjct: 423 GFQGTMMLQ 431
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 115/473 (24%), Positives = 173/473 (36%), Gaps = 133/473 (28%)
Query: 449 NGIVCDCCNKEISPSQFEAHA----------------------------------GMAAR 474
GI C CCN + + FE HA M AR
Sbjct: 411 TGIRCRCCNTVVPVAVFETHARCERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAR 470
Query: 475 R--------------QPYRHIYTSNGMTLHD----IAISLAMGQRRTTGG----SDDMCH 512
Q R + + M L D ++ S Q + GG SDD C
Sbjct: 471 EKAMASLEQEKEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRHQVKKNGGGKDCSDDACG 530
Query: 513 VCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG 572
VC DG LL C+ CP FH CL + +S W + RQ ++ + L+ APG
Sbjct: 531 VCADGGQLLCCDTCPSTFHPDCLAIQFMIKS-WLLFD-RQQLTTIYGQQPWLQ---TAPG 585
Query: 573 AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 632
A + D + CR F +G
Sbjct: 586 AAISA------------DHQYCRPLQSPG-----------------FEIGA--------- 607
Query: 633 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQW 692
+C + C ++ + L D + +N H E G F + +Q
Sbjct: 608 --------YCSETCKKMSSHLSDMIG------------VMN--HTEDG--FSWALLKIQK 643
Query: 693 QMLKKAQ----CFEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQE 745
L ++ E K L+ A + ECF+P+ + D++ VY G
Sbjct: 644 DELVTSEDMPVILESNVK--LAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVN 701
Query: 746 FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSL 805
+ G Y+++L ++S LLR GR++AE+P T YQ +G + L +E++L SL
Sbjct: 702 YEGFYTMVLEKDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASL 761
Query: 806 NVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
VENLV+PA W + F FR M E + + L GT++L+K +
Sbjct: 762 QVENLVIPAVADLVETWKRSFSFRPMQAE-VRDEAKKLSLVAITGTTLLQKPI 813
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 115/473 (24%), Positives = 173/473 (36%), Gaps = 133/473 (28%)
Query: 449 NGIVCDCCNKEISPSQFEAHA----------------------------------GMAAR 474
GI C CCN + + FE HA M AR
Sbjct: 413 TGIRCRCCNTVVPVAVFETHARCERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAR 472
Query: 475 R--------------QPYRHIYTSNGMTLHD----IAISLAMGQRRTTGG----SDDMCH 512
Q R + + M L D ++ S Q + GG SDD C
Sbjct: 473 EKAMASLEQEKEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRHQVKKNGGGKDCSDDACG 532
Query: 513 VCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG 572
VC DG LL C+ CP FH CL + +S W + RQ ++ + L+ APG
Sbjct: 533 VCADGGQLLCCDTCPSTFHPDCLAIQFMIKS-WLLFD-RQQLTTIYGQQPWLQ---TAPG 587
Query: 573 AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 632
A + D + CR F +G
Sbjct: 588 AAISA------------DHQYCRPLQSPG-----------------FEIGA--------- 609
Query: 633 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQW 692
+C + C ++ + L D + +N H E G F + +Q
Sbjct: 610 --------YCSETCKKMSSHLSDMIG------------VMN--HTEDG--FSWALLKIQK 645
Query: 693 QMLKKAQ----CFEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQE 745
L ++ E K L+ A + ECF+P+ + D++ VY G
Sbjct: 646 DELVTSEDMPVILESNVK--LAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVN 703
Query: 746 FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSL 805
+ G Y+++L ++S LLR GR++AE+P T YQ +G + L +E++L SL
Sbjct: 704 YEGFYTMVLEKDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASL 763
Query: 806 NVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
VENLV+PA W + F FR M E + + L GT++L+K +
Sbjct: 764 QVENLVIPAVADLVETWKRSFSFRPMQAE-VRDEAKKLSLVAITGTTLLQKPI 815
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/443 (22%), Positives = 175/443 (39%), Gaps = 84/443 (18%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L H+ + Q +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC S
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC----SCK 695
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
+ + + C +C S+++ T D
Sbjct: 696 FCEKDEAAKHETSTLPSLSSCRLCEEKCSKHY-----------PHTLAD----------- 733
Query: 618 EFHVGCLRKNGLCDLKEIPKDK---WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINR 674
H C+ ++G +P ++ FC C + LQ F+ + S S + R
Sbjct: 734 --HQACINQDGT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRR 786
Query: 675 KHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIP 733
+ + D+ ++ A+ ++ A ++ ECF P++ G +L+
Sbjct: 787 FELPSEV----ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQ 833
Query: 734 VMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 791
+VY G N +F + +L +++ +RI G ++AE+P + T Y+ +G
Sbjct: 834 NIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMC 893
Query: 792 QALFSCIERLLCSLNVENLVLPAAEKAESIWT----------------KKFGFRKMSRER 835
+ L IE + + + L +E +W FGF ++ +
Sbjct: 894 RRLMDGIESFVAYFS--QMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVN-DS 950
Query: 836 LLKYQRDFQLTIFKGTSMLEKKV 858
K ++ L +F G ML K +
Sbjct: 951 EKKTIKNLNLLVFPGVDMLGKSL 973
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 168/398 (42%), Gaps = 71/398 (17%)
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
M RRQPY +IYT G++LH +A+ L ++ L NG A
Sbjct: 1 MGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPNGFDNAS 39
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFD 590
++ D + SG C + S++ V LK L+ E C C +
Sbjct: 40 VSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYGHT---- 90
Query: 591 IRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCD 644
+ + T+I+C+QCE+ H+ C +K L LKE + CC
Sbjct: 91 ---------TIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQ 141
Query: 645 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTM-NDVQWQMLKKAQCF 701
+C + A L++ +EK GI F + +++ W++L
Sbjct: 142 ECQSLRARLEE--------------------GLEKCVGITFLRRIRSNICWRLLSGMDAS 181
Query: 702 EEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVV 760
+ K + IF++ F E D+I MV G+N + +F GMY +LT + V
Sbjct: 182 RDV-KLYMPQVIDIFKDAFMDSTDE-HSDIISDMVNGKNGDQEKDFRGMYCALLTASTHV 239
Query: 761 VSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAES 820
VSA +L++ ++AEL L+AT E + KG F L IE L + NV L P +
Sbjct: 240 VSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQ 299
Query: 821 IWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
IW++K GF +S E L +FK +++K +
Sbjct: 300 IWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 337
>gi|449440451|ref|XP_004137998.1| PREDICTED: uncharacterized protein LOC101221048 [Cucumis sativus]
Length = 233
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/88 (61%), Positives = 66/88 (75%), Gaps = 1/88 (1%)
Query: 772 EVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 831
++AELPLVAT GKG FQ LFSCIERLL L V+ LVLPAAE+AESIWT+KFGF ++
Sbjct: 136 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 195
Query: 832 SRERLLKYQRD-FQLTIFKGTSMLEKKV 858
++L Y+R Q+ FKGTSML+K V
Sbjct: 196 KPDQLSSYRRSCCQMVTFKGTSMLQKTV 223
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 120/257 (46%), Gaps = 38/257 (14%)
Query: 605 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 664
D +I CD C FH CL D++ + LQ + + +
Sbjct: 294 DGGDLICCDGCPSTFHQSCL------DIQ---------------LFEQLQMLLGVKHELE 332
Query: 665 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
S + + R + FD ++N + +K +C S L+ A +I ECF PI+
Sbjct: 333 DGFSWTLVQRTEVG----FDISLNGIP----QKVEC-----NSKLAVALSIMDECFLPIV 379
Query: 725 AE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 781
+ G +LI ++Y G N + + G ++ IL ++SA +RI G ++AE+P + T
Sbjct: 380 DQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGT 439
Query: 782 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQR 841
Y+ +G + L + IE L SLNVE LV+PA + WT FGF+ + K R
Sbjct: 440 RHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPLEVSS-RKEMR 498
Query: 842 DFQLTIFKGTSMLEKKV 858
+ + +F GT ML+K +
Sbjct: 499 NMNMLVFHGTDMLQKPL 515
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 76/170 (44%), Gaps = 23/170 (13%)
Query: 389 ERKRGVKRPFMHQKRTAEGGTKKRDN--------DLHRLLFLPNGLPDGERLTYI--VKG 438
+R + KR + + + EG T D L + +P ++ Y+ K
Sbjct: 146 QRTQNTKRFALLARHSKEGLTTDTDGYVPYSGKRTLLSWMVDLGTVPLNAKVQYMNRRKT 205
Query: 439 QRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAM 498
+ L G +GI C CC++ + S+FE HAGM +P ++I G++L +
Sbjct: 206 RALLEGWISRDGIRCGCCSEIFTISKFEIHAGMKLC-EPSQNIILETGISLLQCQLDSWN 264
Query: 499 GQRRTTGG------------SDDMCHVCGDGENLLLCNGCPLAFHAACLD 536
Q + +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 265 KQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFHQSCLD 314
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 112/229 (48%), Gaps = 24/229 (10%)
Query: 641 FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQML---KK 697
FC C + LQ+ ++ + P S + R H DV ++L K+
Sbjct: 27 FCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIH-----------EDVPEEVLALDKR 75
Query: 698 AQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVIL 754
+C S ++ A ++ ECF PII + G +LI +VY G N + +F G Y IL
Sbjct: 76 VEC-----NSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFIL 130
Query: 755 TVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPA 814
+++A +RI G ++AE+P + T Y+ +G + L IE +L SLNVE L++PA
Sbjct: 131 ERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPA 190
Query: 815 AEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKK-VQCLP 862
+ WT KFGF + + + + +F GT +L+K ++ LP
Sbjct: 191 ITELVDTWTSKFGFSPLEDSEKQEV-KSISMLVFPGTGLLQKPLLKALP 238
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 134/354 (37%), Gaps = 92/354 (25%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG----MTLHDIAISL 496
++ G GIVC+CC K +S S F AHAG + Q ++ +G + L + +
Sbjct: 880 IKDGMVTWEGIVCNCCKKNLSVSDFMAHAGRS-HPQSSLGLFLESGKSYTLCLVEAWSAE 938
Query: 497 AMGQRRTTGG--------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+M +R G SDD C CGDG LL C+ CP +H ACL +PE W C
Sbjct: 939 SMSRRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCH 998
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT 608
NC C +C SE +TF
Sbjct: 999 NCT--------------------------CQVCGGPFSEK-----------EVSTF--SA 1019
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
+ C QC +H C+ + L L++ WFC C I L
Sbjct: 1020 IFKCFQCGDAYHDTCIEQEKL-PLEDQISQTWFCGKYCKEIFIGL--------------- 1063
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLK------------KAQCFEEKEKSLLSSATAIF 716
R H+ + D +D+ W +L+ K C E L+ A +
Sbjct: 1064 -----RSHVGTDNILD---SDLSWSILRCNNDGQKLHSVQKIACLAECNMK-LAVALTLL 1114
Query: 717 RECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLR 767
ECF ++ G D+IP ++Y G N + ++ G Y+VIL ++ +R
Sbjct: 1115 EECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIR 1168
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 149/373 (39%), Gaps = 78/373 (20%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT-----LHDI----AISLAMGQ 500
GI C CC+K ++ ++FE HAG + ++PY +I+ G L D S G
Sbjct: 685 GIFCGCCSKILTIAKFELHAG-SKEKKPYANIFLEGGKVSLLQCLLDAWEKHTQSENKGF 743
Query: 501 RRTTGGSD---DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ G D D C +CGDG +L+ C+ C FH CL + +P W C +C
Sbjct: 744 YKIDKGDDEHDDTCAICGDGGDLVCCDHCASTFHLDCLG-IKLPSGDWYCRSC------- 795
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDF--SAATFDDRTVIYCDQC 615
LCR F + ++ C QC
Sbjct: 796 -----------------------------------LCRFCGFPQEKPSSSPELLLSCLQC 820
Query: 616 EKEFHVGCLRKNGL---CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 672
+++H C G C + D FC C +I+ L + + S S +
Sbjct: 821 SRKYHQTCSSGTGTDSGCTMPGTSID-CFCSPGCRKIYKRLNKLLGIKNHMEAGFSWSLV 879
Query: 673 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 731
+ D M + + L AQC S + A + ECF P I + G ++
Sbjct: 880 H------CFANDQAMPNKNKEKL--AQC-----NSKTALAFTVLDECFQPHIDDRSGINM 926
Query: 732 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 789
I + Y G + S +F G Y+ IL V++A +RI G ++AE+P + T Y+ +G
Sbjct: 927 IHNVAYNCGSDFSRLDFSGFYAFILERGDEVIAAASVRIHGTDLAEMPFIGTRGMYRHQG 986
Query: 790 CFQALFSCIERLL 802
+ L + IE ++
Sbjct: 987 MCRRLLNGIESVI 999
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 5/61 (8%)
Query: 229 VKYISTSRERQ-LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
VKY++++R R+ L G + G CGC SK+++ +FE HAG+K + P +I+LE G
Sbjct: 665 VKYMNSNRTREMLAGKITREGIFCGC----CSKILTIAKFELHAGSKEKKPYANIFLEGG 720
Query: 288 K 288
K
Sbjct: 721 K 721
>gi|224090647|ref|XP_002309044.1| predicted protein [Populus trichocarpa]
gi|222855020|gb|EEE92567.1| predicted protein [Populus trichocarpa]
Length = 467
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/65 (56%), Positives = 52/65 (80%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
+IK +KK+ N +P+NV+ LLSTG+LDG VKY++ S+E++L G++ G GYLCGC CNF
Sbjct: 382 DIKTAKKLPSNNFPSNVRSLLSTGMLDGVPVKYVAWSQEKELRGVIKGSGYLCGCQTCNF 441
Query: 259 SKVVS 263
SKV+S
Sbjct: 442 SKVIS 446
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 82/149 (55%), Gaps = 5/149 (3%)
Query: 718 ECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 774
ECF PII + G +LI +VY G N + +F G Y IL +++A +RI G ++A
Sbjct: 3 ECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLA 62
Query: 775 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 834
E+P + T Y+ +G + L IE +L SLNVE L++PA + WT KFGF +
Sbjct: 63 EMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDS 122
Query: 835 RLLKYQRDFQLTIFKGTSMLEKK-VQCLP 862
+ + + +F GT +L+K ++ LP
Sbjct: 123 EKQEV-KSISMLVFPGTGLLQKPLLKALP 150
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 10/156 (6%)
Query: 707 SLLSSATAIFRECFDPIIAEC-GRDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSA 763
S L A ++ ECF+P+ RDL+ +++ R ++ F G Y+V+L ++S
Sbjct: 977 SKLHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISV 1036
Query: 764 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 823
+R++G++VAE+PLV T +Y+ G L +E+ L L VE LVLPA WT
Sbjct: 1037 ATVRVYGKKVAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWT 1096
Query: 824 KKFGFRKMS---RERLLKYQRDFQLTIFKGTSMLEK 856
+ FGF KM+ R + L D+ F+G M +K
Sbjct: 1097 RSFGFAKMTNLERSQFL----DYTFLDFQGAIMCQK 1128
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 94/235 (40%), Gaps = 57/235 (24%)
Query: 431 RLTYIVKGQR-----LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSN 485
++ Y VKG+ L G +GI C+CC S FE HA + +P I+ +
Sbjct: 548 KVYYKVKGRHRKVCTLADGKITRDGIKCNCCMGIYSFVGFENHASGNSTCRPSASIFLED 607
Query: 486 GMTLHDIAISLAMGQR--RTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAAC 534
G +L D I + + T+G S D +C VC G L+LC+ CP +FH C
Sbjct: 608 GRSLLDCLIKMMHDHKTMETSGKSFSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTC 667
Query: 535 LDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
L IP W CP+C G R +D +E +L P
Sbjct: 668 LGLEDIPNGDWFCPSCCCGICG--QRKIDRDDEVE------------QLLP--------- 704
Query: 595 RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 649
C QCE ++HV CL +NG D+ WFC DC ++
Sbjct: 705 -----------------CIQCEHKYHVRCL-ENGAADISTRYLGNWFCGKDCEKL 741
>gi|297606567|ref|NP_001058661.2| Os06g0731100 [Oryza sativa Japonica Group]
gi|255677428|dbj|BAF20575.2| Os06g0731100, partial [Oryza sativa Japonica Group]
Length = 78
Score = 88.2 bits (217), Expect = 2e-14, Method: Composition-based stats.
Identities = 40/71 (56%), Positives = 56/71 (78%)
Query: 788 KGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTI 847
+G FQALF CIERLL SL V++ VLPAA++AESIWT++FGF K++++ L +Y + + T+
Sbjct: 3 QGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTV 62
Query: 848 FKGTSMLEKKV 858
F+GTS L K V
Sbjct: 63 FQGTSTLHKLV 73
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 76/140 (54%), Gaps = 11/140 (7%)
Query: 709 LSSATAIFRECFDPII-AECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGL 765
L A +I ECF PII A G D+IP ++Y + ++ G Y+V+L ++S
Sbjct: 1122 LVIALSIMEECFLPIIDARTGIDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVAS 1181
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
+R+ G VAE+PL+ATC E + +G + L IE++L SL VE L+L A WT
Sbjct: 1182 IRLHGTVVAEMPLIATCLENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMA 1241
Query: 826 FGF--------RKMSRERLL 837
FGF + +SR RL+
Sbjct: 1242 FGFVPIDDLDRKNLSRLRLV 1261
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 17/109 (15%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQ--PYRHIYTSNG--MTLHDI-AIS 495
L+ G GI C CC+ + S F+ HAG+ RQ P +++ +G TL + A S
Sbjct: 966 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGL---RQEIPSLNLFLGSGKSYTLCQLQAWS 1022
Query: 496 LAMGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACL 535
+ R+ +DD C +CGDG L+ C+ CP ++H CL
Sbjct: 1023 IEHKARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCL 1071
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 86.3 bits (212), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 66/102 (64%), Gaps = 1/102 (0%)
Query: 210 EYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFE 268
++P N+++LL+TG+L+G V YI S++ L G++ G C C CN SK VSA+ FE
Sbjct: 405 KHPGNIRELLNTGLLEGMPVMYIIPHSKKAVLKGVITGCNIRCFCLSCNGSKAVSAYYFE 464
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
QHAG+ +HP ++IYL NG + +++ +PL LE+ ++
Sbjct: 465 QHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIR 506
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 79/160 (49%), Gaps = 10/160 (6%)
Query: 709 LSSATAIFRECFDPIIAECGRDLIPVMVYGRNIS---------GQEFGGMYSVILTVKSV 759
+ A +F+ F P++ + GRDL+ ++ G G F G + +L +
Sbjct: 221 IDQALRLFKSSFSPLLMDNGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRGA 280
Query: 760 VVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAE 819
VV+A LR+FGR AELP VAT Y+ G + L +E LL S V LV+P+ +
Sbjct: 281 VVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLSAGVGQLVMPSIKPLL 340
Query: 820 SIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 859
+W KFGF ++ + + + D+ + + TS L V+
Sbjct: 341 PMWAAKFGFTPLTEQEVAAIE-DWVVDTDRETSTLSTVVR 379
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 120/269 (44%), Gaps = 46/269 (17%)
Query: 609 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC----CDDCNRIHAALQDFVSNRAQTI 664
++ CD+C FH C+ L+ P+ WFC C C S+
Sbjct: 423 LLMCDRCPSMFHHACV------GLESTPQGDWFCPACTCAICG----------SSDLDDP 466
Query: 665 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 724
PA++ + +G D + + Q ++++ EE+E + L A + RECF +I
Sbjct: 467 PATTTT--------QGFSSDRMVISCE-QCRRESRDGEEEEHAKLCMALDVLRECFVTLI 517
Query: 725 -AECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 781
DL +V+ + +F G Y V L +++ LR++G EVAE+PLV T
Sbjct: 518 EPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGT 577
Query: 782 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT-KKFGFRKMSRERLLKYQ 840
+ +G + L I++LL + VE LVLPA + + WT FG R+M Q
Sbjct: 578 RFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGIREMG-------Q 630
Query: 841 RDFQ------LTIFKGTSMLEKKVQCLPE 863
D Q + F+GT M K++ P+
Sbjct: 631 ADRQDVAHHAILRFQGTIMCHKQLPPQPQ 659
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 114/313 (36%), Gaps = 84/313 (26%)
Query: 352 SIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGG--T 409
++ SCS+ + P +C + +++ + + R V RP Q R EG T
Sbjct: 224 TMASCSNNNNKRPAPPAACKTATSSKKKKKKKASLQQAR---VLRP---QPRNEEGNALT 277
Query: 410 KKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIV---CDCC----NK---- 458
R L LL L ++L Y K G G+G+V C C NK
Sbjct: 278 PARARTLLSLLIDKKILAPRDQLIYTTKR-----GLITGDGMVKCMCGGCINNNNKRRVA 332
Query: 459 EISPSQFEAHA----GMAARRQPYRHIYTSNGMTLHDIAISLAMG--------------- 499
E + ++F H ++ RQP+ ++ +G +L + L M
Sbjct: 333 EYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKKY 392
Query: 500 ----------QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
+R+ D +C VC D LL+C+ CP FH AC+ P+ W CP
Sbjct: 393 LPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPA 452
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV 609
C C IC S ++ + FS+ DR V
Sbjct: 453 C--------------------------TCAICGSSDLDDPPA-TTTTQGFSS----DRMV 481
Query: 610 IYCDQCEKEFHVG 622
I C+QC +E G
Sbjct: 482 ISCEQCRRESRDG 494
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 65/117 (55%), Gaps = 2/117 (1%)
Query: 196 RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGC 253
+N+E+KMSKK+ + P VK+L TG+L+G V Y+ + + L G + G LC C
Sbjct: 67 KNLELKMSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSC 126
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CN +V+ +FE HA + R +I ENGK + ++ +TAPL LE ++
Sbjct: 127 AFCNGRRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQ 183
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 80.1 bits (196), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 93/200 (46%), Gaps = 27/200 (13%)
Query: 355 SCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDN 414
S S L + S S S D G+ H++ L V+RP +G +
Sbjct: 476 SASPLHHQTEKSTGSSSHHVDGGKSSKHVRSTLS-----VRRPVRGDNSEGDGFVPSSEK 530
Query: 415 DLHRLLFLPNG-LPDGERLTYIVKGQRLRFGCKQG----NGIVCDCCNKEISPSQFEAHA 469
+ +G L E++ Y+ QR +G +GI C CC+K +S S+FE HA
Sbjct: 531 QTILAWLIDSGTLKLSEKVMYM--NQRRTHAMLEGWITRDGIHCGCCSKILSVSKFEIHA 588
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-------------SDDMCHVCGD 516
G + RQP+++I+ + G++L I A +++ G +DD C +CGD
Sbjct: 589 G-SKLRQPFQNIFLNTGVSLFQCQID-AWDKQKGAGNIGFCSVDVIADDPNDDACGICGD 646
Query: 517 GENLLLCNGCPLAFHAACLD 536
G +L+ C+GCP FH CLD
Sbjct: 647 GGDLVCCDGCPSTFHQRCLD 666
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 101/231 (43%), Gaps = 41/231 (17%)
Query: 605 DDRTVIYCDQCEKEFHVGCL-------------RKNGLCDLKEIPKDKWFCCDDCNRIHA 651
D ++ CD C FH CL R N C L I N +
Sbjct: 646 DGGDLVCCDGCPSTFHQRCLDIRGHLMPDWIFLRFNYRCFLLVIGIAPIV---HANSVGQ 702
Query: 652 ALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDGTMNDVQ----WQMLKKAQCFEE--- 703
L+ + R Q IPA + RK++ +G+ + G ++++ W ++ + +C +
Sbjct: 703 LLKMLLRPRMQ-IPAKCV----RKNLSEGVKKYVGVKHELEAGFSWSLVHR-ECADSDLF 756
Query: 704 --------KEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSV 752
+ S L+ A + ECF PI+ G +++ ++Y G N + FGG Y+
Sbjct: 757 LGEHPHIVENNSKLALALTVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTA 816
Query: 753 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 803
+L VV++ +R G +AE+P + T Y+ +G + LFS IE + C
Sbjct: 817 LLERGDEVVASASIRFHGNHLAEMPFIGTRHVYRHQGMCRRLFSVIESVKC 867
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 11/87 (12%)
Query: 212 PTNVKKLLSTGILDGACVK------YISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSA 264
P++ K+ + ++D +K Y++ R L+G + G CGC C SK++S
Sbjct: 526 PSSEKQTILAWLIDSGTLKLSEKVMYMNQRRTHAMLEGWITRDGIHCGC--C--SKILSV 581
Query: 265 HEFEQHAGAKTRHPNNHIYLENGKPIY 291
+FE HAG+K R P +I+L G ++
Sbjct: 582 SKFEIHAGSKLRQPFQNIFLNTGVSLF 608
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 78/151 (51%), Gaps = 4/151 (2%)
Query: 709 LSSATAIFRECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGL 765
L A I ECF II D+ +V+ R + F G Y ++L +VS G
Sbjct: 8 LCIAVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGT 67
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
RI G++ AELPL+ T Y+ +G + L + +E+LL L VE L+LPA + WT
Sbjct: 68 FRICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCS 127
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
FGF MS L+ + L+ F+GT+M +K
Sbjct: 128 FGFTVMSNSERLELAGNSILS-FQGTTMCQK 157
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 69/125 (55%), Gaps = 1/125 (0%)
Query: 735 MVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 793
MV ++ +G+ +F G+Y +LT + VVSA +L++ EVAEL L+AT E + KG F
Sbjct: 109 MVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFSL 168
Query: 794 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 853
L S IE L + NV L P + IW++K G+ +S E+ L +F S+
Sbjct: 169 LLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYTILSDEQKHSMLMAHPLVMFANLSL 228
Query: 854 LEKKV 858
++K +
Sbjct: 229 VQKSL 233
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 75.1 bits (183), Expect = 2e-10, Method: Composition-based stats.
Identities = 50/156 (32%), Positives = 79/156 (50%), Gaps = 4/156 (2%)
Query: 704 KEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVV 760
+E L A + ECF ++ DL +V+ R + F G Y + L +
Sbjct: 4 EEHGKLCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGEL 63
Query: 761 VSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAES 820
++ G LR++G++VAELPLV T ++ +G L + +E LL VE LVLPA +
Sbjct: 64 ITVGTLRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQ 123
Query: 821 IWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
WT FGF+ M++ + L + + F+GT+M K
Sbjct: 124 TWTGSFGFQVMTQSQKLDIAQH-TIMCFQGTTMCHK 158
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 74.3 bits (181), Expect = 3e-10, Method: Composition-based stats.
Identities = 52/206 (25%), Positives = 88/206 (42%), Gaps = 30/206 (14%)
Query: 640 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ 699
+FC +C R + AL+ R++ T H ++
Sbjct: 67 YFCGKECQRSYEALEAATGRRSRIRDEPEQYTFELVHYKQD------------------- 107
Query: 700 CFEEKEKSLLSSATAIFRECFDPIIAECGRDLI---------PVMVYGRNISGQEFGGMY 750
+ +S + +A +FR F P+I E GRDL+ P G F
Sbjct: 108 --DRTVRSAVETAMRMFRTSFAPLIMENGRDLLEMVCTAYETPDEEVEEEGGGHNFSAFR 165
Query: 751 SVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENL 810
IL + +++A LR+FG + AE+P V+T ++ G + L +E LL + V L
Sbjct: 166 LAILRMGGTIITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGVHCL 225
Query: 811 VLPAAEKAESIWTKKFGFRKMSRERL 836
V+P+ + +WT KFGF K++ + +
Sbjct: 226 VIPSINELLPMWTNKFGFAKIATDEV 251
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/248 (24%), Positives = 104/248 (41%), Gaps = 58/248 (23%)
Query: 653 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 712
LQ+ ++ + P S + R H E + + + K+ +C S ++ A
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEE--------VPEEVLALDKRVEC-----NSKIAVA 1288
Query: 713 TAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLR-- 767
++ ECF PI+ + G +LI +VY G N + +F G Y +IL +++A +R
Sbjct: 1289 LSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRLK 1348
Query: 768 ---------------------------------------IFGREVAELPLVATCREYQGK 788
I G ++AE+P + T Y+ +
Sbjct: 1349 EKNILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHGTKLAEMPFIGTRNMYRRQ 1408
Query: 789 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIF 848
G + L IE +L SLN+E L++PA + WT KFGF + + + + +F
Sbjct: 1409 GMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEV-KSVSMLVF 1467
Query: 849 KGTSMLEK 856
GT +L+K
Sbjct: 1468 PGTGLLQK 1475
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 60/116 (51%), Gaps = 16/116 (13%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT---LHDIAISLAM---GQRR 502
+GI C CC++ IS +F HAG + PYR+I +G+ LH + + M +R+
Sbjct: 935 DGINCSCCSEVISVPEFVTHAGSEVNK-PYRNILV-DGLDIDLLHCLINAWNMQSDAERQ 992
Query: 503 T--------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+DD C +CGDG NL+ C+GCP FH +CL +P W C NC
Sbjct: 993 DFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNC 1048
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
LRI G +VAE+PLVAT +Y+ +G Q L +E++L L+VE LVLPA + +W
Sbjct: 95 LRIHGHKVAEMPLVATAFKYRRQGMCQVLVHELEKMLSQLHVERLVLPAISERSELWQSL 154
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
FGF +MS L+ R F F+GT+M +K
Sbjct: 155 FGFSEMSSAERLELLR-FPFLGFQGTTMFQK 184
>gi|256052547|ref|XP_002569826.1| chromodomain helicase DNA binding protein [Schistosoma mansoni]
gi|353229870|emb|CCD76041.1| putative chromodomain helicase DNA binding protein [Schistosoma
mansoni]
Length = 1966
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 12/139 (8%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMS---RSVD 563
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +S+S + +
Sbjct: 330 DYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGTWSCPHCEKEGITSVSKGNKECE 389
Query: 564 LKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGC 623
G A ++ G ++ + P+ +F D +I CD C +H+ C
Sbjct: 390 DSGSEPAVTSDKDGKIVSAVHPTSP---EKDEHQEFCTECRDGGDLICCDNCPASYHIAC 446
Query: 624 LRKNGLCDLKEIPKDKWFC 642
L + L IP+ W C
Sbjct: 447 L----IPPLANIPEGVWLC 461
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C DG +L+ C+ CP ++H ACL P L IPE W CP C
Sbjct: 421 EFCTECRDGGDLICCDNCPASYHIACLIPPLANIPEGVWLCPRC 464
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 75/145 (51%), Gaps = 2/145 (1%)
Query: 715 IFRECFDPIIAECGRDLIPVMVYGRN-ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREV 773
IF++ F E D+I MV G+N ++F GMY +LT + VVSA +L++ ++
Sbjct: 62 IFKDAFMDSTDE-HSDIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAILKVRIEQI 120
Query: 774 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSR 833
AEL L+AT E + KG F L IE L + NV L P + IW++K GF +S
Sbjct: 121 AELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSA 180
Query: 834 ERLLKYQRDFQLTIFKGTSMLEKKV 858
E L +FK +++K +
Sbjct: 181 EEKESMLESHPLVMFKNLVLVQKSL 205
>gi|358339541|dbj|GAA47583.1| chromodomain-helicase-DNA-binding protein 4 [Clonorchis sinensis]
Length = 1670
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 71/172 (41%), Gaps = 14/172 (8%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + SM V+ K
Sbjct: 51 DYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGSWSCPHCEK-EGISMGSQVEGKA 109
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ + SP ++ +F D +I C+ C +H+ CL
Sbjct: 110 TGTKMAPDKSAKQVAAASPEKD------EHQEFCTECHDGGDLICCENCPVSYHLDCL-- 161
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
+ L IP+ W C C + A + ++ R P S + HI
Sbjct: 162 --IPPLTNIPEGVWLCPRCGCKPLKARVSKILTWRWFEPPKESDELDHTHHI 211
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 306 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 365
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 366 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 413
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 414 ----PLPEIPNGEWLC 425
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 385 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 428
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 225 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 284
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 285 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 332
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 333 ----PLPEIPNGEWLC 344
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 304 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 347
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 352 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 411
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 412 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 459
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 460 ----PLPEIPNGEWLC 471
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 431 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 474
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 353 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 412
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 413 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 460
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 461 ----PLPEIPNGEWLC 472
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 432 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 475
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 404
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 405 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 452
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 453 ----PLPEIPNGEWLC 464
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 427
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 428 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 475
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 476 ----PLPEIPNGEWLC 487
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 447 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 490
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 422 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 481
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 482 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 529
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 530 ----PLPEIPNGEWLC 541
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 501 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 544
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 404
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 405 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 452
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 453 ----PLPEIPNGEWLC 464
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 358 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 417
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 418 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 465
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 466 ----PLPEIPNGEWLC 477
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 437 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 480
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 365 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 424
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 425 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 472
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 473 ----PLPEIPNGEWLC 484
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 444 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 487
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 429 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 476
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 477 ----PLPEIPNGEWLC 488
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 448 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 491
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 472 ----PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 434
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 435 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 482
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 483 ----PLPEIPNGEWLC 494
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 454 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 497
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 392 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 451
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 452 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 499
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 500 ----PLPEIPNGEWLC 511
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 471 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 514
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 427
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 428 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 475
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 476 ----PLPEIPNGEWLC 487
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 447 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 490
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 361 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 420
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 421 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 468
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 469 ----PLPEIPNGEWLC 480
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 440 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 483
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 116 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 175
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 176 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 223
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 224 ----PLPEIPNGEWLC 235
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 195 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 238
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 290 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 349
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 350 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP 397
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 398 ----PLPEIPNGEWLC 409
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 369 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 412
>gi|395537374|ref|XP_003770678.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Sarcophilus harrisii]
Length = 386
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 260 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 319
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG P E D + +F D ++ CD C +H+ CL
Sbjct: 320 --EEILEEVGG------DPEEEDDHHM----EFCRVCKDGGELLCCDPCPSSYHIHCLNP 367
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 368 P----LPEIPNGEWLC 379
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP+C
Sbjct: 339 EFCRVCKDGGELLCCDPCPSSYHIHCLNPPLPEIPNGEWLCPHC 382
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 69/161 (42%), Gaps = 17/161 (10%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R DL
Sbjct: 387 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEARD-DLS- 444
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
EA G + + ++ I CR D ++ CD C +H+ CL
Sbjct: 445 --EAEGEDDDDRRDEGMEEEDDHHIEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 496
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 666
L EIP +W C C + +Q ++ R PA
Sbjct: 497 ----PLPEIPNGEWICPRCKCPPMKGKVQKVLTWRWGEAPA 533
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 53/88 (60%), Gaps = 4/88 (4%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSD 508
+GI C CC+K ++ S+FE HAG + RQP+++I+ ++G + I +D
Sbjct: 582 DGIHCGCCSKILAVSKFEIHAG-SKLRQPFQNIFLNSGGAGN---IGFCSVDVIADDPND 637
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLD 536
D C +CGDG +L+ C+GCP FH CLD
Sbjct: 638 DACGICGDGGDLVCCDGCPSTFHQRCLD 665
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 56/128 (43%), Gaps = 23/128 (17%)
Query: 707 SLLSSATAIFRECFDPIIAE--------CGRDLIPVMVYG---------------RNISG 743
S L+ A + ECF PII C R+ V+ +G N +
Sbjct: 767 SKLALALTVMDECFLPIIDRRSGHCKKFCLRNFTTVIFFGISLCWFVCLYIAFRRSNFNR 826
Query: 744 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 803
FGG Y+ +L +V++ +R G +AE+P + T Y+ +G + LFS +E +
Sbjct: 827 LNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESVSS 886
Query: 804 SLNVENLV 811
+ +V L
Sbjct: 887 TADVAKLT 894
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 6/84 (7%)
Query: 206 VVPNEYPTNVKKLLSTGILD-GACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVS 263
V +E T + L+ +G L V Y++ R R L+G + G CGC C SK+++
Sbjct: 539 VPSSEKRTVLAWLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGC--C--SKILA 594
Query: 264 AHEFEQHAGAKTRHPNNHIYLENG 287
+FE HAG+K R P +I+L +G
Sbjct: 595 VSKFEIHAGSKLRQPFQNIFLNSG 618
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 735 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 782
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 783 IQWEPKDDEDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 836
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 837 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGLPPPK 892
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 893 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 931
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 337 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 384
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 385 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 438
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 439 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPK 494
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 495 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 533
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 418
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 419 IQWEPKDDEDDEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 472
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 473 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPK 528
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 529 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 567
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 324 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 371
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 372 IQWEPKDDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 425
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 426 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPK 481
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 482 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 520
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 330 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 377
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 378 IQWEPKDDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 431
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 432 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPK 487
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 488 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 526
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 482 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 529
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 530 IQWEPKEDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 583
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + + PA + + +E GI
Sbjct: 584 ----PLAEIPNGEWLCPRCTCPPLKGKVQRILHWKWTEPPAPFMVGLPGPEVEPGIPPPK 639
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 640 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 678
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 429 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 482
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 483 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPK 538
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 539 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 577
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 61/136 (44%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG ++ + CR D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDA----EEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 478
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 479 ----PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ +
Sbjct: 372 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN--- 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ G E+ V+ ++ + CR D ++ CD C +H+ CL
Sbjct: 429 ---SEGEEILEDVVGDAEEEDDHHMEFCR------VCKDGGELLCCDACPSSYHIHCLNP 479
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 480 ----PLPEIPNGEWLC 491
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 451 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ +
Sbjct: 372 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN--- 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ G E+ V+ ++ + CR D ++ CD C +H+ CL
Sbjct: 429 ---SEGEEILEDVVGDAEEEDDHHMEFCR------VCKDGGELLCCDACPSSYHIHCLNP 479
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 480 ----PLPEIPNGEWLC 491
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 451 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 69.3 bits (168), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ +
Sbjct: 365 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN--- 421
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ G E+ V+ ++ + CR D ++ CD C +H+ CL
Sbjct: 422 ---SEGEEILEDVVGDAEEEDDHHMEFCR------VCKDGGELLCCDACPSSYHIHCLNP 472
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 473 ----PLPEIPNGEWLC 484
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 444 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 487
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 69.3 bits (168), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 320 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 367
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P E E+ + CR D ++ CD C +H+ CL
Sbjct: 368 IQWEPKDEEEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 421
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 422 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPK 477
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 478 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 516
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 342 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 389
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 390 IQWEPKDDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 443
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 444 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPR 499
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 500 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 538
>gi|390367174|ref|XP_003731194.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Strongylocentrotus purpuratus]
Length = 2202
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 55/136 (40%), Gaps = 38/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP AFH CLDP L PE W CPNC
Sbjct: 349 DYCEVCQQGGEIILCDTCPKAFHLVCLDPELETAPEGKWSCPNCE--------------- 393
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G I P++ + CR D ++ C+QC +H+ CL
Sbjct: 394 ----------GEGIPEPEPADEH-MEFCR------VCHDGGELLCCEQCPSSYHIFCLNP 436
Query: 627 NGLCDLKEIPKDKWFC 642
L++IP D W C
Sbjct: 437 ----PLRKIPDDDWVC 448
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C CP ++H CL+P L IP+ W CP C
Sbjct: 408 EFCRVCHDGGELLCCEQCPSSYHIFCLNPPLRKIPDDDWVCPRC 451
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 70 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 117
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 118 IQWEPKDDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 171
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 172 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPR 227
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 228 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 266
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 52/92 (56%), Gaps = 1/92 (1%)
Query: 767 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 826
RI G ++AE+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT F
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 827 GFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
GF+ + + K + + I GT +LEK++
Sbjct: 61 GFKPVEPSKRQKI-KSLNILIIHGTGLLEKRL 91
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 346 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 393
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 394 IQWEPKDDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 447
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 448 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFVVGLPGPEVEPGMPPPR 503
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 504 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 542
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 355 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 402
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 403 IQWEPKDDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 456
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 457 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFVVGLPGPEVEPGMPPPR 512
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 513 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 551
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 82/219 (37%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 346 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 393
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 394 IQWEPKDDDEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 447
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E G+
Sbjct: 448 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFVVGLPGPEVEPGMPPPR 503
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 504 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 542
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 60/136 (44%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D
Sbjct: 355 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEARE-DASE 413
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G E E GG + ++ + CR D ++ CD C +H+ CL
Sbjct: 414 GEE--DNEAGGE-----AEEDDHHMEFCR------VCKDGGELLCCDSCPSSYHIHCLN- 459
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 460 ---PPLPEIPNGEWIC 472
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 432 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 475
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 59/136 (43%), Gaps = 21/136 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R +G
Sbjct: 413 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREEGSEG 472
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E +VG ++ + CR D ++ CD C +H+ CL
Sbjct: 473 --EDDNGDVGEM-------EDDHHMEFCR------VCKDGGELLCCDSCPSSYHIHCLNP 517
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 518 ----PLPEIPNGEWIC 529
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 489 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 532
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 74/151 (49%), Gaps = 18/151 (11%)
Query: 709 LSSATAIFRECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGL 765
L A + E F II RDL +V+ R + F G Y+++
Sbjct: 8 LCIALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYTIL------------ 55
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
+ G++ AELPL+ T +Y+ +G + L + +E+LL L VE L+LP + WT
Sbjct: 56 --VCGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWTGS 113
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 856
FGF +MS +Y + L+ F+GT+M +K
Sbjct: 114 FGFTEMSYSDRFQYAANIILS-FQGTTMCQK 143
>gi|292606963|gb|ADE34162.1| chromodomain helicase DNA-binding protein 4 [Schmidtea
mediterranea]
Length = 1868
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 66/160 (41%), Gaps = 12/160 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP AFH CLDP L PE W CP+C + + SRS
Sbjct: 366 DYCEVCQQGGEIMLCDTCPRAFHLVCLDPELEEAPEGSWSCPHCEKEGVVAASRSTTPAT 425
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G G R S N + + +F D +I C +C +H CL
Sbjct: 426 G----GDMSQNPQNIRKSAQPNEEEK-DEHQEFCNECKDGGDLICCAKCPVSYHPECL-- 478
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIP 665
L EIP+ W C C + A + ++ R P
Sbjct: 479 --YPPLSEIPEGPWLCPRCGCGPLKAKVHKILTWRWMEPP 516
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 815 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 862
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 863 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 916
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 917 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 972
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 973 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 1011
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 56/136 (41%), Gaps = 29/136 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + P W CP+C + + D
Sbjct: 340 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEEPPGGKWSCPHCE----NDLVNDNDAVT 395
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
EA A+ G C RLCR D ++ CD C +H CL
Sbjct: 396 SKEAAPAKAGNMEFC----------RLCR---------DGGELLCCDSCPSSYHRYCL-- 434
Query: 627 NGLCDLKEIPKDKWFC 642
+ L IP+ W C
Sbjct: 435 --IPPLTTIPEGDWHC 448
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
G+ + C +C DG LL C+ CP ++H CL P L IPE W CP C
Sbjct: 404 AGNMEFCRLCRDGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRC 451
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 447 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 494
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 495 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 548
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 549 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 604
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 605 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 643
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 61/136 (44%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLD + PE W CP+C + ++ + +G
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHMVCLDSDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 404
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E EVGG L ++ + CR D ++ CD C +H+ CL
Sbjct: 405 --EEILEEVGG----DLEEEDDHHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 452
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 453 ----PLPEIPNGEWLC 464
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 415
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P E E+ + CR D ++ CD C +H+ CL
Sbjct: 416 IQWEPKDEEEEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 469
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 470 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSVPPPK 525
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 526 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 564
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 60/136 (44%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D
Sbjct: 354 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEARE-DASE 412
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G E E GG + ++ + CR D ++ CD C +H+ CL
Sbjct: 413 GEE--DNEAGG-----EAEEDDHHMEFCR------VCKDGGELLCCDSCPSSYHIHCLNP 459
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 460 P----LPEIPNGEWIC 471
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 431 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 474
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 60/136 (44%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D
Sbjct: 354 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEARE-DASE 412
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G E E GG + ++ + CR D ++ CD C +H+ CL
Sbjct: 413 GEE--DNEAGG-----EAEEDDHHMEFCR------VCKDGGELLCCDSCPSSYHIHCLNP 459
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 460 P----LPEIPNGEWIC 471
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 431 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 474
>gi|297820632|ref|XP_002878199.1| hypothetical protein ARALYDRAFT_907299 [Arabidopsis lyrata subsp.
lyrata]
gi|297324037|gb|EFH54458.1| hypothetical protein ARALYDRAFT_907299 [Arabidopsis lyrata subsp.
lyrata]
Length = 173
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/50 (60%), Positives = 38/50 (76%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
+ ++ YP+NVK LL TGIL+GA VKYIST R+L GI++ GGYLCGC
Sbjct: 21 QGILDLSYPSNVKNLLETGILEGAPVKYISTPHVRELQGIIHSGGYLCGC 70
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 594 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 641
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 642 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 695
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 696 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 751
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 752 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 790
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 59/140 (42%), Gaps = 16/140 (11%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CL+P L PE W CP+C Q +
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEDFED 438
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
+ + +EVG V ++ + CR D ++ CD C +H+
Sbjct: 439 FEEDSEDRVISEVGVRVATGAEEEDDDHMEFCR------VCKDGGELLCCDTCTSSYHIH 492
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL L EIP +W C
Sbjct: 493 CLNP----PLPEIPNGEWLC 508
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWR 546
+ ++ + +A G + C VC DG LL C+ C ++H CL+P L IP W
Sbjct: 448 ISEVGVRVATGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWL 507
Query: 547 CPNC 550
CP C
Sbjct: 508 CPRC 511
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 68/163 (41%), Gaps = 22/163 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R +G
Sbjct: 257 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREEGSEG 316
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ +VG ++ + CR D ++ CD C +H+ CL
Sbjct: 317 --DEDNGDVGEM-------EDDHHMEFCR------VCKDGGELLCCDSCPSSYHIHCLNP 361
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASS 668
L EIP +W C C + +Q ++ R P +
Sbjct: 362 ----PLPEIPNGEWICPRCTCPSMKGKVQKILTWRWGEPPTPT 400
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 391
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 392 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 445
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 446 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 501
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 502 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 58/136 (42%), Gaps = 18/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ +
Sbjct: 366 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN--- 422
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ G E + ++ + CR D ++ CD C +H+ CL
Sbjct: 423 ---SEGEETMEDAVGDAEEEDDHHMEFCR------VCKDGGELLCCDACPSSYHIHCLNP 473
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 474 ----PLPEIPNGEWLC 485
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 445 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 488
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 391
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 392 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 445
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 446 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 501
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 502 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|302771658|ref|XP_002969247.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
gi|300162723|gb|EFJ29335.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
Length = 618
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHE 266
V E+ + K LL +G+L+G V+Y+ E G+V G LC C C + +A
Sbjct: 503 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLGSGVVKSGVILCNCRHCKSHQGFNASS 562
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQE 296
FE+H G+ RHP++ I+L+NG+ + ++++
Sbjct: 563 FEKHVGSTARHPSDFIFLDNGRRLREVLED 592
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 414 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 461
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 462 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 515
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 516 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 571
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 572 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 610
>gi|302754362|ref|XP_002960605.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
gi|300171544|gb|EFJ38144.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
Length = 645
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHE 266
V E+ + K LL +G+L+G V+Y+ E G+V G LC C C + +A
Sbjct: 530 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLGSGVVKSGVILCNCRHCKSHQGFNASS 589
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQE 296
FE+H G+ RHP++ I+L+NG+ + ++++
Sbjct: 590 FEKHVGSTARHPSDFIFLDNGRRLREVLED 619
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 391
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 392 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 445
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 446 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 501
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 502 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 391
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 392 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 445
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 446 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 501
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 502 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 60/138 (43%), Gaps = 20/138 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRS--VDL 564
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D
Sbjct: 305 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEARDDLSDG 364
Query: 565 KGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
+G E + G + ++ I CR D ++ CD C +H+ CL
Sbjct: 365 EGEDEEDRRDEG------VEEEDDHHIEFCR------VCKDGGELLCCDTCPSSYHIHCL 412
Query: 625 RKNGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 413 NP----PLPEIPNGEWIC 426
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+ C VC DG LL C+ CP ++H CL+P L IP W CP C+
Sbjct: 386 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 430
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 81/219 (36%), Gaps = 33/219 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C K
Sbjct: 347 DYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHCE-------------KE 393
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ + E+ + CR D ++ CD C +H+ CL
Sbjct: 394 GIQWEPKDDDEEDEEGGEEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHLHCLNP 447
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + + PA ++ + + I
Sbjct: 448 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWKWTEPPAPFITVLPPPDADPSIPLPK 503
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 504 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 542
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 391
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 392 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 445
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 446 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 501
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 502 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 54 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 101
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 102 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 155
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 156 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 211
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 212 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 250
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 81/219 (36%), Gaps = 32/219 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 329 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 376
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 377 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 430
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 685
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 431 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPK 486
Query: 686 TMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 487 PLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 525
>gi|391338290|ref|XP_003743492.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1481
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 66/163 (40%), Gaps = 18/163 (11%)
Query: 494 ISLAMGQRRTTGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
++ A + +TG S+D C VC + E L+ C CP +FH C+ P WRC C
Sbjct: 975 VTSAEDEEESTGEVPESNDFCEVCLNDEQLISCGSCPRSFHLICIQMKRAPRRDWRCLAC 1034
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
G DLK +E A + S E+F I C R I
Sbjct: 1035 TAGVKKYKQELKDLKKIIEEKEA-----FEAKDSNEEDFSINQC----LKCGELLSRGHI 1085
Query: 611 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAAL 653
C C +++H+ C DL + PK W+C C + +L
Sbjct: 1086 ECIGCGRKYHLAC------ADLTKRPKGDWYCKKRCEPGYVSL 1122
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/143 (23%), Positives = 50/143 (34%), Gaps = 26/143 (18%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMS 559
G ++ C VC + ++ C C L FH C+ P L IP W C C
Sbjct: 910 GLTNARCKVCRGKATPDRMIRCETCDLVFHLPCIKPALREIPRGEWFCKACTPETVPDSP 969
Query: 560 RSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
R E G + S+DF +D +I C C + F
Sbjct: 970 RKKPKVTSAEDEEESTG---------------EVPESNDFCEVCLNDEQLISCGSCPRSF 1014
Query: 620 HVGCLRKNGLCDLKEIPKDKWFC 642
H+ C++ +K P+ W C
Sbjct: 1015 HLICIQ------MKRAPRRDWRC 1031
>gi|443719422|gb|ELU09603.1| hypothetical protein CAPTEDRAFT_166178 [Capitella teleta]
Length = 1711
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 88/226 (38%), Gaps = 53/226 (23%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C + SS++ + + G
Sbjct: 154 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEQAPEGKWSCPHCVR---SSVNIAFEENG 210
Query: 567 GLEAPGAE-VGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
+E E + C +C+ D ++ CD C +HV CL
Sbjct: 211 PVEQDDDEHMEFCRVCK----------------------DGGELLCCDTCTSAYHVFCLA 248
Query: 626 KNGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQ-----------TIPAS--SLST 671
N + +P W C C + A +Q + R + P S +
Sbjct: 249 PN----VANVPDGIWHCPRCSCQPLKARVQKILFWRWKVPMKVEDELDHVTPHSPNKKAQ 304
Query: 672 INRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFR 717
I R+H E+ V+W L C E L A+FR
Sbjct: 305 IEREHKEREFF-------VKWHELSYWHCEWISELQLDVYHPAMFR 343
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 59/136 (43%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D+
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEARE-DVSD 439
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E G E G ++ + CR D ++ CD C +H+ CL
Sbjct: 440 AEEDNG-ETGEM------EEDDHHMEFCR------VCKDGGELLCCDSCPSSYHIHCLNP 486
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 487 ----PLPEIPNGEWIC 498
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 458 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 501
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 57/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R G
Sbjct: 367 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREDGSDG 426
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + G ++ + CR D ++ CD C +H+ CL
Sbjct: 427 --EEDNGDAGDM------EEDDHHMEFCR------VCKDGGELLCCDSCPSSYHIHCLNP 472
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 473 ----PLPEIPNGEWIC 484
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 444 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 487
>gi|196008026|ref|XP_002113879.1| hypothetical protein TRIADDRAFT_27056 [Trichoplax adhaerens]
gi|190584283|gb|EDV24353.1| hypothetical protein TRIADDRAFT_27056, partial [Trichoplax
adhaerens]
Length = 871
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 30/44 (68%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D CHVC DG LL C+ CPL++H CL+P L IPE WRCP C
Sbjct: 1 DFCHVCKDGGQLLCCDSCPLSYHLRCLNPPLEDIPEGDWRCPRC 44
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 58/142 (40%), Gaps = 31/142 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C K
Sbjct: 362 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCE-------------KE 408
Query: 567 GLEAPGAEVGGCVICRLSPS------ENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFH 620
G++ E + + + E+ + CR D ++ CD C +H
Sbjct: 409 GVQWEAKEDNSEIDDDMDDTVGDPEEEDHHMEFCR------VCKDGGELLCCDACPSSYH 462
Query: 621 VGCLRKNGLCDLKEIPKDKWFC 642
+ CL L EIP +W C
Sbjct: 463 IHCLNP----PLPEIPNGEWLC 480
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 440 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 483
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 57/145 (39%), Gaps = 37/145 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHSSS 557
D C VC G ++LC+ CP A+H CLDP + PE W CP+C ++ +S
Sbjct: 361 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCEKEGVQWEAKEDNSEI 420
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
D G E + C +C+ D ++ CD C
Sbjct: 421 DDDMDDTVGDPEEEDHHMEFCRVCK----------------------DGGELLCCDACPS 458
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFC 642
+H+ CL L EIP +W C
Sbjct: 459 SYHIHCLNP----PLPEIPNGEWLC 479
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 439 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 482
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 57/145 (39%), Gaps = 37/145 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHSSS 557
D C VC G ++LC+ CP A+H CLDP + PE W CP+C ++ +S
Sbjct: 362 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCEKEGVQWEAKEDNSEL 421
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
D G E + C +C+ D ++ CD C
Sbjct: 422 DDDLDDAVGDPEEEDHHMEFCRVCK----------------------DGGELLCCDVCPS 459
Query: 618 EFHVGCLRKNGLCDLKEIPKDKWFC 642
+H+ CL L EIP +W C
Sbjct: 460 SYHIHCLNP----PLPEIPNGEWLC 480
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 440 EFCRVCKDGGELLCCDVCPSSYHIHCLNPPLPEIPNGEWLCPRC 483
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 58/140 (41%), Gaps = 16/140 (11%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CL+P L PE W CP+C Q +
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEDFED 440
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
+ + +EV V ++ + CR D ++ CD C +H+
Sbjct: 441 FEEDSEDRVISEVSSGVPAGGDDEDDDHMEFCR------VCKDGGELLCCDTCTSSYHIH 494
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL L EIP +W C
Sbjct: 495 CLNP----PLPEIPNGEWLC 510
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 8/54 (14%)
Query: 505 GGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
GG D+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 460 GGDDEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 513
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 55/136 (40%), Gaps = 25/136 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C K
Sbjct: 328 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-------------KE 374
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ + E+ + CR D ++ CD C +H+ CL
Sbjct: 375 GIQWEAKDDEEEEEEAPGEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 428
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 429 ----PLPEIPNGEWLC 440
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 400 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 443
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 55/136 (40%), Gaps = 25/136 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C K
Sbjct: 352 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-------------KE 398
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ + E+ + CR D ++ CD C +H+ CL
Sbjct: 399 GIQWEAKDDEEEEDEVAGEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 452
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 453 ----PLPEIPNGEWLC 464
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|334821760|gb|AEG90850.1| chromodomain-helicase-DNA-binding protein 5 [Apostichopus
japonicus]
Length = 110
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 57/136 (41%), Gaps = 33/136 (24%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C G + + V+ +
Sbjct: 5 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEAAPEGKWSCPQCEAGAAPPSAHVVEQE- 63
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
++ + CR D ++ CDQC +H+ CL
Sbjct: 64 --------------------QDEHMEFCRVRQ------DGGELLCCDQCPSSYHIFCLNP 97
Query: 627 NGLCDLKEIPKDKWFC 642
LK+IP +W C
Sbjct: 98 P----LKKIPDGEWLC 109
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 57/141 (40%), Gaps = 30/141 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-----GHSSSMSRS 561
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 457 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 516
Query: 562 VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
+ +GG E E C R+C+ D ++ CD C +H+
Sbjct: 517 EEERGGKERRREEDDHMEYC----------RVCK---------DGGELLCCDACISSYHI 557
Query: 622 GCLRKNGLCDLKEIPKDKWFC 642
CL L +IP +W C
Sbjct: 558 HCLNP----PLPDIPNGEWLC 574
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+RR + C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 525 RRREEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 577
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 63/161 (39%), Gaps = 25/161 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 328 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 375
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 376 IQWEPKDDDDDEDEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 429
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 666
L EIP +W C C + +Q + R PA
Sbjct: 430 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 466
>gi|198428271|ref|XP_002124973.1| PREDICTED: similar to RING and PHD-finger domain-containing protein
KIAA1542 [Ciona intestinalis]
Length = 1966
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 44/76 (57%), Gaps = 6/76 (7%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--I 540
G+ +H++ + + T +D C VCG GEN LLLC+GC L +H ACL+P L +
Sbjct: 121 GVFVHELVVEKKINVVET-HENDTNCQVCGSGENEETLLLCDGCDLGYHCACLNPPLEQV 179
Query: 541 PESGWRCPNCRQGHSS 556
P W C NCR H++
Sbjct: 180 PSDEWFCINCRPSHAA 195
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 28/48 (58%), Gaps = 5/48 (10%)
Query: 605 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
++ T++ CD C+ +H CL L+++P D+WFC +C HAA
Sbjct: 153 NEETLLLCDGCDLGYHCACLN----PPLEQVPSDEWFCI-NCRPSHAA 195
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 58/140 (41%), Gaps = 16/140 (11%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CL+P L PE W CP+C Q +
Sbjct: 392 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEEFED 451
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
+ + +EV V ++ + CR D ++ CD C +H+
Sbjct: 452 FEEDSEDRVISEVSLGVPTGAEEEDDDHMEFCR------VCKDGGELLCCDTCTSSYHIH 505
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL L EIP +W C
Sbjct: 506 CLNP----PLPEIPNGEWLC 521
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 2/65 (3%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGW 545
+ ++++ + G + C VC DG LL C+ C ++H CL+P L IP W
Sbjct: 460 VISEVSLGVPTGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEW 519
Query: 546 RCPNC 550
CP C
Sbjct: 520 LCPRC 524
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 25/136 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C K
Sbjct: 505 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-------------KE 551
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ + + E+ + CR D ++ CD C +H+ CL
Sbjct: 552 GIQWEAKDEEEDEEEPVGEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 605
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 606 ----PLPEIPNGEWLC 617
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 577 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 620
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 63/161 (39%), Gaps = 25/161 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 432 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 479
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 480 IQWEPKDDDDDEDEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 533
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 666
L EIP +W C C + +Q + R PA
Sbjct: 534 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 570
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 63/161 (39%), Gaps = 25/161 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 318 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 365
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 366 IQWEPKDDDDDEDEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 419
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 666
L EIP +W C C + +Q + R PA
Sbjct: 420 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 456
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 63/161 (39%), Gaps = 25/161 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 432 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 479
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 480 IQWEPKDDDDDEDEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 533
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 666
L EIP +W C C + +Q + R PA
Sbjct: 534 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 570
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 28/123 (22%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS---SSMSRS 561
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG + + +
Sbjct: 303 NEDECAVCRDGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSCLQGRALQDTPHAEE 362
Query: 562 VDLKGGLEAPGAEVGG-CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFH 620
+ LE+PGA G C +C D + C C FH
Sbjct: 363 PRPQEPLESPGAAPGARCGVC----------------------GDGAEALRCAHCAAAFH 400
Query: 621 VGC 623
+ C
Sbjct: 401 LRC 403
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 63/161 (39%), Gaps = 25/161 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ CL
Sbjct: 429 IQWEPKDDDDDEDEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCLNP 482
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 666
L EIP +W C C + +Q + R PA
Sbjct: 483 ----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 519
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 57/141 (40%), Gaps = 30/141 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-----GHSSSMSRS 561
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 203 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 262
Query: 562 VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
+ +GG E E C R+C+ D ++ CD C +H+
Sbjct: 263 EEERGGKERRREEDDHMEYC----------RVCK---------DGGELLCCDACISSYHI 303
Query: 622 GCLRKNGLCDLKEIPKDKWFC 642
CL L +IP +W C
Sbjct: 304 HCLNP----PLPDIPNGEWLC 320
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+RR + C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 271 RRREEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 323
>gi|242036737|ref|XP_002465763.1| hypothetical protein SORBIDRAFT_01g045403 [Sorghum bicolor]
gi|241919617|gb|EER92761.1| hypothetical protein SORBIDRAFT_01g045403 [Sorghum bicolor]
Length = 158
Score = 63.5 bits (153), Expect = 4e-07, Method: Composition-based stats.
Identities = 31/72 (43%), Positives = 43/72 (59%), Gaps = 4/72 (5%)
Query: 247 GGYLCGCPL---CNF-SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPL 302
GGY C CP C + KV+SA +FE+HAGA++ +PN HI L NGK +Y + +L+ P
Sbjct: 3 GGYACACPASAGCGYHGKVLSARQFEKHAGAESNNPNGHILLRNGKSLYQLFHDLRHVPA 62
Query: 303 GILEEVVKKVAG 314
L + AG
Sbjct: 63 EALAAKFLEFAG 74
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 81/221 (36%), Gaps = 35/221 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C K
Sbjct: 346 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-------------KE 392
Query: 567 GL--EAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
G+ E + E+ + CR D ++ CD C +H+ CL
Sbjct: 393 GIQWEPKDDDDEEEEGGCEEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCL 446
Query: 625 RKNGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILF 683
L EIP +W C C + +Q + R PA + + +E +
Sbjct: 447 NP----PLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPP 502
Query: 684 DGTMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 503 PKPLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 543
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 71/185 (38%), Gaps = 54/185 (29%)
Query: 460 ISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN 519
++P + + G+ +R+P R + S G+++ D C VC G
Sbjct: 318 VAPRKAKTKFGVKNKRKPKRKMNDSKS------------GEQQE---HQDYCEVCQQGGE 362
Query: 520 LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGG 577
++LC+ CP A+H CLDP L PE W CP+C EA G +
Sbjct: 363 IILCDTCPRAYHLVCLDPELEETPEGKWSCPHC------------------EAEGTQEQD 404
Query: 578 CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 637
N RLC+ D ++ CD C +H+ CL L EIP
Sbjct: 405 ------DDEHNEFCRLCK---------DGGELLCCDSCTSAYHIFCLNP----PLSEIPD 445
Query: 638 DKWFC 642
W C
Sbjct: 446 GDWKC 450
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 409 NEFCRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPRC 453
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 64/144 (44%), Gaps = 21/144 (14%)
Query: 428 DGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG 486
DGE L Y K GQ + G + GI +K + + FE AG R P H +T+ G
Sbjct: 20 DGELLRYKSKHGQLMAVGRAREGGIEVAGTSKLLGYTAFEDIAGSKYHR-PAEHTHTTTG 78
Query: 487 MTLHDIAISLAMGQRRTTGG-------------------SDDMCHVCGDGENLLLCNGCP 527
TL +A+ G G +DD+CH+CG G +L+ C CP
Sbjct: 79 RTLQSLALLAGGGAGGGAGKRGGKAAPPPEPVVELLEDDNDDLCHICGLGGDLMCCETCP 138
Query: 528 LAFHAACLDPLLIPESGWRCPNCR 551
FHAACL PE + CP CR
Sbjct: 139 GVFHAACLGLAAPPEGDYHCPLCR 162
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 58/140 (41%), Gaps = 16/140 (11%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CL+P L PE W CP+C Q +
Sbjct: 275 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEEFED 334
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
+ + +EV V ++ + CR D ++ CD C +H+
Sbjct: 335 FEEDSEDRVISEVSLGVPMGAEEEDDDHMEFCR------VCKDGGELLCCDTCTSSYHIH 388
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL L EIP +W C
Sbjct: 389 CLNP----PLPEIPNGEWLC 404
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 2/65 (3%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGW 545
+ ++++ + MG + C VC DG LL C+ C ++H CL+P L IP W
Sbjct: 343 VISEVSLGVPMGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEW 402
Query: 546 RCPNC 550
CP C
Sbjct: 403 LCPRC 407
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 55/136 (40%), Gaps = 25/136 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C K
Sbjct: 320 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-------------KE 366
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ E E+ + CR D ++ CD C +H+ CL
Sbjct: 367 GIQWEPKEEEDEEEEGGEEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHLHCLNP 420
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 421 ----PLPEIPNGEWLC 432
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 392 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 435
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 55/136 (40%), Gaps = 25/136 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C K
Sbjct: 335 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-------------KE 381
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ E E+ + CR D ++ CD C +H+ CL
Sbjct: 382 GIQWEPKEEEDEEEEGGEEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHLHCLNP 435
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 436 ----PLPEIPNGEWLC 447
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 407 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 450
>gi|291225093|ref|XP_002732536.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1B-like
[Saccoglossus kowalevskii]
Length = 1438
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 59/143 (41%), Gaps = 19/143 (13%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C +C GD + LLLC+ C FH CL P L +P+ W CP C+ + SR D
Sbjct: 1092 CKICRKKGDEDKLLLCDECNQPFHLYCLRPALSYVPKGDWMCPACKPSVARRNSRGRDY- 1150
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
AE+ G + D + DD+ ++YC +C +H C
Sbjct: 1151 -------AELNGGSDSDEYDETDSDESEAEHDEMCCMCDDDQELVYCSRCPAAYHRECHD 1203
Query: 626 KNGLCDLKEIPKDKWFC--CDDC 646
L+ P+ KW C C +C
Sbjct: 1204 P----PLRNFPRGKWVCSACTNC 1222
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D+MC +C D + L+ C+ CP A+H C DP L P W C C
Sbjct: 1175 DEMCCMCDDDQELVYCSRCPAAYHRECHDPPLRNFPRGKWVCSAC 1219
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 54/136 (39%), Gaps = 25/136 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C K
Sbjct: 330 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCE-------------KE 376
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ E+ + CR D ++ CD C +H+ CL
Sbjct: 377 GIQWEAKGEEEEEEEAAGEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHIHCLNP 430
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 431 ----PLPEIPNGEWLC 442
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 402 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 445
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 56/138 (40%), Gaps = 32/138 (23%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H C DP L PE W CP+C +G + + +V K
Sbjct: 416 DYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEAPEGRWSCPHC-EGEGITAA-TVTEKA 473
Query: 567 GLEAPGAEVGG--CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
G A + C IC+ D ++ CD C +H CL
Sbjct: 474 GRNAADDDEHSEFCRICK----------------------DGGELLCCDSCTSAYHTFCL 511
Query: 625 RKNGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 512 NP----PLSEIPDGDWKC 525
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 485 EFCRICKDGGELLCCDSCTSAYHTFCLNPPLSEIPDGDWKCPRC 528
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 47/109 (43%), Gaps = 36/109 (33%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
D +C VC DG LLLC+ CP AFH AC+ PE W CP CR
Sbjct: 465 DSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------------- 508
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
C +C S + D +A F D+T+IYC+QCE
Sbjct: 509 ----------CGVCGGSDLD----------DDTAEGFTDKTIIYCEQCE 537
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 57/137 (41%), Gaps = 28/137 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +G
Sbjct: 356 DYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHCEK------------EG 403
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
P + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 404 IQWEPKDDDEEDEDLCEEADDHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLN 454
Query: 626 KNGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 455 P----PLPEIPNGEWLC 467
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 427 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 470
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 66/146 (45%), Gaps = 29/146 (19%)
Query: 712 ATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFG 770
A I ECF II D+ +V+ R I GQ+F
Sbjct: 774 AVDILHECFVTIIEPRTQSDISEDIVFNREICGQKF------------------------ 809
Query: 771 REVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRK 830
AELPL+ T Y+ +G + L + +E+LL L VE L+LPA + WT FGF
Sbjct: 810 ---AELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGFTV 866
Query: 831 MSRERLLKYQRDFQLTIFKGTSMLEK 856
MS L+ + L+ F+GT+M +K
Sbjct: 867 MSNSERLELAGNSILS-FQGTTMCQK 891
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 56/139 (40%), Gaps = 22/139 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + + +
Sbjct: 386 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEK-------EGIQWEA 438
Query: 567 GLEAPGAEVGGCVICRLSPSENFD---IRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGC 623
E E R E D I CR D ++ CD C +H+ C
Sbjct: 439 KDELSEGEGEDEEDRRDEGVEEEDDHHIEFCR------VCKDGGELLCCDTCPSSYHIHC 492
Query: 624 LRKNGLCDLKEIPKDKWFC 642
L L EIP +W C
Sbjct: 493 LNP----PLPEIPNGEWIC 507
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+ C VC DG LL C+ CP ++H CL+P L IP W CP C+
Sbjct: 467 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 511
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 57/137 (41%), Gaps = 28/137 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 276 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 323
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
P E ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 324 IQWEPKEEDEEEEEGGEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHLHCLN 374
Query: 626 KNGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 375 P----PLPEIPNGEWLC 387
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 347 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 390
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 58/128 (45%), Gaps = 18/128 (14%)
Query: 517 GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAE 574
G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G E E
Sbjct: 290 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG--EEILEE 347
Query: 575 VGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 634
VGG P E D + +F D ++ CD C +H+ CL L E
Sbjct: 348 VGG------DPEEEDDHHM----EFCRVCKDGGELLCCDTCPSSYHIHCLNP----PLPE 393
Query: 635 IPKDKWFC 642
IP +W C
Sbjct: 394 IPNGEWLC 401
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 361 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 404
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 57/128 (44%), Gaps = 21/128 (16%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG----HSSSMSR 560
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG + S
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRIQQNLSQPEE 352
Query: 561 SVDLKGGLEAPGAEVGG-CVICRLSPSENFDIRLC---------RSHDFSAATFDDRTVI 610
S L+ E PG + C +C ++ D+ C R H F A T +
Sbjct: 353 SRPLEPSAETPGPTLSARCGVC----GDSTDVLRCAHCAAAFHWRCH-FPMAAVRPGTNL 407
Query: 611 YCDQCEKE 618
C C E
Sbjct: 408 RCKSCSAE 415
>gi|395536811|ref|XP_003770405.1| PREDICTED: autoimmune regulator-like [Sarcophilus harrisii]
Length = 529
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 56/138 (40%), Gaps = 12/138 (8%)
Query: 442 RFGCKQGNGIVCDCCNKEI-SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
RF G C E +P +FE +G R P T ++ +S G
Sbjct: 178 RFPLGNGGSKKCIQVGGEFYTPGKFEEPSGKNKTRSPKPPTRTKATQGRAELQLSQQSGG 237
Query: 501 RRTTG---------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
T +DD C VC DG L+ C+GCP AFH ACL+P L IP WRC
Sbjct: 238 PATPAQPPELHLHQKNDDECAVCRDGGELICCDGCPRAFHLACLEPPLTEIPSGMWRCGC 297
Query: 550 CRQGHSSSMSRSVDLKGG 567
C G R D + G
Sbjct: 298 CIVGKVHQDGRHGDQRDG 315
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 33/50 (66%), Gaps = 3/50 (6%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
+DD C VC DG L+ C+GCP AFH ACLDP L IP W+C CR GH
Sbjct: 259 NDDECTVCKDGGELICCDGCPRAFHLACLDPPLSSIPSGSWQCEWCR-GH 307
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTA 436
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTA 436
>gi|301776989|ref|XP_002923917.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Ailuropoda melanoleuca]
Length = 1948
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/218 (23%), Positives = 78/218 (35%), Gaps = 33/218 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 392
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ C
Sbjct: 393 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCXTP 446
Query: 627 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 686
+L + C C + +Q + R PA + + +E G+
Sbjct: 447 TPTLNLGTL------LCPQCPPLKGKVQRILHWRWTEPPAPFMVGLPGSDVEPGVPPPKP 500
Query: 687 MND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 501 LEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 538
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 432 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCIQGTTA 483
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 303 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTA 354
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 57/137 (41%), Gaps = 20/137 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+GCP A+H CL+P L PE W CP C +K
Sbjct: 345 DYCEVCKQGGEIILCDGCPRAYHLVCLEPPLDQPPEGSWPCPTC-------------VKN 391
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSH-DFSAATFDDRTVIYCDQCEKEFHVGCLR 625
G++ E H +F + D ++ CD C +H+ CL
Sbjct: 392 GIKPKVRGAEKDEDYDDLEEEEEAEENMDEHMEFCSRCKDGGDLLICDTCPHSYHLNCLN 451
Query: 626 KNGLCDLKEIPKDKWFC 642
++++P+ +W C
Sbjct: 452 P----PVEKVPEGEWSC 464
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C DG +LL+C+ CP ++H CL+P + +PE W CP C
Sbjct: 424 EFCSRCKDGGDLLICDTCPHSYHLNCLNPPVEKVPEGEWSCPRC 467
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 45/99 (45%), Gaps = 11/99 (11%)
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA----ISLAMGQRRTTGGSDDMCHVCGD 516
+P +FE G + R P + + S GQ+ +DD C VC D
Sbjct: 236 TPGKFEDAGGKSKNRSPKPPVRPKASQVPQHVRSLSRFSSLWGQK-----NDDECAVCRD 290
Query: 517 GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
G L+ C+GCP AFH CL P L IP WRC C +G
Sbjct: 291 GGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRCVEG 329
>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
Length = 4897
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 53/143 (37%), Gaps = 53/143 (37%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMS 559
G D C VC GD +LL C GC L +HA CLD P +GW+CP C+
Sbjct: 211 AGEDSWCAVCDSAGDLTDLLFCTGCGLHYHATCLDTGATPILRAGWQCPECKV------- 263
Query: 560 RSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
C CR P E D ++ CD CEK
Sbjct: 264 ------------------CQTCR-QPGE------------------DSKMLVCDSCEKGC 286
Query: 620 HVGCLRKNGLCDLKEIPKDKWFC 642
H CL+ + +P D+W C
Sbjct: 287 HTFCLQPA----MDSVPSDRWKC 305
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 64/149 (42%), Gaps = 33/149 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q S+S
Sbjct: 878 VCEVCGEASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSPGFH 937
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C +CR ENF ++ ++ C C++
Sbjct: 938 CEWQNNYTHCGPCASLVTCPVCR----ENF--------------MEEELLLQCQYCDRWV 979
Query: 620 HVGCLRKNGLCDLKEIPK--DKWFCCDDC 646
H C L E+ + D+ F C C
Sbjct: 980 HAVC---ESLYTEDEVEQASDEGFACTSC 1005
>gi|255646689|gb|ACU23818.1| unknown [Glycine max]
Length = 296
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 7/86 (8%)
Query: 776 LPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF---RKMS 832
+PL+ATC +Y+ +G + L S IE++L S VE LV+ A WTK FGF +
Sbjct: 1 MPLIATCSQYRRQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDDIE 60
Query: 833 RERLLKYQRDFQLTIFKGTSMLEKKV 858
R+RL K L +F GT +L K +
Sbjct: 61 RQRLNK----INLMVFPGTVLLVKSL 82
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 51/126 (40%), Gaps = 33/126 (26%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 352
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
SR +L E PG A C +C D V+ C C
Sbjct: 353 SRPPELPA--ETPGPAPSARCSVC----------------------GDGTEVLRCAHCAA 388
Query: 618 EFHVGC 623
FH C
Sbjct: 389 AFHWRC 394
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 51/126 (40%), Gaps = 33/126 (26%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 355
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
SR +L E PG A C +C D V+ C C
Sbjct: 356 SRPPELPA--ETPGPAPSARCSVC----------------------GDGTEVLRCAHCAA 391
Query: 618 EFHVGC 623
FH C
Sbjct: 392 AFHWRC 397
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 51/126 (40%), Gaps = 33/126 (26%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 356
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
SR +L E PG A C +C D V+ C C
Sbjct: 357 SRPPELPA--ETPGPAPSARCSVC----------------------GDGTEVLRCAHCAA 392
Query: 618 EFHVGC 623
FH C
Sbjct: 393 AFHWRC 398
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 51/126 (40%), Gaps = 33/126 (26%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 351
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEK 617
SR +L E PG A C +C D V+ C C
Sbjct: 352 SRPPELPA--ETPGPAPSARCSVC----------------------GDGTEVLRCAHCAA 387
Query: 618 EFHVGC 623
FH C
Sbjct: 388 AFHWRC 393
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/258 (22%), Positives = 91/258 (35%), Gaps = 69/258 (26%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H C DP L PE W CP+C +G D
Sbjct: 326 DYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWSCPHC-EGEGIKEQEEDD--- 381
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ C +C+ D ++ CD C +HV CL
Sbjct: 382 -------HMEFCRVCK----------------------DGGELLCCDTCPSAYHVHCLNP 412
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQT-----------------IPASS 668
+K IP +W C C + + ++ R +PAS+
Sbjct: 413 ----PMKMIPDGEWHCPRCSCEPLKGRVAKILTWRWTEPIQEEGKMEELDHTHPHLPAST 468
Query: 669 LSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECG 728
+NRK + + V+W L C E L A++R I +
Sbjct: 469 RKLMNRKPLREFF--------VKWHDLSYWHCSWISELQLDVYHPAMYRA----YIRKND 516
Query: 729 RDLIPVMVYGRNISGQEF 746
D P + G + G+++
Sbjct: 517 MDEPPPLEDGSSYGGKDY 534
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG S
Sbjct: 307 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQGRS 357
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 72/175 (41%), Gaps = 35/175 (20%)
Query: 501 RRTTGGSD--DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
RR G D ++C CG L+LC+ CP+ +H CLDP L + + W CP C ++
Sbjct: 1415 RRGLGALDHNELCQSCGHAGQLILCHDCPIVYHCECLDPPLSKLTQDHWFCPLCVMDRTT 1474
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
+ + S + G + C CR +I CD C
Sbjct: 1475 NGADSEEEMGSNDGEIEHEDVCSRCRHGGE----------------------LICCDTCP 1512
Query: 617 KEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLST 671
K FH+ C + L+++PK W C++C + + + R T PA S+
Sbjct: 1513 KAFHMECCKPV----LRKVPKGHWE-CENCKKGTKS----AAIRVPTGPAKKFSS 1558
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG +
Sbjct: 332 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQGRA 382
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG +
Sbjct: 293 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQGRA 343
>gi|196010575|ref|XP_002115152.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
gi|190582535|gb|EDV22608.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
Length = 709
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 76/180 (42%), Gaps = 32/180 (17%)
Query: 502 RTTGGS--DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHS 555
R +G S +D C C +G +LL C CP AFH C DP L IP W C C+ +
Sbjct: 36 RRSGRSFNNDTCDSCNEGGDLLCCERCPSAFHLHCCDPPLDEDGIPAGEWICKKCKAEYQ 95
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSE-------NFDIRLC------RSHDFSAA 602
+ +R + + L +P A + + + +PSE + I L +S+ +A
Sbjct: 96 DN-TRPIAKQSELRSPFAVLVKDNV-KQNPSEFRLPKEMQYHIPLPGSKKRKKSNGHGSA 153
Query: 603 TFDDR-------TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD 655
T R +I+CD C FH+ C+ L P W C + A+QD
Sbjct: 154 TKGGRRGNSFLKPLIHCDYCPLSFHLDCMDP----PLTTTPSGLWMCPNHAEHFLPAMQD 209
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 58/137 (42%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 439 DYCEVCQQGGEIILCDSCPRAYHLVCLDPELDKAPEGKWSCPHCEK-------EGVQWEP 491
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 492 KEEEDEYEGEMDDAEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 542
Query: 626 KNGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 543 P----PLPEIPNGEWLC 555
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 515 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPEIPNGEWLCPRC 558
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 55/137 (40%), Gaps = 14/137 (10%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C + + +
Sbjct: 388 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCEK-------EGIQWEA 440
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSH-DFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E C R S H +F D ++ CD C +H+ CL
Sbjct: 441 KEEDFEEFEEECDDVRDVESGLGGEEEEDDHMEFCRVCKDGGELLCCDSCPSSYHIHCLN 500
Query: 626 KNGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 501 P----PLPEIPNGEWLC 513
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 473 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWLCPRC 516
>gi|357116142|ref|XP_003559843.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium
distachyon]
Length = 1679
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNCRQGHSS 556
G T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C ++
Sbjct: 413 GSDDTDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGEWYCPECLMQRNN 472
Query: 557 SMSRSVDLKGGLEAPGAEVGG 577
L G E G + G
Sbjct: 473 GSRNMAKLGRGAEVLGIDPHG 493
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 51/137 (37%), Gaps = 41/137 (29%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +G
Sbjct: 374 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN------------EG 421
Query: 567 GLEAPGAEVGG-CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E C IC+ D ++ CD C +H CL
Sbjct: 422 PAEQDDDEHQEFCRICK----------------------DGGELLCCDSCPSAYHTHCLN 459
Query: 626 KNGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 460 P----PLVEIPDGDWKC 472
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ CP A+H CL+P L IP+ W+CP C
Sbjct: 432 EFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRC 475
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 55/136 (40%), Gaps = 22/136 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C K
Sbjct: 340 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-------------KE 386
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
G++ E +E + +F D ++ CD C +H+ CL
Sbjct: 387 GIQ---WEPKEDDEDEEDGAEEEEEEEDDHMEFCRVCKDGGELLCCDTCPSSYHLHCLNP 443
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP +W C
Sbjct: 444 ----PLPEIPNGEWLC 455
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 415 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 458
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 51/137 (37%), Gaps = 41/137 (29%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +G
Sbjct: 377 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN------------EG 424
Query: 567 GLEAPGAEVGG-CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E C IC+ D ++ CD C +H CL
Sbjct: 425 PAEQDDDEHQEFCRICK----------------------DGGELLCCDSCPSAYHTHCLN 462
Query: 626 KNGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 463 P----PLVEIPDGDWKC 475
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ CP A+H CL+P L IP+ W+CP C
Sbjct: 435 EFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRC 478
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSCLQG 347
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 59.7 bits (143), Expect = 7e-06, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 98 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 146
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG +
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSCLQGRA 349
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 67/163 (41%), Gaps = 24/163 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 483
Query: 626 KNGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P S
Sbjct: 484 P----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 522
>gi|170592228|ref|XP_001900871.1| CHD4 protein [Brugia malayi]
gi|158591738|gb|EDP30342.1| CHD4 protein, putative [Brugia malayi]
Length = 1846
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 52/138 (37%), Gaps = 38/138 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-GHSSSMSRSVDLK 565
D C VC G ++LC+ CP A+H CLDP + PE W CP C G + +
Sbjct: 257 DYCEVCQQGGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTCESTGATKDDEEEKKIT 316
Query: 566 GGLEA-PGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
+E + GG ++C CD C +H CL
Sbjct: 317 TNMEYCRTCKEGGWLLC------------------------------CDTCPSSYHAYCL 346
Query: 625 RKNGLCDLKEIPKDKWFC 642
+ L EIP+ W C
Sbjct: 347 NPS----LTEIPEGDWSC 360
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 320 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 363
>gi|268557732|ref|XP_002636856.1| C. briggsae CBR-LET-418 protein [Caenorhabditis briggsae]
Length = 1849
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 32/50 (64%), Gaps = 3/50 (6%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IP-ESGWRCPNC 550
T +DD C +C + ENLLLC+ C AFHA C+DP L +P E W CP C
Sbjct: 309 TKQNDDFCKICKETENLLLCDSCVCAFHAYCIDPPLTQVPQEETWACPRC 358
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 57/141 (40%), Gaps = 37/141 (26%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
++D C C G L+LC+ CP A+H C+D PE W CP+C
Sbjct: 254 NNDFCETCKVGGELVLCDTCPRAYHTGCMDE-DPPEGDWSCPHC---------------- 296
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E G VI +N D ++C+ + ++ CD C FH C+
Sbjct: 297 ------IEHGPEVIKEEPTKQNDDFCKICKETE---------NLLLCDSCVCAFHAYCID 341
Query: 626 KNGLCDLKEIPKDKWFCCDDC 646
L ++P+++ + C C
Sbjct: 342 P----PLTQVPQEETWACPRC 358
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 49/141 (34%), Gaps = 49/141 (34%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C
Sbjct: 457 DYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTCE--------------- 501
Query: 567 GLEAPGAEVGG-----CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
E P E C IC+ D ++ CD C +H
Sbjct: 502 -AEGPADEDDDEHQEFCRICK----------------------DGGELLCCDNCPSAYHT 538
Query: 622 GCLRKNGLCDLKEIPKDKWFC 642
CL L +IP W C
Sbjct: 539 FCLNP----PLDDIPDGDWRC 555
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 515 EFCRICKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGDWRCPRC 558
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+DD C VC DG L+ C+GCP AFH CL+P L IP W+C CR
Sbjct: 156 NDDECAVCKDGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERCR 202
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 45/96 (46%), Gaps = 10/96 (10%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 356
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
SR +L E P ++CRL R C
Sbjct: 357 SRPPELPA--ETPDQSPLQILLCRLDSHARHTGRSC 390
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 45/96 (46%), Gaps = 10/96 (10%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 352
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
SR +L E P ++CRL R C
Sbjct: 353 SRPPELPA--ETPDQSPLQILLCRLDSHARHTGRSC 386
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 45/96 (46%), Gaps = 10/96 (10%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 351
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
SR +L E P ++CRL R C
Sbjct: 352 SRPPELPA--ETPDQSPLQILLCRLDSHARHTGRSC 385
>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
Length = 2037
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 52/136 (38%), Gaps = 39/136 (28%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C
Sbjct: 414 DYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC---------------- 457
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
EA G P++ D +F D ++ CD C +H CL
Sbjct: 458 --EAEG------------PADEDD---DEHQEFCRVCKDGGELLCCDNCPSAYHTFCLNP 500
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 501 ----PLDDIPDGEWRC 512
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 472 EFCRVCKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGEWRCPRC 515
>gi|17569817|ref|NP_510140.1| Protein CHD-3 [Caenorhabditis elegans]
gi|6165993|sp|Q22516.2|CHD3_CAEEL RecName: Full=Chromodomain-helicase-DNA-binding protein 3 homolog;
Short=CHD-3
gi|3879819|emb|CAA91810.1| Protein CHD-3 [Caenorhabditis elegans]
gi|11095331|gb|AAG29837.1| CHD-3 [Caenorhabditis elegans]
Length = 1787
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 29/44 (65%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +C + N+LLC+ CP ++HA C+DP L IPE W CP C
Sbjct: 329 DYCRICKETSNILLCDTCPSSYHAYCIDPPLTEIPEGEWSCPRC 372
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 57/136 (41%), Gaps = 38/136 (27%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGL 568
C VC L+LC+ C A+H AC+D + PE W CP+C
Sbjct: 268 CEVCNQDGELMLCDTCTRAYHVACIDENMEQPPEGDWSCPHC------------------ 309
Query: 569 EAPGAEVGGCVICRLSPSE-NFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E G +V +I P++ N D R+C+ + ++ CD C +H C+
Sbjct: 310 EEHGPDV---LIVEEEPAKANMDYCRICK---------ETSNILLCDTCPSSYHAYCIDP 357
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP+ +W C
Sbjct: 358 ----PLTEIPEGEWSC 369
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 45/96 (46%), Gaps = 10/96 (10%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 355
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
SR +L E P ++CRL R C
Sbjct: 356 SRPPELPA--ETPDQSPLQILLCRLDSHARHTGRSC 389
>gi|146421865|ref|XP_001486876.1| hypothetical protein PGUG_00253 [Meyerozyma guilliermondii ATCC
6260]
Length = 709
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 60/144 (41%), Gaps = 27/144 (18%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
TKKR H L P G +T ++G+RL G K+ K ISP + A
Sbjct: 197 TKKRRKRTHNKLETPKQPTRGSSVTPALEGRRLIRGSKRI---------KVISPKKHAA- 246
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
+A +SNG I+ A + TG +DD C CG + C+ CP
Sbjct: 247 TNLAP---------SSNG------TITAAASESEQTGDNDDFCSTCGGSGVFICCDSCPK 291
Query: 529 AFHAACLDPLL--IPESGWRCPNC 550
+FH C DP + PE W C C
Sbjct: 292 SFHFLCCDPPIEECPEDNWNCREC 315
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 339 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 391
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 392 KEEEEDYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 442
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 443 P----PLPDIPNGEWLC 455
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 415 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 458
>gi|449683391|ref|XP_002153791.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Hydra
magnipapillata]
Length = 1699
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 58/140 (41%), Gaps = 40/140 (28%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
+ +C VCG+G ++LLC+ C +H CLDP L +PE W CP C S + D
Sbjct: 164 NSVCDVCGEGGDILLCDTCTCVWHLTCLDPPLDEVPEGDWSCPKCEDELSGPVEEDED-- 221
Query: 566 GGLEAPGAEVGGCVICRLSPSENFD---IRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
ENF ++CR D ++ CD C +H+
Sbjct: 222 --------------------DENFHGDYCKICR---------DGGELLCCDFCPGTYHMR 252
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
C++ L +P+ +W C
Sbjct: 253 CVKPQ----LITVPEGEWKC 268
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLI--PESGWRCPNCR 551
D C +C DG LL C+ CP +H C+ P LI PE W+CP C+
Sbjct: 228 DYCKICRDGGELLCCDFCPGTYHMRCVKPQLITVPEGEWKCPLCK 272
>gi|292622418|ref|XP_685699.4| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Danio rerio]
Length = 1953
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 56/136 (41%), Gaps = 15/136 (11%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R +G
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHCEKEGIQWEAREESSEG 435
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 436 EEENDDGRRDDGDVEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHLHCLNP 486
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 487 ----PLPDIPNGEWIC 498
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|255081825|ref|XP_002508131.1| SNF2 super family [Micromonas sp. RCC299]
gi|226523407|gb|ACO69389.1| SNF2 super family [Micromonas sp. RCC299]
Length = 2064
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/265 (24%), Positives = 94/265 (35%), Gaps = 104/265 (39%)
Query: 511 CHVCGDGE---NLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--------------- 550
C CG G+ NL+LC+ CP +H CL P L +P W CP C
Sbjct: 28 CERCGLGDDEPNLVLCDDCPRGWHVYCLRPKLPHVPRGSWSCPRCAPAKKGSGGGGGGDD 87
Query: 551 -----------------RQGHSSSMSRS-----VDLKGGLEAP---------GAE----- 574
R G +RS V G +AP GAE
Sbjct: 88 ARGSPAAPASGDRAKRRRTGTRDETTRSTRRSGVAAAGSTKAPDDDEMSDREGAEDDEMR 147
Query: 575 VGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 634
CV C L EN ++ CD C+ H+ CLR L +
Sbjct: 148 TMRCVTCDLGDDEN-------------------KMVLCDGCDAGHHLYCLRPK----LSQ 184
Query: 635 IPKDKWFCC-----DDCNRIHA-------ALQDFVSNR------------AQTIPASSLS 670
+P+ +WFC +D R A AL+ V+N ++ +P + L+
Sbjct: 185 VPRGRWFCPACEIREDARRRSAEATAATKALRVAVANEYADGRVVRAILGSRLVPETRLA 244
Query: 671 TINRKHIEKGILFDGTMNDVQWQML 695
T+ R +G F+G +V++ L
Sbjct: 245 TL-RADATRGPAFEGRSAEVEFLAL 268
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 203 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 255
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 256 KEEEEEYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 306
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 307 P----PLPDIPNGEWLC 319
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 279 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 322
>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
Length = 1844
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 52/136 (38%), Gaps = 33/136 (24%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + K
Sbjct: 257 DYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEAPEGHWSCPSCEAAGIPQKDEEEEKK- 315
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ C +C+ + LC CD C +H C+
Sbjct: 316 ----VATNMEYCRVCK-----DVGWLLC-----------------CDTCPSSYHAYCMNP 349
Query: 627 NGLCDLKEIPKDKWFC 642
L E+P+ +W C
Sbjct: 350 ----PLTEVPEGEWSC 361
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC D LL C+ CP ++HA C++P L +PE W CP C
Sbjct: 321 EYCRVCKDVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPRC 364
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 433 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 485
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 486 KEEEEEYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 536
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 537 P----PLPDIPNGEWLC 549
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 509 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 552
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 336 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 388
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 389 KEEEEDYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 439
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 440 P----PLPDIPNGEWLC 452
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 412 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 455
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 337 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 389
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 390 KEEEEDYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 440
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 441 P----PLPDIPNGEWLC 453
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 413 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 337 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 389
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 390 KEEEEDYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 440
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 441 P----PLPDIPNGEWLC 453
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 413 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|161611630|gb|AAI55800.1| Wu:fd12d03 protein [Danio rerio]
Length = 1074
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 56/136 (41%), Gaps = 15/136 (11%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R +G
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHCEKEGIQWEAREESSEG 435
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 436 EEENDDGRRDDGDVEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHLHCLNP 486
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 487 ----PLPDIPNGEWIC 498
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|312077956|ref|XP_003141528.1| CHromoDomain protein family member [Loa loa]
Length = 1696
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 57/137 (41%), Gaps = 36/137 (26%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP C
Sbjct: 112 DYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTC---------------- 155
Query: 567 GLEAPGAEVGGCVICRLSPSEN-FDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E+ GA P E+ + ++ + ++ + ++ CD C +H CL
Sbjct: 156 --ESTGA-----------PKEDEEEKKVTTNMEYCRTCKEGGWLLCCDTCPSSYHAYCLN 202
Query: 626 KNGLCDLKEIPKDKWFC 642
+ L EIP+ W C
Sbjct: 203 PS----LTEIPEGDWSC 215
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 175 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 218
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 68/172 (39%), Gaps = 41/172 (23%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVD 563
G D C C G L+LC+ CPL+FH C+DP L +P W C C
Sbjct: 999 GHSDRCARCRRGGELILCDSCPLSFHLDCVDPPLLGVPPDIWLCQLC------------- 1045
Query: 564 LKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSH-DFSAATFDDRTVIYCDQCEKEFHVG 622
LEA + + GC S+ D SH D A + +I CD C FH+
Sbjct: 1046 ---VLEAESSPLEGC-------SDGTD-----SHCDVCARCYKHGQLILCDVCPLAFHLR 1090
Query: 623 CLRKNGLCDLKEIPKDKW---FCCDDCNRIHAALQDFVSNRAQTIPASSLST 671
C L ++P KW C DC + + FV NR S ST
Sbjct: 1091 CTDP----PLLKVPSGKWTCQICVKDCQPVSSG--GFV-NRGPAKSKSGKST 1135
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 47/136 (34%), Gaps = 7/136 (5%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C CG L+LC CP A+H C +P L IP W C C S + KG
Sbjct: 912 DECAKCGREGQLILCETCPSAYHLKCANPPLKKIPAGKWICEVC-TDKSQKKPTGIKFKG 970
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
S E I D A +I CD C FH+ C+
Sbjct: 971 KHRKGLLPTSSSPSSLSSDLETLGIVADGHSDRCARCRRGGELILCDSCPLSFHLDCVDP 1030
Query: 627 NGLCDLKEIPKDKWFC 642
L +P D W C
Sbjct: 1031 ----PLLGVPPDIWLC 1042
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD E +LLC+GC H CL P + IP W C +CR
Sbjct: 778 CRICRRKGDAERMLLCDGCDRGHHMYCLKPPVKSIPSGDWYCVDCR 823
>gi|198466497|ref|XP_002135204.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
gi|198150627|gb|EDY73831.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
Length = 2036
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 435 EFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 478
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 434
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 435 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 463
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 464 A----LDTIPDGDWRC 475
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 447 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 499
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 500 KEEEEDYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 550
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 551 P----PLPDIPNGEWLC 563
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 523 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 566
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 397 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 449
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 450 KEEEEEYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 500
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 501 P----PLPDIPNGEWLC 513
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 473 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 516
>gi|402590896|gb|EJW84826.1| chromodomain-helicase-DNA-binding protein 4 [Wuchereria bancrofti]
Length = 1519
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 52/138 (37%), Gaps = 38/138 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-GHSSSMSRSVDLK 565
D C VC G ++LC+ CP A+H CLDP + PE W CP C G + +
Sbjct: 171 DYCEVCQQGGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTCESTGAAKDDEEEKKIT 230
Query: 566 GGLEA-PGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
+E + GG ++C CD C +H CL
Sbjct: 231 TNMEYCRTCKEGGWLLC------------------------------CDTCPSSYHAYCL 260
Query: 625 RKNGLCDLKEIPKDKWFC 642
+ L EIP+ W C
Sbjct: 261 NPS----LTEIPEGDWSC 274
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 234 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 49/141 (34%), Gaps = 49/141 (34%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C
Sbjct: 379 DYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTCEA-------------- 424
Query: 567 GLEAPGAEVGG-----CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
E P E C +C+ D ++ CD C +H
Sbjct: 425 --EGPADEDDDEHQEFCRVCK----------------------DGGEMLCCDSCPSAYHT 460
Query: 622 GCLRKNGLCDLKEIPKDKWFC 642
CL L +IP W C
Sbjct: 461 WCL----TPPLDDIPDGDWRC 477
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG +L C+ CP A+H CL P L IP+ WRCP C
Sbjct: 437 EFCRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPRC 480
>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4802
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 82/187 (43%), Gaps = 27/187 (14%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G+ H + + +A+ R G C VC GD +L+C+ C +H CL
Sbjct: 360 TSCGLHYHGMCLDMAVTPLRRAGWQCPECKVCQTCKNHGDDTKMLVCDMCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
P + +P +GWRC NCR V ++ G G ++C +N D LC
Sbjct: 420 PAMESLPTNGWRCKNCR----------VCIQCGTRTSGHWHHNSLLCENC-FQNQDPALC 468
Query: 595 RSHDFSAATFDD---RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHA 651
S + D + +++C C++ H+ C R+N EI + + C +C R A
Sbjct: 469 CS--MCSCILDPEHHKDLLFCQTCKRWLHLECERQN--SGQTEINPREDYVCFNC-RSPA 523
Query: 652 ALQDFVS 658
A Q ++
Sbjct: 524 AEQALLT 530
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 28/143 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ W+C +C Q +++
Sbjct: 1015 VCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTPGLR 1074
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C IC + ++S T ++ C QC++ F
Sbjct: 1075 CEWQSNYTQCAPCASLSTCPICLV--------------NYSEGT----VIVQCRQCDRWF 1116
Query: 620 HVGCLRKNGLCDLKEIPKDKWFC 642
H C + DL++ ++ + C
Sbjct: 1117 HASCQGLHSEDDLEKAAENSFDC 1139
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 52/135 (38%), Gaps = 50/135 (37%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD + L C C L +H CLD + P +GW+CP C+
Sbjct: 346 LCDSPGDLLDQLFCTSCGLHYHGMCLDMAVTPLRRAGWQCPECK---------------- 389
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
+C+ C++H DD ++ CD C+K +H CL+
Sbjct: 390 ------------VCQT----------CKNHG------DDTKMLVCDMCDKGYHTFCLQPA 421
Query: 628 GLCDLKEIPKDKWFC 642
++ +P + W C
Sbjct: 422 ----MESLPTNGWRC 432
>gi|393911013|gb|EJD76123.1| LET-418 protein [Loa loa]
Length = 1755
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 57/137 (41%), Gaps = 36/137 (26%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP C
Sbjct: 171 DYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTC---------------- 214
Query: 567 GLEAPGAEVGGCVICRLSPSEN-FDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E+ GA P E+ + ++ + ++ + ++ CD C +H CL
Sbjct: 215 --ESTGA-----------PKEDEEEKKVTTNMEYCRTCKEGGWLLCCDTCPSSYHAYCLN 261
Query: 626 KNGLCDLKEIPKDKWFC 642
+ L EIP+ W C
Sbjct: 262 PS----LTEIPEGDWSC 274
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 234 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|328717947|ref|XP_001943997.2| PREDICTED: hypothetical protein LOC100159693, partial
[Acyrthosiphon pisum]
Length = 2904
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 53/144 (36%), Gaps = 55/144 (38%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD NL++C C +H CL L+P +GW+C NCR
Sbjct: 388 ICSTLGDVSNLMMCTACGSHYHGVCLGLALLPGVRAGWQCGNCR---------------- 431
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
C +CR P+E + LC C+K +H GCLR
Sbjct: 432 ---------ICQVCR-QPAEQTKVMLCEG------------------CDKAYHPGCLRPQ 463
Query: 628 GLCDLKEIPKDKWFC-----CDDC 646
+ IPK W C C DC
Sbjct: 464 ----VTTIPKIGWKCKCCRVCTDC 483
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 58/150 (38%), Gaps = 43/150 (28%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D L+LC+ C +++H C DP L +P W+C C Q L
Sbjct: 773 VCEGCGQRNDESRLILCDECDISYHIYCTDPKLDYVPRGTWKCKWCAQC----------L 822
Query: 565 KGGLEAPG---------AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
G PG E G C + PS C+ + D++ +I C QC
Sbjct: 823 TCGSNDPGFNCSWLNNYTECGPCASRSICPS-------CQE-----SYTDNQLIIKCSQC 870
Query: 616 EKEFHVGCLRKNGLCDLKEIPKDKWFCCDD 645
++ H G CD E D C +D
Sbjct: 871 DRWLH-------GKCDKIENEDDAEKCAED 893
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 62/157 (39%), Gaps = 29/157 (18%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTG---GSDDMCHVC---GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + LA+ G G+ +C VC + ++LC GC A+H CL
Sbjct: 402 TACGSHYHGVCLGLALLPGVRAGWQCGNCRICQVCRQPAEQTKVMLCEGCDKAYHPGCLR 461
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGG--------CVICRLSPS 586
P + IP+ GW+C CR V G PGA + C C +
Sbjct: 462 PQVTTIPKIGWKCKCCR----------VCTDCGSRTPGAGLSSRWHAHYTVCDSCYQQRN 511
Query: 587 ENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGC 623
+ LC +AA R ++ C C K H C
Sbjct: 512 KGSSCPLCHRAYRAAA---HREMVQCISCRKYIHGAC 545
>gi|308501284|ref|XP_003112827.1| CRE-LET-418 protein [Caenorhabditis remanei]
gi|308267395|gb|EFP11348.1| CRE-LET-418 protein [Caenorhabditis remanei]
Length = 1884
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IP-ESGWRCPNC 550
+DD C +C + ENLLLC+ C AFHA C+DP L +P E W CP C
Sbjct: 311 NDDFCKICKETENLLLCDTCVCAFHAYCMDPPLTQVPQEETWNCPRC 357
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/145 (24%), Positives = 57/145 (39%), Gaps = 36/145 (24%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C G LLLC+ CP A+H C+D + PE W CP+C +
Sbjct: 250 NNDFCEECKQGGELLLCDTCPRAYHTPCIDSSMEDPPEGDWSCPHCIE------------ 297
Query: 565 KGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
G I + P + D DF + ++ CD C FH C+
Sbjct: 298 -----------HGPEIVKEEPQKVND-------DFCKICKETENLLLCDTCVCAFHAYCM 339
Query: 625 RKNGLCDLKEIPKDKWFCCDDCNRI 649
L ++P+++ + C C +
Sbjct: 340 DP----PLTQVPQEETWNCPRCELV 360
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG
Sbjct: 293 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCCSCLQG 341
>gi|195166785|ref|XP_002024215.1| GL22908 [Drosophila persimilis]
gi|194107570|gb|EDW29613.1| GL22908 [Drosophila persimilis]
Length = 1898
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 293 EFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 336
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 233 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 292
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 293 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 321
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 322 A----LDTIPDGDWRC 333
>gi|297805014|ref|XP_002870391.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316227|gb|EFH46650.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 351
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 766 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 825
L+ G +AE+ + T Y+ +G + LFS +E L +L VE LV+PA +W K
Sbjct: 248 LQFHGNRLAEMQFIGTRHVYRHQGMCRRLFSVVESTLQNLKVELLVIPATADLSHVWISK 307
Query: 826 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
FGF K + L K R L F G +L+K++
Sbjct: 308 FGF-KYVEDSLKKELRSMNLLAFPGIDVLQKEL 339
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 423 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 466
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 363 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 422
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 423 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 451
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 452 ----PLDTIPDGDWRC 463
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 63/152 (41%), Gaps = 35/152 (23%)
Query: 500 QRRTTGGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCR-Q 552
+R+ GGS + C VC DG +L+LC+ CP +FH +C++ IP+ W CP C +
Sbjct: 443 ERKEGGGSGNESDHFWYCEVCKDGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICTGE 502
Query: 553 G--HSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVI 610
G S S + E C +C+ R D VI
Sbjct: 503 GLPEDGDSSNSAQEEEEGEEETEHDQFCKVCK------------RGGD----------VI 540
Query: 611 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
CD C FH+ CL L E+P+ W C
Sbjct: 541 LCDFCSCVFHLRCLNP----PLGEVPEGDWKC 568
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSV 562
D C VC G +++LC+ C FH CL+P L +PE W+CP C+ S + V
Sbjct: 527 DQFCKVCKRGGDVILCDFCSCVFHLRCLNPPLGEVPEGDWKCPRCKISEKSPKGKVV 583
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 425 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 468
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 365 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 424
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 425 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 453
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 454 ----PLDTIPDGDWRC 465
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 429 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 457
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 458 ----PLDTIPDGDWRC 469
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 438 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 466
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 467 ----PLDTIPDGDWRC 478
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 439 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 467
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 468 ----PLDTIPDGDWRC 479
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 438 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 466
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 467 ----PLDTIPDGDWRC 478
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 65/174 (37%), Gaps = 39/174 (22%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W C C Q + D
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYC-QAEGNQEQEDDDEHQ 434
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C IC+ D ++ CD C +H CL
Sbjct: 435 EF---------CRICK----------------------DGGELLCCDSCPSAYHRFCLNP 463
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
L+E+P +W C C + + ++ R + PA S + +R+ K
Sbjct: 464 ----PLEEVPDGEWKCPRCSCPPLDGKVAKILTWRWKEQPAKSKAPRSREFFVK 513
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 444 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 487
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 384 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 443
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 444 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 472
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 473 ----PLDTIPDGDWRC 484
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 438 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 466
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 467 ----PLDTIPDGDWRC 478
>gi|194874449|ref|XP_001973400.1| GG13363 [Drosophila erecta]
gi|190655183|gb|EDV52426.1| GG13363 [Drosophila erecta]
Length = 711
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CL+P L PE W CP+C
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|351713762|gb|EHB16681.1| Chromodomain-helicase-DNA-binding protein 5 [Heterocephalus glaber]
Length = 2263
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 84/231 (36%), Gaps = 43/231 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 412 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC---------------- 455
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P E GCV+ L + + L RS R ++ QC + G L +
Sbjct: 456 ---VPSFEAQGCVLSVLGGALCYPATL-RSPSILGPQCVSR-LMETGQCTRVGPGGSLAR 510
Query: 627 NGL---CDLK-----EIPKDKWFCC-----DDCNRIHAALQDFVSNRAQTIPASSLSTIN 673
G CD K E P+ C + C + +Q + R PA + +
Sbjct: 511 GGRGWPCDNKLTAKCEGPRTAMSCSGPEDGETCPPLKGKVQRILHWRWTEPPAPFMVGLP 570
Query: 674 RKHIEKGILFDGTMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+E G+ + V+W L C KE L T ++R
Sbjct: 571 GPDVEPGVPPPKPLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 621
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 429 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 457
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 458 ----PLDTIPDGDWRC 469
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 373 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 433 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 461
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 462 ----PLDTIPDGDWRC 473
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 302 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCCLQG 350
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 373 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 433 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 461
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 462 ----PLDTIPDGDWRC 473
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C +H CL
Sbjct: 439 EF---------CRVCK----------------------DGGELLCCDSCPSAYHTFCLNP 467
Query: 627 NGLCDLKEIPKDKWFC 642
L IP W C
Sbjct: 468 ----PLDTIPDGDWRC 479
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 80/183 (43%), Gaps = 27/183 (14%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G+ H I + +A+ R G C VC G+ +L+C+ C +H CL
Sbjct: 360 TSCGLHYHGICLDMAVTPLRRAGWQCPECKVCQTCKNPGEDTKMLVCDMCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
P++ +P +GWRC NCR V L+ G G ++C +N D LC
Sbjct: 420 PVIDTLPTNGWRCQNCR----------VCLQCGTRTGGQWHHTSLLCENC-VQNQDPALC 468
Query: 595 RSHDFSAATFDD---RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHA 651
+ D + +++C C++ H+ C R+N EI + + C +C R A
Sbjct: 469 CP--MCSCILDPEHHKDLVFCHTCKRWLHLECERQN--SGQAEIHPREDYVCSNC-RSPA 523
Query: 652 ALQ 654
A Q
Sbjct: 524 AEQ 526
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 66/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ W+C +C Q +++
Sbjct: 1041 VCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTPGLR 1100
Query: 562 VDLKGG--LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + L AP A + C IC + D+S T ++ C QC++ F
Sbjct: 1101 CEWQNNYTLCAPCASLSTCPICLV--------------DYSEGTI----IVQCRQCDRWF 1142
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C + D+++ D F C C
Sbjct: 1143 HASCQSLHSEEDIEKA-ADSSFDCTMC 1168
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 49/135 (36%), Gaps = 50/135 (37%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD + L C C L +H CLD + P +GW+CP C+
Sbjct: 346 LCDSPGDLLDQLFCTSCGLHYHGICLDMAVTPLRRAGWQCPECKV--------------- 390
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
C C+ +P E D ++ CD C+K +H CL+
Sbjct: 391 ----------CQTCK-NPGE------------------DTKMLVCDMCDKGYHTFCLQPV 421
Query: 628 GLCDLKEIPKDKWFC 642
+ +P + W C
Sbjct: 422 ----IDTLPTNGWRC 432
>gi|222637620|gb|EEE67752.1| hypothetical protein OsJ_25457 [Oryza sativa Japonica Group]
Length = 1646
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 423 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 472
>gi|115473879|ref|NP_001060538.1| Os07g0661500 [Oryza sativa Japonica Group]
gi|113612074|dbj|BAF22452.1| Os07g0661500 [Oryza sativa Japonica Group]
Length = 1752
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 438 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q M
Sbjct: 426 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSCLQATVQDM 479
>gi|301623129|ref|XP_002940874.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Xenopus
(Silurana) tropicalis]
Length = 1954
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 64/161 (39%), Gaps = 24/161 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L P+ W CP+C + V +
Sbjct: 402 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELERAPQGKWSCPHCEK-------EGVQWEA 454
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
R ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 455 KELEEEEMEEPKRERREEEDDHMEFCRVCK---------DGGELLCCDACVSSYHIHCLN 505
Query: 626 KNGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIP 665
L +IP +W C C ++ +Q + R P
Sbjct: 506 P----PLPDIPHGEWLCPRCTCPQLKGKVQKILHWRWGVPP 542
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 341
>gi|50510226|dbj|BAD31424.1| PHD finger transcription factor-like protein [Oryza sativa Japonica
Group]
Length = 1696
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 438 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C Q
Sbjct: 252 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQ 299
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSM 558
++D C VC DG LL C+GCP AFH ACL P L IP WRC +C Q M
Sbjct: 293 NEDECAVCRDGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQATVPVM 346
>gi|302855516|ref|XP_002959250.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
gi|300255380|gb|EFJ39692.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
Length = 358
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/254 (22%), Positives = 114/254 (44%), Gaps = 20/254 (7%)
Query: 610 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 669
+ CD C H+GC+ G+ +++P WF C C + L+ P+S+
Sbjct: 123 LRCDTCGCWVHLGCV---GVEVAEQVPPRPWFHCRACRSTYLRLEAAAERNPH--PSSAS 177
Query: 670 STINRKHIEKGILFDGTMNDVQWQM-LKKAQCFEEKEKSLLSSATAIFRECFDPIIAECG 728
T L+ T +D Q M + + + + P++A+ G
Sbjct: 178 PT--------HALYVLTPSDHQAAMAVALNRGLTPAVAAGGGAGGRGPVGAVLPLLAQ-G 228
Query: 729 RDLIPVMVYGRNISGQEF-GGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQG 787
+ + +G+ +E+ GG YS +L + V+A +FG + A+L L+AT +++
Sbjct: 229 FNEDAIRGFGQ--PAREYDGGKYSAVLLNRGQPVAAATFNVFGAD-AQLCLLATAVQHRL 285
Query: 788 KGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTI 847
KG AL + +E LL + V L++ + A +W + G+R + + L+ R +
Sbjct: 286 KGNGSALVADLEALLADVGVSRLLVQSRGVALPLWLGRLGYRLVPPQEALQLHRTLPIAY 345
Query: 848 FKGTSMLEKKVQCL 861
+ ++++K+++ L
Sbjct: 346 YD-CALMQKQLRPL 358
>gi|125559491|gb|EAZ05027.1| hypothetical protein OsI_27209 [Oryza sativa Indica Group]
Length = 1696
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 438 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 343
>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
Length = 1048
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 65/163 (39%), Gaps = 17/163 (10%)
Query: 501 RRTTGGSDDMCHVCGDGENLLLCNG-CPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
R T +DD C +CG+ L+ C+G C AFH CL L P +RC C G+ + +
Sbjct: 432 RTTMTANDDTCRICGELGQLICCDGGCRGAFHLECLSILQPPTGEFRCDECSTGNHTCYT 491
Query: 560 RSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATF-----DDRTVIYCDQ 614
+ GA++ + C+ R C F A F ++ CD
Sbjct: 492 --------CDKVGADL---IKCQFPHCNKLYHRGCAEKQFKADNFCLVCGTGGDLVVCDG 540
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV 657
C +H C++ K + +WFC D + + D V
Sbjct: 541 CPGAYHAACIKSTFAFTGKPDEQGQWFCHDCLTGTKSMIHDVV 583
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 6/67 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLI------PESGWRCPNCRQGHSSSMSR 560
+D+ C VCG G +L++C+GCP A+HAAC+ + W C +C G S +
Sbjct: 522 ADNFCLVCGTGGDLVVCDGCPGAYHAACIKSTFAFTGKPDEQGQWFCHDCLTGTKSMIHD 581
Query: 561 SVDLKGG 567
V K G
Sbjct: 582 VVWAKYG 588
>gi|351706869|gb|EHB09788.1| PHD finger protein 21B [Heterocephalus glaber]
Length = 482
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NLL C+ CP A+H CL P L P+ GW CP C+Q
Sbjct: 303 DELCAACRRGSNLLCCSACPGAYHLGCLHPPLKTAPKGGWLCPKCQQ 349
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 343
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 341
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQ 342
>gi|397629214|gb|EJK69257.1| hypothetical protein THAOC_09499 [Thalassiosira oceanica]
Length = 734
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D+C VCGDG +LL C+GC ++H CL+P + +PE W CP+C
Sbjct: 672 DVCEVCGDGGDLLCCDGCINSYHQRCLNPPMEQVPEGQWFCPSC 715
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 242 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCCLQG 290
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 345
>gi|358334996|dbj|GAA53428.1| histone-lysine N-methyltransferase MLL3, partial [Clonorchis
sinensis]
Length = 3518
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 53/146 (36%), Gaps = 57/146 (39%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES----GWRCPNCRQGHSSSMSRSVDLK 565
+C GD LL C GC +H +CL+P L P GW+C C+
Sbjct: 43 LCECLGDISELLFCTGCGSHYHGSCLEPSLQPNPTIRIGWQCAECK-------------- 88
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C+IC S EN ++ CD C+K FH CLR
Sbjct: 89 -----------ACLICNESKDEN-------------------KMLVCDVCDKGFHTYCLR 118
Query: 626 KNGLCDLKEIPKDKWFC-----CDDC 646
C IP++ + C C DC
Sbjct: 119 PPVSC----IPRNGFKCERCRVCSDC 140
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 62/163 (38%), Gaps = 47/163 (28%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCP------NCRQGHSSSM 558
+C CG N LLLC+ C +++H CLDP L +P+ GW+C NC Q
Sbjct: 559 VCEGCGGTTNESLLLLCDDCDISYHTYCLDPPLQEVPKGGWKCSECVVCTNCGQ------ 612
Query: 559 SRSVDLKGGLE------APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYC 612
R L G AP A + C +C L A + +I C
Sbjct: 613 -RDPGLNGKWHANYSMCAPCASLATCPVCTL------------------AYREGELLIRC 653
Query: 613 DQCEKEFHVGC--LRKNGLCDLKEIPKDKWFCCDDCNRIHAAL 653
C + H GC LR D E+ D + C C A +
Sbjct: 654 ALCSRWSHAGCDQLRTE---DELELATDMGYNCLLCREAGAEM 693
>gi|242041381|ref|XP_002468085.1| hypothetical protein SORBIDRAFT_01g039297 [Sorghum bicolor]
gi|241921939|gb|EER95083.1| hypothetical protein SORBIDRAFT_01g039297 [Sorghum bicolor]
Length = 134
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 39/59 (66%), Gaps = 4/59 (6%)
Query: 247 GGYLCGCPL---CNF-SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP 301
GGY C CP C + KV+SA +FE+HAG+++++ N HI L NGK +Y + EL+ P
Sbjct: 3 GGYACACPASARCGYRDKVLSALQFEKHAGSESKNQNGHILLLNGKSLYDLFHELRDVP 61
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 340
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 341
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 380 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 427
>gi|414887990|tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1679
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
G G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 412 GSAEVDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 465
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 344
>gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
gi|414887992|tpg|DAA64006.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1712
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
G G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 412 GSAEVDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 465
>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
Length = 1982
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 58/160 (36%), Gaps = 39/160 (24%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP C
Sbjct: 400 DFCEVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWSCPTCEAD------------- 446
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
GG + F R+C+ D ++ CD C +H CL
Sbjct: 447 ---------GGVAEDDDDEHQEF-CRICK---------DGGELLCCDMCPSAYHTFCL-- 485
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIP 665
L +IP W C C + +Q ++ R P
Sbjct: 486 --TPPLDDIPDGDWRCPRCSCPPLQYKVQKILTWRWTDKP 523
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 342
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 342
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 342
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 306 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLREIPSGLWRCSCCLQG 354
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 74/205 (36%), Gaps = 43/205 (20%)
Query: 477 PYRHIYTSNGMTLHDIAISLAMGQRRTTG---GSDDMCHVCGDGENLLLCNGCPLAFHAA 533
P R T G L A++ G+ G D C VC G ++LC+ CP A+H
Sbjct: 254 PIRRAKTKEGKVLGCPAVA---GEEEVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLV 310
Query: 534 CLDPLL--IPESGWRCPNCRQ--------GHSSSMSRSVDLKGGLEAPGAEVGGCVICRL 583
CLDP L PE W CP+C + + +G E + C +C+
Sbjct: 311 CLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCK- 369
Query: 584 SPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCC 643
D ++ CD C +H+ CL L +IP +W C
Sbjct: 370 ---------------------DGGELLCCDACISSYHIHCLNP----PLPDIPNGEWLCP 404
Query: 644 D-DCNRIHAALQDFVSNRAQTIPAS 667
C + +Q + R P S
Sbjct: 405 RCTCPVLKGRVQKILHWRWGEPPVS 429
>gi|242051184|ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
gi|241926713|gb|EER99857.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
Length = 1688
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
G G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 417 GSAEMDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 470
>gi|390478260|ref|XP_003735458.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator, partial
[Callithrix jacchus]
Length = 330
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q S M
Sbjct: 77 NEDECAVCRDGGELICCDGCPRAFHLACLTPPLREIPSGTWRCSSCLQATVSEM 130
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG--GSDDMCHVCGDGENLLLCNG 525
H +R+P + G+ + +A+ Q+ T +DD C VC DG L+ C+G
Sbjct: 251 HTSSHQQREPSSRGLATPGLQ-YVVAVLTCYPQKMTFAVEHNDDECAVCKDGGELICCDG 309
Query: 526 CPLAFHAACLDPLL--IPESGWRCPNCR 551
CP AFH +CL P L IP WRC C+
Sbjct: 310 CPRAFHLSCLVPPLTSIPRGTWRCQLCQ 337
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C Q
Sbjct: 274 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQ 321
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
+DD C VC DG L+ C+GCP AFH ACL+P L IP WRC C G
Sbjct: 286 NDDECAVCRDGGELICCDGCPRAFHLACLEPPLTDIPSGMWRCGCCIVG 334
>gi|281346162|gb|EFB21746.1| hypothetical protein PANDA_013140 [Ailuropoda melanoleuca]
Length = 1814
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/218 (23%), Positives = 76/218 (34%), Gaps = 31/218 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +G
Sbjct: 215 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK------------EG 262
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
P + E+ + CR D ++ CD C +H+ C
Sbjct: 263 IQWEPKDDDDEEEEGGCEEEEDDHMEFCR------VCKDGGELLCCDACPSSYHLHCXTP 316
Query: 627 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 686
L C C + +Q + R PA + + +E G+
Sbjct: 317 TPSPALNL----GTLLCPQCPPLKGKVQRILHWRWTEPPAPFMVGLPGSDVEPGVPPPKP 372
Query: 687 MND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 717
+ V+W L C KE L T ++R
Sbjct: 373 LEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 410
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ +
Sbjct: 398 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 457
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ + R+C+ D ++ CD C +H+ CL
Sbjct: 458 EEDGEEEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 503
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 504 ----PLPDIPNGEWLC 515
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 475 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 518
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 64/162 (39%), Gaps = 21/162 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 255 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 307
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 308 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 360
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P S
Sbjct: 361 ----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 398
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 64/162 (39%), Gaps = 21/162 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 474 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 526
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 527 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 579
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P S
Sbjct: 580 ----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 617
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 64/162 (39%), Gaps = 21/162 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 377 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 429
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 430 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDTCISSYHIHCLNP 482
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P S
Sbjct: 483 ----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 520
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 55/136 (40%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ D
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEDDDY 439
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 64/162 (39%), Gaps = 21/162 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 377 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 429
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 430 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDTCISSYHIHCLNP 482
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P S
Sbjct: 483 ----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 520
>gi|340371689|ref|XP_003384377.1| PREDICTED: hypothetical protein LOC100637285 [Amphimedon
queenslandica]
Length = 400
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D CH+C DG LL C+ CP A+H CL P L IP+ W CP C
Sbjct: 318 DYCHICQDGGELLCCDRCPKAYHLQCLYPPLKKIPDGTWHCPRC 361
>gi|328716042|ref|XP_003245819.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Acyrthosiphon pisum]
Length = 2002
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHC 418
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C VC DG LL C+ CP A+H CL P + +P+ W+CP C
Sbjct: 434 FCRVCKDGGELLCCDSCPAAYHTFCLSPPITDVPDGDWKCPRC 476
>gi|449486846|ref|XP_004174324.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Taeniopygia guttata]
Length = 2088
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 340 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCPQ 385
>gi|341900249|gb|EGT56184.1| hypothetical protein CAEBREN_32223 [Caenorhabditis brenneri]
Length = 1816
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP----ESGWRCPNC 550
+DD C +C + ENLLLC+ C +FHA C+DP L+ + W CP C
Sbjct: 315 NDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRC 362
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 57/142 (40%), Gaps = 35/142 (24%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C G L+LC+ CP A+H C+D + PE W CP+C +
Sbjct: 254 NNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCME------------ 301
Query: 565 KGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
G I + P++ D DF + ++ CD C FH C+
Sbjct: 302 -----------HGPEIVKEEPAKQND-------DFCKICKETENLLLCDNCTCSFHAYCM 343
Query: 625 RKNGLCDLKEIPKDKWFCCDDC 646
L +L P+D+ + C C
Sbjct: 344 -DPPLLELP--PQDESWACPRC 362
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 55/136 (40%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ D
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEDDDY 439
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 531 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 583
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 584 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 636
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 637 ----PLPDIPNGEWLC 648
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 608 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 651
>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
Length = 1890
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 52/137 (37%), Gaps = 45/137 (32%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGG 567
CH + LLLC+GC +H C P + IP+ W C CR + +
Sbjct: 1613 FCHSGDKEDQLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRN-------- 1664
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV-IYCDQCEKEFHVGCLRK 626
C++C ++T+ + CDQC K +H+ CL+
Sbjct: 1665 ----------CIVC--------------------GKPGNKTISVLCDQCPKAYHIECLQP 1694
Query: 627 NGLCDLKEIPKDKWFCC 643
L ++P+ KW C
Sbjct: 1695 ----PLAKVPRGKWLCV 1707
>gi|341891282|gb|EGT47217.1| CBN-LET-418 protein [Caenorhabditis brenneri]
Length = 1835
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP----ESGWRCPNC 550
+DD C +C + ENLLLC+ C +FHA C+DP L+ + W CP C
Sbjct: 315 NDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRC 362
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 57/142 (40%), Gaps = 35/142 (24%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C G L+LC+ CP A+H C+D + PE W CP+C +
Sbjct: 254 NNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCME------------ 301
Query: 565 KGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
G I + P++ D DF + ++ CD C FH C+
Sbjct: 302 -----------HGPEIVKEEPAKQND-------DFCKICKETENLLLCDNCTCSFHAYCM 343
Query: 625 RKNGLCDLKEIPKDKWFCCDDC 646
L +L P+D+ + C C
Sbjct: 344 -DPPLLELP--PQDESWACPRC 362
>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1283
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 69/159 (43%), Gaps = 35/159 (22%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
++ C+VC +G ++ C+GC FH +CL+ +P C +C +G D KG
Sbjct: 1077 EEECYVCREGGQVVSCDGCQRVFHLSCLNIRRMPRGKLYCKHCSEG---------DTKGA 1127
Query: 568 LEAPGAEVGGCVICRLSP-------SENFDIRLCRSH----DFSAATFDDRTVIY----- 611
E G RLS EN +IR+ S+ + +D I
Sbjct: 1128 EEKSVGGDGRRQSLRLSADGRHDDVEENDEIRMKSSNRELESGAVGPWDVECFICKLYGE 1187
Query: 612 ---CDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
CD C K FH+ C+ +K P+++WF CD+C+
Sbjct: 1188 LLGCDGCPKAFHLACI------GIKSWPQEEWF-CDECD 1219
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 75/201 (37%), Gaps = 54/201 (26%)
Query: 451 IVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA-----ISLAMGQRRTTG 505
+ C C++ + E G RRQ R +++G HD I + R
Sbjct: 1114 LYCKHCSEGDTKGAEEKSVGGDGRRQSLR--LSADGR--HDDVEENDEIRMKSSNRELES 1169
Query: 506 GS----DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS 561
G+ D C +C LL C+GCP AFH AC+ P+ W C C
Sbjct: 1170 GAVGPWDVECFICKLYGELLGCDGCPKAFHLACIGIKSWPQEEWFCDEC----------- 1218
Query: 562 VDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
++ C +C + + +C S D S C+K FH+
Sbjct: 1219 ------------DMQTCGVCGRNKIKLNSHVICGSEDGSKG------------CDKVFHL 1254
Query: 622 GCLRKNGLCDLKEIPKDKWFC 642
C++ L+++P+ WFC
Sbjct: 1255 KCVK------LEKVPESDWFC 1269
>gi|196003152|ref|XP_002111443.1| hypothetical protein TRIADDRAFT_23466 [Trichoplax adhaerens]
gi|190585342|gb|EDV25410.1| hypothetical protein TRIADDRAFT_23466 [Trichoplax adhaerens]
Length = 1296
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 5/75 (6%)
Query: 481 IYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN---LLLCNGCPLAFHAACLDP 537
+Y + ++ A + + R +D C +CG G+N LLLC+GC ++H CL P
Sbjct: 256 LYFIVIIIMYQNAFTFYLANYRKENYADTACMLCGLGDNEEFLLLCDGCDDSYHTYCLIP 315
Query: 538 LL--IPESGWRCPNC 550
L IP WRCP C
Sbjct: 316 PLQSIPPGDWRCPKC 330
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C +G
Sbjct: 409 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSCLRG 457
>gi|355568209|gb|EHH24490.1| hypothetical protein EGK_08151 [Macaca mulatta]
Length = 1931
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 431 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 483
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 484 ----PLPDIPNGEWLC 495
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 455 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 67/166 (40%), Gaps = 21/166 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 395 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 447
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 448 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 500
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLST 671
L +IP +W C C + +Q + R P ++ ++
Sbjct: 501 ----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVAAPAS 542
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 55/136 (40%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ +
Sbjct: 391 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 450
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 451 EEEGEEEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 496
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 497 ----PLPDIPNGEWLC 508
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 468 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 511
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 55/136 (40%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ +
Sbjct: 347 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 406
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 407 EEEGEEEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 452
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 453 ----PLPDIPNGEWLC 464
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 424 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 467
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 355 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 407
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 408 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 460
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 461 ----PLPDIPNGEWLC 472
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 432 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 475
>gi|291231854|ref|XP_002735870.1| PREDICTED: PHD finger protein 21B-like [Saccoglossus kowalevskii]
Length = 143
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
G DD+C VC LL+C+ C L +H CLDP L IP W CP C+ G+
Sbjct: 4 GHDDICAVCKQSGELLMCDTCSLVYHLGCLDPPLSFIPAGIWICPQCQAGN 54
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 55/144 (38%), Gaps = 36/144 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
+ +G E + C +C+ D ++ CD C
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK----------------------DGGELLCCDACISS 477
Query: 619 FHVGCLRKNGLCDLKEIPKDKWFC 642
+H+ CL L +IP +W C
Sbjct: 478 YHIHCLNP----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 55/136 (40%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ +
Sbjct: 212 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 271
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 272 EEEGEEEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 317
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 318 ----PLPDIPNGEWLC 329
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 289 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 332
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 55/144 (38%), Gaps = 36/144 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
+ +G E + C +C+ D ++ CD C
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK----------------------DGGELLCCDACISS 477
Query: 619 FHVGCLRKNGLCDLKEIPKDKWFC 642
+H+ CL L +IP +W C
Sbjct: 478 YHIHCLNP----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like
[Oreochromis niloticus]
Length = 4907
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 54/143 (37%), Gaps = 53/143 (37%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMS 559
G + C VC G+ +LL C GC L +HAACL+ P +GW+CP C+
Sbjct: 212 AGEESWCAVCDSAGELTDLLFCTGCGLHYHAACLEIGATPIQRAGWQCPECKV------- 264
Query: 560 RSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
C CR P E D ++ CD C+K +
Sbjct: 265 ------------------CQTCR-QPGE------------------DSKMLVCDACDKGY 287
Query: 620 HVGCLRKNGLCDLKEIPKDKWFC 642
H CL+ + +P D W C
Sbjct: 288 HTFCLQPA----MDSLPSDPWKC 306
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 64/149 (42%), Gaps = 33/149 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q S+S
Sbjct: 897 VCEVCGKASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSPGFH 956
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C +CR ENF ++ ++ C C++
Sbjct: 957 CEWQNNYTHCGPCASLVTCPVCR----ENF--------------MEEELLLQCQYCDRWV 998
Query: 620 HVGCLRKNGLCDLKEIPK--DKWFCCDDC 646
H C L E+ + D+ F C C
Sbjct: 999 HAVC---ESLYTEDEVEQASDEGFACTSC 1024
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 386 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 438
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 439 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 491
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 492 ----PLPDIPNGEWLC 503
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 463 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 506
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ D +
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEDEEY 434
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 435 EEEGEEEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 480
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 481 ----PLPDIPNGEWLC 492
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 452 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 495
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 430
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 431 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 483
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 484 ----PLPDIPNGEWLC 495
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 455 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|26337379|dbj|BAC32375.1| unnamed protein product [Mus musculus]
Length = 708
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 23/137 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 392 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 444
Query: 567 GLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
E E G + ++ + R+C+ D ++ CD C +H+ CL
Sbjct: 445 KEEEEEYEEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLN 495
Query: 626 KNGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 496 PP----LPDIPNGEWLC 508
Score = 46.2 bits (108), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 468 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 511
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 439 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 491
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 492 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 544
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 545 ----PLPDIPNGEWLC 556
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 516 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 559
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 438 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 490
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 491 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 543
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 544 ----PLPDIPNGEWLC 555
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 515 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 558
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 47/190 (24%), Positives = 78/190 (41%), Gaps = 44/190 (23%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C S + +
Sbjct: 1120 VCEACGKATDPGRLLLCDDCDISYHTFCLDPPLQTVPKGGWKCKWC-----VSCTNCKAI 1174
Query: 565 KGGLE----------APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQ 614
GL AP A + C +C C+++ ++ ++ C Q
Sbjct: 1175 TPGLRCEWQNNYTQCAPCASLSACPVC------------CQNY------IEEELILQCRQ 1216
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI---HAALQDFVSNRAQTIPASSLST 671
C + H C N ++ E+ D F C C H ++ + AQ IP + S
Sbjct: 1217 CIRWSHASCQNLNTEAEV-ELAADSGFDCAACKPFVVSHEVIEPTID--AQIIPKTRDSD 1273
Query: 672 INRKHIEKGI 681
+ + + G+
Sbjct: 1274 VLKTFTQDGV 1283
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC GD +L+C+ C +H CL
Sbjct: 685 TTCGQHYHGMCLDIAVTPLKRAGWQCPDCKVCQNCKHSGDDNQMLVCDTCDKGYHTFCLQ 744
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 745 PVMDSVPTNGWKCKNCR 761
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/144 (22%), Positives = 51/144 (35%), Gaps = 55/144 (38%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD + L C C +H CLD + P +GW+CP+C+
Sbjct: 671 LCDSSGDLLDQLFCTTCGQHYHGMCLDIAVTPLKRAGWQCPDCKV--------------- 715
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
C C+ S DD ++ CD C+K +H CL+
Sbjct: 716 ----------CQNCKHS-------------------GDDNQMLVCDTCDKGYHTFCLQPV 746
Query: 628 GLCDLKEIPKDKWFC-----CDDC 646
+ +P + W C C +C
Sbjct: 747 ----MDSVPTNGWKCKNCRICTEC 766
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 347 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 399
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 400 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 452
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 453 ----PLPDIPNGEWLC 464
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 424 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 467
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 366 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 418
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 419 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 471
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 472 ----PLPDIPNGEWLC 483
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 443 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 486
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 425 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 477
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 478 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 530
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 531 ----PLPDIPNGEWLC 542
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 502 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 545
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 55/144 (38%), Gaps = 36/144 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
+ +G E + C +C+ D ++ CD C
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK----------------------DGGELLCCDACISS 477
Query: 619 FHVGCLRKNGLCDLKEIPKDKWFC 642
+H+ CL L +IP +W C
Sbjct: 478 YHIHCLNP----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 428
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 429 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 481
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 482 ----PLPDIPNGEWLC 493
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 453 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 496
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C CGDG L+ C+GCP AFH ACL P L +P WRC +C
Sbjct: 246 NEDECAACGDGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSC 291
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 432
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 433 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 486 ----PLPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 422
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 423 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 475
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 476 ----PLPDIPNGEWLC 487
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 447 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 490
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 62/153 (40%), Gaps = 41/153 (26%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHS 555
+C CGD + LLLC+ C +++H CLDP L +P+ W+C C G
Sbjct: 1071 VCEACGDASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKWCVWCVQCGSTSPGVH 1130
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
S R+ L G P + C C+ A +D ++ C QC
Sbjct: 1131 SDWQRNYSLCG----PCCSLSRCPACQ------------------QAYAEDDLILQCQQC 1168
Query: 616 EKEFHVGCLRKNGLC--DLKEIPKDKWFCCDDC 646
++ H C GLC D E+ D+ F C C
Sbjct: 1169 DRWVHATC---QGLCTEDEVEVAADEGFDCSLC 1198
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 69/189 (36%), Gaps = 65/189 (34%)
Query: 500 QRRTTGGSD-DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES----GWRCPNCRQGH 554
Q TG S +C GD LL+C+ C +H +CLDP + P GW+CP CR
Sbjct: 455 QHARTGSSQCRLCAGSGDSGGLLMCSCCGSCYHGSCLDPPVTPSPLSRVGWQCPQCR--- 511
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQ 614
+ RS L+G D V+ C +
Sbjct: 512 ---VCRSCSLQG---------------------------------------DSGVLLCAR 529
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKDKWFC----CDDCNRIHAALQDFVSNRAQTIPASSLS 670
C+K +H CL L + P W C C D +AA + ++ PAS S
Sbjct: 530 CDKAYHAHCLTP----PLDDAPHAAWTCKAETCPDLVLTNAA-------QLKSGPASLQS 578
Query: 671 TINRKHIEK 679
++EK
Sbjct: 579 AAANDNVEK 587
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 43/83 (51%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 299
L G++ G LC C LC +V+S +FE HA + R +I ENGK + +++ +
Sbjct: 35 LRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRG 94
Query: 300 APLGILEEVVKKVAGSSFNEGSF 322
APL LE ++ + S E F
Sbjct: 95 APLHDLEATIQNIVCSPPEEKYF 117
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 20/136 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 374 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 426
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 427 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 479
Query: 627 NGLCDLKEIPKDKWFC 642
L +IP +W C
Sbjct: 480 ----PLPDIPNGEWLC 491
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 451 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 494
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 35/59 (59%), Gaps = 7/59 (11%)
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
AM Q+ ++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 2 AMAQK-----NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 55
>gi|190344475|gb|EDK36155.2| hypothetical protein PGUG_00253 [Meyerozyma guilliermondii ATCC
6260]
Length = 709
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 58/144 (40%), Gaps = 27/144 (18%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
TKKR H L P G +T +G+R G K+ K ISP + A
Sbjct: 197 TKKRRKRTHNKLETPKQPTRGSSVTPASEGRRSIRGSKRI---------KVISPKKHAA- 246
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
+A +SNG I+ A + TG +DD C CG + C+ CP
Sbjct: 247 TNLAP---------SSNG------TITAAASESEQTGDNDDFCSTCGGSGVFICCDSCPK 291
Query: 529 AFHAACLDPLL--IPESGWRCPNC 550
+FH C DP + PE W C C
Sbjct: 292 SFHFLCCDPPIEECPEDNWNCREC 315
>gi|444519036|gb|ELV12526.1| PHD finger protein 21B [Tupaia chinensis]
Length = 489
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C GCP A+H +CLDP L P+ W CP C+Q
Sbjct: 310 DEHCAACTRGANLQPCGGCPGAYHLSCLDPPLKTAPKGTWLCPKCQQ 356
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 55/146 (37%), Gaps = 57/146 (39%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDLK 565
+C + G+ L C C +H +CLDP + + +GW+CPNC+
Sbjct: 382 ICDLPGNLSESLFCTSCGQHYHGSCLDPPVSIDPVVRAGWQCPNCK-------------- 427
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
IC + CR DD ++ CD C+K +H CL+
Sbjct: 428 --------------IC----------QTCRQ------PGDDNKMLVCDTCDKGYHTFCLK 457
Query: 626 KNGLCDLKEIPKDKWFC-----CDDC 646
+ IPK+ W C C DC
Sbjct: 458 PAMIT----IPKNGWKCKTCRVCTDC 479
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C
Sbjct: 837 VCEGCGKSSDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWC 882
>gi|431905346|gb|ELK10391.1| Chromodomain-helicase-DNA-binding protein 4 [Pteropus alecto]
Length = 1854
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C VC G ++LC+ CP A+H CLDP + PE W CP+C +
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEK 416
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 55/146 (37%), Gaps = 57/146 (39%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDLK 565
+C + G+ L C C +H +CLDP + + +GW+CPNC+
Sbjct: 381 ICDLPGNLSESLFCTSCGQHYHGSCLDPPVSIDPVVRAGWQCPNCK-------------- 426
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
IC + CR DD ++ CD C+K +H CL+
Sbjct: 427 --------------IC----------QTCRQ------PGDDNKMLVCDTCDKGYHTFCLK 456
Query: 626 KNGLCDLKEIPKDKWFC-----CDDC 646
+ IPK+ W C C DC
Sbjct: 457 PAMIT----IPKNGWKCKTCRVCTDC 478
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C
Sbjct: 817 VCEGCGKSSDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWC 862
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 55/146 (37%), Gaps = 57/146 (39%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDLK 565
+C + G+ L C C +H +CLDP + + +GW+CPNC+
Sbjct: 398 ICDLPGNLSESLFCTSCGQHYHGSCLDPPVSIDPVVRAGWQCPNCK-------------- 443
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
IC + CR DD ++ CD C+K +H CL+
Sbjct: 444 --------------IC----------QTCRQ------PGDDNKMLVCDTCDKGYHTFCLK 473
Query: 626 KNGLCDLKEIPKDKWFC-----CDDC 646
+ IPK+ W C C DC
Sbjct: 474 PAMIT----IPKNGWKCKTCRVCTDC 495
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C
Sbjct: 834 VCEGCGKSSDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWC 879
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 61/166 (36%), Gaps = 48/166 (28%)
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
M R + + +C + E +LLC+GC +H CL P L IP+ W C C
Sbjct: 967 VMWMRSVSNAACRVCRKKSNPEQMLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPTQ 1026
Query: 555 SSSMSRS---VDLKGGLEAPGAEVGG---------------CVICRLSPSENFDIRLCRS 596
S R+ V++ E +V C IC SP E
Sbjct: 1027 LSPRKRTKAPVEVSSEEEDDNEKVDEDGDEDEEEEDLNQEVCNICE-SPGE--------- 1076
Query: 597 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
+I CD C K FH+ C+ DLK +P+ W C
Sbjct: 1077 ------------LILCDFCPKSFHLDCI------DLKRLPRGTWKC 1104
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQG 553
++C++C L+LC+ CP +FH C+D +P W+CP C G
Sbjct: 1066 EVCNICESPGELILCDFCPKSFHLDCIDLKRLPRGTWKCPPCVLG 1110
>gi|332838463|ref|XP_508960.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 isoform 3 [Pan troglodytes]
Length = 1825
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CLDP + PE W CP+C
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 407
>gi|432909970|ref|XP_004078255.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1882
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 2/61 (3%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + R + +
Sbjct: 385 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCCPPMKGKVQRVITWRW 444
Query: 567 G 567
G
Sbjct: 445 G 445
>gi|156379809|ref|XP_001631648.1| predicted protein [Nematostella vectensis]
gi|156218692|gb|EDO39585.1| predicted protein [Nematostella vectensis]
Length = 824
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDP--LLIPESGWRCPNCR 551
D C VC DG LL C+ CP+A+H CL P + +P W+CP C+
Sbjct: 12 DYCRVCKDGGQLLCCDKCPMAYHLKCLIPPMMRVPTGEWKCPRCQ 56
>gi|17562600|ref|NP_504523.1| Protein LET-418 [Caenorhabditis elegans]
gi|403399446|sp|G5EBZ4.1|LE418_CAEEL RecName: Full=Protein let-418; AltName: Full=Lethal protein 418
gi|11095333|gb|AAG29838.1| LET-418 [Caenorhabditis elegans]
gi|351020697|emb|CCD62685.1| Protein LET-418 [Caenorhabditis elegans]
Length = 1829
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IP-ESGWRCPNC 550
+D+ C +C + ENLLLC+ C +FHA C+DP L +P E W CP C
Sbjct: 316 NDEFCKICKETENLLLCDSCVCSFHAYCIDPPLTEVPKEETWSCPRC 362
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/145 (24%), Positives = 56/145 (38%), Gaps = 36/145 (24%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C LLLC+ CP A+H C+D + PE W C +C + H + +
Sbjct: 255 NNDYCEECKQDGELLLCDTCPRAYHTVCIDENMEEPPEGDWSCAHCIE-HGPEVVKEEPA 313
Query: 565 KGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
K E C IC+ + ++ CD C FH C+
Sbjct: 314 KQNDEF-------CKICK----------------------ETENLLLCDSCVCSFHAYCI 344
Query: 625 RKNGLCDLKEIPKDKWFCCDDCNRI 649
L E+PK++ + C C +
Sbjct: 345 DP----PLTEVPKEETWSCPRCETV 365
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 992 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1051
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C IC R R D+ +I C QC++
Sbjct: 1052 CEWQNNYTQCAPCASLSTCPIC---------YRTYR---------DEELIIQCRQCDRWM 1093
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ I D F C C
Sbjct: 1094 HAICQNLNTEEEVENI-ADMGFDCTIC 1119
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 36/160 (22%), Positives = 64/160 (40%), Gaps = 29/160 (18%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + + + + G C VC G+ +L+C+ C +H CL
Sbjct: 362 TTCGQHYHGMCLDIQVTPLKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHTFCLQ 421
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICR--LSPSENFDIR 592
P++ +P +GW+C NCR V + G C++C +N
Sbjct: 422 PVMDSVPTNGWKCKNCR----------VCAECGTRTSCQWHHNCLVCDSCYQQQDNLSCP 471
Query: 593 LCRS---HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 629
C DF + +++C C++ H+ C R G+
Sbjct: 472 FCDKLCLQDFQ------KDMLHCHMCKRWIHMECDRSPGI 505
>gi|356510796|ref|XP_003524120.1| PREDICTED: uncharacterized protein LOC100793933 [Glycine max]
Length = 2325
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 36/70 (51%), Gaps = 2/70 (2%)
Query: 491 DIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP 548
++A +++ G C +C G NLL C+ CP +H CLDP L IP W+CP
Sbjct: 53 EVATDQISSKKKGNDGYYYECVICDVGGNLLCCDSCPRTYHLQCLDPPLKRIPNGKWQCP 112
Query: 549 NCRQGHSSSM 558
+C +G M
Sbjct: 113 SCFEGKDQRM 122
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+DD C C DG L+ C+GCP AFH CLDP L IP W+C C
Sbjct: 251 NDDECAACKDGGELICCDGCPQAFHLTCLDPPLTSIPSGPWQCDWC 296
>gi|345494441|ref|XP_001603951.2| PREDICTED: lysine-specific demethylase lid [Nasonia vitripennis]
Length = 1704
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 379 ICHNCGKGDNEENMLLCDGCDDSYHTFCLLPPLTEIPKGDWRCPKC 424
>gi|444728247|gb|ELW68711.1| Chromodomain-helicase-DNA-binding protein 5 [Tupaia chinensis]
Length = 2128
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 366 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 409
>gi|414590323|tpg|DAA40894.1| TPA: hypothetical protein ZEAMMB73_603732 [Zea mays]
Length = 345
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 43/94 (45%), Gaps = 4/94 (4%)
Query: 460 ISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN 519
+SPS E A P H T + HD IS + + R G C C G N
Sbjct: 29 VSPSN-ENEATSPTSDSPMSHGSTKKKLK-HDTNISPSAKKIRGHDGYFFECVECDLGGN 86
Query: 520 LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
LL C+ CP +H CL+P L +P W+CP CR
Sbjct: 87 LLCCDSCPRVYHLECLNPPLKRLPPGKWQCPRCR 120
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 962 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1021
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C IC R R D+ +I C QC++
Sbjct: 1022 CEWQNNYTQCAPCASLSTCPIC---------YRTYR---------DEELIIQCRQCDRWM 1063
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ I D F C C
Sbjct: 1064 HAICQNLNTEEEVENI-ADMGFDCTIC 1089
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/159 (22%), Positives = 63/159 (39%), Gaps = 29/159 (18%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + + + + G C VC G+ +L+C+ C +H CL
Sbjct: 330 TTCGQHYHGMCLDIQVTPLKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHTFCLQ 389
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICR--LSPSENFDIR 592
P++ +P +GW+C NCR V + G C++C +N
Sbjct: 390 PVMDAVPTNGWKCKNCR----------VCAECGTRTSCQWHHNCLVCDSCYQQQDNLSCP 439
Query: 593 LCRS---HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 628
C DF + +++C C++ H+ C R G
Sbjct: 440 FCEKLCLQDFQ------KDMLHCHMCKRWIHIDCDRSPG 472
>gi|307199491|gb|EFN80104.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Harpegnathos saltator]
Length = 1948
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 8/55 (14%)
Query: 504 TGGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
TG +DD C VC DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 418 TGAADDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 472
Score = 46.6 bits (109), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 51/136 (37%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------ITG 419
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
+ + C +C+ D ++ CD C +H CL
Sbjct: 420 AADDDDEHMEFCRVCK----------------------DGGELLCCDSCTSAYHTHCLNP 457
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 458 ----PLSEIPDGDWKC 469
>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
Length = 1882
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP AFH CL+P L +P W CP C
Sbjct: 476 EFCRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPRC 519
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 48/136 (35%), Gaps = 39/136 (28%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C ++
Sbjct: 418 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWSCPHCEGEGIQEQEEDEHME- 476
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C +C+ D ++ CD C FH CL
Sbjct: 477 ----------FCRVCK----------------------DGGELLCCDSCPAAFHTFCLNP 504
Query: 627 NGLCDLKEIPKDKWFC 642
LK +P KW C
Sbjct: 505 ----PLKNVPTGKWNC 516
>gi|281352213|gb|EFB27797.1| hypothetical protein PANDA_013886 [Ailuropoda melanoleuca]
Length = 640
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 64/162 (39%), Gaps = 21/162 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + ++ + +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + R+C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEDEGEKEEEDDHME-----YCRVCK---------DGGELLCCDACISSYHIHCLNP 485
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P S
Sbjct: 486 P----LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 523
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 98 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 145
>gi|332026170|gb|EGI66312.1| Lysine-specific demethylase 5A [Acromyrmex echinatior]
Length = 1637
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|307209985|gb|EFN86754.1| Histone demethylase JARID1A [Harpegnathos saltator]
Length = 1625
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 306 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 351
>gi|390367018|ref|XP_003731167.1| PREDICTED: PHD finger protein 21A-like isoform 1
[Strongylocentrotus purpuratus]
gi|390367020|ref|XP_003731168.1| PREDICTED: PHD finger protein 21A-like isoform 2
[Strongylocentrotus purpuratus]
Length = 501
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+ G +D+C VC LL+C+ C L +H CLDP L +P W CP C+
Sbjct: 351 SDGHEDICAVCRQIGELLMCDTCNLVYHLTCLDPPLAAVPPGAWSCPECK 400
>gi|307166621|gb|EFN60647.1| Histone demethylase JARID1A [Camponotus floridanus]
Length = 1566
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 308 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 353
>gi|328786362|ref|XP_003250774.1| PREDICTED: lysine-specific demethylase lid isoform 1 [Apis
mellifera]
Length = 1643
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|118341427|gb|AAI27573.1| LOC568230 protein [Danio rerio]
gi|118341429|gb|AAI27575.1| LOC568230 protein [Danio rerio]
Length = 430
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C VC G ++LC+ CP A+H CL+P L PE W CP+C +
Sbjct: 384 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCEK 429
>gi|449490665|ref|XP_004158671.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101228553, partial [Cucumis sativus]
Length = 1851
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 15/75 (20%)
Query: 493 AISLAMGQRRTTGGSDDM-------------CHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+ISLA G+ ++ G D C VC G NLL C+ CP +H CL+P L
Sbjct: 85 SISLAKGKVKSEGHRDQFSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPL 144
Query: 540 --IPESGWRCPNCRQ 552
IP W CP C Q
Sbjct: 145 KRIPMGKWHCPTCNQ 159
>gi|219111827|ref|XP_002177665.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410550|gb|EEC50479.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1255
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 40/89 (44%), Gaps = 16/89 (17%)
Query: 478 YRHIYTSNGMTLHDIAISL-------------AMGQRRTTGGSDDMCHVCGDGENLLLCN 524
YR +YT NG L D S+ GS D C+VC + NLL C+
Sbjct: 608 YRTVYTPNGFFLIDQVTSVDDFGEIMPKENVATSAASLIEDGSVDYCNVCRNHGNLLCCD 667
Query: 525 GCPLAFHAACL---DPLLIPESGWRCPNC 550
CP AFH+ C+ D L E+ W CP C
Sbjct: 668 YCPRAFHSECIHVKDEELDSEAPWECPVC 696
>gi|328786364|ref|XP_003250775.1| PREDICTED: lysine-specific demethylase lid isoform 2 [Apis
mellifera]
Length = 1617
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 304 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 349
>gi|380014866|ref|XP_003691437.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase
lid-like [Apis florea]
Length = 1643
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|358412582|ref|XP_003582344.1| PREDICTED: PHD finger protein 21B isoform 1 [Bos taurus]
gi|359066204|ref|XP_003586213.1| PREDICTED: PHD finger protein 21B isoform 2 [Bos taurus]
Length = 477
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 298 DELCAACKRGTNLQPCGTCPRAYHLSCLDPPLKTAPKGVWVCPKCQQ 344
>gi|119893354|ref|XP_876176.2| PREDICTED: PHD finger protein 21B isoform 2 [Bos taurus]
gi|297475482|ref|XP_002688025.1| PREDICTED: PHD finger protein 21B isoform 1 [Bos taurus]
gi|296486907|tpg|DAA29020.1| TPA: PHD finger protein 21B [Bos taurus]
Length = 489
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 310 DELCAACKRGTNLQPCGTCPRAYHLSCLDPPLKTAPKGVWVCPKCQQ 356
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 237 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 282
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 224 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 269
>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
Length = 4823
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 56/151 (37%), Gaps = 58/151 (38%)
Query: 506 GSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSR 560
G + C VC G+ +LL C GC +HAACL+ P +GW+CP C+
Sbjct: 210 GEEACCAVCDSAGELSDLLFCTGCGQHYHAACLEIGATPIQRAGWQCPECKV-------- 261
Query: 561 SVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFH 620
C CR P E D ++ CD C+K +H
Sbjct: 262 -----------------CQTCR-KPGE------------------DSKMLVCDACDKGYH 285
Query: 621 VGCLRKNGLCDLKEIPKDKWFC-----CDDC 646
CL+ + +P D W C C DC
Sbjct: 286 TFCLQPA----MDSLPTDPWKCKRCRVCTDC 312
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 60/145 (41%), Gaps = 26/145 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRSVD 563
+MC D LLLC+ C +++H CLDP L +P+ GW+C C Q S++ +
Sbjct: 740 EMCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTPGFHCE 799
Query: 564 LKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
+ P A + C +CR ENF ++ ++ C C++ H
Sbjct: 800 WQNNYTHCGPCASLVTCPVCR----ENF--------------MEEELLLQCQYCDRWVHA 841
Query: 622 GCLRKNGLCDLKEIPKDKWFCCDDC 646
C D E D+ F C C
Sbjct: 842 VC-ESLYTEDEVEQASDEGFACTYC 865
>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
rubripes]
Length = 3715
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 56/151 (37%), Gaps = 58/151 (38%)
Query: 506 GSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSR 560
G + C VC G+ +LL C GC +HAACL+ P +GW+CP C+
Sbjct: 81 GEEACCAVCDSAGELSDLLFCTGCGQHYHAACLEIGATPIQRAGWQCPECKV-------- 132
Query: 561 SVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFH 620
C CR P E D ++ CD C+K +H
Sbjct: 133 -----------------CQTCR-KPGE------------------DSKMLVCDACDKGYH 156
Query: 621 VGCLRKNGLCDLKEIPKDKWFC-----CDDC 646
CL+ + +P D W C C DC
Sbjct: 157 TFCLQPA----MDSLPTDPWKCKRCRVCTDC 183
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 60/145 (41%), Gaps = 26/145 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRSVD 563
+MC D LLLC+ C +++H CLDP L +P+ GW+C C Q S++ +
Sbjct: 470 EMCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTPGFHCE 529
Query: 564 LKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
+ P A + C +CR ENF ++ ++ C C++ H
Sbjct: 530 WQNNYTHCGPCASLVTCPVCR----ENF--------------MEEELLLQCQYCDRWVHA 571
Query: 622 GCLRKNGLCDLKEIPKDKWFCCDDC 646
C D E D+ F C C
Sbjct: 572 VC-ESLYTEDEVEQASDEGFACTYC 595
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 52/132 (39%), Gaps = 41/132 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEA 570
C VCG G LL C+GCP AFH C+ IPE+ W C C L+
Sbjct: 1195 CSVCGLGGELLCCDGCPRAFHVTCIGLEKIPETEWFCNEC----------------NLQT 1238
Query: 571 PGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 630
GA C R+ + +C S D + C++ FH+ C +
Sbjct: 1239 CGA----CKKNRIRLDSHV---ICGSEDGTKG------------CDRVFHLKCAK----- 1274
Query: 631 DLKEIPKDKWFC 642
L +P D W+C
Sbjct: 1275 -LDAVPADDWYC 1285
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACL--DPLLIPESGWRCPNC 550
C++C DG LL C+ CP AFH CL +IP+S W C C
Sbjct: 161 CNICKDGGELLCCDRCPRAFHMNCLGMSEDMIPDSEWYCKMC 202
Score = 46.6 bits (109), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 39/182 (21%), Positives = 68/182 (37%), Gaps = 53/182 (29%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-------------------- 550
C++C +G L+ C+GCP FH +C+ +P C C
Sbjct: 1062 CYICSEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHECDTSVNPVLPAKKNGKAAAKR 1121
Query: 551 ------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHD 598
R+ + + S D + G ++ A V RL ++ +R +
Sbjct: 1122 PRRSQSPTSRGRRRKQAKTKSGDSDSESGADSDSA-VSATSSHRLPSRKSSPVRPAHVEN 1180
Query: 599 FSAATFDDRT-------------VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD 645
+ D ++ CD C + FHV C+ L++IP+ +WF C++
Sbjct: 1181 HTGPKNPDDQWDVDCSVCGLGGELLCCDGCPRAFHVTCI------GLEKIPETEWF-CNE 1233
Query: 646 CN 647
CN
Sbjct: 1234 CN 1235
>gi|47206538|emb|CAF92234.1| unnamed protein product [Tetraodon nigroviridis]
Length = 361
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 299 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 342
>gi|449433493|ref|XP_004134532.1| PREDICTED: uncharacterized protein LOC101204186 [Cucumis sativus]
Length = 2368
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 15/75 (20%)
Query: 493 AISLAMGQRRTTGGSDDM-------------CHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+ISLA G+ ++ G D C VC G NLL C+ CP +H CL+P L
Sbjct: 85 SISLAKGKVKSEGHRDQFSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPL 144
Query: 540 --IPESGWRCPNCRQ 552
IP W CP C Q
Sbjct: 145 KRIPMGKWHCPTCNQ 159
>gi|307136401|gb|ADN34210.1| chromatin remodeling complex subunit [Cucumis melo subsp. melo]
Length = 2374
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 15/75 (20%)
Query: 493 AISLAMGQRRTTGGSDDM-------------CHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+ISLA G+ ++ G D C VC G NLL C+ CP +H CL+P L
Sbjct: 92 SISLAKGKVKSEGHHDQFSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPL 151
Query: 540 --IPESGWRCPNCRQ 552
IP W CP C Q
Sbjct: 152 KRIPMGKWHCPTCNQ 166
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
+DD C VC DG L+ C+GCP +FH +CL P L IP WRC C +S
Sbjct: 27 SNDDECSVCRDGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDACNTQRPTS 80
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 4 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 52
>gi|444322063|ref|XP_004181687.1| hypothetical protein TBLA_0G02280 [Tetrapisispora blattae CBS 6284]
gi|387514732|emb|CCH62168.1| hypothetical protein TBLA_0G02280 [Tetrapisispora blattae CBS 6284]
Length = 769
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR 551
++D C CG + + C+ CP +FH CLDP L +PE W CPNC+
Sbjct: 375 NEDFCSSCGQTGSFVCCDTCPKSFHFLCLDPPLDPDNLPEGNWSCPNCQ 423
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 213 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 258
>gi|449690917|ref|XP_004212502.1| PREDICTED: nucleosome-remodeling factor subunit BPTF-like, partial
[Hydra magnipapillata]
Length = 588
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMS 559
SDD C CG +LL C CP +H CL P L +PE+ W CP C + H +S
Sbjct: 280 SDDYCRKCGRMGDLLCCELCPAVYHLECLSPPLLEVPENEWFCPICAEQHVKGVS 334
>gi|2135739|pir||I38558 Mi-2 autoantigen 240 kDa protein - human (fragment)
gi|761718|gb|AAC50228.1| Mi-2 autoantigen 240 kDa protein, partial [Homo sapiens]
Length = 530
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 64/162 (39%), Gaps = 21/162 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 256 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 308
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 309 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 361
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P +
Sbjct: 362 ----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVA 399
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 66/153 (43%), Gaps = 29/153 (18%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 985 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1044
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C IC CR++ ++ ++ C QC++
Sbjct: 1045 CEWQNNYTQCAPCASLSSCPIC------------CRNYR------EEDLILQCRQCDRWM 1086
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
H C N +++ + D F C C A
Sbjct: 1087 HAVCQNLNTEEEVENV-ADIGFDCSMCRPYMPA 1118
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/219 (21%), Positives = 89/219 (40%), Gaps = 31/219 (14%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 343 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 402
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDI--- 591
P++ +P +GW+C NCR + ++ G + C+IC + ++
Sbjct: 403 PVMKSVPTNGWKCKNCR----------ICIECGTRSSAQWHHNCLICDTCYQQQDNLCPF 452
Query: 592 -RLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIH 650
C + +F + ++YC+ C++ H+ C + +I +D + C C +
Sbjct: 453 CGKCYNPEFQ------KDMLYCNMCKRWVHLECDKPTDHELDSQIKED--YICMYCKHLG 504
Query: 651 AALQDFVSNRAQTIPASSLSTINRKHIEKGILF-DGTMN 688
A + +P + IE +LF D T N
Sbjct: 505 AEIDSLHPGNELEMPELPTDYSSGMEIEDEVLFLDPTAN 543
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 324 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 373
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 374 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 399
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 400 CLQPV----MKSVPTNGWKC 415
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 264 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 309
>gi|357624184|gb|EHJ75056.1| hypothetical protein KGM_20601 [Danaus plexippus]
Length = 1448
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
MCH+CG G E +LLC+GC ++H CL P L +P+ WRCP C
Sbjct: 322 MCHICGRGDIEEQMLLCDGCDDSYHTFCLVPPLADVPKGDWRCPVC 367
>gi|297815628|ref|XP_002875697.1| hypothetical protein ARALYDRAFT_905616 [Arabidopsis lyrata subsp.
lyrata]
gi|297321535|gb|EFH51956.1| hypothetical protein ARALYDRAFT_905616 [Arabidopsis lyrata subsp.
lyrata]
Length = 1570
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 12/82 (14%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 414 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 463
Query: 564 LKGGLEAPGAEVGGCVICRLSP 585
KG A G + G V + P
Sbjct: 464 KKGPKIAHGTSLRGAVQFGMDP 485
>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
Length = 5559
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 75/187 (40%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1414 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1473
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1474 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1515
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 1516 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEP 1567
Query: 680 GIL-FDG 685
F+G
Sbjct: 1568 QYFRFEG 1574
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|334188015|ref|NP_198371.3| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
gi|332006563|gb|AED93946.1| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
Length = 1539
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 12/82 (14%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 409 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 458
Query: 564 LKGGLEAPGAEVGGCVICRLSP 585
KG A G + G V + P
Sbjct: 459 KKGPKIAHGTSLRGAVQFGMDP 480
>gi|432937609|ref|XP_004082462.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A-like [Oryzias latipes]
Length = 1475
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
C +C GD +N+LLC+GC H CL P L +PE W CP+CR SS
Sbjct: 1140 CRICRRKGDADNMLLCDGCDRGHHTHCLRPRLKSVPEGDWFCPDCRPKQRSS 1191
>gi|76155653|gb|AAX26942.2| SJCHGC07786 protein [Schistosoma japonicum]
Length = 234
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 41/95 (43%), Gaps = 7/95 (7%)
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC---GDGENL 520
+ MAA R + T + L + L R D C VC D +NL
Sbjct: 91 ELRTRTWMAAWRTEVENARTLTRLNL--LHACLDACVRWEKSVEDARCRVCRRKTDDDNL 148
Query: 521 LLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
LLC+GC LAFH CL P L +P W CP CR
Sbjct: 149 LLCDGCNLAFHLYCLRPPLKRVPTGDWFCPTCRPA 183
>gi|1585696|prf||2201456A Mi-2 autoantigen
Length = 529
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 64/162 (39%), Gaps = 21/162 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + V +
Sbjct: 256 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEK-------EGVQWEA 308
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E E G ++ + CR D ++ CD C +H+ CL
Sbjct: 309 KEEEEEYEEEGEEEGEKEEEDDH-MEYCR------VCKDGGELLCCDACISSYHIHCLNP 361
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L +IP +W C C + +Q + R P +
Sbjct: 362 ----PLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVA 399
>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Ovis aries]
Length = 5387
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 75/187 (40%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1486 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1545
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1546 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1587
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 1588 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEP 1639
Query: 680 GIL-FDG 685
F+G
Sbjct: 1640 QYFRFEG 1646
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|242009521|ref|XP_002425532.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
gi|212509407|gb|EEB12794.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
Length = 1963
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 52/138 (37%), Gaps = 47/138 (34%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C G+N LLLC+GC +H C P + IP+ W C C+ + +
Sbjct: 1684 CQFCQSGDNEDKLLLCDGCDRGYHMYCFKPKMESIPDGDWYCHECKNKSNGEKN------ 1737
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C++C P +N+ + C+ C + +H+ CL
Sbjct: 1738 ------------CIVCGKRPIKNY--------------------VICEHCPRIYHIECLN 1765
Query: 626 KNGLCDLKEIPKDKWFCC 643
L ++P+ KW C
Sbjct: 1766 P----PLSKVPRAKWNCV 1779
>gi|8978355|dbj|BAA98208.1| unnamed protein product [Arabidopsis thaliana]
Length = 1515
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 12/82 (14%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 409 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 458
Query: 564 LKGGLEAPGAEVGGCVICRLSP 585
KG A G + G V + P
Sbjct: 459 KKGPKIAHGTSLRGAVQFGMDP 480
>gi|403175396|ref|XP_003334222.2| hypothetical protein PGTG_15759 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171584|gb|EFP89803.2| hypothetical protein PGTG_15759 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 392
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 13/80 (16%)
Query: 476 QPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAFHA 532
+P R TS+ T HD+ S+ G D+C +CG D N+LLC+ C +H
Sbjct: 289 RPIRRPSTSHPSTSHDLDTSMDHG--------GDICEICGSDEDDPNILLCDSCDKGYHL 340
Query: 533 ACLDP--LLIPESGWRCPNC 550
CL P L +PE W C C
Sbjct: 341 QCLTPPLLTVPEGNWYCDAC 360
>gi|196015577|ref|XP_002117645.1| hypothetical protein TRIADDRAFT_61645 [Trichoplax adhaerens]
gi|190579814|gb|EDV19903.1| hypothetical protein TRIADDRAFT_61645 [Trichoplax adhaerens]
Length = 533
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
DD+C VC G LL+C+ C +H CLDP L IP+ W CP+C
Sbjct: 394 DDVCCVCNKGGELLICDTCNSVYHLRCLDPPLSSIPDGMWMCPDC 438
>gi|339244153|ref|XP_003378002.1| putative PHD finger protein [Trichinella spiralis]
gi|316973126|gb|EFV56753.1| putative PHD finger protein [Trichinella spiralis]
Length = 864
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 79/197 (40%), Gaps = 51/197 (25%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWR---CPNCRQGHSSSMSRS 561
+C CG G++ LLLC+ C +++H CLDP L IP+ WR C C+ ++ +
Sbjct: 266 VCEGCGTGDDEQHLLLCDECDVSYHMYCLDPPLDSIPQGAWRCKWCSTCQFCGATPPNGM 325
Query: 562 VDLKGGLEA--PGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+D L A A + C C L E + +I CD C +
Sbjct: 326 LDSIKNLRACFKCASLYSCCFCHLQYKE------------------EDMIILCDICHRWS 367
Query: 620 HVGCLRKNGLC--DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
H C NGLC D+ + D F C C P + S+ +
Sbjct: 368 HANC---NGLCAEDILKKGLDAGFICVYCR-----------------PGDACSSAAMHFV 407
Query: 678 EKGILFDGT-MNDVQWQ 693
+G+L + ++ VQW+
Sbjct: 408 IEGVLLTKSGLSTVQWR 424
>gi|260943552|ref|XP_002616074.1| hypothetical protein CLUG_03315 [Clavispora lusitaniae ATCC 42720]
gi|238849723|gb|EEQ39187.1| hypothetical protein CLUG_03315 [Clavispora lusitaniae ATCC 42720]
Length = 766
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 66/185 (35%), Gaps = 44/185 (23%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
Q T +DD C CG + C+ CP +FH C P L +PE W C CR +
Sbjct: 227 QPEDTTQNDDYCATCGGTGEFICCDSCPKSFHLLCCGPPLREVPEDNWNCSECRAAQGMT 286
Query: 558 MSRSVDLKGGLEAPGAEVGG------CVICRLSPSENFDI----------------RLCR 595
RS + G A + G C+ RL + D+ R R
Sbjct: 287 QRRSWNSVGVFGALLNALHGRNPREYCLPRRLRENTFIDVSAGPSGQYTDASLKPERTSR 346
Query: 596 SHDFSAATFDDR------------------TVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 637
+ + DR T++ CD C ++H+ CL +C K +
Sbjct: 347 ADELDIDALYDRAGRPFLCHRCRMSGRANKTLVACDYCPLKWHLDCL-PEPVCSAKTLGM 405
Query: 638 DKWFC 642
KW C
Sbjct: 406 -KWRC 409
>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
Length = 5503
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 75/187 (40%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1392 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1451
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1452 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1493
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 1494 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEP 1545
Query: 680 GIL-FDG 685
F+G
Sbjct: 1546 QYFRFEG 1552
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|301775083|ref|XP_002922956.1| PREDICTED: PHD finger protein 21B-like [Ailuropoda melanoleuca]
Length = 484
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W+CP C+Q
Sbjct: 305 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWQCPKCQQ 351
>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
Length = 5262
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 62/148 (41%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1154 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1213
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1214 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1255
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+ P D+ F C C
Sbjct: 1256 HAGCESLFTEDDVDHAP-DEGFDCVSCQ 1282
Score = 46.2 bits (108), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 580 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 639
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 640 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 681
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 682 HAVCQNLNTEEEVENV-ADIGFDCSMC 707
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 29/153 (18%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 960 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1019
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 1020 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 1061
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
H C N +++ + D F C C A
Sbjct: 1062 HAVCQNLNTEEEVENV-ADIGFDCSMCRPYMPA 1093
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 314 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 373
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 374 PVMKSVPTNGWKCKNCR 390
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 295 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 344
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 345 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 370
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 371 CLQPV----MKSVPTNGWKC 386
>gi|25412286|pir||B84645 hypothetical protein At2g25170 [imported] - Arabidopsis thaliana
Length = 1359
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES--GWRCPNC 550
T ++ C CG+ NL+ CN C AFHA CL P L S WRCP C
Sbjct: 45 TDAKENACQACGESTNLVSCNTCTYAFHAKCLVPPLKDASVENWRCPEC 93
>gi|195999462|ref|XP_002109599.1| hypothetical protein TRIADDRAFT_53787 [Trichoplax adhaerens]
gi|190587723|gb|EDV27765.1| hypothetical protein TRIADDRAFT_53787 [Trichoplax adhaerens]
Length = 1866
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 52/146 (35%), Gaps = 48/146 (32%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
C +C G+N LLLC+ C H C P L +PE W CP C
Sbjct: 1630 FCQICRKGDNESLLLLCDKCDCGTHTYCCTPKLTSVPEGDWFCPKC-------------- 1675
Query: 565 KGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 624
A + C IC + D+ I C++C +H+ CL
Sbjct: 1676 -----ASECDKDYCCICEID--------------------DNEKAIACERCNSVYHLACL 1710
Query: 625 RKNGLCDLKEIPKDKWFCCDDCNRIH 650
+K PK W+C D RI+
Sbjct: 1711 DPV----IKRYPKTGWYCRDCRKRIY 1732
Score = 46.6 bits (109), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 509 DMCHVC--GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C +C D E + C C +H ACLDP++ P++GW C +CR+
Sbjct: 1682 DYCCICEIDDNEKAIACERCNSVYHLACLDPVIKRYPKTGWYCRDCRK 1729
>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
Length = 4932
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 75/187 (40%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 867 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 926
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 927 CEWQNSYTHCGPCASLVTCPICH------------------APYMEEDLLIQCRHCERWM 968
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 969 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAVPVAPPELVPMKVKEPEP 1020
Query: 680 GIL-FDG 685
F+G
Sbjct: 1021 QYFRFEG 1027
>gi|302793688|ref|XP_002978609.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
gi|300153958|gb|EFJ20595.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
Length = 1349
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 44/81 (54%), Gaps = 3/81 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC--LDPLLIPESGWRCPNCR-QGHSS 556
Q+ T+ + D C +CG NL+ C+GCP A+H+ C + +PE W CP C Q ++
Sbjct: 464 QKHTSDWNSDDCVLCGMDGNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPECTLQKIAT 523
Query: 557 SMSRSVDLKGGLEAPGAEVGG 577
++R+ + G E G + G
Sbjct: 524 EVTRARKVVRGFECFGIDTYG 544
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 521 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 580
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 581 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 622
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 623 HAVCQNLNTEEEVENV-ADIGFDCSMC 648
>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
melanoleuca]
Length = 5483
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 75/187 (40%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1429 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1488
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1489 CEWQNSYTHCGPCASLVTCPICH------------------APYMEEDLLIQCRHCERWM 1530
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 1531 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAVPVAPPELVPMKVKEPEP 1582
Query: 680 GIL-FDG 685
F+G
Sbjct: 1583 QYFRFEG 1589
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
Length = 5154
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 75/187 (40%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1008 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1067
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1068 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1109
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 1110 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEP 1161
Query: 680 GIL-FDG 685
F+G
Sbjct: 1162 QYFRFEG 1168
>gi|330805108|ref|XP_003290529.1| hypothetical protein DICPUDRAFT_81259 [Dictyostelium purpureum]
gi|325079359|gb|EGC32963.1| hypothetical protein DICPUDRAFT_81259 [Dictyostelium purpureum]
Length = 1463
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 30/58 (51%), Gaps = 5/58 (8%)
Query: 507 SDDMCHVCGDG-----ENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
SDD+C VC DG ++ C+GC +A H C LLIPE W C C S +S
Sbjct: 448 SDDVCAVCFDGTSDDTNQIVYCDGCDIAVHQDCYGILLIPEGQWFCQKCESPEKSQIS 505
>gi|334188013|ref|NP_851094.2| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
gi|332006562|gb|AED93945.1| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
Length = 1706
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 43/91 (47%), Gaps = 12/91 (13%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 409 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 458
Query: 564 LKGGLEAPGAEVGGCVICRLSPSENFDIRLC 594
KG A G + G V + P + C
Sbjct: 459 KKGPKIAHGTSLRGAVQFGMDPHGRLFLGTC 489
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 73/189 (38%), Gaps = 50/189 (26%)
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSN--GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGE 518
+P E G+A R+ TS M L+ + S+A + C C G+
Sbjct: 1889 APDPNEIPKGLAVWREAVNRAQTSAQLAMCLYSLESSIAWDKSIMKAN----CQFCHSGD 1944
Query: 519 N---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGA 573
N LLLC+GC +H C P + IPE W C C M+++ +
Sbjct: 1945 NEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWYCHEC-------MNKATGER-------- 1989
Query: 574 EVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 633
C++C ++T R +I C+ C + +H C+ +
Sbjct: 1990 ---NCIVC----------------GKKSSTSGTR-LILCELCPRAYHTDCIHP----IMH 2025
Query: 634 EIPKDKWFC 642
++P+ KW+C
Sbjct: 2026 KVPRGKWYC 2034
>gi|18400745|ref|NP_565587.1| CHD3-type chromatin-remodeling factor PICKLE [Arabidopsis thaliana]
gi|75193642|sp|Q9S775.1|PKL_ARATH RecName: Full=CHD3-type chromatin-remodeling factor PICKLE;
AltName: Full=Protein GYMNOS
gi|6318930|gb|AAF07084.1|AF185578_1 GYMNOS/PICKLE [Arabidopsis thaliana]
gi|6478518|gb|AAF13875.1|AF185577_1 chromatin remodeling factor CHD3 [Arabidopsis thaliana]
gi|330252572|gb|AEC07666.1| CHD3-type chromatin-remodeling factor PICKLE [Arabidopsis thaliana]
Length = 1384
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES--GWRCPNC 550
T ++ C CG+ NL+ CN C AFHA CL P L S WRCP C
Sbjct: 45 TDAKENACQACGESTNLVSCNTCTYAFHAKCLVPPLKDASVENWRCPEC 93
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 171
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 172 HAVCQNLNTEEEVENV-ADIGFDCSMC 197
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1002 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1061
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 1062 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 1103
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1104 HAVCQNLNTEEEVENV-ADIGFDCSMC 1129
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 390 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 415
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 416 CLQPV----MKSVPTNGWKC 431
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 64/147 (43%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1060 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGPR 1119
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C S N+ R D ++ C QC++
Sbjct: 1120 CEWQNNYTQCAPCASLSTCPVC----SRNY-----REEDL---------ILQCRQCDRWM 1161
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1162 HAVCQNLNTEDEVENV-ADIGFDCTMC 1187
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 45/211 (21%), Positives = 82/211 (38%), Gaps = 32/211 (15%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 404 TTCGQHYHGMCLDIAVTALKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHTFCLQ 463
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICR--LSPSENFDIR 592
P++ +P +GW+C NCR + + G C++C P +N
Sbjct: 464 PVMDSVPTNGWKCKNCR----------ICAECGTRTSSQWHHNCLVCDNCYQPQDNTACP 513
Query: 593 L---CRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCN 647
C DF + +++C C++ H+ C K +L K+ + CC
Sbjct: 514 FCGKCYQPDFQ------KDMLHCQMCKRWIHIEC-DKPADTELDSQLKEDYVCMCCKQLG 566
Query: 648 RIHAALQDFVSNRAQTIPASSLSTINRKHIE 678
+ ++T+P S ++N E
Sbjct: 567 DLEPVQPCDEMETSETVPQSGTDSVNEMETE 597
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1003 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1062
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 1063 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 1104
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1105 HAVCQNLNTEEEVENV-ADIGFDCSMC 1130
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 390 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 415
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 416 CLQPV----MKSVPTNGWKC 431
>gi|405973316|gb|EKC38036.1| Lysine-specific demethylase 5A [Crassostrea gigas]
Length = 1883
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 31/46 (67%), Gaps = 5/46 (10%)
Query: 510 MCHVC--GDGEN-LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C GDGE +LLC+GC AFH CL P + +P+ WRCP C
Sbjct: 289 ICHMCNRGDGEEYMLLCDGCDDAFHTYCLIPPMPEVPKGDWRCPKC 334
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1004 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1063
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 1064 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 1105
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1106 HAVCQNLNTEEEVENV-ADIGFDCSMC 1131
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ R G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLRRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C CR S +D ++ CD C+K +H
Sbjct: 390 ---------------CQNCRQS-------------------GEDSKMLVCDTCDKGYHTF 415
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 416 CLQPV----MKSVPTNGWKC 431
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1003 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1062
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 1063 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 1104
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1105 HAVCQNLNTEEEVENV-ADIGFDCSMC 1130
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ R G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLRRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C CR S +D ++ CD C+K +H
Sbjct: 390 ---------------CQNCRQS-------------------GEDSKMLVCDTCDKGYHTF 415
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 416 CLQPV----MKSVPTNGWKC 431
>gi|297821893|ref|XP_002878829.1| PKL/SSL2 [Arabidopsis lyrata subsp. lyrata]
gi|297324668|gb|EFH55088.1| PKL/SSL2 [Arabidopsis lyrata subsp. lyrata]
Length = 1399
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES--GWRCPNC 550
T ++ C CG+ NL+ CN C AFHA CL P L S WRCP C
Sbjct: 45 TDAKENACQACGESANLVSCNTCTYAFHAKCLVPPLKDASVENWRCPEC 93
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 171
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 172 HAVCQNLNTEEEVENV-ADIGFDCSMC 197
>gi|449665037|ref|XP_002164686.2| PREDICTED: uncharacterized protein LOC100213470 [Hydra
magnipapillata]
Length = 1382
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 57/141 (40%), Gaps = 25/141 (17%)
Query: 511 CHVCGDG------ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSV 562
C +C G ++L++C+ C LA+H AC+ P L +PES W CP C S+S +
Sbjct: 965 CKICNRGKGRYHKDHLVICDACSLAYHLACIKPPLKEVPESTWECPECT---SNSQREKI 1021
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDI-RLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
E S++ DI +C + D VI CD C +H
Sbjct: 1022 SNGKSRERSSKTTESESESSNEDSDHEDICHVCET---------DGLVILCDTCPLSYHF 1072
Query: 622 GCLRKNGLCDLKEIPKDKWFC 642
C L+ P+ W C
Sbjct: 1073 DCHNP----PLRHAPRGPWSC 1089
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
+D+CHVC ++LC+ CPL++H C +P L P W C CR+ +S
Sbjct: 1048 EDICHVCETDGLVILCDTCPLSYHFDCHNPPLRHAPRGPWSCYECRKNSKNS 1099
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
troglodytes]
Length = 4026
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 171
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 172 HAVCQNLNTEEEVENV-ADIGFDCSMC 197
>gi|440895709|gb|ELR47837.1| PHD finger protein 21B, partial [Bos grunniens mutus]
Length = 476
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 297 DELCAACKRGTNLQPCGTCPGAYHLSCLDPPLKTAPKGMWVCPKCQQ 343
>gi|340710729|ref|XP_003393938.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase
lid-like [Bombus terrestris]
Length = 1644
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|209880517|ref|XP_002141698.1| PHD / zinc finger (C3HC4 type) family protein [Cryptosporidium
muris RN66]
gi|209557304|gb|EEA07349.1| PHD / zinc finger (C3HC4 type) family protein [Cryptosporidium
muris RN66]
Length = 855
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 9/82 (10%)
Query: 478 YRHIYTSNGMTLHDIAISLAMGQRRTTGGSD-------DMCHVCGDGENLLLCNGCPLAF 530
Y HI T N + + I + G D ++CH+ E LLLC+GC +
Sbjct: 224 YDHIPTENSIAIITIPVERRTLNNDELDGDDPFADFACEVCHLNDHEEVLLLCDGCDCGY 283
Query: 531 HAACLDPLL--IPESGWRCPNC 550
H CLDP L +P W CP C
Sbjct: 284 HTYCLDPPLDSVPSGEWFCPRC 305
>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
Length = 4957
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 62/148 (41%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 849 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 908
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 909 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 950
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+ P D+ F C C
Sbjct: 951 HAGCESLFTEDDVDHAP-DEGFDCVSCQ 977
>gi|351700053|gb|EHB02972.1| Bromodomain adjacent to zinc finger domain protein 1A [Heterocephalus
glaber]
Length = 1857
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
MC GD EN++LC+GC H C+ P L +PE W CP CR S
Sbjct: 1454 MCRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRS 1502
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCR 551
+DD C VCG +LL C+GCP +FH C+ +PE W CP C+
Sbjct: 1052 NDDECTVCGIEGDLLCCDGCPGSFHRQCIGVARLPEGKWLCPECK 1096
>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
Length = 4884
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1110
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1111 HAVCQNLNTEEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 391 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 416
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 417 CLQPV----MKSVPTNGWKC 432
>gi|219112141|ref|XP_002177822.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410707|gb|EEC50636.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 462
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +CGDG +LL+C+GC +H C+ P L IPE W C +C
Sbjct: 347 DACAICGDGGSLLICDGCEGEYHMDCVQPSLAEIPEGHWECDDC 390
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 615 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 674
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 675 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 716
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 717 HAVCQNLNTEEEVENV-ADIGFDCSMC 742
Score = 43.1 bits (100), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 515 GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
G+ +L+C+ C +H CL P++ +P +GW+C NCR
Sbjct: 4 GEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKNCR 42
>gi|350412813|ref|XP_003489772.1| PREDICTED: lysine-specific demethylase lid-like isoform 1 [Bombus
impatiens]
Length = 1644
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4527
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 28/143 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ W+C C Q +++
Sbjct: 721 VCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVTCTQCGATTPGLR 780
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C IC + D+S T T++ C QC++ F
Sbjct: 781 CEWQKNYTQCAPCASLMTCPICLV--------------DYSEGT----TILQCRQCDRWF 822
Query: 620 HVGCLRKNGLCDLKEIPKDKWFC 642
H C + D+++ ++ + C
Sbjct: 823 HASCQSLHSEEDVEKAAENGFNC 845
>gi|328773887|gb|EGF83924.1| hypothetical protein BATDEDRAFT_84646 [Batrachochytrium
dendrobatidis JAM81]
Length = 828
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 61/143 (42%), Gaps = 30/143 (20%)
Query: 510 MCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-----------RQG 553
+C VC GD + LL C+ C +H CL+P L +PE W C C +Q
Sbjct: 623 LCTVCNNAGDDDKLLFCDTCDRGYHMYCLNPPLEVLPEGSWLCSECAVCKSCKKRPEKQE 682
Query: 554 HSSSMSRSVDLKGGLE--------APGAEVGGCVICRLSPS--ENFDI-RLCR--SHDFS 600
+ M R V + L P G +C ++F+ R C H +S
Sbjct: 683 GTEDMWRHVVIPPSLSLQEIQIKPPPATSALGTYLCTYCTDCYDHFEADRFCPLCIHVYS 742
Query: 601 AATFDDRTVIYCDQCEKEFHVGC 623
+ DD ++ CD+C++ HVGC
Sbjct: 743 EDS-DDLAMVCCDECDRWVHVGC 764
>gi|350412816|ref|XP_003489773.1| PREDICTED: lysine-specific demethylase lid-like isoform 2 [Bombus
impatiens]
Length = 1618
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 304 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 349
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 895 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 954
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 955 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 996
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 997 HAVCQNLNTEEEVENV-ADIGFDCSMC 1022
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 291 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 350
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 351 PVMKSVPTNGWKCKNCR 367
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 272 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 321
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 322 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 347
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 348 CLQPV----MKSVPTNGWKC 363
>gi|357139082|ref|XP_003571114.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Brachypodium distachyon]
Length = 924
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 44/89 (49%), Gaps = 14/89 (15%)
Query: 481 IYTSNGMT-------LHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAF 530
Y SNG LHDI +S A + D +C VCG D +++LLC+ C +
Sbjct: 26 FYLSNGNAGSEIHDELHDI-LSAATNLPKAPW-EDGVCKVCGIDRDDDSVLLCDKCDSEY 83
Query: 531 HAACLDPLL--IPESGWRCPNCRQGHSSS 557
H CL+P L IP+ W CP+C G S
Sbjct: 84 HTYCLNPPLARIPQGNWYCPSCMSGQKKS 112
>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
Length = 3310
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 454 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 513
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 514 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 555
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 556 HAVCQNLNTEEEVENV-ADIGFDCSMC 581
>gi|383853507|ref|XP_003702264.1| PREDICTED: lysine-specific demethylase lid-like isoform 2
[Megachile rotundata]
Length = 1616
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 302 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPRC 347
>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
Length = 2475
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 65/155 (41%), Gaps = 33/155 (21%)
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAA 533
Y + +TLHD S +G+ G C VC GD NLLLC+ C +++H
Sbjct: 450 QTYHTYCVTLHDKLNSAVVGR----GWRCLDCTVCEGCGTGGDEANLLLCDECDVSYHIY 505
Query: 534 CLDPLL--IPESGWR---CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSEN 588
C+ PLL IP+ WR C CR+ + + S + GL P A + C C N
Sbjct: 506 CMKPLLDKIPQGPWRCQWCSRCRRCNHKAASGNDLTSQGLCFPCASLRKCPRC----ERN 561
Query: 589 FDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGC 623
+ + + +I C QC K H C
Sbjct: 562 YQL--------------NEKLIRCSQCSKWQHGAC 582
>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
Length = 3314
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 454 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 513
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 514 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 555
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 556 HAVCQNLNTEEEVENV-ADIGFDCSMC 581
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1003 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1062
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1063 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1104
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1105 HAVCQNLNTEEEVENV-ADIGFDCSMC 1130
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 391 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 416
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 417 CLQPV----MKSVPTNGWKC 432
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1110
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1111 HAVCQNLNTEEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 391 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 416
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 417 CLQPV----MKSVPTNGWKC 432
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
D C++C DG +LL C+ CP +FH C+ IPE W C C
Sbjct: 122 DHCNICKDGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRC 163
>gi|301625847|ref|XP_002942113.1| PREDICTED: PHD finger protein 21B-like [Xenopus (Silurana)
tropicalis]
Length = 565
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H C+DP L +P+ W CP C+Q
Sbjct: 400 DEYCSACKRGSNLQTCGACPRAYHLDCVDPPLKVLPKGVWLCPKCQQ 446
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1110
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1111 HAVCQNLNTEEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 391 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 416
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 417 CLQPV----MKSVPTNGWKC 432
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1110
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1111 HAVCQNLNTEEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 391 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 416
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 417 CLQPV----MKSVPTNGWKC 432
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1110
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1111 HAVCQNLNTEEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 391 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 416
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 417 CLQPV----MKSVPTNGWKC 432
>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
Length = 5488
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 74/187 (39%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1390 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1449
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1450 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1491
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q ++ I L I K E
Sbjct: 1492 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYIVKPVVPIAPPELVPIKVKEPEP 1543
Query: 680 GIL-FDG 685
F+G
Sbjct: 1544 QYFRFEG 1550
Score = 46.2 bits (108), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 53/153 (34%), Gaps = 58/153 (37%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CTVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC-----CDDCNRIHAAL 653
++E+P W C C C A L
Sbjct: 305 P----PMEELPAHSWKCKACRICRTCGASSAEL 333
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 63/153 (41%), Gaps = 29/153 (18%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 925 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 984
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 985 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1026
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 652
H C N +++ + D F C C A
Sbjct: 1027 HAVCQNLNTEEEVENV-ADIGFDCSMCRPYMPA 1058
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 276 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 335
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 336 PVMKSVPTNGWKCKNCR 352
>gi|195130615|ref|XP_002009747.1| GI15065 [Drosophila mojavensis]
gi|193908197|gb|EDW07064.1| GI15065 [Drosophila mojavensis]
Length = 886
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 9/73 (12%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR---- 551
+R G + D+C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPSGQWLCHSCRMAKV 104
Query: 552 -QGHSSSMSRSVD 563
Q SSS + SV+
Sbjct: 105 SQPTSSSKASSVE 117
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSR 560
+D+C C G L+ C+ CP AFH C P+L +P+ W C NC++G S+ R
Sbjct: 1422 EDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENCKKGTKSAAIR 1476
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+DD C C DG +LL C+ C +FH CL+P L IPE W C +C+
Sbjct: 65 NDDFCSSCKDGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSCK 111
Score = 40.0 bits (92), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 4/45 (8%)
Query: 598 DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
DF ++ D ++ CD CEK FH+ CL L+EIP+ W+C
Sbjct: 67 DFCSSCKDGGDLLCCDSCEKSFHLMCLNP----PLEEIPEGDWYC 107
>gi|383853505|ref|XP_003702263.1| PREDICTED: lysine-specific demethylase lid-like isoform 1
[Megachile rotundata]
Length = 1642
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 328 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPRC 373
>gi|156392707|ref|XP_001636189.1| predicted protein [Nematostella vectensis]
gi|156223290|gb|EDO44126.1| predicted protein [Nematostella vectensis]
Length = 1071
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 33/67 (49%), Gaps = 10/67 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDL 564
D C CG+G +LL C+ CP AFH +C DP L IP+ W C CR D
Sbjct: 49 DNCDSCGEGGDLLCCDQCPCAFHLSCCDPPLEEDDIPDGEWLCIECRNKQKQ------DQ 102
Query: 565 KGGLEAP 571
K G E P
Sbjct: 103 KNGKEVP 109
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 5/45 (11%)
Query: 510 MCHVCGDGE---NLLLCNGCPLAFHAACLD-PLLIPESG-WRCPN 549
+C C E L+ C+ CPL FH C++ PL P SG W CPN
Sbjct: 212 LCFTCSRTELFGKLVQCDFCPLGFHMDCINPPLTTPPSGMWMCPN 256
>gi|50290691|ref|XP_447778.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527089|emb|CAG60725.1| unnamed protein product [Candida glabrata]
Length = 671
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 10/65 (15%)
Query: 496 LAMGQRRTTGGS------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGW 545
+AM +++++ G+ DD C C + L C+ CP +FH CL+P L +PE W
Sbjct: 255 IAMAKKKSSDGTTQEIENDDYCSACFQTGSFLCCDTCPKSFHFLCLNPPLDPDHLPEGDW 314
Query: 546 RCPNC 550
CP C
Sbjct: 315 SCPQC 319
>gi|348551108|ref|XP_003461372.1| PREDICTED: PHD finger protein 21B [Cavia porcellus]
Length = 513
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G +LL C+ CP A+H CL P L P GW CP C+Q
Sbjct: 334 DEHCAACRRGADLLHCSACPDAYHLGCLHPPLKTAPRGGWLCPKCQQ 380
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 941 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1000
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1001 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1042
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1043 HAVCQNLNTEEEVENV-ADIGFDCSMC 1068
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 292 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 351
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 352 PVMKSVPTNGWKCKNCR 368
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 273 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 322
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 323 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 348
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 349 CLQPV----MKSVPTNGWKC 364
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 956 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1015
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1016 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1057
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1058 HAVCQNLNTEEEVENV-ADIGFDCSMC 1083
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 307 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 366
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 367 PVMKSVPTNGWKCKNCR 383
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 288 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 337
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 338 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 363
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 364 CLQPV----MKSVPTNGWKC 379
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1046 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1105
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1106 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1147
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1148 HAVCQNLNTEEEVENV-ADIGFDCSMC 1173
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 398 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 457
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 458 PVMKSVPTNGWKCKNCR 474
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 379 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 428
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 429 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 454
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 455 CLQPV----MKSVPTNGWKC 470
>gi|255565329|ref|XP_002523656.1| chromodomain helicase DNA binding protein, putative [Ricinus
communis]
gi|223537108|gb|EEF38742.1| chromodomain helicase DNA binding protein, putative [Ricinus
communis]
Length = 2257
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 2/64 (3%)
Query: 491 DIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP 548
++ L+ +++ G C +C G NLL C+ CP +H CLDP L IP W+CP
Sbjct: 58 ELGSDLSSSKKKGNDGYYYECVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCP 117
Query: 549 NCRQ 552
C Q
Sbjct: 118 KCYQ 121
>gi|115444767|ref|NP_001046163.1| Os02g0192400 [Oryza sativa Japonica Group]
gi|46389826|dbj|BAD15389.1| PHD finger-like protein [Oryza sativa Japonica Group]
gi|50726413|dbj|BAD34024.1| PHD finger-like protein [Oryza sativa Japonica Group]
gi|113535694|dbj|BAF08077.1| Os02g0192400 [Oryza sativa Japonica Group]
Length = 929
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 38/74 (51%), Gaps = 7/74 (9%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPES 543
LHDI A D +C VCG D +++LLC+ C +H CL+P L IPE
Sbjct: 41 LHDILT--AANSLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPPLARIPEG 98
Query: 544 GWRCPNCRQGHSSS 557
W CP+C G + +
Sbjct: 99 NWYCPSCMLGQTKA 112
>gi|281344197|gb|EFB19781.1| hypothetical protein PANDA_012011 [Ailuropoda melanoleuca]
Length = 487
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W+CP C+Q
Sbjct: 308 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWQCPKCQQ 354
>gi|356564737|ref|XP_003550605.1| PREDICTED: uncharacterized protein LOC100794210 [Glycine max]
Length = 1608
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGH-SSSMSRSV 562
G+ D C +CG LL C+GCP +H+ C+ + IPE W CP C+ +++R
Sbjct: 381 GNGDECRLCGMDGTLLCCDGCPAVYHSRCIGVMKMHIPEGAWYCPECKIDMIGPTIARGT 440
Query: 563 DLKGGLEAPGAEVGGCVI 580
LKG E G ++ G V
Sbjct: 441 SLKGA-EVFGKDLYGQVF 457
>gi|71028128|ref|XP_763707.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350661|gb|EAN31424.1| hypothetical protein TP04_0072 [Theileria parva]
Length = 964
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 29/48 (60%), Gaps = 5/48 (10%)
Query: 508 DDMCHVCGDGENL---LLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
DD C +CG+ +N LLC+ C FH CL+P L IPES W C +C
Sbjct: 484 DDGCQICGNDDNWNQQLLCDNCDKGFHTYCLNPPLTRIPESNWYCQHC 531
>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
Length = 2185
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 171
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 172 HAVCQNLNTEEEVENV-ADIGFDCSMC 197
>gi|432114496|gb|ELK36344.1| Histone-lysine N-methyltransferase MLL2 [Myotis davidii]
Length = 3462
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 75/187 (40%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1416 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1475
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1476 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1517
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 1518 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVIKPAAPVAPPELVPMKVKEPEP 1569
Query: 680 GIL-FDG 685
F+G
Sbjct: 1570 QYFRFEG 1576
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 261 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 307
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 308 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 336
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 337 P----PMEELPAHSWKCKACRVCRSCGAGSAELNPN 368
Score = 39.3 bits (90), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 277 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 336
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 337 PPMEELPAHSWKCKACR 353
>gi|358336343|dbj|GAA54879.1| tyrosine-protein kinase BAZ1B, partial [Clonorchis sinensis]
Length = 1921
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 5/49 (10%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
D C +C D +NLLLC+GC AFH CL P L +P W CP+CR
Sbjct: 1423 DARCRICRHKSDDDNLLLCDGCNRAFHLYCLRPPLRRVPAGDWYCPSCR 1471
>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
abelii]
Length = 5559
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 74/187 (39%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1420 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1479
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1480 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1521
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V I L + K E
Sbjct: 1522 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPVAPIAPPELVPMKVKEPEP 1573
Query: 680 GIL-FDG 685
F+G
Sbjct: 1574 QYFRFEG 1580
Score = 45.8 bits (107), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 49/137 (35%), Gaps = 53/137 (38%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC 642
++E+P W C
Sbjct: 305 P----PMEELPAHSWKC 317
Score = 39.7 bits (91), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|224071533|ref|XP_002303505.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222840937|gb|EEE78484.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 2327
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Query: 491 DIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP 548
+++ L+ +++ G C +C G NLL C+ CP +H CLDP L IP W+CP
Sbjct: 55 ELSSDLSTSKKKGNDGYYYECVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCP 114
Query: 549 NC 550
C
Sbjct: 115 KC 116
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 91/226 (40%), Gaps = 32/226 (14%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 922 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 981
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C + E I CR D S F
Sbjct: 982 CEWQNNYTQCAPCASLSSCPVCYRNYREEDLILQCRQCDSSMKVF--------------L 1027
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
+G +R+ G K +D + +L V R ++ P L IN+ +
Sbjct: 1028 LIGIVRRKGQDPPKTYTQDGVCLTESGMTQLQSLTVTVPRRKRSKPKLKLKIINQNSV-- 1085
Query: 680 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLS----SATAIFRECFD 721
+L T D+Q + + + + +E L+ S ++ RE D
Sbjct: 1086 AVL--QTPPDIQSEHSRDGEMDDSREGELMDCDGKSESSPEREAVD 1129
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 265 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 324
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 325 PVMKSVPTNGWKCKNCR 341
>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
abelii]
Length = 5293
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 74/187 (39%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1154 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1213
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1214 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1255
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V I L + K E
Sbjct: 1256 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPVAPIAPPELVPMKVKEPEP 1307
Query: 680 GIL-FDG 685
F+G
Sbjct: 1308 QYFRFEG 1314
Score = 45.8 bits (107), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 49/137 (35%), Gaps = 53/137 (38%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC 642
++E+P W C
Sbjct: 305 P----PMEELPAHSWKC 317
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|296086550|emb|CBI32139.3| unnamed protein product [Vitis vinifera]
Length = 1789
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 7/62 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSV 562
+C VCG D +N+LLC+ C +H CL+P L IPE W CP+C Q S SRS
Sbjct: 939 LCKVCGVDKDDDNVLLCDACDSEYHTYCLNPPLARIPEGNWYCPSCVAAQRLSQGTSRSA 998
Query: 563 DL 564
++
Sbjct: 999 EV 1000
>gi|224125208|ref|XP_002329920.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222871157|gb|EEF08288.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 2332
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
C +C G NLL C+ CP +H CLDP L IP W+CP C Q
Sbjct: 65 CVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCPKCSQ 108
>gi|383851360|ref|XP_003701201.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Megachile rotundata]
Length = 1967
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 8/55 (14%)
Query: 504 TGGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
TG ++D C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 420 TGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 474
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 51/136 (37%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------ITG 421
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + C IC+ D ++ CD C +H CL
Sbjct: 422 AAEDDDEHMEFCRICK----------------------DGGELLCCDSCTSAYHTHCLNP 459
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 460 ----PLSEIPDGDWKC 471
>gi|296192024|ref|XP_002743885.1| PREDICTED: PHD finger protein 21B [Callithrix jacchus]
Length = 489
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 310 DEHCAACKRGANLQPCGACPGAYHLSCLDPPLKTAPKGVWVCPRCQQ 356
>gi|359473443|ref|XP_002264500.2| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Vitis
vinifera]
Length = 2060
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 7/62 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSV 562
+C VCG D +N+LLC+ C +H CL+P L IPE W CP+C Q S SRS
Sbjct: 1042 LCKVCGVDKDDDNVLLCDACDSEYHTYCLNPPLARIPEGNWYCPSCVAAQRLSQGTSRSA 1101
Query: 563 DL 564
++
Sbjct: 1102 EV 1103
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1007 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1066
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1067 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1108
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1109 HAVCQNLNTEEEVENV-ADIGFDCSMC 1134
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 391 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 416
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 417 CLQPV----MKSVPTNGWKC 432
>gi|350645335|emb|CCD59958.1| zinc finger protein, putative [Schistosoma mansoni]
Length = 1690
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 5/48 (10%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +C D +NLLLC+GC LAFH CL P L +P W CP C
Sbjct: 1250 DARCRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCPTC 1297
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 508 DDMCHVCGDGE-NLLLCNGCPLAFHAACLDPLL--IPES-GWRCPNCRQGHSSS 557
D C VC D +++LC+ CP FH C DP L IP GW+C CR S
Sbjct: 1391 DTSCLVCSDSTGDIVLCSNCPNIFHLDCHDPPLHHIPRGYGWQCSICRSNKKRS 1444
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 960 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1019
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1020 CEWQNNYTQCAPCASLSSCPVC---------YRNYREDDL---------ILQCRQCDRWM 1061
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1062 HAVCQNLNTEEEVENV-ADIGFDCSMC 1087
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 316 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 375
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 376 PVMKSVPTNGWKCKNCR 392
>gi|195048447|ref|XP_001992528.1| GH24800 [Drosophila grimshawi]
gi|193893369|gb|EDV92235.1| GH24800 [Drosophila grimshawi]
Length = 887
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 9/73 (12%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR---- 551
+R G + D+C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPTGQWLCHSCRMAKV 104
Query: 552 -QGHSSSMSRSVD 563
Q S+S S SV+
Sbjct: 105 AQPTSNSKSNSVE 117
>gi|256070387|ref|XP_002571524.1| zinc finger protein [Schistosoma mansoni]
Length = 1690
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 5/48 (10%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +C D +NLLLC+GC LAFH CL P L +P W CP C
Sbjct: 1250 DARCRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCPTC 1297
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 508 DDMCHVCGDGE-NLLLCNGCPLAFHAACLDPLL--IPES-GWRCPNCRQGHSSS 557
D C VC D +++LC+ CP FH C DP L IP GW+C CR S
Sbjct: 1391 DTSCLVCSDSTGDIVLCSNCPNIFHLDCHDPPLHHIPRGYGWQCSICRSNKKRS 1444
>gi|222622353|gb|EEE56485.1| hypothetical protein OsJ_05715 [Oryza sativa Japonica Group]
Length = 1949
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 38/74 (51%), Gaps = 7/74 (9%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPES 543
LHDI A D +C VCG D +++LLC+ C +H CL+P L IPE
Sbjct: 1061 LHDILT--AANSLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPPLARIPEG 1118
Query: 544 GWRCPNCRQGHSSS 557
W CP+C G + +
Sbjct: 1119 NWYCPSCMLGQTKA 1132
>gi|307180140|gb|EFN68184.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Camponotus floridanus]
Length = 1960
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 428 EFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 471
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 50/136 (36%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W C +C +G + G
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCAHC-EGEG--------IAG 418
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + C +C+ D ++ CD C +H CL
Sbjct: 419 AAEDDDEHMEFCRVCK----------------------DGGELLCCDSCTSAYHTHCLNP 456
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 457 ----PLSEIPDGDWKC 468
>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
gorilla gorilla]
Length = 5284
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1154 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1213
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1214 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1255
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1256 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1282
Score = 45.8 bits (107), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 49/137 (35%), Gaps = 53/137 (38%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC 642
++E+P W C
Sbjct: 305 P----PMEELPAHSWKC 317
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|118091868|ref|XP_426440.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Gallus gallus]
Length = 1571
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C VC GD E+++LC+GC +H C+ P L IPE W CP CR
Sbjct: 1163 CKVCRKKGDAESMVLCDGCDRGYHTYCIRPKLKIIPEGDWFCPECR 1208
>gi|115472215|ref|NP_001059706.1| Os07g0497100 [Oryza sativa Japonica Group]
gi|113611242|dbj|BAF21620.1| Os07g0497100, partial [Oryza sativa Japonica Group]
Length = 306
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 490 HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRC 547
HD IS + + R G C C G NLL C+ CP +H CL+P L P W+C
Sbjct: 58 HDTNISPSTKKIRGHDGYFYECVECDLGGNLLCCDSCPRTYHLECLNPPLKRAPPGNWQC 117
Query: 548 PNCR 551
P CR
Sbjct: 118 PRCR 121
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 38 EEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 82
>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
boliviensis boliviensis]
Length = 5498
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1447 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1506
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1507 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1548
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1549 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1575
Score = 45.8 bits (107), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|395538828|ref|XP_003771376.1| PREDICTED: lysine-specific demethylase 5A [Sarcophilus harrisii]
Length = 1479
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A S+ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 63 AFSMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 122
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 123 DWRCPKC 129
>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
porcellus]
Length = 5577
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1429 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1488
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1489 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1530
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1531 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1557
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 49/137 (35%), Gaps = 53/137 (38%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE N+ C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCNMFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC 642
++E+P W C
Sbjct: 305 P----PMEELPAHSWKC 317
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|338721435|ref|XP_001488154.3| PREDICTED: PHD finger protein 21B [Equus caballus]
Length = 494
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 305 DEHCAACKRGANLQACGTCPGAYHLSCLDPPLRTAPKGVWVCPKCQQ 351
>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Callithrix jacchus]
Length = 5289
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1414 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1473
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1474 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1515
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1516 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1542
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 55/156 (35%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + SN
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNSN 336
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|341890749|gb|EGT46684.1| hypothetical protein CAEBREN_05565 [Caenorhabditis brenneri]
Length = 1728
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C +C D ++ LLC CP A+HA CL+P L +P+ W CP C
Sbjct: 309 CQICKDNDHTLLCATCPNAYHAYCLNPPLDEMPDDDWFCPRC 350
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 54/134 (40%), Gaps = 35/134 (26%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGL 568
C C L++C+ C +H C+DP P+ W C +C +K G
Sbjct: 247 CAECDQAGELVMCSTCERMYHCVCIDPNSDEPPKGVWSCVHC-------------MKNG- 292
Query: 569 EAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 628
PG + VI R ++ + ++C+ +D + + C C +H CL
Sbjct: 293 --PGFPIDPNVIVR----KHTNCQICKDNDHT---------LLCATCPNAYHAYCLNP-- 335
Query: 629 LCDLKEIPKDKWFC 642
L E+P D WFC
Sbjct: 336 --PLDEMPDDDWFC 347
>gi|255577029|ref|XP_002529399.1| hypothetical protein RCOM_0623590 [Ricinus communis]
gi|223531147|gb|EEF32995.1| hypothetical protein RCOM_0623590 [Ricinus communis]
Length = 275
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 62/126 (49%), Gaps = 10/126 (7%)
Query: 687 MNDVQWQMLKKAQCFEEKEK-------SLLSSATAIFRECFDPIIAECGR-DLIPVMVYG 738
+N++ W +LK + K S LS A + ECF P+ + D + +++
Sbjct: 27 LNNLTWTLLKSNHSSDHKPDASDIENYSKLSIALHVMHECFQPVEEPRTKGDFLKDVIFR 86
Query: 739 R--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 796
+ ++ F G Y+V+L ++ +R++G +VAE+PLV T +Y+ G L +
Sbjct: 87 KRSELNRLNFRGFYTVLLQKDDEFITVATVRVYGEKVAEIPLVGTRVQYRRLGMCGILMN 146
Query: 797 CIERLL 802
+E+ L
Sbjct: 147 VLEKNL 152
>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Pan paniscus]
Length = 5373
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1261 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1320
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1321 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1362
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1363 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1389
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
sapiens]
Length = 4539
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 431 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 490
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 491 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 532
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 533 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 559
>gi|194687197|ref|XP_001787707.1| PREDICTED: PHD finger protein 21B-like, partial [Bos taurus]
Length = 211
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 32 DELCAACKRGTNLQPCGTCPRAYHLSCLDPPLKTAPKGVWVCPKCQQ 78
>gi|297808265|ref|XP_002872016.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317853|gb|EFH48275.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1562
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCR-QGHS 555
G G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +
Sbjct: 401 GDSSDLDGNSDECRLCGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWYCPECTIKKMG 460
Query: 556 SSMSRSVDLKGGL 568
+++ L+G +
Sbjct: 461 PTVAHKTSLRGAV 473
>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
sapiens]
Length = 5265
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1157 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1216
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1217 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1258
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1259 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1285
Score = 46.2 bits (108), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
Length = 5538
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1428 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1487
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1488 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1529
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1530 HAGCESLFTEDDVEQA-ADEGFDCISCQ 1556
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|402226259|gb|EJU06319.1| hypothetical protein DACRYDRAFT_60996 [Dacryopinax sp. DJM-731 SS1]
Length = 1730
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 39/83 (46%), Gaps = 17/83 (20%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDM----------CHVC---GDGENLLLCNGCPLAFHA 532
GMT D+ A QR S D+ C +C D N+LLC+GC AFH
Sbjct: 411 GMT--DLMPPFAQSQREERAQSPDIRLEPLQPGDACEICRIDNDNTNMLLCDGCDAAFHM 468
Query: 533 ACLDPLL--IPESGWRCPNCRQG 553
CLDP L IP+S W C C G
Sbjct: 469 YCLDPPLTYIPKSQWFCHACLFG 491
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 955 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1014
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1015 CEWQNNYTQCAPCASLSTCPVC---------YRHYREEDL---------ILQCRQCDRWM 1056
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1057 HAICQNLNTEEEVENV-ADIGFDCSMC 1082
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 308 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 367
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 368 PVMKSVPTNGWKCKNCR 384
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 289 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 338
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 339 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 364
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 365 CLQPV----MKSVPTNGWKC 380
>gi|224051421|ref|XP_002200551.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Taeniopygia guttata]
Length = 1559
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C VC GD E+++LC+GC +H C+ P L IPE W CP CR
Sbjct: 1156 CKVCRKKGDAESMVLCDGCDRGYHTYCIRPKLKAIPEGDWFCPECR 1201
>gi|158293040|ref|XP_314337.4| AGAP004854-PA [Anopheles gambiae str. PEST]
gi|157016915|gb|EAA09709.5| AGAP004854-PA [Anopheles gambiae str. PEST]
Length = 1494
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 9/56 (16%)
Query: 504 TGGSDDM----CHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
T G D M CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 346 THGYDPMAKYICHMCNRGDVEESMLLCDGCDASYHTFCLMPPLQDIPKGDWRCPKC 401
>gi|410898760|ref|XP_003962865.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Takifugu rubripes]
Length = 1329
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
C +C GD +N+LLC+GC H CL P L +P+ W CP+CR SS
Sbjct: 1051 CRICRRKGDADNMLLCDGCDRGHHTHCLRPRLKAVPQGDWFCPDCRPKQRSS 1102
>gi|326427650|gb|EGD73220.1| hypothetical protein PTSG_04935 [Salpingoeca sp. ATCC 50818]
Length = 2055
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 47/101 (46%), Gaps = 14/101 (13%)
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGS------DDMCHVC 514
+P+ A A A R R T+ G I+L+ G R G+ DDMC C
Sbjct: 276 NPASVSAGASRRASRAISRSASTAPGSGAE---IALSHGGHRPPLGTEDHSLGDDMCEEC 332
Query: 515 GDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
G E LLLC+ C +H CL+P L +P WRCP+C
Sbjct: 333 KHGDYAELLLLCDKCDKGYHTFCLNPPLSKVPSGDWRCPSC 373
>gi|308802818|ref|XP_003078722.1| PHD finger protein BR140/LIN-49 (ISS) [Ostreococcus tauri]
gi|116057175|emb|CAL51602.1| PHD finger protein BR140/LIN-49 (ISS) [Ostreococcus tauri]
Length = 1206
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 46/91 (50%), Gaps = 9/91 (9%)
Query: 509 DMCHVCGDGEN--------LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSR 560
D+CHVCG+GE+ ++ C+GC + H +C + +PE W C C++G ++
Sbjct: 267 DVCHVCGEGESDYWHDDDEIVFCDGCDVQVHMSCYGVMALPEGQWFCRGCKEGVTNGPLV 326
Query: 561 SVDLKGGLEAPGAEVGGCVICRLSPSENFDI 591
S G+ A GG ++ R+ P D+
Sbjct: 327 SCGTPRGICAMCPHPGGALV-RVEPPSKRDV 356
>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
gorilla gorilla]
Length = 5550
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1420 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1479
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1480 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1521
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1522 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1548
Score = 45.8 bits (107), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 49/137 (35%), Gaps = 53/137 (38%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC 642
++E+P W C
Sbjct: 305 P----PMEELPAHSWKC 317
Score = 39.7 bits (91), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Nasonia vitripennis]
Length = 2009
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 427 EFCRVCKDGGELLCCDSCTSAYHTHCLNPPLTEIPDGDWKCPRC 470
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 50/140 (35%), Gaps = 40/140 (28%)
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSV 562
G D C VC G ++LC+ CP A+H CL+P L W CP+C
Sbjct: 366 GDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCEN---------- 415
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
G LE + C +C+ D ++ CD C +H
Sbjct: 416 --DGALEDDDEHMEFCRVCK----------------------DGGELLCCDSCTSAYHTH 451
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL L EIP W C
Sbjct: 452 CLNP----PLTEIPDGDWKC 467
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 35 EEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Nomascus leucogenys]
Length = 5407
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1441 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1500
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1501 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1542
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1543 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1569
Score = 46.2 bits (108), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
(Silurana) tropicalis]
Length = 6019
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 49/135 (36%), Gaps = 50/135 (37%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD +LL C C L +H CL+ + P SGW+CP C+
Sbjct: 223 LCDKPGDLIDLLFCTSCGLHYHGTCLEITVSPLKRSGWQCPECK---------------- 266
Query: 568 LEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 627
C CR P E D ++ CD C+K +H CL+
Sbjct: 267 ---------VCQTCR-QPGE------------------DTMMLVCDACDKGYHTFCLKPA 298
Query: 628 GLCDLKEIPKDKWFC 642
C +P D W C
Sbjct: 299 IEC----LPTDSWKC 309
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 72/171 (42%), Gaps = 41/171 (23%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWR---CPNCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+ C +C Q + +
Sbjct: 859 VCEVCGKATDPSRLLLCDDCDISYHTYCLDPPLHTVPKGGWKCRWCVSCMQCGAVTPGFR 918
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C L + +I C CE+
Sbjct: 919 SEWQNNYTHCAPCASLVSCPVCHLK------------------YLEGDLLIQCRHCERWL 960
Query: 620 HVGCLRKNGLCDLKEIPK--DKWFCCDDCNRIHAALQDFVSNRAQTIPASS 668
H C L +E+ + D+ F C C Q F+ + +IPAS+
Sbjct: 961 HAVC---ENLFTEEEVEQAADEGFDCSSC-------QPFIP-KPISIPAST 1000
>gi|428169481|gb|EKX38414.1| hypothetical protein GUITHDRAFT_144200 [Guillardia theta CCMP2712]
Length = 994
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 4/47 (8%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACL---DPLLIPESGWRCPNCRQ 552
D C CGD +LL C+GCP A+HAAC+ DP +P+ W C C++
Sbjct: 435 DYCDDCGDAGDLLCCDGCPAAYHAACIGVADPSKLPDP-WYCNACKR 480
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 64/147 (43%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CL P L +P+ GW+C CR ++S
Sbjct: 974 VCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1033
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ +D ++ C QC++
Sbjct: 1034 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EDDLILQCRQCDRWM 1075
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1076 HAVCQNLNTEEEVENV-ADIGFDCSMC 1101
>gi|357474041|ref|XP_003607305.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
gi|355508360|gb|AES89502.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
Length = 1573
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 43/82 (52%), Gaps = 8/82 (9%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACL----DPLLIPESGWRCPNCRQG-HSSS 557
TT + D C +C NL+ C+GCP AFH+ C+ D L PE W CP C G H +S
Sbjct: 791 TTDLNSDECCLCKMDGNLICCDGCPAAFHSRCVGIASDSL--PEGDWYCPECAIGTHRAS 848
Query: 558 MSRSVDLKGGLEAPGAEVGGCV 579
M L+G + G + GC+
Sbjct: 849 MKSRRSLRGA-DLLGMDPHGCL 869
>gi|224092566|ref|XP_002309665.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222855641|gb|EEE93188.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 1340
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAAC-LDPLLIP-ESGWRCPNC 550
+D C CG+ ENLL C C A+H+ C L PL P S WRCP C
Sbjct: 48 AKEDSCQACGESENLLNCETCTYAYHSKCLLPPLKAPFPSNWRCPEC 94
>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=ALL1-related protein; AltName: Full=Lysine
N-methyltransferase 2D; Short=KMT2D; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
sapiens]
Length = 5537
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1429 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1488
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1489 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1530
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1531 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1557
Score = 46.2 bits (108), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|326920735|ref|XP_003206624.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Meleagris gallopavo]
Length = 1567
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C VC GD E+++LC+GC +H C+ P L IPE W CP CR
Sbjct: 1159 CKVCRKKGDAESMVLCDGCDRGYHTYCIRPKLKIIPEGDWFCPECR 1204
>gi|328713195|ref|XP_001950157.2| PREDICTED: hypothetical protein LOC100158992 [Acyrthosiphon pisum]
Length = 1118
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 5/49 (10%)
Query: 508 DDMCHVCGDGEN---LLLCNGCPLAFHAACLDP--LLIPESGWRCPNCR 551
DDMC C + +LLC+ C +HA+CL P ++IP+ W CP CR
Sbjct: 890 DDMCQKCNKSDQPEWILLCDTCNQGWHASCLRPPLMVIPDGDWYCPPCR 938
>gi|297262270|ref|XP_001099471.2| PREDICTED: histone-lysine N-methyltransferase MLL2 [Macaca mulatta]
Length = 5505
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1376 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1435
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1436 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1477
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1478 HAGCESLFTEDDVEQA-ADEGFDCISCQ 1504
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|225431531|ref|XP_002281774.1| PREDICTED: histone-lysine N-methyltransferase ATXR5 [Vitis
vinifera]
gi|296088576|emb|CBI37567.3| unnamed protein product [Vitis vinifera]
Length = 374
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 8/68 (11%)
Query: 507 SDDMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---RQGHSSSM 558
SD C CG G++ LLLC+ C FH +CL P++ IP W CP+C R+G S S
Sbjct: 58 SDVRCVQCGSGDHDDELLLCDKCDRGFHMSCLRPIVVRIPIGTWLCPSCSGQRRGRSLSQ 117
Query: 559 SRSVDLKG 566
++ +D G
Sbjct: 118 TKIIDFFG 125
>gi|170046220|ref|XP_001850672.1| jumonji/ARID domain-containing protein 1A [Culex quinquefasciatus]
gi|167869058|gb|EDS32441.1| jumonji/ARID domain-containing protein 1A [Culex quinquefasciatus]
Length = 1443
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 232 ICHMCNRGDVEESMLLCDGCDASYHTFCLLPPLQEIPKGDWRCPKC 277
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
domestica]
Length = 4862
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 29/147 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1028 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGPR 1087
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 1088 CEWQNNYTQCAPCASLSICPVC------------CRNYR------EEDLILQCRQCDRWM 1129
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDC 646
H C N +++ + D F C C
Sbjct: 1130 HAVCQNLNTEDEVENV-ADIGFDCTMC 1155
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/173 (21%), Positives = 70/173 (40%), Gaps = 30/173 (17%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 369 TTCGQHYHGMCLDIAVTPLKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHTFCLQ 428
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICR--LSPSENFDIR 592
P++ +P +GW+C NCR + + G C++C P +N
Sbjct: 429 PIMDSVPTNGWKCKNCR----------ICAECGTRTSSQWHHNCLVCDSCYQPQDNLSCP 478
Query: 593 L---CRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
C D + +++C C++ H+ C K +L K+++ C
Sbjct: 479 FCGKCYQPDLQ------KDMLHCHMCKRWIHIEC-DKPADTELDSQLKEEYVC 524
>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2-like, partial [Taeniopygia guttata]
Length = 4299
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 79/190 (41%), Gaps = 44/190 (23%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q ++S
Sbjct: 791 VCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCVQCGAASPGFH 850
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C CR E + +D +I C C++
Sbjct: 851 CEWQNNYTHCAPCASLVVCPFCR----EKY--------------VEDDLLIQCRHCDRWL 892
Query: 620 HVGCLRKNGLCDLKEIPK--DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
H C + L +E+ + D+ F C C Q +V + +PA S I K
Sbjct: 893 HAAC---DSLFTEEEVEQAADEGFDCSAC-------QPYV---VKPVPAPSAEMIKAKDP 939
Query: 678 EKGIL-FDGT 686
E F+G
Sbjct: 940 EPQYFRFEGV 949
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 54/156 (34%), Gaps = 61/156 (39%)
Query: 508 DDMCHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC G GE +L C C FH ACLD L P SGW+CP C+
Sbjct: 123 DARCSVCDGPGELRDLAFCTSCGQHFHGACLDISLTPRKRSGWQCPQCK----------- 171
Query: 563 DLKGGLEAPGAEVGGCVICR-LSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHV 621
+C+ L P + D ++ C+ C+K +H
Sbjct: 172 -----------------VCQNLQPGQ------------------DSAMLVCETCDKGYHT 196
Query: 622 GCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAA 652
C + +P W C C DC + A
Sbjct: 197 SCTEPAA----QGLPTTSWKCKNCWVCSDCGQRPAG 228
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 35/75 (46%), Gaps = 7/75 (9%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC-----GDGENLLLCNGCPLAFHAACLDP 537
TS G H + +++ R+ +G C VC G +L+C C +H +C +P
Sbjct: 142 TSCGQHFHGACLDISLTPRKRSGWQCPQCKVCQNLQPGQDSAMLVCETCDKGYHTSCTEP 201
Query: 538 LL--IPESGWRCPNC 550
+P + W+C NC
Sbjct: 202 AAQGLPTTSWKCKNC 216
>gi|431899989|gb|ELK07924.1| PHD finger protein 21B [Pteropus alecto]
Length = 454
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 234 DEHCAACKRGANLQPCGACPGAYHLSCLDPPLRTAPKGVWVCPRCQQ 280
>gi|312381860|gb|EFR27503.1| hypothetical protein AND_05764 [Anopheles darlingi]
Length = 1549
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 413 ICHMCNRGDVEESMLLCDGCDASYHTFCLMPPLHDIPKGDWRCPKC 458
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 60/159 (37%), Gaps = 58/159 (36%)
Query: 498 MGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQ 552
+GQ D C C GD NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 367 LGQVPLLLNGDVTCMQCCGMGDVSNLVMCSICGQHYHGSCVGLALLPGVRAGWQCASCR- 425
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYC 612
C +CR + D S V+ C
Sbjct: 426 ------------------------VCQVCR------------QPEDVSK-------VMLC 442
Query: 613 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDC 646
++CEK +H CLR + IPK W C C DC
Sbjct: 443 ERCEKAYHPSCLRP----IVTSIPKYGWKCKCCRVCTDC 477
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 50/128 (39%), Gaps = 36/128 (28%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C G +
Sbjct: 766 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDPGFN 825
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
SS ++ G P A C+ C+ A + +I C QC
Sbjct: 826 SSWQKNYTQCG----PCASHTACISCQ------------------EAYNEGDLIIQCIQC 863
Query: 616 EKEFHVGC 623
E+ H C
Sbjct: 864 ERWLHCAC 871
>gi|157128338|ref|XP_001661408.1| hypothetical protein AaeL_AAEL011092 [Aedes aegypti]
gi|108872617|gb|EAT36842.1| AAEL011092-PA [Aedes aegypti]
Length = 1354
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 335 ICHMCNRGDVEESMLLCDGCDASYHTFCLLPPLQEIPKGDWRCPKC 380
>gi|350424653|ref|XP_003493868.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Bombus impatiens]
Length = 1965
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 430 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 473
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 51/136 (37%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 420
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + C IC+ D ++ CD C +H CL
Sbjct: 421 AAEDDDEHMEFCRICK----------------------DGGELLCCDSCTSAYHTHCLNP 458
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 459 ----PLSEIPDGDWKC 470
>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Papio anubis]
Length = 5547
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1402 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1461
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1462 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1503
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1504 HAGCESLFTEDDVEQX-ADEGFDCISCQ 1530
Score = 45.8 bits (107), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
familiaris]
Length = 5552
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 74/187 (39%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1411 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1470
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC ++ +I C CE+
Sbjct: 1471 CEWQNSYTHCGPCASLVTCPICH------------------TPYVEEDLLIQCRHCERWM 1512
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H GC D+++ D+ F C C Q +V A + L + K E
Sbjct: 1513 HAGCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEP 1564
Query: 680 GIL-FDG 685
F+G
Sbjct: 1565 QYFRFEG 1571
Score = 45.8 bits (107), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 305 P----PMEELPAHSWKCKACRVCRACGAGSAELNPN 336
Score = 39.7 bits (91), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
norvegicus]
Length = 5543
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 54/124 (43%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1486
Query: 620 HVGC 623
H GC
Sbjct: 1487 HAGC 1490
Score = 46.6 bits (109), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++++P W C C C A D N
Sbjct: 305 P----PIEDLPAHSWKCKTCRICRACGAGSADLNPN 336
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPIEDLPAHSWKCKTCR 321
>gi|395504252|ref|XP_003756470.1| PREDICTED: bromodomain adjacent to zinc finger domain protein
1A-like, partial [Sarcophilus harrisii]
Length = 1378
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
MC GD E+++LC+GC H C+ P L IPE W CP CR
Sbjct: 974 MCRKKGDAESMVLCDGCDRGHHTYCVRPKLKTIPEGDWFCPECR 1017
>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Rattus norvegicus]
Length = 5543
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 54/124 (43%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1486
Query: 620 HVGC 623
H GC
Sbjct: 1487 HAGC 1490
Score = 46.6 bits (109), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 276 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 304
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++++P W C C C A D N
Sbjct: 305 P----PIEDLPAHSWKCKTCRICRACGAGSADLNPN 336
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPIEDLPAHSWKCKTCR 321
>gi|242013407|ref|XP_002427399.1| FYVE-containing protein, putative [Pediculus humanus corporis]
gi|212511773|gb|EEB14661.1| FYVE-containing protein, putative [Pediculus humanus corporis]
Length = 1102
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 31/56 (55%), Gaps = 4/56 (7%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR 551
+R G + D C CG+G NLL C+ CP +FH C DP L IP W C +C+
Sbjct: 46 KRPGKGHNHDSCDSCGEGGNLLCCDKCPASFHLQCHDPPLEEDDIPIGQWLCHSCK 101
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 43/111 (38%), Gaps = 27/111 (24%)
Query: 446 KQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG 505
K GNG+ P + + +A+++PY NGM + +
Sbjct: 260 KAGNGV---------GPGKITSRRNSSAKKKPYE---LDNGMVPLPVKVCYE-------- 299
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
CH L+ C+ C L FH CLDP L +P W CPN Q +
Sbjct: 300 -----CHRSCRVAPLVSCDYCSLLFHLDCLDPPLASMPTGKWMCPNHPQHY 345
>gi|110777198|ref|XP_624414.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Apis mellifera]
Length = 1966
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 430 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 473
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 51/136 (37%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 420
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + C IC+ D ++ CD C +H CL
Sbjct: 421 AAEDDDEHMEFCRICK----------------------DGGELLCCDSCTSAYHTHCLNP 458
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 459 ----PLSEIPDGDWKC 470
>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Apis florea]
Length = 1964
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 429 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 472
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 51/136 (37%), Gaps = 37/136 (27%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 419
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + C IC+ D ++ CD C +H CL
Sbjct: 420 AAEDDDEHMEFCRICK----------------------DGGELLCCDSCTSAYHTHCLNP 457
Query: 627 NGLCDLKEIPKDKWFC 642
L EIP W C
Sbjct: 458 ----PLSEIPDGDWKC 469
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 35 EEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|224143281|ref|XP_002324903.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222866337|gb|EEF03468.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 1334
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAAC-LDPLLIP-ESGWRCPNC 550
+D C CG+ ENLL C C A+H C L PL P S WRCP C
Sbjct: 49 AKEDSCQACGESENLLNCETCTYAYHPKCLLPPLKAPFPSNWRCPEC 95
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 59/147 (40%), Gaps = 36/147 (24%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C Q G +
Sbjct: 753 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAQCQTCGSNDPGFN 812
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
SS +S G P A CV+C+ A + +I C QC
Sbjct: 813 SSWQKSYTQCG----PCASHTACVVCQ------------------EAYQEGDLIIQCVQC 850
Query: 616 EKEFHVGCLRKNGLCDLKEIPKDKWFC 642
E+ H GC D + ++ + C
Sbjct: 851 ERWLHCGCDSIKSESDAERCAEEGYVC 877
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 57/143 (39%), Gaps = 55/143 (38%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGGL 568
C+ GD NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 370 CYGMGDVSNLVMCSVCGQHYHGSCVGLALLPGVRAGWQCVSCRV---------------- 413
Query: 569 EAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 628
C +CR + D S V+ C++C+K +H GCLR
Sbjct: 414 ---------CQVCR------------QPEDVSK-------VMLCERCDKAYHPGCLRP-- 443
Query: 629 LCDLKEIPKDKWFC-----CDDC 646
+ IPK W C C DC
Sbjct: 444 --IVTSIPKYGWKCKCCRVCTDC 464
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 71/184 (38%), Gaps = 48/184 (26%)
Query: 510 MCHVCGDGENL---LLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C VC E++ +LC C A+H CL P++ IP+ GW+C CR V
Sbjct: 413 VCQVCRQPEDVSKVMLCERCDKAYHPGCLRPIVTSIPKYGWKCKCCR----------VCT 462
Query: 565 KGGLEAPGAEVGG--------CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCE 616
G PGA + C C ++ F LCR + AA + R ++ C C+
Sbjct: 463 DCGSRTPGAGLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKA-YRAAAY--REMVQCHGCK 519
Query: 617 KEFHVGCLRKNGLCD----------LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 666
K H G CD KE D + C C I V+ R +I
Sbjct: 520 KFVH-------GTCDPEADPLTYQQRKEAKPDYEYVCLHCKSIA-----MVARRKDSIDE 567
Query: 667 SSLS 670
LS
Sbjct: 568 GDLS 571
>gi|340726708|ref|XP_003401695.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Bombus terrestris]
Length = 1974
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 439 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 482
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 60/162 (37%), Gaps = 38/162 (23%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 429
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
E + C IC+ D ++ CD C +H CL
Sbjct: 430 AAEDDDEHMEFCRICK----------------------DGGELLCCDSCTSAYHTHCLNP 467
Query: 627 NGLCDLKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 667
L EIP W C C I + ++ R + P +
Sbjct: 468 ----PLSEIPDGDWKCPRCSCPPIRGKVAKILTWRWKECPET 505
>gi|384254264|gb|EIE27738.1| hypothetical protein COCSUDRAFT_39323 [Coccomyxa subellipsoidea
C-169]
Length = 1967
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 27/45 (60%), Gaps = 5/45 (11%)
Query: 511 CHVCGD---GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C CG GE +LLC+GC +H CLDP L IPE W CP+C
Sbjct: 796 CEECGKNDRGEEMLLCDGCDHGYHTDCLDPPLKEIPEGDWFCPSC 840
>gi|427797709|gb|JAA64306.1| Putative lysine-specific demethylase lid, partial [Rhipicephalus
pulchellus]
Length = 1499
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G E++LLC+GC ++H CL P L IP WRCP C
Sbjct: 33 VCHTCGRGDDEESMLLCDGCDDSYHTFCLLPPLPEIPRGDWRCPRC 78
>gi|189239068|ref|XP_971434.2| PREDICTED: similar to zinc finger protein [Tribolium castaneum]
Length = 1308
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/170 (23%), Positives = 64/170 (37%), Gaps = 21/170 (12%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C +C D EN+LLC+GC L H CL P L IP W C C Q + S K
Sbjct: 1004 CRICRRKNDSENMLLCDGCNLGVHLYCLKPKLKSIPPGDWFCDKCEQEKKPEVVESPPKK 1063
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRT-----------VIYCDQ 614
+ S N + + ++ T ++ CD+
Sbjct: 1064 RPIFRDEEIDEEESEIEQLESNNEEEDTEDGEEEEEQDENEETSLLTFFLFWGEMVQCDK 1123
Query: 615 CEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 664
C +H+ C+ L+ P+ WFC C + + +++ S+ + I
Sbjct: 1124 CTDSYHIECVEP----PLRRAPRGPWFCT-KCKKNNDRKRNYDSSEEEVI 1168
>gi|218199663|gb|EEC82090.1| hypothetical protein OsI_26092 [Oryza sativa Indica Group]
Length = 2275
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 490 HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRC 547
HD IS + + R G C C G NLL C+ CP +H CL+P L P W+C
Sbjct: 58 HDTNISPSTKKIRGHDGYFYECVECDLGGNLLCCDSCPRTYHLECLNPPLKRAPPGNWQC 117
Query: 548 PNCR 551
P CR
Sbjct: 118 PRCR 121
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 49/132 (37%), Gaps = 41/132 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEA 570
C VCG G LL C+GCP AFH C+ IP++ W C C
Sbjct: 1242 CSVCGLGGELLCCDGCPRAFHVNCIGLAEIPDTEWFCNEC-------------------- 1281
Query: 571 PGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 630
+ C C+ + +C S D + C++ FH+ C +
Sbjct: 1282 ---NLQTCGACKKNKIRLDSHVICGSEDGTKG------------CDRVFHLKCAK----- 1321
Query: 631 DLKEIPKDKWFC 642
L +P D W+C
Sbjct: 1322 -LDAVPADDWYC 1332
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 66/180 (36%), Gaps = 57/180 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-------------------- 550
C++C +G LL C+GCP FH +C+ IP C C
Sbjct: 1117 CYICTEGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCHECDTTVKPVFPVNGAKKNGKAA 1176
Query: 551 ----RQGHSSSMSRSVDLKGGL----------EAPGAEVGGCVICRLS---------PSE 587
R+ +S + R + L E GAE V S P +
Sbjct: 1177 SKRPRRSNSPTSRRRPRKQAKLGKAKSDDSESEDSGAESDSAVSTTSSARPASRPKAPED 1236
Query: 588 NFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
+D+ D S ++ CD C + FHV C+ L EIP +WF C++CN
Sbjct: 1237 QWDV------DCSVCGLGGE-LLCCDGCPRAFHVNCI------GLAEIPDTEWF-CNECN 1282
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACL--DPLLIPESGWRCPNC 550
C++C DG LL C+ CP AFH +CL +IP+S W C C
Sbjct: 161 CNICKDGGQLLCCDRCPRAFHMSCLGMSVDMIPDSEWYCKMC 202
>gi|307208076|gb|EFN85607.1| PHD finger protein 12 [Harpegnathos saltator]
Length = 633
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC----LDPLLIPESGWRCPNCR 551
+RR G + D C C DG L+ C+ CP ++H C +DP+ IP W C CR
Sbjct: 48 KRRGRGHNHDFCDACKDGGELICCDRCPASYHLQCHYPAVDPMDIPNGEWLCYTCR 103
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 23/45 (51%), Gaps = 5/45 (11%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
+C CG L+ C+ CPL FH CLDP L P W CPN
Sbjct: 183 LCFECGRSCRKAPLIACDYCPLYFHQDCLDPPLTTFPIGRWMCPN 227
>gi|297817898|ref|XP_002876832.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322670|gb|EFH53091.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 386
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 8/100 (8%)
Query: 766 LRIF-GREVAELPLVATCREYQGKGCFQALFSCIERL------LCSLNVENLVLPAAEKA 818
L++F G +AE+ + T Y+ +G + LFS +E + L +L VE LV+PA
Sbjct: 247 LQLFHGNRLAEMQFIGTRHVYRHQGMCRRLFSVVESMSFDVKTLQNLKVELLVIPATADL 306
Query: 819 ESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 858
+W KFGF K + L K R L F G +L+K++
Sbjct: 307 SHVWISKFGF-KYVEDSLKKELRSMNLLAFPGIDVLQKEL 345
>gi|9937325|gb|AAG02418.1|AF286030_1 regulator Ustilago maydis 1 protein [Ustilago maydis]
Length = 2289
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 5/73 (6%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--I 540
G++ H A S Q + MC +C DG N+LLC+ C +H CL P L I
Sbjct: 517 GVSPHLEADSYLRAQAGNQAQEEQMCEICLRGEDGPNMLLCDECNRGYHMYCLQPALTSI 576
Query: 541 PESGWRCPNCRQG 553
P+S W CP C G
Sbjct: 577 PKSQWFCPPCLVG 589
>gi|302803789|ref|XP_002983647.1| hypothetical protein SELMODRAFT_155996 [Selaginella moellendorffii]
gi|300148484|gb|EFJ15143.1| hypothetical protein SELMODRAFT_155996 [Selaginella moellendorffii]
Length = 1296
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
+D C +C G +L C+ C +H CLDP + +P+ WRCP C +
Sbjct: 47 EDFCTICKSGGKVLCCDACTAVYHLQCLDPPMKSVPKGSWRCPKCEE 93
>gi|355785070|gb|EHH65921.1| hypothetical protein EGM_02787, partial [Macaca fascicularis]
Length = 493
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 314 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPRCQQ 360
>gi|302817780|ref|XP_002990565.1| hypothetical protein SELMODRAFT_185367 [Selaginella moellendorffii]
gi|300141733|gb|EFJ08442.1| hypothetical protein SELMODRAFT_185367 [Selaginella moellendorffii]
Length = 1296
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
+D C +C G +L C+ C +H CLDP + +P+ WRCP C +
Sbjct: 47 EDFCTICKSGGKVLCCDACTAVYHLQCLDPPMKSVPKGSWRCPKCEE 93
>gi|222637088|gb|EEE67220.1| hypothetical protein OsJ_24338 [Oryza sativa Japonica Group]
Length = 2258
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 490 HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRC 547
HD IS + + R G C C G NLL C+ CP +H CL+P L P W+C
Sbjct: 57 HDTNISPSTKKIRGHDGYFYECVECDLGGNLLCCDSCPRTYHLECLNPPLKRAPPGNWQC 116
Query: 548 PNCR 551
P CR
Sbjct: 117 PRCR 120
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 8 EEYCRVCRDGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRC 52
>gi|311254947|ref|XP_003126022.1| PREDICTED: PHD finger protein 21B [Sus scrofa]
Length = 495
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 316 DEHCATCKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPKCQQ 362
>gi|71014560|ref|XP_758729.1| hypothetical protein UM02582.1 [Ustilago maydis 521]
gi|46098519|gb|EAK83752.1| hypothetical protein UM02582.1 [Ustilago maydis 521]
Length = 2289
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 5/73 (6%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--I 540
G++ H A S Q + MC +C DG N+LLC+ C +H CL P L I
Sbjct: 517 GVSPHLEADSYLRAQAGNQAQEEQMCEICLRGEDGPNMLLCDECNRGYHMYCLQPALTSI 576
Query: 541 PESGWRCPNCRQG 553
P+S W CP C G
Sbjct: 577 PKSQWFCPPCLVG 589
>gi|241841259|ref|XP_002415325.1| Jumonji/ARID domain-containing protein, putative [Ixodes
scapularis]
gi|215509537|gb|EEC18990.1| Jumonji/ARID domain-containing protein, putative [Ixodes
scapularis]
Length = 1356
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G E++LLC+GC ++H CL P L IP WRCP C
Sbjct: 57 VCHTCGRGDDEESMLLCDGCDDSYHTFCLLPPLQEIPRGDWRCPRC 102
>gi|195402015|ref|XP_002059606.1| GJ14859 [Drosophila virilis]
gi|194147313|gb|EDW63028.1| GJ14859 [Drosophila virilis]
Length = 884
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHS 555
+R G + D+C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSCRMAKV 104
Query: 556 SSMSRSVDLKGGLEAPGAEVGG 577
S + S P A G
Sbjct: 105 SQPACSSKASSVERVPSAGSGS 126
>gi|358374395|dbj|GAA90987.1| PHD finger domain protein [Aspergillus kawachii IFO 4308]
Length = 846
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNC 550
++D CH C LL C+GCP +FH +CL+P L PE W CP C
Sbjct: 459 NNDFCHNCNGSGQLLCCDGCPNSFHFSCLNPPLDPANPPEGDWFCPKC 506
>gi|300793885|ref|NP_001179869.1| bromodomain adjacent to zinc finger domain protein 1A [Bos taurus]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1153 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|403283154|ref|XP_003932992.1| PREDICTED: PHD finger protein 21B [Saimiri boliviensis boliviensis]
Length = 678
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 499 DEHCAACKRGANLQPCGACPGAYHLSCLDPPLKTAPKGVWVCPRCQQ 545
>gi|357476459|ref|XP_003608515.1| Chromodomain-helicase-DNA-binding protein, partial [Medicago
truncatula]
gi|355509570|gb|AES90712.1| Chromodomain-helicase-DNA-binding protein, partial [Medicago
truncatula]
Length = 1406
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C +C G NLL C+ CP +H CLDP L IP W+CP+C
Sbjct: 73 CVICDLGGNLLCCDSCPRTYHLQCLDPPLKRIPMGKWQCPSC 114
>gi|145259007|ref|XP_001402242.1| PHD finger domain protein [Aspergillus niger CBS 513.88]
gi|134074859|emb|CAK38971.1| unnamed protein product [Aspergillus niger]
gi|350631896|gb|EHA20265.1| hypothetical protein ASPNIDRAFT_51426 [Aspergillus niger ATCC 1015]
Length = 848
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNC 550
++D CH C LL C+GCP +FH +CL+P L PE W CP C
Sbjct: 461 NNDFCHNCNGSGQLLCCDGCPNSFHFSCLNPPLDPANPPEGDWFCPKC 508
>gi|395838235|ref|XP_003792024.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Otolemur garnettii]
Length = 1525
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1120 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1165
>gi|198437529|ref|XP_002126456.1| PREDICTED: similar to Bromodomain adjacent to zinc finger domain
protein 1A (ATP-utilizing chromatin assembly and
remodeling factor 1) (hACF1) (ATP-dependent
chromatin-remodeling protein) (Williams syndrome
transcription factor-related chromatin-remodeling fa...
[Ciona intestinalis]
Length = 1458
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 6/78 (7%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C +C GDGE +LLC+ C H CL P L +P W CP+C+ S R V
Sbjct: 1178 CRICRRKGDGEKMLLCDNCDRGHHMYCLRPALKIVPSGDWFCPDCKPRQSRISPRKVVRT 1237
Query: 566 GGLEAPGAEVGGCVICRL 583
A VG + C+L
Sbjct: 1238 KSFSQVSA-VGADLACQL 1254
>gi|440909285|gb|ELR59209.1| Bromodomain adjacent to zinc finger domain protein 1A, partial [Bos
grunniens mutus]
Length = 1532
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1126 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1171
>gi|431901376|gb|ELK08402.1| Histone-lysine N-methyltransferase MLL2 [Pteropus alecto]
Length = 5640
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 74/187 (39%), Gaps = 37/187 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1430 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1489
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1490 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1531
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 679
H C D+++ D+ F C C Q +V A + L + K E
Sbjct: 1532 HASCESLFTEDDVEQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEP 1583
Query: 680 GIL-FDG 685
F+G
Sbjct: 1584 QYFRFEG 1590
Score = 46.2 bits (108), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 55/156 (35%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 248 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 294
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 295 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 323
Query: 626 KNGLCDLKEIPKDKWFC--CDDCNRIHAALQDFVSN 659
++E+P W C C C A + N
Sbjct: 324 P----PMEELPPHSWKCKACRVCRACGAGSAELNPN 355
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 264 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 323
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 324 PPMEELPPHSWKCKACR 340
>gi|73962719|ref|XP_537409.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Canis lupus familiaris]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|344298527|ref|XP_003420943.1| PREDICTED: PHD finger protein 21B, partial [Loxodonta africana]
Length = 405
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 298 DEHCVACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPQCQQ 344
>gi|410962140|ref|XP_003987633.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Felis catus]
Length = 1416
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1012 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1055
>gi|395838233|ref|XP_003792023.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Otolemur garnettii]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1152 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1197
>gi|346421429|ref|NP_001231088.1| bromodomain adjacent to zinc finger domain protein 1A [Sus scrofa]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|413935348|gb|AFW69899.1| hypothetical protein ZEAMMB73_132865 [Zea mays]
Length = 387
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 499 GQRRTTGGSDDMCHVCGDGE---NLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
G D C CG G+ L+LC+GC FH CL P+L +P W CP+CR
Sbjct: 20 GPEAAAADDDVRCEACGSGDAAAELMLCDGCDCGFHIFCLRPILPRVPAGDWYCPSCRAP 79
Query: 554 HSS 556
SS
Sbjct: 80 ASS 82
>gi|355671930|gb|AER94956.1| bromodomain adjacent to zinc finger domain, 1A [Mustela putorius
furo]
Length = 1221
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 815 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 860
>gi|62088090|dbj|BAD92492.1| bromodomain adjacent to zinc finger domain, 1A isoform b variant
[Homo sapiens]
Length = 1188
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 759 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 802
>gi|281342876|gb|EFB18460.1| hypothetical protein PANDA_011886 [Ailuropoda melanoleuca]
Length = 1527
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1123 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1166
>gi|410965818|ref|XP_003989437.1| PREDICTED: PHD finger protein 21B [Felis catus]
Length = 545
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 366 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPKCQQ 412
>gi|405958289|gb|EKC24431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 4990
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 58/159 (36%), Gaps = 60/159 (37%)
Query: 500 QRRTTGGSDDMCHVCGDGENL---LLCNGCPLAFHAACLDP--LLIPE--SGWRCPNCRQ 552
Q T G + C +C + + L C C +H CL P L PE +GW+CP+C+
Sbjct: 1819 QAETIAGEEAFCVLCCQADKIGKQLFCTSCGHHYHGGCLHPSVALSPEVRAGWQCPDCKV 1878
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYC 612
C +CR P E D ++ C
Sbjct: 1879 -------------------------CQMCR-QPGE------------------DSKMLVC 1894
Query: 613 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDC 646
D C+K +H CL+ + IPK+ W C C DC
Sbjct: 1895 DTCDKGYHTFCLKPV----MTAIPKNGWKCKNCRVCGDC 1929
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 50/124 (40%), Gaps = 23/124 (18%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGG 567
MC G+ +L+C+ C +H CL P++ IP++GW+C NCR V G
Sbjct: 1881 MCRQPGEDSKMLVCDTCDKGYHTFCLKPVMTAIPKNGWKCKNCR----------VCGDCG 1930
Query: 568 LEAPGAEVGG--------CVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
PG+ C C ++ LC + F + +I C C+K
Sbjct: 1931 SRTPGSGPSSRWHLNYSVCDSCYQQRNKGLSCPLCGK---AYRQFTQKAMIQCGTCKKHV 1987
Query: 620 HVGC 623
H C
Sbjct: 1988 HAEC 1991
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 51/125 (40%), Gaps = 30/125 (24%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D LLLC+ C +++H CLDP L +P+ W+C C + +
Sbjct: 2196 VCEGCGKPHDEGRLLLCDECDISYHIYCLDPPLDQVPKGTWKCKWC----------VMCI 2245
Query: 565 KGGLEAPG------AEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKE 618
G PG C CR D +CR H + DD +I C QC +
Sbjct: 2246 NCGTTTPGFGCNWQNNYTQCGPCR----SKIDCPVCR-HKYQ----DDEMIIQCLQCNRW 2296
Query: 619 FHVGC 623
H C
Sbjct: 2297 LHALC 2301
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 967 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1026
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1027 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 1068
Query: 620 HVGC 623
H C
Sbjct: 1069 HAVC 1072
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 320 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 379
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 380 PVMKSVPTNGWKCKNCR 396
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 49/140 (35%), Gaps = 53/140 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 301 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 350
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C C+ S +D ++ CD C+K +H
Sbjct: 351 ---------------CQNCKQS-------------------GEDSKMLVCDTCDKGYHTF 376
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
CL+ +K +P + W C
Sbjct: 377 CLQPV----MKSVPTNGWKC 392
>gi|380798887|gb|AFE71319.1| lysine-specific demethylase 5A, partial [Macaca mulatta]
Length = 1653
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 237 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 296
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 297 DWRCPKC 303
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 36 EEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 36 EEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|444722488|gb|ELW63180.1| Bromodomain adjacent to zinc finger domain protein 1A [Tupaia
chinensis]
Length = 1836
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1461 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1506
>gi|32967605|ref|NP_872589.1| bromodomain adjacent to zinc finger domain protein 1A isoform b [Homo
sapiens]
gi|119586307|gb|EAW65903.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_d [Homo
sapiens]
Length = 1524
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
harrisii]
Length = 5047
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 74/189 (39%), Gaps = 41/189 (21%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q + S
Sbjct: 969 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAVSPGFH 1028
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C CR A ++ +I C CE+
Sbjct: 1029 CEWQNSYTHCGPCASLVTCPACR------------------APYVEEDLLIQCRHCERWM 1070
Query: 620 HVGCLRKNGLCDLKEIPK--DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 677
H GC L +E+ + D+ F C C Q +V I L + K
Sbjct: 1071 HAGC---ESLFTEEEVEQAADEGFDCASC-------QPYVVKPVVPIAPPELVPVKVKEP 1120
Query: 678 EKGIL-FDG 685
E F+G
Sbjct: 1121 EPQYFRFEG 1129
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 56/156 (35%), Gaps = 58/156 (37%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC G+ +LL C C +H ACLD L +GW+CP+C+
Sbjct: 225 DAHCVVCDGLGELRDLLFCTSCGQHYHGACLDTALTARKRAGWQCPDCKV---------- 274
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C CR P E D ++ C+ C+K +H
Sbjct: 275 ---------------CQTCR-QPGE------------------DSMMLVCEACDKGYHTF 300
Query: 623 CLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAAL 653
CL+ ++ +P D W C C C A L
Sbjct: 301 CLKPA----IQSLPPDSWKCKTCRVCRACGACPAEL 332
>gi|119586305|gb|EAW65901.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_b [Homo
sapiens]
Length = 1523
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1118 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1163
>gi|443717120|gb|ELU08315.1| hypothetical protein CAPTEDRAFT_106550 [Capitella teleta]
Length = 806
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 54/149 (36%), Gaps = 29/149 (19%)
Query: 508 DDMCHVCGDGE-----NLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS------- 555
D +C +C DGE +L C+ C LA H C IPE W C C Q S
Sbjct: 257 DAVCSICMDGECQNTNVILFCDMCNLAVHQECYGVPYIPEGQWLCRRCLQSPSRAVKCCL 316
Query: 556 ----------SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFD 605
+S SR + L P EV I L P +N D S
Sbjct: 317 CPNKGGAFKQTSTSRWAHVVCALWIP--EVAFANIVFLEPIDNVDRIPAARWKLSCYICK 374
Query: 606 DRTVIYCDQCEK-----EFHVGCLRKNGL 629
R C QC K FHV C ++ GL
Sbjct: 375 QRGTGACIQCHKTNCYTAFHVTCAQQAGL 403
>gi|29387265|gb|AAH48307.1| JARID1A protein [Homo sapiens]
gi|31753201|gb|AAH53893.1| JARID1A protein [Homo sapiens]
gi|83406060|gb|AAI10917.1| JARID1A protein [Homo sapiens]
Length = 1102
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREVVWVKVG 1768
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2113 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|355693214|gb|EHH27817.1| hypothetical protein EGK_18107, partial [Macaca mulatta]
Length = 1519
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1116 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1159
>gi|326912468|ref|XP_003202572.1| PREDICTED: PHD finger protein 21B-like, partial [Meleagris
gallopavo]
Length = 445
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H CLDP L P+ W CP C+Q
Sbjct: 264 DEYCTACKRGVNLQPCGTCPRAYHLNCLDPPLKTAPKGVWVCPKCQQ 310
>gi|291392893|ref|XP_002712830.1| PREDICTED: retinoblastoma binding protein 2 isoform 2 [Oryctolagus
cuniculus]
Length = 1648
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|410046801|ref|XP_003313790.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
troglodytes]
Length = 2476
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 29/148 (19%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1026 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1085
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C IC A ++ +I C CE+
Sbjct: 1086 CEWQNSYTHCGPCASLVTCPICH------------------APYVEEDLLIQCRHCERWM 1127
Query: 620 HVGCLRKNGLCDLKEIPKDKWFCCDDCN 647
H GC D+++ D+ F C C
Sbjct: 1128 HAGCESLFTEDDVEQA-ADEGFDCVSCQ 1154
>gi|403263951|ref|XP_003924260.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Saimiri boliviensis boliviensis]
Length = 1240
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 835 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 880
>gi|119609377|gb|EAW88971.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_b
[Homo sapiens]
gi|162319392|gb|AAI56462.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
gi|225000544|gb|AAI72533.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
Length = 1641
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|7920301|gb|AAF70601.1|AF213467_1 ATP-dependent chromatin remodelling protein [Homo sapiens]
Length = 1556
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|397501128|ref|XP_003846181.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A [Pan paniscus]
Length = 1240
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 837 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 880
>gi|332842506|ref|XP_003314441.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Pan troglodytes]
Length = 1524
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|194207285|ref|XP_001490944.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Equus caballus]
Length = 1601
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1195 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1240
>gi|32967603|ref|NP_038476.2| bromodomain adjacent to zinc finger domain protein 1A isoform a [Homo
sapiens]
gi|116241266|sp|Q9NRL2.2|BAZ1A_HUMAN RecName: Full=Bromodomain adjacent to zinc finger domain protein 1A;
AltName: Full=ATP-dependent chromatin-remodeling protein;
AltName: Full=ATP-utilizing chromatin assembly and
remodeling factor 1; Short=hACF1; AltName: Full=CHRAC
subunit ACF1; AltName: Full=Williams syndrome
transcription factor-related chromatin-remodeling factor
180; Short=WCRF180; AltName: Full=hWALp1
gi|6942227|gb|AAF32366.1|AF221130_1 chromatin remodeling factor WCRF180 [Homo sapiens]
gi|119586304|gb|EAW65900.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_a [Homo
sapiens]
gi|162319394|gb|AAI56492.1| Bromodomain adjacent to zinc finger domain, 1A [synthetic construct]
Length = 1556
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|397499371|ref|XP_003820427.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A [Pan
paniscus]
Length = 1717
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|119586306|gb|EAW65902.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_c [Homo
sapiens]
gi|119586309|gb|EAW65905.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_c [Homo
sapiens]
Length = 1555
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1150 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1195
>gi|355778515|gb|EHH63551.1| hypothetical protein EGM_16543, partial [Macaca fascicularis]
Length = 1426
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1116 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1159
>gi|354498657|ref|XP_003511431.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Cricetulus griseus]
gi|344245398|gb|EGW01502.1| Bromodomain adjacent to zinc finger domain protein 1A [Cricetulus
griseus]
Length = 1525
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1121 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1166
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1712 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREVVWVKVG 1767
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2112 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2162
>gi|302816250|ref|XP_002989804.1| hypothetical protein SELMODRAFT_428350 [Selaginella moellendorffii]
gi|300142370|gb|EFJ09071.1| hypothetical protein SELMODRAFT_428350 [Selaginella moellendorffii]
Length = 2150
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 30/54 (55%), Gaps = 4/54 (7%)
Query: 501 RRTTG--GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+RT G GS C VC G NLL C+ CP +H CL P L P WRCP+C
Sbjct: 236 KRTRGSDGSFFECMVCQSGGNLLCCDHCPRVYHLHCLSPPLKRAPTGKWRCPDC 289
>gi|443695479|gb|ELT96377.1| hypothetical protein CAPTEDRAFT_147496 [Capitella teleta]
Length = 1490
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
MC++CG G E++LLC+ C AFH CL P L +P+ WRCP C
Sbjct: 282 MCNLCGRGDGEESMLLCDSCDDAFHMHCLIPPLHEVPKGDWRCPKC 327
>gi|441595313|ref|XP_004087232.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Nomascus leucogenys]
Length = 1533
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1128 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1173
>gi|426371192|ref|XP_004052536.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Gorilla gorilla gorilla]
Length = 1589
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 257 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 316
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 317 DWRCPKC 323
>gi|403263949|ref|XP_003924259.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Saimiri boliviensis boliviensis]
Length = 1524
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1523 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1578
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1923 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1973
>gi|426248832|ref|XP_004018162.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A [Ovis aries]
Length = 1647
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1242 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1285
>gi|417406655|gb|JAA49977.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
Length = 1649
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 229 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 288
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 289 DWRCPKC 295
>gi|241859183|ref|XP_002416180.1| fetal alzheimer antigen, putative [Ixodes scapularis]
gi|215510394|gb|EEC19847.1| fetal alzheimer antigen, putative [Ixodes scapularis]
Length = 2457
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSV-DL 564
DD C VC +LL C CP FH ACLDP L +P W C C+ S ++ + D+
Sbjct: 321 DDHCRVCHKLGDLLCCETCPAVFHLACLDPPLTDVPTEDWICTVCQANQVSGVTDCISDI 380
Query: 565 -KGGL 568
KGGL
Sbjct: 381 EKGGL 385
>gi|432091446|gb|ELK24528.1| Lysine-specific demethylase 5A [Myotis davidii]
Length = 1097
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|332842504|ref|XP_003314440.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Pan troglodytes]
Length = 1556
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1153 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|332838263|ref|XP_003313474.1| PREDICTED: lysine-specific demethylase 5A [Pan troglodytes]
Length = 1688
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|119586308|gb|EAW65904.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_e [Homo
sapiens]
Length = 1560
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1157 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1200
>gi|410354793|gb|JAA44000.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1688
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|387541282|gb|AFJ71268.1| bromodomain adjacent to zinc finger domain protein 1A isoform a
[Macaca mulatta]
Length = 1556
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|355785795|gb|EHH65978.1| Lysine-specific demethylase 5A [Macaca fascicularis]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|390467343|ref|XP_002752254.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A,
partial [Callithrix jacchus]
Length = 1595
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 152 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 211
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 212 DWRCPKC 218
>gi|355563873|gb|EHH20373.1| Lysine-specific demethylase 5A [Macaca mulatta]
gi|383420467|gb|AFH33447.1| lysine-specific demethylase 5A [Macaca mulatta]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|344254289|gb|EGW10393.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
Length = 4002
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 54/124 (43%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1799 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1858
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C +C A ++ +I C CE+
Sbjct: 1859 CEWQNSYTHCGPCASLVTCPVCH------------------APYVEEDLLIQCRHCERWM 1900
Query: 620 HVGC 623
H GC
Sbjct: 1901 HAGC 1904
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 50/137 (36%), Gaps = 53/137 (38%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C++
Sbjct: 642 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKE------------- 688
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C CR +D ++ C+ C+K +H CL+
Sbjct: 689 ------------CQACR-------------------KPGNDSKMLVCETCDKGYHTFCLK 717
Query: 626 KNGLCDLKEIPKDKWFC 642
++E+P W C
Sbjct: 718 P----PIEELPAHSWKC 730
>gi|301756621|ref|XP_002914152.1| PREDICTED: lysine-specific demethylase 5A-like [Ailuropoda
melanoleuca]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|431907355|gb|ELK11328.1| Bromodomain adjacent to zinc finger domain protein 1A [Pteropus
alecto]
Length = 1550
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1144 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1189
>gi|410223430|gb|JAA08934.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
gi|410300510|gb|JAA28855.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
gi|410354795|gb|JAA44001.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|351710793|gb|EHB13712.1| Lysine-specific demethylase 5A [Heterocephalus glaber]
Length = 1694
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|348552013|ref|XP_003461823.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A-like
[Cavia porcellus]
Length = 1635
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 359 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 418
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 419 DWRCPKC 425
>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
Length = 5261
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/177 (23%), Positives = 67/177 (37%), Gaps = 61/177 (34%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC G+ +LL C C L +H CL+ + P SGW+C C+
Sbjct: 231 DSRCMVCDAPGELRDLLFCTSCGLHYHGTCLEITVTPRKRSGWQCHECKV---------- 280
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
C CRLS +D ++ C+ CEK +H
Sbjct: 281 ---------------CQTCRLS-------------------GEDSRMLVCEACEKCYHTY 306
Query: 623 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA-SSLSTINRKHIE 678
CL+ ++ +P D W C + L DF ++ P + + ++ +H+E
Sbjct: 307 CLKPA----IESVPADSWKCKTE-------LADFGTHHTTLFPNYTPAADVSVEHME 352
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 64/150 (42%), Gaps = 33/150 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q + S
Sbjct: 1022 VCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLNTVPKGGWKCKWCVCCVQCGAVSPGFH 1081
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C IC++ ++ +I C CE+
Sbjct: 1082 CEWQNNYTHCAPCASLVTCPICQVK------------------YVEEDLLIQCQHCERWM 1123
Query: 620 HVGCLRKNGLCDLKEIPK--DKWFCCDDCN 647
H C + L +E+ + D+ F C C
Sbjct: 1124 HAVC---DNLFTEEEVEQAADEGFDCTSCQ 1150
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 8/73 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G+ H + + + R+ +G C VC G+ +L+C C +H CL
Sbjct: 250 TSCGLHYHGTCLEITVTPRKRSGWQCHECKVCQTCRLSGEDSRMLVCEACEKCYHTYCLK 309
Query: 537 PLL--IPESGWRC 547
P + +P W+C
Sbjct: 310 PAIESVPADSWKC 322
>gi|301774883|ref|XP_002922870.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Ailuropoda melanoleuca]
Length = 1641
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1235 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1280
>gi|121718370|ref|XP_001276189.1| PHD transcription factor (Rum1), putative [Aspergillus clavatus
NRRL 1]
gi|119404387|gb|EAW14763.1| PHD transcription factor (Rum1), putative [Aspergillus clavatus
NRRL 1]
Length = 1707
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 5/50 (10%)
Query: 509 DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
D C +CG GE+ +L+C+ C +H CLDP L +PE W CP C G
Sbjct: 441 DKCEICGKGEDRPSILVCDSCDQGYHKNCLDPPLTTVPEYDWHCPKCLVG 490
>gi|410261294|gb|JAA18613.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|332249134|ref|XP_003273720.1| PREDICTED: lysine-specific demethylase 5A [Nomascus leucogenys]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|431892152|gb|ELK02599.1| Lysine-specific demethylase 5A [Pteropus alecto]
Length = 1692
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDRLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|435778|gb|AAB28544.1| retinoblastoma binding protein 2 [Homo sapiens]
Length = 1722
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|417406677|gb|JAA49985.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|126340225|ref|XP_001373115.1| PREDICTED: lysine-specific demethylase 5A [Monodelphis domestica]
Length = 1689
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 273 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 332
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 333 DWRCPKC 339
>gi|110618244|ref|NP_001036068.1| lysine-specific demethylase 5A [Homo sapiens]
gi|215274124|sp|P29375.3|KDM5A_HUMAN RecName: Full=Lysine-specific demethylase 5A; AltName: Full=Histone
demethylase JARID1A; AltName: Full=Jumonji/ARID
domain-containing protein 1A; AltName:
Full=Retinoblastoma-binding protein 2; Short=RBBP-2
gi|168275608|dbj|BAG10524.1| histone demethylase JARID1A [synthetic construct]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
Length = 5619
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 62/159 (38%), Gaps = 58/159 (36%)
Query: 498 MGQRRTTGGSDDMC-HVCGDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQ 552
+GQ D C H CG G+ NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 358 LGQVPLLLNGDITCMHCCGMGDVSNLVMCSICGQHYHGSCVGLALLPGVRAGWQCVSCR- 416
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYC 612
C +CR + D S V+ C
Sbjct: 417 ------------------------VCQVCR------------QPEDVSK-------VMLC 433
Query: 613 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDC 646
++CEK +H CLR + IPK W C C DC
Sbjct: 434 ERCEKAYHPSCLRP----IVTSIPKYGWKCKCCRVCTDC 468
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 53/128 (41%), Gaps = 36/128 (28%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C G +
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDPGFN 816
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
SS ++ G P A C+ C+ + +E D+ +I C QC
Sbjct: 817 SSWQKNYTQCG----PCASHTACISCQETYTEG-DL-----------------IIQCIQC 854
Query: 616 EKEFHVGC 623
E+ H C
Sbjct: 855 ERWLHCAC 862
>gi|297297682|ref|XP_001082156.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Macaca mulatta]
Length = 1646
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1243 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1286
>gi|119609376|gb|EAW88970.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_a
[Homo sapiens]
Length = 1315
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
Length = 5622
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 62/159 (38%), Gaps = 58/159 (36%)
Query: 498 MGQRRTTGGSDDMC-HVCGDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQ 552
+GQ D C H CG G+ NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 358 LGQVPLLLNGDITCMHCCGMGDVSNLVMCSICGQHYHGSCVGLALLPGVRAGWQCVSCR- 416
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYC 612
C +CR + D S V+ C
Sbjct: 417 ------------------------VCQVCR------------QPEDVSK-------VMLC 433
Query: 613 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDC 646
++CEK +H CLR + IPK W C C DC
Sbjct: 434 ERCEKAYHPSCLRP----IVTSIPKYGWKCKCCRVCTDC 468
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 53/128 (41%), Gaps = 36/128 (28%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C G +
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDPGFN 816
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQC 615
SS ++ G P A C+ C+ + +E D+ +I C QC
Sbjct: 817 SSWQKNYTQCG----PCASHAACISCQETYTEG-DL-----------------IIQCIQC 854
Query: 616 EKEFHVGC 623
E+ H C
Sbjct: 855 ERWLHCAC 862
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 56/124 (45%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 948 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSSGPR 1007
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 1008 CEWQNNYTQCAPCASLSSCPVC------------CRNYR------EEDLILQCRQCDRWM 1049
Query: 620 HVGC 623
H C
Sbjct: 1050 HAVC 1053
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 306 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 365
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 366 PVMKSVPTNGWKCKNCR 382
>gi|417406599|gb|JAA49949.1| Putative chromatin remodeling complex wstf-iswi large subunit
[Desmodus rotundus]
Length = 1557
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1152 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1197
>gi|410963533|ref|XP_003988319.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Felis catus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|449273319|gb|EMC82842.1| PHD finger protein 21B, partial [Columba livia]
Length = 446
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 5/73 (6%)
Query: 482 YTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL-- 539
Y +N + L AI L + ++ D+ C C G NL C CP A+H CLDP L
Sbjct: 242 YLNNPLFLSTRAILLLL---QSEIHHDEHCTSCKRGINLQPCGTCPRAYHLNCLDPPLKT 298
Query: 540 IPESGWRCPNCRQ 552
P+ W CP C+Q
Sbjct: 299 APKGVWVCPKCQQ 311
>gi|329663691|ref|NP_001192554.1| lysine-specific demethylase 5A [Bos taurus]
gi|296487060|tpg|DAA29173.1| TPA: JARID1A variant protein-like [Bos taurus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
anubis]
Length = 1431
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 54/128 (42%), Gaps = 28/128 (21%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 745 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 804
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 805 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 846
Query: 620 HVGCLRKN 627
H C N
Sbjct: 847 HAVCQNLN 854
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 96 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 155
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 156 PVMKSVPTNGWKCKNCR 172
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 45/213 (21%), Positives = 70/213 (32%), Gaps = 62/213 (29%)
Query: 435 IVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI 494
+V G R + G CC ++ + A A Q + HI+ +
Sbjct: 13 VVSGSTERCAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHIF---------LLC 63
Query: 495 SLAMGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPN 549
+ Q D C VC GD + C C +H CLD + P +GW+CP
Sbjct: 64 PEHIDQAPERSKEDANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPE 123
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTV 609
C+ C C+ S +D +
Sbjct: 124 CKV-------------------------CQNCKQS-------------------GEDSKM 139
Query: 610 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 642
+ CD C+K +H CL+ +K +P + W C
Sbjct: 140 LVCDTCDKGYHTFCLQPV----MKSVPTNGWKC 168
>gi|355697494|gb|AES00689.1| lysine -specific demethylase 5A [Mustela putorius furo]
Length = 1504
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 294 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 353
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 354 DWRCPKC 360
>gi|291392891|ref|XP_002712829.1| PREDICTED: retinoblastoma binding protein 2 isoform 1 [Oryctolagus
cuniculus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|426227070|ref|XP_004007650.1| PREDICTED: lysine-specific demethylase 5A [Ovis aries]
Length = 1793
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 377 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 436
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 437 DWRCPKC 443
>gi|344253284|gb|EGW09388.1| Lysine-specific demethylase 5A [Cricetulus griseus]
Length = 1608
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 192 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 251
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 252 DWRCPKC 258
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1711 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1766
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2106 GKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2161
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1710 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1765
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2110 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2160
>gi|328869901|gb|EGG18276.1| hypothetical protein DFA_03770 [Dictyostelium fasciculatum]
Length = 1246
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C C DG +LL C C AFH CLDP + +PE W C +C Q
Sbjct: 88 NNDFCDACHDGGDLLCCESCECAFHMMCLDPPVSSLPEGDWFCHSCEQN 136
>gi|242004184|ref|XP_002423004.1| Jumonji/ARID domain-containing protein 1D, putative [Pediculus
humanus corporis]
gi|212505920|gb|EEB10266.1| Jumonji/ARID domain-containing protein 1D, putative [Pediculus
humanus corporis]
Length = 1680
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH C G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 256 ICHNCERGDAEENMLLCDGCDDSYHTFCLLPPLTEIPKGDWRCPKC 301
>gi|194375257|dbj|BAG62741.1| unnamed protein product [Homo sapiens]
Length = 1049
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 233 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 292
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 293 DWRCPKC 299
>gi|195449007|ref|XP_002071907.1| GK24911 [Drosophila willistoni]
gi|194167992|gb|EDW82893.1| GK24911 [Drosophila willistoni]
Length = 884
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 35/81 (43%), Gaps = 4/81 (4%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHS 555
+R G + D C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDFCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPTGQWLCHSCRMAKM 104
Query: 556 SSMSRSVDLKGGLEAPGAEVG 576
S S P A G
Sbjct: 105 SHPPSSSKASSVERVPSAGSG 125
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 43/106 (40%), Gaps = 18/106 (16%)
Query: 462 PSQFEAHAGMAA--RRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSD----------- 508
P++ E HA + QP + ++T NG A QRR + +
Sbjct: 199 PAELELHAQFPGHGKVQPVQPVHTDNGHHGRKTGAGSAGNQRRNSKPFELDSQGLVPLPA 258
Query: 509 DMCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
C C L+ C+ CPL +H CLDP L +P W CPN
Sbjct: 259 KTCFYCKRSCKRAPLISCDYCPLYYHQDCLDPPLTALPAGLWMCPN 304
>gi|74141547|dbj|BAE38548.1| unnamed protein product [Mus musculus]
Length = 1093
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
Length = 5588
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 54/124 (43%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C +C A ++ +I C CE+
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPVCH------------------APYVEEDLLIQCRHCERWM 1486
Query: 620 HVGC 623
H GC
Sbjct: 1487 HAGC 1490
>gi|354476339|ref|XP_003500382.1| PREDICTED: lysine-specific demethylase 5A [Cricetulus griseus]
Length = 1671
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 255 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 314
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 315 DWRCPKC 321
>gi|6683494|dbj|BAA89209.1| bromodomain adjacent to zinc finger domain 1A [Homo sapiens]
Length = 1674
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1271 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1314
>gi|4884108|emb|CAB43261.1| hypothetical protein [Homo sapiens]
Length = 811
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 408 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 451
>gi|443894534|dbj|GAC71882.1| DNA-binding protein jumonji/RBP2/SMCY [Pseudozyma antarctica T-34]
Length = 2474
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 5/54 (9%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
G + MC +C DG N+LLC+ C +H CL P L IP+S W CP C G
Sbjct: 532 GAEEQMCEICLRGEDGPNMLLCDECNRGYHMYCLQPPLTSIPKSQWFCPPCLVG 585
>gi|68533053|dbj|BAE06081.1| JARID1A variant protein [Homo sapiens]
Length = 1731
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 315 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 374
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 375 DWRCPKC 381
>gi|390469001|ref|XP_002807272.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Callithrix jacchus]
Length = 1552
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1148 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1193
>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=Lysine N-methyltransferase 2D; Short=KMT2D
Length = 5588
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 54/124 (43%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + P A + C +C A ++ +I C CE+
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPVCH------------------APYVEEDLLIQCRHCERWM 1486
Query: 620 HVGC 623
H GC
Sbjct: 1487 HAGC 1490
>gi|402875961|ref|XP_003901758.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A [Papio anubis]
Length = 1752
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1368 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1411
>gi|291403724|ref|XP_002718181.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1A isoform 2
[Oryctolagus cuniculus]
Length = 1525
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|195340386|ref|XP_002036794.1| GM12479 [Drosophila sechellia]
gi|194130910|gb|EDW52953.1| GM12479 [Drosophila sechellia]
Length = 876
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 38/73 (52%), Gaps = 9/73 (12%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR---- 551
+R G + D C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSCRMSKL 104
Query: 552 -QGHSSSMSRSVD 563
Q +SS + SV+
Sbjct: 105 SQPPASSKASSVE 117
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 520 LLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
L+ C+ CPL FH CLDP L +P W CPN
Sbjct: 269 LISCDYCPLYFHQDCLDPPLTALPAGLWMCPN 300
>gi|74201318|dbj|BAE26113.1| unnamed protein product [Mus musculus]
Length = 1094
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1608 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1663
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2003 GKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2058
>gi|359323071|ref|XP_854690.3| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Canis lupus familiaris]
Length = 1688
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|291403722|ref|XP_002718180.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1A isoform 1
[Oryctolagus cuniculus]
Length = 1557
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|71122345|gb|AAH99835.1| Jarid1a protein [Rattus norvegicus]
Length = 1099
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|51593641|gb|AAH80691.1| Jarid1a protein, partial [Mus musculus]
Length = 1102
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|395743750|ref|XP_002822777.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A,
partial [Pongo abelii]
Length = 1613
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 314 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 373
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 374 DWRCPKC 380
>gi|297682047|ref|XP_002818745.1| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Pongo
abelii]
Length = 1215
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 54/128 (42%), Gaps = 28/128 (21%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 671 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 730
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 731 CEWQNNYTQCAPCASLSSCPVC---------YRNYREEDL---------ILQCRQCDRWM 772
Query: 620 HVGCLRKN 627
H C N
Sbjct: 773 HAVCQNLN 780
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 22 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 81
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 82 PVMKSVPTNGWKCKNCR 98
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1402 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1457
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1797 GKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1852
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1608 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1663
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2003 GKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2058
>gi|148667219|gb|EDK99635.1| mCG129751 [Mus musculus]
Length = 1710
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|392340216|ref|XP_002726529.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Rattus norvegicus]
Length = 1639
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1711 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1766
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2108 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2161
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1341 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1396
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1736 GKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1791
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1711 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1766
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2108 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2161
>gi|363728201|ref|XP_425507.3| PREDICTED: PHD finger protein 21B [Gallus gallus]
Length = 579
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H CLDP L P+ W CP C+Q
Sbjct: 398 DEYCTACKRGVNLQPCGTCPRAYHLNCLDPPLKTAPKGVWVCPKCQQ 444
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1444 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1499
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1839 GKRRSQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1894
>gi|349604671|gb|AEQ00159.1| Bromodomain adjacent to zinc finger domain protein 1A-like protein,
partial [Equus caballus]
Length = 410
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 146 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 191
>gi|392347603|ref|XP_002729471.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Rattus norvegicus]
Length = 1722
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1709 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1764
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2106 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2159
>gi|344278059|ref|XP_003410814.1| PREDICTED: lysine-specific demethylase 5A [Loxodonta africana]
Length = 1693
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1710 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1765
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2107 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2160
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1709 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1764
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2106 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2159
>gi|367000541|ref|XP_003685006.1| hypothetical protein TPHA_0C04220 [Tetrapisispora phaffii CBS 4417]
gi|357523303|emb|CCE62572.1| hypothetical protein TPHA_0C04220 [Tetrapisispora phaffii CBS 4417]
Length = 669
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNC 550
+DD C C + L C+ CP +FH CL+P L +PE W CP C
Sbjct: 264 NDDFCSTCQQSGSFLCCDTCPRSFHFLCLNPPLDPDNLPEGDWSCPQC 311
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1768
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2108 GKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1714 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1769
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2111 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2164
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1354 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1409
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1768
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 28/57 (49%), Gaps = 6/57 (10%)
Query: 498 MGQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2107 QGKRRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1711 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1766
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2108 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2161
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1711 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1766
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2108 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2161
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1709 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1764
Score = 42.7 bits (99), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2106 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2159
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1768
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2108 GKRRSQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|149049582|gb|EDM02036.1| rCG29703 [Rattus norvegicus]
Length = 1526
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1711 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1766
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2108 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2161
>gi|226958545|ref|NP_666109.2| lysine-specific demethylase 5A [Mus musculus]
gi|150383496|sp|Q3UXZ9.2|KDM5A_MOUSE RecName: Full=Lysine-specific demethylase 5A; AltName: Full=Histone
demethylase JARID1A; AltName: Full=Jumonji/ARID
domain-containing protein 1A; AltName:
Full=Retinoblastoma-binding protein 2; Short=RBBP-2
Length = 1690
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|403286751|ref|XP_003934640.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Saimiri boliviensis boliviensis]
Length = 1952
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 536 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 595
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 596 DWRCPKC 602
>gi|395538667|ref|XP_003771296.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Sarcophilus harrisii]
Length = 2011
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 523 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 566
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 673 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 728
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1711 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1766
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2111 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2161
>gi|296084643|emb|CBI25766.3| unnamed protein product [Vitis vinifera]
Length = 126
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 51/98 (52%), Gaps = 3/98 (3%)
Query: 707 SLLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSA 763
S ++ A A+ ECF+P+I + +++ ++Y G N F G Y+ IL +S
Sbjct: 20 SKIAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISV 79
Query: 764 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 801
+RI G ++AE+P +AT Y+ +G L IE +
Sbjct: 80 ASMRIHGNKLAEMPFIATRPSYRRQGMCHKLLVAIESV 117
>gi|156045475|ref|XP_001589293.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980]
gi|154694321|gb|EDN94059.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 883
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/140 (22%), Positives = 57/140 (40%), Gaps = 29/140 (20%)
Query: 508 DDMCHVCGDGEN-----LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
D +C +C ++ +L C+GCPLA H C IP+ W C C++ ++ +
Sbjct: 288 DPVCEICTKPDSKRPNLILFCDGCPLAVHQKCYSVPKIPDGDWFCKKCQRNRVAAEAARA 347
Query: 563 DLKGGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVG 622
+ L E+ C +CR S+ + +I C+ C+ H
Sbjct: 348 NENDALNDSDDEI-KCAVCRGLDSKKPN-----------------EIILCENCDYAVHQT 389
Query: 623 CLRKNGLCDLKEIPKDKWFC 642
C D+ + P+++W C
Sbjct: 390 C------GDIPKKPREEWLC 403
>gi|66810249|ref|XP_638848.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|60467522|gb|EAL65544.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 1678
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 30/58 (51%), Gaps = 5/58 (8%)
Query: 507 SDDMCHVCGDG-----ENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
+DD+C VC DG ++ C+GC +A H C LLIPE W C C S+S
Sbjct: 478 ADDVCGVCFDGTSDDTNQIVYCDGCDIAVHQDCYGILLIPEGHWFCQKCESPDKDSIS 535
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1768
Score = 42.4 bits (98), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2110 RRTQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1426 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1481
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1826 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1876
>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific, partial [Pteropus alecto]
Length = 2202
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1159 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1214
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1607 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1662
Score = 42.7 bits (99), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2004 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2057
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1710 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1765
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2110 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2160
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1768
Score = 42.4 bits (98), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2110 RRTQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1440 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1495
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1837 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1890
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1442 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1497
Score = 42.4 bits (98), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1839 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1892
>gi|14626491|gb|AAK70213.1| MLL3-like protein [Mus musculus]
Length = 420
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 57/128 (44%), Gaps = 28/128 (21%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 46 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 105
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C CR++ ++ ++ C QC++
Sbjct: 106 CEWQNNYTQCAPCASLSSCPVC------------CRNY------REEDLILQCRQCDRWM 147
Query: 620 HVGCLRKN 627
H C N
Sbjct: 148 HAVCQNLN 155
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1441 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1496
Score = 42.4 bits (98), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1838 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1891
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1441 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1496
Score = 42.4 bits (98), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1838 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1891
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 28/124 (22%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---RQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C R ++S
Sbjct: 973 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGPR 1032
Query: 562 VDLKGGLE--APGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEF 619
+ + AP A + C +C R R D ++ C QC++
Sbjct: 1033 GEWQNNYTQCAPCASLSACPVCH---------RNYREEDL---------ILQCRQCDRWM 1074
Query: 620 HVGC 623
H C
Sbjct: 1075 HAVC 1078
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 331 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 390
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GWRC NCR
Sbjct: 391 PVMKSVPTNGWRCKNCR 407
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1442 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1497
Score = 42.4 bits (98), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1839 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1892
>gi|380023668|ref|XP_003695637.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis florea]
Length = 2272
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 511 CHVCGD--GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSVDL 564
C VCG G+NL+LC CP A+H C +P++ +P W C NC +Q + SR
Sbjct: 2043 CLVCGKRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKRNSSRRSHT 2102
Query: 565 KGG 567
KGG
Sbjct: 2103 KGG 2105
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/198 (21%), Positives = 74/198 (37%), Gaps = 53/198 (26%)
Query: 454 DCCNKEISPSQFEAHAGMAARRQPYRHIYTSN--GMTLHDIAISLAMGQRRTTGGSDDMC 511
D N+ P + G+ R+ +TS M L+ + S+A + C
Sbjct: 1934 DQSNQTPVPQEETTPRGLNNWREATARAHTSAQLAMALYMLEASIAWDKSIMKAN----C 1989
Query: 512 HVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
C G+N LLLC+GC +H C P + IP+ W C C M+++ +
Sbjct: 1990 QFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHEC-------MNKATGER- 2041
Query: 567 GLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 626
C++C +N + C+ C + +H C
Sbjct: 2042 ----------NCLVCGKRVGKNL--------------------VLCELCPRAYHTDC--H 2069
Query: 627 NGLCDLKEIPKDKWFCCD 644
N + + ++P+ KW+C +
Sbjct: 2070 NPV--MPKMPRGKWYCSN 2085
>gi|189530702|ref|XP_693055.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Danio rerio]
Length = 742
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C VC GD EN+LLC+GC +H C+ P L +P W CP CR
Sbjct: 391 CKVCRRKGDAENMLLCDGCGRGYHIFCVRPKLKAVPSEDWFCPECR 436
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1608 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1663
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2005 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2058
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1405 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1460
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1802 RRTQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1855
>gi|328792710|ref|XP_623473.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis mellifera]
Length = 2293
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 511 CHVCGD--GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSVDL 564
C VCG G+NL+LC CP A+H C +P++ +P W C NC +Q + SR
Sbjct: 2064 CLVCGKRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKRNSSRRSHT 2123
Query: 565 KGG 567
KGG
Sbjct: 2124 KGG 2126
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 31/139 (22%), Positives = 54/139 (38%), Gaps = 47/139 (33%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C G+N LLLC+GC +H C P + IP+ W C C M+++ +
Sbjct: 2010 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHEC-------MNKATGER 2062
Query: 566 GGLEAPGAEVGGCVICRLSPSENFDIRLCRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 625
C++C +N + C+ C + +H C
Sbjct: 2063 -----------NCLVCGKRVGKNL--------------------VLCELCPRAYHTDC-- 2089
Query: 626 KNGLCDLKEIPKDKWFCCD 644
N + + ++P+ KW+C +
Sbjct: 2090 HNPV--MPKMPRGKWYCSN 2106
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1354 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1409
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1740 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1793
>gi|255085606|ref|XP_002505234.1| predicted protein [Micromonas sp. RCC299]
gi|226520503|gb|ACO66492.1| predicted protein [Micromonas sp. RCC299]
Length = 1628
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 41/85 (48%), Gaps = 16/85 (18%)
Query: 508 DDMCHVCGD---GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSV 562
DD C VCG +L+C C +H ACLDP L +PE W CP C + + SRS
Sbjct: 977 DDGCQVCGGDVAAGVVLICEECTGEYHCACLDPPLESVPEGEWFCPACERAQETRDSRSR 1036
Query: 563 D---LKG--------GLEAPGAEVG 576
+ L+G GL P E+G
Sbjct: 1037 ELFALRGTKLDLVTRGLPTPDGELG 1061
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1400 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1455
Score = 45.8 bits (107), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGDG L+ C GCP +HA CL+ P W CP
Sbjct: 1795 GKRRSQGEVTKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1850
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1605 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1660
Score = 46.2 bits (108), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 6/56 (10%)
Query: 499 GQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGDG L+ C GCP +HA CL+ P W CP
Sbjct: 2000 GKRRSQGEVTKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2055
>gi|357472039|ref|XP_003606304.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507359|gb|AES88501.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 856
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 22/87 (25%)
Query: 503 TTGGSD-----DMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHS 555
+TG D D C +CG L+ C+GCP A+H+ C+ + IPE W CP C+
Sbjct: 406 STGDGDVDRNGDECRLCGMDGTLICCDGCPSAYHSRCIGVMKMFIPEGPWYCPECK---- 461
Query: 556 SSMSRSVDLKG-----GLEAPGAEVGG 577
+D+ G G GAEV G
Sbjct: 462 ------IDMAGPTIAKGTSLRGAEVFG 482
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1607 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1662
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2004 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2057
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1768
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 28/57 (49%), Gaps = 6/57 (10%)
Query: 498 MGQRRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
G+RR+ G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2107 QGKRRSQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|340718584|ref|XP_003397745.1| PREDICTED: PHD finger protein 12-like [Bombus terrestris]
Length = 638
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 7/72 (9%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC----LDPLLIPESGWRCPNCRQGHS 555
+RR G + D+C C DG L+ C+ CP ++H C +DP IP W C CR
Sbjct: 48 KRRGRGHNHDICDACRDGGELICCDKCPASYHLQCHYPAVDPADIPNGEWLCYACR---C 104
Query: 556 SSMSRSVDLKGG 567
+S +D KG
Sbjct: 105 ASKRNLLDTKGN 116
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 23/45 (51%), Gaps = 5/45 (11%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
+C CG L+ C+ CPL FH CLDP L P W CPN
Sbjct: 193 LCFECGRSCRKAPLIACDYCPLYFHQDCLDPPLTAFPIGRWMCPN 237
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1702 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1757
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2102 QGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2152
>gi|297261284|ref|XP_001107725.2| PREDICTED: PHD finger protein 21B-like, partial [Macaca mulatta]
Length = 537
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 472 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPRCQQ 518
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1404 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1459
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1801 RRTQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1854
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1445 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1500
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 501 RRTTGG----SDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
RRT G +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1842 RRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1895
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,743,782,312
Number of Sequences: 23463169
Number of extensions: 595738044
Number of successful extensions: 1471670
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1389
Number of HSP's successfully gapped in prelim test: 3847
Number of HSP's that attempted gapping in prelim test: 1453527
Number of HSP's gapped (non-prelim): 16966
length of query: 863
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 711
effective length of database: 8,792,793,679
effective search space: 6251676305769
effective search space used: 6251676305769
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)